psychometric test scores: Topics by WorldWideScience.org

Sample records for psychometric test scores

Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Science.gov (United States)

Kolen, Michael J.; Lee, Won-Chan

2011-01-01

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Psychometric Evaluation of the Lower Extremity Computerized Adaptive Test, the Modified Harris Hip Score, and the Hip Outcome Score.

Science.gov (United States)

Hung, Man; Hon, Shirley D; Cheng, Christine; Franklin, Jeremy D; Aoki, Stephen K; Anderson, Mike B; Kapron, Ashley L; Peters, Christopher L; Pelt, Christopher E

2014-12-01

The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Cohort study (diagnosis); Level of evidence, 2. Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior in all psychometric aspects examined in this study. Future
Normalization of the psychometric hepatic encephalopathy score for ...

African Journals Online (AJOL)

Aim: To construct normal values for the tests of the psychometric hepatic encephalopathy score (PHES) and evaluate the prevalence of minimal hepatic encephalopathy (MHE) among Turkish patients with liver cirrhosis. Materials and Methods: One hundred and eighty-five healthy subjects and sixty patients with liver ...
Smoking habit and psychometric scores: a community study.

Science.gov (United States)

Waal-Manning, H J; de Hamel, F A

1978-09-13

During the Milton health survey subjects completed a psychometric inventory consisting of the 48 questions of the Middlesex Hospital questionnaire (MHQ) and 26 from the hostility and direction of hostility questionnaire (HDHQ) designed to examine nine psychological dimensions. The 1209 subjects were classified into smoking categories and the scores for each psychometric trait were calculated. Women scored higher than men and heavy smokers scored higher than "never smokers". The psychometric traits and the scores of the four smoking categories after correcting for age and Quetelet's index showed statistically significant differences by analysis of variance in respect of somatic anxiety and depression for both men and women; and free-floating anxiety, phobic anxiety, hysteria, acting out hostility, self criticism and guilt in women. For somatic anxiety the increase in score almost exactly paralleled the increasing quantity of tobacco consumed.
Psychometric properties of the Cumulated Ambulation Score

DEFF Research Database (Denmark)

Ferriero, Giorgio; Kristensen, Morten T; Invernizzi, Marco

2018-01-01

INTRODUCTION: In the geriatric population, independent mobility is a key factor in determining readiness for discharge following acute hospitalization. The Cumulated Ambulation Score (CAS) is a potentially valuable score that allows day-to-day measurements of basic mobility. The CAS was developed...... and validated in older patients with hip fracture as an early postoperative predictor of short-term outcome, but it is also used to assess geriatric in-patients with acute medical illness. Despite the fast- accumulating literature on the CAS, to date no systematic review synthesizing its psychometric properties....... Of 49 studies identified, 17 examined the psychometric properties of the CAS. EVIDENCE SYNTHESIS: Most papers dealt with patients after hip fracture surgery, and only 4 studies assessed the CAS psychometric characteristics also in geriatric in-patients with acute medical illness. Two versions of CAS...
Assessment of Minimal HE (with emphasis on computerized psychometric tests)

Science.gov (United States)

Kappus, Matthew R; Bajaj, Jasmohan S

2012-01-01

Synopsis Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life and driving accidents. The detection of MHE requires specialized testing since it cannot by definition, be diagnosed on standard clinical examination. Psychometric (paper-pencil or computerized or a combination) and neuro-physiological techniques are often used to test for MHE. Paper-pencil psychometric batteries like the Psychometric Hepatic Encephalopathy Score (PHES) have been validated in several countries but do not have US normative values. Computerized tests such as the inhibitory control test (ICT), cognitive drug research system and Scan test have proven useful to diagnose MHE and predict outcomes. The specificity and sensitivity of these tests are similar to the recommended gold standards. Neuro-physiological tests such as the EEG and its interpretations, evoked potentials and Critical Flicker Frequency (CFF) also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike and the testing strategies depend on the normative data available, patient comfort and local expertise. PMID:22321464
Psychometric challenges and proposed solutions when scoring facial emotion expression codes

OpenAIRE

Olderbak, Sally; Hildebrandt, Andrea; Pinkpank, Thomas; Sommer, Werner; Wilhelm, Oliver

2013-01-01

Coding of facial emotion expressions is increasingly performed by automated emotion expression scoring software; however, there is limited discussion on how best to score the resulting codes. We present a discussion of facial emotion expression theories and a review of contemporary emotion expression coding methodology. We highlight methodological challenges pertinent to scoring software-coded facial emotion expression codes and present important psychometric research questions centered on co...
[Standardization of the Test Your Memory and evaluation of their concordance with the outcome of the psychometric examination].

Science.gov (United States)

Ferrero-Arias, J; Turrión-Rojo, M A

2016-05-01

To explore the relationship between scores on the Test Your Memory (TYM) battery and findings from a more exhaustive neurocognitive assessment. The TYM and fourteen psychometric tests were administered to 84 subjects aged 50 or older who attended an outpatient neurology clinic due to cognitive symptoms. Each patient's cognitive state was determined independently from his/her score on the TYM (CDR 0, n=25; CDR 0.5, n=45; CDR 1, n=14). We analysed concurrent validity of TYM scores and results from the psychometric tests, as well as the degree of concordance between the two types of measurement, by contrasting normalised data from each instrument. Although the intraclass correlation coefficient was 0.67 (confidence interval 95%, 0.53-0.77), analysis of the Bland-Altman plot and the curve on the survival-agreement plot (Luiz et al. method) demonstrates that the individual distances between the two methods exhibit excessive dispersion from a clinical viewpoint. TYM-based predictions of the mean z-score on psychometric tests differed substantially from real results in 30% of the subjects. Concordance of 95% can only be achieved by accepting absolute inter-instrument differences of up to 0.87 as identical values. Furthermore, the TYM underestimates cognitive performance for low values and overestimates it for high values. The TYM is a cognitive screening test which should not be used to predict results on psychometric tests or to detect cognitive changes in clinical trials. Copyright © 2014 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.
The use of test scores from large-scale assessment surveys: psychometric and statistical considerations

Directory of Open Access Journals (Sweden)

Henry Braun

2017-11-01

Full Text Available Abstract Background Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT or ACT. These differences have important implications both for utilization and interpretation. Although much has been written about PVs, it appears that there are still misconceptions about whether and how to employ them in secondary analyses. Methods We address a range of technical issues, including those raised in a recent article that was written to inform economists using these databases. First, an extensive review of the relevant literature was conducted, with particular attention to key publications that describe the derivation and psychometric characteristics of such achievement measures. Second, a simulation study was carried out to compare the statistical properties of estimates based on the use of PVs with those based on other, commonly used methods. Results It is shown, through both theoretical analysis and simulation, that under fairly general conditions appropriate use of PV yields approximately unbiased estimates of model parameters in regression analyses of large scale survey data. The superiority of the PV methodology is particularly evident when measures of student achievement are employed as explanatory variables. Conclusions The PV methodology used to report student test performance in large scale surveys remains the state-of-the-art for secondary analyses of these databases.
A Psychometric Evaluation of the Mayer-Salovey-Caruso Emotional Intelligence Test Version 2.0

Science.gov (United States)

Palmer, B.R.; Gignac, G.; Manocha, R.; Stough, C.

2005-01-01

and discussed.There has been some debate recently over the scoring, reliability and factor structure of ability measures of emotional intelligence (EI). This study examined these three psychometric properties with the most recent ability test of EI, the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT V2.0; Mayer, Salovey, & Caruso,…
Patient activation in Europe: an international comparison of psychometric properties and patients' scores on the short form Patient Activation Measure (PAM-13).

Science.gov (United States)

Rademakers, Jany; Maindal, Helle Terkildsen; Steinsbekk, Aslak; Gensichen, Jochen; Brenk-Franz, Katja; Hendriks, Michelle

2016-10-12

To allow better assessment of patients' individual competencies for self-management, the Patient Activation Measure (PAM) has been developed in the USA. Because the American studies have shown the PAM to be a valuable tool, several European countries have translated the instrument into their native languages (Danish, Dutch, German, Norwegian). The aim was to compare the psychometric properties in studies from the different countries and establish whether the scores on the PAM vary between the studies. Data from the four separate studies were subjected to the same data cleaning procedures and statistical analyses. The psychometric properties of the instruments were established with measures of data quality and scale structure. The mean patient activation score and distribution across four predefined activation levels were described and the differences between the four studies were tested with ANOVA (unadjusted and adjusted) followed by a post-hoc Tukey HSD test and the Pearson chi-squared test respectively. The total N of the four studies was 5184. The percentage of missing values was low in all datasets, confirming the good quality of the datasets. Factor analyses revealed moderate to strong factor loadings on the first factor in all datasets. Cronbach's α was high for all version, ranging from .80 (German) to .88 (Dutch). Item-rest correlations varied between .32 and .66, indicating a moderate to strong correlation of the individual items to the sum scale. Both the mean PAM score and the distribution across activation levels differed between the four datasets. After adjustment of the PAM score, patients in Norway in particular had a higher patient activation level. The European translations of PAM-13 (into Danish, Dutch, German and Norwegian) resulted in four instruments with good psychometric capabilities for measuring patient activation. The mean PAM score and the distribution across activation levels differed between the four datasets.
A New Clinical Pain Knowledge Test for Nurses: Development and Psychometric Evaluation.

Science.gov (United States)

Bernhofer, Esther I; St Marie, Barbara; Bena, James F

2017-08-01

All nurses care for patients with pain, and pain management knowledge and attitude surveys for nurses have been around since 1987. However, no validated knowledge test exists to measure postlicensure clinicians' knowledge of the core competencies of pain management in current complex patient populations. To develop and test the psychometric properties of an instrument designed to measure pain management knowledge of postlicensure nurses. Psychometric instrument validation. Four large Midwestern U.S. hospitals. Registered nurses employed full time and part time August 2015 to April 2016, aged M = 43.25 years; time as RN, M = 16.13 years. Prospective survey design using e-mail to invite nurses to take an electronic multiple choice pain knowledge test. Content validity of initial 36-item test "very good" (95.1% agreement). Completed tests that met analysis criteria, N = 747. Mean initial test score, 69.4% correct (range 27.8-97.2). After revision/removal of 13 unacceptable questions, mean test score was 50.4% correct (range 8.7-82.6). Initial test item percent difficulty range was 15.2%-98.1%; discrimination values range, 0.03-0.50; final test item percent difficulty range, 17.6%-91.1%, discrimination values range, -0.04 to 1.04. Split-half reliability final test was 0.66. A high decision consistency reliability was identified, with test cut-score of 75%. The final 23-item Clinical Pain Knowledge Test has acceptable discrimination, difficulty, decision consistency, reliability, and validity in the general clinical inpatient nurse population. This instrument will be useful in assessing pain management knowledge of clinical nurses to determine gaps in education, evaluate knowledge after pain management education, and measure research outcomes. Copyright © 2017 American Society for Pain Management Nursing. Published by Elsevier Inc. All rights reserved.
Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

Science.gov (United States)

Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

2011-08-15

Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Testing Psychometrics of Healthcare Empowerment Questionnaires ...

African Journals Online (AJOL)

Testing Psychometrics of Healthcare Empowerment Questionnaires (HCEQ) among Iranian ... PROMOTING ACCESS TO AFRICAN RESEARCH ... translation and backtranslation procedures, pilot testing, and getting views of expert panel.
Raven's Test Performance of Sub-Saharan Africans: Average Performance, Psychometric Properties, and the Flynn Effect

Science.gov (United States)

Wicherts, Jelte M.; Dolan, Conor V.; Carlson, Jerry S.; van der Maas, Han L. J.

2010-01-01

This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's test scores as measures of general intelligence.…
Nonparametric tests for equality of psychometric functions.

Science.gov (United States)

García-Pérez, Miguel A; Núñez-Antón, Vicente

2017-12-07

Many empirical studies measure psychometric functions (curves describing how observers' performance varies with stimulus magnitude) because these functions capture the effects of experimental conditions. To assess these effects, parametric curves are often fitted to the data and comparisons are carried out by testing for equality of mean parameter estimates across conditions. This approach is parametric and, thus, vulnerable to violations of the implied assumptions. Furthermore, testing for equality of means of parameters may be misleading: Psychometric functions may vary meaningfully across conditions on an observer-by-observer basis with no effect on the mean values of the estimated parameters. Alternative approaches to assess equality of psychometric functions per se are thus needed. This paper compares three nonparametric tests that are applicable in all situations of interest: The existing generalized Mantel-Haenszel test, a generalization of the Berry-Mielke test that was developed here, and a split variant of the generalized Mantel-Haenszel test also developed here. Their statistical properties (accuracy and power) are studied via simulation and the results show that all tests are indistinguishable as to accuracy but they differ non-uniformly as to power. Empirical use of the tests is illustrated via analyses of published data sets and practical recommendations are given. The computer code in MATLAB and R to conduct these tests is available as Electronic Supplemental Material.
Predicting psychopharmacological drug effects on actual driving performance (SDLP) from psychometric tests measuring driving-related skills.

Science.gov (United States)

Verster, Joris C; Roth, Thomas

2012-03-01

There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a constant speed and steady lateral position within the right traffic lane. Standard deviation of lateral position (SDLP), i.e., the weaving of the car, was determined. The subjects also performed a psychometric test battery including the DSST, Sternberg memory scanning test, a tracking test, and a divided attention test. Difference scores from placebo for parameters of the psychometric tests and SDLP were computed and correlated with each other. A stepwise linear regression analysis determined the predictive validity of the laboratory test battery to SDLP. Stepwise regression analyses revealed that the combination of five parameters, hard tracking, tracking and reaction time of the divided attention test, and reaction time and percentage of errors of the Sternberg memory scanning test, together had a predictive validity of 33.4%. The psychometric tests in this test battery showed insufficient predictive validity to replace the on-the-road driving test during normal traffic.
Translation, cross-cultural adaptation, and psychometric properties of the German version of the hip disability and osteoarthritis outcome score.

Science.gov (United States)

Blasimann, Angela; Dauphinee, Sharon Wood; Staal, J Bart

2014-12-01

Clinical measurement. To translate and cross-culturally adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) from English into German, and to study its psychometric properties in patients after hip surgery. There is no specific hip questionnaire in German that not only measures symptoms and function but also contains items about hip-related quality of life. The translation and cross-cultural adaptation involved forward translation, harmonization, cognitive debriefing, back translation, and comparison to the original HOOS following international guidelines. The German version was tested in 51 Swiss inpatients 8 weeks after different types of hip surgery, mainly total hip replacement. The mean age of the participants was 62.5 years, and the age range was from 27 to 87 years. Thirty (58.8%) of the participants were women. Internal consistency and test-retest reliability were estimated using Cronbach alpha and intraclass correlation coefficients for agreement. For construct validity, total scores of the German HOOS were correlated with those of the Western Ontario and McMaster Universities Osteoarthritis Index. The HOOS was also compared to the Medical Outcomes Study 36-Item Short-Form Health Survey. Cronbach alpha values for all German HOOS subscales were between .87 and .93. For test-retest reliability, the intraclass correlation coefficient for agreement was 0.85 for the total scores of the German HOOS. The Spearman rho for the Medical Outcomes Study 36-Item Short-Form Health Survey physical functioning subscale compared to the sum of all HOOS subscales was 0.71, and that for the Medical Outcomes Study 36-Item Short-Form Health Survey physical component summary was 0.97. The German HOOS has demonstrated adequate reliability and validity. Use of the German HOOS is recommended for assessment of patients after hip surgery, with the proviso that additional psychometric testing should be done in future research.
Psychometric Properties of the Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite Scale.

Science.gov (United States)

Barnett, Carolina; Merkies, Ingemar S J; Katzberg, Hans; Bril, Vera

2015-09-02

The Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite are two commonly used outcome measures in Myasthenia Gravis. So far, their measurement properties have not been compared, so we aimed to study their psychometric properties using the Rasch model. 251 patients with stable myasthenia gravis were assessed with both scales, and 211 patients returned for a second assessment. We studied fit to the Rasch model at the first visit, and compared item fit, thresholds, differential item functioning, local dependence, person separation index, and tests for unidimensionality. We also assessed test-retest reliability and estimated the Minimal Detectable Change. Neither scale fit the Rasch model (X2p Myasthenia Gravis Composite had lower discrimination properties than the Quantitative Myasthenia Gravis Scale (Person Separation Index: 0.14 and 0.7). There was local dependence in both scales, as well as differential item functioning for ocular and generalized disease. Disordered thresholds were found in 6(60%) items of the Myasthenia Gravis Composite and in 4(31%) of the Quantitative Myasthenia Gravis Score. Both tools had adequate test-retest reliability (ICCs >0.8). The minimally detectable change was 4.9 points for the Myasthenia Gravis Composite and 4.3 points for the Quantitative Myasthenia Gravis Score. Neither scale fulfilled Rasch model expectations. The Quantitative Myasthenia Gravis Score has higher discrimination than the Myasthenia Gravis Composite. Both tools have items with disordered thresholds, differential item functioning and local dependency. There was evidence of multidimensionality in the QMGS. The minimal detectable change values are higher than previous studies on the minimal significant change. These findings might inform future modifications of these tools.
Psychometric testing and Human Resource Management

Directory of Open Access Journals (Sweden)

R. P. van der Merwe

2002-09-01

Full Text Available This is a cumulative report on the findings of various exploratory research that were done with regard to the practice of psychometric testing in the Eastern Cape. Recent and ongoing developments in the South African labour legislation, and especially the implications of the Employment Equity Act, highlight once again the importance of the validation of all instruments to be used for human assessment and selection purposes. Information was gathered to establish which psychometric tests are used, and for what purposes, in industry today. Biographical information on each organisation is supplied, including the number of employees. The role of psychometric testing in the selection procedure is discussed. The different tests used, as well as the test users, are also indicated. The findings of other, related research, as well as comments, recommendations and shortcomings, are discussed. Opsomming Hierdie is ‘n kumulatiewe verslag wat die resultate verstrek van verskeie verkennende ondersoeke wat gedoen is na die aanwending van psigometriese toetsing in die Oos-Kaap. Onlangse en voortdurende ontwikkelinge in die Suid-Afrikaanse arbeidswetgewing, en veral die implikasies van die Wet op Gelyke Indiensneming, beklemtoon weer eens die belangrikheid van die validering van enige instrumente wat gebruik word vir evaluerings- en keuringsdoeleindes van individue. Inligting is ingewin om te bepaal watter psigometriese toetse, sowel as vir watter doel, vandag in die bedryf gebruik word. Biografiese inligting oor die onderskeie organisasies, insluitende hul aantal werknemers, word verstrek. Die rol van psigometriese toetsing in die keuringsproses word bespreek. Die verskillende toetse wat deur die organisasies gebruik word, sowel as die toetsge-bruikers, word ook aangedui. Die bevindinge van ander, relevante navorsing, sowel as opmerkings, aanbevelings en tekortkominge word bespreek.

Psychometric testing of the Spiritual Well-Being Scale-Mandarin version in Taiwanese cancer patients.

Science.gov (United States)

Tang, Woung-Ru; Kao, Chen-Yi

2017-06-01

The spiritual well-being of terminally ill cancer patients is an important indicator of the quality of their lives and of the quality of hospice care, but no validated tools are available for assessing this indicator in Taiwan. The present cross-sectional study validated the Spiritual Well-Being Scale-Mandarin version (SWBS-M) by testing its psychometric properties in 243 cancer patients from five teaching hospitals throughout Taiwan. Construct validity was tested by factor analysis and hypothesis testing. Patients' spiritual well-being and quality of life were assessed using the SWBS-M and the McGill Quality of Life Questionnaire (MQoL), respectively. Overall, the SWBS-M had an internal consistency/reliability of 0.89. Exploratory factor analysis showed that the SWBS-M had an underlying two-factor structure, explaining 46.94% of the variance. SWBS-M scores correlated moderately with MQoL scores (r = 0.48, p spiritual well-being was inversely related to their average pain level during the previous 24 hours (r = -0.183, p = 0.006). Cancer patients' spiritual well-being also differed significantly with their experience of pain (t = -3.67, p spiritual well-being than those without pain. Our findings support a two-factor model for the SWBS-M in terminally ill Taiwanese cancer patients. We recommend testing the psychometric properties of the SWBS-M in different patient populations to verify its factorial structure in other Asian countries.
Psychometrics evaluation of Charcot-Marie-Tooth Neuropathy Score (CMTNSv2) second version, using Rasch analysis.

Science.gov (United States)

Sadjadi, Reza; Reilly, Mary M; Shy, Michael E; Pareyson, Davide; Laura, Matilde; Murphy, Sinead; Feely, Shawna M E; Grider, Tiffany; Bacon, Chelsea; Piscosquito, Giuseppe; Calabrese, Daniela; Burns, Ted M

2014-09-01

Charcot-Marie-Tooth Neuropathy Score second version (CMTNSv2) is a validated clinical outcome measure developed for use in clinical trials to monitor disease impairment and progression in affected CMT patients. Currently, all items of CMTNSv2 have identical contribution to the total score. We used Rasch analysis to further explore psychometric properties of CMTNSv2, and in particular, category response functioning, and their weight on the overall disease progression. Weighted category responses represent a more accurate estimate of actual values measuring disease severity and therefore could potentially be used in improving the current version. © 2014 Peripheral Nerve Society.
Psychometric properties of the Turkish versions of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in the prison setting.

Science.gov (United States)

Evren, Cuneyt; Ogel, Kultegin; Evren, Bilge; Bozkurt, Muge

2014-01-01

The aim of this study was to evaluate psychometric properties of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in prisoners with (n = 124) or without (n = 78) drug use disorder. Participants were evaluated with the DUDIT, the DAST-10, and the Addiction Profile Index-Short (API-S). The DUDIT and the DAST-10 were found to be psychometrically sound drug abuse screening measures with high convergent validity when compared with each other (r = 0.86), and API-S (r = 0.88 and r = 0.84, respectively), and to have a Cronbach's α of 0.93 and 0.87, respectively. In addition, a single component accounted for 58.28% of total variance for DUDIT, whereas this was 47.10% for DAST-10. The DUDIT had sensitivity and specificity scores of 0.95 and 0.79, respectively, when using the optimal cut-off score of 10, whereas these scores were 0.88 and 0.74 for the DAST-10 when using the optimal cut-off score of 4. Additionally, both the DUDIT and the DAST-10 showed good discriminant validity as they differentiated prisoners with drug use disorder from those without. Findings support the Turkish versions of both the DUDIT and the DAST-10 as reliable and valid drug abuse screening instruments that measure unidimensional constructs.
Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Science.gov (United States)

Sawaki, Yasuyo; Sinharay, Sandip

2013-01-01

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…
Evidence for the Psychometric Validity, Internal Consistency and Measurement Invariance of Warwick Edinburgh Mental Well-being Scale Scores in Scottish and Irish Adolescents.

Science.gov (United States)

McKay, Michael T; Andretta, James R

2017-09-01

Mental well-being is an important indicator of current, but also the future health of adolescents. The 14-item Warwick Edinburgh Mental Well-being Scale (WEMWBS) has been well validated in adults world-wide, but less work has been undertaken to examine the psychometric validity and internal consistency of WEMWBS scores in adolescents. In particular, little research has examined scores on the short 7-item version of the WEMWBS. The present study used two large samples of school children in Scotland and Northern Ireland and found that for both forms of the WEMWBS, scores were psychometrically valid, internally consistent, factor saturated, and measurement invariant by country. Using the WEMWBS full form, males reported significantly higher scores than females, and Northern Irish adolescents reported significantly higher scores than their Scottish counterparts. Last, the lowest overall levels of well-being were observed among Scottish females. Copyright © 2017. Published by Elsevier B.V.
A Psychometric Review of Measures Assessing Discrimination Against Sexual Minorities.

Science.gov (United States)

Morrison, Todd G; Bishop, C J; Morrison, Melanie A; Parker-Taneo, Kandice

2016-08-01

Discrimination against sexual minorities is widespread and has deleterious consequences on victims' psychological and physical wellbeing. However, a review of the psychometric properties of instruments measuring lesbian, gay, and bisexual (LGB) discrimination has not been conducted. The results of this review, which involved evaluating 162 articles, reveal that most have suboptimal psychometric properties. Specifically, myriad scales possess questionable content validity as (1) items are not created in collaboration with sexual minorities; (2) measures possess a small number of items and, thus, may not sufficiently represent the domain of interest; and (3) scales are "adapted" from measures designed to examine race- and gender-based discrimination. Additional limitations include (1) summed scores are computed, often in the absence of scale score reliability metrics; (2) summed scores operate from the questionable assumption that diverse forms of discrimination are necessarily interrelated; (3) the dimensionality of instruments presumed to consist of subscales is seldom tested; (4) tests of criterion-related validity are routinely omitted; and (5) formal tests of measures' construct validity are seldom provided, necessitating that one infer validity based on the results obtained. The absence of "gold standard" measures, the attendant difficulty in formulating a coherent picture of this body of research, and suggestions for psychometric improvements are noted.
Development and psychometric testing of a Clinical Reasoning Evaluation Simulation Tool (CREST) for assessing nursing students' abilities to recognize and respond to clinical deterioration.

Science.gov (United States)

Liaw, Sok Ying; Rashasegaran, Ahtherai; Wong, Lai Fun; Deneen, Christopher Charles; Cooper, Simon; Levett-Jones, Tracy; Goh, Hongli Sam; Ignacio, Jeanette

2018-03-01

The development of clinical reasoning skills in recognising and responding to clinical deterioration is essential in pre-registration nursing education. Simulation has been increasingly used by educators to develop this skill. To develop and evaluate the psychometric properties of a Clinical Reasoning Evaluation Simulation Tool (CREST) for measuring clinical reasoning skills in recognising and responding to clinical deterioration in a simulated environment. A scale development with psychometric testing and mixed methods study. Nursing students and academic staff were recruited at a university. A three-phase prospective study was conducted. Phase 1 involved the development and content validation of the CREST; Phase 2 included the psychometric testing of the tool with 15 second-year and 15 third-year nursing students who undertook the simulation-based assessment; Phase 3 involved the usability testing of the tool with nine academic staff through a survey questionnaire and focus group discussion. A 10-item CREST was developed based on a model of clinical reasoning. A content validity of 0.93 was obtained from the validation of 15 international experts. The construct validity was supported as the third-year students demonstrated significantly higher (preasoning scores than the second-year students. The concurrent validity was also supported with significant positive correlations between global rating scores and almost all subscale scores, and the total scores. The predictive validity was supported with an existing tool. The internal consistency was high with a Cronbach's alpha of 0.92. A high inter-rater reliability was demonstrated with an intraclass correlation coefficient of 0.88. The usability of the tool was rated positively by the nurse educators but the need to ease the scoring process was highlighted. A valid and reliable tool was developed to measure the effectiveness of simulation in developing clinical reasoning skills for recognising and responding to
Psychometric testing of an instrument to measure the experience of home.

Science.gov (United States)

Molony, Sheila L; McDonald, Deborah Dillon; Palmisano-Mills, Christine

2007-10-01

Research related to quality of life in long-term care has been hampered by a paucity of measurement tools sensitive to environmental interventions. The primary aim of this study was to test the psychometric properties of a new instrument, the Experience of Home (EOH) Scale, designed to measure the strength of the experience of meaningful person-environment transaction. The instrument was administered to 200 older adults in diverse dwelling types. Principal components analysis provided support for construct validity, eliciting a three-factor solution accounting for 63.18% of variance in scores. Internal consistency reliability was supported with Cronbach's alpha of .96 for the entire scale. The EOH Scale is a unique research tool to evaluate interventions to improve quality of living in residential environments.
Translation, Cross-cultural Adaptation, and Psychometric Properties of the German Version of the Hip Disability and Osteoarthritis Outcome Score

NARCIS (Netherlands)

Blasimann, A.; Dauphinee, S.W.; Staal, J.B.

2014-01-01

Study Design Clinical measurement. Objectives To translate and cross-culturally adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) from English into German, and to study its psychometric properties in patients after hip surgery. Background There is no specific hip questionnaire in
Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

Science.gov (United States)

Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

2016-04-01

The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
A Psychometric Review of Norm-Referenced Tests Used to Assess Phonological Error Patterns

Science.gov (United States)

Kirk, Celia; Vigeland, Laura

2014-01-01

Purpose: The authors provide a review of the psychometric properties of 6 norm-referenced tests designed to measure children's phonological error patterns. Three aspects of the tests' psychometric adequacy were evaluated: the normative sample, reliability, and validity. Method: The specific criteria used for determining the psychometric…
Assessment of minimal hepatic encephalopathy (with emphasis on computerized psychometric tests).

Science.gov (United States)

Kappus, Matthew R; Bajaj, Jasmohan S

2012-02-01

Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life, and driving accidents. The detection of MHE requires specialized testing because it cannot, by definition, be diagnosed on standard clinical examination. Psychometric and neurophysiologic techniques are often used to test for MHE. Paper-pencil psychometric batteries and computerized tests have proved useful in diagnosing MHE and predicting its outcomes. Neurophysiologic tests also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike. Testing strategies depend on the normative data available, patient comfort, and local expertise. Copyright © 2012 Elsevier Inc. All rights reserved.
Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

Science.gov (United States)

Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

2015-12-01

The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.
Psychometric properties including reliability, validity and responsiveness of the Majeed pelvic score in patients with chronic sacroiliac joint pain.

Science.gov (United States)

Bajada, Stefan; Mohanty, Khitish

2016-06-01

The Majeed scoring system is a disease-specific outcome measure that was originally designed to assess pelvic injuries. The aim of this study was to determine the psychometric properties of the Majeed scoring system for chronic sacroiliac joint pain. Internal consistency, content validity, criterion validity, construct validity and responsiveness to change was assessed prospectively for the Majeed scoring system in a cohort of 60 patients diagnosed with sacroiliac joint pain. This diagnosis was confirmed with CT-guided sacroiliac joint anaesthetic block. The overall Majeed score showed acceptable internal consistency (Cronbach alpha = 0.63). Similarly, it showed acceptable floor (0 %) and ceiling (0 %) effects. On the other hand, the domains of pain, work, sitting and sexual intercourse had high (>30 %) floor effects. Significant correlation with the physical component of the Short Form-36 (p = 0.005) and Oswestry disability index (p ≤ 0.001) was found indicating acceptable criterion validity. The overall Majeed score showed acceptable construct validity with all five developed hypotheses showing significance (p ≤ 0.05). The overall Majeed score showed acceptable responsiveness to change with a large (≥0.80) effect size and standardized response mean. Overall the Majeed scoring system demonstrated acceptable psychometric properties for outcome assessment in chronic sacroiliac joint pain. Thus, its use in this condition is adequate. However, some domains demonstrated suboptimal performance indicating that improvement might be achieved with the development of an outcome measure specific for sacroiliac joint dysfunction and degeneration.
Psychometrics

NARCIS (Netherlands)

Borsboom, D.; Molenaar, D.; Wright, J.D.

2015-01-01

Psychometrics is a scientific discipline concerned with the construction of measurement models for psychological data. In these models, a theoretical construct (e.g., intelligence) is systematically coordinated with observables (e.g., IQ scores). This is often done through latent variable models,
[Development of patient-reported outcome scale for myasthenia gravis: a psychometric test].

Science.gov (United States)

Chen, Xin-lin; Liu, Feng-bin; Guo, Li; Liu, Xiao-bin

2010-02-01

To investigate the scientificity of patient-reported outcome (PRO) scale for myasthenia gravis (MG), which was used to evaluate the clinical effects of traditional Chinese and Western medicine treatment on MG patients. Psychometric performance of the MG-PRO scale was also expected to be evaluated in this study. A total of 100 MG patients and 100 healthy people were face-to-face interviewed by well-trained investigators, and the data of MG-PRO scale were collected. The classical theory test (CTT) and item response theory (IRT) methods were used to analyze the psychometric performance such as validity, reliability, person separation index (PSI) and differential item functioning (DIF) in the MG-PRO scale. The results of CTT analysis showed that the split-half reliabilities of the MG-PRO scale and each dimension were greater than 0.7. In the analysis of internal consistency of each dimension, the Cronbach's alpha was greater than 0.8. Each facet had greater correlation with its dimension than the other dimensions. Four principal components were extracted by exploratory factor analysis, which represented all dimensions of the scale, and the cumulative variance was 55.54%. The scores of each of the 8 facets between MG patients and healthy people were different (Pdefinition and connotation of quality of life and contains special issues of MG patients as well, and shows good reliability (split-half reliability, Cronbach's alpha), validity (content validity, construct validity, discriminate validity) from the results of CTT, and good psychometric performance from the results of IRT.
Applying modern psychometric techniques to melodic discrimination testing: Item response theory, computerised adaptive testing, and automatic item generation.

Science.gov (United States)

Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel

2017-06-15

Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.
Psychometrics behind Computerized Adaptive Testing.

Science.gov (United States)

Chang, Hua-Hua

2015-03-01

The paper provides a survey of 18 years' progress that my colleagues, students (both former and current) and I made in a prominent research area in Psychometrics-Computerized Adaptive Testing (CAT). We start with a historical review of the establishment of a large sample foundation for CAT. It is worth noting that the asymptotic results were derived under the framework of Martingale Theory, a very theoretical perspective of Probability Theory, which may seem unrelated to educational and psychological testing. In addition, we address a number of issues that emerged from large scale implementation and show that how theoretical works can be helpful to solve the problems. Finally, we propose that CAT technology can be very useful to support individualized instruction on a mass scale. We show that even paper and pencil based tests can be made adaptive to support classroom teaching.
Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

Science.gov (United States)

Jongen, Stefan; Perrier, Joy; Vuurman, Eric F; Ramaekers, Johannes G; Vermeeren, Annemiek

2015-01-01

To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation. Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am) and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am). On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP) of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT). Large effects sizes were also found in the Divided Attention Test (DAT), the Attention Network Test (ANT), and the test for Useful Field of View (UFOV) at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV. From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.
Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

Directory of Open Access Journals (Sweden)

Stefan Jongen

Full Text Available To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation.Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am.On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT. Large effects sizes were also found in the Divided Attention Test (DAT, the Attention Network Test (ANT, and the test for Useful Field of View (UFOV at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV.From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.

Future of Psychometrics: Ask What Psychometrics Can Do for Psychology

Science.gov (United States)

Sijtsma, Klaas

2012-01-01

I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…
The 1-min Screening Test for Reading Problems in College Students: Psychometric Properties of the 1-min TIL.

Science.gov (United States)

Fernandes, Tânia; Araújo, Susana; Sucena, Ana; Reis, Alexandra; Castro, São Luís

2017-02-01

Reading is a central cognitive domain, but little research has been devoted to standardized tests for adults. We, thus, examined the psychometric properties of the 1-min version of Teste de Idade de Leitura (Reading Age Test; 1-min TIL), the Portuguese version of Lobrot L3 test, in three experiments with college students: typical readers in Experiment 1A and B, dyslexic readers and chronological age controls in Experiment 2. In Experiment 1A, test-retest reliability and convergent validity were evaluated in 185 students. Reliability was >.70, and phonological decoding underpinned 1-min TIL. In Experiment 1B, internal consistency was assessed by presenting two 45-s versions of the test to 19 students, and performance in these versions was significantly associated (r = .78). In Experiment 2, construct validity, criterion validity and clinical utility of 1-min TIL were investigated. A multiple regression analysis corroborated construct validity; both phonological decoding and listening comprehension were reliable predictors of 1-min TIL scores. Logistic regression and receiver operating characteristics analyses revealed the high accuracy of this test in distinguishing dyslexic from typical readers. Therefore, the 1-min TIL, which assesses reading comprehension and potential reading difficulties in college students, has the necessary psychometric properties to become a useful screening instrument in neuropsychological assessment and research. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
[The methodological assessment and qualitative evaluation of psychometric performance tests based on the example of modern tests that assess reading and spelling skills].

Science.gov (United States)

Galuschka, Katharina; Rothe, Josefine; Schulte-Körne, Gerd

2015-09-01

This article looks at a means of objectively evaluating the quality of psychometric tests. This approach enables users to evaluate psychometric tests based on their methodological characteristics, in order to decide which instrument should be used. Reading and spelling assessment tools serve as examples. The paper also provides a review of German psychometric tests for the assessment of reading and spelling skills. This method facilitates the identification of psychometric tests.of high methodological quality which can be used for the assessment of reading and spelling skills. Reading performance should ideally be assessed with the following instruments: ELFE 1-6, LGVT 6-12, LESEN 6-7, LESEN 8-9, or WLLP-R. The tests to be used for the evaluation of spelling skills are DERET 1-2+, DERET 3-4+, WRT 1+, WRT 2+, WRT 3+, WRT 4+ or HSP 1-10.
Scoring and psychometric validation of the Perception of Anticoagulant Treatment Questionnaire (PACT-Q©

Directory of Open Access Journals (Sweden)

Essers B

2009-04-01

Full Text Available Abstract Background The 'Perception of Anti-Coagulant Treatment Questionnaire' (PACT-Q was developed to assess patients' expectations of, and satisfaction with their anticoagulant treatment. This questionnaire needs to be finalised and psychometrically validated. Methods The PACT-Q was included in the United States, the Netherlands and France into three phase III multinational clinical trials conducted to evaluate efficacy and safety of a new long-acting anticoagulant drug (idraparinux compared to vitamin K antagonist (VKA. PACT-Q was administered to patients with deep venous thrombosis (DVT, atrial fibrillation (AF or pulmonary embolism (PE at Day 1, to assess patients' expectations, and at 3 and 6 months to assess patients' satisfaction and treatment convenience and burden. The final structure of the PACT-Q (Principal Component Analysis – PCA – with Varimax Rotation was first determined and its psychometric properties were then measured with validity of the structure (Multitrait analysis, internal consistency reliability (Cronbach's alpha coefficients and known-group validity. Results PCA and multitrait analyses showed the multidimensionality of the "Treatment Expectations" dimension, comprising 7 items that had to be scored independently. The "Convenience" and "Burden of Disease and Treatment" dimensions of the hypothesised original structure of the questionnaire were combined, thus resulting in 13 items grouped into the single dimension "Convenience". The "Anticoagulant Treatment Satisfaction" dimension remained unchanged and included 7 items. All items of the "Convenience" and "Anticoagulant Treatment Satisfaction" dimensions displayed good convergent and discriminant validity. The internal consistency reliability was good, with a Cronbach's alpha of 0.84 for the "Convenience" dimension, and 0.76 for the "Anticoagulant Treatment Satisfaction" dimension. Known-group validity was good, especially with regard to occurrence of
Normalization of the Psychometric Hepatic Encephalopathy score ...

African Journals Online (AJOL)

2016-05-09

May 9, 2016 ... influenced by age, education levels, and gender.[5] Till date, the PHES ... and death. MHE also increases the risk of development ... large circles beginning from each row on the left and working to the right. The test score is the ...
Development and psychometric validation of the verbal affective memory test

DEFF Research Database (Denmark)

Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S

2015-01-01

. Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...
The factor structure and psychometric properties of the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test.

Science.gov (United States)

Sanchez-Garcia, Manuel; Extremera, Natalio; Fernandez-Berrocal, Pablo

2016-11-01

This research examined evidence regarding the reliability and validity of scores on the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test, Version 2.0 (MSCEIT; Mayer, Salovey, & Caruso, 2002). In Study 1, we found a close convergence of the Spanish consensus scores and the general and expert consensus scores determined with Mayer, Salovey, Caruso, and Sitarenios (2003) data. The MSCEIT also demonstrated adequate evidence of reliability of test scores as estimated by internal consistency and test-retest correlation after 12 weeks. Confirmatory factor analysis supported a 3-level higher factor model with 8 manifest variables (task scores), 4 first-level factors (corresponding to the 4-branch model of Mayer & Salovey [1997], with 2 tasks for each branch), 2 second-level factors (experiential and strategic areas, with 2 branches for each area), and 1 third-level factor (overall emotional intelligence [EI]), and multigroup analyses supported MSCEIT cross-gender invariance. Study 2 found evidence for the discriminant validity of scores on the MSCEIT subscales, which were differentially related to personality and self-reported EI. Study 3 provided evidence of the incremental validity of scores on the MSCEIT, which added significant variance to the prospective prediction of psychological well-being after controlling for personality traits. The psychometric properties of the Spanish MSCEIT are similar to those of the original English version, supporting its use for assessing emotional abilities in the Spanish population. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Psychometric evaluation of ADAS-Cog and NTB for measuring drug response.

Science.gov (United States)

Karin, A; Hannesdottir, K; Jaeger, J; Annas, P; Segerdahl, M; Karlsson, P; Sjögren, N; von Rosen, T; Miller, F

2014-02-01

To conduct a psychometric analysis to determine the adequacy of instruments that measure cognition in Alzheimer's disease trials. Both the Alzheimer's Disease Assessment Scale - Cognition (ADAS-Cog) and the Neuropsychological Test Battery (NTB) are validated outcome measures for clinical trials in Alzheimer's disease and are approved also for regulatory purposes. However, it is not clear how comparable they are in measuring cognitive function. In fact, many recent trials in Alzheimer's disease patients have failed and it has been questioned if ADAS-Cog still is a sensitive measure. The present paper examines the psychometric properties of ADAS-Cog and NTB, based on a post hoc analysis of data from a clinical trial (NCT01024660), which was conducted by AstraZeneca, in mild-to-moderate Alzheimer's disease (AD) patients, with a Mini Mental State Examination (MMSE) Total score 16-24. Acceptability, reliability, different types of validity and ability to detect change were assessed using relevant statistical methods. Total scores of both tests, as well as separate domains of both tests, including the Wechsler Memory Scale (WMS), Rey Auditory Verbal Learning Test (RAVLT) and Delis-Kaplan Executive Function System (D-KEFS) Verbal Fluency Condition, were analyzed. Overall, NTB performed well, with acceptable reliability and ability to detect change, while ADAS-Cog had insufficient psychometric properties, including ceiling effects in 8 out of a total of 11 ADAS-Cog items in mild AD patients, as well as low test-retest reliability in some of the items. Based on a direct comparison on the same patient sample, we see advantages of the NTB compared with the ADAS-Cog for the evaluation of cognitive function in the population of mild-to-moderate AD patients. The results suggest that not all of ADAS-Cog items are relevant for both mild and moderate AD population. This validation study demonstrates satisfactory psychometric properties of the NTB, while ADAS-Cog was found to be
Health Belief Model Scale for Human Papilloma Virus and its Vaccination: Adaptation and Psychometric Testing.

Science.gov (United States)

Guvenc, Gulten; Seven, Memnun; Akyuz, Aygul

2016-06-01

To adapt and psychometrically test the Health Belief Model Scale for Human Papilloma Virus (HPV) and Its Vaccination (HBMS-HPVV) for use in a Turkish population and to assess the Human Papilloma Virus Knowledge score (HPV-KS) among female college students. Instrument adaptation and psychometric testing study. The sample consisted of 302 nursing students at a nursing school in Turkey between April and May 2013. Questionnaire-based data were collected from the participants. Information regarding HBMS-HPVV and HPV knowledge and descriptive characteristic of participants was collected using translated HBMS-HPVV and HPV-KS. Test-retest reliability was evaluated and Cronbach α was used to assess internal consistency reliability, and exploratory factor analysis was used to assess construct validity of the HBMS-HPVV. The scale consists of 4 subscales that measure 4 constructs of the Health Belief Model covering the perceived susceptibility and severity of HPV and the benefits and barriers. The final 14-item scale had satisfactory validity and internal consistency. Cronbach α values for the 4 subscales ranged from 0.71 to 0.78. Total HPV-KS ranged from 0 to 8 (scale range, 0-10; 3.80 ± 2.12). The HBMS-HPVV is a valid and reliable instrument for measuring young Turkish women's beliefs and attitudes about HPV and its vaccination. Copyright © 2015 North American Society for Pediatric and Adolescent Gynecology. Published by Elsevier Inc. All rights reserved.
An instrument assessing satisfaction with iron chelation therapy: Psychometric testing from an open-label clinical trial.

Science.gov (United States)

Rofail, Diana; Viala, Muriel; Gater, Adam; Abetz-Webb, Linda; Baladi, Jean-Francois; Cappellini, Maria Domenica

2010-08-01

The Satisfaction with Iron Chelation Therapy (SICT) instrument was developed based on a literature review, in-depth patient and clinician interviews, and cognitive debriefing interviews. An, open-label, single arm, multicenter trial evaluating the efficacy and safety of deferasirox in patients diagnosed with transfusion-dependent iron overload, provided an opportunity to assess the psychometric measurement properties of the instrument. Psychometric analyses were performed using data at baseline from 273 patients with a range of transfusion-dependent iron overload conditions who were participating in a multinational study. Responsiveness was further evaluated for all patients who also had subsequent satisfaction domain scores collected at week 4. Baseline SICT domain scores had acceptable floor and ceiling effects and internal consistency reliability (Cronbach's alpha: 0.75-0.85). Item discriminant and item convergent validity were both excellent although one item in each analysis did not meet the specified criterion. Small to moderate correlations were observed between SICT and Short Form 36 Health Survey (SF-36) domain scores. Patients with the highest levels of serum ferritin at baseline (>3100 ng/mL) were the least satisfied about the Perceived Effectiveness of ICT and vice versa. Satisfaction improved in all patients, although there were no clear differences observed between groups of patients defined according to changes in serum ferritin levels from baseline to week 4 (stable, improved, or worsened). The SICT domains are reliable and valid. Further testing using a more specific criterion (such as assessing patient global ratings of change in satisfaction domains that correspond to the SICT domains) could help to establish with greater confidence the responsiveness of the instrument.
Development and psychometric pilot-testing of a questionnaire for the evaluation of satisfaction with continuing education in infection control nurses.

Science.gov (United States)

Meng, Michael; Peter, Daniel; Mattner, Frauke; Igel, Christoph; Kugler, Christiane

2018-05-16

Satisfaction with continuing education can be defined as positive attitudes towards educational programs, which has potential to strengthen learning outcomes. A multi-dimensional construct may enhance continuing education program evaluation processes. The objective is to describe the development and psychometric testing of the 'affective - behavioral - cognitive - satisfaction questionnaire' (ABC-SAT) for assessing participants' satisfaction with a continuing education program for nurses in infection control. The multi-staged development of a satisfaction questionnaire comprised of three subscales. The pilot tool was administered to a nationwide sample of 126 infection control nurses to assess satisfaction after participating in a continuing education program. Satisfaction scores were calculated and psychometric testing was performed to determine reliability, using Cronbach's alpha, face validity, objectivity, and economy. A principle component analysis using varimax rotation and Kaiser normalization was performed. The analysis led to a three-factor solution of the questionnaire with 11 items, explaining 61.4% of the variance. Internal consistency of three scales using Cronbach's alpha was 0.83, 0.60, and 0.66, respectively. Selectivity coefficients varied between 0.39 and 0.70. Participants needed approximately three minutes to complete the questionnaire. Initial findings refer to a satisfying scale structure and internal consistency of the 3-dimensional ABC-SAT questionnaire. Further research is required to confirm the questionnaires' psychometric properties. Copyright © 2018 Elsevier Ltd. All rights reserved.
Clock Drawing Test and the diagnosis of amnestic mild cognitive impairment: can more detailed scoring systems do the work?

Science.gov (United States)

Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin

2014-01-01

The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.
The Free and Cued Selective Reminding Test: evidence of psychometric adequacy

Directory of Open Access Journals (Sweden)

KATJA OCEPEK-WELIKSON

2009-09-01

Full Text Available These analyses examine the psychometric properties of the Free and Cued Selective Reminding Test with Immediate Recall (FCSRT-IR. FCSRT-IR is a measure of memory under conditions that control attention and cognitive processing in order to obtain an assessment of memory unconfounded by normal agerelated changes in cognition. FCSRT-IR performance has been associated with preclinical and early dementia in several longitudinal epidemiological studies. Factor and item response theory analyses were applied to FCSRT-IR data from patients at a geriatric primary care center who had independently established clinical diagnoses. The results provide supporting evidence for the psychometric adequacy of the FCSR-IR in terms of reliability, essential (sufficient unidimensionality, information across the continuum of memory disability/ability, and classification accuracy. The psychometric adequacy of the FCSRT-IR adds further validity to its use as a case finding strategy for dementia.
Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

Directory of Open Access Journals (Sweden)

Roberts Gwerfyl

2012-06-01

Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.
Effects of common chronic medical conditions on psychometric tests used to diagnose minimal hepatic encephalopathy

DEFF Research Database (Denmark)

Lauridsen, M M; Poulsen, L; Rasmussen, C K

2016-01-01

Many chronic medical conditions are accompanied by cognitive disturbances but these have only to a very limited extent been psychometrically quantified. An exception is liver cirrhosis where hepatic encephalopathy is an inherent risk and mild forms are diagnosed by psychometric tests. The preferred...... diagnostic test battery in cirrhosis is often the Continuous Reaction Time (CRT) and the Portosystemic Encephalopathy (PSE) tests but the effect on these of other medical conditions is not known. We aimed to examine the effects of common chronic (non-cirrhosis) medical conditions on the CRT and PSE tests. We...
Psychometric validation of the Persian Bergen Social Media Addiction Scale using classic test theory and Rasch models.

Science.gov (United States)

Lin, Chung-Ying; Broström, Anders; Nilsen, Per; Griffiths, Mark D; Pakpour, Amir H

2017-12-01

Background and aims The Bergen Social Media Addiction Scale (BSMAS), a six-item self-report scale that is a brief and effective psychometric instrument for assessing at-risk social media addiction on the Internet. However, its psychometric properties in Persian have never been examined and no studies have applied Rasch analysis for the psychometric testing. This study aimed to verify the construct validity of the Persian BSMAS using confirmatory factor analysis (CFA) and Rasch models among 2,676 Iranian adolescents. Methods In addition to construct validity, measurement invariance in CFA and differential item functioning (DIF) in Rasch analysis across gender were tested for in the Persian BSMAS. Results Both CFA [comparative fit index (CFI) = 0.993; Tucker-Lewis index (TLI) = 0.989; root mean square error of approximation (RMSEA) = 0.057; standardized root mean square residual (SRMR) = 0.039] and Rasch (infit MnSq = 0.88-1.28; outfit MnSq = 0.86-1.22) confirmed the unidimensionality of the BSMAS. Moreover, measurement invariance was supported in multigroup CFA including metric invariance (ΔCFI = -0.001; ΔSRMR = 0.003; ΔRMSEA = -0.005) and scalar invariance (ΔCFI = -0.002; ΔSRMR = 0.005; ΔRMSEA = 0.001) across gender. No item displayed DIF (DIF contrast = -0.48 to 0.24) in Rasch across gender. Conclusions Given the Persian BSMAS was unidimensional, it is concluded that the instrument can be used to assess how an adolescent is addicted to social media on the Internet. Moreover, users of the instrument may comfortably compare the sum scores of the BSMAS across gender.
Testing the psychometric properties of a Chinese version of the level of expressed emotion scale.

Science.gov (United States)

Chien, Wai Tong; Chan, Zenobia Chung-Yee; Chan, Sally Wai-Chi

2014-01-01

This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76-0.95 and ICC = 0.81-0.92), test-retest reliability (r = 0.89-0.95, P Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement), accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ (2)/df = 1.93, P = 0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78) to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = -0.54) and family burden (r = 0.49) and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90). The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P Chinese version demonstrates sound psychometric properties to measure families' expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
Methods for Examining the Psychometric Quality of Subscores: A Review and Application

Science.gov (United States)

Wedman, Jonathan; Lyrén, Per-Erik

2015-01-01

When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…
What Do Test Scores Really Mean? A Latent Class Analysis of Danish Test Score Performance

DEFF Research Database (Denmark)

Munk, Martin D.; McIntosh, James

2014-01-01

Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores...... of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture and possible incentive problems make it more di¢ cult to understand what the tests measure....
The Alliance Negotiation Scale: A psychometric investigation.

Science.gov (United States)

Doran, Jennifer M; Safran, Jeremy D; Muran, J Christopher

2016-08-01

This study investigates the utility and psychometric properties of a new measure of psychotherapy process, the Alliance Negotiation Scale (ANS; Doran, Safran, Waizmann, Bolger, & Muran, 2012). The ANS was designed to operationalize the theoretical construct of negotiation (Safran & Muran, 2000), and to extend our current understanding of the working alliance concept (Bordin, 1979). The ANS was also intended to improve upon existing measures such as the Working Alliance Inventory (WAI; Horvath & Greenberg, 1986, 1989) and its short form (WAI-S; Tracey & Kokotovic, 1989) by expanding the emphasis on negative therapy process. The present study investigates the psychometric validity of the ANS test scores and interpretation-including confirming its original factor structure and evaluating its internal consistency and construct validity. Construct validity was examined through the ANS' convergence and divergence with several existing scales that measure theoretically related constructs. The results bolster and extend previous findings about the psychometric integrity of the ANS, and begin to illuminate the relationship between negotiation and other important variables in psychotherapy research. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

Development and psychometric properties of the Basic Amputee Mobility Score for use in patients with a major lower extremity amputation

DEFF Research Database (Denmark)

Kristensen, Morten Tange; Nielsen, Anni Østergaard; Topp, Ulla Madsen

2018-01-01

AIM: To develop and examine the psychometric properties, including responsiveness and interrater reliability, of a new outcome measure for the evaluation of basic mobility activities after a major lower extremity amputation - The Basic Amputee Mobility Score (BAMS). METHODS: The four following es...... a large responsiveness, excellent interrater reliability and with a change of 1 point indicating a real change in performances. Geriatr Gerontol Int 2017; ••: ••-••....
Disentangling Gratitude: A Theoretical and Psychometric Examination of the Gratitude Resentment and Appreciation Test-Revised Short (GRAT-RS).

Science.gov (United States)

Hammer, Joseph H; Brenner, Rachel E

2017-07-14

This study extended our theoretical and applied understanding of gratitude through a psychometric examination of the most popular multidimensional measure of gratitude, the Gratitude, Resentment, and Appreciation Test-Revised Short form (GRAT-RS). Namely, the dimensionality of the GRAT-RS, the model-based reliability of the GRAT-RS total score and 3 subscale scores, and the incremental evidence of validity for its latent factors were assessed. Dimensionality measures (e.g., explained common variance) and confirmatory factor analysis results with 426 community adults indicated that the GRAT-RS conformed to a multidimensional (bifactor) structure. Model-based reliability measures (e.g., omega hierarchical) provided support for the future use of the Lack of a Sense of Deprivation raw subscale score, but not for the raw GRAT-RS total score, Simple Appreciation subscale score, or Appreciation of Others subscale score. Structural equation modeling results indicated that only the general gratitude factor and the lack of a sense of deprivation specific factor accounted for significant variance in life satisfaction, positive affect, and distress. These findings support the 3 pillars of gratitude conceptualization of gratitude over competing conceptualizations, the position that the specific forms of gratitude are theoretically distinct, and the argument that appreciation is distinct from the superordinate construct of gratitude.
Psychometric Properties of the Persian Adaptation of Mini-Cog Test in Iranian Older Adults.

Science.gov (United States)

Rezaei, Mohammad; Rashedi, Vahid; Lotfi, Gohar; Shirinbayan, Peymaneh; Foroughan, Mahshid

2018-04-01

The aim of this study was to assess the psychometric properties of the Mini-Cog in Iranian older adults. It was a cross-sectional study; 50 older people with dementia and 50 without dementia who matched for age, gender, and education entered the study. The diagnostic and statistical manual of mental disorders criteria for dementia were used as gold standard. A battery of scales included the abbreviated mental test score (AMTS), the Geriatric Depression Scale, and the Mini-Cog was performed. Validity and reliability of the Mini-Cog determined using the Pearson product-moment correlation coefficient (Pearson's r), Cronbach's alpha, and Receiver Operating Characteristic (ROC) curve analysis. The Persian version of Mini-Cog showed a good inter-rater reliability ( K = 0.76, p Mini-Cog have an acceptable sensitivity, specificity, and substantial overall agreement with the AMTS.
The Psychometric Properties of English and Spanish Versions of the Life Orientation Test-Revised in Hispanic Americans.

Science.gov (United States)

Pan, Tonya M; Mills, Sarah D; Fox, Rina S; Baik, Sharon H; Harry, Kadie M; Roesch, Scott C; Sadler, Georgia Robins; Malcarne, Vanessa L

2017-12-01

The Life Orientation Test-Revised (LOT-R) is a widely used measure of optimism and pessimism, with three positively worded and three negatively worded content items. This study examined the structural validity and invariance, internal consistency reliability, and convergent and divergent validity of the English and Spanish versions of the LOT-R among Hispanic Americans. A community sample of Hispanic Americans ( N = 422) completed self-report measures, including the LOT-R, Patient Health Questionnaire-9, and Generalized Anxiety Disorder-7, in their preferred language of English or Spanish. Based on the literature, four structural models were tested: one-factor , oblique two-factor , orthogonal two-factor method effects with positive specific factor , and orthogonal two-factor method effects with negative specific factor . Baseline support for both of the English and Spanish versions was not achieved for any model; in all models, the negatively worded items in Spanish had non-significant factor loadings. Therefore, the positively worded three-item optimism subscale of the LOT-R was examined separately and fit the data, with factor loadings equivalent across language-preference groups. Coefficient alphas for the optimism subscale were consistent across both language-preference groups (αs = .61 [English] and .66 [Spanish]). In contrast, the six-item total score and three-item pessimism subscale demonstrated extremely low or inconsistent alphas. Convergent and divergent validity were established for the optimism subscale in both languages. In sum, the optimism subscale of the LOT-R demonstrated minimally acceptable to good psychometric properties across English and Spanish language-preference groups. However, neither the total score nor the pessimism subscale showed adequate psychometric properties for Spanish-speaking Hispanic Americans, likely due to translation and cultural adaptation issues, and thus are not supported for use with this population.
Testing the Psychometric Properties of a Chinese Version of the Level of Expressed Emotion Scale

Directory of Open Access Journals (Sweden)

Wai Tong Chien

2014-01-01

Full Text Available This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76–0.95 and ICC = 0.81–0.92, test-retest reliability (r = 0.89–0.95, P<0.01, and internal consistency (Cronbach’s α = 0.86–0.92. Among 262 patients with severe mental illness and their caregivers, the 50-item Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement, accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ2/df = 1.93, P=0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78 to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = −0.54 and family burden (r = 0.49 and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90. The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P<0.01. The Chinese version demonstrates sound psychometric properties to measure families’ expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
[Can Psychometric Tests Predict Success in the Selection Interview for Medical School? A Cross-Sectional Study at One German Medical School].

Science.gov (United States)

Kötter, T; Obst, K U; Brüheim, L; Eisemann, N; Voltmer, E; Katalinic, A

2017-07-01

Background The final exam grade is the main selection criterion for medical school application in Germany. For academic success, it seems to be a reliable predictor. Its use as the only selection criterion is, however, criticised. At some universities, personal interviews are part of the selection process. However, these are very time consuming and are of doubtful validity. The (additional) use of appropriate psychometric instruments could reduce the cost and increase the validity. This study investigates the extent to which psychometric instruments can predict the outcome of a personal selection interview. Methods This is a cross-sectional study on the correlation of the results of psychometric instruments with those of the personal selection interview as part of the application process. As the outcome, the score of the selection interview was used. The NEO - Five Factor Inventory, the Hospital Anxiety and Depression Scale (HADS) and the questionnaire to identify work-related behaviour and experience patterns (AVEM) were used as psychometric interviews. Results There was a statistically significant correlation with the results of the personal selection interview for the sum score of the depression scale from the HADS and the sum score for the dimension of life satisfaction of the AVEM. In addition, those participants who did not previously complete an application training achieved a better result in the selection interview. Conclusion The instruments used measure different aspects than the interviews and cannot replace them. It remains to be seen whether the selected parameters are able to predict academic success. © Georg Thieme Verlag KG Stuttgart · New York.
Do Test Scores Buy Happiness?

Science.gov (United States)

McCluskey, Neal

2017-01-01

Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…
A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.

Science.gov (United States)

Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael

2014-01-01

This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.
Psychometric properties of the Plutchik's EPI test (Emotions Profile Index

Directory of Open Access Journals (Sweden)

Ana Trebovc

2005-04-01

Full Text Available Authors report a study on psychometric properties of Plutchik's test, called Emotions Profile Index (EPI. A new Slovene translation and adaptation of English version of the test, consisting of combinations (pairs of 12 words reflecting eight different emotional conditions, was prepared and compared to the old one. Both versions as well as the Big Five Questionnaire (BFQ were administered on the sample of 239 participants. Different statistical analyses were performed examining psychometric features of both versions of EPI. Discriminative power was tested by cluster analysis and analysis of frequency distributions, reliability was studied via internal consistency index and correlation between the two versions, and validity was examined by correlating PIE dimensions with BFQ dimensions and subdimensions, by comparing profiles of groups on both versions of EPI and BFQ and by fitting the theoretical model proposed by Plutchik to the data. Discriminative power of EPI seems to be affected by avoiding (not choosing the socially desirable expressions in the test, parallel reliability seems to be susceptible to the use of different words (expressions in the new version of EPI having the same meaning as words in the old version. Dimensions expected to reflect similar constructs in BFQ and EPI do not correlate satisfactory. Data gathered with EPI cannot be fully explained with the model proposed by Plutchik's theory.
Psychometric properties of the Satisfaction With Life Scale in Parkinson's disease.

Science.gov (United States)

Rosengren, L; Jonasson, S B; Brogårdh, C; Lexell, J

2015-09-01

The Satisfaction With Life Scale (SWLS) is a global measure of life satisfaction (LS). The objective of this study was to evaluate the psychometric properties (data completeness, scaling assumptions, targeting and reliability) of the SWLS in a sample of people with Parkinson's disease (PD). A postal survey including a Swedish version of the SWLS and demographic information was administered to 174 persons with PD; 97 responded and received a second survey after 2 weeks. The mean (SD) age and PD duration of the 97 responders were 73 (8) and 7 (6) years, respectively. Data completeness was 92% to 97% for the five items in the SWLS and 92% for the total score (5-35 points). The mean score of the SWLS was 24.2 points (7.7), indicating that this group had an average LS. The items' means and SDs were roughly parallel and the score distribution was even. The internal consistency reliability (Cronbach's alpha) was 0.90. The test-retest reliability, assessed by the intraclass correlation coefficient, was 0.78. The scale showed no systematic difference between the first and second response. The standard error of measurement was 3.6 points, and the smallest detectable difference was 10.0 points. This evaluation of the psychometric properties of the SWLS shows that the scale has good data completeness, scaling assumptions and targeting and that the internal consistency reliability and the test-retest reliability are acceptable. Thus, the SWLS is a psychometrically sound and suitable tool to asses LS in people with PD. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Questionnaire-based assessment of executive functioning: Psychometrics.

Science.gov (United States)

Castellanos, Irina; Kronenberger, William G; Pisoni, David B

2018-01-01

The psychometric properties of the Learning, Executive, and Attention Functioning (LEAF) scale were investigated in an outpatient clinical pediatric sample. As a part of clinical testing, the LEAF scale, which broadly measures neuropsychological abilities related to executive functioning and learning, was administered to parents of 118 children and adolescents referred for psychological testing at a pediatric psychology clinic; 85 teachers also completed LEAF scales to assess reliability across different raters and settings. Scores on neuropsychological tests of executive functioning and academic achievement were abstracted from charts. Psychometric analyses of the LEAF scale demonstrated satisfactory internal consistency, parent-teacher inter-rater reliability in the small to large effect size range, and test-retest reliability in the large effect size range, similar to values for other executive functioning checklists. Correlations between corresponding subscales on the LEAF and other behavior checklists were large, while most correlations with neuropsychological tests of executive functioning and achievement were significant but in the small to medium range. Results support the utility of the LEAF as a reliable and valid questionnaire-based assessment of delays and disturbances in executive functioning and learning. Applications and advantages of the LEAF and other questionnaire measures of executive functioning in clinical neuropsychology settings are discussed.
A call for policy guidance on psychometric testing in doping control in sport.

Science.gov (United States)

Petróczi, Andrea; Backhouse, Susan H; Barkoukis, Vassilis; Brand, Ralf; Elbe, Anne-Marie; Lazuras, Lambros; Lucidi, Fabio

2015-11-01

One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping) into organised intelligence to protect clean sport. Beyond the foreseeable consequences of a positive identification as a doping user, this task is further complicated by the discrepancy between what constitutes a doping offence in the World Anti-Doping Code and operationalized in doping research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. In both research and forensic settings, we must acknowledge that (1) socially desirable responding confounds self-reported psychometric test results and (2) that the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that, to date, both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better understanding of doping behaviour, not to establish evidence for doping. A false 'positive' psychological profile for doping affects not only the individual
Propriedades psicométricas apresentadas em manuais de testes de inteligência Psychometric parameters in intelligence test directions

Directory of Open Access Journals (Sweden)

Ana Paula Porto Noronha

2003-06-01

Full Text Available A pesquisa teve como objetivo verificar quais os parâmetros psicométricos apresentados nos manuais de 19 instrumentos de avaliação da inteligência. Os elementos avaliados nos instrumentos foram: análise de itens, padronização, validade e precisão. Os resultados encontrados mostraram que, dos 19 testes avaliados, 89,5% apresentaram estudos de padronização, sendo que o procedimento mais utilizado na escolha dos sujeitos foi o não aleatório (62,2% dos testes. No que se refere à validade, a de construto foi a mais freqüente dentre os testes (94,7%. Observou-se que todos os instrumentos apresentaram verifica��ão da precisão, sendo o método de consistência interna o mais aplicado (78,9%. Conclui-se que, embora os autores concordem que todos os testes devam realizar estudos de verificação dos parâmetros psicométricos e devam possuir normas regionais, tal prática ainda não se encontra totalmente difundida na avaliação psicológica brasileira,This research aimed to verify the psychometric parameters presented in manuals of 19 intelligence tests. The psychometric properties included in the analysis were: item analysis, validity, reliability, and norms studies. The results indicated that 89.5% of the 19 tests presented norming studies. The procedure of sample selection was mostly non-random (62.2% of the tests. Construct validity was the most frequent method used among the studies (94.7%. All tests presented reliability studies, most of them using internal consistency coefficient (78.9%. It is concluded that although the authors agree that all tests need studies to verify psychometric parameters and studies to obtain regional norms this action isn’t divulged totally yet in the Brazilian psychological assessment.
Predicting occupational personality test scores.

Science.gov (United States)

Furnham, A; Drakeley, R

2000-01-01

The relationship between students' actual test scores and their self-estimated scores on the Hogan Personality Inventory (HPI; R. Hogan & J. Hogan, 1992), an omnibus personality questionnaire, was examined. Despite being given descriptive statistics and explanations of each of the dimensions measured, the students tended to overestimate their scores; yet all correlations between actual and estimated scores were positive and significant. Correlations between self-estimates and actual test scores were highest for sociability, ambition, and adjustment (r = .62 to r = .67). The results are discussed in terms of employers' use and abuse of personality assessment for job recruitment.
The Development and Psychometric Properties of the Immigration Law Concerns Scale (ILCS) for HIV Testing.

Science.gov (United States)

Lechuga, Julia; Galletly, Carol L; Broaddus, Michelle R; Dickson-Gomez, Julia B; Glasman, Laura R; McAuliffe, Timothy L; Vega, Miriam Y; LeGrand, Sarah; Mena, Carla A; Barlow, Morgan L; Valera, Erik; Montenegro, Judith I

2017-11-08

To develop, pilot test, and conduct psychometric analyses of an innovative scale measuring the influence of perceived immigration laws on Latino migrants' HIV-testing behavior. The Immigration Law Concerns Scale (ILCS) was developed in three phases: Phase 1 involved a review of law and literature, generation of scale items, consultation with project advisors, and subsequent revision of the scale. Phase 2 involved systematic translation- back translation and consensus-based editorial processes conducted by members of a bilingual and multi-national study team. In Phase 3, 339 sexually active, HIV-negative Spanish-speaking, non-citizen Latino migrant adults (both documented and undocumented) completed the scale via audio computer-assisted self-interview. The psychometric properties of the scale were tested with exploratory factor analysis and estimates of reliability coefficients were generated. Bivariate correlations were conducted to test the discriminant and predictive validity of identified factors. Exploratory factor analysis revealed a three-factor, 17-item scale. subscale reliability ranged from 0.72 to 0.79. There were significant associations between the ILCS and the HIV-testing behaviors of participants. Results of the pilot test and psychometric analysis of the ILCS are promising. The scale is reliable and significantly associated with the HIV-testing behaviors of participants. Subscales related to unwanted government attention and concerns about meeting moral character requirements should be refined.
Do medical students’ scores using different assessment instruments predict their scores in clinical reasoning using a computer-based simulation?

Directory of Open Access Journals (Sweden)

Fida M

2015-02-01

Full Text Available Mariam Fida,1 Salah Eldin Kassab2 1Department of Molecular Medicine, College of Medicine and Medical Sciences, Arabian Gulf University, Manama, Bahrain; 2Department of Medical Education, Faculty of Medicine, Suez Canal University, Ismailia, Egypt Purpose: The development of clinical problem-solving skills evolves over time and requires structured training and background knowledge. Computer-based case simulations (CCS have been used for teaching and assessment of clinical reasoning skills. However, previous studies examining the psychometric properties of CCS as an assessment tool have been controversial. Furthermore, studies reporting the integration of CCS into problem-based medical curricula have been limited. Methods: This study examined the psychometric properties of using CCS software (DxR Clinician for assessment of medical students (n=130 studying in a problem-based, integrated multisystem module (Unit IX during the academic year 2011–2012. Internal consistency reliability of CCS scores was calculated using Cronbach's alpha statistics. The relationships between students' scores in CCS components (clinical reasoning, diagnostic performance, and patient management and their scores in other examination tools at the end of the unit including multiple-choice questions, short-answer questions, objective structured clinical examination (OSCE, and real patient encounters were analyzed using stepwise hierarchical linear regression. Results: Internal consistency reliability of CCS scores was high (α=0.862. Inter-item correlations between students' scores in different CCS components and their scores in CCS and other test items were statistically significant. Regression analysis indicated that OSCE scores predicted 32.7% and 35.1% of the variance in clinical reasoning and patient management scores, respectively (P<0.01. Multiple-choice question scores, however, predicted only 15.4% of the variance in diagnostic performance scores (P<0.01, while
Exploring a Source of Uneven Score Equity across the Test Score Range

Science.gov (United States)

Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D.

2018-01-01

Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…
Cross-cultural adaptation and psychometric analysis of the Arabic version of the oxford knee score in adult male with knee osteoarthritis.

Science.gov (United States)

Alghadir, Ahmad H; Al-Eisa, Einas S; Anwer, Shahnawaz

2017-05-15

There are varieties of self-assessment questionnaire used for the evaluation of pain, functional disability, and health related quality of life in individuals with knee osteoarthritis (OA). The present study intended to adapt and translate the oxford knee score into the Arabic and investigated its psychometric properties in adult male with knee OA. Ninety-seven adult male (mean age 57.55 ± 11.49 years) with knee OA participated. Patients were requested to complete the adapted Arabic version of the Oxford knee score (OKS-Ar), reduced "Western Ontario and McMaster Universities Index (WOMAC)", and the Visual analogue scale (VAS). Patients were requested to complete 2 nd form of OKS-Ar at least 1 week apart to assess the reproducibility of the score. The OKS was adapted and translated into Arabic by two independent Arabic native speakers (one rehabilitation professional having experience of knee OA patients and another one a trained translator) according to the international guidelines. All the participants completed the 2 nd form of OKS-Ar (Response rate 100%). Reliability and internal consistency was high with an ICC of 0.97, and the Cronbach's alpha coefficient of 0.987, respectively. A significant relationship between the OKS-Ar and the WOMAC and VAS scores confirmed the construct validity (p < 0.001). The standard error of measurement (SEM) and the minimum detectable change (MDC) were 2.2 and 6.2, respectively. The adapted Arabic version of the OKS demonstrated acceptable psychometric properties, including reliability, internal consistency, and the validity. The present study indicates that the OKS-Ar is a suitable questionnaire to measure pain and physical function in the Arabic speaking adult male patients with knee OA.
Psychometric properties of the NOMO 1.0 tested among adult powered-mobility users

DEFF Research Database (Denmark)

Sund, Terje; Brandt, Åse; Anttila, Heidi

2017-01-01

(Participation Repertoire). PURPOSE: This study aimed to investigate a range of psychometric properties of the NOMO 1.0 in a sample of adult powered mobility device (PMD) users. METHOD: Data collected from PMD users ( N = 248) in Denmark, Finland, and Norway as part of a larger study were analyzed using state...... scale and six components of the Frequency scale. IMPLICATIONS: The NOMO 1.0 should be used for research purposes and not for clinical practice. Better reliability should be established for the Need for Assistance and Ease/Difficulty scales prior to further psychometric testing to establish the validity...
Psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C) for patients with alcoholism.

Science.gov (United States)

Hsueh, Yu-Jung; Chu, Hsin; Huang, Chang-Chih; Ou, Keng-Liang; Chen, Chiung-Hua; Chou, Kuei-Ru

2014-04-01

The aim of this study was to examine the psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C). The sensitivity, specificity, and positive and negative predictive values for the MAST-C were examined in this study. The MAST-C had an internal consistency of 0.83 and a test-retest reliability of 0.89. It had a good content validity index of 0.92. Factor analysis identified four factors and the optimal cutoff point for the MAST-C was a score of 6/7, which yielded a sensitivity of 0.92, a specificity of 0.83, a positive predictive value of 0.92, and a negative predictive value of 0.83. The MAST-C provides a fast, accurate, and sensitive method for clinically diagnosing alcoholism and clinical management. © 2013 Wiley Periodicals, Inc.

Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

Science.gov (United States)

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Psychometric properties of the Albanian version of the Orofacial Esthetic Scale: OES-ALB.

Science.gov (United States)

Bimbashi, Venera; Čelebić, Asja; Staka, Gloria; Hoxha, Flurije; Peršić, Sanja; Petričević, Nikola

2015-08-26

The aim was to adapt the Orofacial Esthetic Scale (OES) and to test psychometric properties of the Albanian language version in the cultural environment of the Republic of Kosovo. The OES questionnaire was translated from the original English version according to the accepted techniques. The reliability (internal consistency), and validity (construct, convergent and discriminative) were tested in 169 subjects, test-retest in 61 dental students (DS), and responsiveness in 51 prosthodontic patients with treatment needs (PPTN). The corrected item correlation coefficients of OES-ALB ranged from 0.686 to 0.909. The inter-item correlation coefficient ranged between 0.572 and 0.919. The Cronbach's alpha was 0.961 and IIC 0.758. Test- retest was confirmed by good ICCs and by no significant differences of the OES scores through the period of 14 days without any orofacial changes (p > 0.05). Construct validity was proved by the presence of one-factor composition that assumed 79.079% of the variance. Convergent validity showed significant correlation between one general question about satisfaction with orofacial esthetics and the OES summary score, as well as between the sum of the 3 OHIP-ALB49 questions related to orofacial aesthetics and the OES summary score. Discriminative validity was confirmed with statistically significant differences between DS, prosthodontic patients without treatment need and PPTN (p < 0.01). Responsiveness was confirmed by a significant increase of OES scores after PPTN patients received new fixed partial or removable dentures (P < 0.001). The results proved excellent psychometric properties of the OES-ALB questionnaire in the Republic of Kosovo.
Psychometric evaluation of the Social Interaction Phobia Scale.

Science.gov (United States)

Reilly, Alison R; Carleton, R Nicholas; Weeks, Justin W

2012-01-01

The present study evaluated the psychometric properties of a novel measure of social anxiety symptoms, the Social Interaction Phobia Scale (SIPS), as a stand-alone item set, using an undergraduate sample (N=512). The 14-item SIPS has three subscales assessing Social Interaction Anxiety, Fear of Overt Evaluation, and Fear of Attracting Attention. Confirmatory factor analyses replicated the three-factor structure for the SIPS originally reported by Carleton et al. All SIPS scores demonstrated good internal consistency. The convergent validity of the SIPS was supported by strong and positive correlations between all SIPS scores and measures of social anxiety and fear of evaluation; the finding that the relationships between all SIPS scores and a social anxiety measure were stronger than relationships between all SIPS scores and measures of other constructs supported the discriminant validity of the SIPS. Results suggest that the SIPS possesses excellent psychometric properties.
Evaluation of a new computerized psychometric test battery: Effects of zolpidem and caffeine

OpenAIRE

Raveendranadh Pilli; MUR Naidu; Usharani Pingali; J C Shobha

2013-01-01

Objective: To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. Materials and Methods: The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of...
Hurlbert Index of Sexual Assertiveness: a study of psychometric properties in a Spanish sample.

Science.gov (United States)

Santos-Iglesias, Pablo; Sierra, Juan Carlos

2010-08-01

The study analyzed psychometric properties of a Spanish version of the Hurlbert Index of Sexual Assertiveness in a Spanish sample of 400 men and 453 women who had had a partner for the last 6 mo. or longer at the time of the study. Exploratory and confirmatory factor analyses suggested a two-factor solution with the factors Initiation and No shyness/Refusal. Internal consistency values for total scores were .87 and .83 for the factors, respectively. Convergent validity tests were also satisfactory. It is therefore reasonable to conclude that the Spanish version of the scale has appropriate psychometric properties.
Evaluation of a new computerized psychometric test battery: Effects of zolpidem and caffeine.

Science.gov (United States)

Pilli, Raveendranadh; Naidu, Mur; Pingali, Usharani; Shobha, Jc

2013-10-01

To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of zolpidem (5 mg) and caffeine (500 mg) versus placebo were studied in 24 healthy participants in a randomized, double-blind three-way crossover design. Psychometric tests were performed at baseline and at 1, 2, and 3 h after administration of study medication. The effects of zolpidem and caffeine on the psychomotor performance were most pronounced 1 h after administration. At this time, a significant impairment of performance in the simple reaction test (SRT), choice discrimination test (CDT), digit symbol substitution test (DSST), digit vigilance test (DVT), and card sorting test (CST) was observed with zolpidem. In contrast, caffeine showed a significant improvement in performance in CDT and DVT only. The results suggest that the tests of the computerized system are more sensitive and reliable then the pencil and paper tests in detecting the effects of central acting agents and are suitable for use in clinical areas to conduct studies with patients.
Psychometric Properties of “Community Assessment of Psychic Experiences”: Review and Meta-analyses

Science.gov (United States)

Mark, Winifred; Toulopoulou, Timothea

2016-01-01

The Community Assessment of Psychic Experiences (CAPE) has been used extensively as a measurement for psychosis proneness in clinical and research settings. However, no prior review and meta-analysis have comprehensively examined psychometric properties (reliability and validity) of CAPE scores across different studies. To study CAPE’s internal reliability—ie, how well scale items correlate with one another—111 studies were reviewed. Of these, 18 reported unique internal reliability coefficients using data at hand, which were aggregated in a meta-analysis. Furthermore, to confirm the number and nature of factors tapped by CAPE, 17 factor analytic studies were reviewed and subjected to meta-analysis in cases of discrepancy. Results suggested that CAPE scores were psychometrically reliable—ie, scores obtained could be attributed to true score variance. Our review of factor analytic studies supported a 3-factor model for CAPE consisting of “Positive”, “Negative”, and “Depressive” subscales; and a tripartite structure for the Negative dimension consisting of “Social withdrawal”, “Affective flattening”, and “Avolition” subdimensions. Meta-analysis of factor analytic studies of the Positive dimension revealed a tridimensional structure consisting of “Bizarre experiences”, “Delusional ideations”, and “Perceptual anomalies”. Information on reliability and validity of CAPE scores is important for ensuring accurate measurement of the psychosis proneness phenotype, which in turn facilitates early detection and intervention for psychotic disorders. Apart from enhancing the understanding of psychometric properties of CAPE scores, our review revealed questionable reporting practices possibly reflecting insufficient understanding regarding the significance of psychometric properties. We recommend increased focus on psychometrics in psychology programmes and clinical journals. PMID:26150674
Psychometric evaluation of the Chinese version of short-form Test of Functional Health Literacy in Adolescents.

Science.gov (United States)

Chang, Li-Chun; Hsieh, Pei-Lin; Liu, Chieh-Hsing

2012-09-01

The purpose of this study is to develop and evaluate the psychometric properties of the Chinese version of short-form Test of Functional Health Literacy in Adolescents. Assessing health literacy is vital to design health education programme; however, there are no measurement tools exist for use specifically in Chinese adolescents. A non-experimental design was used to test the psychometric properties of the Test of Functional Health Literacy in Adolescents. The short-form Test of Functional Health Literacy in Adolescents was translated and back translated into a Chinese language version. Thirty high school students were recruited to validate the scenario of Test of Functional Health Literacy in Adolescents. Based on the multiple-stage stratified random sampling method, 300 high school students from four counties in Taiwan were invited to participate in this study to evaluate the psychometric properties of Test of Functional Health Literacy in Adolescents. The Functional Health Literacy in Adolescents had good internal consistency reliability and excellent test-retest reliability. Confirmatory factor analysis resulted in a one-factor solution. Contrary to the original version of the Test of Functional Health Literacy in Adolescents, the findings revealed that the 36-item, one-factor model for the Test of Functional Health Literacy in Adolescents is the best-fit model. This is a suitable instrument to assess health literacy levels in Chinese adolescents before health education programmes can be appropriately planned, implemented and evaluated. © 2012 Blackwell Publishing Ltd.
Psychometric properties and clinical usefulness of the Oswestry Disability Index.

Science.gov (United States)

Vianin, Michael

2008-12-01

Outcome measures with good reliability, validity, responsiveness, and low burden of administration are clinically useful. The Oswestry Disability Index (ODI) is one of the most commonly used outcome measures for individuals with low back pain. Psychometric properties of the ODI will determine the questionnaire's suitability as a useful clinical tool. A literature search of relevant databases on psychometric evaluation of the ODI was performed. The search was done using the key words disability evaluation, and low back pain, and questionnaires, and reproducibility of results, and the term Oswestry. Inclusion criterion was direct reference regarding psychometric property, interpretability, and burden being included in the abstract. Eight articles met the inclusion criterion. The ODI shows good construct validity; internal consistency is rated as acceptable; test-retest reliability and responsiveness have been shown to be high; and burden of administration is low. The ODI is a valid, reliable, and responsive condition-specific assessment tool that is suited for use in clinical practice. It is easy to administer and score, objectifies clients' complaints, and monitors effects of therapy.
IQs Are Very Strong but Imperfect Indicators of Psychometric "g": Results from Joint Confirmatory Factor Analysis

Science.gov (United States)

Farmer, Ryan L.; Floyd, Randy G.; Reynolds, Matthew R.; Kranzler, John H.

2014-01-01

The most global score yielded by intelligence tests, IQs, are supported by substantial validity evidence and have historically been central to the identification of intellectual disabilities, learning disabilities, and giftedness. This study examined the extent to which IQs measure the ability they target, psychometric "g." Data from…
Psychometrics of social cognitive measures for psychosis treatment research.

Science.gov (United States)

Davidson, Charlie A; Lesser, Rebecca; Parente, Lori T; Fiszdon, Joanna M

2018-03-01

Social cognition represents an important treatment target, closely linked to everyday social function. While a number of social cognitive interventions have recently been developed, measures used to evaluate these treatments are only beginning to receive psychometric scrutiny. Study goals were to replicate recently-published psychometrics for several social cognitive measures, and to provide information for additional social cognitive measures not included in recent reports. Forty-eight outpatients with psychotic-spectrum disorders completed measures of emotion perception, theory of mind, and attributional bias on two occasions, one month apart. Measures were tested for distributional characteristics, test-retest reliability, utility as a repeated measure, and relationship to symptoms and functioning. For a subgroup of participants, information about sensitivity to social cognitive treatment was also available. We replicated aspects of prior work, including largely favorable psychometric characteristics for the Bell-Lysaker Emotion Recognition Task, and promising but weaker characteristics for The Awareness of Social Inferences Test subscales and Reading the Mind in the Eyes Task. The Hinting Task had adequate test-retest statistics but a more pronounced ceiling effect. Ambiguous Intentions and Hostility Questionnaire data showed evidence of validity but were limited by inconsistency over time. Our results strongly support the Davos Assessment of Cognitive Biases Scale for future evaluation as a social cognitive treatment outcome measure. Its scores were adequately distributed, consistent over time, related to symptoms and functioning, and sensitive to treatment effects. Other relatively novel assessments of attributional bias and theory of mind showed some promise, although more work is needed. Published by Elsevier B.V.
Development and Psychometric Testing of the Dogs and WalkinG Survey (DAWGS)

Science.gov (United States)

Richards, Elizabeth A.; McDonough, Meghan H.; Edwards, Nancy E.; Lyle, Roseann M.; Troped, Philip J.

2013-01-01

Purpose: Dog owners represent 40% of the population, a promising audience to increase population levels of physical activity. The purpose of this study was to develop and test the psychometric properties of a new instrument to assess social-cognitive theory constructs related to dog walking. Method: Dog owners ("N" = 431) completed the…
Psychometric properties of a scale to measure alexithymia.

Science.gov (United States)

Blanchard, E B; Arena, J G; Pallmeyer, T P

1981-01-01

Four studies were conducted on a sample of 230 undergraduates to determine the psychometric properties of a measure of alexithymia, the Schalling-Sifneos Scale. In the first study it was found that scores on the scale are approximately normally distributed for each sex with 8.2% of males and 1.8% of females in the alexithymia range. In the second study a factor analysis of the scale revealed three distinct factors: (1) 'difficulty in expression of feelings'; (2) 'the importance of feelings especially about people'; (3) 'day-dreaming or introspection'. In the second factor analytic study, scores from several standard psychological tests on the same subjects were introduced with the scale items. Two factors in this analysis were comprised almost entirely of the other test scores: a 'general psychological distress factor' and a 'concerns about physical symptoms factor'. The other two factors were similar to factors 1 and 2 above in terms of items. The Rathus Assertiveness Scale loaded positively on the equivalent of factor 1. In the lst study, it was shown that Schalling-Sifneos Scale score is relatively orthogonal to other psychological tests with the exception of a Psychosomatic Symptom Checklist and thus is measuring something other than depression, anxiety, etc.
Psychometric properties of Persian version of the Sustained Auditory Attention Capacity Test in children with attention deficit-hyperactivity disorder.

Science.gov (United States)

Soltanparast, Sanaz; Jafari, Zahra; Sameni, Seyed Jalal; Salehi, Masoud

2014-01-01

The purpose of the present study was to evaluate the psychometric properties (validity and reliability) of the Persian version of the Sustained Auditory Attention Capacity Test in children with attention deficit hyperactivity disorder. The Persian version of the Sustained Auditory Attention Capacity Test was constructed to assess sustained auditory attention using the method provided by Feniman and colleagues (2007). In this test, comments were provided to assess the child's attentional deficit by determining inattention and impulsiveness error, the total scores of the sustained auditory attention capacity test and attention span reduction index. In the present study for determining the validity and reliability of in both Rey Auditory Verbal Learning test and the Persian version of the Sustained Auditory Attention Capacity Test (SAACT), 46 normal children and 41 children with Attention Deficit Hyperactivity (ADHD), all right-handed and aged between 7 and 11 of both genders, were evaluated. In determining convergent validity, a negative significant correlation was found between the three parts of the Rey Auditory Verbal Learning test (first, fifth, and immediate recall) and all indicators of the SAACT except attention span reduction. By comparing the test scores between the normal and ADHD groups, discriminant validity analysis showed significant differences in all indicators of the test except for attention span reduction (pAttention Capacity test has good validity and reliability, that matches other reliable tests, and it can be used for the identification of children with attention deficits and if they suspected to have Attention Deficit Hyperactivity Disorder.
Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

Science.gov (United States)

Cecilio-Fernandes, Dario; Medema, Harro; Collares, Carlos Fernando; Schuwirth, Lambert; Cohen-Schotanus, Janke; Tio, René A

2017-11-09

Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods. A 2 × 2 crossover design was used in a study where 298 students from four medical schools participated. A sample of 200 previously used questions from the progress tests was selected. The data were analyzed using the Rasch model, which provides fit parameters, reliability coefficients, and response option analysis. The fit parameters were in the optimal interval ranging from 0.50 to 1.50, and the means were around 1.00. The person and item reliability coefficients were higher in the number-right condition than in the formula-scoring condition. The response option analysis showed that the majority of dysfunctional items emerged in the formula-scoring condition. The findings of this study support the use of number-right scoring over formula scoring. Rasch model analyses showed that tests with number-right scoring have better psychometric properties than formula scoring. However, choosing the appropriate scoring method should depend not only on psychometric properties but also on self-directed test-taking strategies and metacognitive skills.
Child's Challenging Behaviour Scale, Version 2 (CCBS-2): Psychometric Evaluation With Young Children.

Science.gov (United States)

Bourke-Taylor, Helen; Pallant, Julie; Cordier, Reinie

In this article, we evaluate psychometric properties of the Child's Challenging Behaviour Scale, Version 2 (CCBS-2) with mothers of young, typically developing children. A cross-sectional mail survey with Australian mothers (N = 337) included the CCBS-2, the Depression Anxiety Stress Scales, and the Parents' Evaluation of Developmental Status scale. Internal consistency was good, and no gender differences in CCBS-2 scores were significant. Significant results included differences between CCBS-2 scores: among children grouped according to age, among children grouped according to pre- and post-school entry, among mothers grouped according to extent of any symptom type, and between this sample and a previously collected age-matched sample of children with disabilities. Of the properties tested, results support sound psychometrics. The CCBS-2 can be used to differentiate children according to age, school entry, and disability as well as to identify families for potential services in behavior management and mental health. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Multidimensional daily diary of fatigue-fibromyalgia-17 items (MDF-fibro-17): part 2 psychometric evaluation in fibromyalgia patients.

Science.gov (United States)

Li, Y; Morris, S; Cole, J; Dube', S; Smith, J A M; Burbridge, C; Symonds, T; Hudgens, S; Wang, W

2017-05-18

The Multidimensional Daily Diary of Fatigue-Fibromyalgia-17 instrument (MDF-Fibro-17) has been developed for use in fibromyalgia (FM) clinical studies and includes 5 domains: Global Fatigue Experience, Cognitive Fatigue, Physical Fatigue, Motivation, and Impact on Function. Psychometric properties of the MDF-Fibro-17 needed to demonstrate the appropriateness of using this instrument in clinical studies are presented. Psychometric analyses were conducted to evaluate the factor structure, reliability, validity, and responsiveness of the MDF-Fibro-17 using data from a Phase 2 clinical study of FM patients (N = 381). Confirmatory factor analyses (CFA) were performed to ensure understanding of the multidimensional domain structure, and a secondary factor analysis of the domains examined the appropriateness of calculating a total score in addition to domain scores. Longitudinal psychometric analyses (test-retest reliability and responder analysis) were also conducted on the data from Baseline to Week 6. The CFA supported the 17-item, 5 domain structure of this instrument as the best fit of the data: comparative fit index (CFI) and non-normed fit index (NNFI) were 0.997 and 0.992 respectively, standardized root mean square residual (SRMR) was 0.010 and the root mean square error of approximation (RMSEA) was 0.06. In addition, total score (CFI and NNFI both 0.95) met required standards. For the total and 5 domain scores, reliability and validity data were acceptable: test-retest and internal consistency were above 0.9; correlations were as expected with the Global Fatigue Index (GFI) (0.62-0.75), Fibromyalgia Impact Questionnaire (FIQ) Total (0.59-0.71), and 36-Item Short Form Health Survey (SF-36) vitality (VT) (0.43-0.53); and discrimination was shown using quintile scores for the GFI, FIQ Total, and Pain Numeric Rating Scale (NRS) quartiles. In addition, sensitivity to change was demonstrated with an overall mean responder score of -2.59 using anchor-based methods
Psychometric properties of the List of Threatening Experiences--LTE and its association with psychosocial factors and mental disorders according to different scoring methods.

Science.gov (United States)

Motrico, Emma; Moreno-Küstner, Berta; de Dios Luna, Juan; Torres-González, Francisco; King, Michael; Nazareth, Irwin; Montón-Franco, Carmen; Gilde Gómez-Barragán, María Josefa; Sánchez-Celaya, Marta; Díaz-Barreiros, Miguel Ángel; Vicens, Catalina; Moreno-Peral, Patricia; Bellón, Juan Ángel

2013-09-25

The List of Threatening Experiences (LTE) questionnaire is frequently used to assess stressful events; however, studies of its psychometric properties are scarce. We examined the LTE's reliability, factorial structure, construct validity and explored the association between LTE scores and psychosocial variables and mental disorders. This study involved interviewing 5442 primary care attendees from Spain. Associations between four different methods of quantifying LTE scores, psychosocial factors, major depression (CIDI), anxiety disorders (PRIME-MD), alcohol misuse and dependence (AUDIT) were measured. The LTE showed high test-retest reliability (Kappa range=0.61-0.87) and low internal consistency (α=0.44). Tetrachoric factorial analysis yielded four factors (spousal and relational problems; employment and financial problems; personal problems; illness and bereavement in close persons). Logistic multilevel regression found a strong association between greater social support and a lower occurrence of stressful events (OR range=0.36-0.79). The association between religious-spiritual beliefs and the LTE, was weaker. The association between mental disorders and LTE scores was greater for depression (OR range=1.64-2.57) than anxiety (OR range=1.35-1.97), though the highest ORs were obtained with alcohol dependence (OR range=2.86-4.80). The ordinal score (ordinal regression) was more sensitive to detect the strength of association with mental disorders. We are unable to distinguish the direction of the association between stressful events, psychosocial factors and mental disorders, due to our cross-sectional design of the study. The LTE is a valid and reliable measure of stress in mental health, and the strength of association with mental disorders depends on the method of quantifying LTE scores. © 2013 Elsevier B.V. All rights reserved.
Psychometric Properties of the Autism-Spectrum Quotient for Assessing Low and High Levels of Autistic Traits in College Students.

Science.gov (United States)

Stevenson, Jennifer L; Hart, Kari R

2017-06-01

The current study systematically investigated the effects of scoring and categorization methods on the psychometric properties of the Autism-Spectrum Quotient. Four hundred and three college students completed the Autism-Spectrum Quotient at least once. Total scores on the Autism-Spectrum Quotient had acceptable internal consistency and test-retest reliability using a binary or Likert scoring method, but the results were more varied for the subscales. Overall, Likert scoring yielded higher internal consistency and test-retest reliability than binary scoring. However, agreement in categorization of low and high autistic traits was poor over time (except for a median split on Likert scores). The results support using Likert scoring and administering the Autism-Spectrum Quotient at the same time as the task of interest with neurotypical participants.
Psychometric and screening properties of the WHO-5 well-being index in adult outpatients with Type 1 or Type 2 diabetes mellitus

DEFF Research Database (Denmark)

Hajós, Tibor R S; Pouwer, F; Skovlund, S E

2013-01-01

OBJECTIVE: The 5-item World Health Organization well-being index is a commonly used measure of emotional well-being, but research on psychometric properties in outpatients with diabetes is scarce. We examined psychometric and screening properties for depression of this index in a large sample...... of the WHO-5 index was determined by Cronbach's alpha. The factor structure was tested by confirmatory factor analysis. Concurrent validity was assessed by correlations with the Patient Health Questionnaire, Problem Areas in Diabetes and the Short Form-12 mental component scores. Sensitivity and specificity...... and Type 2 diabetes. Moderate to strong correlations were observed between the WHO-5 index and the Patient Health Questionnaire scores, the Problem Areas in Diabetes scores and the Short Form-12 mental component scores (r = 0.55-0.69, P

The "Reading the Mind in the Eyes" Test: Investigation of Psychometric Properties and Test-Retest Reliability of the Persian Version

Science.gov (United States)

Khorashad, Behzad S.; Baron-Cohen, Simon; Roshan, Ghasem M.; Kazemian, Mojtaba; Khazai, Ladan; Aghili, Zahra; Talaei, Ali; Afkhamizadeh, Mozhgan

2015-01-01

The psychometric properties of the Persian "Reading the Mind in the Eyes" test were investigated, so were the predictions from the Empathizing-Systemizing theory of psychological sex differences. Adults aged 16-69 years old (N = 545, female = 51.7%) completed the test online. The analysis of items showed them to be generally acceptable.…
A test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS: Normative data and psychometric properties

Directory of Open Access Journals (Sweden)

Giorgio eArcara

2016-02-01

Full Text Available The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS test is a new tool to evaluate pragmatic abilities in patients with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2 and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total. Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts’ evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient’s cognitive profile, given its vital role for communication and social interaction in daily life. The
Applying cognitive acuity theory to the development and scoring of situational judgment tests.

Science.gov (United States)

Leeds, J Peter

2017-11-09

The theory of cognitive acuity (TCA) treats the response options within items as signals to be detected and uses psychophysical methods to estimate the respondents' sensitivity to these signals. Such a framework offers new methods to construct and score situational judgment tests (SJT). Leeds (2012) defined cognitive acuity as the capacity to discern correctness and distinguish between correctness differences among simultaneously presented situation-specific response options. In this study, SJT response options were paired in order to offer the respondent a two-option choice. The contrast in correctness valence between the two options determined the magnitude of signal emission, with larger signals portending a higher probability of detection. A logarithmic relation was found between correctness valence contrast (signal stimulus) and its detectability (sensation response). Respondent sensitivity to such signals was measured and found to be related to the criterion variables. The linkage between psychophysics and elemental psychometrics may offer new directions for measurement theory.
Psychological theory testing versus psychometric nay-saying: comment on Neuberg et al.'s (1997) critique of the need for closure scale.

Science.gov (United States)

Kruglanski, A W; Atash, M N; DeGrada, E; Mannetti, L; Pierro, A; Webster, D M

1997-11-01

S. L. Neuberg, T. N. Judice, and S. G. West (1997) faulted our work with the Need for Closure Scale (NFCS) on grounds that the NFCS lacks discriminant validity relative to S. L. Neuberg's and J. T. Newsom's (1993) Personal Need for Structure (PNS) Scale and is multidimensional, which, so they claim, renders the use of its total score inadmissible. By contrast, the present authors show that neither of the above assertions is incompatible with the underlying need for closure theory. Relations between NFCS and the PNS are to be expected, as these were designed to operationalize the very same construct (of need for closure). Furthermore, no unidimensionality of the NFCS has been claimed, and none is required to use its total score for testing various theoretically derived predictions. An instrument's ultimate utility hinges on theoretical considerations and empirical evidence rather than on questionable psychometric dogma unrelated to the substantive matters at hand.
[Quality of advanced practice nurse counseling in home care settings (APN-BQ): psychometric testing of the instrument].

Science.gov (United States)

Petry, Heidi; Suter-Riederer, Susanne; Kerker-Specker, Carmen; Imhof, Lorenz

2014-12-01

Patient centred and individually-tailored counselling of older people with a chronic condition who live at home is a useful intervention to support their independence. The paper presents the development and psychometric testing of the APN-BQ Instrument, to measure patient-centeredness. To measure the quality of an in-home counselling intervention, a 23-item questionnaire was developed and tested with 206 people 80 years and older. Principal component analysis with Varimax Rotation was conducted (n = 206). Analysis revealed a four factor (fs = 0.91) model scoring in 19 items. All factors loaded > 0.45. Cronbach's alpha was 0.86. The utility and acceptance of the instrument was confirmed by the high response rate (100 %) and the fact that participants answered 98.8 % of all questions. The APN-BQ has shown to be a reliable Instrument with good content and construct validity. It is a tool for APNs to measure structure, process, and outcome quality of a patient-centred and individually-tailored counselling program, including the degree of patient participation, and patient empowerment.
Validating the Interpretations and Uses of Test Scores

Science.gov (United States)

Kane, Michael T.

2013-01-01

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Prediction of true test scores from observed item scores and ancillary data.

Science.gov (United States)

Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

2015-05-01

In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.
The Mental Vulnerability Questionnaire: a psychometric evaluation

DEFF Research Database (Denmark)

Eplov, Lene Falgaard; Petersen, Janne; Jørgensen, Torben

2010-01-01

The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim of the pre......The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim...... 0.30 for the 12 and the 22 item scales. All five Mental Vulnerability scales had positively skewed score distributions which were associated significantly with both SCL-90-R symptom scores and NEO-PI-R personality scales (primarily Neuroticism and Extraversion). Coefficient alpha was highest...
Psychometric characteristics of the MATRICS Consensus Cognitive Battery in a large pooled cohort of stable schizophrenia patients.

Science.gov (United States)

Georgiades, Anastasia; Davis, Vicki G; Atkins, Alexandra S; Khan, Anzalee; Walker, Trina W; Loebel, Antony; Haig, George; Hilt, Dana C; Dunayevich, Eduardo; Umbricht, Daniel; Sand, Michael; Keefe, Richard S E

2017-12-01

The MATRICS Consensus Cognitive Battery (MCCB) was developed to assess cognitive treatment effects in schizophrenia clinical trials, and is considered the FDA gold standard outcome measure for that purpose. The aim of the present study was to establish pre-treatment psychometric characteristics of the MCCB in a large pooled sample. The dataset included 2616 stable schizophrenia patients enrolled in 15 different clinical trials between 2007 and 2016 within the United States (94%) and Canada (6%). The MCCB was administered twice prior to the initiation of treatment in 1908 patients. Test-retest reliability and practice effects of the cognitive composite score, the neurocognitive composite score, which excludes the domain Social Cognition, and the subtests/domains were examined using Intra-Class Correlations (ICC) and Cohen's d. Simulated regression models explored which domains explained the greatest portion of variance in composite scores. Test-retest reliability was high (ICC=0.88) for both composite scores. Practice effects were small for the cognitive (d=0.15) and neurocognitive (d=0.17) composites. Simulated bootstrap regression analyses revealed that 3 of the 7 domains explained 86% of the variance for both composite scores. The domains that entered most frequently in the top 3 positions of the regression models were Speed of Processing, Working Memory, and Visual Learning. Findings provide definitive psychometric characteristics and a benchmark comparison for clinical trials using the MCCB. The test-retest reliability of the MCCB composite scores is considered excellent and the learning effects are small, fulfilling two of the key criteria for outcome measures in cognition clinical trials. Copyright © 2017 Elsevier B.V. All rights reserved.
Test/score/report: Simulation techniques for automating the test process

Science.gov (United States)

Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

1994-01-01

A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary
Construction and psychometric testing of the EMPATHIC questionnaire measuring parent satisfaction in the pediatric intensive care unit

NARCIS (Netherlands)

Latour, Jos M.; van Goudoever, Johannes B.; Duivenvoorden, Hugo J.; Albers, Marcel J. I. J.; van Dam, Nicolette A. M.; Dullaart, Eugenie; van Heerde, Marc; de Neef, Marjorie; Verlaat, Carin W. M.; van Vught, Elise M.; Hazelzet, Jan A.

To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument
Construction and psychometric testing of the EMPATHIC questionnaire measuring parent satisfaction in the pediatric intensive care unit

NARCIS (Netherlands)

Latour, J.M.; van Goudoever, J.B.; Duivenvoorden, H.J.; Albers, M.J.I.J.; van Dam, N.A.M.; Dullaart, E.; van Heerde, M.; de Neef, M.; Verlaat, C.W.M.; van Vught, E.M.; Hazelzet, J.A.

2011-01-01

To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument
Psychometric Evaluation of the Italian Adaptation of the Test of Inferential and Creative Thinking

Science.gov (United States)

Faraci, Palmira; Hell, Benedikt; Schuler, Heinz

2016-01-01

This article describes the psychometric properties of the Italian adaptation of the "Analyse des Schlussfolgernden und Kreativen Denkens" (ASK; Test of Inferential and Creative Thinking) for measuring inferential and creative thinking. The study aimed to (a) supply evidence for the factorial structure of the instrument, (b) describe its…
Adaptive testing with equated number-correct scoring

NARCIS (Netherlands)

van der Linden, Willem J.

1999-01-01

A constrained CAT algorithm is presented that automatically equates the number-correct scores on adaptive tests. The algorithm can be used to equate number-correct scores across different administrations of the same adaptive test as well as to an external reference test. The constraints are derived
The Dutch-Flemish PROMIS Physical Function item bank exhibited strong psychometric properties in patients with chronic pain.

Science.gov (United States)

Crins, Martine H P; Terwee, Caroline B; Klausch, Thomas; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis A; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Roorda, Leo D

2017-07-01

The objective of this study was to assess the psychometric properties of the Dutch-Flemish Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank in Dutch patients with chronic pain. A bank of 121 items was administered to 1,247 Dutch patients with chronic pain. Unidimensionality was assessed by fitting a one-factor confirmatory factor analysis and evaluating resulting fit statistics. Items were calibrated with the graded response model and its fit was evaluated. Cross-cultural validity was assessed by testing items for differential item functioning (DIF) based on language (Dutch vs. English). Construct validity was evaluated by calculation correlations between scores on the Dutch-Flemish PROMIS Physical Function measure and scores on generic and disease-specific measures. Results supported the Dutch-Flemish PROMIS Physical Function item bank's unidimensionality (Comparative Fit Index = 0.976, Tucker Lewis Index = 0.976) and model fit. Item thresholds targeted a wide range of physical function construct (threshold-parameters range: -4.2 to 5.6). Cross-cultural validity was good as four items only showed DIF for language and their impact on item scores was minimal. Physical Function scores were strongly associated with scores on all other measures (all correlations ≤ -0.60 as expected). The Dutch-Flemish PROMIS Physical Function item bank exhibited good psychometric properties. Development of a computer adaptive test based on the large bank is warranted. Copyright © 2017 Elsevier Inc. All rights reserved.
Psychometric characteristics of single-word tests of children's speech sound production.

Science.gov (United States)

Flipsen, Peter; Ogiela, Diane A

2015-04-01

Our understanding of test construction has improved since the now-classic review by McCauley and Swisher (1984). The current review article examines the psychometric characteristics of current single-word tests of speech sound production in an attempt to determine whether our tests have improved since then. It also provides a resource that clinicians may use to help them make test selection decisions for their particular client populations. Ten tests published since 1990 were reviewed to determine whether they met the 10 criteria set out by McCauley and Swisher (1984), as well as 7 additional criteria. All of the tests reviewed met at least 3 of McCauley and Swisher's (1984) original criteria, and 9 of 10 tests met at least 5 of them. Most of the tests met some of the additional criteria as well. The state of the art for single-word tests of speech sound production in children appears to have improved in the last 30 years. There remains, however, room for improvement.
ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

Science.gov (United States)

Allalouf, Avi

2014-01-01

The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…
Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

Science.gov (United States)

McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

2010-01-01

Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
An Examination of Psychometric Properties of Positive Functional Attitudes Scale

Directory of Open Access Journals (Sweden)

Saide Umut ZEYBEK

2017-08-01

Full Text Available The aim of this study is to investigate the applicability of Coping Attitudes Scale: Measure of Positive Attitudes in Depression (CAS among Turkish young adult community sample and determine the psychometric properties (validity and reliability of this scale. This study was conducted with 419 students attending different departments in Mugla Sitki Kocman University, Faculty of Education in the spring semester of academic year of 2015-2016. Positive Functional Attitudes Scale, Beck Depression Scale, Beck Hopelessness Scale, Automatic Thoughts Scale, Positivity Scale and Developed Automatic Thoughts Scale.were used as data collection tools. Confirmatory factor analysis (CFA were used for investigation of the psychometric properties of the PFAS. Also, criterion-related validity, test-retest validity, and internal consistency were used calculated. The CFA results showed that standardized item estimates of the CAS ranged between 0.45 and 0.47. Also the CFA results showed that the original factor structure of the PFAS confirmed on the Turkish sample. internal consistency was calculated using the total community samples PFAS score. Cronbachs alpha coefficient ort he total scale (.93 was high. Test-retest results of the subscales were 0.76. The findings showed that factor structures of the PFAS life perspective, personal accomplishment, positive future, self-worth, coping with problems had psychometric quality in Turkish version. As a result of the study, the Turkish version of PFAS has good validity and reliability for young adult community sample. [JCBPR 2017; 6(2.000: 59-66
Development and psychometric testing of the Carter Assessment of Critical Thinking in Midwifery (Preceptor/Mentor version).

Science.gov (United States)

Carter, Amanda G; Creedy, Debra K; Sidebotham, Mary

2016-03-01

develop and test a tool designed for use by preceptors/mentors to assess undergraduate midwifery students׳ critical thinking in practice. a descriptive cohort design was used. participants worked in a range of maternity settings in Queensland, Australia. 106 midwifery clinicians who had acted in the role of preceptor for undergraduate midwifery students. this study followed a staged model for tool development recommended by DeVellis (2012). This included generation of items, content validity testing through mapping of draft items to critical thinking concepts and expert review, administration of items to a convenience sample of preceptors, and psychometric testing. A 24 item tool titled the XXXX Assessment of Critical Thinking in Midwifery (CACTiM) was completed by registered midwives in relation to students they had recently preceptored in the clinical environment. ratings by experts revealed a content validity index score of 0.97, representing good content validity. An evaluation of construct validity through factor analysis generated three factors: 'partnership in practice', 'reflection on practice' and 'practice improvements'. The scale demonstrated good internal reliability with a Cronbach alpha coefficient of 0.97. The mean total score for the CACTiM scale was 116.77 (SD=16.68) with a range of 60-144. Total and subscale scores correlated significantly. the CACTiM (Preceptor/Mentor version) was found to be a valid and reliable tool for use by preceptors to assess critical thinking in undergraduate midwifery students. given the importance of critical thinking skills for midwifery practice, mapping and assessing critical thinking development in students׳ practice across an undergraduate programme is vital. The CACTiM (Preceptor/Mentor version) has utility for clinical education, research and practice. The tool can inform and guide preceptors׳ assessment of students׳ critical thinking in practice. The availability of a reliable and valid tool can be used to

Psychometric analysis of subjective sedation scales used for critically ill paediatric patients.

Science.gov (United States)

Ge, Xiaohua; Zhang, Tingting; Zhou, Lingling

2018-01-01

This study evaluated the psychometric properties of subjective sedation scales using one psychometric scoring system to identify the appropriate scale that is most suitable for clinical care practice. A number of published sedation assessment scales for paediatric patients are currently used to attempt to achieve a moderate depth of sedation to avoid the undesirable effects caused by over- or undersedation. However, there has been no systematic review of these scales. We searched the Cochrane Library, PubMed, EMBASE, the Cumulative Index to Nursing and Allied Health Literature, etc., to obtain relevant articles. The quality of the selected studies was evaluated according to the Consensus-based Standards for the Selection of Health Measurement Instruments checklist. Articles that had been published or were in press and discussed the psychometric properties of sedation scales were included. The population comprised critically ill infants and non-verbal children ranging in age from 0 to 18 years who underwent sedation in an intensive care unit. Data were independently extracted by two investigators using a standard data extraction checklist: 43 articles were included in this review, and 13 sedation scales were examined. The quality of the psychometric evidence for the Comfort Scale and Comfort Behaviour Scale was 'very good', with the Comfort Scale having a higher quality (total weighted scores, Comfort Scale = 17·3 and Comfort Behaviour Scale = 15·5). We suggest that the scales be systematically and comprehensively tested in terms of development method, reliability, validation, feasibility and correlation with clinical outcome. The Comfort Scale and Comfort Behaviour Scale are useful tools for measuring sedation in paediatric patients. Nursing staff should choose one subjective sedation scale that is suitable for assessing paediatric patients' depth of sedation. We recommend the Comfort Scale and Comfort Behaviour Scale as optimal choices if the clinical
Measurement of overgeneral autobiographical memory: Psychometric properties of the autobiographical memory test in young and older populations.

Science.gov (United States)

Ros, Laura; Romero, Dulce; Ricarte, Jorge J; Serrano, Juan P; Nieto, Marta; Latorre, Jose M

2018-01-01

The Autobiographical Memory Test (AMT) is the most widely used measure of overgeneral autobiographical memory (OGM). The AMT appears to have good psychometric properties, but more research is needed on the influence and applicability of individual cue words in different languages and populations. To date, no studies have evaluated its usefulness as a measure of OMG in Spanish or older populations. This work aims to analyze the applicability of the AMT in young and older Spanish samples. We administered a Spanish version of the AMT to samples of young (N = 520) and older adults (N = 155). We conducted confirmatory factor analysis (CFA), item response theory-based analysis (IRT) and differential item functioning (DIF). Results confirm the one-factor structure for the AMT. IRT analysis suggests that both groups find the AMT easy given that they generally perform well, and that it is more precise in individuals who score low on memory specificity. DIF analysis finds three items differ in their functioning depending on age group. This differential functioning of these items affects the overall AMT scores and, thus, they should be excluded from the AMT in studies comparing young and older samples. We discuss the possible implications of the samples and cue words used.
Summary of Score Changes (in other Tests).

Science.gov (United States)

Cleary, T. Anne; McCandless, Sam A.

Scholastic Aptitude Test (SAT) scores have declined during the last 14 years. Similar score declines have been observed in many different testing programs, many groups, and tested areas. The declines, while not large in any given year, have been consistent over time, area, and group. The period around 1965 is critical for the interpretation of…
Psychometric Properties of the Drive for Muscularity Attitudes Questionnaire Among Irish Men

Directory of Open Access Journals (Sweden)

Travis A. Ryan

2014-09-01

Full Text Available The Drive for Muscularity Attitudes Questionnaire (DMAQ was developed to measure men’s desire to attain an idealized muscular body. To date, the cross-cultural suitability of this measure has received limited attention. The current study addressed this omission by testing the psychometric properties of the DMAQ using an online sample of Irish men (N = 327. Confirmatory factor analysis revealed that a unidimensional model adequately matched observed data (i.e., fit indices suggested acceptable model fit. Analyses also showed that the DMAQ yielded reliable and construct valid scores, suggesting that the scale holds promise as an indicant of the drive for muscularity among Irish men. Strengths and limitations associated with this study are discussed, such as advantages and disadvantages of Internet research. Directions for future research are given, including the need for more psychometric work.
Psychometric investigation of the abbreviated concussion symptom inventory in a sample of U.S. Marines returning from combat.

Science.gov (United States)

Campbell, Justin S; Pulos, Steven; Haran, F Jay; Tsao, Jack W; Alphonso, Aimee L

2015-01-01

This study describes the psychometric investigation of an 11-item symptom checklist, the Abbreviated Concussion Symptom Inventory (ACSI). The ACSI is a dichotomously scored list of postconcussive symptoms associated with mild traumatic brain injury. The ACSI was administered to Marines (N = 1,435) within the 1st month of their return from combat deployments to Afghanistan. Psychometric analyses based upon nonparametric item response theory supported scoring the ACSI via simple summation of symptom endorsements; doing so produced a total score with good reliability (α = .802). Total scores were also found to significantly differentiate between different levels of head injury complexity during deployment, F(3, 1,431) = 100.75, p < .001. The findings support the use of the ASCI in research settings requiring a psychometrically reliable measure of postconcussion symptoms.
The development and psychometric testing of a Disaster Response Self-Efficacy Scale among undergraduate nursing students.

Science.gov (United States)

Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling

2017-12-01

Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, pself-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Psychometric evaluation of the ostomy complication severity index.

Science.gov (United States)

Pittman, Joyce; Bakas, Tamilyn; Ellett, Marsha; Sloan, Rebecca; Rawl, Susan M

2014-01-01

The purpose of this study was to evaluate the psychometric properties of a new instrument to measure incidence and severity of ostomy complications early in the postoperative period. 71 participants were enrolled, most were men (52%), white (96%), and married or partnered (55%). The mean age of participants was 57 ± 15.09 years (mean ± SD). Fifty-two participants (84%) experienced at least 1 ostomy complication in the 60-day postoperative period. The research setting was 3 acute care settings within a large healthcare system in the Midwestern United States. We developed an evidence-based conceptual model to guide development and evaluation of a new instrument, the Pittman Ostomy Complication Severity Index (OCSI). The OCSI format includes Likert-like scale with 9 individual items scored 0 to 3 and a total score computed by summing the individual items. Higher scores indicate more severe ostomy complications. This study consisted of 2 phases: (1) an expert review, conducted to establish content validity; and (2) a prospective, longitudinal study design, to examine psychometric properties of the instrument. A convenience sample of 71 adult patients who underwent surgery to create a new fecal ostomy was recruited from 3 hospitals. Descriptive analyses, content validity indices, interrater reliability testing, and construct validity testing were employed. Common complications included leakage (60%), peristomal moisture-associated dermatitis (50%), stomal pain (42%), retraction (39%), and bleeding (32%). The OCSI demonstrated acceptable evidence of content validity index (CVI = 0.9) and interrater reliability for individual items (k = 0.71-1.0), as well as almost perfect agreement for total scores among raters (ICC = 0.991, P ≤ .001). Construct validity of the OCSI was supported by significant correlations among variables in the conceptual model (complications, risk factors, stoma care self-efficacy, and ostomy adjustment). OCSI demonstrated acceptable validity and
Cross-cultural adaptation and psychometric evaluations of the Turkish version of Parkinson Fatigue Scale.

Science.gov (United States)

Ozturk, Erhan Arif; Kocer, Bilge Gonenli; Umay, Ebru; Cakci, Aytul

2018-06-07

The objectives of the present study were to translate and cross-culturally adapt the English version of the Parkinson Fatigue Scale into Turkish, to evaluate its psychometric properties, and to compare them with that of other language versions. A total of 144 patients with idiopathic Parkinson disease were included in the study. The Turkish version of Parkinson Fatigue Scale was evaluated for data quality, scaling assumptions, acceptability, reliability, and validity. The questionnaire response rate was 100% for both test and retest. The percentage of missing data was zero for items, and the percentage of computable scores was full. Floor and ceiling effects were absent. The Parkinson Fatigue Scale provides an acceptable internal consistency (Cronbach's alpha was 0.974 for 1st test and 0.964 for a retest, and corrected item-to-total correlations were ranged from 0.715 to 0.906) and test-retest reliability (Cohen's kappa coefficients were ranged from 0.632 to 0.786 for individuals items, and intraclass correlation coefficient was 0.887 for the overall Parkinson Fatigue Scale Score). An exploratory factor analysis of the items revealed a single factor explaining 71.7% of variance. The goodness-of-fit statistics for the one-factorial confirmatory factor analysis were Tucker Lewis index = 0.961, comparative fit index = 0.971 and root mean square error of approximation = 0.077 for a single factor. The average Parkinson Fatigue Scale Score was correlated significantly with sociodemographic data, clinical characteristics and scores of rating scales. The Turkish version of the Parkinson Fatigue Scale seems to be culturally well adapted and have good psychometric properties. The scale can be used in further studies to assess the fatigue in patients with Parkinson's disease.
Field Psychometric Testing of the Instrument for Assessment of Psychological Predictors of Well-Being and Quality of Life in People with HIV or AIDS.

Science.gov (United States)

Remor, Eduardo; Fuster-RuizdeApodaca, Maria José; Ballester-Arnal, Rafael; Gómez-Martínez, Sandra; Fumaz, Carmina R; González-Garcia, Marian; Ubillos-Landa, Silvia; Aguirrezabal-Prado, Arrate; Molero, Fernando; Ruzafa-Martínez, Maria

2016-06-01

The Screenphiv, a screening measure for psychological issues related to HIV, was psychometrically tested in a study involving 744 HIV-infected people in Spain. Participants ages 18-82 (M = 43.04, 72 % men, 28 % women) completed an assessment protocol that included the Screenphiv and the MOS-HIV. A trained interviewer also collected relevant illness-related clinical data and socio-demographics from the participants. A confirmatory factor analysis was used to evaluate the goodness of fit of the Screenphiv's theoretical model and confirmed six first-order factors and two second-order factors [RMSEA (IC 90 %) = 0.07 (0.07-0.08)]. No floor or ceiling effects were observed for the scores. Cronbach's alphas were acceptable for all of the factors (from 0.65 to 0.92). Criterion-related validity also achieved; Screenphiv scores were related to socio-demographic and clinical variables and MOS-HIV summary scores. The Screenphiv is a reliable and valid measure, ready to use in research and clinical settings in Spain.
Psychometric characteristics of health-related quality-of-life questionnaires in oropharyngeal dysphagia.

Science.gov (United States)

Timmerman, Angelique A; Speyer, Renée; Heijnen, Bas J; Klijn-Zwijnenberg, Iris R

2014-04-01

Dysphagia can have severe consequences for the patient's health, influencing health-related quality of life (HRQoL). Sound psychometric properties of HRQoL questionnaires are a precondition for assessing the impact of dysphagia, the focus of this study, resulting in recommendations for the appropriate use of these questionnaires in both clinical practice and research contexts. We performed a systematic review starting with a search for and retrieval of all full-text articles on the development of HRQoL questionnaires related to oropharyngeal dysphagia and/or their psychometric validation from the electronic databases PubMed and Embase published up to June 2011. Psychometric properties were judged according to quality criteria proposed for health status questionnaires. Eight questionnaires were included in this study. Four are aimed solely at HRQoL in oropharyngeal dysphagia: the deglutition handicap index (DHI), dysphagia handicap index (DHI'), M.D. Anderson Dysphagia Inventory (MDADI), and SWAL-QOL, while the EDGQ, EORTC QLQ-STO 22, EORTC QLQ-OG 25 and EORTC QLQ-H&N35 focus on other primary diseases resulting in dysphagia. The psychometric properties of the DHI, DHI', MDADI, and SWAL-QOL were evaluated. For appropriate applicability of HRQoL questionnaires, strong scores on the psychometric criteria face validity, criterion validity, and interpretability are prerequisites. The SWAL-QOL has the strongest ratings for these criteria, while the DHI' is the most easy to apply given its 25 items and the use of a uniform scoring format. For optimal use of HRQoL questionnaires in diverse settings, it is necessary to combine psychometric and utility approaches.
Developing a model of competence in the operating theatre: psychometric validation of the perceived perioperative competence scale-revised.

Science.gov (United States)

Gillespie, Brigid M; Polit, Denise F; Hamlin, Lois; Chaboyer, Wendy

2012-01-01

This paper describes the development and validation of the Revised Perioperative Competence Scale (PPCS-R). There is a lack of a psychometrically tested sound self-assessment tools to measure nurses' perceived competence in the operating room. Content validity was established by a panel of international experts and the original 98-item scale was pilot tested with 345 nurses in Queensland, Australia. Following the removal of several items, a national sample that included all 3209 nurses who were members of the Australian College of Operating Room Nurses was surveyed using the 94-item version. Psychometric testing assessed content validity using exploratory factor analysis, internal consistency using Cronbach's alpha, and construct validity using the "known groups" technique. During item reduction, several preliminary factor analyses were performed on two random halves of the sample (n=550). Usable data for psychometric assessment were obtained from 1122 nurses. The original 94-item scale was reduced to 40 items. The final factor analysis using the entire sample resulted in a 40 item six-factor solution. Cronbach's alpha for the 40-item scale was .96. Construct validation demonstrated significant differences (pperceived competence scores relative to years of operating room experience and receipt of specialty education. On the basis of these results, the psychometric properties of the PPCS-R were considered encouraging. Further testing of the tool in different samples of operating room nurses is necessary to enable cross-cultural comparisons. Copyright © 2011 Elsevier Ltd. All rights reserved.
Psychometric evaluation of the 10-item Short Opiate Withdrawal Scale-Gossop (SOWS-Gossop) in patients undergoing opioid detoxification.

Science.gov (United States)

Vernon, Margaret K; Reinders, Stefan; Mannix, Sally; Gullo, Kristen; Gorodetzky, Charles W; Clinch, Thomas

2016-09-01

The Short Opiate Withdrawal Scale (SOWS)-Gossop is a 10-item questionnaire developed to evaluate opioid withdrawal symptom severity. The scale was derived from the original 32-item Opiate Withdrawal Scale in order to reduce redundancy while providing an equally sensitive measure of opioid withdrawal symptom severity appropriate for research and clinical practice. The objective of this study was to examine the psychometric properties and provide score interpretation guidelines for the SOWS-Gossop 10-item version. Blinded, pooled data from two trials assessing the efficacy of lofexidine hydrochloride in reducing withdrawal symptoms in patients undergoing opioid detoxification were used to evaluate the quantitative psychometric properties and score interpretation of the SOWS-Gossop. Five hundred fifty-five (N=555) observations were available at baseline with numbers decreasing to n=213 at day 7. Mean (standard deviation) SOWS-Gossop scores were 10.4 (6.86) at baseline, 8.7 (6.49) on day 1, 10.5 (7.21) on day 2, and 3.1 (3.95) on day 7. Confirmatory factor analysis indicated that the SOWS-Gossop items loaded on a single factor consistent with a single total score. Intra-class correlations (95% confidence interval) were 0.78 (0.70-0.85) between baseline and day 1, 0.84 (0.79-0.89) between days 4 and 5, and 0.88 (0.83-0.91) between days 6 and 7, demonstrating good test-retest reliability. Mean SOWS-Gossop scores varied significantly (popioid withdrawal and has excellent psychometric properties. The SOWS-Gossop is an appropriate, precise, and sensitive measure to evaluate the symptoms of acute opioid withdrawal in research or clinical settings. Copyright © 2016 Elsevier Ltd. All rights reserved.
Data-driven efficient score tests for deconvolution hypotheses

NARCIS (Netherlands)

Langovoy, M.

2008-01-01

We consider testing statistical hypotheses about densities of signals in deconvolution models. A new approach to this problem is proposed. We constructed score tests for the deconvolution density testing with the known noise density and efficient score tests for the case of unknown density. The
Psychometric properties of the Fagerström Test for Nicotine Dependence As propriedades psicométricas do Teste de Fagerström para Dependência de Nicotina

Directory of Open Access Journals (Sweden)

Izilda Carolina de Meneses-Gaya

2009-01-01

Full Text Available OBJECTIVE: The Fagerström Test for Nicotine Dependence (FTND is a screening instrument for physical nicotine dependence and is extensively used in various countries. The objective of the present report was to review articles related to the psychometric properties of the FTND. METHODS: A systematic search for articles published up through December of 2007 was carried out in various electronic databases. The following search terms were used: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; and "factors". We included articles published in English, Spanish or Portuguese and in which the psychometric properties of the FTND were evaluated. RESULTS: Twenty-six studies related to the psychometric properties of the FTND were identified in the indexed literature. Analysis of the studies confirmed the reliability of the FTND for the assessment of nicotine dependence in different settings and populations. CONCLUSIONS: Further validation studies using previously validated instruments as a comparative measure are needed before the extensive use of the FTND can be justified on the basis of its psychometric qualities.OBJETIVO: O Fagerström Test for Nicotine Dependence (FTND, Teste de Fagerström para Dependência de Nicotina é um instrumento de rastreamento para dependência física de tabaco, amplamente utilizado em diversos países. Objetivou-se realizar uma revisão de artigos relacionados às propriedades psicométricas do FTND. MÉTODOS: Uma busca sistemática foi realizada usando-se vários indexadores eletrônicos até dezembro de 2007, com os seguintes descritores: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; e "factors". Foram incluídos os artigos relacionados à avaliação das propriedades psicométricas do FTND publicados em inglês, espanhol e português. RESULTADOS: Vinte e seis estudos relativos às propriedades psicom
Bayesian psychometric scaling

NARCIS (Netherlands)

Fox, Gerardus J.A.; van den Berg, Stéphanie Martine; Veldkamp, Bernard P.; Irwing, P.; Booth, T.; Hughes, D.

2015-01-01

In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item
Psychometric properties of the Danish MCMI-I translation

DEFF Research Database (Denmark)

Mortensen, E L; Simonsen, E

1990-01-01

A translation of the MCMI-I has been in use in Denmark for some years. An untested assumption in the interpretation of the pattern of test results is that the psychometric characteristics of the Danish and American versions are similar. The purpose of this study was to evaluate the psychometric...... properties of the questionnaire by using traditional psychometric analysis techniques on the results of a sample consisting of 423 patients and 179 normal controls. Coefficient alpha was calculated for the 20 clinical subscales of the test and the Danish results were strikingly similar to the original...... coefficients reported by Millon. Furthermore, factor analysis of the subscales showed a factor structure very similar to American findings, and it is concluded that the psychometric properties of the Danish MCMI are not significantly different from the original....
The new Intragroup Conflict Scale: testing and psychometric properties.

Science.gov (United States)

Cox, Kathleen B

2014-01-01

The importance of healthy work environments has received attention. Health care organizations are plagued with conflict which is detrimental to work environments. Thus, conflict must be studied. The purpose of this article is to describe the testing of a measure of conflict. A survey was used to evaluate the psychometric properties. The sample consisted of 430 nurses at an academic medical center. Using principal component analysis (PCA) with varimax rotation, a six-factor solution (30 items) that explained 74.3% of variance emerged. Coefficient alpha ranged from .95 to .81. Correlations with existing scales supported construct validity (r = -.32(-)-.58). The results are encouraging. Use of the scale may provide insight into the impact of conflict on patient, staff, and organizational outcomes.
Development and psychometric testing of a new geriatric spiritual well-being scale.

Science.gov (United States)

Dunn, Karen S

2008-09-01

Aims and objectives. Assess the psychometric properties of a new geriatric spiritual well-being scale (GSWS), specifically designed for older adults. Background. Religiosity and spiritual wellness must be measured as two distinct concepts to prevent confounding them as synonymous among atheist and agnostic population. Design. A test-retest survey design was used to estimate the psychometric properties. Methods. A convenience sample of 138 community-dwelling older adults was drawn from the inner city of Detroit. Data were collected using telephone survey interviews. Data analyses included descriptive statistics, structural equation modelling, reliability analyses, and point-biserial correlations. Results. The factorial validity of the proposed model was not supported by the data. Fit indices were χ(2) = 185.98, d.f. = 98, P atheists have spiritual needs that do not include religious beliefs or practices. Thus, assessing patients' religious beliefs and practices prior to assessing spiritual well-being is essential to prevent bias. © 2008 The Author. Journal compilation © 2008 Blackwell Publishing Ltd.
Psychometric characteristics and dimensionality of a Persian version of Rosenberg Self-esteem Scale.

Science.gov (United States)

Shapurian, R; Hojat, M; Nayerahmadi, H

1987-08-01

The Rosenberg Self-esteem scale was translated into Persian and 12 Iranian bilingual judges confirmed the soundness of translation. The psychometric properties of the Persian version of Rosenberg Self-esteem Scale were studied in two samples of Iranian college students separately. Sample I consisted of 232 Iranian students in American universities, and Sample II comprised 305 Iranian students in Iranian universities. Criterion measures of loneliness, depression, anxiety, neuroticism, psychoticism, misanthropy, locus of control, tendency to dissimulate, and measures of relationship with parents, peers, and academic achievement were obtained. Item-total score correlations and alpha reliabilities supported the internal consistency of the scale. Test-retest reliabilities indicated the stability of the scores, and correlations between scores of the scale, and criterion measures supported the concurrent validity of the Rosenberg scale. Factor analysis of the Rosenberg scores confirmed the unidimensionality of the scale.
Psychometric properties and construct validity of the Muscle Appearance Satisfaction Scale among Hungarian men.

Science.gov (United States)

Babusa, Bernadett; Urbán, Róbert; Czeglédi, Edit; Túry, Ferenc

2012-01-01

Limited studies have evaluated the psychometric properties of the Muscle Appearance Satisfaction Scale (MASS), a measure of muscle dysmorphia, in different cultures and languages. The aims were to examine the psychometric properties of the Hungarian version of the MASS (MASS-HU), and to investigate its relationship with self-esteem and exercise-related variables. Two independent samples of male weight lifters (ns=289 and 43), and a sample of undergraduates (n=240) completed the MASS, Eating Disorder Inventory, and Rosenberg Self-esteem Scale. Exploratory factor analysis supported the original five-factor structure of the MASS only in the weight lifter sample. The MASS-HU had excellent scale score reliability and good test-retest reliability. The construct validity of the MASS-HU was tested with multivariate regression analyses which indicated an inverse relationship between self-esteem and muscle dysmorphia. The 18-item MASS-HU was found to be a useful measure for the assessment of muscle dysmorphia among male weight lifters. Copyright © 2011 Elsevier Ltd. All rights reserved.

Workplace nutrition knowledge questionnaire: psychometric validation and application.

Science.gov (United States)

Guadagnin, Simone C; Nakano, Eduardo Y; Dutra, Eliane S; de Carvalho, Kênia M B; Ito, Marina K

2016-11-01

Workplace dietary intervention studies in low- and middle-income countries using psychometrically sound measures are scarce. This study aimed to validate a nutrition knowledge questionnaire (NQ) and its utility in evaluating the changes in knowledge among participants of a Nutrition Education Program (NEP) conducted at the workplace. A NQ was tested for construct validity, internal consistency and discriminant validity. It was applied in a NEP conducted at six workplaces, in order to evaluate the effect of an interactive or a lecture-based education programme on nutrition knowledge. Four knowledge domains comprising twenty-three items were extracted in the final version of the NQ. Internal consistency of each domain was significant, with Kuder-Richardson formula values>0·60. These four domains presented a good fit in the confirmatory factor analysis. In the discriminant validity test, both the Expert and Lay groups scored>0·52, but the Expert group scores were significantly higher than those of the Lay group in all domains. When the NQ was applied in the NEP, the overall questionnaire scores increased significantly because of the NEP intervention, in both groups (Pnutrition knowledge among participants of NEP at the workplace. According to the NQ, an interactive nutrition education had a higher impact on nutrition knowledge than a lecture programme.
Psychometric Properties of Questionnaires on Functional Health Status in Oropharyngeal Dysphagia: A Systematic Literature Review

Science.gov (United States)

Speyer, Renée; Cordier, Reinie; Kertscher, Berit; Heijnen, Bas J

2014-01-01

Introduction. Questionnaires on Functional Health Status (FHS) are part of the assessment of oropharyngeal dysphagia. Objective. To conduct a systematic review of the literature on the psychometric properties of English-language FHS questionnaires in adults with oropharyngeal dysphagia. Methods. A systematic search was performed using the electronic databases Pubmed and Embase. The psychometric properties of the questionnaires were determined based on the COSMIN taxonomy of measurement properties and definitions for health-related patient-reported outcomes and the COSMIN checklist using preset psychometric criteria. Results. Three questionnaires were included: the Eating Assessment Tool (EAT-10), the Swallowing Outcome after Laryngectomy (SOAL), and the Self-report Symptom Inventory. The Sydney Swallow Questionnaire (SSQ) proved to be identical to the Modified Self-report Symptom Inventory. All FHS questionnaires obtained poor overall methodological quality scores for most measurement properties. Conclusions. The retrieved FHS questionnaires need psychometric reevaluation; if the overall methodological quality shows satisfactory improvement on most measurement properties, the use of the questionnaires in daily clinic and research can be justified. However, in case of insufficient validity and/or reliability scores, new FHS questionnaires need to be developed using and reporting on preestablished psychometric criteria as recommended in literature. PMID:24877095
Development and psychometric testing of a new instrument to measure the caring behaviour of nurses in Italian acute care settings.

Science.gov (United States)

Piredda, Michela; Ghezzi, Valerio; Fenizia, Elisa; Marchetti, Anna; Petitti, Tommasangelo; De Marinis, Maria Grazia; Sili, Alessandro

2017-12-01

To develop and psychometrically test the Italian-language Nurse Caring Behaviours Scale, a short measure of nurse caring behaviour as perceived by inpatients. Patient perceptions of nurses' caring behaviours are a predictor of care quality. Caring behaviours are culture-specific, but no measure of patient perceptions has previously been developed in Italy. Moreover, existing tools show unclear psychometric properties, are burdensome for respondents, or are not widely applicable. Instrument development and psychometric testing. Item generation included identifying and adapting items from existing measures of caring behaviours as perceived by patients. A pool of 28 items was evaluated for face validity. Content validity indexes were calculated for the resulting 15-item scale; acceptability and clarity were pilot tested with 50 patients. To assess construct validity, a sample of 2,001 consecutive adult patients admitted to a hospital in 2014 completed the scale and was split into two groups. Reliability was evaluated using nonlinear structural equation modelling coefficients. Measurement invariance was tested across subsamples. Item 15 loaded poorly in the exploratory factor analysis (n = 983) and was excluded from the final solution, positing a single latent variable with 14 indicators. This model fitted the data moderately. The confirmatory factor analysis (n = 1018) returned similar results. Internal consistency was excellent in both subsamples. Full scalar invariance was reached, and no significant latent mean differences were detected across subsamples. The new instrument shows reasonable psychometric properties and is a promising short and widely applicable measure of inpatient perceptions of nurse caring behaviours. © 2017 John Wiley & Sons Ltd.
Psychometric properties of a pictorial scale measuring correct condom use.

Science.gov (United States)

Li, Qing; Li, Xiaoming; Stanton, Bonita; Wang, Bo

2011-02-01

This study was designed to assess the psychometric properties of a pictorial scale of correct condom use (PSCCU) using data from female sex workers (FSWs) in China. The psychometric properties assessed in this study include construct validity by correlations and known-group validation. The study sample included 396 FSWs in Guangxi, China. The results demonstrate adequate validity of the PSCCU among the study population. FSWs with a higher level of education scored significantly higher on the PSCCU than those with a lower level of education. FSWs who self-reported appropriate condom use with stable partners scored significantly higher on PSCCU than their counterparts. The PSCCU should provide HIV/STI prevention researchers and practitioners with a valid alternative assessment tool among high-risk populations, especially in resource-limited settings.
[The psychometric properties of the Proverb-Metaphor Test].

Science.gov (United States)

Szajer, Katarzyna; Karakuła, Hanna; Grzywa, Anna; Parnas, Josef; Perzyńska, Aneta; Zaborska, Anna; Pawezka, Justyna; Sekunda, Agnieszka; Piszczek, Rafał; Skórska, Małgorzata

2007-01-01

Abstract thinking belongs to intellectual abilities of the highest level of the evolutionary development, thanks to which operations such a classification, systematisation and comparison are possible. An analysis of the psychometric properties of the Proverb-Metaphor Test (TPM) which has been used in the German speaking countries since 2001. The TPM was subject to the process of translation--retranslation--travesty in order to be adapted to clinical conditions in Poland. 60 patients of the Department of Psychiatry, Medical University of Lublin with diagnosed paranoid schizophrenia (according to ICD-10 criteria). PANSS and TPM was carried out amongst 15 patients at the beginning of the hospitalisation (the first stage of the research) and among all persons during the remission of syndromes (the second stage). The WAIS-R (PL) was used in the second stage. 1. The TPM is a reliable instrument, of high criteria propriety. 2. The evaluated test is a relatively homogeneous research tool. 3. The TPM is, thanks to its simple construction and the short carrying out time, a practical method of abstract thinking evaluation. 4. The TPM may be a useful instrument enabling long term prognosis.
Psychometrics of the chronic liver disease questionnaire for Southern Chinese patients with chronic hepatitis B virus infection

Science.gov (United States)

Lam, Elegance Ting Pui; Lam, Cindy Lo Kuen; Lai, Ching Lung; Yuen, Man Fung; Fong, Daniel Yee Tak

2009-01-01

AIM: To test the psychometric properties of a Chinese [(Hong Kong) HK] translation of the chronic liver disease questionnaire (CLDQ). METHODS: A Chinese (HK) translation of the CLDQ was developed by iterative translation and cognitive debriefing. It was then administered to 72 uncomplicated and 78 complicated chronic hepatitis B (CHB) patients in Hong Kong together with a structured questionnaire on service utilization, and the Chinese (HK) SF-36 Health Survey Version 2 (SF-36v2). RESULTS: Scaling success was ≥ 80% for all but three items. A new factor assessing sleep was found and items of two (Fatigue and Systemic Symptoms) subscales tended to load on the same factor. Internal consistency and test-retest reliabilities ranged from 0.58-0.90 for different subscales. Construct validity was confirmed by the expected correlations between the SF-36v2 Health Survey and CLDQ scores. Mean scores of CLDQ were significantly lower in complicated compared with uncomplicated CHB, supporting sensitivity in detecting differences between groups. CONCLUSION: The Chinese (HK) CLDQ is valid, reliable and sensitive for patients with CHB. Some modifications to the scaling structure might further improve its psychometric properties. PMID:19598306
Psychometric Evaluation of the Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF).

Science.gov (United States)

Gelhorn, Heather L; Roberts, Laurie J; Khandelwal, Nikhil; Revicki, Dennis A; DeRogatis, Leonard R; Dobs, Adrian; Hepp, Zsolt; Miller, Michael G

2017-08-01

The Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF) is a patient-reported outcome measurement designed to evaluate the symptoms of hypogonadism. The HIS-Q-SF is an abbreviated version including17 items from the original 28-item HIS-Q. To conduct item analyses and reduction, evaluate the psychometric properties of the HIS-Q-SF, and provide guidance on score interpretation. A 12-week observational longitudinal study of hypogonadal men was conducted as part of the original HIS-Q psychometric evaluation. Participants completed the original HIS-Q every 2 weeks. Blood samples were collected to evaluate testosterone levels. Participants completed the Aging Male's Symptoms Scale, the International Index of Erectile Function, the Short Form-12, and the PROMIS Sexual Activity, Satisfaction with Sex Life, Sleep Disturbance, and Applied Cognition Scales (baseline and weeks 6 and 12). Clinicians completed the Clinical Global Impression of Severity and Change scales and a clinical form. Item performance was evaluated using descriptive statistics and Rasch analyses. Reliability (internal consistency and test-retest), validity (concurrent and know groups), and responsiveness were assessed. One hundred seventy-seven men participated (mean age = 54.1 years, range = 23-83). Similar to the full HIS-Q, the final abbreviated HIS-Q-SF instrument includes five domains (sexual, energy, sleep, cognition, and mood) with two sexual subdomains (libido and sexual function). For key domains, test-retest reliability was very good, and construct validity was good for all domains. Known-groups validity was demonstrated for all domain scores, subdomain scores, and total score based on the Clinical Global Impression-Severity. All domains and subdomains were responsive to change based on patient-rated anchor questions. The HIS-Q-SF could be a useful tool in clinical practice, epidemiologic studies, and other academic research settings. Careful consideration was given to the
D-Catch instrument : development and psychometric testing of a measurement instrument for nursing documentation in hospitals

NARCIS (Netherlands)

Paans, Wolter; Sermeus, Walter; Nieweg, Roos; van der Schans, Cees P.

AIM: This paper is a report of the development and testing of the psychometric properties of an instrument to measure the accuracy of nursing documentation in general hospitals. BACKGROUND: Little information is available about the accuracy of nursing documentation. None of the existing instruments
Sexual Self-Schema Scale for Women—Validation and Psychometric Properties of the Polish Version

Directory of Open Access Journals (Sweden)

Krzysztof Nowosielski, MD

2018-06-01

Full Text Available Introduction: The sexual self-schema is a part of a broader concept of the self that is believed to be crucial for intrapersonal and interpersonal sexual relationships. Aim: To develop and perform psychometric validation of the Polish version of the Sexual Self-Schema Scale for Women (SSSS-W-PL. Methods: 561 women 18 to 55 years old were included in the final analysis. Linguistic validation was performed in 4 steps in line with the MAPI Institute guidelines. Convergent validity was calculated using the Pearson r product-moment coefficient between different measures of sexuality (attitudes and experience, behavior, arousal, romantic relationship and SSSS-W-PL total and factor scores. To test discriminant validity, we applied hierarchical regression analyses predicting the number of lifetime sexual partners, self-rating as a sexual person (1 item, “I feel sexually attractive”; on a 5-point Likert scale, and arousability, with independent variables being extraversion (Ten-Item Personality Inventory, self-esteem (Rosenberg Self-Esteem Scale, and the SSSS-W-PL (total and factor scores. Main Outcomes Measures: Sexual self-schema was measured by the SSSS-W-PL, whereas arousability was measured by the arousal/excitement scale of the Changes in Sexual Functioning Questionnaire. Results: The mean age of the study population was 29.0 ± 7.6 years. The final scale consisted of 24 adjectives grouped within 4 factors: romantic, passionate, direct, and embarrassed. The 4-factor model accounted for 39% of the variance. The Cronbach α was 0.74 for the SSSS-W-PL total score and 0.61 to 0.84 for individual factors. Test-retest reliability of the scale after 2- to 8-week intervals was 0.87 (95% CI = 0.82–0.86, P < .001. The increment variances were statistically significant and ranged from 3.8% to 11.6%. Conclusion: The analysis showed good psychometric properties and internal validity of the SSSS-W-PL. The SSSS-W-PL might be helpful in consulting and
A Human Capital Model of Educational Test Scores

DEFF Research Database (Denmark)

McIntosh, James; D. Munk, Martin

Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelated...... with observable parental attributes and, thus, are environmental rather than genetic in origin. We show that the test scores measure manifest or measured ability as it has evolved over the life of the respondent and is, thus, more a product of the human capital formation process than some latent or fundamental...... measure of pure cognitive ability. We find that variables which are not closely associated with traditional notions of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture, attitudes...
Analyzing Test-Taking Behavior: Decision Theory Meets Psychometric Theory.

Science.gov (United States)

Budescu, David V; Bo, Yuanchao

2015-12-01

We investigate the implications of penalizing incorrect answers to multiple-choice tests, from the perspective of both test-takers and test-makers. To do so, we use a model that combines a well-known item response theory model with prospect theory (Kahneman and Tversky, Prospect theory: An analysis of decision under risk, Econometrica 47:263-91, 1979). Our results reveal that when test-takers are fully informed of the scoring rule, the use of any penalty has detrimental effects for both test-takers (they are always penalized in excess, particularly those who are risk averse and loss averse) and test-makers (the bias of the estimated scores, as well as the variance and skewness of their distribution, increase as a function of the severity of the penalty).
Development, Testing, and Psychometric Qualities of the Nash Duty to Care Scale for Disaster Response.

Science.gov (United States)

Nash, Tracy Jeanne

2017-08-01

Although nurses struggle with the decision to report for work during disaster events, there are no instruments to measure nurses' duty to care for disaster situations. The purpose of this study was to describe the development, testing, and psychometric qualities of the Nash Duty to Care Scale. A convenience sample of 409 registered nurses were recruited from 3 universities in the United States. Exploratory factor analysis resulted in a 19-item, 4-factor model explaining 67.34% of the variance. Internal consistency reliability was supported by Cronbach's alpha ranging from .81 to .91 for the 4-factor subscales and .92 for the total scale. The psychometrically sound instrument for measuring nurses' perceived duty to care for disasters is applicable to contemporary nursing practice, institutional disaster management plans, and patient health outcomes worldwide.
Psychometric Quality of the Dutch Version of the Children's Eating Attitude Test in a Community Sample and a Sample of Overweight Youngsters

Directory of Open Access Journals (Sweden)

Lotte Theuwis

2010-12-01

Full Text Available Introduction. Disturbed eating attitudes may be important precursors of pathological eating patterns and, therefore need to be researched adequately. The Children's Eating Attitude Test (ChEAT is indicated for detecting at-risk attitudes and concerns in youngsters. Method. The present study was designed to provide a preliminary psychometric evaluation of the Dutch version of the ChEAT, by examining reliability and validity in a sample of 166 youngsters. Results. Generally the ChEAT seems to be a reliable instrument. Concurrent validity was demonstrated by positive correlations with measures assessing pathological eating behaviour and with related psychological problems. The discriminant validity was good. Based on ChEAT scores we can distinguish overweight youngsters from the community sample and “dieters” from “non dieters”. Divergent validity and factor structure show still shortcomings. Discussion. The Dutch version of the ChEAT seems to be a promising screening- and research instrument. Future prospective research could focus on a cut-off score for identifying at-risk youngsters.
Psychometric properties of a Swedish translation of the VISA-P outcome score for patellar tendinopathy.

Science.gov (United States)

Frohm, Anna; Saartok, Tönu; Edman, Gunnar; Renström, Per

2004-12-18

Self-administrated patient outcome scores are increasingly recommended for evaluation of primary outcome in clinical studies. The VISA-P score, developed at the Victorian Institute of Sport Assessment in Melbourne, Australia, is a questionnaire developed for patients with patellar tendinopathy and the patients assess severity of symptoms, function and ability to participate in sport. The aim of this study was to translate the questionnaire into Swedish and to study the reliability and validity of the translated questionnaire and resultant scores. The questionnaire was translated into Swedish according to internationally recommended guidelines for cross-cultural adaptation of self-report measures. The reliability and validity were tested in three different populations. The populations used were healthy students (n = 17), members of the Swedish male national basketball team (n = 17), considered as a population at risk, and a group of non-surgically treated patients (n = 17) with clinically diagnosed patellar tendinopathy. The questionnaire was completed by 51 subjects altogether. The translated VISA-P questionnaire showed very good test-retest reliability (ICC = 0.97).The mean (+/- SD) of the VISA-P score, at both the first and second test occasions was highest in the healthy student group 83 (+/- 13) and 81 (+/- 15), respectively. The score of the basketball players was 79 (+/- 24) and 80 (+/- 23), while the patient group scored significantly (p < 0.05) lower, 48 (+/- 20) and 52 (+/- 19). The translated version of the VISA-P questionnaire was linguistically and culturally equivalent to the original version. The translated score showed good reliability.
Psychometric properties of a Swedish translation of the VISA-P outcome score for patellar tendinopathy

Directory of Open Access Journals (Sweden)

Edman Gunnar

2004-12-01

Full Text Available Abstract Background Self-administrated patient outcome scores are increasingly recommended for evaluation of primary outcome in clinical studies. The VISA-P score, developed at the Victorian Institute of Sport Assessment in Melbourne, Australia, is a questionnaire developed for patients with patellar tendinopathy and the patients assess severity of symptoms, function and ability to participate in sport. The aim of this study was to translate the questionnaire into Swedish and to study the reliability and validity of the translated questionnaire and resultant scores. Methods The questionnaire was translated into Swedish according to internationally recommended guidelines for cross-cultural adaptation of self-report measures. The reliability and validity were tested in three different populations. The populations used were healthy students (n = 17, members of the Swedish male national basketball team (n = 17, considered as a population at risk, and a group of non-surgically treated patients (n = 17 with clinically diagnosed patellar tendinopathy. The questionnaire was completed by 51 subjects altogether. Results The translated VISA-P questionnaire showed very good test-retest reliability (ICC = 0.97. The mean (± SD of the VISA-P score, at both the first and second test occasions was highest in the healthy student group 83 (± 13 and 81 (± 15, respectively. The score of the basketball players was 79 (± 24 and 80 (± 23, while the patient group scored significantly (p Conclusions The translated version of the VISA-P questionnaire was linguistically and culturally equivalent to the original version. The translated score showed good reliability.
Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

Science.gov (United States)

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

2017-01-01

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…
The standardised copy of pentagons test

Directory of Open Access Journals (Sweden)

Terzoglou Vassiliki A

2011-04-01

Full Text Available Abstract Background The 'double-diamond copy' task is a simple paper and pencil test part of the Bender-Gestalt Test and the Mini Mental State Examination (MMSE. Although it is a widely used test, its method of scoring is crude and its psychometric properties are not adequately known. The aim of the present study was to develop a sensitive and reliable method of administration and scoring. Methods The study sample included 93 normal control subjects (53 women and 40 men aged 35.87 ± 12.62 and 127 patients suffering from schizophrenia (54 women and 73 men aged 34.07 ± 9.83. Results The scoring method was based on the frequencies of responses of healthy controls and proved to be relatively reliable with Cronbach's α equal to 0.61, test-retest correlation coefficient equal to 0.41 and inter-rater reliability equal to 0.52. The factor analysis produced two indices and six subscales of the Standardised Copy of Pentagons Test (SCPT. The total score as well as most of the individual items and subscales distinguished between controls and patients. The discriminant function correctly classified 63.44% of controls and 75.59% of patients. Discussion The SCPT seems to be a satisfactory, reliable and valid instrument, which is easy to administer, suitable for use in non-organic psychiatric patients and demands minimal time. Further research is necessary to test its psychometric properties and its usefulness and applications as a neuropsychological test.
Modeling the Test-Taking Motivation Construct through Investigation of Psychometric Properties of an Expectancy-Value-Based Questionnaire

Science.gov (United States)

Knekta, Eva; Eklöf, Hanna

2015-01-01

The aim of this study was to evaluate the psychometric properties of an expectancy-value-based questionnaire measuring five aspects of test-taking motivation (effort, expectancies, importance, interest, and test anxiety). The questionnaire was distributed to a sample of Swedish Grade 9 students taking a low-stakes (n = 1,047) or a high-stakes (n =…
The Youth Psychopathic Traits Inventory: Measurement Invariance and Psychometric Properties among Portuguese Youths

Directory of Open Access Journals (Sweden)

Pedro Pechorro

2016-08-01

Full Text Available The aim of the present study was to examine the psychometric properties of the Youth Psychopathic Traits Inventory (YPI among a mixed-gender sample of 782 Portuguese youth (M = 15.87 years; SD = 1.72, in a school context. Confirmatory factor analysis revealed the expected three-factor first-order structure. Cross-gender measurement invariance and cross-sample measurement invariance using a forensic sample of institutionalized males were also confirmed. The Portuguese version of the YPI demonstrated generally adequate psychometric properties of internal consistency, mean inter-item correlation, convergent validity, discriminant validity, and criterion-related validity of statistically significant associations with conduct disorder symptoms, alcohol abuse, drug use, and unprotected sex. In terms of known-groups validity, males scored higher than females, and males from the school sample scored lower than institutionalized males. The use of the YPI among the Portuguese male and female youth population is psychometrically justified, and it can be a useful measure to identify adolescents with high levels of psychopathic traits.
Development and Psychometric Testing of a Sexual Concerns Questionnaire for Kidney Transplant Recipients.

Science.gov (United States)

Muehrer, Rebecca J; Lanuza, Dorothy M; Brown, Roger L; Djamali, Arjang

2015-01-01

This study describes the development and psychometric testing of the Sexual Concerns Questionnaire (SCQ) in kidney transplant (KTx) recipients. Construct validity was assessed using the Kroonenberg and Lewis exploratory/confirmatory procedure and testing hypothesized relationships with established questionnaires. Configural and weak invariance were examined across gender, dialysis history, relationship status, and transplant type. Reliability was assessed with Cronbach's alpha, composite reliability, and test-retest reliability. Factor analysis resulted in a 7-factor solution and suggests good model fit. Construct validity was also supported by the tests of hypothesized relationships. Configural and weak invariance were supported for all subgroups. Reliability of the SCQ was also supported. Findings indicate the SCQ is a valid and reliable measure of KTx recipients' sexual concerns.

Measurement of ability emotional intelligence: results for two new tests.

Science.gov (United States)

Austin, Elizabeth J

2010-08-01

Emotional intelligence (EI) has attracted considerable interest amongst both individual differences researchers and those in other areas of psychology who are interested in how EI relates to criteria such as well-being and career success. Both trait (self-report) and ability EI measures have been developed; the focus of this paper is on ability EI. The associations of two new ability EI tests with psychometric intelligence, emotion perception, and the Mayer-Salovey-Caruso EI test (MSCEIT) were examined. The new EI tests were the Situational Test of Emotion Management (STEM) and the Situational Test of Emotional Understanding (STEU). Only the STEU and the MSCEIT Understanding Emotions branch were significantly correlated with psychometric intelligence, suggesting that only understanding emotions can be regarded as a candidate new intelligence component. These understanding emotions tests were also positively correlated with emotion perception tests, and STEM and STEU scores were positively correlated with MSCEIT total score and most branch scores. Neither the STEM nor the STEU were significantly correlated with trait EI tests, confirming the distinctness of trait and ability EI. Taking the present results as a starting-point, approaches to the development of new ability EI tests and models of EI are suggested.
The Truth about Scores Children Achieve on Tests.

Science.gov (United States)

Brown, Jonathan R.

1989-01-01

The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Psychometric properties and convergent and predictive validity of an executive function test battery for two-year-olds

Directory of Open Access Journals (Sweden)

Hanna eMulder

2014-07-01

Full Text Available Executive function (EF is an important predictor of numerous developmental outcomes, such as academic achievement and behavioral adjustment. Although a plethora of measurement instruments exists to assess executive function in children, only few of these are suitable for toddlers, and even fewer have undergone psychometric evaluation. The present study evaluates the psychometric properties and validity of an assessment battery for measuring EF in two-year-olds. A sample of 2437 children were administered the assessment battery at a mean age of 2;4 years (SD = 0;3 years in a large-scale field study. Measures of both hot EF (snack and gift delay tasks and cool EF (six boxes, memory for location, and visual search task were included. Confirmatory Factor Analyses showed that a two-factor hot and cool EF model fitted the data better than a one-factor model. Measurement invariance was supported across groups differing in age, gender, socioeconomic status (SES, home language, and test setting. Criterion and convergent validity were evaluated by examining relationships between EF and age, gender, SES, home language, and parent and teacher reports of children’s attention and inhibitory control. Predictive validity of the test battery was investigated by regressing children’s pre-academic skills and behavioral problems at age three on the latent hot and cool EF factors at age two years. The test battery showed satisfactory psychometric quality and criterion, convergent, and predictive validity. Whereas cool EF predicted both pre-academic skills and behavior problems one year later, hot EF predicted behavior problems only. These results show that EF can be assessed with psychometrically sound instruments in children as young as two years, and that EF tasks can be reliably applied in large scale field research. The current instruments offer new opportunities for investigating EF in early childhood, and for evaluating interventions targeted at improving
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

Science.gov (United States)

Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

2015-12-01

To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
A psychometric appraisal of the DREEM

Directory of Open Access Journals (Sweden)

Hammond Sean M

2012-01-01

Full Text Available Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM. Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89, subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming's indices shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the
A psychometric appraisal of the DREEM

LENUS (Irish Health Repository)

Hammond, Sean M

2012-01-12

Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM). Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89), subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming\\'s indices) shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the device in future
Evaluation of psychometric properties of Tinetti performance-oriented mobility assessment scale in subjects with knee osteoarthritis

OpenAIRE

Parveen, Huma; Noohu, Majumi M.

2017-01-01

Objective: The objective of this study was to determine the psychometric properties of the Tinetti Performance-Oriented Mobility Assessment (POMA) scale to measure balance and gait impairments in individuals with knee osteoarthritis (OA). Methods: A convenient sample of 25 individuals with bilateral OA knee were recruited. The convergent validity was determined by correlation analysis between scores of Berg Balance Scale (BBS) with balance subscale (POMA-B) and the Timed Up and Go Test (TU...
The Psychometric Properties of Turkish Version of Depression Anxiety Stress Scale-21 (DASS-21 in Community and Clinical Samples

Directory of Open Access Journals (Sweden)

Hakan SARICAM

2018-04-01

Full Text Available This paper presented the Turkish version of the Depression Anxiety Stress Scale-21 (DASS-21 in community and clinical samples, examined its psychometric properties. Construct validity and concurrent validity were conducted in validity studies. Depression Anxiety Stress Scale-42 (DASS-42 was used for concurrent validity. In reliability analysis, the instruments internal consistency and re-test reliability were studied. Results of explanatory factor analyses demonstrated that 21 items yielded three-factors. Results of confirmatory factor analyses for three-dimensional model showed that acceptable fit index values in community sample and perfect fit index values in clinical sample. Factor loadings ranged from .42 to .72. In the concurrent validity, significant positive relationships were found between DASS-42 and DASS-21. Cronbach alpha internal consistency coefficient was found as α= .87 for depression sub-scale, α= .85 for anxiety sub-scale and α= .81 for stress sub-scale in clinical sample. Moreover, test-retest reliability coefficient was obtained as r=.68 for depression sub-scale, r=.66 for anxiety sub-scale and r=.61 for stress sub-scale in community sample, and corrected item-total correlations ranged from .43 to .77 in clinical sample. In second study, DASS-21 discriminated the patients (depression mean score=10.83; anxiety mean score=10.39; stress mean score=11.85 from the healthy subjects (depression mean score=5.88; anxiety mean score=5.37; stress mean score=7.90 well (U=5310.50; 4748.50; 5562.50, p=0.00. According to psychometric properties, DASS-21 is a reliable and valid instrument in the assessment of depression, anxiety, stress levels. [JCBPR 2018; 7(1.000: 19-30
Evaluating the Psychometric Quality of Social Skills Measures: A Systematic Review.

Science.gov (United States)

Cordier, Reinie; Speyer, Renée; Chen, Yu-Wei; Wilkes-Gillan, Sarah; Brown, Ted; Bourke-Taylor, Helen; Doma, Kenji; Leicht, Anthony

2015-01-01

Impairments in social functioning are associated with an array of adverse outcomes. Social skills measures are commonly used by health professionals to assess and plan the treatment of social skills difficulties. There is a need to comprehensively evaluate the quality of psychometric properties reported across these measures to guide assessment and treatment planning. To conduct a systematic review of the literature on the psychometric properties of social skills and behaviours measures for both children and adults. A systematic search was performed using four electronic databases: CINAHL, PsycINFO, Embase and Pubmed; the Health and Psychosocial Instruments database; and grey literature using PsycExtra and Google Scholar. The psychometric properties of the social skills measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria. Thirty-Six studies and nine manuals were included to assess the psychometric properties of thirteen social skills measures that met the inclusion criteria. Most measures obtained excellent overall methodological quality scores for internal consistency and reliability. However, eight measures did not report measurement error, nine measures did not report cross-cultural validity and eleven measures did not report criterion validity. The overall quality of the psychometric properties of most measures was satisfactory. The SSBS-2, HCSBS and PKBS-2 were the three measures with the most robust evidence of sound psychometric quality in at least seven of the eight psychometric properties that were appraised. A universal working definition of social functioning as an overarching construct is recommended. There is a need for ongoing research in the area of the psychometric properties of social skills and behaviours instruments.
Evaluating the Psychometric Quality of Social Skills Measures: A Systematic Review

Science.gov (United States)

Brown, Ted; Bourke-Taylor, Helen; Doma, Kenji; Leicht, Anthony

2015-01-01

Introduction Impairments in social functioning are associated with an array of adverse outcomes. Social skills measures are commonly used by health professionals to assess and plan the treatment of social skills difficulties. There is a need to comprehensively evaluate the quality of psychometric properties reported across these measures to guide assessment and treatment planning. Objective To conduct a systematic review of the literature on the psychometric properties of social skills and behaviours measures for both children and adults. Methods A systematic search was performed using four electronic databases: CINAHL, PsycINFO, Embase and Pubmed; the Health and Psychosocial Instruments database; and grey literature using PsycExtra and Google Scholar. The psychometric properties of the social skills measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria. Results Thirty-Six studies and nine manuals were included to assess the psychometric properties of thirteen social skills measures that met the inclusion criteria. Most measures obtained excellent overall methodological quality scores for internal consistency and reliability. However, eight measures did not report measurement error, nine measures did not report cross-cultural validity and eleven measures did not report criterion validity. Conclusions The overall quality of the psychometric properties of most measures was satisfactory. The SSBS-2, HCSBS and PKBS-2 were the three measures with the most robust evidence of sound psychometric quality in at least seven of the eight psychometric properties that were appraised. A universal working definition of social functioning as an overarching construct is recommended. There is a need for ongoing research in the area of the psychometric properties of social skills and behaviours instruments. PMID:26151362
Evaluating the Psychometric Quality of Social Skills Measures: A Systematic Review.

Directory of Open Access Journals (Sweden)

Reinie Cordier

Full Text Available Impairments in social functioning are associated with an array of adverse outcomes. Social skills measures are commonly used by health professionals to assess and plan the treatment of social skills difficulties. There is a need to comprehensively evaluate the quality of psychometric properties reported across these measures to guide assessment and treatment planning.To conduct a systematic review of the literature on the psychometric properties of social skills and behaviours measures for both children and adults.A systematic search was performed using four electronic databases: CINAHL, PsycINFO, Embase and Pubmed; the Health and Psychosocial Instruments database; and grey literature using PsycExtra and Google Scholar. The psychometric properties of the social skills measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria.Thirty-Six studies and nine manuals were included to assess the psychometric properties of thirteen social skills measures that met the inclusion criteria. Most measures obtained excellent overall methodological quality scores for internal consistency and reliability. However, eight measures did not report measurement error, nine measures did not report cross-cultural validity and eleven measures did not report criterion validity.The overall quality of the psychometric properties of most measures was satisfactory. The SSBS-2, HCSBS and PKBS-2 were the three measures with the most robust evidence of sound psychometric quality in at least seven of the eight psychometric properties that were appraised. A universal working definition of social functioning as an overarching construct is recommended. There is a need for ongoing research in the area of the psychometric properties of social skills and behaviours instruments.
Herth hope index: psychometric testing of the Chinese version.

Science.gov (United States)

Chan, Keung Sum; Li, Ho Cheung William; Chan, Sally Wai-Chi; Lopez, Violeta

2012-09-01

This article is a report on psychometric testing of the Chinese version of the herth hope index. The availability of a valid and reliable instrument that accurately measures the level of hope in patients with heart failure is crucial before any hope-enhancing interventions can be appropriately planned and evaluated. There is no such instrument for Chinese people. A test-retest, within-subjects design was used. A purposive sample of 120 Hong Kong Chinese patients with heart failure between the ages of 60 and 80 years admitted to two medical wards was recruited during an 8-month period in 2009. Participants were asked to respond to the Chinese version of the herth hope index, Hamilton depression rating scale and Rosenberg's self-esteem scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the herth hope index were assessed. The newly translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly translated scale can be used as a self-report assessment tool in assessing the level of hope in Hong Kong Chinese patients with heart failure. © 2011 Blackwell Publishing Ltd.
Conditional standard errors of measurement for composite scores on the Wechsler Preschool and Primary Scale of Intelligence-Third Edition.

Science.gov (United States)

Price, Larry R; Raju, Nambury; Lurie, Anna; Wilkins, Charles; Zhu, Jianjun

2006-02-01

A specific recommendation of the 1999 Standards for Educational and Psychological Testing by the American Educational Research Association, the American Psychological Association, and the National Council on Measurement in Education is that test publishers report estimates of the conditional standard error of measurement (SEM). Procedures for calculating the conditional (score-level) SEM based on raw scores are well documented; however, few procedures have been developed for estimating the conditional SEM of subtest or composite scale scores resulting from a nonlinear transformation. Item response theory provided the psychometric foundation to derive the conditional standard errors of measurement and confidence intervals for composite scores on the Wechsler Preschool and Primary Scale of Intelligence-Third Edition.
Time perspective in hereditary cancer: psychometric properties of a short form of the Zimbardo Time Perspective Inventory in a community and clinical sample.

Science.gov (United States)

Wakefield, Claire E; Homewood, Judi; Taylor, Alan; Mahmut, Mehmet; Meiser, Bettina

2010-10-01

We aimed to assess the psychometric properties of a 25-item short form of the Zimbardo Time Perspective Inventory in a community sample (N = 276) and in individuals with a strong family history of cancer, considering genetic testing for cancer risk (N = 338). In the community sample, individuals with high past-negative or present-fatalistic scores had higher levels of distress, as measured by depression, anxiety, and aggression. Similarly, in the patient sample, past-negative time perspective was positively correlated with distress, uncertainty, and postdecision regret when making a decision about genetic testing. Past-negative-oriented individuals were also more likely to be undecided about, or against, genetic testing. Hedonism was associated with being less likely to read the educational materials they received at their clinic, and fatalism was associated with having lower knowledge levels about genetic testing. The assessment of time perspective in individuals at increased risk of cancer can provide valuable clinical insights. However, further investigation of the psychometric properties of the short form of this scale is warranted, as it did not meet the currently accepted criteria for psychometric validation studies.
Psychometric properties of the Spanish version of the mindful attention awareness scale (MAAS in patients with fibromyalgia

Directory of Open Access Journals (Sweden)

Cebolla Ausias

2013-01-01

Full Text Available Abstract Background Mindful-based interventions improve functioning and quality of life in fibromyalgia (FM patients. The aim of the study is to perform a psychometric analysis of the Spanish version of the Mindful Attention Awareness Scale (MAAS in a sample of patients diagnosed with FM. Methods The following measures were administered to 251 Spanish patients with FM: the Spanish version of MAAS, the Chronic Pain Acceptance Questionnaire, the Pain Catastrophising Scale, the Injustice Experience Questionnaire, the Psychological Inflexibility in Pain Scale, the Fibromyalgia Impact Questionnaire and the Euroqol. Factorial structure was analysed using Confirmatory Factor Analyses (CFA. Cronbach's α coefficient was calculated to examine internal consistency, and the intraclass correlation coefficient (ICC was calculated to assess the test-retest reliability of the measures. Pearson’s correlation tests were run to evaluate univariate relationships between scores on the MAAS and criterion variables. Results The MAAS scores in our sample were low (M = 56.7; SD = 17.5. CFA confirmed a two-factor structure, with the following fit indices [sbX2 = 172.34 (p Conclusion Psychometric properties of the Spanish version of the MAAS in patients with FM are adequate. The dimensionality of the MAAS found in this sample and directions for future research are discussed.
Psychometric properties of the Turkish version of the Internet Addiction Test (IAT).

Science.gov (United States)

Boysan, Murat; Kuss, Daria J; Barut, Yaşar; Ayköse, Nafi; Güleç, Mustafa; Özdemir, Osman

2017-01-01

Of many instruments developed to assess Internet addiction, the Internet Addiction Test (IAT), an expanded version of the Internet Addiction Diagnostic Questionnaire (IADQ), has been the most widely used scale in English and non-English speaking populations. In this study, our aim was to investigate the psychometric properties of short and expanded versions of the IAT in a Turkish undergraduate sample. Overall, 455 undergraduate students from Turkey aged between 18 and 30 participated in the study (63.53% were females). Explanatory and confirmatory factor analytic procedures investigated factor structures of the IADQ and IAT. The Internet Addiction Scale (IAS), Coping Inventory for Stressful Situations (CISS), Obsessive Compulsive Inventory-Revised (OCI-R) and Dissociative Experiences Scale (DES) were administered to assess convergent and divergent validities of the IADQ and IAT. Internal consistency and 15-day test-retest reliability were computed. In the factorial analytic investigation, we found a unidimensional factor structure for each measure fit the current data best. Significant but weak to moderate correlations of the IADQ and the IAT with the CISS, OCI-R and DES provided empirical evidence for divergent validity, whereas strong associations with the subscales of the IAS pointed to the convergent validity of Young's Internet addiction construct. Internal consistency of the IADQ was weak (α=0.67) and of the IAT was high (α=0.93). Temporal reliability of both instruments was very high (α=0.81 and α=0.87; respectively). The IAT revealed promising and sound psychometric properties in a Turkish sample. Copyright © 2015 Elsevier Ltd. All rights reserved.
Development and psychometric testing of the Nursing Workplace Relational Environment Scale (NWRES).

Science.gov (United States)

Duddle, Maree; Boughton, Maureen

2009-03-01

The aim of this study was to develop and test the psychometric properties of the Nursing Workplace Relational Environment Scale (NWRES). A positive relational environment in the workplace is characterised by a sense of connectedness and belonging, support and cooperation among colleagues, open communication and effectively managed conflict. A poor relational environment in the workplace may contribute to job dissatisfaction and early turnover of staff. Quantitative survey. A three-stage process was used to design and test the NWRES. In Stage 1, an extensive literature review was conducted on professional working relationships and the nursing work environment. Three key concepts; collegiality, workplace conflict and job satisfaction were identified and defined. In Stage 2, a pool of items was developed from the dimensions of each concept and formulated into a 35-item scale which was piloted on a convenience sample of 31 nurses. In Stage 3, the newly refined 28-item scale was administered randomly to a convenience sample of 150 nurses. Psychometric testing was conducted to establish the construct validity and reliability of the scale. Exploratory factor analysis resulted in a 22-item scale. The factor analysis indicated a four-factor structure: collegial behaviours, relational atmosphere, outcomes of conflict and job satisfaction which explained 68.12% of the total variance. Cronbach's alpha coefficient for the NWRES was 0.872 and the subscales ranged from 0.781-0.927. The results of the study confirm the reliability and validity of the NWRES. Replication of this study with a larger sample is indicated to determine relationships among the subscales. The results of this study have implications for health managers in terms of understanding the impact of the relational environment of the workplace on job satisfaction and retention.
The Italian version of the 16-item prodromal questionnaire (iPQ-16): Field-test and psychometric features.

Science.gov (United States)

Lorenzo, Pelizza; Silvia, Azzali; Federica, Paterlini; Sara, Garlassi; Ilaria, Scazza; Pupo, Simona; Andrea, Raballo

2018-03-20

Among current early screeners for psychosis-risk states, the Prodromal Questionnaire-16 items (PQ-16) is often used. We aimed to assess validity and reliability of the Italian version of the PQ-16 in a young adult help-seeking population. We included 154 individuals aged 18-35years seeking help at the Reggio Emilia outpatient mental health services in a large semirural catchment area (550.000 inhabitants). Participants completed the Italian version of the PQ-16 (iPQ-16) and were subsequently evaluated with the Comprehensive Assessment of At-Risk Mental States (CAARMS). We examined diagnostic accuracy (i.e. specificity, sensitivity, negative and positive likelihood ratios, and negative and positive predictive values) and content, convergent, and concurrent validity between PQ-16 and CAARMS using Cronbach's alpha, Spearman's rho, and Cohen's kappa, respectively. We also tested the validity of the adopted PQ-16 cut-offs through Receiver Operating Characteristic (ROC) curves plotted against CAARMS diagnoses and the 1-year predictive validity of the PQ-16. The iPQ-16 showed high internal consistency and acceptable diagnostic accuracy and concurrent validity. ROC analyses pointed to a cut-off score of ≥5 as best cut-off. After 12months of follow-up, 8.7% of participants with a PQ-16 symptom total score of ≥5 who were below the CAARMS psychosis threshold at the baseline, developed a psychotic disorder. Psychometric properties of the iPQ-16 were satisfactory. Copyright © 2018. Published by Elsevier B.V.
78th Annual Meeting of the Psychometric Society

CERN Document Server

Bolt, Daniel; Ark, L; Wang, Wen-Chung

2015-01-01

The 78th Annual Meeting of the Psychometric Society (IMPS) builds on the Psychometric Society's mission to share quantitative methods relevant to psychology. The chapters of this volume present cutting-edge work in the field. Topics include studies of item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Additional psychometric topics relate to structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis, among others. The papers in this volume will be especially useful for researchers in the social sciences who use quantitative methods. Prior knowledge of statistical methods is recommended. The 78th annual meeting took place in Arnhem, The Netherlands between July 22nd and 26th, 2013. The previous volume to showcase work from the Psychometric Society’s Meeting is New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 201...
Psychometric properties of the PTSD Checklist for Diagnostic and Statistical Manual of Mental Disorders-Fifth Edition (PCL-5) in veterans.

Science.gov (United States)

Bovin, Michelle J; Marx, Brian P; Weathers, Frank W; Gallagher, Matthew W; Rodriguez, Paola; Schnurr, Paula P; Keane, Terence M

2016-11-01

This study examined the psychometric properties of the posttraumatic stress disorder (PTSD) Checklist for Diagnostic and Statistical Manual of Mental Disorders-Fifth Edition (PCL-5; Weathers, Litz, et al., 2013b) in 2 independent samples of veterans receiving care at a Veterans Affairs Medical Center (N = 468). A subsample of these participants (n = 140) was used to define a valid diagnostic cutoff score for the instrument using the Clinician-Administered PTSD Scale for DSM-5 (CAPS-5; Weathers, Blake, et al., 2013) as the reference standard. The PCL-5 test scores demonstrated good internal consistency (α = .96), test-retest reliability (r = .84), and convergent and discriminant validity. Consistent with previous studies (Armour et al., 2015; Liu et al., 2014), confirmatory factor analysis revealed that the data were best explained by a 6-factor anhedonia model and a 7-factor hybrid model. Signal detection analyses using the CAPS-5 revealed that PCL-5 scores of 31 to 33 were optimally efficient for diagnosing PTSD (κ(.5) = .58). Overall, the findings suggest that the PCL-5 is a psychometrically sound instrument that can be used effectively with veterans. Further, by determining a valid cutoff score using the CAPS-5, the PCL-5 can now be used to identify veterans with probable PTSD. However, findings also suggest the need for research to evaluate cluster structure of DSM-5. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

Psychometrics of the neonatal oral motor assessment scale.

Science.gov (United States)

Zarem, Cori; Kidokoro, Hiroyuki; Neil, Jeffrey; Wallendorf, Michael; Inder, Terrie; Pineda, Roberta

2013-12-01

To establish the psychometrics of the Neonatal Oral Motor Assessment Scale (NOMAS). In this prospective cohort study of 75 preterm infants (39 females, 36 males) born at or before 30 weeks gestation (mean gestational age 26.56 wks, SD 1.90, range 23-30 wks; mean birthweight 967.33 g, SD 288.54, range 480-2240), oral feeding was videotaped before discharge from the neonatal intensive care unit (NICU). The NOMAS was used to classify feeding as normal, disorganized, or dysfunctional. Neurobehavior was assessed at term equivalent, and infants underwent magnetic resonance imaging. Children returned for developmental testing at 2 years corrected age. Associations between NOMAS scores and (1) neurobehavior; (2) cerebral injury and metrics; and (3) developmental outcome were investigated using χ(2) -analyses, t-tests, and linear regression. For reliability, six certified NOMAS evaluators rated five randomly selected NOMAS recordings and re-scored them 2 weeks later in a second randomized order. Reliability was calculated with Cohen's kappa statistics. Dysfunctional NOMAS scores were associated with lower Dubowitz scores [t=-2.14; mean difference -2.32 (95% confidence interval [CI] -0.157 to -4.49); p=0.036], higher stress on the NICU Network Neurobehavioral Scale (t=2.61; mean difference 0.073 [95% CI 0.017-0.129]; p=0.0110), and decreased transcerebellar diameter (t=-2.22; mean difference -2.04 [CI=-3.89 to -0.203]; p=0.03). No significant associations were found between NOMAS scores and 2-year outcome. Some concurrent validity was established with associations between NOMAS scores and measures of infant behavior and cerebral structure. The NOMAS did not show predictive validity in this study of preterm infants at high risk of developmental delay. Reliability was variable and suboptimal. © 2013 Mac Keith Press.
Psychometric Properties of the Korean Version of the Overactive Bladder Questionnaire (OAB-q in a Korean Population

Directory of Open Access Journals (Sweden)

Seung-June Oh

2012-06-01

Full Text Available Purpose Psychometric properties of the overactive bladder questionnaire (OAB-q were recently examined. However, since the cross-cultural adaptation of a non-English version of the OAB-q has never been demonstrated, we evaluated the psychometric properties of a Korean version of the OAB-q in a Korean population with OAB. Methods A prospective cohort study involving 116 women with 58 OAB and 58 control subjects was performed and convergent validity was assessed. Total and subscale OAB-q scores of the control and OAB groups were compared to their sensitivity to score changes before and after administering anti-cholinergic medication for 12 weeks. Short form 36 and King's health questionnaire (KHQ were also used for comparison or correlation. Results Assessment of face validity showed that the Korean version of the OAB-q was reasonable with OAB-q subscale scores being significantly different between the control and patient groups. Significant correlation (range, -0.29 to -0.81 was found between the OAB-q scores and KHQ results for the OAB patients. Cronbach's alpha coefficients (range, 0.77 to 0.95 indicated excellent internal consistency and test-retest analysis involving 35 OAB patients showed that each questions as well as subscale scores were reproducible. Each score of OAB-q also showed statistically significant sensitivity to changes following anti-muscarinic treatment for OAB (n=27, P<0.001 except for social, P=0.059. Conclusions The Korean version of the OAB-q is a valid and reliable instrument to measure outcomes in Korean patients with OAB.
The psychometric properties of an Iranian translation of the Work Ability Index (WAI) questionnaire.

Science.gov (United States)

Abdolalizadeh, M; Arastoo, A A; Ghsemzadeh, R; Montazeri, A; Ahmadi, K; Azizi, A

2012-09-01

This study was carried out to evaluate the psychometric properties of an Iranian translation of the Work Ability Index (WAI) questionnaire. In this methodological study, nurses and healthcare workers aged 40 years and older who worked in educational hospitals in Ahvaz (236 workers) in 2010, completed the questionnaire and 60 of the workers filled out the WAI questionnaire for the second time to ensure test-retest reliability. Forward-backward method was applied to translate the questionnaire from English into Persian. The psychometric properties of the Iranian translation of the WAI were assessed using the fallowing tests: Internal consistency (to test reliability), test-retest analysis, exploratory factor analysis (construct validity), discriminate validity by comparing the mean WAI score in two groups of the employees that had different levels of sick leave, criterion validity by determining the correlation between the Persian version of short form health survey (SF-36) and WAI score. Cronbach's alpha coefficient was estimated to be 0.79 and it was concluded that the internal consistency was high enough. The intraclass correlation coefficient was recognized to be 0.92. Factor analysis indicated three factors in the structure of the work ability including self-perceived work ability (24.5% of the variance), mental resources (22.23% of the variance), and presence of disease and health related limitation (18.55% of the variance). Statistical tests showed that this questionnaire was capable of discriminating two groups of employees who had different levels of sick leave. Criterion validity analysis showed that this instrument and all dimensions of the Iranian version of SF-36 were correlated significantly. Item correlation corrective for overlap showed the items tests had a good correlation except for one. The finding of the study showed that the Iranian version of the WAI is a reliable and valid measure of work ability and can be used both in research and practical
French validation of the internet addiction test.

Science.gov (United States)

Khazaal, Yasser; Billieux, Joël; Thorens, Gabriel; Khan, Riaz; Louati, Youssr; Scarlatti, Elisa; Theintz, Florence; Lederrey, Jerome; Van Der Linden, Martial; Zullino, Daniele

2008-12-01

The main goal of the present study is to investigate the psychometric properties of a French version of the Internet Addiction Test (IAT) and to assess its relationship with both time spent on Internet and online gaming. The French version of the Young's Internet Addiction Test (IAT) was administered to a sample of 246 adults. Exploratory and confirmatory analyses were carried out. We discovered that a one-factor model of the IAT has good psychometric properties and fits the data well, which is not the case of a six-factor model as found in previous studies using exploratory methods. Correlation analysis revealed positive significant relationships between IAT scores and both the daily duration of Internet use and the fact of being an online player. In addition, younger people scored higher on the IAT. The one-factor model found in this study has to be replicated in other IAT language versions.
The spider and the snake - A psychometric study of two phobias and insights from the Hungarian validation.

Science.gov (United States)

Zsido, Andras N

2017-11-01

Specific phobias-particularly zoophobias-are prevalent worldwide and can have fairly dramatic health consequences. Self-report measurements play a crucial role in phobia research studies; thus, it is important to have a reliable tool in different languages. The present investigation examined the psychometric properties of the Hungarian version of two commonly used measures of fear: the Spider Phobia Questionnaire (i.e. SPQ) and the Snake Questionnaire (i.e. SNAQ). The SPQ and SNAQ scores both demonstrated excellent reliability, including a test-retest over a 4-week period. Supportive evidence for the validity of the SPQ and SNAQ scores was found using questions assessing fainting and avoidance history, regarding snakes and spiders, based on DSM-V criteria. Both questionnaires could discriminate between participants who reported such an event and those who did not. Further analyses also revealed a sex difference, with women scoring higher than men on both scales. Moreover, 9.5% and 4.24% of the respondents reached the cut-off point, set by previous studies, for spider and snake phobias, respectively. These findings suggest that the SPQ and SNAQ have excellent psychometric properties, making them suitable for use in further cross-cultural research and epidemiological studies. Copyright © 2017 Elsevier B.V. All rights reserved.
Psychometric properties of the hebrew translation of the patient activation measure (PAM-13).

Science.gov (United States)

Magnezi, Racheli; Glasser, Saralee

2014-01-01

"Patient activation" reflects involvement in managing ones health. This cross-sectional study assessed the psychometric properties of the Hebrew translation (PAM-H) of the PAM-13. A nationally representative sample of 203 Hebrew-speaking Israeli adults answered the PAM-H, PHQ-9 depression scale, SF-12, and Self-efficacy Scale via telephone. Mean PAM-H scores were 70.7±15.4. Rasch analysis indicated that the PAM-H is a good measure of activation. There were no differences in PAM-H scores based on gender, age or education. Subjects with chronic disease scored lower than those without. Scores correlated with the Self-efficacy Scale (0.47), Total SF-12 (0.39) and PHQ-9 (-0.35, PPAM-H score of those who scored below 10 (72.1±14.8) on the PHQ-9 (not depressed) compared to those scoring ≥10 (i.e. probable depression) (59.2±15.8; t 3.75; P = 0.001). The PAM-H psychometric properties indicate its usefulness with the Hebrew-speaking Israeli population. PAM-H can be useful for assessing programs aimed at effecting changes in patient compliance, health behaviors, etc. Researchers in Israel should use a single translation of the PAM-13 so that findings can be compared, increasing understanding of patient activation.
Investigation of Psychometric Properties of the Test for Creative Thinking-Drawing Production: Evidence from Study in Latvia

Science.gov (United States)

Kalis, Emils; Roke, Liga; Krumina, Indra

2016-01-01

The Test for Creative Thinking-Drawing Production (TCT-DP) is designed as an effective drawing-based instrument for measuring creative potential. Many studies report adaptation efforts in other cultures pointing out good psychometric properties of the instrument nonetheless revealing also some trouble spots. The present study includes adaptation…
Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

Science.gov (United States)

Sinharay, Sandip

2018-01-01

Producers and consumers of test scores are increasingly concerned about fraudulent behavior before and during the test. There exist several statistical or psychometric methods for detecting fraudulent behavior on tests. This paper provides a review of the Bayesian approaches among them. Four hitherto-unpublished real data examples are provided to…
The development and psychometric validation of the Ethical Awareness Scale.

Science.gov (United States)

Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela

2018-04-19

To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.
Reconsidering the psychometrics of quality of life assessment in light of response shift and appraisal

Directory of Open Access Journals (Sweden)

Schwartz Carolyn E

2004-03-01

Full Text Available Abstract The increasing evidence for response shift phenomena in quality of life (QOL assessment points to the necessity to reconsider both the measurement model and the application of psychometric analyses. The proposed psychometric model posits that the QOL true score is always contingent upon parameters of the appraisal process. This new model calls into question existing methods for establishing the reliability and validity of QOL assessment tools and suggests several new approaches for describing the psychometric properties of these scales. Recommendations for integrating the assessment of appraisal into QOL research and clinical practice are discussed.
Reconsidering the psychometrics of quality of life assessment in light of response shift and appraisal

Science.gov (United States)

Schwartz, Carolyn E; Rapkin, Bruce D

2004-01-01

The increasing evidence for response shift phenomena in quality of life (QOL) assessment points to the necessity to reconsider both the measurement model and the application of psychometric analyses. The proposed psychometric model posits that the QOL true score is always contingent upon parameters of the appraisal process. This new model calls into question existing methods for establishing the reliability and validity of QOL assessment tools and suggests several new approaches for describing the psychometric properties of these scales. Recommendations for integrating the assessment of appraisal into QOL research and clinical practice are discussed. PMID:15038830
Psychometric and Clinimetric Properties of the Melbourne Assessment 2 in Children With Cerebral Palsy.

Science.gov (United States)

Wang, Tien-Ni; Liang, Kai-Jie; Liu, Yi-Chia; Shieh, Jeng-Yi; Chen, Hao-Ling

2017-09-01

To examine the psychometric and clinimetric properties of the Melbourne Assessment 2 (MA2), an outcome measurement that is increasingly used in clinical studies. Psychometric and clinimetric study. Community. Seventeen children with cerebral palsy (CP) from 5 to 12 years were recruited for the estimation of the test-retest reliability and minimal detectable change (MDC). Thirty-five children with CP were recruited to receive an 8-week intensive neurorehabilitation intervention to estimate the validity, responsiveness, and minimal clinically important difference (MCID). Thirty-five children with CP received upper limb neurorehabilitation programs for 8 weeks. The MA2 and the criterion measures, including the Bruininks-Oseretsky Test of Motor Proficiency, 2nd edition (BOT-2), the Box and Blocks Test (BBT), and the Pediatric Motor Activity Log-Revised (PMAL-R), were evaluated at pretreatment and posttreatment. The MA2 has 4 subscales: range of motion, fluency, accuracy, and dexterity. The test-retest reliability of the MA2 is high (intraclass correlation coefficient, .92-.98). The significant relationships between the MA2 and BBT, BOT-2, and PMAL-R support its validity. The significance of paired t test results (PMA2. The MDC values of the 4 subscales of the MA2 are 2.85, 1.63, 1.97, and 1.84, respectively, and the suggested MCID values of these 4 subscales are 2.35, 3.20, 2.09, and 2.22, respectively, indicating the minimum scores of improvement to be interpreted as both statistically significant and clinically important. The study findings indicate that the MA2 has sound psychometric and clinimetric properties and is thus an adequate measurement for research and clinical applications. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Non invasive blood flow measurement in cerebellum detects minimal hepatic encephalopathy earlier than psychometric tests.

Science.gov (United States)

Felipo, Vicente; Urios, Amparo; Giménez-Garzó, Carla; Cauli, Omar; Andrés-Costa, Maria-Jesús; González, Olga; Serra, Miguel A; Sánchez-González, Javier; Aliaga, Roberto; Giner-Durán, Remedios; Belloch, Vicente; Montoliu, Carmina

2014-09-07

To assess whether non invasive blood flow measurement by arterial spin labeling in several brain regions detects minimal hepatic encephalopathy. Blood flow (BF) was analyzed by arterial spin labeling (ASL) in different brain areas of 14 controls, 24 cirrhotic patients without and 16 cirrhotic patients with minimal hepatic encephalopathy (MHE). Images were collected using a 3 Tesla MR scanner (Achieva 3T-TX, Philips, Netherlands). Pulsed ASL was performed. Patients showing MHE were detected using the battery Psychometric Hepatic Encephalopathy Score (PHES) consisting of five tests. Different cognitive and motor functions were also assessed: alterations in selective attention were evaluated using the Stroop test. Patients and controls also performed visuo-motor and bimanual coordination tests. Several biochemical parameters were measured: serum pro-inflammatory interleukins (IL-6 and IL-18), 3-nitrotyrosine, cGMP and nitrates+nitrites in plasma, and blood ammonia. Bivariate correlations were evaluated. In patients with MHE, BF was increased in cerebellar hemisphere (P = 0.03) and vermis (P = 0.012) and reduced in occipital lobe (P = 0.017). BF in cerebellar hemisphere was also increased in patients without MHE (P = 0.02). Bimanual coordination was impaired in patients without MHE (P = 0.05) and much more in patients with MHE (P battery and with CFF. BF in cerebellar hemisphere correlates with plasma cGMP and nitric oxide (NO) metabolites. BF in vermis cerebellar also correlates with NO metabolites and with 3-nitrotyrosine. IL-18 in plasma correlates with BF in thalamus and occipital lobe. Non invasive BF determination in cerebellum using ASL may detect MHE earlier than the PHES. Altered NO-cGMP pathway seems to be associated to altered BF in cerebellum.
Psychometric properties of the Knee injury and Osteoarthritis Outcome Score for Children (KOOS-Child) in children with knee disorders

DEFF Research Database (Denmark)

Ortqvist, Maria; Iversen, Maura D; Janarv, Per-Mats

2014-01-01

-Child was developed. This study aims to evaluate psychometric properties of the final KOOS-Child when used in children with knee disorders. METHODS: 115 children (boys/girls 51/64, 7-16 years) with knee disorders were recruited. All children (n=115) completed the KOOS-Child, the Child-Health Assessment Questionnaire...... better. CONCLUSIONS: The final KOOS-Child demonstrates good psychometric properties and supports the use of the KOOS-Child when evaluating children with knee disorders....
PedsQLTM 4.0 Generic Core Scales for adolescents in the Yoruba language: translation and general psychometric properties.

Science.gov (United States)

Atilola, Olayinka; Stevanović, Dejan

2014-04-01

Quality of life (QOL) is a universally accepted concept for measuring the impact of different aspects of life on general well-being. Adaptation of existing QOL instruments to local cultures has been identified as a better strategy than development of new ones. To translate and adapt the Paediatric Quality of Life Inventory™ Version 4.0 Generic Core Scales (PedsQL™) to the Yoruba language and culture and to test the psychometric properties of the adapted instrument among adolescents. Psychometric properties including internal consistency reliability, construct and factorial validity of the Yoruba version of PedsQL™ were evaluated using standard procedures. The self report and proxy scales of the Yoruba PedsQL™ were developed with good cultural relevance and semantic/conceptual equivalence. Results from 527 adolescents revealed a Cronbach's coefficient which exceeded 0.7 for internal consistency reliability for all scores. The healthy subjects reported higher PedsQL™ scores than those with mental health and physical problems, which confirmed construct validity. Confirmatory factor analysis revealed a good model fit for the Psychosocial Health score, but not for the other measures. The Yoruba PedsQL™ is culturally appropriate and with good internal consistency, reliability and construct validity. More work is needed regarding its factorial validity.
A Psychometric Evaluation of the Danish Version of the Theory of Mind Storybook for 8-14 Year-Old Children

DEFF Research Database (Denmark)

Clemmensen, Lars; Bartels-Velthuis, Agna A.; Jespersen, Rókur av F.

2016-01-01

BACKGROUND: Theory-of-Mind (ToM) keeps on developing in late childhood and early adolescence, and the study of ToM development later in childhood had to await the development of sufficiently sensitive tests challenging more mature children. The current study aimed to investigate the psychometric......M-Frederik and the Social Emotional Evaluation (SEE) Total score were analyzed. RESULTS: A significantly higher ToM-Frederik score was observed in the TD group compared to the HFASD group. Furthermore, the convergent validity of ToM-Frederik as a measure of ToM was supported by significant and positive associations...
Development and preliminary psychometric properties of the multidimensional neglectful behavior scale-child report.

Science.gov (United States)

Kantor, Glenda Kaufman; Holt, Melissa K; Mebert, Carolyn J; Straus, Murray A; Drach, Kerry M; Ricci, Lawrence R; MacAllum, Crystal A; Brown, Wendy

2004-11-01

This article describes the development and psychometric properties of the Multidimensional Neglectful Behavior Scale-Child Report (MNBS-CR). The measure is broadly conceptualized to tap child neglect across four core domains: cognitive, emotional, physical and supervisory neglect, and it assesses exposure to violence, alcohol-related neglect, abandonment, and children's appraisals of parenting. Features include pictorial items, audio computer-assisted testing, and programming by age and gender of the child and caregiver. A clinical sample of 144 children, age 6 to 15 years, and a comparison sample of 87 children were tested. Results showed that the MNBS-CR has high reliability, with higher reliability found for older children (alpha = .94) than for younger children (alpha = .66). Among older children, the MNBS-CR Supervisory scale was significantly associated with the Child Behavior Check List (CBCL), and total MNBS-CR scores were significantly associated with clinician reports of behavioral disorders. Younger and older neglected children scored significantly higher on the MNBS-CR than community children.
Development and psychometric evaluation of the Thirst Distress Scale for patients with heart failure.

Science.gov (United States)

Waldréus, Nana; Jaarsma, Tiny; van der Wal, Martje Hl; Kato, Naoko P

2018-03-01

Patients with heart failure can experience thirst distress. However, there is no instrument to measure this in patients with heart failure. The aim of the present study was to develop the Thirst Distress Scale for patients with Heart Failure (TDS-HF) and to evaluate psychometric properties of the scale. The TDS-HF was developed to measure thirst distress in patients with heart failure. Face and content validity was confirmed using expert panels including patients and healthcare professionals. Data on the TDS-HF was collected from patients with heart failure at outpatient heart failure clinics and hospitals in Sweden, the Netherlands and Japan. Psychometric properties were evaluated using data from 256 heart failure patients (age 72±11 years). Concurrent validity of the scale was assessed using a thirst intensity visual analogue scale. Patients did not have any difficulties answering the questions, and time taken to answer the questions was about five minutes. Factor analysis of the scale showed one factor. After psychometric testing, one item was deleted. For the eight item TDS-HF, a single factor explained 61% of the variance and Cronbach's alpha was 0.90. The eight item TDS-HF was significantly associated with the thirst intensity score ( r=0.55, pfailure.
Cross-Cultural Adaptation and Psychometric Testing of the Brazilian Version of the Self-Care of Heart Failure Index Version 6.2

Science.gov (United States)

Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane

2013-01-01

Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765
Turkish adaptation and psychometric characteristics of the Nursing Authority and Autonomy Scale.

Science.gov (United States)

Basaran Acil, Seher; Dinç, Leyla

2018-04-14

To adapt the Nursing Authority and Autonomy Scale (NAAS) into Turkish the Nursing Authority and Autonomy Scale (NAAS) to Turkish and assess its psychometric properties for Turkish nurses and nurse managers. The NAAS is a tool that specifically measures nursing authority and autonomy from the perspectives of nurses and nurse managers. The study sample consisted of 160 nurse managers and 266 staff nurses. Content validity was assessed using expert approval. Construct validity was assessed using confirmatory factor analysis. Internal consistency was assessed using Cronbach's α, and the test-retest reliability was assessed using Pearson's correlation coefficients. The model achieved a good fit. The internal reliability of the NAAS' authority and autonomy in nursing practice and importance of nursing practice subscales were .84. The Cronbach's α of the instrument was .88. The test-retest scores within an interval of 3 weeks were statistically not significant. The Turkish version of the NAAS has good psychometric properties and this scale can be employed to measure nurses' authority and autonomy. Nurse managers and educators should use an appropriate scale such as NAAS in order to assess nurses' clinical authority and autonomy to improve patient outcomes and develop nurses. © 2018 John Wiley & Sons Ltd.

The Clinician-Administered PTSD Scale for DSM-5 (CAPS-5): Development and initial psychometric evaluation in military veterans.

Science.gov (United States)

Weathers, Frank W; Bovin, Michelle J; Lee, Daniel J; Sloan, Denise M; Schnurr, Paula P; Kaloupek, Danny G; Keane, Terence M; Marx, Brian P

2018-03-01

The Clinician-Administered PTSD Scale (CAPS) is an extensively validated and widely used structured diagnostic interview for posttraumatic stress disorder (PTSD). The CAPS was recently revised to correspond with PTSD criteria in the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5; American Psychiatric Association, 2013). This article describes the development of the CAPS for DSM-5 (CAPS-5) and presents the results of an initial psychometric evaluation of CAPS-5 scores in 2 samples of military veterans (Ns = 165 and 207). CAPS-5 diagnosis demonstrated strong interrater reliability (к = .78 to 1.00, depending on the scoring rule) and test-retest reliability (к = .83), as well as strong correspondence with a diagnosis based on the CAPS for DSM-IV (CAPS-IV; к = .84 when optimally calibrated). CAPS-5 total severity score demonstrated high internal consistency (α = .88) and interrater reliability (ICC = .91) and good test-retest reliability (ICC = .78). It also demonstrated good convergent validity with total severity score on the CAPS-IV (r = .83) and PTSD Checklist for DSM-5 (r = .66) and good discriminant validity with measures of anxiety, depression, somatization, functional impairment, psychopathy, and alcohol abuse (rs = .02 to .54). Overall, these results indicate that the CAPS-5 is a psychometrically sound measure of DSM-5 PTSD diagnosis and symptom severity. Importantly, the CAPS-5 strongly corresponds with the CAPS-IV, which suggests that backward compatibility with the CAPS-IV was maintained and that the CAPS-5 provides continuity in evidence-based assessment of PTSD in the transition from DSM-IV to DSM-5 criteria. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
The Motivated Strategies for Learning Questionnaire: score validity among medicine residents.

Science.gov (United States)

Cook, David A; Thompson, Warren G; Thomas, Kris G

2011-12-01

The Motivated Strategies for Learning Questionnaire (MSLQ) purports to measure motivation using the expectancy-value model. Although it is widely used in other fields, this instrument has received little study in health professions education. The purpose of this study was to evaluate the validity of MSLQ scores. We conducted a validity study evaluating the relationships of MSLQ scores to other variables and their internal structure (reliability and factor analysis). Participants included 210 internal medicine and family medicine residents participating in a web-based course on ambulatory medicine at an academic medical centre. Measurements included pre-course MSLQ scores, pre- and post-module motivation surveys, post-module knowledge test and post-module Instructional Materials Motivation Survey (IMMS) scores. Internal consistency was universally high for all MSLQ items together (Cronbach's α = 0.93) and for each domain (α ≥ 0.67). Total MSLQ scores showed statistically significant positive associations with post-test knowledge scores. For example, a 1-point rise in total MSLQ score was associated with a 4.4% increase in post-test scores (β = 4.4; p motivation and satisfaction. Scores on MSLQ domains demonstrated associations that generally aligned with our hypotheses. Self-efficacy and control of learning belief scores demonstrated the strongest domain-specific relationships with knowledge scores (β = 2.9 for both). Confirmatory factor analysis showed a borderline model fit. Follow-up exploratory factor analysis revealed the scores of five factors (self-efficacy, intrinsic interest, test anxiety, extrinsic goals, attribution) demonstrated psychometric and predictive properties similar to those of the original scales. Scores on the MSLQ are reliable and predict meaningful outcomes. However, the factor structure suggests a simplified model might better fit the empiric data. Future research might consider how assessing and responding to motivation could enhance
Psychometric properties of the Social Problem Solving Inventory-Revised Short-Form in a South African population.

Science.gov (United States)

Sorsdahl, Katherine; Stein, Dan J; Myers, Bronwyn

2017-04-01

The Social Problem Solving Inventory-Revised Short-Form (SPSI-R:SF) has been used in several countries to identify problem-solving deficits among clinical and general populations in order to guide cognitive-behavioural interventions. Yet, very few studies have evaluated its psychometric properties. Three language versions of the questionnaire were administered to a general population sample comprising 1000 participants (771 English-, 178 Afrikaans- and 101 Xhosa-speakers). Of these participants, 210 were randomly selected to establish test-retest reliability (70 in each language). Principal component analysis was performed to examine the applicability of the factor structure of the original questionnaire to the South African data. Supplementary psychometric analyses were performed, including internal consistency and test-retest reliability. Collectively, results provide initial evidence of the reliability and validity of the SPSI-R:SF for the assessment of problem solving deficits in South Africa. Further studies that explore how the Afrikaans language version of the SPSI-R:SF can be improved and that establish the predictive validity of scores on the SPSI-R:SF are needed. © 2015 International Union of Psychological Science.
Psychometric properties of the PROMIS Physical Function item bank in patients receiving physical therapy.

Directory of Open Access Journals (Sweden)

Martine H P Crins

Full Text Available The Patient-Reported Outcomes Measurement Information System (PROMIS is a universally applicable set of instruments, including item banks, short forms and computer adaptive tests (CATs, measuring patient-reported health across different patient populations. PROMIS CATs are highly efficient and the use in practice is considered feasible with little administration time, offering standardized and routine patient monitoring. Before an item bank can be used as CAT, the psychometric properties of the item bank have to be examined. Therefore, the objective was to assess the psychometric properties of the Dutch-Flemish PROMIS Physical Function item bank (DF-PROMIS-PF in Dutch patients receiving physical therapy.Cross-sectional study.805 patients >18 years, who received any kind of physical therapy in primary care in the past year, completed the full DF-PROMIS-PF (121 items.Unidimensionality was examined by Confirmatory Factor Analysis and local dependence and monotonicity were evaluated. A Graded Response Model was fitted. Construct validity was examined with correlations between DF-PROMIS-PF T-scores and scores on two legacy instruments (SF-36 Health Survey Physical Functioning scale [SF36-PF10] and the Health Assessment Questionnaire Disability-Index [HAQ-DI]. Reliability (standard errors of theta was assessed.The results for unidimensionality were mixed (scaled CFI = 0.924, TLI = 0.923, RMSEA = 0.045, 1th factor explained 61.5% of variance. Some local dependence was found (8.2% of item pairs. The item bank showed a broad coverage of the physical function construct (threshold-parameters range: -4.28-2.33 and good construct validity (correlation with SF36-PF10 = 0.84 and HAQ-DI = -0.85. Furthermore, the DF-PROMIS-PF showed greater reliability over a broader score-range than the SF36-PF10 and HAQ-DI.The psychometric properties of the DF-PROMIS-PF item bank are sufficient. The DF-PROMIS-PF can now be used as short forms or CAT to measure the level of
79th Annual Meeting of the Psychometric Society

CERN Document Server

Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Chow, Sy-Miin

2015-01-01

These research articles from the 79th Annual Meeting of the Psychometric Society (IMPS) cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 79th annual meeting took place in Madison, WI between July 21nd and 25th, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013) and Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015).
80th Annual Meeting of the Psychometric Society

CERN Document Server

Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Wiberg, Marie

2016-01-01

The research articles in this volume cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 80th annual meeting took place in Beijing, China, between the 12th and 16th of July, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013), Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015), and Quantitative Psychology Research: The 79th Annual Meeting of the Psychometric Society, Wisconsin, USA, 2014 (Springer, 2015).
Sexual Self-Schema Scale for Women-Validation and Psychometric Properties of the Polish Version.

Science.gov (United States)

Nowosielski, Krzysztof; Jankowski, Konrad S; Kowalczyk, Robert; Kurpisz, Jacek; Normantowicz-Zakrzewska, Małgorzata; Krasowska, Aleksandra

2018-06-01

The sexual self-schema is a part of a broader concept of the self that is believed to be crucial for intrapersonal and interpersonal sexual relationships. To develop and perform psychometric validation of the Polish version of the Sexual Self-Schema Scale for Women (SSSS-W-PL). 561 women 18 to 55 years old were included in the final analysis. Linguistic validation was performed in 4 steps in line with the MAPI Institute guidelines. Convergent validity was calculated using the Pearson r product-moment coefficient between different measures of sexuality (attitudes and experience, behavior, arousal, romantic relationship) and SSSS-W-PL total and factor scores. To test discriminant validity, we applied hierarchical regression analyses predicting the number of lifetime sexual partners, self-rating as a sexual person (1 item, "I feel sexually attractive"; on a 5-point Likert scale), and arousability, with independent variables being extraversion (Ten-Item Personality Inventory), self-esteem (Rosenberg Self-Esteem Scale), and the SSSS-W-PL (total and factor scores). Sexual self-schema was measured by the SSSS-W-PL, whereas arousability was measured by the arousal/excitement scale of the Changes in Sexual Functioning Questionnaire. The mean age of the study population was 29.0 ± 7.6 years. The final scale consisted of 24 adjectives grouped within 4 factors: romantic, passionate, direct, and embarrassed. The 4-factor model accounted for 39% of the variance. The Cronbach α was 0.74 for the SSSS-W-PL total score and 0.61 to 0.84 for individual factors. Test-retest reliability of the scale after 2- to 8-week intervals was 0.87 (95% CI = 0.82-0.86, P Self-Schema Scale for Women-Validation and Psychometric Properties of the Polish Version. Sex Med 2018;6:131-142. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Psychometric Properties of the Croatian Language Version of the Dental Environment Stress Questionnaire on Dental Medicine Students.

Science.gov (United States)

Laktić, Martina; Kuftinec, Krešimir; Čelebić, Asja; Kovačić, Ines; Alhajj, Mohamed Nasser; Kiršić, Sanja Peršić

2017-09-01

To develop the Croatian version of the 41-item Dental Environment Stress questionnaire (DES) for stress assessment of dental students in both, preclinical and clinical years of study and to test its psychometric properties in Croatian dental student population. The English version of the 41-Item DES questionnaire was first translated into the Croatian language. Subsequently, it was set on the google drive and filled out by a total of 202 students from the School of Dental Medicine, University of Zagreb and 30 additional students from other Faculties. Students also assessed their overall level of stress on the Likert scale (1=no stress, 5=highest level of stress). Internal consistency was tested on 202 dental students; test-retest reliability on 30 dental students who filled out the same questionnaire twice; convergent validity on 202 dental students; and divergent validity on 202 dental students and 30 students from faculties not belonging to the biomedicine group. Internal consistency showed high Cronbach alpha coefficient (0.9) and test-retest reliability showed no significant difference (P>0.05) within the period of 14 days when stress level had not changed (vacation). Convergent validity was confirmed by the significant association between the DES summary scores and the self- perceived level of stress (Spearman's rho=0.881; P <0.001). Divergent validity was confirmed by significantly lower DES summary scores in students not belonging to the Biomedicine group (t=7.5, P<0.001). Excellent psychometric properties of the Croatian version of the DES questionnaire enable its utilization for assessment of stress level in Croatian dental students.
Initial psychometric evaluation of the Moral Injury Questionnaire--Military version.

Science.gov (United States)

Currier, Joseph M; Holland, Jason M; Drescher, Kent; Foy, David

2015-01-01

Moral injury is an emerging construct related to negative consequences associated with war-zone stressors that transgress military veterans' deeply held values/beliefs. Given the newness of the construct, there is a need for instrumentation that might assess morally injurious experiences (MIEs) in this population. Drawing on a community sample of 131 Iraq and/or Afghanistan Veterans and clinical sample of 82 returning Veterans, we conducted an initial psychometric evaluation of the newly developed Moral Injury Questionnaire-Military version (MIQ-M)-a 20-item self-report measure for assessing MIEs. Possibly due to low rates of reporting, an item assessing sexual trauma did not yield favourable psychometric properties and was excluded from analyses. Veterans in the clinical sample endorsed significantly higher scores across MIQ-M items. Factor analytic results for the final 19 items supported a unidimensional structure, and convergent validity analyses revealed that higher scores (indicative of more MIEs) were correlated with greater general combat exposure, impairments in work/social functioning, posttraumatic stress and depression in the community sample. In addition, when controlling for demographics, deployment-related factors and exposure to life threat stressors associated with combat, tests of incremental validity indicated that MIQ-M scores were also uniquely linked with suicide risk and other mental health outcomes. These findings provide preliminary evidence for the validity of the MIQ-M and support the applicability of this measure for further research and clinical work with Veterans. Military service can confront service members with experiences that undermine their core sense of humanity and violate global values and beliefs. These types of experiences increase the risk for posttraumatic maladjustment in this population, even when accounting for rates of exposure to life threat traumas. Moral injury is an emerging construct to more fully capture the many
My Vocational Situation (MVS): Case Example and Psychometric Review.

Science.gov (United States)

Nitsch, Kristian P; Pedersen, Jessica; Miliotto, Alexandra; Petersen, Brett; Robbins, Samantha; Garcia, Ana; Hoisington, Molly Ansel; The, Kimberly J; Smiley, Jill; Janikowski, Timothy

This case report provides an overview of the psychometric properties and clinical utility of the My Vocational Situation (MVS) instrument. The accompanying hypothetical case description illustrates how clinicians could use the MVS to evaluate vocational preferences and outcomes and how the MVS can be used to inform treatment planning and rehabilitation decision making. The information contained in this report is intended to familiarize clinicians with the administration and scoring of the MVS, the psychometric information necessary to interpret results obtained from the MVS, and how the results could be used to provide comprehensive, patient-centered care. It is important to note that the information provided represents only a sample of the available research literature on the MVS. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Psychometric Evaluation of the Brachial Assessment Tool Part 1: Reproducibility.

Science.gov (United States)

Hill, Bridget; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea

2018-04-01

To evaluate reproducibility (reliability and agreement) of the Brachial Assessment Tool (BrAT), a new patient-reported outcome measure for adults with traumatic brachial plexus injury (BPI). Prospective repeated-measure design. Outpatient clinics. Adults with confirmed traumatic BPI (N=43; age range, 19-82y). People with BPI completed the 31-item 4-response BrAT twice, 2 weeks apart. Results for the 3 subscales and summed score were compared at time 1 and time 2 to determine reliability, including systematic differences using paired t tests, test retest using intraclass correlation coefficient model 1,1 (ICC 1,1 ), and internal consistency using Cronbach α. Agreement parameters included standard error of measurement, minimal detectable change, and limits of agreement. BrAT. Test-retest reliability was excellent (ICC 1,1 =.90-.97). Internal consistency was high (Cronbach α=.90-.98). Measurement error was relatively low (standard error of measurement range, 3.1-8.8). A change of >4 for subscale 1, >6 for subscale 2, >4 for subscale 3, and >10 for the summed score is indicative of change over and above measurement error. Limits of agreement ranged from ±4.4 (subscale 3) to 11.61 (summed score). These findings support the use of the BrAT as a reproducible patient-reported outcome measure for adults with traumatic BPI with evidence of appropriate reliability and agreement for both individual and group comparisons. Further psychometric testing is required to establish the construct validity and responsiveness of the BrAT. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Improving personality facet scores with multidimensional computer adaptive testing

DEFF Research Database (Denmark)

Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A W

2013-01-01

personality tests contain many highly correlated facets. This article investigates the possibility of increasing the precision of the NEO PI-R facet scores by scoring items with multidimensional item response theory and by efficiently administering and scoring items with multidimensional computer adaptive...
The Effect of Mock Tests on Iranian EFL learners’ Test Scores

Directory of Open Access Journals (Sweden)

Hossein Khodabakhshzadeh

2016-07-01

Full Text Available The effect of using tests in test preparation courses has been subject to debate. While some scholars such as Yang and Badger (2015 believe it is a cause of positive washback effect, others argue that this issue is tentative and context-bound (Green, 2007. Therefore, this study investigated the effect of using Mock tests in International English Language Testing System (IELTS preparation courses on students’ overall IELTS scores. Fifty one IELTS students were selected non-randomly through the quota sampling approach out of 76 students at Mahan Language Institute in Birjand, Iran. These participants were distributed into Group 1 (n=25 and Group 2 (n=26. A complete IELTS test was administered to ensure that the Groups were homogeneous and to serve as pretest. After 10 sessions of intervention, a different IELTS test was administered as posttest. The results of between subject analysis through independent samples t-test revealed that using Mock tests in the IELTS preparation courses can positively affect the participants scores on IELTS exam. Pedagogical implications are discussed.
Psychometric evaluation of self-report outcome measures for prosthetic applications.

Science.gov (United States)

Hafner, Brian J; Morgan, Sara J; Askew, Robert L; Salem, Rana

2016-01-01

Documentation of clinical outcomes is increasingly expected in delivery of prosthetic services and devices. However, many outcome measures suitable for use in clinical care and research have not been psychometrically tested with prosthesis users. The aim of this study was to determine test-retest reliability, mode-of-administration (MoA) equivalence, standard error of measurement (SEM), and minimal detectable change (MDC) of standardized, self-report instruments that assess constructs of importance to people with lower limb loss. Prosthesis users (n = 201) were randomly assigned to groups based on MoA (i.e., paper, electronic, or mixed-mode). Participants completed two surveys 2 to 3 d apart. Instruments included the Prosthetic Limb Users Survey of Mobility, Prosthesis Evaluation Questionnaire-Mobility Subscale, Activities-Specific Balance Confidence Scale, Quality of Life in Neurological Conditions-Applied Cognition/General Concerns, Patient-Reported Outcomes Measurement Information System Profile, and Socket Comfort Score. Intraclass correlation coefficients indicated all instruments are appropriate for group-level comparisons and select instruments are suitable for individual-level applications. Several instruments showed evidence of possible floor and ceiling effects. All were equivalent across MoAs. SEM and MDC were quantified to facilitate interpretation of outcomes and change scores. These results can enhance clinicians' and researchers' ability to select, apply, and interpret scores from instruments administered to prosthesis users.
Cross-cultural validation and psychometric testing of the Norwegian version of the TeamSTEPPS® teamwork perceptions questionnaire.

Science.gov (United States)

Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise

2017-12-02

Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.
'Mechanical restraint-confounders, risk, alliance score': testing the clinical validity of a new risk assessment instrument.

Science.gov (United States)

Deichmann Nielsen, Lea; Bech, Per; Hounsgaard, Lise; Alkier Gildberg, Frederik

2017-08-01

Unstructured risk assessment, as well as confounders (underlying reasons for the patient's risk behaviour and alliance), risk behaviour, and parameters of alliance, have been identified as factors that prolong the duration of mechanical restraint among forensic mental health inpatients. To clinically validate a new, structured short-term risk assessment instrument called the Mechanical Restraint-Confounders, Risk, Alliance Score (MR-CRAS), with the intended purpose of supporting the clinicians' observation and assessment of the patient's readiness to be released from mechanical restraint. The content and layout of MR-CRAS and its user manual were evaluated using face validation by forensic mental health clinicians, content validation by an expert panel, and pilot testing within two, closed forensic mental health inpatient units. The three sub-scales (Confounders, Risk, and a parameter of Alliance) showed excellent content validity. The clinical validations also showed that MR-CRAS was perceived and experienced as a comprehensible, relevant, comprehensive, and useable risk assessment instrument. MR-CRAS contains 18 clinically valid items, and the instrument can be used to support the clinical decision-making regarding the possibility of releasing the patient from mechanical restraint. The present three studies have clinically validated a short MR-CRAS scale that is currently being psychometrically tested in a larger study.
Psychometric properties of the reassurance-seeking scale in a Turkish sample.

Science.gov (United States)

Gençöz, Tülin; Gençöz, Faruk

2005-02-01

This study examined the psychometric properties of the Reassurance-Seeking Scale in a sample of 102 Turkish undergraduate students. High internal consistency reliability was found for the Reassurance-Seeking Scale (alpha=.86). Factor analysis of the scale identified a single component that accounted for 71% of the total variance. The scale was significantly positively correlated with the Beck Depression Inventory and Beck Anxiety Inventory and had a significantly negative correlation with the Rosenberg Self-esteem Scale. Partial correlations of Reassurance-seeking with Depression scores as controlled by Anxiety scores and with Anxiety scores as controlled by Depression scores indicated that Reassurance-seeking scores maintained association with Depression but not with Anxiety. All these findings were in line with expectations.
Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

Science.gov (United States)

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

2010-01-01

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Evaluating the psychometric properties of the Polish version of the Body Appreciation Scale-2.

Science.gov (United States)

Razmus, Magdalena; Razmus, Wiktor

2017-12-01

This study aimed to investigate the factor structure and psychometric properties of a Polish version of the Body Appreciation Scale-2 (BAS-2; Tylka & Wood-Barcalow, 2015). Data were collected from 721 individuals residing in various regions of Poland. There were two subsamples (n=336, age M=34.95, SD=10.83; and n=385, age M=35.38, SD=10.83). Both principal-axis and confirmatory factor analyses supported the one-dimensional structure of BAS-2 scores. Moreover, full scalar invariance of the BAS-2 in Poland across sex was demonstrated. Scores on the Polish BAS-2 had adequate internal consistency. Convergent validity was demonstrated through significant correlations between BAS-2 scores and variables related to body image (body and appearance self-conscious emotions), well-being (self-esteem, positive affect, and positive orientation), and body mass index. These results indicate that the Polish BAS-2 is an appropriate and psychometrically-sound measure of body appreciation. Copyright © 2017 Elsevier Ltd. All rights reserved.
The Cervical Dystonia Impact Profile (CDIP-58: Can a Rasch developed patient reported outcome measure satisfy traditional psychometric criteria?

Directory of Open Access Journals (Sweden)

Bhatia Kailash P

2008-08-01

Full Text Available Abstract Background The United States Food and Drug Administration (FDA are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58, a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1 provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2 enable researchers and clinicians to compare it with existing dystonia PROMs; and 3 help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing. Methods We evaluated traditional psychometric properties of data quality, scaling assumptions, targeting, reliability and validity in a group of 391 people with CD. The main outcome measures used were the CDIP-58, Medical Outcome Study Short Form-36, the 28-item General Health Questionnaire, and Hospital and Anxiety and Depression Scale. Results A total of 391 people returned completed questionnaires (corrected response rate 87%. Analyses showed: 1 data quality was high (low missing data ≤ 4%, subscale scores could be computed for > 96% of the sample; 2 item groupings passed tests for scaling assumptions; 3 good targeting (except for the Sleep subscale, ceiling effect = 27%; 4 good reliability (Cronbach's alpha ≥ 0.92, test-retest intraclass correlations ≥ 0.83; and 5 validity was supported. Conclusion This study has shown that new psychometric methods can produce a PROM that stands up to traditional criteria and supports the clinical advantages of Rasch analysis.

Psychometric Properties of Farsi Version of the Wish to be Dead Scale.

Science.gov (United States)

Dadfar, Mahboubeh; Lester, David; Atef Vahid, Mohammad Kazem; Abdel-Khalek, Ahmed M

2017-11-01

The Wish to be Dead Scale (WDS) is a new scale to measure precursors to suicidal ideation, and the aim of the present study was to examine the psychometric characteristics of a Farsi version of the WDS. The sample was a convenience sample of 145 Iranian female undergraduates and postgraduates selected from different faculties at Iran University of Medical Sciences, Iran. Using a principal component analysis and a varimax rotation with Kaiser normalization, three factors were identified and labeled: (a) lack of purpose and usefulness in life, (b) lack of interest in living, and (c) fantasizing about being dead. The WDS had good inter-item and test-retest reliability and significant positive correlations with scores on the Kessler Psychological Distress Scale-10 and the Rosenberg Self-Esteem Scale, and negative correlations with scores on the Adult Hope Scale, the Satisfaction with Life Scale, the General Self-Efficacy Scale, the Love of Life Scale, the Life Orientation Test, and the Oxford Happiness Questionnaire. We conclude that the WDS may prove to be useful in clinical practice and research into suicide.
Revision and psychometric testing of the Incivility in Nursing Education (INE) survey: introducing the INE-R.

Science.gov (United States)

Clark, Cynthia M; Barbosa-Leiker, Celestina; Gill, Larecia Money; Nguyen, Danh

2015-06-01

Academic incivility is a serious challenge for nursing education, which needs to be empirically measured and fully addressed. A convenience sample of nursing faculty and students from 20 schools of nursing in the United States participated in a mixed-methods study to test the psychometric properties of the Incivility in Nursing Education-Revised (INE-R) Survey. A factor analysis and other reliability analyses support the use of the INE-R as a valid and reliable measurement of student and faculty perceptions of incivility in nursing education. The INE-R is a psychometrically sound instrument to measure faculty and student perceptions of incivility; to examine differences regarding levels of nursing education, program type, gender, age, and ethnicity; to compare perceptions of incivility between and among adjunct, clinical, teaching, and research faculty; and to conduct pre- and postassessments of the perceived levels of faculty and student incivility in nursing programs to inform evidence-based interventions. Copyright 2015, SLACK Incorporated.
Clinimetrics and clinical psychometrics: macro- and micro-analysis.

Science.gov (United States)

Tomba, Elena; Bech, Per

2012-01-01

Clinimetrics was introduced three decades ago to specify the domain of clinical markers in clinical medicine (indexes or rating scales). In this perspective, clinical validity is the platform for selecting the various indexes or rating scales (macro-analysis). Psychometric validation of these indexes or rating scales is the measuring aspect (micro-analysis). Clinical judgment analysis by experienced psychiatrists is included in the macro-analysis and the item response theory models are especially preferred in the micro-analysis when using the total score as a sufficient statistic. Clinical assessment tools covering severity of illness scales, prognostic measures, issues of co-morbidity, longitudinal assessments, recovery, stressors, lifestyle, psychological well-being, and illness behavior have been identified. The constructive dialogue in clinimetrics between clinical judgment and psychometric validation procedures is outlined for generating developments of clinical practice in psychiatry. Copyright © 2012 S. Karger AG, Basel.
Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

OpenAIRE

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff; Lenon, George Binh

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization’s guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability...
Achievement Testing with the Wechsler Quicktest: An Examination of Its Psychometric Properties and Applied Utility with a Greek-Cypriot Sample

Science.gov (United States)

Vrachimi-Souroulla, Andry; Panayiotou, Georgia; Kokkinos, Constantinos M.; Lamprianou, Iasonas

2011-01-01

The study aimed to field-test a Greek version of the Wechsler Quicktest and to examine its psychometric properties. The Quicktest was individually administered to 208 students, aged 5-14 years, along with a reading test. Based on the Rasch analysis, data for the Quicktest subtests showed acceptable fit to the model. Also, correlations were found…
A validation study of the psychometric properties of the Groningen Reflection Ability Scale

DEFF Research Database (Denmark)

Andersen, Nina Bjerre; O'Neill, Lotte; Gormsen, Lise Kirstine

2014-01-01

Background Reflection, the ability to examine critically one’s own learning and functioning, is considered important for ‘the good doctor’. The Groningen Reflection Ability Scale (GRAS) is an instrument measuring student reflection, which has not yet been validated beyond the original Dutch study....... The aim of this study was to adapt GRAS for use in a Danish setting and to investigate the psychometric properties of GRAS-DK. Methods We performed a cross-cultural adaptation of GRAS from Dutch to Danish. Next, we collected primary data online, performed a retest, analysed data descriptively, estimated...... measurement error, performed an exploratory and a confirmatory factor analysis to test the proposed three-factor structure. Results 361 (69%) of 523 invited students completed GRAS-DK. Their mean score was 88 (SD = 11.42; scale maximum 115). Scores were approximately normally distributed. Measurement error...
Development and psychometric validation of a scale to assess information needs in cardiac rehabilitation: the INCR Tool.

Science.gov (United States)

Ghisi, Gabriela Lima de Melo; Grace, Sherry L; Thomas, Scott; Evans, Michael F; Oh, Paul

2013-06-01

To develop and psychometrically validate a tool to assess information needs in cardiac rehabilitation (CR) patients. After a literature search, 60 information items divided into 11 areas of needs were identified. To establish content validity, they were reviewed by an expert panel (N=10). Refined items were pilot-tested in 34 patients on a 5-point Likert-scale from 1 "really not helpful" to 5 "very important". A final version was generated and psychometrically tested in 203 CR patients. Test-retest reliability was assessed via the intraclass correlation coefficient (ICC), the internal consistency using Cronbach's alpha, and criterion validity was assessed with regard to patient's education and duration in CR. Five items were excluded after ICC analysis as well as one area of needs. All 10 areas were considered internally consistent (Cronbach's alpha>0.7). Criterion validity was supported by significant differences in mean scores by educational level (pinformation need. The INCR Tool was demonstrated to have good reliability and validity. This is an appropriate tool for application in clinical and research settings, assessing patients' needs during CR and as part of education programming. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Psychometric properties for the Balanced Inventory of Desirable Responding: dichotomous versus polytomous conventional and IRT scoring.

Science.gov (United States)

Vispoel, Walter P; Kim, Han Yi

2014-09-01

[Correction Notice: An Erratum for this article was reported in Vol 26(3) of Psychological Assessment (see record 2014-16017-001). The mean, standard deviation and alpha coefficient originally reported in Table 1 should be 74.317, 10.214 and .802, respectively. The validity coefficients in the last column of Table 4 are affected as well. Correcting this error did not change the substantive interpretations of the results, but did increase the mean, standard deviation, alpha coefficient, and validity coefficients reported for the Honesty subscale in the text and in Tables 1 and 4. The corrected versions of Tables 1 and Table 4 are shown in the erratum.] Item response theory (IRT) models were applied to dichotomous and polytomous scoring of the Self-Deceptive Enhancement and Impression Management subscales of the Balanced Inventory of Desirable Responding (Paulhus, 1991, 1999). Two dichotomous scoring methods reflecting exaggerated endorsement and exaggerated denial of socially desirable behaviors were examined. The 1- and 2-parameter logistic models (1PLM, 2PLM, respectively) were applied to dichotomous responses, and the partial credit model (PCM) and graded response model (GRM) were applied to polytomous responses. For both subscales, the 2PLM fit dichotomous responses better than did the 1PLM, and the GRM fit polytomous responses better than did the PCM. Polytomous GRM and raw scores for both subscales yielded higher test-retest and convergent validity coefficients than did PCM, 1PLM, 2PLM, and dichotomous raw scores. Information plots showed that the GRM provided consistently high measurement precision that was superior to that of all other IRT models over the full range of both construct continuums. Dichotomous scores reflecting exaggerated endorsement of socially desirable behaviors provided noticeably weak precision at low levels of the construct continuums, calling into question the use of such scores for detecting instances of "faking bad." Dichotomous
Psychometric Properties of the Psychosocial Assessment Tool-Chronic Pain Version in Families of Children With Headache.

Science.gov (United States)

Woods, Kristine; Ostrowski-Delahanty, Sarah

2017-07-01

Children with headache disorders are at increased psychosocial risk, and no validated screening measures exist to succinctly assess for risk. This study examined the psychometric properties of the Psychosocial Assessment Tool-Chronic Pain, a previously adapted screening measure of risk, in a retrospective sample of families of children diagnosed with headaches. Participants included 127 children and caregivers presenting for behavioral health evaluation of headache. Children and their primary caregivers completed several psychosocial assessment measures. Internal consistency for the Psychosocial Assessment Tool-Chronic Pain total score was high (α = 0.80), and all subscale scores had moderate to high internal consistency (α = 0.597-0.88), with the exception of the caregiver beliefs subscale (α = 0.443). The total score and the majority of subscale scores on the Psychosocial Assessment Tool-Chronic Pain were correlated with caregiver- and child-reported scores on study measures. The results demonstrate that the Psychosocial Assessment Tool-Chronic Pain has adequate psychometric properties, and because of the brief administration time, ease of scoring, and accessibility of the measure, it is a promising measure of screening for psychosocial risk in this population.
Prediction of IOI-HA Scores Using Speech Reception Thresholds and Speech Discrimination Scores in Quiet

DEFF Research Database (Denmark)

Brännström, K Jonas; Lantz, Johannes; Nielsen, Lars Holme

2014-01-01

), and speech discrimination scores (SDSs) in quiet or in noise are common assessments made prior to hearing aid (HA) fittings. It is not known whether SRT and SDS in quiet relate to HA outcome measured with the International Outcome Inventory for Hearing Aids (IOI-HA). PURPOSE: The aim of the present study...... COLLECTION AND ANALYSIS: The psychometric properties were evaluated and compared to previous studies using the IOI-HA. The associations and differences between the outcome scores and a number of descriptive variables (age, gender, fitted monaurally/binaurally with HA, first-time/experienced HA users, years...
No apparent influence of psychometrically-defined schizotypy on orientation-dependent contextual modulation of visual contrast detection

Directory of Open Access Journals (Sweden)

Damien J. Mannion

2017-01-01

Full Text Available We investigated the relationship between psychometrically-defined schizotypy and the ability to detect a visual target pattern. Target detection is typically impaired by a surrounding pattern (context with an orientation that is parallel to the target, relative to a surrounding pattern with an orientation that is orthogonal to the target (orientation-dependent contextual modulation. Based on reports that this effect is reduced in those with schizophrenia, we hypothesised that there would be a negative relationship between the relative score on psychometrically-defined schizotypy and the relative effect of orientation-dependent contextual modulation. We measured visual contrast detection thresholds and scores on the Oxford-Liverpool Inventory of Feelings and Experiences (O-LIFE from a non-clinical sample (N = 100. Contrary to our hypothesis, we find an absence of a monotonic relationship between the relative magnitude of orientation-dependent contextual modulation of visual contrast detection and the relative score on any of the subscales of the O-LIFE. The apparent difference of this result with previous reports on those with schizophrenia suggests that orientation-dependent contextual modulation may be an informative condition in which schizophrenia and psychometrically-defined schizotypy are dissociated. However, further research is also required to clarify the strength of orientation-dependent contextual modulation in those with schizophrenia.
The psychometrics of mental workload: multiple measures are sensitive but divergent.

Science.gov (United States)

Matthews, Gerald; Reinerman-Jones, Lauren E; Barber, Daniel J; Abich, Julian

2015-02-01

A study was run to test the sensitivity of multiple workload indices to the differing cognitive demands of four military monitoring task scenarios and to investigate relationships between indices. Various psychophysiological indices of mental workload exhibit sensitivity to task factors. However, the psychometric properties of multiple indices, including the extent to which they intercorrelate, have not been adequately investigated. One hundred fifty participants performed in four task scenarios based on a simulation of unmanned ground vehicle operation. Scenarios required threat detection and/or change detection. Both single- and dual-task scenarios were used. Workload metrics for each scenario were derived from the electroencephalogram (EEG), electrocardiogram, transcranial Doppler sonography, functional near infrared, and eye tracking. Subjective workload was also assessed. Several metrics showed sensitivity to the differing demands of the four scenarios. Eye fixation duration and the Task Load Index metric derived from EEG were diagnostic of single-versus dual-task performance. Several other metrics differentiated the two single tasks but were less effective in differentiating single- from dual-task performance. Psychometric analyses confirmed the reliability of individual metrics but failed to identify any general workload factor. An analysis of difference scores between low- and high-workload conditions suggested an effort factor defined by heart rate variability and frontal cortex oxygenation. General workload is not well defined psychometrically, although various individual metrics may satisfy conventional criteria for workload assessment. Practitioners should exercise caution in using multiple metrics that may not correspond well, especially at the level of the individual operator.
Development and psychometric evaluation of a cardiovascular risk and disease management knowledge assessment tool.

Science.gov (United States)

Rosneck, James S; Hughes, Joel; Gunstad, John; Josephson, Richard; Noe, Donald A; Waechter, Donna

2014-01-01

This article describes the systematic construction and psychometric analysis of a knowledge assessment instrument for phase II cardiac rehabilitation (CR) patients measuring risk modification disease management knowledge and behavioral outcomes derived from national standards relevant to secondary prevention and management of cardiovascular disease. First, using adult curriculum based on disease-specific learning outcomes and competencies, a systematic test item development process was completed by clinical staff. Second, a panel of educational and clinical experts used an iterative process to identify test content domain and arrive at consensus in selecting items meeting criteria. Third, the resulting 31-question instrument, the Cardiac Knowledge Assessment Tool (CKAT), was piloted in CR patients to ensure use of application. Validity and reliability analyses were performed on 3638 adults before test administrations with additional focused analyses on 1999 individuals completing both pretreatment and posttreatment administrations within 6 months. Evidence of CKAT content validity was substantiated, with 85% agreement among content experts. Evidence of construct validity was demonstrated via factor analysis identifying key underlying factors. Estimates of internal consistency, for example, Cronbach's α = .852 and Spearman-Brown split-half reliability = 0.817 on pretesting, support test reliability. Item analysis, using point biserial correlation, measured relationships between performance on single items and total score (P knowledge instrument specifically designed for an adult CR population was systematically developed and tested in a large representative patient population, satisfying psychometric parameters, including validity and reliability.
An investigation into the psychometric properties of the Hospital Anxiety and Depression Scale in patients with breast cancer

Science.gov (United States)

Rodgers, Jacqui; Martin, Colin R; Morse, Rachel C; Kendell, Kate; Verrill, Mark

2005-01-01

Background To determine the psychometric properties of the Hospital Anxiety and Depression Scale (HADS) in patients with breast cancer and determine the suitability of the instrument for use with this clinical group. Methods A cross-sectional design was used. The study used a pooled data set from three breast cancer clinical groups. The dependent variables were HADS anxiety and depression sub-scale scores. Exploratory and confirmatory factor analyses were conducted on the HADS to determine its psychometric properties in 110 patients with breast cancer. Seven models were tested to determine model fit to the data. Results Both factor analysis methods indicated that three-factor models provided a better fit to the data compared to two-factor (anxiety and depression) models for breast cancer patients. Clark and Watson's three factor tripartite and three factor hierarchical models provided the best fit. Conclusion The underlying factor structure of the HADS in breast cancer patients comprises three distinct, but correlated factors, negative affectivity, autonomic anxiety and anhedonic depression. The clinical utility of the HADS in screening for anxiety and depression in breast cancer patients may be enhanced by using a modified scoring procedure based on a three-factor model of psychological distress. This proposed alternate scoring method involving regressing autonomic anxiety and anhedonic depression factors onto the third factor (negative affectivity) requires further investigation in order to establish its efficacy. PMID:16018801
An investigation into the psychometric properties of the Hospital Anxiety and Depression Scale in patients with breast cancer

Directory of Open Access Journals (Sweden)

Kendell Kate

2005-07-01

Full Text Available Abstract Background To determine the psychometric properties of the Hospital Anxiety and Depression Scale (HADS in patients with breast cancer and determine the suitability of the instrument for use with this clinical group. Methods A cross-sectional design was used. The study used a pooled data set from three breast cancer clinical groups. The dependent variables were HADS anxiety and depression sub-scale scores. Exploratory and confirmatory factor analyses were conducted on the HADS to determine its psychometric properties in 110 patients with breast cancer. Seven models were tested to determine model fit to the data. Results Both factor analysis methods indicated that three-factor models provided a better fit to the data compared to two-factor (anxiety and depression models for breast cancer patients. Clark and Watson's three factor tripartite and three factor hierarchical models provided the best fit. Conclusion The underlying factor structure of the HADS in breast cancer patients comprises three distinct, but correlated factors, negative affectivity, autonomic anxiety and anhedonic depression. The clinical utility of the HADS in screening for anxiety and depression in breast cancer patients may be enhanced by using a modified scoring procedure based on a three-factor model of psychological distress. This proposed alternate scoring method involving regressing autonomic anxiety and anhedonic depression factors onto the third factor (negative affectivity requires further investigation in order to establish its efficacy.
A process dissociation approach to objective-projective test score interrelationships.

Science.gov (United States)

Bornstein, Robert F

2002-02-01

Even when self-report and projective measures of a given trait or motive both predict theoretically related features of behavior, scores on the 2 tests correlate modestly with each other. This article describes a process dissociation framework for personality assessment, derived from research on implicit memory and learning, which can resolve these ostensibly conflicting results. Research on interpersonal dependency is used to illustrate 3 key steps in the process dissociation approach: (a) converging behavioral predictions, (b) modest test score intercorrelations, and (c) delineation of variables that differentially affect self-report and projective test scores. Implications of the process dissociation framework for personality assessment and test development are discussed.
A prognostic scoring system for arm exercise stress testing.

Science.gov (United States)

Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

2016-01-01

Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all pstatistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.
Psychometric properties of the Spanish version of the Mindful Attention Awareness Scale (MAAS) in patients with fibromyalgia.

Science.gov (United States)

Cebolla, Ausias; Luciano, Juan V; DeMarzo, Marcelo Piva; Navarro-Gil, Mayte; Campayo, Javier Garcia

2013-01-14

Mindful-based interventions improve functioning and quality of life in fibromyalgia (FM) patients. The aim of the study is to perform a psychometric analysis of the Spanish version of the Mindful Attention Awareness Scale (MAAS) in a sample of patients diagnosed with FM. The following measures were administered to 251 Spanish patients with FM: the Spanish version of MAAS, the Chronic Pain Acceptance Questionnaire, the Pain Catastrophising Scale, the Injustice Experience Questionnaire, the Psychological Inflexibility in Pain Scale, the Fibromyalgia Impact Questionnaire and the Euroqol. Factorial structure was analysed using Confirmatory Factor Analyses (CFA). Cronbach's α coefficient was calculated to examine internal consistency, and the intraclass correlation coefficient (ICC) was calculated to assess the test-retest reliability of the measures. Pearson's correlation tests were run to evaluate univariate relationships between scores on the MAAS and criterion variables. The MAAS scores in our sample were low (M = 56.7; SD = 17.5). CFA confirmed a two-factor structure, with the following fit indices [sbX2 = 172.34 (p < 0.001), CFI = 0.95, GFI = 0.90, SRMR = 0.05, RMSEA = 0.06. MAAS was found to have high internal consistency (Cronbach's α = 0.90) and adequate test-retest reliability at a 1-2 week interval (ICC = 0.90). It showed significant and expected correlations with the criterion measures with the exception of the Euroqol (Pearson = 0.15). Psychometric properties of the Spanish version of the MAAS in patients with FM are adequate. The dimensionality of the MAAS found in this sample and directions for future research are discussed.
A Psychometric Analysis of the Reading the Mind in the Eyes Test: Towards a Brief Form for Research and Applied Settings

Directory of Open Access Journals (Sweden)

Sally eOlderbak

2015-10-01

Full Text Available The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum. However, little is known about the test’s psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test’s reliance on one’s vocabulary and evaluate the short-form structure in clinical populations.
Psychometric characteristics of the chronic Otitis media questionnaire 12 (COMQ - 12): stability of factor structure and replicability shown by the Serbian version.

Science.gov (United States)

Bukurov, Bojana; Arsovic, Nenad; Grujicic, Sandra Sipetic; Haggard, Mark; Spencer, Helen; Marinkovic, Jelena Eric

2017-10-23

Recently, demand for and supply of short-form patient-reported outcome measures (PROMs) have risen throughout the world healthcare. Our contribution to meeting that demand has been translating and culturally adapting the Chronic Otitis Media Questionnaire-12 (COMQ-12) for adults into Serbian and enhancing its psychometric base on the relatively large Serbian COM caseload. Chronic otitis media can seriously affect quality of life progressively and in long-term, and it remains the major source of hearing problems in the developing world. The translated questionnaire was given twice to 60 adult patients with chronic otitis media of three types (inactive, active mucosal and active squamous disease) and to 60 healthy volunteers. Both patients and volunteers also filled the generic Short-Form 36 questionnaire (SF-36). Conventional statistical procedures were used in strategically driven development of scoring. Additionally, item responses were scaled by linear mapping against the provisional total score. Generalizability, detailed factor interpretation and supportability of scores were criteria, for the best compromise factor solution. Test-retest reliability was very high (0.924 to 0.989, depending on score). The a priori content dimensions of the questionnaire were strongly supported by 3-factor exploratory and confirmatory factor analyses for content validity, separating (i) ear symptoms from (ii) hearing problems, from (iii) daily activity restriction plus healthcare uptake. The 3-factor structure was furthermore highly stable on replication. The very large effect sizes when contrasting patients with healthy volunteers, and active with inactive disease established construct validity for the total score. A strong association with disease activity and a moderate one with generic health-related quality of life (HRQoL), the SF-36, supported construct validity for two of three factors extracted (ear symptoms, and impact on daily activities plus healthcare uptake). Given

The Functionality Appreciation Scale (FAS): Development and psychometric evaluation in U.S. community women and men.

Science.gov (United States)

Alleva, Jessica M; Tylka, Tracy L; Kroon Van Diest, Ashley M

2017-12-01

Body functionality has been identified as an important dimension of body image that has the potential to be useful in the prevention and treatment of negative body image and in the enhancement of positive body image. Specifically, cultivating appreciation of body functionality may offset appearance concerns. However, a scale assessing this construct has yet to be developed. Therefore, we developed the Functionality Appreciation Scale (FAS) and examined its psychometric properties among three online community samples totalling 1042 women and men (ns=490 and 552, respectively). Exploratory factor analyses revealed a unidimensional structure with seven items. Confirmatory factor analysis upheld its unidimensionality and invariance across gender. The internal consistency, test-retest reliability, criterion-related, and construct (convergent, discriminant, incremental) validity of its scores were upheld. The FAS is a psychometrically sound measure that is unique from existing positive body image measures. Scholars will find the FAS applicable within research and clinical settings. Copyright © 2017 Elsevier Ltd. All rights reserved.
Measuring psychological trauma after spinal cord injury: Development and psychometric characteristics of the SCI-QOL Psychological Trauma item bank and short form.

Science.gov (United States)

Kisala, Pamela A; Victorson, David; Pace, Natalie; Heinemann, Allen W; Choi, Seung W; Tulsky, David S

2015-05-01

To describe the development and psychometric properties of the SCI-QOL Psychological Trauma item bank and short form. Using a mixed-methods design, we developed and tested a Psychological Trauma item bank with patient and provider focus groups, cognitive interviews, and item response theory based analytic approaches, including tests of model fit, differential item functioning (DIF) and precision. We tested a 31-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Veterans Administration hospital. A total of 716 individuals with SCI completed the trauma items The 31 items fit a unidimensional model (CFI=0.952; RMSEA=0.061) and demonstrated good precision (theta range between 0.6 and 2.5). Nine items demonstrated negligible DIF with little impact on score estimates. The final calibrated item bank contains 19 items The SCI-QOL Psychological Trauma item bank is a psychometrically robust measurement tool from which a short form and a computer adaptive test (CAT) version are available.
Psychometric Properties of the Croatian Language Version of the Dental Environment Stress Questionnaire on Dental Medicine Students

Directory of Open Access Journals (Sweden)

Martina Laktić

2017-01-01

Full Text Available Objective: To develop the Croatian version of the 41-item Dental Environment Stress questionnaire (DES for stress assessment of dental students in both, preclinical and clinical years of study and to test its psychometric properties in Croatian dental student population. Materials and Methods: The English version of the 41-Item DES questionnaire was first translated into the Croatian language. Subsequently, it was set on the google drive and filled out by a total of 202 students from the School of Dental Medicine, University of Zagreb and 30 additional students from other Faculties. Students also assessed their overall level of stress on the Likert scale (1=no stress, 5=highest level of stress. Internal consistency was tested on 202 dental students; test-retest reliability on 30 dental students who filled out the same questionnaire twice; convergent validity on 202 dental students; and divergent validity on 202 dental students and 30 students from faculties not belonging to the biomedicine group. Results: Internal consistency showed high Cronbach alpha coefficient (0.9 and test-retest reliability showed no significant difference (P>0.05 within the period of 14 days when stress level had not changed (vacation. Convergent validity was confirmed by the significant association between the DES summary scores and the self- perceived level of stress (Spearman’s rho=0.881; P<0.001. Divergent validity was confirmed by significantly lower DES summary scores in students not belonging to the Biomedicine group (t=7.5, P<0.001. Conclusion: Excellent psychometric properties of the Croatian version of the DES questionnaire enable its utilization for assessment of stress level in Croatian dental students.
Measuring pregnancy planning: A psychometric evaluation and comparison of two scales.

Science.gov (United States)

Drevin, Jennifer; Kristiansson, Per; Stern, Jenny; Rosenblad, Andreas

2017-11-01

To psychometrically test the London Measure of Unplanned Pregnancy and compare it with the Swedish Pregnancy Planning Scale. The incidence of unplanned pregnancies is an important indicator of reproductive health. The London Measure of Unplanned Pregnancy measures pregnancy planning by taking contraceptive use, timing, intention to become pregnant, desire for pregnancy, partner agreement, and pre-conceptual preparations into account. It has, however, previously not been psychometrically evaluated using confirmatory factor analysis. The Likert-scored single-item Swedish Pregnancy Planning Scale has been developed to measure the woman's own view of pregnancy planning level. Cross-sectional design. In 2012-2013, 5493 pregnant women living in Sweden were invited to participate in the Swedish Pregnancy Planning study, of whom 3327 (61%) agreed to participate and answered a questionnaire. A test-retest pilot study was conducted in 2011-2012. Thirty-two participants responded to the questionnaire on two occasions 14 days apart. Data were analysed using confirmatory factor analysis, Cohen's weighted kappa and Spearman's correlation. All items of the London Measure of Unplanned Pregnancy contributed to measuring pregnancy planning, but four items had low item-reliability. The London Measure of Unplanned Pregnancy and Swedish Pregnancy Planning Scale corresponded reasonably well with each other and both showed good test-retest reliability. The London Measure of Unplanned Pregnancy may benefit from item reduction and its usefulness may be questioned. The Swedish Pregnancy Planning Scale is time-efficient and shows acceptable reliability and construct validity, which makes it more useful for measuring pregnancy planning. © 2017 John Wiley & Sons Ltd.
Preliminary psychometric properties of the Acceptance and Action Questionnaire-II: a revised measure of psychological inflexibility and experiential avoidance.

Science.gov (United States)

Bond, Frank W; Hayes, Steven C; Baer, Ruth A; Carpenter, Kenneth M; Guenole, Nigel; Orcutt, Holly K; Waltz, Tom; Zettle, Robert D

2011-12-01

The present research describes the development and psychometric evaluation of a second version of the Acceptance and Action Questionnaire (AAQ-II), which assesses the construct referred to as, variously, acceptance, experiential avoidance, and psychological inflexibility. Results from 2,816 participants across six samples indicate the satisfactory structure, reliability, and validity of this measure. For example, the mean alpha coefficient is .84 (.78-.88), and the 3- and 12-month test-retest reliability is .81 and .79, respectively. Results indicate that AAQ-II scores concurrently, longitudinally, and incrementally predict a range of outcomes, from mental health to work absence rates, that are consistent with its underlying theory. The AAQ-II also demonstrates appropriate discriminant validity. The AAQ-II appears to measure the same concept as the AAQ-I (r=.97) but with better psychometric consistency. Copyright © 2011. Published by Elsevier Ltd.
What do educational test scores really measure?

DEFF Research Database (Denmark)

McIntosh, James; D. Munk, Martin

Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelate......, and possible incentive problems make it more difficult to elicit true values of what the tests measure....
Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

Science.gov (United States)

Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

2016-03-01

This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (pcorrelation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.
Psychometric properties of a driving self-efficacy scale – short form in Argentinean drivers

Directory of Open Access Journals (Sweden)

Mario Alberto Trogolo

2017-06-01

Full Text Available The purpose of this study was to translate and examine the psychometric properties of a driving self-efficacy scale developed by Dorn and Machin (2004. The factor structure, reliability and external validity of the scale were examined in a sample of 447 drivers from Cordoba, Argentina. In addition, measurement invariance across sex was also tested. Results from a confirmatory factor analysis support the unidimensional structure of the scale and the invariance of its parameters (configural, metric and scalar between men and women. Reliability analyses using alpha and omega coefficients revealed high internal consistency (coefficients equal to 0.81 in both cases and satisfactory evidence of external validity of the scale scores, with measures of risk perception, risky driving, history of traffic crashes and fines. Finally, results also showed that the scale seems to be relatively robust against response biases due to social desirability. In summary, findings support the validity and reliability of the scale in Argentina. However, further studies analyzing additional psychometric properties are needed.
Psychometric properties of startle and corrugator response in NPU, affective picture viewing, and resting state tasks.

Science.gov (United States)

Kaye, Jesse T; Bradford, Daniel E; Curtin, John J

2016-08-01

The current study provides a comprehensive evaluation of critical psychometric properties of commonly used psychophysiology laboratory tasks/measures within the NIMH RDoC. Participants (N = 128) completed the no-shock, predictable shock, unpredictable shock (NPU) task, affective picture viewing task, and resting state task at two study visits separated by 1 week. We examined potentiation/modulation scores in NPU (predictable or unpredictable shock vs. no-shock) and affective picture viewing tasks (pleasant or unpleasant vs. neutral pictures) for startle and corrugator responses with two commonly used quantification methods. We quantified startle potentiation/modulation scores with raw and standardized responses. We quantified corrugator potentiation/modulation in the time and frequency domains. We quantified general startle reactivity in the resting state task as the mean raw startle response during the task. For these three tasks, two measures, and two quantification methods, we evaluated effect size robustness and stability, internal consistency (i.e., split-half reliability), and 1-week temporal stability. The psychometric properties of startle potentiation in the NPU task were good, but concerns were noted for corrugator potentiation in this task. Some concerns also were noted for the psychometric properties of both startle and corrugator modulation in the affective picture viewing task, in particular, for pleasant picture modulation. Psychometric properties of general startle reactivity in the resting state task were good. Some salient differences in the psychometric properties of the NPU and affective picture viewing tasks were observed within and across quantification methods. © 2016 The Authors. Psychophysiology published by Wiley Periodicals, Inc. on behalf of Society for Psychophysiological Research.
Content Validity and Psychometric Characteristics of the "Knowledge about Older Patients Quiz" for Nurses Using Item Response Theory.

Science.gov (United States)

Dikken, Jeroen; Hoogerduijn, Jita G; Kruitwagen, Cas; Schuurmans, Marieke J

2016-11-01

To assess the content validity and psychometric characteristics of the Knowledge about Older Patients Quiz (KOP-Q), which measures nurses' knowledge regarding older hospitalized adults and their certainty regarding this knowledge. Cross-sectional. Content validity: general hospitals. Psychometric characteristics: nursing school and general hospitals in the Netherlands. Content validity: 12 nurse specialists in geriatrics. Psychometric characteristics: 107 first-year and 78 final-year bachelor of nursing students, 148 registered nurses, and 20 nurse specialists in geriatrics. Content validity: The nurse specialists rated each item of the initial KOP-Q (52 items) on relevance. Ratings were used to calculate Item-Content Validity Index and average Scale-Content Validity Index (S-CVI/ave) scores. Items with insufficient content validity were removed. Psychometric characteristics: Ratings of students, nurses, and nurse specialists were used to test for different item functioning (DIF) and unidimensionality before item characteristics (discrimination and difficulty) were examined using Item Response Theory. Finally, norm references were calculated and nomological validity was assessed. Content validity: Forty-three items remained after assessing content validity (S-CVI/ave = 0.90). Psychometric characteristics: Of the 43 items, two demonstrating ceiling effects and 11 distorting ability estimates (DIF) were subsequently excluded. Item characteristics were assessed for the remaining 30 items, all of which demonstrated good discrimination and difficulty parameters. Knowledge was positively correlated with certainty about this knowledge. The final 30-item KOP-Q is a valid, psychometrically sound, comprehensive instrument that can be used to assess the knowledge of nursing students, hospital nurses, and nurse specialists in geriatrics regarding older hospitalized adults. It can identify knowledge and certainty deficits for research purposes or serve as a tool in educational
Modeling Floor Effects in Standardized Vocabulary Test Scores in a Sample of Low SES Hispanic Preschool Children under the Multilevel Structural Equation Modeling Framework

Directory of Open Access Journals (Sweden)

Leina Zhu

2017-12-01

Full Text Available Researchers and practitioners often use standardized vocabulary tests such as the Peabody Picture Vocabulary Test-4 (PPVT-4; Dunn and Dunn, 2007 and its companion, the Expressive Vocabulary Test-2 (EVT-2; Williams, 2007, to assess English vocabulary skills as an indicator of children's school readiness. Despite their psychometric excellence in the norm sample, issues arise when standardized vocabulary tests are used to asses children from culturally, linguistically and ethnically diverse backgrounds (e.g., Spanish-speaking English language learners or delayed in some manner. One of the biggest challenges is establishing the appropriateness of these measures with non-English or non-standard English speaking children as often they score one to two standard deviations below expected levels (e.g., Lonigan et al., 2013. This study re-examines the issues in analyzing the PPVT-4 and EVT-2 scores in a sample of 4-to-5-year-old low SES Hispanic preschool children who were part of a larger randomized clinical trial on the effects of a supplemental English shared-reading vocabulary curriculum (Pollard-Durodola et al., 2016. It was found that data exhibited strong floor effects and the presence of floor effects made it difficult to differentiate the invention group and the control group on their vocabulary growth in the intervention. A simulation study is then presented under the multilevel structural equation modeling (MSEM framework and results revealed that in regular multilevel data analysis, ignoring floor effects in the outcome variables led to biased results in parameter estimates, standard error estimates, and significance tests. Our findings suggest caution in analyzing and interpreting scores of ethnically and culturally diverse children on standardized vocabulary tests (e.g., floor effects. It is recommended appropriate analytical methods that take into account floor effects in outcome variables should be considered.
Vietnamese validation of the short version of Internet Addiction Test

Directory of Open Access Journals (Sweden)

Bach Xuan Tran

2017-12-01

Full Text Available Background and aims: The main goal of the present study was to examine the psychometric properties of a Vietnamese version of the short-version of Internet Addiction Test (s-IAT and to assess the relationship between s-IAT scores and demographics, health related qualify of life and perceived stress scores in young Vietnamese. Methods: The Vietnamese version of s-IAT was administered to a sample of 589 participants. Exploratory factor and reliability analyses were performed. Regression analysis was used to identify the associated factors. Results: The two-factor model of Vietnamese version of s-IAT demonstrated good psychometric properties. The internal consistency of Factor 1 (loss of control/time management was high (Cronbach's alpha=0.82 and Factor 2 (craving/social problems was satisfactory (Cronbach's alpha=0.75. Findings indicated that 20.9% youths were addicted to the Internet. Regression analysis revealed significant associations between Internet addiction and having problems in self-care, lower quality of life and high perceived stress scores. Discussion and conclusions: The Vietnamese version of s-IAT is a valid and reliable instrument to assess IA in Vietnamese population. Due to the high prevalence of IA among Vietnamese youths, IA should be paid attention in future intervention programs. s-IAT can be a useful screening tool for IA to promptly inform and treat the IA among Vietnamese youths. Keywords: Factor analysis, Short-version, Internet Addiction Test, Psychometric properties, Vietnamese
R in Psychometrics and Psychometrics in R

OpenAIRE

Leeuw, Jan de

2006-01-01

In psychometrics, and in the closely related fields of quantititative methods for the social and educational sciences, R is not yet used very often. Traditional mainframe packages such as SAS and SPSS are still dominant at the user-level, Stata has made inroads at the teaching level, and Matlab is quite prominent at the research level. In this paper we define the most visible techniques in the psychometrics area, we give an overview of what is available in R, and we discuss what is m...
A note on contemporary psychometrics.

Science.gov (United States)

Vitoratou, Silia; Pickles, Andrew

2017-12-01

Psychometrics provide the mathematical underpinnings for psychological assessment. From the late 19th century, a plethora of methodological research achievements equipped researchers and clinicians with efficient tools whose practical value becomes more evident in the era of the internet and big data. Nowadays, powerful probabilistic models exist for most types of data and research questions. As the usability of the psychometric scales is better comprehended, there is an increased interest in applied research outcomes. Paradoxically, while the interest in applications for psychometric scales increases, publishing research on the development and/or evaluation of those scales per se, is not welcomed by many relevant journals. This special issue in psychometrics is therefore a great opportunity to briefly review the main ideas and methods used in psychometrics, and to discuss the challenges in contemporary applied psychometrics.
NIH Toolbox Cognitive Function Battery (CFB): Composite Scores of Crystallized, Fluid, and Overall Cognition

Science.gov (United States)

Akshoomoff, Natacha; Beaumont, Jennifer L.; Bauer, Patricia J.; Dikmen, Sureyya; Gershon, Richard; Mungas, Dan; Slotkin, Jerry; Tulsky, David; Weintraub, Sandra; Zelazzo, Philip; Heaton, Robert K.

2014-01-01

The NIH Toolbox Cognitive Function Battery (CFB) includes 7 tests covering 8 cognitive abilities considered to be important in adaptive functioning across the lifespan (from early childhood to late adulthood). Here we present data on psychometric characteristics in children (N = 208; ages 3–15 years) of a total summary score and composite scores reflecting two major types of cognitive abilities: “crystallized” (more dependent upon past learning experiences) and “fluid” (capacity for new learning and information processing in novel situations). Both types of cognition are considered important in everyday functioning, but are thought to be differently affected by brain health status throughout life, from early childhood through older adulthood. All three Toolbox composite scores showed excellent test-retest reliability, robust developmental effects across the childhood age range considered here, and strong correlations with established, “gold standard” measures of similar abilities. Additional preliminary evidence of validity includes significant associations between all three Toolbox composite scores and maternal reports of children’s health status and school performance. PMID:23952206
Effects of white noise on Callsign Acquisition Test and Modified Rhyme Test scores.

Science.gov (United States)

Blue-Terry, Misty; Letowski, Tomasz

2011-02-01

The Callsign Acquisition Test (CAT) is a speech intelligibility test developed by the US Army Research Laboratory. The test has been used to evaluate speech transmission through various communication systems but has not been yet sufficiently standardised and validated. The aim of this study was to compare CAT and Modified Rhyme Test (MRT) performance in the presence of white noise across a range of signal-to-noise ratios (SNRs). A group of 16 normal-hearing listeners participated in the study. The speech items were presented at 65 dB(A) in the background of white noise at SNRs of -18, -15, -12, -9 and -6 dB. The results showed a strong positive association (75.14%) between the two tests, but significant differences between the CAT and MRT absolute scores in the range of investigated SNRs. Based on the data, a function to predict CAT scores based on existing MRT scores and vice versa was formulated. STATEMENT OF RELEVANCE: This work compares performance data of a common speech intelligibility test (MRT) with a new test (CAT) in the presence of white noise. The results here can be used as a part of the standardisation procedures and provide insights to the predictive capabilities of the CAT to quantify speech intelligibility communication in high-noise military environments.
Development, validation and psychometric properties of the Arabic version of the Orofacial Esthetic Scale: OES-Ar.

Science.gov (United States)

Alhajj, Mohammed Nasser; Amran, Abdullah Ghalib; Halboub, Esam; Al-Basmi, Abdulghani Ali; Al-Ghabri, Fawaz Abdullah

2017-07-01

This study aimed at developing the Arabic version of the Orofacial Esthetic Scale (OES-Ar) and to investigate its psychometric properties among Arabic-speaking population with and without esthetic impairments. Translation and cross-cultural adaptation was done according to the standard guidelines. Internal consistency was assessed on 230 participants. For test-retest reliability, 50 subjects with natural teeth were recalled within a period of 2 weeks. Validity of the OES-Ar was tested by construct, convergent, and discriminant validity tests. Responsiveness to esthetic changes was assessed in 60 patients. The results showed excellent internal consistency with Cronbach's alpha value of 0.92 and inter-item correlation average value of 0.60. The ICC values ranged from 0.87 to 0.96 which indicated excellent agreement. Construct validity of the OES-Ar was confirmed to be one-factor structure (one-dimensional). For convergent validity, a significant correlation was found between OES summary score and overall impression of the orofacial esthetic as well as between OES summary score and the summary score of the three questions of the OHIP-49Ar related to esthetic. The discriminant validity test revealed significant differences between different study groups (Pesthetics in Arabic-speaking patients. Copyright © 2016 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Disruptive Finance : Using Psychometrics to Overcome Collateral Constraints in Ethiopia

OpenAIRE

Alibhai, Salman; Buehren, Niklas; Coleman, Rachel; Goldstein, Markus; Strobbe, Francesco

2018-01-01

This case study tells the story of the evolution of psychometric credit scoring as an innovative solution in a World Bank operation, from its humble beginnings as a small pilot in Ethiopia, to the current movement to replicate its use for similar challenges in countries across the continent in Tanzania, Zimbabwe, Madagascar, and beyond. Fintech is commonly defined as an industry composed ...
Psychometric evaluation of the Danish version of Satisfaction with Daily Occupations (SDO)

DEFF Research Database (Denmark)

Eklund, Mona; Morville, Anne-Le

2014-01-01

AIMS: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. METHODS: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed....
Vitamin B12 deficiency: Characterization of psychometrics and MRI morphometrics.

Science.gov (United States)

Hsu, Yen-Hsuan; Huang, Ching-Feng; Lo, Chung-Ping; Wang, Tzu-Lan; Tu, Min-Chien

2016-01-01

Vitamin B12 is essential for the integrity of the central nervous system. However, performances in different cognitive domains relevant to vitamin B12 deficiency remain to be detailed. To date, there have been limited studies that examined the relationships between cognitions and structural neuroimaging in a single cohort of low-vitamin B12 status. The present study aimed to depict psychometrics and magnetic resonance imaging (MRI) morphometrics among patients with vitamin B12 deficiency, and to examine their inter-relations. We compared 34 consecutive patients with vitamin B12 deficiency (serum level ≤ 250 pg/ml) to 34 demographically matched controls by their cognitive performances and morphometric indices of brain MRI. The correlations between psychometrics and morphometrics were analyzed. The vitamin B12 deficiency group had lower scores than the controls on total scores of Mini-Mental Status Examination (MMSE) and Cognitive Abilities Screening Instrument (CASI) (both P psychometric and morphometric indices, pronounced correlations between bicaudate ratio and long-term memory, mental manipulation, orientation, language, and verbal fluency were noted (all P < 0.01). Vitamin B12 deficiency is associated with a global cognition decline with language, orientation, and mental manipulation selectively impaired. Preferential atrophy in frontal regions is the main neuroimaging feature. Although the frontal ratio highlights the relevant atrophy among patients, the bicaudate ratio might be the best index on the basis of its strong association with global cognition and related cognitive domains, implying dysfunction of fronto-subcortical circuits as the fundamental pathogenesis related to vitamin B12 deficiency.

Psychometric properties of a single-item scale to assess sleep quality among individuals with fibromyalgia

Directory of Open Access Journals (Sweden)

Sadosky Alesia B

2009-06-01

Full Text Available Abstract Background Sleep disturbances are a common and bothersome symptom of fibromyalgia (FM. This study reports psychometric properties of a single-item scale to assess sleep quality among individuals with FM. Methods Analyses were based on data from two randomized, double-blind, placebo-controlled trials of pregabalin (studies 1056 and 1077. In a daily diary, patients reported the quality of their sleep on a numeric rating scale ranging from 0 ("best possible sleep" to 10 ("worst possible sleep". Test re-test reliability of the Sleep Quality Scale was evaluated by computing intraclass correlation coefficients. Pearson correlation coefficients were computed between baseline Sleep Quality scores and baseline pain diary and Medical Outcomes Study (MOS Sleep scores. Responsiveness to treatment was evaluated by standardized effect sizes computed as the difference between least squares mean changes in Sleep Quality scores in the pregabalin and placebo groups divided by the standard deviation of Sleep Quality scores across all patients at baseline. Results Studies 1056 and 1077 included 748 and 745 patients, respectively. Most patients were female (study 1056: 94.4%; study 1077: 94.5% and white (study 1056: 90.2%; study 1077: 91.0%. Mean ages were 48.8 years (study 1056 and 50.1 years (study 1077. Test re-test reliability coefficients of the Sleep Quality Scale were 0.91 and 0.90 in the 1056 and 1077 studies, respectively. Pearson correlation coefficients between baseline Sleep Quality scores and baseline pain diary scores were 0.64 (p Conclusion These results provide evidence of the reproducibility, convergent validity, and responsiveness to treatment of the Sleep Quality Scale and provide a foundation for its further use and evaluation in FM patients.
Psychometric testing of the Chinese version of the medical outcomes study social support survey (MOS-SSS-C).

Science.gov (United States)

Yu, Doris S F; Lee, Diana T F; Woo, Jean

2004-04-01

The purpose of this study was to assess the psychometric properties of the Chinese version of the Medical Outcomes Study Social Support Survey (MOS-SSS-C) in a sample of 110 patients. Criterion-related and construct validities of the MOS-SSS-C were evaluated by correlations with the Chinese version of the Multidimensional Perceived Social Support Survey (r =.82) and the Hospital Anxiety and Depression Scale (r = -.58). Confirmatory factor analysis affirmed the four-factor structure of the MOS-SSS-C in measuring the functional aspects of perceived social support. Cronbach's alphas for the subscales ranged from.93 to.96, whereas the alpha for the overall scale was.98. The 2-week test-retest reliability of the MOS-SSS-C as measured by the intraclass correlation coefficient was.84. The MOS-SSS-C is a psychometrically sound multidimensional measure for the evaluation of functional aspects of perceived social support by Chinese patients with chronic disease. Copyright 2004 Wiley Periodicals, Inc.
The MCCB impairment profile for schizophrenia outpatients: results from the MATRICS psychometric and standardization study.

Science.gov (United States)

Kern, Robert S; Gold, James M; Dickinson, Dwight; Green, Michael F; Nuechterlein, Keith H; Baade, Lyle E; Keefe, Richard S E; Mesholam-Gately, Raquelle I; Seidman, Larry J; Lee, Cathy; Sugar, Catherine A; Marder, Stephen R

2011-03-01

The MATRICS Psychometric and Standardization Study was conducted as a final stage in the development of the MATRICS Consensus Cognitive Battery (MCCB). The study included 176 persons with schizophrenia or schizoaffective disorder and 300 community residents. Data were analyzed to examine the cognitive profile of clinically stable schizophrenia patients on the MCCB. Secondarily, the data were analyzed to identify which combination of cognitive domains and corresponding cut-off scores best discriminated patients from community residents, and patients competitively employed vs. those not. Raw scores on the ten MCCB tests were entered into the MCCB scoring program which provided age- and gender-corrected T-scores on seven cognitive domains. To test for between-group differences, we conducted a 2 (group)×7 (cognitive domain) MANOVA with follow-up independent t-tests on the individual domains. Classification and regression trees (CART) were used for the discrimination analyses. Examination of patient T-scores across the seven cognitive domains revealed a relatively compact profile with T-scores ranging from 33.4 for speed of processing to 39.3 for reasoning and problem-solving. Speed of processing and social cognition best distinguished individuals with schizophrenia from community residents; speed of processing along with visual learning and attention/vigilance optimally distinguished patients competitively employed from those who were not. The cognitive profile findings provide a standard to which future studies can compare results from other schizophrenia samples and related disorders; the classification results point to specific areas and levels of cognitive impairment that may advance work rehabilitation efforts. Published by Elsevier B.V.
ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

Science.gov (United States)

Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C.

2016-01-01

Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…
The End-Stage Renal Disease Adherence Questionnaire (ESRD-AQ): testing the psychometric properties in patients receiving in-center hemodialysis.

OpenAIRE

Kim, Y; Evangelista, LS; Phillips, LR; Pavlish, C; Kopple, JD

2010-01-01

Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and v...
The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.

Science.gov (United States)

Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary

2017-10-01

Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between
Quantitative Psychology : the 82nd Annual Meeting of the Psychometric Society

CERN Document Server

Culpepper, Steven; Janssen, Rianne; González, Jorge; Molenaar, Dylan

2018-01-01

This proceedings book highlights the latest research and developments in psychometrics and statistics. Featuring contributions presented at the 82nd Annual Meeting of the Psychometric Society (IMPS), organized by the University of Zurich and held in Zurich, Switzerland from July 17 to 21, 2017, its 34 chapters address a diverse range of psychometric topics including item response theory, factor analysis, causal inference, Bayesian statistics, test equating, cognitive diagnostic models and multistage adaptive testing. The IMPS is one of the largest international meetings on quantitative measurement in psychology, education and the social sciences, attracting over 500 participants and 250 paper presentations from around the world every year. This book gathers the contributions of selected presenters, which were subsequently expanded and peer-reviewed.
Psychometric properties of Spanish-language adult dental fear measures

Directory of Open Access Journals (Sweden)

Heaton Lisa J

2008-05-01

Full Text Available Abstract Background It would be useful to have psychometrically-sound measures of dental fear for Hispanics, who comprise the largest ethnic minority in the United States. We report on the psychometric properties of Spanish-language versions of two common adult measures of dental fear (Modified Dental Anxiety Scale, MDAS; Dental Fear Survey, DFS, as well as a measure of fear of dental injections (Needle Survey, NS. Methods Spanish versions of the measures were administered to 213 adults attending Hispanic cultural festivals, 31 students (who took the questionnaire twice, for test-retest reliability, and 100 patients at a dental clinic. We also administered the questionnaire to 136 English-speaking adults at the Hispanic festivals and 58 English-speaking students at the same college where we recruited the Spanish-speaking students, to compare the performance of the English and Spanish measures in the same populations. Results The internal reliabilities of the Spanish MDAS ranged from 0.80 to 0.85. Values for the DFS ranged from 0.92 to 0.96, and values for the NS ranged from 0.92 to 0.94. The test-retest reliabilities (intra-class correlations for the three measures were 0.69, 0.86, and 0.94 for the MDAS, DFS, and NS, respectively. The three measures showed moderate correlations with one another in all three samples, providing evidence for construct validity. Patients with higher scores on the measures were rated as being more anxious during dental procedures. Similar internal reliabilities and correlations were found in the English-version analyses. The test-retest values were also similar in the English students for the DFS and NS; however, the English test-retest value for the MDAS was better than that found in the Spanish students. Conclusion We found evidence for the internal reliability, construct validity, and criterion validity for the Spanish versions of the three measures, and evidence for the test-retest reliability of the Spanish
Psychometric properties of the Hare Psychopathy Checklist-Revised (PCL-R) in a representative sample of Canadian federal offenders.

Science.gov (United States)

Storey, Jennifer E; Hart, Stephen D; Cooke, David J; Michie, Christine

2016-04-01

The Hare Psychopathy Checklist-Revised (PCL-R; Hare, 2003) is a commonly used psychological test for assessing traits of psychopathic personality disorder. Despite the abundance of research using the PCL-R, the vast majority of research used samples of convenience rather than systematic methods to minimize sampling bias and maximize the generalizability of findings. This potentially complicates the interpretation of test scores and research findings, including the "norms" for offenders from the United States and Canada included in the PCL-R manual. In the current study, we evaluated the psychometric properties of PCL-R scores for all male offenders admitted to a regional reception center of the Correctional Service of Canada during a 1-year period (n = 375). Because offenders were admitted for assessment prior to institutional classification, they comprise a sample that was heterogeneous with respect to correctional risks and needs yet representative of all offenders in that region of the service. We examined the distribution of PCL-R scores, classical test theory indices of its structural reliability, the factor structure of test items, and the external correlates of test scores. The findings were highly consistent with those typically reported in previous studies. We interpret these results as indicating it is unlikely any sampling limitations of past research using the PCL-R resulted in findings that were, overall, strongly biased or unrepresentative. (c) 2016 APA, all rights reserved).
Do Students Behave Rationally in Multiple Choice Tests? Evidence from a Field Experiment

OpenAIRE

María Paz Espinosa; Javier Gardeazabal

2013-01-01

A disadvantage of multiple choice tests is that students have incentives to guess. To discourage guessing, it is common to use scoring rules that either penalize wrong answers or reward omissions. In psychometrics, penalty and reward scoring rules are considered equivalent. However, experimental evidence indicates that students behave differently under penalty or reward scoring rules. These differences have been attributed to the different framing (penalty versus reward). In this paper, we mo...
Does the COPD assessment test (CAT(TM)) questionnaire produce similar results when self- or interviewer administered?

Science.gov (United States)

Agusti, A; Soler-Cataluña, J J; Molina, J; Morejon, E; Garcia-Losa, M; Roset, M; Badia, X

2015-10-01

The COPD assessment test (CAT) is a questionnaire that assesses the impact of chronic obstructive pulmonary disease (COPD) on health status, but some patients have difficulties filling it up by themselves. We examined whether the mode of administration of the Spanish version of CAT (self vs. interviewer) influences its scores and/or psychometric properties. Observational, prospective study in 49 Spanish centers that includes clinically stable COPD patients (n = 153) and patients hospitalized because of an exacerbation (ECOPD; n = 224). The CAT was self-administered (CAT-SA) or administered by an interviewer (CAT-IA) based on the investigator judgment of the patient's capacity. To assess convergent validity, the Saint George's Respiratory Disease Questionnaire (SGRQ) and the London Chest Activity of Daily Living (LCADL) instrument were also administered. Psychometric properties were compared across modes of administration. A total of 118 patients (31 %) completed the CAT-SA and 259 (69 %) CAT-IA. Multiple regression analysis showed that mode of administration did not affect CAT scores. The CAT showed excellent psychometric properties in both modes of administration. Internal consistency coefficients (Cronbach's alpha) were high (0.86 for CAT-SA and 0.85 for CAT-IA) as was test-retest reliability (intraclass correlation coefficients of 0.83 for CAT-SA and CAT-IA). Correlations with SGRQ and LCADL were moderate to strong both in CAT-SA and CAT-IA, indicating good convergent validity. Similar results were observed when testing longitudinal validity. The mode of administration does not influence CAT scores or its psychometric properties. Hence, both modes of administration can be used in clinical practice depending on the physician judgment of patient's capacity.
A Comparison of EQ-5D-3L Index Scores Using Malaysian, Singaporean, Thai, and UK Value Sets in Indonesian Cervical Cancer Patients.

Science.gov (United States)

Endarti, Dwi; Riewpaiboon, Arthorn; Thavorncharoensap, Montarat; Praditsitthikorn, Naiyana; Hutubessy, Raymond; Kristina, Susi Ari

2018-05-01

To gain insight into the most suitable foreign value set among Malaysian, Singaporean, Thai, and UK value sets for calculating the EuroQol five-dimensional questionnaire index score (utility) among patients with cervical cancer in Indonesia. Data from 87 patients with cervical cancer recruited from a referral hospital in Yogyakarta province, Indonesia, from an earlier study of health-related quality of life were used in this study. The differences among the utility scores derived from the four value sets were determined using the Friedman test. Performance of the psychometric properties of the four value sets versus visual analogue scale (VAS) was assessed. Intraclass correlation coefficients and Bland-Altman plots were used to test the agreement among the utility scores. Spearman ρ correlation coefficients were used to assess convergent validity between utility scores and patients' sociodemographic and clinical characteristics. With respect to known-group validity, the Kruskal-Wallis test was used to examine the differences in utility according to the stages of cancer. There was significant difference among utility scores derived from the four value sets, among which the Malaysian value set yielded higher utility than the other three value sets. Utility obtained from the Malaysian value set had more agreements with VAS than the other value sets versus VAS (intraclass correlation coefficients and Bland-Altman plot tests results). As for the validity, the four value sets showed equivalent psychometric properties as those that resulted from convergent and known-group validity tests. In the absence of an Indonesian value set, the Malaysian value set was more preferable to be used compared with the other value sets. Further studies on the development of an Indonesian value set need to be conducted. Copyright © 2018. Published by Elsevier Inc.
The Effect of Mock Tests on Iranian EFL learners’ Test Scores

OpenAIRE

Hossein Khodabakhshzadeh; Reza Zardkanloo

2016-01-01

The effect of using tests in test preparation courses has been subject to debate. While some scholars such as Yang and Badger (2015) believe it is a cause of positive washback effect, others argue that this issue is tentative and context-bound (Green, 2007). Therefore, this study investigated the effect of using Mock tests in International English Language Testing System (IELTS) preparation courses on students’ overall IELTS scores. Fifty one IELTS students were selected non-randomly through ...
Biering-Sorensen test scores in coal miners

Energy Technology Data Exchange (ETDEWEB)

Tekin, Y.; Ortancil, O.; Ankarali, H.; Basaran, A.; Sarikaya, S.; Ozdolap, S. [Zonguldak Karaelmas University, Zonguldak (Turkey)

2009-05-15

Biering-Sorensen test is an isometric back endurance test. Biering-Sorensen test scores have varied in different cultural and occupational groups. The aims of this study were to collect normative data on Biering-Sorensen holding times, to determine the discriminative ability of the Biering-Sorensen test in Turkish coal miners, and to examine the association between Biering-Sorensen test result and functional disability. One hundred and fifty male coal miners participated in this study. Trunk extensor muscle strength was measured using the Biering-Sorensen test. Oswestry disability index was used to measure the functional disability level of low back pain. The mean Biering-Sorensen holding time for the total subject group was 107.3 {+-} 22.5 s. The mean time of Biering-Sorensen test of the subjects with and without low back pain were 99.9 {+-} 19.8 and 128.6 {+-} 15.2 s, respectively. The difference between the subjects with and without low back pain was statistically significant (p < 0.001). There was a statistically significant negative correlation between Oswestry functional disability score and Biering-Sorensen holding time (R = -0.824, p < 0.001). Turkish coal miners have low mean back extensor endurance holding times. Biering-Sorensen test had a good discriminative ability in our study group. Trunk muscle strength has a significant effect on the disability level of low back pain. Thus trunk muscle endurance training exercise therapy may be effective for the reduction of disability in patients with low back pain.
Psychometric evaluation of the Danish version of Satisfaction with Daily Occupations (SDO)

DEFF Research Database (Denmark)

Eklund, Mona; Morville, Anne-Le

2013-01-01

Aims: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. Methods: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed. Key words: validity, reliability, health, Activity...
The development and psychometric testing of a theory-based instrument to evaluate nurses' perception of clinical reasoning competence.

Science.gov (United States)

Liou, Shwu-Ru; Liu, Hsiu-Chen; Tsai, Hsiu-Min; Tsai, Ying-Huang; Lin, Yu-Ching; Chang, Chia-Hao; Cheng, Ching-Yu

2016-03-01

The purpose of the study was to develop and psychometrically test the Nurses Clinical Reasoning Scale. Clinical reasoning is an essential skill for providing safe and quality patient care. Identifying pre-graduates' and nurses' needs and designing training courses to improve their clinical reasoning competence becomes a critical task. However, there is no instrument focusing on clinical reasoning in the nursing profession. Cross-sectional design was used. This study included the development of the scale, a pilot study that preliminary tested the readability and reliability of the developed scale and a main study that implemented and tested the psychometric properties of the developed scale. The Nurses Clinical Reasoning Scale was developed based on the Clinical Reasoning Model. The scale includes 15 items using a Likert five-point scale. Data were collected from 2013-2014. Two hundred and fifty-one participants comprising clinical nurses and nursing pre-graduates completed and returned the questionnaires in the main study. The instrument was tested for internal consistency and test-retest reliability. Its validity was tested with content, construct and known-groups validity. One factor emerged from the factor analysis. The known-groups validity was confirmed. The Cronbach's alpha for the entire instrument was 0·9. The reliability and validity of the Nurses Clinical Reasoning Scale were supported. The scale is a useful tool and can be easily administered for the self-assessment of clinical reasoning competence of clinical nurses and future baccalaureate nursing graduates. Study limitations and further recommendations are discussed. © 2015 John Wiley & Sons Ltd.
A Psychometric Review of the Personality Inventory for DSM-5 (PID-5): Current Status and Future Directions.

Science.gov (United States)

Al-Dajani, Nadia; Gralnick, Tara M; Bagby, R Michael

2016-01-01

The paradigm of personality psychopathology is shifting from one that is purely categorical in nature to one grounded in dimensional individual differences. Section III (Emerging Measures and Models) of the Diagnostic and Statistical Manual of Mental Disorders (5th ed. [DSM-5]; American Psychiatric Association, 2013), for example, includes a hybrid categorical/dimensional model of personality disorder classification. To inform the hybrid model, the DSM-5 Personality and Personality Disorders Work Group developed a self-report instrument to assess pathological personality traits-the Personality Inventory for the DSM-5 (PID-5). Since its recent introduction, 30 papers (39 samples) have been published examining various aspects of its psychometric properties. In this article, we review the psychometric characteristics of the PID-5 using the Standards for Educational and Psychological Testing as our framework. The PID-5 demonstrates adequate psychometric properties, including a replicable factor structure, convergence with existing personality instruments, and expected associations with broadly conceptualized clinical constructs. More research is needed with specific consideration to clinical utility, additional forms of reliability and validity, relations with psychopathological personality traits using clinical samples, alternative methods of criterion validation, effective employment of cut scores, and the inclusion of validity scales to propel this movement forward.
Adapting the helpful responses questionnaire to assess communication skills involved in delivering contingency management: preliminary psychometrics.

Science.gov (United States)

Hartzler, Bryan

2015-08-01

A paper/pencil instrument, adapted from Miller and colleagues' (1991) Helpful Responses Questionnaire (HRQ), was developed to assess clinician skill with core communicative aspects involved in delivering contingency management (CM). The instrument presents a single vignette consisting of six points of client dialogue to which respondents write 'what they would say next.' In the context of an implementation/effectiveness hybrid trial, 19 staff clinicians at an opiate treatment program completed serial training outcome assessments before, following, and three months after CM training. Assessments included this adaptation of the HRQ, a multiple-choice CM knowledge test, and a recorded standardized patient encounter scored for CM skillfulness. Study results reveal promising psychometric properties for the instrument, including strong scoring reliability, internal consistency, concurrent and predictive validity, test-retest reliability and sensitivity to training effects. These preliminary findings suggest the instrument is a viable, practical method to assess clinician skill in communicative aspects of CM delivery. Copyright © 2015 Elsevier Inc. All rights reserved.
Psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale: A Rasch rating scale analysis and confirmatory factor analysis.

Science.gov (United States)

Pilatti, Angelina; Lozano, Oscar M; Cyders, Melissa A

2015-12-01

The present study was aimed at determining the psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale in a sample of college students. Participants were 318 college students (36.2% men; mean age = 20.9 years, SD = 6.4 years). The psychometric properties of this Spanish version were analyzed using the Rasch model, and the factor structure was examined using confirmatory factor analysis. The verification of the global fit of the data showed adequate indexes for persons and items. The reliability estimates were high for both items and persons. Differential item functioning across gender was found for 23 items, which likely reflects known differences in impulsivity levels between men and women. The factor structure of the Spanish version of the UPPS-P replicates previous work with the original UPPS-P Scale. Overall, results suggest that test scores from the Spanish version of the UPPS-P show adequate psychometric properties to accurately assess the multidimensional model of impulsivity, which represents the most exhaustive measure of this construct. (c) 2015 APA, all rights reserved).
Examination of Psychometric Properties of a Translated Social-Emotional Screening Test: The Taiwanese Version of The Ages and Stages Questionnaires: Social-Emotional

Science.gov (United States)

Chen, Chieh-Yu

2017-01-01

Investigating the psychometric properties of a screening instrument for young children is necessary to ascertain its quality and accuracy. In light of the important role culture plays on human beliefs and parenting styles, a newly translated and adapted test needs to be studied. Evaluating outcomes on a translated version of a test may reveal…

The development and psychometric analysis of the Chinese HIV-Related Fatigue Scale.

Science.gov (United States)

Li, Su-Yin; Wu, Hua-Shan; Barroso, Julie

2016-04-01

To develop a Chinese version of the human immunodeficiency virus-related Fatigue Scale and examine its reliability and validity. Fatigue is found in more than 70% of people infected with human immunodeficiency virus. However, a scale to assess fatigue in human immunodeficiency virus-positive people has not yet been developed for use in Chinese-speaking countries. A methodologic study involving instrument development and psychometric evaluation was used. The human immunodeficiency virus-related Fatigue Scale was examined through a two-step procedure: (1) translation and back translation and (2) psychometric analysis. A sample of 142 human immunodeficiency virus-positive patients was recruited from the Infectious Disease Outpatient Clinic in central Taiwan. Their fatigue data were analysed with Cronbach's α for internal consistency. Two weeks later, the data of a random sample of 28 patients from the original 142 were analysed for test-retest reliability. The correlation between the World Health Organization Quality of Life Assessment-Human Immunodeficiency Virus and the Chinese version of the human immunodeficiency virus-related Fatigue Scale was analysed for concurrent validity. The Chinese version of the human immunodeficiency virus-related Fatigue Scale scores of human immunodeficiency virus-positive patients with highly active antiretroviral therapy and those without were compared to demonstrate construct validity. The internal consistency and test-retest reliability of the Chinese version of the human immunodeficiency virus-related Fatigue Scale were 0·97 and 0·686, respectively. In regard to concurrent validity, a negative correlation was found between the scores of the Chinese version of the human immunodeficiency virus-related Fatigue Scale and the World Health Organization Quality of Life Assessment-Human Immunodeficiency Virus. Additionally, the Chinese version of the human immunodeficiency virus-related Fatigue Scale could be used to effectively
Increased correlation coefficient between the written test score and tutors’ performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia

Directory of Open Access Journals (Sweden)

Heethal Jaiprakash

2016-03-01

Full Text Available This paper is aimed at finding if there was a change of correlation between the written test score and tutors’ performance test scores in the assessment of medical students during a problem-based learning (PBL course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group’s tutors did not receive tutor training; while the second group’s tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors’ performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors’ scores in group 1 was 0.099 (p<0.001 and for group 2 was 0.305 (p<0.001. The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.
The Spanish-Version of the Subjective Vitality Scale: Psychometric Properties and Evidence of Validity.

Science.gov (United States)

Castillo, Isabel; Tomás, Inés; Balaguer, Isabel

2017-06-05

The Subjective Vitality Scale (SVS) assess the subjective experience of being full of energy and alive, a clinically relevant outcome measure of positive psychological well-being. The purpose of this paper was to translate the 7-item SVS into Spanish and examine its psychometric properties. In Study 1 (n = 790 adolescents) and Study 2 (n = 130 athletes) reliability and exploratory factor analysis (EFA) were carried out. In Study 1 and Study 3 (n = 197 dancers) evidence of validity of inferences based on SVS scores estimating relationships with other variables (life satisfaction, global self-esteem and emotional and physical exhaustion) was obtained. In Study 2 invariance across time was tested. Finally in Study 3, the factorial structure was cross-validated using confirmatory factor analysis (CFA). Results of EFA showed a one-factor solution. CFA also supported a unidimensional factor structure for the Spanish 6-item SVS (RMSEA = .050 (90% CI = .00, .080); NNFI = .993; CFI = .996). Reliability analysis indicated a strong internal consistency in all study samples (α ranged from .82 to .89). Further, results from multi-sample analysis supported the replicability of SVS factor structure across time. Finally, the SVS scores showed the expected correlations patterns (all them significant, p < .01) with the measured outcomes. In conclusion, the Spanish version of the SVS demonstrated adequate psychometric properties, indicating that the scale can be confidently used to measure the experience of possessing energy and aliveness; furthermore, differences across time can be meaningfully carried out.
Psychometrics of Multiple Choice Questions with Non-Functioning Distracters: Implications to Medical Education.

Science.gov (United States)

Deepak, Kishore K; Al-Umran, Khalid Umran; AI-Sheikh, Mona H; Dkoli, B V; Al-Rubaish, Abdullah

2015-01-01

The functionality of distracters in a multiple choice question plays a very important role. We examined the frequency and impact of functioning and non-functioning distracters on psychometric properties of 5-option items in clinical disciplines. We analyzed item statistics of 1115 multiple choice questions from 15 summative assessments of undergraduate medical students and classified the items into five groups by their number of non-functioning distracters. We analyzed the effect of varying degree of non-functionality ranging from 0 to 4, on test reliability, difficulty index, discrimination index and point biserial correlation. The non-functionality of distracters inversely affected the test reliability and quality of items in a predictable manner. The non-functioning distracters made the items easier and lowered the discrimination index significantly. Three non-functional distracters in a 5-option MCQ significantly affected all psychometric properties (p psychometrically as effective as 5-option items. Our study reveals that a multiple choice question with 3 functional options provides lower most limit of item format that has adequate psychometric property. The test containing items with less number of functioning options have significantly lower reliability. The distracter function analysis and revision of nonfunctioning distracters can serve as important methods to improve the psychometrics and reliability of assessment.
Psychometric Evaluation of an Instrument for Measuring Organizational Climate for Quality: Evidence From a National Sample of Infection Preventionists.

Science.gov (United States)

Pogorzelska-Maziarz, Monika; Nembhard, Ingrid M; Schnall, Rebecca; Nelson, Shanelle; Stone, Patricia W

2016-09-01

In recent years, there has been increased interest in measuring the climate for infection prevention; however, reliable and valid instruments are lacking. This study tested the psychometric properties of the Leading a Culture of Quality for Infection Prevention (LCQ-IP) instrument measuring the infection prevention climate in a sample of 972 infection preventionists from acute care hospitals. An exploratory principal component analysis showed that the instrument had structural validity and captured 4 factors related to the climate for infection prevention: Psychological Safety, Prioritization of Quality, Supportive Work Environment, and Improvement Orientation. LCQ-IP exhibited excellent internal consistency, with a Cronbach α of .926. Criterion validity was supported with overall LCQ-IP scores, increasing with the number of evidence-based prevention policies in place (P = .047). This psychometrically sound instrument may be helpful to researchers and providers in assessing climate for quality related to infection prevention. © The Author(s) 2015.
Psychometric validation of the functional assessment of cancer therapy--brain (FACT-Br) for assessing quality of life in patients with brain metastases.

Science.gov (United States)

Thavarajah, Nemica; Bedard, Gillian; Zhang, Liying; Cella, David; Beaumont, Jennifer L; Tsao, May; Barnes, Elizabeth; Danjoux, Cyril; Sahgal, Arjun; Soliman, Hany; Chow, Edward

2014-04-01

This study aimed to test the reliability, psychometric, and clinical validity of the use of the Functional Assessment of Cancer Therapy--Brain (FACT-Br) in patients with brain metastases. Patients with brain metastases were interviewed using the FACT-Br (including the FACT-general) 1 week prior to treatment. All patients completed a follow-up assessment 1 month post-treatment. Patients with a good performance status and receiving stereotactic radiosurgery completed an additional 1 week follow-up assessment after the initial baseline interview to assess test-retest reliability. Forty patients had complete 1 month follow-up data. Ten of these patients also completed the 1 week follow-up assessment from baseline. The median Karnofsky performance status of patients was 80 and the median age was 64 years. All subscales of the FACT-Br were found to be conceptually related (except for two correlations) using the following subscales: physical well-being (PWB), social/family well-being (SWB), emotional well-being (EWB), functional well-being (FWB), FACT-G total score, brain cancer subscale (BrC), and the FACT-Br total score. All FACT-Br scores demonstrated excellent reliability, except for the SWB scale which revealed good reliability. The FACT-Br scores showed no significant change in the quality of life (QoL) of patients from baseline to 1 month follow-up. The use of the combined FACT-G and FACT-Br Subscale to assess QoL specifically in patients with brain metastases has successfully undergone psychometric validation. Future clinical trials should use the FACT-G and FACT-Br Subscale to assess QoL in this patient population.
Measurement characteristics of the childhood Asthma-Control Test and a shortened, child-only version.

Science.gov (United States)

Bime, Christian; Gerald, Joe K; Wei, Christine Y; Holbrook, Janet T; Teague, William G; Wise, Robert A; Gerald, Lynn B

2016-10-20

The childhood Asthma-Control Test (C-ACT) is validated for assessing asthma control in paediatric asthma. Among children aged 4-11 years, the C-ACT requires the simultaneous presence of both parent and child. There is an unmet need for a tool that can be used to assess asthma control in children when parents or caregivers are not present such as in the school setting. We assessed the psychometric properties and estimated the minimally important difference (MID) of the C-ACT and a modified version, comprising only the child responses (C-ACTc). Asthma patients aged 6-11 years (n=161) from a previously completed multicenter randomised trial were included. Demographic information, spirometry and questionnaire scores were obtained at baseline and during follow-up. Participants or their guardians kept a daily asthma diary. Internal consistency reliabilities of the C-ACT and C-ACTc were 0.76 and 0.67 (Cronbach's α), respectively. Test-retest reliabilities of the C-ACT and C-ACTc were 0.72 and 0.66 (intra-class correlation), respectively. Significant correlations were noted between C-ACT scores and ACQ scores (Spearman's correlation r=-0.56, 95% CI (-0.66, -0.44), Pasthma patients aged 6-11 years, the C-ACT had good psychometric properties. The psychometric properties of a shortened child-only version (C-ACTc), although acceptable, are not as strong.
Analysing model fit of psychometric process models: An overview, a new test and an application to the diffusion model.

Science.gov (United States)

Ranger, Jochen; Kuhn, Jörg-Tobias; Szardenings, Carsten

2017-05-01

Cognitive psychometric models embed cognitive process models into a latent trait framework in order to allow for individual differences. Due to their close relationship to the response process the models allow for profound conclusions about the test takers. However, before such a model can be used its fit has to be checked carefully. In this manuscript we give an overview over existing tests of model fit and show their relation to the generalized moment test of Newey (Econometrica, 53, 1985, 1047) and Tauchen (J. Econometrics, 30, 1985, 415). We also present a new test, the Hausman test of misspecification (Hausman, Econometrica, 46, 1978, 1251). The Hausman test consists of a comparison of two estimates of the same item parameters which should be similar if the model holds. The performance of the Hausman test is evaluated in a simulation study. In this study we illustrate its application to two popular models in cognitive psychometrics, the Q-diffusion model and the D-diffusion model (van der Maas, Molenaar, Maris, Kievit, & Boorsboom, Psychol Rev., 118, 2011, 339; Molenaar, Tuerlinckx, & van der Maas, J. Stat. Softw., 66, 2015, 1). We also compare the performance of the test to four alternative tests of model fit, namely the M 2 test (Molenaar et al., J. Stat. Softw., 66, 2015, 1), the moment test (Ranger et al., Br. J. Math. Stat. Psychol., 2016) and the test for binned time (Ranger & Kuhn, Psychol. Test. Asess. , 56, 2014b, 370). The simulation study indicates that the Hausman test is superior to the latter tests. The test closely adheres to the nominal Type I error rate and has higher power in most simulation conditions. © 2017 The British Psychological Society.
Psychometric Properties of the Persian Version of the Love of Life Scale.

Science.gov (United States)

Vahid, Mohammad Kazem Atef; Dadfar, Mahboubeh; Abdel-Khalek, Ahmed M; Lester, David

2016-10-01

A love of life is defined as an overall positive attitude toward life and a liking for life. The present study was designed to evaluate the psychometric characteristics of a Persian version of the Love of Life Scale using a convenience sample of 145 Iranian female volunteer college students (M age = 23.0 years, SD = 3.4). The mean score on the Love of Life Scale was 61.08 (SD = 11.40). A principal component analysis with a Varimax rotation yielded two factors labeled (a) Positive Attitude Towards Life and Happy Consequences of Love of Life and (b) Meaningfulness of Life. Cronbach's alpha was .94 and the one-week test-retest reliability was .85. Love of Life Scale scores had significant positive correlations with scores on the Oxford Happiness Questionnaire, the Satisfaction with Life Scale, the General Self-Efficacy Scale, and the Adult Hope Scale. The scale displayed negative correlations with the Kessler Psychological Distress Scale and the Wish to be Dead Scale. It was concluded that the Persian form of the Love of Life Scale can be recommended for future research on positive psychology. © The Author(s) 2016.
Psychometric properties of patient-reported outcome measures for hip arthroscopic surgery

DEFF Research Database (Denmark)

Kemp, Joanne L; Collins, Natalie J; Roos, Ewa M.

2013-01-01

Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown.......Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown....
Psychometric testing of the modified Care Dependency Scale (Neuro-CDS).

Science.gov (United States)

Piredda, Michela; Biagioli, Valentina; Gambale, Giulia; Porcelli, Elisa; Barbaranelli, Claudio; Palese, Alvisa; De Marinis, Maria Grazia

2016-01-01

Effective measures of nursing care dependency in neurorehabilitation are warranted to plan nursing interventions to help patients avoid increasing dependency. The Care Dependency Scale (CDS) is a theory-based, comprehensive tool to evaluate functional disability. This study aimed to modify the CDS for neurological and neurorehabilitation patients (Neuro-CDS) and to test its psychometric properties in adult neurorehabilitation inpatients. Exploratory factor analysis (EFA) was performed using a Maximum Likelihood robust (MLR) estimator. The Barthel Index (BI) was used to evaluate concurrent validity. Stability was measured using the Intra-class Correlation Coefficient (ICC). The sample included 124 patients (mean age = 69.7 years, 54% male). The EFA revealed a two-factor structure with good fit indexes, Factor 1 (Physical care dependence) loaded by 11 items and Factor 2 (Psycho-social care dependence) loaded by 4 items. The correlation between factors was 0.61. Correlations between Factor 1 and the BI and between Factor 2 and the BI were r = 0.843 and r = 0.677, respectively (p dependence in neurorehabilitation patients as a basis for individualized and holistic care.
Psychometrics in action, science as practice.

Science.gov (United States)

Pearce, Jacob

2017-07-27

Practitioners in health sciences education and assessment regularly use a range of psychometric techniques to analyse data, evaluate models, and make crucial progression decisions regarding student learning. However, a recent editorial entitled "Is Psychometrics Science?" highlighted some core epistemological and practical problems in psychometrics, and brought its legitimacy into question. This paper attempts to address these issues by applying some key ideas from history and philosophy of science (HPS) discourse. I present some of the conceptual developments in HPS that have bearing on the psychometrics debate. Next, by shifting the focus onto what constitutes the practice of science, I discuss psychometrics in action. Some incorrectly conceptualize science as an assemblage of truths, rather than an assemblage of tools and goals. Psychometrics, however, seems to be an assemblage of methods and techniques. Psychometrics in action represents a range of practices using specific tools in specific contexts. This does not render the practice of psychometrics meaningless or futile. Engaging in debates about whether or not we should regard psychometrics as 'scientific' is, however, a fruitless enterprise. The key question and focus should be whether, on what grounds, and in what contexts, the existing methods and techniques used by psychometricians can be justified or criticized.
Gender, Stereotype Threat and Mathematics Test Scores

OpenAIRE

Ming Tsui; Xiao Y. Xu; Edmond Venator

2011-01-01

Problem statement: Stereotype threat has repeatedly been shown to depress womens scores on difficult math tests. An attempt to replicate these findings in China found no support for the stereotype threat hypothesis. Our math test was characterized as being personally important for the student participants, an atypical condition in most stereotype threat laboratory research. Approach: To evaluate the effects of this personal demand, we conducted three experiments. Results: ...
Psychometric properties of the Adolescent Sleep Hygiene Scale.

Science.gov (United States)

Storfer-Isser, Amy; Lebourgeois, Monique K; Harsh, John; Tompsett, Carolyn J; Redline, Susan

2013-12-01

This study evaluated the psychometric properties of the Adolescent Sleep Hygiene Scale (ASHS), a self-report measure assessing sleep practices theoretically important for optimal sleep. Data were collected on a community sample of 514 adolescents (16-19; 17.7 ± 0.4 years; 50% female) participating in the late adolescent examination of a longitudinal study on sleep and health. Sleep hygiene and daytime sleepiness were obtained from adolescent reports, behavior from caretaker reports, and sleep-wake estimation on weekdays from wrist actigraphy. Confirmatory factor analysis indicated the empirical and conceptually based factor structure were similar for six of the eight proposed sleep hygiene domains. Internal consistency of the revised scale (ASHSr) was α = 0.84; subscale alphas were: physiological: α = 0.60; behavioural arousal: α = 0.62; cognitive/emotional: α = 0.81; sleep environment: α = 0.61; sleep stability: α = 0.68; daytime sleep: α = 0.78. Sleep hygiene scores were associated positively with sleep duration (r = 0.16) and sleep efficiency (r = 0.12) and negatively with daytime sleepiness (r = -0.26). Results of extreme-groups analyses comparing ASHSr scores in the lowest and highest quintile provided further evidence for concurrent validity. Correlations between sleep hygiene scores and caretaker reports of school competence, internalizing and externalizing behaviours provided support for convergent validity. These findings indicate that the ASHSr has satisfactory psychometric properties for a research instrument and is a useful research tool for assessing sleep hygiene in adolescents. © 2013 European Sleep Research Society.
Psychometric testing of the properties of the spiritual health scale short form.

Science.gov (United States)

Hsiao, Ya-Chu; Chiang, Yi-Chien; Lee, Hsiang-Chun; Han, Chin-Yen

2013-11-01

To further examine the psychometric properties of the spiritual health scale short form, including its reliability and validity. Spirituality is one of the main factors associated with good health outcomes. A reliable and valid instrument to measure spirituality is essential to identify the spiritual needs of an individual and to evaluate the effect of spiritual care. A cross-sectional study design was used. The study was conducted in six nursing schools in northern, central and southern Taiwan. The inclusion criterion for participants was nursing students with clinical practice experience. Initially, 1141 participants were recruited for the study, but 67 were absent and 48 did not complete the questionnaires. A total of 1026 participants were finally recruited, indicating a response rate of 89·9%. The psychometric testing of the spiritual health scale short form included construct validity with confirmatory factor analysis, known-group validity and internal consistency reliability. The results of the confirmatory factor analysis supported the five-factor model as an acceptable model fit. In the known-group validity, the results indicated that people who are in the category of primary religious affiliation have better spiritual health than people in the category of secondary religious affiliation and atheism. The result also indicated that the 24-item spiritual health scale short form achieved an acceptable internal consistency coefficient. The findings suggest that the spiritual health scale short form is a valid and reliable instrument for the appraisal of individual spiritual health. The spiritual health scale short form could provide useful information to guide clinical practice in assessing and managing people's spiritual health in Taiwan. © 2013 John Wiley & Sons Ltd.
Translation and Psychometric Testing of the Persian Version of the Spiritual Needs Questionnaire Among Elders With Chronic Diseases.

Science.gov (United States)

Moeini, Babak; Zamanian, Hadi; Taheri-Kharameh, Zahra; Ramezani, Tahereh; Saati-Asr, Mohamadhasan; Hajrahimian, Mohamadhasan; Amini-Tehrani, Mohammadali

2018-01-01

Spirituality plays an important role in coping with chronic diseases for patients and they often report unmet spiritual and existential needs, which should be considered for a holistic view of their health. Studying spiritual needs in this generation requires culturally appropriate and valid instruments. The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The "forward-backward" procedure was applied to translate the SpNQ from English into Persian. The SpNQ-Persian Version (SpNQ-PV) was checked in terms of validity and reliability with a convenience sample of 100 elders with chronic diseases who were recruited from the inpatient wards at two university hospitals in Qom, Iran. The validity was assessed using content, face, and construct validity. The Cronbach alpha and test-retest were used to assess the reliability of the questionnaire. The results of the exploratory factor analysis indicated a five-factor solution for the questionnaire, which included religious needs, existential needs, forgiveness/generativity needs, need for inner peace, and emotional needs. These accounted for 60.1% of the total observed variance. One item was removed (factor loading Spiritual Well-being Scale. Cronbach alpha of the subscales ranged from 0.56 to 0.78 and the test-retest reliability ranged from 0.72 to 0.91, which indicated an acceptable range of reliability. The SpNQ-PV showed a minor difference in structuring and indicated good psychometric properties, which can be used to assess the spiritual needs of Iranian elders suffering from chronic diseases. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Attitudes toward Science: Measurement and Psychometric Properties of the Test of Science-Related Attitudes for Its Use in Spanish-Speaking Classrooms

Science.gov (United States)

Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina

2016-01-01

Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the "Test of Science-Related Attitudes" (TOSRA), such as its construct validity, its…
Computational Psychometrics for the Measurement of Collaborative Problem Solving Skills

Science.gov (United States)

Polyak, Stephen T.; von Davier, Alina A.; Peterschmidt, Kurt

2017-01-01

This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses. PMID:29238314
Computational Psychometrics for the Measurement of Collaborative Problem Solving Skills

Directory of Open Access Journals (Sweden)

Stephen T. Polyak

2017-11-01

Full Text Available This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
Computational Psychometrics for the Measurement of Collaborative Problem Solving Skills.

Science.gov (United States)

Polyak, Stephen T; von Davier, Alina A; Peterschmidt, Kurt

2017-01-01

This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.

Cross-Cultural Adaptation and Psychometric Properties of the AUDIT and CAGE Questionnaires in Tanzanian Swahili for a Traumatic Brain Injury Population.

Science.gov (United States)

Vissoci, Joao Ricardo Nickenig; Hertz, Julian; El-Gabri, Deena; Andrade Do Nascimento, José Roberto; Pestillo De Oliveira, Leonardo; Mmbaga, Blandina Theophil; Mvungi, Mark; Staton, Catherine A

2018-01-01

To develop Swahili versions of the Alcohol Use Disorders Identification Test (AUDIT) and CAGE questionnaires and evaluate their psychometric properties in a traumatic brain injury (TBI) population in Tanzania. Swahili versions of the AUDIT and CAGE were developed through translation and back-translation by a panel of native speakers of both English and Swahili. The translated instruments were administered to a sample of Tanzanian adults from a TBI registry. The validity and reliability were analyzed using standard statistical methods. The translated versions of both the AUDIT and CAGE questionnaires were found to have excellent language clarity and domain coherence. Reliability was acceptable (>0.85) for all tested versions. Confirmatory factor analysis of one, two and three factor solution for the AUDIT and one factor solution for the CAGE showed adequate results. AUDIT and CAGE scores were strongly correlated to each other (R > 0.80), and AUDIT scores were significantly lower in non-drinkers compared to drinkers. This article presents the first Swahili and Tanzanian adaptations of the AUDIT and CAGE instruments as well as the first validation of these questionnaires with TBI patients. Both instruments were found to have acceptable psychometric properties, resulting in two new useful tools for medical and social research in this setting. © The Author 2017. Medical Council on Alcohol and Oxford University Press. All rights reserved.
Validation of Patient-Reported Outcomes Measurement Information System Computerized Adaptive Tests Against the Foot and Ankle Outcome Score for 6 Common Foot and Ankle Pathologies.

Science.gov (United States)

Koltsov, Jayme C B; Greenfield, Stephen T; Soukup, Dylan; Do, Huong T; Ellis, Scott J

2017-08-01

The field of foot and ankle surgery lacks a widely accepted gold-standard patient-reported outcome instrument. With the changing infrastructure of the medical profession, more efficient patient-reported outcome tools are needed to reduce respondent burden and increase participation while providing consistent and reliable measurement across multiple pathologies and disciplines. The primary purpose of the present study was to validate 3 Patient-Reported Outcomes Measurement Information System computer adaptive tests (CATs) most relevant to the foot and ankle discipline against the Foot and Ankle Outcome Score (FAOS) and the Short Form 12 general health status survey in patients with 6 common foot and ankle pathologies. Patients (n = 240) indicated for operative treatment for 1 of 6 common foot and ankle pathologies completed the CATs, FAOS, and Short Form 12 at their preoperative surgical visits, 1 week subsequently (before surgery), and at 6 months postoperatively. The psychometric properties of the instruments were assessed and compared. The Patient-Reported Outcomes Measurement Information System CATs each took less than 1 minute to complete, whereas the FAOS took 6.5 minutes, and the Short Form 12 took 3 minutes. CAT scores were more normally distributed and had fewer floor and ceiling effects than those on the FAOS, which reached as high as 24%. The CATs were more precise than the FAOS and had similar responsiveness and test-retest reliability. The physical function and mobility CATs correlated strongly with the activities subscale of the FAOS, and the pain interference CAT correlated strongly with the pain subscale of the FAOS. The CATs and FAOS were responsive to changes with operative treatment for 6 common foot and ankle pathologies. The CATs performed as well as or better than the FAOS in all aspects of psychometric validity. The Patient-Reported Outcomes Measurement Information System CATs show tremendous potential for improving the study of patient
Evaluating the Effectiveness of Collaborative Computer-Intensive Projects in an Undergraduate Psychometrics Course

Science.gov (United States)

Barchard, Kimberly A.; Pace, Larry A.

2010-01-01

Undergraduate psychometrics classes often use computer-intensive active learning projects. However, little research has examined active learning or computer-intensive projects in psychometrics courses. We describe two computer-intensive collaborative learning projects used to teach the design and evaluation of psychological tests. Course…
The Shame and Guilt Scales of the Test of Self-Conscious Affect-Adolescent (TOSCA-A): Psychometric Properties for Responses from Children, and Measurement Invariance Across Children and Adolescents

Science.gov (United States)

Watson, Shaun D.; Gomez, Rapson; Gullone, Eleonora

2016-01-01

This study examined various psychometric properties of the items comprising the shame and guilt scales of the Test of Self-Conscious Affect-Adolescent (TOSCA-A) in a group children between 8 and 11 years of age. A total of 699 children (367 females and 332 males) completed these scales, and also measures of depression and empathy. Confirmatory factor analysis (CFA) provided support for an oblique two-factor model, with the originally proposed shame and guilt items comprising shame and guilt factors, respectively. There was good internal consistency reliability for the shame and guilt scales, with omega coefficient values of 0.77 and 0.81 for shame and guilt, respectively. Also, shame correlated with depression symptoms positively (0.34, p Guilt correlated with depression symptoms negatively (-0.28, p guilt factors. Multiple-group CFA comparing this group of children with a separate group of adolescents (320 females and 242 males), based on the chi-square difference test, supported full metric invariance, the intercept invariance of 17 of the 30 shame and guilt items, and higher latent mean scores among children for both shame and guilt. The non-equivalency for intercepts and mean scores were of small effect sizes. Comparisons based on the difference in root mean squared error of approximation values supported full measurement invariance and no group difference for latent mean scores. The findings in the current study support the use of the TOSCA-A in children and the valid comparison of scores between children and adolescents, thereby opening up the possibility of evaluating change in the TOSCA-A shame and guilt factors over these developmental age groups. PMID:27242573
Psychometric evaluation of the HIV symptom distress scale

Science.gov (United States)

Marc, Linda G.; Wang, Ming-Mei; Testa, Marcia A.

2012-01-01

The objective of this paper is to psychometrically validate the HIV Symptom Distress Scale (SDS), an instrument that can be used to measure overall HIV symptom distress or clinically relevant groups of HIV symptoms. A secondary data analysis was conducted using the Collaborations in HIV Outcomes Research U.S. Cohort (CHORUS). Inclusion criteria required study participants (N=5,521) to have a valid baseline measure of the AIDS Clinical Trial Group Symptom Distress Module, with an SF-12 or SF-36 completed on the same day. Psychometric testing assessed unidimensionality, internal consistency and factor structure using exploratory and confirmatory factor analysis, and structural equation modeling (SEM). Construct validity examined whether the new measure discriminates across clinical significance (CD4 and HIV viral load). Findings show that the SDS has high reliability (α=0.92), and SEM supports a correlated second-order factor model (physical and mental distress) with acceptable fit (GFI=0.88, AGFI=0.85, NFI=0.99, NNFI=0.99; RMSEA=0.06, [90% CI 0.06 – 0.06]; Satorra Bentler Scaled, C2 =3274.20; p=0.0). Construct validity shows significant differences across categories for HIV-1 viral load (p< 0.001) and CD4 (p< 0.001). Differences in mean SDS scores exist across gender (p< 0.001), race/ethnicity (p< 0.05) and educational attainment (p < 0.001). Hence, the HIV Symptom Distress Scale is a reliable and valid instrument, which measures overall HIV symptoms or clinically relevant groups of symptoms. PMID:22409246
Dutch validation of the low anterior resection syndrome score.

Science.gov (United States)

Hupkens, B J P; Breukink, S O; Olde Reuver Of Briel, C; Tanis, P J; de Noo, M E; van Duijvendijk, P; van Westreenen, H L; Dekker, J W T; Chen, T Y T; Juul, T

2018-04-21

The aim of this study was to validate the Dutch translation of the low anterior resection syndrome (LARS) score in a population of Dutch rectal cancer patients. Patients who underwent surgery for rectal cancer received the LARS score questionnaire, a single quality of life (QoL) category question and the European Organization for Research and Treatment of Cancer (EORTC) QLQ-C30 questionnaire. A subgroup of patients received the LARS score twice to assess the test-retest reliability. A total of 165 patients were included in the analysis, identified in six Dutch centres. The response rate was 62.0%. The percentage of patients who reported 'major LARS' was 59.4%. There was a high proportion of patients with a perfect or moderate fit between the QoL category question and the LARS score, showing a good convergent validity. The LARS score was able to discriminate between patients with or without neoadjuvant radiotherapy (P = 0.003), between total and partial mesorectal excision (P = 0.008) and between age groups (P = 0.039). There was a statistically significant association between a higher LARS score and an impaired function on the global QoL subscale and the physical, role, emotional and social functioning subscales of the EORTC QLQ-C30 questionnaire. The test-retest reliability of the LARS score was good, with an interclass correlation coefficient of 0.79. The good psychometric properties of the Dutch version of the LARS score are comparable overall to the earlier validations in other countries. Therefore, the Dutch translation can be considered to be a valid tool for assessing LARS in Dutch rectal cancer patients. Colorectal Disease © 2018 The Association of Coloproctology of Great Britain and Ireland.
Psychometric evaluation of the adolescent and parent versions of the Gaming Addiction Identification Test (GAIT).

Science.gov (United States)

Vadlin, Sofia; Åslund, Cecilia; Rehn, Mattias; Nilsson, Kent W

2015-12-01

The objective of the study is to evaluate the psychometric properties of the Gaming Addiction Identification Test (GAIT) and its parent version (GAIT-P), in a representative community sample of adolescents and parents in Västmanland, Sweden. Self-rated and parent-rated gaming addictive symptoms identified by GAIT and GAIT-P were analyzed for frequency of endorsement, internal consistency, concordance, factor structure, prevalence of Internet gaming disorder (IGD), concurrence with the Gaming Addiction Scale for Adolescents, 7-item version (GAS) and the parent version of GAS (GAS-P), and for sex differences. The 12-month prevalence of IGD was found to be 1.3% with GAIT and 2.4% with GAIT-P. Results also indicate promising psychometric results within this population, with high internal consistency, and high concurrent validity with GAS and GAS-P. Concordance between adolescents and parents ratings was high, although moderate in girls. Although exploratory factor analysis indicated poor model fit, it also indicated unidimensionality and high factor loadings in all analyses. GAIT and GAIT-P are suitable for continued use in measuring gaming addiction in adolescents, and, with the additional two items, they now cover all nine IGD criteria. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
The assessment of fatigue: Psychometric qualities and norms for the Checklist individual strength.

Science.gov (United States)

Worm-Smeitink, M; Gielissen, M; Bloot, L; van Laarhoven, H W M; van Engelen, B G M; van Riel, P; Bleijenberg, G; Nikolaus, S; Knoop, H

2017-07-01

The Checklist Individual Strength (CIS) measures four dimensions of fatigue: Fatigue severity, concentration problems, reduced motivation and activity. On the fatigue severity subscale, a cut-off score of 35 is used. This study 1) investigated the psychometric qualities of the CIS; 2) validated the cut-off score for severe fatigue and 3) provided norms. Representatives of the Dutch general population (n=2288) completed the CIS. The factor structure was investigated using an exploratory factor analysis. Internal consistency and test-retest reliability were determined. Concurrent validity was assessed in two additional samples by correlating the CIS with other fatigue scales (Chalder Fatigue Questionnaire, MOS Short form-36 Vitality subscale, EORTC QLQ-C30 fatigue subscale). To validate the fatigue severity cut-off score, a Receiver Operating Characteristics analysis was performed with patients referred to a chronic fatigue treatment centre (n=5243) and a healthy group (n=1906). Norm scores for CIS subscales were calculated for the general population, patients with chronic fatigue syndrome (CFS; n=1407) and eight groups with other medical conditions (n=1411). The original four-factor structure of the CIS was replicated. Internal consistency (α=0.84-0.95) and test-retest reliability (r=0.74-0.86) of the subscales were high. Correlations with other fatigue scales were moderate to high. The 35 points cut-off score for severe fatigue is appropriate, but, given the 17% false positive rate, should be adjusted to 40 for research in CFS. The CIS is a valid and reliable tool for the assessment of fatigue, with a validated cut-off score for severe fatigue that can be used in clinical practice. Copyright © 2017. Published by Elsevier Inc.
The paced auditory serial addition test for working memory assessment: Psychometric properties

Science.gov (United States)

Nikravesh, Maryam; Jafari, Zahra; Mehrpour, Masoud; Kazemi, Roozbeh; Amiri Shavaki, Younes; Hossienifar, Shamim; Azizi, Mohamad Parsa

2017-01-01

Background: The paced auditory serial addition test (PASAT) was primarily developed to assess the effects of traumatic brain injury on cognitive functioning. Working memory (WM) is one of the most important aspects of cognitive function, and WM impairment is one of the clinically remarkable signs of aphasia. To develop the Persian version of PASAT, an initial version was used in individuals with aphasia (IWA). Methods: In this study, 25 individuals with aphasia (29-60 years) and 85 controls (18-60 years) were included. PASAT was presented in the form of recorded 61 single-digit numbers (1 to 9). The participants repeatedly added the 2 recent digits. The psychometric properties of PASAT including convergent validity (using the digit memory span tasks), divergent validity (using results in the control group and IWA group), and face validity were investigated. Test-retest reliability was considered as well. Results: The relationship between the PASAT and digit memory span tests was moderate to strong in the control group (forward digit memory span test: r= 0.52, p< 0.0001; backward digit memory span test: r = 0.48, p< 0.0001). A strong relationship was found in IWA (forward digit memory span test: r= 0.72, p< 0.0001; backward digit memory span test: r= 0.53, p= 0.006). Also, strong testretest reliability (intraclass correlation= 0.95, p< 0.0001) was observed. Conclusion: According to our results, the PASAT is a valid and reliable test to assess working memory, particularly in IWA. It could be used as a feasible tool for clinical and research applications. PMID:29445690
Psychometric testing of the Agitation Severity Scale for acute presentation behavioral management patients in the emergency department.

Science.gov (United States)

Strout, Tania D

2014-01-01

Agitation is a vexing problem frequently observed in emergency department acute psychiatric patients, yet no instruments to measure agitation in this setting and population were found upon review of the literature. Previously developed agitation rating scales are limited by the length of observation they require, their need for participation by the patient, complexity in scoring, and a lack of validity in this setting and population. The purpose of this study was to psychometrically evaluate and refine an observation-based agitation scale for use with emergency department acute psychiatric patients. Using a methodological design, the 21-item Agitation Severity Scale was utilized to assess 270 adult psychiatric patients in the emergency setting in a prospective, observational fashion. Reliability analysis, item analysis, exploratory factor analysis, and validity assessments were completed. The relationship between Agitation Severity Scale scores and scores on the previously established Overt Agitation Severity Scale was evaluated. The instrument was reduced to 17 items representing four factors (Aggressive Behaviors, Interpersonal Behaviors, Involuntary Motor Behaviors, and Physical Stance) that accounted for nearly 70% of observed variance, Cronbach's α = 0.91. Evidence of internal consistency reliability, equivalence reliability, construct validity, and convergent validity was established. Through this study, the 17-item Agitation Severity Scale demonstrated acceptable levels of reliability and validity when used with acute psychiatric patients in the emergency setting. This instrument holds promise as a method of enhancing clinical communication about agitation, evaluating the efficacy of interventions aimed at decreasing agitation, and as a research tool.
Psychometric evaluation of a motor control test battery of the craniofacial region.

Science.gov (United States)

von Piekartz, H; Stotz, E; Both, A; Bahn, G; Armijo-Olivo, S; Ballenberger, N

2017-12-01

The primary objective of this study was to determine the structural and known-group validity as well as the inter-rater reliability of a test battery to evaluate the motor control of the craniofacial region. Seventy volunteers without TMD and 25 subjects with TMD (Axes I) per the DC/TMD were asked to execute a test battery consisting of eight tests. The tests were video-taped in the same sequence in a standardised manner. Two experienced physical therapists participated in this study as blinded assessors. We used exploratory factor analysis to identify the underlying component structure of the eight tests. Internal consistency (Cronbach's α), inter-rater reliability (intra-class correlation coefficient) and construct validity (ie, hypothesis testing-known-group validity) (receiver operating curves) were also explored for the test battery. The structural validity showed the presence of one factor underlying the construct of the test battery. The internal consistency was excellent (0.90) as well as the inter-rater reliability. All values of reliability were close to 0.9 or above indicating very high inter-rater reliability. The area under the curve (AUC) was 0.93 for rater 1 and 0.94 for rater two, respectively, indicating excellent discrimination between subjects with TMD and healthy controls. The results of the present study support the psychometric properties of test battery to measure motor control of the craniofacial region when evaluated through videotaping. This test battery could be used to differentiate between healthy subjects and subjects with musculoskeletal impairments in the cervical and oro-facial regions. In addition, this test battery could be used to assess the effectiveness of management strategies in the craniofacial region. © 2017 John Wiley & Sons Ltd.
Psychometric properties of the Alcohol Use Disorders Identification Test (AUDIT) and prevalence of alcohol use among Iranian psychiatric outpatients.

Science.gov (United States)

Noorbakhsh, Simasadat; Shams, Jamal; Faghihimohamadi, Mohamadmahdi; Zahiroddin, Hanieh; Hallgren, Mats; Kallmen, Hakan

2018-01-30

Iran is a developing and Islamic country where the consumption of alcoholic beverages is banned. However, psychiatric disorders and alcohol use disorders are often co-occurring. We used the Alcohol Use Disorders Identification Test (AUDIT) to estimate the prevalence of alcohol use and examined the psychometric properties of the test among psychiatric outpatients in Teheran, Iran. AUDIT was completed by 846 consecutive (sequential) patients. Descriptive statistics, internal consistency (Cronbach alpha), confirmatory and exploratory factor analyses were used to analyze the prevalence of alcohol use, reliability and construct validity. 12% of men and 1% of women were hazardous alcohol consumers. Internal reliability of the Iranian version of AUDIT was excellent. Confirmatory factor analyses showed that the construct validity and the fit of previous factor structures (1, 2 and 3 factors) to data were not good and seemingly contradicted results from the explorative principal axis factoring, which showed that a 1-factor solution explained 77% of the co-variances. We could not reproduce the suggested factor structure of AUDIT, probably due to the skewed distribution of alcohol consumption. Only 19% of men and 3% of women scored above 0 on AUDIT. This could be explained by the fact that alcohol is illegal in Iran. In conclusion the AUDIT exhibited good internal reliability when used as a single scale. The prevalence estimates according to AUDIT were somewhat higher among psychiatric patients compared to what was reported by WHO regarding the general population.
Vietnamese validation of the short version of Internet Addiction Test

OpenAIRE

Tran, Bach Xuan; Mai, Hue Thi; Nguyen, Long Hoang; Nguyen, Cuong Tat; Latkin, Carl A.; Zhang, Melvyn W.B.; Ho, Roger C.M.

2017-01-01

Background and aims: The main goal of the present study was to examine the psychometric properties of a Vietnamese version of the short-version of Internet Addiction Test (s-IAT) and to assess the relationship between s-IAT scores and demographics, health related qualify of life and perceived stress scores in young Vietnamese. Methods: The Vietnamese version of s-IAT was administered to a sample of 589 participants. Exploratory factor and reliability analyses were performed. Regression analys...
Psychometric properties of an innovative self-report measure: The Social Anxiety Questionnaire for adults.

Science.gov (United States)

Caballo, Vicente E; Arias, Benito; Salazar, Isabel C; Irurtia, María Jesús; Hofmann, Stefan G

2015-09-01

This article presents the psychometric properties of a new measure of social anxiety, the Social Anxiety Questionnaire for adults (SAQ), composed of 30 items that were developed based on participants from 16 Latin American countries, Spain, and Portugal. Two groups of participants were included in the study: a nonclinical group involving 18,133 persons and a clinical group comprising 334 patients with a diagnosis of social anxiety disorder (social phobia). Exploratory and confirmatory factor analyses supported a 5-factor structure of the questionnaire. The factors were labeled as follows: (1) Interactions with strangers, (2) Speaking in public/talking with people in authority, (3) Interactions with the opposite sex, (4) Criticism and embarrassment, and (5) Assertive expression of annoyance, disgust, or displeasure. Psychometric evidence supported the internal consistency, convergent validity, and measurement invariance of the SAQ. To facilitate clinical applications, a receiver operating characteristics (ROC) analysis identified cut scores for men and women for each factor and for the global score. (c) 2015 APA, all rights reserved.
Development and psychometric testing the Health of Body, Mind and Spirit Scale for assessing individuals who have drug abuse histories.

Science.gov (United States)

Sun, Fan-Ko; Chiang, Chun-Ying; Lu, Chu-Yun; Yu, Pei-Jane; Liao, Tzu-Chiao; Lan, Chu-Mei

2018-03-01

To develop the Health of Body, Mind and Spirit Scale (HBMSS), which was designed to assess drug abusers' health condition. Helping drug abusers to become healthy is important to healthcare professionals. However, no instrument exists to assess drug abusers' state of health. A cross-sectional questionnaire survey was implemented to examine the validity of the HBMSS. Data were collected from 2015-2016 at one drug abuse prevention centre in Taiwan. Participants (N = 320) who had abused drugs were invited to complete a preliminary 64-item version of the HBMSS. An item analysis, criterion-related validity analysis (using the Relapse Prediction Scale [RPS] score), split-half reliability testing and confirmatory factor analysis (CFA) were conducted to examine the psychometric properties of the HBMSS. The final version of the HBMSS contained 15 items that were divided into three subscales: the health of the body, mind and spirit. Cronbach's α and split-half reliability coefficients were all above .85. The factor loading of each item was between .74-.95. The HBMSS had satisfactory criterion-related validity with the RPS score (r = -.50, p < .001). A second-order CFA was conducted on the HBMSS. The fit indexes were good, χ 2 = 184.060, df = 94, χ 2 /df = 1.958 (p = .000). The entire HBMSS and the subscales had satisfactory reliability and validity. Healthcare professionals could use the HBMSS to evaluate the condition of the health of individuals with a drug abuse history. © 2017 John Wiley & Sons Ltd.
Psychometric study of the Required Care Levels for People with Severe Mental Disorder Assessment Scale (ENAR-TMG).

Science.gov (United States)

Lascorz, David; López, Victoria; Pinedo, Carmen; Trujols, Joan; Vegué, Joan; Pérez, Víctor

2016-03-08

People with severe mental disorder have significant difficulties in everyday life that involve the need for continued support. These needs are not easily measurable with the currently available tools. Therefore, a multidimensional scale that assesses the different levels of need for care is proposed, including a study of its psychometric properties. One-hundred and thirty-nine patients (58% men) with a severe mental disorder were assessed using the Required Care Levels for People with Severe Mental Disorder Assessment Scale (ENAR-TMG), the Camberwell Assessment of Need scale, and the Health of the Nation Outcome Scales. ENAR-TMG's psychometric features were examined by: a) evaluating 2 sources of validity evidence (evidence based on internal structure and evidence based on relations to other variables), and b) estimating the internal consistency, temporal stability, inter-rater reliability, and sensitivity to change of scores of the ENAR-TMG's subscales. Exploratory factor analyses revealed a one-factor structure for each of the theoretical dimensions of the scale, in which all but one showed a significant and positive correlation with the Camberwell Assessment of Need (range of r: 0.143-0.557) and Health of the Nation Outcome Scales (range of r: 0.241-0.474) scales. ENAR-TMG subscale scores showed acceptable internal consistency (range of ordinal α coefficients: 0.682-0.804), excellent test-retest (range of intraclass correlation coefficients: 0.889-0.999) and inter-rater reliabilities (range of intraclass correlation coefficients: 0.926-0.972), and satisfactory sensitivity to treatment-related changes (range of η 2 : 0.003-0.103). The satisfactory psychometric behaviour of the ENAR-TMG makes the scale a promising tool to assess global functioning in people with a severe mental disorder. Copyright © 2016 SEP y SEPB. Published by Elsevier España. All rights reserved.
The Weighted Airman Promotion System: Standardizing Test Scores

Science.gov (United States)

2008-01-01

u th o ri ze d Top 3/E6 ratio, inventory 1401206040 100 70 130 5R 2F 2G 3N 2M 2A 4J 4C 4P 4T 4B 1W 2T 3P 1T 4A 2S 5J 1A 1S1C 6F 4N 7S 4R 4E 1N 3A 3V...System: Standardizing Test Scores AFHRL convened a panel to identify the relevant factors to consider, and then sit as a promotion board and rank...Costs If the Air Force decided to standardize test scores, there would be three basic types of costs: implementation costs, marketing costs, and
Psychometric Properties of the Persian Translation of the Sexual Quality of Life–Male Questionnaire

Science.gov (United States)

Maasoumi, Raziyeh; Mokarami, Hamidreza; Nazifi, Morteza; Stallones, Lorann; Taban, Abrahim; Yazdani Aval, Mohsen; Samimi, Kazem

2016-01-01

Sexual dysfunction has been demonstrated to be related to a poor quality of life. These dysfunctions are especially prevalent among men. This cross-sectional study aimed to investigate the psychometric properties of the Persian translation of the Sexual Quality of Life–Male (SQOL-M), translated and adapted to measure sexual quality of life among Iranian men. Forward–backward procedures were applied in translating the original SQOL-M into Persian, and then the psychometric properties of the Persian translation of the SQOL-M were studied. A total of 181 participants (23-60 years old) were included in the study. Validity was assessed by construct validity using confirmatory factor analysis, convergent validity, and content validity. The international index of erectile function (IIEF) and the work ability index were used to study the convergent validity. Reliability was evaluated through internal consistency and test–retest reliability analyses. The results from confirmatory factor analysis confirmed a one-factor solution for the Persian version of the SQOL-M. Content validity of the translated measure was endorsed by 10 specialists. Pearson correlations indicated that work ability index score, dimensions of the IIEF, and the IIEF total score were positively correlated with the Persian version of the SQOL-M (p Persian version of the SQOL-M has good to excellent psychometric properties and can be used to assess the sexual quality of life among Iranian men. PMID:26856758
The Convergent, Discriminant, and Concurrent Validity of Scores on the Abbreviated Self-Leadership Questionnaire

Directory of Open Access Journals (Sweden)

Faruk Şahin

2015-10-01

Full Text Available The present study reports the psychometric properties of a short measure of self-leadership in the Turkish context: the Abbreviated Self-Leadership Questionnaire (ASLQ. The ASLQ was examined using two samples and showed sound psychometric properties. Confirmatory factor analysis showed that nine-item ASLQ measured a single construct of self-leadership. The results supported the convergent and discriminant validity of the one-factor model of the ASLQ in relation to the 35-item Revised Self-Leadership Questionnaire and General Self-Efficacy scale, respectively. With regard to internal consistency and test-retest reliability, the ASLQ showed acceptable results. Furthermore, the results provided evidence that scores on the ASLQ positively predicted individual's self-reported task performance and self-efficacy mediated this relationship. Taken together, these findings suggest that the Turkish version of the ASLQ is a reliable and valid measure that can be used to measure self-leadership as one variable of interest in the future studies.
Measuring Orthorexia Nervosa: Psychometric Limitations of the ORTO-15.

Science.gov (United States)

Roncero, María; Barrada, Juan Ramón; Perpiñá, Conxa

2017-09-20

Orthorexia nervosa has recently been defined as excessive preoccupation with healthy eating, causing significant nutritional deficiencies and social and personal impairments. The ORTO-15 is the most widely used instrument to evaluate orthorexia nervosa, although previous studies obtained inconsistent results about its psychometric properties, and there are no data on the Spanish version. Thus, the main objective of the present study was to analyze the psychometric properties of the Spanish adaptation of the ORTO-15. In order to cross-validate the results, two independent samples were used (Sample 1: n = 807, 74.1% women; Sample 2: n = 242, 63.2% women). The results did not support the original recoding and reversal of the items; thus, the original scores were maintained. The analysis of the internal structure showed that the best interpretable solution was unidimensional, and due to low loadings, four items were removed. The internal consistency (α = .74) and temporal stability (r = .92; p orthorexia nervosa.

Factor structure and psychometric properties of a Romanian translation of the Body Appreciation Scale-2.

Science.gov (United States)

Swami, Viren; Tudorel, Otilia; Goian, Cosmin; Barron, David; Vintila, Mona

2017-12-01

We examined the psychometric properties of a Romanian translation of the 10-item Body Appreciation Scale-2 (BAS-2). A total of 453 university students from Romania completed the BAS-2, along with measures of disordered eating, self-esteem, satisfaction with life, and subjective happiness. In addition, a separate sample of university students (N=109) completed only the BAS-2 at two time-points three weeks apart. Principal-axis factor analysis indicated that BAS-2 scores had a one-dimensional factor structure in both women and men. Confirmatory factor analysis indicated that this factor structure had adequate fit, but invariance across sex was not supported. Further analyses indicated that BAS-2 scores evidenced internal consistency, convergent validity, and test-retest reliability in both women and men. These results suggest that BAS-2 scores reduce to one dimension in Romanian adults, but the lack of sex invariance may indicate that the same latent construct is not being measured in women and men. Copyright © 2017 Elsevier Ltd. All rights reserved.
Psychometric Properties of the Physical Activity Questionnaire for Older Children in Italy: Testing the Validity among a General and Clinical Pediatric Population.

Directory of Open Access Journals (Sweden)

Erica Gobbi

Full Text Available The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It. Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170 examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA and construct validity with enjoyment perception during physical activity. Study 2 (n = 59 reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry over the span of seven consecutive days. Study 3 (n = 58 examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD. In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83. Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36, with BMI (r = -.30 and -.79 for CHD simple form, and with the VO2max (r = .55 for CHD simple form. Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05. Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Psychometric Properties of the Physical Activity Questionnaire for Older Children in Italy: Testing the Validity among a General and Clinical Pediatric Population.

Science.gov (United States)

Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio

2016-01-01

The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Patient-Reported Outcome questionnaires for hip arthroscopy: a systematic review of the psychometric evidence

Science.gov (United States)

2011-01-01

Background Hip arthroscopies are often used in the treatment of intra-articular hip injuries. Patient-reported outcomes (PRO) are an important parameter in evaluating treatment. It is unclear which PRO questionnaires are specifically available for hip arthroscopy patients. The aim of this systematic review was to investigate which PRO questionnaires are valid and reliable in the evaluation of patients undergoing hip arthroscopy. Methods A search was conducted in Pubmed, Medline, CINAHL, the Cochrane Library, Pedro, EMBASE and Web of Science from 1931 to October 2010. Studies assessing the quality of PRO questionnaires in the evaluation of patients undergoing hip arthroscopy were included. The quality of the questionnaires was evaluated by the psychometric properties of the outcome measures. The quality of the articles investigating the questionnaires was assessed by the COSMIN list. Results Five articles identified three questionnaires; the Modified Harris Hip Score (MHHS), the Nonarthritic Hip Score (NAHS) and the Hip Outcome Score (HOS). The NAHS scored best on the content validity, whereas the HOS scored best on agreement, internal consistency, reliability and responsiveness. The quality of the articles describing the HOS scored highest. The NAHS is the best quality questionnaire. The articles describing the HOS are the best quality articles. Conclusions This systematic review shows that there is no conclusive evidence for the use of a single patient-reported outcome questionnaire in the evaluation of patients undergoing hip arthroscopy. Based on available psychometric evidence we recommend using a combination of the NAHS and the HOS for patients undergoing hip arthroscopy. PMID:21619610
Psychometric Properties of the MMPI-2-RF Somatic Complaints (RC1) Scale

Science.gov (United States)

Thomas, Michael L.; Locke, Dona E. C.

2010-01-01

The MMPI-2 Restructured Form (MMPI-2-RF; Tellegen & Ben-Porath, 2008) was designed to be psychometrically superior to its MMPI-2 counterpart. However, the test has yet to be extensively evaluated in diverse clinical settings. The purpose of this study was to examine the psychometric properties of the MMPI-2-RF Somatic Complaints (RC1) scale in…
Evaluation of the psychometric properties of the Nighttime Symptoms of COPD Instrument.

Science.gov (United States)

Mocarski, Michelle; Zaiser, Erica; Trundell, Dylan; Make, Barry J; Hareendran, Asha

2015-01-01

Nighttime symptoms can negatively impact the quality of life of patients with chronic obstructive pulmonary disease (COPD). The Nighttime Symptoms of COPD Instrument (NiSCI) was designed to measure the occurrence and severity of nighttime symptoms in patients with COPD, the impact of symptoms on nighttime awakenings, and rescue medication use. The objective of this study was to explore item reduction, inform scoring recommendations, and evaluate the psychometric properties of the NiSCI. COPD patients participating in a Phase III clinical trial completed the NiSCI daily. Item analyses were conducted using weekly mean and single day scores. Descriptive statistics (including percentage of respondents at floor/ceiling and inter-item correlations), factor analyses, and Rasch model analyses were conducted to examine item performance and scoring. Test-retest reliability was assessed for the final instrument using the intraclass correlation coefficient (ICC). Correlations with assessments conducted during study visits were used to evaluate convergent and known-groups validity. Data from 1,663 COPD patients aged 40-93 years were analyzed. Item analyses supported the generation of four scores. A one-factor structure was confirmed with factor analysis and Rasch analysis for the symptom severity score. Test-retest reliability was confirmed for the six-item symptom severity (ICC, 0.85), number of nighttime awakenings (ICC, 0.82), and rescue medication (ICC, 0.68) scores. Convergent validity was supported by significant correlations between the NiSCI, St George's Respiratory Questionnaire, and Exacerbations of Chronic Obstructive Pulmonary Disease Tool-Respiratory Symptoms scores. The results suggest that the NiSCI can be used to determine the severity of nighttime COPD symptoms, the number of nighttime awakenings due to COPD symptoms, and the nighttime use of rescue medication. The NiSCI is a reliable and valid instrument to evaluate these concepts in COPD patients in clinical
Cross-cultural adaptation and validation of the Turkish version of Oxford hip score.

Science.gov (United States)

Tuğay, Baki Umut; Tuğay, Nazan; Güney, Hande; Hazar, Zeynep; Yüksel, İnci; Atilla, Bülent

2015-06-01

The purpose of this study was to translate the Oxford hip score (OHS) into Turkish and to evaluate the psychometric properties by testing the internal consistency, reproducibility, construct validity, and responsiveness in patients with hip osteoarthritis (OA). Oxford hip score was translated and culturally adapted according to the guidelines in the literature. Seventy patients (mean age 61.45 ± 9.29 years) with hip osteoarthritis participated in the study. Patients completed the Turkish Oxford hip score (OHS-TR), the Short-Form 36 (SF-36), and Western Ontario and McMaster Universities Index (WOMAC). Internal consistency was tested using Cronbach's α coefficient. Patients completed OHS-TR questionnaire twice in 7 days for determining the reproducibility. Correlation between the total results of both tests was determined by the Pearson correlation coefficient and intraclass correlation coefficient (ICC). Validity was assessed by calculating the Pearson correlation coefficient between the OHS-TR and WOMAC and SF-36 scores. Floor and ceiling effects were analyzed. The internal consistency was high (Cronbach's α 0.93). The construct validity showed a significant correlation between the OHS-TR and WOMAC and related SF-36 domains (p < 0.001). The ICC's ranged between 0.80 and 0.99. There was no floor or ceiling effect in total OHS-TR score. The OHS-TR questionnaire is valid, reliable, and responsive for the Turkish-speaking patients with hip OA.
Modern psychometrics for assessing achievement goal orientation: a Rasch analysis.

Science.gov (United States)

Muis, Krista R; Winne, Philip H; Edwards, Ordene V

2009-09-01

A program of research is needed that assesses the psychometric properties of instruments designed to quantify students' achievement goal orientations to clarify inconsistencies across previous studies and to provide a stronger basis for future research. We conducted traditional psychometric and modern Rasch-model analyses of the Achievement Goals Questionnaire (AGQ, Elliot & McGregor, 2001) and the Patterns of Adaptive Learning Scale (PALS, Midgley et al., 2000) to provide an in-depth analysis of the two most popular instruments in educational psychology. For Study 1, 217 undergraduate students enrolled in educational psychology courses participated. Thirty-four were male and 181 were female (two did not respond). Participants completed the AGQ in the context of their educational psychology class. For Study 2, 126 undergraduate students enrolled in educational psychology courses participated. Thirty were male and 95 were female (one did not respond). Participants completed the PALS in the context of their educational psychology class. Traditional psychometric assessments of the AGQ and PALS replicated previous studies. For both, reliability estimates ranged from good to very good for raw subscale scores and fit for the models of goal orientations were good. Based on traditional psychometrics, the AGQ and PALS are valid and reliable indicators of achievement goals. Rasch analyses revealed that estimates of reliability for items were very good but respondent ability estimates varied from poor to good for both the AGQ and PALS. These findings indicate that items validly and reliably reflect a group's aggregate goal orientation, but using either instrument to characterize an individual's goal orientation is hazardous.
THE MODERN RACISM SCALE: PSYCHOMETRIC

Directory of Open Access Journals (Sweden)

MANUEL CÁRDENAS

2007-08-01

Full Text Available An adaption of McConahay, Harder and Batts’ (1981 moderm racism scale is presented for Chilean population andits psychometric properties, (reliability and validity are studied, along with its relationship with other relevantpsychosocial variables in studies on prejudice and ethnic discrimination (authoritarianism, religiousness, politicalposition, etc., as well as with other forms of prejudice (gender stereotypes and homophobia. The sample consistedof 120 participants, students of psychology, resident in the city of Antofagasta (a geographical zone with a highnumber of Latin-American inmigrants. Our findings show that the scale seems to be a reliable instrument to measurethe prejudice towards Bolivian immigrants in our social environment. Likewise, important differences among thesubjects are detected with high and low scores in the psychosocial variables used.
Psychometric properties of the motor diagnostics in the German football talent identification and development programme.

Science.gov (United States)

HÖner, Oliver; Votteler, Andreas; Schmid, Markus; Schultz, Florian; Roth, Klaus

2015-01-01

The utilisation of motor performance tests for talent identification in youth sports is discussed intensively in talent research. This article examines the reliability, differential stability and validity of the motor diagnostics conducted nationwide by the German football talent identification and development programme and provides reference values for a standardised interpretation of the diagnostics results. Highly selected players (the top 4% of their age groups, U12-U15) took part in the diagnostics at 17 measurement points between spring 2004 and spring 2012 (N = 68,158). The heterogeneous test battery measured speed abilities and football-specific technical skills (sprint, agility, dribbling, ball control, shooting, juggling). For all measurement points, the overall score and the speed tests showed high internal consistency, high test-retest reliability and satisfying differential stability. The diagnostics demonstrated satisfying factorial-related validity with plausible and stable loadings on the two empirical factors "speed" and "technical skills". The score, and the technical skills dribbling and juggling, differentiated the most among players of different performance levels and thus showed the highest criterion-related validity. Satisfactory psychometric properties for the diagnostics are an important prerequisite for a scientifically sound rating of players' actual motor performance and for the future examination of the prognostic validity for success in adulthood.
Measuring patients’ satisfaction with their anti-TNF treatment in severe Crohn’s disease: scoring and psychometric validation of the Satisfaction for PAtients in Crohn’s diseasE Questionnaire (SPACE-Q©

Directory of Open Access Journals (Sweden)

Gilet H

2014-12-01

Full Text Available Hélène Gilet,1 Benoit Arnould,1 Fatoumata Fofana,1 Pierre Clerson,2 Jean-Frédéric Colombel,10 Olivier D’Hondt,2 Patrick Faure,4 Hervé Hagège,5 Maria Nachury,3 Stéphane Nahon,6 Gilbert Tucat,7 Luc Vandromme,8 Ines Cazala-Telinge,9 Emmanuel Thibout9 1HEOR and Strategic Market Access, Mapi, Lyon, France; 2Orgamétrie, Roubaix, France; 3Hôpital Claude Huriez, Lille, France; 4Clinique Saint-Jean du Languedoc, Toulouse, France; 5Centre Hospitalier Intercommunal, Créteil, France; 6Centre Hospitalier Intercommunal, Le Raincy Montfermeil, France; 7Gastroenterologist, Private Clinical Practice, Paris, France; 8Gastroenterologist, Private Clinical Practice, Reims, France; 9Abbvie France, Rungis, France; 10Icahn School of Medicine at Mount Sinai, New York, NY, USA Background: Severe Crohn’s disease management includes anti-tumor necrosis factor (anti-TNF drugs that differ from early-stage treatments regarding efficacy, safety, and convenience. This study aimed to finalize and psychometrically validate the Satisfaction for PAtients in Crohn’s diseasE Questionnaire (SPACE-Q©, developed to measure satisfaction with anti-TNF treatment in patients with severe Crohn’s disease. Methods: A total of 279 patients with severe Crohn’s disease receiving anti-TNF therapy completed the SPACE-Q 62-item pilot version at inclusion and 12 and 13 weeks after first anti-TNF injection. The final SPACE-Q scoring was defined using multitrait and regression analyses and clinical relevance considerations. Psychometric validation included clinical validity against Harvey–Bradshaw score, concurrent validity against Treatment Satisfaction Questionnaire for Medication (TSQM, internal consistency reliability, test–retest reliability, and responsiveness against the patient global impression of change (PGIC.Results: Quality of completion was good (55%–67% of patients completed all items. Four items were removed from the questionnaire. Eleven scores were defined
Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

Science.gov (United States)

Kim, Seonghoon

2013-01-01

With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…
Online pre-race education improves test scores for volunteers at a marathon.

Science.gov (United States)

Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

2017-09-01

This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.
A Psychometric Evaluation of the Threadgold Communication Tool for Persons with Dementia

Directory of Open Access Journals (Sweden)

Benedicte Sørensen Strøm

2016-04-01

Full Text Available Background: The objective of this study was to investigate the psychometric properties of the Threadgold Communication Tool (TCT. Method: Internal consistency reliability was measured using Cronbach's α coefficient and inter-item correlation. Test-retest was performed to examine the instrument's stability. Exploratory principal component analysis (PCA with oblimin rotation was carried out to evaluate construct validity. Finally, the score on each item of the TCT was correlated with the person's Mini Mental State Examination (MMSE and Barthel Index of activities of daily living scores. Results: A total of 51 persons participated, with a mean age of 86.7 (SD 6.6 years, of whom 46 were women with moderate-to-severe dementia [mean MMSE score 7.5 (SD 6.7]. There were two measurement points 2 weeks apart. The results showed a satisfactory level for internal consistency and a high test-retest reliability (r = 0.76. The corrected item-total correlation ranged between 0.50 and 0.87, and a two-factor structure was revealed at the PCA. ‘Vocalizing' seemed to measure another aspect of communication and was the only item which was negatively loaded. Conclusion: Despite the low sample size in this study, the results revealed the TCT as a reliable and valid instrument, suitable for measuring communication among people with dementia. We suggest clarifying the understanding of ‘vocalizing' before considering removing it from the scale.
The Psychometric Parameters of the Farsi Form of the Arabic Scale of Death Anxiety.

Science.gov (United States)

Dadfar, Mahboubeh; Abdel-Khalek, Ahmed M; Lester, David; Atef Vahid, Mohammad Kazem

2017-01-01

The aim of this study was to describe the psychometric properties of the Farsi Form of the Arabic Scale of Death Anxiety (ASDA). The original scale was first translated into Farsi by language experts using the back translation procedure and then administered to a total of 252 Iranian college students and 52 psychiatric outpatients from psychiatric and psychological clinics. The one-week test-retest reliability of the Farsi version in a sample of college students was 0.78, indicating good temporal stability and corroborating the trait-like nature of scores. Cronbach's α was 0.90 for the college students and 0.92 for the psychiatric outpatients, indicating high internal consistency. Scale scores correlated 0.46 with Death Obsession Scale scores, 0.56 with Death Depression Scale scores, 0.41 with Death Anxiety Scale scores, and 0.40 with Wish to be Dead Scale scores, indicating good construct and criterion-related validity. A principal component analysis with a Varimax rotation yielded four factors in the sample of Iranian college students, indicating a lack of homogeneity in the content of the scale. Male students obtained a significant higher mean score than did females. It was concluded that the Farsi ASDA had good internal consistency, temporal stability, criterion-related validity, and a factor structure reflecting important features of death anxiety. In general, the Farsi ASDA could be recommended for use in research on death anxiety among Iranian college students and psychiatric outpatients.
The Psychometric Parameters of the Farsi Form of the Arabic Scale of Death Anxiety

Directory of Open Access Journals (Sweden)

Mahboubeh Dadfar

2017-01-01

Full Text Available The aim of this study was to describe the psychometric properties of the Farsi Form of the Arabic Scale of Death Anxiety (ASDA. The original scale was first translated into Farsi by language experts using the back translation procedure and then administered to a total of 252 Iranian college students and 52 psychiatric outpatients from psychiatric and psychological clinics. The one-week test-retest reliability of the Farsi version in a sample of college students was 0.78, indicating good temporal stability and corroborating the trait-like nature of scores. Cronbach’s α was 0.90 for the college students and 0.92 for the psychiatric outpatients, indicating high internal consistency. Scale scores correlated 0.46 with Death Obsession Scale scores, 0.56 with Death Depression Scale scores, 0.41 with Death Anxiety Scale scores, and 0.40 with Wish to be Dead Scale scores, indicating good construct and criterion-related validity. A principal component analysis with a Varimax rotation yielded four factors in the sample of Iranian college students, indicating a lack of homogeneity in the content of the scale. Male students obtained a significant higher mean score than did females. It was concluded that the Farsi ASDA had good internal consistency, temporal stability, criterion-related validity, and a factor structure reflecting important features of death anxiety. In general, the Farsi ASDA could be recommended for use in research on death anxiety among Iranian college students and psychiatric outpatients.
Measuring resilience after spinal cord injury: Development, validation and psychometric characteristics of the SCI-QOL Resilience item bank and short form.

Science.gov (United States)

Victorson, David; Tulsky, David S; Kisala, Pamela A; Kalpakjian, Claire Z; Weiland, Brian; Choi, Seung W

2015-05-01

To describe the development and psychometric properties of the Spinal Cord Injury--Quality of Life (SCI-QOL) Resilience item bank and short form. Using a mixed-methods design, we developed and tested a resilience item bank through the use of focus groups with individuals with SCI and clinicians with expertise in SCI, cognitive interviews, and item-response theory based analytic approaches, including tests of model fit and differential item functioning (DIF). We tested a 32-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Department of Veterans Affairs medical center. A total of 717 individuals with SCI completed the Resilience items. A unidimensional model was observed (CFI=0.968; RMSEA=0.074) and measurement precision was good (theta range between -3.1 and 0.9). Ten items were flagged for DIF, however, after examination of effect sizes we found this to be negligible with little practical impact on score estimates. The final calibrated item bank resulted in 21 retained items. This study indicates that the SCI-QOL Resilience item bank represents a psychometrically robust measurement tool. Short form items are also suggested and computer adaptive tests are available.
The Test Anxiety Measure for Adolescents (TAMA): Examination of the Reliability and Validity of the Scores of a New Multidimensional Measure of Test Anxiety for Middle and High School Students

Science.gov (United States)

Lowe, Patricia A.

2014-01-01

A new multidimensional measure of test anxiety, the Test Anxiety Measure for Adolescents (TAMA), specifically designed for U.S. adolescents in Grades 6 to 12 was developed and its psychometric properties were examined. The TAMA consists of five scales (Cognitive Interference, Physiological Hyperarousal, Social Concerns, Task Irrelevant Behavior,…
Development and psychometric testing of the Supportive Supervisory Scale.

Science.gov (United States)

McGilton, Katherine S

2010-06-01

To describe the development and psychometric testing of the Supportive Supervisory Scale (SSS). The development of the items of the scale was based on Winnicott's relationship theory and on focus groups with 26 healthcare aides (HCAs) and 30 supervisors from six long-term care (LTC) facilities in Ontario, Canada. Content validity of the 15-item instrument was established by a panel of experts. Based on a secondary analysis of data collected from 222 HCAs in 10 LTC facilities in Ontario, Canada, the SSS was subjected to principal components analysis with oblique rotation. A two-factor solution was accepted, which is consistent with the theoretical conceptualization of the instrument. Factor I was labeled Respects Uniqueness and Factor II was labeled Being Reliable. Internal consistency of Factor I was .95, and that of Factor II was .91. Discriminant validity was also established. The focus groups revealed that "being available to staff" while "recognizing the HCA as an individual, and taking a moment to get to know them" was essential to feeling supported by their supervisor. The SSS is a reliable and valid measure of supervisory support of supervisors working in LTC facilities. At the core of supportive supervision is the supervisor's ability to develop and maintain positive relationships with each HCA. It is through respecting the uniqueness of each HCA and being reliable that supervisor-HCA relationships can flourish. Supportive leadership in LTC settings is a major contributor to HCAs' job satisfaction and retention and to quality of patient care. Therefore, a tool developed and tested to measure supervisors' supportive capacities in LTC is primal to evaluate the effectiveness of supervisors in these environments.
Psychometric Investigation of the Raven's Colored Progressive Matrices Test in a Sample of Preschool Children.

Science.gov (United States)

Lúcio, Patrícia Silva; Cogo-Moreira, Hugo; Puglisi, Marina; Polanczyk, Guilherme Vanoni; Little, Todd D

2017-11-01

The present study investigated the psychometric properties of the Raven's Colored Progressive Matrices (CPM) test in a sample of preschoolers from Brazil ( n = 582; age: mean = 57 months, SD = 7 months; 46% female). We investigated the plausibility of unidimensionality of the items (confirmatory factor analysis) and differential item functioning (DIF) for sex and age (multiple indicators multiple causes method). We tested four unidimensional models and the one with the best-fit index was a reduced form of the Raven's CPM. The DIF analysis was carried out with the reduced form of the test. A few items presented DIF (two for sex and one for age), confirming that the Raven's CPM items are mostly measurement invariant. There was no effect of sex on the general factor, but increasing age was associated with higher values of the g factor. Future research should indicate if the reduced form is suitable for evaluating the general ability of preschoolers.

Psychometric properties of three measures assessing advanced theory of mind: Evidence from people with schizophrenia.

Science.gov (United States)

Chen, Kuan-Wei; Lee, Shih-Chieh; Chiang, Hsin-Yu; Syu, Ya-Cing; Yu, Xiao-Xuan; Hsieh, Ching-Lin

2017-11-01

Patients with schizophrenia tend to have deficits in advanced Theory of Mind (ToM). The "Reading the mind in the eyes" test (RMET), the Faux Pas Task, and the Strange Stories are commonly used for assessing advanced ToM. However, most of the psychometric properties of these 3 measures in patients with schizophrenia are unknown. The aims of this study were to validate the psychometric properties of the 3 advanced ToM measures in patients with schizophrenia, including: (1) test-retest reliability; (2) random measurement error; (3) practice effect; (4) concurrent validity; and (5) ecological validity. We recruited 53 patients with schizophrenia, who completed the 3 measures twice, 4 weeks apart. The Revised Social Functioning Scale-Taiwan short version (R-SFST) was completed within 3 days of first session of assessments. We found that the intraclass correlation coefficients of the RMET, Strange Stories, and Faux Pas Task were 0.24, 0.5, and 0.76. All 3 advanced ToM measures had large random measurement error, trivial to small practice effects, poor concurrent validity, and low ecological validity. We recommend that the scores of the 3 advanced ToM measures be interpreted with caution because these measures may not provide reliable and valid results on patients' advanced ToM abilities. Copyright © 2017 Elsevier B.V. All rights reserved.
CONSTRUCT VALIDITY AND SCORING METHODS OF THE WORLD HEALTH ORGANIZATION- HEALTH AND WORK PERFORMANCE QUESTIONNAIRE AMONG WORKERS WITH ARTHRITIS AND RHEUMATOLOGICAL CONDITIONS

Science.gov (United States)

AlHeresh, Rawan; LaValley, Michael P.; Coster, Wendy; Keysor, Julie J.

2017-01-01

Objective To evaluate construct validity and scoring methods of the world health organization- health and work performance questionnaire (HPQ) for people with arthritis. Methods Construct validity was examined through hypothesis testing using the recommended guidelines of the Consensus-based Standards for the selection of health Measurement Instruments (COSMIN). Results The HPQ using the absolute scoring method showed moderate construct validity as 4 of the 7 hypotheses were met. The HPQ using the relative scoring method had weak construct validity as only one of the 7 hypotheses were met. Conclusion The absolute scoring method for the HPQ is superior in construct validity to the relative scoring method in assessing work performance among people with arthritis and related rheumatic conditions; however, more research is needed to further explore other psychometric properties of the HPQ. PMID:28598938
Construct Validity and Scoring Methods of the World Health Organization: Health and Work Performance Questionnaire Among Workers With Arthritis and Rheumatological Conditions.

Science.gov (United States)

AlHeresh, Rawan; LaValley, Michael P; Coster, Wendy; Keysor, Julie J

2017-06-01

To evaluate construct validity and scoring methods of the world health organization-health and work performance questionnaire (HPQ) for people with arthritis. Construct validity was examined through hypothesis testing using the recommended guidelines of the consensus-based standards for the selection of health measurement instruments (COSMIN). The HPQ using the absolute scoring method showed moderate construct validity as four of the seven hypotheses were met. The HPQ using the relative scoring method had weak construct validity as only one of the seven hypotheses were met. The absolute scoring method for the HPQ is superior in construct validity to the relative scoring method in assessing work performance among people with arthritis and related rheumatic conditions; however, more research is needed to further explore other psychometric properties of the HPQ.
Elders Health Empowerment Scale: Spanish adaptation and psychometric analysis.

Science.gov (United States)

Serrani Azcurra, Daniel Jorge Luis

2014-01-01

Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs.
Using Raters from India to Score a Large-Scale Speaking Test

Science.gov (United States)

Xi, Xiaoming; Mollaun, Pam

2011-01-01

We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…
Factor structure and psychometric properties of a Spanish translation of the Body Appreciation Scale-2 (BAS-2).

Science.gov (United States)

Swami, Viren; García, Antonio Alías; Barron, David

2017-09-01

We examined the psychometric properties of a Spanish translation of the Body Appreciation Scale-2 (BAS-2) in a community sample of 411 women and 389 men in Almería, Spain. Participants completed the 10-item BAS-2 along with measures of appearance evaluation, body areas satisfaction, self-esteem, life satisfaction, and self-reported body mass index (BMI). Exploratory factor analyses with one split-half subsample revealed that BAS-2 scores had a one-dimensional factor structure in women and men. Confirmatory factor analysis with a second split-half subsample showed the one-dimensional factor structure had acceptable fit and was invariant across sex. There were no significant sex differences in BAS-2 scores. BAS-2 scores were significantly and positively correlated with appearance evaluation, body areas satisfaction, self-esteem, and life satisfaction. Body appreciation was significantly and negatively correlated with BMI in men, but associations in women were only significant in the second subsample. Results suggest that the Spanish BAS-2 has adequate psychometric properties. Copyright © 2017 Elsevier Ltd. All rights reserved.
Determining the Sensitivity of CAT-ASVAB (Computerized Adaptive Testing- Armed Services Vocational Aptitude Battery) Scores to Changes in Item Response Curves with the Medium of Administration

Science.gov (United States)

1986-08-01

most examinees. Therefore it appears psychometrically ac - ceptable for the CAT -ASVAB project to proceed without item recalibration based on...MEMORANDUM DETERMINING THE SENSITIVITY OF CAT -ASVAB SCORES TO CHANGES IN ITEM RESPONSE CURVES WITH THE MEDIUM OF ADMINISTRATION D. R. Divgi...Subj: Center for Naval Analyses Research Memorandum 86-189 End: (1) CNA Research Memorandum 86-189, "Determining the Sensitivity of CAT -ASVAB
Psychometric Properties and Normative Data of the Zuckerman-Kuhlman Personality Questionnaire in a Psychiatric Outpatient Sample.

Science.gov (United States)

Martínez Ortega, Yolanda; Gomà-I-Freixanet, Montserrat; Valero, Sergi

2017-01-01

The Zuckerman-Kuhlman Personality Questionnaire (ZKPQ; Zuckerman, Kuhlman, Joireman, Teta, & Kraft, 1993 ) was designed for the assessment of personality. The goal of this work was to determine the psychometric properties of the ZKPQ, as well as to establish normative data by gender and age in an outpatient sample attending primary mental health care services. We administered the questionnaire to 314 participants (34.7% males) 18 to 81 years old. The most prevalent primary diagnoses were mood (37.9%) and adjustment disorders (35.0%). Concerning the psychometric properties of the ZKPQ, the pattern of internal consistencies was similar to that previously found among general population, student, or clinical samples. Regarding gender differences, a general pattern was found, with women scoring higher on neuroticism and sociability, and lower on aggression-hostility. As for age, in general, scores declined with age. Norm-based decision making has the potential for significant and long-lasting consequences, and the quality of decisions based on score comparisons can be improved when scores are compared to norms fitted to the group of reference. The availability of the ZKPQ norms by gender and age in mental health care will benefit the accuracy of assessment and therapeutic decision making, providing more effective treatment planning overall.
The pornography craving questionnaire: psychometric properties.

Science.gov (United States)

Kraus, Shane; Rosenberg, Harold

2014-04-01

Despite the prevalence of pornography use, and recent conceptualization of problematic use as an addiction, we could find no published scale to measure craving for pornography. Therefore, we conducted three studies employing young male pornography users to develop and evaluate such a questionnaire. In Study 1, we had participants rate their agreement with 20 potential craving items after reading a control script or a script designed to induce craving to watch pornography. We dropped eight items because of low endorsement. In Study 2, we revised both the questionnaire and cue exposure stimuli and then evaluated several psychometric properties of the modified questionnaire. Item loadings from a principal components analysis, a high internal consistency reliability coefficient, and a moderate mean inter-item correlation supported interpreting the 12 revised items as a single scale. Correlations of craving scores with preoccupation with pornography, sexual history, compulsive internet use, and sensation seeking provided support for convergent validity, criterion validity, and discriminant validity, respectively. The enhanced imagery script did not impact reported craving; however, more frequent users of pornography reported higher craving than less frequent users regardless of script condition. In Study 3, craving scores demonstrated good one-week test-retest reliability and predicted the number of times participants used pornography during the following week. This questionnaire could be applied in clinical settings to plan and evaluate therapy for problematic users of pornography and as a research tool to assess the prevalence and contextual triggers of craving among different types of pornography users.
Cross-cultural adaptation and validation of the French version of the Knee injury and Osteoarthritis Outcome Score (KOOS) in knee osteoarthritis patients

DEFF Research Database (Denmark)

Ornetti, P; Parratte, S; Gossec, L

2008-01-01

OBJECTIVE: To adapt the Knee injury and Osteoarthritis Outcome Score (KOOS) into French and to evaluate the psychometric properties of this new version. METHODS: The French version of the KOOS was developed according to cross-cultural guidelines by using the "translation-back translation" method...... to ensure content validity. KOOS data were then obtained in patients with symptomatic knee osteoarthritis (OA). The translated questionnaire was evaluated in two knee OA population groups, one with no indication for joint replacement (medicine), and the other waiting for joint replacement (surgery......). The psychometric properties evaluated were feasibility: percentage of responses, floor and ceiling effects; construct validity: internal consistency using Cronbach's alpha, correlations with osteoarthritis knee and hip quality of life domains using Spearman's rank test, and known group comparison between medicine...
Group differences in the heritability of items and test scores

NARCIS (Netherlands)

Wicherts, J.M.; Johnson, W.

2009-01-01

It is important to understand potential sources of group differences in the heritability of intelligence test scores. On the basis of a basic item response model we argue that heritabilities which are based on dichotomous item scores normally do not generalize from one sample to the next. If groups
Governing by Testing: Circulation, Psychometric Knowledge, Experts and the "Alliance for Progress" in Latin America during the 1960s and 1970s

Science.gov (United States)

Alarcón, Cristina

2015-01-01

This paper analyzes the activities, members, and effects of an inter-American expert network for the diffusion of psychometric knowledge, specifically of standardized aptitude testing for university admission in Latin America during the 1960s and 1970s. Within the framework of educational transfer studies, the role of international,…
Psychometric Features of the General Aptitude Test-Verbal Part (GAT-V): A Large-Scale Assessment of High School Graduates in Saudi Arabia

Science.gov (United States)

Dimitrov, Dimiter M.; Shamrani, Abdul Rahman

2015-01-01

This study examines the psychometric features of a General Aptitude Test-Verbal Part, which is used with assessments of high school graduates in Saudi Arabia. The data supported a bifactor model, with one general factor and three content domains (Analogy, Sentence Completion, and Reading Comprehension) as latent aspects of verbal aptitude.
Psychometrically equivalent bisyllabic words for speech recognition threshold testing in Vietnamese.

Science.gov (United States)

Harris, Richard W; McPherson, David L; Hanson, Claire M; Eggett, Dennis L

2017-08-01

This study identified, digitally recorded, edited and evaluated 89 bisyllabic Vietnamese words with the goal of identifying homogeneous words that could be used to measure the speech recognition threshold (SRT) in native talkers of Vietnamese. Native male and female talker productions of 89 Vietnamese bisyllabic words were recorded, edited and then presented at intensities ranging from -10 to 20 dBHL. Logistic regression was used to identify the best words for measuring the SRT. Forty-eight words were selected and digitally edited to have 50% intelligibility at a level equal to the mean pure-tone average (PTA) for normally hearing participants (5.2 dBHL). Twenty normally hearing native Vietnamese participants listened to and repeated bisyllabic Vietnamese words at intensities ranging from -10 to 20 dBHL. A total of 48 male and female talker recordings of bisyllabic words with steep psychometric functions (>9.0%/dB) were chosen for the final bisyllabic SRT list. Only words homogeneous with respect to threshold audibility with steep psychometric function slopes were chosen for the final list. Digital recordings of bisyllabic Vietnamese words are now available for use in measuring the SRT for patients whose native language is Vietnamese.
Translation of the Manchester Clinical Supervision Scale (MCSS) into Danish and a preliminary psychometric validation

DEFF Research Database (Denmark)

Buus, Niels; Gonge, Henrik

2013-01-01

for the translation of the MCSS from English into Danish and to present a preliminary psychometric validation of the Danish version of the scale. Methods included a formal translation/back-translation procedure and statistical analyses. The sample consisted of MCSS scores from 139 Danish mental health nursing staff...
Psychometric properties of revised Thought-Action Fusion questionnaire (TAF-R) in an Iranian population.

Science.gov (United States)

Pourfaraj, Majid; Mohammadi, Nourallah; Taghavi, Mohammadreza

2008-12-01

The purpose of this study is to examine the psychometric properties of Thought-Action Fusion revised scale (TAF-R; Amir, N., freshman, M., Ramsey, B., Neary, E., & Brigidi, B. (2001). Thought-action fusion in individuals with OCD symptoms. Behaviour Research and Therapy, 39, 765-776) in a sample of 565 (321 female) students of Shiraz university. The results of factor analysis with using varimax rotation yielded eight factors that explained 80% variances of total scale. These factors are labeled: moral TAF, responsibility for positive thoughts, likelihood negative events, likelihood positive events, responsibility for negative thoughts, responsibility for harm avoidance, likelihood harm avoidance and likelihood self, respectively. The reliability coefficients of total scale are calculated by two methods: internal consistency and test-retest, which were 0.81 and 0.61, respectively. Concurrent validity showed that TAF-R scores positively and significantly correlate with responsibility, guilt and obsessive-compulsive symptoms. Confirming the expectations, there were people with high obsessive-compulsive symptoms having higher TAF-R scores than those with low symptoms. Moreover, subscales-total correlations showed that the correlations between subscales were low, but subscales correlating with total score of TAF-R were moderated.
Combination of classical test theory (CTT) and item response theory (IRT) analysis to study the psychometric properties of the French version of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF).

Science.gov (United States)

Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric

2015-02-01

The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.
Initial development and psychometric testing of an instrument to measure the quality of children's end-of-life care.

Science.gov (United States)

Widger, Kimberley; Tourangeau, Ann E; Steele, Rose; Streiner, David L

2015-01-01

The field of pediatric palliative care is hindered by the lack of a well-defined, reliable, and valid method for measuring the quality of end-of-life care. The study purpose was to develop and test an instrument to measure mothers' perspectives on the quality of care received before, at the time of, and following a child's death. In Phase 1, key components of quality end-of-life care for children were synthesized through a comprehensive review of research literature. These key components were validated in Phase 2 and then extended through focus groups with bereaved parents. In Phase 3, items were developed to assess structures, processes, and outcomes of quality end-of-life care then tested for content and face validity with health professionals. Cognitive testing was conducted through interviews with bereaved parents. In Phase 4, bereaved mothers were recruited through 10 children's hospitals/hospices in Canada to complete the instrument, and psychometric testing was conducted. Following review of 67 manuscripts and 3 focus groups with 10 parents, 141 items were initially developed. The overall content validity index for these items was 0.84 as rated by 7 health professionals. Based on feedback from health professionals and cognitive testing with 6 parents, a 144-item instrument was finalized for further testing. In Phase 4, 128 mothers completed the instrument, 31 of whom completed it twice. Test-retest reliability, internal consistency, and construct validity were demonstrated for six subscales: Connect With Families, Involve Parents, Share Information With Parents, Share Information Among Health Professionals, Support Parents, and Provide Care at Death. Additional items with content validity were grouped in four domains: Support the Child, Support Siblings, Provide Bereavement Follow-up, and Structures of Care. Forty-eight items were deleted through psychometric testing, leaving a 95-item instrument. There is good initial evidence for the reliability and
Psychometric properties of the SCOFF questionnaire (Chinese version) for screening eating disorders in Hong Kong secondary school students: a cross-sectional study.

Science.gov (United States)

Leung, Sau Fong; Lee, Ka Li; Lee, Sze Man; Leung, Sik Chi; Hung, Wing Sze; Lee, Wai Leng; Leung, Yuen Yee; Li, Man Wai; Tse, Tak Kin; Wong, Hoi Kei; Wong, Yuen Ni

2009-02-01

Eating disorders are affecting an increasing number of high school students in Western and Asian countries. The availability of an effective screening tool is crucial for early detection and prompt intervention. The objective of this study was to examine the validity and reliability of the SCOFF questionnaire for screening eating disorders in Hong Kong high school students. This study adopted a cross-sectional design to examine the psychometric properties of the SCOFF questionnaire. A panel of 7 experts and 936 students of a high school participated in the study. The SCOFF questionnaire was translated into Chinese and back-translated into English to ensure the linguistic equivalence. A panel of 7 experts involved in the content validation of the SCOFF questionnaire. The Eating Disorder Examination-Questionnaire (EDE-Q) was used as the "reference standard" to assess its concurrent validity in 936 students of a high school. Its reliability was examined by internal consistency and the test-retest method at a 2-week interval and with 38 students. The SCOFF questionnaire achieved an agreement of 86-100% among the experts for the content relevance. Of 812 students (86.8%) who responded to this study, their SCOFF scores correlated significantly with their global scores on the EDE-Q (r=0.5, Peating disorders had significantly higher scores in the EDE-Q than those not identified as such by SCOFF. The SCOFF questionnaire demonstrated moderate test-retest reliability (ICC=0.66) and an acceptable internal consistency reliability (Cronbach's alpha=0.44-0.57) in comparing with previous studies. The SCOFF questionnaire has acceptable psychometric properties in the Chinese culture. It will be useful for detecting potential eating disorders and assisting health promotion activity.
Psychometric assessment of the Behavior and Attitudes Questionnaire for Healthy Habits: measuring parents' views on food and physical activity.

Science.gov (United States)

Henry, Beverly W; Smith, Thomas J; Ahmad, Saadia

2014-05-01

To assess parents' perspectives of their home environments to establish the validity of scores from the Behavior and Attitudes Questionnaire for Healthy Habits (BAQ-HH). In the present descriptive study, we surveyed a cross-sectional sample of parents of pre-school children. Questionnaire items developed in an iterative process with community-based programming addressed parents' knowledge/awareness, attitudes/concerns and behaviours about healthy foods and physical activity habits with 6-point rating scales. Exploratory and confirmatory factor analyses were used to psychometrically evaluate scores from the scales. English and Spanish versions of the BAQ-HH were administered at parent-teacher conferences for pre-school children at ten Head Start centres across a five-county agency in autumn 2010. From 672 families with pre-school children, 532 parents provided responses to the BAQ-HH (79 % response rate). The majority was female (83 %), Hispanic (66 %) or white (16 %), and ages ranged from 20 to 39 years (85 %). Exploratory and confirmatory analyses revealed a knowledge scale (seven items), an attitude scale (four items) and three behaviour subscales (three items each). Correlations were identified between parents' perceptions of home activities and reports of children's habits. Differences were identified by gender and ethnicity groupings. As a first step in psychometric testing, the dimensionality of each of the three scales (Knowledge, Attitudes and Behaviours) was identified and scale scores were related to other indicators of child behaviours and parents' demographic characteristics. This questionnaire offers a method to measure parents' views to inform planning and monitoring of obesity-prevention education programmes.

Reducing the number of options on multiple-choice questions: response time, psychometrics and standard setting.

Science.gov (United States)

Schneid, Stephen D; Armour, Chris; Park, Yoon Soo; Yudkowsky, Rachel; Bordage, Georges

2014-10-01

Despite significant evidence supporting the use of three-option multiple-choice questions (MCQs), these are rarely used in written examinations for health professions students. The purpose of this study was to examine the effects of reducing four- and five-option MCQs to three-option MCQs on response times, psychometric characteristics, and absolute standard setting judgements in a pharmacology examination administered to health professions students. We administered two versions of a computerised examination containing 98 MCQs to 38 Year 2 medical students and 39 Year 3 pharmacy students. Four- and five-option MCQs were converted into three-option MCQs to create two versions of the examination. Differences in response time, item difficulty and discrimination, and reliability were evaluated. Medical and pharmacy faculty judges provided three-level Angoff (TLA) ratings for all MCQs for both versions of the examination to allow the assessment of differences in cut scores. Students answered three-option MCQs an average of 5 seconds faster than they answered four- and five-option MCQs (36 seconds versus 41 seconds; p = 0.008). There were no significant differences in item difficulty and discrimination, or test reliability. Overall, the cut scores generated for three-option MCQs using the TLA ratings were 8 percentage points higher (p = 0.04). The use of three-option MCQs in a health professions examination resulted in a time saving equivalent to the completion of 16% more MCQs per 1-hour testing period, which may increase content validity and test score reliability, and minimise construct under-representation. The higher cut scores may result in higher failure rates if an absolute standard setting method, such as the TLA method, is used. The results from this study provide a cautious indication to health professions educators that using three-option MCQs does not threaten validity and may strengthen it by allowing additional MCQs to be tested in a fixed amount
The Psychometric Toolbox: An Excel Package for Use in Measurement and Psychometrics Courses

Science.gov (United States)

Ferrando, Pere J.; Masip-Cabrera, Antoni; Navarro-González, David; Lorenzo-Seva, Urbano

2017-01-01

The Psychometric Toolbox (PT) is a user-friendly, non-commercial package mainly intended to be used for instructional purposes in introductory courses of educational and psychological measurement, psychometrics and statistics. The PT package is organized in six separate modules or sub-programs: Data preprocessor (descriptive analyses and data…
The psychometric properties of a shortened corporate entrepreneurship assessment instrument

Directory of Open Access Journals (Sweden)

Renier Steyn

2017-08-01

Aim: The aim of this research was to evaluate the psychometric properties of a measure of entrepreneurial climate. Entrepreneurial climate was measured using a shortened version of the Hornsby, Kuratko and Zahra (2002 instrument, called the Corporate Entrepreneurship Assessment Instrument (CEAI. Making information on the psychometric properties of the instrument available directly relates to its utility. Setting: The setting was medium to large South African companies. A random sample of employees was drawn from 53 selected companies across South Africa, with 60 respondents per company (N = 3 180. Methods: A cross-sectional survey design was used. Several instruments were administered, including the shortened version of the CEAI. Cronbach’s alpha was used to test for reliability and several methods were used to test for validity. Correlation analysis was used to test for concurrent validity, convergent validity and divergent validity. Principle component factor analysis was used to test for factorial validity and a t-test to test for known-group validity. Results: The results showed that the reliability for the total score of the shortened version of the CEAI was acceptable at 0.758. The results also showed some evidence of concurrent validity, as well as homogeneity among the items. With regard to factorial validity, all items loaded in accordance with the subscales of the instrument. The measure was able to distinguish, as expected, between government organisations and private business entities, suggesting known-group validity. Convergent validity and divergent validity were also assessed. Interesting to note was that entrepreneurship climate correlates more with general employee attitude (e.g. employee engagement; R= 0.420, p < 0.001 and organisational commitment, R = 0.331, p < 0.001 than with self-reported innovation (R = 0.277, p < 0.001 and R = 0.267, p < 0.001. Contribution: This paper not only provided information on the reliability
Adapting tests of sign language assessment for other sign languages--a review of linguistic, cultural, and psychometric problems.

Science.gov (United States)

Haug, Tobias; Mann, Wolfgang

2008-01-01

Given the current lack of appropriate assessment tools for measuring deaf children's sign language skills, many test developers have used existing tests of other sign languages as templates to measure the sign language used by deaf people in their country. This article discusses factors that may influence the adaptation of assessment tests from one natural sign language to another. Two tests which have been adapted for several other sign languages are focused upon: the Test for American Sign Language and the British Sign Language Receptive Skills Test. A brief description is given of each test as well as insights from ongoing adaptations of these tests for other sign languages. The problems reported in these adaptations were found to be grounded in linguistic and cultural differences, which need to be considered for future test adaptations. Other reported shortcomings of test adaptation are related to the question of how well psychometric measures transfer from one instrument to another.
Psychometric analysis of the TRANSIT quality indicators for cardiovascular disease prevention in primary care.

Science.gov (United States)

Khanji, Cynthia; Bareil, Céline; Hudon, Eveline; Goudreau, Johanne; Duhamel, Fabie; Lussier, Marie-Thérèse; Perreault, Sylvie; Lalonde, Gilles; Turcotte, Alain; Berbiche, Djamal; Martin, Élisabeth; Lévesque, Lise; Gagnon, Marie-Mireille; Lalonde, Lyne

2017-12-01

To assess a selection of psychometric properties of the TRANSIT indicators. Using medical records, indicators were documented retrospectively during the 14 months preceding the end of the TRANSIT study. Primary care in Quebec, Canada. Indicators were documented in a random subsample (n = 123 patients) of the TRANSIT study population (n = 759). For every patient, the mean compliance to all indicators of a category (subscale score) and to the complete set of indicators (overall scale score) were established. To evaluate test-retest and inter-rater reliabilities, indicators were applied twice, two months apart, by the same evaluator and independently by different evaluators, respectively. To evaluate convergent validity, correlations between TRANSIT indicators, Burge et al. indicators and Institut national d'excellence en santé et en services sociaux (INESSS) indicators were examined. Test-retest reliability, inter-rater reliability, and convergent validity. Test-retest reliability, as measured by intraclass correlation coefficients (ICCs) was equal to 0.99 (0.99-0.99) for the overall scale score while inter-rater reliability was equal to 0.95 (0.93-0.97) for the overall scale score. Convergent validity, as measured by Pearson's correlation coefficients, was equal to 0.77 (P TRANSIT indicators were compared to Burge et al. indicators and to 0.82 (P TRANSIT indicators were compared to INESSS indicators. Reliability was excellent except for eleven indicators while convergent validity was strong except for domains related to the management of CVD risk factors. © The Author 2017. Published by Oxford University Press in association with the International Society for Quality in Health Care. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Psychometric Testing of the Persian Version of the Perceived Perioperative Competence Scale-Revised.

Science.gov (United States)

Ajorpaz, Neda Mirbagher; Tafreshi, Mansoureh Zagheri; Mohtashami, Jamileh; Zayeri, Farid; Rahemi, Zahra

2017-12-01

The clinical competence of nursing students in operating room (OR) is an important issue in nursing education. The purpose of this study was to evaluate the psychometric properties of the Persian Perceived Perioperative Competence Scale-Revised (PPCS-R) instrument. This cross-sectional study was conducted across 12 universities in Iran. The psychometric properties and factor structure of the PPCS-R for OR students was examined. Based on the results of factor analysis, seven items were removed from the original version of the scale. The fitness indices of the Persian scale include comparative fit index (CFI) = .90, goodness-of-fit-index (GFI) = .86, adjusted goodness-of-fit index (AGFI) = .90, normed fit index (NFI) = .84, and root mean square error of approximation (RMSEA) = .04. High validity and reliability indicated the scale's value for measuring perceived perioperative competence of Iranian OR students.
Psychometric analysis of the PTSD Checklist-5 (PCL-5) among treatment-seeking military service members.

Science.gov (United States)

Wortmann, Jennifer H; Jordan, Alexander H; Weathers, Frank W; Resick, Patricia A; Dondanville, Katherine A; Hall-Clark, Brittany; Foa, Edna B; Young-McCaughan, Stacey; Yarvis, Jeffrey S; Hembree, Elizabeth A; Mintz, Jim; Peterson, Alan L; Litz, Brett T

2016-11-01

The Posttraumatic Stress Disorder Checklist (PCL-5; Weathers et al., 2013) was recently revised to reflect the changed diagnostic criteria for posttraumatic stress disorder (PTSD) in the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5; American Psychiatric Association, 2013). We investigated the psychometric properties of PCL-5 scores in a large cohort (N = 912) of military service members seeking PTSD treatment while stationed in garrison. We examined the internal consistency, convergent and discriminant validity, and DSM-5 factor structure of PCL-5 scores, their sensitivity to clinical change relative to PTSD Symptom Scale-Interview (PSS-I; Foa, Riggs, Dancu, & Rothbaum, 1993) scores, and their diagnostic utility for predicting a PTSD diagnosis based on various measures and scoring rules. PCL-5 scores exhibited high internal consistency. There was strong agreement between the order of hypothesized and observed correlations among PCL-5 and criterion measure scores. The best-fitting structural model was a 7-factor hybrid model (Armour et al., 2015), which demonstrated closer fit than all other models evaluated, including the DSM-5 model. The PCL-5's sensitivity to clinical change, pre- to posttreatment, was comparable with that of the PSS-I. Optimally efficient cut scores for predicting PTSD diagnosis were consistent with prior research with service members (Hoge, Riviere, Wilk, Herrell, & Weathers, 2014). The results indicate that the PCL-5 is a psychometrically sound measure of DSM-5 PTSD symptoms that is useful for identifying provisional PTSD diagnostic status, quantifying PTSD symptom severity, and detecting clinical change over time in PTSD symptoms among service members seeking treatment. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Testing the applicability of the SASS5 scoring procedure for ...

African Journals Online (AJOL)

A study was undertaken between 29th January and 17th February 2004 to test the applicability of the South African Scoring System Version 5 (SASS5) scoring and calculation procedure in nutrient-enriched palustrine wetlands in the midlands of KwaZulu-Natal, South Africa. Four reference wetlands and three dairy-effluent ...
Evaluating the Predictive Validity of Graduate Management Admission Test Scores

Science.gov (United States)

Sireci, Stephen G.; Talento-Miller, Eileen

2006-01-01

Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…
Health-related quality of life questionnaire for polycystic ovary syndrome (PCOSQ-50): development and psychometric properties.

Science.gov (United States)

Nasiri-Amiri, Fatemeh; Ramezani Tehrani, Fahimeh; Simbar, Masoumeh; Montazeri, Ali; Mohammadpour, Reza Ali

2016-07-01

The determinants of the health-related quality of life of women with polycystic ovary syndrome are not fully understood. The aim of this study was to develop a comprehensive instrument to assess the health-related quality of life of Iranian women with PCOS and to assess its psychometric properties. We used a mixed-method, sequential, exploratory design including both qualitative [in-depth interview to define the components of health-related quality of life questionnaire (PCOSQ)] and quantitative approaches (to assess the psychometric properties of PCOSQ). A preliminary questionnaire was developed including 147 items which emerged from the qualitative phase of the study. Considering the optimum cutoff points for content validity ratio (CVR), content validity index (CVI), and impact score, items of the preliminary questionnaire were reduced from 147 to 88 items. Finally, by excluding highly correlated items using the exploratory factor analysis, a 50-item questionnaire was obtained. The Kaiser criteria (eigenvalues >1) and Scree plot tests demonstrated that six factors were optimum with an estimated 47.3 % of variance. Assessment of the psychometric properties of the questionnaire demonstrated a mean CVI = 0.92, CVR = 0.91, Cronbach's alpha for whole questionnaire = 0.88 (0.61-0.88 for subscales), Spearman's correlation coefficients of test-retest = 0.75, and the intra-class correlation coefficient for the PCOS questionnaire subscales ranging from 0.57 to 0.88. Eventually the final questionnaire included 50 items in six domains, 'psychosocial and emotional,' 'fertility,' 'sexual function,' 'obesity and menstrual disorders,' 'hirsutism,' and 'coping' and rated on a 5-point Likert scale. The PCOSQ-50 is a valid and reliable instrument for the assessment of quality of life of women with PCOS, capable of assessing some obscure aspects overlooked by previous HRQL questionnaires.
Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

Science.gov (United States)

Zou, Xiao-Ling; Chen, Yan-Min

2016-01-01

The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…
Validity, Reliability, and the Questionable Role of Psychometrics in Plastic Surgery

Science.gov (United States)

2014-01-01

Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Validity, Reliability, and the Questionable Role of Psychometrics in Plastic Surgery

Directory of Open Access Journals (Sweden)

Eric Swanson, MD

2014-06-01

Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
The Multimedia Piers-Harris Children's Self-Concept Scale 2: Its Psychometric Properties, Equivalence with the Paper-and-Pencil Version, and Respondent Preferences.

Science.gov (United States)

Flahive, Mon-hsin Wang; Chuang, Ying-Chih; Li, Chien-Mo

2015-01-01

A multimedia version of Piers-Harris Children's Self-Concept Scale 2 (Piers-Harris 2) was created with audio and cartoon animation to facilitate the measurement of self-concept among younger children. This study aimed to assess the psychometric qualities of the computer version of Piers-Harris 2 scores, examine its score equivalence with the paper-and-pencil version, and survey the respondent preference of the two versions. Two hundred and forty eight Taiwanese students from the first to fourth grade were recruited. In regard to the psychometric properties, high internal consistency (α = .91) was found for the total score of multimedia Piers-Harris 2. High interscale correlations (.77 to .83) of the multimedia Piers-Harris 2 scores and the results of confirmatory factor analysis suggested the multimedia Piers-Harris 2 contained good structural characteristics. The scores of the multimedia Piers-Harris 2 also had significant correlations with the scores of the Elementary School Children's Self Concept Scale. The equality of convergence and criterion-related validities of Piers-Harris 2 scores for the multimedia and paper-and-pencil versions and the results of ICCs between the scores of the multimedia and paper-and-pencil Piers-Harris 2 suggested their high level of equivalence. Participants showed more positive attitudes towards the multimedia version.
The Multimedia Piers-Harris Children's Self-Concept Scale 2: Its Psychometric Properties, Equivalence with the Paper-and-Pencil Version, and Respondent Preferences

Science.gov (United States)

Flahive, Mon-hsin Wang; Chuang, Ying-Chih; Li, Chien-Mo

2015-01-01

A multimedia version of Piers-Harris Children's Self-Concept Scale 2 (Piers-Harris 2) was created with audio and cartoon animation to facilitate the measurement of self-concept among younger children. This study aimed to assess the psychometric qualities of the computer version of Piers-Harris 2 scores, examine its score equivalence with the paper-and-pencil version, and survey the respondent preference of the two versions. Two hundred and forty eight Taiwanese students from the first to fourth grade were recruited. In regard to the psychometric properties, high internal consistency (α = .91) was found for the total score of multimedia Piers-Harris 2. High interscale correlations (.77 to .83) of the multimedia Piers-Harris 2 scores and the results of confirmatory factor analysis suggested the multimedia Piers-Harris 2 contained good structural characteristics. The scores of the multimedia Piers-Harris 2 also had significant correlations with the scores of the Elementary School Children’s Self Concept Scale. The equality of convergence and criterion-related validities of Piers-Harris 2 scores for the multimedia and paper-and-pencil versions and the results of ICCs between the scores of the multimedia and paper-and-pencil Piers-Harris 2 suggested their high level of equivalence. Participants showed more positive attitudes towards the multimedia version. PMID:26252499
Validation of online psychometric instruments for common mental health disorders: a systematic review.

Science.gov (United States)

van Ballegooijen, Wouter; Riper, Heleen; Cuijpers, Pim; van Oppen, Patricia; Smit, Johannes H

2016-02-25

Online questionnaires for measuring common mental health disorders such as depression and anxiety disorders are increasingly used. The psychometrics of several pen-and-paper questionnaires have been re-examined for online use and new online instruments have been developed and tested for validity as well. This study aims to review and synthesise the literature on this subject and provide a framework for future research. We searched Medline and PsycINFO for psychometric studies on online instruments for common mental health disorders and extracted the psychometric data. Studies were coded and assessed for quality by independent raters. We included 56 studies on 62 online instruments. For common instruments such as the CES-D, MADRS-S and HADS there is mounting evidence for adequate psychometric properties. Further results are scattered over different instruments and different psychometric characteristics. Few studies included patient populations. We found at least one online measure for each of the included mental health disorders and symptoms. A small number of online questionnaires have been studied thoroughly. This study provides an overview of online instruments to refer to when choosing an instrument for assessing common mental health disorders online, and can structure future psychometric research.
Does breastfeeding contribute to the racial gap in reading and math test scores?

Science.gov (United States)

Peters, Kristen E; Huang, Jin; Vaughn, Michael G; Witko, Christopher

2013-10-01

The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0-12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. Copyright © 2013 Elsevier Inc. All rights reserved.
Validation of new prognostic and predictive scores by sequential testing approach

International Nuclear Information System (INIS)

Nieder, Carsten; Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid

2010-01-01

Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)
Validation of new prognostic and predictive scores by sequential testing approach

Energy Technology Data Exchange (ETDEWEB)

Nieder, Carsten [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway); Inst. of Clinical Medicine, Univ. of Tromso (Norway); Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway)

2010-03-15

Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)
Accountancy, teaching methods, sex, and American College Test scores.

Science.gov (United States)

Heritage, J; Harper, B S; Harper, J P

1990-10-01

This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.

Psychometric evaluation of Persian Nomophobia Questionnaire: Differential item functioning and measurement invariance across gender.

Science.gov (United States)

Lin, Chung-Ying; Griffiths, Mark D; Pakpour, Amir H

2018-03-01

Background and aims Research examining problematic mobile phone use has increased markedly over the past 5 years and has been related to "no mobile phone phobia" (so-called nomophobia). The 20-item Nomophobia Questionnaire (NMP-Q) is the only instrument that assesses nomophobia with an underlying theoretical structure and robust psychometric testing. This study aimed to confirm the construct validity of the Persian NMP-Q using Rasch and confirmatory factor analysis (CFA) models. Methods After ensuring the linguistic validity, Rasch models were used to examine the unidimensionality of each Persian NMP-Q factor among 3,216 Iranian adolescents and CFAs were used to confirm its four-factor structure. Differential item functioning (DIF) and multigroup CFA were used to examine whether males and females interpreted the NMP-Q similarly, including item content and NMP-Q structure. Results Each factor was unidimensional according to the Rach findings, and the four-factor structure was supported by CFA. Two items did not quite fit the Rasch models (Item 14: "I would be nervous because I could not know if someone had tried to get a hold of me;" Item 9: "If I could not check my smartphone for a while, I would feel a desire to check it"). No DIF items were found across gender and measurement invariance was supported in multigroup CFA across gender. Conclusions Due to the satisfactory psychometric properties, it is concluded that the Persian NMP-Q can be used to assess nomophobia among adolescents. Moreover, NMP-Q users may compare its scores between genders in the knowledge that there are no score differences contributed by different understandings of NMP-Q items.
Reduce, Reuse, Recycle: The Longitudinal Value of Local Cut Scores Using State Test Data

Science.gov (United States)

Nelson, Peter M.; Van Norman, Ethan R.; VanDerHeyden, Amanda

2017-01-01

We used existing reading (n = 1,498) and math (n = 2,260) data to evaluate state test scores for screening middle school students. In Phase 1, state test data were used to create a research-derived cut score that was optimal for predicting state test performance the following year. In Phase 2, those cut scores were applied with future cohorts.…
Measuring self-esteem after spinal cord injury: Development, validation and psychometric characteristics of the SCI-QOL Self-esteem item bank and short form.

Science.gov (United States)

Kalpakjian, Claire Z; Tate, Denise G; Kisala, Pamela A; Tulsky, David S

2015-05-01

To describe the development and psychometric properties of the Spinal Cord Injury-Quality of Life (SCI-QOL) Self-esteem item bank. Using a mixed-methods design, we developed and tested a self-esteem item bank through the use of focus groups with individuals with SCI and clinicians with expertise in SCI, cognitive interviews, and item-response theory-(IRT) based analytic approaches, including tests of model fit, differential item functioning (DIF) and precision. We tested a pool of 30 items at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital, and the James J. Peters/Bronx Department of Veterans Affairs hospital. A total of 717 individuals with SCI completed the self-esteem items. A unidimensional model was observed (CFI=0.946; RMSEA=0.087) and measurement precision was good (theta range between -2.7 and 0.7). Eleven items were flagged for DIF; however, effect sizes were negligible with little practical impact on score estimates. The final calibrated item bank resulted in 23 retained items. This study indicates that the SCI-QOL Self-esteem item bank represents a psychometrically robust measurement tool. Short form items are also suggested and computer adaptive tests are available.
Design and Psychometric Evaluation of the Quality of Life in Patients With Anal Fistula Questionnaire.

Science.gov (United States)

Ferrer-Márquez, Manuel; Espínola-Cortés, Natalia; Reina-Duarte, Angel; Granero-Molina, José; Fernández-Sola, Cayetano; Hernández-Padilla, José Manuel

2017-10-01

Quality of life is often considered when deciding and evaluating the treatment strategy for patients diagnosed with anal fistula. The purpose of this study was to develop and psychometrically test the Quality of Life in Patients with Anal Fistula Questionnaire. This was an observational cross-sectional study for the development and validation of a psychometric tool. The study was conducted at a general hospital in the southeast of Spain. A convenience sample included 54 patients diagnosed with anal fistula. The reliability of the tool was assessed through its internal consistency (Cronbach α) and temporal stability (Spearman correlation coefficient (r) between test-retest). The content validity index of the items and the scale was calculated. Correlation analysis and an ordinal regression analysis between the developed tool and the Short Form 12 Health Survey examined its concurrent validity. Principal component analysis and known-group analysis using the Kruskal-Wallis test examined its construct validity. The reliability of the developed questionnaire was very high (α = 0.908; r = 0.861; p questionnaire to detect expected differences in patients presenting with different symptomatology. The major limitations of this study were the use of a small sample of Spanish-speaking patients, not including patients in the initial development of the questionnaire, and developing the scoring system using a summation method. The Quality of Life in Patients with Anal Fistula Questionnaire has proven to be a valid, reliable, and concise tool that could contribute to the evaluation of quality of life among patients with an anal fistula. See Video Abstract at http://links.lww.com/DCR/A368.
A Protocol for Advanced Psychometric Assessment of Surveys

Science.gov (United States)

Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.

2013-01-01

Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
Psychometric considerations in the measurement of event-related brain potentials: Guidelines for measurement and reporting.

Science.gov (United States)

Clayson, Peter E; Miller, Gregory A

2017-01-01

Failing to consider psychometric issues related to reliability and validity, differential deficits, and statistical power potentially undermines the conclusions of a study. In research using event-related brain potentials (ERPs), numerous contextual factors (population sampled, task, data recording, analysis pipeline, etc.) can impact the reliability of ERP scores. The present review considers the contextual factors that influence ERP score reliability and the downstream effects that reliability has on statistical analyses. Given the context-dependent nature of ERPs, it is recommended that ERP score reliability be formally assessed on a study-by-study basis. Recommended guidelines for ERP studies include 1) reporting the threshold of acceptable reliability and reliability estimates for observed scores, 2) specifying the approach used to estimate reliability, and 3) justifying how trial-count minima were chosen. A reliability threshold for internal consistency of at least 0.70 is recommended, and a threshold of 0.80 is preferred. The review also advocates the use of generalizability theory for estimating score dependability (the generalizability theory analog to reliability) as an improvement on classical test theory reliability estimates, suggesting that the latter is less well suited to ERP research. To facilitate the calculation and reporting of dependability estimates, an open-source Matlab program, the ERP Reliability Analysis Toolbox, is presented. Copyright © 2016 Elsevier B.V. All rights reserved.
Psychometric Properties of the Chinese Version of the Arabic Scale of Death Anxiety.

Science.gov (United States)

Qiu, Qi; Zhang, Shengyu; Lin, Xiang; Ban, Chunxia; Yang, Haibo; Liu, Zhengwen; Wang, Jingrong; Wang, Tao; Xiao, Shifu; Abdel-Khalek, Ahmed M; Li, Xia

2016-06-25

Death anxiety is regarded as a risk and maintaining factor of psychopathology. While the Arabic Scale of Death Anxiety (ASDA) is a brief, commonly used assessment, such a tool is lacking in Chinese clinical practice. The current study was conducted to develop a Chinese version of the ASDA, i.e., the ASDA(C), using a multistage back-translation technique, and examine the psychometric properties of the scale. A total of 1372 participants from hospitals and universities located in three geographic areas of China were recruited for this study. To calculate the criterion-related validity of the ASDA(C) compared to the Chinese version of the longer-form Multidimensional Orientation toward Dying and Death Inventory (MODDI-F/chin), 49 undergraduates were randomly assigned to complete both questionnaires. Of the total participants, 56 were randomly assigned to retake the ASDA(C) in order to estimate the one-week, test-retest reliability of the ASDA(C). The overall Cronbach's alpha was 0.91 for the whole scale. The one-week, test-retest reliability was 0.96. Exploratory Factor Analysis (EFA) revealed three factors, "fear of dead people and tombs," "fear of lethal disease," and "fear of postmortem events," accounted for 57.09% of the total variance. Factor structure for the three-factor model was sound. The correlation between the total scores on the ASDA(C) and the MODDI-F/chin was 0.54, indicating acceptable concurrent validity. ASDA(C) has adequate psychometrics and properties that make it a reliable and valid scale to assess death anxiety in Mandarin-speaking Chinese.
Psychometric properties of a culture-adapted Spanish version of AIDA (Assessment of Identity Development in Adolescence) in Mexico.

Science.gov (United States)

Kassin, Moises; De Castro, Filipa; Arango, Ivan; Goth, Kirstin

2013-01-01

The construct "identity" was discussed to be integrated as an important criterion for diagnosing personality disorders in DSM-5. According to Kernberg, identity diffusion is one of the relevant underlying structures in terms of personality organization for developing psychopathology, especially borderline personality disorder. Therefore, it would be important to differentiate healthy from pathological development already in adolescence. With the questionnaire termed AIDA (Assessment of Identity Development in Adolescence), a reliable and valid self-rating inventory was introduced by Goth, Foelsch, Schlueter-Mueller, & Schmeck (2012) to assess pathology-related identity development in healthy and disturbed adolescents. To test the usefulness of the questionnaire in Mexico, we contributed to the development of a culture-specific Spanish translation of AIDA and tested the reliability and aspects of validity of the questionnaire in a juvenile Mexican sample. An adapted Spanish translation of AIDA was developed by an expert panel from Chile, Mexico, and Spain in cooperation with the original authors, focusing on content equivalence and comprehensibility by considering specific idioms, life circumstances, and culture-specific aspects. The psychometric properties of the Spanish version were first tested in Mexico. Participants were 265 students from a state school (N = 110) and private school (N = 155), aged between 12 and 19 years (mean 14.15 years). Of these, 44.9% were boys and 55.1% were girls. Item characteristics were analyzed by several parameters, scale reliability by Cronbach's Alpha, and systematic effects of gender, age, and socioeconomics by an analysis of variance (ANOVA). We evaluated aspects of criterion validity in a juvenile justice system sample (N = 41) of adolescent boys in conflict with the law who displayed various types of behavioral problems by comparing the AIDA scores of a subgroup with signs for borderline pathology (N = 14
Development and psychometric evaluation of a clinical global impression for schizoaffective disorder scale.

Science.gov (United States)

Allen, Michael H; Daniel, David G; Revicki, Dennis A; Canuso, Carla M; Turkoz, Ibrahim; Fu, Dong-Jing; Alphs, Larry; Ishak, K Jack; Bartko, John J; Lindenmayer, Jean-Pierre

2012-01-01

The Clinical Global Impression for Schizoaffective Disorder scale is a new rating scale adapted from the Clinical Global Impression scale for use in patients with schizoaffective disorder. The psychometric characteristics of the Clinical Global Impression for Schizoaffective Disorder are described. Content validity was assessed using an investigator questionnaire. Inter-rater reliability was determined with 12 sets of videotaped interviews rated independently by two trained individuals. Test-retest reliability was assessed using 30 randomly selected raters from clinical trials who evaluated the same videos on separate occasions two weeks apart. Convergent and divergent validity and effect size were evaluated by comparing scores between the Clinical Global Impression for Schizoaffective Disorder and the Positive and Negative Syndrome Scale, 21-item Hamilton Rating Scale for Depression, and Young Mania Rating Scale scales using pooled patient data from two clinical trials. Clinical Global Impression for Schizoaffective Disorder scores were then linked to corresponding Positive and Negative Syndrome Scale scores. Content validity was strong. Inter-rater agreement was good to excellent for most scales and subscales (intra-class correlation coefficient ≥ 0.50). Test-retest showed good reproducibility, with intraclass correlation coefficients ranging from 0.444 to 0.898. Spearman correlations between Clinical Global Impression for Schizoaffective Disorder domains and corresponding symptom scales were 0.60 or greater, and effect sizes for Clinical Global Impression for Schizoaffective Disorder overall and domain scores were similar to Positive and Negative Syndrome Scale Young Mania Rating Scale, and 21-item Hamilton Rating Scale for Depression scores. Raters anticipated that the scale might be less effective in distinguishing negative from depressive symptoms, and, in fact, the results here may reflect that clinical reality. Multiple lines of evidence support the
A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

Science.gov (United States)

Kosinski, Andrzej S

2013-03-15

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.
A Systematic Review of the Psychometric Properties of Composite LGBT Prejudice and Discrimination Scales.

Science.gov (United States)

Morrison, Melanie A; Bishop, C J; Morrison, Todd G

2018-01-08

Prejudice and discrimination against LGBT individuals is widespread and has been shown to have negative consequences for sexual and gender minority persons' physical and psychological wellbeing. A recent and problematic trend in the literature is to compositely measure prejudice toward and discrimination against LGBT persons. As such, a review of the psychometric properties of scales assessing, in a combinatory fashion, negative attitudes and/or behaviors toward LGBT persons is warranted. In the current study, 32 scales were identified, and their psychometric properties were evaluated. Most of the scales reviewed did not provide sufficient information regarding item development and refinement, scale dimensionality, scale score reliability, or validity. Properties of the reviewed scales are summarized, and recommendations for better measurement practice are articulated.
Validity of GRE General Test scores and TOEFL scores for graduate admission to a technical university in Western Europe

Science.gov (United States)

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.
The Performance of the Upper Limb scores correlate with pulmonary function test measures and Egen Klassifikation scores in Duchenne muscular dystrophy.

Science.gov (United States)

Lee, Ha Neul; Sawnani, Hemant; Horn, Paul S; Rybalsky, Irina; Relucio, Lani; Wong, Brenda L

2016-01-01

The Performance of the Upper Limb scale was developed as an outcome measure specifically for ambulant and non-ambulant patients with Duchenne muscular dystrophy and is implemented in clinical trials needing longitudinal data. The aim of this study is to determine whether this novel tool correlates with functional ability using pulmonary function test, cardiac function test and Egen Klassifikation scale scores as clinical measures. In this cross-sectional study, 43 non-ambulatory Duchenne males from ages 10 to 30 years and on long-term glucocorticoid treatment were enrolled. Cardiac and pulmonary function test results were analyzed to assess cardiopulmonary function, and Egen Klassifikation scores were analyzed to assess functional ability. The Performance of the Upper Limb scores correlated with pulmonary function measures and had inverse correlation with Egen Klassifikation scores. There was no correlation with left ventricular ejection fraction and left ventricular dysfunction. Body mass index and decreased joint range of motion affected total Performance of the Upper Limb scores and should be considered in clinical trial designs. Copyright © 2016 Elsevier B.V. All rights reserved.
Relative Merits of Four Methods for Scoring Cloze Tests.

Science.gov (United States)

Brown, James Dean

1980-01-01

Describes study comparing merits of exact answer, acceptable answer, clozentropy and multiple choice methods for scoring tests. Results show differences among reliability, mean item facility, discrimination and usability, but not validity. (BK)
Psychometric evaluation and validation of the Serbian version of “Reading the mind in the eyes” test

Directory of Open Access Journals (Sweden)

Đorđević Jelena

2017-01-01

Full Text Available “Reading the Mind in the Eyes” test (RMET is one of the most popular and widely used measures of individual differences in Theory of Mind (ToM capabilities. Despite demonstrating good validity in differentiating various clinical groups exhibiting ToM deficits from unimpaired controls, previous studies raised the question of the RMET’s homogeneity, latent structure, and reliability. The aim of this study is to provide evidence on psychometric properties, latent structure, and validity of the newly adapted Serbian version of the RMET. In total, 260 participants (61.9% females took part in the study. The sample consisted of both unimpaired controls (76.5%, and a clinical group of participants that are believed to demonstrate ToM deficits (23.5%, namely, persons diagnosed with schizophrenia and bipolar disorder (54.1% females. RMET has demonstrated fair psychometric properties (KMO = .723; α = .747; H1 = .076; H5 = .465, successfully differentiating between clinical group and control [F (1,254 = 26.175, p <.001, η2 p = .093], while typical gender differences in performance were found only in control group. Tests of several models based on the previous literature revealed that the affect-specific factors underlying performance on RMET demonstrate poor fit. The best fitting model obtained included reduced scale with a single-factor underlying the test’s performance (TLI = .953, CFI = .958, RMSEA = .020. Based on the fit parameters we propose 18-item short-form of the Serbian version of RMET (KMO = .797; α = .728; H1 = .129; H5 = .677 for economic, reliable and valid measurement of ToM abilities.
Development and Psychometric Evaluation of Scales: A Survey of Published Articles

Directory of Open Access Journals (Sweden)

Foroozan Atashzadeh-Shoorideh

2016-01-01

Full Text Available Background and purpose: Using valid and reliable instruments is an important way for collecting data in qualitative researches. This paper is a report of a study conducted to examine the extent of psychometric properties of the scales in research papers published in Journal of Advanced Nursing.Methods: In this study, the Journal of Advanced Nursing was chosen for systematic review. All articles which were published during 2007-2009 in this journal were collected and articles related to instrument development were selected. Each article was completely reviewed to identify the methods of instrument validation and reliability.Results: From 980 articles published in Journal of Advanced Nursing during 2007-2009, 41 (4.18% articles were about research methodology. In these, 12 articles (29.27% were related to developing an instrument. In this study, review of 12 articles that published in Journal of Advanced Nursing, 2007-2009, showed that some of the articles did not measure psychometric properties properly, thus some of the developed scales need to measure other types of necessary validity. In addition, reliability testing needs to be performed on each instrument used in a study before other statistical analysis are performed. From 12 articles, all of the articles measured and reported Cronbach’s alpha, but four of them did not measure test-retest.Conclusions: Although researchers put a great emphasis on methodology and statistical analysis, they pay less attention to the psychometric properties of their new instruments. The authors of this article hope to draw the attention of researcher to the importance of measuring psychometric properties of new instruments.Keywords: PSYCHOMETRIC, SCALES, CRITICAL REVIEW
Decay of Iconic Memory Traces Is Related to Psychometric Intelligence: A Fixed-Links Modeling Approach

Science.gov (United States)

Miller, Robert; Rammsayer, Thomas H.; Schweizer, Karl; Troche, Stefan J.

2010-01-01

Several memory processes have been examined regarding their relation to psychometric intelligence with the exception of sensory memory. This study examined the relation between decay of iconic memory traces, measured with a partial-report task, and psychometric intelligence, assessed with the Berlin Intelligence Structure test, in 111…
The Harris hip score: Do ceiling effects limit its usefulness in orthopedics?

NARCIS (Netherlands)

Wamper, K.E.; Sierevelt, I.N.; Poolman, R.W.; Bhandari, M.; Haverkamp, D.

2010-01-01

The Harris hip score (HHS), a disease-specific health status scale that is frequently used to measure the outcome of total hip arthroplasty, has never been validated properly. A questionnaire is suitable only when all 5 psychometric properties are of sufficient quality. We questioned the usefulness
The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach.

Science.gov (United States)

Xu, Jian

2017-01-01

The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers' listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.
The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

Directory of Open Access Journals (Sweden)

Jian Xu

2017-12-01

Full Text Available The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers’ listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.

Untimed Design Fluency in Aging and Alzheimer's Disease: Psychometrics and Normative Data.

Science.gov (United States)

Sunderaraman, Preeti; Sokolov, Elisaveta; Cines, Sarah; Sullo, Elizabeth; Orly, Aidan; Lerer, Bianca; Karlawish, Jason; Huey, Edward; Cosentino, Stephanie

2015-01-01

Design fluency tests, commonly used in both clinical and research contexts to evaluate nonverbal concept generation, have the potential to offer useful information in the differentiation of healthy versus pathological aging. Although normative data for older adults (OAs) are available for multiple timed versions of this test, similar data have been unavailable for a previously published untimed test, the Graphic Pattern Generation Test (GPG). Time constraints common to almost all of the available design fluency tests may cloud interpretation of higher-level executive abilities-for example, in individuals with slow processing speed. The current study examined the psychometric properties of the GPG and presents normative data in a sample of 167 healthy OAs and 110 individuals diagnosed with Alzheimer's disease (AD). Results suggest that a brief version of the GPG can be administered reliably and that this short form has high test-retest and interrater reliability. Number of perseverations was higher in individuals with AD as compared with OAs. A cutoff score of 4 or more perseverations showed a moderate degree of sensitivity (76%) and specificity (37%) in distinguishing individuals with AD and OAs. Finally, perseverations were associated with nonmemory indexes, thereby underscoring the nonverbal nature of this error in OAs and individuals with AD.
The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

Science.gov (United States)

Longenbecker, Sueann; Wood, Peter H.

1984-01-01

Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)
A speech reception in noise test for preschool children (the Galker-test)

DEFF Research Database (Denmark)

Lauritsen, Maj-Britt Glenn; Kreiner, Svend; Söderström, Margareta

2015-01-01

Purpose: This study evaluates initial validity and reliability of the “Galker test of speech reception in noise” developed for Danish preschool children suspected to have problems with hearing or understanding speech against strict psychometric standards and assesses acceptance by the children....... Methods:The Galker test is an audio-visual, computerised, word discrimination test in background noise, originally comprised of 50 word pairs. Three hundred and eighty eight children attending ordinary day care centres and aged 3–5 years were included. With multiple regression and the Rasch item response...... model it was examined whether the total score of the Galker test validly reflected item responses across subgroups defined by sex, age, bilingualism, tympanometry, audiometry and verbal comprehension. Results: A total of 370 children (95%) accepted testing and 339 (87%) completed all 50 items...
Psychometric properties of the Sexual Adjustment Questionnaire (SAQ) in the Iranian population with spinal cord injury.

Science.gov (United States)

Merghati-Khoei, E; Maasoumi, R; Rahdari, F; Bayat, A; Hajmirzaei, S; Lotfi, S; Hajiaghababaei, M; Emami-Razavi, S H; Korte, J E; Atoof, F

2015-11-01

This is a cross-sectional study. The objective of this study was to examine the psychometric properties of the Sexual Adjustment Questionnaire (SAQ) for Iranian people with spinal cord injury. This study was conducted in the brain and Spinal Cord Injury Research Center, Tehran University of Medical Sciences, Tehran, Iran. We assessed the psychometric properties of the SAQ, with 200 participants (men=146, women=54) completing the scale. An evaluation of its test-retest reliability was performed over a 2-weeks period, on a subsample of 30 patients recruited from the overall group. Cronbach's α-coefficient was computed for assessment of internal consistency reliability. In addition, content and face validity were examined by an expert committee. Construct validity was assessed by examining convergent and discriminant validity. Finally, exploratory factor analysis was used to extract the factor structure of the questionnaire. The Cronbach's α and intraclass correlation coefficient were 0.77 and 0.72 retrospectively. With regard to construct validity, there was a significant (P=0.009) negative correlation (r=-0.28) between the SAQ score and age. Those with lower levels of educations scored significantly lower on the SAQ (P=0.04). The exploratory factor analysis indicated a four-factor structure for the questionnaire, accounting for 68.9% of the observed variance. The expert committee approved the face and content validity of the developed measure. The SAQ is a valid measure for assessing sexual adjustment in people with spinal cord injury. The evaluation of sexual well-being may be useful in clinical trials and practical settings.
Psychometric characteristics of Clinical Reasoning Problems (CRPs) and its correlation with routine multiple choice question (MCQ) in Cardiology department.

Science.gov (United States)

Derakhshandeh, Zahra; Amini, Mitra; Kojuri, Javad; Dehbozorgian, Marziyeh

2018-01-01

Clinical reasoning is one of the most important skills in the process of training a medical student to become an efficient physician. Assessment of the reasoning skills in a medical school program is important to direct students' learning. One of the tests for measuring the clinical reasoning ability is Clinical Reasoning Problems (CRPs). The major aim of this study is to measure psychometric qualities of CRPs and define correlation between this test and routine MCQ in cardiology department of Shiraz medical school. This study was a descriptive study conducted on total cardiology residents of Shiraz Medical School. The study population consists of 40 residents in 2014. The routine CRPs and the MCQ tests was designed based on similar objectives and were carried out simultaneously. Reliability, item difficulty, item discrimination, and correlation between each item and the total score of CRPs were all measured by Excel and SPSS software for checking psycometeric CRPs test. Furthermore, we calculated the correlation between CRPs test and MCQ test. The mean differences of CRPs test score between residents' academic year [second, third and fourth year] were also evaluated by Analysis of variances test (One Way ANOVA) using SPSS software (version 20)(α=0.05). The mean and standard deviation of score in CRPs was 10.19 ±3.39 out of 20; in MCQ, it was 13.15±3.81 out of 20. Item difficulty was in the range of 0.27-0.72; item discrimination was 0.30-0.75 with question No.3 being the exception (that was 0.24). The correlation between each item and the total score of CRP was 0.26-0.87; the correlation between CRPs test and MCQ test was 0.68 (preasoning in residents. It can be included in cardiology residency assessment programs.
Evaluation of the psychometric properties of the Nighttime Symptoms of COPD Instrument

Directory of Open Access Journals (Sweden)

Mocarski M

2015-03-01

Full Text Available Michelle Mocarski,1 Erica Zaiser,2 Dylan Trundell,2 Barry J Make,3 Asha Hareendran21Forest Research Institute, Inc., an affiliate of Actavis, Inc., Jersey City, NJ, USA; 2Evidera, London, UK; 3National Jewish Health, Denver, CO, USA Background: Nighttime symptoms can negatively impact the quality of life of patients with chronic obstructive pulmonary disease (COPD. The Nighttime Symptoms of COPD Instrument (NiSCI was designed to measure the occurrence and severity of nighttime symptoms in patients with COPD, the impact of symptoms on nighttime awakenings, and rescue medication use. The objective of this study was to explore item reduction, inform scoring recommendations, and evaluate the psychometric properties of the NiSCI.Methods: COPD patients participating in a Phase III clinical trial completed the NiSCI daily. Item analyses were conducted using weekly mean and single day scores. Descriptive statistics (including percentage of respondents at floor/ceiling and inter-item correlations, factor analyses, and Rasch model analyses were conducted to examine item performance and scoring. Test–retest reliability was assessed for the final instrument using the intraclass correlation coefficient (ICC. Correlations with assessments conducted during study visits were used to evaluate convergent and known-groups validity.Results: Data from 1,663 COPD patients aged 40–93 years were analyzed. Item analyses supported the generation of four scores. A one-factor structure was confirmed with factor analysis and Rasch analysis for the symptom severity score. Test–retest reliability was confirmed for the six-item symptom severity (ICC, 0.85, number of nighttime awakenings (ICC, 0.82, and rescue medication (ICC, 0.68 scores. Convergent validity was supported by significant correlations between the NiSCI, St George’s Respiratory Questionnaire, and Exacerbations of Chronic Obstructive Pulmonary Disease Tool-Respiratory Symptoms scores.Conclusion: The
Psychometric intelligence and P3 of the event-related potentials studied with a 3-stimulus auditory oddball task

NARCIS (Netherlands)

Wronka, E.A.; Kaiser, J.; Coenen, A.M.L.

2013-01-01

Relationship between psychometric intelligence measured with Raven's Advanced Progressive Matrices (RAPM) and event-related potentials (ERP) was examined using 3-stimulus oddball task. Subjects who had scored higher on RAPM exhibited larger amplitude of P3a component. Additional analysis using the
Evaluating the validity of an integrity-based situational judgement test for medical school admissions.

Science.gov (United States)

Husbands, Adrian; Rodgerson, Mark J; Dowell, Jon; Patterson, Fiona

2015-09-02

While the construct of integrity has emerged as a front-runner amongst the desirable attributes to select for in medical school admissions, it is less clear how best to assess this characteristic. A potential solution lies in the use of Situational Judgement Tests (SJTs) which have gained popularity due to robust psychometric evidence and potential for large-scale administration. This study aims to explore the psychometric properties of an SJT designed to measure the construct of integrity. Ten SJT scenarios, each with five response stems were developed from critical incident interviews with academic and clinical staff. 200 of 520 (38.5 %) Multiple Mini Interview candidates at Dundee Medical School participated in the study during the 2012-2013 admissions cycle. Participants were asked to rate the appropriateness of each SJT response on a 4-point likert scale as well as complete the HEXACO personality inventory and a face validity questionnaire. Pearson's correlations and descriptive statistics were used to examine the associations between SJT score, HEXACO personality traits, pre-admissions measures namely academic and United Kingdom Clinical Aptitude Test (UKCAT) scores, as well as acceptability. Cronbach's alpha reliability for the SJT was .64. Statistically significant correlations ranging from .16 to .36 (.22 to .53 disattenuated) were observed between SJT score and the honesty-humility (integrity), conscientiousness, extraversion and agreeableness dimensions of the HEXACO inventory. A significant correlation of .32 (.47 disattenuated) was observed between SJT and MMI scores and no significant relationship with the UKCAT. Participant reactions to the SJTs were generally positive. Initial findings are encouraging regarding the psychometric robustness of an integrity-based SJT for medical student selection, with significant associations found between the SJTs, integrity, other desirable personality traits and the MMI. The SJTs showed little or no redundancy with
Contributions of Hamstring Stiffness to Straight-Leg-Raise and Sit-and-Reach Test Scores.

Science.gov (United States)

Miyamoto, Naokazu; Hirata, Kosuke; Kimura, Noriko; Miyamoto-Mikami, Eri

2018-02-01

The passive straight-leg-raise (PSLR) and the sit-and-reach (SR) tests have been widely used to assess hamstring extensibility. However, it remains unclear to what extent hamstring stiffness (a measure of material properties) contributes to PSLR and SR test scores. Therefore, we aimed to clarify the relationship between hamstring stiffness and PSLR and SR scores using ultrasound shear wave elastography. Ninety-eight healthy subjects completed the study. Each subject completed PSLR testing, and classic and modified SR testing of the right leg. Muscle shear modulus of the biceps femoris, semitendinosus, and semimembranosus was quantified as an index of muscle stiffness. The relationships between shear modulus of each muscle and PSLR or SR scores were calculated using Pearson's product-moment correlation coefficients. Shear modulus of the semitendinosus and semimembranosus showed negative correlations with the two PSLR and two SR scores (absolute r value≤0.484). Shear modulus of the biceps femoris was significantly correlated with the PSLR score determined by the examiner and the modified SR score (absolute r value≤0.308). The present findings suggest that PSLR and SR test scores are strongly influenced by factors other than hamstring stiffness and therefore might not accurately evaluate hamstring stiffness. © Georg Thieme Verlag KG Stuttgart · New York.
Manual for Scoring the Test of Directed Imagination.

Science.gov (United States)

Veldman, Donald J.; And Others

A scoring manual for the Directed Imagination Test, a projective technique wherein the subject is instructed to write four fictional stories (four minutes are allowed for each) about teachers and their experiences, is presented. The manual provides detailed instructions for rating each story by fifteen dimensions relevant to teacher education…
Cross-cultural adaptation and validation of Persian Achilles tendon Total Rupture Score.

Science.gov (United States)

Ansari, Noureddin Nakhostin; Naghdi, Soofia; Hasanvand, Sahar; Fakhari, Zahra; Kordi, Ramin; Nilsson-Helander, Katarina

2016-04-01

To cross-culturally adapt the Achilles tendon Total Rupture Score (ATRS) to Persian language and to preliminary evaluate the reliability and validity of a Persian ATRS. A cross-sectional and prospective cohort study was conducted to translate and cross-culturally adapt the ATRS to Persian language (ATRS-Persian) following steps described in guidelines. Thirty patients with total Achilles tendon rupture and 30 healthy subjects participated in this study. Psychometric properties of floor/ceiling effects (responsiveness), internal consistency reliability, test-retest reliability, standard error of measurement (SEM), smallest detectable change (SDC), construct validity, and discriminant validity were tested. Factor analysis was performed to determine the ATRS-Persian structure. There were no floor or ceiling effects that indicate the content and responsiveness of ATRS-Persian. Internal consistency was high (Cronbach's α 0.95). Item-total correlations exceeded acceptable standard of 0.3 for the all items (0.58-0.95). The test-retest reliability was excellent [(ICC)agreement 0.98]. SEM and SDC were 3.57 and 9.9, respectively. Construct validity was supported by a significant correlation between the ATRS-Persian total score and the Persian Foot and Ankle Outcome Score (PFAOS) total score and PFAOS subscales (r = 0.55-0.83). The ATRS-Persian significantly discriminated between patients and healthy subjects. Explanatory factor analysis revealed 1 component. The ATRS was cross-culturally adapted to Persian and demonstrated to be a reliable and valid instrument to measure functional outcomes in Persian patients with Achilles tendon rupture. II.
Psychometric Properties of the Persian Version of Self-Transcendence Scale: Adolescent Version.

Science.gov (United States)

Farahani, Azam Shirinabadi; Rassouli, Maryam; Yaghmaie, Farideh; Majd, Hamid Alavi; Sajjadi, Moosa

2016-04-01

Given the greater tendency during adolescence toward risk-taking, identifying and measuring the factors affecting the adolescents' health is highly important to ensure the efficacy of health promoting interventions. One of these factors is self-transcendence. The aim of this study was to assess the psychometric features of the Self-Transcendence Scale (adolescents' version) in students in Tehran, the capital city of Iran. This research was conducted in 2015. For this purpose, 1210 high school students were selected through the multistage cluster sampling method. After the backward-forward translation, the psychometric properties of the scale were examined through the assessment of the (face and construct) validity and reliability (internal consistency and stability) of the scale. The construct validity was assessed using two methods, factor analysis, and convergence of the scale with the Hopefulness Scale for Adolescents. The result of face validity was minor modifications in some words. The exploratory factor analysis resulted in the extraction of two dimensions, with explaining 52.79% of the variance collectively. In determining the convergent validity, the correlation between hopefulness score and self-transcendence score was r=0.47 (PSelf-Transcendence Scale showed an acceptable validity and reliability and can be used in the assessment of self-transcendence in Iranian adolescents.
GamTest: Psychometric Evaluation and the Role of Emotions in an Online Self-Test for Gambling Behavior.

Science.gov (United States)

Jonsson, Jakob; Munck, Ingrid; Volberg, Rachel; Carlbring, Per

2017-06-01

Recent increases in the number of online gambling sites have made gambling more available, which may contribute to an increase in gambling problems. At the same time, online gambling provides opportunities to introduce measures intended to prevent problem gambling. GamTest is an online test of gambling behavior that provides information that can be used to give players individualized feedback and recommendations for action. The aim of this study is to explore the dimensionality of GamTest and validate it against the Problem Gambling Severity Index (PGSI) and the gambler's own perceived problems. A recent psychometric approach, exploratory structural equation modeling (ESEM) is used. Well-defined constructs are identified in a two-step procedure fitting a traditional exploratory factor analysis model as well as a so-called bifactor model. Using data collected at four Nordic gambling sites in the autumn of 2009 (n = 10,402), the GamTest ESEM analyses indicate high correspondence with the players' own understanding of their problems and with the PGSI, a validated measure of problem gambling. We conclude that GamTest captures five dimensions of problematic gambling (i.e., overconsumption of money and time, and monetary, social and emotional negative consequences) with high reliability, and that the bifactor approach, composed of a general factor and specific residual factors, reproduces all these factors except one, the negative consequences emotional factor, which contributes to the dominant part of the general factor. The results underscore the importance of tailoring feedback and support to online gamblers with a particular focus on how to handle emotions in relation to their gambling behavior.
An initial psychometric evaluation and exploratory cross-sectional study of the body checking questionnaire among Brazilian women.

Directory of Open Access Journals (Sweden)

Angela Nogueira Neves Betanho Campana

Full Text Available Body checking is considered an expression of an excessive preoccupation with appearance. The first aim of this study was to evaluate the psychometric properties of a Brazilian Portuguese version of the Body Checking Questionnaire (BCQ. Additionally, we wanted to examine the questionnaire's associations with body avoidance behaviour, body mass index, dietary habits, and the intensity, frequency, and length of physical exercise. Finally, we also examined the differences between the total BCQ score and the individual BCQ factor scores. Differences between active and sedentary persons and between non-dieters and those on weight-loss diets were also analyzed. For the psychometric study, 546 female public university students from four different courses were surveyed. Two minor samples of university students and eating disorders women were also recruited. In the second part of the study, 403 women were recruited from weight-loss programs, gyms, and a university. All participants were verbally invited to participate in the research and voluntarily took part. Confirmatory factor analysis showed a good fit to the original model of the Brazilian BCQ that retained all 23 items. Satisfactory evidence of construct validity and internal consistency were also generated through analysis of factor loadings, t-values, Cronbach's alpha, and construct reliability tests. The results also showed associations among body checking and body avoidance, body satisfaction, social anxiety, body mass index, and the frequency and intensity of physical exercise. Significant differences were found between non-dieters and weight-loss dieters for all BCQ factors and the total BCQ score. For physically active and sedentary persons, a significant difference was only observed for idiosyncratic checking behaviour. In conclusion, the BCQ appears to be a valid and reliable scale for Brazilian research, and the associations and differences found in this study suggest that women at gyms
The quality improvement attitude survey: Development and preliminary psychometric characteristics.

Science.gov (United States)

Dunagan, Pamela B

2017-12-01

To report the development of a tool to measure nurse's attitudes about quality improvement in their practice setting and to examine preliminary psychometric characteristics of the Quality Improvement Nursing Attitude Scale. Human factors such as nursing attitudes of complacency have been identified as root causes of sentinel events. Attitudes of nurses concerning use of Quality and Safety Education for nurse's competencies can be most challenging to teach and to change. No tool has been developed measuring attitudes of nurses concerning their role in quality improvement. A descriptive study design with preliminary psychometric evaluation was used to examine the preliminary psychometric characteristics of the Quality Improvement Nursing Attitude Scale. Registered bedside clinical nurses comprised the sample for the study (n = 57). Quantitative data were analysed using descriptive statistics and Cronbach's alpha reliability. Total score and individual item statistics were evaluated. Two open-ended items were used to collect statements about nurses' feelings regarding their experience in quality improvement efforts. Strong support for the internal consistency reliability and face validity of the Quality Improvement Nursing Attitude Scale was found. Total scale scores were high indicating nurse participants valued Quality and Safety Education for Nurse competencies in practice. However, item-level statistics indicated nurses felt powerless when other nurses deviate from care standards. Additionally, the sample indicated they did not consistently report patient safety issues and did not have a feeling of value in efforts to improve care. Findings suggested organisational culture fosters nurses' reporting safety issues and feeling valued in efforts to improve care. Participants' narrative comments and item analysis revealed the need to generate new items for the Quality Improvement Nursing Attitude Scale focused on nurses' perception of their importance in quality and
AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

Science.gov (United States)

Cech, Scott J.

2008-01-01

More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…
Psychometrics of the Kansas City Cardiomyopathy Questionnaire Adapted for Family Caregiver/Significant Other.

Science.gov (United States)

Tucker, Rebecca; Quinn, Jill R; Chen, Ding-Geng; Chen, Leway

2016-12-01

The Kansas City Cardiomyopathy Questionnaire (KCCQ) was adapted to be administered to the family caregiver/significant other (FC/SO) of hospitalized patients with heart failure (HF). The objective was to examine the psychometrics of the adapted scale (KCCQ-SO). Factor analysis, Cronbach's alpha, and correlations were used. A 5-factor solution was found that explained 67.9% of the variance. The internal consistency of the KCCQ-SO factors were all greater than .70. Patient and FC/SO perceived health status scores were significantly related. Because the scores were found to have high internal consistency and correlated with patient scores on the KCCQ, there is evidence that the FC/SOs' reports may be used in circumstances when the patient is unable or unwilling to answer questions.
Preliminary Psychometric examination of the Davidson Trauma Scale: A study on chileans adolescent

Directory of Open Access Journals (Sweden)

Cristobal Guerra

2013-12-01

Full Text Available Davidson Trauma Scale (DTS measures the frequency and severity of the posttraumatic Stress Disorder pTSD. Since chile has limited data about validity and reliability of instruments to measure pTSD, this study evaluated psychometric properties of the scale in a sample of 130 adolescents between 13 and 18 years (M= 15,78; DT= 1,40. Some of them were traumatized patients and others were from general population. They answered the DTS, a depression and an anxiety scale. The scale obtained adequate internal consistency scores, showed convergent validity (DTS score was associated moderately, directly and significantly with depression and anxiety scores, and discriminated between clinical sample and general population. DTS seems to be a valid and reliable instrument in chilean adolescents.
Psychometric Properties of the Chinese Version of the Eating Attitudes Test in Young Female Patients with Eating Disorders in Mainland China.

Science.gov (United States)

Kang, Qing; Chan, Raymond C K; Li, Xiaoping; Arcelus, Jon; Yue, Ling; Huang, Jiabin; Gu, Lian; Fan, Qing; Zhang, Haiyin; Xiao, Zeping; Chen, Jue

2017-11-01

The study aimed to investigate the reliability and validity of the Chinese version of the eating attitudes test (EAT-26) among female adolescents and young adults in Mainland China. This scale was administered to 396 female eating disorder patients and 406 noneating disorder healthy controls, in addition 35 healthy controls completed a retest after a 4-week intervals. Tests for reliability, convergent validity and receiver operating characteristic analysis were performed to detect the psychometric properties. The EAT-26 demonstrated good internal consistency (Cronbach's alpha = 0.822-0.922), test-retest reliability (interclass correlation coefficient = 0.817) and convergent validity(r = 0.450-0.750). The receiver operating characteristic analysis showed that the cut-off 14 for anorexia nervosa and 15 for bulimia nervosa represented good compromises with approximate sensitivity (0.66-0.68) and specificity (0.85-0.86). Our findings provided evidence that the Chinese version of the EAT-26 was a psychometrically reliable and valid self-rating instrument for identifying people suffering from an eating disorder in Mainland China. A clinical cut-off range between 14 and 15 could be used, but caution should be exercised because of the low sensitivity of the tool. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association.
Validity of GRE General Test Scores and TOEFL Scores for Graduate Admission to a Technical University in Western Europe

Science.gov (United States)

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the…

The Motivation Analysis Test: an historical and contemporary evaluation.

Science.gov (United States)

Bernard, Larry C; Walsh, R Patricia; Mills, Michael

2005-04-01

This is an historical review and contemporary empirical evaluation of the Motivation Analysis Test (MAT), one of the first tests to take a psychometric approach to the assessment of motivation. Reviews were quite positive, but the test is now over 50 years old. Nevertheless, it employs innovations in measurement not widely used in objective measurement then or now: (1) subtests with different formats, (2) disguised items, (3) speeded administration procedures, and (4) ipsative format and scoring procedures. These issues are discussed and a contemporary sample (N = 360) obtained to evaluate the Motivation Analysis Test in light of its innovative characteristics.
The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

Science.gov (United States)

Molenaar, Dylan; Borsboom, Denny

2013-01-01

Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…
Psychometric analysis of simulated psychopathology during sick leave

Directory of Open Access Journals (Sweden)

Ignacio Jáuregui Lobera

2018-01-01

Full Text Available Simulation from a categorical or diagnostic perspective, has turned into a more dimensional point of view, so it is possible to establish different “levels” of simulation. In order to analyse, from a psychometric perspective, the possible prediction of simulated behaviour based on common measures of general psychopathology, the objective of the current study was to analyse possible predictors of the Structured Symptomatic Simulation Inventory (SIMS scores considering as dependent variables the total SIMS score, the SIMS subscales scores, and the cut-off points usually suggested to discriminate between “no suspected simulation”/“suspected simulation”, which usually are 14 and 16. In terms of possible predictors, a set of variables were established: a categorical (sex, type of treatment - psychopharmacological, psychotherapeutic, combined-, type of work activity, being self-employed or not, presence-absence of a history of psychopathology (both familial and personal, presence or not of associated physical pathology, diagnosis -according to ICD-10- and the final proposal -return to work, sick leave extended, proposal of permanent work incapacity-; and b continuous (perceived stress -general and current, self-esteem, results of a screening questionnaire for personality disorders and scores on a symptoms questionnaire. In addition, a descriptive study of all variables was carried out and possible differences of genre were analysed.
Development and psychometric evaluation of the Undergraduate Clinical Education Environment Measure (UCEEM).

Science.gov (United States)

Strand, Pia; Sjöborg, Karolina; Stalmeijer, Renée; Wichmann-Hansen, Gitte; Jakobsson, Ulf; Edgren, Gudrun

2013-12-01

There is a paucity of instruments designed to evaluate the multiple dimensions of the workplace as an educational environment for undergraduate medical students. The aim was to develop and psychometrically evaluate an instrument to measure how undergraduate medical students perceive the clinical workplace environment, based on workplace learning theories and empirical findings. Development of the instrument relied on established standards including theoretical and empirical grounding, systematic item development and expert review at various stages to ensure content validity. Qualitative and quantitative methods were employed using a series of steps from conceptualization through psychometric analysis of scores in a Swedish medical student population. The final result was a 25-item instrument with two overarching dimensions, experiential learning and social participation, and four subscales that coincided well with theory and empirical findings: Opportunities to learn in and through work & quality of supervision; Preparedness for student entry; Workplace interaction patterns & student inclusion; and Equal treatment. Evidence from various sources supported content validity, construct validity and reliability of the instrument. The Undergraduate Clinical Education Environment Measure represents a valid, reliable and feasible multidimensional instrument for evaluation of the clinical workplace as a learning environment for undergraduate medical students. Further validation in different populations using various psychometric methods is needed.
Work environment impact scale: testing the psychometric properties of the Swedish version.

Science.gov (United States)

Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée

2014-01-01

The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.
Psychometric validation of the Columbia-Suicide Severity rating scale in Spanish-speaking adolescents.

Science.gov (United States)

Serrani Azcurra, Daniel

2017-12-30

Adolescent suicide is a major public health issue, and early and accurate detection is of great concern. There are many reliable instruments for this purpose, such as the Columbia-Suicide severity rating scale (C-SSRS), but no validation exists for Spanish speaking Latin American adolescents. To assess psychometric properties and cut-off scores of the C-SSRS in Spanish speaking adolescents. Exploratory assessment with principal component analysis (PCA) and Varimax rotation, and confirmatory analysis (CFA) were performed on two groups with 782 and 834 participants respectively (N=1616). Mean age was 24.8 years. A Receiver operator analysis was applied to distinguish between control and suicide-risk subgroups adolescents. Promax rotation yielded two 10-items factors, for suicide ideation and behavior respectively. C-SSRS was positively correlated with other suicide risk scales, such as Beck Depression Inventory-II, Suicidal Behaviors Questionnaire-Revised, or PHQ-9. Confirmatory factor analysis yielded a two-factor solution as the best goodness of fit model. C-SSRS showed adequate ability to detect suicide risk group with positive predictive value of 68.3%. ROC analyses showed cutoff scores of ≥ 6 and ≥ 4 for suicide ideation and behavior scales respectively. This research offers data supporting psychometric validity and reliability of C-SSRS in nonclinical Spanish-speaking students. Added benefits are flexible scoring and management easiness. This questionnaire yields data on distinct aspects of suicidality, being more parsimonious than separate administration of a bunch of questionnaires.
Attitudes toward science: measurement and psychometric properties of the Test of Science-Related Attitudes for its use in Spanish-speaking classrooms

Science.gov (United States)

Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina

2016-06-01

Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the Test of Science-Related Attitudes (TOSRA), such as its construct validity, its discriminant and concurrent validity, and its reliability. The evidence presented suggests that TOSRA, in its Spanish-adapted version, has adequate construct validity regarding its theoretical referents, as well as good indexes of reliability. In addition, it determines the attitudes toward science of secondary school students in Santiago de Chile (n = 664) and analyzes the sex variable as a differentiating factor in such attitudes. The analysis by sex revealed low-relevance gender difference. The results are contrasted with those obtained in English-speaking countries. This TOSRA sample showed good psychometric parameters for measuring and evaluating attitudes toward science, which can be used in classrooms of Spanish-speaking countries or with immigrant populations with limited English proficiency.
Psychometric test of the Team Climate Inventory-short version investigated in Dutch quality improvement teams

Directory of Open Access Journals (Sweden)

Nieboer Anna P

2009-07-01

Full Text Available Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Results Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales – vision, participative safety, task orientation and support for innovation – acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. Conclusion The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.
Psychometric test of the Team Climate Inventory-short version investigated in Dutch quality improvement teams.

Science.gov (United States)

Strating, Mathilde M H; Nieboer, Anna P

2009-07-24

Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales--vision, participative safety, task orientation and support for innovation--acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.
Psychometric properties of the Flemish translation of the NEECHAM Confusion Scale

Directory of Open Access Journals (Sweden)

Abraham Ivo L

2005-03-01

Full Text Available Abstract Background Determination of a patient's cognitive status by use of a valid and reliable screening instrument is of major importance as early recognition and accurate diagnosis of delirium is necessary for effective management. This study determined the reliability, validity and diagnostic value of the Flemish translation of the NEECHAM Confusion Scale. Methods A sample of 54 elderly hip fracture patients with a mean age of 80.9 years (SD = 7.85 were included. To test the psychometric properties of the NEECHAM Confusion Scale, performance on the NEECHAM was compared to the Confusion Assessment Method (CAM and the Mini-Mental State Examination (MMSE, by using aggregated data based on 5 data collection measurement points (repeated measures. The CAM and MMSE served as gold standards. Results The alpha coefficient for the total NEECHAM score was high (0.88. Principal components analysis yielded a two-component solution accounting for 70.8% of the total variance. High correlations were found between the total NEECHAM scores and total MMSE (0.75 and total CAM severity scores (-0.73, respectively. Diagnostic values using the CAM algorithm as gold standard showed 76.9% sensitivity, 64.6% specificity, 13.5% positive and 97.5% negative predictive values, respectively. Conclusion This validation of the Flemish version of the NEECHAM Confusion Scale adds to previous evidence suggesting that this scale holds promise as a valuable screening instrument for delirium in clinical practice. Further validation studies in diverse clinical populations; however, are needed.
Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

Science.gov (United States)

Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

2015-11-01

In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. (c) 2015 APA, all rights reserved).
Psychometric function for NU-6 word recognition in noise: effects of first language and dominant language.

Science.gov (United States)

Shi, Lu-Feng; Zaki, Nancy A

2014-01-01

The present study attempted to establish psychometric function in individuals whose first language is not English. Psychometric function was obtained for one of the most commonly used clinical tests, the Northwestern University Auditory Test No. 6 (Tillman & Carhart 1966), so that findings could be directly applied to everyday clinical practice. Five groups of 14 normal-hearing, adult listeners differing in their first language and dominant language (English monolinguals, English- and Arabic-dominant Arabic-English bilinguals, and English- and Russian-dominant Russian-English bilinguals) participated. Both forms of the Northwestern University Auditory Test No. 6 test (8 lists of 50 monosyllabic English words) were presented. The lists were randomly assigned to eight signal-to-noise ratios (-3 to 18 dB in 3 dB steps). Listeners responded verbally and in writing. Psychometric functions were derived via logistic regression and described by two parameters: the 50% correct performance level (θ) and the slope (k). Both English-dominant bilingual groups obtained psychometric functions comparable with monolinguals. The θ and k of the functions for these three groups of participants were consistent with the literature. Compared with these three groups, non-English-dominant bilinguals' functions grew significantly more gradually (i.e., a significantly higher θ and a significantly lower k). No differences in either θ or k were found between bilinguals with the same dominant language but different first languages. Bilinguals reporting themselves to be dominant in English generate monolingual-like psychometric functions. By contrast, a different set of psychometric properties describes the function of bilinguals dominant in their first language. Because first language did not appear to be a significant factor in determining bilinguals' functions, it is concluded that English learning history and English proficiency are more important variables than first language for
Development of a computerized adaptive test for Schizotypy assessment.

Directory of Open Access Journals (Sweden)

Eduardo Fonseca-Pedrero

Full Text Available BACKGROUND: Schizotypal traits in adolescents from the general population represent the behavioral expression of liability for psychotic disorders. Schizotypy assessment in this sector of population has advanced considerably in the last few years; however, it is necessary to incorporate recent advances in psychological and educational measurement. OBJECTIVE: The main goal of this study was to develop a Computerized Adaptive Test (CAT to evaluate schizotypy through "The Oviedo Questionnaire for Schizotypy Assessment" (ESQUIZO-Q, in non-clinical adolescents. METHODS: The final sample consisted of 3,056 participants, 1,469 males, with a mean age of 15.9 years (SD=1.2. RESULTS: The results indicated that the ESQUIZO-Q scores presented adequate psychometric properties under both Classical Test Theory and Item Response Theory. The Information Function estimated using the Gradual Response Model indicated that the item pool effectively assesses schizotypy at the high end of the latent trait. The correlation between the CAT total scores and the paper-and-pencil test was 0.92. The mean number of presented items in the CAT with the standard error fixed at ≤ 0.30 was of 34 items. CONCLUSION: The CAT showed adequate psychometric properties for schizotypy assessment in the general adolescent population. The ESQUIZO-Q adaptive version could be used as a screening method for the detection of adolescents at risk for psychosis in both educational and mental health settings.
Psychometric properties of the Hebrew version of the Oswestry Disability Index.

Science.gov (United States)

Gamus, Dorit; Glasser, Saralee; Langner, Elisheva; Beth-Hakimian, Aliza; Caspi, Israel; Carmel, Narin; Siev-Ner, Itzhak; Amir, Hagai; Ziv, A; Papa, M; Lerner-Geva, Liat

2016-06-17

Low back pain (LBP) is one of the most common health complaints, with lifetime prevalence rates as high as 84%. The Oswestry Disability Index (ODI) is often the measure of choice for LBP in both research and clinical settings and, as such, has been translated into 29 languages and dialects. Currently, however, there is no validated version of Hebrew-translated ODI (ODI-H). To examine the psychometric properties of the ODI-H. Cross-culturally appropriate translation into Hebrew was conducted. A convenience sample of 115 participants (Case Group) with LBP and 68 without LBP (Control Group) completed the ODI-H, SF-36 Health Survey, and two Visual Analog Scales (VAS). Internal consistency was α = 0.94 and test-retest reliability for 18 participants repeating the ODI-H was 0.97. No floor or ceiling effects were noted for Cases, although there was a floor effect for the Control Group. Scores were significantly different for the two groups, indicating discriminant validity. Concurrent validity was reflected by significant correlations with SF-36 scores, particularly the Physical Functioning and Bodily Pain subscales (-0.83 and -0.79, respectively) and with the VAS (0.84 and 0.79). The ODI-H is a valid and reliable measure of low back pain-related disability for the Hebrew-speaking public.
International field testing of the psychometric properties of an EORTC quality of life module for oral health: the EORTC QLQ-OH15.

Science.gov (United States)

Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B

2016-09-01

This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took issues were addressed. Analyses suggested a revision of the phase III hypothesized scale structure. Two items were deleted based on a high degree of item misfit, together with negative patient feedback. The remaining 15 items formed one eight-item scale named OH-QoL score, a two-item information scale, a two-item scale regarding dentures, and three single items (sticky saliva/mouth soreness/sensitivity to food/drink). Face and convergent validity and internal consistency were confirmed. Test-retest reliability (n = 60) was demonstrated as was RCA for patients undergoing chemotherapy (n = 117; p = 0.06). The resulting QLQ-OH15 discriminated between clinically distinct patient groups, e.g., low performance status vs. higher (p < 000.1), and head-and-neck cancer versus other cancers (p < 0.03). The EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.
The Mini-Social Phobia Inventory: psychometric properties in an adolescent general population sample.

Science.gov (United States)

Ranta, Klaus; Kaltiala-Heino, Riittakerttu; Rantanen, Päivi; Marttunen, Mauri

2012-07-01

Onset of social phobia (SP) typically occurs in adolescence. Short screening instruments for its assessment are needed for use in primary health and school settings. The 3-item Mini-Social Phobia Inventory (SPIN) has demonstrated effectiveness in screening for generalized SP (GSP) in adults. This study examined the psychometrics of the Mini-SPIN in an adolescent general population sample. Three hundred fifty adolescents aged 12 to 17 years were clinically interviewed using the Schedule for Affective Disorders and Schizophrenia for School-Age Children-Present and Lifetime Version for identification of SP and other Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Axis I disorders, blind to their Mini-SPIN status. Associations between SP; subclinical SP; other anxiety, depressive, and disruptive disorders; and Mini-SPIN scores were examined, and diagnostic efficiency statistics were calculated. The association between Mini-SPIN scores and the generalized subtype of SP was also examined. As in adults, the Mini-SPIN items differentiated subjects with SP from those without. A score of 6 points or greater was found optimal in predicting SP with a sensitivity of 86%, specificity of 84%, and positive and negative predictive values of 26% and 99%. The Mini-SPIN also possessed discriminative validity, as scores were higher for adolescents with SP than they were for those with depressive, disruptive, and other anxiety disorders. The Mini-SPIN was also able to differentiate adolescents with GSP from the rest of the sample. The Mini-SPIN has good psychometrics for screening SP in adolescents from general population and may have value in screening for GSP. Copyright © 2012 Elsevier Inc. All rights reserved.
Psychometric evaluation of the Sheehan Disability Scale in adult patients with attention-deficit/hyperactivity disorder

Directory of Open Access Journals (Sweden)

Coles T

2014-05-01

Full Text Available Theresa Coles,1 Cheryl Coon,1 Carla DeMuro,1 Lori McLeod,1 Ari Gnanasakthy21Patient-Reported Outcomes, RTI Health Solutions, Research Triangle Park, NC, 2Novartis Pharmaceuticals, East Hanover, NJ, USAAbstract: Inattention and impulsivity symptoms are common among adults with attention-deficit/hyperactivity disorder (ADHD, which can lead to difficulty concentrating, restlessness, difficulty completing tasks, disorganization, impatience, and impulsiveness. Many adults with ADHD find it difficult to focus and prioritize. Resulting outcomes, such as missed deadlines and forgotten engagements, may ultimately impact the ability to function at work, school, home, or in a social environment. The European Medicines Agency guidelines for evaluating medicinal products for ADHD recommend inclusion of both functional outcomes, such as school, social, or work functioning, and outcomes related to symptoms of ADHD in clinical studies of novel medication primary efficacy endpoints. Due to its performance in other disease areas and the relevance of its items as evidenced by content validity analyses, the Sheehan Disability Scale (SDS was chosen to assess functional impairment in ADHD. The aim of this study was to investigate the psychometric properties of the SDS, used as a brief measure of functional impairment in a number of psychiatric disorders, in adult patients with ADHD. To the authors' knowledge, this is the first study to evaluate the reliability of the SDS (based on Cronbach's coefficient alpha and test-retest reliability, its validity (construct and known-groups validity, and its ability to detect change in this patient population. This study also established a preliminary responder definition for the SDS in this study population to determine when change can be considered clinically beneficial in a clinical trial setting. The psychometric results support the use of the SDS subscales (items 1–3 and total score (sum of items 1–3 in an ADHD
A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

Science.gov (United States)

Kamens, David H.

2015-01-01

This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…
On the Psychometric Study of Human Life History Strategies.

Science.gov (United States)

Richardson, George B; Sanning, Blair K; Lai, Mark H C; Copping, Lee T; Hardesty, Patrick H; Kruger, Daniel J

2017-01-01

This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS), provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a) use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b) confirm the dimensionality of life history indicators, and (c) establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested-there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
On the Psychometric Study of Human Life History Strategies

Directory of Open Access Journals (Sweden)

George B. Richardson

2017-02-01

Full Text Available This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS, provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b confirm the dimensionality of life history indicators, and (c establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested—there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.

The Role of Psychometrics in Individual Differences Research in Cognition: A Case Study of the AX-CPT

Directory of Open Access Journals (Sweden)

Shelly R. Cooper

2017-09-01

Full Text Available Investigating individual differences in cognition requires addressing questions not often thought about in standard experimental designs, especially regarding the psychometric properties of the task. Using the AX-CPT cognitive control task as a case study example, we address four concerns that one may encounter when researching the topic of individual differences in cognition. First, we demonstrate the importance of variability in task scores, which in turn directly impacts reliability, particularly when comparing correlations in different populations. Second, we demonstrate the importance of variability and reliability for evaluating potential failures to replicate predicted correlations, even within the same population. Third, we demonstrate how researchers can turn to evaluating psychometric properties as a way of evaluating the feasibility of utilizing the task in new settings (e.g., online administration. Lastly, we show how the examination of psychometric properties can help researchers make informed decisions when designing a study, such as determining the appropriate number of trials for a task.
The Role of Psychometrics in Individual Differences Research in Cognition: A Case Study of the AX-CPT

Science.gov (United States)

Cooper, Shelly R.; Gonthier, Corentin; Barch, Deanna M.; Braver, Todd S.

2017-01-01

Investigating individual differences in cognition requires addressing questions not often thought about in standard experimental designs, especially regarding the psychometric properties of the task. Using the AX-CPT cognitive control task as a case study example, we address four concerns that one may encounter when researching the topic of individual differences in cognition. First, we demonstrate the importance of variability in task scores, which in turn directly impacts reliability, particularly when comparing correlations in different populations. Second, we demonstrate the importance of variability and reliability for evaluating potential failures to replicate predicted correlations, even within the same population. Third, we demonstrate how researchers can turn to evaluating psychometric properties as a way of evaluating the feasibility of utilizing the task in new settings (e.g., online administration). Lastly, we show how the examination of psychometric properties can help researchers make informed decisions when designing a study, such as determining the appropriate number of trials for a task. PMID:28928690
Psychometric evaluation of an item bank for computerized adaptive testing of the EORTC QLQ-C30 cognitive functioning dimension in cancer patients

DEFF Research Database (Denmark)

Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J. B.

2017-01-01

on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). METHODS: In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients...... model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study...... sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. CONCLUSION: A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient...
Center for Epidemiologic Studies Depression Scale for Children: psychometric testing of the Chinese version.

Science.gov (United States)

Li, Ho Cheung William; Chung, Oi Kwan Joyce; Ho, Ka Yan

2010-11-01

This paper is a report of psychometric testing of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children. The availability of a valid and reliable instrument that accurately detects depressive symptoms in children is crucial before any psychological intervention can be appropriately planned and evaluated. There is no such an instrument for Chinese children. A test-retest, within-subjects design was used. A total of 313 primary school students between the ages of 8 and 12 years were invited to participate in the study in 2009. Participants were asked to respond to the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children, short form of the State Anxiety Scale for Children and Rosenberg's Self-Esteem Scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children were assessed. The newly-translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly-translated scale can be used as a self-report assessment tool in detecting depressive symptoms of Chinese children aged between 8 and 12 years. © 2010 Blackwell Publishing Ltd.
Mental Illness Related Internalized Stigma: Psychometric Properties of the Brief ISMI Scale in Greece.

Science.gov (United States)

Paraskevoulakou, Alexia; Vrettou, Kassiani; Pikouli, Katerina; Triantafillou, Evgenia; Lykou, Anastasia; Economou, Marina

2017-09-01

Since evaluation regarding the impact of mental illness related internalized stigma is scarce, there is a great need for psychometric instruments which could contribute to understanding its adverse effects among Greek patients with severe mental illness. The Brief Internalized Stigma of Mental Illness (ISMI) scale is one of the most widely used measures designed to assess the subjective experience of stigma related to mental illness. The present study aimed to investigate the psychometric properties of the Greek version of the Brief ISMI scale. In addition to presenting psychometric findings, we explored the relationship of the Greek version of the Brief ISMI subscales with indicators of self-esteem and quality of life. 272 outpatients (108 males, 164 females) meeting the DSM-IV TR criteria for severe mental disorder (schizophrenia, bipolar disorder, major depression) completed the Brief ISMI, the RSES and the WHOQOL-BREF scales. Patients reported age and educational level. A retest was conducted with 124 patients. The Chronbach's alpha coefficient was 0 0.83. The test-retest reliability coefficients varied from 0.81 to 0.91, indicating substantial agreement. The ICC was for the total score 0.83 and for the two factors, 0.69 and 0.77 respectively. Factor analysis provided strong evidence for a two factor model. Factors 1 and 2 were named respectively "how others view me" and "how I view myself". They were negatively correlated with both RSES and WHOQOL-BREF scales, as well as with educational level. Factor 2 was significantly associated with the type of diagnosis. The Greek version of the Brief ISMI scale can be used as a reliable and valid tool for assessing mental illness related internalized stigma among Greek patients with severe mental illness.
Scores of a web-based version of the seasonal pattern assessment questionnaire in Brazil

Directory of Open Access Journals (Sweden)

Denis Martinez

2015-12-01

Full Text Available Introduction: Seasonal affective disorder (SAD is a proposed mental disorder still controversial. This condition is prevalent in northern latitudes, but few studies have been conducted at locations in the southern hemisphere. It is usually assessed by the Seasonal Pattern Assessment Questionnaire (SPAQ. This study aimed to evaluate, through on-line questionnaire, the hypothesis that, in the Brazilian population, latitude and longitude influence SPAQ scores. Methods: An advertisement was posted on a sleep medicine website inviting visitors to investigate seasonal patterns of behavior and mood, using a Brazilian Portuguese version of the SPAQ. The geographic coordinates of the place of residence of each respondent were analyzed as a continuous variable or distributed in quartiles of latitude and longitude. The psychometric properties of the SPAQ were assessed by reliability and factor analyses. Results: Answers from 1001 respondents out of 1045 were considered eligible. High SPAQ scores were observed in 287 respondents, equally distributed among all latitude and longitude quartiles. Data collected in different seasons and during daylight saving time did not differ significantly in any of the scores for SPAQ dimensions. No correlations between SPAQ scores and latitude or longitude were observed. Psychometric properties of the SPAQ were preserved in all geographic locations. Conclusion: The finding of similar SPAQ scores at a wide latitude range defies the concept of SAD symptoms as latitude or longitude-dependent phenomena.
Measuring anxiety after spinal cord injury: Development and psychometric characteristics of the SCI-QOL Anxiety item bank and linkage with GAD-7.

Science.gov (United States)

Kisala, Pamela A; Tulsky, David S; Kalpakjian, Claire Z; Heinemann, Allen W; Pohlig, Ryan T; Carle, Adam; Choi, Seung W

2015-05-01

To develop a calibrated item bank and computer adaptive test to assess anxiety symptoms in individuals with spinal cord injury (SCI), transform scores to the Patient Reported Outcomes Measurement Information System (PROMIS) metric, and create a statistical linkage with the Generalized Anxiety Disorder (GAD)-7, a widely used anxiety measure. Grounded-theory based qualitative item development methods; large-scale item calibration field testing; confirmatory factor analysis; graded response model item response theory analyses; statistical linking techniques to transform scores to a PROMIS metric; and linkage with the GAD-7. Setting Five SCI Model System centers and one Department of Veterans Affairs medical center in the United States. Participants Adults with traumatic SCI. Spinal Cord Injury-Quality of Life (SCI-QOL) Anxiety Item Bank Seven hundred sixteen individuals with traumatic SCI completed 38 items assessing anxiety, 17 of which were PROMIS items. After 13 items (including 2 PROMIS items) were removed, factor analyses confirmed unidimensionality. Item response theory analyses were used to estimate slopes and thresholds for the final 25 items (15 from PROMIS). The observed Pearson correlation between the SCI-QOL Anxiety and GAD-7 scores was 0.67. The SCI-QOL Anxiety item bank demonstrates excellent psychometric properties and is available as a computer adaptive test or short form for research and clinical applications. SCI-QOL Anxiety scores have been transformed to the PROMIS metric and we provide a method to link SCI-QOL Anxiety scores with those of the GAD-7.
Psychometric properties of the Female Sexual Distress Scale-Revised among a sample of non-clinical Iranian women.

Science.gov (United States)

Ghassami, Maryam; Asghari, Ali; Shaeeri, Mohammad R; Soltaninejad, Zahra; Safarinejad, Mohammad R

2014-10-01

The present study aimed to investigate the psychometric properties of a Persian version of the Female Sexual Distress Scale-Revised (P-FSDS-R) among a sample of healthy Iranian women. A total of 562 healthy Iranian women completed a battery of questionnaires, including the P-FSDS-R, Depression Anxiety Stress Scales (DASS), Positive and Negative Affect Scales (PANAS) and Locke-Wallace-Marital Adjustment Test (LWMAT). The factor structure and the convergent and divergent validity of the P-FSDS-R were examined, using exploratory and confirmatory factor analysis and Pearson product-moment correlations, respectively. To examine the discriminant validity of the P-FSDS-R, data collected from 562 healthy participants were compared with data from 108 women with sexual problems who completed the P-FSDS-R measure. The results of exploratory and confirmatory factor analyses indicate that the P-FSDS-R is conceptualized within a one - factor model. The results also indicate that the P-FSDS-R has good internal consistency and test-retest reliability. Significant correlations in the predicted directions between the P-FSDS-R scores and the scores of DASS, PANAS and LWMAT support both the convergent and divergent validity of the FSDS-R. The results also indicate that the scores of the P-FSDS-R tests significantly differentiated women with and without sexual problems. In general, these findings support the reliability and the validity of the P-FSDS-R among Iranian women.
The Body Appreciation Scale-2: item refinement and psychometric evaluation.

Science.gov (United States)

Tylka, Tracy L; Wood-Barcalow, Nichole L

2015-01-01

Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Psychometric properties of the Vulnerability to Abuse Screening Scale for screening abuse of older adults

Directory of Open Access Journals (Sweden)

Raquel Batista Dantas

Full Text Available ABSTRACT OBJECTIVE Adapt and evaluate the psychometric properties of the Vulnerability to Abuse Screening Scale to identify risk of domestic violence against older adults in Brazil. METHODS The instrument was adapted and validated in a sample of 151 older adults from a geriatric reference center in the municipality of Belo Horizonte, State of Minas Gerais, in 2014. We collected sociodemographic, clinical, and abuse-related information, and verified reliability by reproducibility in a sample of 55 older people, who underwent re-testing of the instrument seven days after the first application. Descriptive and comparative analyses were performed for all variables, with a significance level of 5%. The construct validity was analyzed by the principal components method with a tetrachoric correlation matrix, the reliability of the scale by the weighted Kappa (Kp statistic, and the internal consistency by the Kuder-Richardson estimator formula 20 (KR-20. RESULTS The average age of the participants was 72.1 years (DP = 6.96; 95%CI 70.94–73.17, with a maximum of 92 years, and they were predominantly female (76.2%; 95%CI 69.82–83.03. When analyzing the relationship between the scores of the Vulnerability to Abuse Screening Scale, categorized by presence (score > 3 or absence (score < 3 of vulnerability to abuse, with clinical and health conditions, we found statistically significant differences for self-perception of health (p = 0.002, depressive symptoms (p = 0.000, and presence of rheumatism (p = 0.003. There were no statistically significant differences between sexes. The Vulnerability to Abuse Screening Scale acceptably evaluated validity in the transcultural adaptation process, demonstrating dimensionality coherent with the original proposal (four factors. In the internal consistency analysis, the instrument presented good results (KR-20 = 0.69 and the reliability via reproducibility was considered excellent for the global scale (Kp = 0
Psychometric assessment of the Spiritual Climate Scale Arabic version for nurses in Saudi Arabia.

Science.gov (United States)

Cruz, Jonas Preposi; Albaqawi, Hamdan Mohammad; Alharbi, Sami Melbes; Alicante, Jerico G; Vitorino, Luciano M; Abunab, Hamzeh Y

2017-12-07

To assess the psychometric properties of the Spiritual Climate Scale Arabic version for Saudi nurses. Evidence showed that a high level of spiritual climate in the workplace is associated with increased productivity and performance, enhanced emotional intelligence, organisational commitment and job satisfaction among nurses. A convenient sample of 165 Saudi nurses was surveyed in this descriptive, cross-sectional study. Cronbach's α and intraclass correlation coefficient of the 2 week test-retest scores were computed to establish reliability. Exploratory factor analysis was performed to support the validity of the Spiritual Climate Scale Arabic version. The Spiritual Climate Scale Arabic version manifested excellent content validity. Exploratory factor analysis supported a single factor with an explained variance of 73.2%. The Cronbach's α values of the scale ranged from .79 to .88, while the intraclass correlation coefficient value was .90. The perceived spiritual climate was associated with the respondents' hospital, gender, age and years of experience. Findings of this study support the sound psychometric properties of the Spiritual Climate Scale Arabic version. The Spiritual Climate Scale Arabic version can be used by nurse managers to assess the nurses' perception of the spiritual climate in any clinical area. This process can lead to spiritually centred interventions, thereby ensuring a clinical climate that accepts and respects different spiritual beliefs and practices. © 2017 John Wiley & Sons Ltd.
A systematic review protocol investigating tests for physical or physiological qualities and game-specific skills commonly used in rugby and related sports and their psychometric properties.

Science.gov (United States)

Chiwaridzo, Matthew; Ferguson, Gillian D; Smits-Engelsman, Bouwien C M

2016-07-27

Scientific focus on rugby has increased over the recent years, providing evidence of the physical or physiological characteristics and game-specific skills needed in the sport. Identification of tests commonly used to measure these characteristics is important for the development of test batteries, which in turn may be used for talent identification and injury prevention programmes. Although there are a number of tests available in the literature to measure physical or physiological variables and game-specific skills, there is limited information available on the psychometric properties of the tests. Therefore, the purpose of this study is to systematically review the literature for tests commonly used in rugby to measure physical or physiological characteristics and rugby-specific skills, documenting evidence of reliability and validity of the identified tests. A systematic review will be conducted. Electronic databases such as Scopus, MEDLINE via EBSCOhost and PubMed, Academic Search Premier, CINAHL and Africa-Wide Information via EBSCOhost will be searched for original research articles published in English from January 1, 1995, to December 31, 2015, using a pre-defined search strategy. The principal investigator will select potentially relevant articles from titles and abstracts. To minimise bias, full text of titles and abstracts deemed potentially relevant will be retrieved and reviewed by two independent reviewers based on the inclusion criteria. Data extraction will be conducted by the principal investigator and verified by two independent reviewers. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist will be used to assess the methodological quality of the selected studies. Choosing an appropriate test to be included in the screening test battery should be based on sound psychometric properties of the test available. This systematic review will provide an overview of the tests commonly used in rugby union
Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

Science.gov (United States)

Haverinen-Shaughnessy, Ulla; Shaughnessy, Richard J

2015-01-01

Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.
"Psychometric properties of the PTSD Checklist for Diagnostic and Statistical Manual of Mental Disorders-Fifth Edition (PCL-5) in veterans": Correction to Bovin et al. (2016).

Science.gov (United States)

2017-06-01

Reports an error in "Psychometric properties of the PTSD Checklist for Diagnostic and Statistical Manual of Mental Disorders-Fifth Edition (PCL-5) in veterans" by Michelle J. Bovin, Brian P. Marx, Frank W. Weathers, Matthew W. Gallagher, Paola Rodriguez, Paula P. Schnurr and Terence M. Keane ( Psychological Assessment , 2016[Nov], Vol 28[11], 1379-1391). In the article, the departments and affiliations were incorrectly listed for authors Michelle J. Bovin, Brian P. Marx, Matthew W. Gallagher, Paola Rodriguez, Paula P. Schnurr, and Terence M. Keane. The first department and affiliation for authors Michelle J. Bovin, Brian P. Marx, Matthew W. Gallagher, Paola Rodriguez, and Terence M. Keane and should have read "National Center for PTSD at VA Boston Healthcare System, Boston, Massachusetts". The first department and affiliation for author Paula P. Schnurr should have read "National Center for PTSD, White River Junction, Vermont." The online version of this article has been corrected. (The following abstract of the original article appeared in record 2015-55809-001.) This study examined the psychometric properties of the posttraumatic stress disorder (PTSD) Checklist for Diagnostic and Statistical Manual of Mental Disorders-Fifth Edition (PCL-5; Weathers, Litz, et al., 2013b) in 2 independent samples of veterans receiving care at a Veterans Affairs Medical Center (N = 468). A subsample of these participants (n = 140) was used to define a valid diagnostic cutoff score for the instrument using the Clinician-Administered PTSD Scale for DSM-5 (CAPS-5; Weathers, Blake, et al., 2013) as the reference standard. The PCL-5 test scores demonstrated good internal consistency (α = .96), test-retest reliability (r = .84), and convergent and discriminant validity. Consistent with previous studies (Armour et al., 2015; Liu et al., 2014), confirmatory factor analysis revealed that the data were best explained by a 6-factor anhedonia model and a 7-factor hybrid model. Signal
Development and psychometric evaluation of the self-assessment of psoriasis symptoms (SAPS) - clinical trial and the SAPS - real world patient-reported outcomes.

Science.gov (United States)

Armstrong, April W; Banderas, Benjamin; Foley, Catherine; Stokes, Jonathan; Sundaram, Murali; Shields, Alan L

2017-09-01

The Self-Assessment of Psoriasis Symptoms - Clinical Trials (SAPS-CT) and SAPS - Real World (SAPS-RW) were simultaneously created to assess the experience of plaque psoriasis in two unique contexts. Qualitative and quantitative research was conducted in four phases namely concept elicitation, questionnaire construction, content evaluation and psychometric evaluation. Following concept elicitation, 18 concepts were selected to inform questionnaire construction of the SAPS-CT and SAPS-RW. To accommodate each context of use, the SAPS-CT asks respondents to rate the target symptom 'at its worst' in the 24 h prior to assessment, while the SAPS-RW asks respondents to rate the target symptom "on average" in the 7 days prior to assessment. Cognitive debriefing confirmed that patients could comprehend and provide meaningful responses to both versions and, after minor modifications, resulted in 11-item questionnaires administered in an observational study (N = 200). Results from the observational study informed further item reduction (SAPS-RW to six items and SAPS-CT to nine items) and demonstrated that scores from each were reliable (Cronbach's α > 0.90, test-retest intraclass correlation coefficient >0.70), construct valid and able to differentiate among clinically distinct groups. The SAPS-CT and SAPS-RW are content-valid PRO questionnaires capable of producing psychometrically sound scores when administered chronic to plaque psoriasis patients.
Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

Science.gov (United States)

Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

2011-01-01

The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

Science.gov (United States)

Tepe, Rodger; Tepe, Chabha

2015-03-01

To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Psychometric properties of the original and short versions of the Falls Efficacy Scale-International (FES-I) in people with Parkinson's disease.

Science.gov (United States)

Jonasson, Stina B; Nilsson, Maria H; Lexell, Jan

2017-05-31

Fear of falling is common in people with Parkinson's disease (PD) and is associated with an increased risk for future falls, activity limitations and a reduced quality of life. The Falls Efficacy Scale-International (FES-I) assesses fear of falling conceptualized as concerns about falling. The original FES-I has good psychometric properties in people with PD, but whether this applies also for the short version of FES-I remains to be shown. The aim of the present study was to evaluate the psychometric properties of the short FES-I and to compare these with the original FES-I in the same sample of people with PD. The investigated psychometric properties included known groups validity, data completeness, scaling assumptions, targeting and reliability. A postal survey, which included the original, full-length FES-I, was distributed to 174 people with PD. Responders received a second survey after two weeks. From these data, short FES-I total scores were calculated by extracting the items that are included in the short version of the scale. Median age and PD duration of the 101 responders (43% women) were 73 and 5 years, respectively. The original as well as the short FES-I scores were able to discriminate (p falling, activity avoidance, falls, near falls, and with various self-rated PD severity, respectively. Both versions of FES-I had a high level of data completeness (0.7 to 0.9% missing item responses). Scaling assumptions were acceptable for the original as well as the short FES-I. While the short FES-I had 19% floor effect, the original version was better targeted. Both versions were reliable and obtained high values for internal consistency (Cronbach's alpha >0.8) and test-retest reliability (Intraclass Correlation Coefficient > 0.9). Both the original and short FES-I revealed generally good psychometric properties in people with PD, although the original scale was better targeted. Due to the higher floor effect in the short FES-I, the present findings favors
Testing statistical significance scores of sequence comparison methods with structure similarity

Directory of Open Access Journals (Sweden)

Leunissen Jack AM

2006-10-01

Full Text Available Abstract Background In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. Results All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. Conclusion The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons.
Development and psychometric testing of the Nurse Practitioner Primary Care Organizational Climate Questionnaire.

Science.gov (United States)

Poghosyan, Lusine; Nannini, Angela; Finkelstein, Stacey R; Mason, Emanuel; Shaffer, Jonathan A

2013-01-01

Policy makers and healthcare organizations are calling for expansion of the nurse practitioner (NP) workforce in primary care settings to assure timely access and high-quality care for the American public. However, many barriers, including those at the organizational level, exist that may undermine NP workforce expansion and their optimal utilization in primary care. This study developed a new NP-specific survey instrument, Nurse Practitioner Primary Care Organizational Climate Questionnaire (NP-PCOCQ), to measure organizational climate in primary care settings and conducted its psychometric testing. Using instrument development design, the organizational climate domain pertinent for primary care NPs was identified. Items were generated from the evidence and qualitative data. Face and content validity were established through two expert meetings. Content validity index was computed. The 86-item pool was reduced to 55 items, which was pilot tested with 81 NPs using mailed surveys and then field-tested with 278 NPs in New York State. SPSS 18 and Mplus software were used for item analysis, reliability testing, and maximum likelihood exploratory factor analysis. Nurse Practitioner Primary Care Organizational Climate Questionnaire had face and content validity. The content validity index was .90. Twenty-nine items loaded on four subscale factors: professional visibility, NP-administration relations, NP-physician relations, and independent practice and support. The subscales had high internal consistency reliability. Cronbach's alphas ranged from.87 to .95. Having a strong instrument is important to promote future research. Also, administrators can use it to assess organizational climate in their clinics and propose interventions to improve it, thus promoting NP practice and the expansion of NP workforce.

A Computer-Based Sustained Visual Attention Test for Pre-School Children: Design, Development and Psychometric Properties

Directory of Open Access Journals (Sweden)

Roohollah Zahedian Nasb

2016-06-01

Full Text Available Background: Sustained visual attention is a prerequisite for learning and memory. The early evaluation of attention in childhood is essential for their school and career success in the future. The aim of this study was to design, development and investigation of psychometric properties (content, face and convergent validity and test-retest and internal consistency reliability of the computer - based sustained visual attention test (SuVAT for healthy preschool children aged 4-6 with their special needs. Methods: This study was carried out in two stages: in the first stage computerbased SuVAT in two versions original and parallel were developed. Then the test-retest and internal consistency reliability using intra-class correlation and Cronbach’s alpha coefficients respectively were examined; Face validity was calculated through ideas gathering from 10 preschool children and content validity evaluated using CVI and CVR method and convergent validity of SuVAT with CPT was assessed using Pearson correlation. Results: The developed test showed a good content and faces validity, and also had excellent test-retest reliability. In addition, the assessment of internal consistency indicated the high internal consistency of the test (Cronbach’s alpha=0.869. SuVAT and CPT test demonstrated a positive correlation upon the convergent validity testing. Conclusion: SuVAT with good reliability and validity could be used as an acceptable sustained attention assessment in preschool children.
The Alzheimer's Disease Knowledge Scale: Development and Psychometric Properties

Science.gov (United States)

Carpenter, Brian D.; Balsis, Steve; Otilingam, Poorni G.; Hanson, Priya K.; Gatz, Margaret

2009-01-01

Purpose: This study provides preliminary evidence for the acceptability, reliability, and validity of the new Alzheimer's Disease Knowledge Scale (ADKS), a content and psychometric update to the Alzheimer's Disease Knowledge Test. Design and Methods: Traditional scale development methods were used to generate items and evaluate their psychometric…
Psychometric evaluation of the impact of weight on quality of life-lite questionnaire (IWQOL-lite) in a community sample.

Science.gov (United States)

Kolotkin, Ronette L; Crosby, Ross D

2002-03-01

The short form of impact of weight on quality of life (IWQOL)-Lite is a 31-item, self-report, obesity-specific measure of health-related quality of life (HRQOL) that consists of a total score and scores on each of five scales--physical function, self-esteem, sexual life, public distress, and work--and that exhibits strong psychometric properties. This study was undertaken in order to assess test-retest reliability and discriminant validity in a heterogeneous sample of individuals not in treatment. Individuals were recruited from the community to complete questionnaires that included the IWQOL-Lite, SF-36, Rosenberg self-esteem (RSE) scale, Marlowe-Crowne social desirability scale, global ratings of quality of life, and sexual functioning and public distress ratings. Persons currently enrolled in weight loss programs or with a body mass index (BMI) of less than 18.5 were dropped from the analyses, leaving 341 females and 153 males for analysis, with an average BMI of 27.4. For test-retest reliability, 112 participants completed the IWQOL-Lite again. ANOVA revealed significant main effects for BMI for all IWQOL-Lite scales and total score. Females showed greater impairment than males on all scales except public distress. Internal consistency ranged from 0.816 to 0.944 for IWQOL-Lite scales and was 0.958 for total score. Test-retest reliability ranged from 0.814 to 0.877 for scales and was 0.937 for total score. Internal consistency and test-retest results for overweight/obese subjects were similar to those obtained for the total sample. There was strong evidence for convergent and discriminant validity of the IWQOL-Lite in overweight/obese subjects. As in previous studies conducted on treatment-seeking obese persons, the IWQOL-Lite appears to be a reliable and valid measure of obesity-specific quality of life in overweight/obese persons not seeking treatment.
High Test Scores: The Wrong Road to National Economic Success

Science.gov (United States)

Baker, Keith

2011-01-01

A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…
Relationships between spatial activities and scores on the mental rotation test as a function of sex.

Science.gov (United States)

Ginn, Sheryl R; Pickens, Stefanie J

2005-06-01

Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.
Development and Initial Psychometric Assessment of the Plant Attitude Questionnaire

Science.gov (United States)

Fančovičová, Jana; Prokop, Pavol

2010-10-01

Plants are integral parts of ecosystems which determine life on Earth. People's attitudes toward them are however, largely overlooked. Here we present initial psychometric assessment of self-constructed Plant Attitude Scale (PAS) that was administered to a sample of 310 Slovakian students living in rural areas aged 10-15 years. The final version of PAS consists from 29 Likert-scale items that were loaded to four distinct dimensions (Interest, Importance, Urban trees and Utilization). Mean scores revealed that Slovakian students lack positive attitudes toward plants and that gender had no effect on their mean attitude scores. Living in a family with a garden was associated with a more positive attitude toward plants. Further correlative research on diverse samples containing urban children and experimental research examining the impact of gardening in schools on student attitudes toward plants is required.
Negative symptoms in bipolar disorder and schizophrenia: A psychometric evaluation of the brief negative symptom scale across diagnostic categories.

Science.gov (United States)

Strauss, Gregory P; Vertinski, Mary; Vogel, Sally J; Ringdahl, Erik N; Allen, Daniel N

2016-02-01

Past studies have demonstrated that the Brief Negative Symptom Scale (BNSS) has excellent psychometric properties in patients with schizophrenia. In the current study, we extended this literature by examining psychometric properties of the BNSS in outpatients diagnosed with bipolar disorder (n=46), outpatients with schizophrenia (n=50), and healthy controls (n=27). Participants completed neuropsychological testing and a clinical interview designed to assess negative, positive, disorganized, mood, and general psychiatric symptoms. Results indicated differences among the 3 groups in the severity of all BNSS items, with SZ and BD scoring higher than CN; however, SZ and BD only differed on blunted affect and alogia items, not anhedonia, avolition, or asociality. BD patients with a history of psychosis did not differ from those without a history of psychosis on negative symptom severity. The BNSS had excellent internal consistency in SZ, BD, and CN groups. Good convergent and discriminant validity was apparent in SZ and BD groups, as indicated by relationships between the BNSS and other clinical rating scales. These findings support the validity of the BNSS in broadly defined serious mental illness populations. Copyright © 2015 Elsevier B.V. All rights reserved.
[Using projective tests in forensic psychiatry may lead to wrong conclusions. Only empirically tested tests should be used].

Science.gov (United States)

Trygg, L; Dåderman, A M; Wiklund, N; Meurling, A W; Lindgren, M; Lidberg, L; Levander, S

2001-06-27

The use of projective and psychometric psychological tests at the Department of Forensic Psychiatry in Stockholm (Huddinge), Sweden, was studied for a population of 60 men, including many patients with neuropsychological disabilities and multiple psychiatric disorders. The results showed that the use of projective tests like Rorschach, Object Relations Test, and House-Tree-Person was more frequent than the use of objective psychometric tests. Neuropsychological test batteries like the Halstead-Reitan Neuropsychological Test Battery or Luria-Nebraska Neuropsychological Battery were not used. The majority of patients were, however, assessed by intelligence scales like the WAIS-R. The questionable reliability and validity of the projective tests, and the risk of subjective interpretations, raise a problem when used in a forensic setting, since the courts' decisions about a sentence to prison or psychiatric care is based on the forensic psychiatric assessment. The use of objective psychometric neuropsychological tests and personality tests is recommended.
Challenges in assessing depressive symptoms in Fiji: A psychometric evaluation of the CES-D.

Science.gov (United States)

Opoliner, April; Blacker, Deborah; Fitzmaurice, Garrett; Becker, Anne

2014-06-01

The CES-D is a commonly used self-report assessment for depressive symptomatology. However, its psychometric properties have not been evaluated in Fiji. This study aims to evaluate the reliability and validity of English language and Fijian vernacular versions in ethnic Fijian adolescent schoolgirls. As part of the HEALTHY Fiji study, ethnic Fijian female adolescents (N = 523) completed the CES-D. Participants selected to respond in English or the local vernacular. Reliability (internal consistency, item-total score correlation, and test-retest estimates), validity (associations with other proxies for depression) and factor structure were assessed. Evaluations considered differences between language versions. In this sample, the CES-D had a Cronbach's α of 0.81 and item-total score correlation coefficients ranged between 0.2 and 0.63. One week test-retest reliability (ICC(2)) was 0.57. CES-D scores were higher among individuals who endorsed feelings of depression and suicidality compared to those who did not. ROC analyses of the CES-D versus binary depression and suicidality variables produced AUCs around 0.70 and did not support a discrete cut-off for significant disturbance. Findings were similar across the two language groups. The CES-D has acceptable reliability and validity among ethnic Fijian female adolescents in English and in the Fijian vernacular language. Findings support its utility as a dimensional measure for depressive symptomatology in this study population. Further examination of its clinical utility for case finding for depression in Fijian school-based and community populations is warranted. © The Author(s) 2013.
The Cross-Cultural Loss Scale: development and psychometric evaluation.

Science.gov (United States)

Wang, Kenneth T; Wei, Meifen; Zhao, Ran; Chuang, Chih-Chun; Li, Feihan

2015-03-01

The Cross-Cultural Loss Scale (CCLS), a measure of loss associated with crossing national boundaries, was developed across 2 samples of international students. With Sample 1 (N = 262), exploratory factor analyses were used to select the 14 CCLS items and to determine 3 factors: Belonging-Competency (α = .87), National Privileges (α = .68), and Access to Home Familiarity (α = .72). With Sample 2, confirmatory factor analyses (N = 256) cross-validated the 3-factor oblique model as well as a bifactor model. Cronbach alphas of CCLS subscale scores in Sample 2 ranged from .73 to .87. The validity of the CCLS scores was supported by its associations with related variables in the expected directions. Perceived cross-cultural losses were positively associated with negative affect, migration grief and loss, and discrimination and were negatively associated with life satisfaction, positive affect, general self-efficacy, and social connection with mainstream society. Moreover, the CCLS total and 2 subscale scores added significant incremental variance in predicting subjective well-being over and above related constructs. The results indicated measurement invariance and validity equivalency for the CCLS scores between men and women. The overall results from these 2 samples support CCLS as a psychometrically strong measure. 2015 APA, all rights reserved
Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

Directory of Open Access Journals (Sweden)

Ulla Haverinen-Shaughnessy

Full Text Available Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms from Southwestern United States, and student level data (N = 3109 on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person. The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points were increased by up to eleven points (0.5% per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points. There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points. Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.
Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

Science.gov (United States)

2015-01-01

Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643
German version of the intuitive eating scale: Psychometric evaluation and application to an eating disordered population.

Science.gov (United States)

van Dyck, Zoé; Herbert, Beate M; Happ, Christian; Kleveman, Gillian V; Vögele, Claus

2016-10-01

Intuitive eating has been described to represent an adaptive eating behaviour that is characterised by eating in response to physiological hunger and satiety cues, rather than situational and emotional stimuli. The Intuitive Eating Scale-2 (IES-2) has been developed to measure such attitudes and behaviours on four subscales: unconditional permission to eat (UPE), eating for physical rather than emotional reasons (EPR), reliance on internal hunger and satiety cues (RHSC), and body-food choice congruence (B-FCC). The present study aimed at validating the psychometric properties of the German translation of the IES-2 in a large German-speaking sample. A second objective was to assess levels of intuitive eating in participants with an eating disorder diagnosis (anorexia nervosa, bulimia nervosa, or binge eating disorder). The proposed factor structure of the IES-2 could be confirmed for the German translation of the questionnaire. The total score and most subscale scores were negatively related to eating disorder symptomatology, problems in appetite and emotional awareness, body dissatisfaction, and self-objectification. Women with eating disorders had significantly lower values on all IES-2 subscale scores and the total score than women without an eating disorder diagnosis. Women with a binge eating disorder (BED) diagnosis had higher scores on the UPE subscale compared to participants with anorexia nervosa (AN) or bulimia nervosa (BN), and those diagnosed with AN had higher scores on the EPR subscale than individuals with BN or BED. We conclude that the German IES-2 constitutes a useful self-report instrument for the assessment of intuitive eating in German-speaking samples. Further studies are warranted to evaluate psychometric properties of the IES-2 in different samples, and to investigate its application in a clinical setting. Copyright © 2016 Elsevier Ltd. All rights reserved.
Eccentricity dimension of the Dimensional Clinical Personality Inventory: Review and psychometric properties

Directory of Open Access Journals (Sweden)

Lucas de Francisco Carvalho

Full Text Available Abstract We aimed to review of the Eccentricity dimension of the Dimensional Clinical Personality Inventory (IDCP, through two steps. The first one focused on developing new items and the second on testing the psychometric properties in a sample of 225 subjects (70.1% females, aging between 18 and 66 years, mostly undergraduate students (58.9%. The subjects answered the IDCP, and the Brazilian versions of the NEO-PI-R, PID-5 and MIS. The first step resulted in 42 items, which 22 were new. The second step resulted in a composite of 18 items, pooled in six interpretable factors, as Interpersonal detachment, Eccentric style, Paranormality, Persecutory style, Depersonalization and Emotional inexpressiveness, with internal consistency coefficients of .85 for the total score, and between .60 and .82 for the factors. The correlations between instruments revealed consistent and expected relations. The data suggested adequacy of the new Eccentricity dimension of IDCP.
Cross cultural comparison of JTCI inventory of temperament and character scores of 11-13 year olds

Directory of Open Access Journals (Sweden)

Dukanac Vesna

2008-01-01

Full Text Available The study compares characteristics of Serbian and American children on the dimensions of temperament and character on the Junior TCI (JTCI for assessment of 9 to 13 year olds - based on Robert Cloninger’s Psychobiological model of temperament and character. Given the lack of assessment tools for this age group, the goal of the present study was to test the factor structure and main psychometric characteristics of the JTCI in order to determine the applicability of this questionnaire on Serbian children. The sample consisted of 222 boys and girls from the normal population, ages 11 to 13 and who attended grades 6 to 8. The results showed significant differences between Serbian and American sample. Namely, Serbian children had higher scores on the Novelty seeking and Harm Avoidance and lower scores on Reward Dependence and Persistency. As to the Character Dimensions, Serbian children had lower scores on Reward dependence and persistency, and significantly lower on Self-directedness and Cooperativeness. Scores on the Self-transcendence were higher among the Serbian children. The differences on Character dimensions between children from different cultures suppose to be primarily a result of the socialization process. They reflect a lower level of maturity, cooperation and probably compensatory reliance on the religion. Although it is a temperament dimension, being prone to negative emotions (higher scores on Danger avoidance may also be a result of a situational sensitivity. This result could be interpreted as a reflection of the negative effects that the general socio cultural milieu had on the children who grew up during the social crisis and transitional periods of our society. The result did not confirm a seven factor personality structure of children in this age group. It is likely that at the age of 11 to 13, dimensions of character and temperament did not yet clearly differentiate. Finally, poor reliability of the JTCI scales imposes
Sudanese Students' Perceptions of Their Class Activities: Psychometric Properties and Measurement Invariance of My Class Activities--Arabic Language Version

Science.gov (United States)

Pereira, Nielsen; Bakhiet, Salaheldin Farah; Gentry, Marcia; Balhmar, Tahani Abdulrahman; Hakami, Sultan Mohammed

2017-01-01

This study examined the psychometric properties and measurement invariance of the Arabic version of "My Class Activities" (MCA), an instrument designed to measure students' perceptions of interest, challenge, choice, and enjoyment in classrooms. Scores of 3,516 Sudanese students in Grades 2 to 8 were used. Confirmatory factor analysis…
Assessment of nutrition and physical activity environments in family child care homes: modification and psychometric testing of the Environment and Policy Assessment and Observation

Directory of Open Access Journals (Sweden)

Amber E. Vaughn

2017-08-01

development (r = 0.21, and nutrition policy (r = 0.18. Child MVPA was significantly associated with overall time provided for activity (r = 0.18 and outdoor playtime (r = 0.20. There was also an unexpected negative association between child MVPA and screen time (−0.16 and screen time practices (r = −0.21. Conclusions The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. Trial registration NCT01814215 , March 15, 2013.
Assessment of nutrition and physical activity environments in family child care homes: modification and psychometric testing of the Environment and Policy Assessment and Observation.

Science.gov (United States)

Vaughn, Amber E; Mazzucca, Stephanie; Burney, Regan; Østbye, Truls; Benjamin Neelon, Sara E; Tovar, Alison; Ward, Dianne S

2017-08-29

= 0.18). Child MVPA was significantly associated with overall time provided for activity (r = 0.18) and outdoor playtime (r = 0.20). There was also an unexpected negative association between child MVPA and screen time (-0.16) and screen time practices (r = -0.21). The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. NCT01814215 , March 15, 2013.
Diabetes Fear of Injecting and Self-Testing Questionnaire

DEFF Research Database (Denmark)

Mollema, E D; Snoek, Frank J; Pouwer, F

2000-01-01

OBJECTIVE: To study the psychometric properties of the Diabetes Fear of Injecting and Self-Testing Questionnaire (D-FISQ). RESEARCH DESIGN AND METHODS: Two groups of patients were studied. Sample A consisted of 252 insulin-treated diabetes patients. Sample B incorporated 24 insulin-treated patients......-injecting or self-testing had higher scores on FSI (P = 0.095) and FST (P = 0.01). EFA yielded 2 separate factors, FSI and FST. CONCLUSIONS: Results from this study support reliability and validity of the D-FISQ, a self-report instrument that can be used for both clinical and research purposes....
Investigating the psychometric properties of the Geriatric Suicide Ideation Scale (GSIS) among community-residing older adults.

Science.gov (United States)

Heisel, Marnin J; Flett, Gordon L

2016-01-01

To investigate the psychometric properties of the Geriatric Suicide Ideation Scale (GSIS) among community-residing older adults. We recruited 173 voluntary participants, 65 years and older, into a 2+ year longitudinal study of the onset or exacerbation of depressive symptoms and suicide ideation. We assessed the internal consistency of the GSIS and its four component subscales, and its shorter and longer duration test-retest reliability, convergent (depression, social hopelessness, and loneliness), divergent (psychological well-being, life satisfaction, perceived social support, and self-rated health), discriminant (basic and instrumental activities of daily living and social desirability), criterion (history of suicide behavior), and predictive validity (future suicide ideation). The GSIS demonstrated strong test-retest reliability and internal consistency. Baseline GSIS scores were significantly positively associated with suicide risk factors, negatively associated with potential resiliency factors, and not associated with functional impairment or social desirability. GSIS scores significantly differentiated between participants with as compared to those without a history of suicide behavior. Baseline GSIS scores significantly predicted suicide ideation at a 2+ year follow-up assessment. Findings suggest strong measurement characteristics for the GSIS with community-residing older adults, including impressive consistency over time. These results are consistent with research attesting to the empirical and pragmatic strengths of this measure. These findings have implications for the monitoring of suicide risk when aiming to enhance mental health and well-being and prevent suicide in later life.

Psychometric Properties of the Satisfaction with Life Scale among Turkish University Students, Correctional Officers, and Elderly Adults

Science.gov (United States)

Durak, Mithat; Senol-Durak, Emre; Gencoz, Tulin

2010-01-01

This study aims to extensively examine the psychometric properties of adapted version of the Satisfaction with Life Scale (SWLS) in different Turkish samples. In order to test the psychometric properties of the SWLS three separate and independent samples are utilized in this study, namely university students (n = 547), correctional officers (n =…
Item-level psychometrics of the ADL instrument of the Korean National Survey on persons with physical disabilities.

Science.gov (United States)

Hong, Ickpyo; Lee, Mi Jung; Kim, Moon Young; Park, Hae Yean

2017-10-01

The aim of this study is to investigate the psychometrics of the 12 items of an instrument assessing activities of daily living (ADL) using an item response theory model. A total of 648 adults with physical disabilities and having difficulties in ADLs were retrieved from the 2014 Korean National Survey on People with Disabilities. The psychometric testing included factor analysis, internal consistency, precision, and differential item functioning (DIF) across categories including sex, older age, marital status, and physical impairment area. The sample had a mean age of 69.7 years old (SD = 13.7). The majority of the sample had lower extremity impairments (62.0%) and had at least 2.1 chronic conditions. The instrument demonstrated unidimensional construct and good internal consistency (Cronbach's alpha = 0.95). The instrument precisely estimated person measures within a wide range of theta values (-2.22 logits 5.0%). Our findings indicate that the dressing item would need to be modified to improve its psychometrics. Overall, the ADL instrument demonstrates good psychometrics, and thus, it may be used as a standardized instrument for measuring disability in rehabilitation contexts. However, the findings are limited to adults with physical disabilities. Future studies should replicate psychometric testing for survey respondents with other disorders and for children.
Disruptive behavior scale for adolescents (DISBA): development and psychometric properties.

Science.gov (United States)

Karimy, Mahmood; Fakhri, Ahmad; Vali, Esmaeel; Vali, Farzaneh; Veiga, Feliciano H; Stein, L A R; Araban, Marzieh

2018-01-01

Growing evidence indicates that if disruptive behavior is left unidentified and untreated, a significant proportion of these problems will persist and may develop into problems linked with delinquency, substance abuse, and violence. Research is needed to develop valid and reliable measures of disruptive behavior to assist recognition and impact of treatments on disruptive behavior. The aim of this study was to develop and evaluate the psychometric properties of a scale for disruptive behavior in adolescents. Six hundred high school students (50% girls), ages ranged 15-18 years old, selected through multi stage random sampling. Psychometrics of the disruptive behavior scale for adolescents (DISBA) (Persian version) was assessed through content validity, explanatory factor analysis (EFA) using Varimax rotation and confirmatory factor analysis (CFA). The reliability of this scale was assessed via internal consistency and test-retest reliability. EFA revealed four factors accounting for 59% of observed variance. The final 29-item scale contained four factors: (1) aggressive school behavior, (2) classroom defiant behavior, (3) unimportance of school, and (4) defiance to school authorities. Furthermore, CFA produced a sufficient Goodness of Fit Index > 0.90. Test-retest and internal consistency reliabilities were acceptable at 0.85 and 0.89, respectively. The findings from this study suggest that the Iranian version of DISBA questionnaire has content validity. Further studies are needed to evaluate stronger psychometric properties for DISBA.
Generalized Network Psychometrics : Combining Network and Latent Variable Models

NARCIS (Netherlands)

Epskamp, S.; Rhemtulla, M.; Borsboom, D.

2017-01-01

We introduce the network model as a formal psychometric model, conceptualizing the covariance between psychometric indicators as resulting from pairwise interactions between observable variables in a network structure. This contrasts with standard psychometric models, in which the covariance between
A knowledge-based theory of rising scores on "culture-free" tests.

Science.gov (United States)

Fox, Mark C; Mitchum, Ainsley L

2013-08-01

Secular gains in intelligence test scores have perplexed researchers since they were documented by Flynn (1984, 1987). Gains are most pronounced on abstract, so-called culture-free tests, prompting Flynn (2007) to attribute them to problem-solving skills availed by scientifically advanced cultures. We propose that recent-born individuals have adopted an approach to analogy that enables them to infer higher level relations requiring roles that are not intrinsic to the objects that constitute initial representations of items. This proposal is translated into item-specific predictions about differences between cohorts in pass rates and item-response patterns on the Raven's Matrices (Flynn, 1987), a seemingly culture-free test that registers the largest Flynn effect. Consistent with predictions, archival data reveal that individuals born around 1940 are less able to map objects at higher levels of relational abstraction than individuals born around 1990. Polytomous Rasch models verify predicted violations of measurement invariance, as raw scores are found to underestimate the number of analogical rules inferred by members of the earlier cohort relative to members of the later cohort who achieve the same overall score. The work provides a plausible cognitive account of the Flynn effect, furthers understanding of the cognition of matrix reasoning, and underscores the need to consider how test-takers select item responses. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Translation and psychometric testing of the Korean Versions of the Spiritual Perspective Scale and the Self-transcendence Scale in Korean elders.

Science.gov (United States)

Kim, Suk Sun; Reed, Pamela G; Kang, Youngmi; Oh, Jina

2012-12-01

The purpose of this study was to translate the Spiritual Perspective Scale (SPS) and Self-transcendence Scale (STS) into Korean and test the psychometric properties of the instruments with Korean elders. A cross-sectional survey design was used to implement the three stages of the study. Stage I consisted of translating and reviewing the scales by six experts. In Stage II, equivalence was tested by comparing the responses between the Korean and English versions among 71 bilingual adults. Stage III established the psychometric properties of the Korean versions SPS-K and STS-K among 154 Korean elders. Cronbach's alpha of the SPS-K and the STS-K .97, and .85 respectively with Korean elders. Factor analysis showed that the SPS-K had one factor; the STS-K had four factors with one factor clearly representing self-transcendence as theorized. Both scales showed good reliability and validity for the translated Korean versions. However, continued study of the construct validity of the STS-K is needed. Study findings indicate that the SPS-K and the STS-K could be useful for nurses and geriatric researchers to assess a broadly defined spirituality, and to conduct research on spirituality and health among Korean elders. Use of these scales within a theory-based study may contribute to further knowledge about the role of spirituality in the health and well-being of Korean people facing health crises.
[Psychometric properties of the Activities Daily Life Scale (ADL)].

Science.gov (United States)

Boyer, L; Murcia, A; Belzeaux, R; Loundou, A; Azorin, J-M; Chabannes, J-M; Dassa, D; Naudin, J; Samuelian, J-C; Lancon, C

2010-10-01

analysis with varimax rotation identified a 2-factor structure accounting for 82% of the total variance. The first dimension (ADL 1) comprised four items and represented personal care activities. The second dimension (ADL 2) comprised two items and represented social functioning. A floor effect was reported for ADL 1 and its unidimensionality was not satisfactory: two items showed an INFIT statistic outside the acceptable range. Internal consistency was satisfactory for the two dimensions: each item achieved the 0.40 standard for item-internal consistency. The correlation of each item with its contributory dimension was higher than with the other (item discriminant validity). Cronbach's alpha coefficients ranged over 0.70 in the whole sample. Concerning external validity, positive correlations were not systematically found between ADL and ASSS dimensions. The score of ADL 1 had medium to high correlations with four dimensions scores of the ASSS, while the score of ADL 2 were not at all or weakly correlated with ASSS dimension scores. Globally, ADL did not cover sentimental life and social relationships. There were statistical associations between ADL and age or gender: women and subjects older than 60 had a higher level of dependency. We didn't find any association with marital status or diagnoses. The ADL scale presented a good reproducibility but was not sensitive to change. The psychometric properties of the ADL scale were not sufficient for several parameters such as validity or sensitivity to change, contrary to other available French scales. The use of a heteroquestionnaire rather than a self-administered questionnaire should be discussed by professionals and the French authorities. These results should be taken into account in the use of the ADL scale for the economic and administrative management of psychiatry. Further research should be conducted to confirm these results. Copyright © 2010 L'Encéphale, Paris. Published by Elsevier Masson SAS. All rights reserved.
Development and psychometric properties of a measure of catheter burden with bladder drainage after pelvic reconstructive surgery.

Science.gov (United States)

Carpenter, Janet S; Heit, Michael; Rand, Kevin L

2017-04-01

Catheter burden after pelvic reconstructive surgery is an important patient-reported quality of life outcome in research and clinical practice. However, existing tools focus on long-term catheter users rather than short-term postoperative patients. The study aim was to evaluate the psychometric properties of a modified version of the intermittent self-catheterization questionnaire (ISC-Q) in postoperative pelvic reconstructive patients. After experts convened to discuss and modify the ISC-Q items based on their knowledge of women's experiences and clinical practices, 178 women (108 with transurethral and 70 with suprapubic catheters) completed the modified scale and other measures as part of a larger parent study designed to assess health-related quality of life (HRQoL) following pelvic reconstructive surgery requiring bladder drainage. During psychometric testing, the modified ISC-Q was reduced to six items encompassing two factors: a three-item difficulty of use factor and a 3-item embarrassment factor. The new scale was named the short-term catheter burden questionnaire (STCBQ). The two-factor model was robust in both subsamples. Only scores within and not between subsamples can be meaningfully compared due to a lack of scalar invariance. Correlations among STCBQ total scores, subscores, and a single satisfaction item indicated good construct validity. Correlations with patient demographics provided further information about the scale. The STCBQ is a short, efficient assessment of short-term catheter burden following pelvic reconstructive surgery. The scale can be used as an important patient reported outcome measure in clinical practice and research. Neurourol. Urodynam. 36:1140-1146, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
A Latent Class Approach to Estimating Test-Score Reliability

Science.gov (United States)

van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

2011-01-01

This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…
Cross-cultural adaptation and validation of the French version of the Hip disability and Osteoarthritis Outcome Score (HOOS) in hip osteoarthritis patients

DEFF Research Database (Denmark)

Ornetti, P; Parratte, S; Gossec, L

2010-01-01

osteoarthritis (OA). METHODS: The French version of the HOOS was developed according to published international guidelines to ensure content validity. The new version was then evaluated in two symptomatic hip OA populations, one with no indication for joint replacement (medical group), and the other waiting......OBJECTIVE: To translate and adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) into French and to evaluate the psychometric properties of this new version, by testing feasibility, internal consistency, construct validity, reliability and responsiveness, in patients with hip...
Computerized scoring algorithms for the Autobiographical Memory Test.

Science.gov (United States)

Takano, Keisuke; Gutenbrunner, Charlotte; Martens, Kris; Salmon, Karen; Raes, Filip

2018-02-01

Reduced specificity of autobiographical memories is a hallmark of depressive cognition. Autobiographical memory (AM) specificity is typically measured by the Autobiographical Memory Test (AMT), in which respondents are asked to describe personal memories in response to emotional cue words. Due to this free descriptive responding format, the AMT relies on experts' hand scoring for subsequent statistical analyses. This manual coding potentially impedes research activities in big data analytics such as large epidemiological studies. Here, we propose computerized algorithms to automatically score AM specificity for the Dutch (adult participants) and English (youth participants) versions of the AMT by using natural language processing and machine learning techniques. The algorithms showed reliable performances in discriminating specific and nonspecific (e.g., overgeneralized) autobiographical memories in independent testing data sets (area under the receiver operating characteristic curve > .90). Furthermore, outcome values of the algorithms (i.e., decision values of support vector machines) showed a gradient across similar (e.g., specific and extended memories) and different (e.g., specific memory and semantic associates) categories of AMT responses, suggesting that, for both adults and youth, the algorithms well capture the extent to which a memory has features of specific memories. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Psychometric Properties and Factor Structure of the German Version of the Clinician-Administered PTSD Scale for DSM-5.

Science.gov (United States)

Müller-Engelmann, Meike; Schnyder, Ulrich; Dittmann, Clara; Priebe, Kathlen; Bohus, Martin; Thome, Janine; Fydrich, Thomas; Pfaltz, Monique C; Steil, Regina

2018-05-01

The Clinician-Administered PTSD Scale (CAPS) is a widely used diagnostic interview for posttraumatic stress disorder (PTSD). Following fundamental modifications in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition ( DSM-5), the CAPS had to be revised. This study examined the psychometric properties (internal consistency, interrater reliability, convergent and discriminant validity, and structural validity) of the German version of the CAPS-5 in a trauma-exposed sample ( n = 223 with PTSD; n =51 without PTSD). The results demonstrated high internal consistency (αs = .65-.93) and high interrater reliability (ICCs = .81-.89). With regard to convergent and discriminant validity, we found high correlations between the CAPS severity score and both the Posttraumatic Diagnostic Scale sum score ( r = .87) and the Beck Depression Inventory total score ( r = .72). Regarding the underlying factor structure, the hybrid model demonstrated the best fit, followed by the anhedonia model. However, we encountered some nonpositive estimates for the correlations of the latent variables (factors) for both models. The model with the best fit without methodological problems was the externalizing behaviors model, but the results also supported the DSM-5 model. Overall, the results demonstrate that the German version of the CAPS-5 is a psychometrically sound measure.
Efficient clinical evaluation of guideline quality: development and testing of a new tool

Science.gov (United States)

2014-01-01

Background Evaluating the methodological quality of clinical practice guidelines is essential before deciding which ones which could best inform policy or practice. One current method of evaluating clinical guideline quality is the research-focused AGREE II instrument. This uses 23 questions scored 1–7, arranged in six domains, which requires at least two independent testers, and uses a formulaic weighted domain scoring system. Following feedback from time-poor clinicians, policy-makers and managers that this instrument did not suit clinical need, we developed and tested a simpler, shorter, binary scored instrument (the iCAHE Guideline Quality Checklist) designed for single users. Methods Content and construct validity, inter-tester reliability and clinical utility were tested by comparing the new iCAHE Guideline Quality Checklist with the AGREE II instrument. Firstly the questions and domains in both instruments were compared. Six randomly-selected guidelines on a similar theme were then assessed by three independent testers with different experience in guideline quality assessment, using both instruments. Per guideline, weighted domain and total AGREE II scores were calculated, using the scoring rubric for three testers. Total iCAHE scores were calculated per guideline, per tester. The linear relationship between iCAHE and AGREE II scores was assessed using Pearson r correlation coefficients. Score differences between testers were assessed for the iCAHE Guideline Quality Checklist. Results There were congruent questions in each instrument in four domains (Scope & Purpose, Stakeholder involvement, Underlying evidence/Rigour, Clarity). The iCAHE and AGREE II scores were moderate to strongly correlated for the six guidelines. There was generally good agreement between testers for iCAHE scores, irrespective of their experience. The iCAHE instrument was preferred by all testers, and took significantly less time to administer than the AGREE II instrument. However
Loosening Psychometric Constraints on Educational Assessments

Science.gov (United States)

Kane, Michael T.

2017-01-01

In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Robust joint score tests in the application of DNA methylation data analysis.

Science.gov (United States)

Li, Xuan; Fu, Yuejiao; Wang, Xiaogang; Qiu, Weiliang

2018-05-18

Recently differential variability has been showed to be valuable in evaluating the association of DNA methylation to the risks of complex human diseases. The statistical tests based on both differential methylation level and differential variability can be more powerful than those based only on differential methylation level. Anh and Wang (2013) proposed a joint score test (AW) to simultaneously detect for differential methylation and differential variability. However, AW's method seems to be quite conservative and has not been fully compared with existing joint tests. We proposed three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM, and have made extensive comparisons with the joint likelihood ratio test (jointLRT), the Kolmogorov-Smirnov (KS) test, and the AW test. Systematic simulation studies showed that: 1) the three improved tests performed better (i.e., having larger power, while keeping nominal Type I error rates) than the other three tests for data with outliers and having different variances between cases and controls; 2) for data from normal distributions, the three improved tests had slightly lower power than jointLRT and AW. The analyses of two Illumina HumanMethylation27 data sets GSE37020 and GSE20080 and one Illumina Infinium MethylationEPIC data set GSE107080 demonstrated that three improved tests had higher true validation rates than those from jointLRT, KS, and AW. The three proposed joint score tests are robust against the violation of normality assumption and presence of outlying observations in comparison with other three existing tests. Among the three proposed tests, iAW.BF seems to be the most robust and effective one for all simulated scenarios and also in real data analyses.
[Validating the Spanish version of the Nursing Activities Score].

Science.gov (United States)

Sánchez-Sánchez, M M; Arias-Rivera, S; Fraile-Gamo, M P; Thuissard-Vasallo, I J; Frutos-Vivar, F

2015-01-01

Validating workload scores ensures that they are appropriate for the purpose for which they were developed. To validate the Nursing Activities Score (NAS) Spanish version. Observational and prospective study. 1,045 patients who were admitted to a medical-surgical unit and a serious burns unit in 2006 were included. The nurse in charge assessed patient workloads by Nine Equivalent of Nursing Manpower use Score and NAS. To assess the internal consistency of the measurements of NAS, item-test correlations, Cronbach's α and Cronbach's α corrected by omitting each of the items were calculated. The intraobserver and interobserver reliability were assessed with the intraclass correlation coefficient by viewing recordings and Kappa (interobserver reliability) was estimated. For the analysis of internal validity, a factorial principal components analysis was performed. Convergent validity was assessed using the Spearman correlation coefficient values obtained from the Nine Equivalent of Nursing Manpower use Score and Spanish-NAS scales. For internal consistency, 164 questionnaires were analysed and a Cronbach's α of 0.373 was calculated. The intraclass correlation coefficient for intraobserver reliability estimate was 0.837 (95% IC: 0.466-0.950) and 0.662 (95% IC: 0.033-0.882) for interobserver reliability. The estimated kappa was 0.371. For internal validity, exploratory factor analysis showed that the first item explained 58.9% of the variance of the questionnaire. For convergent validity 1006 questionnaires were included and a Spearman correlation coefficient of 0.746 was observed. The psychometric properties of Spanish-NAS are acceptable. Copyright © 2014 Elsevier España, S.L.U. y SEEIUC. All rights reserved.
Linguistic adaptation and psychometric evaluation of original Oral Health Literacy-Adult Questionnaire (OHL-AQ).

Science.gov (United States)

Vyas, Shaleen; Nagarajappa, Sandesh; Dasar, Pralhad L; Mishra, Prashant

2016-10-01

Linguistically adapted oral health literacy tools are helpful to assess oral health literacy among local population with clarity and understandability. The original oral health literacy adult questionnaire, Oral Health Literacy Adult Questionnaire, was given in English (2013), consisting of 17 items under 4 domains. The present study rationalizes to culturally adapt and validate Oral Health Literacy Adult Questionnaire into Hindi language. Thus, we objectified to translate Oral Health Literacy Adult Questionnaire into Hindi and test its psychometric properties like reliability and validity among primary school teachers. The Oral Health Literacy Adult Questionnaire was translated into Oral Health Literacy Adult Questionnaire - Hindi Version using the World Health Organization recommended translation back-translation protocol. During pre-testing, an expert panel assessed content validity of the questionnaire. Face validity was assessed on a small sample of 10 individuals. A cross-sectional study was conducted (June-July 2015) and OHL-AQ-H was administered on a convenient sample of 170 primary school teachers. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and Intra-class correlation coefficient (ICC), respectively, with 2 weeks interval to ascertain adherence to the questionnaire response. Predictive validity was tested by comparing OHL-AQ-H scores with clinical indicators like oral hygiene scores and dental caries scores. The concurrent and discriminant validity was assessed through self-reported oral health and through negative association with sociodemographic variables. The data was analyzed by descriptive tests using chi-square and bivariate logistic regression in SPSS software, version 20 and pLiteracy Adult Questionnaire - Hindi Version were 0.94 and 0.70, respectively. Comparisons of varying levels of oral health literacy with self-reported oral health established significant concurrent validity (p=0.01). Significant
Psychometric assessment of the Adult-Adolescent Parenting Inventory in a sample of low-income single mothers.

Science.gov (United States)

Lutenbacher, M

2001-01-01

The Adult-Adolescent Parenting Inventory (AAPI) is a 32-item inventory widely used to identify adolescents and adults at risk for inadequate parenting behaviors. It includes four subscales representing the most frequent patterns associated with abusive parenting: (a) Inappropriate Expectations; (b) Lack of Empathy; (c) Parental Value of Corporal Punishment; and (d) Parent-Child Role Reversal. Although it has been used in a variety of samples, the psychometric properties of the AAPI have not been examined in low-income single mothers. The purposes of this study were to: (a) examine the reliability and validity of the Adult-Adolescent Parenting Inventory (AAPI) in a sample of 206 low-income single mothers; (b) assess the mother's risk for inadequate parenting by comparing their AAPI subscale scores with normative subscale scores on the AAPI; (c) assess the construct validity of the AAPI by testing the hypothesis that mothers with lower AAPI scores have a higher level of depressive symptoms and lower self-esteem in comparison to mothers with higher AAPI scores; and (d) determine whether the 4-factor structure proposed by Bavolek (1984) could be replicated. AAPI scores indicated these mothers were at high risk for child abuse when compared with normative data for parents with no known history of abuse. Higher risk for abusive parenting was associated with a higher level of depressive symptoms, less education, and unemployment. The subscales, Inappropriate Expectations and Parental Value of Corporal Punishment demonstrated poor internal consistency with Cronbach's alphas of .40 and .54, respectively. Hypothesis testing supported the construct validity of the AAPI. Bavolek's 4-factor structure was not supported. A 19-item modified version of the AAPI with three dimensions was identified. This modified version of the AAPI may provide a more efficacious tool for use with low-income single mothers.
The Heteroscedastic Graded Response Model with a Skewed Latent Trait: Testing Statistical and Substantive Hypotheses Related to Skewed Item Category Functions

Science.gov (United States)

Molenaar, Dylan; Dolan, Conor V.; de Boeck, Paul

2012-01-01

The Graded Response Model (GRM; Samejima, "Estimation of ability using a response pattern of graded scores," Psychometric Monograph No. 17, Richmond, VA: The Psychometric Society, 1969) can be derived by assuming a linear regression of a continuous variable, Z, on the trait, [theta], to underlie the ordinal item scores (Takane & de Leeuw in…
Psychometric Validation of the Bahasa Malaysia Version of the EORTC QLQ-CR29.

Science.gov (United States)

Magaji, Bello Arkilla; Moy, Foong Ming; Roslani, April Camilla; Law, Chee Wei; Raduan, Farhana; Sagap, Ismail

2015-01-01

This study examined the psychometric properties of the Bahasa Malaysia (BM) version of the European Organization for Research and Treatment of Cancer (EORTC) Colorectal Cancer-specific Quality Of Life Questionnaire (QLQ-CR29). We studied 93 patients recruited from University Malaya and Universiti Kebangsaan Medical Centers, Kuala Lumpur, Malaysia using a self-administered method. Tools included QLQ-C30, QLQ-CR29 and Karnofsky Performance Scales (KPS). Statistical analyses included Cronbach's alpha, test-retest correlations, multi-traits scaling and known-groups comparisons. A p value ≤ 0.05 was considered significant. The internal consistency coefficients for body image, urinary frequency, blood and mucus and stool frequency scales were acceptable (Cronbach's alpha α ≥ 0.65). However, the coefficients were low for the blood and mucus and stool frequency scales in patients with a stoma bag (α = 0.46). Test-retest correlation coefficients were moderate to high (range: r = 0.51 to 1.00) for most of the scales except anxiety, urinary frequency, buttock pain, hair loss, stoma care related problems, and dyspareunia (r ≤ 0.49). Convergent and discriminant validities were achieved in all scales. Patients with a stoma reported significantly higher symptoms of blood and mucus in the stool, flatulence, faecal incontinence, sore skin, and embarrassment due to the frequent need to change the stoma bag (p < 0.05) compared to patients without stoma. None of the scales distinguished between patients based on the KPS scores. There were no overlaps between scales in the QLQ-C30 and QLQ-CR29 (r < 0.40). the BM version of the QLQ-CR29 indicated acceptable psychometric properties in most of the scales similar to original validation study. This questionnaire could be used to complement the QLQ-C30 in assessing HRQOL among BM speaking population with colorectal cancer.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.