Testing Psychometrics of Healthcare Empowerment Questionnaires (HCEQ) among Iranian ... PROMOTING ACCESS TO AFRICAN RESEARCH ... translation and backtranslation procedures, pilot testing, and getting views of expert panel.
The paper provides a survey of 18 years' progress that my colleagues, students (both former and current) and I made in a prominent research area in Psychometrics-Computerized Adaptive Testing (CAT). We start with a historical review of the establishment of a large sample foundation for CAT. It is worth noting that the asymptotic results were derived under the framework of Martingale Theory, a very theoretical perspective of Probability Theory, which may seem unrelated to educational and psychological testing. In addition, we address a number of issues that emerged from large scale implementation and show that how theoretical works can be helpful to solve the problems. Finally, we propose that CAT technology can be very useful to support individualized instruction on a mass scale. We show that even paper and pencil based tests can be made adaptive to support classroom teaching.
García-Pérez, Miguel A; Núñez-Antón, Vicente
Many empirical studies measure psychometric functions (curves describing how observers' performance varies with stimulus magnitude) because these functions capture the effects of experimental conditions. To assess these effects, parametric curves are often fitted to the data and comparisons are carried out by testing for equality of mean parameter estimates across conditions. This approach is parametric and, thus, vulnerable to violations of the implied assumptions. Furthermore, testing for equality of means of parameters may be misleading: Psychometric functions may vary meaningfully across conditions on an observer-by-observer basis with no effect on the mean values of the estimated parameters. Alternative approaches to assess equality of psychometric functions per se are thus needed. This paper compares three nonparametric tests that are applicable in all situations of interest: The existing generalized Mantel-Haenszel test, a generalization of the Berry-Mielke test that was developed here, and a split variant of the generalized Mantel-Haenszel test also developed here. Their statistical properties (accuracy and power) are studied via simulation and the results show that all tests are indistinguishable as to accuracy but they differ non-uniformly as to power. Empirical use of the tests is illustrated via analyses of published data sets and practical recommendations are given. The computer code in MATLAB and R to conduct these tests is available as Electronic Supplemental Material.
R. P. van der Merwe
Full Text Available This is a cumulative report on the findings of various exploratory research that were done with regard to the practice of psychometric testing in the Eastern Cape. Recent and ongoing developments in the South African labour legislation, and especially the implications of the Employment Equity Act, highlight once again the importance of the validation of all instruments to be used for human assessment and selection purposes. Information was gathered to establish which psychometric tests are used, and for what purposes, in industry today. Biographical information on each organisation is supplied, including the number of employees. The role of psychometric testing in the selection procedure is discussed. The different tests used, as well as the test users, are also indicated. The findings of other, related research, as well as comments, recommendations and shortcomings, are discussed. Opsomming Hierdie is ‘n kumulatiewe verslag wat die resultate verstrek van verskeie verkennende ondersoeke wat gedoen is na die aanwending van psigometriese toetsing in die Oos-Kaap. Onlangse en voortdurende ontwikkelinge in die Suid-Afrikaanse arbeidswetgewing, en veral die implikasies van die Wet op Gelyke Indiensneming, beklemtoon weer eens die belangrikheid van die validering van enige instrumente wat gebruik word vir evaluerings- en keuringsdoeleindes van individue. Inligting is ingewin om te bepaal watter psigometriese toetse, sowel as vir watter doel, vandag in die bedryf gebruik word. Biografiese inligting oor die onderskeie organisasies, insluitende hul aantal werknemers, word verstrek. Die rol van psigometriese toetsing in die keuringsproses word bespreek. Die verskillende toetse wat deur die organisasies gebruik word, sowel as die toetsge-bruikers, word ook aangedui. Die bevindinge van ander, relevante navorsing, sowel as opmerkings, aanbevelings en tekortkominge word bespreek.
Borsboom, D.; Molenaar, D.; Wright, J.D.
Psychometrics is a scientific discipline concerned with the construction of measurement models for psychological data. In these models, a theoretical construct (e.g., intelligence) is systematically coordinated with observables (e.g., IQ scores). This is often done through latent variable models,
Kappus, Matthew R; Bajaj, Jasmohan S
Synopsis Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life and driving accidents. The detection of MHE requires specialized testing since it cannot by definition, be diagnosed on standard clinical examination. Psychometric (paper-pencil or computerized or a combination) and neuro-physiological techniques are often used to test for MHE. Paper-pencil psychometric batteries like the Psychometric Hepatic Encephalopathy Score (PHES) have been validated in several countries but do not have US normative values. Computerized tests such as the inhibitory control test (ICT), cognitive drug research system and Scan test have proven useful to diagnose MHE and predict outcomes. The specificity and sensitivity of these tests are similar to the recommended gold standards. Neuro-physiological tests such as the EEG and its interpretations, evoked potentials and Critical Flicker Frequency (CFF) also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike and the testing strategies depend on the normative data available, patient comfort and local expertise. PMID:22321464
Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S
. Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...
Ortiz, T; Fernández, A; Martínez-Castillo, E; Maestú, F; Martínez-Arias, R; López-Ibor, J J
Several pathologies (i.e. Alzheimer's disease) that courses with memory alterations, appears in a context of impaired cognitive status and mobility. In recent years, several investigations were carried out in order to design short batteries that detect those subjects under risk of dementia. Some of this batteries were also design to be administrated over the telephone, trying to overcome the accessibility limitations of this patients. In this paper we present a battery (called Autotest de Memoria) essentially composed by episodic and semantic memory tests, administered both over the telephone and face to face. This battery was employed in the cognitive assessment of healthy controls and subjects diagnosed as probable Alzheimer's disease patients. Results show the capability of this battery in order to discriminate patients and healthy controls, a great sensibility and specificity, and a nearly absolute parallelism of telephone and face to face administrations. These data led us to claim the usefulness and practicality of our so called Memoria>.
Full Text Available Authors report a study on psychometric properties of Plutchik's test, called Emotions Profile Index (EPI. A new Slovene translation and adaptation of English version of the test, consisting of combinations (pairs of 12 words reflecting eight different emotional conditions, was prepared and compared to the old one. Both versions as well as the Big Five Questionnaire (BFQ were administered on the sample of 239 participants. Different statistical analyses were performed examining psychometric features of both versions of EPI. Discriminative power was tested by cluster analysis and analysis of frequency distributions, reliability was studied via internal consistency index and correlation between the two versions, and validity was examined by correlating PIE dimensions with BFQ dimensions and subdimensions, by comparing profiles of groups on both versions of EPI and BFQ and by fitting the theoretical model proposed by Plutchik to the data. Discriminative power of EPI seems to be affected by avoiding (not choosing the socially desirable expressions in the test, parallel reliability seems to be susceptible to the use of different words (expressions in the new version of EPI having the same meaning as words in the old version. Dimensions expected to reflect similar constructs in BFQ and EPI do not correlate satisfactory. Data gathered with EPI cannot be fully explained with the model proposed by Plutchik's theory.
Kirk, Celia; Vigeland, Laura
Purpose: The authors provide a review of the psychometric properties of 6 norm-referenced tests designed to measure children's phonological error patterns. Three aspects of the tests' psychometric adequacy were evaluated: the normative sample, reliability, and validity. Method: The specific criteria used for determining the psychometric…
Cox, Kathleen B
The importance of healthy work environments has received attention. Health care organizations are plagued with conflict which is detrimental to work environments. Thus, conflict must be studied. The purpose of this article is to describe the testing of a measure of conflict. A survey was used to evaluate the psychometric properties. The sample consisted of 430 nurses at an academic medical center. Using principal component analysis (PCA) with varimax rotation, a six-factor solution (30 items) that explained 74.3% of variance emerged. Coefficient alpha ranged from .95 to .81. Correlations with existing scales supported construct validity (r = -.32(-)-.58). The results are encouraging. Use of the scale may provide insight into the impact of conflict on patient, staff, and organizational outcomes.
Kolen, Michael J.; Lee, Won-Chan
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Chan, Keung Sum; Li, Ho Cheung William; Chan, Sally Wai-Chi; Lopez, Violeta
This article is a report on psychometric testing of the Chinese version of the herth hope index. The availability of a valid and reliable instrument that accurately measures the level of hope in patients with heart failure is crucial before any hope-enhancing interventions can be appropriately planned and evaluated. There is no such instrument for Chinese people. A test-retest, within-subjects design was used. A purposive sample of 120 Hong Kong Chinese patients with heart failure between the ages of 60 and 80 years admitted to two medical wards was recruited during an 8-month period in 2009. Participants were asked to respond to the Chinese version of the herth hope index, Hamilton depression rating scale and Rosenberg's self-esteem scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the herth hope index were assessed. The newly translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly translated scale can be used as a self-report assessment tool in assessing the level of hope in Hong Kong Chinese patients with heart failure. © 2011 Blackwell Publishing Ltd.
Szajer, Katarzyna; Karakuła, Hanna; Grzywa, Anna; Parnas, Josef; Perzyńska, Aneta; Zaborska, Anna; Pawezka, Justyna; Sekunda, Agnieszka; Piszczek, Rafał; Skórska, Małgorzata
Abstract thinking belongs to intellectual abilities of the highest level of the evolutionary development, thanks to which operations such a classification, systematisation and comparison are possible. An analysis of the psychometric properties of the Proverb-Metaphor Test (TPM) which has been used in the German speaking countries since 2001. The TPM was subject to the process of translation--retranslation--travesty in order to be adapted to clinical conditions in Poland. 60 patients of the Department of Psychiatry, Medical University of Lublin with diagnosed paranoid schizophrenia (according to ICD-10 criteria). PANSS and TPM was carried out amongst 15 patients at the beginning of the hospitalisation (the first stage of the research) and among all persons during the remission of syndromes (the second stage). The WAIS-R (PL) was used in the second stage. 1. The TPM is a reliable instrument, of high criteria propriety. 2. The evaluated test is a relatively homogeneous research tool. 3. The TPM is, thanks to its simple construction and the short carrying out time, a practical method of abstract thinking evaluation. 4. The TPM may be a useful instrument enabling long term prognosis.
McGilton, Katherine S
To describe the development and psychometric testing of the Supportive Supervisory Scale (SSS). The development of the items of the scale was based on Winnicott's relationship theory and on focus groups with 26 healthcare aides (HCAs) and 30 supervisors from six long-term care (LTC) facilities in Ontario, Canada. Content validity of the 15-item instrument was established by a panel of experts. Based on a secondary analysis of data collected from 222 HCAs in 10 LTC facilities in Ontario, Canada, the SSS was subjected to principal components analysis with oblique rotation. A two-factor solution was accepted, which is consistent with the theoretical conceptualization of the instrument. Factor I was labeled Respects Uniqueness and Factor II was labeled Being Reliable. Internal consistency of Factor I was .95, and that of Factor II was .91. Discriminant validity was also established. The focus groups revealed that "being available to staff" while "recognizing the HCA as an individual, and taking a moment to get to know them" was essential to feeling supported by their supervisor. The SSS is a reliable and valid measure of supervisory support of supervisors working in LTC facilities. At the core of supportive supervision is the supervisor's ability to develop and maintain positive relationships with each HCA. It is through respecting the uniqueness of each HCA and being reliable that supervisor-HCA relationships can flourish. Supportive leadership in LTC settings is a major contributor to HCAs' job satisfaction and retention and to quality of patient care. Therefore, a tool developed and tested to measure supervisors' supportive capacities in LTC is primal to evaluate the effectiveness of supervisors in these environments.
Kappus, Matthew R; Bajaj, Jasmohan S
Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life, and driving accidents. The detection of MHE requires specialized testing because it cannot, by definition, be diagnosed on standard clinical examination. Psychometric and neurophysiologic techniques are often used to test for MHE. Paper-pencil psychometric batteries and computerized tests have proved useful in diagnosing MHE and predicting its outcomes. Neurophysiologic tests also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike. Testing strategies depend on the normative data available, patient comfort, and local expertise. Copyright © 2012 Elsevier Inc. All rights reserved.
Lauridsen, M M; Poulsen, L; Rasmussen, C K
Many chronic medical conditions are accompanied by cognitive disturbances but these have only to a very limited extent been psychometrically quantified. An exception is liver cirrhosis where hepatic encephalopathy is an inherent risk and mild forms are diagnosed by psychometric tests. The preferred...... diagnostic test battery in cirrhosis is often the Continuous Reaction Time (CRT) and the Portosystemic Encephalopathy (PSE) tests but the effect on these of other medical conditions is not known. We aimed to examine the effects of common chronic (non-cirrhosis) medical conditions on the CRT and PSE tests. We...
Palmer, B.R.; Gignac, G.; Manocha, R.; Stough, C.
and discussed.There has been some debate recently over the scoring, reliability and factor structure of ability measures of emotional intelligence (EI). This study examined these three psychometric properties with the most recent ability test of EI, the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT V2.0; Mayer, Salovey, & Caruso,…
Jongen, Stefan; Perrier, Joy; Vuurman, Eric F; Ramaekers, Johannes G; Vermeeren, Annemiek
To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation. Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am) and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am). On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP) of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT). Large effects sizes were also found in the Divided Attention Test (DAT), the Attention Network Test (ANT), and the test for Useful Field of View (UFOV) at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV. From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.
Full Text Available To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation.Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am.On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT. Large effects sizes were also found in the Divided Attention Test (DAT, the Attention Network Test (ANT, and the test for Useful Field of View (UFOV at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV.From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.
Raveendranadh Pilli; MUR Naidu; Usharani Pingali; J C Shobha
Objective: To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. Materials and Methods: The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of...
Sund, Terje; Brandt, Åse; Anttila, Heidi
(Participation Repertoire). PURPOSE: This study aimed to investigate a range of psychometric properties of the NOMO 1.0 in a sample of adult powered mobility device (PMD) users. METHOD: Data collected from PMD users ( N = 248) in Denmark, Finland, and Norway as part of a larger study were analyzed using state...... scale and six components of the Frequency scale. IMPLICATIONS: The NOMO 1.0 should be used for research purposes and not for clinical practice. Better reliability should be established for the Need for Assistance and Ease/Difficulty scales prior to further psychometric testing to establish the validity...
Full Text Available These analyses examine the psychometric properties of the Free and Cued Selective Reminding Test with Immediate Recall (FCSRT-IR. FCSRT-IR is a measure of memory under conditions that control attention and cognitive processing in order to obtain an assessment of memory unconfounded by normal agerelated changes in cognition. FCSRT-IR performance has been associated with preclinical and early dementia in several longitudinal epidemiological studies. Factor and item response theory analyses were applied to FCSRT-IR data from patients at a geriatric primary care center who had independently established clinical diagnoses. The results provide supporting evidence for the psychometric adequacy of the FCSR-IR in terms of reliability, essential (sufficient unidimensionality, information across the continuum of memory disability/ability, and classification accuracy. The psychometric adequacy of the FCSRT-IR adds further validity to its use as a case finding strategy for dementia.
Faraci, Palmira; Hell, Benedikt; Schuler, Heinz
This article describes the psychometric properties of the Italian adaptation of the "Analyse des Schlussfolgernden und Kreativen Denkens" (ASK; Test of Inferential and Creative Thinking) for measuring inferential and creative thinking. The study aimed to (a) supply evidence for the factorial structure of the instrument, (b) describe its…
Wicherts, Jelte M.; Dolan, Conor V.; Carlson, Jerry S.; van der Maas, Han L. J.
This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's test scores as measures of general intelligence.…
Richards, Elizabeth A.; McDonough, Meghan H.; Edwards, Nancy E.; Lyle, Roseann M.; Troped, Philip J.
Purpose: Dog owners represent 40% of the population, a promising audience to increase population levels of physical activity. The purpose of this study was to develop and test the psychometric properties of a new instrument to assess social-cognitive theory constructs related to dog walking. Method: Dog owners ("N" = 431) completed the…
Lechuga, Julia; Galletly, Carol L; Broaddus, Michelle R; Dickson-Gomez, Julia B; Glasman, Laura R; McAuliffe, Timothy L; Vega, Miriam Y; LeGrand, Sarah; Mena, Carla A; Barlow, Morgan L; Valera, Erik; Montenegro, Judith I
To develop, pilot test, and conduct psychometric analyses of an innovative scale measuring the influence of perceived immigration laws on Latino migrants' HIV-testing behavior. The Immigration Law Concerns Scale (ILCS) was developed in three phases: Phase 1 involved a review of law and literature, generation of scale items, consultation with project advisors, and subsequent revision of the scale. Phase 2 involved systematic translation- back translation and consensus-based editorial processes conducted by members of a bilingual and multi-national study team. In Phase 3, 339 sexually active, HIV-negative Spanish-speaking, non-citizen Latino migrant adults (both documented and undocumented) completed the scale via audio computer-assisted self-interview. The psychometric properties of the scale were tested with exploratory factor analysis and estimates of reliability coefficients were generated. Bivariate correlations were conducted to test the discriminant and predictive validity of identified factors. Exploratory factor analysis revealed a three-factor, 17-item scale. subscale reliability ranged from 0.72 to 0.79. There were significant associations between the ILCS and the HIV-testing behaviors of participants. Results of the pilot test and psychometric analysis of the ILCS are promising. The scale is reliable and significantly associated with the HIV-testing behaviors of participants. Subscales related to unwanted government attention and concerns about meeting moral character requirements should be refined.
Budescu, David V; Bo, Yuanchao
We investigate the implications of penalizing incorrect answers to multiple-choice tests, from the perspective of both test-takers and test-makers. To do so, we use a model that combines a well-known item response theory model with prospect theory (Kahneman and Tversky, Prospect theory: An analysis of decision under risk, Econometrica 47:263-91, 1979). Our results reveal that when test-takers are fully informed of the scoring rule, the use of any penalty has detrimental effects for both test-takers (they are always penalized in excess, particularly those who are risk averse and loss averse) and test-makers (the bias of the estimated scores, as well as the variance and skewness of their distribution, increase as a function of the severity of the penalty).
Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff; Lenon, George Binh
Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization’s guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability...
Pilli, Raveendranadh; Naidu, Mur; Pingali, Usharani; Shobha, Jc
To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of zolpidem (5 mg) and caffeine (500 mg) versus placebo were studied in 24 healthy participants in a randomized, double-blind three-way crossover design. Psychometric tests were performed at baseline and at 1, 2, and 3 h after administration of study medication. The effects of zolpidem and caffeine on the psychomotor performance were most pronounced 1 h after administration. At this time, a significant impairment of performance in the simple reaction test (SRT), choice discrimination test (CDT), digit symbol substitution test (DSST), digit vigilance test (DVT), and card sorting test (CST) was observed with zolpidem. In contrast, caffeine showed a significant improvement in performance in CDT and DVT only. The results suggest that the tests of the computerized system are more sensitive and reliable then the pencil and paper tests in detecting the effects of central acting agents and are suitable for use in clinical areas to conduct studies with patients.
Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel
Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.
Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de
The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study
Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de
The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study
Nash, Tracy Jeanne
Although nurses struggle with the decision to report for work during disaster events, there are no instruments to measure nurses' duty to care for disaster situations. The purpose of this study was to describe the development, testing, and psychometric qualities of the Nash Duty to Care Scale. A convenience sample of 409 registered nurses were recruited from 3 universities in the United States. Exploratory factor analysis resulted in a 19-item, 4-factor model explaining 67.34% of the variance. Internal consistency reliability was supported by Cronbach's alpha ranging from .81 to .91 for the 4-factor subscales and .92 for the total scale. The psychometrically sound instrument for measuring nurses' perceived duty to care for disasters is applicable to contemporary nursing practice, institutional disaster management plans, and patient health outcomes worldwide.
Bajrakova, A.; Vasilev, G.; Khristova, M. N.; Chobanova, N.; Tsenova, T.; Jordanova, M.; Lalova, J.; Vasileva, F.; Mikhajlova, Z.; Trifonova, S.
The investigation involved 50 children aged median 6 years and 6 months. The group was selected in view of the critical period for occurrence of radiation-related deviations in mental development (8-15 gestation weeks) and the period of maximum irradiation during the Chernobyl accident. Assessment of the individual exposure and analysis of possible impacts from non-radiation risk factors were based on guided parental history reports. The dose of accidental irradiation was determined using the radiological data for the country. A Bulgarian standardization of the Wechsler Intelligence Scale for Children (WISC-R) was used. The procedure includes 5 verbal and 5 nonverbal subtests. Results were compared with those from a countrywide control group of children (including a large city, a small town, a village). The analysis indicated higher mean IQ scores in the investigated children. The children were additionally studied by original tests for attention and gnosis-praxis functions using tactile and visual modalities. The tests included intra- and transmodal versions, bilateral simultaneous presentation of stimuli with verbal and nonverbal characteristics in applying analytical and global strategies. Comparisons were made with results for children in the same age range, who had been studied prior to the Chernobyl accident. The evidence surprisingly varied, taking into account the small size of the investigation group. A longitudinal follow-up of this population thus appears to be appropriate. (author)
van Rooij, Antonius J; Schoenmakers, Tim M; van den Eijnden, Regina J J M; Vermulst, Ad A; van de Mheen, Dike
The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study used data (n=2,894) from a large-sample paper-and-pencil questionnaire study, conducted in 2009 on secondary schools in Netherlands. Thus, the main source of data was a large sample of schoolchildren (aged 13-16 years). Measurements included the proposed VAT, the Compulsive Internet Use Scale, weekly hours spent on various game types, and several psychosocial variables. The VAT demonstrated excellent reliability, excellent construct validity, a one-factor model fit, and a high degree of measurement invariance across gender, ethnicity, and learning year, indicating that the scale outcomes can be compared across different subgroups with little bias. In summary, the VAT can be helpful in the further study of video game addiction, and it contributes to the debate on possible inclusion of behavioral addictions in the upcoming DSM-V.
Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.
Petróczi, Andrea; Backhouse, Susan H; Barkoukis, Vassilis; Brand, Ralf; Elbe, Anne-Marie; Lazuras, Lambros; Lucidi, Fabio
One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping) into organised intelligence to protect clean sport. Beyond the foreseeable consequences of a positive identification as a doping user, this task is further complicated by the discrepancy between what constitutes a doping offence in the World Anti-Doping Code and operationalized in doping research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. In both research and forensic settings, we must acknowledge that (1) socially desirable responding confounds self-reported psychometric test results and (2) that the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that, to date, both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better understanding of doping behaviour, not to establish evidence for doping. A false 'positive' psychological profile for doping affects not only the individual
Galuschka, Katharina; Rothe, Josefine; Schulte-Körne, Gerd
This article looks at a means of objectively evaluating the quality of psychometric tests. This approach enables users to evaluate psychometric tests based on their methodological characteristics, in order to decide which instrument should be used. Reading and spelling assessment tools serve as examples. The paper also provides a review of German psychometric tests for the assessment of reading and spelling skills. This method facilitates the identification of psychometric tests.of high methodological quality which can be used for the assessment of reading and spelling skills. Reading performance should ideally be assessed with the following instruments: ELFE 1-6, LGVT 6-12, LESEN 6-7, LESEN 8-9, or WLLP-R. The tests to be used for the evaluation of spelling skills are DERET 1-2+, DERET 3-4+, WRT 1+, WRT 2+, WRT 3+, WRT 4+ or HSP 1-10.
Bernhofer, Esther I; St Marie, Barbara; Bena, James F
All nurses care for patients with pain, and pain management knowledge and attitude surveys for nurses have been around since 1987. However, no validated knowledge test exists to measure postlicensure clinicians' knowledge of the core competencies of pain management in current complex patient populations. To develop and test the psychometric properties of an instrument designed to measure pain management knowledge of postlicensure nurses. Psychometric instrument validation. Four large Midwestern U.S. hospitals. Registered nurses employed full time and part time August 2015 to April 2016, aged M = 43.25 years; time as RN, M = 16.13 years. Prospective survey design using e-mail to invite nurses to take an electronic multiple choice pain knowledge test. Content validity of initial 36-item test "very good" (95.1% agreement). Completed tests that met analysis criteria, N = 747. Mean initial test score, 69.4% correct (range 27.8-97.2). After revision/removal of 13 unacceptable questions, mean test score was 50.4% correct (range 8.7-82.6). Initial test item percent difficulty range was 15.2%-98.1%; discrimination values range, 0.03-0.50; final test item percent difficulty range, 17.6%-91.1%, discrimination values range, -0.04 to 1.04. Split-half reliability final test was 0.66. A high decision consistency reliability was identified, with test cut-score of 75%. The final 23-item Clinical Pain Knowledge Test has acceptable discrimination, difficulty, decision consistency, reliability, and validity in the general clinical inpatient nurse population. This instrument will be useful in assessing pain management knowledge of clinical nurses to determine gaps in education, evaluate knowledge after pain management education, and measure research outcomes. Copyright © 2017 American Society for Pain Management Nursing. Published by Elsevier Inc. All rights reserved.
Guvenc, Gulten; Seven, Memnun; Akyuz, Aygul
To adapt and psychometrically test the Health Belief Model Scale for Human Papilloma Virus (HPV) and Its Vaccination (HBMS-HPVV) for use in a Turkish population and to assess the Human Papilloma Virus Knowledge score (HPV-KS) among female college students. Instrument adaptation and psychometric testing study. The sample consisted of 302 nursing students at a nursing school in Turkey between April and May 2013. Questionnaire-based data were collected from the participants. Information regarding HBMS-HPVV and HPV knowledge and descriptive characteristic of participants was collected using translated HBMS-HPVV and HPV-KS. Test-retest reliability was evaluated and Cronbach α was used to assess internal consistency reliability, and exploratory factor analysis was used to assess construct validity of the HBMS-HPVV. The scale consists of 4 subscales that measure 4 constructs of the Health Belief Model covering the perceived susceptibility and severity of HPV and the benefits and barriers. The final 14-item scale had satisfactory validity and internal consistency. Cronbach α values for the 4 subscales ranged from 0.71 to 0.78. Total HPV-KS ranged from 0 to 8 (scale range, 0-10; 3.80 ± 2.12). The HBMS-HPVV is a valid and reliable instrument for measuring young Turkish women's beliefs and attitudes about HPV and its vaccination. Copyright © 2015 North American Society for Pediatric and Adolescent Gynecology. Published by Elsevier Inc. All rights reserved.
McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen
Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Muehrer, Rebecca J; Lanuza, Dorothy M; Brown, Roger L; Djamali, Arjang
This study describes the development and psychometric testing of the Sexual Concerns Questionnaire (SCQ) in kidney transplant (KTx) recipients. Construct validity was assessed using the Kroonenberg and Lewis exploratory/confirmatory procedure and testing hypothesized relationships with established questionnaires. Configural and weak invariance were examined across gender, dialysis history, relationship status, and transplant type. Reliability was assessed with Cronbach's alpha, composite reliability, and test-retest reliability. Factor analysis resulted in a 7-factor solution and suggests good model fit. Construct validity was also supported by the tests of hypothesized relationships. Configural and weak invariance were supported for all subgroups. Reliability of the SCQ was also supported. Findings indicate the SCQ is a valid and reliable measure of KTx recipients' sexual concerns.
Chen, Xin-lin; Liu, Feng-bin; Guo, Li; Liu, Xiao-bin
To investigate the scientificity of patient-reported outcome (PRO) scale for myasthenia gravis (MG), which was used to evaluate the clinical effects of traditional Chinese and Western medicine treatment on MG patients. Psychometric performance of the MG-PRO scale was also expected to be evaluated in this study. A total of 100 MG patients and 100 healthy people were face-to-face interviewed by well-trained investigators, and the data of MG-PRO scale were collected. The classical theory test (CTT) and item response theory (IRT) methods were used to analyze the psychometric performance such as validity, reliability, person separation index (PSI) and differential item functioning (DIF) in the MG-PRO scale. The results of CTT analysis showed that the split-half reliabilities of the MG-PRO scale and each dimension were greater than 0.7. In the analysis of internal consistency of each dimension, the Cronbach's alpha was greater than 0.8. Each facet had greater correlation with its dimension than the other dimensions. Four principal components were extracted by exploratory factor analysis, which represented all dimensions of the scale, and the cumulative variance was 55.54%. The scores of each of the 8 facets between MG patients and healthy people were different (Pdefinition and connotation of quality of life and contains special issues of MG patients as well, and shows good reliability (split-half reliability, Cronbach's alpha), validity (content validity, construct validity, discriminate validity) from the results of CTT, and good psychometric performance from the results of IRT.
Lúcio, Patrícia Silva; Cogo-Moreira, Hugo; Puglisi, Marina; Polanczyk, Guilherme Vanoni; Little, Todd D
The present study investigated the psychometric properties of the Raven's Colored Progressive Matrices (CPM) test in a sample of preschoolers from Brazil ( n = 582; age: mean = 57 months, SD = 7 months; 46% female). We investigated the plausibility of unidimensionality of the items (confirmatory factor analysis) and differential item functioning (DIF) for sex and age (multiple indicators multiple causes method). We tested four unidimensional models and the one with the best-fit index was a reduced form of the Raven's CPM. The DIF analysis was carried out with the reduced form of the test. A few items presented DIF (two for sex and one for age), confirming that the Raven's CPM items are mostly measurement invariant. There was no effect of sex on the general factor, but increasing age was associated with higher values of the g factor. Future research should indicate if the reduced form is suitable for evaluating the general ability of preschoolers.
Vadlin, Sofia; Åslund, Cecilia; Rehn, Mattias; Nilsson, Kent W
The objective of the study is to evaluate the psychometric properties of the Gaming Addiction Identification Test (GAIT) and its parent version (GAIT-P), in a representative community sample of adolescents and parents in Västmanland, Sweden. Self-rated and parent-rated gaming addictive symptoms identified by GAIT and GAIT-P were analyzed for frequency of endorsement, internal consistency, concordance, factor structure, prevalence of Internet gaming disorder (IGD), concurrence with the Gaming Addiction Scale for Adolescents, 7-item version (GAS) and the parent version of GAS (GAS-P), and for sex differences. The 12-month prevalence of IGD was found to be 1.3% with GAIT and 2.4% with GAIT-P. Results also indicate promising psychometric results within this population, with high internal consistency, and high concurrent validity with GAS and GAS-P. Concordance between adolescents and parents ratings was high, although moderate in girls. Although exploratory factor analysis indicated poor model fit, it also indicated unidimensionality and high factor loadings in all analyses. GAIT and GAIT-P are suitable for continued use in measuring gaming addiction in adolescents, and, with the additional two items, they now cover all nine IGD criteria. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Dunn, Karen S
Aims and objectives. Assess the psychometric properties of a new geriatric spiritual well-being scale (GSWS), specifically designed for older adults. Background. Religiosity and spiritual wellness must be measured as two distinct concepts to prevent confounding them as synonymous among atheist and agnostic population. Design. A test-retest survey design was used to estimate the psychometric properties. Methods. A convenience sample of 138 community-dwelling older adults was drawn from the inner city of Detroit. Data were collected using telephone survey interviews. Data analyses included descriptive statistics, structural equation modelling, reliability analyses, and point-biserial correlations. Results. The factorial validity of the proposed model was not supported by the data. Fit indices were χ(2) = 185.98, d.f. = 98, P atheists have spiritual needs that do not include religious beliefs or practices. Thus, assessing patients' religious beliefs and practices prior to assessing spiritual well-being is essential to prevent bias. © 2008 The Author. Journal compilation © 2008 Blackwell Publishing Ltd.
Latour, Jos M.; van Goudoever, Johannes B.; Duivenvoorden, Hugo J.; Albers, Marcel J. I. J.; van Dam, Nicolette A. M.; Dullaart, Eugenie; van Heerde, Marc; de Neef, Marjorie; Verlaat, Carin W. M.; van Vught, Elise M.; Hazelzet, Jan A.
To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument
Latour, J.M.; van Goudoever, J.B.; Duivenvoorden, H.J.; Albers, M.J.I.J.; van Dam, N.A.M.; Dullaart, E.; van Heerde, M.; de Neef, M.; Verlaat, C.W.M.; van Vught, E.M.; Hazelzet, J.A.
To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument
Knekta, Eva; Eklöf, Hanna
The aim of this study was to evaluate the psychometric properties of an expectancy-value-based questionnaire measuring five aspects of test-taking motivation (effort, expectancies, importance, interest, and test anxiety). The questionnaire was distributed to a sample of Swedish Grade 9 students taking a low-stakes (n = 1,047) or a high-stakes (n =…
Chien, Wai Tong; Chan, Zenobia Chung-Yee; Chan, Sally Wai-Chi
This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76-0.95 and ICC = 0.81-0.92), test-retest reliability (r = 0.89-0.95, P Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement), accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ (2)/df = 1.93, P = 0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78) to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = -0.54) and family burden (r = 0.49) and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90). The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P Chinese version demonstrates sound psychometric properties to measure families' expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
Tang, Woung-Ru; Kao, Chen-Yi
The spiritual well-being of terminally ill cancer patients is an important indicator of the quality of their lives and of the quality of hospice care, but no validated tools are available for assessing this indicator in Taiwan. The present cross-sectional study validated the Spiritual Well-Being Scale-Mandarin version (SWBS-M) by testing its psychometric properties in 243 cancer patients from five teaching hospitals throughout Taiwan. Construct validity was tested by factor analysis and hypothesis testing. Patients' spiritual well-being and quality of life were assessed using the SWBS-M and the McGill Quality of Life Questionnaire (MQoL), respectively. Overall, the SWBS-M had an internal consistency/reliability of 0.89. Exploratory factor analysis showed that the SWBS-M had an underlying two-factor structure, explaining 46.94% of the variance. SWBS-M scores correlated moderately with MQoL scores (r = 0.48, p spiritual well-being was inversely related to their average pain level during the previous 24 hours (r = -0.183, p = 0.006). Cancer patients' spiritual well-being also differed significantly with their experience of pain (t = -3.67, p spiritual well-being than those without pain. Our findings support a two-factor model for the SWBS-M in terminally ill Taiwanese cancer patients. We recommend testing the psychometric properties of the SWBS-M in different patient populations to verify its factorial structure in other Asian countries.
Wai Tong Chien
Full Text Available This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76–0.95 and ICC = 0.81–0.92, test-retest reliability (r = 0.89–0.95, P<0.01, and internal consistency (Cronbach’s α = 0.86–0.92. Among 262 patients with severe mental illness and their caregivers, the 50-item Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement, accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ2/df = 1.93, P=0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78 to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = −0.54 and family burden (r = 0.49 and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90. The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P<0.01. The Chinese version demonstrates sound psychometric properties to measure families’ expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
Flipsen, Peter; Ogiela, Diane A
Our understanding of test construction has improved since the now-classic review by McCauley and Swisher (1984). The current review article examines the psychometric characteristics of current single-word tests of speech sound production in an attempt to determine whether our tests have improved since then. It also provides a resource that clinicians may use to help them make test selection decisions for their particular client populations. Ten tests published since 1990 were reviewed to determine whether they met the 10 criteria set out by McCauley and Swisher (1984), as well as 7 additional criteria. All of the tests reviewed met at least 3 of McCauley and Swisher's (1984) original criteria, and 9 of 10 tests met at least 5 of them. Most of the tests met some of the additional criteria as well. The state of the art for single-word tests of speech sound production in children appears to have improved in the last 30 years. There remains, however, room for improvement.
Ana Paula Porto Noronha
Full Text Available A pesquisa teve como objetivo verificar quais os parâmetros psicométricos apresentados nos manuais de 19 instrumentos de avaliação da inteligência. Os elementos avaliados nos instrumentos foram: análise de itens, padronização, validade e precisão. Os resultados encontrados mostraram que, dos 19 testes avaliados, 89,5% apresentaram estudos de padronização, sendo que o procedimento mais utilizado na escolha dos sujeitos foi o não aleatório (62,2% dos testes. No que se refere à validade, a de construto foi a mais freqüente dentre os testes (94,7%. Observou-se que todos os instrumentos apresentaram verifica��ão da precisão, sendo o método de consistência interna o mais aplicado (78,9%. Conclui-se que, embora os autores concordem que todos os testes devam realizar estudos de verificação dos parâmetros psicométricos e devam possuir normas regionais, tal prática ainda não se encontra totalmente difundida na avaliação psicológica brasileira,This research aimed to verify the psychometric parameters presented in manuals of 19 intelligence tests. The psychometric properties included in the analysis were: item analysis, validity, reliability, and norms studies. The results indicated that 89.5% of the 19 tests presented norming studies. The procedure of sample selection was mostly non-random (62.2% of the tests. Construct validity was the most frequent method used among the studies (94.7%. All tests presented reliability studies, most of them using internal consistency coefficient (78.9%. It is concluded that although the authors agree that all tests need studies to verify psychometric parameters and studies to obtain regional norms this action isn’t divulged totally yet in the Brazilian psychological assessment.
Duddle, Maree; Boughton, Maureen
The aim of this study was to develop and test the psychometric properties of the Nursing Workplace Relational Environment Scale (NWRES). A positive relational environment in the workplace is characterised by a sense of connectedness and belonging, support and cooperation among colleagues, open communication and effectively managed conflict. A poor relational environment in the workplace may contribute to job dissatisfaction and early turnover of staff. Quantitative survey. A three-stage process was used to design and test the NWRES. In Stage 1, an extensive literature review was conducted on professional working relationships and the nursing work environment. Three key concepts; collegiality, workplace conflict and job satisfaction were identified and defined. In Stage 2, a pool of items was developed from the dimensions of each concept and formulated into a 35-item scale which was piloted on a convenience sample of 31 nurses. In Stage 3, the newly refined 28-item scale was administered randomly to a convenience sample of 150 nurses. Psychometric testing was conducted to establish the construct validity and reliability of the scale. Exploratory factor analysis resulted in a 22-item scale. The factor analysis indicated a four-factor structure: collegial behaviours, relational atmosphere, outcomes of conflict and job satisfaction which explained 68.12% of the total variance. Cronbach's alpha coefficient for the NWRES was 0.872 and the subscales ranged from 0.781-0.927. The results of the study confirm the reliability and validity of the NWRES. Replication of this study with a larger sample is indicated to determine relationships among the subscales. The results of this study have implications for health managers in terms of understanding the impact of the relational environment of the workplace on job satisfaction and retention.
Molony, Sheila L; McDonald, Deborah Dillon; Palmisano-Mills, Christine
Research related to quality of life in long-term care has been hampered by a paucity of measurement tools sensitive to environmental interventions. The primary aim of this study was to test the psychometric properties of a new instrument, the Experience of Home (EOH) Scale, designed to measure the strength of the experience of meaningful person-environment transaction. The instrument was administered to 200 older adults in diverse dwelling types. Principal components analysis provided support for construct validity, eliciting a three-factor solution accounting for 63.18% of variance in scores. Internal consistency reliability was supported with Cronbach's alpha of .96 for the entire scale. The EOH Scale is a unique research tool to evaluate interventions to improve quality of living in residential environments.
Rezaei, Mohammad; Rashedi, Vahid; Lotfi, Gohar; Shirinbayan, Peymaneh; Foroughan, Mahshid
The aim of this study was to assess the psychometric properties of the Mini-Cog in Iranian older adults. It was a cross-sectional study; 50 older people with dementia and 50 without dementia who matched for age, gender, and education entered the study. The diagnostic and statistical manual of mental disorders criteria for dementia were used as gold standard. A battery of scales included the abbreviated mental test score (AMTS), the Geriatric Depression Scale, and the Mini-Cog was performed. Validity and reliability of the Mini-Cog determined using the Pearson product-moment correlation coefficient (Pearson's r), Cronbach's alpha, and Receiver Operating Characteristic (ROC) curve analysis. The Persian version of Mini-Cog showed a good inter-rater reliability ( K = 0.76, p Mini-Cog have an acceptable sensitivity, specificity, and substantial overall agreement with the AMTS.
Verster, Joris C; Roth, Thomas
There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a constant speed and steady lateral position within the right traffic lane. Standard deviation of lateral position (SDLP), i.e., the weaving of the car, was determined. The subjects also performed a psychometric test battery including the DSST, Sternberg memory scanning test, a tracking test, and a divided attention test. Difference scores from placebo for parameters of the psychometric tests and SDLP were computed and correlated with each other. A stepwise linear regression analysis determined the predictive validity of the laboratory test battery to SDLP. Stepwise regression analyses revealed that the combination of five parameters, hard tracking, tracking and reaction time of the divided attention test, and reaction time and percentage of errors of the Sternberg memory scanning test, together had a predictive validity of 33.4%. The psychometric tests in this test battery showed insufficient predictive validity to replace the on-the-road driving test during normal traffic.
Boysan, Murat; Kuss, Daria J; Barut, Yaşar; Ayköse, Nafi; Güleç, Mustafa; Özdemir, Osman
Of many instruments developed to assess Internet addiction, the Internet Addiction Test (IAT), an expanded version of the Internet Addiction Diagnostic Questionnaire (IADQ), has been the most widely used scale in English and non-English speaking populations. In this study, our aim was to investigate the psychometric properties of short and expanded versions of the IAT in a Turkish undergraduate sample. Overall, 455 undergraduate students from Turkey aged between 18 and 30 participated in the study (63.53% were females). Explanatory and confirmatory factor analytic procedures investigated factor structures of the IADQ and IAT. The Internet Addiction Scale (IAS), Coping Inventory for Stressful Situations (CISS), Obsessive Compulsive Inventory-Revised (OCI-R) and Dissociative Experiences Scale (DES) were administered to assess convergent and divergent validities of the IADQ and IAT. Internal consistency and 15-day test-retest reliability were computed. In the factorial analytic investigation, we found a unidimensional factor structure for each measure fit the current data best. Significant but weak to moderate correlations of the IADQ and the IAT with the CISS, OCI-R and DES provided empirical evidence for divergent validity, whereas strong associations with the subscales of the IAS pointed to the convergent validity of Young's Internet addiction construct. Internal consistency of the IADQ was weak (α=0.67) and of the IAT was high (α=0.93). Temporal reliability of both instruments was very high (α=0.81 and α=0.87; respectively). The IAT revealed promising and sound psychometric properties in a Turkish sample. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kalis, Emils; Roke, Liga; Krumina, Indra
The Test for Creative Thinking-Drawing Production (TCT-DP) is designed as an effective drawing-based instrument for measuring creative potential. Many studies report adaptation efforts in other cultures pointing out good psychometric properties of the instrument nonetheless revealing also some trouble spots. The present study includes adaptation…
Paans, Wolter; Sermeus, Walter; Nieweg, Roos; van der Schans, Cees P.
AIM: This paper is a report of the development and testing of the psychometric properties of an instrument to measure the accuracy of nursing documentation in general hospitals. BACKGROUND: Little information is available about the accuracy of nursing documentation. None of the existing instruments
von Piekartz, H; Stotz, E; Both, A; Bahn, G; Armijo-Olivo, S; Ballenberger, N
The primary objective of this study was to determine the structural and known-group validity as well as the inter-rater reliability of a test battery to evaluate the motor control of the craniofacial region. Seventy volunteers without TMD and 25 subjects with TMD (Axes I) per the DC/TMD were asked to execute a test battery consisting of eight tests. The tests were video-taped in the same sequence in a standardised manner. Two experienced physical therapists participated in this study as blinded assessors. We used exploratory factor analysis to identify the underlying component structure of the eight tests. Internal consistency (Cronbach's α), inter-rater reliability (intra-class correlation coefficient) and construct validity (ie, hypothesis testing-known-group validity) (receiver operating curves) were also explored for the test battery. The structural validity showed the presence of one factor underlying the construct of the test battery. The internal consistency was excellent (0.90) as well as the inter-rater reliability. All values of reliability were close to 0.9 or above indicating very high inter-rater reliability. The area under the curve (AUC) was 0.93 for rater 1 and 0.94 for rater two, respectively, indicating excellent discrimination between subjects with TMD and healthy controls. The results of the present study support the psychometric properties of test battery to measure motor control of the craniofacial region when evaluated through videotaping. This test battery could be used to differentiate between healthy subjects and subjects with musculoskeletal impairments in the cervical and oro-facial regions. In addition, this test battery could be used to assess the effectiveness of management strategies in the craniofacial region. © 2017 John Wiley & Sons Ltd.
Nieboer Anna P
Full Text Available Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Results Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales – vision, participative safety, task orientation and support for innovation – acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. Conclusion The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.
Strating, Mathilde M H; Nieboer, Anna P
Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales--vision, participative safety, task orientation and support for innovation--acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.
Hsiao, Ya-Chu; Chiang, Yi-Chien; Lee, Hsiang-Chun; Han, Chin-Yen
To further examine the psychometric properties of the spiritual health scale short form, including its reliability and validity. Spirituality is one of the main factors associated with good health outcomes. A reliable and valid instrument to measure spirituality is essential to identify the spiritual needs of an individual and to evaluate the effect of spiritual care. A cross-sectional study design was used. The study was conducted in six nursing schools in northern, central and southern Taiwan. The inclusion criterion for participants was nursing students with clinical practice experience. Initially, 1141 participants were recruited for the study, but 67 were absent and 48 did not complete the questionnaires. A total of 1026 participants were finally recruited, indicating a response rate of 89·9%. The psychometric testing of the spiritual health scale short form included construct validity with confirmatory factor analysis, known-group validity and internal consistency reliability. The results of the confirmatory factor analysis supported the five-factor model as an acceptable model fit. In the known-group validity, the results indicated that people who are in the category of primary religious affiliation have better spiritual health than people in the category of secondary religious affiliation and atheism. The result also indicated that the 24-item spiritual health scale short form achieved an acceptable internal consistency coefficient. The findings suggest that the spiritual health scale short form is a valid and reliable instrument for the appraisal of individual spiritual health. The spiritual health scale short form could provide useful information to guide clinical practice in assessing and managing people's spiritual health in Taiwan. © 2013 John Wiley & Sons Ltd.
Chang, Li-Chun; Hsieh, Pei-Lin; Liu, Chieh-Hsing
The purpose of this study is to develop and evaluate the psychometric properties of the Chinese version of short-form Test of Functional Health Literacy in Adolescents. Assessing health literacy is vital to design health education programme; however, there are no measurement tools exist for use specifically in Chinese adolescents. A non-experimental design was used to test the psychometric properties of the Test of Functional Health Literacy in Adolescents. The short-form Test of Functional Health Literacy in Adolescents was translated and back translated into a Chinese language version. Thirty high school students were recruited to validate the scenario of Test of Functional Health Literacy in Adolescents. Based on the multiple-stage stratified random sampling method, 300 high school students from four counties in Taiwan were invited to participate in this study to evaluate the psychometric properties of Test of Functional Health Literacy in Adolescents. The Functional Health Literacy in Adolescents had good internal consistency reliability and excellent test-retest reliability. Confirmatory factor analysis resulted in a one-factor solution. Contrary to the original version of the Test of Functional Health Literacy in Adolescents, the findings revealed that the 36-item, one-factor model for the Test of Functional Health Literacy in Adolescents is the best-fit model. This is a suitable instrument to assess health literacy levels in Chinese adolescents before health education programmes can be appropriately planned, implemented and evaluated. © 2012 Blackwell Publishing Ltd.
Nikravesh, Maryam; Jafari, Zahra; Mehrpour, Masoud; Kazemi, Roozbeh; Amiri Shavaki, Younes; Hossienifar, Shamim; Azizi, Mohamad Parsa
Background: The paced auditory serial addition test (PASAT) was primarily developed to assess the effects of traumatic brain injury on cognitive functioning. Working memory (WM) is one of the most important aspects of cognitive function, and WM impairment is one of the clinically remarkable signs of aphasia. To develop the Persian version of PASAT, an initial version was used in individuals with aphasia (IWA). Methods: In this study, 25 individuals with aphasia (29-60 years) and 85 controls (18-60 years) were included. PASAT was presented in the form of recorded 61 single-digit numbers (1 to 9). The participants repeatedly added the 2 recent digits. The psychometric properties of PASAT including convergent validity (using the digit memory span tasks), divergent validity (using results in the control group and IWA group), and face validity were investigated. Test-retest reliability was considered as well. Results: The relationship between the PASAT and digit memory span tests was moderate to strong in the control group (forward digit memory span test: r= 0.52, p< 0.0001; backward digit memory span test: r = 0.48, p< 0.0001). A strong relationship was found in IWA (forward digit memory span test: r= 0.72, p< 0.0001; backward digit memory span test: r= 0.53, p= 0.006). Also, strong testretest reliability (intraclass correlation= 0.95, p< 0.0001) was observed. Conclusion: According to our results, the PASAT is a valid and reliable test to assess working memory, particularly in IWA. It could be used as a feasible tool for clinical and research applications. PMID:29445690
Li, Ho Cheung William; Chung, Oi Kwan Joyce; Ho, Ka Yan
This paper is a report of psychometric testing of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children. The availability of a valid and reliable instrument that accurately detects depressive symptoms in children is crucial before any psychological intervention can be appropriately planned and evaluated. There is no such an instrument for Chinese children. A test-retest, within-subjects design was used. A total of 313 primary school students between the ages of 8 and 12 years were invited to participate in the study in 2009. Participants were asked to respond to the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children, short form of the State Anxiety Scale for Children and Rosenberg's Self-Esteem Scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children were assessed. The newly-translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly-translated scale can be used as a self-report assessment tool in detecting depressive symptoms of Chinese children aged between 8 and 12 years. © 2010 Blackwell Publishing Ltd.
Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff
Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Full Text Available Abstract Background Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT or ACT. These differences have important implications both for utilization and interpretation. Although much has been written about PVs, it appears that there are still misconceptions about whether and how to employ them in secondary analyses. Methods We address a range of technical issues, including those raised in a recent article that was written to inform economists using these databases. First, an extensive review of the relevant literature was conducted, with particular attention to key publications that describe the derivation and psychometric characteristics of such achievement measures. Second, a simulation study was carried out to compare the statistical properties of estimates based on the use of PVs with those based on other, commonly used methods. Results It is shown, through both theoretical analysis and simulation, that under fairly general conditions appropriate use of PV yields approximately unbiased estimates of model parameters in regression analyses of large scale survey data. The superiority of the PV methodology is particularly evident when measures of student achievement are employed as explanatory variables. Conclusions The PV methodology used to report student test performance in large scale surveys remains the state-of-the-art for secondary analyses of these databases.
Grant, Marcia; Ferrell, Betty; Dean, Grace; Uman, Gwen; Chu, David; Krouse, Robert
Ostomies may be performed for bowel or urinary diversion, and occur in both cancer and non-cancer patients. Impact on physical, psychological, social and spiritual well-being is not unexpected, but has been minimally described in the literature. The City of Hope Quality of Life (COH-QOL)-Ostomy Questionnaire is an adult patient self-report instrument designed to assess quality of life. This report focuses on the revision and psychometric testing of this questionnaire. The revised COH-QOL-Ostomy Questionnaire involved in-depth patient interviews and expert panel review. The format consisted of a 13-item disease and demographic section, a 34-item forced-choice section, and a 41-item linear analogue scaled section. A mailed survey to California members of the United Ostomy Association resulted in a 62% response rate (n = 1513). Factor analysis was conducted to refine the instrument. Construct validity involved testing a number of hypotheses identifying contrasting groups. Factor analysis confirmed the conceptual framework. Reliability of subscales ranged from 0.77 to 0.90. The questionnaire discriminated between subpopulations with specific concerns. Overall, the analyses provide evidence for the validity and reliability of the COH-QOL-Ostomy Questionnaire as a comprehensive, multidimensional self-report questionnaire for measuring quality of life in patients with intestinal ostomies.
Ferrero-Arias, J; Turrión-Rojo, M A
To explore the relationship between scores on the Test Your Memory (TYM) battery and findings from a more exhaustive neurocognitive assessment. The TYM and fourteen psychometric tests were administered to 84 subjects aged 50 or older who attended an outpatient neurology clinic due to cognitive symptoms. Each patient's cognitive state was determined independently from his/her score on the TYM (CDR 0, n=25; CDR 0.5, n=45; CDR 1, n=14). We analysed concurrent validity of TYM scores and results from the psychometric tests, as well as the degree of concordance between the two types of measurement, by contrasting normalised data from each instrument. Although the intraclass correlation coefficient was 0.67 (confidence interval 95%, 0.53-0.77), analysis of the Bland-Altman plot and the curve on the survival-agreement plot (Luiz et al. method) demonstrates that the individual distances between the two methods exhibit excessive dispersion from a clinical viewpoint. TYM-based predictions of the mean z-score on psychometric tests differed substantially from real results in 30% of the subjects. Concordance of 95% can only be achieved by accepting absolute inter-instrument differences of up to 0.87 as identical values. Furthermore, the TYM underestimates cognitive performance for low values and overestimates it for high values. The TYM is a cognitive screening test which should not be used to predict results on psychometric tests or to detect cognitive changes in clinical trials. Copyright © 2014 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.
Thanakwang, Kattika; Isaramalai, Sang-Arun; Hatthakit, Urai
Active aging is central to enhancing the quality of life for older adults, but its conceptualization is not often made explicit for Asian elderly people. Little is known about active aging in older Thai adults, and there has been no development of scales to measure the expression of active aging attributes. The aim of this study was to develop a culturally relevant composite scale of active aging for Thai adults (AAS-Thai) and to evaluate its reliability and validity. EIGHT STEPS OF SCALE DEVELOPMENT WERE FOLLOWED: 1) using focus groups and in-depth interviews, 2) gathering input from existing studies, 3) developing preliminary quantitative measures, 4) reviewing for content validity by an expert panel, 5) conducting cognitive interviews, 6) pilot testing, 7) performing a nationwide survey, and 8) testing psychometric properties. In a nationwide survey, 500 subjects were randomly recruited using a stratified sampling technique. Statistical analyses included exploratory factor analysis, item analysis, and measures of internal consistency, concurrent validity, and test-retest reliability. Principal component factor analysis with varimax rotation resulted in a final 36-item scale consisting of seven factors of active aging: 1) being self-reliant, 2) being actively engaged with society, 3) developing spiritual wisdom, 4) building up financial security, 5) maintaining a healthy lifestyle, 6) engaging in active learning, and 7) strengthening family ties to ensure care in later life. These factors explained 69% of the total variance. Cronbach's alpha coefficient for the overall AAS-Thai was 0.95 and varied between 0.81 and 0.91 for the seven subscales. Concurrent validity and test-retest reliability were confirmed. The AAS-Thai demonstrated acceptable overall validity and reliability for measuring the multidimensional attributes of active aging in a Thai context. This newly developed instrument is ready for use as a screening tool to assess active aging levels among older
Poghosyan, Lusine; Nannini, Angela; Finkelstein, Stacey R; Mason, Emanuel; Shaffer, Jonathan A
Policy makers and healthcare organizations are calling for expansion of the nurse practitioner (NP) workforce in primary care settings to assure timely access and high-quality care for the American public. However, many barriers, including those at the organizational level, exist that may undermine NP workforce expansion and their optimal utilization in primary care. This study developed a new NP-specific survey instrument, Nurse Practitioner Primary Care Organizational Climate Questionnaire (NP-PCOCQ), to measure organizational climate in primary care settings and conducted its psychometric testing. Using instrument development design, the organizational climate domain pertinent for primary care NPs was identified. Items were generated from the evidence and qualitative data. Face and content validity were established through two expert meetings. Content validity index was computed. The 86-item pool was reduced to 55 items, which was pilot tested with 81 NPs using mailed surveys and then field-tested with 278 NPs in New York State. SPSS 18 and Mplus software were used for item analysis, reliability testing, and maximum likelihood exploratory factor analysis. Nurse Practitioner Primary Care Organizational Climate Questionnaire had face and content validity. The content validity index was .90. Twenty-nine items loaded on four subscale factors: professional visibility, NP-administration relations, NP-physician relations, and independent practice and support. The subscales had high internal consistency reliability. Cronbach's alphas ranged from.87 to .95. Having a strong instrument is important to promote future research. Also, administrators can use it to assess organizational climate in their clinics and propose interventions to improve it, thus promoting NP practice and the expansion of NP workforce.
Kuo, Shu-Fen; Chang, Wen-Yin; Chang, Lu-I; Chou, Yu-Hua; Chen, Ching-Min
This is a report of development and psychometric testing of the East Asian Acculturation Measure-Chinese version (EAAM-C) scale. An instrument validation design with a cross-sectional survey was conducted. The process was carried in two phases. In Phase 1, Barry's East Asian Acculturation Measure was translated and back translated to evaluate its content, face validity, and feasibility validity. In Phase 2, the 16-item EAAM-C was pilot-tested among 485 female immigrants for test-retest reliability, internal consistency, theoretically-supported construct validity and concurrent validity. The pilot work and the survey results indicated the tools possessed adequate content and face validity. The Cronbach's Alphas for the EAAM-C was 0.72, and 0.76-0.79 for its subscales, and the correlation of test-retest reliability (at 3 weeks) was 0.75. After dropping one item, four theoretically-supported factors which explained 61.82% of the variance were abstracted using exploratory factor analysis: assimilation, integration, separation, and marginalization. Based on the underlying four-factor theoretical structures of the EAAM, the confirmatory factor analysis of the EAAM-C was further examined. The analysis revealed that the four-factor model was an acceptable fit for the data which demonstrated adequate finding in its construct validity. These factors were inter-correlated, and showed statistically significant correlation with the Chinese Health Questionnaire, indicating adequate concurrent validity. The scale shows acceptable validity and consistency, and suggests that immigrant acculturation is a complex construct. This quick evaluation instrument can be applied to assess clients' acculturation and in further developing certain interventions to improve their health.
Wocial, Lucia D; Weaver, Michael T
To report the development and psychometric testing of the Moral Distress Thermometer. The Moral Distress Thermometer is a new screening tool to measure moral distress in nurses who practise in the hospital setting. Moral distress occurs when one knows the ethically correct thing to do, but is prevented from acting on that perceived obligation. It is a well documented phenomenon with negative consequences that may be experienced by nurses. Creating an instrument to effectively and efficiently measure moral distress in a timely way has been identified as a priority for nursing. This study used a cross-sectional survey design. Data collection for this research occurred in 2009. Participants simultaneously completed either the adult or pediatric version of the Moral Distress Scale version 2009 and the Moral Distress Thermometer. A total of 529 participants from various clinical areas completed both tools. Coefficients alpha were adequate for both Adult (0·90) and Pediatric (0·92) Moral Distress Scale 2009 scales. Statistically significant Pearson correlations were found for the Moral Distress Thermometer with Adult Moral Distress Scale 2009 and Pediatric Moral Distress Scale 2009 and higher Moral Distress Thermometer, Adult Moral Distress Scale 2009 and Pediatric Moral Distress Scale 2009 means for participants who had left or who considered leaving a position because of moral distress. These findings provide support for the validity of the Moral Distress Thermometer. © 2012 Blackwell Publishing Ltd.
Piredda, Michela; Biagioli, Valentina; Gambale, Giulia; Porcelli, Elisa; Barbaranelli, Claudio; Palese, Alvisa; De Marinis, Maria Grazia
Effective measures of nursing care dependency in neurorehabilitation are warranted to plan nursing interventions to help patients avoid increasing dependency. The Care Dependency Scale (CDS) is a theory-based, comprehensive tool to evaluate functional disability. This study aimed to modify the CDS for neurological and neurorehabilitation patients (Neuro-CDS) and to test its psychometric properties in adult neurorehabilitation inpatients. Exploratory factor analysis (EFA) was performed using a Maximum Likelihood robust (MLR) estimator. The Barthel Index (BI) was used to evaluate concurrent validity. Stability was measured using the Intra-class Correlation Coefficient (ICC). The sample included 124 patients (mean age = 69.7 years, 54% male). The EFA revealed a two-factor structure with good fit indexes, Factor 1 (Physical care dependence) loaded by 11 items and Factor 2 (Psycho-social care dependence) loaded by 4 items. The correlation between factors was 0.61. Correlations between Factor 1 and the BI and between Factor 2 and the BI were r = 0.843 and r = 0.677, respectively (p dependence in neurorehabilitation patients as a basis for individualized and holistic care.
Molle, Elizabeth; Froman, Robin
Computerized interdisciplinary plans of care have revitalized nurse-centric care plans into dynamic and meaningful electronic documents. To maximize the benefits of these documents, it is important to understand healthcare professionals' attitudes, specifically their confidence, for making computerized interdisciplinary care plans useful and meaningful documents. The purpose of the study was to test the psychometric properties of the Self-Efficacy for Interdisciplinary Plans of Care instrument intended to measure healthcare professionals' self-efficacy for using such documents. Content validity was assessed by an expert review panel. Content validity indices ranged from 0.75 to 1.00, with a scale CVI of 0.94. A sample of 389 healthcare providers completed the 14-item instrument. Principal axis factoring was used to assess factor structure. The exploratory factor analysis yielded a single-factor structure accounting for 71.76% of covariance. Cronbach internal consistency coefficient for the single factor solution was .97. The corrected item-total correlations ranged from 0.71 to 0.90. The coefficient of stability, during a 2-week period, with a subset of the sample (n = 38), was estimated at 0.82. The results of this study suggest that the Self-Efficacy for Interdisciplinary Plans of Care has sturdy reliability and validity for measuring the self-efficacy of healthcare providers to make computerized interdisciplinary plans of care meaningful and useful documents.
Felipo, Vicente; Urios, Amparo; Giménez-Garzó, Carla; Cauli, Omar; Andrés-Costa, Maria-Jesús; González, Olga; Serra, Miguel A; Sánchez-González, Javier; Aliaga, Roberto; Giner-Durán, Remedios; Belloch, Vicente; Montoliu, Carmina
To assess whether non invasive blood flow measurement by arterial spin labeling in several brain regions detects minimal hepatic encephalopathy. Blood flow (BF) was analyzed by arterial spin labeling (ASL) in different brain areas of 14 controls, 24 cirrhotic patients without and 16 cirrhotic patients with minimal hepatic encephalopathy (MHE). Images were collected using a 3 Tesla MR scanner (Achieva 3T-TX, Philips, Netherlands). Pulsed ASL was performed. Patients showing MHE were detected using the battery Psychometric Hepatic Encephalopathy Score (PHES) consisting of five tests. Different cognitive and motor functions were also assessed: alterations in selective attention were evaluated using the Stroop test. Patients and controls also performed visuo-motor and bimanual coordination tests. Several biochemical parameters were measured: serum pro-inflammatory interleukins (IL-6 and IL-18), 3-nitrotyrosine, cGMP and nitrates+nitrites in plasma, and blood ammonia. Bivariate correlations were evaluated. In patients with MHE, BF was increased in cerebellar hemisphere (P = 0.03) and vermis (P = 0.012) and reduced in occipital lobe (P = 0.017). BF in cerebellar hemisphere was also increased in patients without MHE (P = 0.02). Bimanual coordination was impaired in patients without MHE (P = 0.05) and much more in patients with MHE (P battery and with CFF. BF in cerebellar hemisphere correlates with plasma cGMP and nitric oxide (NO) metabolites. BF in vermis cerebellar also correlates with NO metabolites and with 3-nitrotyrosine. IL-18 in plasma correlates with BF in thalamus and occipital lobe. Non invasive BF determination in cerebellum using ASL may detect MHE earlier than the PHES. Altered NO-cGMP pathway seems to be associated to altered BF in cerebellum.
Petrova, Tatjana; Kavookjian, Jan; Madson, Michael B; Dagley, John; Shannon, David; McDonough, Sharon K
Motivational interviewing (MI) has demonstrated a significant impact as an intervention strategy for addiction management, change in lifestyle behaviors, and adherence to prescribed medication and other treatments. Key elements to studying MI include training in MI of professionals who will use it, assessment of skills acquisition in trainees, and the use of a validated skills assessment tool. The purpose of this research project was to develop a psychometrically valid and reliable tool that has been designed to assess MI skills competence in health care provider trainees. The goal was to develop an assessment tool that would evaluate the acquisition and use of specific MI skills and principles, as well as the quality of the patient-provider therapeutic alliance in brief health care encounters. To address this purpose, specific steps were followed, beginning with a literature review. This review contributed to the development of relevant conceptual and operational definitions, selecting a scaling technique and response format, and methods for analyzing validity and reliability. Internal consistency reliability was established on 88 video recorded interactions. The inter-rater and test-retest reliability were established using randomly selected 18 from the 88 interactions. The assessment tool Motivational Interviewing Skills for Health Care Encounters (MISHCE) and a manual for use of the tool were developed. Validity and reliability of MISHCE were examined. Face and content validity were supported with well-defined conceptual and operational definitions and feedback from an expert panel. Reliability was established through internal consistency, inter-rater reliability, and test-retest reliability. The overall internal consistency reliability (Cronbach's alpha) for all fifteen items was 0.75. MISHCE demonstrated good inter-rater reliability and good to excellent test-retest reliability. MISHCE assesses the health provider's level of knowledge and skills in brief
Piredda, Michela; Ghezzi, Valerio; Fenizia, Elisa; Marchetti, Anna; Petitti, Tommasangelo; De Marinis, Maria Grazia; Sili, Alessandro
To develop and psychometrically test the Italian-language Nurse Caring Behaviours Scale, a short measure of nurse caring behaviour as perceived by inpatients. Patient perceptions of nurses' caring behaviours are a predictor of care quality. Caring behaviours are culture-specific, but no measure of patient perceptions has previously been developed in Italy. Moreover, existing tools show unclear psychometric properties, are burdensome for respondents, or are not widely applicable. Instrument development and psychometric testing. Item generation included identifying and adapting items from existing measures of caring behaviours as perceived by patients. A pool of 28 items was evaluated for face validity. Content validity indexes were calculated for the resulting 15-item scale; acceptability and clarity were pilot tested with 50 patients. To assess construct validity, a sample of 2,001 consecutive adult patients admitted to a hospital in 2014 completed the scale and was split into two groups. Reliability was evaluated using nonlinear structural equation modelling coefficients. Measurement invariance was tested across subsamples. Item 15 loaded poorly in the exploratory factor analysis (n = 983) and was excluded from the final solution, positing a single latent variable with 14 indicators. This model fitted the data moderately. The confirmatory factor analysis (n = 1018) returned similar results. Internal consistency was excellent in both subsamples. Full scalar invariance was reached, and no significant latent mean differences were detected across subsamples. The new instrument shows reasonable psychometric properties and is a promising short and widely applicable measure of inpatient perceptions of nurse caring behaviours. © 2017 John Wiley & Sons Ltd.
Full Text Available Executive function (EF is an important predictor of numerous developmental outcomes, such as academic achievement and behavioral adjustment. Although a plethora of measurement instruments exists to assess executive function in children, only few of these are suitable for toddlers, and even fewer have undergone psychometric evaluation. The present study evaluates the psychometric properties and validity of an assessment battery for measuring EF in two-year-olds. A sample of 2437 children were administered the assessment battery at a mean age of 2;4 years (SD = 0;3 years in a large-scale field study. Measures of both hot EF (snack and gift delay tasks and cool EF (six boxes, memory for location, and visual search task were included. Confirmatory Factor Analyses showed that a two-factor hot and cool EF model fitted the data better than a one-factor model. Measurement invariance was supported across groups differing in age, gender, socioeconomic status (SES, home language, and test setting. Criterion and convergent validity were evaluated by examining relationships between EF and age, gender, SES, home language, and parent and teacher reports of children’s attention and inhibitory control. Predictive validity of the test battery was investigated by regressing children’s pre-academic skills and behavioral problems at age three on the latent hot and cool EF factors at age two years. The test battery showed satisfactory psychometric quality and criterion, convergent, and predictive validity. Whereas cool EF predicted both pre-academic skills and behavior problems one year later, hot EF predicted behavior problems only. These results show that EF can be assessed with psychometrically sound instruments in children as young as two years, and that EF tasks can be reliably applied in large scale field research. The current instruments offer new opportunities for investigating EF in early childhood, and for evaluating interventions targeted at improving
Clark, Cynthia M; Barbosa-Leiker, Celestina; Gill, Larecia Money; Nguyen, Danh
Academic incivility is a serious challenge for nursing education, which needs to be empirically measured and fully addressed. A convenience sample of nursing faculty and students from 20 schools of nursing in the United States participated in a mixed-methods study to test the psychometric properties of the Incivility in Nursing Education-Revised (INE-R) Survey. A factor analysis and other reliability analyses support the use of the INE-R as a valid and reliable measurement of student and faculty perceptions of incivility in nursing education. The INE-R is a psychometrically sound instrument to measure faculty and student perceptions of incivility; to examine differences regarding levels of nursing education, program type, gender, age, and ethnicity; to compare perceptions of incivility between and among adjunct, clinical, teaching, and research faculty; and to conduct pre- and postassessments of the perceived levels of faculty and student incivility in nursing programs to inform evidence-based interventions. Copyright 2015, SLACK Incorporated.
Full Text Available Abstract Background Validated instruments are needed to evaluate the programmatic impact of Evidence Based Practice (EBP training and to document the competence of individual trainees. This study aimed to translate the Fresno test into Spanish and subsequently validate it, in order to ensure the equivalence of the Spanish version against the original English version. Methods Before and after study performed between October 2007 and June 2008. Three groups of participants: (a Mentors of family medicine residents (expert group (n = 56; (b Family medicine physicians (intermediate experience group (n = 17; (c Family medicine residents (novice group (n = 202; Medical residents attended an EBP course, and two sets of the test were administered before and after the course. The Fresno test is a performance based measure for use in medical education that assesses EBP skills. The outcome measures were: inter-rater and intra-rater reliability, internal consistency, item analyses, construct validity, feasibility of administration, and responsiveness. Results Inter-rater correlations were 0.95 and 0.85 in the pre-test and the post-test respectively. The overall intra-rater reliability was 0.71 and 0.81 in the pre-test and post-test questionnaire, respectively. Cronbach's alpha was 0.88 and 0.77, respectively. 152 residents (75.2% returned both sets of the questionnaire. The observed effect size for the residents was 1.77 (CI 95%: 1.57-1.95, the standardised response mean was 1.65 (CI 95%:1.47-1.82. Conclusions The Spanish version of the Fresno test is a useful tool in assessing the knowledge and skills of EBP in Spanish-speaking residents of Family Medicine.
Jonsson, Jakob; Munck, Ingrid; Volberg, Rachel; Carlbring, Per
Recent increases in the number of online gambling sites have made gambling more available, which may contribute to an increase in gambling problems. At the same time, online gambling provides opportunities to introduce measures intended to prevent problem gambling. GamTest is an online test of gambling behavior that provides information that can be used to give players individualized feedback and recommendations for action. The aim of this study is to explore the dimensionality of GamTest and validate it against the Problem Gambling Severity Index (PGSI) and the gambler's own perceived problems. A recent psychometric approach, exploratory structural equation modeling (ESEM) is used. Well-defined constructs are identified in a two-step procedure fitting a traditional exploratory factor analysis model as well as a so-called bifactor model. Using data collected at four Nordic gambling sites in the autumn of 2009 (n = 10,402), the GamTest ESEM analyses indicate high correspondence with the players' own understanding of their problems and with the PGSI, a validated measure of problem gambling. We conclude that GamTest captures five dimensions of problematic gambling (i.e., overconsumption of money and time, and monetary, social and emotional negative consequences) with high reliability, and that the bifactor approach, composed of a general factor and specific residual factors, reproduces all these factors except one, the negative consequences emotional factor, which contributes to the dominant part of the general factor. The results underscore the importance of tailoring feedback and support to online gamblers with a particular focus on how to handle emotions in relation to their gambling behavior.
Haug, Tobias; Mann, Wolfgang
Given the current lack of appropriate assessment tools for measuring deaf children's sign language skills, many test developers have used existing tests of other sign languages as templates to measure the sign language used by deaf people in their country. This article discusses factors that may influence the adaptation of assessment tests from one natural sign language to another. Two tests which have been adapted for several other sign languages are focused upon: the Test for American Sign Language and the British Sign Language Receptive Skills Test. A brief description is given of each test as well as insights from ongoing adaptations of these tests for other sign languages. The problems reported in these adaptations were found to be grounded in linguistic and cultural differences, which need to be considered for future test adaptations. Other reported shortcomings of test adaptation are related to the question of how well psychometric measures transfer from one instrument to another.
M.M.H. Strating (Mathilde); A.P. Nieboer (Anna)
textabstractAbstract BACKGROUND: Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate
Khorashad, Behzad S.; Baron-Cohen, Simon; Roshan, Ghasem M.; Kazemian, Mojtaba; Khazai, Ladan; Aghili, Zahra; Talaei, Ali; Afkhamizadeh, Mozhgan
The psychometric properties of the Persian "Reading the Mind in the Eyes" test were investigated, so were the predictions from the Empathizing-Systemizing theory of psychological sex differences. Adults aged 16-69 years old (N = 545, female = 51.7%) completed the test online. The analysis of items showed them to be generally acceptable.…
Fernandes, Tânia; Araújo, Susana; Sucena, Ana; Reis, Alexandra; Castro, São Luís
Reading is a central cognitive domain, but little research has been devoted to standardized tests for adults. We, thus, examined the psychometric properties of the 1-min version of Teste de Idade de Leitura (Reading Age Test; 1-min TIL), the Portuguese version of Lobrot L3 test, in three experiments with college students: typical readers in Experiment 1A and B, dyslexic readers and chronological age controls in Experiment 2. In Experiment 1A, test-retest reliability and convergent validity were evaluated in 185 students. Reliability was >.70, and phonological decoding underpinned 1-min TIL. In Experiment 1B, internal consistency was assessed by presenting two 45-s versions of the test to 19 students, and performance in these versions was significantly associated (r = .78). In Experiment 2, construct validity, criterion validity and clinical utility of 1-min TIL were investigated. A multiple regression analysis corroborated construct validity; both phonological decoding and listening comprehension were reliable predictors of 1-min TIL scores. Logistic regression and receiver operating characteristics analyses revealed the high accuracy of this test in distinguishing dyslexic from typical readers. Therefore, the 1-min TIL, which assesses reading comprehension and potential reading difficulties in college students, has the necessary psychometric properties to become a useful screening instrument in neuropsychological assessment and research. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Ranger, Jochen; Kuhn, Jörg-Tobias; Szardenings, Carsten
Cognitive psychometric models embed cognitive process models into a latent trait framework in order to allow for individual differences. Due to their close relationship to the response process the models allow for profound conclusions about the test takers. However, before such a model can be used its fit has to be checked carefully. In this manuscript we give an overview over existing tests of model fit and show their relation to the generalized moment test of Newey (Econometrica, 53, 1985, 1047) and Tauchen (J. Econometrics, 30, 1985, 415). We also present a new test, the Hausman test of misspecification (Hausman, Econometrica, 46, 1978, 1251). The Hausman test consists of a comparison of two estimates of the same item parameters which should be similar if the model holds. The performance of the Hausman test is evaluated in a simulation study. In this study we illustrate its application to two popular models in cognitive psychometrics, the Q-diffusion model and the D-diffusion model (van der Maas, Molenaar, Maris, Kievit, & Boorsboom, Psychol Rev., 118, 2011, 339; Molenaar, Tuerlinckx, & van der Maas, J. Stat. Softw., 66, 2015, 1). We also compare the performance of the test to four alternative tests of model fit, namely the M 2 test (Molenaar et al., J. Stat. Softw., 66, 2015, 1), the moment test (Ranger et al., Br. J. Math. Stat. Psychol., 2016) and the test for binned time (Ranger & Kuhn, Psychol. Test. Asess. , 56, 2014b, 370). The simulation study indicates that the Hausman test is superior to the latter tests. The test closely adheres to the nominal Type I error rate and has higher power in most simulation conditions. © 2017 The British Psychological Society.
Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling
Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, pself-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Fox, Gerardus J.A.; van den Berg, Stéphanie Martine; Veldkamp, Bernard P.; Irwing, P.; Booth, T.; Hughes, D.
In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item
Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M
The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…
Full Text Available Alessandra Gorini,1,2 Ketti Mazzocco,1,2 Sara Gandini,2 Elisabetta Munzone,2 Gordon McVie,2 Gabriella Pravettoni1,2 1Department of Health Science, University of Milan, Milan, Italy; 2European Institute of Oncology, Milan, Italy Introduction: The advent of “personalized medicine” has been driven by technological advances in genomics. Concentration at the subcellular level of a patient's cancer cells has meant inevitably that the “person” has been overlooked. For this reason, we think there is an urgent need to develop a truly personalized approach focusing on each patient as an individual, assessing his/her unique mental dimensions and tailoring interventions to his/her individual needs and preferences. The aim of this study was to develop and test the psychometric properties of the ALGA-Breast Cancer (ALGA-BC, a new multidimensional questionnaire that assesses the breast cancer patient's physical and mental characteristics in order to provide physicians, prior to the consultation, with a patient's profile that is supposed to facilitate subsequent communication, interaction, and information delivery between the doctor and the patient. Methods: The specific validation processes used were: content and face validity, construct validity using factor analysis, reliability and internal consistency using test–retest reliability, and Cronbach's alpha correlation coefficient. The exploratory analysis included 100 primary breast cancer patients and 730 healthy subjects. Results: The exploratory factor analysis revealed eight key factors: global self-rated health, perceived physical health, anxiety, self-efficacy, cognitive closure, memory, body image, and sexual life. Test–retest reliability and internal consistency were good. Comparing patients with a sample of healthy subjects, we also observed a general ability of the ALGA-BC questionnaire to discriminate between the two. Conclusion: The ALGA-BC questionnaire with 29 items is a valid
Ajorpaz, Neda Mirbagher; Tafreshi, Mansoureh Zagheri; Mohtashami, Jamileh; Zayeri, Farid; Rahemi, Zahra
The clinical competence of nursing students in operating room (OR) is an important issue in nursing education. The purpose of this study was to evaluate the psychometric properties of the Persian Perceived Perioperative Competence Scale-Revised (PPCS-R) instrument. This cross-sectional study was conducted across 12 universities in Iran. The psychometric properties and factor structure of the PPCS-R for OR students was examined. Based on the results of factor analysis, seven items were removed from the original version of the scale. The fitness indices of the Persian scale include comparative fit index (CFI) = .90, goodness-of-fit-index (GFI) = .86, adjusted goodness-of-fit index (AGFI) = .90, normed fit index (NFI) = .84, and root mean square error of approximation (RMSEA) = .04. High validity and reliability indicated the scale's value for measuring perceived perioperative competence of Iranian OR students.
Nieboer Anna P; Strating Mathilde MH
Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for...
Tseng, Hsu-Min; Liu, F-C; West, Michael
Understanding the feasibility of applying the Team Climate Inventory (TCI) in non-Western cultures is essential for researchers attempting to understand the influence of culture on workers' perceived climate. This study describes the application of the TCI in such a setting using data from 203 administrators employed in a Taiwanese medical center. Reliability and factor analyses were performed to establish the feasibility and psychometric properties of the TCI Taiwan version. Reliabilities of...
Moeini, Babak; Zamanian, Hadi; Taheri-Kharameh, Zahra; Ramezani, Tahereh; Saati-Asr, Mohamadhasan; Hajrahimian, Mohamadhasan; Amini-Tehrani, Mohammadali
Spirituality plays an important role in coping with chronic diseases for patients and they often report unmet spiritual and existential needs, which should be considered for a holistic view of their health. Studying spiritual needs in this generation requires culturally appropriate and valid instruments. The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The "forward-backward" procedure was applied to translate the SpNQ from English into Persian. The SpNQ-Persian Version (SpNQ-PV) was checked in terms of validity and reliability with a convenience sample of 100 elders with chronic diseases who were recruited from the inpatient wards at two university hospitals in Qom, Iran. The validity was assessed using content, face, and construct validity. The Cronbach alpha and test-retest were used to assess the reliability of the questionnaire. The results of the exploratory factor analysis indicated a five-factor solution for the questionnaire, which included religious needs, existential needs, forgiveness/generativity needs, need for inner peace, and emotional needs. These accounted for 60.1% of the total observed variance. One item was removed (factor loading Spiritual Well-being Scale. Cronbach alpha of the subscales ranged from 0.56 to 0.78 and the test-retest reliability ranged from 0.72 to 0.91, which indicated an acceptable range of reliability. The SpNQ-PV showed a minor difference in structuring and indicated good psychometric properties, which can be used to assess the spiritual needs of Iranian elders suffering from chronic diseases. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Lin, Chung-Ying; Broström, Anders; Nilsen, Per; Griffiths, Mark D; Pakpour, Amir H
Background and aims The Bergen Social Media Addiction Scale (BSMAS), a six-item self-report scale that is a brief and effective psychometric instrument for assessing at-risk social media addiction on the Internet. However, its psychometric properties in Persian have never been examined and no studies have applied Rasch analysis for the psychometric testing. This study aimed to verify the construct validity of the Persian BSMAS using confirmatory factor analysis (CFA) and Rasch models among 2,676 Iranian adolescents. Methods In addition to construct validity, measurement invariance in CFA and differential item functioning (DIF) in Rasch analysis across gender were tested for in the Persian BSMAS. Results Both CFA [comparative fit index (CFI) = 0.993; Tucker-Lewis index (TLI) = 0.989; root mean square error of approximation (RMSEA) = 0.057; standardized root mean square residual (SRMR) = 0.039] and Rasch (infit MnSq = 0.88-1.28; outfit MnSq = 0.86-1.22) confirmed the unidimensionality of the BSMAS. Moreover, measurement invariance was supported in multigroup CFA including metric invariance (ΔCFI = -0.001; ΔSRMR = 0.003; ΔRMSEA = -0.005) and scalar invariance (ΔCFI = -0.002; ΔSRMR = 0.005; ΔRMSEA = 0.001) across gender. No item displayed DIF (DIF contrast = -0.48 to 0.24) in Rasch across gender. Conclusions Given the Persian BSMAS was unidimensional, it is concluded that the instrument can be used to assess how an adolescent is addicted to social media on the Internet. Moreover, users of the instrument may comfortably compare the sum scores of the BSMAS across gender.
Bekhet, Abir K; Zauszniewski, Jaclene A
Positive thinking interventions improve adaptive functioning and quality of life in many populations. However, no direct measure of positive thinking skills taught during intervention exists. This psychometric study of a convenience sample of 109 autism spectrum disorder (ASD) caregivers examined a new eight-item Positive Thinking Skills Scale (PTSS), which measures the frequency of use of positive thinking skills. The PTSS was found to be internally consistent (α = .90). Construct validity was supported by significant correlations (p thinking skills could provide direction for future intervention.
Hung, Man; Hon, Shirley D; Cheng, Christine; Franklin, Jeremy D; Aoki, Stephen K; Anderson, Mike B; Kapron, Ashley L; Peters, Christopher L; Pelt, Christopher E
The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Cohort study (diagnosis); Level of evidence, 2. Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior in all psychometric aspects examined in this study. Future
Meyerson, Paul; Tryon, Warren W
This study evaluated the psychometric equivalency of Web-based research. The Sexual Boredom Scale was presented via the World-Wide Web along with five additional scales used to validate it. A subset of 533 participants that matched a previously published sample (Watt & Ewing, 1996) on age, gender, and race was identified. An 8 x 8 correlation matrix from the matched Internet sample was compared via structural equation modeling with a similar 8 x 8 correlation matrix from the previously published study. The Internet and previously published samples were psychometrically equivalent. Coefficient alpha values calculated on the matched Internet sample yielded reliability coefficients almost identical to those for the previously published sample. Factors such as computer administration and uncontrollable administration settings did not appear to affect the results. Demographic data indicated an overrepresentation of males by about 6% and Caucasians by about 13% relative to the U.S. Census (2000). A total of 2,230 participants were obtained in about 8 months without remuneration. These results suggest that data collection on the Web is (1) reliable, (2) valid, (3) reasonably representative, (4) cost effective, and (5) efficient.
Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée
The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.
Harris, Richard W; McPherson, David L; Hanson, Claire M; Eggett, Dennis L
This study identified, digitally recorded, edited and evaluated 89 bisyllabic Vietnamese words with the goal of identifying homogeneous words that could be used to measure the speech recognition threshold (SRT) in native talkers of Vietnamese. Native male and female talker productions of 89 Vietnamese bisyllabic words were recorded, edited and then presented at intensities ranging from -10 to 20 dBHL. Logistic regression was used to identify the best words for measuring the SRT. Forty-eight words were selected and digitally edited to have 50% intelligibility at a level equal to the mean pure-tone average (PTA) for normally hearing participants (5.2 dBHL). Twenty normally hearing native Vietnamese participants listened to and repeated bisyllabic Vietnamese words at intensities ranging from -10 to 20 dBHL. A total of 48 male and female talker recordings of bisyllabic words with steep psychometric functions (>9.0%/dB) were chosen for the final bisyllabic SRT list. Only words homogeneous with respect to threshold audibility with steep psychometric function slopes were chosen for the final list. Digital recordings of bisyllabic Vietnamese words are now available for use in measuring the SRT for patients whose native language is Vietnamese.
Liou, Shwu-Ru; Liu, Hsiu-Chen; Tsai, Hsiu-Min; Tsai, Ying-Huang; Lin, Yu-Ching; Chang, Chia-Hao; Cheng, Ching-Yu
The purpose of the study was to develop and psychometrically test the Nurses Clinical Reasoning Scale. Clinical reasoning is an essential skill for providing safe and quality patient care. Identifying pre-graduates' and nurses' needs and designing training courses to improve their clinical reasoning competence becomes a critical task. However, there is no instrument focusing on clinical reasoning in the nursing profession. Cross-sectional design was used. This study included the development of the scale, a pilot study that preliminary tested the readability and reliability of the developed scale and a main study that implemented and tested the psychometric properties of the developed scale. The Nurses Clinical Reasoning Scale was developed based on the Clinical Reasoning Model. The scale includes 15 items using a Likert five-point scale. Data were collected from 2013-2014. Two hundred and fifty-one participants comprising clinical nurses and nursing pre-graduates completed and returned the questionnaires in the main study. The instrument was tested for internal consistency and test-retest reliability. Its validity was tested with content, construct and known-groups validity. One factor emerged from the factor analysis. The known-groups validity was confirmed. The Cronbach's alpha for the entire instrument was 0·9. The reliability and validity of the Nurses Clinical Reasoning Scale were supported. The scale is a useful tool and can be easily administered for the self-assessment of clinical reasoning competence of clinical nurses and future baccalaureate nursing graduates. Study limitations and further recommendations are discussed. © 2015 John Wiley & Sons Ltd.
Soltanparast, Sanaz; Jafari, Zahra; Sameni, Seyed Jalal; Salehi, Masoud
The purpose of the present study was to evaluate the psychometric properties (validity and reliability) of the Persian version of the Sustained Auditory Attention Capacity Test in children with attention deficit hyperactivity disorder. The Persian version of the Sustained Auditory Attention Capacity Test was constructed to assess sustained auditory attention using the method provided by Feniman and colleagues (2007). In this test, comments were provided to assess the child's attentional deficit by determining inattention and impulsiveness error, the total scores of the sustained auditory attention capacity test and attention span reduction index. In the present study for determining the validity and reliability of in both Rey Auditory Verbal Learning test and the Persian version of the Sustained Auditory Attention Capacity Test (SAACT), 46 normal children and 41 children with Attention Deficit Hyperactivity (ADHD), all right-handed and aged between 7 and 11 of both genders, were evaluated. In determining convergent validity, a negative significant correlation was found between the three parts of the Rey Auditory Verbal Learning test (first, fifth, and immediate recall) and all indicators of the SAACT except attention span reduction. By comparing the test scores between the normal and ADHD groups, discriminant validity analysis showed significant differences in all indicators of the test except for attention span reduction (pAttention Capacity test has good validity and reliability, that matches other reliable tests, and it can be used for the identification of children with attention deficits and if they suspected to have Attention Deficit Hyperactivity Disorder.
Yu, Doris S F; Lee, Diana T F; Woo, Jean
The purpose of this study was to assess the psychometric properties of the Chinese version of the Medical Outcomes Study Social Support Survey (MOS-SSS-C) in a sample of 110 patients. Criterion-related and construct validities of the MOS-SSS-C were evaluated by correlations with the Chinese version of the Multidimensional Perceived Social Support Survey (r =.82) and the Hospital Anxiety and Depression Scale (r = -.58). Confirmatory factor analysis affirmed the four-factor structure of the MOS-SSS-C in measuring the functional aspects of perceived social support. Cronbach's alphas for the subscales ranged from.93 to.96, whereas the alpha for the overall scale was.98. The 2-week test-retest reliability of the MOS-SSS-C as measured by the intraclass correlation coefficient was.84. The MOS-SSS-C is a psychometrically sound multidimensional measure for the evaluation of functional aspects of perceived social support by Chinese patients with chronic disease. Copyright 2004 Wiley Periodicals, Inc.
Investigating the psychometric properties of a screening instrument for young children is necessary to ascertain its quality and accuracy. In light of the important role culture plays on human beliefs and parenting styles, a newly translated and adapted test needs to be studied. Evaluating outcomes on a translated version of a test may reveal…
Vrachimi-Souroulla, Andry; Panayiotou, Georgia; Kokkinos, Constantinos M.; Lamprianou, Iasonas
The study aimed to field-test a Greek version of the Wechsler Quicktest and to examine its psychometric properties. The Quicktest was individually administered to 208 students, aged 5-14 years, along with a reading test. Based on the Rasch analysis, data for the Quicktest subtests showed acceptable fit to the model. Also, correlations were found…
Tepe, Rodger; Tepe, Chabha
To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Hsueh, Yu-Jung; Chu, Hsin; Huang, Chang-Chih; Ou, Keng-Liang; Chen, Chiung-Hua; Chou, Kuei-Ru
The aim of this study was to examine the psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C). The sensitivity, specificity, and positive and negative predictive values for the MAST-C were examined in this study. The MAST-C had an internal consistency of 0.83 and a test-retest reliability of 0.89. It had a good content validity index of 0.92. Factor analysis identified four factors and the optimal cutoff point for the MAST-C was a score of 6/7, which yielded a sensitivity of 0.92, a specificity of 0.83, a positive predictive value of 0.92, and a negative predictive value of 0.83. The MAST-C provides a fast, accurate, and sensitive method for clinically diagnosing alcoholism and clinical management. © 2013 Wiley Periodicals, Inc.
Roohollah Zahedian Nasb
Full Text Available Background: Sustained visual attention is a prerequisite for learning and memory. The early evaluation of attention in childhood is essential for their school and career success in the future. The aim of this study was to design, development and investigation of psychometric properties (content, face and convergent validity and test-retest and internal consistency reliability of the computer - based sustained visual attention test (SuVAT for healthy preschool children aged 4-6 with their special needs. Methods: This study was carried out in two stages: in the first stage computerbased SuVAT in two versions original and parallel were developed. Then the test-retest and internal consistency reliability using intra-class correlation and Cronbach’s alpha coefficients respectively were examined; Face validity was calculated through ideas gathering from 10 preschool children and content validity evaluated using CVI and CVR method and convergent validity of SuVAT with CPT was assessed using Pearson correlation. Results: The developed test showed a good content and faces validity, and also had excellent test-retest reliability. In addition, the assessment of internal consistency indicated the high internal consistency of the test (Cronbach’s alpha=0.869. SuVAT and CPT test demonstrated a positive correlation upon the convergent validity testing. Conclusion: SuVAT with good reliability and validity could be used as an acceptable sustained attention assessment in preschool children.
Full Text Available “Reading the Mind in the Eyes” test (RMET is one of the most popular and widely used measures of individual differences in Theory of Mind (ToM capabilities. Despite demonstrating good validity in differentiating various clinical groups exhibiting ToM deficits from unimpaired controls, previous studies raised the question of the RMET’s homogeneity, latent structure, and reliability. The aim of this study is to provide evidence on psychometric properties, latent structure, and validity of the newly adapted Serbian version of the RMET. In total, 260 participants (61.9% females took part in the study. The sample consisted of both unimpaired controls (76.5%, and a clinical group of participants that are believed to demonstrate ToM deficits (23.5%, namely, persons diagnosed with schizophrenia and bipolar disorder (54.1% females. RMET has demonstrated fair psychometric properties (KMO = .723; α = .747; H1 = .076; H5 = .465, successfully differentiating between clinical group and control [F (1,254 = 26.175, p <.001, η2 p = .093], while typical gender differences in performance were found only in control group. Tests of several models based on the previous literature revealed that the affect-specific factors underlying performance on RMET demonstrate poor fit. The best fitting model obtained included reduced scale with a single-factor underlying the test’s performance (TLI = .953, CFI = .958, RMSEA = .020. Based on the fit parameters we propose 18-item short-form of the Serbian version of RMET (KMO = .797; α = .728; H1 = .129; H5 = .677 for economic, reliable and valid measurement of ToM abilities.
Pan, Tonya M; Mills, Sarah D; Fox, Rina S; Baik, Sharon H; Harry, Kadie M; Roesch, Scott C; Sadler, Georgia Robins; Malcarne, Vanessa L
The Life Orientation Test-Revised (LOT-R) is a widely used measure of optimism and pessimism, with three positively worded and three negatively worded content items. This study examined the structural validity and invariance, internal consistency reliability, and convergent and divergent validity of the English and Spanish versions of the LOT-R among Hispanic Americans. A community sample of Hispanic Americans ( N = 422) completed self-report measures, including the LOT-R, Patient Health Questionnaire-9, and Generalized Anxiety Disorder-7, in their preferred language of English or Spanish. Based on the literature, four structural models were tested: one-factor , oblique two-factor , orthogonal two-factor method effects with positive specific factor , and orthogonal two-factor method effects with negative specific factor . Baseline support for both of the English and Spanish versions was not achieved for any model; in all models, the negatively worded items in Spanish had non-significant factor loadings. Therefore, the positively worded three-item optimism subscale of the LOT-R was examined separately and fit the data, with factor loadings equivalent across language-preference groups. Coefficient alphas for the optimism subscale were consistent across both language-preference groups (αs = .61 [English] and .66 [Spanish]). In contrast, the six-item total score and three-item pessimism subscale demonstrated extremely low or inconsistent alphas. Convergent and divergent validity were established for the optimism subscale in both languages. In sum, the optimism subscale of the LOT-R demonstrated minimally acceptable to good psychometric properties across English and Spanish language-preference groups. However, neither the total score nor the pessimism subscale showed adequate psychometric properties for Spanish-speaking Hispanic Americans, likely due to translation and cultural adaptation issues, and thus are not supported for use with this population.
Widger, Kimberley; Tourangeau, Ann E; Steele, Rose; Streiner, David L
The field of pediatric palliative care is hindered by the lack of a well-defined, reliable, and valid method for measuring the quality of end-of-life care. The study purpose was to develop and test an instrument to measure mothers' perspectives on the quality of care received before, at the time of, and following a child's death. In Phase 1, key components of quality end-of-life care for children were synthesized through a comprehensive review of research literature. These key components were validated in Phase 2 and then extended through focus groups with bereaved parents. In Phase 3, items were developed to assess structures, processes, and outcomes of quality end-of-life care then tested for content and face validity with health professionals. Cognitive testing was conducted through interviews with bereaved parents. In Phase 4, bereaved mothers were recruited through 10 children's hospitals/hospices in Canada to complete the instrument, and psychometric testing was conducted. Following review of 67 manuscripts and 3 focus groups with 10 parents, 141 items were initially developed. The overall content validity index for these items was 0.84 as rated by 7 health professionals. Based on feedback from health professionals and cognitive testing with 6 parents, a 144-item instrument was finalized for further testing. In Phase 4, 128 mothers completed the instrument, 31 of whom completed it twice. Test-retest reliability, internal consistency, and construct validity were demonstrated for six subscales: Connect With Families, Involve Parents, Share Information With Parents, Share Information Among Health Professionals, Support Parents, and Provide Care at Death. Additional items with content validity were grouped in four domains: Support the Child, Support Siblings, Provide Bereavement Follow-up, and Structures of Care. Forty-eight items were deleted through psychometric testing, leaving a 95-item instrument. There is good initial evidence for the reliability and
This paper analyzes the activities, members, and effects of an inter-American expert network for the diffusion of psychometric knowledge, specifically of standardized aptitude testing for university admission in Latin America during the 1960s and 1970s. Within the framework of educational transfer studies, the role of international,…
Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina
Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the "Test of Science-Related Attitudes" (TOSRA), such as its construct validity, its…
Dimitrov, Dimiter M.; Shamrani, Abdul Rahman
This study examines the psychometric features of a General Aptitude Test-Verbal Part, which is used with assessments of high school graduates in Saudi Arabia. The data supported a bifactor model, with one general factor and three content domains (Analogy, Sentence Completion, and Reading Comprehension) as latent aspects of verbal aptitude.
Hutchinson, Marie; Doran, Frances
Background Domestic violence (DV) is an international public health issue associated with adverse health outcomes for adults and children. There have been widespread calls to increase nurses' capacity to respond to DV and improve undergraduate nursing education in this area. However, there are few valid, reliable and contemporary measures of nursing attitudes towards and beliefs concerning DV that are suited for use in evaluating education programmes. Aim To establish the psychometric properties of a newly developed inventory designed to measure nursing students' beliefs about and attitudes towards DV. Discussion Exploratory factor analysis identified five factors, with a Cronbach's alpha of 0.646. The few factors loading>.80 suggest that the instrument has good discriminate validity. The absence of cross-loadings indicate good convergent validity. Conclusion The inventory provides one of the first validated and reliable measures for examining undergraduate nursing students' attitudes towards and beliefs about DV. Implications for practice The instrument is suited for use by nurse educators in assessing the influence of curriculum design and teaching strategies on student beliefs and attitudes. It would also be useful in studies investigating nurses' clinical practice on domestic violence.
Gutiérrez Sánchez, Daniel; Cuesta-Vargas, Antonio I
Many measurements have been developed to assess the quality of death (QoD). Among these, the Quality of Dying and Death Questionnaire (QODD) is the most widely studied and best validated. Informal carers and health professionals who care for the patient during their last days of life can complete this assessment tool. The aim of the study is to carry out a cross-cultural adaptation and a psychometric analysis of the QODD for the Spanish population. The translation was performed using a double forward and backward method. An expert panel evaluated the content validity. The questionnaire was tested in a sample of 72 Spanish-speaking adult carers of deceased cancer patients. A psychometric analysis was performed to evaluate internal consistency, divergent criterion-related validity with the Mini-Suffering State Examination (MSSE) and concurrent criterion-related validity with the Palliative Outcome Scale (POS). Some items were deleted and modified to create the Spanish version of the QODD (QODD-ESP-26). The instrument was readable and acceptable. The content validity index was 0.96, suggesting that all items are relevant for the measure of the QoD. This questionnaire showed high internal consistency (Cronbach's α coefficient = 0.88). Divergent validity with MSSE (r = -0.64) and convergent validity with POS (r = -0.61) were also demonstrated. The QODD-ESP-26 is a valid and reliable instrument for the assessment of the QoD of deceased cancer patients that can be used in a clinical and research setting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Rofail, Diana; Viala, Muriel; Gater, Adam; Abetz-Webb, Linda; Baladi, Jean-Francois; Cappellini, Maria Domenica
The Satisfaction with Iron Chelation Therapy (SICT) instrument was developed based on a literature review, in-depth patient and clinician interviews, and cognitive debriefing interviews. An, open-label, single arm, multicenter trial evaluating the efficacy and safety of deferasirox in patients diagnosed with transfusion-dependent iron overload, provided an opportunity to assess the psychometric measurement properties of the instrument. Psychometric analyses were performed using data at baseline from 273 patients with a range of transfusion-dependent iron overload conditions who were participating in a multinational study. Responsiveness was further evaluated for all patients who also had subsequent satisfaction domain scores collected at week 4. Baseline SICT domain scores had acceptable floor and ceiling effects and internal consistency reliability (Cronbach's alpha: 0.75-0.85). Item discriminant and item convergent validity were both excellent although one item in each analysis did not meet the specified criterion. Small to moderate correlations were observed between SICT and Short Form 36 Health Survey (SF-36) domain scores. Patients with the highest levels of serum ferritin at baseline (>3100 ng/mL) were the least satisfied about the Perceived Effectiveness of ICT and vice versa. Satisfaction improved in all patients, although there were no clear differences observed between groups of patients defined according to changes in serum ferritin levels from baseline to week 4 (stable, improved, or worsened). The SICT domains are reliable and valid. Further testing using a more specific criterion (such as assessing patient global ratings of change in satisfaction domains that correspond to the SICT domains) could help to establish with greater confidence the responsiveness of the instrument.
Evren, Cuneyt; Ogel, Kultegin; Evren, Bilge; Bozkurt, Muge
The aim of this study was to evaluate psychometric properties of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in prisoners with (n = 124) or without (n = 78) drug use disorder. Participants were evaluated with the DUDIT, the DAST-10, and the Addiction Profile Index-Short (API-S). The DUDIT and the DAST-10 were found to be psychometrically sound drug abuse screening measures with high convergent validity when compared with each other (r = 0.86), and API-S (r = 0.88 and r = 0.84, respectively), and to have a Cronbach's α of 0.93 and 0.87, respectively. In addition, a single component accounted for 58.28% of total variance for DUDIT, whereas this was 47.10% for DAST-10. The DUDIT had sensitivity and specificity scores of 0.95 and 0.79, respectively, when using the optimal cut-off score of 10, whereas these scores were 0.88 and 0.74 for the DAST-10 when using the optimal cut-off score of 4. Additionally, both the DUDIT and the DAST-10 showed good discriminant validity as they differentiated prisoners with drug use disorder from those without. Findings support the Turkish versions of both the DUDIT and the DAST-10 as reliable and valid drug abuse screening instruments that measure unidimensional constructs.
Dyson, Judith; Cowdell, Fiona
To develop and psychometrically test the Motivation and Self-Efficacy in Early Detection of Skin Lesions Index. Skin cancer is the most frequently diagnosed cancer worldwide. The primary strategy used to prevent skin cancer is promotion of sun avoidance and the use of sun protection. However, despite costly and extensive campaigns, cases of skin cancer continue to increase. If found and treated early, skin cancer is curable. Early detection is, therefore, very important. The study was conducted in 2013. Instrument Development. A literature review and a survey identified barriers (factors that hinder) and levers (factors that help) to skin self-examination. These were categorized according to a the Theoretical Domains Framework and this formed the basis of an instrument, which was tested for validity and reliability using confirmatory factor analysis and Cronbach's alpha respectively. A five-factor 20-item instrument was used that tested well for reliability and construct validity. Test-retest reliability was good for all items and domains. The five factors were: (i) Outcome expectancies; (ii) Intention; (iii) Self-efficacy; (iv) Social influences; (v) Memory. The Motivation and Self-Efficacy in Early Detection of Skin Lesions Index provides a reliable and valid method of assessing barriers and levers to skin self-examination. The next step is to design a theory-based intervention that can be tailored according to individual determinants to behaviour change identified by this instrument. © 2014 John Wiley & Sons Ltd.
Sanchez-Garcia, Manuel; Extremera, Natalio; Fernandez-Berrocal, Pablo
This research examined evidence regarding the reliability and validity of scores on the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test, Version 2.0 (MSCEIT; Mayer, Salovey, & Caruso, 2002). In Study 1, we found a close convergence of the Spanish consensus scores and the general and expert consensus scores determined with Mayer, Salovey, Caruso, and Sitarenios (2003) data. The MSCEIT also demonstrated adequate evidence of reliability of test scores as estimated by internal consistency and test-retest correlation after 12 weeks. Confirmatory factor analysis supported a 3-level higher factor model with 8 manifest variables (task scores), 4 first-level factors (corresponding to the 4-branch model of Mayer & Salovey , with 2 tasks for each branch), 2 second-level factors (experiential and strategic areas, with 2 branches for each area), and 1 third-level factor (overall emotional intelligence [EI]), and multigroup analyses supported MSCEIT cross-gender invariance. Study 2 found evidence for the discriminant validity of scores on the MSCEIT subscales, which were differentially related to personality and self-reported EI. Study 3 provided evidence of the incremental validity of scores on the MSCEIT, which added significant variance to the prospective prediction of psychological well-being after controlling for personality traits. The psychometric properties of the Spanish MSCEIT are similar to those of the original English version, supporting its use for assessing emotional abilities in the Spanish population. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Hirani, Shela Akbar Ali; Karmaliani, Rozina; Christie, Thomas; Parpio, Yasmin; Rafique, Ghazala
breast feeding is an essential source of nutrition among young babies; however, in Pakistan a gradual decline in prevalence of breast feeding, especially among urban working mothers, has been reported. Previous studies among Pakistani urban working mothers have revealed that ensuring exclusivity and continuation of breast feeding is challenging if social and/or workplace environmental support is minimal or absent. This problem indicated a crucial need to assess availability of breast-feeding support for Pakistani urban working mothers by using a comprehensive, reliable, and validated tool in their national language (Urdu). to develop and test the psychometric properties of the 'Perceived Breastfeeding Support Assessment Tool' (PBSAT) that can gauge Pakistani urban working mothers' perceptions about breast-feeding support. this methodological research was undertaken in five phases. During phase I, a preliminary draft of the PBSAT was developed by using the Socio-ecological model, reviewing literature, and referring to two United States based tools. In Phase II, the instrument was evaluated by seven different experts, and, in Phase III, the instrument was revised, translated, and back translated. In Phase IV, the tool was pilot tested among 20 participants and then modified on the basis of statistical analysis. In Phase V, the refined instrument was tested on 200 breast-feeding working mothers recruited through purposive sampling from the government and private health-care settings in Karachi, Pakistan. Approvals were received from the Ethical Review Committees of the identified settings. the 29-item based PBSAT revealed an acceptable inter-rater reliability of 0.95, and an internal consistency reliability coefficient (Cronbach's alpha) of 0.85. A construct validity assessment through Exploratory Factor Analysis revealed that the PBSAT has two dimensions, 'workplace environmental support' (12 items; α=0.86) and 'social environmental support' (17 items; α=0.77). the
Full Text Available The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS test is a new tool to evaluate pragmatic abilities in patients with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2 and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total. Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts’ evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient’s cognitive profile, given its vital role for communication and social interaction in daily life. The
Kim, Y; Evangelista, LS; Phillips, LR; Pavlish, C; Kopple, JD
Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and v...
Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise
Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.
Petry, Heidi; Suter-Riederer, Susanne; Kerker-Specker, Carmen; Imhof, Lorenz
Patient centred and individually-tailored counselling of older people with a chronic condition who live at home is a useful intervention to support their independence. The paper presents the development and psychometric testing of the APN-BQ Instrument, to measure patient-centeredness. To measure the quality of an in-home counselling intervention, a 23-item questionnaire was developed and tested with 206 people 80 years and older. Principal component analysis with Varimax Rotation was conducted (n = 206). Analysis revealed a four factor (fs = 0.91) model scoring in 19 items. All factors loaded > 0.45. Cronbach's alpha was 0.86. The utility and acceptance of the instrument was confirmed by the high response rate (100 %) and the fact that participants answered 98.8 % of all questions. The APN-BQ has shown to be a reliable Instrument with good content and construct validity. It is a tool for APNs to measure structure, process, and outcome quality of a patient-centred and individually-tailored counselling program, including the degree of patient participation, and patient empowerment.
Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C
The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.
Noorbakhsh, Simasadat; Shams, Jamal; Faghihimohamadi, Mohamadmahdi; Zahiroddin, Hanieh; Hallgren, Mats; Kallmen, Hakan
Iran is a developing and Islamic country where the consumption of alcoholic beverages is banned. However, psychiatric disorders and alcohol use disorders are often co-occurring. We used the Alcohol Use Disorders Identification Test (AUDIT) to estimate the prevalence of alcohol use and examined the psychometric properties of the test among psychiatric outpatients in Teheran, Iran. AUDIT was completed by 846 consecutive (sequential) patients. Descriptive statistics, internal consistency (Cronbach alpha), confirmatory and exploratory factor analyses were used to analyze the prevalence of alcohol use, reliability and construct validity. 12% of men and 1% of women were hazardous alcohol consumers. Internal reliability of the Iranian version of AUDIT was excellent. Confirmatory factor analyses showed that the construct validity and the fit of previous factor structures (1, 2 and 3 factors) to data were not good and seemingly contradicted results from the explorative principal axis factoring, which showed that a 1-factor solution explained 77% of the co-variances. We could not reproduce the suggested factor structure of AUDIT, probably due to the skewed distribution of alcohol consumption. Only 19% of men and 3% of women scored above 0 on AUDIT. This could be explained by the fact that alcohol is illegal in Iran. In conclusion the AUDIT exhibited good internal reliability when used as a single scale. The prevalence estimates according to AUDIT were somewhat higher among psychiatric patients compared to what was reported by WHO regarding the general population.
Ros, Laura; Romero, Dulce; Ricarte, Jorge J; Serrano, Juan P; Nieto, Marta; Latorre, Jose M
The Autobiographical Memory Test (AMT) is the most widely used measure of overgeneral autobiographical memory (OGM). The AMT appears to have good psychometric properties, but more research is needed on the influence and applicability of individual cue words in different languages and populations. To date, no studies have evaluated its usefulness as a measure of OMG in Spanish or older populations. This work aims to analyze the applicability of the AMT in young and older Spanish samples. We administered a Spanish version of the AMT to samples of young (N = 520) and older adults (N = 155). We conducted confirmatory factor analysis (CFA), item response theory-based analysis (IRT) and differential item functioning (DIF). Results confirm the one-factor structure for the AMT. IRT analysis suggests that both groups find the AMT easy given that they generally perform well, and that it is more precise in individuals who score low on memory specificity. DIF analysis finds three items differ in their functioning depending on age group. This differential functioning of these items affects the overall AMT scores and, thus, they should be excluded from the AMT in studies comparing young and older samples. We discuss the possible implications of the samples and cue words used.
García-Alandete, Joaquín; Ros, Montserrat Cañabate; Salvador, José Heliodoro Marco; Rodríguez, Sandra Pérez
The aim of this study was to analyze the psychometric properties of the Purpose-In-Life Test (PIL), as well as the age-related differences in meaning in life in women diagnosed with eating disorders. Participants were 250 Spanish women diagnosed with eating disorders who ranged from 12 to 60 years old. Confirmatory Factor Analysis, descriptive analyses, estimation of the internal consistency of the PIL, correlations between the PIL and the Beck Hopelessness Scale (BHS), Overweight Preoccupation Scale (OPS), and Body Investment Scale (BIS), and age differences were calculated. A 19-item model that showed a good fit and internal consistency, a negative correlation between the PIL and both the BHS and OPS, and a positive correlation with the BIS, as well as significant differences between the adolescents and the mature adults, were found. It would be advisable to increase the inclusion of meaning in life in psychotherapeutic interventions with women diagnosed with eating disorders. Copyright © 2017 Elsevier B.V. All rights reserved.
Hammer, Joseph H; Brenner, Rachel E
This study extended our theoretical and applied understanding of gratitude through a psychometric examination of the most popular multidimensional measure of gratitude, the Gratitude, Resentment, and Appreciation Test-Revised Short form (GRAT-RS). Namely, the dimensionality of the GRAT-RS, the model-based reliability of the GRAT-RS total score and 3 subscale scores, and the incremental evidence of validity for its latent factors were assessed. Dimensionality measures (e.g., explained common variance) and confirmatory factor analysis results with 426 community adults indicated that the GRAT-RS conformed to a multidimensional (bifactor) structure. Model-based reliability measures (e.g., omega hierarchical) provided support for the future use of the Lack of a Sense of Deprivation raw subscale score, but not for the raw GRAT-RS total score, Simple Appreciation subscale score, or Appreciation of Others subscale score. Structural equation modeling results indicated that only the general gratitude factor and the lack of a sense of deprivation specific factor accounted for significant variance in life satisfaction, positive affect, and distress. These findings support the 3 pillars of gratitude conceptualization of gratitude over competing conceptualizations, the position that the specific forms of gratitude are theoretically distinct, and the argument that appreciation is distinct from the superordinate construct of gratitude.
Kao, Mei-Hua; Tsai, Yun-Fang
Self-management of osteoarthritis (OA) of the knee is important for treating this chronic disease. This study developed and psychometrically tested a new instrument for measuring adult patients' self-management needs of knee osteoarthritis (SMNKOA). The theoretical framework of self-care guided the development of the 35-item SMNKOA scale. Participants ( N = 372) were purposively sampled from orthopedic clinics at medical centers in Taiwan. The content validity index was 0.83. Principal components analysis identified a three-factor solution, accounting for 53.19% of the variance. The divergent validity was -0.67; convergent validity was -0.51. Cronbach's alpha was .95, Pearson's correlation coefficient was .88, and the intraclass correlation coefficient was .95. The scale's reliability and validity supports the SMNKOA, as a tool to measure self-management needs of adults with knee OA. Nurses and other health care providers can use this instrument to evaluate knee OA patients and identify strategies for improving health-related outcomes and patient education.
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
MERSCH, PPA; BREUKERS, P; EMMELKAMP, PMG
The Simulated Social Interaction Test (SSIT) was translated and adjusted for use on a population of Dutch males and females. Seventy-four social phobic patients were assessed with the SSIT, a conversation test, and an interview with an independent observer. Results show that the SSIT is a relatively
Carter, Amanda G; Creedy, Debra K; Sidebotham, Mary
develop and test a tool designed for use by preceptors/mentors to assess undergraduate midwifery students׳ critical thinking in practice. a descriptive cohort design was used. participants worked in a range of maternity settings in Queensland, Australia. 106 midwifery clinicians who had acted in the role of preceptor for undergraduate midwifery students. this study followed a staged model for tool development recommended by DeVellis (2012). This included generation of items, content validity testing through mapping of draft items to critical thinking concepts and expert review, administration of items to a convenience sample of preceptors, and psychometric testing. A 24 item tool titled the XXXX Assessment of Critical Thinking in Midwifery (CACTiM) was completed by registered midwives in relation to students they had recently preceptored in the clinical environment. ratings by experts revealed a content validity index score of 0.97, representing good content validity. An evaluation of construct validity through factor analysis generated three factors: 'partnership in practice', 'reflection on practice' and 'practice improvements'. The scale demonstrated good internal reliability with a Cronbach alpha coefficient of 0.97. The mean total score for the CACTiM scale was 116.77 (SD=16.68) with a range of 60-144. Total and subscale scores correlated significantly. the CACTiM (Preceptor/Mentor version) was found to be a valid and reliable tool for use by preceptors to assess critical thinking in undergraduate midwifery students. given the importance of critical thinking skills for midwifery practice, mapping and assessing critical thinking development in students׳ practice across an undergraduate programme is vital. The CACTiM (Preceptor/Mentor version) has utility for clinical education, research and practice. The tool can inform and guide preceptors׳ assessment of students׳ critical thinking in practice. The availability of a reliable and valid tool can be used to
Full Text Available The purpose of this article is to present the content, conceptual structure and methodological steps of the latest revision of the Peabody Picture Vocabulary Test (PPVT-III, which is a highly functional and valuable vocabulary test that has been in use since 1959 in different language and cultural surroundings. On the case of the PPVT-III we are presenting the procedure of development and standardization of such vocabulary tests as well as its translation and adaptation from one language and cultural milieu to another. We also note the practical use of the PPVT-III for research purposes. In Slovenian language no vocabulary tests were developed or adapted so far; PPVT-III is presented in this context, too.
Petersen, Morten Aa; Giesinger, Johannes M; Holzner, Bernhard
Fatigue is one of the most common symptoms associated with cancer and its treatment. To obtain a more precise and flexible measure of fatigue, the EORTC Quality of Life Group has developed a computerized adaptive test (CAT) measure of fatigue. This is part of an ongoing project developing a CAT...
Lin, Shu-Hui; Huang, Yun-Chen
The purpose of this study was to validate a Chinese version of the Gratitude Resentment and Appreciation Test (GRAT) with Taiwanese students. In Study 1, a total of 2511 Taiwanese students participated and completed the translated GRAT. Exploratory factor analysis, confirmatory factor analysis and reliability analysis were undertaken to assess the…
Buck, Harleah G; Harkness, Karen; Ali, Muhammad Usman; Carroll, Sandra L; Kryworuchko, Jennifer; McGillion, Michael
Caregivers (CGs) contribute important assistance with heart failure (HF) self-care, including daily maintenance, symptom monitoring, and management. Until CGs' contributions to self-care can be quantified, it is impossible to characterize it, account for its impact on patient outcomes, or perform meaningful cost analyses. The purpose of this study was to conduct psychometric testing and item reduction on the recently developed 34-item Caregiver Contribution to Heart Failure Self-care (CACHS) instrument using classical and item response theory methods. Fifty CGs (mean age 63 years ±12.84; 70% female) recruited from a HF clinic completed the CACHS in 2014 and results evaluated using classical test theory and item response theory. Items would be deleted for low (.95) endorsement, low (.7) corrected item-total correlations, significant pairwise correlation coefficients, floor or ceiling effects, relatively low latent trait and item information function levels ( .5), and differential item functioning. After analysis, 14 items were excluded, resulting in a 20-item instrument (self-care maintenance eight items; monitoring seven items; and management five items). Most items demonstrated moderate to high discrimination (median 2.13, minimum .77, maximum 5.05), and appropriate item difficulty (-2.7 to 1.4). Internal consistency reliability was excellent (Cronbach α = .94, average inter-item correlation = .41) with no ceiling effects. The newly developed 20-item version of the CACHS is supported by rigorous instrument development and represents a novel instrument to measure CGs' contribution to HF self-care. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Khazaee-Pool, Maryam; Majlessi, Fereshteh; Montazeri, Ali; Pashaei, Tahereh; Gholami, Ali; Ponnet, Koen
Breast cancer preventive behaviors have an extreme effect on women's health. Despite the benefits of preventive behaviors regarding breast cancer, they have not been implemented as routine care for healthy women. To assess this health issue, a reliable and valid scale is needed. The aim of the present study is to develop and examine the psychometric properties of a new scale, called the ASSISTS, in order to identify factors that affect women's breast cancer prevention behaviors. A multi-phase instrument development method was performed to develop the questionnaire from February 2012 to September 2014. The item pool was generated based on secondary analyses of previous qualitative data. Then, content and face validity were applied to provide a pre-final version of the scale. The scale validation was conducted with a sample of women recruited from health centers affiliated with Tehran University of Medical Sciences. The construct validity (both exploratory and confirmatory), convergent validity, discriminate validity, internal consistency reliability and test-retest analysis of the questionnaire were tested. Fifty-eight items were initially extracted from the secondary analysis of previous qualitative data. After content validity, this was reduced to 49 items. The exploratory factor analysis revealed seven factors (Attitude, supportive systems, self-efficacy, information seeking, stress management, stimulant and self-care) containing 33 items that jointly accounted for 60.62 % of the observed variance. The confirmatory factor analysis showed a model with appropriate fitness for the data. The Cronbach's alpha coefficient for the subscales ranged from 0.68 to 0.85, and the Intraclass Correlation Coefficient (ICC) ranged from 0.71 to 0.98; which is well above the acceptable thresholds. The findings showed that the designed questionnaire was a valid and reliable instrument for assessing factors affecting women's breast cancer prevention behaviors that can be used both
Gerhardt, Andreas; Eich, Wolfgang; Janke, Susanne; Leisner, Sabine; Treede, Rolf-Detlef; Tesarz, Jonas
Whether chronic localized pain (CLP) and chronic widespread pain (CWP) have different mechanisms or to what extent they overlap in their pathophysiology is controversial. The study compared quantitative sensory testing profiles of nonspecific chronic back pain patients with CLP (n=48) and CWP (n=29) with and fibromyalgia syndrome (FMS) patients (n=90) and pain-free controls (n = 40). The quantitative sensory testing protocol of the "German-Research-Network-on-Neuropathic-Pain" was used to measure evoked pain on the painful area in the lower back and the pain-free hand (thermal and mechanical detection and pain thresholds, vibration threshold, pain sensitivity to sharp and blunt mechanical stimuli). Ongoing pain and psychometrics were captured with pain drawings and questionnaires. CLP patients did not differ from pain-free controls, except for lower pressure pain threshold (PPT) on the back. CWP and FMS patients showed lower heat pain threshold and higher wind-up ratio on the back and lower heat pain threshold and cold pain threshold on the hand. FMS showed lower PPT on back and hand, and higher comorbidity of anxiety and depression and more functional impairment than all other groups. Even after long duration CLP presents with a local hypersensitivity for PPT, suggesting a somatotopically specific sensitization of nociceptive processing. However, CWP patients show widespread ongoing pain and hyperalgesia for different stimuli that is generalized in space, suggesting the involvement of descending control systems, as also suggested for FMS patients. Because mechanisms in nonspecific chronic back pain with CLP and CWP differ, these patients should be distinguished in future research and allocated to different treatments.
Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J. B.
on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). METHODS: In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients...... model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study...... sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. CONCLUSION: A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient...
Meng, Michael; Peter, Daniel; Mattner, Frauke; Igel, Christoph; Kugler, Christiane
Satisfaction with continuing education can be defined as positive attitudes towards educational programs, which has potential to strengthen learning outcomes. A multi-dimensional construct may enhance continuing education program evaluation processes. The objective is to describe the development and psychometric testing of the 'affective - behavioral - cognitive - satisfaction questionnaire' (ABC-SAT) for assessing participants' satisfaction with a continuing education program for nurses in infection control. The multi-staged development of a satisfaction questionnaire comprised of three subscales. The pilot tool was administered to a nationwide sample of 126 infection control nurses to assess satisfaction after participating in a continuing education program. Satisfaction scores were calculated and psychometric testing was performed to determine reliability, using Cronbach's alpha, face validity, objectivity, and economy. A principle component analysis using varimax rotation and Kaiser normalization was performed. The analysis led to a three-factor solution of the questionnaire with 11 items, explaining 61.4% of the variance. Internal consistency of three scales using Cronbach's alpha was 0.83, 0.60, and 0.66, respectively. Selectivity coefficients varied between 0.39 and 0.70. Participants needed approximately three minutes to complete the questionnaire. Initial findings refer to a satisfying scale structure and internal consistency of the 3-dimensional ABC-SAT questionnaire. Further research is required to confirm the questionnaires' psychometric properties. Copyright © 2018 Elsevier Ltd. All rights reserved.
Liu, Pei-Ching; Gau, Bih-Shya; Hung, Chao-Chia
BACKGROUNG: No specific instrument has thus far been developed for measuring the caregiver burden perceived by parents of children with allergies (CWA). To determine the psychometric properties of the Caregiver Burden Index (CBI). A mixed-methods design was adopted to evaluate the psychometric properties of the scale. The content validity index was 0.89, and the internal consistency was high with a coefficient alpha of 0.98. Three factors were extracted after exploratory factor analysis. The study findings suggest that the CBI has sufficient reliability and validity to evaluate the caregiver burden of parents of CWA. Copyright © 2015 Elsevier Inc. All rights reserved.
Miller, M; Hamilton, J; Scupham, R; Matwiejczyk, L; Prichard, I; Farrer, O; Yaxley, A
Food service staff are integral to delivery of quality food in aged care homes yet measurement of their satisfaction is unable to be performed due to an absence of a valid and reliable questionnaire. The aim of this study was to develop and perform psychometric testing for a new Food Service Satisfaction Questionnaire developed in Australia specifically for use by food service staff working in residential aged care homes (Flinders FSSQFSAC). A mixed methods design utilizing both a qualitative (in-depth interviews, focus groups) and a quantitative approach (cross sectional survey) was used. Content validity was determined from focus groups and interviews with food service staff currently working in aged care homes, related questionnaires from the literature and consultation with an expert panel. The questionnaire was tested for construct validity and internal consistency using data from food service staff currently working in aged care homes that responded to an electronic invitation circulated to Australian aged care homes using a national database of email addresses. Construct validity was tested via principle components analysis and internal consistency through Cronbach's alpha. Temporal stability of the questionnaire was determined from food service staff undertaking the Flinders FSSQFSAC on two occasions, two weeks apart, and analysed using Pearson's correlations. Content validity for the Flinders FSSQFSAC was established from a panel of experts and stakeholders. Principle components analysis revealed food service staff satisfaction was represented by 61-items divided into eight domains: job satisfaction (α=0.832), food quality (α=0.871), staff training (α=0.922), consultation (α=0.840), eating environment (α=0.777), reliability (α=0.695), family expectations (α=0.781) and resident relationships (α=0.429), establishing construct validity in all domains, and internal consistency in all (α>0.5) except for "resident relationships" (α=0.429). Test
López, Mariana Beatriz; Lichtenberger, Aldana; Conde, Karina; Cremonte, Mariana
Background Considering the physical, mental and behavioral problems related to fetal alcohol exposure, prenatal clinical guides suggest a brief evaluation of alcohol consumption during pregnancy to detect alcohol intake and to adjust interventions, if required. Even if any alcohol use should be considered risky during pregnancy, identifying women with alcohol use disorders is important because they could need a more specific intervention than simple advice to abstain. Most screening tests have been developed and validated in male populations and focused on the long-term consequences of heavy alcohol use, so they might be inappropriate to assess consumption in pregnant women. Objective To analyze the internal reliability and validity of the alcohol screening instruments Alcohol Use Disorders Identification Test (AUDIT), Alcohol Use Disorders Identification Test - Consumption (AUDIT-C), Tolerance, Worried, Eye-Opener, Amnesia and Cut-Down (TWEAK), Rapid Alcohol Problems Screen - Quantity Frequency (RAPS-QF) and Tolerance, Annoyed, Cut-Down and Eye-Opener (T-ACE) to identify alcohol use disorders in pregnant women. Methods A total of 641 puerperal women were personally interviewed during the 48 hours after delivery. The receiver operating characteristics (ROC) curves and the sensitivity and specificity of each instrument using different cut-off points were analyzed. Results All instruments showed areas under the ROC curves above 0.80. Larger areas were found for the TWEAK and the AUDIT. The TWEAK, the T-ACE and the AUDIT-C showed higher sensitivity, while the AUDIT and the RAPS-QF showed higher specificity. Reliability (internal consistency) was low for all instruments, improving when optimal cut-off points were used, especially for the AUDIT, the AUDIT-C and the RAPS-QF. Conclusions In other cultural contexts, studies have concluded that T-ACE and TWEAK are the best instruments to assess pregnant women. In contrast, our results evidenced the low
Sun, Fan-Ko; Chiang, Chun-Ying; Lu, Chu-Yun; Yu, Pei-Jane; Liao, Tzu-Chiao; Lan, Chu-Mei
To develop the Health of Body, Mind and Spirit Scale (HBMSS), which was designed to assess drug abusers' health condition. Helping drug abusers to become healthy is important to healthcare professionals. However, no instrument exists to assess drug abusers' state of health. A cross-sectional questionnaire survey was implemented to examine the validity of the HBMSS. Data were collected from 2015-2016 at one drug abuse prevention centre in Taiwan. Participants (N = 320) who had abused drugs were invited to complete a preliminary 64-item version of the HBMSS. An item analysis, criterion-related validity analysis (using the Relapse Prediction Scale [RPS] score), split-half reliability testing and confirmatory factor analysis (CFA) were conducted to examine the psychometric properties of the HBMSS. The final version of the HBMSS contained 15 items that were divided into three subscales: the health of the body, mind and spirit. Cronbach's α and split-half reliability coefficients were all above .85. The factor loading of each item was between .74-.95. The HBMSS had satisfactory criterion-related validity with the RPS score (r = -.50, p < .001). A second-order CFA was conducted on the HBMSS. The fit indexes were good, χ 2 = 184.060, df = 94, χ 2 /df = 1.958 (p = .000). The entire HBMSS and the subscales had satisfactory reliability and validity. Healthcare professionals could use the HBMSS to evaluate the condition of the health of individuals with a drug abuse history. © 2017 John Wiley & Sons Ltd.
Lorenzo, Pelizza; Silvia, Azzali; Federica, Paterlini; Sara, Garlassi; Ilaria, Scazza; Pupo, Simona; Andrea, Raballo
Among current early screeners for psychosis-risk states, the Prodromal Questionnaire-16 items (PQ-16) is often used. We aimed to assess validity and reliability of the Italian version of the PQ-16 in a young adult help-seeking population. We included 154 individuals aged 18-35years seeking help at the Reggio Emilia outpatient mental health services in a large semirural catchment area (550.000 inhabitants). Participants completed the Italian version of the PQ-16 (iPQ-16) and were subsequently evaluated with the Comprehensive Assessment of At-Risk Mental States (CAARMS). We examined diagnostic accuracy (i.e. specificity, sensitivity, negative and positive likelihood ratios, and negative and positive predictive values) and content, convergent, and concurrent validity between PQ-16 and CAARMS using Cronbach's alpha, Spearman's rho, and Cohen's kappa, respectively. We also tested the validity of the adopted PQ-16 cut-offs through Receiver Operating Characteristic (ROC) curves plotted against CAARMS diagnoses and the 1-year predictive validity of the PQ-16. The iPQ-16 showed high internal consistency and acceptable diagnostic accuracy and concurrent validity. ROC analyses pointed to a cut-off score of ≥5 as best cut-off. After 12months of follow-up, 8.7% of participants with a PQ-16 symptom total score of ≥5 who were below the CAARMS psychosis threshold at the baseline, developed a psychotic disorder. Psychometric properties of the iPQ-16 were satisfactory. Copyright © 2018. Published by Elsevier B.V.
Petróczi, Andrea; Backhouse, Susan H.; Barkoukis, Vassilis
research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. At the same time, we must acknowledge that socially desirable responding...... guidance aims to protect the global athletic community against social, ethical and legal consequences from potential misuse of psychological tests, including applications as forensic diagnostic tools in both practice and research.......One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping...
Rezaul Karim, A K M; Nigar, Naima
There is growing importance of the Internet Addiction Test (IAT) in Internet addiction research around the world. Since the development of the IAT (Young, 1996, 1998), a number of validation studies have been done in various cultures. The aim of this study was to translate the instrument into Bangla and validate in Bangladeshi culture, a culture vulnerable to Internet addiction. A total of 177 Internet users (77 females and 100 males) participated in the study. Exploratory factor analysis (EFA) of the data from 172 participants (who provided complete responses) identified a four factor structure of the IAT with 18 items. The four factors namely 'Neglect of duty', 'Online dependence', 'Virtual fantasies', and 'Privacy and self-defense' together explained 55.68% of the total variance. Problematic (moderate/excessive) users on the IAT scored, on average, higher on each of the four IAT factors as compared to average or non-problematic (minimal) users consistently across genders. The IAT and its factors showed good internal consistency (Cronbach's α=.89 for the IAT, and .60-.84 for the factors), strong convergent and discriminant validity. Thus, the Bangla version IAT appears to be valid and reliable and therefore may be used in further research on Internet addiction in the country. Copyright © 2013 Elsevier B.V. All rights reserved.
Jardin, Charles; Garey, Lorra; Zvolensky, Michael J
Sexual motives refer to functions served by sexual behavior. The Sex Motivations Scale (SMS) has frequently been used to assess sexual motives. At its development, the SMS demonstrated good internal consistency; convergent, divergent, and criterion validity; and configural invariance across sex, age, and Caucasians and African Americans. Yet the metric and scalar invariance of the SMS has not been examined, nor has the measurement invariance of the SMS across Hispanic and Asian Americans, sexual minority status, and relationship status been tested. The criterion validity of the SMS also has yet to be examined for nonintercourse sexual behaviors, such as sexting. The present study aimed to address these gaps in a diverse sample of 2,201 college students (77.60% female; M age = 22.06; 27.84% Caucasian). Results further affirmed the configural, metric, and scalar invariance of the SMS. The convergent and divergent validity of the SMS was supported in relation to positive and negative affect and attachment patterns; and specific SMS subscales demonstrated associations with sexual intercourse behaviors and sexting, supporting the criterion validity of the SMS. These findings suggest the relevance of the SMS in assessing sexual motives across diverse populations and behaviors.
Wicherts, J.M.; Dolan, C.V.; Carlson, J.S.; van der Maas, H.L.J.
This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's
Kornelsen, Jude; Stoll, Kathrin; Grzybowski, Stefan
Rural pregnant woman who lack local access to maternity care due to their remote living circumstances may experience stress and anxiety related to pregnancy and parturition. The Rural Pregnancy Experience Scale (RPES) was designed to assess the unique worry and concerns reflective of the stress and anxiety of rural pregnant women related to pregnancy and parturition. The items of the scale were designed based on the results of a qualitative study of the experiences of pregnant rural women, thereby building a priori content validity into the measure. The relevancy content validity index (CVI) for this instrument was 1.0 and the clarity CVI was .91, as rated by maternity care specialists. A field test of the RPES with 187 pregnant rural women from British Columbia indicated that it had two factors: financial worries and worries/concerns about maternity care services, which were consistent with the conceptual base of the tool. Cronbach's alpha for the total RPES was .91; for the financial worries subscale and the worries/concerns about maternity care services subscale, alpha were .89 and .88, respectively. Construct validity was supported by significant correlations between the total scores of the RPES and the Depression Anxiety Stress Scales (DASS [r =.39, p DASS supporting convergent validity (correlations ranged between .20; p < .05 and .43; p < .01). Construct validity was also supported by findings that the level of access and availability of maternity care services were significantly associated with RPES scores. It was concluded that the RPES is a reliable and valid measure of worries and concerns reflective of rural pregnant women's stress and anxiety related to pregnancy and parturition.
Full Text Available Abstract Background There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. Methods and design The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU. Participants will be: (a tutors of Family Medicine residents (expert group; (b Family Medicine residents in their second year of the Family Medicine training program (novice group, and (c Family Medicine physicians (intermediate group. The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. Discussion The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Argimon-Pallàs, Josep M; Flores-Mateo, Gemma; Jiménez-Villa, Josep; Pujol-Ribera, Enriqueta; Foz, Gonçal; Bundó-Vidiella, Magda; Juncosa, Sebastià; Fuentes-Bellido, Cruz M; Pérez-Rodríguez, Belén; Margalef-Pallarès, Francesc; Villafafila-Ferrero, Rosa; Forès-Garcia, Dolors; Roman-Martínez, Josep; Vilert-Garroga, Esther
There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP) curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU). Participants will be: (a) tutors of Family Medicine residents (expert group); (b) Family Medicine residents in their second year of the Family Medicine training program (novice group), and (c) Family Medicine physicians (intermediate group). The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Liaw, Sok Ying; Rashasegaran, Ahtherai; Wong, Lai Fun; Deneen, Christopher Charles; Cooper, Simon; Levett-Jones, Tracy; Goh, Hongli Sam; Ignacio, Jeanette
The development of clinical reasoning skills in recognising and responding to clinical deterioration is essential in pre-registration nursing education. Simulation has been increasingly used by educators to develop this skill. To develop and evaluate the psychometric properties of a Clinical Reasoning Evaluation Simulation Tool (CREST) for measuring clinical reasoning skills in recognising and responding to clinical deterioration in a simulated environment. A scale development with psychometric testing and mixed methods study. Nursing students and academic staff were recruited at a university. A three-phase prospective study was conducted. Phase 1 involved the development and content validation of the CREST; Phase 2 included the psychometric testing of the tool with 15 second-year and 15 third-year nursing students who undertook the simulation-based assessment; Phase 3 involved the usability testing of the tool with nine academic staff through a survey questionnaire and focus group discussion. A 10-item CREST was developed based on a model of clinical reasoning. A content validity of 0.93 was obtained from the validation of 15 international experts. The construct validity was supported as the third-year students demonstrated significantly higher (preasoning scores than the second-year students. The concurrent validity was also supported with significant positive correlations between global rating scores and almost all subscale scores, and the total scores. The predictive validity was supported with an existing tool. The internal consistency was high with a Cronbach's alpha of 0.92. A high inter-rater reliability was demonstrated with an intraclass correlation coefficient of 0.88. The usability of the tool was rated positively by the nurse educators but the need to ease the scoring process was highlighted. A valid and reliable tool was developed to measure the effectiveness of simulation in developing clinical reasoning skills for recognising and responding to
Chiwaridzo, Matthew; Ferguson, Gillian D; Smits-Engelsman, Bouwien C M
Scientific focus on rugby has increased over the recent years, providing evidence of the physical or physiological characteristics and game-specific skills needed in the sport. Identification of tests commonly used to measure these characteristics is important for the development of test batteries, which in turn may be used for talent identification and injury prevention programmes. Although there are a number of tests available in the literature to measure physical or physiological variables and game-specific skills, there is limited information available on the psychometric properties of the tests. Therefore, the purpose of this study is to systematically review the literature for tests commonly used in rugby to measure physical or physiological characteristics and rugby-specific skills, documenting evidence of reliability and validity of the identified tests. A systematic review will be conducted. Electronic databases such as Scopus, MEDLINE via EBSCOhost and PubMed, Academic Search Premier, CINAHL and Africa-Wide Information via EBSCOhost will be searched for original research articles published in English from January 1, 1995, to December 31, 2015, using a pre-defined search strategy. The principal investigator will select potentially relevant articles from titles and abstracts. To minimise bias, full text of titles and abstracts deemed potentially relevant will be retrieved and reviewed by two independent reviewers based on the inclusion criteria. Data extraction will be conducted by the principal investigator and verified by two independent reviewers. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist will be used to assess the methodological quality of the selected studies. Choosing an appropriate test to be included in the screening test battery should be based on sound psychometric properties of the test available. This systematic review will provide an overview of the tests commonly used in rugby union
Izilda Carolina de Meneses-Gaya
Full Text Available OBJECTIVE: The Fagerström Test for Nicotine Dependence (FTND is a screening instrument for physical nicotine dependence and is extensively used in various countries. The objective of the present report was to review articles related to the psychometric properties of the FTND. METHODS: A systematic search for articles published up through December of 2007 was carried out in various electronic databases. The following search terms were used: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; and "factors". We included articles published in English, Spanish or Portuguese and in which the psychometric properties of the FTND were evaluated. RESULTS: Twenty-six studies related to the psychometric properties of the FTND were identified in the indexed literature. Analysis of the studies confirmed the reliability of the FTND for the assessment of nicotine dependence in different settings and populations. CONCLUSIONS: Further validation studies using previously validated instruments as a comparative measure are needed before the extensive use of the FTND can be justified on the basis of its psychometric qualities.OBJETIVO: O Fagerström Test for Nicotine Dependence (FTND, Teste de Fagerström para Dependência de Nicotina é um instrumento de rastreamento para dependência física de tabaco, amplamente utilizado em diversos países. Objetivou-se realizar uma revisão de artigos relacionados às propriedades psicométricas do FTND. MÉTODOS: Uma busca sistemática foi realizada usando-se vários indexadores eletrônicos até dezembro de 2007, com os seguintes descritores: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; e "factors". Foram incluídos os artigos relacionados à avaliação das propriedades psicométricas do FTND publicados em inglês, espanhol e português. RESULTADOS: Vinte e seis estudos relativos às propriedades psicom
Huang, Yun-Hsin; Wu, Chih-Hsun; Chen, Hsiu-Jung; Cheng, Yih-Ru; Hung, Fu-Chien; Leung, Kai-Kuan; Lue, Bee-Horng; Chen, Ching-Yu; Chiu, Tai-Yuan; Wu, Yin-Chang
Severe negative emotional reactions to chronic illness are maladaptive to patients and they need to be addressed in a primary care setting. The psychometric properties of a quick screening tool-the Negative Emotions due to Chronic Illness Screening Test (NECIS)-for general emotional problems among patients with chronic illness being treated in a primary care setting was investigated. Three studies including 375 of patients with chronic illness were used to assess and analyze internal consistency, test-retest reliability, criterion-related validity, a cut-off point for distinguishing maladaptive emotions and clinical application validity of NECIS. Self-report questionnaires were used. Internal consistency (Cronbach's α) ranged from 0.78 to 0.82, and the test-retest reliability was 0.71 (P analysis reference, the receiver-operating characteristic curve analysis revealed an area under the curve of 0.81 and 0.82 (ps emotions, with a sensitivity and specificity of 83.3 and 69.0%, and 68.5 and 83.0%, respectively. The clinical application validity analysis revealed that low NECIS group showed significantly better adaptation to chronic illness on the scales of subjective health, general satisfaction with life, self-efficacy of self-care for disease, illness perception and stressors in everyday life. The NECIS has satisfactory psychometric properties for use in the primary care setting. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: email@example.com.
Kang, Qing; Chan, Raymond C K; Li, Xiaoping; Arcelus, Jon; Yue, Ling; Huang, Jiabin; Gu, Lian; Fan, Qing; Zhang, Haiyin; Xiao, Zeping; Chen, Jue
The study aimed to investigate the reliability and validity of the Chinese version of the eating attitudes test (EAT-26) among female adolescents and young adults in Mainland China. This scale was administered to 396 female eating disorder patients and 406 noneating disorder healthy controls, in addition 35 healthy controls completed a retest after a 4-week intervals. Tests for reliability, convergent validity and receiver operating characteristic analysis were performed to detect the psychometric properties. The EAT-26 demonstrated good internal consistency (Cronbach's alpha = 0.822-0.922), test-retest reliability (interclass correlation coefficient = 0.817) and convergent validity(r = 0.450-0.750). The receiver operating characteristic analysis showed that the cut-off 14 for anorexia nervosa and 15 for bulimia nervosa represented good compromises with approximate sensitivity (0.66-0.68) and specificity (0.85-0.86). Our findings provided evidence that the Chinese version of the EAT-26 was a psychometrically reliable and valid self-rating instrument for identifying people suffering from an eating disorder in Mainland China. A clinical cut-off range between 14 and 15 could be used, but caution should be exercised because of the low sensitivity of the tool. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association.
Full Text Available The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum. However, little is known about the test’s psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test’s reliance on one’s vocabulary and evaluate the short-form structure in clinical populations.
Ryabukhin, Yu.S.; Ryabukhin, V.Yu.
Psychometric indicators for mental development of children in towns distinguished by radioactive contamination resulting from the Chernobyl accident are studied. Using some radiological information obtained after the Chernobyl accident, values of expected intelligence quotient (IQ) reduction have been assessed as a result of brain exposure in utero due to various components of dose. Comparing the results of examinations in Novozybkov, Klintsy and Obninsk, no confident evidence has been obtained that radiation exposure of the developing brain exerts influence on indicators for mental development [ru
Kim, Suk Sun; Reed, Pamela G; Kang, Youngmi; Oh, Jina
The purpose of this study was to translate the Spiritual Perspective Scale (SPS) and Self-transcendence Scale (STS) into Korean and test the psychometric properties of the instruments with Korean elders. A cross-sectional survey design was used to implement the three stages of the study. Stage I consisted of translating and reviewing the scales by six experts. In Stage II, equivalence was tested by comparing the responses between the Korean and English versions among 71 bilingual adults. Stage III established the psychometric properties of the Korean versions SPS-K and STS-K among 154 Korean elders. Cronbach's alpha of the SPS-K and the STS-K .97, and .85 respectively with Korean elders. Factor analysis showed that the SPS-K had one factor; the STS-K had four factors with one factor clearly representing self-transcendence as theorized. Both scales showed good reliability and validity for the translated Korean versions. However, continued study of the construct validity of the STS-K is needed. Study findings indicate that the SPS-K and the STS-K could be useful for nurses and geriatric researchers to assess a broadly defined spirituality, and to conduct research on spirituality and health among Korean elders. Use of these scales within a theory-based study may contribute to further knowledge about the role of spirituality in the health and well-being of Korean people facing health crises.
Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J B; Conroy, Thierry; Tomaszewski, Krzysztof A; Young, Teresa; Petersen, Morten Aa
The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients. This evaluation included an assessment of dimensionality, fit to the item response theory (IRT) model, differential item functioning (DIF), and measurement properties. A total of 1030 cancer patients completed the 44 candidate items on CF. Of these, 34 items could be included in a unidimensional IRT model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient assessment of HRQOL of cancer patients, without loss of comparability of results.
Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina
Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the Test of Science-Related Attitudes (TOSRA), such as its construct validity, its discriminant and concurrent validity, and its reliability. The evidence presented suggests that TOSRA, in its Spanish-adapted version, has adequate construct validity regarding its theoretical referents, as well as good indexes of reliability. In addition, it determines the attitudes toward science of secondary school students in Santiago de Chile (n = 664) and analyzes the sex variable as a differentiating factor in such attitudes. The analysis by sex revealed low-relevance gender difference. The results are contrasted with those obtained in English-speaking countries. This TOSRA sample showed good psychometric parameters for measuring and evaluating attitudes toward science, which can be used in classrooms of Spanish-speaking countries or with immigrant populations with limited English proficiency.
Kim, Youngmee; Evangelista, Lorraine S; Phillips, Linda R; Pavlish, Carol; Kopple, Joel D
Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and validity) of the 46-item ESRD-Adherence Questionnaire (ESRD-AQ) in a cohort of patients receiving maintenance HD at dialysis centers in Los Angeles County. The ESRD-AQ is the first self-report instrument to address all components of adherence behaviors of patients with ESRD. The findings support that the instrument is reliable and valid and is easy to administer. Future studies are needed in a larger sample to determine whether additional modifications are needed.
Kruglanski, A W; Atash, M N; DeGrada, E; Mannetti, L; Pierro, A; Webster, D M
S. L. Neuberg, T. N. Judice, and S. G. West (1997) faulted our work with the Need for Closure Scale (NFCS) on grounds that the NFCS lacks discriminant validity relative to S. L. Neuberg's and J. T. Newsom's (1993) Personal Need for Structure (PNS) Scale and is multidimensional, which, so they claim, renders the use of its total score inadmissible. By contrast, the present authors show that neither of the above assertions is incompatible with the underlying need for closure theory. Relations between NFCS and the PNS are to be expected, as these were designed to operationalize the very same construct (of need for closure). Furthermore, no unidimensionality of the NFCS has been claimed, and none is required to use its total score for testing various theoretically derived predictions. An instrument's ultimate utility hinges on theoretical considerations and empirical evidence rather than on questionable psychometric dogma unrelated to the substantive matters at hand.
Strout, Tania D
Agitation is a vexing problem frequently observed in emergency department acute psychiatric patients, yet no instruments to measure agitation in this setting and population were found upon review of the literature. Previously developed agitation rating scales are limited by the length of observation they require, their need for participation by the patient, complexity in scoring, and a lack of validity in this setting and population. The purpose of this study was to psychometrically evaluate and refine an observation-based agitation scale for use with emergency department acute psychiatric patients. Using a methodological design, the 21-item Agitation Severity Scale was utilized to assess 270 adult psychiatric patients in the emergency setting in a prospective, observational fashion. Reliability analysis, item analysis, exploratory factor analysis, and validity assessments were completed. The relationship between Agitation Severity Scale scores and scores on the previously established Overt Agitation Severity Scale was evaluated. The instrument was reduced to 17 items representing four factors (Aggressive Behaviors, Interpersonal Behaviors, Involuntary Motor Behaviors, and Physical Stance) that accounted for nearly 70% of observed variance, Cronbach's α = 0.91. Evidence of internal consistency reliability, equivalence reliability, construct validity, and convergent validity was established. Through this study, the 17-item Agitation Severity Scale demonstrated acceptable levels of reliability and validity when used with acute psychiatric patients in the emergency setting. This instrument holds promise as a method of enhancing clinical communication about agitation, evaluating the efficacy of interventions aimed at decreasing agitation, and as a research tool.
Full Text Available The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It. Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170 examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA and construct validity with enjoyment perception during physical activity. Study 2 (n = 59 reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry over the span of seven consecutive days. Study 3 (n = 58 examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD. In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83. Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36, with BMI (r = -.30 and -.79 for CHD simple form, and with the VO2max (r = .55 for CHD simple form. Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05. Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio
The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Alexander, Dayna S; Alfonso, Moya L; Cao, Chunhua
Currently, public health practitioners are analyzing the role that caregivers play in childhood obesity efforts. Assessing African American caregiver's perceptions of childhood obesity in rural communities is an important prevention effort. This article's objective is to describe the development and psychometric testing of a survey tool to assess childhood obesity perceptions among African American caregivers in a rural setting, which can be used for obesity prevention program development or evaluation. The Childhood Obesity Perceptions (COP) survey was developed to reflect the multidimensional nature of childhood obesity including risk factors, health complications, weight status, built environment, and obesity prevention strategies. A 97-item survey was pretested and piloted with the priority population. After pretesting and piloting, the survey was reduced to 59-items and administered to 135 African American caregivers. An exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) was conducted to test how well the survey items represented the number of Social Cognitive Theory constructs. Twenty items were removed from the original 59-item survey and acceptable internal consistency of the six factors (α=0.70-0.85) was documented for all scales in the final COP instrument. CFA resulted in a less than adequate fit; however, a multivariate Lagrange multiplier test identified modifications to improve the model fit. The COP survey represents a promising approach as a potentially comprehensive assessment for implementation or evaluation of childhood obesity programs. Copyright © 2016 Elsevier Ltd. All rights reserved.
Wiklander, Maria; Brännström, Johanna; Svedhem, Veronica; Eriksson, Lars E
Barriers to HIV testing experienced by individuals at risk for HIV can result in treatment delay and further transmission of the disease. Instruments to systematically measure barriers are scarce, but could contribute to improved strategies for HIV testing. Aims of this study were to develop and test a barriers to HIV testing scale in a Swedish context. An 18-item scale was developed, based on an existing scale with addition of six new items related to fear of the disease or negative consequences of being diagnosed as HIV-infected. Items were phrased as statements about potential barriers with a three-point response format representing not important, somewhat important, and very important. The scale was evaluated regarding missing values, floor and ceiling effects, exploratory factor analysis, and internal consistencies. The questionnaire was completed by 292 adults recently diagnosed with HIV infection, of whom 7 were excluded (≥9 items missing) and 285 were included (≥12 items completed) in the analyses. The participants were 18-70 years old (mean 40.5, SD 11.5), 39 % were females and 77 % born outside Sweden. Routes of transmission were heterosexual transmission 63 %, male to male sex 20 %, intravenous drug use 5 %, blood product/transfusion 2 %, and unknown 9 %. All scale items had <3 % missing values. The data was feasible for factor analysis (KMO = 0.92) and a four-factor solution was chosen, based on level of explained common variance (58.64 %) and interpretability of factor structure. The factors were interpreted as; personal consequences, structural barriers, social and economic security, and confidentiality. Ratings on the minimum level (suggested barrier not important) were common, resulting in substantial floor effects on the scales. The scales were internally consistent (Cronbach's α 0.78-0.91). This study gives preliminary evidence of the scale being feasible, reliable and valid to identify different types of barriers to HIV testing.
Giuffrida, Michelle A; Brown, Dorothy Cimino; Ellenberg, Susan S; Farrar, John T
OBJECTIVE To describe development and initial psychometric testing of an owner-reported questionnaire designed to standardize measurement of general quality of life (QOL) in dogs with cancer. DESIGN Key-informant interviews, questionnaire development, and field trial. SAMPLE Owners of 25 dogs with cancer for item development and pretesting and owners of 90 dogs with cancer for reliability and validity testing. PROCEDURES Standard methods for development and testing of questionnaire instruments intended to measure subjective states were used. Items were generated, selected, scaled, and pretested for content, meaning, and readability. Response items were evaluated with exploratory factor analysis and by assessing internal consistency (Cronbach α) and convergence with global QOL as determined with a visual analog scale. Preliminary tests of stability and responsiveness were performed. RESULTS The final questionnaire-which was named the Canine Owner-Reported Quality of Life (CORQ) questionnaire-contained 17 items related to observable behaviors commonly used by owners to evaluate QOL in their dogs. Several items pertaining to physical symptoms performed poorly and were omitted. The 17 items were assigned to 4 factors-vitality, companionship, pain, and mobility-on the basis of the items they contained. The CORQ questionnaire and its factors had high internal consistency (Cronbach α = 0.68 to 0.90) and moderate to strong correlations (r = 0.49 to 0.71) with global QOL as measured on a visual analog scale. Preliminary testing indicated good test-retest reliability and responsiveness to improvements in overall QOL. CONCLUSIONS AND CLINICAL RELEVANCE The CORQ questionnaire was a valid, reliable owner-reported questionnaire that measured general QOL in dogs with cancer and showed promise as a clinical trial outcome measure for quantifying changes in individual dog QOL occurring in response to cancer treatment and progression.
Conclusion: Although both the chained linear equating method and Rasch analysis can be readily applied to practical test-equating issues in medical education, Rasch analysis exhibited more versatility in test parameter estimation and item bank development for clinical curriculums.
Kolman, Nikki; Huijgen, Barbara; Kramer, Tamara; Elferink-Gemser, Marije; Visscher, Chris
This study examined the test-retest reliability, validity and feasibility of the newly developed Dutch Technical Tactical Tennis Test (D4T). This new test is relevant for talent identification and development in tennis. Thirty-two youth male tennis players (age 13.4 +/- 0.5) were classified as elite
Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B
This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took issues were addressed. Analyses suggested a revision of the phase III hypothesized scale structure. Two items were deleted based on a high degree of item misfit, together with negative patient feedback. The remaining 15 items formed one eight-item scale named OH-QoL score, a two-item information scale, a two-item scale regarding dentures, and three single items (sticky saliva/mouth soreness/sensitivity to food/drink). Face and convergent validity and internal consistency were confirmed. Test-retest reliability (n = 60) was demonstrated as was RCA for patients undergoing chemotherapy (n = 117; p = 0.06). The resulting QLQ-OH15 discriminated between clinically distinct patient groups, e.g., low performance status vs. higher (p < 000.1), and head-and-neck cancer versus other cancers (p < 0.03). The EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.
Leeuw, Jan de
In psychometrics, and in the closely related fields of quantititative methods for the social and educational sciences, R is not yet used very often. Traditional mainframe packages such as SAS and SPSS are still dominant at the user-level, Stata has made inroads at the teaching level, and Matlab is quite prominent at the research level. In this paper we define the most visible techniques in the psychometrics area, we give an overview of what is available in R, and we discuss what is m...
Full Text Available Introduction. Disturbed eating attitudes may be important precursors of pathological eating patterns and, therefore need to be researched adequately. The Children's Eating Attitude Test (ChEAT is indicated for detecting at-risk attitudes and concerns in youngsters. Method. The present study was designed to provide a preliminary psychometric evaluation of the Dutch version of the ChEAT, by examining reliability and validity in a sample of 166 youngsters. Results. Generally the ChEAT seems to be a reliable instrument. Concurrent validity was demonstrated by positive correlations with measures assessing pathological eating behaviour and with related psychological problems. The discriminant validity was good. Based on ChEAT scores we can distinguish overweight youngsters from the community sample and “dieters” from “non dieters”. Divergent validity and factor structure show still shortcomings. Discussion. The Dutch version of the ChEAT seems to be a promising screening- and research instrument. Future prospective research could focus on a cut-off score for identifying at-risk youngsters.
Remor, Eduardo; Fuster-RuizdeApodaca, Maria José; Ballester-Arnal, Rafael; Gómez-Martínez, Sandra; Fumaz, Carmina R; González-Garcia, Marian; Ubillos-Landa, Silvia; Aguirrezabal-Prado, Arrate; Molero, Fernando; Ruzafa-Martínez, Maria
The Screenphiv, a screening measure for psychological issues related to HIV, was psychometrically tested in a study involving 744 HIV-infected people in Spain. Participants ages 18-82 (M = 43.04, 72 % men, 28 % women) completed an assessment protocol that included the Screenphiv and the MOS-HIV. A trained interviewer also collected relevant illness-related clinical data and socio-demographics from the participants. A confirmatory factor analysis was used to evaluate the goodness of fit of the Screenphiv's theoretical model and confirmed six first-order factors and two second-order factors [RMSEA (IC 90 %) = 0.07 (0.07-0.08)]. No floor or ceiling effects were observed for the scores. Cronbach's alphas were acceptable for all of the factors (from 0.65 to 0.92). Criterion-related validity also achieved; Screenphiv scores were related to socio-demographic and clinical variables and MOS-HIV summary scores. The Screenphiv is a reliable and valid measure, ready to use in research and clinical settings in Spain.
Oliveri, María Elena; von Davier, Alina A.
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Full Text Available This study examined the test-retest reliability, validity and feasibility of the newly developed Dutch Technical-Tactical Tennis Test (D4T. This new test is relevant for talent identification and development in tennis. Thirty-two youth male tennis players (age 13.4 ± 0.5 were classified as elite (n = 15 or sub-elite (n = 17 according to their position on the national youth ranking list under 15 years (cut-off rank 50 in the Netherlands. Games, rallies and different tactical situations (i.e. offensive, neutral and defensive were simulated with a ball machine. Players had to return 72 balls to predetermined target areas. Stroke quality was recorded based on ball velocity and accuracy (VA-index, as well as percentage errors. Test-retest reliability was assessed by comparing differences between the first and second test-session (n = 10. An intraclass-correlation coefficient of .78 for the VA-index was found (p < .05, indicating excellent test-retest reliability. Independent t-tests revealed that elite players outscored sub-elite players for the VA-index, ball velocity, accuracy and percentage errors (p < .05, supporting good validity. Furthermore, a high correlation was found between the VA-index and individual positions on the youth ranking list (p = -.75; p < .001. The assessment of feasibility indicated that the D4T was applicable for instructors and coaches. In conclusion, the D4T was shown to be a reliable, valid and feasible test to measure technical-tactical characteristics of tennis performance in youth players.
Verster, Joris C.; Roth, Thomas
Rationale There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. Objective The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Methods Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a ...
Lahey, Benjamin B.; Applegate, Brooks; Chronis, Andrea M.; Jones, Heather A.; Williams, Stephanie Hall; Loney, Jan; Waldman, Irwin D.
Lahey and Waldman proposed a developmental propensity model in which three dimensions of children's emotional dispositions are hypothesized to transact with the environment to influence risk for conduct disorder, heterogeneity in conduct disorder, and comorbidity with other disorders. To prepare for future tests of this model, a new measure of…
Raykov, Tenko; Marcoulides, George A.; Dimitrov, Dimiter M.; Li, Tatyana
This article extends the procedure outlined in the article by Raykov, Marcoulides, and Tong for testing congruence of latent constructs to the setting of binary items and clustering effects. In this widely used setting in contemporary educational and psychological research, the method can be used to examine if two or more homogeneous…
Hakim-Larson, Julie; Parker, Alison; Lee, Catharine; Goodwin, Jacqueline; Voelker, Sylvia
Parental meta-emotion, assessed through interviews, involves parents' philosophy about emotions and has been found to be related to parenting behaviors and children's emotional and social competence (e.g., Gottman, Katz, & Hooven, 1996; Katz & Windecker-Nelson, 2004). The Emotion-Related Parenting Styles Self-Test is a true-false…
Richard R. McNeer
Full Text Available Introduction. Medical simulators are used for assessing clinical skills and increasingly for testing hypotheses. We developed and tested an approach for assessing performance in anesthesia residents using screen-based simulation that ensures expert raters remain blinded to subject identity and experimental condition. Methods. Twenty anesthesia residents managed emergencies in an operating room simulator by logging actions through a custom graphical user interface. Two expert raters rated performance based on these entries using custom Global Rating Scale (GRS and Crisis Management Checklist (CMC instruments. Interrater reliability was measured by calculating intraclass correlation coefficients (ICC, and internal consistency of the instruments was assessed with Cronbach’s alpha. Agreement between GRS and CMC was measured using Spearman rank correlation (SRC. Results. Interrater agreement (GRS: ICC = 0.825, CMC: ICC = 0.878 and internal consistency (GRS: alpha = 0.838, CMC: alpha = 0.886 were good for both instruments. Subscale analysis indicated that several instrument items can be discarded. GRS and CMC scores were highly correlated (SRC = 0.948. Conclusions. In this pilot study, we demonstrated that screen-based simulation can allow blinded assessment of performance. GRS and CMC instruments demonstrated good rater agreement and internal consistency. We plan to further test construct validity of our instruments by measuring performance in our simulator as a function of training level.
Pearcy, Benjamin T D; Roberts, Lynne D; McEvoy, Peter M
Internet Gaming Disorder (IGD) is in the early stages of recognition as a disorder, following its inclusion in the Diagnostic and Statistical Manual for Mental Disorders (DSM-5; American Psychiatric Association(1)) as a condition for further study. Existing measures of Internet gaming pathology are limited in their ability to measure IGD as defined in the DSM-5. We present the initial development and validation of a new measure derived from the proposed DSM-5 criteria for IGD, the Personal Internet Gaming Disorder Evaluation-9 (PIE-9). A student sample (n = 119) and a community sample (n = 285), sourced through a variety of online gaming forums, completed an online survey comprising the new measure, existing measures of IGD, and a range of health and demographic questions. Exploratory and confirmatory factor analysis supported a single factor structure for the 9-item PIE-9. Internal consistency (α = 0.89) and test-retest reliability (intraclass correlation [ICC] = 0.77) were high. Convergent validity was demonstrated with similar gaming addiction measures. Predictive validity was established through significant differences in distress and disability between those who met the criteria for IGD and those who did not. The distress and disability associated with meeting IGD criteria fell within the range of other common DSM-5 disorders. Preliminary testing of the PIE-9 has demonstrated that it is an efficient and straightforward measure for use in further research of IGD, and as a potential screening measure in clinical practice.
Ana Paula Porto Noronha
Full Text Available A validade e a precisão de testes psicológicos vêm sendo bastante questionadas e discutidas atualmente. O presente estudo teve como objetivo avaliar a validade, a precisão e a existência de padronização brasileira em 43 testes psicológicos comercializados no Brasil, sendo 22 de inteligência e 21 de personalidade. Os testes foram comparados quanto ao período de publicação no Brasil. Os resultados indicaram que existe maior número de instrumentos publicados nas décadas de 1980 e 1990, que os testes de inteligência apresentam mais estudos de padronização, validade e precisão, embora não tenha havido diferença significante entre os grupos de testes (inteligência e personalidade. Novos estudos devem ser desenvolvidos com o intuito de promover os testes psicológicos e a área de avaliação psicológica, como um todo.Nowadays, the validity and the reliability of psychological tests are being very questioned and discussed. The current study aims to evaluate the validity, the reliability and the existence of Brazilian standardization in 43 psychological tests commercialized in Brazil, being 22 of intelligence and 21 of personality. The tests have been compared concearning the publication period in Brazil. The results have indicated that there is a bigger number of instruments published in the 1980´s and 1990´s, that the intelligence tests present more studies of standardization, validity and reliability, even though there hasn´t been significant difference among the groups tests (intelligence and personality. New studies must be developed aiming to promote the psychological tests and the area of psychological assessment, as a whole.
Vedam, Saraswathi; Stoll, Kathrin; Martin, Kelsey; Rubashkin, Nicholas; Partridge, Sarah; Thordarson, Dana; Jolicoeur, Ganga
To develop and validate a new instrument that assesses women's autonomy and role in decision making during maternity care. Through a community-based participatory research process, service users designed, content validated, and administered a cross-sectional quantitative survey, including 31 items on the experience of decision-making. Pregnancy experiences (n = 2514) were reported by 1672 women who saw a single type of primary maternity care provider in British Columbia. They described care by a midwife, family physician or obstetrician during 1, 2 or 3 maternity care cycles. We conducted psychometric testing in three separate samples. We assessed reliability, item-to-total correlations, and the factor structure of the The Mothers' Autonomy in Decision Making (MADM) scale. We report MADM scores by care provider type, length of prenatal appointments, preferences for role in decision-making, and satisfaction with experience of decision-making. The MADM scale measures a single construct: autonomy in decision-making during maternity care. Cronbach alphas for the scale exceeded 0.90 for all samples and all provider groups. All item-to-total correlations were replicable across three samples and exceeded 0.7. Eigenvalue and scree plots exhibited a clear 90-degree angle, and factor analysis generated a one factor scale. MADM median scores were highest among women who were cared for by midwives, and 10 or more points lower for those who saw physicians. Increased time for prenatal appointments was associated with higher scale scores, and there were significant differences between providers with respect to average time spent in prenatal appointments. Midwifery care was associated with higher MADM scores, even during short prenatal appointments (maternity care. This new scale was developed and content validated by community members representing various populations of childbearing women in BC including women from vulnerable populations. MADM measures women's ability to lead
Amber E. Vaughn
development (r = 0.21, and nutrition policy (r = 0.18. Child MVPA was significantly associated with overall time provided for activity (r = 0.18 and outdoor playtime (r = 0.20. There was also an unexpected negative association between child MVPA and screen time (−0.16 and screen time practices (r = −0.21. Conclusions The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. Trial registration NCT01814215 , March 15, 2013.
Vaughn, Amber E; Mazzucca, Stephanie; Burney, Regan; Østbye, Truls; Benjamin Neelon, Sara E; Tovar, Alison; Ward, Dianne S
= 0.18). Child MVPA was significantly associated with overall time provided for activity (r = 0.18) and outdoor playtime (r = 0.20). There was also an unexpected negative association between child MVPA and screen time (-0.16) and screen time practices (r = -0.21). The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. NCT01814215 , March 15, 2013.
Kötter, T; Obst, K U; Brüheim, L; Eisemann, N; Voltmer, E; Katalinic, A
Background The final exam grade is the main selection criterion for medical school application in Germany. For academic success, it seems to be a reliable predictor. Its use as the only selection criterion is, however, criticised. At some universities, personal interviews are part of the selection process. However, these are very time consuming and are of doubtful validity. The (additional) use of appropriate psychometric instruments could reduce the cost and increase the validity. This study investigates the extent to which psychometric instruments can predict the outcome of a personal selection interview. Methods This is a cross-sectional study on the correlation of the results of psychometric instruments with those of the personal selection interview as part of the application process. As the outcome, the score of the selection interview was used. The NEO - Five Factor Inventory, the Hospital Anxiety and Depression Scale (HADS) and the questionnaire to identify work-related behaviour and experience patterns (AVEM) were used as psychometric interviews. Results There was a statistically significant correlation with the results of the personal selection interview for the sum score of the depression scale from the HADS and the sum score for the dimension of life satisfaction of the AVEM. In addition, those participants who did not previously complete an application training achieved a better result in the selection interview. Conclusion The instruments used measure different aspects than the interviews and cannot replace them. It remains to be seen whether the selected parameters are able to predict academic success. © Georg Thieme Verlag KG Stuttgart · New York.
Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane
Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765
Bolt, Daniel; Ark, L; Wang, Wen-Chung
The 78th Annual Meeting of the Psychometric Society (IMPS) builds on the Psychometric Society's mission to share quantitative methods relevant to psychology. The chapters of this volume present cutting-edge work in the field. Topics include studies of item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Additional psychometric topics relate to structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis, among others. The papers in this volume will be especially useful for researchers in the social sciences who use quantitative methods. Prior knowledge of statistical methods is recommended. The 78th annual meeting took place in Arnhem, The Netherlands between July 22nd and 26th, 2013. The previous volume to showcase work from the Psychometric Society’s Meeting is New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 201...
Davis, Frederick B.
This review of psychometric research in reading analyzes the factors which seem related to reading comprehension skills. Experimental analysis of reading comprehension by L. E. Thorndike revealed two major components: knowledge of word meanings and verbal reasoning abilities. Subsequent analysis of experimental studies of reading comprehension…
Mathias, Susan D; Bussel, James B; George, James N; McMillan, Robert; Okano, Gary J; Nichol, Janet L
The Immune Thrombocytopenic Purpura Patient Assessment Questionnaire (ITP-PAQ) was developed to assess disease-specific quality of life (QoL) in adults with ITP. It is a 44-item questionnaire that includes scales for physical health (symptoms, fatigue/sleep, bother, and activity), emotional health (psychological and fear), overall QoL, social activity, women's reproductive health, and work. A previous study reported preliminary evidence of its reliability and validity. The present study was conducted to ascertain the responsiveness (ability to detect a clinically important treatment effect), reliability, and validity of the ITP-PAQ and to corroborate the earlier findings. The women's reproductive health scale was evaluated for psychometric evidence of the existence of separate menstrual symptoms and fertility subscales. The ITP-PAQ was evaluated in the context of an ongoing open-label extension study assessing the tolerability and durability of increases in the platelet count with AMG 531 (a thrombopoiesis peptibody that increases platelet production by targeting the thrombopoietin receptor) administered by subcutaneous injection once weekly in adult patients with ITP It was self-administered at baseline and at weeks 4, 12, and 24. The responsiveness of the questionnaire was evaluated by calculating and comparing the change scores of patients who showed clinical improvement-categorized as platelet responders (those with a platelet count > or =50 x 10(9) cells/L and a doubling of baseline values at week 24) and durable platelet responders (those with a platelet count > or =50 x 10(9) cells/L and a doubling of baseline values on > or =6 occasions during weeks 17-24)-with the change scores of patients wh did not show clinical improvement. The reliability (internal consistency and test-retest) and validity (convergent, discriminant, and known groups) of the questionnaire were also evaluated. Validity was examined in terms of correlations between the ITP-PAQ and the 36
Coombes, Lee; Roberts, Martin; Zahra, Daniel; Burr, Steven
It is incumbent on medical schools to show, both to regulatory bodies and to the public at large, that their graduating students are "fit for purpose" as tomorrow's doctors. Since students graduate by virtue of passing assessments, it is vital that schools quality assure their assessment procedures, standards, and outcomes. An important part of this quality assurance process is the appropriate use of psychometric analyses. This begins with development of an empowering, evidence-based culture in which assessment validity can be demonstrated. Preparation prior to an assessment requires the establishment of appropriate rules, test blueprinting and standard setting. When an assessment has been completed, the reporting of test results should consider reliability, assessor, demographic, and long-term analyses across multiple levels, in an integrated way to ensure the information conveyed to all stakeholders is meaningful.
Franklin, Ashley E; Burns, Paulette; Lee, Christopher S
In 2006, the National League for Nursing published three measures related to novice nurses' beliefs about self-confidence, scenario design, and educational practices associated with simulation. Despite the extensive use of these measures, little is known about their reliability and validity. The psychometric properties of the Student Satisfaction and Self-Confidence in Learning Scale, Simulation Design Scale, and Educational Practices Questionnaire were studied among a sample of 2200 surveys completed by novice nurses from a liberal arts university in the southern United States. Psychometric tests included item analysis, confirmatory and exploratory factor analyses in randomly-split subsamples, concordant and discordant validity, and internal consistency. All three measures have sufficient reliability and validity to be used in education research. There is room for improvement in content validity with the Student Satisfaction and Self-Confidence in Learning and Simulation Design Scale. This work provides robust evidence to ensure that judgments made about self-confidence after simulation, simulation design and educational practices are valid and reliable. Copyright © 2014 Elsevier Ltd. All rights reserved.
Vitoratou, Silia; Pickles, Andrew
Psychometrics provide the mathematical underpinnings for psychological assessment. From the late 19th century, a plethora of methodological research achievements equipped researchers and clinicians with efficient tools whose practical value becomes more evident in the era of the internet and big data. Nowadays, powerful probabilistic models exist for most types of data and research questions. As the usability of the psychometric scales is better comprehended, there is an increased interest in applied research outcomes. Paradoxically, while the interest in applications for psychometric scales increases, publishing research on the development and/or evaluation of those scales per se, is not welcomed by many relevant journals. This special issue in psychometrics is therefore a great opportunity to briefly review the main ideas and methods used in psychometrics, and to discuss the challenges in contemporary applied psychometrics.
Cross-cultural adaptation and analysis of the psychometric properties of the Balance Evaluation Systems Test and MiniBESTest in the elderly and individuals with Parkinson's disease: application of the Rasch model
Angelica C. Maia
Full Text Available BACKGROUND: Older adults and individuals with neurological problems such as Parkinson's disease (PD exhibit balance deficits that might impair their mobility and independence. The assessment of balance must be useful in identifying the presence of instability and orient interventions. OBJECTIVE: To translate and perform a cross-cultural adaptation of the Balance Evaluation Systems Test (BESTest and MiniBESTest to Brazilian Portuguese and analyze its psychometric properties. METHOD: The tests were translated and adapted to Portuguese according to a standard method and then subjected to a test-retest reliability assessment (10 older adults; 10 individuals with PD. The psychometric properties were assessed by the Rasch model (35 older adults; 35 individuals with PD. RESULTS: The reliability coefficient of the tests relative to the items and subjects varied from 0.91 and 0.98, which is indicative of the stability and reproducibility of the measures. In the BESTest, the person (4.19 and item (5.36 separation index established six balance ability levels and seven levels of difficulty, respectively. In the MiniBESTest, the person (3.16 and item (6.41 separation index established four balance ability levels and nine levels of difficulty, respectively. Two items in the BESTest did not fit with the model expectations, but the construct validity was not compromised. No item in the MiniBESTest was erratic. CONCLUSIONS: The results corroborate the diagnostic and screening functions of the BESTest and MiniBESTest, respectively, and indicate that the Brazilian versions exhibit adequate reliability, construct validity, response stability, and capacity to distinguish among various balance ability levels in older adults and individuals with PD.
Combination of classical test theory (CTT) and item response theory (IRT) analysis to study the psychometric properties of the French version of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF).
Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric
The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.
Mortensen, E L; Simonsen, E
A translation of the MCMI-I has been in use in Denmark for some years. An untested assumption in the interpretation of the pattern of test results is that the psychometric characteristics of the Danish and American versions are similar. The purpose of this study was to evaluate the psychometric...... properties of the questionnaire by using traditional psychometric analysis techniques on the results of a sample consisting of 423 patients and 179 normal controls. Coefficient alpha was calculated for the 20 clinical subscales of the test and the Danish results were strikingly similar to the original...... coefficients reported by Millon. Furthermore, factor analysis of the subscales showed a factor structure very similar to American findings, and it is concluded that the psychometric properties of the Danish MCMI are not significantly different from the original....
Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Chow, Sy-Miin
These research articles from the 79th Annual Meeting of the Psychometric Society (IMPS) cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 79th annual meeting took place in Madison, WI between July 21nd and 25th, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013) and Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015).
Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Wiberg, Marie
The research articles in this volume cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 80th annual meeting took place in Beijing, China, between the 12th and 16th of July, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013), Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015), and Quantitative Psychology Research: The 79th Annual Meeting of the Psychometric Society, Wisconsin, USA, 2014 (Springer, 2015).
Full Text Available Abstract Background Studies on the effects of tuberculosis on a patient’s quality of life (QOL are scant. The objective of this study was to evaluate the psychometric properties of the Taiwan short version of the World Health Organization Quality of Life (WHOQOL-BREF questionnaire using patients with tuberculosis in Taiwan and healthy referents. Methods The Taiwanese short version of the WHOQOL-BREF was administered to patients with tuberculosis undergoing treatment and healthy referents from March 2007 to July 2007. Patients with tuberculosis (n = 140 and healthy referents (n = 130, matched by age, sex, and ethnicity, agreed to an interview. All participants lived in eastern Taiwan. Reliability assessments included internal consistency, whereas validity assessments included construct validity, convergent validity, and discriminant validity. Results More than half of these patients and referents were men (70.7% and 66.2%, respectively, and their average ages were 50.1 and 47.9 years, respectively. Approximately 60% of patients and referents were aboriginal Taiwanese (60.7% and 61.1%, respectively. The proportion with low socioeconomic status was greater for these patients. The internal consistency reliability coefficients were .92 and .93 for the patients and healthy referents, respectively. Exploratory factor analysis on the healthy referents displayed a 4-domain model, which was compatible with the original WHOQOL-BREF 4-domain model. However, for the TB patient group, after deleting 3 items, both exploratory and confirmatory factor analysis revealed a 6-domain model. Conclusion Psychometric evaluation of the Taiwan short version of the WHOQOL-BREF indicates that it has adequate reliability for use in research with TB patients in Taiwan. However, the factor structure generated from this TB patient sample differed from the WHO’s original 4-factor model, which raised a validity concern to apply the Taiwan short version of the WHOQOL
Eklund, Mona; Morville, Anne-Le
Aims: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. Methods: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed. Key words: validity, reliability, health, Activity...
Eklund, Mona; Morville, Anne-Le
AIMS: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. METHODS: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed....
Nowparast Rostami, Hadiseh; Sommer, Werner; Zhou, Changsong; Wilhelm, Oliver; Hildebrandt, Andrea
The enhanced N1 component in event-related potentials (ERP) to face stimuli, termed N170, is considered to indicate the structural encoding of faces. Previously, individual differences in the latency of the N170 have been related to face and object cognition abilities. By orthogonally manipulating content domain (faces vs objects) and task demands (easy/speed vs difficult/accuracy) in both psychometric and EEG tasks, we investigated the uniqueness of the processes underlying face cognition as compared with object cognition and the extent to which the N1/N170 component can explain individual differences in face and object cognition abilities. Data were recorded from N = 198 healthy young adults. Structural equation modeling (SEM) confirmed that the accuracies of face perception (FP) and memory are specific abilities above general object cognition; in contrast, the speed of face processing was not differentiable from the speed of object cognition. Although there was considerable domain-general variance in the N170 shared with the N1, there was significant face-specific variance in the N170. The brain-behavior relationship showed that faster face-specific processes for structural encoding of faces are associated with higher accuracy in both perceiving and memorizing faces. Moreover, in difficult task conditions, qualitatively different processes are additionally needed for recognizing face and object stimuli as compared with easy tasks. The difficulty-dependent variance components in the N170 amplitude were related with both face and object memory (OM) performance. We discuss implications for understanding individual differences in face cognition. Copyright © 2017 Elsevier Ltd. All rights reserved.
Nine methods for automated test construction are described. All are based on the concepts of information from item response theory. Two general kinds of methods for the construction of parallel tests are presented: (1) sequential test design; and (2) simultaneous test design. Sequential design implies that the tests are constructed one after the…
Doran, Jennifer M; Safran, Jeremy D; Muran, J Christopher
This study investigates the utility and psychometric properties of a new measure of psychotherapy process, the Alliance Negotiation Scale (ANS; Doran, Safran, Waizmann, Bolger, & Muran, 2012). The ANS was designed to operationalize the theoretical construct of negotiation (Safran & Muran, 2000), and to extend our current understanding of the working alliance concept (Bordin, 1979). The ANS was also intended to improve upon existing measures such as the Working Alliance Inventory (WAI; Horvath & Greenberg, 1986, 1989) and its short form (WAI-S; Tracey & Kokotovic, 1989) by expanding the emphasis on negative therapy process. The present study investigates the psychometric validity of the ANS test scores and interpretation-including confirming its original factor structure and evaluating its internal consistency and construct validity. Construct validity was examined through the ANS' convergence and divergence with several existing scales that measure theoretically related constructs. The results bolster and extend previous findings about the psychometric integrity of the ANS, and begin to illuminate the relationship between negotiation and other important variables in psychotherapy research. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Bielinski, John; Minnema, Jane; Thurlow, Martha
A Web-based survey of 25 experts in testing theory and large-scale assessment examined the utility of out-of-level testing for making decisions about students and schools. Survey respondents were given a series of scenarios and asked to judge the degree to which out-of-level testing would affect the reliability and validity of test scores within…
Vera Lúcia Marques de Figueiredo
psychometric properties of WISC-III items, specifically difficulty, discrimination and validity of items. Since WISC-III is a widely used test for the assessment of intelligence, knowledge on the quality of items of the test seems to be essential for professionals using it. Analyses were performed on data 801 protocols obtained in the adaptation research of a test within a Brazilian context. Analyses showed that the adapted items presented adequate psychometric features which make it a reliable diagnostic tool.
Blom, Eva Henje; Bech, Per; Högberg, Göran
of two such scales, which may be used in a two-step screening procedure, the WHO-Five Well-being Index (WHO-5) and the six-item version of Beck's Depression Inventory (BDI-6). METHOD: 66 adolescent psychiatric patients with a clinical diagnosis of major depressive disorder (MDD), 60 girls and 6 boys......, aged 14--18 years, mean age 16.8 years, completed the WHO-5 scale as well as the BDI-6. Statistical validity was tested by Mokken and Rasch analyses. RESULTS: The correlation between WHO-5 and BDI-6 was -0.49 (p=0.0001). Mokken analyses showed a coefficient of homogeneity for the WHO-5 of 0.......52 and for the BDI-6 of 0.46. Rasch analysis also accepted unidimensionality when testing males versus females (p > 0.05). CONCLUSIONS: The WHO-5 is psychometrically valid in an adolescent psychiatric context including both genders to assess the wellness dimension and applicable as a first step in screening for MDD...
Barchard, Kimberly A.; Pace, Larry A.
Undergraduate psychometrics classes often use computer-intensive active learning projects. However, little research has examined active learning or computer-intensive projects in psychometrics courses. We describe two computer-intensive collaborative learning projects used to teach the design and evaluation of psychological tests. Course…
Thomas, Michael L.; Locke, Dona E. C.
The MMPI-2 Restructured Form (MMPI-2-RF; Tellegen & Ben-Porath, 2008) was designed to be psychometrically superior to its MMPI-2 counterpart. However, the test has yet to be extensively evaluated in diverse clinical settings. The purpose of this study was to examine the psychometric properties of the MMPI-2-RF Somatic Complaints (RC1) scale in…
Miller, Robert; Rammsayer, Thomas H.; Schweizer, Karl; Troche, Stefan J.
Several memory processes have been examined regarding their relation to psychometric intelligence with the exception of sensory memory. This study examined the relation between decay of iconic memory traces, measured with a partial-report task, and psychometric intelligence, assessed with the Berlin Intelligence Structure test, in 111…
Activities of daily living (ADL), such as walking, often involve the added complexity of walking while doing other activities (i.e. dual task walking). A complex walking task may require a greater motor and mental capacity, resulting in decrements in gait performance not seen for simple walking tasks. The purpose of this study was to determine if the trail walking test (TWT), the mobile adaptation of the trail making test (TMT), could be a reliable and valid early detection tool to discriminate between non-fallers and fallers. This study examined dual task costs of a cognitive and a sensorimotor task (walking) in 94 older adults aged 50-81 years (average age M = 67.4 years, SD ± 7.34). Based on the idea of the paper and pencil TMT, participants walked along a fixed pathway (TWT-1), stepped on targets with increasing sequential numbers (i.e. 1, 2, 3, TWT-2), and increasing sequential numbers and letters (i.e. 1, A, 2, B, 3, C, TWT-3). The dual task costs were calculated for each task. Additionally, the following tests were conducted: TMT, block tapping test (BTT), timed up and go (TUG) test, 30s chair rising test, 10 m walking time test with and without head turns, German physical activity questionnaire (German PAQ-50 +) and the activities-specific balance confidence (ABC-D) scale. The TWT performance times as well as errors increased with increasing age. Reliability coefficients were high (interclass correlation ICC > 0.90). Correlations between the different TWT conditions and potential falls-related predictors were moderate to high (r = -0.430 to 0.699). Of the participants 34 % reported falling in the past year. The stepwise logistic regression analysis revealed that the dual task costs for the numbers and letters (odds ratio OR 1.162, 95 % confidence interval CI 1.058-1.277, p = 0.002), the ABC-D (OR 0.767, 95 % CI 0.651-0.904, p = 0.002) and exercise (OR 1.027, 95 % CI 1.008-1.046, p = 0.006) were significantly related to
Watson, Shaun D.; Gomez, Rapson; Gullone, Eleonora
This study examined various psychometric properties of the items comprising the shame and guilt scales of the Test of Self-Conscious Affect-Adolescent (TOSCA-A) in a group children between 8 and 11 years of age. A total of 699 children (367 females and 332 males) completed these scales, and also measures of depression and empathy. Confirmatory factor analysis (CFA) provided support for an oblique two-factor model, with the originally proposed shame and guilt items comprising shame and guilt factors, respectively. There was good internal consistency reliability for the shame and guilt scales, with omega coefficient values of 0.77 and 0.81 for shame and guilt, respectively. Also, shame correlated with depression symptoms positively (0.34, p Guilt correlated with depression symptoms negatively (-0.28, p guilt factors. Multiple-group CFA comparing this group of children with a separate group of adolescents (320 females and 242 males), based on the chi-square difference test, supported full metric invariance, the intercept invariance of 17 of the 30 shame and guilt items, and higher latent mean scores among children for both shame and guilt. The non-equivalency for intercepts and mean scores were of small effect sizes. Comparisons based on the difference in root mean squared error of approximation values supported full measurement invariance and no group difference for latent mean scores. The findings in the current study support the use of the TOSCA-A in children and the valid comparison of scores between children and adolescents, thereby opening up the possibility of evaluating change in the TOSCA-A shame and guilt factors over these developmental age groups. PMID:27242573
Clemens, Sheila M; Gailey, Robert S; Bennett, Christopher L; Pasquina, Paul F; Kirk-Sanchez, Neva J; Gaunaurd, Ignacio A
Using a custom mobile application to evaluate the reliability and validity of the Component Timed-Up-and-Go test to assess prosthetic mobility in people with lower limb amputation. Cross-sectional design. National conference for people with limb loss. A total of 118 people with non-vascular cause of lower limb amputation participated. Subjects had a mean age of 48 (±13.7) years and were an average of 10 years post amputation. Of them, 54% ( n = 64) of subjects were male. None. The Component Timed-Up-and-Go was administered using a mobile iPad application, generating a total time to complete the test and five component times capturing each subtask (sit to stand transitions, linear gait, turning) of the standard timed-up-and-go test. The outcome underwent test-retest reliability using intraclass correlation coefficients (ICCs) and convergent validity analyses through correlation with self-report measures of balance and mobility. The Component Timed-Up-and-Go exhibited excellent test-retest reliability with ICCs ranging from .98 to .86 for total and component times. Evidence of discriminative validity resulted from significant differences in mean total times between people with transtibial (10.1 (SD: ±2.3)) and transfemoral (12.76 (SD: ±5.1) amputation, as well as significant differences in all five component times ( P < .05). Convergent validity of the Component Timed-Up-and-Go was demonstrated through moderate correlations with the PLUS-M ( r s = -.56). The Component Timed-Up-and-Go is a reliable and valid clinical tool for detailed assessment of prosthetic mobility in people with non-vascular lower limb amputation. The iPad application provided a means to easily record data, contributing to clinical utility.
Aim: To construct normal values for the tests of the psychometric hepatic encephalopathy score (PHES) and evaluate the prevalence of minimal hepatic encephalopathy (MHE) among Turkish patients with liver cirrhosis. Materials and Methods: One hundred and eighty-five healthy subjects and sixty patients with liver ...
Burger, Helena; Franchignoni, Franco; Puzic, Natasa; Giordano, Andrea
The objective of this study was to evaluate by means of classical test theory and Rasch analysis the scaling characteristics and psychometric properties of the Fatigue Severity Scale (FSS) in polio survivors. A questionnaire, consisting of five general questions (sex, age, age at time of acute polio, sequelae of polio, and new symptoms), the FSS,…
Slaney, Kathleen L.; Tkatchouk, Masha; Gabriel, Stephanie M.; Maraun, Michael D.
The aim of the current study is twofold: (a) to investigate the rates at which researchers assess and report on the psychometric properties of the measures they use in their research and (b) to examine whether or not researchers appear to be generally employing sound/unsound rationales when it comes to how they conduct test evaluations. Based on a…
Monsen, Jeremy J.; Ewing, Donna L.; Boyle, James
This paper presents the psychometric properties of a questionnaire measure that updates and extends Larrivee and Cook's (1979) Opinions Relative to Mainstreaming Scale in terms of structure, terminology, and language. The revised scale was tested using a sample of 106 teachers based in inclusive mainstream schools. Using Principal Component…
Carpenter, Brian D.; Balsis, Steve; Otilingam, Poorni G.; Hanson, Priya K.; Gatz, Margaret
Purpose: This study provides preliminary evidence for the acceptability, reliability, and validity of the new Alzheimer's Disease Knowledge Scale (ADKS), a content and psychometric update to the Alzheimer's Disease Knowledge Test. Design and Methods: Traditional scale development methods were used to generate items and evaluate their psychometric…
Full Text Available Abstract Background No control tools for nasal congestion (NC are currently available in Spanish. This study aimed to adapt and validate the Congestion Quantifier Seven Item Test (CQ7 for Spain. Methods CQ7 was adapted from English following international guidelines. The instrument was validated in an observational, prospective study in allergic rhinitis patients with NC (N = 166 and a control group without NC (N = 35. Participants completed the CQ7, MOS sleep questionnaire, and a measure of psychological well-being (PGWBI. Clinical data included NC severity rating, acoustic rhinometry, and total symptom score (TSS. Internal consistency was assessed using Cronbach's alpha and test-retest reliability using the intraclass correlation coefficient (ICC. Construct validity was tested by examining correlations with other outcome measures and ability to discriminate between groups classified by NC severity. Sensitivity and specificity were assessed using Area under the Receiver Operating Curve (AUC and responsiveness over time using effect sizes (ES. Results Cronbach's alpha for the CQ7 was 0.92, and the ICC was 0.81, indicating good reliability. CQ7 correlated most strongly with the TSS (r = 0.60, p Conclusions The Spanish version of the CQ7 is appropriate for detecting, measuring, and monitoring NC in allergic rhinitis patients.
Khanjari, Sedigheh; Oskouie, Fatemeh; Langius-Eklöf, Ann
To translate and test the reliability and validity of the Persian version of the Caregiver Quality of Life Index-Cancer scale. Research across many countries has determined quality of life of cancer patients, but few attempts have been made to measure the quality of life of family caregivers of patients with breast cancer. The Caregiver Quality of Life Index-Cancer scale was developed for this purpose, but until now, it has not been translated into or tested in the Persian language. Methodological research design. After standard translation, the 35-item Caregiver Quality of Life Index-Cancer scale was administered to 166 Iranian family caregivers of patients with breast cancer. A confirmatory factor analysis was carried out using LISREL to test the scale's construct validity. Further, the internal consistency and convergent validity of the instrument were tested. For convergent validity, four instruments were used in the study: sense of coherence scale, spirituality perspective scale, health index and brief religious coping scale. The confirmatory factor analysis resulted in the same four-factor structure as the original, though, with somewhat different item loadings. The Persian version of the Caregiver Quality of Life Index-Cancer scales had satisfactory internal consistency (0·72-0·90). Tests of convergent validity showed that all hypotheses were confirmed. A hierarchical multiple regression analysis additionally confirmed the convergent validity between the total Caregiver Quality of Life Index-Cancer score and sense of coherence (β = 0·34), negative religious coping (β = -0·21), education (β = 0·24) and the more severe stage of breast cancer (β = 0·23), in total explaining 41% of the variance. The Persian version of the Caregiver Quality of Life Index-Cancer scale could be a reliable and valid measure in Iranian family caregivers of patients with breast cancer. The Persian version of the Caregiver Quality of Life Index-Cancer scale is simple to
Feinberg, Richard A.; Rubright, Jonathan D.
Simulation studies are fundamental to psychometric discourse and play a crucial role in operational and academic research. Yet, resources for psychometricians interested in conducting simulations are scarce. This Instructional Topics in Educational Measurement Series (ITEMS) module is meant to address this deficiency by providing a comprehensive…
Yang, Luke; Liu, Yung-Fang; Sun, Huey-Fang; Chiang, Hsien-Hsien; Tsai, Yu-Lun; Liaw, Jen-Jiuan
The study purpose was to examine the validities and reliabilities of the Chinese-versions Frommelt Attitudes Toward Care of the Dying Scale (Attitudes Scale) and Caregiving Behaviors Scale for End-of-Life Patients and Families (Behaviors Scale). The scales were tested in a convenience sample of 318 nurses with ≥6 months work experience at three hospitals. Cronbach's alphas of the Attitudes and Behaviors Scales were .90 and .96, respectively. Each scale had Kaiser-Meyer-Olkin index >.85 and Bartlett's test of sphericity >4000 ( p < .001). Attitudes Scale loaded on three factors: respecting and caring for dying patients and families, avoiding care of the dying, and involving patients and families in end-of-life care. The Behaviors Scale loaded on two factors: supporting dying patients and families, and helping families cope with grief. Factor loadings for both scales were ≥.49. Both Attitudes and Behaviors Scales are reliable and valid for evaluating nurses' attitudes and caregiving behaviors for the dying.
Radley, S C; Jones, G L; Tanguy, E A; Stevens, V G; Nelson, C; Mathers, N J
To develop and evaluate a Web-based, electronic pelvic floor symptoms assessment questionnaire (e-PAQ)1 for women. A cross-sectional study in primary and secondary care. Two general practices, two community health clinics and a secondary care urogynaecology clinic. A total of 432 women (204 in primary care and 228 in secondary care) were recruited between June 2003 and January 2004. The e-PAQ was located on a workstation (computer, touchscreen and printer). Women completed the e-PAQ prior to their appointment. Untreated women in primary care were asked to return seven days later to complete the e-PAQ a second time (test-retest). Factor analysis, reliability, validity, patient satisfaction, completion times and system costs. In secondary care, factor analysis identified 14 domains within the four dimensions (urinary, bowel, vaginal and sexual symptoms) with internal consistency (Cronbach's alpha)>or=0.7 in 11 of these. In primary care, alpha values were all>or=0.7 and test-retest analysis found acceptable intraclass correlations of 0.50-0.95 (PPAQ offers a user-friendly clinical tool, which provides valid and reliable data. The system offers comprehensive symptoms and quality of life evaluation and may enhance the clinical episode as well as the quality of care for women with pelvic floor disorders.
Santo, Ruth Miyuki; Ribeiro-Ferreira, Felipe; Alves, Milton Ruiz; Epstein, Jonathan; Novaes, Priscila
To provide a reliable, validated, and culturally adapted instrument that may be used in monitoring dry eye in Brazilian patients and to discuss the strategies for the enhancement of the cross-cultural adaptation and validation process of a self-report measure for dry eye. The cross-cultural adaptation process (CCAP) of the original Ocular Surface Disease Index (OSDI) into Brazilian-Portuguese was conducted using a 9-step guideline. The synthesis of translations was tested twice, for face and content validity, by different subjects (focus groups and cognitive interviews). The expert committee contributed on several steps, and back translations were based on the final rather than the prefinal version. For validation, the adapted version was applied in a prospective longitudinal study to 101 patients from the Dry Eye Clinic at the General Hospital of the University of São Paulo, Brazil. Simultaneously to the OSDI, patients answered the short form-36 health survey (SF-36) and the 25-item visual function questionnaire (VFQ-25) and underwent clinical evaluation. Internal consistency, test-retest reliability, and measure validity were assessed. Cronbach's alpha value of the cross-culturally adapted Brazilian-Portuguese version of the OSDI was 0.905, and the intraclass correlation coefficient was 0.801. There was a statistically significant difference between OSDI scores in patients with dry eye (41.15 ± 27.40) and without dry eye (17.88 ± 17.09). There was a negative association between OSDI and VFQ-25 total score (P adaptation process requires skill, knowledge, experience, and a considerable investment of time to maximize the attainment of semantic, idiomatic, experiential, and conceptual equivalence between the source and target questionnaires. A well-established guideline resulted in a culturally adapted Brazilian-Portuguese version of the OSDI, tested and validated on a sample of Brazilian population, and proved to be a valid and reliable instrument for assessing
Vescia Vieira de Alencar Caldas
Full Text Available OBJECTIVE: To validate the Leganés cognitive test (LCT for cognitive screening in low educated elderly Brazilians. METHODS: The study sample was composed of 59 elderly residents from the city of Santa Cruz, in Brazil, with low schooling levels. Reliability was analyzed with a two-day interval between assessments, and concurrent validity was assessed using the Mini Mental State Examination (MMSE. RESULTS: According to the LCT, the prevalence of dementia was 11.8%. The scale items showed a moderate to strong correlation between domains (p<0.01, and inter-rater reliability exhibited ICC=0.81, 95%CI=0.72-0.88. The factor analysis resulted in two factors: memory and orientation. Interscale agreement was considered poor (k=-0.02, supporting the hypothesis of an educational impact on final MMSE scores. CONCLUSION: The results suggest that LCT has acceptable levels of reliability for use in low-educated Brazilian elderly.
Full Text Available Abstract Background Aging entails not only a decrease in the ability to be active, but also a trend toward increased dependence to sustain basic life functions. An important aspect for appropriately elucidating the individual's care needs is the ability to measure them both simply and reliably. Since 2006 a new version of the Time in Care needs (TIC-n instrument (19-item version has been explored and used in one additional municipality with the same structure as the one described in an earlier study. Methods The TIC-n assessment was conducted on a total of 1282 care recipients. Factor analysis (principal component was applied to explore the construct validity of the TIC-n. Cronbach's alpha was calculated to test reliability and for each of the items remaining in the instrument after factor analysis, an inter-rater comparison was carried out on all recipients in both municipalities. Independently of each other, a weighted Kappa (Kw was calculated. Results. The mean of each weighted Kappa (Kw for the dimensions in the two municipalities was 0.75 and 0.76, respectively. Factor analysis showed that all 19 items had a factor loading of ≥ 0.40. Three factors (General Care, Medical Care and Cognitive Care were created. Conclusion The TIC-n instrument has now been tested for validity and reliability in two municipalities with satisfactory results. However, TIC-n can not yet be used as a golden standard, but it can be recommended for use of measurement of individual care needs in municipal elderly care.
Igwesi-Chidobe, Chinonso N; Obiekwe, Chinwe; Sorinola, Isaac O; Godfrey, Emma L
Cross-culturally adapt and validate the Igbo Roland Morris Disability Questionnaire. Cross-cultural adaptation, test-retest, and cross-sectional psychometric testing. Roland Morris Disability Questionnaire was forward and back translated by clinical/non-clinical translators. An expert committee appraised the translations. Twelve participants with chronic low back pain pre-tested the measure in a rural Nigerian community. Internal consistency using Cronbach's alpha; test-retest reliability using intra-class correlation coefficient and Bland-Altman plot; and minimal detectable change were investigated in a convenient sample of 50 people with chronic low back pain in rural and urban Nigeria. Pearson's correlation analyses using the eleven-point box scale and back performance scale, and exploratory factor analysis were used to examine construct validity in a random sample of 200 adults with chronic low back pain in rural Nigeria. Ceiling and floor effects were investigated in the two samples. Modifications gave the option of interviewer-administration and reflected Nigerian social context. The measure had excellent internal consistency (α = 0.91) and intraclass correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity, and a predominant uni-dimensional structure, with no ceiling or floor effects. Igbo Roland Morris Disability Questionnaire is a valid and reliable measure of pain-related disability. Implications for rehabilitation Low back pain is the leading cause of years lived with disability worldwide, and is particularly prevalent in rural Nigeria, but there are no self-report measures to assess its impact due to low literacy rates. This study describes the cross-cultural adaptation and validation of a core self-report back pain specific disability measure in a low-literate Nigerian population. The Igbo Roland Morris Disability Questionnaire is a reliable and valid measure of self
Lehmann, V.; Ouwens, M.A.; Braeken, J.; Danner, U.N.; van Elburg, A.A.; Bekker, M.H.J.; Breurkens, A.; van Strien, T.
The psychometric properties of the Dutch version of the Eating Disorder Inventory–3 (EDI-3) were tested in eating disordered patients (N = 514) using confirmatory factor analyses, variance decomposition, reliabilities, and receiver operating characteristic (ROC) curve analyses. Factorial validity
Vickers, Andrew J; Chen, Ling Y
New technologies to collect patient - reported outcomes have substantially solved the challenge of integrating a questionnaire in a busy clinical practice. At Memorial Sloan Kettering, we have been collecting patient reported outcomes electronically for many years. Our experience confirms the predicted benefits of obtaining patient reported outcomes but has also raised serious concerns about whether instruments developed for the research setting are appropriate for routine clinical use. We summarize four principles for a clinically - relevant psychometrics. First, minimize patient burden: the use of a large number of items for a single domain may be of interest for research but additional items have little clinical utility. Secondly, use simplified language: patients who do not have good language skills are typically excluded from research studies but will nonetheless present in clinical practice. Third, avoid dumb questions: many questionnaire items are inappropriate when applied to a more general population. Fourth, what works for the group may not work for the individual: group level statistics used to validate survey instruments can obscure problems when applied to a subgroup of patients. There is a need for a clinically-oriented psychometrics to help design, test, and evaluate questionnaires that would be used in routine practice. Developing statistical methods to optimize questionnaires will be highly challenging but needed to bring the potential of patient reported outcomes into widespread clinical use.
Kemp, Joanne L; Collins, Natalie J; Roos, Ewa M.
Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown.......Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown....
May 1, 2018 ... (HCEQ) among Iranian Reproductive Age Women: Persian Version .... satisfaction, improved interaction with others and .... Private employee 75 (13.7) ... Married. 526 (95.8). Unmarried. 23 (4.2). Frequency of receiving.
Aronow, E; Reznikoff, M; Moreland, K L
Various approaches to the Rorschach Technique are described in terms of the idiographic-nomothetic axis and the perceptual-content axis. It is suggested that it is most productive to view the Rorschach as a projective tool, with perceptual scoring a secondary factor. Current efforts at objectification of the Rorschach are not seen as useful as efforts to enhance its projective qualities. Some possible ways are discussed in which the projective value of the instrument can be maximized.
Various studies have been carried out on emotional intelligence. But few of these studies have been able to link emotional intelligence to creativity, innovation and entrepreneurship. Importantly, the fact that creativity, innovation and entrepreneurship are strongly driven by positive emotions cannot be ignored. This study ...
Durak, Mithat; Senol-Durak, Emre; Gencoz, Tulin
This study aims to extensively examine the psychometric properties of adapted version of the Satisfaction with Life Scale (SWLS) in different Turkish samples. In order to test the psychometric properties of the SWLS three separate and independent samples are utilized in this study, namely university students (n = 547), correctional officers (n =…
Richardson, George B; Sanning, Blair K; Lai, Mark H C; Copping, Lee T; Hardesty, Patrick H; Kruger, Daniel J
This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS), provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a) use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b) confirm the dimensionality of life history indicators, and (c) establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested-there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
George B. Richardson
Full Text Available This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS, provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b confirm the dimensionality of life history indicators, and (c establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested—there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
Epskamp, S.; Rhemtulla, M.; Borsboom, D.
We introduce the network model as a formal psychometric model, conceptualizing the covariance between psychometric indicators as resulting from pairwise interactions between observable variables in a network structure. This contrasts with standard psychometric models, in which the covariance between
Practitioners in health sciences education and assessment regularly use a range of psychometric techniques to analyse data, evaluate models, and make crucial progression decisions regarding student learning. However, a recent editorial entitled "Is Psychometrics Science?" highlighted some core epistemological and practical problems in psychometrics, and brought its legitimacy into question. This paper attempts to address these issues by applying some key ideas from history and philosophy of science (HPS) discourse. I present some of the conceptual developments in HPS that have bearing on the psychometrics debate. Next, by shifting the focus onto what constitutes the practice of science, I discuss psychometrics in action. Some incorrectly conceptualize science as an assemblage of truths, rather than an assemblage of tools and goals. Psychometrics, however, seems to be an assemblage of methods and techniques. Psychometrics in action represents a range of practices using specific tools in specific contexts. This does not render the practice of psychometrics meaningless or futile. Engaging in debates about whether or not we should regard psychometrics as 'scientific' is, however, a fruitless enterprise. The key question and focus should be whether, on what grounds, and in what contexts, the existing methods and techniques used by psychometricians can be justified or criticized.
Kane, Michael T.
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Caselli, G; Fernie, B; Canfora, F; Mascolo, C; Ferrari, A; Antonioni, M; Giustina, L; Donato, G; Marcotriggiani, A; Bertani, A; Altieri, A; Pellegrini, E; Spada, MM
Recent research has suggested that metacognitions may play a role across the spectrum of addictive behaviours. The goal of our studies was to develop the first self-report scale of metacognitions about gambling. We conducted three studies with one community (n = 165) and two clinical (n = 110; n = 87) samples to test the structure and psychometric properties of the Metacognitions about Gambling Questionnaire and examined its capacity to prospectively predict severity of gambling. Findings sup...
Morrison, Todd G; Bishop, C J; Morrison, Melanie A; Parker-Taneo, Kandice
Discrimination against sexual minorities is widespread and has deleterious consequences on victims' psychological and physical wellbeing. However, a review of the psychometric properties of instruments measuring lesbian, gay, and bisexual (LGB) discrimination has not been conducted. The results of this review, which involved evaluating 162 articles, reveal that most have suboptimal psychometric properties. Specifically, myriad scales possess questionable content validity as (1) items are not created in collaboration with sexual minorities; (2) measures possess a small number of items and, thus, may not sufficiently represent the domain of interest; and (3) scales are "adapted" from measures designed to examine race- and gender-based discrimination. Additional limitations include (1) summed scores are computed, often in the absence of scale score reliability metrics; (2) summed scores operate from the questionable assumption that diverse forms of discrimination are necessarily interrelated; (3) the dimensionality of instruments presumed to consist of subscales is seldom tested; (4) tests of criterion-related validity are routinely omitted; and (5) formal tests of measures' construct validity are seldom provided, necessitating that one infer validity based on the results obtained. The absence of "gold standard" measures, the attendant difficulty in formulating a coherent picture of this body of research, and suggestions for psychometric improvements are noted.
Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C.
Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric evaluation). Classical Test Theory and Item Response Theory provide two psychometric frameworks for evaluating the quality of assessment tools. We discuss how these theories can be applied to assessment tools generally and then apply them to the Digital Logic Concept Inventory (DLCI). We demonstrate that the DLCI is sufficiently reliable for research purposes when used in its entirety and as a post-course assessment of students' conceptual understanding of digital logic. The DLCI can also discriminate between students across a wide range of ability levels, providing the most information about weaker students' ability levels.
Paula Inez Cunha Gomide
Full Text Available Abstract The development of forensic evaluation scales is fundamental. This study's purpose was to explore the psychometric properties of a parental alienation scale. Forensic technicians completed 193 scales concerning parents involved in a lawsuit: 48 families with at least one parent indicated as the alienator (group A and 48 families with no parental alienation claim (group B. The scale consisted of five categories and 69 items: denying access to the child; derogatory comparisons; emotional manipulation; behavior of parent and child during assessment. The results show Cronbach's alpha = .965 and split-half = .745; KMO = .884 and Bartlett's sphericity test ( p < .001. Concurrent criterion validity applied to data showed that the scale is able to distinguish between the alienator and target parent. The results showed significant and consistent standards in the instrument's psychometric characteristics.
Karimy, Mahmood; Fakhri, Ahmad; Vali, Esmaeel; Vali, Farzaneh; Veiga, Feliciano H; Stein, L A R; Araban, Marzieh
Growing evidence indicates that if disruptive behavior is left unidentified and untreated, a significant proportion of these problems will persist and may develop into problems linked with delinquency, substance abuse, and violence. Research is needed to develop valid and reliable measures of disruptive behavior to assist recognition and impact of treatments on disruptive behavior. The aim of this study was to develop and evaluate the psychometric properties of a scale for disruptive behavior in adolescents. Six hundred high school students (50% girls), ages ranged 15-18 years old, selected through multi stage random sampling. Psychometrics of the disruptive behavior scale for adolescents (DISBA) (Persian version) was assessed through content validity, explanatory factor analysis (EFA) using Varimax rotation and confirmatory factor analysis (CFA). The reliability of this scale was assessed via internal consistency and test-retest reliability. EFA revealed four factors accounting for 59% of observed variance. The final 29-item scale contained four factors: (1) aggressive school behavior, (2) classroom defiant behavior, (3) unimportance of school, and (4) defiance to school authorities. Furthermore, CFA produced a sufficient Goodness of Fit Index > 0.90. Test-retest and internal consistency reliabilities were acceptable at 0.85 and 0.89, respectively. The findings from this study suggest that the Iranian version of DISBA questionnaire has content validity. Further studies are needed to evaluate stronger psychometric properties for DISBA.
van Ballegooijen, Wouter; Riper, Heleen; Cuijpers, Pim; van Oppen, Patricia; Smit, Johannes H
Online questionnaires for measuring common mental health disorders such as depression and anxiety disorders are increasingly used. The psychometrics of several pen-and-paper questionnaires have been re-examined for online use and new online instruments have been developed and tested for validity as well. This study aims to review and synthesise the literature on this subject and provide a framework for future research. We searched Medline and PsycINFO for psychometric studies on online instruments for common mental health disorders and extracted the psychometric data. Studies were coded and assessed for quality by independent raters. We included 56 studies on 62 online instruments. For common instruments such as the CES-D, MADRS-S and HADS there is mounting evidence for adequate psychometric properties. Further results are scattered over different instruments and different psychometric characteristics. Few studies included patient populations. We found at least one online measure for each of the included mental health disorders and symptoms. A small number of online questionnaires have been studied thoroughly. This study provides an overview of online instruments to refer to when choosing an instrument for assessing common mental health disorders online, and can structure future psychometric research.
Full Text Available An adaption of McConahay, Harder and Batts’ (1981 moderm racism scale is presented for Chilean population andits psychometric properties, (reliability and validity are studied, along with its relationship with other relevantpsychosocial variables in studies on prejudice and ethnic discrimination (authoritarianism, religiousness, politicalposition, etc., as well as with other forms of prejudice (gender stereotypes and homophobia. The sample consistedof 120 participants, students of psychology, resident in the city of Antofagasta (a geographical zone with a highnumber of Latin-American inmigrants. Our findings show that the scale seems to be a reliable instrument to measurethe prejudice towards Bolivian immigrants in our social environment. Likewise, important differences among thesubjects are detected with high and low scores in the psychosocial variables used.
陳柏熹 Po-Hsi Chen
Full Text Available 本研究目的旨在發展大學生基本素養測驗並進行信度與效度評估。藉由分析國內大專院校的通識教育目標和核心素養，並參考ATC21S 提出的21世紀現代學生需具備的10 項基本素養，歸納出大學生基本素養測驗的九項素養，分別為：溝通合作、美感素養、科學思辨、資訊素養、終身學習、創新領導、問題解決、公民社會及生涯發展。測驗形式為線上多媒體情境式題型，每個題本均包含九項素養的內容，每項素養皆有二至三個題組。研究對象為全國大專校院一至四年級學生，研究樣本來自20 校10,958名大學生。由效度評估結果可知，大學生基本素養測驗的題組效果不大，可以忽略，並採用部分計分模式來估計，幾乎所有試題與模式都能適配，顯示建構效度良好。試題發展過程均歷經嚴謹修審題程序，取得良好專家效度證據。此外，不同性別和年級的學生在各素養的表現上差異不大，和過去的文獻相符合，具有良好的效標關聯效度。信度證據方面，各素養能力估計誤差約在 .20～ .60 logit 之間，單一題本的信度高於 .69，顯示本測驗題數雖少，但信度大致良好。整體而言，大學生基本素養測驗具良好的信度與效度。 This study evaluated the psychometric properties of the General Literacy Test for University Students. To develop the assessment framework, the educational objectives of general literacy courses of universities in Taiwan as well as the core competencies of Assessment and Teaching of 21st Century Skills were all reviewed and considered. The general literacy test is composed of nine literacy domains: communication and collaboration, esthetics, information, lifelong learning, career, leadership, problem solving, social concerns and citizenship, and scientific thinking. The items of the general literacy test were developed into a
Stone, L.L.; Janssens, J.M.A.M.; Vermulst, A.A.; Maten, M.L. van der; Engels, R.C.M.E.; Otten, R.
Background The Strengths and Difficulties Questionnaire is one of the most employed screening instruments. Although there is a large research body investigating its psychometric properties, reliability and validity are not yet fully tested using modern techniques. Therefore, we investigate
Outcome measures with good reliability, validity, responsiveness, and low burden of administration are clinically useful. The Oswestry Disability Index (ODI) is one of the most commonly used outcome measures for individuals with low back pain. Psychometric properties of the ODI will determine the questionnaire's suitability as a useful clinical tool. A literature search of relevant databases on psychometric evaluation of the ODI was performed. The search was done using the key words disability evaluation, and low back pain, and questionnaires, and reproducibility of results, and the term Oswestry. Inclusion criterion was direct reference regarding psychometric property, interpretability, and burden being included in the abstract. Eight articles met the inclusion criterion. The ODI shows good construct validity; internal consistency is rated as acceptable; test-retest reliability and responsiveness have been shown to be high; and burden of administration is low. The ODI is a valid, reliable, and responsive condition-specific assessment tool that is suited for use in clinical practice. It is easy to administer and score, objectifies clients' complaints, and monitors effects of therapy.
Tylka, Tracy L; Wood-Barcalow, Nichole L
Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.
Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
Validation of the Psychometric Properties of the Self-Compassion Scale. Testing the Factorial Validity and Factorial Invariance of the Measure among Borderline Personality Disorder, Anxiety Disorder, Eating Disorder and General Populations.
Costa, Joana; Marôco, João; Pinto-Gouveia, José; Ferreira, Cláudia; Castilho, Paula
During the last years, there has been a growing interest in self-compassion. Empirical evidences show that self-compassion is associated with psychological benefits among young adults and it might be considered a buffer factor in several mental disorders. The aim of this study was to validate the psychometric properties of the Self-compassion Scale (SCS: Neff, 2003a) after the initial lack of replicating the original six-factor structure. Data were collected from the overall database of a research centre (56 men and 305 women; mean age = 25.19) and comprised four groups: borderline personality disorder, anxiety disorder, eating disorder and general population. Confirmatory factor analysis supported a two-factor model (self-compassionate attitude versus self-critical attitude) with good internal consistencies, construct-related validity and external validity. Configural, weak measurement and structural invariance of the two-factor model of SCS were also shown. Findings support the generalizability of the two-factor model and show that both properties and interpretations of scores on self-compassion are equivalent across these population groups. Copyright © 2015 John Wiley & Sons, Ltd. A two-factor structure of SCS with strong psychometric validity was supported in clinical and non-clinical samples. Helping individuals with limited experiences of compassion to develop positive internal processing systems seems to be related with better mental health, self-acceptance and self-nurturing abilities. The non-probabilistic sampling limits the generalization of our conclusions. Copyright © 2015 John Wiley & Sons, Ltd.
Deepak, Kishore K; Al-Umran, Khalid Umran; AI-Sheikh, Mona H; Dkoli, B V; Al-Rubaish, Abdullah
The functionality of distracters in a multiple choice question plays a very important role. We examined the frequency and impact of functioning and non-functioning distracters on psychometric properties of 5-option items in clinical disciplines. We analyzed item statistics of 1115 multiple choice questions from 15 summative assessments of undergraduate medical students and classified the items into five groups by their number of non-functioning distracters. We analyzed the effect of varying degree of non-functionality ranging from 0 to 4, on test reliability, difficulty index, discrimination index and point biserial correlation. The non-functionality of distracters inversely affected the test reliability and quality of items in a predictable manner. The non-functioning distracters made the items easier and lowered the discrimination index significantly. Three non-functional distracters in a 5-option MCQ significantly affected all psychometric properties (p psychometrically as effective as 5-option items. Our study reveals that a multiple choice question with 3 functional options provides lower most limit of item format that has adequate psychometric property. The test containing items with less number of functioning options have significantly lower reliability. The distracter function analysis and revision of nonfunctioning distracters can serve as important methods to improve the psychometrics and reliability of assessment.
Ferrando, Pere J.; Masip-Cabrera, Antoni; Navarro-González, David; Lorenzo-Seva, Urbano
The Psychometric Toolbox (PT) is a user-friendly, non-commercial package mainly intended to be used for instructional purposes in introductory courses of educational and psychological measurement, psychometrics and statistics. The PT package is organized in six separate modules or sub-programs: Data preprocessor (descriptive analyses and data…
Kraus, Shane; Rosenberg, Harold
Despite the prevalence of pornography use, and recent conceptualization of problematic use as an addiction, we could find no published scale to measure craving for pornography. Therefore, we conducted three studies employing young male pornography users to develop and evaluate such a questionnaire. In Study 1, we had participants rate their agreement with 20 potential craving items after reading a control script or a script designed to induce craving to watch pornography. We dropped eight items because of low endorsement. In Study 2, we revised both the questionnaire and cue exposure stimuli and then evaluated several psychometric properties of the modified questionnaire. Item loadings from a principal components analysis, a high internal consistency reliability coefficient, and a moderate mean inter-item correlation supported interpreting the 12 revised items as a single scale. Correlations of craving scores with preoccupation with pornography, sexual history, compulsive internet use, and sensation seeking provided support for convergent validity, criterion validity, and discriminant validity, respectively. The enhanced imagery script did not impact reported craving; however, more frequent users of pornography reported higher craving than less frequent users regardless of script condition. In Study 3, craving scores demonstrated good one-week test-retest reliability and predicted the number of times participants used pornography during the following week. This questionnaire could be applied in clinical settings to plan and evaluate therapy for problematic users of pornography and as a research tool to assess the prevalence and contextual triggers of craving among different types of pornography users.
Wedman, Jonathan; Lyrén, Per-Erik
When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…
Davidson, Charlie A; Lesser, Rebecca; Parente, Lori T; Fiszdon, Joanna M
Social cognition represents an important treatment target, closely linked to everyday social function. While a number of social cognitive interventions have recently been developed, measures used to evaluate these treatments are only beginning to receive psychometric scrutiny. Study goals were to replicate recently-published psychometrics for several social cognitive measures, and to provide information for additional social cognitive measures not included in recent reports. Forty-eight outpatients with psychotic-spectrum disorders completed measures of emotion perception, theory of mind, and attributional bias on two occasions, one month apart. Measures were tested for distributional characteristics, test-retest reliability, utility as a repeated measure, and relationship to symptoms and functioning. For a subgroup of participants, information about sensitivity to social cognitive treatment was also available. We replicated aspects of prior work, including largely favorable psychometric characteristics for the Bell-Lysaker Emotion Recognition Task, and promising but weaker characteristics for The Awareness of Social Inferences Test subscales and Reading the Mind in the Eyes Task. The Hinting Task had adequate test-retest statistics but a more pronounced ceiling effect. Ambiguous Intentions and Hostility Questionnaire data showed evidence of validity but were limited by inconsistency over time. Our results strongly support the Davos Assessment of Cognitive Biases Scale for future evaluation as a social cognitive treatment outcome measure. Its scores were adequately distributed, consistent over time, related to symptoms and functioning, and sensitive to treatment effects. Other relatively novel assessments of attributional bias and theory of mind showed some promise, although more work is needed. Published by Elsevier B.V.
Culpepper, Steven; Janssen, Rianne; González, Jorge; Molenaar, Dylan
This proceedings book highlights the latest research and developments in psychometrics and statistics. Featuring contributions presented at the 82nd Annual Meeting of the Psychometric Society (IMPS), organized by the University of Zurich and held in Zurich, Switzerland from July 17 to 21, 2017, its 34 chapters address a diverse range of psychometric topics including item response theory, factor analysis, causal inference, Bayesian statistics, test equating, cognitive diagnostic models and multistage adaptive testing. The IMPS is one of the largest international meetings on quantitative measurement in psychology, education and the social sciences, attracting over 500 participants and 250 paper presentations from around the world every year. This book gathers the contributions of selected presenters, which were subsequently expanded and peer-reviewed.
Castellanos, Irina; Kronenberger, William G; Pisoni, David B
The psychometric properties of the Learning, Executive, and Attention Functioning (LEAF) scale were investigated in an outpatient clinical pediatric sample. As a part of clinical testing, the LEAF scale, which broadly measures neuropsychological abilities related to executive functioning and learning, was administered to parents of 118 children and adolescents referred for psychological testing at a pediatric psychology clinic; 85 teachers also completed LEAF scales to assess reliability across different raters and settings. Scores on neuropsychological tests of executive functioning and academic achievement were abstracted from charts. Psychometric analyses of the LEAF scale demonstrated satisfactory internal consistency, parent-teacher inter-rater reliability in the small to large effect size range, and test-retest reliability in the large effect size range, similar to values for other executive functioning checklists. Correlations between corresponding subscales on the LEAF and other behavior checklists were large, while most correlations with neuropsychological tests of executive functioning and achievement were significant but in the small to medium range. Results support the utility of the LEAF as a reliable and valid questionnaire-based assessment of delays and disturbances in executive functioning and learning. Applications and advantages of the LEAF and other questionnaire measures of executive functioning in clinical neuropsychology settings are discussed.
Saltychev, Mikhail; Mattie, Ryan; McCormick, Zachary; Bärlund, Esa; Laimi, Katri
The aim of this study was to investigate the psychometric properties of the Oswestry Disability Index (ODI) in a large cross-sectional cohort of individuals with chronic low back pain by defining its internal consistency, construct structure and validity, and its ability to differentiate between different degrees of functional limitation. A total of 837 consecutive outpatient patients with low back pain were studied. The internal consistency of ODI was assessed by Cronbach's α, construct structure by exploratory factor analysis, construct validity by confirmatory factor analysis, and discrimination was determined by item response theory analysis. The ODI showed good internal consistency (α=0.85). Explanatory factor analysis showed that ODI is a unidimensional test measuring functional level and nothing else. The confirmatory factor analysis showed that the standardized regression weights of all ODI items were relatively high, varying from 0.5 to 0.7. The item response theory analysis suggested that eight out of 10 ODI items have a close to perfect ability to measure functional limitations in accordance with the actual severity of disability experienced by the respondents. Discrimination of all the items was high to perfect (1.08-2.01). The test characteristic and test information curves showed that the discriminative ability of the ODI is superior at higher levels of disability. The present data showed that the ODI is an internally consistent, unidimensional scale with overall excellent construct validity and ability to discriminate the severity of functional disability. The analysis suggests that the ODI may better distinguish between the relative degrees of function at above-average disability levels.
Eun-Hyun Lee, RN, PhD
Conclusion: Overall, the PSS is an easy-to-use questionnaire with established acceptable psychometric properties. However, future studies should evaluate these psychometric properties in greater depth, and validate the scale using diverse populations.
Lacroix, Emilie; Alberga, Angela; Russell-Mathew, Shelly; McLaren, Lindsay; von Ranson, Kristin
People living with overweight and obesity often experience weight-based stigmatization. Investigations of the prevalence and correlates of weight bias and evaluation of weight bias reduction interventions depend upon psychometrically-sound measurement. Our paper is the first to comprehensively evaluate the psychometric properties, use of people-first language within items, and suitability for use with various populations of available self-report measures of weight bias. We searched five electronic databases to identify English-language self-report questionnaires of weight bias. We rated each questionnaire's psychometric properties based on initial validation reports and subsequent use, and examined item language. Our systematic review identified 40 original self-report questionnaires. Most questionnaires were brief, demonstrated adequate internal consistency, and tapped key cognitive and affective dimensions of weight bias such as stereotypes and blaming. Current psychometric evidence is incomplete for many questionnaires, particularly with regard to the properties of test-retest reliability, sensitivity to change as well as discriminant and structural validity. Most questionnaires were developed prior to debate surrounding terminology preferences, and do not employ people-first language in the items administered to participants. We provide information and recommendations for clinicians and researchers in selecting psychometrically sound measures of weight bias for various purposes and populations, and discuss future directions to improve measurement of this construct. © 2017 The Author(s) Published by S. Karger GmbH, Freiburg.
Ferriero, Giorgio; Kristensen, Morten T; Invernizzi, Marco
INTRODUCTION: In the geriatric population, independent mobility is a key factor in determining readiness for discharge following acute hospitalization. The Cumulated Ambulation Score (CAS) is a potentially valuable score that allows day-to-day measurements of basic mobility. The CAS was developed...... and validated in older patients with hip fracture as an early postoperative predictor of short-term outcome, but it is also used to assess geriatric in-patients with acute medical illness. Despite the fast- accumulating literature on the CAS, to date no systematic review synthesizing its psychometric properties....... Of 49 studies identified, 17 examined the psychometric properties of the CAS. EVIDENCE SYNTHESIS: Most papers dealt with patients after hip fracture surgery, and only 4 studies assessed the CAS psychometric characteristics also in geriatric in-patients with acute medical illness. Two versions of CAS...
Tilden, V P; Hirsch, A M; Nelson, C A
For norm-referenced measures to be useful in social-behavioral research, investigators who develop measures face several psychometric challenges, including: (a) adequate domain specification; (b) adequate initial evidence of reliability and validity; and (c) ongoing evidence of psychometric quality. The Interpersonal Relationship Inventory (IPRI) was developed in response to gaps in measurement of social relationships, and contributed scales for reciprocity and conflict to a measure of social support. For the IPRI, the first two points were addressed during the period of instrument development. The measure now has been in use for 4 years. This article reports evidence addressing the third challenge: ongoing evidence of psychometric quality. Findings from 19 studies using the IPRI provide compelling evidence for internal consistency reliability and construct validity of the scales.
Saide Umut ZEYBEK
Full Text Available The aim of this study is to investigate the applicability of Coping Attitudes Scale: Measure of Positive Attitudes in Depression (CAS among Turkish young adult community sample and determine the psychometric properties (validity and reliability of this scale. This study was conducted with 419 students attending different departments in Mugla Sitki Kocman University, Faculty of Education in the spring semester of academic year of 2015-2016. Positive Functional Attitudes Scale, Beck Depression Scale, Beck Hopelessness Scale, Automatic Thoughts Scale, Positivity Scale and Developed Automatic Thoughts Scale.were used as data collection tools. Confirmatory factor analysis (CFA were used for investigation of the psychometric properties of the PFAS. Also, criterion-related validity, test-retest validity, and internal consistency were used calculated. The CFA results showed that standardized item estimates of the CAS ranged between 0.45 and 0.47. Also the CFA results showed that the original factor structure of the PFAS confirmed on the Turkish sample. internal consistency was calculated using the total community samples PFAS score. Cronbachs alpha coefficient ort he total scale (.93 was high. Test-retest results of the subscales were 0.76. The findings showed that factor structures of the PFAS life perspective, personal accomplishment, positive future, self-worth, coping with problems had psychometric quality in Turkish version. As a result of the study, the Turkish version of PFAS has good validity and reliability for young adult community sample. [JCBPR 2017; 6(2.000: 59-66
Santos-Iglesias, Pablo; Sierra, Juan Carlos
The study analyzed psychometric properties of a Spanish version of the Hurlbert Index of Sexual Assertiveness in a Spanish sample of 400 men and 453 women who had had a partner for the last 6 mo. or longer at the time of the study. Exploratory and confirmatory factor analyses suggested a two-factor solution with the factors Initiation and No shyness/Refusal. Internal consistency values for total scores were .87 and .83 for the factors, respectively. Convergent validity tests were also satisfactory. It is therefore reasonable to conclude that the Spanish version of the scale has appropriate psychometric properties.
Lundman, Berit; Viglund, Kerstin; Aléx, Lena; Jonsén, Elisabeth; Norberg, Astrid; Fischer, Regina Santamäki; Strandberg, Gunilla; Nygren, Björn
Four dimensions of inner strength were previously identified in a meta-theoretical analysis: firmness, creativity, connectedness, and flexibility. The aim of this study was to develop an Inner Strength Scale (ISS) based on those four dimensions and to evaluate its psychometric properties. An initial version of ISS was distributed for validation purpose with the Rosenberg Self-Esteem Scale, the resilience scale, and the sense of Coherence Scale. A convenience sample of 391 adults, aged 19-90 years participated. Principal component analysis (PCA) and confirmatory factor analysis (CFA) were used in the process of exploring, evaluating, and reducing the 63-item ISS to the 20-item ISS. Cronbach's alpha and test-retest were used to measure reliability. CFA showed satisfactory goodness-of-fit for the 20-item ISS. The analysis supported a fourfactor solution explaining 51% of the variance. Cronbach's alpha on the 20-item ISS was 0.86, and the test-retest showed stability over time (r=0.79). The ISS was found to be a valid and reliable instrument for capturing a multifaceted understanding of inner strength. Further tests of psychometric properties of the ISS will be performed in forthcoming studies. Copyright © 2011 Elsevier Ltd. All rights reserved.
Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela
To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.
Full Text Available The primary purpose of this study was to evaluate the possibility of using a psychometric approach for assessing supervisory competencies relevant to the mining and refining environment. The competency questionnaire was developed using supervisory roles and registered supervisory unit standards from the United Kingdom (UK, as no registered unit standards exist in South Africa. Twenty-four supervisors from three departments (Production, Engineering and Laboratory were evaluated by 125 raters; besides by themselves, also by their managers, peers, customers and their sub-ordinates. Based on difference scores derived from the Importance and Performance scales, a single factor was extracted with an internal reliability of 0,965. No statistical significant differences were obtained (ANOVA’s, t-test and F-statistics between groups based on biographical variables or between rater groups. The findings and their implications are further discussed. Opsomming Die primêre doel van die studie was om die moontlikheid vir die gebruik van ’n psigometriese benadering tot toesighouerbevoegdheidsbeoordeling, te evalueer. Die bevoegdheidsvraelys is ontwikkel deur gebruik te maak van toesighouersrolle en geregistreerde toesighouerseenheidstandaarde van die Verenigde Koningkryk, as gevolg van ‘n gebrek aan bestaande eenheidstandaarde in Suid-Afrika. Vier-en-twintig toesighouers van drie departemente (Produksie, Ingenieurswese en Laboratorium is deur 125 beoordelaars geëvalueer; buiten deur hulself, ook deur hul bestuurders, kollegas, kliënte en hul ondergeskiktes. ’n Enkele faktor, met ’n betroubaarheid van 0,965, gebaseer op die verskiltellings van die Prestasie- en Belangrikheidskaal, is onttrek. Geen beduidende verskille (ANOVA’s, t-toetse en F-statistiek kon tussen groepe gebaseer op biografiese veranderlikes en die onderskeie beoordelaarsgroepe gevind word nie. Hierdie bevindinge en die implikasies daarvan word verder bespreek.
Farmer, Ryan L.; Floyd, Randy G.; Reynolds, Matthew R.; Kranzler, John H.
The most global score yielded by intelligence tests, IQs, are supported by substantial validity evidence and have historically been central to the identification of intellectual disabilities, learning disabilities, and giftedness. This study examined the extent to which IQs measure the ability they target, psychometric "g." Data from…
Lovakov, Andrey V.; Agadullina, Elena R.; Schaufeli, Wilmar B.
This article aims to analyze the psychometric properties of the Russian version of the Utrecht Work Engagement Scale (UWES-9) by using a sample of 1783 employees of a large Russian organization. We conducted a series of Confirmatory Factor Analysis (CFA) tests of the factorial structure and the
Khanna, Rahul; Madhavan, S. Suresh; Smith, Michael J.; Tworek, Cindy; Patrick, Julie H.; Becker-Cottrill, Barbara
The purpose of this study was to test the psychometric properties of the Caregiver Strain Questionnaire (CGSQ) among caregivers of children with autism. The CGSQ was originally developed to assess burden experienced by parents of children and adolescents with serious emotional and behavioral disorders. Study data was collected from 304 primary…
Al-Hendawi, Maha; Keller, Clayton; Cloninger, Lea
The Child Behavior Checklist for children 6 to 18 (CBCL/6-18) is a widely used, standardized parent rating scale. However, few studies have tested the psychometric properties of this instrument in the Arab world despite the great need for such instruments to support the identification and education of children with emotional, behavioral, and…
Aronson, David M.; Baum, Steven K.
A new psychometric instrument for measuring the impact of divorce on elementary school age children was developed: the Child's Report of the Impact of Separation by Parents (CRISP). This structured projective test was specifically designed to assess children's postdivorce stress/adjustment. An initial version of the CRISP was administered to 99…
Raykov, Tenko; Marcoulides, George A.; Patelis, Thanos
A critical discussion of the assumption of uncorrelated errors in classical psychometric theory and its applications is provided. It is pointed out that this assumption is essential for a number of fundamental results and underlies the concept of parallel tests, the Spearman-Brown's prophecy and the correction for attenuation formulas as well as…
Brouwer, B.J.M. de; Kaljouw, M.J.; Schoonhoven, L.; Achterberg, T. van
AIMS AND OBJECTIVES: To develop and psychometrically test the Essentials of Magnetism II in nursing homes. BACKGROUND: Increasing numbers and complex needs of older people in nursing homes strain the nursing workforce. Fewer adequately trained staff and increased care complexity raise concerns about
Smith, Leann V.; Cokley, Kevin
The authors investigated the psychometric properties of the Social Identities and Attitudes Scale developed by Picho and Brown, which captures an individual's vulnerability to Stereotype Threat effects. Confirmatory factor analyses and group invariance tests conducted on a diverse sample of 516 college students revealed adequate reliability and…
Moussa, Miriam Taouk; Lovibond, Peter; Laube, Roy; Megahead, Hamido A.
Objective: To translate and evaluate the psychometric properties of an Arabic-language version of the Depression Anxiety Stress Scales (DASS). Method: The items were translated, back translated, refined, and tested in an Australian immigrant sample (N = 220). Results: Confirmatory factor analysis showed that the Arabic DASS discriminates between…
Kim, L. H.; McLeod, R. S.; Kiss, Z. H. T.
Objective. There have been remarkable advances over the past decade in neural prostheses to restore lost motor function. However, restoration of somatosensory feedback, which is essential for fine motor control and user acceptance, has lagged behind. With an increasing interest in using electrical stimulation to restore somatosensory sensations within the peripheral (PNS) and central nervous systems (CNS), it is critical to characterize the percepts evoked by electrical stimulation in a standardized manner with a validated psychometric questionnaire. This will allow comparison of results from applications at various nervous system levels in multiple settings. Approach. We compiled a summary of published reports of somatosensory percepts that were elicited by electrical stimulation in humans and used these to develop a new psychometric questionnaire. Results. This new questionnaire was able to characterize subjective evoked sensations with good test-retest reliability (Spearman’s correlation coefficients ranging 0.716 ⩽ ρ ⩽ 1.000, p ⩽ 0.005) in 13 subjects receiving stimulation through neural implants in both the CNS and PNS. Furthermore, the new questionnaire captured more descriptors (M = 2.65, SD = 0.91) that would have been missed by being categorized as ‘other sensations’, using a previous questionnaire (M = 1.40, SD = 0.77, t(12) = -10.24, p psychometric questionnaire will aid in establishing consistency and standardization of reporting in future studies of somatosensory neural prostheses.
Caselli, Gabriele; Fernie, Bruce; Canfora, Flaviano; Mascolo, Cristina; Ferrari, Andrea; Antonioni, Maria; Giustina, Lucia; Donato, Gilda; Marcotriggiani, Antonella; Bertani, Andrea; Altieri, Antonella; Pellegrini, Eliana; Spada, Marcantonio M
Recent research has suggested that metacognitions may play a role across the spectrum of addictive behaviours. The goal of our studies was to develop the first self-report scale of metacognitions about gambling. We conducted three studies with one community (n = 165) and two clinical (n = 110; n = 87) samples to test the structure and psychometric properties of the Metacognitions about Gambling Questionnaire and examined its capacity to prospectively predict severity of gambling. Findings supported a two factor solution consisting of positive and negative metacognitions about gambling. Internal consistency, predictive and divergent validity were acceptable. All the factors of the Metacognitions about Gambling Questionnaire correlated positively with gambling severity. Regression analyses showed that negative metacognitions about gambling were significantly associated to gambling severity over and above negative affect and gambling-specific cognitive distortions. Finally only gambling severity and negative metacognitions about gambling were significant prospective predictors of gambling severity as measured three months later. The Metacognitions about Gambling Questionnaire was shown to possess good psychometric properties, as well as predictive and divergent validity within the populations that were tested. Copyright © 2018 Elsevier B.V. All rights reserved.
Spada, Marcantonio M; Caselli, Gabriele
Recent research has suggested that metacognitions may play a role across the spectrum of addictive behaviours. The goal of our studies was to develop the first self-report scale of metacognitions about online gaming. We conducted two studies with samples of online gamers (n=225, n=348) to test the structure and psychometric properties of the Metacognitions about Online Gaming Scale and examined its capacity to predict weekly online gaming hours and Internet addiction. Exploratory and confirmatory factor analyses supported a three-factor solution: positive metacognitions about online gaming, negative metacognitions about the uncontrollability of online gaming, and negative metacognitions about the dangers of online gaming. Internal consistency, predictive and divergent validity were acceptable. All the factors of the Metacognitions about Online Gaming Scale correlated positively with weekly online gaming hours and Internet addiction. Regression analyses showed that negative metacognitions about the uncontrollability of online gaming and levels of Internet addiction were the only significant predictors of weekly online gaming hours, and that positive metacognitions about online gaming and negative metacognitions about the uncontrollability of online gaming were the only significant predictors of Internet addiction. The Metacognitions about Online Gaming Scale was shown to possess good psychometric properties, as well as predictive and divergent validity within the populations that were tested. Copyright © 2015 Elsevier Ltd. All rights reserved.
Huggins-Manley, Anne Corinne
This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
Introduction: The aim of study was to evaluate the psychometric indices Sternberg love scale on married men and women in Iranian society. Methods: The study type is correlation (factor analysis). In this research factor analysis was used that is an exploratory and confirmatory technique to study the structure of a set of data, ...
The present research focuses on the psychometric properties of the Birthday Party measure for ages 3-5. The Birthday Party was developed to provide a reliable, valid, and engaging measure of early mathematical content--Number and Operation, Shape, Space, and Pattern--that can be given in either a short or a long form to English and Spanish…
Cannito, Michael P.
This study examined emotional characteristics of 18 female spasmodic dysphonic subjects in comparison to matched normal controls across psychometric measures of depression, anxiety, and somatic complaints. Statistically significant differences were noted between groups for all measures and over half of the dysphonic subjects exhibited clinically…
Longo, Matthew R.; Schuur, Friederike; Kammers, Marjolein P. M.; Tsakiris, Manos; Haggard, Patrick
What is it like to have a body? The present study takes a psychometric approach to this question. We collected structured introspective reports of the rubber hand illusion, to systematically investigate the structure of bodily self-consciousness. Participants observed a rubber hand that was stroked either synchronously or asynchronously with their…
Dutch Translation and Psychometric Testing of the 9-Item Shared Decision Making Questionnaire (SDM-Q-9) and Shared Decision Making Questionnaire-Physician Version (SDM-Q-Doc) in Primary and Secondary Care.
Rodenburg-Vandenbussche, Sumayah; Pieterse, Arwen H; Kroonenberg, Pieter M; Scholl, Isabelle; van der Weijden, Trudy; Luyten, Gre P M; Kruitwagen, Roy F P M; den Ouden, Henk; Carlier, Ingrid V E; van Vliet, Irene M; Zitman, Frans G; Stiggelbout, Anne M
The SDM-Q-9 and SDM-Q-Doc measure patient and physician perception of the extent of shared decision making (SDM) during a physician-patient consultation. So far, no self-report instrument for SDM was available in Dutch, and validation of the scales in other languages has been limited. The aim of this study was to translate both scales into Dutch and assess their psychometric characteristics. Participants were patients and their treating physicians (general practitioners and medical specialists). Patients (N = 182) rated their consultation using the SDM-Q-9, 43 physicians rated their consultations using the SDM-Q-Doc (N = 201). Acceptability, reliability (internal consistency), and the factorial structure of the instruments were determined. For convergent validity the CPSpost was used. Reliabilities of both scales were high (alpha SDM-Q-9 0.88; SDM-Q-Doc 0.87). The SDM-Q-9 and SDM-Q-Doc total scores correlated as expected with the CPSpost (SDM-Q-9: r = 0.29; SDM-Q-Doc: r = 0.48) and were significantly different between the CPSpost categories, with lowest mean scores when the physician made the decision alone. Principal Component Analyses showed a two-component model for each scale. A confirmatory factor analysis yielded a mediocre, but acceptable, one-factor model, if Item 1 was excluded; for both scales the best indices of fit were obtained for a one-factor solution, if both Items 1 and 9 were excluded. The Dutch SDM-Q-9 and SDM-Q-Doc demonstrate good acceptance and reliability; they correlated as expected with the CPSpost and are suitable for use in Dutch primary and specialised care. Although the best model fit was found when excluding Items 1 and 9, we believe these items address important aspects of SDM. Therefore, also based on the coherence with theory and comparability with other studies, we suggest keeping all nine items of the scale. Further research on the SDM-concept in patients and physicians, in different clinical settings and different countries, is
Hammond Sean M
Full Text Available Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM. Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89, subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming's indices shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the
Hammond, Sean M
Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM). Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89), subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming\\'s indices) shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the device in future
Michels, Charlotte TJ; Boulton, Mary; Adams, Astrid; Wee, Bee; Peters, Michele
Background: Informal carers face many challenges in caring for patients with palliative care needs. Selecting suitable valid and reliable outcome measures to determine the impact of caring and carers’ outcomes is a common problem. Aim: To identify outcome measures used for informal carers looking after patients with palliative care needs, and to evaluate the measures’ psychometric properties. Design: A systematic review was conducted. The studies identified were evaluated by independent reviewers (C.T.J.M., M.B., M.P.). Data regarding study characteristics and psychometric properties of the measures were extracted and evaluated. Good psychometric properties indicate a high-quality measure. Data sources: The search was conducted, unrestricted to publication year, in the following electronic databases: Applied Social Sciences Index and Abstracts, Cumulative Index to Nursing and Allied Health Literature, The Cochrane Library, EMBASE, PubMed, PsycINFO, Social Sciences Citation Index and Sociological Abstracts. Results: Our systematic search revealed 4505 potential relevant studies, of which 112 studies met the inclusion criteria using 38 carer measures for informal carers of patients with palliative care needs. Psychometric properties were reported in only 46% (n = 52) of the studies, in relation to 24 measures. Where psychometric data were reported, the focus was mainly on internal consistency (n = 45, 87%), construct validity (n = 27, 52%) and/or reliability (n = 14, 27%). Of these, 24 measures, only four (17%) had been formally validated in informal carers in palliative care. Conclusion: A broad range of outcome measures have been used for informal carers of patients with palliative care needs. Little formal psychometric testing has been undertaken. Furthermore, development and refinement of measures in this field is required. PMID:26407683
Cabeza de Baca, Tomás; Black, Candace Jasmine; García, Rafael Antonio; Fernandes, Heitor Barcellos Ferreira; Wolf, Pedro Sofio Abril; Woodley of Menie, Michael Anthony
Copping, Campbell, and Muncer (2014) have recently published an article critical of the psychometric approach to the assessment of life history (LH) strategy. Their purported goal was testing for the convergent validation and examining the psychometric structure of the High-K Strategy Scale (HKSS). As much of the literature on the psychometrics of human LH during the past decade or so has emanated from our research laboratory and those of close collaborators, we have prepared this detailed response. Our response is organized into four main sections: (1) A review of psychometric methods for the assessment of human LH strategy, expounding upon the essence of our approach; (2) our theoretical/conceptual concerns regarding the critique, addressing the broader issues raised by the critique regarding the latent and hierarchical structure of LH strategy; (3) our statistical/methodological concerns regarding the critique, examining the validity and persuasiveness of the empirical case made specifically against the HKSS; and (4) our recommendations for future research that we think might be helpful in closing the gap between the psychometric and biometric approaches to measurement in this area. Clearly stating our theoretical positions, describing our existing body of work, and acknowledgintheir limitations should assist future researchers in planning and implementing more informed and prudent empirical research that will synthesize the psychometric approach to the assessment of LH strategy with complementary methods. PMID:25844774
Aurelio José Figueredo
Full Text Available Copping, Campbell, and Muncer (2014 have recently published an article critical of the psychometric approach to the assessment of life history (LH strategy. Their purported goal was testing for the convergent validation and examining the psychometric structure of the High-K Strategy Scale (HKSS. As much of the literature on the psychometrics of human LH during the past decade or so has emanated from our research laboratory and those of close collaborators, we have prepared this detailed response. Our response is organized into four main sections: (1 A review of psychometric methods for the assessment of human LH strategy, expounding upon the essence of our approach; (2 our theoretical/conceptual concerns regarding the critique, addressing the broader issues raised by the critique regarding the latent and hierarchical structure of LH strategy; (3 our statistical/methodological concerns regarding the critique, examining the validity and persuasiveness of the empirical case made specifically against the HKSS; and (4 our recommendations for future research that we think might be helpful in closing the gap between the psychometric and biometric approaches to measurement in this area. Clearly stating our theoretical positions, describing our existing body of work, and acknowledging their limitations should assist future researchers in planning and implementing more informed and prudent empirical research that will synthesize the psychometric approach to the assessment of LH strategy with complementary methods.
Psychometric properties of the portuguese version of the Jebsen-Taylor test for adults with mild hemiparesis Avaliação das propriedades pscicométricas da versão em português do teste de Jebsen Taylor para adultos com hemiparesia leve
Karina N. Ferreiro
Full Text Available OBJECTIVES: To evaluate the psychometric properties of the Portuguese version of the Jebsen-Taylor Test (JTT in patients with stroke. METHODS: Forty participants who suffered a stroke in the cerebral hemisphere were videotaped while performing the JTT. Scores were defined by the time taken to perform the tasks, and two physical therapists evaluated the performance of the participants. Intra- and inter-rater reliability was defined by intraclass correlation coefficients (ICC through videotape analysis. Cronbach's alpha and Pearson's correlation coefficient (r were used to measure the internal consistency of the scale. Confidence intervals (CI were calculated, and the influence of handedness and educational level on the JTT scores was evaluated. RESULTS: Inter-rater (ICC = 1.0; CI, 1.0-1.0 and intra-rater reliabilities (ICC=0.997; CI, 0.995-0.998 were excellent. Regarding internal consistency, Cronbach's α was 0.924. The item "writing a sentence" was less consistent than the other items (Cronbach's alpha=0.884. Pearson's r (item score - total score was lower for the item "small objects" (r=0.657. There was no significant influence of handedness or educational level on the JTT scores. CONCLUSIONS: Videotaping test performances can be a useful tool in multicenter studies if inter-rater reliability is appropriate. The inter- and intra-rater reliabilities of the Portuguese version of the JTT were excellent in patients with stroke. The JTT can be a valuable tool for evaluating dexterity in research protocols aiming at efficacy of rehabilitation interventions.OBJETIVOS: Avaliar as propriedades psicométricas da versão em Português do teste de Jebsen-Taylor (TJT em pacientes com acidente vascular encefálico (AVE. MÉTODOS: Quarenta pacientes com AVEs em hemisférios cerebrais foram filmados enquanto realizaram o TJT. A pontuação no teste é definida pelo tempo de execução de tarefas motoras. Duas fisioterapeutas avaliaram o desempenho dos
Pahlevan Sharif, Saeed
Purpose The purpose of this paper is to develop and evaluate psychometrically an instrument named the Breast Size Satisfaction Scale (BSSS) to assess breast size satisfaction. Design/methodology/approach The present scale was developed using a set of 16 computer-generated 3D images of breasts to overcome some of the limitations of existing instruments. The images were presented to participants and they were asked to select the figure that most accurately depicted their actual breast size and the figure that most closely represented their ideal breast size. Breast size satisfaction was computed by subtracting the absolute value of the difference between ideal and actual perceived size from 16, such that higher values indicate greater breast size satisfaction. Findings Study 1 ( n=65 female undergraduate students) showed good test-retest reliability and study 2 ( n=1,000 Iranian women, aged 18 years and above) provided support for convergent validity using a nomological network approach. Originality/value The BSSS demonstrated good psychometric properties and thus can be used in future studies to assess breast size satisfaction among women.
Lundgren, Tobias; Parling, Thomas
Psychological inflexibility and experiential avoidance are equivalent (with somewhat different connotations) concepts and refer to an unwillingness to remain in contact with particular private events. This concept is most often measured by the Acceptance and Action Questionnaire (AAQ-II) and is strongly related to psychopathology and behavioral effectiveness. In this study, the preliminary psychometric properties of the Swedish version of the AAQ-II (Swedish Acceptance and Action Questionnaire-SAAQ) are presented. The study is done in two steps. In the first step, the 10-item version of the AAQ-II is investigated through principal component analysis (n = 147). Secondly, due to problems with the component structure, the instrument is reduced to a six-item version and its validity and internal consistency are investigated (n = 154). The six-item version shows good concurrent and convergent validity as well as satisfying internal consistency (α = .85). Furthermore, the Swedish six-item version of the AAQ-II showed one strong component. Test-retest reliability was satisfactory (r = .80; n = 228). In future research, predictive and external validity would be important to investigate in order to further ensure that the SAAQ is a useful measure for clinical research. In conclusion, the SAAQ has satisfactory psychometric properties, but more data need to be gathered to further explore the possibilities for the instruments in Swedish contexts.
Graffigna, Guendalina; Barello, Serena; Bonanomi, Andrea; Lozza, Edoardo
Beyond the rhetorical call for increasing patients' engagement, policy makers recognize the urgency to have an evidence-based measure of patients' engagement and capture its effect when planning and implementing initiatives aimed at sustaining the engagement of consumers in their health. In this paper, authors describe the Patient Health Engagement Scale (PHE-scale), a measure of patient engagement that is grounded in rigorous conceptualization and appropriate psychometric methods. The scale was developed based on our previous conceptualization of patient engagement (the PHE-model). In particular, the items of the PHE-scale were developed based on the findings from the literature review and from interviews with chronic patients. Initial psychometric analysis was performed to pilot test a preliminary version of the items. The items were then refined and administered to a national sample of chronic patients (N = 382) to assess the measure's psychometric performance. A final phase of test-retest reliability was performed. The analysis showed that the PHE Scale has good psychometric properties with good correlation with concurrent measures and solid reliability. Having a valid and reliable measure to assess patient engagement is the first step in understanding patient engagement and its role in health care quality, outcomes, and cost containment. The PHE Scale shows a promising clinical relevance, indicating that it can be used to tailor intervention and assess changes after patient engagement interventions.
Eplov, Lene Falgaard; Petersen, Janne; Jørgensen, Torben
The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim of the pre......The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim...... 0.30 for the 12 and the 22 item scales. All five Mental Vulnerability scales had positively skewed score distributions which were associated significantly with both SCL-90-R symptom scores and NEO-PI-R personality scales (primarily Neuroticism and Extraversion). Coefficient alpha was highest...
Hafner, Brian J.; Morgan, Sara J.; Askew, Robert L.; Salem, Rana
Documentation of clinical outcomes is increasingly expected in delivery of prosthetic services and devices. However, many outcome measures suitable for use in clinical care and research have not been psychometrically tested with prosthesis users. The aim of this study was to determine test-retest reliability, mode-of-administration (MoA) equivalence, standard error of measurement (SEM), and minimal detectable change (MDC) of standardized, self-report instruments that assess constructs of impo...
Full Text Available Background and purpose: Using valid and reliable instruments is an important way for collecting data in qualitative researches. This paper is a report of a study conducted to examine the extent of psychometric properties of the scales in research papers published in Journal of Advanced Nursing.Methods: In this study, the Journal of Advanced Nursing was chosen for systematic review. All articles which were published during 2007-2009 in this journal were collected and articles related to instrument development were selected. Each article was completely reviewed to identify the methods of instrument validation and reliability.Results: From 980 articles published in Journal of Advanced Nursing during 2007-2009, 41 (4.18% articles were about research methodology. In these, 12 articles (29.27% were related to developing an instrument. In this study, review of 12 articles that published in Journal of Advanced Nursing, 2007-2009, showed that some of the articles did not measure psychometric properties properly, thus some of the developed scales need to measure other types of necessary validity. In addition, reliability testing needs to be performed on each instrument used in a study before other statistical analysis are performed. From 12 articles, all of the articles measured and reported Cronbach’s alpha, but four of them did not measure test-retest.Conclusions: Although researchers put a great emphasis on methodology and statistical analysis, they pay less attention to the psychometric properties of their new instruments. The authors of this article hope to draw the attention of researcher to the importance of measuring psychometric properties of new instruments.Keywords: PSYCHOMETRIC, SCALES, CRITICAL REVIEW
Hong, Ickpyo; Lee, Mi Jung; Kim, Moon Young; Park, Hae Yean
The aim of this study is to investigate the psychometrics of the 12 items of an instrument assessing activities of daily living (ADL) using an item response theory model. A total of 648 adults with physical disabilities and having difficulties in ADLs were retrieved from the 2014 Korean National Survey on People with Disabilities. The psychometric testing included factor analysis, internal consistency, precision, and differential item functioning (DIF) across categories including sex, older age, marital status, and physical impairment area. The sample had a mean age of 69.7 years old (SD = 13.7). The majority of the sample had lower extremity impairments (62.0%) and had at least 2.1 chronic conditions. The instrument demonstrated unidimensional construct and good internal consistency (Cronbach's alpha = 0.95). The instrument precisely estimated person measures within a wide range of theta values (-2.22 logits 5.0%). Our findings indicate that the dressing item would need to be modified to improve its psychometrics. Overall, the ADL instrument demonstrates good psychometrics, and thus, it may be used as a standardized instrument for measuring disability in rehabilitation contexts. However, the findings are limited to adults with physical disabilities. Future studies should replicate psychometric testing for survey respondents with other disorders and for children.
McGilton, Katherine S
The purpose of this study was to develop and evaluate the psychometric properties of 2 supportive leadership scales, the Charge Nurse Support Scale and the Unit Manager Support Scale, designed for long-term-care environments. These 6-item self-report scales were administered to 70 nursing staff and their internal consistency reliability, test-retest reliability, content validity, factor structure, and construct validity investigated. Content validity was established with the assistance of experts. Both scales were deemed reliable. As hypothesized, a significant relationship was found between the measure of how nursing staff related to residents and measures of charge nurses' supportive behaviours (r = .42, p = .05). Reliable and valid measures of supportive leadership could be developed for use in identifying the quality of support provided to staff in long-term-care environments.
Shi, Lu-Feng; Zaki, Nancy A
The present study attempted to establish psychometric function in individuals whose first language is not English. Psychometric function was obtained for one of the most commonly used clinical tests, the Northwestern University Auditory Test No. 6 (Tillman & Carhart 1966), so that findings could be directly applied to everyday clinical practice. Five groups of 14 normal-hearing, adult listeners differing in their first language and dominant language (English monolinguals, English- and Arabic-dominant Arabic-English bilinguals, and English- and Russian-dominant Russian-English bilinguals) participated. Both forms of the Northwestern University Auditory Test No. 6 test (8 lists of 50 monosyllabic English words) were presented. The lists were randomly assigned to eight signal-to-noise ratios (-3 to 18 dB in 3 dB steps). Listeners responded verbally and in writing. Psychometric functions were derived via logistic regression and described by two parameters: the 50% correct performance level (θ) and the slope (k). Both English-dominant bilingual groups obtained psychometric functions comparable with monolinguals. The θ and k of the functions for these three groups of participants were consistent with the literature. Compared with these three groups, non-English-dominant bilinguals' functions grew significantly more gradually (i.e., a significantly higher θ and a significantly lower k). No differences in either θ or k were found between bilinguals with the same dominant language but different first languages. Bilinguals reporting themselves to be dominant in English generate monolingual-like psychometric functions. By contrast, a different set of psychometric properties describes the function of bilinguals dominant in their first language. Because first language did not appear to be a significant factor in determining bilinguals' functions, it is concluded that English learning history and English proficiency are more important variables than first language for
Othman Mohamad Hashim
Full Text Available Urinalysis was used in previous studies among higher institution students (n=16252 in Malaysia to answer the question of whether university students are involved in drug abuse. However, the use of urinalysis had faced some problems. The problems were related to human rights issues and the cost to perform the urinalysis was expensive and quite impossible to be implemented to a large population of university students. To overcome this problem, this study was conducted to examine the effectiveness of psychometric measures in screening drug, alcohol and substance abuse. The Substance Abuse Subtle Screening Inventory A2 (SASSI-A2 was used for this purpose. SASSI-A2 is a brief screening tool designed to identify individuals who have a high probability of having a substance use disorder, including both substance abuse and substance dependence. SASSI-A2 comprises of 72 items that are rated on a two point scale with response; true and false. SASSI-A2 was translated into Malay language and it was refined through a back-translation technique and focus group approach. Psychometric testing was undertaken on a sample of 750 university students from five public universities in Malaysia. All participants were aged between 19 and 20 years. Internal consistency coefficients were calculated for the total scale and its subscales. Chronbach's alpha obtained for SASSI-A2 was 0.72. This relatively high level of Chronbach's alpha showed relatively high level of reliability. The results demonstrated that the whole SASSI-A2 meets the fundamental measurement properties and can discriminate groups of higher institution students from high to low on the substance dependency variable. The accuracy of the test has been found to be unaffected by gender, ethnicity, age and years of education. Although more rigorous validation studies are needed, it is recommended that SASSI-A2 be considered for usage to higher institution students populations when a brief, objective, and
Lundman, Berit; Årestedt, Kristofer; Norberg, Astrid; Norberg, Catharina; Fischer, Regina Santamäki; Lövheim, Hugo
This study tested the psychometric properties of a Swedish version of the Self-Transcendence Scale (STS). Cohen's weighted kappa, agreement, absolute reliability, relative reliability, and internal consistency were calculated, and the underlying structure of the STS was established by exploratory factor analysis. There were 2 samples available: 1 including 194 people aged 85-103 years and a convenience sample of 60 people aged 21-69 years. Weighted kappa values ranged from .40 to .89. The intraclass correlation coefficient for the original STS was .763, and the least significant change between repeated tests was 6.25 points. The revised STS was found to have satisfactory psychometric properties, and 2 of the 4 underlying dimensions in Reed's self-transcendence theory were supported.
Idrovo, Alvaro J; Camacho-Avila, Anabel; García-Rivas, Javier; Juárez-García, Arturo
Most studies on social capital and health are carried out with large home-based surveys, neglecting that many interactions among individuals occur in the workplace. The objective of this study was to explore the psychometric properties of a scale in Spanish used to measure social capital at work. The scale designed by Kouvonen et al was translated into Spanish and tested under classical test theory, item response theory, and confirmatory factorial analysis; 152 public health workers from different socio-cultural contexts participated in the survey. Internal consistency was high (Chronbach's alpha = 0.88). Social capital at work correlated properly with two Job Content Questionnaire dimensions. A ceiling effect was detected and item difficulty was quantified. The confirmatory factor analysis showed the expected theoretical components of social capital: bonding, bridging and trust. The scale has acceptable psychometric properties, thus it can be used in future studies.
García-Fernández, José Manuel; Espada Sánchez, José Pedro; Orgilés Amorós, Mireia; Méndez Carrillo, Xavier
This paper describes the psychometric properties of a new children's self-report measure. The School Fears Survey Scale, Form II (SFSS-II) assesses school fears in children from ages 8 to 11. The factor solution with a Spanish sample of 3,665 children isolated four factors: Fear of academic failure and punishment, fear of physical discomfort, fear of social and school assessment and anticipatory and separation anxiety. The questionnaire was tested by confirmatory factor analysis, which accounted for 55.80% of the total variance. Results indicated that the SFSS-II has a high internal consistency (alpha= .89). The results revealed high test-retest reliability and appropriate relationship with other scales. The age by gender interaction was significant. Two-way analysis of variance found that older children and girls had higher anxiety. The instrument shows adequate psychometric guarantees and can be used for the multidimensional assessment of anxiety in clinical and educational settings.
Fay, Derek M.; Levy, Roy; Mehta, Vandhana
A common practice in educational assessment is to construct multiple forms of an assessment that consists of tasks with similar psychometric properties. This study utilizes a Bayesian multilevel item response model and descriptive graphical representations to evaluate the psychometric similarity of variations of the same task. These approaches for…
Seo, Daeryong; Taherbhai, Husein; Frantz, Roger
The importance of listening in the context of English language acquisition is gaining acceptance, but its unique attributes in language performance, while substantively and qualitatively justifiable, are generally not psychometrically defined. This article psychometrically supports listening as a distinct domain among the three other domains of…
May, Keith A; Solomon, Joshua A
In a 2-alternative forced-choice (2AFC) discrimination task, observers choose which of two stimuli has the higher value. The psychometric function for this task gives the probability of a correct response for a given stimulus difference, Δx. This paper proves four theorems about the psychometric function. Assuming the observer applies a transducer and adds noise, Theorem 1 derives a convenient general expression for the psychometric function. Discrimination data are often fitted with a Weibull function. Theorem 2 proves that the Weibull "slope" parameter, β, can be approximated by β(Noise) x β(Transducer), where β(Noise) is the β of the Weibull function that fits best to the cumulative noise distribution, and β(Transducer) depends on the transducer. We derive general expressions for β(Noise) and β(Transducer), from which we derive expressions for specific cases. One case that follows naturally from our general analysis is Pelli's finding that, when d' ∝ (Δx)(b), β ≈ β(Noise) x b. We also consider two limiting cases. Theorem 3 proves that, as sensitivity improves, 2AFC performance will usually approach that for a linear transducer, whatever the actual transducer; we show that this does not apply at signal levels where the transducer gradient is zero, which explains why it does not apply to contrast detection. Theorem 4 proves that, when the exponent of a power-function transducer approaches zero, 2AFC performance approaches that of a logarithmic transducer. We show that the power-function exponents of 0.4-0.5 fitted to suprathreshold contrast discrimination data are close enough to zero for the fitted psychometric function to be practically indistinguishable from that of a log transducer. Finally, Weibull β reflects the shape of the noise distribution, and we used our results to assess the recent claim that internal noise has higher kurtosis than a Gaussian. Our analysis of β for contrast discrimination suggests that, if internal noise is stimulus
Keith A May
Full Text Available In a 2-alternative forced-choice (2AFC discrimination task, observers choose which of two stimuli has the higher value. The psychometric function for this task gives the probability of a correct response for a given stimulus difference, Δx. This paper proves four theorems about the psychometric function. Assuming the observer applies a transducer and adds noise, Theorem 1 derives a convenient general expression for the psychometric function. Discrimination data are often fitted with a Weibull function. Theorem 2 proves that the Weibull "slope" parameter, β, can be approximated by β(Noise x β(Transducer, where β(Noise is the β of the Weibull function that fits best to the cumulative noise distribution, and β(Transducer depends on the transducer. We derive general expressions for β(Noise and β(Transducer, from which we derive expressions for specific cases. One case that follows naturally from our general analysis is Pelli's finding that, when d' ∝ (Δx(b, β ≈ β(Noise x b. We also consider two limiting cases. Theorem 3 proves that, as sensitivity improves, 2AFC performance will usually approach that for a linear transducer, whatever the actual transducer; we show that this does not apply at signal levels where the transducer gradient is zero, which explains why it does not apply to contrast detection. Theorem 4 proves that, when the exponent of a power-function transducer approaches zero, 2AFC performance approaches that of a logarithmic transducer. We show that the power-function exponents of 0.4-0.5 fitted to suprathreshold contrast discrimination data are close enough to zero for the fitted psychometric function to be practically indistinguishable from that of a log transducer. Finally, Weibull β reflects the shape of the noise distribution, and we used our results to assess the recent claim that internal noise has higher kurtosis than a Gaussian. Our analysis of β for contrast discrimination suggests that, if internal noise is
Full Text Available Starting with the common origins of biometrics and psychometrics at the beginning of the twentieth century, the paper compares and contrasts subsequent developments, informed by the author's 35 years at Rothamsted Experimental Station followed by a period with the data theory group in Leiden and thereafter. Although the methods used by biometricians and psychometricians have much in common, there are important differences arising from the different fields of study. Similar differences arise wherever data are generated and may be regarded as a major driving force in the development of statistical ideas.
TEMPLE, SUSANNAH FLEUR
Functional Fluency denotes efficacy of interpersonal functioning in terms of flexibility and balance of the behavioural modes a person uses. The aim of this project is to design and create a psychometric tool for mapping the patterns of such functioning. The intention is that feedback on the test results will stimulate the insights and understanding to support and encourage positive behavioural change. This process, involving the development of self-awareness, which is a key as...
Burton, Amy L.; Hay, Phillipa; Kleitman, Sabina; Smith, Evelyn; Raman, Jayanthi; Swinbourne, Jessica; Touyz, Stephen W.; Abbott, Maree J.
Background The Eating Beliefs Questionnaire (EBQ) is a 27-item self-report measure that assesses positive and negative beliefs about binge eating. It has been validated and its factor structure explored in a non-clinical sample. This study tested the psychometric properties of the EBQ in a clinical and a non-clinical sample. Method A sample of 769 participants (573 participants recruited from the university and general community, 76 seeking treatment for an eating disorder and 120 participati...
Marc, Linda G.; Wang, Ming-Mei; Testa, Marcia A.
The objective of this paper is to psychometrically validate the HIV Symptom Distress Scale (SDS), an instrument that can be used to measure overall HIV symptom distress or clinically relevant groups of HIV symptoms. A secondary data analysis was conducted using the Collaborations in HIV Outcomes Research U.S. Cohort (CHORUS). Inclusion criteria required study participants (N=5,521) to have a valid baseline measure of the AIDS Clinical Trial Group Symptom Distress Module, with an SF-12 or SF-36 completed on the same day. Psychometric testing assessed unidimensionality, internal consistency and factor structure using exploratory and confirmatory factor analysis, and structural equation modeling (SEM). Construct validity examined whether the new measure discriminates across clinical significance (CD4 and HIV viral load). Findings show that the SDS has high reliability (α=0.92), and SEM supports a correlated second-order factor model (physical and mental distress) with acceptable fit (GFI=0.88, AGFI=0.85, NFI=0.99, NNFI=0.99; RMSEA=0.06, [90% CI 0.06 – 0.06]; Satorra Bentler Scaled, C2 =3274.20; p=0.0). Construct validity shows significant differences across categories for HIV-1 viral load (p< 0.001) and CD4 (p< 0.001). Differences in mean SDS scores exist across gender (p< 0.001), race/ethnicity (p< 0.05) and educational attainment (p < 0.001). Hence, the HIV Symptom Distress Scale is a reliable and valid instrument, which measures overall HIV symptoms or clinically relevant groups of symptoms. PMID:22409246
Serrani Azcurra, Daniel Jorge Luis
Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs.
Manuel Salvador Ortiz Parada
Full Text Available The present study tested the psychometric properties of the Zimet et al.’ Multidimensional Scale of Perceived Social Support (MSPSS, in patients with Type 2 Diabetes Mellitus (DM (n = 76 from Temuco City, Chile. The total scale shown appropriate levels of internal consistency (0,849. An exploratory factorial analysis, with Varimax rotation, was performed over the MSPSS measures. In agreement with the original Scale, three factors were obtained explaining 66,8% of the variance. These results suggest that MSPSS is a psychometrically sound instrument that can be applied to patients with Type 2 Diabetes.
Aim: The aim of this research was to evaluate the psychometric properties of a measure of entrepreneurial climate. Entrepreneurial climate was measured using a shortened version of the Hornsby, Kuratko and Zahra (2002 instrument, called the Corporate Entrepreneurship Assessment Instrument (CEAI. Making information on the psychometric properties of the instrument available directly relates to its utility. Setting: The setting was medium to large South African companies. A random sample of employees was drawn from 53 selected companies across South Africa, with 60 respondents per company (N = 3 180. Methods: A cross-sectional survey design was used. Several instruments were administered, including the shortened version of the CEAI. Cronbach’s alpha was used to test for reliability and several methods were used to test for validity. Correlation analysis was used to test for concurrent validity, convergent validity and divergent validity. Principle component factor analysis was used to test for factorial validity and a t-test to test for known-group validity. Results: The results showed that the reliability for the total score of the shortened version of the CEAI was acceptable at 0.758. The results also showed some evidence of concurrent validity, as well as homogeneity among the items. With regard to factorial validity, all items loaded in accordance with the subscales of the instrument. The measure was able to distinguish, as expected, between government organisations and private business entities, suggesting known-group validity. Convergent validity and divergent validity were also assessed. Interesting to note was that entrepreneurship climate correlates more with general employee attitude (e.g. employee engagement; R= 0.420, p < 0.001 and organisational commitment, R = 0.331, p < 0.001 than with self-reported innovation (R = 0.277, p < 0.001 and R = 0.267, p < 0.001. Contribution: This paper not only provided information on the reliability
Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Eric Swanson, MD
Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
Richardson, George B; Chen, Ching-Chen; Dai, Chia-Liang; Brubaker, Michael D; Nedelec, Joseph L
Many published studies have employed the Mini-K to measure a single fast-slow life history dimension. However, the internal structure of the Mini-K has not been determined and it is not clear that a single higher order K-factor fits the data. It is also not clear that the Mini-K is measurement invariant across groups such as the sexes. To establish the construct validity of K as well as the broader usefulness of applying life history theory to humans, it is crucial that these psychometric issues are addressed as a part of measure validation efforts. Here we report on three studies that used latent variable modeling and data drawn from two college student samples ( ns = 361 and 300) to elucidate the psychometrics of the Mini-K. We found that (a) the Mini-K had a six dimensional first-order structure, (b) the K-factor provided a parsimonious explanation of the associations among the lower order factors at no significant cost to fit, (c) the Mini-K measured the same K-factor across the sexes, (d) K-factor means did not have the same meaning across the sexes and thus the first-order factors should be used in studies of mean sex differences, and finally, (e) the K-factor was only associated with environment and aspects of mating competition in females. Implications and future directions for life history research are discussed.
V. Duprez (Veerle); S.M. van Hooft (Susanne); J. Dwarswaard (Jolanda); A.L. van Staa (AnneLoes); A. Van Hecke (Ann); M.M.H. Strating (Mathilde)
markdownabstract__Aim:__ To develop and psychometrically test the self-efficacy and performance in self-management support (SEPSS) instrument. __Background:__ Facilitating persons with a chronic condition to take an active role in the management of their condition, implicates that nurses acquire
Slaney, Kathleen L.; Tkatchouk, Masha; Gabriel, Stephanie M.; Ferguson, Leona P.; Knudsen, Jared R. S.; Legere, Julien C.
The primary aim of the present study is to determine whether the psychometric evaluation practices and test-analytic rationales of researchers publishing in journals with a measurement focus differ from those of researchers publishing in journals with varying substantive research foci. Several components of two different samples of articles were…
Fazeli, Seyed Hossein
The current study aims to analyze the psychometric qualities of the Persian adapted version of Strategy Inventory for Language Learning (SILL) developed by Rebecca L. Oxford (1990). Three instruments were used: Persian adapted version of SILL, a Background Questionnaire, and Test of English as a Foreign Language. Two hundred and thirteen Iranian…
Gillespie, Brigid M; Polit, Denise F; Hamlin, Lois; Chaboyer, Wendy
This paper describes the development and validation of the Revised Perioperative Competence Scale (PPCS-R). There is a lack of a psychometrically tested sound self-assessment tools to measure nurses' perceived competence in the operating room. Content validity was established by a panel of international experts and the original 98-item scale was pilot tested with 345 nurses in Queensland, Australia. Following the removal of several items, a national sample that included all 3209 nurses who were members of the Australian College of Operating Room Nurses was surveyed using the 94-item version. Psychometric testing assessed content validity using exploratory factor analysis, internal consistency using Cronbach's alpha, and construct validity using the "known groups" technique. During item reduction, several preliminary factor analyses were performed on two random halves of the sample (n=550). Usable data for psychometric assessment were obtained from 1122 nurses. The original 94-item scale was reduced to 40 items. The final factor analysis using the entire sample resulted in a 40 item six-factor solution. Cronbach's alpha for the 40-item scale was .96. Construct validation demonstrated significant differences (pperceived competence scores relative to years of operating room experience and receipt of specialty education. On the basis of these results, the psychometric properties of the PPCS-R were considered encouraging. Further testing of the tool in different samples of operating room nurses is necessary to enable cross-cultural comparisons. Copyright © 2011 Elsevier Ltd. All rights reserved.
Fang, Ke; Pieterse, Alex L.; Friedlander, Myrna; Cao, Junhong
This investigation tested the psychometric properties of the Attitudes Toward Seeking Professional Psychological Help Scale-Short Form (ATSPPH-SF; Fisher and Farina ["Journal of College Student Development, 36", 368-373, 1995]) in a sample of 338 Mainland Chinese college students. Using back-translation, the ATSPPH-SF was translated into…
Masters, Kevin S; Ross, Kaile M; Hooker, Stephanie A; Wooldridge, Jennalee L
There has been a notable disconnect between theories of behavior change and behavior change interventions. Because few interventions are both explicitly and adequately theory-based, investigators cannot assess the impact of theory on intervention effectiveness. Theory-based interventions, designed to deliberately engage the theory's proposed mechanisms of change, are needed to adequately test theories. Thus, systematic approaches to theory-based intervention development are needed. This article will introduce and discuss the psychometric method of developing theory-based interventions. The psychometric approach to intervention development utilizes basic psychometric principles at each step of the intervention development process in order to build a theoretically driven intervention to, subsequently, be tested in process (mechanism) and outcome studies. Five stages of intervention development are presented as follows: (i) Choice of theory; (ii) Identification and characterization of key concepts and expected relations; (iii) Intervention construction; (iv) Initial testing and revision; and (v) Empirical testing of the intervention. Examples of this approach from the Colorado Meaning-Activity Project (COMAP) are presented. Based on self-determination theory integrated with meaning or purpose, and utilizing a motivational interviewing approach, the COMAP intervention is individually based with an initial interview followed by smart phone-delivered interventions for increasing daily activity. The psychometric approach to intervention development is one method to ensure careful consideration of theory in all steps of intervention development. This structured approach supports developing a research culture that endorses deliberate and systematic operationalization of theory into behavior change intervention from the outset of intervention development.
Muller-Staub, M.; Lunney, M.; Lavin, M.A.; Needham, I.; Odenbreit, M.; Achterberg, T. van
The instrument Q-DIO was developed in the years 2005 till 2006 to measure the quality of documented nursing diagnoses, interventions, and nursing sensitive patient outcomes. Testing psychometric properties of the Q-DIO (Quality of nursing Diagnoses, Interventions and Outcomes.) was the study aim.
Heaton Lisa J
Full Text Available Abstract Background It would be useful to have psychometrically-sound measures of dental fear for Hispanics, who comprise the largest ethnic minority in the United States. We report on the psychometric properties of Spanish-language versions of two common adult measures of dental fear (Modified Dental Anxiety Scale, MDAS; Dental Fear Survey, DFS, as well as a measure of fear of dental injections (Needle Survey, NS. Methods Spanish versions of the measures were administered to 213 adults attending Hispanic cultural festivals, 31 students (who took the questionnaire twice, for test-retest reliability, and 100 patients at a dental clinic. We also administered the questionnaire to 136 English-speaking adults at the Hispanic festivals and 58 English-speaking students at the same college where we recruited the Spanish-speaking students, to compare the performance of the English and Spanish measures in the same populations. Results The internal reliabilities of the Spanish MDAS ranged from 0.80 to 0.85. Values for the DFS ranged from 0.92 to 0.96, and values for the NS ranged from 0.92 to 0.94. The test-retest reliabilities (intra-class correlations for the three measures were 0.69, 0.86, and 0.94 for the MDAS, DFS, and NS, respectively. The three measures showed moderate correlations with one another in all three samples, providing evidence for construct validity. Patients with higher scores on the measures were rated as being more anxious during dental procedures. Similar internal reliabilities and correlations were found in the English-version analyses. The test-retest values were also similar in the English students for the DFS and NS; however, the English test-retest value for the MDAS was better than that found in the Spanish students. Conclusion We found evidence for the internal reliability, construct validity, and criterion validity for the Spanish versions of the three measures, and evidence for the test-retest reliability of the Spanish
Mittelstädt, Justin M; Pecena, Yvonne; Oubaid, Viktor; Maschke, Peter
This paper investigates personality traits as potential factors for success in an astronaut selection by comparing personality profiles of unsuccessful and successful astronaut candidates in different phases of the ESA selection procedure. It is further addressed whether personality traits could predict an overall assessment rating at the end of the selection. In 2008/2009, ESA performed an astronaut selection with 902 candidates who were either psychologically recommended for mission training (N = 46) or failed in basic aptitude (N = 710) or Assessment Center and interview testing (N = 146). Candidates completed the Temperament Structure Scales (TSS) and the NEO Personality Inventory Revised (NEO-PI-R). Those candidates who failed in basic aptitude testing showed higher levels of Neuroticism (M = 49.8) than the candidates who passed that phase (M = 45.4 and M = 41.6). Additionally, candidates who failed in basic testing had lower levels of Agreeableness (M = 132.9) than recommended candidates (M = 138.1). TSS scales for Achievement (r = 0.19) and Vitality (r = 0.18) showed a significant correlation with the overall assessment rating given by a panel board after a final interview. Results indicate that a personality profile similar to Helmreich's "Right Stuff" is beneficial in astronaut selection. Influences of test anxiety on performance are discussed. Mittelstädt JM, Pecena Y, Oubaid V, Maschke P. Psychometric personality differences between candidates in astronaut selection. Aerosp Med Hum Perform. 2016; 87(11):933-939.
González, David Andrés; Boals, Adriel; Jenkins, Sharon Rae; Schuler, Eric R; Taylor, Daniel
Students and young adults have high rates of suicide and depression, thus are a population of interest. To date, there is no normative psychometric information on the IDS and QIDS in these populations. Furthermore, there is equivocal evidence on the factor structure and subscales of the IDS. Two samples of young adult students (ns=475 and 1681) were given multiple measures to test the psychometrics and dimensionality of the IDS and QIDS. The IDS, its subscales, and QIDS had acceptable internal consistencies (αs=.79-90) and favorable convergent and divergent validity correlations. A three-factor structure and two Rasch-derived subscales best fit the IDS. The samples were collected from one university, which may influence generalizability. The IDS and QIDS are desirable measures of depressive symptoms when studying young adult students. Copyright © 2013 Elsevier B.V. All rights reserved.
Griffith, James W; Sumner, Jennifer A; Raes, Filip; Barnhofer, Thorsten; Debeer, Elise; Hermans, Dirk
Autobiographical memory is a multifaceted construct that is related to psychopathology and other difficulties in functioning. Across many studies, a variety of methods have been used to study autobiographical memory. The relationship between overgeneral autobiographical memory (OGM) and psychopathology has been of particular interest, and many studies of this cognitive phenomenon rely on the Autobiographical Memory Test (AMT) to assess it. In this paper, we examine several methodological approaches to studying autobiographical memory, and focus primarily on methodological and psychometric considerations in OGM research. We pay particular attention to what is known about the reliability, validity, and methodological variations of the AMT. The AMT has adequate psychometric properties, but there is great variability in methodology across studies that use it. Methodological recommendations and suggestions for future studies are presented. Copyright © 2011 Elsevier Ltd. All rights reserved.
Travis A. Ryan
Full Text Available The Drive for Muscularity Attitudes Questionnaire (DMAQ was developed to measure men’s desire to attain an idealized muscular body. To date, the cross-cultural suitability of this measure has received limited attention. The current study addressed this omission by testing the psychometric properties of the DMAQ using an online sample of Irish men (N = 327. Confirmatory factor analysis revealed that a unidimensional model adequately matched observed data (i.e., fit indices suggested acceptable model fit. Analyses also showed that the DMAQ yielded reliable and construct valid scores, suggesting that the scale holds promise as an indicant of the drive for muscularity among Irish men. Strengths and limitations associated with this study are discussed, such as advantages and disadvantages of Internet research. Directions for future research are given, including the need for more psychometric work.
Hannan, Jean; Youngblut, JoAnne M; Brooten, Dorothy; Bazzani, Dianne; Romero, Norma R; Chavez, Blanca; Picanes, Joann
Measuring stress in Hispanic Americans, the fastest growing U.S. minority, is problematic. The Life Events Inventory (LEI) and the Daily Hassles Scale (DHS), widely used stress instruments, are not available in Spanish. To test the psychometric properties of the translated Spanish versions of the LEI and DHS. A convenience sample of 63 Hispanic women completed both instruments in Spanish and English 2 weeks apart. Internal consistency reliability and stability were strong for both instruments (.85-.97). Reliability and validity evidence for the translated Spanish versions were strong and similar to the English version. Psychometric findings suggest that the newly translated Spanish versions are good representations of the English versions and that these newly translated instruments are ready for use.
Greenslade, Kathryn J; Coggins, Truman E
This study presents an independent replication and extension of psychometric evidence supporting the Theory of Mind Inventory (ToMI). Parents of 20 children with ASD (4; 1-6; 7 years; months) and 20 with typical development (3; 1-6; 5), rated their child's theory of mind abilities in everyday situations. Other parent report and child behavioral assessments included the Social Responsiveness Scale-2, Vineland Adaptive Behavior Scales-2, Peabody Picture Vocabulary Test-4, and Clinical Evaluation of Language Fundamentals-Preschool, 2. Results revealed high internal consistency, expected developmental changes in children with typical development, expected group differences between children with and without ASD, and strong correlations with other measures of social and communication abilities. The ToMI demonstrates strong psychometrics, suggesting considerable utility in identifying theory of mind deficits in children with ASD.
Lerdal, Anners; Moe, Britt; Digre, Elin; Harding, Thomas; Kristensen, Frode; Grov, Ellen K; Bakken, Linda N; Eklund, Marthe L; Ruud, Ireen; Rossi, Joseph S
Title Stages of Change – Continuous Measure (URICA-E2): psychometrics of a Norwegian version. Aim This paper is a report of research to translate the English version of the Stages of Change continuous measure questionnaire (URICA-E2) into Norwegian and to test the validity of the questionnaire and its usefulness in predicting behavioural change. Background While the psychometric properties of the Stages of Change categorical measure have been tested extensively, evaluation of the psychometric properties of the continuous questionnaire has not been described elsewhere in the literature. Method Cross-sectional data were collected with a convenience sample of 198 undergraduate nursing students in 2005 and 2006. The English version of URICA-E2 was translated into Norwegian according to standardized procedures. Findings Principal components analysis clearly confirmed five of the dimensions of readiness to change (Precontemplation Non-Believers, Precontemplation Believers, Contemplation, Preparation and Maintenance), while the sixth dimension, Action, showed the lowest Eigenvalue (0·93). Findings from the cluster analysis indicate distinct profiles among the respondents in terms of readiness to change their exercise behaviour. Conclusion The URICA-E2 was for the most part replicated from Reed’s original work. The result of the cluster analysis of the items associated with the factor ‘Action’ suggests that these do not adequately measure the factor. PMID:19032513
Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael
This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.
Full Text Available Objective. The Geriatric Depression Scale (GDS is an evaluation tool to diagnose older adult’s depression. This questionnaire was defined by Yesavage and Brink in 1982; it was designed expressly for the older person and defines his/her degree of satisfaction, quality of life, and feelings. The objective of this study is to evaluate the psychometric properties of the Italian translation of the Geriatric Depression Scale (GDS-IT. Methods. The Italian version of the Geriatric Depression Scale was administered to 119 people (79 people with a depression diagnosis and 40 healthy ones. We examined the following psychometric characteristics: internal consistency reliability, test-retest reliability, concurrent validity, and construct validity (factor structure. Results. Cronbach’s Alpha for the GDS-IT administered to the depressed sample was 0.84. Test-retest reliability was 0.91 and the concurrent validity was 0.83. The factorial analysis showed a structure of 5 factors, and the scale cut-off is between 10 and 11. Conclusion. The GDS-IT proved to be a reliable and valid questionnaire for the evaluation of depression in an Italian population. In the present study, the GDS-IT showed good psychometric properties. Health professionals now have an assessment tool for the evaluation of depression symptoms in the Italian population.
Chang, Chih-Cheng; Su, Jian-An; Tsai, Ching-Shu; Yen, Cheng-Fang; Liu, Jiun-Horng; Lin, Chung-Ying
To examine the psychometrics of the Affiliate Stigma Scale using rigorous psychometric analysis: classical test theory (CTT) (traditional) and Rasch analysis (modern). Differential item functioning (DIF) items were also tested using Rasch analysis. Caregivers of relatives with mental illness (n = 453; mean age: 53.29 ± 13.50 years) were recruited from southern Taiwan. Each participant filled out four questionnaires: Affiliate Stigma Scale, Rosenberg Self-Esteem Scale, Beck Anxiety Inventory, and one background information sheet. CTT analyses showed that the Affiliate Stigma Scale had satisfactory internal consistency (α = 0.85-0.94) and concurrent validity (Rosenberg Self-Esteem Scale: r = -0.52 to -0.46; Beck Anxiety Inventory: r = 0.27-0.34). Rasch analyses supported the unidimensionality of three domains in the Affiliate Stigma Scale and indicated four DIF items (affect domain: 1; cognitive domain: 3) across gender. Our findings, based on rigorous statistical analysis, verified the psychometrics of the Affiliate Stigma Scale and reported its DIF items. We conclude that the three domains of the Affiliate Stigma Scale can be separately used and are suitable for measuring the affiliate stigma of caregivers of relatives with mental illness. Copyright © 2015 Elsevier Inc. All rights reserved.
Neumann, Guenter; Schaadt, Anna-Katharina; Reinhart, Stefan; Kerkhoff, Georg
Cerebral vision disorders (CVDs) are frequent after brain damage and impair the patient's outcome. Yet clinically and psychometrically validated procedures for the anamnesis of CVD are lacking. To evaluate the clinical validity and psychometric qualities of the Cerebral Vision Screening Questionnaire (CVSQ) for the anamnesis of CVD in individuals poststroke. Analysis of the patients' subjective visual complaints in the 10-item CVSQ in relation to objective visual perimetry, tests of reading, visual scanning, visual acuity, spatial contrast sensitivity, light/dark adaptation, and visual depth judgments. Psychometric analyses of concurrent validity, specificity, sensitivity, positive/negative predictive value, and interrater reliability were also done. Four hundred sixty-one patients with unilateral (39.5% left, 47.5% right) or bilateral stroke (13.0%) were included. Most patients were assessed in the chronic stage, on average 36.7 (range = 1-620) weeks poststroke. The majority of all patients (96.4%) recognized their visual symptoms within 1 week poststroke when asked for specifically. Mean concurrent validity of the CVSQ with objective tests was 0.64 (0.54-0.79, P reliability was 0.76 for a 1-week interval between both assessments (all P guides the clinician in the selection of necessary assessments and appropriate neurovisual therapies for the patient. © The Author(s) 2015.
Karin, A; Hannesdottir, K; Jaeger, J; Annas, P; Segerdahl, M; Karlsson, P; Sjögren, N; von Rosen, T; Miller, F
To conduct a psychometric analysis to determine the adequacy of instruments that measure cognition in Alzheimer's disease trials. Both the Alzheimer's Disease Assessment Scale - Cognition (ADAS-Cog) and the Neuropsychological Test Battery (NTB) are validated outcome measures for clinical trials in Alzheimer's disease and are approved also for regulatory purposes. However, it is not clear how comparable they are in measuring cognitive function. In fact, many recent trials in Alzheimer's disease patients have failed and it has been questioned if ADAS-Cog still is a sensitive measure. The present paper examines the psychometric properties of ADAS-Cog and NTB, based on a post hoc analysis of data from a clinical trial (NCT01024660), which was conducted by AstraZeneca, in mild-to-moderate Alzheimer's disease (AD) patients, with a Mini Mental State Examination (MMSE) Total score 16-24. Acceptability, reliability, different types of validity and ability to detect change were assessed using relevant statistical methods. Total scores of both tests, as well as separate domains of both tests, including the Wechsler Memory Scale (WMS), Rey Auditory Verbal Learning Test (RAVLT) and Delis-Kaplan Executive Function System (D-KEFS) Verbal Fluency Condition, were analyzed. Overall, NTB performed well, with acceptable reliability and ability to detect change, while ADAS-Cog had insufficient psychometric properties, including ceiling effects in 8 out of a total of 11 ADAS-Cog items in mild AD patients, as well as low test-retest reliability in some of the items. Based on a direct comparison on the same patient sample, we see advantages of the NTB compared with the ADAS-Cog for the evaluation of cognitive function in the population of mild-to-moderate AD patients. The results suggest that not all of ADAS-Cog items are relevant for both mild and moderate AD population. This validation study demonstrates satisfactory psychometric properties of the NTB, while ADAS-Cog was found to be
Blanchard, E B; Arena, J G; Pallmeyer, T P
Four studies were conducted on a sample of 230 undergraduates to determine the psychometric properties of a measure of alexithymia, the Schalling-Sifneos Scale. In the first study it was found that scores on the scale are approximately normally distributed for each sex with 8.2% of males and 1.8% of females in the alexithymia range. In the second study a factor analysis of the scale revealed three distinct factors: (1) 'difficulty in expression of feelings'; (2) 'the importance of feelings especially about people'; (3) 'day-dreaming or introspection'. In the second factor analytic study, scores from several standard psychological tests on the same subjects were introduced with the scale items. Two factors in this analysis were comprised almost entirely of the other test scores: a 'general psychological distress factor' and a 'concerns about physical symptoms factor'. The other two factors were similar to factors 1 and 2 above in terms of items. The Rathus Assertiveness Scale loaded positively on the equivalent of factor 1. In the lst study, it was shown that Schalling-Sifneos Scale score is relatively orthogonal to other psychological tests with the exception of a Psychosomatic Symptom Checklist and thus is measuring something other than depression, anxiety, etc.
Dikken, Jeroen; Hoogerduijn, Jita G; Kruitwagen, Cas; Schuurmans, Marieke J
To assess the content validity and psychometric characteristics of the Knowledge about Older Patients Quiz (KOP-Q), which measures nurses' knowledge regarding older hospitalized adults and their certainty regarding this knowledge. Cross-sectional. Content validity: general hospitals. Psychometric characteristics: nursing school and general hospitals in the Netherlands. Content validity: 12 nurse specialists in geriatrics. Psychometric characteristics: 107 first-year and 78 final-year bachelor of nursing students, 148 registered nurses, and 20 nurse specialists in geriatrics. Content validity: The nurse specialists rated each item of the initial KOP-Q (52 items) on relevance. Ratings were used to calculate Item-Content Validity Index and average Scale-Content Validity Index (S-CVI/ave) scores. Items with insufficient content validity were removed. Psychometric characteristics: Ratings of students, nurses, and nurse specialists were used to test for different item functioning (DIF) and unidimensionality before item characteristics (discrimination and difficulty) were examined using Item Response Theory. Finally, norm references were calculated and nomological validity was assessed. Content validity: Forty-three items remained after assessing content validity (S-CVI/ave = 0.90). Psychometric characteristics: Of the 43 items, two demonstrating ceiling effects and 11 distorting ability estimates (DIF) were subsequently excluded. Item characteristics were assessed for the remaining 30 items, all of which demonstrated good discrimination and difficulty parameters. Knowledge was positively correlated with certainty about this knowledge. The final 30-item KOP-Q is a valid, psychometrically sound, comprehensive instrument that can be used to assess the knowledge of nursing students, hospital nurses, and nurse specialists in geriatrics regarding older hospitalized adults. It can identify knowledge and certainty deficits for research purposes or serve as a tool in educational
Bech, P; Bech, P
OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety, antidepress......OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety...... psychometrics in psychiatry have been found to cover a pharmacopsychometric triangle illustrating the measurements of wanted and unwanted effects of pharmacotherapeutic drugs as well as health-related quality of life....
Full Text Available This study describes the psychometric properties of the Children's Separation Anxiety Scale (CSAS, which assesses separation anxiety symptoms in childhood. Participants in Study 1 were 1,908 schoolchildren aged between 8 and 11. Exploratory factor analysis identified four factors: worry about separation, distress from separation, opposition to separation, and calm at separation, which explained 46.91% of the variance. In Study 2, 6,016 children aged 8-11 participated. The factor model in Study 1 was validated by confirmatory factor analysis. The internal consistency (α = 0.82 and temporal stability (r = 0.83 of the instrument were good. The convergent and discriminant validity were evaluated by means of correlations with other measures of separation anxiety, childhood anxiety, depression and anger. Sensitivity of the scale was 85% and its specificity, 95%. The results support the reliability and validity of the CSAS.
Dhir, Amandeep; Chen, Sufen; Nieminen, Marko
The past few years have witnessed great developments in Internet infrastructure, which have led to increased Internet usage among people of various age groups. However, at the same time, there have been some negative implications associated with increased Internet usage for some individuals. "Internet addiction" (IA) is one such negative…
paradigm shift and suggest that qualitative research designs might be used more ... remained on quantitative methods and the meaning behind the concepts is often ... the guideline in practice, a list of common errors, and a set of references for ...
Potkin, Steven G; Bugarski-Kirola, Dragana; Edgar, Chris J; Soliman, Sherif; Le Scouiller, Stephanie; Kunovac, Jelena; Miguel Velasco, Eugenio; Garibaldi, George M
Unemployment can negatively impact quality of life among patients with schizophrenia. Employment status depends on ability, opportunity, education, and cultural influences. A clinician-rated scale of work readiness, independent of current work status, can be a valuable assessment tool. A series of studies were conducted to create and validate a Work Readiness Questionnaire (WoRQ) for clinicians to assess patient ability to engage in socially useful activity, independent of work availability. Content validity, test-retest and inter-rater reliability, and construct validity were evaluated in three separate studies. Content validity was supported. Cronbach's α was 0.91, in the excellent range. Clinicians endorsed WoRQ concepts, including treatment adherence, physical appearance, social competence, and symptom control. The final readiness decision showed good test-retest reliability and moderate inter-rater reliability. Work readiness was associated with higher function and lower levels of negative symptoms. Low positive and high negative predictive values confirmed the concept validity. The WoRQ has suitable psychometric properties for use in a clinical trial for patients with a broad range of symptom severity. The scale may be applicable to assess therapeutic interventions. It is not intended to assess eligibility for supported work interventions. The WoRQ is suitable for use in schizophrenia clinical trials to assess patient work functional potential.
Guadagnin, Simone C; Nakano, Eduardo Y; Dutra, Eliane S; de Carvalho, Kênia M B; Ito, Marina K
Workplace dietary intervention studies in low- and middle-income countries using psychometrically sound measures are scarce. This study aimed to validate a nutrition knowledge questionnaire (NQ) and its utility in evaluating the changes in knowledge among participants of a Nutrition Education Program (NEP) conducted at the workplace. A NQ was tested for construct validity, internal consistency and discriminant validity. It was applied in a NEP conducted at six workplaces, in order to evaluate the effect of an interactive or a lecture-based education programme on nutrition knowledge. Four knowledge domains comprising twenty-three items were extracted in the final version of the NQ. Internal consistency of each domain was significant, with Kuder-Richardson formula values>0·60. These four domains presented a good fit in the confirmatory factor analysis. In the discriminant validity test, both the Expert and Lay groups scored>0·52, but the Expert group scores were significantly higher than those of the Lay group in all domains. When the NQ was applied in the NEP, the overall questionnaire scores increased significantly because of the NEP intervention, in both groups (Pnutrition knowledge among participants of NEP at the workplace. According to the NQ, an interactive nutrition education had a higher impact on nutrition knowledge than a lecture programme.
Full Text Available Health literacy refers to personal competencies for the access to, understanding of, appraisal of and application of health information in order to make sound decisions in everyday life. The aim of this study was to develop and evaluate the psychometric properties of an instrument for the measurement of health literacy among adolescents (the Health Literacy Measure for Adolescents-HELMA.This study was made up of two phases, qualitative and quantitative, which were carried out in 2012-2014 in Tehran, Iran. In the qualitative part of the study, in-depth interviews with 67 adolescents aged 15-18 were carried out in 4 high schools to generate the initial item pool for the survey. The content validity of the items was then assessed by an expert panel review (n = 13 and face validity was assessed by interviewing adolescents (n = 16. In the quantitative part of the study, in order to describe the psychometric properties of the scale, validity, reliability (internal consistency and test-retest and factor analysis were assessed.An item pool made up of 104 items was generated at the qualitative stage. After content validity was considered, this decreased to 47 items. In the quantitative stage, 582 adolescents aged 15-18 participated in the study with a mean age of 16.2 years. 51.2% of participants were females. In principal component factor analysis, 8 factors were loaded, which accounted for 53.37% of the variance observed. Reliability has been approved by α = 0.93 and the test-retest of the scale at two-week intervals indicated an appropriate stability for the scale (ICC = 0.93. The final questionnaire was approved with 44 items split into eight sections. The sections were titled: gain access to, reading, understanding, appraise, use, communication, self-efficacy and numeracy.The Health Literacy Measure for Adolescents (HELMA is a valid and reliable tool for the measurement of the health literacy of adolescents aged 15-18 and can be used to evaluate
Pittman, Joyce; Bakas, Tamilyn; Ellett, Marsha; Sloan, Rebecca; Rawl, Susan M
The purpose of this study was to evaluate the psychometric properties of a new instrument to measure incidence and severity of ostomy complications early in the postoperative period. 71 participants were enrolled, most were men (52%), white (96%), and married or partnered (55%). The mean age of participants was 57 ± 15.09 years (mean ± SD). Fifty-two participants (84%) experienced at least 1 ostomy complication in the 60-day postoperative period. The research setting was 3 acute care settings within a large healthcare system in the Midwestern United States. We developed an evidence-based conceptual model to guide development and evaluation of a new instrument, the Pittman Ostomy Complication Severity Index (OCSI). The OCSI format includes Likert-like scale with 9 individual items scored 0 to 3 and a total score computed by summing the individual items. Higher scores indicate more severe ostomy complications. This study consisted of 2 phases: (1) an expert review, conducted to establish content validity; and (2) a prospective, longitudinal study design, to examine psychometric properties of the instrument. A convenience sample of 71 adult patients who underwent surgery to create a new fecal ostomy was recruited from 3 hospitals. Descriptive analyses, content validity indices, interrater reliability testing, and construct validity testing were employed. Common complications included leakage (60%), peristomal moisture-associated dermatitis (50%), stomal pain (42%), retraction (39%), and bleeding (32%). The OCSI demonstrated acceptable evidence of content validity index (CVI = 0.9) and interrater reliability for individual items (k = 0.71-1.0), as well as almost perfect agreement for total scores among raters (ICC = 0.991, P ≤ .001). Construct validity of the OCSI was supported by significant correlations among variables in the conceptual model (complications, risk factors, stoma care self-efficacy, and ostomy adjustment). OCSI demonstrated acceptable validity and
.... The more specific intent is to encourage reevaluation from a structured psychometric viewpoint. The end goal is to facilitate a uniformly higher standard of measurement quality in unidimensional scaling having complex scale step descriptors...
Bergmann Tiest, W.M.; Kappers, A.M.L.
This very brief report introduces a psychometric function, very suitable for psychophysical data that displays Weber-like behaviour, because it is antisymmetric on a logarithmic scale. © 2011 a Pion publication.
Wiberg, Marie; Culpepper, Steven; Douglas, Jeffrey; Wang, Wen-Chung
This proceedings volume compiles and expands on selected and peer reviewed presentations given at the 81st Annual Meeting of the Psychometric Society (IMPS), organized by the University of North Carolina at Greensboro, and held in Asheville, North Carolina, July 11th to 17th, 2016. IMPS is one of the largest international meetings focusing on quantitative measurement in psychology, education, and the social sciences, both in terms of participants and number of presentations. The meeting built on the Psychometric Society's mission to share quantitative methods relevant to psychology, addressing a diverse set of psychometric topics including item response theory, factor analysis, structural equation modeling, time series analysis, mediation analysis, cognitive diagnostic models, and multi-level models. Selected presenters were invited to revise and expand their contributions and to have them peer reviewed and published in this proceedings volume. Previous volumes to showcase work from the Psychometric Society�...
Matthews, Gerald; Reinerman-Jones, Lauren E; Barber, Daniel J; Abich, Julian
A study was run to test the sensitivity of multiple workload indices to the differing cognitive demands of four military monitoring task scenarios and to investigate relationships between indices. Various psychophysiological indices of mental workload exhibit sensitivity to task factors. However, the psychometric properties of multiple indices, including the extent to which they intercorrelate, have not been adequately investigated. One hundred fifty participants performed in four task scenarios based on a simulation of unmanned ground vehicle operation. Scenarios required threat detection and/or change detection. Both single- and dual-task scenarios were used. Workload metrics for each scenario were derived from the electroencephalogram (EEG), electrocardiogram, transcranial Doppler sonography, functional near infrared, and eye tracking. Subjective workload was also assessed. Several metrics showed sensitivity to the differing demands of the four scenarios. Eye fixation duration and the Task Load Index metric derived from EEG were diagnostic of single-versus dual-task performance. Several other metrics differentiated the two single tasks but were less effective in differentiating single- from dual-task performance. Psychometric analyses confirmed the reliability of individual metrics but failed to identify any general workload factor. An analysis of difference scores between low- and high-workload conditions suggested an effort factor defined by heart rate variability and frontal cortex oxygenation. General workload is not well defined psychometrically, although various individual metrics may satisfy conventional criteria for workload assessment. Practitioners should exercise caution in using multiple metrics that may not correspond well, especially at the level of the individual operator.
Caminha, Guilherme Pilla; Melo Junior, José Tavares de; Hopkins, Claire; Pizzichini, Emilio; Pizzichini, Marcia Margaret Menezes
Rhinosinusitis is a highly prevalent disease and a major cause of high medical costs. It has been proven to have an impact on the quality of life through generic health-related quality of life assessments. However, generic instruments may not be able to factor in the effects of interventions and treatments. SNOT-22 is a major disease-specific instrument to assess quality of life for patients with rhinosinusitis. Nevertheless, there is still no validated SNOT-22 version in our country. Cross-cultural adaptation of the SNOT-22 into Brazilian Portuguese and assessment of its psychometric properties. The Brazilian version of the SNOT-22 was developed according to international guidelines and was broken down into nine stages: 1) Preparation 2) Translation 3) Reconciliation 4) Back-translation 5) Comparison 6) Evaluation by the author of the SNOT-22 7) Revision by committee of experts 8) Cognitive debriefing 9) Final version. Second phase: prospective study consisting of a verification of the psychometric properties, by analyzing internal consistency and test-retest reliability. Cultural adaptation showed adequate understanding, acceptability and psychometric properties. We followed the recommended steps for the cultural adaptation of the SNOT-22 into Portuguese language, producing a tool for the assessment of patients with sinonasal disorders of clinical importance and for scientific studies.
Sköld, Annika; Janeslätt, Gunnel Kristina
Impaired ability to manage time has been shown in several diagnoses common in childhood. Impaired ability involves activities and participation domain (daily time management, DTM) and body function and structure domain (time-processing ability, TPA). DTM needs to be evaluated from an individual's own perspective. To date, there has been a lack of self-rating instruments for children that focus on DTM. The aim of this study is to describe psychometric properties of Time-S when used in children aged 10-17 years with a diagnosis of ADHD, Autism, CP or mild ID. Further, to test whether TPA correlates with self-rated DTM. Eighty-three children aged 10-17 years participated in the study. Rasch analysis was used to assess psychometric properties. Correlation analysis was performed between Time-S and a measure of TPA. The 21 items of the Time-S questionnaire fit into a unitary construct measuring self-perceived daily management of an individual's time. A non-significant, small correlation was found between TPA and DTM. The results indicate good psychometric properties for the questionnaire. The questionnaire is potentially useful in intervention planning and evaluation.
Waldréus, Nana; Jaarsma, Tiny; van der Wal, Martje Hl; Kato, Naoko P
Patients with heart failure can experience thirst distress. However, there is no instrument to measure this in patients with heart failure. The aim of the present study was to develop the Thirst Distress Scale for patients with Heart Failure (TDS-HF) and to evaluate psychometric properties of the scale. The TDS-HF was developed to measure thirst distress in patients with heart failure. Face and content validity was confirmed using expert panels including patients and healthcare professionals. Data on the TDS-HF was collected from patients with heart failure at outpatient heart failure clinics and hospitals in Sweden, the Netherlands and Japan. Psychometric properties were evaluated using data from 256 heart failure patients (age 72±11 years). Concurrent validity of the scale was assessed using a thirst intensity visual analogue scale. Patients did not have any difficulties answering the questions, and time taken to answer the questions was about five minutes. Factor analysis of the scale showed one factor. After psychometric testing, one item was deleted. For the eight item TDS-HF, a single factor explained 61% of the variance and Cronbach's alpha was 0.90. The eight item TDS-HF was significantly associated with the thirst intensity score ( r=0.55, pfailure.
McDermott, Orii; Orgeta, Vasiliki; Ridder, Hanne Mette; Orrell, Martin
Music in Dementia Assessment Scales (MiDAS), an observational outcome measure for music therapy with people with moderate to severe dementia, was developed from qualitative data of focus groups and interviews. Expert and peer consultations were conducted at each stage of the scale development to maximize its content validity. This study aimed to evaluate the psychometric properties of MiDAS. Care home residents with dementia attended weekly group music therapy for up to ten sessions. Music therapists and care home staff were requested to complete weekly MiDAS ratings. The Quality of Life Scale (QoL-AD) was completed at three time-points. A total of 629 (staff = 306, therapist = 323) MiDAS forms were completed. The statistical analysis revealed that MiDAS has high therapist inter-rater reliability, low staff inter-rater reliability, adequate staff test-retest reliability, adequate concurrent validity, and good construct validity. High factor loadings between the five MiDAS Visual Analogue Scale (VAS) items, levels of Interest, Response, Initiation, Involvement, and Enjoyment, were found. This study indicates that MiDAS has good psychometric properties despite the small sample size. Future research with a larger sample size could provide a more in-depth psychometric evaluation, including further exploration of the underlying factors. MiDAS provides a measure of engagement with musical experience and offers insight into who is likely to benefit on other outcomes such as quality of life or reduction in psychiatric symptoms.
Full Text Available Palmira Faraci,1 Michael Lock,2 Robert Wheeler2 1Faculty of Human and Social Sciences, University of Enna “Kore”, Enna, Italy; 2Formula 4 Leadership Limited, Nottingham, UK Abstract: This study aimed to validate the Italian version of the Leadership Judgement Indicator, an unconventional instrument devoted to measurement of leaders' judgments and preferred styles, ie, directive, consultative, consensual, or delegative, when dealing with a range of decision-making scenarios. After forward-translation and back-translation, its psychometric properties were estimated for 299 managers at various levels, who were asked to put themselves in the position of leader and to rate the appropriateness of certain ways of responding to challenge. Differences between several groups of managers, ranked in order of seniority, provided evidence for discriminant validity. Internal consistency was adequate. The findings show that the Italian adaptation of the Leadership Judgement Indicator has promising psychometric qualities, suggesting its suitability for use to improve outcomes in both organizational and selection settings. Keywords: Leadership Judgement Indicator, decision-making, situational test, scenarios, psychometric properties
Coluci, Marina Zambon Orpinelli; Alexandre, Neusa Maria Costa
The objectives of this study were to develop a questionnaire that evaluates the perception of nursing workers to job factors that may contribute to musculoskeletal symptoms, and to evaluate its psychometric properties. Internationally recommended methodology was followed: construction of domains, items and the instrument as a whole, content validity, and pre-test. Psychometric properties were evaluated among 370 nursing workers. Construct validity was analyzed by the factorial analysis, known-groups technique, and convergent validity. Reliability was assessed through internal consistency and stability. Results indicated satisfactory fit indices during confirmatory factor analysis, significant difference (p office workers, and moderate correlations between the new questionnaire and Numeric Pain Scale, SF-36 and WRFQ. Cronbach's alpha was close to 0.90 and ICC values ranged from 0.64 to 0.76. Therefore, results indicated that the new questionnaire had good psychometric properties for use in studies involving nursing workers. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Ge, Xiaohua; Zhang, Tingting; Zhou, Lingling
This study evaluated the psychometric properties of subjective sedation scales using one psychometric scoring system to identify the appropriate scale that is most suitable for clinical care practice. A number of published sedation assessment scales for paediatric patients are currently used to attempt to achieve a moderate depth of sedation to avoid the undesirable effects caused by over- or undersedation. However, there has been no systematic review of these scales. We searched the Cochrane Library, PubMed, EMBASE, the Cumulative Index to Nursing and Allied Health Literature, etc., to obtain relevant articles. The quality of the selected studies was evaluated according to the Consensus-based Standards for the Selection of Health Measurement Instruments checklist. Articles that had been published or were in press and discussed the psychometric properties of sedation scales were included. The population comprised critically ill infants and non-verbal children ranging in age from 0 to 18 years who underwent sedation in an intensive care unit. Data were independently extracted by two investigators using a standard data extraction checklist: 43 articles were included in this review, and 13 sedation scales were examined. The quality of the psychometric evidence for the Comfort Scale and Comfort Behaviour Scale was 'very good', with the Comfort Scale having a higher quality (total weighted scores, Comfort Scale = 17·3 and Comfort Behaviour Scale = 15·5). We suggest that the scales be systematically and comprehensively tested in terms of development method, reliability, validation, feasibility and correlation with clinical outcome. The Comfort Scale and Comfort Behaviour Scale are useful tools for measuring sedation in paediatric patients. Nursing staff should choose one subjective sedation scale that is suitable for assessing paediatric patients' depth of sedation. We recommend the Comfort Scale and Comfort Behaviour Scale as optimal choices if the clinical
Elholm, Bjarne; Larsen, Klaus; Hornnes, Nete
The study aimed to evaluate psychometrically a Danish translation of the Short Alcohol Withdrawal Scale (SAWS) in an outpatient setting in patients with Alcohol Dependence (AD) and Alcohol Withdrawal Symptoms/Syndrome (AWS).......The study aimed to evaluate psychometrically a Danish translation of the Short Alcohol Withdrawal Scale (SAWS) in an outpatient setting in patients with Alcohol Dependence (AD) and Alcohol Withdrawal Symptoms/Syndrome (AWS)....
Existing statistical tests for the fit of the Rasch model have been criticized, because they are only sensitive to specific violations of its assumptions. Contingency table methods using loglinear models have been used to test various psychometric models. In this paper, the assumptions of the Rasch
Hald, Søren Vester; Baker, Felicity A.; Ridder, Hanne Mette Ochsner
Primary objective: To evaluate the psychometric properties of two adapted versions of the interpersonal communication competence scale (ICCS) that were applied to people with acquired brain injury (ABI). Construct validity was tested for both new scales and a factor extraction was performed....... Participants with medium-to-severe ABI self-rated their interpersonal communication skills using the modified ICCS. Cronbach Alpha test was performed on both scales followed by a correlation analysis. Results: Seventeen participants with medium-to-severe ABI and staff and relatives (n¼37) were involved...... of the proxy-rating revealed six meaningful sub-groups of interpersonal communication competencies....
Wang, Tien-Ni; Liang, Kai-Jie; Liu, Yi-Chia; Shieh, Jeng-Yi; Chen, Hao-Ling
To examine the psychometric and clinimetric properties of the Melbourne Assessment 2 (MA2), an outcome measurement that is increasingly used in clinical studies. Psychometric and clinimetric study. Community. Seventeen children with cerebral palsy (CP) from 5 to 12 years were recruited for the estimation of the test-retest reliability and minimal detectable change (MDC). Thirty-five children with CP were recruited to receive an 8-week intensive neurorehabilitation intervention to estimate the validity, responsiveness, and minimal clinically important difference (MCID). Thirty-five children with CP received upper limb neurorehabilitation programs for 8 weeks. The MA2 and the criterion measures, including the Bruininks-Oseretsky Test of Motor Proficiency, 2nd edition (BOT-2), the Box and Blocks Test (BBT), and the Pediatric Motor Activity Log-Revised (PMAL-R), were evaluated at pretreatment and posttreatment. The MA2 has 4 subscales: range of motion, fluency, accuracy, and dexterity. The test-retest reliability of the MA2 is high (intraclass correlation coefficient, .92-.98). The significant relationships between the MA2 and BBT, BOT-2, and PMAL-R support its validity. The significance of paired t test results (PMA2. The MDC values of the 4 subscales of the MA2 are 2.85, 1.63, 1.97, and 1.84, respectively, and the suggested MCID values of these 4 subscales are 2.35, 3.20, 2.09, and 2.22, respectively, indicating the minimum scores of improvement to be interpreted as both statistically significant and clinically important. The study findings indicate that the MA2 has sound psychometric and clinimetric properties and is thus an adequate measurement for research and clinical applications. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Hill, Bridget; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea
To evaluate reproducibility (reliability and agreement) of the Brachial Assessment Tool (BrAT), a new patient-reported outcome measure for adults with traumatic brachial plexus injury (BPI). Prospective repeated-measure design. Outpatient clinics. Adults with confirmed traumatic BPI (N=43; age range, 19-82y). People with BPI completed the 31-item 4-response BrAT twice, 2 weeks apart. Results for the 3 subscales and summed score were compared at time 1 and time 2 to determine reliability, including systematic differences using paired t tests, test retest using intraclass correlation coefficient model 1,1 (ICC 1,1 ), and internal consistency using Cronbach α. Agreement parameters included standard error of measurement, minimal detectable change, and limits of agreement. BrAT. Test-retest reliability was excellent (ICC 1,1 =.90-.97). Internal consistency was high (Cronbach α=.90-.98). Measurement error was relatively low (standard error of measurement range, 3.1-8.8). A change of >4 for subscale 1, >6 for subscale 2, >4 for subscale 3, and >10 for the summed score is indicative of change over and above measurement error. Limits of agreement ranged from ±4.4 (subscale 3) to 11.61 (summed score). These findings support the use of the BrAT as a reproducible patient-reported outcome measure for adults with traumatic BPI with evidence of appropriate reliability and agreement for both individual and group comparisons. Further psychometric testing is required to establish the construct validity and responsiveness of the BrAT. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Lochhead, Lois E; MacMillan, Peter D
The Oswestry disability index (ODI) is the most widely used measure of perceived disability for low back conditions. It has been adopted without adaptation in functional capacity evaluation (FCE). Rigorous testing of the ODI with modern psychometric methods, in this setting, is warranted. To determine the psychometric properties of the ODI in FCE: unidimensionality; differential item functioning; item coverage and to identify poorly functioning items, allowing for improvement of these items and recalibration of the scale. Rasch analysis, specifically Masters' partial credit model, was conducted on data. 133 work-disabled individuals presenting for FCE in northern British Columbia, Canada. All items had one poorly functioning option. Items were rescaled from six categories to five, improving the psychometric properties of the ODI as a unidimensional (disability due to back pain) scale. Item difficulty range is sufficient for a population with mild to severe disability. Although two of the ten ODI items functioned marginally unsatisfactorily in the unrevised state, the 5-option revised ODI appears superior. Use in clinical settings across a broad spectrum of disability levels could help establish its psychometric properties. Health professionals should be aware that the ODI may perform differently depending on client population.
Moreira, Diana; Almeida, Fernando; Pinto, Marta; Segarra, Pilar; Barbosa, Fernando
The behavioral inhibition/behavioral activation (BIS/BAS) scales (Carver & White, 1994), which allow rating the Gray's motivational systems, were translated and adapted into Portuguese. In this study, the authors present the procedure and the psychometric analyses of the Portuguese version of the scales, which included basic item and scales psychometric characteristics, as well as confirmatory and exploratory factor analyses. After the psychometric analyses provided evidence for the quality of the Portuguese version of the scales, the normative data was provided by age and school grade. The confirmatory factor analysis of the BIS/BAS scales that the authors performed did not demonstrate satisfactory fit for the 2- or 4-factor solution. The authors also tested the more recent 5-factor model, but the fit indices remained inadequate. As fit indices were not satisfactory they proceeded with an exploratory factor analysis to examine the structure of the Portuguese scales. These psychometric analyses provided evidence of a successful translation of the original scales. Therefore these scales can now be used in future research with Portuguese or Brazilian population. (c) 2015 APA, all rights reserved.
Ringblom, Jenny; Wåhlin, Ingrid; Proczkowska, Marie
Emergence delirium and emergence agitation have been a subject of interest since the early 1960s. This behavior has been associated with increased risk of injury in children and dissatisfaction with anesthesia care in their parents. The Pediatric Anesthesia Emergence Delirium Scale is a commonly used instrument for codifying and recording this behavior. The aim of this study was to psychometrically evaluate the Pediatric Anesthesia Emergence Delirium scale, focusing on the factor structure, in a sample of children recovering from anesthesia after surgery or diagnostic procedures. The reliability of the Pediatric Anesthesia Emergence Delirium scale was also tested. One hundred and twenty-two children younger than seven years were observed at postoperative care units during recovery from anesthesia. Two or 3 observers independently assessed the children using the Pediatric Anesthesia Emergence Delirium scale. The factor analysis clearly revealed a one-factor solution, which accounted for 82% of the variation in the data. Internal consistency, calculated with Cronbach's alpha, was good (0.96). The Intraclass Correlation Coefficient, which was used to assess interrater reliability for the Pediatric Anesthesia Emergence Delirium scale sum score, was 0.97 (P Pediatric Anesthesia Emergence Delirium scale for assessing emergence delirium in children recovering from anesthesia after surgery or diagnostic procedures. The kappa statistics for the Pediatric Anesthesia Emergence Delirium scale items essentially indicated good agreement between independent raters, supporting interrater reliability. © 2018 John Wiley & Sons Ltd.
Endsley, Paige; Weobong, Benedict; Nadkarni, Abhijit
The Alcohol Use Disorders Identification Test (AUDIT) is a 10-item screening questionnaire used to detect alcohol use disorders. The AUDIT has been validated in only two studies in India and although it has been previously used in Goa, India, it has yet to be validated in that setting. In this paper, we aim to report data on the validity of the AUDIT for the screening of AUDs among men in Goa, India. Concurrent and convergent validity of the AUDIT were assessed against the Mini International Neuropsychiatric Interview (MINI) and World Health Organisation Disability Assessment Scale (WHODAS) for alcohol abuse, alcohol dependence, and functional status respectively through the secondary analysis of data from a community cohort of men from Goa, India. The AUDIT showed high internal reliability and acceptable criterion validity with adequate psychometric properties for the detection of alcohol abuse and dependence. However, all of the optimal cut-off points from ROC analyses were lower than the WHO recommended for identification of risk of all AUDs, with a score of 6-12 detecting alcohol abuse and 13 and higher alcohol dependence. In order to optimize the utility of the AUDIT, a lowered cut-off point for alcohol abuse and dependence is recommended for Goa, India. Further validation studies for the AUDIT should be conducted for continued validation of the tool in other parts of India. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Full Text Available Recent work in animals suggests that the extent of early tactile stimulation by parents of offspring is an important element in early caregiving. We evaluate the psychometric properties of a new parent-report measure designed to assess frequency of tactile stimulation across multiple caregiving domains in infancy. We describe the full item set of the Parent-Infant Caregiving Touch Scale (PICTS and, using data from a UK longitudinal Child Health and Development Study, the response frequencies and factor structure and whether it was invariant over two time points in early development (5 and 9 weeks. When their infant was 9 weeks old, 838 mothers responded on the PICTS while a stratified subsample of 268 mothers completed PICTS at an earlier 5 week old assessment (229 responded on both occasions. Three PICTS factors were identified reflecting stroking, holding and affective communication. These were moderately to strongly correlated at each of the two time points of interest and were unrelated to, and therefore distinct from, a traditional measure of maternal sensitivity at 7-months. A wholly stable psychometry over 5 and 9-week assessments was not identified which suggests that behavior profiles differ slightly for younger and older infants. Tests of measurement invariance demonstrated that all three factors are characterized by full configural and metric invariance, as well as a moderate degree of evidence of scalar invariance for the stroking factor. We propose the PICTS as a valuable new measure of important aspects of caregiving in infancy.
Recent public controversies, ranging from the 2014 Facebook 'emotional contagion' study to psychographic data profiling by Cambridge Analytica in the 2016 American presidential election, Brexit referendum and elsewhere, signal watershed moments in which the intersecting trajectories of psychology and computer science have become matters of public concern. The entangled history of these two fields grounds the application of applied psychological techniques to digital technologies, and an investment in applying calculability to human subjectivity. Today, a quantifiable psychological subject position has been translated, via 'big data' sets and algorithmic analysis, into a model subject amenable to classification through digital media platforms. I term this position the 'scalable subject', arguing it has been shaped and made legible by algorithmic psychometrics - a broad set of affordances in digital platforms shaped by psychology and the behavioral sciences. In describing the contours of this 'scalable subject', this paper highlights the urgent need for renewed attention from STS scholars on the psy sciences, and on a computational politics attentive to psychology, emotional expression, and sociality via digital media.
Rosneck, James S; Hughes, Joel; Gunstad, John; Josephson, Richard; Noe, Donald A; Waechter, Donna
This article describes the systematic construction and psychometric analysis of a knowledge assessment instrument for phase II cardiac rehabilitation (CR) patients measuring risk modification disease management knowledge and behavioral outcomes derived from national standards relevant to secondary prevention and management of cardiovascular disease. First, using adult curriculum based on disease-specific learning outcomes and competencies, a systematic test item development process was completed by clinical staff. Second, a panel of educational and clinical experts used an iterative process to identify test content domain and arrive at consensus in selecting items meeting criteria. Third, the resulting 31-question instrument, the Cardiac Knowledge Assessment Tool (CKAT), was piloted in CR patients to ensure use of application. Validity and reliability analyses were performed on 3638 adults before test administrations with additional focused analyses on 1999 individuals completing both pretreatment and posttreatment administrations within 6 months. Evidence of CKAT content validity was substantiated, with 85% agreement among content experts. Evidence of construct validity was demonstrated via factor analysis identifying key underlying factors. Estimates of internal consistency, for example, Cronbach's α = .852 and Spearman-Brown split-half reliability = 0.817 on pretesting, support test reliability. Item analysis, using point biserial correlation, measured relationships between performance on single items and total score (P knowledge instrument specifically designed for an adult CR population was systematically developed and tested in a large representative patient population, satisfying psychometric parameters, including validity and reliability.
McCauley, Rebecca J.; Strand, Edythe A.
Purpose: To review the content and psychometric characteristics of 6 published tests currently available to aid in the study, diagnosis, and treatment of motor speech disorders in children. Method: We compared the content of the 6 tests and critically evaluated the degree to which important psychometric characteristics support the tests' use for…
Akerman, Eva; Fridlund, Bengt; Samuelson, Karin; Baigi, Amir; Ersson, Anders
This is a further development of a specific questionnaire, the 3-set 4P, to be used for measuring former ICU patients' physical and psychosocial problems after intensive care and the need for follow-up. The aim was to psychometrically test and evaluate the 3-set 4P questionnaire in a larger population. The questionnaire consists of three sets: "physical", "psychosocial" and "follow-up". The questionnaires were sent by mail to all patients with more than 24-hour length of stay on four ICUs in Sweden. Construct validity was measured with exploratory factor analysis with Varimax rotation. This resulted in three factors for the "physical set", five factors for the "psychosocial set" and four factors for the "follow-up set" with strong factor loadings and a total explained variance of 62-77.5%. Thirteen questions in the SF-36 were used for concurrent validity showing Spearman's r(s) 0.3-0.6 in eight questions and less than 0.2 in five. Test-retest was used for stability reliability. In set follow-up the correlation was strong to moderate and in physical and psychosocial sets the correlations were moderate to fair. This may have been because the physical and psychosocial status changed rapidly during the test period. All three sets had good homogeneity. In conclusion, the 3-set 4P showed overall acceptable results, but it has to be further modified in different cultures before being considered a fully operational instrument for use in clinical practice. Copyright © 2012 Elsevier Ltd. All rights reserved.
Telch, Michael J; Pujols, Yasisca
Erectile dysfunction is a highly publicized and prevalent condition with marked adverse effects on men's social, emotional, and quality of life. Although several instruments have emerged for assessing erectile dysfunction and its impact on men's quality of life, none of the existing instruments provide a specific assessment of men's erectile performance anxiety. This article reports on the development and psychometric evaluation of the Erectile Performance Anxiety Index (EPAI)--a 10-item self-report scale designed to fill an important gap in the assessment of male erectile dysfunction. A total of 207 men ranging in age from 18 to 79 took part in the study. All subjects completed an online battery consisting of the EPAI, along with measures of related sexual functioning, social anxiety, state anxiety, and depressive symptoms. A small subset of study participants (N = 42) completed the EPAI a second time for determining test-retest reliability. Test-retest reliability was determined by Pearson's product-moment correlations. Internal reliability was assessed using Cronbach's alpha. Factor validity was evaluated by a maximum likelihood factor analysis with oblique rotation. Convergent and discriminant validity was assessed by comparing the strength of association between the EPAI and measures varying in their hypothesized shared variance with the construct of erectile performance anxiety. The EPAI demonstrated excellent internal consistency, with Cronbach's alpha = 0.93 and excellent test-retest reliability (r = 0.85) over an average period of 3.5 weeks. Results of an exploratory factor analysis revealed a one-factor solution that accounted for 63% of the total variance. Preliminary evidence supports the convergent and discriminant validity of the EPAI. Results support the use of the EPAI as a reliable, valid, and efficient instrument for the assessment of erectile performance anxiety. Potential research and clinical applications are discussed. © 2013 International
Silveira, Vladímir de Aquino; Souza, Givago da Silva; Gomes, Bruno Duarte; Rodrigues, Anderson Raiol; Silveira, Luiz Carlos de Lima
We used psychometric functions to estimate the joint entropy for space discrimination and spatial frequency discrimination. Space discrimination was taken as discrimination of spatial extent. Seven subjects were tested. Gábor functions comprising unidimensionalsinusoidal gratings (0.4, 2, and 10 cpd) and bidimensionalGaussian envelopes (1°) were used as reference stimuli. The experiment comprised the comparison between reference and test stimulithat differed in grating's spatial frequency or envelope's standard deviation. We tested 21 different envelope's standard deviations around the reference standard deviation to study spatial extent discrimination and 19 different grating's spatial frequencies around the reference spatial frequency to study spatial frequency discrimination. Two series of psychometric functions were obtained for 2%, 5%, 10%, and 100% stimulus contrast. The psychometric function data points for spatial extent discrimination or spatial frequency discrimination were fitted with Gaussian functions using the least square method, and the spatial extent and spatial frequency entropies were estimated from the standard deviation of these Gaussian functions. Then, joint entropy was obtained by multiplying the square root of space extent entropy times the spatial frequency entropy. We compared our results to the theoretical minimum for unidimensional Gábor functions, 1/4π or 0.0796. At low and intermediate spatial frequencies and high contrasts, joint entropy reached levels below the theoretical minimum, suggesting non-linear interactions between two or more visual mechanisms. We concluded that non-linear interactions of visual pathways, such as the M and P pathways, could explain joint entropy values below the theoretical minimum at low and intermediate spatial frequencies and high contrasts. These non-linear interactions might be at work at intermediate and high contrasts at all spatial frequencies once there was a substantial decrease in joint
Schwartz, Carolyn E; Michael, Wesley; Zhang, Jie; Rapkin, Bruce D; Sprangers, Mirjam A G
A growing body of research suggests that regularly engaging in stimulating activities across multiple domains-physical, cultural, intellectual, communal, and spiritual-builds resilience. This project investigated the psychometric characteristics of the DeltaQuest Reserve-Building Measure for use in prospective research. The study included Rare Patient Voice panel participants. The web-based survey included the Reserve-Building Measure with one-week re-test, measures of quality of life (QOL) and well-being (PROMIS General Health; NeuroQOL Cognitive Function and Positive Affect & Well-Being short-forms; Ryff Environmental Mastery subscale); and the Big Five Inventory-10 personality measure. Classical test theory and item response theory (IRT) analyses investigated psychometric characteristics of the Reserve-Building Measure. This North American sample (n = 592) included both patients and caregivers [mean age = 44, SD 19)]. Psychometric analyses revealed distinct subscales measuring current reserve-building activities (Active in the World, Games, Outdoors, Creative, Religious/Spiritual, Exercise, Inner Life, Shopping/Cooking, Passive Media Consumption,), past reserve-building activities (Childhood Activities, Achievement), and reserve-related person-factors (Perseverance, Current and Past Social Support, and Work Value). Test-retest stability (n = 101) was moderately high for 11 of 15 subscales (ICC range 0.78-0.99); four were below 0.59 indicating a need for further refinement. IRT analyses supported the item functioning of all subscales. Correlational analyses suggest the measure's subscales tap distinct constructs (range r = 0.11-0.46) which are not redundant with QOL, well-being, or personality (range r = 0.11-0.48). The Reserve-Building Measure provides a measure of activities and person-factors related to reserve that may potentially be useful in prospective research.
McCreary, Linda L.; Conrad, Karen M.; Conrad, Kendon J.; Scott, Christy K; Funk, Rodney R.; Dennis, Michael L.
Background Valid assessment of family functioning can play a vital role in optimizing client outcomes. Because family functioning is influenced by family structure, socioeconomic context, and culture, existing measures of family functioning--primarily developed with nuclear, middle class European American families--may not be valid assessments of families in diverse populations. The Family Effectiveness Measure was developed to address this limitation. Objectives To test the Family Effectiveness Measure with data from a primarily low-income African American convenience sample, using the Rasch measurement model. Method A sample of 607 adult women completed the measure. Rasch analysis was used to assess unidimensionality, response category functioning, item fit, person reliability, differential item functioning by race and parental status, and item hierarchy. Criterion-related validity was tested using correlations with five other variables related to family functioning. Results The Family Effectiveness Measure measures two separate constructs: The effective family functioning construct was a psychometrically sound measure of the target construct that was more efficient due to the deletion of 22 items. The ineffective family functioning construct consisted of 16 of those deleted items but was not as strong psychometrically. Items in both constructs evidenced no differential item functioning by race. Criterion-related validity was supported for both. Discussion In contrast to the prevailing conceptualization that family functioning is a single construct, assessed by positively and negatively worded items, use of the Rasch analysis suggested the existence of two constructs. While the effective family functioning is a strong and efficient measure of family functioning, the ineffective family functioning will require additional item development and psychometric testing. PMID:23636342
de Brouwer, Brigitte Johanna Maria; Kaljouw, Marian J; Schoonhoven, Lisette; van Achterberg, Theo
To develop and psychometrically test the Essentials of Magnetism II in nursing homes. Increasing numbers and complex needs of older people in nursing homes strain the nursing workforce. Fewer adequately trained staff and increased care complexity raise concerns about declining quality. Nurses' practice environment has been reported to affect quality of care and productivity. The Essentials of Magnetism II © measures processes and relationships of practice environments that contribute to productivity and quality of care and can therefore be useful in identifying processes requiring change to pursue excellent practice environments. However, this instrument was not explicitly evaluated for its use in nursing home settings so far. In a preparatory phase, a cross-sectional survey study focused on face validity of the essentials of magnetism in nursing homes. A second cross-sectional survey design was then used to further test the instrument's validity and reliability. Psychometric testing included evaluation of content and construct validity, and reliability. Nurses (N = 456) working at 44 units of three nursing homes were included. Respondent acceptance, relevance and clarity were adequate. Five of the eight subscales and 54 of the 58 items did meet preset psychometric criteria. All essentials of magnetism are considered relevant for nursing homes. The subscales Adequacy of Staffing, Clinically Competent Peers, Patient Centered Culture, Autonomy and Nurse Manager Support can be used in nursing homes without problems. The other subscales cannot be directly applied to this setting. The valid subscales of the Essentials of Magnetism II instrument can be used to design excellent nursing practice environments that support nurses' delivery of care. Before using the entire instrument, however, the other subscales have to be improved. © 2016 John Wiley & Sons Ltd.
Carvalho, Lucas de Francisco; Pianowski, Giselle; Silveira, Fernando José; Bacciotti, Jonatha Tiago; Vieira, Philipe Gomes
Abstract We aimed to review of the Eccentricity dimension of the Dimensional Clinical Personality Inventory (IDCP), through two steps. The first one focused on developing new items and the second on testing the psychometric properties in a sample of 225 subjects (70.1% females), aging between 18 and 66 years, mostly undergraduate students (58.9%). The subjects answered the IDCP, and the Brazilian versions of the NEO-PI-R, PID-5 and MIS. The first step resulted in 42 items, which 22 were new. ...
Zarem, Cori; Kidokoro, Hiroyuki; Neil, Jeffrey; Wallendorf, Michael; Inder, Terrie; Pineda, Roberta
To establish the psychometrics of the Neonatal Oral Motor Assessment Scale (NOMAS). In this prospective cohort study of 75 preterm infants (39 females, 36 males) born at or before 30 weeks gestation (mean gestational age 26.56 wks, SD 1.90, range 23-30 wks; mean birthweight 967.33 g, SD 288.54, range 480-2240), oral feeding was videotaped before discharge from the neonatal intensive care unit (NICU). The NOMAS was used to classify feeding as normal, disorganized, or dysfunctional. Neurobehavior was assessed at term equivalent, and infants underwent magnetic resonance imaging. Children returned for developmental testing at 2 years corrected age. Associations between NOMAS scores and (1) neurobehavior; (2) cerebral injury and metrics; and (3) developmental outcome were investigated using χ(2) -analyses, t-tests, and linear regression. For reliability, six certified NOMAS evaluators rated five randomly selected NOMAS recordings and re-scored them 2 weeks later in a second randomized order. Reliability was calculated with Cohen's kappa statistics. Dysfunctional NOMAS scores were associated with lower Dubowitz scores [t=-2.14; mean difference -2.32 (95% confidence interval [CI] -0.157 to -4.49); p=0.036], higher stress on the NICU Network Neurobehavioral Scale (t=2.61; mean difference 0.073 [95% CI 0.017-0.129]; p=0.0110), and decreased transcerebellar diameter (t=-2.22; mean difference -2.04 [CI=-3.89 to -0.203]; p=0.03). No significant associations were found between NOMAS scores and 2-year outcome. Some concurrent validity was established with associations between NOMAS scores and measures of infant behavior and cerebral structure. The NOMAS did not show predictive validity in this study of preterm infants at high risk of developmental delay. Reliability was variable and suboptimal. © 2013 Mac Keith Press.
Forslin, Mia; Kottorp, Anders; Kierkegaard, Marie; Johansson, Sverker
To translate and culturally adapt the Acceptance of Chronic Health Conditions (ACHC) Scale for people with multiple sclerosis into Swedish, and to analyse the psychometric properties of the Swedish version. Ten people with multiple sclerosis participated in translation and cultural adaptation of the ACHC Scale; 148 people with multiple sclerosis were included in evaluation of the psychometric properties of the scale. Translation and cultural adaptation were carried out through translation and back-translation, by expert committee evaluation and pre-test with cognitive interviews in people with multiple sclerosis. The psychometric properties of the Swedish version were evaluated using Rasch analysis. The Swedish version of the ACHC Scale was an acceptable equivalent to the original version. Seven of the original 10 items fitted the Rasch model and demonstrated ability to separate between groups. A 5-item version, including 2 items and 3 super-items, demonstrated better psychometric properties, but lower ability to separate between groups. The Swedish version of the ACHC Scale with the original 10 items did not fit the Rasch model. Two solutions, either with 7 items (ACHC-7) or with 2 items and 3 super-items (ACHC-5), demonstrated acceptable psychometric properties. Use of the ACHC-5 Scale with super-items is recommended, since this solution adjusts for local dependency among items.
Feenstra, H.E.M.; Vermeulen, I.E.; Murre, J.M.J.; Schagen, S.B.
OBJECTIVE: Online neuropsychological test batteries could allow for large-scale cognitive data collection in clinical studies. However, the few online neuropsychological test batteries that are currently available often still require supervision or lack proper psychometric evaluation. In this paper,
Feenstra, Heleen E M; Vermeulen, Ivar E; Murre, Jaap M J; Schagen, Sanne B
OBJECTIVE: Online neuropsychological test batteries could allow for large-scale cognitive data collection in clinical studies. However, the few online neuropsychological test batteries that are currently available often still require supervision or lack proper psychometric evaluation. In this paper,
Waal-Manning, H J; de Hamel, F A
During the Milton health survey subjects completed a psychometric inventory consisting of the 48 questions of the Middlesex Hospital questionnaire (MHQ) and 26 from the hostility and direction of hostility questionnaire (HDHQ) designed to examine nine psychological dimensions. The 1209 subjects were classified into smoking categories and the scores for each psychometric trait were calculated. Women scored higher than men and heavy smokers scored higher than "never smokers". The psychometric traits and the scores of the four smoking categories after correcting for age and Quetelet's index showed statistically significant differences by analysis of variance in respect of somatic anxiety and depression for both men and women; and free-floating anxiety, phobic anxiety, hysteria, acting out hostility, self criticism and guilt in women. For somatic anxiety the increase in score almost exactly paralleled the increasing quantity of tobacco consumed.
Evans, Travis C; Britton, Jennifer C
Abnormal threat-related attention in anxiety disorders is most commonly assessed and modified using the dot-probe paradigm; however, poor psychometric properties of reaction-time measures may contribute to inconsistencies across studies. Typically, standard attention measures are derived using average reaction-times obtained in experimentally-defined conditions. However, current approaches based on experimentally-defined conditions are limited. In this study, the psychometric properties of a novel response-based computation approach to analyze dot-probe data are compared to standard measures of attention. 148 adults (19.19 ± 1.42 years, 84 women) completed a standardized dot-probe task including threatening and neutral faces. We generated both standard and response-based measures of attention bias, attentional orientation, and attentional disengagement. We compared overall internal consistency, number of trials necessary to reach internal consistency, test-retest reliability (n = 72), and criterion validity obtained using each approach. Compared to standard attention measures, response-based measures demonstrated uniformly high levels of internal consistency with relatively few trials and varying improvements in test-retest reliability. Additionally, response-based measures demonstrated specific evidence of anxiety-related associations above and beyond both standard attention measures and other confounds. Future studies are necessary to validate this approach in clinical samples. Response-based attention measures demonstrate superior psychometric properties compared to standard attention measures, which may improve the detection of anxiety-related associations and treatment-related changes in clinical samples. Copyright © 2018 Elsevier Ltd. All rights reserved.
Selan, Denis; Jakobsson, Ulf; Condelius, Anna
The aim of this study was to further investigate the psychometric properties (with focus on construct validity and scale function) of the Swedish version of the Person-centred Care Assessment Tool (P-CAT) in a sample consisting of staff working in elderly care units (N = 142). The aim was also to further develop and psychometrically test a modified, noncontext-specific version of the instrument (mP-CAT) in a sample consisting of staff working in primary health care or within home care for older people (N = 182). Principal component analysis with varimax rotation initially suggested a three-factor solution for the P-CAT, explaining 55.96% of variance. Item 13 solely represented one factor wherefore this solution was rejected. A final 2-factor solution, without item 13, had a cumulative explained variance of 50.03%. All communalities were satisfactory (>0.3), and alpha values for both first factor (items 1-6, 11) and second factor (items 7-10, 12) were found to be acceptable. Principal component analysis with varimax rotation suggested a final 2-factor solution for the mP-CAT explaining 46.15% of the total variance with communalities ranging from 0.263 to 0.712. Cronbach's α for both factors was found to be acceptable (>0.7). This study suggests a 2-factor structure for the P-CAT and an exclusion of item 13. The results indicated that the modified noncontext-specific version, mP-CAT, seems to be a valid measure. Further psychometric testing of the mP-CAT is however needed in order to establish the instrument's validity and reliability in various contexts. © 2016 Nordic College of Caring Science.
Rosengren, L; Jonasson, S B; Brogårdh, C; Lexell, J
The Satisfaction With Life Scale (SWLS) is a global measure of life satisfaction (LS). The objective of this study was to evaluate the psychometric properties (data completeness, scaling assumptions, targeting and reliability) of the SWLS in a sample of people with Parkinson's disease (PD). A postal survey including a Swedish version of the SWLS and demographic information was administered to 174 persons with PD; 97 responded and received a second survey after 2 weeks. The mean (SD) age and PD duration of the 97 responders were 73 (8) and 7 (6) years, respectively. Data completeness was 92% to 97% for the five items in the SWLS and 92% for the total score (5-35 points). The mean score of the SWLS was 24.2 points (7.7), indicating that this group had an average LS. The items' means and SDs were roughly parallel and the score distribution was even. The internal consistency reliability (Cronbach's alpha) was 0.90. The test-retest reliability, assessed by the intraclass correlation coefficient, was 0.78. The scale showed no systematic difference between the first and second response. The standard error of measurement was 3.6 points, and the smallest detectable difference was 10.0 points. This evaluation of the psychometric properties of the SWLS shows that the scale has good data completeness, scaling assumptions and targeting and that the internal consistency reliability and the test-retest reliability are acceptable. Thus, the SWLS is a psychometrically sound and suitable tool to asses LS in people with PD. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Bidrag med en kortfattet, introducerende, perspektiverende og begrebsafklarende fremstilling af begrebet test i det pædagogiske univers.......Bidrag med en kortfattet, introducerende, perspektiverende og begrebsafklarende fremstilling af begrebet test i det pædagogiske univers....
Tagliabue,Semira; Olivari,Maria Giulia; Bacchini,Dario; Affuso,Gaetana; Confalonieri,Emanuela
The paper analyzes the psychometric properties of the G1 version of the Parenting Styles and Dimensions Questionnaire, a self-report instrument designed to investigate how adolescents or adults were parented during childhood. The sample included 1451 Italian adolescents in high school. Three studies tested the scale's structure, invariance, and convergent validity. The first found slightly acceptable fit indexes for a 40-item scale measuring three factors (authoritative, authoritarian, an...
This paper presented the Turkish version of the Depression Anxiety Stress Scale-21 (DASS-21) in community and clinical samples, examined its psychometric properties. Construct validity and concurrent validity were conducted in validity studies. Depression Anxiety Stress Scale-42 (DASS-42) was used for concurrent validity. In reliability analysis, the instruments internal consistency and re-test reliability were studied. Results of explanatory factor analyses demonstrated that 21 items yielded...
Parveen, Huma; Noohu, Majumi M.
Objective: The objective of this study was to determine the psychometric properties of the Tinetti Performance-Oriented Mobility Assessment (POMA) scale to measure balance and gait impairments in individuals with knee osteoarthritis (OA). Methods: A convenient sample of 25 individuals with bilateral OA knee were recruited. The convergent validity was determined by correlation analysis between scores of Berg Balance Scale (BBS) with balance subscale (POMA-B) and the Timed Up and Go Test (TU...
Hansen, Jacob Sander; Bendtsen, Lars; Jensen, Rigmor
The purpose of the study is to test the cross-cultural adaptation and psychometric properties of a Danish version of the Headache-Specific Locus of Control Scale (HSLC) and the Headache Management Self-Efficacy Scale (HMSE) in a tertiary headache centre. HSLC and HMSE are headache-specific measures...... with other self-report measures concerning general distress, anxiety, depression, and health-related quality of life. Internal stability of the HSLC subscales and the HMSE were analysed using Chronbach's alpha coefficient. The psychometric properties of the Danish version of the HSLC and the HMSE were...
Noor, Syed WB; Simon Rosser, B. R.; Erickson, Darin J.
Although the phenomenon of hypersexuality has been described in the literature, and scales of compulsive sexual behavior have been published, the existing measures do not assess compulsive sexually explicit media (SEM) consumption. This study tested the psychometric properties of a new scale, the Compulsive Pornography Consumption (CPC). Exploratory and confirmatory factor analyses results showed good psychometric performance of a five item two factor preoccupation-compulsivity solution. As hypothesized, the scale correlates positively with compulsive sexual behavior, internalized homonegativity, and negatively with sexual self-esteem. The scale will enable researchers to investigate the etiologic factors of compulsive SEM use, and enable clinicians to assess problematic consumption. PMID:25838755
Trygg, L; Dåderman, A M; Wiklund, N; Meurling, A W; Lindgren, M; Lidberg, L; Levander, S
The use of projective and psychometric psychological tests at the Department of Forensic Psychiatry in Stockholm (Huddinge), Sweden, was studied for a population of 60 men, including many patients with neuropsychological disabilities and multiple psychiatric disorders. The results showed that the use of projective tests like Rorschach, Object Relations Test, and House-Tree-Person was more frequent than the use of objective psychometric tests. Neuropsychological test batteries like the Halstead-Reitan Neuropsychological Test Battery or Luria-Nebraska Neuropsychological Battery were not used. The majority of patients were, however, assessed by intelligence scales like the WAIS-R. The questionable reliability and validity of the projective tests, and the risk of subjective interpretations, raise a problem when used in a forensic setting, since the courts' decisions about a sentence to prison or psychiatric care is based on the forensic psychiatric assessment. The use of objective psychometric neuropsychological tests and personality tests is recommended.
Full Text Available Purpose: Translate and adapt the Convergence Insuficiency Symptom Survey (CISS questionnaire to the Portuguese language and culture and assess the psychometric properties of the translated questionnaire (CISSvp. Methods: The CISS questionnaire was adapted according to the methodology recommended by some authors. The process involved two translations and back-translations performed by independent evaluators, evaluation of these versions, preparation of a synthesis version and its pre-test. The final version (CISSvp was applied in 70 patients (21.79 ± 2.42 years students in higher education, and at two different times, by two observers, to assess its reliability. Results: The results showed good internal consistency of the CISSvp (Cronbach's alpha - α=0.893. The test re-test revealed an average of the differences between the first and second evaluation of 0.75 points (SD ± 3.53, which indicates a minimum bias between the two administrations. The interrater reliability assessed by intraclass correlation coefficient ranged from 0.880 to 0.952, revealing that the CISSvp represents an appropriate tool for measuring the visual discomfort associated with near vision tasks with a high level of reproducibility. Conclusions: The CISS Portuguese version, showed good psychometric properties and has been sown to be applicable to the Portuguese population, to quantify the visual discomfort associated with near vision, in higher education students.
Hong, Ickpyo; Velozo, Craig A; Li, Chih-Ying; Romero, Sergio; Gruber-Baldini, Ann L; Shulman, Lisa M
The aim of this study is to investigate the psychometrics of the Patient-Reported Outcomes Measurement Information System self-efficacy for managing daily activities item bank. The item pool was field tested on a sample of 1087 participants via internet (n = 250) and in-clinic (n = 837) surveys. All participants reported having at least one chronic health condition. The 35 item pool was investigated for dimensionality (confirmatory factor analyses, CFA and exploratory factor analysis, EFA), item-total correlations, local independence, precision, and differential item functioning (DIF) across gender, race, ethnicity, age groups, data collection modes, and neurological chronic conditions (McFadden Pseudo R (2) less than 10 %). The item pool met two of the four CFA fit criteria (CFI = 0.952 and SRMR = 0.07). EFA analysis found a dominant first factor (eigenvalue = 24.34) and the ratio of first to second eigenvalue was 12.4. The item pool demonstrated good item-total correlations (0.59-0.85) and acceptable internal consistency (Cronbach's alpha = 0.97). The item pool maintained its precision (reliability over 0.90) across a wide range of theta (3.70), and there was no significant DIF. The findings indicated the item pool has sound psychometric properties and the test items are eligible for development of computerized adaptive testing and short forms.
Full Text Available BACKGROUND: The 10-item Perceived Stress Scale (PSS-10 is one of most widely used instruments to measure a global level of perceived stress in a range of clinical and research settings. This study was conducted to examine the psychometric properties of the Simplified Chinese version of the PSS-10 in policewomen. METHODOLOGY: A total of 240 policewomen were recruited in this study. The Simplified Chinese versions of the PSS-10, the Beck Depression Inventory Revised (BDI-II, and the Beck Anxiety Inventory (BAI were administered to all participants, and 36 of the participants were re-tested two weeks after the initial testing. PRINCIPAL FINDINGS: The overall Cronbach's alpha was 0.86, and the test-retest reliability coefficient was 0.68. Exploratory Factor Analysis (EFA yielded 2 factors with eigenvalues of 4.76 and 1.48, accounting for 62.41% of variance. Factor 1 consisted of 6 items representing "negative feelings"; whereas Factor 2 consisted of 4 items representing "positive feelings". The item loadings ranged from 0.72 to 0.83. The Confirmatory factor analysis (CFA indicated a very good fit of this two-factor model to this sample. The PSS-10 significantly correlated with both BDI-II and BAI, indicating an acceptable concurrent validity. CONCLUSIONS: The Simplified Chinese version of the PSS-10 demonstrated adequate psychometric properties for evaluating stress levels. The results support its use among the Chinese population.
Goossens, Joline; Verhaeghe, Sofie; Van Hecke, Ann; Barrett, Geraldine; Delbaere, Ilse; Beeckman, Dimitri
To evaluate the psychometric properties of the Dutch version of the London Measure of Unplanned Pregnancy in women with pregnancies ending in birth. A two-phase psychometric evaluation design was set-up. Phase I comprised the translation from English into Dutch and pretesting with 6 women using cognitive interviews. In phase II, the reliability and validity of the Dutch version of the LMUP was assessed in 517 women giving birth recently. Reliability (internal consistency) was assessed using Cronbach's alpha, inter-item correlations, and corrected item-total correlations. Construct validity was assessed using principal components analysis and hypothesis testing. Exploratory Mokken scale analysis was carried out. 517 women aged 15-45 completed the Dutch version of the LMUP. Reliability testing showed acceptable internal consistency (alpha = 0.74, positive inter-item correlations between all items, all corrected item-total correlations >0.20). Validity testing confirmed the unidimensional structure of the scale and all hypotheses were confirmed. The overall Loevinger's H coefficient was 0.57, representing a 'strong' scale. The Dutch version of the LMUP is a reliable and valid measure that can be used in the Dutch-speaking population in Belgium to assess pregnancy planning. Future research is necessary to assess the stability of the Dutch version of the LMUP, and to evaluate its psychometric properties in women with abortions.
Full Text Available Background. The Barkley Adult Attention Deficit/Hyperactivity Disorder (ADHD Rating Scale-IV (BAARS-IV was developed, and it demonstrated good psychometric properties. The BAARS-IV includes 27 questions on the symptoms of adult ADHD. The purpose of the present study is to investigate the psychometric testing of the Persian version of BAARS-IV among the elderlies in Tabriz City. Method. This cross-sectional study was conducted in Tabriz City—in the west of Iran—in 2015 via enrolling of 121 old-aged people. We did the process of translation and adaptation of BAARS-IV and examined its concurrent validity, internal consistency, and test-retest reliability. Result. The BAARS-IV demonstrated good internal consistency and test-retest reliability. Correlations between the BAARS-IV and the CAARS-S: SV were high and evidence supporting concurrent validity was revealed. Cronbach’s alpha for the overall scale and subscales stood at 0.89, 0.81, 0.66, 0.56, and 0.82, respectively. Conclusion. The Persian BAARS-IV showed acceptable reliability and validity. BAARS-IV was determined to be composed of internally consistent and psychometrically sound items.
One of the most common types of psychometric test used in assessment and selection procedures, The Numeracy Test Workbook provides practice questions and mock tests designed to build confidence and improve performance.
Larry H. Ludlow
Full Text Available Given the high stakes of teacher testing, there is no doubt that every teacher test should meet the industry guidelines set forth in the Standards for Educational and Psychological Testing. Unfortunately, however, there is no public or private business or governmental agency that serves to certify or in any other formal way declare that any teacher test does, in fact, meet the psychometric recommendations stipulated in the Standards. Consequently, there are no legislated penalties for faulty products (tests nor are there opportunities for test takers simply to raise questions about a test and to have their questions taken seriously by an impartial panel. The purpose of this article is to highlight some of the psychometric results reported by National Evaluation Systems (NES in their 1999 Massachusetts Educator Certification Test (MECT Technical Report, and more specifically, to identify those technical characteristics of the MECT that are inconsistent with the Standards. A second purpose of this article is to call for the establishment of a standing test auditing organization with investigation and sanctioning power. The significance of the present analysis is twofold: a psychometric results for the MECT are similar in nature to psychometric results presented as evidence of test development flaws in an Alabama class-action lawsuit dealing with teacher certification (an NES-designed testing system; and b there was no impartial enforcement agency to whom complaints about the Alabama tests could be brought, other than the court, nor is there any such agency to whom complaints about the Massachusetts tests can be brought. I begin by reviewing NES's role in Allen v. Alabama State Board of Education, 81-697-N. Next I explain the purpose and interpretation of standard item analysis procedures and statistics. Finally, I present results taken directly from the 1999 MECT Technical Report and compare them to procedures, results, and consequences of
Full Text Available Abstract Objectives Measurement of treatment satisfaction in diabetes is important as it has been shown to be associated with positive outcomes, reduced disease cost and better health. The aim of this study was to assess the construct validity and internal consistency reliability of the Greek version of the Diabetes Treatment Satisfaction Questionnaire (DTSQ. Methods A sample of type II diabetes patients (N = 172 completed the DTSQ status version, the SF-36 health survey and also provided data regarding treatment method, clinical and socio-demographic status. Instrument structure, reliability (Cronbach's a and construct validity (convergent, discriminative, concurrent and known-groups were assessed. Results The DTSQ measurement properties were confirmed in the Greek version with confirmatory factor analysis (CFA. Scale reliability was high (Cronbach's a = 0.92. Item-scale internal consistency and discriminant validity were also good, exceeding the designated success criteria. Significant correlations were observed between DTSQ items/overall score and SF-36 scales/component scores, which were hypothesized to measure similar dimensions. Known groups' comparisons yielded consistent support of the construct validity of the instrument. Conclusions The instrument was well-accepted by the patients and its psychometric properties were similar to those reported in validation studies of other language versions. Further research, incorporating a longitudinal study design, is required for examining test-retest reliability and responsiveness of the instrument, which were not addressed in this study. Overall, the present results confirm that the DTSQ status version is a reasonable choice for measuring diabetes treatment satisfaction in Greece.
Bruijning, Janna E; van Rens, Ger; Knol, Dirk; van Nispen, Ruth
In the past, rehabilitation centers for the visually impaired used unstructured or semistructured methods to assess rehabilitation needs of their patients. Recently, an extensive instrument, the Dutch ICF Activity Inventory (D-AI), was developed to systematically investigate rehabilitation needs of visually impaired adults and to evaluate rehabilitation outcomes. The purpose of this study was to investigate the underlying factor structure and other psychometric properties to shorten and improve the D-AI. The D-AI was administered to 241 visually impaired persons who recently enrolled in a multidisciplinary rehabilitation center. The D-AI uses graded scores to assess the importance and difficulty of 65 rehabilitation goals. For high-priority goals (e.g., daily meal preparation), the difficulty of underlying tasks (e.g., read recipes, cut vegetables) was assessed. To reduce underlying task items (>950), descriptive statistics were investigated and factor analyses were performed for several goals. The internal consistency reliability and test-retest reliability of the D-AI were investigated by calculating Cronbach α and Cohen (weighted) κ. Finally, consensus-based discussions were used to shorten and improve the D-AI. Except for one goal, factor analysis model parameters were at least reasonable. Internal consistency reliability was satisfactory (range, 0.74 to 0.93). In total, 60% of the 65 goal importance items and 84.4% of the goal difficulty items showed moderate to almost perfect κ values (≥0.40). After consensus-based discussions, a new D-AI was produced, containing 48 goals and less than 500 tasks. The analyses were an important step in the validation process of the D-AI and to develop a more feasible assessment tool to investigate rehabilitation needs of visually impaired persons in a systematic way. The D-AI is currently implemented in all Dutch rehabilitation centers serving all visually impaired adults with various rehabilitation needs.
Wetselaar, P.; Koutris, M.; Visscher, C.M.; Larsson, P.; John, M.T.; Lobbezoo, F.
The aim of this study was to test the psychometric properties of the Dutch version of the Orofacial Esthetic Scale (OES) in dental patients with and without self-reported tooth wear. The English version of the OES was translated into Dutch, following established guidelines for cross-cultural
Carlson, Ryan G.; Rogers, Tiffany L.; Wheeler, Naomi J.; Kelchner, Viki; Griffith, Sandy-Ann M.; Liu, Xun
Intimate partner violence (IPV) classifications have treatment implications for couples. This study tested the psychometrics of the Continuum of Conflict and Control Relationship Scale (CCC-RS) and examined differences between violence severity and CCC-RS scales. A sample of 575 low-income, ethnically diverse participants contributed data. Results…
Clemmensen, Lars; Bartels-Velthuis, Agna A.; Jespersen, Rokur Av F.; van Os, Jim; Blijd-Hoogewys, Els M. A.; Ankerstrom, Lise; Vaever, Mette; Daniel, Peter F.; Drukker, Marjan; Jeppesen, Pia; Jepsen, Jens R. M.
Background: Theory-of-Mind (ToM) keeps on developing in late childhood and early adolescence, and the study of ToM development later in childhood had to await the development of sufficiently sensitive tests challenging more mature children. The current study aimed to investigate the psychometric
Hutchins, Tiffany L.; Prelock, Patricia A.; Bonazinga, Laura
Two studies examined the psychometric properties of the Theory of Mind Inventory (ToMI). In Study One, 135 caregivers completed the ToMI for children (ages 3 through 17) with autism spectrum disorder (ASD). Findings revealed excellent test-retest reliability and internal consistency. Principle Components Analysis revealed three subscales related…
Kunz, M.; Capito, E. S.; Horn-Hofmann, C.; Baum, C.; Scheel, J.; Karmann, A. J.; Priebe, J. A.; Lautenbacher, S.
The way individuals attend to pain is known to have a considerable impact on the experience and chronification of pain. One method to assess the habitual "attention to pain" is the Pain Vigilance and Awareness Questionnaire (PVAQ). With the present study, we aimed to test the psychometric properties
Krzysztof Nowosielski, MD
Full Text Available Introduction: The sexual self-schema is a part of a broader concept of the self that is believed to be crucial for intrapersonal and interpersonal sexual relationships. Aim: To develop and perform psychometric validation of the Polish version of the Sexual Self-Schema Scale for Women (SSSS-W-PL. Methods: 561 women 18 to 55 years old were included in the final analysis. Linguistic validation was performed in 4 steps in line with the MAPI Institute guidelines. Convergent validity was calculated using the Pearson r product-moment coefficient between different measures of sexuality (attitudes and experience, behavior, arousal, romantic relationship and SSSS-W-PL total and factor scores. To test discriminant validity, we applied hierarchical regression analyses predicting the number of lifetime sexual partners, self-rating as a sexual person (1 item, “I feel sexually attractive”; on a 5-point Likert scale, and arousability, with independent variables being extraversion (Ten-Item Personality Inventory, self-esteem (Rosenberg Self-Esteem Scale, and the SSSS-W-PL (total and factor scores. Main Outcomes Measures: Sexual self-schema was measured by the SSSS-W-PL, whereas arousability was measured by the arousal/excitement scale of the Changes in Sexual Functioning Questionnaire. Results: The mean age of the study population was 29.0 ± 7.6 years. The final scale consisted of 24 adjectives grouped within 4 factors: romantic, passionate, direct, and embarrassed. The 4-factor model accounted for 39% of the variance. The Cronbach α was 0.74 for the SSSS-W-PL total score and 0.61 to 0.84 for individual factors. Test-retest reliability of the scale after 2- to 8-week intervals was 0.87 (95% CI = 0.82–0.86, P < .001. The increment variances were statistically significant and ranged from 3.8% to 11.6%. Conclusion: The analysis showed good psychometric properties and internal validity of the SSSS-W-PL. The SSSS-W-PL might be helpful in consulting and
Bhatia Kailash P
Full Text Available Abstract Background The United States Food and Drug Administration (FDA are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58, a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1 provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2 enable researchers and clinicians to compare it with existing dystonia PROMs; and 3 help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing. Methods We evaluated traditional psychometric properties of data quality, scaling assumptions, targeting, reliability and validity in a group of 391 people with CD. The main outcome measures used were the CDIP-58, Medical Outcome Study Short Form-36, the 28-item General Health Questionnaire, and Hospital and Anxiety and Depression Scale. Results A total of 391 people returned completed questionnaires (corrected response rate 87%. Analyses showed: 1 data quality was high (low missing data ≤ 4%, subscale scores could be computed for > 96% of the sample; 2 item groupings passed tests for scaling assumptions; 3 good targeting (except for the Sleep subscale, ceiling effect = 27%; 4 good reliability (Cronbach's alpha ≥ 0.92, test-retest intraclass correlations ≥ 0.83; and 5 validity was supported. Conclusion This study has shown that new psychometric methods can produce a PROM that stands up to traditional criteria and supports the clinical advantages of Rasch analysis.
Full Text Available The OAV questionnaire has been developed to integrate research on altered states of consciousness (ASC. It measures three primary and one secondary dimensions of ASC that are hypothesized to be invariant across ASC induction methods. The OAV rating scale has been in use for more than 20 years and applied internationally in a broad range of research fields, yet its factorial structure has never been tested by structural equation modeling techniques and its psychometric properties have never been examined in large samples of experimentally induced ASC.The present study conducted a psychometric evaluation of the OAV in a sample of psilocybin (n = 327, ketamine (n = 162, and MDMA (n = 102 induced ASC that was obtained by pooling data from 43 experimental studies. The factorial structure was examined by confirmatory factor analysis, exploratory structural equation modeling, hierarchical item clustering (ICLUST, and multiple indicators multiple causes (MIMIC modeling. The originally proposed model did not fit the data well even if zero-constraints on non-target factor loadings and residual correlations were relaxed. Furthermore, ICLUST suggested that the "oceanic boundlessness" and "visionary restructuralization" factors could be combined on a high level of the construct hierarchy. However, because these factors were multidimensional, we extracted and examined 11 new lower order factors. MIMIC modeling indicated that these factors were highly measurement invariant across drugs, settings, questionnaire versions, and sexes. The new factors were also demonstrated to have improved homogeneities, satisfactory reliabilities, discriminant and convergent validities, and to differentiate well among the three drug groups.The original scales of the OAV were shown to be multidimensional constructs. Eleven new lower order scales were constructed and demonstrated to have desirable psychometric properties. The new lower order scales are most likely better suited to
Holly G Prigerson
Full Text Available Bereavement is a universal experience, and its association with excess morbidity and mortality is well established. Nevertheless, grief becomes a serious health concern for a relative few. For such individuals, intense grief persists, is distressing and disabling, and may meet criteria as a distinct mental disorder. At present, grief is not recognized as a mental disorder in the DSM-IV or ICD-10. The goal of this study was to determine the psychometric validity of criteria for prolonged grief disorder (PGD to enhance the detection and potential treatment of bereaved individuals at heightened risk of persistent distress and dysfunction.A total of 291 bereaved respondents were interviewed three times, grouped as 0-6, 6-12, and 12-24 mo post-loss. Item response theory (IRT analyses derived the most informative, unbiased PGD symptoms. Combinatoric analyses identified the most sensitive and specific PGD algorithm that was then tested to evaluate its psychometric validity. Criteria require reactions to a significant loss that involve the experience of yearning (e.g., physical or emotional suffering as a result of the desired, but unfulfilled, reunion with the deceased and at least five of the following nine symptoms experienced at least daily or to a disabling degree: feeling emotionally numb, stunned, or that life is meaningless; experiencing mistrust; bitterness over the loss; difficulty accepting the loss; identity confusion; avoidance of the reality of the loss; or difficulty moving on with life. Symptoms must be present at sufficiently high levels at least six mo from the death and be associated with functional impairment.The criteria set for PGD appear able to identify bereaved persons at heightened risk for enduring distress and dysfunction. The results support the psychometric validity of the criteria for PGD that we propose for inclusion in DSM-V and ICD-11. Please see later in the article for Editors' Summary.
Paraskevoulakou, Alexia; Vrettou, Kassiani; Pikouli, Katerina; Triantafillou, Evgenia; Lykou, Anastasia; Economou, Marina
Since evaluation regarding the impact of mental illness related internalized stigma is scarce, there is a great need for psychometric instruments which could contribute to understanding its adverse effects among Greek patients with severe mental illness. The Brief Internalized Stigma of Mental Illness (ISMI) scale is one of the most widely used measures designed to assess the subjective experience of stigma related to mental illness. The present study aimed to investigate the psychometric properties of the Greek version of the Brief ISMI scale. In addition to presenting psychometric findings, we explored the relationship of the Greek version of the Brief ISMI subscales with indicators of self-esteem and quality of life. 272 outpatients (108 males, 164 females) meeting the DSM-IV TR criteria for severe mental disorder (schizophrenia, bipolar disorder, major depression) completed the Brief ISMI, the RSES and the WHOQOL-BREF scales. Patients reported age and educational level. A retest was conducted with 124 patients. The Chronbach's alpha coefficient was 0 0.83. The test-retest reliability coefficients varied from 0.81 to 0.91, indicating substantial agreement. The ICC was for the total score 0.83 and for the two factors, 0.69 and 0.77 respectively. Factor analysis provided strong evidence for a two factor model. Factors 1 and 2 were named respectively "how others view me" and "how I view myself". They were negatively correlated with both RSES and WHOQOL-BREF scales, as well as with educational level. Factor 2 was significantly associated with the type of diagnosis. The Greek version of the Brief ISMI scale can be used as a reliable and valid tool for assessing mental illness related internalized stigma among Greek patients with severe mental illness.
de Brouwer, B J M; Kaljouw, M J; Kramer, M; Schmalenberg, C; van Achterberg, T
Translate the Essentials of Magnetism II© (EOMII; Dutch Nurses' Association, Utrecht, The Netherlands) and assess its psychometric properties in a culture different from its origin. The EOMII, developed in the USA, measures the extent to which organizations/units provide healthy, productive and satisfying work environments. As many healthcare organizations are facing difficulties in attracting and retaining staff nurses, the EOMII provides the opportunity to assess the health and effectiveness of work environments. A three-phased (respectively N = 13, N = 74 and N = 2542) combined descriptive and correlational design was undertaken for translation and evaluation validity and psychometric qualities of the EOMII for Dutch hospitals (December 2009-January 2010). We performed forward-backward translation, face and content validation via cross-sectional survey research, and semi-structured interviews on relevance, clarity, and recognizability of instruments' items. Psychometric testing included principal component analysis using varimax rotation, item-total statistics, and reliability in terms of internal consistency (Cronbach's α) for the total scale and its subscales. Face validity was confirmed. Items were recognizable, relevant and clear. Confirmatory factor analysis indicated that five of eight subscales formed clear factors. Three original subscales contained two factors. Item-total correlations ranged from 0.43 to 0.83. One item correlated weakly (0.24) with its subscale. Cronbach's α for the entire scale was 0.92 and ranged from 0.58 to 0.92 for eight subscales. Dutch-translated EOMII (D-EOMII) demonstrated acceptable reliability and validity for assessing hospital staff nurses' work environment. The D-EOMII can be useful and effective in identifying areas in which change is needed for a hospital to pursue an excellent work environment that attracts and retains well-qualified nurses. © 2013 International Council of Nurses.
Hernández-Padilla, José Manuel; Granero-Molina, José; Márquez-Hernández, Verónica V; Suthers, Fiona; Fernández-Sola, Cayetano
Arterial puncture for arterial blood gases (ABG) analysis can be a risky, painful, difficult-to-perform procedure that is often insufficiently practised and generates stress and discomfort amongst patients and healthcare professionals. Self-efficacy is a key component in the acquisition of procedural skills. Therefore, professionals' self-efficacy in arterial puncture should be measured before attempting the procedure on real patients. To develop and psychometrically assess a self-efficacy scale in arterial puncture. An observational cross-sectional design was used in this study. Faculty of Education Sciences, Nursing and Physiotherapy in a higher education institution in the south of Spain. A convenience sample of 342 nursing students entered and completed the study. All participants met the following inclusion criteria: (1) ≥18years old and (2) enrolled in a nursing degree programme during the 2014/2015 academic year. Participants were 74% female (n=254) and their age ranged from 18 to 50, with a mean age of 21.74years (SD=5.14). The Arterial Puncture Self-Efficacy Scale (APSES) was developed and psychometrically tested. Reliability and content validity were studied. Predictive validity and concurrent validity assessed criterion validity. In addition, principal component analysis and known-group analysis evaluated construct validity. Principal component analysis revealed the two-subscale structure of the final 22-item version of the Arterial Puncture Self-Efficacy Scale (APSES). A total Cronbach's alpha coefficient of 0.97 showed its high reliability. The APSES' content validity index was excellent (S-CVI/Ave=0.95). Predictive and concurrent validity analysis demonstrated the good criterion validity of the tool. Supporting the APSES' sensitivity and specificity, known-groups analysis evidenced significant differences (pgood psychometric properties for measuring self-efficacy in arterial puncture for ABG analysis. Copyright © 2016 Elsevier Ltd. All rights
Babusa, Bernadett; Urbán, Róbert; Czeglédi, Edit; Túry, Ferenc
Limited studies have evaluated the psychometric properties of the Muscle Appearance Satisfaction Scale (MASS), a measure of muscle dysmorphia, in different cultures and languages. The aims were to examine the psychometric properties of the Hungarian version of the MASS (MASS-HU), and to investigate its relationship with self-esteem and exercise-related variables. Two independent samples of male weight lifters (ns=289 and 43), and a sample of undergraduates (n=240) completed the MASS, Eating Disorder Inventory, and Rosenberg Self-esteem Scale. Exploratory factor analysis supported the original five-factor structure of the MASS only in the weight lifter sample. The MASS-HU had excellent scale score reliability and good test-retest reliability. The construct validity of the MASS-HU was tested with multivariate regression analyses which indicated an inverse relationship between self-esteem and muscle dysmorphia. The 18-item MASS-HU was found to be a useful measure for the assessment of muscle dysmorphia among male weight lifters. Copyright © 2011 Elsevier Ltd. All rights reserved.
Dijkstra, Boukje A G; Krabbe, Paul F M; Riezebos, Truus G M; van der Staak, Cees P F; De Jong, Cor A J
To evaluate the psychometric properties of the Dutch version of the 16-item Subjective Opiate Withdrawal Scale (SOWS). The SOWS measures withdrawal symptoms at the time of assessment. The Dutch SOWS was repeatedly administered to a sample of 272 opioid-dependent inpatients of four addiction treatment centers during rapid detoxification with or without general anesthesia. Examination of the psychometric properties of the SOWS included exploratory factor analysis, internal consistency, test-retest reliability, and criterion validity. Exploratory factor analysis of the SOWS revealed a general pattern of four factors with three items not always clustered in the same factors at different points of measurement. After excluding these items from factor analysis four factors were identified during detoxification (temperature dysregulation, tractus locomotorius, tractus gastro-intestinalis and facial disinhibition). The 13-item SOWS shows high internal consistency and test-retest reliability and good validity at different stages of withdrawal. The 13-item SOWS is a reliable and valid instrument to assess opioid withdrawal during rapid detoxification. Three items were deleted because their content does not correspond directly with opioid withdrawal symptoms. Copyright (c) 2007 S. Karger AG, Basel.
Basaran Acil, Seher; Dinç, Leyla
To adapt the Nursing Authority and Autonomy Scale (NAAS) into Turkish the Nursing Authority and Autonomy Scale (NAAS) to Turkish and assess its psychometric properties for Turkish nurses and nurse managers. The NAAS is a tool that specifically measures nursing authority and autonomy from the perspectives of nurses and nurse managers. The study sample consisted of 160 nurse managers and 266 staff nurses. Content validity was assessed using expert approval. Construct validity was assessed using confirmatory factor analysis. Internal consistency was assessed using Cronbach's α, and the test-retest reliability was assessed using Pearson's correlation coefficients. The model achieved a good fit. The internal reliability of the NAAS' authority and autonomy in nursing practice and importance of nursing practice subscales were .84. The Cronbach's α of the instrument was .88. The test-retest scores within an interval of 3 weeks were statistically not significant. The Turkish version of the NAAS has good psychometric properties and this scale can be employed to measure nurses' authority and autonomy. Nurse managers and educators should use an appropriate scale such as NAAS in order to assess nurses' clinical authority and autonomy to improve patient outcomes and develop nurses. © 2018 John Wiley & Sons Ltd.
A Farahani, Mansoureh; Emamzadeh Ghasemi, Hormat Sadat; Nikpaima, Nasrin; Fereidooni, Zhila; Rasoli, Maryam
Evaluation of nursing instructors' clinical teaching performance is a prerequisite to the quality assurance of nursing education. One of the most common procedures for this purpose is using student evaluations. This study was to develop and evaluate the psychometric properties of Nursing Instructors' Clinical Teaching Performance Inventory (NICTPI). The primary items of the inventory were generated by reviewing the published literature and the existing questionnaires as well as consulting with the members of the Faculties Evaluation Committee of the study setting. Psychometric properties were assessed by calculating its content validity ratio and index, and test-retest correlation coefficient as well as conducting an exploratory factor analysis and an internal consistency assessment. The content validity ratios and indices of the items were respectively higher than 0.85 and 0.79. The final version of the inventory consisted of 25 items, and in the exploratory factor analysis, items were loaded on three factors which jointly accounting for 72.85% of the total variance. The test-retest correlation coefficient and the Cronbach's alpha of the inventory were 0.93 and 0.973, respectively. The results revealed that the developed inventory is an appropriate, valid, and reliable instrument for evaluating nursing instructors' clinical teaching performance.
Barney, Lisa J; Griffiths, Kathleen M; Christensen, Helen; Jorm, Anthony F
Self-stigma may feature strongly and be detrimental for people with depression, but the understanding of its nature and prevalence is limited by the lack of psychometrically-validated measures. This study aimed to develop and validate a measure of self-stigma about depression. Items assessing self-stigma were developed from focus group discussions, and were tested and refined over three studies using surveys of 408 university students, 330 members of a depression Internet network, and 1312 members of the general Australian public. Evaluation involved item-level and bivariate analyses, and factor analytic procedures. Items performed consistently across the three surveys. The resulting Self-Stigma of Depression Scale (SSDS) comprised 16 items representing subscales of Shame, Self-Blame, Social Inadequacy, and Help-Seeking Inhibition. Construct validity, internal consistency and test-retest reliability were satisfactory. The SSDS distinguishes self-stigma from perceptions of stigma by others, yields in-depth information about self-stigma of depression, and possesses good psychometric properties. It is a promising tool for the measurement of self-stigma and is likely to be useful in further understanding self-stigma and evaluating stigma interventions. Copyright © 2010 John Wiley & Sons, Ltd.
Chen, Kuan-Wei; Lee, Shih-Chieh; Chiang, Hsin-Yu; Syu, Ya-Cing; Yu, Xiao-Xuan; Hsieh, Ching-Lin
Patients with schizophrenia tend to have deficits in advanced Theory of Mind (ToM). The "Reading the mind in the eyes" test (RMET), the Faux Pas Task, and the Strange Stories are commonly used for assessing advanced ToM. However, most of the psychometric properties of these 3 measures in patients with schizophrenia are unknown. The aims of this study were to validate the psychometric properties of the 3 advanced ToM measures in patients with schizophrenia, including: (1) test-retest reliability; (2) random measurement error; (3) practice effect; (4) concurrent validity; and (5) ecological validity. We recruited 53 patients with schizophrenia, who completed the 3 measures twice, 4 weeks apart. The Revised Social Functioning Scale-Taiwan short version (R-SFST) was completed within 3 days of first session of assessments. We found that the intraclass correlation coefficients of the RMET, Strange Stories, and Faux Pas Task were 0.24, 0.5, and 0.76. All 3 advanced ToM measures had large random measurement error, trivial to small practice effects, poor concurrent validity, and low ecological validity. We recommend that the scores of the 3 advanced ToM measures be interpreted with caution because these measures may not provide reliable and valid results on patients' advanced ToM abilities. Copyright © 2017 Elsevier B.V. All rights reserved.
Wang, Wen-Ling; Feng, Jui-Ying; Wang, Chi-Jen; Chen, Jing-Huei
This study aimed to develop a family-centered care survey for Chinese adult intensive care units and to establish the survey's psychometric properties. Family-centered care (FCC) is widely recognized as an ideal model of care. Few studies have explored FCC perceptions among family members of adult critical care patients in Asian countries, and no Chinese FCC measurement has been developed. An English version of the 3-factor family-centered care survey for adult intensive care units (FCCS-AICU) was translated into Chinese using a modified back translation procedure. Based on the literature review, two additional concepts, information and empowerment, were added to the Chinese FCCS-AICU. The psychometric properties of the Chinese FCCS-AICU were determined with 249 family members from a medical center in Taiwan and were tested for construct and convergent validity, and internal consistency. Both the monolingual and bilingual equivalence tests of the English and Chinese versions of the 3-factor FCCS-AICU were supported. Exploratory factor analysis supported the 5-factor structure of the Chinese FCCS-AICU with a total explained variance of 58.34%. The Chinese FCCS-AICU was correlated with the Chinese Critical Care Family Needs Inventory. Internal consistency, determined by Cronbach's α, for the overall scale was .94. The Chinese FCCS-AICU is a valid and reliable tool for measuring perceptions of FCC by family members of adult intensive care patients within Chinese-speaking communities. Copyright © 2015 Elsevier Inc. All rights reserved.
Full Text Available Menopause is not a disease; however the somatic and psychological symptoms that accompany it affect the life of women. Women health questionnaire (WHQ is a self-administered questionnaire that measures the physical and mental health of women ages 40 to 65 years. The purpose of this study is to provide psychometric documentation details of the translation of WHQ into the Persian language. A total of 350 peri and postmenopausal women were recruited from urban health centers in the city of Tabriz, between March and October 2015. The validity of WHQ was assessed using construct and discriminate validity. The reliability of questionnaire was assessed by test retest reliability and measuring internal consistency. The KMO was 0.791, and the Bartlett’s test of Sphericity was significant. Principle component analysis (PCA resulted in 9 factors which explained up to 55.4% of the total variance. Cronbach's coefficient was 0.799 and the Intraclass correlation coefficient (ICC of the Persian translation scale was 0.712. Evaluation of the psychometric properties showed that the Persian language translation of the 36-item version of the WHQ was appropriate when applied to middle aged women
Sami, W.; Ansari, T.; Butt, N. S.; Hamid, M. R. Ab
This research evaluated the psychometric properties of English version of dietary habits questionnaires developed for type 2 diabetic patients. There is scarcity of literature about availability of standardized questionnaires for assessing dietary habits of type 2 diabetics in Saudi Arabia. As dietary habits vary from country to country, therefore, this was an attempt to develop questionnaires that can serve as a baseline. Through intensive literature review, four questionnaires were developed / modified and subsequently tested for psychometric properties. Prior to pilot study, a pre-test was conducted to evaluate the face validity and content validity. The pilot study was conducted from 23 October - 22 November, 2016 to evaluate the questionnaires’ reliability and validity. Systematic random sampling technique was used to collect the data from 132 patients by direct investigation method. Questionnaires assessing diabetes mellitus knowledge (0.891), dietary knowledge (0.869), dietary attitude (0.841) and dietary practices (0.874) had good internal consistency reliability. Factor analysis conducted on dietary attitude questionnaire showed a valid 5 factor solution. Directions of loadings were positive and free from factorial complexity. Relying on the data obtained from type 2 diabetics, these questionnaires can be considered as reliable and valid for the assessment of dietary habits in Saudi Arabia and neighbouring Gulf countries population.
Mahdieh Sadat Khoshouei
Full Text Available Abstract: Liebowitz Social Anxiety Scale (LSAS is an instrument used to evaluate the severity of social phobia. It has been widely used in different contexts and cultures, presenting variable psychometric properties. The aim of this study is to investigate the psychometric properties of LSAS. Method: The sample consisted of 342 students (184 females, 158 males aged 19 - 34(Mean age=21.93, SD=3.44 years; Subjects were selected from eight faculties of Isfahan University, using a random clustering procedure. In order to measure test-retest reliability, LSAS was re-administered to fifty-eight students three weeks after the first session. Results: The method of Principal components analysis (Varimax Normalized Rotation was applied to evaluate structural validity of LSAS. The factor analysis defined 5 factors which covered 84.68% of the total variability of the data. The five factors extracted were: 1 speaking in a group; 2 social interaction in leisure activity; 3 activity in public; 4 attitude of disagreement or disapproval;and 5 social interaction with unknown person. The reliability coefficients (Cronbach alpha, split-half and test- retest were found to be satisfactory for the total scale and its subscales. Conclusions: The scale is valid and reliable for studies that require a standard
Bailet, Laura L.; Zettler-Greeley, Cynthia; Lewis, Kandia
Home literacy activities influence children's emergent literacy progress and readiness for reading instruction. To help parents fulfill this opportunity, we developed a new Emergent Literacy Screener (ELS) and conducted 2 studies of its psychometric properties with independent prekindergarten samples. For Study 1 (n = 812, M[subscript age] = 54.4…
Polyak, Stephen T.; von Davier, Alina A.; Peterschmidt, Kurt
This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses. PMID:29238314
Tate, Kevin A.; Bloom, Margaret L.; Tassara, Marcel H.; Caperton, William
Psychometric instruments have been underutilized by counselor educators in performance assessment and program evaluation efforts. As such, we conducted a review of the literature that revealed 41 instruments fit for such efforts. We described and critiqued these instruments along four dimensions--"Target Domain," "Format,"…
Lee, Ahram; Park, Eun Hye; Byeon, Eunji; Lee, Sang Min
This study describes the development and psychometric properties of the Counseling Supervisor's Behavior Questionnaire, designed to assess the specific behaviors of supervisors, which can be observed by supervisees during supervision sessions. Factor structure, construct and concurrent validity, and internal consistency reliability of the…
Morean, Meghan E.; de Wit, Harriet; King, Andrea C.; Sofuoglu, Mehmet; Rueger, Sandra Y.; O’Malley, Stephanie S.
Rationale The Drug Effects Questionnaire (DEQ) is widely used in studies of acute subjective response (SR) to a variety of substances, but the format of the DEQ varies widely across studies, and details of its psychometric properties are lacking. Thus, the field would benefit from demonstrating the reliability and validity of the DEQ for use across multiple substances. Objective The current study evaluated the psychometric properties of several variations of DEQ items, which assessed the extent to which participants (1) feel any substance effect(s), (2) feel high, (3) like the effects, (4) dislike the effects, and (5) want more of the substance using 100mm Visual Analog Scales. Methods DEQ data from three placebo-controlled studies were analyzed to examine SR to amphetamine, nicotine, and alcohol. We evaluated the internal structure of the DEQ for use with each substance as well as relationships between scale items, measures of similar constructs, and substance-related behaviors. Results Results provided preliminary psychometric support for items assessing each DEQ construct (FEEL, HIGH, DISLIKE, LIKE, and MORE). Conclusions Based on the study results, we identify several common limitations of extant variants of the DEQ and recommend an improved version of the measure. The simplicity and brevity of the DEQ combined with its promising psychometric properties support its use in future SR research across a variety of substances. PMID:23271193
Background: Oral health has an impact on quality of life hence for research purpose validation of a Tamil version of General Oral Health Assessment Index would enable it to be used as a valuable tool among Tamil speaking population. Aim: In this study, we aimed to assess the psychometric properties of translated Tamil ...
Stephen T. Polyak
Full Text Available This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
The study aimed to develop the Homophobic Bullying Scale and to investigate its psychometric properties. The items of the Homophobic Bullying Scale were created to measure high school students' bullying behaviors motivated by homophobia, including verbal bullying, relational bullying, physical bullying, property bullying, sexual harassment, and…
Conclusion According to the results, the detection protocol of malingering stuttering is of good internal consistency and concurrent validity. However, considering that the sample population was not large in the present study, it can be said that this study is a preliminary evaluation to find the psychometric features of the instruments, with the aim of laying the groundwork for further studies.
Schwabe, I.; Jonker, Wilfried; Van Den Berg, Stéphanie M.
The Wilson-Patterson conservatism scale was psychometrically evaluated using homogeneity analysis and item response theory models. Results showed that this scale actually measures two different aspects in people: on the one hand people vary in their agreement with either conservative or liberal
Khan, Anwar; Yusoff, Rosman Bin Md.; Khan, Muhammad Muddassar; Yasir, Muhammad; Khan, Faisal
A comprehensive Psychometric Analysis of Rizzo et al.'s (1970) Role Conflict & Ambiguity (RCA) scales were performed after its distribution among 600 academic staff working in six universities of Pakistan. The reliability analysis includes calculation of Cronbach Alpha Coefficients and Inter-Items statistics, whereas validity was determined by…
Bergman, R. Lindsey; Keller, Melody L.; Piacentini, John; Bergman, Andrea J.
Research on selective mutism (SM) has been limited by the absence of standardized, psychometrically sound assessment measures. The purpose of our investigation was to present two studies that examined the factor structure and initial reliability and validity of the Selective Mutism Questionnaire (SMQ), a 17-item parent report measure of failure to…
This study aims to examine the psychometric characteristics of Mooney Problem Checklist (MPCL) items using the Rasch measurement model framework in the context of polytechnics. The MPCL with eleven dimensions was administered to 252 respondents who were selected from seven polytechnic institutions in Malaysia ...
Aim: To determine the psychometric properties of the Multidimensional Anxiety Scale for Children (MASC) in Nairobi public secondary school children, Kenya. Method: Concurrent self-administration of the MASC and Children's Depression Inventory (CDI) to students in Nairobi public secondary schools. Results: The MASC ...
Lamarche, Larkin; Gammage, Kimberley L.; Sullivan, Philip J.; Gabriel, David A.
This study examined the psychometric properties of the Self-Presentational Efficacy Scale (SPES) developed by Gammage, Hall, and Martin Ginis (2004). University students (196 men and 269 women) completed the SPES and measures of social physique anxiety, fear of negative evaluation, and physical activity. Participants also completed the SPES a…
Lowman, Rodney L.; Schurman, Susan J.
The psychometric properties of a revised version of Holland's Vocational Preference Inventory were assessed using federal government employees. Factor analyses, interscale correlations, measures of internal consistency, and criterion group profiles are presented. Evidence was supportive of the validity of the revised form. (Author/BW)
van Kampen, D.
This paper examines the psychometric properties (reliability and factor structure) and validity (relationship with various self-report measures and SPEM dysfunction) of the SSQ or Schizotypic Syndrome Questionnaire, a 108-item inventory for the measurement of 12 prodromal or schizotypic symptoms
Alibhai, Salman; Buehren, Niklas; Coleman, Rachel; Goldstein, Markus; Strobbe, Francesco
This case study tells the story of the evolution of psychometric credit scoring as an innovative solution in a World Bank operation, from its humble beginnings as a small pilot in Ethiopia, to the current movement to replicate its use for similar challenges in countries across the continent in Tanzania, Zimbabwe, Madagascar, and beyond. Fintech is commonly defined as an industry composed ...
Powell, Michael; Newgent, Rebecca A.
This article describes the development and psychometrics of the Juvenile Addiction Risk Rating. The Juvenile Addiction Risk Rating is a brief screening of addiction potential based on 10 risk factors predictive of youth alcohol and drug-related problems that assists examiners in more accurate treatment planning when self-report information is…
Schwabe, Inga; Jonker, Willem; van den Berg, Stéphanie Martine
The Wilson−Patterson conservatism scale was psychometrically evaluated using homogeneity analysis and item response theory models. Results showed that this scale actually measures two different aspects in people: on the one hand people vary in their agreement with either conservative or liberal
Evaluation of Psychometric Properties of the Malay Version Perceived Stress Scale in Two Occupational Settings In Malaysia. ... Statistical analysis was carried out using statistical package for the social sciences version 16 (SPSS, Chicago, IL, USA) software. Results: Analysis yielded two factor structure of the Malay version ...
Swami, Viren; Chamorro-Premuzic, Tomas
The Satisfaction With Life Scale (SWLS) is one of the most widely used scales for the measurement of subjective well-being across the globe, but no satisfactory version exists for use among Malay-speaking populations. The present study reports on the translation of a new Malay SWLS and examines its psychometric properties in a community sample of…
Robinson, Carrie H.; Betz, Nancy E.
This study describes the psychometric evaluation of Super's Work Values Inventory--Revised (SWVI-R), an instrument comprised of 12 scales measuring the relative importance placed on the following work-related value dimensions: Achievement, Coworkers, Creativity, Income, Independence, Lifestyle, Mental Challenge, Prestige, Security, Supervision,…
of a Tamil version of General Oral Health Assessment Index would enable it to be used as a valuable ... psychometric properties, so that it can be used as an efficient tool in identifying the impact of oral ... affects a person physically and psychologically thereby .... questions 3, 5, and 7 were negatively rephrased and choices.
Clerkin, Suzanne M.; Marks, David J.; Policaro, Katia L.; Halperin, Jeffrey M.
The psychometric properties of the Alabama Parenting Questionnaire-Preschool Revision (APQ-PR) were explored in a sample of hyperactive-inattentive preschool children (N = 47) and nonimpaired controls (N = 113). A subset of parents completed the questionnaire on 2 occasions, approximately 1 year apart. Factor analysis revealed a 3-factor solution,…
Background: Due to the socio-cultural characteristics of Iranian adult men and lack of standardized questionnaires to assess their reproductive health associated with sexually transmitted diseases and HIV / AIDS, this study is done with the goal of development and psychometrics of a valid relevant instrument. Method: A ...
Tolar, Tammy D.; Barth, Amy E.; Francis, David J.; Fletcher, Jack M.; Stuebing, Karla K.; Vaughn, Sharon
Maze tasks have appealing properties as progress-monitoring tools, but there is a need for a thorough examination of the psychometric properties of Maze tasks among middle school students. We evaluated form effects, reliability, validity, and practice effects of Maze among students in Grades 6 through 8. We administered the same (familiar) and…
Polyak, Stephen T; von Davier, Alina A; Peterschmidt, Kurt
This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
Bond, Frank W; Hayes, Steven C; Baer, Ruth A; Carpenter, Kenneth M; Guenole, Nigel; Orcutt, Holly K; Waltz, Tom; Zettle, Robert D
The present research describes the development and psychometric evaluation of a second version of the Acceptance and Action Questionnaire (AAQ-II), which assesses the construct referred to as, variously, acceptance, experiential avoidance, and psychological inflexibility. Results from 2,816 participants across six samples indicate the satisfactory structure, reliability, and validity of this measure. For example, the mean alpha coefficient is .84 (.78-.88), and the 3- and 12-month test-retest reliability is .81 and .79, respectively. Results indicate that AAQ-II scores concurrently, longitudinally, and incrementally predict a range of outcomes, from mental health to work absence rates, that are consistent with its underlying theory. The AAQ-II also demonstrates appropriate discriminant validity. The AAQ-II appears to measure the same concept as the AAQ-I (r=.97) but with better psychometric consistency. Copyright © 2011. Published by Elsevier Ltd.
Pogorzelska-Maziarz, Monika; Nembhard, Ingrid M; Schnall, Rebecca; Nelson, Shanelle; Stone, Patricia W
In recent years, there has been increased interest in measuring the climate for infection prevention; however, reliable and valid instruments are lacking. This study tested the psychometric properties of the Leading a Culture of Quality for Infection Prevention (LCQ-IP) instrument measuring the infection prevention climate in a sample of 972 infection preventionists from acute care hospitals. An exploratory principal component analysis showed that the instrument had structural validity and captured 4 factors related to the climate for infection prevention: Psychological Safety, Prioritization of Quality, Supportive Work Environment, and Improvement Orientation. LCQ-IP exhibited excellent internal consistency, with a Cronbach α of .926. Criterion validity was supported with overall LCQ-IP scores, increasing with the number of evidence-based prevention policies in place (P = .047). This psychometrically sound instrument may be helpful to researchers and providers in assessing climate for quality related to infection prevention. © The Author(s) 2015.
Alleva, Jessica M; Tylka, Tracy L; Kroon Van Diest, Ashley M
Body functionality has been identified as an important dimension of body image that has the potential to be useful in the prevention and treatment of negative body image and in the enhancement of positive body image. Specifically, cultivating appreciation of body functionality may offset appearance concerns. However, a scale assessing this construct has yet to be developed. Therefore, we developed the Functionality Appreciation Scale (FAS) and examined its psychometric properties among three online community samples totalling 1042 women and men (ns=490 and 552, respectively). Exploratory factor analyses revealed a unidimensional structure with seven items. Confirmatory factor analysis upheld its unidimensionality and invariance across gender. The internal consistency, test-retest reliability, criterion-related, and construct (convergent, discriminant, incremental) validity of its scores were upheld. The FAS is a psychometrically sound measure that is unique from existing positive body image measures. Scholars will find the FAS applicable within research and clinical settings. Copyright © 2017 Elsevier Ltd. All rights reserved.
Cano, Stefan J; Posner, Holly B; Moline, Margaret L; Hurt, Stephen W; Swartz, Jina; Hsu, Tim; Hobart, Jeremy C
The Alzheimer's Disease Assessment Scale Cognitive Behavior Section (ADAS-cog), a measure of cognitive performance, has been used widely in Alzheimer's disease trials. Its key role in clinical trials should be supported by evidence that it is both clinically meaningful and scientifically sound. Its conceptual and neuropsychological underpinnings are well-considered, but its performance as an instrument of measurement has received less attention. Objective To examine the traditional psychometric properties of the ADAS-cog in a large sample of people with Alzheimer's disease. Data from three clinical trials of donepezil (Aricept) in mild-to-moderate Alzheimer's disease (n=1421; MMSE 10-26) were analysed at both the scale and component level. Five psychometric properties were examined using traditional psychometric methods. These methods of examination underpin upcoming Food and Drug Administration recommendations for patient rating scale evaluation. At the scale-level, criteria tested for data completeness, scaling assumptions (eg, component total correlations: 0.39-0.67), targeting (no floor or ceiling effects), reliability (eg, Cronbach's α: = 0.84; test-retest intraclass correlations: 0.93) and validity (correlation with MMSE: -0.63) were satisfied. At the component level, 7 of 11 ADAS-cog components had substantial ceiling effects (range 40-64%). Performance was satisfactory at the scale level, but most ADAS-cog components were too easy for many patients in this sample and did not reflect the expected depth and range of cognitive performance. The clinical implication of this finding is that the ADAS-cog's estimate of cognitive ability, and its potential ability to detect differences in cognitive performance under treatment, could be improved. However, because of the limitations of traditional psychometric methods, further evaluations would be desirable using additional rating scale analysis techniques to pinpoint specific improvements.
Pedro Henrique Berbert de Carvalho
Full Text Available Background The study of male body image has increased substantially, but there are few assessment tools available for this population. The Male Body Dissatisfaction Scale (MBDS has been widely used among students to research body image disturbances and eating disorders. However, the psychometric properties of this instrument have not been tested in the Brazilian context.Objectives To explore the psychometric properties (convergent validity, internal consistency, test-retest reliability and factor structure of the Brazilian version of the MBDS.Methods Two-hundred sixty-four undergraduate students were evaluated. Pearson’s correlation was used to test the convergent validity of the MBDS and the Drive for Muscularity Scale, the Swansea Muscularity Attitudes Questionnaire, the Rosenberg Self-Esteem Scale, the Beck Depression Inventory, the Eating Attitudes Test-26, and the Commitment to Exercise Scale. Test-retest reliability was evaluated using t-tests for repeated measures and by calculating the coefficient of intraclass correlation. Exploratory factor analysis was conducted, and Cronbach’s α coefficients were determined. A significance level of 5% was adopted.Results The MBDS had an adequate factor structure, with two factors explaining 52.67% of the total variance. It showed excellent internal consistency (Cronbach’s α between 0.90 and 0.92, a high intraclass correlation coefficient (0.81, and convergent validity with the drive for muscularity, the psychological commitment to exercise, low self-esteem, and eating disorder risk behaviour measures.Discussion The MBDS appears to be a valid and reliable tool for evaluating Brazilian male body image dissatisfaction.
Abdolalizadeh, M; Arastoo, A A; Ghsemzadeh, R; Montazeri, A; Ahmadi, K; Azizi, A
This study was carried out to evaluate the psychometric properties of an Iranian translation of the Work Ability Index (WAI) questionnaire. In this methodological study, nurses and healthcare workers aged 40 years and older who worked in educational hospitals in Ahvaz (236 workers) in 2010, completed the questionnaire and 60 of the workers filled out the WAI questionnaire for the second time to ensure test-retest reliability. Forward-backward method was applied to translate the questionnaire from English into Persian. The psychometric properties of the Iranian translation of the WAI were assessed using the fallowing tests: Internal consistency (to test reliability), test-retest analysis, exploratory factor analysis (construct validity), discriminate validity by comparing the mean WAI score in two groups of the employees that had different levels of sick leave, criterion validity by determining the correlation between the Persian version of short form health survey (SF-36) and WAI score. Cronbach's alpha coefficient was estimated to be 0.79 and it was concluded that the internal consistency was high enough. The intraclass correlation coefficient was recognized to be 0.92. Factor analysis indicated three factors in the structure of the work ability including self-perceived work ability (24.5% of the variance), mental resources (22.23% of the variance), and presence of disease and health related limitation (18.55% of the variance). Statistical tests showed that this questionnaire was capable of discriminating two groups of employees who had different levels of sick leave. Criterion validity analysis showed that this instrument and all dimensions of the Iranian version of SF-36 were correlated significantly. Item correlation corrective for overlap showed the items tests had a good correlation except for one. The finding of the study showed that the Iranian version of the WAI is a reliable and valid measure of work ability and can be used both in research and practical
Classen, Sherrilene; Wen, Pey-Shan; Velozo, Craig A; Bédard, Michel; Winter, Sandra M; Brumback, Babette; Lanford, Desiree N
We investigated the psychometric properties of the 68-item Safe Driving Behavior Measure (SDBM) with 80 older drivers, 80 caregivers, and 2 evaluators from two sites. Using Rasch analysis, we examined unidimensionality and local dependence; rating scale; item- and person-level psychometrics; and item hierarchy of older drivers, caregivers, and driving evaluators who had completed the SDBM. The evidence suggested the SDBM is unidimensional, but pairs of items showed local dependency. Across the three rater groups, the data showed good person (≥3.4) and item (≥3.6) separation as well as good person (≥.93) and item reliability (≥.92). Cronbach's α was ≥.96, and few items were misfitting. Some of the items did not follow the hypothesized order of item difficulty. The SDBM classified the older drivers into six ability levels, but to fully calibrate the instrument it must be refined in terms of its items (e.g., item exclusion) and then tested among participants of lesser ability. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Larissa Forni dos Santos
Full Text Available Social Anxiety Disorder (SAD is prevalent and rarely diagnosed due to the difficulty in recognizing its symptoms as belonging to a disorder. Therefore, the evaluation/screening scales are of great importance for its detection, with the most used being the Liebowitz Social Anxiety Scale (LSAS. Thus, this study proposed to evaluate the psychometric properties of internal consistency and convergent validity, as well as the confirmatory factorial analysis and reliability of the self-reported version of the LSAS (LSAS-SR, translated into Brazilian Portuguese, in a sample of the general population (N = 413 and in a SAD clinical sample (N = 252. The convergent validity with specific scales for the evaluation of SAD and a general anxiety scale presented correlations ranging from 0.21 to 0.84. The confirmatory factorial analysis did not replicate the previously indicated findings of the literature, with the difficulty being in obtaining a consensus factorial structure common to the diverse cultures in which the instrument was studied. The LSAS-SR presented excellent internal consistency (α = 0.90-0.96 and test-retest reliability (Intraclass Correlation Coefficient = 0.81; Pearson's = 0.82. The present findings support those of international studies that attest to the excellent psychometric properties of the LSAS-SR, endorsing its status as the gold standard.
Two separate paths to the concept of intelligence are discussed: the psychometric path being concerned with the measurement of intelligence, involving the methodology of norm-referenced testing; the path followed by Piaget, and others, addresses from the start the related question of how intelligence can be described, and employs a criterion-referenced methodology. The achievements of psychometrics are briefly described, with an argument that they now remain important tools of what Kuhn called 'normal science'. The criterion-referenced approach of Piaget and others is described, with evidence from intervention studies that the Genevan descriptions of children-in-action have allowed the choice of contexts within which children can profitably be challenged to go further in their thinking. Hence, Genevan psychology is also now a part of the normal science with important uses, shown both in neo-Piagetian studies and further research stemming from Geneva. Discussion of the 'Flynn effect' sheds light on both paths, with problems still unresolved. The argument is then developed that the relevance of neuroscience needs to be discussed to try to decide in what ways it may provide useful insights into intelligence.
Hultell, Daniel; Gustavsson, J Petter
Burnout and work engagement are generally defined as psychological states but the methods used to measure these constructs are more in line with methods used to assess psychological traits. Thus, a new instrument called the Scale of Work Engagement and Burnout (SWEBO) measuring the state mood of burnout and work engagement was developed during the fall of 2007. The purpose of the present study was to evaluate the SWEBO using psychometrical methods. The sample consisted of 2,266 newly graduated Swedish nurses and teachers. Measurement models of both burnout and work engagement were evaluated using confirmatory factor analysis (CFA). Both burnout and work engagement were also tested for measurement invariance across occupation and age. The fit of the measurement model of burnout was satisfactory and it was invariant across both occupation and age. The measurement model of work engagement as initially defined did not fit the data satisfactorily. The model was therefore revised and reanalyzed. The revised model had a satisfactory fit and was invariant across occupation. Analysis of its invariance across age, however, gave ambiguous results that were difficult to interpret. The SWEBO presents a psychometrically sound alternative for measuring burnout and work engagement.
Cruz, Jonas Preposi; Albaqawi, Hamdan Mohammad; Alharbi, Sami Melbes; Alicante, Jerico G; Vitorino, Luciano M; Abunab, Hamzeh Y
To assess the psychometric properties of the Spiritual Climate Scale Arabic version for Saudi nurses. Evidence showed that a high level of spiritual climate in the workplace is associated with increased productivity and performance, enhanced emotional intelligence, organisational commitment and job satisfaction among nurses. A convenient sample of 165 Saudi nurses was surveyed in this descriptive, cross-sectional study. Cronbach's α and intraclass correlation coefficient of the 2 week test-retest scores were computed to establish reliability. Exploratory factor analysis was performed to support the validity of the Spiritual Climate Scale Arabic version. The Spiritual Climate Scale Arabic version manifested excellent content validity. Exploratory factor analysis supported a single factor with an explained variance of 73.2%. The Cronbach's α values of the scale ranged from .79 to .88, while the intraclass correlation coefficient value was .90. The perceived spiritual climate was associated with the respondents' hospital, gender, age and years of experience. Findings of this study support the sound psychometric properties of the Spiritual Climate Scale Arabic version. The Spiritual Climate Scale Arabic version can be used by nurse managers to assess the nurses' perception of the spiritual climate in any clinical area. This process can lead to spiritually centred interventions, thereby ensuring a clinical climate that accepts and respects different spiritual beliefs and practices. © 2017 John Wiley & Sons Ltd.
Full Text Available Mita et al. (2010 devised a technique of comparing a visual acuity (VA change in an individual with more accurate VA than conventional VA tests by significant difference examined logarithmic (Log VA ± standard deviation (SD. Using this technique, in this study, we examined a relation between VA and the slope of the psychometric function in normal young subjects. Six occlusion foil conditions were employed (1.0, 0.8, 0.6, 0.4, 0.1 and without the foil under a full refractive correction. Ten normal young adults (22.8 years old on average who have no ophthalmologic disease except ametropia participated in the measurement. The experiment was carried out with the constant method, a series of ten Landolt rings were used and each ring was presented 20 times randomly in a measurement. A 5.6-inch type of liquid crystal display driven by a computer, which has 1,280×800 pixels spatial resolution, was used to present the stimulus. In the normal young adults, the slope of the psychometric function did not change as the VA change systematically, and there was almost no correlation between them (r = −0.103.
Research purpose: The objectives of this study were to investigate the internal validity (construct, discriminant and convergent validity, reliability and external validity (relationship with theoretically relevant variables, including job characteristics, home characteristics, burnout, ill health and life satisfaction of the instrument. Motivation for the study: Work-family interaction is a key topic receiving significant research attention. In order to facilitate comparison across work-family studies, the use of psychometrically sound instruments is of great importance. Research design, approach and method: A cross-sectional survey design was used for the target population of married employees with children working at a tertiary institution in the North West province (n = 366. In addition to the new instrument, job characteristics, home characteristics, burnout, ill health and life satisfaction were measured. Main findings: The results provided evidence for construct, discriminant and convergent validity, reliability and significant relations with external variables. Practical/managerial implications: The new instrument can be used by researchers and managers as a test under development to investigate the interference between work and different nonwork roles (i.e. parental role, spousal role, work role, domestic role and specific relations with antecedents (e.g. job/home characteristics and well-being (e.g. burnout, ill health and life satisfaction. Contribution/value-add: This study provides preliminary information on the psychometric properties of a new instrument that measures the interference between work and nonwork.
Full Text Available Background: The Daily Spiritual Experience Scale (DSES has been developed through extensive and qualitative research. Numerous studies have confirmed the reliability and validity of the DSES among different populations. Most of the studies have shown association of the DSES with physical and psychological well-being. Purpose: The current study aimed to evaluate the psychometric properties of the DSES in the Croatian population. Method: The 16-item scale was translated through standard translation/back-translation procedures. The scale was afterwards applied to a sample of 535 test subjects (49% men and 51% women, mean age 42.6 years. Results: The coefficient of reliability (Cronbach alpha = 0.945 is very high. The coefficients of discriminant validity were satisfactory for 15 items, whereas only one item (14 has a coefficient of less than 0.30. The factor analysis after oblique rotation resulted in two related factors: the relationship with God and relationship with others. Using these two factors explained the 66.1% of the variance. Conclusion: Based on the data, it can be concluded that DSES has satisfactory psychometric characteristics and can be applied to the Croatian population, but its correlation with other religious and non-religious constructs should be verified in further research.
Castillo, Isabel; Tomás, Inés; Balaguer, Isabel
The Subjective Vitality Scale (SVS) assess the subjective experience of being full of energy and alive, a clinically relevant outcome measure of positive psychological well-being. The purpose of this paper was to translate the 7-item SVS into Spanish and examine its psychometric properties. In Study 1 (n = 790 adolescents) and Study 2 (n = 130 athletes) reliability and exploratory factor analysis (EFA) were carried out. In Study 1 and Study 3 (n = 197 dancers) evidence of validity of inferences based on SVS scores estimating relationships with other variables (life satisfaction, global self-esteem and emotional and physical exhaustion) was obtained. In Study 2 invariance across time was tested. Finally in Study 3, the factorial structure was cross-validated using confirmatory factor analysis (CFA). Results of EFA showed a one-factor solution. CFA also supported a unidimensional factor structure for the Spanish 6-item SVS (RMSEA = .050 (90% CI = .00, .080); NNFI = .993; CFI = .996). Reliability analysis indicated a strong internal consistency in all study samples (α ranged from .82 to .89). Further, results from multi-sample analysis supported the replicability of SVS factor structure across time. Finally, the SVS scores showed the expected correlations patterns (all them significant, p < .01) with the measured outcomes. In conclusion, the Spanish version of the SVS demonstrated adequate psychometric properties, indicating that the scale can be confidently used to measure the experience of possessing energy and aliveness; furthermore, differences across time can be meaningfully carried out.
Morlock Robert J
Full Text Available Abstract Background Fast-acting medications for the management of anxiety are important to patients and society. Measuring early onset, however, requires a sensitive and clinically responsive tool. This study evaluates the psychometric properties of a patient-reported Global Anxiety - Visual Analog Scale (GA-VAS. Methods Data from a double-blind, randomized, placebo-controlled study of lorazepam and paroxetine in patients with Generalized Anxiety Disorder were analyzed to assess the reliability, validity, responsiveness, and utility of the GA-VAS. The GA-VAS was completed at clinic visits and at home during the first week of treatment. Targeted psychometric analyses—test-retest reliabilities, validity correlations, responsiveness statistics, and minimum important differences—were conducted. Results The GA-VAS correlates well with other anxiety measures, at Week 4, r = 0.60 (p r = 0.74 (p p p p Conclusions The GA-VAS is capable of validly and effectively capturing a reduction in anxiety as quickly as 24 hours post-dose.
Schatz, Michael; Zeiger, Robert S; Yang, Su-Jau; Chen, Wansu; Kosinski, Mark
The Asthma Impact Survey (AIS-6) is a brief disease-specific quality-of-life instrument with limited published validation data. To obtain additional validation data and psychometric properties of the AIS-6. In November, 2007, patients with persistent asthma were mailed a survey that included the AIS-6, the mini-Asthma Quality of Life Questionnaire (mAQLQ), and the Asthma Control Test (ACT). Follow-up surveys were sent in April, July, and October 2008. Year 2008 exacerbations and short-acting β-agonist (SABA) dispensings were captured from administrative data. A total of 2680 patients had complete baseline survey data. Criterion validity was demonstrated by the strong correlations of the AIS-6 with the mAQLQ (r = -0.84 to -0.86); construct validity by significant relationships (P validity by significant relationships (P reliability (intraclass correlation coefficient = 0.86-0.91) were also demonstrated. The AIS-6 demonstrated good psychometric properties in a large independent sample and could be used to assess asthma-specific quality of life in clinical practice and clinical research. Copyright © 2011 American Academy of Allergy, Asthma & Immunology. Published by Mosby, Inc. All rights reserved.
Mak, Kwok-Kei; Lai, Ching-Man; Ko, Chih-Hung; Chou, Chien; Kim, Dong-Il; Watanabe, Hiroko; Ho, Roger C M
The Revised Chen Internet Addiction Scale (CIAS-R) was developed to assess Internet addiction in Chinese populations, but its psychometric properties in adolescents have not been examined. This study aimed to evaluate the factor structure and psychometric properties of CIAS-R in Hong Kong Chinese adolescents. 860 Grade 7 to 13 students (38 % boys) completed the CIAS-R, the Young's Internet Addiction Test (IAT), and the Health of the Nation Outcome Scales for Children and Adolescents (HoNOSCA) in a survey. The prevalence of Internet addiction as assessed by CIAS-R was 18 %. High internal consistency and inter-item correlations were reported for the CIAS-R. Results from the confirmatory factor analysis suggested a four-factor structure of Compulsive Use and Withdrawal, Tolerance, Interpersonal and Health-related Problems, and Time Management Problems. Moreover, results of hierarchical multiple regression supported the incremental validity of the CIAS-R to predict mental health outcomes beyond the effects of demographic differences and self-reported time spent online. The CIAS is a reliable and valid measure of internet addiction problems in Hong Kong adolescents. Future study is warranted to validate the cutoffs of the CIAS-R for identification of adolescents with Internet use problems who may have mental health needs.
Bourke-Taylor, Helen; Pallant, Julie; Cordier, Reinie
In this article, we evaluate psychometric properties of the Child's Challenging Behaviour Scale, Version 2 (CCBS-2) with mothers of young, typically developing children. A cross-sectional mail survey with Australian mothers (N = 337) included the CCBS-2, the Depression Anxiety Stress Scales, and the Parents' Evaluation of Developmental Status scale. Internal consistency was good, and no gender differences in CCBS-2 scores were significant. Significant results included differences between CCBS-2 scores: among children grouped according to age, among children grouped according to pre- and post-school entry, among mothers grouped according to extent of any symptom type, and between this sample and a previously collected age-matched sample of children with disabilities. Of the properties tested, results support sound psychometrics. The CCBS-2 can be used to differentiate children according to age, school entry, and disability as well as to identify families for potential services in behavior management and mental health. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Schaefer, Rafaela; Zoboli, Elma Lcp; Vieira, Margarida M
Moral distress is a kind of suffering that nurses may experience when they act in ways that are considered inconsistent with moral values, leading to a perceived compromise of moral integrity. Consequences are mostly negative and include physical and psychological symptoms, in addition to organizational implications. To psychometrically test the Moral Distress Risk Scale. A methodological study was realized. Data were submitted to exploratory factorial analysis through the SPSS statistical program. Participants and research context: In total, 268 nurses from hospitals and primary healthcare settings participated in this research during the period of March to June of 2016. Ethical considerations: This research has ethics committee approval. The Moral Distress Risk Scale is composed of 7 factors and 30 items; it shows evidence of acceptable reliability and validity with a Cronbach's α = 0.913, a total variance explained of 59%, a Kaiser-Meyer-Olkin = 0.896, and a significant Bartlett <0.001. Concerns about moral distress should be beyond acute care settings, and a tool to help clarify critical points in other healthcare contexts may add value to moral distress speech. Psychometric results reveal that the Moral Distress Risk Scale can be applied in different healthcare contexts.
Martínez-Rodríguez, Silvia; Iraurgi, Ioseba; Gómez-Marroquin, Ignacio; Carrasco, María; Ortiz-Marqués, Nuria; Stevens, Alan B
Despite evidence of the numerous benefits of leisure to health and well-being appropriate tools to assess this construct are lacking. The purpose of this work was to analyse the psychometric properties of the Spanish version of the Leisure Time Satisfaction (LTS). The sample was made up of 1048 primary family caregivers of dependent people. Scale structure was subjected to exploratory and confirmatory factor analysis. Concurrent and convergent validity were assessed by correlation with validated questionnaires for measuring burden (Zarit Burden Inventory - ZBI) and health (SF-36 Health Survey). The results show a high level of internal consistency (Cronbach’s alpha = .938) suitable fit of the dimensional model tested via confirmatory factor analysis (GFI = .925, BBNNFI= .996; IFI= .998, RMSEA= .043), and appropriate convergent validity with similar constructs (r = -.44 with ZBI; and r-values between .226 and .440 with SF-36 dimensions). Psychometric results obtained from the LTS are promising and the results enable us to draw the conclusion that it is a suitable tool for assessing caregivers’ leisure time satisfaction.
Mario Alberto Trogolo
Full Text Available The purpose of this study was to translate and examine the psychometric properties of a driving self-efficacy scale developed by Dorn and Machin (2004. The factor structure, reliability and external validity of the scale were examined in a sample of 447 drivers from Cordoba, Argentina. In addition, measurement invariance across sex was also tested. Results from a confirmatory factor analysis support the unidimensional structure of the scale and the invariance of its parameters (configural, metric and scalar between men and women. Reliability analyses using alpha and omega coefficients revealed high internal consistency (coefficients equal to 0.81 in both cases and satisfactory evidence of external validity of the scale scores, with measures of risk perception, risky driving, history of traffic crashes and fines. Finally, results also showed that the scale seems to be relatively robust against response biases due to social desirability. In summary, findings support the validity and reliability of the scale in Argentina. However, further studies analyzing additional psychometric properties are needed.
Seyyed Jalal Sadrosadat
Full Text Available Objective: SNAP-IV rating scale to diagnosis Attention Deficit Hyperactivity Disorder (ADHD developed by Swanson, Nolan and Pelham. The aim of this study is determination of psychometrics specifications of this scale. Materials & Methods: This Descriptive research is a methodological, applied and validity assessment study. One thousand students at 7 to 12 age of primary school in Tehran city were selected by cluster sampling. Then the students mothers was asked to complete rating scale to consider behavior of their children.30 staff members of sample group were retest after one mounts. Diagnostic interview was administered at 36 members of sample group. Data were analyzed by using pearsonian correlation coefficient, Kolmogorof – Smirnoff and Behrens – Fisher T test. Results: Criterion validity was 48%, factor analysis was detected 3 factors that explain 56% of the total variance. Reliability coefficient was 82% . internal consistency coefficient was 90% and split –half coefficient was 76%, Cut-off point in scale and subscales was 1.57,1.47 and 1.9 respectively. Conclusion: The SNAP-IV Rating scales have fit psychometrics specifications. Therefore, it is useable in various diagnostic and therapeutic conditioning.
Champagne, Alexandra; Landreville, Philippe; Gosselin, Patrick; Carmichael, Pierre-Hugues
The Geriatric Anxiety Inventory (GAI) and a short form of this instrument (GAI-SF) were developed to assess the severity of anxiety symptoms in older adults in order to compensate for the lack of validated screening tools adapted to the elderly population. This study examined the psychometric properties of the French Canadian version of the GAI, in its complete (GAI-FC) and short form (GAI-FC-SF). A total of 331 community-dwelling seniors between 65 and 92 years old participated in this study. Both the GAI-FC and the GAI-FC-SF have sound psychometric properties with, respectively, a high internal consistency (α = .94 and .83), an adequate convergent validity (r = .50 to .86 with instruments known to evaluate constructs similar to the GAI or related to anxiety), a good test-retest reliability (r = .89 and .85), in addition to a single-factor structure. The results support the use of both the GAI-FC and the GAI-FC-SF. The GAI-FC-SF seems to be an interesting alternative to the GAI-FC as a screening tool when time available for assessment is limited.
van der Maas, H.L.J.; Wagenmakers, E.J.
This study introduces the Amsterdam Chess Test (ACT). The ACT measures chess playing proficiency through 5 tasks: a choose-a-move task (comprising two parallel tests), a motivation questionnaire, a predict-a-move task, a verbal knowledge questionnaire, and a recall task. The validity of these tasks
Full Text Available Objectives. The aims of this study were to perform a cultural translation of the DMSES and evaluate the psychometric properties of the translated scale in a Korean population with type 2 diabetics. Methods. This study was conducted in patients with diabetes recruited from university hospitals. The first stage of this study involved translating the DMSES into Korean using a forward- and backward-translation technique. The content validity was assessed by an expert group. In the second stage, the psychometric properties of the Korean version of the DMSES (K-DMSES were evaluated. Results. The content validity of the K-DMSES was satisfactory. Sixteen-items clustered into four-subscales were extracted by exploratory factor analysis, and supported by confirmatory factor analysis. The construct validity of the K-DMSES with the Summary of Diabetes Self-Care Activities scale was satisfactory (r=0.50, P<0.001. The Cronbach’s alpha and intraclass correlation coefficient were 0.92 and 0.85 (P<0.001; 95% CI=0.75–0.91, respectively, which indicate excellent internal consistency reliability and test-retest reliability. Conclusions. The K-DMSES is a brief instrument that has demonstrated good psychometric properties. It is therefore feasible to use in practice, and is ready for use in clinical research involving Korean patients with type 2 diabetes.
Al-Dajani, Nadia; Gralnick, Tara M; Bagby, R Michael
The paradigm of personality psychopathology is shifting from one that is purely categorical in nature to one grounded in dimensional individual differences. Section III (Emerging Measures and Models) of the Diagnostic and Statistical Manual of Mental Disorders (5th ed. [DSM-5]; American Psychiatric Association, 2013), for example, includes a hybrid categorical/dimensional model of personality disorder classification. To inform the hybrid model, the DSM-5 Personality and Personality Disorders Work Group developed a self-report instrument to assess pathological personality traits-the Personality Inventory for the DSM-5 (PID-5). Since its recent introduction, 30 papers (39 samples) have been published examining various aspects of its psychometric properties. In this article, we review the psychometric characteristics of the PID-5 using the Standards for Educational and Psychological Testing as our framework. The PID-5 demonstrates adequate psychometric properties, including a replicable factor structure, convergence with existing personality instruments, and expected associations with broadly conceptualized clinical constructs. More research is needed with specific consideration to clinical utility, additional forms of reliability and validity, relations with psychopathological personality traits using clinical samples, alternative methods of criterion validation, effective employment of cut scores, and the inclusion of validity scales to propel this movement forward.
Tomás, José M; Galiana, Laura; Fernández, Irene
The aim of current research is to analyze the psychometric properties of the Spanish version of the SF-8, overcoming previous shortcomings. A double line of analyses was used: competitive structural equations models to establish factorial validity, and Item Response theory to analyze item psychometric characteristics and information. 593 people aged 60 years or older, attending long life learning programs at the University were surveyed. Their age ranged from 60 to 92 years old. 67.6% were women. The survey included scales on personality dimensions, attitudes, perceptions, and behaviors related to aging. Competitive confirmatory models pointed out two-factors (physical and mental health) as the best representation of the data: χ2(13) = 72.37 (p < .01); CFI = .99; TLI = .98; RMSEA = .08 (.06, .10). Item 5 was removed because of unreliability and cross-loading. Graded response models showed appropriate fit for two-parameter logistic model both the physical and the mental dimensions. Item Information Curves and Test Information Functions pointed out that the SF-8 was more informative for low levels of health. The Spanish SF-8 has adequate psychometric properties, being better represented by two dimensions, once Item 5 is removed. Gathering evidence on patient-reported outcome measures is of crucial importance, as this type of measurement instruments are increasingly used in clinical arena.
Brandão, Tânia; Schulz, Marc S; Gross, James J; Matos, Paula Mena
Emotion regulation is thought to play an important role in adaptation to cancer. However, the emotion regulation questionnaire (ERQ), a widely used instrument to assess emotion regulation, has not yet been validated in this context. This study addresses this gap by examining the psychometric properties of the ERQ in a sample of Portuguese women with cancer. The ERQ was administered to 204 women with cancer (mean age = 48.89 years, SD = 7.55). Confirmatory factor analysis and item response theory analysis were used to examine psychometric properties of the ERQ. Confirmatory factor analysis confirmed the 2-factor solution proposed by the original authors (expressive suppression and cognitive reappraisal). This solution was invariant across age and type of cancer. Item response theory analyses showed that all items were moderately to highly discriminant and that items are better suited for identifying moderate levels of expressive suppression and cognitive reappraisal. Support was found for the internal consistency and test-retest reliability of the ERQ. The pattern of relationships with emotional control, alexithymia, emotional self-efficacy, attachment, and quality of life provided evidence of the convergent and concurrent validity for both dimensions of the ERQ. Overall, the ERQ is a psychometrically sound approach for assessing emotion regulation strategies in the oncological context. Clinical implications are discussed. Copyright © 2016 John Wiley & Sons, Ltd.
Strauss, Gregory P; Gold, James M
In 2005, the National Institute of Mental Health held a consensus development conference on negative symptoms of schizophrenia. Among the important conclusions of this meeting were that there are at least 5 commonly accepted domains of negative symptoms (blunted affect, alogia, avolition, anhedonia, asociality) and that new rating scales were needed to adequately assess these constructs. Two next-generation negative symptom scales resulted from this meeting: the Brief Negative Symptom Scale (BNSS) and Clinical Assessment Interview for Negative Symptoms (CAINS). Both measures are becoming widely used and studies have demonstrated good psychometric properties for each scale. The current study provides the first direct psychometric comparison of these scales. Participants included 65 outpatients diagnosed with schizophrenia or schizoaffective disorder who completed clinical interviews, questionnaires, and neuropsychological testing. Separate raters completed the BNSS and CAINS within the same week. Results indicated that both measures had good internal consistency, convergent validity, and discriminant validity. High correspondence was observed between CAINS and BNSS blunted affect and alogia items. Moderate convergence occurred for avolition and asociality items, and low convergence was seen among anhedonia items. Findings suggest that both scales have good psychometric properties, but that there are important distinctions among the items related to motivation and pleasure. © The Author 2016. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: firstname.lastname@example.org.
Lu, Wei; Bian, Qian; Wang, Wenzheng; Wu, Xiaoling; Wang, Zhen; Zhao, Min
Chinese university students often suffer from acute stress, which can affect their mental health. We measured and evaluated perceived stress in this population using the Simplified Chinese version of the 10-item Perceived Stress Scale (SCPSS-10). The SCPSS-10, Patient Health Questionnaire (PHQ), and Generalized Anxiety Disorder 7-item scale (GAD-7) were conducted in 1096 university students. Two weeks later, 129 participants were re-tested using the SCPSS-10. Exploratory factor analysis yielded two factors with Eigen values of 4.76 and 1.48, accounting for 62.41% of the variance. Confirmatory factor analysis demonstrated good fit of this two-factor model. The internal consistency reliability, as measured by Cronbach's α, was 0.85. The test-retest reliability coefficient was 0.7. The SCPSS-10 exhibited high correlation with the PHQ-9 and GAD-7, indicating an acceptable concurrent validity. The SCPSS-10 exhibited satisfactory psychometric properties in Chinese university students.
Blijd-Hoogewys, E M A; van Geert, P L C; Serra, M; Minderaa, R B
Although research on Theory-of-Mind (ToM) is often based on single task measurements, more comprehensive instruments result in a better understanding of ToM development. The ToM Storybooks is a new instrument measuring basic ToM-functioning and associated aspects. There are 34 tasks, tapping various emotions, beliefs, desires and mental-physical distinctions. Four studies on the validity and reliability of the test are presented, in typically developing children (n = 324, 3-12 years) and children with PDD-NOS (n = 30). The ToM Storybooks have good psychometric qualities. A component analysis reveals five components corresponding with the underlying theoretical constructs. The internal consistency, test-retest reliability, inter-rater reliability, construct validity and convergent validity are good. The ToM Storybooks can be used in research as well as in clinical settings.
Rifbjerg-Madsen, Signe; Wæhrens, Eva Elisabet Ejlersen; Danneskiold-Samsøe, Bente
that can identify underlying pain mechanisms are needed. The painDETECT questionnaire (PDQ) was originally designed to differentiate between pain phenotypes. The objectives were to evaluate the psychometric properties of the PDQ in patients with inflammatory arthritis by applying Rasch analysis...... and to explore the reliability of pain classification by test-retest. METHODS: For the Rasch analysis 900 questionnaires from patients with RA, PsA and SpA (300 per diagnosis) were extracted from 'the DANBIO painDETECT study'. The analysis was directed at the seven items assessing somatosensory symptoms...... and included: 1) the performance of the six-category Likert scale; 2) whether a unidimensional construct was defined; 3) the reliability and precision of estimates. Another group of 30 patients diagnosed with RA, PsA or SpA participated in a test-retest study. Intraclass Correlation Coefficients (ICC...
Gelhorn, Heather L; Roberts, Laurie J; Khandelwal, Nikhil; Revicki, Dennis A; DeRogatis, Leonard R; Dobs, Adrian; Hepp, Zsolt; Miller, Michael G
The Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF) is a patient-reported outcome measurement designed to evaluate the symptoms of hypogonadism. The HIS-Q-SF is an abbreviated version including17 items from the original 28-item HIS-Q. To conduct item analyses and reduction, evaluate the psychometric properties of the HIS-Q-SF, and provide guidance on score interpretation. A 12-week observational longitudinal study of hypogonadal men was conducted as part of the original HIS-Q psychometric evaluation. Participants completed the original HIS-Q every 2 weeks. Blood samples were collected to evaluate testosterone levels. Participants completed the Aging Male's Symptoms Scale, the International Index of Erectile Function, the Short Form-12, and the PROMIS Sexual Activity, Satisfaction with Sex Life, Sleep Disturbance, and Applied Cognition Scales (baseline and weeks 6 and 12). Clinicians completed the Clinical Global Impression of Severity and Change scales and a clinical form. Item performance was evaluated using descriptive statistics and Rasch analyses. Reliability (internal consistency and test-retest), validity (concurrent and know groups), and responsiveness were assessed. One hundred seventy-seven men participated (mean age = 54.1 years, range = 23-83). Similar to the full HIS-Q, the final abbreviated HIS-Q-SF instrument includes five domains (sexual, energy, sleep, cognition, and mood) with two sexual subdomains (libido and sexual function). For key domains, test-retest reliability was very good, and construct validity was good for all domains. Known-groups validity was demonstrated for all domain scores, subdomain scores, and total score based on the Clinical Global Impression-Severity. All domains and subdomains were responsive to change based on patient-rated anchor questions. The HIS-Q-SF could be a useful tool in clinical practice, epidemiologic studies, and other academic research settings. Careful consideration was given to the