Testing Psychometrics of Healthcare Empowerment Questionnaires (HCEQ) among Iranian ... PROMOTING ACCESS TO AFRICAN RESEARCH ... translation and backtranslation procedures, pilot testing, and getting views of expert panel.
The paper provides a survey of 18 years' progress that my colleagues, students (both former and current) and I made in a prominent research area in Psychometrics-Computerized Adaptive Testing (CAT). We start with a historical review of the establishment of a large sample foundation for CAT. It is worth noting that the asymptotic results were derived under the framework of Martingale Theory, a very theoretical perspective of Probability Theory, which may seem unrelated to educational and psychological testing. In addition, we address a number of issues that emerged from large scale implementation and show that how theoretical works can be helpful to solve the problems. Finally, we propose that CAT technology can be very useful to support individualized instruction on a mass scale. We show that even paper and pencil based tests can be made adaptive to support classroom teaching.
García-Pérez, Miguel A; Núñez-Antón, Vicente
Many empirical studies measure psychometric functions (curves describing how observers' performance varies with stimulus magnitude) because these functions capture the effects of experimental conditions. To assess these effects, parametric curves are often fitted to the data and comparisons are carried out by testing for equality of mean parameter estimates across conditions. This approach is parametric and, thus, vulnerable to violations of the implied assumptions. Furthermore, testing for equality of means of parameters may be misleading: Psychometric functions may vary meaningfully across conditions on an observer-by-observer basis with no effect on the mean values of the estimated parameters. Alternative approaches to assess equality of psychometric functions per se are thus needed. This paper compares three nonparametric tests that are applicable in all situations of interest: The existing generalized Mantel-Haenszel test, a generalization of the Berry-Mielke test that was developed here, and a split variant of the generalized Mantel-Haenszel test also developed here. Their statistical properties (accuracy and power) are studied via simulation and the results show that all tests are indistinguishable as to accuracy but they differ non-uniformly as to power. Empirical use of the tests is illustrated via analyses of published data sets and practical recommendations are given. The computer code in MATLAB and R to conduct these tests is available as Electronic Supplemental Material.
Fay, Derek M.; Levy, Roy; Mehta, Vandhana
A common practice in educational assessment is to construct multiple forms of an assessment that consists of tasks with similar psychometric properties. This study utilizes a Bayesian multilevel item response model and descriptive graphical representations to evaluate the psychometric similarity of variations of the same task. These approaches for…
R. P. van der Merwe
Full Text Available This is a cumulative report on the findings of various exploratory research that were done with regard to the practice of psychometric testing in the Eastern Cape. Recent and ongoing developments in the South African labour legislation, and especially the implications of the Employment Equity Act, highlight once again the importance of the validation of all instruments to be used for human assessment and selection purposes. Information was gathered to establish which psychometric tests are used, and for what purposes, in industry today. Biographical information on each organisation is supplied, including the number of employees. The role of psychometric testing in the selection procedure is discussed. The different tests used, as well as the test users, are also indicated. The findings of other, related research, as well as comments, recommendations and shortcomings, are discussed. Opsomming Hierdie is ‘n kumulatiewe verslag wat die resultate verstrek van verskeie verkennende ondersoeke wat gedoen is na die aanwending van psigometriese toetsing in die Oos-Kaap. Onlangse en voortdurende ontwikkelinge in die Suid-Afrikaanse arbeidswetgewing, en veral die implikasies van die Wet op Gelyke Indiensneming, beklemtoon weer eens die belangrikheid van die validering van enige instrumente wat gebruik word vir evaluerings- en keuringsdoeleindes van individue. Inligting is ingewin om te bepaal watter psigometriese toetse, sowel as vir watter doel, vandag in die bedryf gebruik word. Biografiese inligting oor die onderskeie organisasies, insluitende hul aantal werknemers, word verstrek. Die rol van psigometriese toetsing in die keuringsproses word bespreek. Die verskillende toetse wat deur die organisasies gebruik word, sowel as die toetsge-bruikers, word ook aangedui. Die bevindinge van ander, relevante navorsing, sowel as opmerkings, aanbevelings en tekortkominge word bespreek.
Bhatia Kailash P
Full Text Available Abstract Background The United States Food and Drug Administration (FDA are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58, a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1 provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2 enable researchers and clinicians to compare it with existing dystonia PROMs; and 3 help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing. Methods We evaluated traditional psychometric properties of data quality, scaling assumptions, targeting, reliability and validity in a group of 391 people with CD. The main outcome measures used were the CDIP-58, Medical Outcome Study Short Form-36, the 28-item General Health Questionnaire, and Hospital and Anxiety and Depression Scale. Results A total of 391 people returned completed questionnaires (corrected response rate 87%. Analyses showed: 1 data quality was high (low missing data ≤ 4%, subscale scores could be computed for > 96% of the sample; 2 item groupings passed tests for scaling assumptions; 3 good targeting (except for the Sleep subscale, ceiling effect = 27%; 4 good reliability (Cronbach's alpha ≥ 0.92, test-retest intraclass correlations ≥ 0.83; and 5 validity was supported. Conclusion This study has shown that new psychometric methods can produce a PROM that stands up to traditional criteria and supports the clinical advantages of Rasch analysis.
Borsboom, D.; Molenaar, D.; Wright, J.D.
Psychometrics is a scientific discipline concerned with the construction of measurement models for psychological data. In these models, a theoretical construct (e.g., intelligence) is systematically coordinated with observables (e.g., IQ scores). This is often done through latent variable models,
Kappus, Matthew R; Bajaj, Jasmohan S
Synopsis Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life and driving accidents. The detection of MHE requires specialized testing since it cannot by definition, be diagnosed on standard clinical examination. Psychometric (paper-pencil or computerized or a combination) and neuro-physiological techniques are often used to test for MHE. Paper-pencil psychometric batteries like the Psychometric Hepatic Encephalopathy Score (PHES) have been validated in several countries but do not have US normative values. Computerized tests such as the inhibitory control test (ICT), cognitive drug research system and Scan test have proven useful to diagnose MHE and predict outcomes. The specificity and sensitivity of these tests are similar to the recommended gold standards. Neuro-physiological tests such as the EEG and its interpretations, evoked potentials and Critical Flicker Frequency (CFF) also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike and the testing strategies depend on the normative data available, patient comfort and local expertise. PMID:22321464
Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S
. Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...
Ortiz, T; Fernández, A; Martínez-Castillo, E; Maestú, F; Martínez-Arias, R; López-Ibor, J J
Several pathologies (i.e. Alzheimer's disease) that courses with memory alterations, appears in a context of impaired cognitive status and mobility. In recent years, several investigations were carried out in order to design short batteries that detect those subjects under risk of dementia. Some of this batteries were also design to be administrated over the telephone, trying to overcome the accessibility limitations of this patients. In this paper we present a battery (called Autotest de Memoria) essentially composed by episodic and semantic memory tests, administered both over the telephone and face to face. This battery was employed in the cognitive assessment of healthy controls and subjects diagnosed as probable Alzheimer's disease patients. Results show the capability of this battery in order to discriminate patients and healthy controls, a great sensibility and specificity, and a nearly absolute parallelism of telephone and face to face administrations. These data led us to claim the usefulness and practicality of our so called Memoria>.
Full Text Available Authors report a study on psychometric properties of Plutchik's test, called Emotions Profile Index (EPI. A new Slovene translation and adaptation of English version of the test, consisting of combinations (pairs of 12 words reflecting eight different emotional conditions, was prepared and compared to the old one. Both versions as well as the Big Five Questionnaire (BFQ were administered on the sample of 239 participants. Different statistical analyses were performed examining psychometric features of both versions of EPI. Discriminative power was tested by cluster analysis and analysis of frequency distributions, reliability was studied via internal consistency index and correlation between the two versions, and validity was examined by correlating PIE dimensions with BFQ dimensions and subdimensions, by comparing profiles of groups on both versions of EPI and BFQ and by fitting the theoretical model proposed by Plutchik to the data. Discriminative power of EPI seems to be affected by avoiding (not choosing the socially desirable expressions in the test, parallel reliability seems to be susceptible to the use of different words (expressions in the new version of EPI having the same meaning as words in the old version. Dimensions expected to reflect similar constructs in BFQ and EPI do not correlate satisfactory. Data gathered with EPI cannot be fully explained with the model proposed by Plutchik's theory.
Schoenherr, Jordan Richard; Hamstra, Stanley J.
Psychometrics has recently undergone extensive criticism within the medical education literature. The use of quantitative measurement using psychometric instruments such as response scales is thought to emphasize a narrow range of relevant learner skills and competencies. Recent reviews and commentaries suggest that a paradigm shift might be…
Kirk, Celia; Vigeland, Laura
Purpose: The authors provide a review of the psychometric properties of 6 norm-referenced tests designed to measure children's phonological error patterns. Three aspects of the tests' psychometric adequacy were evaluated: the normative sample, reliability, and validity. Method: The specific criteria used for determining the psychometric…
Cox, Kathleen B
The importance of healthy work environments has received attention. Health care organizations are plagued with conflict which is detrimental to work environments. Thus, conflict must be studied. The purpose of this article is to describe the testing of a measure of conflict. A survey was used to evaluate the psychometric properties. The sample consisted of 430 nurses at an academic medical center. Using principal component analysis (PCA) with varimax rotation, a six-factor solution (30 items) that explained 74.3% of variance emerged. Coefficient alpha ranged from .95 to .81. Correlations with existing scales supported construct validity (r = -.32(-)-.58). The results are encouraging. Use of the scale may provide insight into the impact of conflict on patient, staff, and organizational outcomes.
Kolen, Michael J.; Lee, Won-Chan
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Chan, Keung Sum; Li, Ho Cheung William; Chan, Sally Wai-Chi; Lopez, Violeta
This article is a report on psychometric testing of the Chinese version of the herth hope index. The availability of a valid and reliable instrument that accurately measures the level of hope in patients with heart failure is crucial before any hope-enhancing interventions can be appropriately planned and evaluated. There is no such instrument for Chinese people. A test-retest, within-subjects design was used. A purposive sample of 120 Hong Kong Chinese patients with heart failure between the ages of 60 and 80 years admitted to two medical wards was recruited during an 8-month period in 2009. Participants were asked to respond to the Chinese version of the herth hope index, Hamilton depression rating scale and Rosenberg's self-esteem scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the herth hope index were assessed. The newly translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly translated scale can be used as a self-report assessment tool in assessing the level of hope in Hong Kong Chinese patients with heart failure. © 2011 Blackwell Publishing Ltd.
Szajer, Katarzyna; Karakuła, Hanna; Grzywa, Anna; Parnas, Josef; Perzyńska, Aneta; Zaborska, Anna; Pawezka, Justyna; Sekunda, Agnieszka; Piszczek, Rafał; Skórska, Małgorzata
Abstract thinking belongs to intellectual abilities of the highest level of the evolutionary development, thanks to which operations such a classification, systematisation and comparison are possible. An analysis of the psychometric properties of the Proverb-Metaphor Test (TPM) which has been used in the German speaking countries since 2001. The TPM was subject to the process of translation--retranslation--travesty in order to be adapted to clinical conditions in Poland. 60 patients of the Department of Psychiatry, Medical University of Lublin with diagnosed paranoid schizophrenia (according to ICD-10 criteria). PANSS and TPM was carried out amongst 15 patients at the beginning of the hospitalisation (the first stage of the research) and among all persons during the remission of syndromes (the second stage). The WAIS-R (PL) was used in the second stage. 1. The TPM is a reliable instrument, of high criteria propriety. 2. The evaluated test is a relatively homogeneous research tool. 3. The TPM is, thanks to its simple construction and the short carrying out time, a practical method of abstract thinking evaluation. 4. The TPM may be a useful instrument enabling long term prognosis.
Chen, Xin-lin; Liu, Feng-bin; Guo, Li; Liu, Xiao-bin
To investigate the scientificity of patient-reported outcome (PRO) scale for myasthenia gravis (MG), which was used to evaluate the clinical effects of traditional Chinese and Western medicine treatment on MG patients. Psychometric performance of the MG-PRO scale was also expected to be evaluated in this study. A total of 100 MG patients and 100 healthy people were face-to-face interviewed by well-trained investigators, and the data of MG-PRO scale were collected. The classical theory test (CTT) and item response theory (IRT) methods were used to analyze the psychometric performance such as validity, reliability, person separation index (PSI) and differential item functioning (DIF) in the MG-PRO scale. The results of CTT analysis showed that the split-half reliabilities of the MG-PRO scale and each dimension were greater than 0.7. In the analysis of internal consistency of each dimension, the Cronbach's alpha was greater than 0.8. Each facet had greater correlation with its dimension than the other dimensions. Four principal components were extracted by exploratory factor analysis, which represented all dimensions of the scale, and the cumulative variance was 55.54%. The scores of each of the 8 facets between MG patients and healthy people were different (Pdefinition and connotation of quality of life and contains special issues of MG patients as well, and shows good reliability (split-half reliability, Cronbach's alpha), validity (content validity, construct validity, discriminate validity) from the results of CTT, and good psychometric performance from the results of IRT.
McGilton, Katherine S
To describe the development and psychometric testing of the Supportive Supervisory Scale (SSS). The development of the items of the scale was based on Winnicott's relationship theory and on focus groups with 26 healthcare aides (HCAs) and 30 supervisors from six long-term care (LTC) facilities in Ontario, Canada. Content validity of the 15-item instrument was established by a panel of experts. Based on a secondary analysis of data collected from 222 HCAs in 10 LTC facilities in Ontario, Canada, the SSS was subjected to principal components analysis with oblique rotation. A two-factor solution was accepted, which is consistent with the theoretical conceptualization of the instrument. Factor I was labeled Respects Uniqueness and Factor II was labeled Being Reliable. Internal consistency of Factor I was .95, and that of Factor II was .91. Discriminant validity was also established. The focus groups revealed that "being available to staff" while "recognizing the HCA as an individual, and taking a moment to get to know them" was essential to feeling supported by their supervisor. The SSS is a reliable and valid measure of supervisory support of supervisors working in LTC facilities. At the core of supportive supervision is the supervisor's ability to develop and maintain positive relationships with each HCA. It is through respecting the uniqueness of each HCA and being reliable that supervisor-HCA relationships can flourish. Supportive leadership in LTC settings is a major contributor to HCAs' job satisfaction and retention and to quality of patient care. Therefore, a tool developed and tested to measure supervisors' supportive capacities in LTC is primal to evaluate the effectiveness of supervisors in these environments.
Kappus, Matthew R; Bajaj, Jasmohan S
Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life, and driving accidents. The detection of MHE requires specialized testing because it cannot, by definition, be diagnosed on standard clinical examination. Psychometric and neurophysiologic techniques are often used to test for MHE. Paper-pencil psychometric batteries and computerized tests have proved useful in diagnosing MHE and predicting its outcomes. Neurophysiologic tests also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike. Testing strategies depend on the normative data available, patient comfort, and local expertise. Copyright © 2012 Elsevier Inc. All rights reserved.
Lauridsen, M M; Poulsen, L; Rasmussen, C K
Many chronic medical conditions are accompanied by cognitive disturbances but these have only to a very limited extent been psychometrically quantified. An exception is liver cirrhosis where hepatic encephalopathy is an inherent risk and mild forms are diagnosed by psychometric tests. The preferred...... diagnostic test battery in cirrhosis is often the Continuous Reaction Time (CRT) and the Portosystemic Encephalopathy (PSE) tests but the effect on these of other medical conditions is not known. We aimed to examine the effects of common chronic (non-cirrhosis) medical conditions on the CRT and PSE tests. We...
Cunningham, Jennifer; Wallston, Kenneth A; Wilkins, Consuelo H; Hull, Pamela C; Miller, Stephania T
This study describes the development and psychometric evaluation of HPV Clinical Trial Survey for Parents with Children Aged 9 to 15 (CTSP-HPV) using traditional instrument development methods and community engagement principles. An expert panel and parental input informed survey content and parents recommended study design changes (e.g., flyer wording). A convenience sample of 256 parents completed the final survey measuring parental willingness to consent to HPV clinical trial (CT) participation and other factors hypothesized to influence willingness (e.g., HPV vaccine benefits). Cronbach's a, Spearman correlations, and multiple linear regression were used to estimate internal consistency, convergent and discriminant validity, and predictively validity, respectively. Internal reliability was confirmed for all scales (a ≥ 0.70.). Parental willingness was positively associated (p < 0.05) with trust in medical researchers, adolescent CT knowledge, HPV vaccine benefits, advantages of adolescent CTs (r range 0.33-0.42), supporting convergent validity. Moderate discriminant construct validity was also demonstrated. Regression results indicate reasonable predictive validity with the six scales accounting for 31% of the variance in parents' willingness. This instrument can inform interventions based on factors that influence parental willingness, which may lead to the eventual increase in trial participation. Further psychometric testing is warranted. © 2015 Wiley Periodicals, Inc.
Palmer, B.R.; Gignac, G.; Manocha, R.; Stough, C.
and discussed.There has been some debate recently over the scoring, reliability and factor structure of ability measures of emotional intelligence (EI). This study examined these three psychometric properties with the most recent ability test of EI, the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT V2.0; Mayer, Salovey, & Caruso,…
Jongen, Stefan; Perrier, Joy; Vuurman, Eric F; Ramaekers, Johannes G; Vermeeren, Annemiek
To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation. Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am) and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am). On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP) of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT). Large effects sizes were also found in the Divided Attention Test (DAT), the Attention Network Test (ANT), and the test for Useful Field of View (UFOV) at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV. From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.
Full Text Available To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation.Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am.On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT. Large effects sizes were also found in the Divided Attention Test (DAT, the Attention Network Test (ANT, and the test for Useful Field of View (UFOV at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV.From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.
Raveendranadh Pilli; MUR Naidu; Usharani Pingali; J C Shobha
Objective: To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. Materials and Methods: The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of...
Sund, Terje; Brandt, Åse; Anttila, Heidi
(Participation Repertoire). PURPOSE: This study aimed to investigate a range of psychometric properties of the NOMO 1.0 in a sample of adult powered mobility device (PMD) users. METHOD: Data collected from PMD users ( N = 248) in Denmark, Finland, and Norway as part of a larger study were analyzed using state...... scale and six components of the Frequency scale. IMPLICATIONS: The NOMO 1.0 should be used for research purposes and not for clinical practice. Better reliability should be established for the Need for Assistance and Ease/Difficulty scales prior to further psychometric testing to establish the validity...
Full Text Available These analyses examine the psychometric properties of the Free and Cued Selective Reminding Test with Immediate Recall (FCSRT-IR. FCSRT-IR is a measure of memory under conditions that control attention and cognitive processing in order to obtain an assessment of memory unconfounded by normal agerelated changes in cognition. FCSRT-IR performance has been associated with preclinical and early dementia in several longitudinal epidemiological studies. Factor and item response theory analyses were applied to FCSRT-IR data from patients at a geriatric primary care center who had independently established clinical diagnoses. The results provide supporting evidence for the psychometric adequacy of the FCSR-IR in terms of reliability, essential (sufficient unidimensionality, information across the continuum of memory disability/ability, and classification accuracy. The psychometric adequacy of the FCSRT-IR adds further validity to its use as a case finding strategy for dementia.
Faraci, Palmira; Hell, Benedikt; Schuler, Heinz
This article describes the psychometric properties of the Italian adaptation of the "Analyse des Schlussfolgernden und Kreativen Denkens" (ASK; Test of Inferential and Creative Thinking) for measuring inferential and creative thinking. The study aimed to (a) supply evidence for the factorial structure of the instrument, (b) describe its…
Wicherts, Jelte M.; Dolan, Conor V.; Carlson, Jerry S.; van der Maas, Han L. J.
This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's test scores as measures of general intelligence.…
Richards, Elizabeth A.; McDonough, Meghan H.; Edwards, Nancy E.; Lyle, Roseann M.; Troped, Philip J.
Purpose: Dog owners represent 40% of the population, a promising audience to increase population levels of physical activity. The purpose of this study was to develop and test the psychometric properties of a new instrument to assess social-cognitive theory constructs related to dog walking. Method: Dog owners ("N" = 431) completed the…
Lechuga, Julia; Galletly, Carol L; Broaddus, Michelle R; Dickson-Gomez, Julia B; Glasman, Laura R; McAuliffe, Timothy L; Vega, Miriam Y; LeGrand, Sarah; Mena, Carla A; Barlow, Morgan L; Valera, Erik; Montenegro, Judith I
To develop, pilot test, and conduct psychometric analyses of an innovative scale measuring the influence of perceived immigration laws on Latino migrants' HIV-testing behavior. The Immigration Law Concerns Scale (ILCS) was developed in three phases: Phase 1 involved a review of law and literature, generation of scale items, consultation with project advisors, and subsequent revision of the scale. Phase 2 involved systematic translation- back translation and consensus-based editorial processes conducted by members of a bilingual and multi-national study team. In Phase 3, 339 sexually active, HIV-negative Spanish-speaking, non-citizen Latino migrant adults (both documented and undocumented) completed the scale via audio computer-assisted self-interview. The psychometric properties of the scale were tested with exploratory factor analysis and estimates of reliability coefficients were generated. Bivariate correlations were conducted to test the discriminant and predictive validity of identified factors. Exploratory factor analysis revealed a three-factor, 17-item scale. subscale reliability ranged from 0.72 to 0.79. There were significant associations between the ILCS and the HIV-testing behaviors of participants. Results of the pilot test and psychometric analysis of the ILCS are promising. The scale is reliable and significantly associated with the HIV-testing behaviors of participants. Subscales related to unwanted government attention and concerns about meeting moral character requirements should be refined.
Jonsson, Jakob; Munck, Ingrid; Volberg, Rachel; Carlbring, Per
Recent increases in the number of online gambling sites have made gambling more available, which may contribute to an increase in gambling problems. At the same time, online gambling provides opportunities to introduce measures intended to prevent problem gambling. GamTest is an online test of gambling behavior that provides information that can be used to give players individualized feedback and recommendations for action. The aim of this study is to explore the dimensionality of GamTest and validate it against the Problem Gambling Severity Index (PGSI) and the gambler's own perceived problems. A recent psychometric approach, exploratory structural equation modeling (ESEM) is used. Well-defined constructs are identified in a two-step procedure fitting a traditional exploratory factor analysis model as well as a so-called bifactor model. Using data collected at four Nordic gambling sites in the autumn of 2009 (n = 10,402), the GamTest ESEM analyses indicate high correspondence with the players' own understanding of their problems and with the PGSI, a validated measure of problem gambling. We conclude that GamTest captures five dimensions of problematic gambling (i.e., overconsumption of money and time, and monetary, social and emotional negative consequences) with high reliability, and that the bifactor approach, composed of a general factor and specific residual factors, reproduces all these factors except one, the negative consequences emotional factor, which contributes to the dominant part of the general factor. The results underscore the importance of tailoring feedback and support to online gamblers with a particular focus on how to handle emotions in relation to their gambling behavior.
Budescu, David V; Bo, Yuanchao
We investigate the implications of penalizing incorrect answers to multiple-choice tests, from the perspective of both test-takers and test-makers. To do so, we use a model that combines a well-known item response theory model with prospect theory (Kahneman and Tversky, Prospect theory: An analysis of decision under risk, Econometrica 47:263-91, 1979). Our results reveal that when test-takers are fully informed of the scoring rule, the use of any penalty has detrimental effects for both test-takers (they are always penalized in excess, particularly those who are risk averse and loss averse) and test-makers (the bias of the estimated scores, as well as the variance and skewness of their distribution, increase as a function of the severity of the penalty).
Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff; Lenon, George Binh
Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization’s guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability...
Pilli, Raveendranadh; Naidu, Mur; Pingali, Usharani; Shobha, Jc
To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of zolpidem (5 mg) and caffeine (500 mg) versus placebo were studied in 24 healthy participants in a randomized, double-blind three-way crossover design. Psychometric tests were performed at baseline and at 1, 2, and 3 h after administration of study medication. The effects of zolpidem and caffeine on the psychomotor performance were most pronounced 1 h after administration. At this time, a significant impairment of performance in the simple reaction test (SRT), choice discrimination test (CDT), digit symbol substitution test (DSST), digit vigilance test (DVT), and card sorting test (CST) was observed with zolpidem. In contrast, caffeine showed a significant improvement in performance in CDT and DVT only. The results suggest that the tests of the computerized system are more sensitive and reliable then the pencil and paper tests in detecting the effects of central acting agents and are suitable for use in clinical areas to conduct studies with patients.
Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel
Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.
Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de
The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study
Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de
The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study
Csernoch, Mária; Biró, Piroska; Máth, János; Abari, Kálmán
The Testing Algorithmic and Application Skills (TAaAS) project was launched in the 2011/2012 academic year to test first year students of Informatics, focusing on their algorithmic skills in traditional and non-traditional programming environments, and on the transference of their knowledge of Informatics from secondary to tertiary education. The…
Nash, Tracy Jeanne
Although nurses struggle with the decision to report for work during disaster events, there are no instruments to measure nurses' duty to care for disaster situations. The purpose of this study was to describe the development, testing, and psychometric qualities of the Nash Duty to Care Scale. A convenience sample of 409 registered nurses were recruited from 3 universities in the United States. Exploratory factor analysis resulted in a 19-item, 4-factor model explaining 67.34% of the variance. Internal consistency reliability was supported by Cronbach's alpha ranging from .81 to .91 for the 4-factor subscales and .92 for the total scale. The psychometrically sound instrument for measuring nurses' perceived duty to care for disasters is applicable to contemporary nursing practice, institutional disaster management plans, and patient health outcomes worldwide.
Bajrakova, A.; Vasilev, G.; Khristova, M. N.; Chobanova, N.; Tsenova, T.; Jordanova, M.; Lalova, J.; Vasileva, F.; Mikhajlova, Z.; Trifonova, S.
The investigation involved 50 children aged median 6 years and 6 months. The group was selected in view of the critical period for occurrence of radiation-related deviations in mental development (8-15 gestation weeks) and the period of maximum irradiation during the Chernobyl accident. Assessment of the individual exposure and analysis of possible impacts from non-radiation risk factors were based on guided parental history reports. The dose of accidental irradiation was determined using the radiological data for the country. A Bulgarian standardization of the Wechsler Intelligence Scale for Children (WISC-R) was used. The procedure includes 5 verbal and 5 nonverbal subtests. Results were compared with those from a countrywide control group of children (including a large city, a small town, a village). The analysis indicated higher mean IQ scores in the investigated children. The children were additionally studied by original tests for attention and gnosis-praxis functions using tactile and visual modalities. The tests included intra- and transmodal versions, bilateral simultaneous presentation of stimuli with verbal and nonverbal characteristics in applying analytical and global strategies. Comparisons were made with results for children in the same age range, who had been studied prior to the Chernobyl accident. The evidence surprisingly varied, taking into account the small size of the investigation group. A longitudinal follow-up of this population thus appears to be appropriate. (author)
van Rooij, Antonius J; Schoenmakers, Tim M; van den Eijnden, Regina J J M; Vermulst, Ad A; van de Mheen, Dike
The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study used data (n=2,894) from a large-sample paper-and-pencil questionnaire study, conducted in 2009 on secondary schools in Netherlands. Thus, the main source of data was a large sample of schoolchildren (aged 13-16 years). Measurements included the proposed VAT, the Compulsive Internet Use Scale, weekly hours spent on various game types, and several psychosocial variables. The VAT demonstrated excellent reliability, excellent construct validity, a one-factor model fit, and a high degree of measurement invariance across gender, ethnicity, and learning year, indicating that the scale outcomes can be compared across different subgroups with little bias. In summary, the VAT can be helpful in the further study of video game addiction, and it contributes to the debate on possible inclusion of behavioral addictions in the upcoming DSM-V.
Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.
Petróczi, Andrea; Backhouse, Susan H; Barkoukis, Vassilis; Brand, Ralf; Elbe, Anne-Marie; Lazuras, Lambros; Lucidi, Fabio
One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping) into organised intelligence to protect clean sport. Beyond the foreseeable consequences of a positive identification as a doping user, this task is further complicated by the discrepancy between what constitutes a doping offence in the World Anti-Doping Code and operationalized in doping research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. In both research and forensic settings, we must acknowledge that (1) socially desirable responding confounds self-reported psychometric test results and (2) that the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that, to date, both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better understanding of doping behaviour, not to establish evidence for doping. A false 'positive' psychological profile for doping affects not only the individual
Galuschka, Katharina; Rothe, Josefine; Schulte-Körne, Gerd
This article looks at a means of objectively evaluating the quality of psychometric tests. This approach enables users to evaluate psychometric tests based on their methodological characteristics, in order to decide which instrument should be used. Reading and spelling assessment tools serve as examples. The paper also provides a review of German psychometric tests for the assessment of reading and spelling skills. This method facilitates the identification of psychometric tests.of high methodological quality which can be used for the assessment of reading and spelling skills. Reading performance should ideally be assessed with the following instruments: ELFE 1-6, LGVT 6-12, LESEN 6-7, LESEN 8-9, or WLLP-R. The tests to be used for the evaluation of spelling skills are DERET 1-2+, DERET 3-4+, WRT 1+, WRT 2+, WRT 3+, WRT 4+ or HSP 1-10.
Bernhofer, Esther I; St Marie, Barbara; Bena, James F
All nurses care for patients with pain, and pain management knowledge and attitude surveys for nurses have been around since 1987. However, no validated knowledge test exists to measure postlicensure clinicians' knowledge of the core competencies of pain management in current complex patient populations. To develop and test the psychometric properties of an instrument designed to measure pain management knowledge of postlicensure nurses. Psychometric instrument validation. Four large Midwestern U.S. hospitals. Registered nurses employed full time and part time August 2015 to April 2016, aged M = 43.25 years; time as RN, M = 16.13 years. Prospective survey design using e-mail to invite nurses to take an electronic multiple choice pain knowledge test. Content validity of initial 36-item test "very good" (95.1% agreement). Completed tests that met analysis criteria, N = 747. Mean initial test score, 69.4% correct (range 27.8-97.2). After revision/removal of 13 unacceptable questions, mean test score was 50.4% correct (range 8.7-82.6). Initial test item percent difficulty range was 15.2%-98.1%; discrimination values range, 0.03-0.50; final test item percent difficulty range, 17.6%-91.1%, discrimination values range, -0.04 to 1.04. Split-half reliability final test was 0.66. A high decision consistency reliability was identified, with test cut-score of 75%. The final 23-item Clinical Pain Knowledge Test has acceptable discrimination, difficulty, decision consistency, reliability, and validity in the general clinical inpatient nurse population. This instrument will be useful in assessing pain management knowledge of clinical nurses to determine gaps in education, evaluate knowledge after pain management education, and measure research outcomes. Copyright © 2017 American Society for Pain Management Nursing. Published by Elsevier Inc. All rights reserved.
Guvenc, Gulten; Seven, Memnun; Akyuz, Aygul
To adapt and psychometrically test the Health Belief Model Scale for Human Papilloma Virus (HPV) and Its Vaccination (HBMS-HPVV) for use in a Turkish population and to assess the Human Papilloma Virus Knowledge score (HPV-KS) among female college students. Instrument adaptation and psychometric testing study. The sample consisted of 302 nursing students at a nursing school in Turkey between April and May 2013. Questionnaire-based data were collected from the participants. Information regarding HBMS-HPVV and HPV knowledge and descriptive characteristic of participants was collected using translated HBMS-HPVV and HPV-KS. Test-retest reliability was evaluated and Cronbach α was used to assess internal consistency reliability, and exploratory factor analysis was used to assess construct validity of the HBMS-HPVV. The scale consists of 4 subscales that measure 4 constructs of the Health Belief Model covering the perceived susceptibility and severity of HPV and the benefits and barriers. The final 14-item scale had satisfactory validity and internal consistency. Cronbach α values for the 4 subscales ranged from 0.71 to 0.78. Total HPV-KS ranged from 0 to 8 (scale range, 0-10; 3.80 ± 2.12). The HBMS-HPVV is a valid and reliable instrument for measuring young Turkish women's beliefs and attitudes about HPV and its vaccination. Copyright © 2015 North American Society for Pediatric and Adolescent Gynecology. Published by Elsevier Inc. All rights reserved.
McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen
Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Muehrer, Rebecca J; Lanuza, Dorothy M; Brown, Roger L; Djamali, Arjang
This study describes the development and psychometric testing of the Sexual Concerns Questionnaire (SCQ) in kidney transplant (KTx) recipients. Construct validity was assessed using the Kroonenberg and Lewis exploratory/confirmatory procedure and testing hypothesized relationships with established questionnaires. Configural and weak invariance were examined across gender, dialysis history, relationship status, and transplant type. Reliability was assessed with Cronbach's alpha, composite reliability, and test-retest reliability. Factor analysis resulted in a 7-factor solution and suggests good model fit. Construct validity was also supported by the tests of hypothesized relationships. Configural and weak invariance were supported for all subgroups. Reliability of the SCQ was also supported. Findings indicate the SCQ is a valid and reliable measure of KTx recipients' sexual concerns.
Lúcio, Patrícia Silva; Cogo-Moreira, Hugo; Puglisi, Marina; Polanczyk, Guilherme Vanoni; Little, Todd D
The present study investigated the psychometric properties of the Raven's Colored Progressive Matrices (CPM) test in a sample of preschoolers from Brazil ( n = 582; age: mean = 57 months, SD = 7 months; 46% female). We investigated the plausibility of unidimensionality of the items (confirmatory factor analysis) and differential item functioning (DIF) for sex and age (multiple indicators multiple causes method). We tested four unidimensional models and the one with the best-fit index was a reduced form of the Raven's CPM. The DIF analysis was carried out with the reduced form of the test. A few items presented DIF (two for sex and one for age), confirming that the Raven's CPM items are mostly measurement invariant. There was no effect of sex on the general factor, but increasing age was associated with higher values of the g factor. Future research should indicate if the reduced form is suitable for evaluating the general ability of preschoolers.
Vadlin, Sofia; Åslund, Cecilia; Rehn, Mattias; Nilsson, Kent W
The objective of the study is to evaluate the psychometric properties of the Gaming Addiction Identification Test (GAIT) and its parent version (GAIT-P), in a representative community sample of adolescents and parents in Västmanland, Sweden. Self-rated and parent-rated gaming addictive symptoms identified by GAIT and GAIT-P were analyzed for frequency of endorsement, internal consistency, concordance, factor structure, prevalence of Internet gaming disorder (IGD), concurrence with the Gaming Addiction Scale for Adolescents, 7-item version (GAS) and the parent version of GAS (GAS-P), and for sex differences. The 12-month prevalence of IGD was found to be 1.3% with GAIT and 2.4% with GAIT-P. Results also indicate promising psychometric results within this population, with high internal consistency, and high concurrent validity with GAS and GAS-P. Concordance between adolescents and parents ratings was high, although moderate in girls. Although exploratory factor analysis indicated poor model fit, it also indicated unidimensionality and high factor loadings in all analyses. GAIT and GAIT-P are suitable for continued use in measuring gaming addiction in adolescents, and, with the additional two items, they now cover all nine IGD criteria. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Dunn, Karen S
Aims and objectives. Assess the psychometric properties of a new geriatric spiritual well-being scale (GSWS), specifically designed for older adults. Background. Religiosity and spiritual wellness must be measured as two distinct concepts to prevent confounding them as synonymous among atheist and agnostic population. Design. A test-retest survey design was used to estimate the psychometric properties. Methods. A convenience sample of 138 community-dwelling older adults was drawn from the inner city of Detroit. Data were collected using telephone survey interviews. Data analyses included descriptive statistics, structural equation modelling, reliability analyses, and point-biserial correlations. Results. The factorial validity of the proposed model was not supported by the data. Fit indices were χ(2) = 185.98, d.f. = 98, P atheists have spiritual needs that do not include religious beliefs or practices. Thus, assessing patients' religious beliefs and practices prior to assessing spiritual well-being is essential to prevent bias. © 2008 The Author. Journal compilation © 2008 Blackwell Publishing Ltd.
Latour, Jos M.; van Goudoever, Johannes B.; Duivenvoorden, Hugo J.; Albers, Marcel J. I. J.; van Dam, Nicolette A. M.; Dullaart, Eugenie; van Heerde, Marc; de Neef, Marjorie; Verlaat, Carin W. M.; van Vught, Elise M.; Hazelzet, Jan A.
To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument
Latour, J.M.; van Goudoever, J.B.; Duivenvoorden, H.J.; Albers, M.J.I.J.; van Dam, N.A.M.; Dullaart, E.; van Heerde, M.; de Neef, M.; Verlaat, C.W.M.; van Vught, E.M.; Hazelzet, J.A.
To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument
Knekta, Eva; Eklöf, Hanna
The aim of this study was to evaluate the psychometric properties of an expectancy-value-based questionnaire measuring five aspects of test-taking motivation (effort, expectancies, importance, interest, and test anxiety). The questionnaire was distributed to a sample of Swedish Grade 9 students taking a low-stakes (n = 1,047) or a high-stakes (n =…
Full Text Available The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS test is a new tool to evaluate pragmatic abilities in patients with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2 and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total. Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts’ evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient’s cognitive profile, given its vital role for communication and social interaction in daily life. The
Chien, Wai Tong; Chan, Zenobia Chung-Yee; Chan, Sally Wai-Chi
This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76-0.95 and ICC = 0.81-0.92), test-retest reliability (r = 0.89-0.95, P Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement), accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ (2)/df = 1.93, P = 0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78) to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = -0.54) and family burden (r = 0.49) and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90). The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P Chinese version demonstrates sound psychometric properties to measure families' expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
Tang, Woung-Ru; Kao, Chen-Yi
The spiritual well-being of terminally ill cancer patients is an important indicator of the quality of their lives and of the quality of hospice care, but no validated tools are available for assessing this indicator in Taiwan. The present cross-sectional study validated the Spiritual Well-Being Scale-Mandarin version (SWBS-M) by testing its psychometric properties in 243 cancer patients from five teaching hospitals throughout Taiwan. Construct validity was tested by factor analysis and hypothesis testing. Patients' spiritual well-being and quality of life were assessed using the SWBS-M and the McGill Quality of Life Questionnaire (MQoL), respectively. Overall, the SWBS-M had an internal consistency/reliability of 0.89. Exploratory factor analysis showed that the SWBS-M had an underlying two-factor structure, explaining 46.94% of the variance. SWBS-M scores correlated moderately with MQoL scores (r = 0.48, p spiritual well-being was inversely related to their average pain level during the previous 24 hours (r = -0.183, p = 0.006). Cancer patients' spiritual well-being also differed significantly with their experience of pain (t = -3.67, p spiritual well-being than those without pain. Our findings support a two-factor model for the SWBS-M in terminally ill Taiwanese cancer patients. We recommend testing the psychometric properties of the SWBS-M in different patient populations to verify its factorial structure in other Asian countries.
Wai Tong Chien
Full Text Available This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76–0.95 and ICC = 0.81–0.92, test-retest reliability (r = 0.89–0.95, P<0.01, and internal consistency (Cronbach’s α = 0.86–0.92. Among 262 patients with severe mental illness and their caregivers, the 50-item Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement, accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ2/df = 1.93, P=0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78 to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = −0.54 and family burden (r = 0.49 and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90. The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P<0.01. The Chinese version demonstrates sound psychometric properties to measure families’ expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
Flipsen, Peter; Ogiela, Diane A
Our understanding of test construction has improved since the now-classic review by McCauley and Swisher (1984). The current review article examines the psychometric characteristics of current single-word tests of speech sound production in an attempt to determine whether our tests have improved since then. It also provides a resource that clinicians may use to help them make test selection decisions for their particular client populations. Ten tests published since 1990 were reviewed to determine whether they met the 10 criteria set out by McCauley and Swisher (1984), as well as 7 additional criteria. All of the tests reviewed met at least 3 of McCauley and Swisher's (1984) original criteria, and 9 of 10 tests met at least 5 of them. Most of the tests met some of the additional criteria as well. The state of the art for single-word tests of speech sound production in children appears to have improved in the last 30 years. There remains, however, room for improvement.
Ana Paula Porto Noronha
Full Text Available A pesquisa teve como objetivo verificar quais os parâmetros psicométricos apresentados nos manuais de 19 instrumentos de avaliação da inteligência. Os elementos avaliados nos instrumentos foram: análise de itens, padronização, validade e precisão. Os resultados encontrados mostraram que, dos 19 testes avaliados, 89,5% apresentaram estudos de padronização, sendo que o procedimento mais utilizado na escolha dos sujeitos foi o não aleatório (62,2% dos testes. No que se refere à validade, a de construto foi a mais freqüente dentre os testes (94,7%. Observou-se que todos os instrumentos apresentaram verifica��ão da precisão, sendo o método de consistência interna o mais aplicado (78,9%. Conclui-se que, embora os autores concordem que todos os testes devam realizar estudos de verificação dos parâmetros psicométricos e devam possuir normas regionais, tal prática ainda não se encontra totalmente difundida na avaliação psicológica brasileira,This research aimed to verify the psychometric parameters presented in manuals of 19 intelligence tests. The psychometric properties included in the analysis were: item analysis, validity, reliability, and norms studies. The results indicated that 89.5% of the 19 tests presented norming studies. The procedure of sample selection was mostly non-random (62.2% of the tests. Construct validity was the most frequent method used among the studies (94.7%. All tests presented reliability studies, most of them using internal consistency coefficient (78.9%. It is concluded that although the authors agree that all tests need studies to verify psychometric parameters and studies to obtain regional norms this action isn’t divulged totally yet in the Brazilian psychological assessment.
Duddle, Maree; Boughton, Maureen
The aim of this study was to develop and test the psychometric properties of the Nursing Workplace Relational Environment Scale (NWRES). A positive relational environment in the workplace is characterised by a sense of connectedness and belonging, support and cooperation among colleagues, open communication and effectively managed conflict. A poor relational environment in the workplace may contribute to job dissatisfaction and early turnover of staff. Quantitative survey. A three-stage process was used to design and test the NWRES. In Stage 1, an extensive literature review was conducted on professional working relationships and the nursing work environment. Three key concepts; collegiality, workplace conflict and job satisfaction were identified and defined. In Stage 2, a pool of items was developed from the dimensions of each concept and formulated into a 35-item scale which was piloted on a convenience sample of 31 nurses. In Stage 3, the newly refined 28-item scale was administered randomly to a convenience sample of 150 nurses. Psychometric testing was conducted to establish the construct validity and reliability of the scale. Exploratory factor analysis resulted in a 22-item scale. The factor analysis indicated a four-factor structure: collegial behaviours, relational atmosphere, outcomes of conflict and job satisfaction which explained 68.12% of the total variance. Cronbach's alpha coefficient for the NWRES was 0.872 and the subscales ranged from 0.781-0.927. The results of the study confirm the reliability and validity of the NWRES. Replication of this study with a larger sample is indicated to determine relationships among the subscales. The results of this study have implications for health managers in terms of understanding the impact of the relational environment of the workplace on job satisfaction and retention.
Molony, Sheila L; McDonald, Deborah Dillon; Palmisano-Mills, Christine
Research related to quality of life in long-term care has been hampered by a paucity of measurement tools sensitive to environmental interventions. The primary aim of this study was to test the psychometric properties of a new instrument, the Experience of Home (EOH) Scale, designed to measure the strength of the experience of meaningful person-environment transaction. The instrument was administered to 200 older adults in diverse dwelling types. Principal components analysis provided support for construct validity, eliciting a three-factor solution accounting for 63.18% of variance in scores. Internal consistency reliability was supported with Cronbach's alpha of .96 for the entire scale. The EOH Scale is a unique research tool to evaluate interventions to improve quality of living in residential environments.
Rezaei, Mohammad; Rashedi, Vahid; Lotfi, Gohar; Shirinbayan, Peymaneh; Foroughan, Mahshid
The aim of this study was to assess the psychometric properties of the Mini-Cog in Iranian older adults. It was a cross-sectional study; 50 older people with dementia and 50 without dementia who matched for age, gender, and education entered the study. The diagnostic and statistical manual of mental disorders criteria for dementia were used as gold standard. A battery of scales included the abbreviated mental test score (AMTS), the Geriatric Depression Scale, and the Mini-Cog was performed. Validity and reliability of the Mini-Cog determined using the Pearson product-moment correlation coefficient (Pearson's r), Cronbach's alpha, and Receiver Operating Characteristic (ROC) curve analysis. The Persian version of Mini-Cog showed a good inter-rater reliability ( K = 0.76, p Mini-Cog have an acceptable sensitivity, specificity, and substantial overall agreement with the AMTS.
Verster, Joris C; Roth, Thomas
There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a constant speed and steady lateral position within the right traffic lane. Standard deviation of lateral position (SDLP), i.e., the weaving of the car, was determined. The subjects also performed a psychometric test battery including the DSST, Sternberg memory scanning test, a tracking test, and a divided attention test. Difference scores from placebo for parameters of the psychometric tests and SDLP were computed and correlated with each other. A stepwise linear regression analysis determined the predictive validity of the laboratory test battery to SDLP. Stepwise regression analyses revealed that the combination of five parameters, hard tracking, tracking and reaction time of the divided attention test, and reaction time and percentage of errors of the Sternberg memory scanning test, together had a predictive validity of 33.4%. The psychometric tests in this test battery showed insufficient predictive validity to replace the on-the-road driving test during normal traffic.
Boysan, Murat; Kuss, Daria J; Barut, Yaşar; Ayköse, Nafi; Güleç, Mustafa; Özdemir, Osman
Of many instruments developed to assess Internet addiction, the Internet Addiction Test (IAT), an expanded version of the Internet Addiction Diagnostic Questionnaire (IADQ), has been the most widely used scale in English and non-English speaking populations. In this study, our aim was to investigate the psychometric properties of short and expanded versions of the IAT in a Turkish undergraduate sample. Overall, 455 undergraduate students from Turkey aged between 18 and 30 participated in the study (63.53% were females). Explanatory and confirmatory factor analytic procedures investigated factor structures of the IADQ and IAT. The Internet Addiction Scale (IAS), Coping Inventory for Stressful Situations (CISS), Obsessive Compulsive Inventory-Revised (OCI-R) and Dissociative Experiences Scale (DES) were administered to assess convergent and divergent validities of the IADQ and IAT. Internal consistency and 15-day test-retest reliability were computed. In the factorial analytic investigation, we found a unidimensional factor structure for each measure fit the current data best. Significant but weak to moderate correlations of the IADQ and the IAT with the CISS, OCI-R and DES provided empirical evidence for divergent validity, whereas strong associations with the subscales of the IAS pointed to the convergent validity of Young's Internet addiction construct. Internal consistency of the IADQ was weak (α=0.67) and of the IAT was high (α=0.93). Temporal reliability of both instruments was very high (α=0.81 and α=0.87; respectively). The IAT revealed promising and sound psychometric properties in a Turkish sample. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kalis, Emils; Roke, Liga; Krumina, Indra
The Test for Creative Thinking-Drawing Production (TCT-DP) is designed as an effective drawing-based instrument for measuring creative potential. Many studies report adaptation efforts in other cultures pointing out good psychometric properties of the instrument nonetheless revealing also some trouble spots. The present study includes adaptation…
Paans, Wolter; Sermeus, Walter; Nieweg, Roos; van der Schans, Cees P.
AIM: This paper is a report of the development and testing of the psychometric properties of an instrument to measure the accuracy of nursing documentation in general hospitals. BACKGROUND: Little information is available about the accuracy of nursing documentation. None of the existing instruments
von Piekartz, H; Stotz, E; Both, A; Bahn, G; Armijo-Olivo, S; Ballenberger, N
The primary objective of this study was to determine the structural and known-group validity as well as the inter-rater reliability of a test battery to evaluate the motor control of the craniofacial region. Seventy volunteers without TMD and 25 subjects with TMD (Axes I) per the DC/TMD were asked to execute a test battery consisting of eight tests. The tests were video-taped in the same sequence in a standardised manner. Two experienced physical therapists participated in this study as blinded assessors. We used exploratory factor analysis to identify the underlying component structure of the eight tests. Internal consistency (Cronbach's α), inter-rater reliability (intra-class correlation coefficient) and construct validity (ie, hypothesis testing-known-group validity) (receiver operating curves) were also explored for the test battery. The structural validity showed the presence of one factor underlying the construct of the test battery. The internal consistency was excellent (0.90) as well as the inter-rater reliability. All values of reliability were close to 0.9 or above indicating very high inter-rater reliability. The area under the curve (AUC) was 0.93 for rater 1 and 0.94 for rater two, respectively, indicating excellent discrimination between subjects with TMD and healthy controls. The results of the present study support the psychometric properties of test battery to measure motor control of the craniofacial region when evaluated through videotaping. This test battery could be used to differentiate between healthy subjects and subjects with musculoskeletal impairments in the cervical and oro-facial regions. In addition, this test battery could be used to assess the effectiveness of management strategies in the craniofacial region. © 2017 John Wiley & Sons Ltd.
Nieboer Anna P
Full Text Available Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Results Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales – vision, participative safety, task orientation and support for innovation – acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. Conclusion The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.
Strating, Mathilde M H; Nieboer, Anna P
Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales--vision, participative safety, task orientation and support for innovation--acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.
Hsiao, Ya-Chu; Chiang, Yi-Chien; Lee, Hsiang-Chun; Han, Chin-Yen
To further examine the psychometric properties of the spiritual health scale short form, including its reliability and validity. Spirituality is one of the main factors associated with good health outcomes. A reliable and valid instrument to measure spirituality is essential to identify the spiritual needs of an individual and to evaluate the effect of spiritual care. A cross-sectional study design was used. The study was conducted in six nursing schools in northern, central and southern Taiwan. The inclusion criterion for participants was nursing students with clinical practice experience. Initially, 1141 participants were recruited for the study, but 67 were absent and 48 did not complete the questionnaires. A total of 1026 participants were finally recruited, indicating a response rate of 89·9%. The psychometric testing of the spiritual health scale short form included construct validity with confirmatory factor analysis, known-group validity and internal consistency reliability. The results of the confirmatory factor analysis supported the five-factor model as an acceptable model fit. In the known-group validity, the results indicated that people who are in the category of primary religious affiliation have better spiritual health than people in the category of secondary religious affiliation and atheism. The result also indicated that the 24-item spiritual health scale short form achieved an acceptable internal consistency coefficient. The findings suggest that the spiritual health scale short form is a valid and reliable instrument for the appraisal of individual spiritual health. The spiritual health scale short form could provide useful information to guide clinical practice in assessing and managing people's spiritual health in Taiwan. © 2013 John Wiley & Sons Ltd.
Chang, Li-Chun; Hsieh, Pei-Lin; Liu, Chieh-Hsing
The purpose of this study is to develop and evaluate the psychometric properties of the Chinese version of short-form Test of Functional Health Literacy in Adolescents. Assessing health literacy is vital to design health education programme; however, there are no measurement tools exist for use specifically in Chinese adolescents. A non-experimental design was used to test the psychometric properties of the Test of Functional Health Literacy in Adolescents. The short-form Test of Functional Health Literacy in Adolescents was translated and back translated into a Chinese language version. Thirty high school students were recruited to validate the scenario of Test of Functional Health Literacy in Adolescents. Based on the multiple-stage stratified random sampling method, 300 high school students from four counties in Taiwan were invited to participate in this study to evaluate the psychometric properties of Test of Functional Health Literacy in Adolescents. The Functional Health Literacy in Adolescents had good internal consistency reliability and excellent test-retest reliability. Confirmatory factor analysis resulted in a one-factor solution. Contrary to the original version of the Test of Functional Health Literacy in Adolescents, the findings revealed that the 36-item, one-factor model for the Test of Functional Health Literacy in Adolescents is the best-fit model. This is a suitable instrument to assess health literacy levels in Chinese adolescents before health education programmes can be appropriately planned, implemented and evaluated. © 2012 Blackwell Publishing Ltd.
Nikravesh, Maryam; Jafari, Zahra; Mehrpour, Masoud; Kazemi, Roozbeh; Amiri Shavaki, Younes; Hossienifar, Shamim; Azizi, Mohamad Parsa
Background: The paced auditory serial addition test (PASAT) was primarily developed to assess the effects of traumatic brain injury on cognitive functioning. Working memory (WM) is one of the most important aspects of cognitive function, and WM impairment is one of the clinically remarkable signs of aphasia. To develop the Persian version of PASAT, an initial version was used in individuals with aphasia (IWA). Methods: In this study, 25 individuals with aphasia (29-60 years) and 85 controls (18-60 years) were included. PASAT was presented in the form of recorded 61 single-digit numbers (1 to 9). The participants repeatedly added the 2 recent digits. The psychometric properties of PASAT including convergent validity (using the digit memory span tasks), divergent validity (using results in the control group and IWA group), and face validity were investigated. Test-retest reliability was considered as well. Results: The relationship between the PASAT and digit memory span tests was moderate to strong in the control group (forward digit memory span test: r= 0.52, p< 0.0001; backward digit memory span test: r = 0.48, p< 0.0001). A strong relationship was found in IWA (forward digit memory span test: r= 0.72, p< 0.0001; backward digit memory span test: r= 0.53, p= 0.006). Also, strong testretest reliability (intraclass correlation= 0.95, p< 0.0001) was observed. Conclusion: According to our results, the PASAT is a valid and reliable test to assess working memory, particularly in IWA. It could be used as a feasible tool for clinical and research applications. PMID:29445690
Li, Ho Cheung William; Chung, Oi Kwan Joyce; Ho, Ka Yan
This paper is a report of psychometric testing of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children. The availability of a valid and reliable instrument that accurately detects depressive symptoms in children is crucial before any psychological intervention can be appropriately planned and evaluated. There is no such an instrument for Chinese children. A test-retest, within-subjects design was used. A total of 313 primary school students between the ages of 8 and 12 years were invited to participate in the study in 2009. Participants were asked to respond to the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children, short form of the State Anxiety Scale for Children and Rosenberg's Self-Esteem Scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children were assessed. The newly-translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly-translated scale can be used as a self-report assessment tool in detecting depressive symptoms of Chinese children aged between 8 and 12 years. © 2010 Blackwell Publishing Ltd.
Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff
Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Umhlanga is a ceremony celebrating virginity. In South Africa, it is practiced, among others, by the Zulu ethnic group who live mainly in the province of KwaZulu Natal. After falling into relative disuse in the Zulu community, the practice of virginity testing made a comeback some 10 years ago at around the time of the country's first democratic election and coinciding with the period when the HIV pandemic began to take hold. In July 2005 the South African Parliament passed a new Children's Bill which will prohibit virginity testing of children. The Bill has been met with outrage and public protest on the part of Zulu citizens. Traditional circumcision rites are also addressed in the new bill but are not banned. Instead, male children are given the right to refuse to participate in traditional initiation ceremonies which include circumcision. This paper asks why the practice of virginity testing is regarded as so troubling to the new democratic order that the state has chosen to take the heavy-handed route of banning it. The paper further asks why the state's approach to traditional male circumcision has been so different to its approach to virginity testing. Finally, the paper asks what these two challenging cases in the country's new democracy tell us about the nature of liberal democratic citizenship in South Africa 10 years after apartheid's formal demise.
Full Text Available Abstract Background Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT or ACT. These differences have important implications both for utilization and interpretation. Although much has been written about PVs, it appears that there are still misconceptions about whether and how to employ them in secondary analyses. Methods We address a range of technical issues, including those raised in a recent article that was written to inform economists using these databases. First, an extensive review of the relevant literature was conducted, with particular attention to key publications that describe the derivation and psychometric characteristics of such achievement measures. Second, a simulation study was carried out to compare the statistical properties of estimates based on the use of PVs with those based on other, commonly used methods. Results It is shown, through both theoretical analysis and simulation, that under fairly general conditions appropriate use of PV yields approximately unbiased estimates of model parameters in regression analyses of large scale survey data. The superiority of the PV methodology is particularly evident when measures of student achievement are employed as explanatory variables. Conclusions The PV methodology used to report student test performance in large scale surveys remains the state-of-the-art for secondary analyses of these databases.
Grant, Marcia; Ferrell, Betty; Dean, Grace; Uman, Gwen; Chu, David; Krouse, Robert
Ostomies may be performed for bowel or urinary diversion, and occur in both cancer and non-cancer patients. Impact on physical, psychological, social and spiritual well-being is not unexpected, but has been minimally described in the literature. The City of Hope Quality of Life (COH-QOL)-Ostomy Questionnaire is an adult patient self-report instrument designed to assess quality of life. This report focuses on the revision and psychometric testing of this questionnaire. The revised COH-QOL-Ostomy Questionnaire involved in-depth patient interviews and expert panel review. The format consisted of a 13-item disease and demographic section, a 34-item forced-choice section, and a 41-item linear analogue scaled section. A mailed survey to California members of the United Ostomy Association resulted in a 62% response rate (n = 1513). Factor analysis was conducted to refine the instrument. Construct validity involved testing a number of hypotheses identifying contrasting groups. Factor analysis confirmed the conceptual framework. Reliability of subscales ranged from 0.77 to 0.90. The questionnaire discriminated between subpopulations with specific concerns. Overall, the analyses provide evidence for the validity and reliability of the COH-QOL-Ostomy Questionnaire as a comprehensive, multidimensional self-report questionnaire for measuring quality of life in patients with intestinal ostomies.
Ferrero-Arias, J; Turrión-Rojo, M A
To explore the relationship between scores on the Test Your Memory (TYM) battery and findings from a more exhaustive neurocognitive assessment. The TYM and fourteen psychometric tests were administered to 84 subjects aged 50 or older who attended an outpatient neurology clinic due to cognitive symptoms. Each patient's cognitive state was determined independently from his/her score on the TYM (CDR 0, n=25; CDR 0.5, n=45; CDR 1, n=14). We analysed concurrent validity of TYM scores and results from the psychometric tests, as well as the degree of concordance between the two types of measurement, by contrasting normalised data from each instrument. Although the intraclass correlation coefficient was 0.67 (confidence interval 95%, 0.53-0.77), analysis of the Bland-Altman plot and the curve on the survival-agreement plot (Luiz et al. method) demonstrates that the individual distances between the two methods exhibit excessive dispersion from a clinical viewpoint. TYM-based predictions of the mean z-score on psychometric tests differed substantially from real results in 30% of the subjects. Concordance of 95% can only be achieved by accepting absolute inter-instrument differences of up to 0.87 as identical values. Furthermore, the TYM underestimates cognitive performance for low values and overestimates it for high values. The TYM is a cognitive screening test which should not be used to predict results on psychometric tests or to detect cognitive changes in clinical trials. Copyright © 2014 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.
Thanakwang, Kattika; Isaramalai, Sang-Arun; Hatthakit, Urai
Active aging is central to enhancing the quality of life for older adults, but its conceptualization is not often made explicit for Asian elderly people. Little is known about active aging in older Thai adults, and there has been no development of scales to measure the expression of active aging attributes. The aim of this study was to develop a culturally relevant composite scale of active aging for Thai adults (AAS-Thai) and to evaluate its reliability and validity. EIGHT STEPS OF SCALE DEVELOPMENT WERE FOLLOWED: 1) using focus groups and in-depth interviews, 2) gathering input from existing studies, 3) developing preliminary quantitative measures, 4) reviewing for content validity by an expert panel, 5) conducting cognitive interviews, 6) pilot testing, 7) performing a nationwide survey, and 8) testing psychometric properties. In a nationwide survey, 500 subjects were randomly recruited using a stratified sampling technique. Statistical analyses included exploratory factor analysis, item analysis, and measures of internal consistency, concurrent validity, and test-retest reliability. Principal component factor analysis with varimax rotation resulted in a final 36-item scale consisting of seven factors of active aging: 1) being self-reliant, 2) being actively engaged with society, 3) developing spiritual wisdom, 4) building up financial security, 5) maintaining a healthy lifestyle, 6) engaging in active learning, and 7) strengthening family ties to ensure care in later life. These factors explained 69% of the total variance. Cronbach's alpha coefficient for the overall AAS-Thai was 0.95 and varied between 0.81 and 0.91 for the seven subscales. Concurrent validity and test-retest reliability were confirmed. The AAS-Thai demonstrated acceptable overall validity and reliability for measuring the multidimensional attributes of active aging in a Thai context. This newly developed instrument is ready for use as a screening tool to assess active aging levels among older
Poghosyan, Lusine; Nannini, Angela; Finkelstein, Stacey R; Mason, Emanuel; Shaffer, Jonathan A
Policy makers and healthcare organizations are calling for expansion of the nurse practitioner (NP) workforce in primary care settings to assure timely access and high-quality care for the American public. However, many barriers, including those at the organizational level, exist that may undermine NP workforce expansion and their optimal utilization in primary care. This study developed a new NP-specific survey instrument, Nurse Practitioner Primary Care Organizational Climate Questionnaire (NP-PCOCQ), to measure organizational climate in primary care settings and conducted its psychometric testing. Using instrument development design, the organizational climate domain pertinent for primary care NPs was identified. Items were generated from the evidence and qualitative data. Face and content validity were established through two expert meetings. Content validity index was computed. The 86-item pool was reduced to 55 items, which was pilot tested with 81 NPs using mailed surveys and then field-tested with 278 NPs in New York State. SPSS 18 and Mplus software were used for item analysis, reliability testing, and maximum likelihood exploratory factor analysis. Nurse Practitioner Primary Care Organizational Climate Questionnaire had face and content validity. The content validity index was .90. Twenty-nine items loaded on four subscale factors: professional visibility, NP-administration relations, NP-physician relations, and independent practice and support. The subscales had high internal consistency reliability. Cronbach's alphas ranged from.87 to .95. Having a strong instrument is important to promote future research. Also, administrators can use it to assess organizational climate in their clinics and propose interventions to improve it, thus promoting NP practice and the expansion of NP workforce.
Kuo, Shu-Fen; Chang, Wen-Yin; Chang, Lu-I; Chou, Yu-Hua; Chen, Ching-Min
This is a report of development and psychometric testing of the East Asian Acculturation Measure-Chinese version (EAAM-C) scale. An instrument validation design with a cross-sectional survey was conducted. The process was carried in two phases. In Phase 1, Barry's East Asian Acculturation Measure was translated and back translated to evaluate its content, face validity, and feasibility validity. In Phase 2, the 16-item EAAM-C was pilot-tested among 485 female immigrants for test-retest reliability, internal consistency, theoretically-supported construct validity and concurrent validity. The pilot work and the survey results indicated the tools possessed adequate content and face validity. The Cronbach's Alphas for the EAAM-C was 0.72, and 0.76-0.79 for its subscales, and the correlation of test-retest reliability (at 3 weeks) was 0.75. After dropping one item, four theoretically-supported factors which explained 61.82% of the variance were abstracted using exploratory factor analysis: assimilation, integration, separation, and marginalization. Based on the underlying four-factor theoretical structures of the EAAM, the confirmatory factor analysis of the EAAM-C was further examined. The analysis revealed that the four-factor model was an acceptable fit for the data which demonstrated adequate finding in its construct validity. These factors were inter-correlated, and showed statistically significant correlation with the Chinese Health Questionnaire, indicating adequate concurrent validity. The scale shows acceptable validity and consistency, and suggests that immigrant acculturation is a complex construct. This quick evaluation instrument can be applied to assess clients' acculturation and in further developing certain interventions to improve their health.
Wocial, Lucia D; Weaver, Michael T
To report the development and psychometric testing of the Moral Distress Thermometer. The Moral Distress Thermometer is a new screening tool to measure moral distress in nurses who practise in the hospital setting. Moral distress occurs when one knows the ethically correct thing to do, but is prevented from acting on that perceived obligation. It is a well documented phenomenon with negative consequences that may be experienced by nurses. Creating an instrument to effectively and efficiently measure moral distress in a timely way has been identified as a priority for nursing. This study used a cross-sectional survey design. Data collection for this research occurred in 2009. Participants simultaneously completed either the adult or pediatric version of the Moral Distress Scale version 2009 and the Moral Distress Thermometer. A total of 529 participants from various clinical areas completed both tools. Coefficients alpha were adequate for both Adult (0·90) and Pediatric (0·92) Moral Distress Scale 2009 scales. Statistically significant Pearson correlations were found for the Moral Distress Thermometer with Adult Moral Distress Scale 2009 and Pediatric Moral Distress Scale 2009 and higher Moral Distress Thermometer, Adult Moral Distress Scale 2009 and Pediatric Moral Distress Scale 2009 means for participants who had left or who considered leaving a position because of moral distress. These findings provide support for the validity of the Moral Distress Thermometer. © 2012 Blackwell Publishing Ltd.
Piredda, Michela; Biagioli, Valentina; Gambale, Giulia; Porcelli, Elisa; Barbaranelli, Claudio; Palese, Alvisa; De Marinis, Maria Grazia
Effective measures of nursing care dependency in neurorehabilitation are warranted to plan nursing interventions to help patients avoid increasing dependency. The Care Dependency Scale (CDS) is a theory-based, comprehensive tool to evaluate functional disability. This study aimed to modify the CDS for neurological and neurorehabilitation patients (Neuro-CDS) and to test its psychometric properties in adult neurorehabilitation inpatients. Exploratory factor analysis (EFA) was performed using a Maximum Likelihood robust (MLR) estimator. The Barthel Index (BI) was used to evaluate concurrent validity. Stability was measured using the Intra-class Correlation Coefficient (ICC). The sample included 124 patients (mean age = 69.7 years, 54% male). The EFA revealed a two-factor structure with good fit indexes, Factor 1 (Physical care dependence) loaded by 11 items and Factor 2 (Psycho-social care dependence) loaded by 4 items. The correlation between factors was 0.61. Correlations between Factor 1 and the BI and between Factor 2 and the BI were r = 0.843 and r = 0.677, respectively (p dependence in neurorehabilitation patients as a basis for individualized and holistic care.
Molle, Elizabeth; Froman, Robin
Computerized interdisciplinary plans of care have revitalized nurse-centric care plans into dynamic and meaningful electronic documents. To maximize the benefits of these documents, it is important to understand healthcare professionals' attitudes, specifically their confidence, for making computerized interdisciplinary care plans useful and meaningful documents. The purpose of the study was to test the psychometric properties of the Self-Efficacy for Interdisciplinary Plans of Care instrument intended to measure healthcare professionals' self-efficacy for using such documents. Content validity was assessed by an expert review panel. Content validity indices ranged from 0.75 to 1.00, with a scale CVI of 0.94. A sample of 389 healthcare providers completed the 14-item instrument. Principal axis factoring was used to assess factor structure. The exploratory factor analysis yielded a single-factor structure accounting for 71.76% of covariance. Cronbach internal consistency coefficient for the single factor solution was .97. The corrected item-total correlations ranged from 0.71 to 0.90. The coefficient of stability, during a 2-week period, with a subset of the sample (n = 38), was estimated at 0.82. The results of this study suggest that the Self-Efficacy for Interdisciplinary Plans of Care has sturdy reliability and validity for measuring the self-efficacy of healthcare providers to make computerized interdisciplinary plans of care meaningful and useful documents.
: beliefs, practices, institutions, and also things. In this sense, the meaning of the term in social research is very close to its usage in common language and is not always theoretically well developed (see Shils, 1971: 123). But the concept of tradition has also been central to major theoretical debates...... on the nature of social change, especially in connection with the notion of modernity. Here tradition is linked to various forms of agency as a factor of both stability and intentional change....
Felipo, Vicente; Urios, Amparo; Giménez-Garzó, Carla; Cauli, Omar; Andrés-Costa, Maria-Jesús; González, Olga; Serra, Miguel A; Sánchez-González, Javier; Aliaga, Roberto; Giner-Durán, Remedios; Belloch, Vicente; Montoliu, Carmina
To assess whether non invasive blood flow measurement by arterial spin labeling in several brain regions detects minimal hepatic encephalopathy. Blood flow (BF) was analyzed by arterial spin labeling (ASL) in different brain areas of 14 controls, 24 cirrhotic patients without and 16 cirrhotic patients with minimal hepatic encephalopathy (MHE). Images were collected using a 3 Tesla MR scanner (Achieva 3T-TX, Philips, Netherlands). Pulsed ASL was performed. Patients showing MHE were detected using the battery Psychometric Hepatic Encephalopathy Score (PHES) consisting of five tests. Different cognitive and motor functions were also assessed: alterations in selective attention were evaluated using the Stroop test. Patients and controls also performed visuo-motor and bimanual coordination tests. Several biochemical parameters were measured: serum pro-inflammatory interleukins (IL-6 and IL-18), 3-nitrotyrosine, cGMP and nitrates+nitrites in plasma, and blood ammonia. Bivariate correlations were evaluated. In patients with MHE, BF was increased in cerebellar hemisphere (P = 0.03) and vermis (P = 0.012) and reduced in occipital lobe (P = 0.017). BF in cerebellar hemisphere was also increased in patients without MHE (P = 0.02). Bimanual coordination was impaired in patients without MHE (P = 0.05) and much more in patients with MHE (P battery and with CFF. BF in cerebellar hemisphere correlates with plasma cGMP and nitric oxide (NO) metabolites. BF in vermis cerebellar also correlates with NO metabolites and with 3-nitrotyrosine. IL-18 in plasma correlates with BF in thalamus and occipital lobe. Non invasive BF determination in cerebellum using ASL may detect MHE earlier than the PHES. Altered NO-cGMP pathway seems to be associated to altered BF in cerebellum.
Petrova, Tatjana; Kavookjian, Jan; Madson, Michael B; Dagley, John; Shannon, David; McDonough, Sharon K
Motivational interviewing (MI) has demonstrated a significant impact as an intervention strategy for addiction management, change in lifestyle behaviors, and adherence to prescribed medication and other treatments. Key elements to studying MI include training in MI of professionals who will use it, assessment of skills acquisition in trainees, and the use of a validated skills assessment tool. The purpose of this research project was to develop a psychometrically valid and reliable tool that has been designed to assess MI skills competence in health care provider trainees. The goal was to develop an assessment tool that would evaluate the acquisition and use of specific MI skills and principles, as well as the quality of the patient-provider therapeutic alliance in brief health care encounters. To address this purpose, specific steps were followed, beginning with a literature review. This review contributed to the development of relevant conceptual and operational definitions, selecting a scaling technique and response format, and methods for analyzing validity and reliability. Internal consistency reliability was established on 88 video recorded interactions. The inter-rater and test-retest reliability were established using randomly selected 18 from the 88 interactions. The assessment tool Motivational Interviewing Skills for Health Care Encounters (MISHCE) and a manual for use of the tool were developed. Validity and reliability of MISHCE were examined. Face and content validity were supported with well-defined conceptual and operational definitions and feedback from an expert panel. Reliability was established through internal consistency, inter-rater reliability, and test-retest reliability. The overall internal consistency reliability (Cronbach's alpha) for all fifteen items was 0.75. MISHCE demonstrated good inter-rater reliability and good to excellent test-retest reliability. MISHCE assesses the health provider's level of knowledge and skills in brief
Piredda, Michela; Ghezzi, Valerio; Fenizia, Elisa; Marchetti, Anna; Petitti, Tommasangelo; De Marinis, Maria Grazia; Sili, Alessandro
To develop and psychometrically test the Italian-language Nurse Caring Behaviours Scale, a short measure of nurse caring behaviour as perceived by inpatients. Patient perceptions of nurses' caring behaviours are a predictor of care quality. Caring behaviours are culture-specific, but no measure of patient perceptions has previously been developed in Italy. Moreover, existing tools show unclear psychometric properties, are burdensome for respondents, or are not widely applicable. Instrument development and psychometric testing. Item generation included identifying and adapting items from existing measures of caring behaviours as perceived by patients. A pool of 28 items was evaluated for face validity. Content validity indexes were calculated for the resulting 15-item scale; acceptability and clarity were pilot tested with 50 patients. To assess construct validity, a sample of 2,001 consecutive adult patients admitted to a hospital in 2014 completed the scale and was split into two groups. Reliability was evaluated using nonlinear structural equation modelling coefficients. Measurement invariance was tested across subsamples. Item 15 loaded poorly in the exploratory factor analysis (n = 983) and was excluded from the final solution, positing a single latent variable with 14 indicators. This model fitted the data moderately. The confirmatory factor analysis (n = 1018) returned similar results. Internal consistency was excellent in both subsamples. Full scalar invariance was reached, and no significant latent mean differences were detected across subsamples. The new instrument shows reasonable psychometric properties and is a promising short and widely applicable measure of inpatient perceptions of nurse caring behaviours. © 2017 John Wiley & Sons Ltd.
Full Text Available Executive function (EF is an important predictor of numerous developmental outcomes, such as academic achievement and behavioral adjustment. Although a plethora of measurement instruments exists to assess executive function in children, only few of these are suitable for toddlers, and even fewer have undergone psychometric evaluation. The present study evaluates the psychometric properties and validity of an assessment battery for measuring EF in two-year-olds. A sample of 2437 children were administered the assessment battery at a mean age of 2;4 years (SD = 0;3 years in a large-scale field study. Measures of both hot EF (snack and gift delay tasks and cool EF (six boxes, memory for location, and visual search task were included. Confirmatory Factor Analyses showed that a two-factor hot and cool EF model fitted the data better than a one-factor model. Measurement invariance was supported across groups differing in age, gender, socioeconomic status (SES, home language, and test setting. Criterion and convergent validity were evaluated by examining relationships between EF and age, gender, SES, home language, and parent and teacher reports of children’s attention and inhibitory control. Predictive validity of the test battery was investigated by regressing children’s pre-academic skills and behavioral problems at age three on the latent hot and cool EF factors at age two years. The test battery showed satisfactory psychometric quality and criterion, convergent, and predictive validity. Whereas cool EF predicted both pre-academic skills and behavior problems one year later, hot EF predicted behavior problems only. These results show that EF can be assessed with psychometrically sound instruments in children as young as two years, and that EF tasks can be reliably applied in large scale field research. The current instruments offer new opportunities for investigating EF in early childhood, and for evaluating interventions targeted at improving
Clark, Cynthia M; Barbosa-Leiker, Celestina; Gill, Larecia Money; Nguyen, Danh
Academic incivility is a serious challenge for nursing education, which needs to be empirically measured and fully addressed. A convenience sample of nursing faculty and students from 20 schools of nursing in the United States participated in a mixed-methods study to test the psychometric properties of the Incivility in Nursing Education-Revised (INE-R) Survey. A factor analysis and other reliability analyses support the use of the INE-R as a valid and reliable measurement of student and faculty perceptions of incivility in nursing education. The INE-R is a psychometrically sound instrument to measure faculty and student perceptions of incivility; to examine differences regarding levels of nursing education, program type, gender, age, and ethnicity; to compare perceptions of incivility between and among adjunct, clinical, teaching, and research faculty; and to conduct pre- and postassessments of the perceived levels of faculty and student incivility in nursing programs to inform evidence-based interventions. Copyright 2015, SLACK Incorporated.
Full Text Available Abstract Background Validated instruments are needed to evaluate the programmatic impact of Evidence Based Practice (EBP training and to document the competence of individual trainees. This study aimed to translate the Fresno test into Spanish and subsequently validate it, in order to ensure the equivalence of the Spanish version against the original English version. Methods Before and after study performed between October 2007 and June 2008. Three groups of participants: (a Mentors of family medicine residents (expert group (n = 56; (b Family medicine physicians (intermediate experience group (n = 17; (c Family medicine residents (novice group (n = 202; Medical residents attended an EBP course, and two sets of the test were administered before and after the course. The Fresno test is a performance based measure for use in medical education that assesses EBP skills. The outcome measures were: inter-rater and intra-rater reliability, internal consistency, item analyses, construct validity, feasibility of administration, and responsiveness. Results Inter-rater correlations were 0.95 and 0.85 in the pre-test and the post-test respectively. The overall intra-rater reliability was 0.71 and 0.81 in the pre-test and post-test questionnaire, respectively. Cronbach's alpha was 0.88 and 0.77, respectively. 152 residents (75.2% returned both sets of the questionnaire. The observed effect size for the residents was 1.77 (CI 95%: 1.57-1.95, the standardised response mean was 1.65 (CI 95%:1.47-1.82. Conclusions The Spanish version of the Fresno test is a useful tool in assessing the knowledge and skills of EBP in Spanish-speaking residents of Family Medicine.
Haug, Tobias; Mann, Wolfgang
Given the current lack of appropriate assessment tools for measuring deaf children's sign language skills, many test developers have used existing tests of other sign languages as templates to measure the sign language used by deaf people in their country. This article discusses factors that may influence the adaptation of assessment tests from one natural sign language to another. Two tests which have been adapted for several other sign languages are focused upon: the Test for American Sign Language and the British Sign Language Receptive Skills Test. A brief description is given of each test as well as insights from ongoing adaptations of these tests for other sign languages. The problems reported in these adaptations were found to be grounded in linguistic and cultural differences, which need to be considered for future test adaptations. Other reported shortcomings of test adaptation are related to the question of how well psychometric measures transfer from one instrument to another.
Leeuw, Jan de
In psychometrics, and in the closely related fields of quantititative methods for the social and educational sciences, R is not yet used very often. Traditional mainframe packages such as SAS and SPSS are still dominant at the user-level, Stata has made inroads at the teaching level, and Matlab is quite prominent at the research level. In this paper we define the most visible techniques in the psychometrics area, we give an overview of what is available in R, and we discuss what is m...
M.M.H. Strating (Mathilde); A.P. Nieboer (Anna)
textabstractAbstract BACKGROUND: Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate
Khorashad, Behzad S.; Baron-Cohen, Simon; Roshan, Ghasem M.; Kazemian, Mojtaba; Khazai, Ladan; Aghili, Zahra; Talaei, Ali; Afkhamizadeh, Mozhgan
The psychometric properties of the Persian "Reading the Mind in the Eyes" test were investigated, so were the predictions from the Empathizing-Systemizing theory of psychological sex differences. Adults aged 16-69 years old (N = 545, female = 51.7%) completed the test online. The analysis of items showed them to be generally acceptable.…
Youngjohn, James R.; And Others
Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…
Fernandes, Tânia; Araújo, Susana; Sucena, Ana; Reis, Alexandra; Castro, São Luís
Reading is a central cognitive domain, but little research has been devoted to standardized tests for adults. We, thus, examined the psychometric properties of the 1-min version of Teste de Idade de Leitura (Reading Age Test; 1-min TIL), the Portuguese version of Lobrot L3 test, in three experiments with college students: typical readers in Experiment 1A and B, dyslexic readers and chronological age controls in Experiment 2. In Experiment 1A, test-retest reliability and convergent validity were evaluated in 185 students. Reliability was >.70, and phonological decoding underpinned 1-min TIL. In Experiment 1B, internal consistency was assessed by presenting two 45-s versions of the test to 19 students, and performance in these versions was significantly associated (r = .78). In Experiment 2, construct validity, criterion validity and clinical utility of 1-min TIL were investigated. A multiple regression analysis corroborated construct validity; both phonological decoding and listening comprehension were reliable predictors of 1-min TIL scores. Logistic regression and receiver operating characteristics analyses revealed the high accuracy of this test in distinguishing dyslexic from typical readers. Therefore, the 1-min TIL, which assesses reading comprehension and potential reading difficulties in college students, has the necessary psychometric properties to become a useful screening instrument in neuropsychological assessment and research. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.
Mortensen, E L; Simonsen, E
A translation of the MCMI-I has been in use in Denmark for some years. An untested assumption in the interpretation of the pattern of test results is that the psychometric characteristics of the Danish and American versions are similar. The purpose of this study was to evaluate the psychometric...... properties of the questionnaire by using traditional psychometric analysis techniques on the results of a sample consisting of 423 patients and 179 normal controls. Coefficient alpha was calculated for the 20 clinical subscales of the test and the Danish results were strikingly similar to the original...... coefficients reported by Millon. Furthermore, factor analysis of the subscales showed a factor structure very similar to American findings, and it is concluded that the psychometric properties of the Danish MCMI are not significantly different from the original....
Ranger, Jochen; Kuhn, Jörg-Tobias; Szardenings, Carsten
Cognitive psychometric models embed cognitive process models into a latent trait framework in order to allow for individual differences. Due to their close relationship to the response process the models allow for profound conclusions about the test takers. However, before such a model can be used its fit has to be checked carefully. In this manuscript we give an overview over existing tests of model fit and show their relation to the generalized moment test of Newey (Econometrica, 53, 1985, 1047) and Tauchen (J. Econometrics, 30, 1985, 415). We also present a new test, the Hausman test of misspecification (Hausman, Econometrica, 46, 1978, 1251). The Hausman test consists of a comparison of two estimates of the same item parameters which should be similar if the model holds. The performance of the Hausman test is evaluated in a simulation study. In this study we illustrate its application to two popular models in cognitive psychometrics, the Q-diffusion model and the D-diffusion model (van der Maas, Molenaar, Maris, Kievit, & Boorsboom, Psychol Rev., 118, 2011, 339; Molenaar, Tuerlinckx, & van der Maas, J. Stat. Softw., 66, 2015, 1). We also compare the performance of the test to four alternative tests of model fit, namely the M 2 test (Molenaar et al., J. Stat. Softw., 66, 2015, 1), the moment test (Ranger et al., Br. J. Math. Stat. Psychol., 2016) and the test for binned time (Ranger & Kuhn, Psychol. Test. Asess. , 56, 2014b, 370). The simulation study indicates that the Hausman test is superior to the latter tests. The test closely adheres to the nominal Type I error rate and has higher power in most simulation conditions. © 2017 The British Psychological Society.
Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling
Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, pself-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Fox, Gerardus J.A.; van den Berg, Stéphanie Martine; Veldkamp, Bernard P.; Irwing, P.; Booth, T.; Hughes, D.
In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item
Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M
The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…
Full Text Available Alessandra Gorini,1,2 Ketti Mazzocco,1,2 Sara Gandini,2 Elisabetta Munzone,2 Gordon McVie,2 Gabriella Pravettoni1,2 1Department of Health Science, University of Milan, Milan, Italy; 2European Institute of Oncology, Milan, Italy Introduction: The advent of “personalized medicine” has been driven by technological advances in genomics. Concentration at the subcellular level of a patient's cancer cells has meant inevitably that the “person” has been overlooked. For this reason, we think there is an urgent need to develop a truly personalized approach focusing on each patient as an individual, assessing his/her unique mental dimensions and tailoring interventions to his/her individual needs and preferences. The aim of this study was to develop and test the psychometric properties of the ALGA-Breast Cancer (ALGA-BC, a new multidimensional questionnaire that assesses the breast cancer patient's physical and mental characteristics in order to provide physicians, prior to the consultation, with a patient's profile that is supposed to facilitate subsequent communication, interaction, and information delivery between the doctor and the patient. Methods: The specific validation processes used were: content and face validity, construct validity using factor analysis, reliability and internal consistency using test–retest reliability, and Cronbach's alpha correlation coefficient. The exploratory analysis included 100 primary breast cancer patients and 730 healthy subjects. Results: The exploratory factor analysis revealed eight key factors: global self-rated health, perceived physical health, anxiety, self-efficacy, cognitive closure, memory, body image, and sexual life. Test–retest reliability and internal consistency were good. Comparing patients with a sample of healthy subjects, we also observed a general ability of the ALGA-BC questionnaire to discriminate between the two. Conclusion: The ALGA-BC questionnaire with 29 items is a valid
Ajorpaz, Neda Mirbagher; Tafreshi, Mansoureh Zagheri; Mohtashami, Jamileh; Zayeri, Farid; Rahemi, Zahra
The clinical competence of nursing students in operating room (OR) is an important issue in nursing education. The purpose of this study was to evaluate the psychometric properties of the Persian Perceived Perioperative Competence Scale-Revised (PPCS-R) instrument. This cross-sectional study was conducted across 12 universities in Iran. The psychometric properties and factor structure of the PPCS-R for OR students was examined. Based on the results of factor analysis, seven items were removed from the original version of the scale. The fitness indices of the Persian scale include comparative fit index (CFI) = .90, goodness-of-fit-index (GFI) = .86, adjusted goodness-of-fit index (AGFI) = .90, normed fit index (NFI) = .84, and root mean square error of approximation (RMSEA) = .04. High validity and reliability indicated the scale's value for measuring perceived perioperative competence of Iranian OR students.
Nieboer Anna P; Strating Mathilde MH
Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for...
Tseng, Hsu-Min; Liu, F-C; West, Michael
Understanding the feasibility of applying the Team Climate Inventory (TCI) in non-Western cultures is essential for researchers attempting to understand the influence of culture on workers' perceived climate. This study describes the application of the TCI in such a setting using data from 203 administrators employed in a Taiwanese medical center. Reliability and factor analyses were performed to establish the feasibility and psychometric properties of the TCI Taiwan version. Reliabilities of...
Moeini, Babak; Zamanian, Hadi; Taheri-Kharameh, Zahra; Ramezani, Tahereh; Saati-Asr, Mohamadhasan; Hajrahimian, Mohamadhasan; Amini-Tehrani, Mohammadali
Spirituality plays an important role in coping with chronic diseases for patients and they often report unmet spiritual and existential needs, which should be considered for a holistic view of their health. Studying spiritual needs in this generation requires culturally appropriate and valid instruments. The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The "forward-backward" procedure was applied to translate the SpNQ from English into Persian. The SpNQ-Persian Version (SpNQ-PV) was checked in terms of validity and reliability with a convenience sample of 100 elders with chronic diseases who were recruited from the inpatient wards at two university hospitals in Qom, Iran. The validity was assessed using content, face, and construct validity. The Cronbach alpha and test-retest were used to assess the reliability of the questionnaire. The results of the exploratory factor analysis indicated a five-factor solution for the questionnaire, which included religious needs, existential needs, forgiveness/generativity needs, need for inner peace, and emotional needs. These accounted for 60.1% of the total observed variance. One item was removed (factor loading Spiritual Well-being Scale. Cronbach alpha of the subscales ranged from 0.56 to 0.78 and the test-retest reliability ranged from 0.72 to 0.91, which indicated an acceptable range of reliability. The SpNQ-PV showed a minor difference in structuring and indicated good psychometric properties, which can be used to assess the spiritual needs of Iranian elders suffering from chronic diseases. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Lin, Chung-Ying; Broström, Anders; Nilsen, Per; Griffiths, Mark D; Pakpour, Amir H
Background and aims The Bergen Social Media Addiction Scale (BSMAS), a six-item self-report scale that is a brief and effective psychometric instrument for assessing at-risk social media addiction on the Internet. However, its psychometric properties in Persian have never been examined and no studies have applied Rasch analysis for the psychometric testing. This study aimed to verify the construct validity of the Persian BSMAS using confirmatory factor analysis (CFA) and Rasch models among 2,676 Iranian adolescents. Methods In addition to construct validity, measurement invariance in CFA and differential item functioning (DIF) in Rasch analysis across gender were tested for in the Persian BSMAS. Results Both CFA [comparative fit index (CFI) = 0.993; Tucker-Lewis index (TLI) = 0.989; root mean square error of approximation (RMSEA) = 0.057; standardized root mean square residual (SRMR) = 0.039] and Rasch (infit MnSq = 0.88-1.28; outfit MnSq = 0.86-1.22) confirmed the unidimensionality of the BSMAS. Moreover, measurement invariance was supported in multigroup CFA including metric invariance (ΔCFI = -0.001; ΔSRMR = 0.003; ΔRMSEA = -0.005) and scalar invariance (ΔCFI = -0.002; ΔSRMR = 0.005; ΔRMSEA = 0.001) across gender. No item displayed DIF (DIF contrast = -0.48 to 0.24) in Rasch across gender. Conclusions Given the Persian BSMAS was unidimensional, it is concluded that the instrument can be used to assess how an adolescent is addicted to social media on the Internet. Moreover, users of the instrument may comfortably compare the sum scores of the BSMAS across gender.
Sapershtein, E.E.; Starodubskii, V.E.
The review considers the testing of some exotic hypotheses about the properties of the nucleon in a nuclear medium in phenomena of traditional nuclear physics. The hypothesis of nucleon swelling proposed to explain the EMC effects is considered in detail. The consequences of this hypothesis for the charge densities and cross sections for scattering of fast electrons and protons by nuclei are analyzed. Also considered are the Nolen--Schiffer anomaly, the Coulomb sum rule for inelastic electron scattering, y scaling, and some other nuclear processes. It is shown that one can estimate the possible scale of nuclear exotics by analyzing many of these phenomena. Thus, examination of high-precision data on the elastic scattering of electrons with energy 500--700 MeV using density distributions calculated on the basis of the self-consistent theory of finite Fermi systems yields a restriction on the amount of nucleon swelling: α=δr N /r N approx-lt 10%. A similar analysis for protons with energy 0.8--1.0 GeV using Glauber theory gives α approx-lt 6%. An even more stringent restriction, α approx-lt 3%, follows from data on y scaling in 56 Fe
African Traditional Medicines (ATMs) serve as a major source of primary healthcare for African people. The reasons for their use range from easy access, affordability, beliefs in traditional systems and long term safety. ATMs have been used to treat individuals infected with HIV and therefore need scientific validation; a view ...
Bekhet, Abir K; Zauszniewski, Jaclene A
Positive thinking interventions improve adaptive functioning and quality of life in many populations. However, no direct measure of positive thinking skills taught during intervention exists. This psychometric study of a convenience sample of 109 autism spectrum disorder (ASD) caregivers examined a new eight-item Positive Thinking Skills Scale (PTSS), which measures the frequency of use of positive thinking skills. The PTSS was found to be internally consistent (α = .90). Construct validity was supported by significant correlations (p thinking skills could provide direction for future intervention.
Carpenter, Brian D.; Balsis, Steve; Otilingam, Poorni G.; Hanson, Priya K.; Gatz, Margaret
Purpose: This study provides preliminary evidence for the acceptability, reliability, and validity of the new Alzheimer's Disease Knowledge Scale (ADKS), a content and psychometric update to the Alzheimer's Disease Knowledge Test. Design and Methods: Traditional scale development methods were used to generate items and evaluate their psychometric…
Hung, Man; Hon, Shirley D; Cheng, Christine; Franklin, Jeremy D; Aoki, Stephen K; Anderson, Mike B; Kapron, Ashley L; Peters, Christopher L; Pelt, Christopher E
The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Cohort study (diagnosis); Level of evidence, 2. Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior in all psychometric aspects examined in this study. Future
Meyerson, Paul; Tryon, Warren W
This study evaluated the psychometric equivalency of Web-based research. The Sexual Boredom Scale was presented via the World-Wide Web along with five additional scales used to validate it. A subset of 533 participants that matched a previously published sample (Watt & Ewing, 1996) on age, gender, and race was identified. An 8 x 8 correlation matrix from the matched Internet sample was compared via structural equation modeling with a similar 8 x 8 correlation matrix from the previously published study. The Internet and previously published samples were psychometrically equivalent. Coefficient alpha values calculated on the matched Internet sample yielded reliability coefficients almost identical to those for the previously published sample. Factors such as computer administration and uncontrollable administration settings did not appear to affect the results. Demographic data indicated an overrepresentation of males by about 6% and Caucasians by about 13% relative to the U.S. Census (2000). A total of 2,230 participants were obtained in about 8 months without remuneration. These results suggest that data collection on the Web is (1) reliable, (2) valid, (3) reasonably representative, (4) cost effective, and (5) efficient.
Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée
The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.
Harris, Richard W; McPherson, David L; Hanson, Claire M; Eggett, Dennis L
This study identified, digitally recorded, edited and evaluated 89 bisyllabic Vietnamese words with the goal of identifying homogeneous words that could be used to measure the speech recognition threshold (SRT) in native talkers of Vietnamese. Native male and female talker productions of 89 Vietnamese bisyllabic words were recorded, edited and then presented at intensities ranging from -10 to 20 dBHL. Logistic regression was used to identify the best words for measuring the SRT. Forty-eight words were selected and digitally edited to have 50% intelligibility at a level equal to the mean pure-tone average (PTA) for normally hearing participants (5.2 dBHL). Twenty normally hearing native Vietnamese participants listened to and repeated bisyllabic Vietnamese words at intensities ranging from -10 to 20 dBHL. A total of 48 male and female talker recordings of bisyllabic words with steep psychometric functions (>9.0%/dB) were chosen for the final bisyllabic SRT list. Only words homogeneous with respect to threshold audibility with steep psychometric function slopes were chosen for the final list. Digital recordings of bisyllabic Vietnamese words are now available for use in measuring the SRT for patients whose native language is Vietnamese.
Liou, Shwu-Ru; Liu, Hsiu-Chen; Tsai, Hsiu-Min; Tsai, Ying-Huang; Lin, Yu-Ching; Chang, Chia-Hao; Cheng, Ching-Yu
The purpose of the study was to develop and psychometrically test the Nurses Clinical Reasoning Scale. Clinical reasoning is an essential skill for providing safe and quality patient care. Identifying pre-graduates' and nurses' needs and designing training courses to improve their clinical reasoning competence becomes a critical task. However, there is no instrument focusing on clinical reasoning in the nursing profession. Cross-sectional design was used. This study included the development of the scale, a pilot study that preliminary tested the readability and reliability of the developed scale and a main study that implemented and tested the psychometric properties of the developed scale. The Nurses Clinical Reasoning Scale was developed based on the Clinical Reasoning Model. The scale includes 15 items using a Likert five-point scale. Data were collected from 2013-2014. Two hundred and fifty-one participants comprising clinical nurses and nursing pre-graduates completed and returned the questionnaires in the main study. The instrument was tested for internal consistency and test-retest reliability. Its validity was tested with content, construct and known-groups validity. One factor emerged from the factor analysis. The known-groups validity was confirmed. The Cronbach's alpha for the entire instrument was 0·9. The reliability and validity of the Nurses Clinical Reasoning Scale were supported. The scale is a useful tool and can be easily administered for the self-assessment of clinical reasoning competence of clinical nurses and future baccalaureate nursing graduates. Study limitations and further recommendations are discussed. © 2015 John Wiley & Sons Ltd.
Soltanparast, Sanaz; Jafari, Zahra; Sameni, Seyed Jalal; Salehi, Masoud
The purpose of the present study was to evaluate the psychometric properties (validity and reliability) of the Persian version of the Sustained Auditory Attention Capacity Test in children with attention deficit hyperactivity disorder. The Persian version of the Sustained Auditory Attention Capacity Test was constructed to assess sustained auditory attention using the method provided by Feniman and colleagues (2007). In this test, comments were provided to assess the child's attentional deficit by determining inattention and impulsiveness error, the total scores of the sustained auditory attention capacity test and attention span reduction index. In the present study for determining the validity and reliability of in both Rey Auditory Verbal Learning test and the Persian version of the Sustained Auditory Attention Capacity Test (SAACT), 46 normal children and 41 children with Attention Deficit Hyperactivity (ADHD), all right-handed and aged between 7 and 11 of both genders, were evaluated. In determining convergent validity, a negative significant correlation was found between the three parts of the Rey Auditory Verbal Learning test (first, fifth, and immediate recall) and all indicators of the SAACT except attention span reduction. By comparing the test scores between the normal and ADHD groups, discriminant validity analysis showed significant differences in all indicators of the test except for attention span reduction (pAttention Capacity test has good validity and reliability, that matches other reliable tests, and it can be used for the identification of children with attention deficits and if they suspected to have Attention Deficit Hyperactivity Disorder.
Yu, Doris S F; Lee, Diana T F; Woo, Jean
The purpose of this study was to assess the psychometric properties of the Chinese version of the Medical Outcomes Study Social Support Survey (MOS-SSS-C) in a sample of 110 patients. Criterion-related and construct validities of the MOS-SSS-C were evaluated by correlations with the Chinese version of the Multidimensional Perceived Social Support Survey (r =.82) and the Hospital Anxiety and Depression Scale (r = -.58). Confirmatory factor analysis affirmed the four-factor structure of the MOS-SSS-C in measuring the functional aspects of perceived social support. Cronbach's alphas for the subscales ranged from.93 to.96, whereas the alpha for the overall scale was.98. The 2-week test-retest reliability of the MOS-SSS-C as measured by the intraclass correlation coefficient was.84. The MOS-SSS-C is a psychometrically sound multidimensional measure for the evaluation of functional aspects of perceived social support by Chinese patients with chronic disease. Copyright 2004 Wiley Periodicals, Inc.
Investigating the psychometric properties of a screening instrument for young children is necessary to ascertain its quality and accuracy. In light of the important role culture plays on human beliefs and parenting styles, a newly translated and adapted test needs to be studied. Evaluating outcomes on a translated version of a test may reveal…
Vrachimi-Souroulla, Andry; Panayiotou, Georgia; Kokkinos, Constantinos M.; Lamprianou, Iasonas
The study aimed to field-test a Greek version of the Wechsler Quicktest and to examine its psychometric properties. The Quicktest was individually administered to 208 students, aged 5-14 years, along with a reading test. Based on the Rasch analysis, data for the Quicktest subtests showed acceptable fit to the model. Also, correlations were found…
Tepe, Rodger; Tepe, Chabha
To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Hsueh, Yu-Jung; Chu, Hsin; Huang, Chang-Chih; Ou, Keng-Liang; Chen, Chiung-Hua; Chou, Kuei-Ru
The aim of this study was to examine the psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C). The sensitivity, specificity, and positive and negative predictive values for the MAST-C were examined in this study. The MAST-C had an internal consistency of 0.83 and a test-retest reliability of 0.89. It had a good content validity index of 0.92. Factor analysis identified four factors and the optimal cutoff point for the MAST-C was a score of 6/7, which yielded a sensitivity of 0.92, a specificity of 0.83, a positive predictive value of 0.92, and a negative predictive value of 0.83. The MAST-C provides a fast, accurate, and sensitive method for clinically diagnosing alcoholism and clinical management. © 2013 Wiley Periodicals, Inc.
Roohollah Zahedian Nasb
Full Text Available Background: Sustained visual attention is a prerequisite for learning and memory. The early evaluation of attention in childhood is essential for their school and career success in the future. The aim of this study was to design, development and investigation of psychometric properties (content, face and convergent validity and test-retest and internal consistency reliability of the computer - based sustained visual attention test (SuVAT for healthy preschool children aged 4-6 with their special needs. Methods: This study was carried out in two stages: in the first stage computerbased SuVAT in two versions original and parallel were developed. Then the test-retest and internal consistency reliability using intra-class correlation and Cronbach’s alpha coefficients respectively were examined; Face validity was calculated through ideas gathering from 10 preschool children and content validity evaluated using CVI and CVR method and convergent validity of SuVAT with CPT was assessed using Pearson correlation. Results: The developed test showed a good content and faces validity, and also had excellent test-retest reliability. In addition, the assessment of internal consistency indicated the high internal consistency of the test (Cronbach’s alpha=0.869. SuVAT and CPT test demonstrated a positive correlation upon the convergent validity testing. Conclusion: SuVAT with good reliability and validity could be used as an acceptable sustained attention assessment in preschool children.
Full Text Available “Reading the Mind in the Eyes” test (RMET is one of the most popular and widely used measures of individual differences in Theory of Mind (ToM capabilities. Despite demonstrating good validity in differentiating various clinical groups exhibiting ToM deficits from unimpaired controls, previous studies raised the question of the RMET’s homogeneity, latent structure, and reliability. The aim of this study is to provide evidence on psychometric properties, latent structure, and validity of the newly adapted Serbian version of the RMET. In total, 260 participants (61.9% females took part in the study. The sample consisted of both unimpaired controls (76.5%, and a clinical group of participants that are believed to demonstrate ToM deficits (23.5%, namely, persons diagnosed with schizophrenia and bipolar disorder (54.1% females. RMET has demonstrated fair psychometric properties (KMO = .723; α = .747; H1 = .076; H5 = .465, successfully differentiating between clinical group and control [F (1,254 = 26.175, p <.001, η2 p = .093], while typical gender differences in performance were found only in control group. Tests of several models based on the previous literature revealed that the affect-specific factors underlying performance on RMET demonstrate poor fit. The best fitting model obtained included reduced scale with a single-factor underlying the test’s performance (TLI = .953, CFI = .958, RMSEA = .020. Based on the fit parameters we propose 18-item short-form of the Serbian version of RMET (KMO = .797; α = .728; H1 = .129; H5 = .677 for economic, reliable and valid measurement of ToM abilities.
Pan, Tonya M; Mills, Sarah D; Fox, Rina S; Baik, Sharon H; Harry, Kadie M; Roesch, Scott C; Sadler, Georgia Robins; Malcarne, Vanessa L
The Life Orientation Test-Revised (LOT-R) is a widely used measure of optimism and pessimism, with three positively worded and three negatively worded content items. This study examined the structural validity and invariance, internal consistency reliability, and convergent and divergent validity of the English and Spanish versions of the LOT-R among Hispanic Americans. A community sample of Hispanic Americans ( N = 422) completed self-report measures, including the LOT-R, Patient Health Questionnaire-9, and Generalized Anxiety Disorder-7, in their preferred language of English or Spanish. Based on the literature, four structural models were tested: one-factor , oblique two-factor , orthogonal two-factor method effects with positive specific factor , and orthogonal two-factor method effects with negative specific factor . Baseline support for both of the English and Spanish versions was not achieved for any model; in all models, the negatively worded items in Spanish had non-significant factor loadings. Therefore, the positively worded three-item optimism subscale of the LOT-R was examined separately and fit the data, with factor loadings equivalent across language-preference groups. Coefficient alphas for the optimism subscale were consistent across both language-preference groups (αs = .61 [English] and .66 [Spanish]). In contrast, the six-item total score and three-item pessimism subscale demonstrated extremely low or inconsistent alphas. Convergent and divergent validity were established for the optimism subscale in both languages. In sum, the optimism subscale of the LOT-R demonstrated minimally acceptable to good psychometric properties across English and Spanish language-preference groups. However, neither the total score nor the pessimism subscale showed adequate psychometric properties for Spanish-speaking Hispanic Americans, likely due to translation and cultural adaptation issues, and thus are not supported for use with this population.
Widger, Kimberley; Tourangeau, Ann E; Steele, Rose; Streiner, David L
The field of pediatric palliative care is hindered by the lack of a well-defined, reliable, and valid method for measuring the quality of end-of-life care. The study purpose was to develop and test an instrument to measure mothers' perspectives on the quality of care received before, at the time of, and following a child's death. In Phase 1, key components of quality end-of-life care for children were synthesized through a comprehensive review of research literature. These key components were validated in Phase 2 and then extended through focus groups with bereaved parents. In Phase 3, items were developed to assess structures, processes, and outcomes of quality end-of-life care then tested for content and face validity with health professionals. Cognitive testing was conducted through interviews with bereaved parents. In Phase 4, bereaved mothers were recruited through 10 children's hospitals/hospices in Canada to complete the instrument, and psychometric testing was conducted. Following review of 67 manuscripts and 3 focus groups with 10 parents, 141 items were initially developed. The overall content validity index for these items was 0.84 as rated by 7 health professionals. Based on feedback from health professionals and cognitive testing with 6 parents, a 144-item instrument was finalized for further testing. In Phase 4, 128 mothers completed the instrument, 31 of whom completed it twice. Test-retest reliability, internal consistency, and construct validity were demonstrated for six subscales: Connect With Families, Involve Parents, Share Information With Parents, Share Information Among Health Professionals, Support Parents, and Provide Care at Death. Additional items with content validity were grouped in four domains: Support the Child, Support Siblings, Provide Bereavement Follow-up, and Structures of Care. Forty-eight items were deleted through psychometric testing, leaving a 95-item instrument. There is good initial evidence for the reliability and
This paper analyzes the activities, members, and effects of an inter-American expert network for the diffusion of psychometric knowledge, specifically of standardized aptitude testing for university admission in Latin America during the 1960s and 1970s. Within the framework of educational transfer studies, the role of international,…
Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina
Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the "Test of Science-Related Attitudes" (TOSRA), such as its construct validity, its…
Dimitrov, Dimiter M.; Shamrani, Abdul Rahman
This study examines the psychometric features of a General Aptitude Test-Verbal Part, which is used with assessments of high school graduates in Saudi Arabia. The data supported a bifactor model, with one general factor and three content domains (Analogy, Sentence Completion, and Reading Comprehension) as latent aspects of verbal aptitude.
Hutchinson, Marie; Doran, Frances
Background Domestic violence (DV) is an international public health issue associated with adverse health outcomes for adults and children. There have been widespread calls to increase nurses' capacity to respond to DV and improve undergraduate nursing education in this area. However, there are few valid, reliable and contemporary measures of nursing attitudes towards and beliefs concerning DV that are suited for use in evaluating education programmes. Aim To establish the psychometric properties of a newly developed inventory designed to measure nursing students' beliefs about and attitudes towards DV. Discussion Exploratory factor analysis identified five factors, with a Cronbach's alpha of 0.646. The few factors loading>.80 suggest that the instrument has good discriminate validity. The absence of cross-loadings indicate good convergent validity. Conclusion The inventory provides one of the first validated and reliable measures for examining undergraduate nursing students' attitudes towards and beliefs about DV. Implications for practice The instrument is suited for use by nurse educators in assessing the influence of curriculum design and teaching strategies on student beliefs and attitudes. It would also be useful in studies investigating nurses' clinical practice on domestic violence.
Gutiérrez Sánchez, Daniel; Cuesta-Vargas, Antonio I
Many measurements have been developed to assess the quality of death (QoD). Among these, the Quality of Dying and Death Questionnaire (QODD) is the most widely studied and best validated. Informal carers and health professionals who care for the patient during their last days of life can complete this assessment tool. The aim of the study is to carry out a cross-cultural adaptation and a psychometric analysis of the QODD for the Spanish population. The translation was performed using a double forward and backward method. An expert panel evaluated the content validity. The questionnaire was tested in a sample of 72 Spanish-speaking adult carers of deceased cancer patients. A psychometric analysis was performed to evaluate internal consistency, divergent criterion-related validity with the Mini-Suffering State Examination (MSSE) and concurrent criterion-related validity with the Palliative Outcome Scale (POS). Some items were deleted and modified to create the Spanish version of the QODD (QODD-ESP-26). The instrument was readable and acceptable. The content validity index was 0.96, suggesting that all items are relevant for the measure of the QoD. This questionnaire showed high internal consistency (Cronbach's α coefficient = 0.88). Divergent validity with MSSE (r = -0.64) and convergent validity with POS (r = -0.61) were also demonstrated. The QODD-ESP-26 is a valid and reliable instrument for the assessment of the QoD of deceased cancer patients that can be used in a clinical and research setting. Copyright © 2018 Elsevier Ltd. All rights reserved.
Rofail, Diana; Viala, Muriel; Gater, Adam; Abetz-Webb, Linda; Baladi, Jean-Francois; Cappellini, Maria Domenica
The Satisfaction with Iron Chelation Therapy (SICT) instrument was developed based on a literature review, in-depth patient and clinician interviews, and cognitive debriefing interviews. An, open-label, single arm, multicenter trial evaluating the efficacy and safety of deferasirox in patients diagnosed with transfusion-dependent iron overload, provided an opportunity to assess the psychometric measurement properties of the instrument. Psychometric analyses were performed using data at baseline from 273 patients with a range of transfusion-dependent iron overload conditions who were participating in a multinational study. Responsiveness was further evaluated for all patients who also had subsequent satisfaction domain scores collected at week 4. Baseline SICT domain scores had acceptable floor and ceiling effects and internal consistency reliability (Cronbach's alpha: 0.75-0.85). Item discriminant and item convergent validity were both excellent although one item in each analysis did not meet the specified criterion. Small to moderate correlations were observed between SICT and Short Form 36 Health Survey (SF-36) domain scores. Patients with the highest levels of serum ferritin at baseline (>3100 ng/mL) were the least satisfied about the Perceived Effectiveness of ICT and vice versa. Satisfaction improved in all patients, although there were no clear differences observed between groups of patients defined according to changes in serum ferritin levels from baseline to week 4 (stable, improved, or worsened). The SICT domains are reliable and valid. Further testing using a more specific criterion (such as assessing patient global ratings of change in satisfaction domains that correspond to the SICT domains) could help to establish with greater confidence the responsiveness of the instrument.
Evren, Cuneyt; Ogel, Kultegin; Evren, Bilge; Bozkurt, Muge
The aim of this study was to evaluate psychometric properties of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in prisoners with (n = 124) or without (n = 78) drug use disorder. Participants were evaluated with the DUDIT, the DAST-10, and the Addiction Profile Index-Short (API-S). The DUDIT and the DAST-10 were found to be psychometrically sound drug abuse screening measures with high convergent validity when compared with each other (r = 0.86), and API-S (r = 0.88 and r = 0.84, respectively), and to have a Cronbach's α of 0.93 and 0.87, respectively. In addition, a single component accounted for 58.28% of total variance for DUDIT, whereas this was 47.10% for DAST-10. The DUDIT had sensitivity and specificity scores of 0.95 and 0.79, respectively, when using the optimal cut-off score of 10, whereas these scores were 0.88 and 0.74 for the DAST-10 when using the optimal cut-off score of 4. Additionally, both the DUDIT and the DAST-10 showed good discriminant validity as they differentiated prisoners with drug use disorder from those without. Findings support the Turkish versions of both the DUDIT and the DAST-10 as reliable and valid drug abuse screening instruments that measure unidimensional constructs.
Pachovski, Veno; Dimova, Slobodanka; Vaneva, Marjana
A model for gathering oral answers as part of testing the speaker skills (i.e. command of language, native or foreign) is presented, as well as the software used in the experimentation. The research presented here is a result of more than six (6) months’ work with TESOL experts, based on 60 test...
Egeland, Jens; Løvstad, Marianne; Norup, Anne; Nybo, Taina; Persson, Bengt A; Rivera, Diego Fernando; Schanke, Anne-Kristine; Sigurdardottir, Solrun; Arango-Lasprilla, Juan Carlos
Historically, the neuropsychological test traditions of the four Nordic countries have spanned from the flexible and qualitative tradition of Luria-Christensen to the quantitative large battery approach of Halstead and Kløve-Matthews. This study reports current test use and discusses whether these traditions still influence attitudes toward test use and choice of tests. The study is based on survey data from 702 Nordic neuropsychologists. The average participant used 9 tests in a standard assessment, and 25 tests overall in their practice. Test use was moderated by nationality, competence level, practice profile, and by attitude toward test selection. Participants who chose their tests flexibly used fewer tests than those adhering to the flexible battery approach, but had fewer tests from which to choose. Testing patients with psychiatric disorders was associated with using more tests. IQ, memory, attention, and executive function were the domains with the largest utilization rate, while tests of motor, visual/spatial, and language were used by few. There is a lack of academic achievement tests. Screening tests played a minor role in specialized assessments, and symptom validity tests were seldom applied on a standard basis. Most tests were of Anglo-American origin. New test methods are implemented rapidly in the Nordic countries, but test selection is also characterized by the dominating position of established and much researched tests. The Halstead-Reitan and Luria traditions are currently weak, but national differences in size of test batteries seem to be influenced by these longstanding traditions.
Dyson, Judith; Cowdell, Fiona
To develop and psychometrically test the Motivation and Self-Efficacy in Early Detection of Skin Lesions Index. Skin cancer is the most frequently diagnosed cancer worldwide. The primary strategy used to prevent skin cancer is promotion of sun avoidance and the use of sun protection. However, despite costly and extensive campaigns, cases of skin cancer continue to increase. If found and treated early, skin cancer is curable. Early detection is, therefore, very important. The study was conducted in 2013. Instrument Development. A literature review and a survey identified barriers (factors that hinder) and levers (factors that help) to skin self-examination. These were categorized according to a the Theoretical Domains Framework and this formed the basis of an instrument, which was tested for validity and reliability using confirmatory factor analysis and Cronbach's alpha respectively. A five-factor 20-item instrument was used that tested well for reliability and construct validity. Test-retest reliability was good for all items and domains. The five factors were: (i) Outcome expectancies; (ii) Intention; (iii) Self-efficacy; (iv) Social influences; (v) Memory. The Motivation and Self-Efficacy in Early Detection of Skin Lesions Index provides a reliable and valid method of assessing barriers and levers to skin self-examination. The next step is to design a theory-based intervention that can be tailored according to individual determinants to behaviour change identified by this instrument. © 2014 John Wiley & Sons Ltd.
Sanchez-Garcia, Manuel; Extremera, Natalio; Fernandez-Berrocal, Pablo
This research examined evidence regarding the reliability and validity of scores on the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test, Version 2.0 (MSCEIT; Mayer, Salovey, & Caruso, 2002). In Study 1, we found a close convergence of the Spanish consensus scores and the general and expert consensus scores determined with Mayer, Salovey, Caruso, and Sitarenios (2003) data. The MSCEIT also demonstrated adequate evidence of reliability of test scores as estimated by internal consistency and test-retest correlation after 12 weeks. Confirmatory factor analysis supported a 3-level higher factor model with 8 manifest variables (task scores), 4 first-level factors (corresponding to the 4-branch model of Mayer & Salovey , with 2 tasks for each branch), 2 second-level factors (experiential and strategic areas, with 2 branches for each area), and 1 third-level factor (overall emotional intelligence [EI]), and multigroup analyses supported MSCEIT cross-gender invariance. Study 2 found evidence for the discriminant validity of scores on the MSCEIT subscales, which were differentially related to personality and self-reported EI. Study 3 provided evidence of the incremental validity of scores on the MSCEIT, which added significant variance to the prospective prediction of psychological well-being after controlling for personality traits. The psychometric properties of the Spanish MSCEIT are similar to those of the original English version, supporting its use for assessing emotional abilities in the Spanish population. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Hirani, Shela Akbar Ali; Karmaliani, Rozina; Christie, Thomas; Parpio, Yasmin; Rafique, Ghazala
breast feeding is an essential source of nutrition among young babies; however, in Pakistan a gradual decline in prevalence of breast feeding, especially among urban working mothers, has been reported. Previous studies among Pakistani urban working mothers have revealed that ensuring exclusivity and continuation of breast feeding is challenging if social and/or workplace environmental support is minimal or absent. This problem indicated a crucial need to assess availability of breast-feeding support for Pakistani urban working mothers by using a comprehensive, reliable, and validated tool in their national language (Urdu). to develop and test the psychometric properties of the 'Perceived Breastfeeding Support Assessment Tool' (PBSAT) that can gauge Pakistani urban working mothers' perceptions about breast-feeding support. this methodological research was undertaken in five phases. During phase I, a preliminary draft of the PBSAT was developed by using the Socio-ecological model, reviewing literature, and referring to two United States based tools. In Phase II, the instrument was evaluated by seven different experts, and, in Phase III, the instrument was revised, translated, and back translated. In Phase IV, the tool was pilot tested among 20 participants and then modified on the basis of statistical analysis. In Phase V, the refined instrument was tested on 200 breast-feeding working mothers recruited through purposive sampling from the government and private health-care settings in Karachi, Pakistan. Approvals were received from the Ethical Review Committees of the identified settings. the 29-item based PBSAT revealed an acceptable inter-rater reliability of 0.95, and an internal consistency reliability coefficient (Cronbach's alpha) of 0.85. A construct validity assessment through Exploratory Factor Analysis revealed that the PBSAT has two dimensions, 'workplace environmental support' (12 items; α=0.86) and 'social environmental support' (17 items; α=0.77). the
Kim, Y; Evangelista, LS; Phillips, LR; Pavlish, C; Kopple, JD
Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and v...
Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise
Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.
Petry, Heidi; Suter-Riederer, Susanne; Kerker-Specker, Carmen; Imhof, Lorenz
Patient centred and individually-tailored counselling of older people with a chronic condition who live at home is a useful intervention to support their independence. The paper presents the development and psychometric testing of the APN-BQ Instrument, to measure patient-centeredness. To measure the quality of an in-home counselling intervention, a 23-item questionnaire was developed and tested with 206 people 80 years and older. Principal component analysis with Varimax Rotation was conducted (n = 206). Analysis revealed a four factor (fs = 0.91) model scoring in 19 items. All factors loaded > 0.45. Cronbach's alpha was 0.86. The utility and acceptance of the instrument was confirmed by the high response rate (100 %) and the fact that participants answered 98.8 % of all questions. The APN-BQ has shown to be a reliable Instrument with good content and construct validity. It is a tool for APNs to measure structure, process, and outcome quality of a patient-centred and individually-tailored counselling program, including the degree of patient participation, and patient empowerment.
Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C
The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.
Noorbakhsh, Simasadat; Shams, Jamal; Faghihimohamadi, Mohamadmahdi; Zahiroddin, Hanieh; Hallgren, Mats; Kallmen, Hakan
Iran is a developing and Islamic country where the consumption of alcoholic beverages is banned. However, psychiatric disorders and alcohol use disorders are often co-occurring. We used the Alcohol Use Disorders Identification Test (AUDIT) to estimate the prevalence of alcohol use and examined the psychometric properties of the test among psychiatric outpatients in Teheran, Iran. AUDIT was completed by 846 consecutive (sequential) patients. Descriptive statistics, internal consistency (Cronbach alpha), confirmatory and exploratory factor analyses were used to analyze the prevalence of alcohol use, reliability and construct validity. 12% of men and 1% of women were hazardous alcohol consumers. Internal reliability of the Iranian version of AUDIT was excellent. Confirmatory factor analyses showed that the construct validity and the fit of previous factor structures (1, 2 and 3 factors) to data were not good and seemingly contradicted results from the explorative principal axis factoring, which showed that a 1-factor solution explained 77% of the co-variances. We could not reproduce the suggested factor structure of AUDIT, probably due to the skewed distribution of alcohol consumption. Only 19% of men and 3% of women scored above 0 on AUDIT. This could be explained by the fact that alcohol is illegal in Iran. In conclusion the AUDIT exhibited good internal reliability when used as a single scale. The prevalence estimates according to AUDIT were somewhat higher among psychiatric patients compared to what was reported by WHO regarding the general population.
Full Text Available Ecological validity should be the cornerstone of any assessment of cognitive functioning. For this purpose, we have developed a preliminary study to test the Art Gallery Test (AGT as an alternative to traditional neuropsychological testing. The AGT involves three visual search subtests displayed in a virtual reality (VR art gallery, designed to assess visual attention within an ecologically valid setting. To evaluate the relation between AGT and standard neuropsychological assessment scales, data were collected on a normative sample of healthy adults (n = 30. The measures consisted of concurrent paper-and-pencil neuropsychological measures [Montreal Cognitive Assessment (MoCA, Frontal Assessment Battery (FAB, and Color Trails Test (CTT] along with the outcomes from the three subtests of the AGT. The results showed significant correlations between the AGT subtests describing different visual search exercises strategies with global and specific cognitive measures. Comparative visual search was associated with attention and cognitive flexibility (CTT; whereas visual searches involving pictograms correlated with global cognitive function (MoCA.
Gray, Sarah A; Deem, Lisa P; Straja, Sorin R
The purpose of this research was to determine the predictive value of the Dental Admission Test (DAT) for clinical success using Ackerman's theory of ability determinants of skilled performance. The Ackerman theory is a valid, reliable schema in the applied psychology literature used to predict complex skill acquisition. Inconsistent stimulus-response skill acquisition depends primarily on determinants of cognitive ability. Consistent information-processing tasks have been described as "automatic," in which stimuli and responses are mapped in a manner that allows for complete certainty once the relationships have been learned. It is theorized that the skills necessary for success in the clinical component of dental schools involve a significant amount of automatic processing demands and, as such, student performance in the clinics should begin to converge as task practice is realized and tasks become more consistent. Subtest scores of the DAT of four classes were correlated with final grades in nine clinical courses. Results showed that the DAT subtest scores played virtually no role with regard to the final clinical grades. Based on this information, the DAT scores were determined to be of no predictive value in clinical achievement.
Ros, Laura; Romero, Dulce; Ricarte, Jorge J; Serrano, Juan P; Nieto, Marta; Latorre, Jose M
The Autobiographical Memory Test (AMT) is the most widely used measure of overgeneral autobiographical memory (OGM). The AMT appears to have good psychometric properties, but more research is needed on the influence and applicability of individual cue words in different languages and populations. To date, no studies have evaluated its usefulness as a measure of OMG in Spanish or older populations. This work aims to analyze the applicability of the AMT in young and older Spanish samples. We administered a Spanish version of the AMT to samples of young (N = 520) and older adults (N = 155). We conducted confirmatory factor analysis (CFA), item response theory-based analysis (IRT) and differential item functioning (DIF). Results confirm the one-factor structure for the AMT. IRT analysis suggests that both groups find the AMT easy given that they generally perform well, and that it is more precise in individuals who score low on memory specificity. DIF analysis finds three items differ in their functioning depending on age group. This differential functioning of these items affects the overall AMT scores and, thus, they should be excluded from the AMT in studies comparing young and older samples. We discuss the possible implications of the samples and cue words used.
García-Alandete, Joaquín; Ros, Montserrat Cañabate; Salvador, José Heliodoro Marco; Rodríguez, Sandra Pérez
The aim of this study was to analyze the psychometric properties of the Purpose-In-Life Test (PIL), as well as the age-related differences in meaning in life in women diagnosed with eating disorders. Participants were 250 Spanish women diagnosed with eating disorders who ranged from 12 to 60 years old. Confirmatory Factor Analysis, descriptive analyses, estimation of the internal consistency of the PIL, correlations between the PIL and the Beck Hopelessness Scale (BHS), Overweight Preoccupation Scale (OPS), and Body Investment Scale (BIS), and age differences were calculated. A 19-item model that showed a good fit and internal consistency, a negative correlation between the PIL and both the BHS and OPS, and a positive correlation with the BIS, as well as significant differences between the adolescents and the mature adults, were found. It would be advisable to increase the inclusion of meaning in life in psychotherapeutic interventions with women diagnosed with eating disorders. Copyright © 2017 Elsevier B.V. All rights reserved.
Hammer, Joseph H; Brenner, Rachel E
This study extended our theoretical and applied understanding of gratitude through a psychometric examination of the most popular multidimensional measure of gratitude, the Gratitude, Resentment, and Appreciation Test-Revised Short form (GRAT-RS). Namely, the dimensionality of the GRAT-RS, the model-based reliability of the GRAT-RS total score and 3 subscale scores, and the incremental evidence of validity for its latent factors were assessed. Dimensionality measures (e.g., explained common variance) and confirmatory factor analysis results with 426 community adults indicated that the GRAT-RS conformed to a multidimensional (bifactor) structure. Model-based reliability measures (e.g., omega hierarchical) provided support for the future use of the Lack of a Sense of Deprivation raw subscale score, but not for the raw GRAT-RS total score, Simple Appreciation subscale score, or Appreciation of Others subscale score. Structural equation modeling results indicated that only the general gratitude factor and the lack of a sense of deprivation specific factor accounted for significant variance in life satisfaction, positive affect, and distress. These findings support the 3 pillars of gratitude conceptualization of gratitude over competing conceptualizations, the position that the specific forms of gratitude are theoretically distinct, and the argument that appreciation is distinct from the superordinate construct of gratitude.
Kao, Mei-Hua; Tsai, Yun-Fang
Self-management of osteoarthritis (OA) of the knee is important for treating this chronic disease. This study developed and psychometrically tested a new instrument for measuring adult patients' self-management needs of knee osteoarthritis (SMNKOA). The theoretical framework of self-care guided the development of the 35-item SMNKOA scale. Participants ( N = 372) were purposively sampled from orthopedic clinics at medical centers in Taiwan. The content validity index was 0.83. Principal components analysis identified a three-factor solution, accounting for 53.19% of the variance. The divergent validity was -0.67; convergent validity was -0.51. Cronbach's alpha was .95, Pearson's correlation coefficient was .88, and the intraclass correlation coefficient was .95. The scale's reliability and validity supports the SMNKOA, as a tool to measure self-management needs of adults with knee OA. Nurses and other health care providers can use this instrument to evaluate knee OA patients and identify strategies for improving health-related outcomes and patient education.
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
MERSCH, PPA; BREUKERS, P; EMMELKAMP, PMG
The Simulated Social Interaction Test (SSIT) was translated and adjusted for use on a population of Dutch males and females. Seventy-four social phobic patients were assessed with the SSIT, a conversation test, and an interview with an independent observer. Results show that the SSIT is a relatively
Carter, Amanda G; Creedy, Debra K; Sidebotham, Mary
develop and test a tool designed for use by preceptors/mentors to assess undergraduate midwifery students׳ critical thinking in practice. a descriptive cohort design was used. participants worked in a range of maternity settings in Queensland, Australia. 106 midwifery clinicians who had acted in the role of preceptor for undergraduate midwifery students. this study followed a staged model for tool development recommended by DeVellis (2012). This included generation of items, content validity testing through mapping of draft items to critical thinking concepts and expert review, administration of items to a convenience sample of preceptors, and psychometric testing. A 24 item tool titled the XXXX Assessment of Critical Thinking in Midwifery (CACTiM) was completed by registered midwives in relation to students they had recently preceptored in the clinical environment. ratings by experts revealed a content validity index score of 0.97, representing good content validity. An evaluation of construct validity through factor analysis generated three factors: 'partnership in practice', 'reflection on practice' and 'practice improvements'. The scale demonstrated good internal reliability with a Cronbach alpha coefficient of 0.97. The mean total score for the CACTiM scale was 116.77 (SD=16.68) with a range of 60-144. Total and subscale scores correlated significantly. the CACTiM (Preceptor/Mentor version) was found to be a valid and reliable tool for use by preceptors to assess critical thinking in undergraduate midwifery students. given the importance of critical thinking skills for midwifery practice, mapping and assessing critical thinking development in students׳ practice across an undergraduate programme is vital. The CACTiM (Preceptor/Mentor version) has utility for clinical education, research and practice. The tool can inform and guide preceptors׳ assessment of students׳ critical thinking in practice. The availability of a reliable and valid tool can be used to
Full Text Available The purpose of this article is to present the content, conceptual structure and methodological steps of the latest revision of the Peabody Picture Vocabulary Test (PPVT-III, which is a highly functional and valuable vocabulary test that has been in use since 1959 in different language and cultural surroundings. On the case of the PPVT-III we are presenting the procedure of development and standardization of such vocabulary tests as well as its translation and adaptation from one language and cultural milieu to another. We also note the practical use of the PPVT-III for research purposes. In Slovenian language no vocabulary tests were developed or adapted so far; PPVT-III is presented in this context, too.
Petersen, Morten Aa; Giesinger, Johannes M; Holzner, Bernhard
Fatigue is one of the most common symptoms associated with cancer and its treatment. To obtain a more precise and flexible measure of fatigue, the EORTC Quality of Life Group has developed a computerized adaptive test (CAT) measure of fatigue. This is part of an ongoing project developing a CAT...
Lin, Shu-Hui; Huang, Yun-Chen
The purpose of this study was to validate a Chinese version of the Gratitude Resentment and Appreciation Test (GRAT) with Taiwanese students. In Study 1, a total of 2511 Taiwanese students participated and completed the translated GRAT. Exploratory factor analysis, confirmatory factor analysis and reliability analysis were undertaken to assess the…
Buck, Harleah G; Harkness, Karen; Ali, Muhammad Usman; Carroll, Sandra L; Kryworuchko, Jennifer; McGillion, Michael
Caregivers (CGs) contribute important assistance with heart failure (HF) self-care, including daily maintenance, symptom monitoring, and management. Until CGs' contributions to self-care can be quantified, it is impossible to characterize it, account for its impact on patient outcomes, or perform meaningful cost analyses. The purpose of this study was to conduct psychometric testing and item reduction on the recently developed 34-item Caregiver Contribution to Heart Failure Self-care (CACHS) instrument using classical and item response theory methods. Fifty CGs (mean age 63 years ±12.84; 70% female) recruited from a HF clinic completed the CACHS in 2014 and results evaluated using classical test theory and item response theory. Items would be deleted for low (.95) endorsement, low (.7) corrected item-total correlations, significant pairwise correlation coefficients, floor or ceiling effects, relatively low latent trait and item information function levels ( .5), and differential item functioning. After analysis, 14 items were excluded, resulting in a 20-item instrument (self-care maintenance eight items; monitoring seven items; and management five items). Most items demonstrated moderate to high discrimination (median 2.13, minimum .77, maximum 5.05), and appropriate item difficulty (-2.7 to 1.4). Internal consistency reliability was excellent (Cronbach α = .94, average inter-item correlation = .41) with no ceiling effects. The newly developed 20-item version of the CACHS is supported by rigorous instrument development and represents a novel instrument to measure CGs' contribution to HF self-care. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Khazaee-Pool, Maryam; Majlessi, Fereshteh; Montazeri, Ali; Pashaei, Tahereh; Gholami, Ali; Ponnet, Koen
Breast cancer preventive behaviors have an extreme effect on women's health. Despite the benefits of preventive behaviors regarding breast cancer, they have not been implemented as routine care for healthy women. To assess this health issue, a reliable and valid scale is needed. The aim of the present study is to develop and examine the psychometric properties of a new scale, called the ASSISTS, in order to identify factors that affect women's breast cancer prevention behaviors. A multi-phase instrument development method was performed to develop the questionnaire from February 2012 to September 2014. The item pool was generated based on secondary analyses of previous qualitative data. Then, content and face validity were applied to provide a pre-final version of the scale. The scale validation was conducted with a sample of women recruited from health centers affiliated with Tehran University of Medical Sciences. The construct validity (both exploratory and confirmatory), convergent validity, discriminate validity, internal consistency reliability and test-retest analysis of the questionnaire were tested. Fifty-eight items were initially extracted from the secondary analysis of previous qualitative data. After content validity, this was reduced to 49 items. The exploratory factor analysis revealed seven factors (Attitude, supportive systems, self-efficacy, information seeking, stress management, stimulant and self-care) containing 33 items that jointly accounted for 60.62 % of the observed variance. The confirmatory factor analysis showed a model with appropriate fitness for the data. The Cronbach's alpha coefficient for the subscales ranged from 0.68 to 0.85, and the Intraclass Correlation Coefficient (ICC) ranged from 0.71 to 0.98; which is well above the acceptable thresholds. The findings showed that the designed questionnaire was a valid and reliable instrument for assessing factors affecting women's breast cancer prevention behaviors that can be used both
Gerhardt, Andreas; Eich, Wolfgang; Janke, Susanne; Leisner, Sabine; Treede, Rolf-Detlef; Tesarz, Jonas
Whether chronic localized pain (CLP) and chronic widespread pain (CWP) have different mechanisms or to what extent they overlap in their pathophysiology is controversial. The study compared quantitative sensory testing profiles of nonspecific chronic back pain patients with CLP (n=48) and CWP (n=29) with and fibromyalgia syndrome (FMS) patients (n=90) and pain-free controls (n = 40). The quantitative sensory testing protocol of the "German-Research-Network-on-Neuropathic-Pain" was used to measure evoked pain on the painful area in the lower back and the pain-free hand (thermal and mechanical detection and pain thresholds, vibration threshold, pain sensitivity to sharp and blunt mechanical stimuli). Ongoing pain and psychometrics were captured with pain drawings and questionnaires. CLP patients did not differ from pain-free controls, except for lower pressure pain threshold (PPT) on the back. CWP and FMS patients showed lower heat pain threshold and higher wind-up ratio on the back and lower heat pain threshold and cold pain threshold on the hand. FMS showed lower PPT on back and hand, and higher comorbidity of anxiety and depression and more functional impairment than all other groups. Even after long duration CLP presents with a local hypersensitivity for PPT, suggesting a somatotopically specific sensitization of nociceptive processing. However, CWP patients show widespread ongoing pain and hyperalgesia for different stimuli that is generalized in space, suggesting the involvement of descending control systems, as also suggested for FMS patients. Because mechanisms in nonspecific chronic back pain with CLP and CWP differ, these patients should be distinguished in future research and allocated to different treatments.
Lunneborg, Clifford E.
A decision-making or utility theory model (which deals effectively with affirmative action goals and allows standardized tests to be placed in the service of those goals) is discussed as an alternative to traditional predictive admissions. (Author/PN)
Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J. B.
on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). METHODS: In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients...... model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study...... sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. CONCLUSION: A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient...
Meng, Michael; Peter, Daniel; Mattner, Frauke; Igel, Christoph; Kugler, Christiane
Satisfaction with continuing education can be defined as positive attitudes towards educational programs, which has potential to strengthen learning outcomes. A multi-dimensional construct may enhance continuing education program evaluation processes. The objective is to describe the development and psychometric testing of the 'affective - behavioral - cognitive - satisfaction questionnaire' (ABC-SAT) for assessing participants' satisfaction with a continuing education program for nurses in infection control. The multi-staged development of a satisfaction questionnaire comprised of three subscales. The pilot tool was administered to a nationwide sample of 126 infection control nurses to assess satisfaction after participating in a continuing education program. Satisfaction scores were calculated and psychometric testing was performed to determine reliability, using Cronbach's alpha, face validity, objectivity, and economy. A principle component analysis using varimax rotation and Kaiser normalization was performed. The analysis led to a three-factor solution of the questionnaire with 11 items, explaining 61.4% of the variance. Internal consistency of three scales using Cronbach's alpha was 0.83, 0.60, and 0.66, respectively. Selectivity coefficients varied between 0.39 and 0.70. Participants needed approximately three minutes to complete the questionnaire. Initial findings refer to a satisfying scale structure and internal consistency of the 3-dimensional ABC-SAT questionnaire. Further research is required to confirm the questionnaires' psychometric properties. Copyright © 2018 Elsevier Ltd. All rights reserved.
Liu, Pei-Ching; Gau, Bih-Shya; Hung, Chao-Chia
BACKGROUNG: No specific instrument has thus far been developed for measuring the caregiver burden perceived by parents of children with allergies (CWA). To determine the psychometric properties of the Caregiver Burden Index (CBI). A mixed-methods design was adopted to evaluate the psychometric properties of the scale. The content validity index was 0.89, and the internal consistency was high with a coefficient alpha of 0.98. Three factors were extracted after exploratory factor analysis. The study findings suggest that the CBI has sufficient reliability and validity to evaluate the caregiver burden of parents of CWA. Copyright © 2015 Elsevier Inc. All rights reserved.
Miller, M; Hamilton, J; Scupham, R; Matwiejczyk, L; Prichard, I; Farrer, O; Yaxley, A
Food service staff are integral to delivery of quality food in aged care homes yet measurement of their satisfaction is unable to be performed due to an absence of a valid and reliable questionnaire. The aim of this study was to develop and perform psychometric testing for a new Food Service Satisfaction Questionnaire developed in Australia specifically for use by food service staff working in residential aged care homes (Flinders FSSQFSAC). A mixed methods design utilizing both a qualitative (in-depth interviews, focus groups) and a quantitative approach (cross sectional survey) was used. Content validity was determined from focus groups and interviews with food service staff currently working in aged care homes, related questionnaires from the literature and consultation with an expert panel. The questionnaire was tested for construct validity and internal consistency using data from food service staff currently working in aged care homes that responded to an electronic invitation circulated to Australian aged care homes using a national database of email addresses. Construct validity was tested via principle components analysis and internal consistency through Cronbach's alpha. Temporal stability of the questionnaire was determined from food service staff undertaking the Flinders FSSQFSAC on two occasions, two weeks apart, and analysed using Pearson's correlations. Content validity for the Flinders FSSQFSAC was established from a panel of experts and stakeholders. Principle components analysis revealed food service staff satisfaction was represented by 61-items divided into eight domains: job satisfaction (α=0.832), food quality (α=0.871), staff training (α=0.922), consultation (α=0.840), eating environment (α=0.777), reliability (α=0.695), family expectations (α=0.781) and resident relationships (α=0.429), establishing construct validity in all domains, and internal consistency in all (α>0.5) except for "resident relationships" (α=0.429). Test
López, Mariana Beatriz; Lichtenberger, Aldana; Conde, Karina; Cremonte, Mariana
Background Considering the physical, mental and behavioral problems related to fetal alcohol exposure, prenatal clinical guides suggest a brief evaluation of alcohol consumption during pregnancy to detect alcohol intake and to adjust interventions, if required. Even if any alcohol use should be considered risky during pregnancy, identifying women with alcohol use disorders is important because they could need a more specific intervention than simple advice to abstain. Most screening tests have been developed and validated in male populations and focused on the long-term consequences of heavy alcohol use, so they might be inappropriate to assess consumption in pregnant women. Objective To analyze the internal reliability and validity of the alcohol screening instruments Alcohol Use Disorders Identification Test (AUDIT), Alcohol Use Disorders Identification Test - Consumption (AUDIT-C), Tolerance, Worried, Eye-Opener, Amnesia and Cut-Down (TWEAK), Rapid Alcohol Problems Screen - Quantity Frequency (RAPS-QF) and Tolerance, Annoyed, Cut-Down and Eye-Opener (T-ACE) to identify alcohol use disorders in pregnant women. Methods A total of 641 puerperal women were personally interviewed during the 48 hours after delivery. The receiver operating characteristics (ROC) curves and the sensitivity and specificity of each instrument using different cut-off points were analyzed. Results All instruments showed areas under the ROC curves above 0.80. Larger areas were found for the TWEAK and the AUDIT. The TWEAK, the T-ACE and the AUDIT-C showed higher sensitivity, while the AUDIT and the RAPS-QF showed higher specificity. Reliability (internal consistency) was low for all instruments, improving when optimal cut-off points were used, especially for the AUDIT, the AUDIT-C and the RAPS-QF. Conclusions In other cultural contexts, studies have concluded that T-ACE and TWEAK are the best instruments to assess pregnant women. In contrast, our results evidenced the low
Sun, Fan-Ko; Chiang, Chun-Ying; Lu, Chu-Yun; Yu, Pei-Jane; Liao, Tzu-Chiao; Lan, Chu-Mei
To develop the Health of Body, Mind and Spirit Scale (HBMSS), which was designed to assess drug abusers' health condition. Helping drug abusers to become healthy is important to healthcare professionals. However, no instrument exists to assess drug abusers' state of health. A cross-sectional questionnaire survey was implemented to examine the validity of the HBMSS. Data were collected from 2015-2016 at one drug abuse prevention centre in Taiwan. Participants (N = 320) who had abused drugs were invited to complete a preliminary 64-item version of the HBMSS. An item analysis, criterion-related validity analysis (using the Relapse Prediction Scale [RPS] score), split-half reliability testing and confirmatory factor analysis (CFA) were conducted to examine the psychometric properties of the HBMSS. The final version of the HBMSS contained 15 items that were divided into three subscales: the health of the body, mind and spirit. Cronbach's α and split-half reliability coefficients were all above .85. The factor loading of each item was between .74-.95. The HBMSS had satisfactory criterion-related validity with the RPS score (r = -.50, p < .001). A second-order CFA was conducted on the HBMSS. The fit indexes were good, χ 2 = 184.060, df = 94, χ 2 /df = 1.958 (p = .000). The entire HBMSS and the subscales had satisfactory reliability and validity. Healthcare professionals could use the HBMSS to evaluate the condition of the health of individuals with a drug abuse history. © 2017 John Wiley & Sons Ltd.
Lorenzo, Pelizza; Silvia, Azzali; Federica, Paterlini; Sara, Garlassi; Ilaria, Scazza; Pupo, Simona; Andrea, Raballo
Among current early screeners for psychosis-risk states, the Prodromal Questionnaire-16 items (PQ-16) is often used. We aimed to assess validity and reliability of the Italian version of the PQ-16 in a young adult help-seeking population. We included 154 individuals aged 18-35years seeking help at the Reggio Emilia outpatient mental health services in a large semirural catchment area (550.000 inhabitants). Participants completed the Italian version of the PQ-16 (iPQ-16) and were subsequently evaluated with the Comprehensive Assessment of At-Risk Mental States (CAARMS). We examined diagnostic accuracy (i.e. specificity, sensitivity, negative and positive likelihood ratios, and negative and positive predictive values) and content, convergent, and concurrent validity between PQ-16 and CAARMS using Cronbach's alpha, Spearman's rho, and Cohen's kappa, respectively. We also tested the validity of the adopted PQ-16 cut-offs through Receiver Operating Characteristic (ROC) curves plotted against CAARMS diagnoses and the 1-year predictive validity of the PQ-16. The iPQ-16 showed high internal consistency and acceptable diagnostic accuracy and concurrent validity. ROC analyses pointed to a cut-off score of ≥5 as best cut-off. After 12months of follow-up, 8.7% of participants with a PQ-16 symptom total score of ≥5 who were below the CAARMS psychosis threshold at the baseline, developed a psychotic disorder. Psychometric properties of the iPQ-16 were satisfactory. Copyright © 2018. Published by Elsevier B.V.
Petróczi, Andrea; Backhouse, Susan H.; Barkoukis, Vassilis
research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. At the same time, we must acknowledge that socially desirable responding...... guidance aims to protect the global athletic community against social, ethical and legal consequences from potential misuse of psychological tests, including applications as forensic diagnostic tools in both practice and research.......One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping...
Rezaul Karim, A K M; Nigar, Naima
There is growing importance of the Internet Addiction Test (IAT) in Internet addiction research around the world. Since the development of the IAT (Young, 1996, 1998), a number of validation studies have been done in various cultures. The aim of this study was to translate the instrument into Bangla and validate in Bangladeshi culture, a culture vulnerable to Internet addiction. A total of 177 Internet users (77 females and 100 males) participated in the study. Exploratory factor analysis (EFA) of the data from 172 participants (who provided complete responses) identified a four factor structure of the IAT with 18 items. The four factors namely 'Neglect of duty', 'Online dependence', 'Virtual fantasies', and 'Privacy and self-defense' together explained 55.68% of the total variance. Problematic (moderate/excessive) users on the IAT scored, on average, higher on each of the four IAT factors as compared to average or non-problematic (minimal) users consistently across genders. The IAT and its factors showed good internal consistency (Cronbach's α=.89 for the IAT, and .60-.84 for the factors), strong convergent and discriminant validity. Thus, the Bangla version IAT appears to be valid and reliable and therefore may be used in further research on Internet addiction in the country. Copyright © 2013 Elsevier B.V. All rights reserved.
Jardin, Charles; Garey, Lorra; Zvolensky, Michael J
Sexual motives refer to functions served by sexual behavior. The Sex Motivations Scale (SMS) has frequently been used to assess sexual motives. At its development, the SMS demonstrated good internal consistency; convergent, divergent, and criterion validity; and configural invariance across sex, age, and Caucasians and African Americans. Yet the metric and scalar invariance of the SMS has not been examined, nor has the measurement invariance of the SMS across Hispanic and Asian Americans, sexual minority status, and relationship status been tested. The criterion validity of the SMS also has yet to be examined for nonintercourse sexual behaviors, such as sexting. The present study aimed to address these gaps in a diverse sample of 2,201 college students (77.60% female; M age = 22.06; 27.84% Caucasian). Results further affirmed the configural, metric, and scalar invariance of the SMS. The convergent and divergent validity of the SMS was supported in relation to positive and negative affect and attachment patterns; and specific SMS subscales demonstrated associations with sexual intercourse behaviors and sexting, supporting the criterion validity of the SMS. These findings suggest the relevance of the SMS in assessing sexual motives across diverse populations and behaviors.
Wicherts, J.M.; Dolan, C.V.; Carlson, J.S.; van der Maas, H.L.J.
This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's
Kornelsen, Jude; Stoll, Kathrin; Grzybowski, Stefan
Rural pregnant woman who lack local access to maternity care due to their remote living circumstances may experience stress and anxiety related to pregnancy and parturition. The Rural Pregnancy Experience Scale (RPES) was designed to assess the unique worry and concerns reflective of the stress and anxiety of rural pregnant women related to pregnancy and parturition. The items of the scale were designed based on the results of a qualitative study of the experiences of pregnant rural women, thereby building a priori content validity into the measure. The relevancy content validity index (CVI) for this instrument was 1.0 and the clarity CVI was .91, as rated by maternity care specialists. A field test of the RPES with 187 pregnant rural women from British Columbia indicated that it had two factors: financial worries and worries/concerns about maternity care services, which were consistent with the conceptual base of the tool. Cronbach's alpha for the total RPES was .91; for the financial worries subscale and the worries/concerns about maternity care services subscale, alpha were .89 and .88, respectively. Construct validity was supported by significant correlations between the total scores of the RPES and the Depression Anxiety Stress Scales (DASS [r =.39, p DASS supporting convergent validity (correlations ranged between .20; p < .05 and .43; p < .01). Construct validity was also supported by findings that the level of access and availability of maternity care services were significantly associated with RPES scores. It was concluded that the RPES is a reliable and valid measure of worries and concerns reflective of rural pregnant women's stress and anxiety related to pregnancy and parturition.
Schmidt, Jesper Hvass; Brandt, Christian; Christensen-Dalsgaard, Jakob
Background: With a newly developed technique, hearing thresholds can be estimated with a system operated by the test persons themselves. This technique is based on the 2 Alternative Forced Choice paradigm known from the psychoacoustic research theory. Test persons can operate the system very......-likelihood and up-down methods has proven effective and reliable even under suboptimal test settings. In non-optimal testing conditions i.e. as a part of a hearing conservation programme the headphone Sennheiser HDA-200 has been used as it contains hearing protection. This test-method has been validated......-retest studies of 2AFC audiometry are comparable to test-retest results known from traditional audiometry under standard clinical settings. Conclusions 2 Alternative Forced Choice audiometry can be a reliable alternative to traditional audiometry especially under certain circumstances, where it can...
Full Text Available Abstract Background There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. Methods and design The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU. Participants will be: (a tutors of Family Medicine residents (expert group; (b Family Medicine residents in their second year of the Family Medicine training program (novice group, and (c Family Medicine physicians (intermediate group. The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. Discussion The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Argimon-Pallàs, Josep M; Flores-Mateo, Gemma; Jiménez-Villa, Josep; Pujol-Ribera, Enriqueta; Foz, Gonçal; Bundó-Vidiella, Magda; Juncosa, Sebastià; Fuentes-Bellido, Cruz M; Pérez-Rodríguez, Belén; Margalef-Pallarès, Francesc; Villafafila-Ferrero, Rosa; Forès-Garcia, Dolors; Roman-Martínez, Josep; Vilert-Garroga, Esther
There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP) curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU). Participants will be: (a) tutors of Family Medicine residents (expert group); (b) Family Medicine residents in their second year of the Family Medicine training program (novice group), and (c) Family Medicine physicians (intermediate group). The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Liaw, Sok Ying; Rashasegaran, Ahtherai; Wong, Lai Fun; Deneen, Christopher Charles; Cooper, Simon; Levett-Jones, Tracy; Goh, Hongli Sam; Ignacio, Jeanette
The development of clinical reasoning skills in recognising and responding to clinical deterioration is essential in pre-registration nursing education. Simulation has been increasingly used by educators to develop this skill. To develop and evaluate the psychometric properties of a Clinical Reasoning Evaluation Simulation Tool (CREST) for measuring clinical reasoning skills in recognising and responding to clinical deterioration in a simulated environment. A scale development with psychometric testing and mixed methods study. Nursing students and academic staff were recruited at a university. A three-phase prospective study was conducted. Phase 1 involved the development and content validation of the CREST; Phase 2 included the psychometric testing of the tool with 15 second-year and 15 third-year nursing students who undertook the simulation-based assessment; Phase 3 involved the usability testing of the tool with nine academic staff through a survey questionnaire and focus group discussion. A 10-item CREST was developed based on a model of clinical reasoning. A content validity of 0.93 was obtained from the validation of 15 international experts. The construct validity was supported as the third-year students demonstrated significantly higher (preasoning scores than the second-year students. The concurrent validity was also supported with significant positive correlations between global rating scores and almost all subscale scores, and the total scores. The predictive validity was supported with an existing tool. The internal consistency was high with a Cronbach's alpha of 0.92. A high inter-rater reliability was demonstrated with an intraclass correlation coefficient of 0.88. The usability of the tool was rated positively by the nurse educators but the need to ease the scoring process was highlighted. A valid and reliable tool was developed to measure the effectiveness of simulation in developing clinical reasoning skills for recognising and responding to
Chiwaridzo, Matthew; Ferguson, Gillian D; Smits-Engelsman, Bouwien C M
Scientific focus on rugby has increased over the recent years, providing evidence of the physical or physiological characteristics and game-specific skills needed in the sport. Identification of tests commonly used to measure these characteristics is important for the development of test batteries, which in turn may be used for talent identification and injury prevention programmes. Although there are a number of tests available in the literature to measure physical or physiological variables and game-specific skills, there is limited information available on the psychometric properties of the tests. Therefore, the purpose of this study is to systematically review the literature for tests commonly used in rugby to measure physical or physiological characteristics and rugby-specific skills, documenting evidence of reliability and validity of the identified tests. A systematic review will be conducted. Electronic databases such as Scopus, MEDLINE via EBSCOhost and PubMed, Academic Search Premier, CINAHL and Africa-Wide Information via EBSCOhost will be searched for original research articles published in English from January 1, 1995, to December 31, 2015, using a pre-defined search strategy. The principal investigator will select potentially relevant articles from titles and abstracts. To minimise bias, full text of titles and abstracts deemed potentially relevant will be retrieved and reviewed by two independent reviewers based on the inclusion criteria. Data extraction will be conducted by the principal investigator and verified by two independent reviewers. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist will be used to assess the methodological quality of the selected studies. Choosing an appropriate test to be included in the screening test battery should be based on sound psychometric properties of the test available. This systematic review will provide an overview of the tests commonly used in rugby union
Izilda Carolina de Meneses-Gaya
Full Text Available OBJECTIVE: The Fagerström Test for Nicotine Dependence (FTND is a screening instrument for physical nicotine dependence and is extensively used in various countries. The objective of the present report was to review articles related to the psychometric properties of the FTND. METHODS: A systematic search for articles published up through December of 2007 was carried out in various electronic databases. The following search terms were used: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; and "factors". We included articles published in English, Spanish or Portuguese and in which the psychometric properties of the FTND were evaluated. RESULTS: Twenty-six studies related to the psychometric properties of the FTND were identified in the indexed literature. Analysis of the studies confirmed the reliability of the FTND for the assessment of nicotine dependence in different settings and populations. CONCLUSIONS: Further validation studies using previously validated instruments as a comparative measure are needed before the extensive use of the FTND can be justified on the basis of its psychometric qualities.OBJETIVO: O Fagerström Test for Nicotine Dependence (FTND, Teste de Fagerström para Dependência de Nicotina é um instrumento de rastreamento para dependência física de tabaco, amplamente utilizado em diversos países. Objetivou-se realizar uma revisão de artigos relacionados às propriedades psicométricas do FTND. MÉTODOS: Uma busca sistemática foi realizada usando-se vários indexadores eletrônicos até dezembro de 2007, com os seguintes descritores: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; e "factors". Foram incluídos os artigos relacionados à avaliação das propriedades psicométricas do FTND publicados em inglês, espanhol e português. RESULTADOS: Vinte e seis estudos relativos às propriedades psicom
Huang, Yun-Hsin; Wu, Chih-Hsun; Chen, Hsiu-Jung; Cheng, Yih-Ru; Hung, Fu-Chien; Leung, Kai-Kuan; Lue, Bee-Horng; Chen, Ching-Yu; Chiu, Tai-Yuan; Wu, Yin-Chang
Severe negative emotional reactions to chronic illness are maladaptive to patients and they need to be addressed in a primary care setting. The psychometric properties of a quick screening tool-the Negative Emotions due to Chronic Illness Screening Test (NECIS)-for general emotional problems among patients with chronic illness being treated in a primary care setting was investigated. Three studies including 375 of patients with chronic illness were used to assess and analyze internal consistency, test-retest reliability, criterion-related validity, a cut-off point for distinguishing maladaptive emotions and clinical application validity of NECIS. Self-report questionnaires were used. Internal consistency (Cronbach's α) ranged from 0.78 to 0.82, and the test-retest reliability was 0.71 (P analysis reference, the receiver-operating characteristic curve analysis revealed an area under the curve of 0.81 and 0.82 (ps emotions, with a sensitivity and specificity of 83.3 and 69.0%, and 68.5 and 83.0%, respectively. The clinical application validity analysis revealed that low NECIS group showed significantly better adaptation to chronic illness on the scales of subjective health, general satisfaction with life, self-efficacy of self-care for disease, illness perception and stressors in everyday life. The NECIS has satisfactory psychometric properties for use in the primary care setting. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: firstname.lastname@example.org.
Kang, Qing; Chan, Raymond C K; Li, Xiaoping; Arcelus, Jon; Yue, Ling; Huang, Jiabin; Gu, Lian; Fan, Qing; Zhang, Haiyin; Xiao, Zeping; Chen, Jue
The study aimed to investigate the reliability and validity of the Chinese version of the eating attitudes test (EAT-26) among female adolescents and young adults in Mainland China. This scale was administered to 396 female eating disorder patients and 406 noneating disorder healthy controls, in addition 35 healthy controls completed a retest after a 4-week intervals. Tests for reliability, convergent validity and receiver operating characteristic analysis were performed to detect the psychometric properties. The EAT-26 demonstrated good internal consistency (Cronbach's alpha = 0.822-0.922), test-retest reliability (interclass correlation coefficient = 0.817) and convergent validity(r = 0.450-0.750). The receiver operating characteristic analysis showed that the cut-off 14 for anorexia nervosa and 15 for bulimia nervosa represented good compromises with approximate sensitivity (0.66-0.68) and specificity (0.85-0.86). Our findings provided evidence that the Chinese version of the EAT-26 was a psychometrically reliable and valid self-rating instrument for identifying people suffering from an eating disorder in Mainland China. A clinical cut-off range between 14 and 15 could be used, but caution should be exercised because of the low sensitivity of the tool. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association.
Full Text Available The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum. However, little is known about the test’s psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test’s reliance on one’s vocabulary and evaluate the short-form structure in clinical populations.
Ryabukhin, Yu.S.; Ryabukhin, V.Yu.
Psychometric indicators for mental development of children in towns distinguished by radioactive contamination resulting from the Chernobyl accident are studied. Using some radiological information obtained after the Chernobyl accident, values of expected intelligence quotient (IQ) reduction have been assessed as a result of brain exposure in utero due to various components of dose. Comparing the results of examinations in Novozybkov, Klintsy and Obninsk, no confident evidence has been obtained that radiation exposure of the developing brain exerts influence on indicators for mental development [ru
Kim, Suk Sun; Reed, Pamela G; Kang, Youngmi; Oh, Jina
The purpose of this study was to translate the Spiritual Perspective Scale (SPS) and Self-transcendence Scale (STS) into Korean and test the psychometric properties of the instruments with Korean elders. A cross-sectional survey design was used to implement the three stages of the study. Stage I consisted of translating and reviewing the scales by six experts. In Stage II, equivalence was tested by comparing the responses between the Korean and English versions among 71 bilingual adults. Stage III established the psychometric properties of the Korean versions SPS-K and STS-K among 154 Korean elders. Cronbach's alpha of the SPS-K and the STS-K .97, and .85 respectively with Korean elders. Factor analysis showed that the SPS-K had one factor; the STS-K had four factors with one factor clearly representing self-transcendence as theorized. Both scales showed good reliability and validity for the translated Korean versions. However, continued study of the construct validity of the STS-K is needed. Study findings indicate that the SPS-K and the STS-K could be useful for nurses and geriatric researchers to assess a broadly defined spirituality, and to conduct research on spirituality and health among Korean elders. Use of these scales within a theory-based study may contribute to further knowledge about the role of spirituality in the health and well-being of Korean people facing health crises.
Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J B; Conroy, Thierry; Tomaszewski, Krzysztof A; Young, Teresa; Petersen, Morten Aa
The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients. This evaluation included an assessment of dimensionality, fit to the item response theory (IRT) model, differential item functioning (DIF), and measurement properties. A total of 1030 cancer patients completed the 44 candidate items on CF. Of these, 34 items could be included in a unidimensional IRT model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient assessment of HRQOL of cancer patients, without loss of comparability of results.
Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina
Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the Test of Science-Related Attitudes (TOSRA), such as its construct validity, its discriminant and concurrent validity, and its reliability. The evidence presented suggests that TOSRA, in its Spanish-adapted version, has adequate construct validity regarding its theoretical referents, as well as good indexes of reliability. In addition, it determines the attitudes toward science of secondary school students in Santiago de Chile (n = 664) and analyzes the sex variable as a differentiating factor in such attitudes. The analysis by sex revealed low-relevance gender difference. The results are contrasted with those obtained in English-speaking countries. This TOSRA sample showed good psychometric parameters for measuring and evaluating attitudes toward science, which can be used in classrooms of Spanish-speaking countries or with immigrant populations with limited English proficiency.
Kim, Youngmee; Evangelista, Lorraine S; Phillips, Linda R; Pavlish, Carol; Kopple, Joel D
Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and validity) of the 46-item ESRD-Adherence Questionnaire (ESRD-AQ) in a cohort of patients receiving maintenance HD at dialysis centers in Los Angeles County. The ESRD-AQ is the first self-report instrument to address all components of adherence behaviors of patients with ESRD. The findings support that the instrument is reliable and valid and is easy to administer. Future studies are needed in a larger sample to determine whether additional modifications are needed.
Full Text Available Jean V Rubanick, Medora B Pashmakova, Micah A Bishop, James W Barr Department of Veterinary Small Animal Clinical Sciences, Texas A&M University, College Station, TX, USA Abstract: A hospital-based, prospective cross-sectional study was used to compare kaolin-activated thromboelastography (TEG parameters with traditional coagulation tests in 29 hospitalized dogs. Cases were included if the attending clinician requested coagulation testing. Blood was obtained from each dog and coagulation (prothrombin time, partial thromboplastin time, antithrombin activity, d-dimer concentration, and fibrinogen concentration and TEG analyses were performed. Hematocrit (Hct was also measured. Traditional coagulation results were evaluated for correlation with those from kaolin-activated TEG. Spearman’s correlation was used to calculate correlation coefficients. Fibrinogen was positively correlated with maximum amplitude (Pearson r=0.72, P<0.001 and global clot strength (Pearson r=0.72, P<0.001. There was no correlation between any of the remaining coagulation variables, TEG parameters, or Hct. Results of kaolin-activated TEG and traditional coagulation tests are not interchangeable means of monitoring coagulation derangements in this intensive care unit patient population. Determination of a true outcome measure is necessary to establish TEG’s clinical relevance to veterinary medicine. Keywords: TEG, thromboelastography, coagulation, hemostasis
Kruglanski, A W; Atash, M N; DeGrada, E; Mannetti, L; Pierro, A; Webster, D M
S. L. Neuberg, T. N. Judice, and S. G. West (1997) faulted our work with the Need for Closure Scale (NFCS) on grounds that the NFCS lacks discriminant validity relative to S. L. Neuberg's and J. T. Newsom's (1993) Personal Need for Structure (PNS) Scale and is multidimensional, which, so they claim, renders the use of its total score inadmissible. By contrast, the present authors show that neither of the above assertions is incompatible with the underlying need for closure theory. Relations between NFCS and the PNS are to be expected, as these were designed to operationalize the very same construct (of need for closure). Furthermore, no unidimensionality of the NFCS has been claimed, and none is required to use its total score for testing various theoretically derived predictions. An instrument's ultimate utility hinges on theoretical considerations and empirical evidence rather than on questionable psychometric dogma unrelated to the substantive matters at hand.
Antoniotto Guidobono Cavalchini
Full Text Available Compared to traditional plowing and minimum tillage, the sod seeding technique has been tested in order to evaluate the differences in energy consumption, labor and machinery requirement and CO2 emission reduction. The experiments were conducted on winter cereal seeding in a Po valley farm in October 2011. The tests were carried out as follows: wheat variety seeding, over corn and alfalfa crops, in large plots with three repetitions for each thesis. They included: sod seeding anticipated by round up weeding in the case of the plots over alfalfa; traditional plowing at 35 cm followed by rotary tillage and combined seeding (seeder plus rotary tiller; minimum tillage based on ripping at the same depth (35 cm and combined seeder ( seeder plus rotary tiller. The following farm operations - fertilizer, and other agrochemical distributionshave been the same in all the considered theses. The results, statistically significant (P<0.001 in terms of yields, highlighted slight differences: the best data in the case of the traditional plowing both in the case of wheat crop over corn and alfalfa (84.43 and 6.75 t/ha; slightly lower yields for the sod seeding (6.23 and 79.9 t/ha for corn and alfalfa respectively; lower in the case of minimum tillage (5.87; 79.77 t/ha in the two situations. Huge differences in energy and oil consumption have been recorded: in the case of succession to corn 61.47; 35.31; 4.27 kg oil/ha respectively for, traditional plowing, minimum tillage and sod seeding; in the case of alfalfa 61.2; 50.96; 5.14 kg oil/ha respectively for traditional plowing, minimum tillage and sod seeding. The innovative technique, highlighted huge energy saving with an oil consumption equal to 92% and 89% (P<0.001 of what happens in traditional plowing and minimum tillage. Large differences concern labor and machine productivity. These parameters together with oil consumption and machine size [power (kW and weight (t] lead to even greater differences in
Strout, Tania D
Agitation is a vexing problem frequently observed in emergency department acute psychiatric patients, yet no instruments to measure agitation in this setting and population were found upon review of the literature. Previously developed agitation rating scales are limited by the length of observation they require, their need for participation by the patient, complexity in scoring, and a lack of validity in this setting and population. The purpose of this study was to psychometrically evaluate and refine an observation-based agitation scale for use with emergency department acute psychiatric patients. Using a methodological design, the 21-item Agitation Severity Scale was utilized to assess 270 adult psychiatric patients in the emergency setting in a prospective, observational fashion. Reliability analysis, item analysis, exploratory factor analysis, and validity assessments were completed. The relationship between Agitation Severity Scale scores and scores on the previously established Overt Agitation Severity Scale was evaluated. The instrument was reduced to 17 items representing four factors (Aggressive Behaviors, Interpersonal Behaviors, Involuntary Motor Behaviors, and Physical Stance) that accounted for nearly 70% of observed variance, Cronbach's α = 0.91. Evidence of internal consistency reliability, equivalence reliability, construct validity, and convergent validity was established. Through this study, the 17-item Agitation Severity Scale demonstrated acceptable levels of reliability and validity when used with acute psychiatric patients in the emergency setting. This instrument holds promise as a method of enhancing clinical communication about agitation, evaluating the efficacy of interventions aimed at decreasing agitation, and as a research tool.
Samanta, Ash; Samanta, Jo
An essential component of an action in negligence against a doctor is proof that the doctor failed to provide the required standard of care under the circumstances. Traditionally the standard of care in law has been determined according to the Bolam test. This is based on the principle that a doctor does not breach the legal standard of care, and is therefore not negligent, if the practice is supported by a responsible body of similar professionals. The Bolam principle, however, has been perceived as being excessively reliant upon medical testimony supporting the defendant. The judgment given by the House of Lords in the recent case of Bolitho imposes a requirement that the standard proclaimed must be justified on a logical basis and must have considered the risks and benefits of competing options. The effect of Bolitho is that the court will take a more enquiring stance to test the medical evidence offered by both parties in litigation, in order to reach its own conclusions. Recent case law shows how the court has applied the Bolitho approach in determining the standard of care in cases of clinical negligence. An understanding of this approach and of the shift from the traditional Bolam test is relevant to all medical practitioners, particularly in a climate that is increasingly litigious.
Full Text Available The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It. Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170 examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA and construct validity with enjoyment perception during physical activity. Study 2 (n = 59 reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry over the span of seven consecutive days. Study 3 (n = 58 examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD. In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83. Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36, with BMI (r = -.30 and -.79 for CHD simple form, and with the VO2max (r = .55 for CHD simple form. Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05. Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio
The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.
Alexander, Dayna S; Alfonso, Moya L; Cao, Chunhua
Currently, public health practitioners are analyzing the role that caregivers play in childhood obesity efforts. Assessing African American caregiver's perceptions of childhood obesity in rural communities is an important prevention effort. This article's objective is to describe the development and psychometric testing of a survey tool to assess childhood obesity perceptions among African American caregivers in a rural setting, which can be used for obesity prevention program development or evaluation. The Childhood Obesity Perceptions (COP) survey was developed to reflect the multidimensional nature of childhood obesity including risk factors, health complications, weight status, built environment, and obesity prevention strategies. A 97-item survey was pretested and piloted with the priority population. After pretesting and piloting, the survey was reduced to 59-items and administered to 135 African American caregivers. An exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) was conducted to test how well the survey items represented the number of Social Cognitive Theory constructs. Twenty items were removed from the original 59-item survey and acceptable internal consistency of the six factors (α=0.70-0.85) was documented for all scales in the final COP instrument. CFA resulted in a less than adequate fit; however, a multivariate Lagrange multiplier test identified modifications to improve the model fit. The COP survey represents a promising approach as a potentially comprehensive assessment for implementation or evaluation of childhood obesity programs. Copyright © 2016 Elsevier Ltd. All rights reserved.
Dodd, Caleb D.; Burris, Scott; Fraze, Steve; Doerfert, David; McCulloch, Abigail
The incorporation of hot and cold food bars into grocery stores in an effort to capture a portion of the home meal replacement industry is presenting new challenges for retail food establishments. To ensure retail success and customer safety, employees need to be educated in food safety practices. Traditional methods of training are not meeting…
Wiklander, Maria; Brännström, Johanna; Svedhem, Veronica; Eriksson, Lars E
Barriers to HIV testing experienced by individuals at risk for HIV can result in treatment delay and further transmission of the disease. Instruments to systematically measure barriers are scarce, but could contribute to improved strategies for HIV testing. Aims of this study were to develop and test a barriers to HIV testing scale in a Swedish context. An 18-item scale was developed, based on an existing scale with addition of six new items related to fear of the disease or negative consequences of being diagnosed as HIV-infected. Items were phrased as statements about potential barriers with a three-point response format representing not important, somewhat important, and very important. The scale was evaluated regarding missing values, floor and ceiling effects, exploratory factor analysis, and internal consistencies. The questionnaire was completed by 292 adults recently diagnosed with HIV infection, of whom 7 were excluded (≥9 items missing) and 285 were included (≥12 items completed) in the analyses. The participants were 18-70 years old (mean 40.5, SD 11.5), 39 % were females and 77 % born outside Sweden. Routes of transmission were heterosexual transmission 63 %, male to male sex 20 %, intravenous drug use 5 %, blood product/transfusion 2 %, and unknown 9 %. All scale items had <3 % missing values. The data was feasible for factor analysis (KMO = 0.92) and a four-factor solution was chosen, based on level of explained common variance (58.64 %) and interpretability of factor structure. The factors were interpreted as; personal consequences, structural barriers, social and economic security, and confidentiality. Ratings on the minimum level (suggested barrier not important) were common, resulting in substantial floor effects on the scales. The scales were internally consistent (Cronbach's α 0.78-0.91). This study gives preliminary evidence of the scale being feasible, reliable and valid to identify different types of barriers to HIV testing.
Giuffrida, Michelle A; Brown, Dorothy Cimino; Ellenberg, Susan S; Farrar, John T
OBJECTIVE To describe development and initial psychometric testing of an owner-reported questionnaire designed to standardize measurement of general quality of life (QOL) in dogs with cancer. DESIGN Key-informant interviews, questionnaire development, and field trial. SAMPLE Owners of 25 dogs with cancer for item development and pretesting and owners of 90 dogs with cancer for reliability and validity testing. PROCEDURES Standard methods for development and testing of questionnaire instruments intended to measure subjective states were used. Items were generated, selected, scaled, and pretested for content, meaning, and readability. Response items were evaluated with exploratory factor analysis and by assessing internal consistency (Cronbach α) and convergence with global QOL as determined with a visual analog scale. Preliminary tests of stability and responsiveness were performed. RESULTS The final questionnaire-which was named the Canine Owner-Reported Quality of Life (CORQ) questionnaire-contained 17 items related to observable behaviors commonly used by owners to evaluate QOL in their dogs. Several items pertaining to physical symptoms performed poorly and were omitted. The 17 items were assigned to 4 factors-vitality, companionship, pain, and mobility-on the basis of the items they contained. The CORQ questionnaire and its factors had high internal consistency (Cronbach α = 0.68 to 0.90) and moderate to strong correlations (r = 0.49 to 0.71) with global QOL as measured on a visual analog scale. Preliminary testing indicated good test-retest reliability and responsiveness to improvements in overall QOL. CONCLUSIONS AND CLINICAL RELEVANCE The CORQ questionnaire was a valid, reliable owner-reported questionnaire that measured general QOL in dogs with cancer and showed promise as a clinical trial outcome measure for quantifying changes in individual dog QOL occurring in response to cancer treatment and progression.
Conclusion: Although both the chained linear equating method and Rasch analysis can be readily applied to practical test-equating issues in medical education, Rasch analysis exhibited more versatility in test parameter estimation and item bank development for clinical curriculums.
Kolman, Nikki; Huijgen, Barbara; Kramer, Tamara; Elferink-Gemser, Marije; Visscher, Chris
This study examined the test-retest reliability, validity and feasibility of the newly developed Dutch Technical Tactical Tennis Test (D4T). This new test is relevant for talent identification and development in tennis. Thirty-two youth male tennis players (age 13.4 +/- 0.5) were classified as elite
Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B
This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took issues were addressed. Analyses suggested a revision of the phase III hypothesized scale structure. Two items were deleted based on a high degree of item misfit, together with negative patient feedback. The remaining 15 items formed one eight-item scale named OH-QoL score, a two-item information scale, a two-item scale regarding dentures, and three single items (sticky saliva/mouth soreness/sensitivity to food/drink). Face and convergent validity and internal consistency were confirmed. Test-retest reliability (n = 60) was demonstrated as was RCA for patients undergoing chemotherapy (n = 117; p = 0.06). The resulting QLQ-OH15 discriminated between clinically distinct patient groups, e.g., low performance status vs. higher (p < 000.1), and head-and-neck cancer versus other cancers (p < 0.03). The EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.
Full Text Available Introduction. Disturbed eating attitudes may be important precursors of pathological eating patterns and, therefore need to be researched adequately. The Children's Eating Attitude Test (ChEAT is indicated for detecting at-risk attitudes and concerns in youngsters. Method. The present study was designed to provide a preliminary psychometric evaluation of the Dutch version of the ChEAT, by examining reliability and validity in a sample of 166 youngsters. Results. Generally the ChEAT seems to be a reliable instrument. Concurrent validity was demonstrated by positive correlations with measures assessing pathological eating behaviour and with related psychological problems. The discriminant validity was good. Based on ChEAT scores we can distinguish overweight youngsters from the community sample and “dieters” from “non dieters”. Divergent validity and factor structure show still shortcomings. Discussion. The Dutch version of the ChEAT seems to be a promising screening- and research instrument. Future prospective research could focus on a cut-off score for identifying at-risk youngsters.
Remor, Eduardo; Fuster-RuizdeApodaca, Maria José; Ballester-Arnal, Rafael; Gómez-Martínez, Sandra; Fumaz, Carmina R; González-Garcia, Marian; Ubillos-Landa, Silvia; Aguirrezabal-Prado, Arrate; Molero, Fernando; Ruzafa-Martínez, Maria
The Screenphiv, a screening measure for psychological issues related to HIV, was psychometrically tested in a study involving 744 HIV-infected people in Spain. Participants ages 18-82 (M = 43.04, 72 % men, 28 % women) completed an assessment protocol that included the Screenphiv and the MOS-HIV. A trained interviewer also collected relevant illness-related clinical data and socio-demographics from the participants. A confirmatory factor analysis was used to evaluate the goodness of fit of the Screenphiv's theoretical model and confirmed six first-order factors and two second-order factors [RMSEA (IC 90 %) = 0.07 (0.07-0.08)]. No floor or ceiling effects were observed for the scores. Cronbach's alphas were acceptable for all of the factors (from 0.65 to 0.92). Criterion-related validity also achieved; Screenphiv scores were related to socio-demographic and clinical variables and MOS-HIV summary scores. The Screenphiv is a reliable and valid measure, ready to use in research and clinical settings in Spain.
Oliveri, María Elena; von Davier, Alina A.
In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…
Full Text Available This study examined the test-retest reliability, validity and feasibility of the newly developed Dutch Technical-Tactical Tennis Test (D4T. This new test is relevant for talent identification and development in tennis. Thirty-two youth male tennis players (age 13.4 ± 0.5 were classified as elite (n = 15 or sub-elite (n = 17 according to their position on the national youth ranking list under 15 years (cut-off rank 50 in the Netherlands. Games, rallies and different tactical situations (i.e. offensive, neutral and defensive were simulated with a ball machine. Players had to return 72 balls to predetermined target areas. Stroke quality was recorded based on ball velocity and accuracy (VA-index, as well as percentage errors. Test-retest reliability was assessed by comparing differences between the first and second test-session (n = 10. An intraclass-correlation coefficient of .78 for the VA-index was found (p < .05, indicating excellent test-retest reliability. Independent t-tests revealed that elite players outscored sub-elite players for the VA-index, ball velocity, accuracy and percentage errors (p < .05, supporting good validity. Furthermore, a high correlation was found between the VA-index and individual positions on the youth ranking list (p = -.75; p < .001. The assessment of feasibility indicated that the D4T was applicable for instructors and coaches. In conclusion, the D4T was shown to be a reliable, valid and feasible test to measure technical-tactical characteristics of tennis performance in youth players.
Verster, Joris C.; Roth, Thomas
Rationale There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. Objective The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Methods Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a ...
Lahey, Benjamin B.; Applegate, Brooks; Chronis, Andrea M.; Jones, Heather A.; Williams, Stephanie Hall; Loney, Jan; Waldman, Irwin D.
Lahey and Waldman proposed a developmental propensity model in which three dimensions of children's emotional dispositions are hypothesized to transact with the environment to influence risk for conduct disorder, heterogeneity in conduct disorder, and comorbidity with other disorders. To prepare for future tests of this model, a new measure of…
Raykov, Tenko; Marcoulides, George A.; Dimitrov, Dimiter M.; Li, Tatyana
This article extends the procedure outlined in the article by Raykov, Marcoulides, and Tong for testing congruence of latent constructs to the setting of binary items and clustering effects. In this widely used setting in contemporary educational and psychological research, the method can be used to examine if two or more homogeneous…
Hakim-Larson, Julie; Parker, Alison; Lee, Catharine; Goodwin, Jacqueline; Voelker, Sylvia
Parental meta-emotion, assessed through interviews, involves parents' philosophy about emotions and has been found to be related to parenting behaviors and children's emotional and social competence (e.g., Gottman, Katz, & Hooven, 1996; Katz & Windecker-Nelson, 2004). The Emotion-Related Parenting Styles Self-Test is a true-false…
Richard R. McNeer
Full Text Available Introduction. Medical simulators are used for assessing clinical skills and increasingly for testing hypotheses. We developed and tested an approach for assessing performance in anesthesia residents using screen-based simulation that ensures expert raters remain blinded to subject identity and experimental condition. Methods. Twenty anesthesia residents managed emergencies in an operating room simulator by logging actions through a custom graphical user interface. Two expert raters rated performance based on these entries using custom Global Rating Scale (GRS and Crisis Management Checklist (CMC instruments. Interrater reliability was measured by calculating intraclass correlation coefficients (ICC, and internal consistency of the instruments was assessed with Cronbach’s alpha. Agreement between GRS and CMC was measured using Spearman rank correlation (SRC. Results. Interrater agreement (GRS: ICC = 0.825, CMC: ICC = 0.878 and internal consistency (GRS: alpha = 0.838, CMC: alpha = 0.886 were good for both instruments. Subscale analysis indicated that several instrument items can be discarded. GRS and CMC scores were highly correlated (SRC = 0.948. Conclusions. In this pilot study, we demonstrated that screen-based simulation can allow blinded assessment of performance. GRS and CMC instruments demonstrated good rater agreement and internal consistency. We plan to further test construct validity of our instruments by measuring performance in our simulator as a function of training level.
Brown, Gregory G; Thomas, Michael L; Patt, Virginie
Neuropsychology is an applied measurement field with its psychometric work primarily built upon classical test theory (CTT). We describe a series of psychometric models to supplement the use of CTT in neuropsychological research and test development. We introduce increasingly complex psychometric models as measurement algebras, which include model parameters that represent abilities and item properties. Within this framework of parametric model measurement (PMM), neuropsychological assessment involves the estimation of model parameters with ability parameter values assuming the role of test 'scores'. Moreover, the traditional notion of measurement error is replaced by the notion of parameter estimation error, and the definition of reliability becomes linked to notions of item and test information. The more complex PMM approaches incorporate into the assessment of neuropsychological performance formal parametric models of behavior validated in the experimental psychology literature, along with item parameters. These PMM approaches endorse the use of experimental manipulations of model parameters to assess a test's construct representation. Strengths and weaknesses of these models are evaluated by their implications for measurement error conditional upon ability level, sensitivity to sample characteristics, computational challenges to parameter estimation, and construct validity. A family of parametric psychometric models can be used to assess latent processes of interest to neuropsychologists. By modeling latent abilities at the item level, psychometric studies in neuropsychology can investigate construct validity and measurement precision within a single framework and contribute to a unification of statistical methods within the framework of generalized latent variable modeling.
Pearcy, Benjamin T D; Roberts, Lynne D; McEvoy, Peter M
Internet Gaming Disorder (IGD) is in the early stages of recognition as a disorder, following its inclusion in the Diagnostic and Statistical Manual for Mental Disorders (DSM-5; American Psychiatric Association(1)) as a condition for further study. Existing measures of Internet gaming pathology are limited in their ability to measure IGD as defined in the DSM-5. We present the initial development and validation of a new measure derived from the proposed DSM-5 criteria for IGD, the Personal Internet Gaming Disorder Evaluation-9 (PIE-9). A student sample (n = 119) and a community sample (n = 285), sourced through a variety of online gaming forums, completed an online survey comprising the new measure, existing measures of IGD, and a range of health and demographic questions. Exploratory and confirmatory factor analysis supported a single factor structure for the 9-item PIE-9. Internal consistency (α = 0.89) and test-retest reliability (intraclass correlation [ICC] = 0.77) were high. Convergent validity was demonstrated with similar gaming addiction measures. Predictive validity was established through significant differences in distress and disability between those who met the criteria for IGD and those who did not. The distress and disability associated with meeting IGD criteria fell within the range of other common DSM-5 disorders. Preliminary testing of the PIE-9 has demonstrated that it is an efficient and straightforward measure for use in further research of IGD, and as a potential screening measure in clinical practice.
Porcelli, Anthony John
Recently research on the relationship between stress and cognition, emotion, and behavior has greatly increased. These advances have yielded insights into important questions ranging from the nature of stress' influence on addiction(1) to the role of stress in neural changes associated with alterations in decision-making(2,3). As topics being examined by the field evolve, however, so too must the methodologies involved. In this article a practical and effective alternative to a classic stress induction technique, the cold pressor test (CPT), is presented: the cold pressor arm wrap (CPAW). CPT typically involves immersion of a participant's dominant hand in ice-cold water for a period of time(4). The technique is associated with robust activation of the sympatho-adrenomedullary (SAM) axis (and release of catecholamines; e.g. adrenaline and noradrenaline) and mild-to-moderate activation of the hypothalamic-pituitary-adrenal (HPA) axis with associated glucocorticoid (e.g. cortisol) release. While CPT has been used in a wide range of studies, it can be impractical to apply in some research environments. For example use of water during, rather than prior to, magnetic resonance imaging (MRI) has the potential to damage sensitive and expensive equipment or interfere with acquisition of MRI signal. The CPAW is a practical and effective alternative to the traditional CPT. Composed of a versatile list of inexpensive and easily acquired components, CPAW makes use of MRI-safe gelpacs cooled to a temperature similar to CPT rather than actual water. Importantly CPAW is associated with levels of SAM and HPA activation comparable to CPT, and can easily be applied in a variety of research contexts. While it is important to maintain specific safety protocols when using the technique, these are easy to implement if planned for. Creation and use of the CPAW will be discussed.
Full Text Available Cephinae is traditionally divided into three tribes and about 24 genera based on morphology and host utilization. There has been no study testing the monophyly of taxa under a strict phylogenetic criterion. A molecular phylogeny of Cephinae based on a total of 68 sequences of mtDNA COI gene, representing seven genera of Cephinae, is reconstructed to test the traditional limits and relationships of taxa. Monophyly of the traditional tribes is not supported. Monophyly of the genera are largely supported except for Pachycephus. A few host shift events are suggested based on phylogenetic relationships among taxa. These results indicate that a more robust phylogeny is required for a more plausible conclusion. We also report two species of Cephus for the first time from Turkey.
Powers, Kasey L.; Brooks, Patricia J.; Galazyn, Magdalena; Donnelly, Seamus
Online course-packs are marketed as improving grades in introductory-level coursework, yet it is unknown whether these course-packs can effectively replace, as opposed to supplement, in-class instruction. This study compared learning outcomes for Introductory Psychology students in hybrid and traditional sections, with hybrid sections replacing…
Block scheduling is the reallocation of a school day into longer class sessions to allow for more active teaching strategies and active engagement of students, in the effort to increase student performance. Various types of block scheduling exist. Traditional scheduling is when the school day is divided into six to eight sessions, with each…
Ana Paula Porto Noronha
Full Text Available A validade e a precisão de testes psicológicos vêm sendo bastante questionadas e discutidas atualmente. O presente estudo teve como objetivo avaliar a validade, a precisão e a existência de padronização brasileira em 43 testes psicológicos comercializados no Brasil, sendo 22 de inteligência e 21 de personalidade. Os testes foram comparados quanto ao período de publicação no Brasil. Os resultados indicaram que existe maior número de instrumentos publicados nas décadas de 1980 e 1990, que os testes de inteligência apresentam mais estudos de padronização, validade e precisão, embora não tenha havido diferença significante entre os grupos de testes (inteligência e personalidade. Novos estudos devem ser desenvolvidos com o intuito de promover os testes psicológicos e a área de avaliação psicológica, como um todo.Nowadays, the validity and the reliability of psychological tests are being very questioned and discussed. The current study aims to evaluate the validity, the reliability and the existence of Brazilian standardization in 43 psychological tests commercialized in Brazil, being 22 of intelligence and 21 of personality. The tests have been compared concearning the publication period in Brazil. The results have indicated that there is a bigger number of instruments published in the 1980´s and 1990´s, that the intelligence tests present more studies of standardization, validity and reliability, even though there hasn´t been significant difference among the groups tests (intelligence and personality. New studies must be developed aiming to promote the psychological tests and the area of psychological assessment, as a whole.
Vedam, Saraswathi; Stoll, Kathrin; Martin, Kelsey; Rubashkin, Nicholas; Partridge, Sarah; Thordarson, Dana; Jolicoeur, Ganga
To develop and validate a new instrument that assesses women's autonomy and role in decision making during maternity care. Through a community-based participatory research process, service users designed, content validated, and administered a cross-sectional quantitative survey, including 31 items on the experience of decision-making. Pregnancy experiences (n = 2514) were reported by 1672 women who saw a single type of primary maternity care provider in British Columbia. They described care by a midwife, family physician or obstetrician during 1, 2 or 3 maternity care cycles. We conducted psychometric testing in three separate samples. We assessed reliability, item-to-total correlations, and the factor structure of the The Mothers' Autonomy in Decision Making (MADM) scale. We report MADM scores by care provider type, length of prenatal appointments, preferences for role in decision-making, and satisfaction with experience of decision-making. The MADM scale measures a single construct: autonomy in decision-making during maternity care. Cronbach alphas for the scale exceeded 0.90 for all samples and all provider groups. All item-to-total correlations were replicable across three samples and exceeded 0.7. Eigenvalue and scree plots exhibited a clear 90-degree angle, and factor analysis generated a one factor scale. MADM median scores were highest among women who were cared for by midwives, and 10 or more points lower for those who saw physicians. Increased time for prenatal appointments was associated with higher scale scores, and there were significant differences between providers with respect to average time spent in prenatal appointments. Midwifery care was associated with higher MADM scores, even during short prenatal appointments (maternity care. This new scale was developed and content validated by community members representing various populations of childbearing women in BC including women from vulnerable populations. MADM measures women's ability to lead
Amber E. Vaughn
development (r = 0.21, and nutrition policy (r = 0.18. Child MVPA was significantly associated with overall time provided for activity (r = 0.18 and outdoor playtime (r = 0.20. There was also an unexpected negative association between child MVPA and screen time (−0.16 and screen time practices (r = −0.21. Conclusions The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. Trial registration NCT01814215 , March 15, 2013.
Vaughn, Amber E; Mazzucca, Stephanie; Burney, Regan; Østbye, Truls; Benjamin Neelon, Sara E; Tovar, Alison; Ward, Dianne S
= 0.18). Child MVPA was significantly associated with overall time provided for activity (r = 0.18) and outdoor playtime (r = 0.20). There was also an unexpected negative association between child MVPA and screen time (-0.16) and screen time practices (r = -0.21). The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. NCT01814215 , March 15, 2013.
Kötter, T; Obst, K U; Brüheim, L; Eisemann, N; Voltmer, E; Katalinic, A
Background The final exam grade is the main selection criterion for medical school application in Germany. For academic success, it seems to be a reliable predictor. Its use as the only selection criterion is, however, criticised. At some universities, personal interviews are part of the selection process. However, these are very time consuming and are of doubtful validity. The (additional) use of appropriate psychometric instruments could reduce the cost and increase the validity. This study investigates the extent to which psychometric instruments can predict the outcome of a personal selection interview. Methods This is a cross-sectional study on the correlation of the results of psychometric instruments with those of the personal selection interview as part of the application process. As the outcome, the score of the selection interview was used. The NEO - Five Factor Inventory, the Hospital Anxiety and Depression Scale (HADS) and the questionnaire to identify work-related behaviour and experience patterns (AVEM) were used as psychometric interviews. Results There was a statistically significant correlation with the results of the personal selection interview for the sum score of the depression scale from the HADS and the sum score for the dimension of life satisfaction of the AVEM. In addition, those participants who did not previously complete an application training achieved a better result in the selection interview. Conclusion The instruments used measure different aspects than the interviews and cannot replace them. It remains to be seen whether the selected parameters are able to predict academic success. © Georg Thieme Verlag KG Stuttgart · New York.
Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane
Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765
Bolt, Daniel; Ark, L; Wang, Wen-Chung
The 78th Annual Meeting of the Psychometric Society (IMPS) builds on the Psychometric Society's mission to share quantitative methods relevant to psychology. The chapters of this volume present cutting-edge work in the field. Topics include studies of item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Additional psychometric topics relate to structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis, among others. The papers in this volume will be especially useful for researchers in the social sciences who use quantitative methods. Prior knowledge of statistical methods is recommended. The 78th annual meeting took place in Arnhem, The Netherlands between July 22nd and 26th, 2013. The previous volume to showcase work from the Psychometric Society’s Meeting is New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 201...
Schmieder, Daniela A; Benítez, Hugo A; Borissov, Ivailo M; Fruciano, Carmelo
External morphology is commonly used to identify bats as well as to investigate flight and foraging behavior, typically relying on simple length and area measures or ratios. However, geometric morphometrics is increasingly used in the biological sciences to analyse variation in shape and discriminate among species and populations. Here we compare the ability of traditional versus geometric morphometric methods in discriminating between closely related bat species--in this case European horseshoe bats (Rhinolophidae, Chiroptera)--based on morphology of the wing, body and tail. In addition to comparing morphometric methods, we used geometric morphometrics to detect interspecies differences as shape changes. Geometric morphometrics yielded improved species discrimination relative to traditional methods. The predicted shape for the variation along the between group principal components revealed that the largest differences between species lay in the extent to which the wing reaches in the direction of the head. This strong trend in interspecific shape variation is associated with size, which we interpret as an evolutionary allometry pattern.
Daniela A Schmieder
Full Text Available External morphology is commonly used to identify bats as well as to investigate flight and foraging behavior, typically relying on simple length and area measures or ratios. However, geometric morphometrics is increasingly used in the biological sciences to analyse variation in shape and discriminate among species and populations. Here we compare the ability of traditional versus geometric morphometric methods in discriminating between closely related bat species--in this case European horseshoe bats (Rhinolophidae, Chiroptera--based on morphology of the wing, body and tail. In addition to comparing morphometric methods, we used geometric morphometrics to detect interspecies differences as shape changes. Geometric morphometrics yielded improved species discrimination relative to traditional methods. The predicted shape for the variation along the between group principal components revealed that the largest differences between species lay in the extent to which the wing reaches in the direction of the head. This strong trend in interspecific shape variation is associated with size, which we interpret as an evolutionary allometry pattern.
Davis, Frederick B.
This review of psychometric research in reading analyzes the factors which seem related to reading comprehension skills. Experimental analysis of reading comprehension by L. E. Thorndike revealed two major components: knowledge of word meanings and verbal reasoning abilities. Subsequent analysis of experimental studies of reading comprehension…
Ishikawa, Keisuke; Eguchi, Yusuke; Uetake, Katsuji; Tnaka, Toshio
The puppy aptitude test (PAT) is a general method for choosing a puppy. However, the reliability of the test has been in doubt because of its lack of a scientific base. In this report, we conducted PAT and some other behavioral tests before and after the establishment of socialization in order to investigate the stability of behavioral traits of puppies. Ten puppies were tested of eleven items on PAT at 57 and 140 days of age. The puppy’s behavior was videotaped and rated by four people using...
Mathias, Susan D; Bussel, James B; George, James N; McMillan, Robert; Okano, Gary J; Nichol, Janet L
The Immune Thrombocytopenic Purpura Patient Assessment Questionnaire (ITP-PAQ) was developed to assess disease-specific quality of life (QoL) in adults with ITP. It is a 44-item questionnaire that includes scales for physical health (symptoms, fatigue/sleep, bother, and activity), emotional health (psychological and fear), overall QoL, social activity, women's reproductive health, and work. A previous study reported preliminary evidence of its reliability and validity. The present study was conducted to ascertain the responsiveness (ability to detect a clinically important treatment effect), reliability, and validity of the ITP-PAQ and to corroborate the earlier findings. The women's reproductive health scale was evaluated for psychometric evidence of the existence of separate menstrual symptoms and fertility subscales. The ITP-PAQ was evaluated in the context of an ongoing open-label extension study assessing the tolerability and durability of increases in the platelet count with AMG 531 (a thrombopoiesis peptibody that increases platelet production by targeting the thrombopoietin receptor) administered by subcutaneous injection once weekly in adult patients with ITP It was self-administered at baseline and at weeks 4, 12, and 24. The responsiveness of the questionnaire was evaluated by calculating and comparing the change scores of patients who showed clinical improvement-categorized as platelet responders (those with a platelet count > or =50 x 10(9) cells/L and a doubling of baseline values at week 24) and durable platelet responders (those with a platelet count > or =50 x 10(9) cells/L and a doubling of baseline values on > or =6 occasions during weeks 17-24)-with the change scores of patients wh did not show clinical improvement. The reliability (internal consistency and test-retest) and validity (convergent, discriminant, and known groups) of the questionnaire were also evaluated. Validity was examined in terms of correlations between the ITP-PAQ and the 36
Lisette Paola Irarrazábal
Full Text Available OBJECTIVE: To compare the sensitivity and specificity of an Oral Rapid Test (ORT to that of the Enzyme-Linked Immunosorbent Assay (ELISA for HIV testing in Santiago, Chile; to track the number of study participants returning for ELISA testing results; and to analyze the participants' perceptions of the ORT compared to the ELISA. METHODS: A total of 497 people were recruited in Santiago, Chile: 153 had previously tested positive for HIV, and 344 were of unknown status. Participants were tested for HIV using both the ELISA and the ORT to examine and compare specificity and sensitivity. Qualitative data were collected from 22 participants to compare perceptions of the testing experience with ORT versus ELISA. RESULTS: The ELISA reported 184 (37% of the 497 participants as being "positive" for HIV antibodies; the ORT showed 181 (36.4% as being "reactive" for HIV. The ORT showed a sensitivity of 98.4% (95.7%-99.9%, 95% Confidence Interval and specificity of 100%. The Kappa test produced K = 0.983 (P < 0.0001. Of the 344 participants whose HIV status was unknown at the start of the study, 55 failed to return for their ELISA results. Participants positively perceived ORT as having reduced both waiting time and anxiety over obtaining their test results. ORT oral swabbing appeared more practical and less invasive than drawing blood for the ELISA. CONCLUSIONS: The ORT and ELISA were statistically equal in specificity and sensitivity. ORT provides quicker results, potentially ensuring that more people receive them, and does not require handling of or exposure to potentially hazardous blood products. Trial number: ClinicalTrials.gov identifier: NCT01733927.
Kikuti,Ana Lúcia Pereira; Marcos-Filho,Júlio
Seed vigor testing is an important component of quality control programs adopted by seed industry. The software Seed Vigor Imaging System (SVIS) has been successfully used for seed vigor assessment in different species. The objective of this research was to verify the SVIS efficiency to assess okra seed vigor in comparison to other vigor tests used for this species. Five seed lots of 'Clemson Americano' and four of 'Santa Cruz' were submitted to germination (speed and percentage), cold germin...
Chang, Chih-Cheng; Su, Jian-An; Tsai, Ching-Shu; Yen, Cheng-Fang; Liu, Jiun-Horng; Lin, Chung-Ying
To examine the psychometrics of the Affiliate Stigma Scale using rigorous psychometric analysis: classical test theory (CTT) (traditional) and Rasch analysis (modern). Differential item functioning (DIF) items were also tested using Rasch analysis. Caregivers of relatives with mental illness (n = 453; mean age: 53.29 ± 13.50 years) were recruited from southern Taiwan. Each participant filled out four questionnaires: Affiliate Stigma Scale, Rosenberg Self-Esteem Scale, Beck Anxiety Inventory, and one background information sheet. CTT analyses showed that the Affiliate Stigma Scale had satisfactory internal consistency (α = 0.85-0.94) and concurrent validity (Rosenberg Self-Esteem Scale: r = -0.52 to -0.46; Beck Anxiety Inventory: r = 0.27-0.34). Rasch analyses supported the unidimensionality of three domains in the Affiliate Stigma Scale and indicated four DIF items (affect domain: 1; cognitive domain: 3) across gender. Our findings, based on rigorous statistical analysis, verified the psychometrics of the Affiliate Stigma Scale and reported its DIF items. We conclude that the three domains of the Affiliate Stigma Scale can be separately used and are suitable for measuring the affiliate stigma of caregivers of relatives with mental illness. Copyright © 2015 Elsevier Inc. All rights reserved.
Pivato, Alberto; Gaspari, Lorenzo
Landfilling is a fundamental step in any waste management strategy, but it can constitute a hazard for the environment for a long time. The need to protect the environment from potential landfill emissions makes risk assessment a decision tool of extreme necessity. The heterogeneity of wastes and the complexity of physical, chemical and biological processes that occur in the body of a landfill need specific procedures in order to evaluate the groundwater risk for the environment. Given the complexity of the composition of landfill leachates, the exact contribution of each potential toxic substance cannot be known precisely. Some reference contaminants that constitute the hazard (toxicity) of leachate have to be found to perform the risk assessment. A preliminary ecotoxicological investigation with luminescent bacteria has been carried out on different leachates from traditional and sustainable landfills in order to rank the chemicals that better characterize the leachate (heavy metals, ammonia and dissolved organic content). The attention has been focused on ammonia because it is present in high concentration and can last for centuries and can seriously contaminate the groundwater. The results showed that the toxicity of the leachate might reliably depend on the ammonia concentration and that the leachate toxicity is considerably lower in sustainable landfills where the ammonia had been degraded. This has an important consequence because if the containment system fails (as usually occur within 30-50 yr), the risk of groundwater contamination will be calculated easier only in terms of the probability that the ammonia concentration is higher than a reference concentration
Rasmussen, Charlotte; Dalsgaard, Kristian
Sieving and sedimentation analyses by pipette or hydrometer are historically the traditional methods for determining particle size distributions (PSD). A more informative and faster alternative has for years been laser diffraction (LD). From 2003 to 2013 the authors of this paper have worked...... intensively with PSD and performed various tests and investigations, using LD, sedimentation (by pipette) and sieving. The aim was to improve and understand the relationship between these various techniques, pre-treatment effects and preferably find a unifying correlation factor. As a result, method...... comparisons of LD and sieving/sedimentation are difficult, as LD is a 3D optical volume measurement, sieving is a 2D width measurement, and sedimentation is density dependent. Platy particles like clay are generally measured to be coarser than traditional methods when LD is used. For LD the clay...
Cano, Stefan J; Posner, Holly B; Moline, Margaret L; Hurt, Stephen W; Swartz, Jina; Hsu, Tim; Hobart, Jeremy C
The Alzheimer's Disease Assessment Scale Cognitive Behavior Section (ADAS-cog), a measure of cognitive performance, has been used widely in Alzheimer's disease trials. Its key role in clinical trials should be supported by evidence that it is both clinically meaningful and scientifically sound. Its conceptual and neuropsychological underpinnings are well-considered, but its performance as an instrument of measurement has received less attention. Objective To examine the traditional psychometric properties of the ADAS-cog in a large sample of people with Alzheimer's disease. Data from three clinical trials of donepezil (Aricept) in mild-to-moderate Alzheimer's disease (n=1421; MMSE 10-26) were analysed at both the scale and component level. Five psychometric properties were examined using traditional psychometric methods. These methods of examination underpin upcoming Food and Drug Administration recommendations for patient rating scale evaluation. At the scale-level, criteria tested for data completeness, scaling assumptions (eg, component total correlations: 0.39-0.67), targeting (no floor or ceiling effects), reliability (eg, Cronbach's α: = 0.84; test-retest intraclass correlations: 0.93) and validity (correlation with MMSE: -0.63) were satisfied. At the component level, 7 of 11 ADAS-cog components had substantial ceiling effects (range 40-64%). Performance was satisfactory at the scale level, but most ADAS-cog components were too easy for many patients in this sample and did not reflect the expected depth and range of cognitive performance. The clinical implication of this finding is that the ADAS-cog's estimate of cognitive ability, and its potential ability to detect differences in cognitive performance under treatment, could be improved. However, because of the limitations of traditional psychometric methods, further evaluations would be desirable using additional rating scale analysis techniques to pinpoint specific improvements.
Berger, Kim Murray; Gese, Eric M; Berger, Joel
The traditional trophic cascades model is based on consumer resource interactions at each link in a food chain. However, trophic-level interactions, such as mesocarnivore release resulting from intraguild predation, may also be important mediators of cascades. From September 2001 to August 2004, we used spatial and seasonal heterogeneity in wolf distribution and abundance in the southern Greater Yellowstone Ecosystem to evaluate whether mesopredator release of coyotes (Canis latrans), resulting from the extirpation of wolves (Canis lupus), accounts for high rates of coyote predation on pronghorn (Antilocapra americana) fawns observed in some areas. Results of this ecological perturbation in wolf densities, coyote densities, and pronghorn neonatal survival at wolf-free and wolf-abundant sites support the existence of a species-level trophic cascade. That wolves precipitated a trophic cascade was evidenced by fawn survival rates that were four-fold higher at sites used by wolves. A negative correlation between coyote and wolf densities supports the hypothesis that interspecific interactions between the two species facilitated the difference in fawn survival. Whereas densities of resident coyotes were similar between wolf-free and wolf-abundant sites, the abundance of transient coyotes was significantly lower in areas used by wolves. Thus, differential effects of wolves on solitary coyotes may be an important mechanism by which wolves limit coyote densities. Our results support the hypothesis that mesopredator release of coyotes contributes to high rates of coyote predation on pronghorn fawns, and demonstrate the importance of alternative food web pathways in structuring the dynamics of terrestrial systems.
Coombes, Lee; Roberts, Martin; Zahra, Daniel; Burr, Steven
It is incumbent on medical schools to show, both to regulatory bodies and to the public at large, that their graduating students are "fit for purpose" as tomorrow's doctors. Since students graduate by virtue of passing assessments, it is vital that schools quality assure their assessment procedures, standards, and outcomes. An important part of this quality assurance process is the appropriate use of psychometric analyses. This begins with development of an empowering, evidence-based culture in which assessment validity can be demonstrated. Preparation prior to an assessment requires the establishment of appropriate rules, test blueprinting and standard setting. When an assessment has been completed, the reporting of test results should consider reliability, assessor, demographic, and long-term analyses across multiple levels, in an integrated way to ensure the information conveyed to all stakeholders is meaningful.
Franklin, Ashley E; Burns, Paulette; Lee, Christopher S
In 2006, the National League for Nursing published three measures related to novice nurses' beliefs about self-confidence, scenario design, and educational practices associated with simulation. Despite the extensive use of these measures, little is known about their reliability and validity. The psychometric properties of the Student Satisfaction and Self-Confidence in Learning Scale, Simulation Design Scale, and Educational Practices Questionnaire were studied among a sample of 2200 surveys completed by novice nurses from a liberal arts university in the southern United States. Psychometric tests included item analysis, confirmatory and exploratory factor analyses in randomly-split subsamples, concordant and discordant validity, and internal consistency. All three measures have sufficient reliability and validity to be used in education research. There is room for improvement in content validity with the Student Satisfaction and Self-Confidence in Learning and Simulation Design Scale. This work provides robust evidence to ensure that judgments made about self-confidence after simulation, simulation design and educational practices are valid and reliable. Copyright © 2014 Elsevier Ltd. All rights reserved.
Pera, D.; Ferrando, I.
The experimental campaign represents an example of how the careful design of the different test phases and the choice of the needed instrumentation are fundamental aspects to obtain a proper interpretation of the results, for future application on the real structures. Additionally, the present work represents a successful example of a combined monitoring system, integrating the traditional and innovative technical instrumentation for loading tests and geomatics survey techniques. The monitoring system has been designed with the aim of defining the load carrying capacity of a masonry arch bridge scaled model and to test the performances of a new retrofitting method. In particular, two different configuration have been considered: a first one with isolated arch and a second one with gravel fill on the arch.
Full Text Available The experimental campaign represents an example of how the careful design of the different test phases and the choice of the needed instrumentation are fundamental aspects to obtain a proper interpretation of the results, for future application on the real structures. Additionally, the present work represents a successful example of a combined monitoring system, integrating the traditional and innovative technical instrumentation for loading tests and geomatics survey techniques. The monitoring system has been designed with the aim of defining the load carrying capacity of a masonry arch bridge scaled model and to test the performances of a new retrofitting method. In particular, two different configuration have been considered: a first one with isolated arch and a second one with gravel fill on the arch.
Magallanes, Adriana Moreno
In response to low achievement in mathematics at a middle school, an ethnomathematic approach was used to teach coordinate planes. Whether there were achievement differences between students taught by the culturally sensitive approach and those taught by a traditional method was studied. Data were collected from the coordinate planes unit…
Vitoratou, Silia; Pickles, Andrew
Psychometrics provide the mathematical underpinnings for psychological assessment. From the late 19th century, a plethora of methodological research achievements equipped researchers and clinicians with efficient tools whose practical value becomes more evident in the era of the internet and big data. Nowadays, powerful probabilistic models exist for most types of data and research questions. As the usability of the psychometric scales is better comprehended, there is an increased interest in applied research outcomes. Paradoxically, while the interest in applications for psychometric scales increases, publishing research on the development and/or evaluation of those scales per se, is not welcomed by many relevant journals. This special issue in psychometrics is therefore a great opportunity to briefly review the main ideas and methods used in psychometrics, and to discuss the challenges in contemporary applied psychometrics.
Neguț, Alexandra; Jurma, Anda Maria; David, Daniel
Virtual-reality-based assessment may be a good alternative to classical or computerized neuropsychological assessment due to increased ecological validity. ClinicaVR: Classroom-CPT (VC) is a neuropsychological test embedded in virtual reality that is designed to assess attention deficits in children with attention deficit hyperactivity disorder (ADHD) or other conditions associated with impaired attention. The present study aimed to (1) investigate the diagnostic validity of VC in comparison to a traditional continuous performance test (CPT), (2) explore the task difficulty of VC, (3) address the effect of distractors on the performance of ADHD participants and typically-developing (TD) controls, and (4) compare the two measures on cognitive absorption. A total of 33 children diagnosed with ADHD and 42 TD children, aged between 7 and 13 years, participated in the study and were tested with a traditional CPT or with VC, along with several cognitive measures and an adapted version of the Cognitive Absorption Scale. A mixed multivariate analysis of covariance (MANCOVA) revealed that the children with ADHD performed worse on correct responses had more commissions and omissions errors than the TD children, as well as slower target reaction times . The results showed significant differences between performance in the virtual environment and the traditional computerized one, with longer reaction times in virtual reality. The data analysis highlighted the negative influence of auditory distractors on attention performance in the case of the children with ADHD, but not for the TD children. Finally, the two measures did not differ on the cognitive absorption perceived by the children.
Gurven, Michael; Jaeggi, Adrian V; von Rueden, Chris; Hooper, Paul L; Kaplan, Hillard
Sharing and exchange are common practices for minimizing food insecurity in rural populations. The advent of markets and monetization in egalitarian indigenous populations presents an alternative means of managing risk, with the potential impact of eroding traditional networks. We test whether market involvement buffers several types of risk and reduces traditional sharing behavior among Tsimane Amerindians of the Bolivian Amazon. Results vary based on type of market integration and scale of analysis (household vs. village), consistent with the notion that local culture and ecology shape risk management strategies. Greater wealth and income were unassociated with the reliance on others for food, or on reciprocity, but wealth was associated with a greater proportion of food given to others (i.e., giving intensity) and a greater number of sharing partners (i.e., sharing breadth). Across villages, greater mean income was negatively associated with reciprocity, but economic inequality was positively associated with giving intensity and sharing breadth. Incipient market integration does not necessarily replace traditional buffering strategies but instead can often enhance social capital.
Murphy, Kevin R; Wolach, Allen
Noted for its accessible approach, this text applies the latest approaches of power analysis to both null hypothesis and minimum-effect testing using the same basic unified model. Through the use of a few simple procedures and examples, the authors show readers with little expertise in statistical analysis how to obtain the values needed to carry out the power analysis for their research. Illustrations of how these analyses work and how they can be used to choose the appropriate criterion for defining statistically significant outcomes are sprinkled throughout. The book presents a simple and g
Cross-cultural adaptation and analysis of the psychometric properties of the Balance Evaluation Systems Test and MiniBESTest in the elderly and individuals with Parkinson's disease: application of the Rasch model
Angelica C. Maia
Full Text Available BACKGROUND: Older adults and individuals with neurological problems such as Parkinson's disease (PD exhibit balance deficits that might impair their mobility and independence. The assessment of balance must be useful in identifying the presence of instability and orient interventions. OBJECTIVE: To translate and perform a cross-cultural adaptation of the Balance Evaluation Systems Test (BESTest and MiniBESTest to Brazilian Portuguese and analyze its psychometric properties. METHOD: The tests were translated and adapted to Portuguese according to a standard method and then subjected to a test-retest reliability assessment (10 older adults; 10 individuals with PD. The psychometric properties were assessed by the Rasch model (35 older adults; 35 individuals with PD. RESULTS: The reliability coefficient of the tests relative to the items and subjects varied from 0.91 and 0.98, which is indicative of the stability and reproducibility of the measures. In the BESTest, the person (4.19 and item (5.36 separation index established six balance ability levels and seven levels of difficulty, respectively. In the MiniBESTest, the person (3.16 and item (6.41 separation index established four balance ability levels and nine levels of difficulty, respectively. Two items in the BESTest did not fit with the model expectations, but the construct validity was not compromised. No item in the MiniBESTest was erratic. CONCLUSIONS: The results corroborate the diagnostic and screening functions of the BESTest and MiniBESTest, respectively, and indicate that the Brazilian versions exhibit adequate reliability, construct validity, response stability, and capacity to distinguish among various balance ability levels in older adults and individuals with PD.
Combination of classical test theory (CTT) and item response theory (IRT) analysis to study the psychometric properties of the French version of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF).
Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric
The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.
Battistelli, A; Portoghese, I; Galletta, M; Pohl, S
This paper aimed to extend research on nurse turnover by developing and testing a theoretical model of turnover intention that includes two emergent key off-the-job constructs, work-family conflict (WFC) and community embeddedness (CE). Nurse turnover is considered one of the most significant issues in health care. There is a considerable body of knowledge that has focused on the study of the on-the-job factors of nurse turnover, showing the important role of job attitudes. Recently, WFC and job embeddedness (JE) have been identified as variables that could help explain levels of nurse turnover. Using structural equation modelling from a cross-sectional survey, the relationships between the variables were explored in a sample of 440 nurses from an Italian public hospital. The questionnaire measures demographic data and psychosocial factors such as job satisfaction, organizational commitment, WFC, CE and turnover intentions. The findings supported the importance of non-work dimensions in turnover models. The results suggest that when studying turnover phenomena in health organizations, the extra-work domains (WFC and JE) can contribute to a decrease in the intention to leave, in addition to the more typically emphasized attitude dimension. © 2012 The Authors. International Nursing Review © 2012 International Council of Nurses.
Sauerland, Melanie; Wolfs, Andrea C F; Crans, Samantha; Verschuere, Bruno
Direct eyewitness identification is widely used, but prone to error. We tested the validity of indirect eyewitness identification decisions using the reaction time-based concealed information test (CIT) for assessing cooperative eyewitnesses' face memory as an alternative to traditional lineup procedures. In a series of five experiments, a total of 401 mock eyewitnesses watched one of 11 different stimulus events that depicted a breach of law. Eyewitness identifications in the CIT were derived from longer reaction times as compared to well-matched foil faces not encountered before. Across the five experiments, the weighted mean effect size d was 0.14 (95% CI 0.08-0.19). The reaction time-based CIT seems unsuited for testing cooperative eyewitnesses' memory for faces. The careful matching of the faces required for a fair lineup or the lack of intent to deceive may have hampered the diagnosticity of the reaction time-based CIT.
Robert J. Taormina
Full Text Available A new measure of adherence to traditional values was created with the objective of facilitating research within and across cultures and nations. The measure was tested in China (N = 321 and Russia (N = 314 and factor analysis of the data revealed two subscales named Personal Traditional Values (10 items and Public Traditional Values (6 items. Empirical psychometric testing of the overall 16-item measure and the two subscales strongly supported the validity and reliability of all three measures. Means comparisons conducted to assess how well the measures could be used for cross-cultural comparisons revealed the Russians somewhat more than the Chinese living by traditional values overall, both nations about equal on living according to traditional values in their personal lives, and the Russians significantly more inclined to abide by traditional values in public. Also tested were several social and psychological variables as theoretical predictors of living by traditional values, and Life Satisfaction was tested as a possible correlate of living according to traditional values. Regression analyses on the combined data confirmed that Family Emotional Support, Conscientiousness, Collectivism, and Age were all significant positive predictors of living by traditional values. Additional regressions also found some unique predictors for each nation. These findings and the results of the parametric tests support the use of the new scales for measuring traditional values both within and across cultures.
Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Chow, Sy-Miin
These research articles from the 79th Annual Meeting of the Psychometric Society (IMPS) cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 79th annual meeting took place in Madison, WI between July 21nd and 25th, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013) and Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015).
Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Wiberg, Marie
The research articles in this volume cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 80th annual meeting took place in Beijing, China, between the 12th and 16th of July, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013), Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015), and Quantitative Psychology Research: The 79th Annual Meeting of the Psychometric Society, Wisconsin, USA, 2014 (Springer, 2015).
Full Text Available Abstract Background Studies on the effects of tuberculosis on a patient’s quality of life (QOL are scant. The objective of this study was to evaluate the psychometric properties of the Taiwan short version of the World Health Organization Quality of Life (WHOQOL-BREF questionnaire using patients with tuberculosis in Taiwan and healthy referents. Methods The Taiwanese short version of the WHOQOL-BREF was administered to patients with tuberculosis undergoing treatment and healthy referents from March 2007 to July 2007. Patients with tuberculosis (n = 140 and healthy referents (n = 130, matched by age, sex, and ethnicity, agreed to an interview. All participants lived in eastern Taiwan. Reliability assessments included internal consistency, whereas validity assessments included construct validity, convergent validity, and discriminant validity. Results More than half of these patients and referents were men (70.7% and 66.2%, respectively, and their average ages were 50.1 and 47.9 years, respectively. Approximately 60% of patients and referents were aboriginal Taiwanese (60.7% and 61.1%, respectively. The proportion with low socioeconomic status was greater for these patients. The internal consistency reliability coefficients were .92 and .93 for the patients and healthy referents, respectively. Exploratory factor analysis on the healthy referents displayed a 4-domain model, which was compatible with the original WHOQOL-BREF 4-domain model. However, for the TB patient group, after deleting 3 items, both exploratory and confirmatory factor analysis revealed a 6-domain model. Conclusion Psychometric evaluation of the Taiwan short version of the WHOQOL-BREF indicates that it has adequate reliability for use in research with TB patients in Taiwan. However, the factor structure generated from this TB patient sample differed from the WHO’s original 4-factor model, which raised a validity concern to apply the Taiwan short version of the WHOQOL
Eklund, Mona; Morville, Anne-Le
Aims: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. Methods: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed. Key words: validity, reliability, health, Activity...
Eklund, Mona; Morville, Anne-Le
AIMS: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. METHODS: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed....
Barnard, P.L.; Rubin, D.M.; Harney, J.; Mustain, N.
This extensive field test of an autocorrelation technique for determining grain size from digital images was conducted using a digital bed-sediment camera, or 'beachball' camera. Using 205 sediment samples and >1200 images from a variety of beaches on the west coast of the US, grain size ranging from sand to granules was measured from field samples using both the autocorrelation technique developed by Rubin [Rubin, D.M., 2004. A simple autocorrelation algorithm for determining grain size from digital images of sediment. Journal of Sedimentary Research, 74(1): 160-165.] and traditional methods (i.e. settling tube analysis, sieving, and point counts). To test the accuracy of the digital-image grain size algorithm, we compared results with manual point counts of an extensive image data set in the Santa Barbara littoral cell. Grain sizes calculated using the autocorrelation algorithm were highly correlated with the point counts of the same images (r2 = 0.93; n = 79) and had an error of only 1%. Comparisons of calculated grain sizes and grain sizes measured from grab samples demonstrated that the autocorrelation technique works well on high-energy dissipative beaches with well-sorted sediment such as in the Pacific Northwest (r2 ??? 0.92; n = 115). On less dissipative, more poorly sorted beaches such as Ocean Beach in San Francisco, results were not as good (r2 ??? 0.70; n = 67; within 3% accuracy). Because the algorithm works well compared with point counts of the same image, the poorer correlation with grab samples must be a result of actual spatial and vertical variability of sediment in the field; closer agreement between grain size in the images and grain size of grab samples can be achieved by increasing the sampling volume of the images (taking more images, distributed over a volume comparable to that of a grab sample). In all field tests the autocorrelation method was able to predict the mean and median grain size with ???96% accuracy, which is more than
Nowparast Rostami, Hadiseh; Sommer, Werner; Zhou, Changsong; Wilhelm, Oliver; Hildebrandt, Andrea
The enhanced N1 component in event-related potentials (ERP) to face stimuli, termed N170, is considered to indicate the structural encoding of faces. Previously, individual differences in the latency of the N170 have been related to face and object cognition abilities. By orthogonally manipulating content domain (faces vs objects) and task demands (easy/speed vs difficult/accuracy) in both psychometric and EEG tasks, we investigated the uniqueness of the processes underlying face cognition as compared with object cognition and the extent to which the N1/N170 component can explain individual differences in face and object cognition abilities. Data were recorded from N = 198 healthy young adults. Structural equation modeling (SEM) confirmed that the accuracies of face perception (FP) and memory are specific abilities above general object cognition; in contrast, the speed of face processing was not differentiable from the speed of object cognition. Although there was considerable domain-general variance in the N170 shared with the N1, there was significant face-specific variance in the N170. The brain-behavior relationship showed that faster face-specific processes for structural encoding of faces are associated with higher accuracy in both perceiving and memorizing faces. Moreover, in difficult task conditions, qualitatively different processes are additionally needed for recognizing face and object stimuli as compared with easy tasks. The difficulty-dependent variance components in the N170 amplitude were related with both face and object memory (OM) performance. We discuss implications for understanding individual differences in face cognition. Copyright © 2017 Elsevier Ltd. All rights reserved.
Girard, Todd A; Wilkins, Leanne K; Lyons, Kathleen M; Yang, Lixia; Christensen, Bruce K
Introduction Working-memory (WM) is a core cognitive deficit among individuals with Schizophrenia Spectrum Disorders (SSD). However, the underlying cognitive mechanisms of this deficit are less known. This study applies a modified version of the Corsi Block Test to investigate the role of proactive interference in visuospatial WM (VSWM) impairment in SSD. Methods Healthy and SSD participants completed a modified version of the Corsi Block Test involving both high (typical ascending set size from 4 to 7 items) and low (descending set size from 7 to 4 items) proactive interference conditions. Results The results confirmed that the SSD group performed worse overall relative to a healthy comparison group. More importantly, the SSD group demonstrated greater VSWM scores under low (Descending) versus high (Ascending) proactive interference; this pattern is opposite to that of healthy participants. Conclusions This differential pattern of performance supports that proactive interference associated with the traditional administration format contributes to VSWM impairment in SSD. Further research investigating associated neurocognitive mechanisms and the contribution of proactive interference across other domains of cognition in SSD is warranted.
De Boever, Wesley, E-mail: Wesley.email@example.com [UGCT/PProGRess, Dept. of Geology, Ghent University, Krijgslaan 281, 9000 Ghent (Belgium); Bultreys, Tom; Derluyn, Hannelore [UGCT/PProGRess, Dept. of Geology, Ghent University, Krijgslaan 281, 9000 Ghent (Belgium); Van Hoorebeke, Luc [UGCT/Radiation Physics, Dept. of Physics & Astronomy, Ghent University, Proeftuinstraat 86, 9000 Ghent (Belgium); Cnudde, Veerle [UGCT/PProGRess, Dept. of Geology, Ghent University, Krijgslaan 281, 9000 Ghent (Belgium)
In this paper, we examine the possibility to use on-site permeability measurements for cultural heritage applications as an alternative for traditional laboratory tests such as determination of the capillary absorption coefficient. These on-site measurements, performed with a portable air permeameter, were correlated with the pore network properties of eight sandstones and one granular limestone that are discussed in this paper. The network properties of the 9 materials tested in this study were obtained from micro-computed tomography (μCT) and compared to measurements and calculations of permeability and the capillary absorption rate of the stones under investigation, in order to find the correlation between pore network characteristics and fluid management characteristics of these sandstones. Results show a good correlation between capillary absorption, permeability and network properties, opening the possibility of using on-site permeability measurements as a standard method in cultural heritage applications. - Highlights: • Measurements of capillary absorption are compared to in-situ permeability. • We obtain pore size distribution and connectivity by using micro-CT. • These properties explain correlation between permeability and capillarity. • Correlation between both methods is good to excellent. • Permeability measurements could be a good alternative to capillarity measurement.
Nine methods for automated test construction are described. All are based on the concepts of information from item response theory. Two general kinds of methods for the construction of parallel tests are presented: (1) sequential test design; and (2) simultaneous test design. Sequential design implies that the tests are constructed one after the…
Doran, Jennifer M; Safran, Jeremy D; Muran, J Christopher
This study investigates the utility and psychometric properties of a new measure of psychotherapy process, the Alliance Negotiation Scale (ANS; Doran, Safran, Waizmann, Bolger, & Muran, 2012). The ANS was designed to operationalize the theoretical construct of negotiation (Safran & Muran, 2000), and to extend our current understanding of the working alliance concept (Bordin, 1979). The ANS was also intended to improve upon existing measures such as the Working Alliance Inventory (WAI; Horvath & Greenberg, 1986, 1989) and its short form (WAI-S; Tracey & Kokotovic, 1989) by expanding the emphasis on negative therapy process. The present study investigates the psychometric validity of the ANS test scores and interpretation-including confirming its original factor structure and evaluating its internal consistency and construct validity. Construct validity was examined through the ANS' convergence and divergence with several existing scales that measure theoretically related constructs. The results bolster and extend previous findings about the psychometric integrity of the ANS, and begin to illuminate the relationship between negotiation and other important variables in psychotherapy research. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Bielinski, John; Minnema, Jane; Thurlow, Martha
A Web-based survey of 25 experts in testing theory and large-scale assessment examined the utility of out-of-level testing for making decisions about students and schools. Survey respondents were given a series of scenarios and asked to judge the degree to which out-of-level testing would affect the reliability and validity of test scores within…
Osuji, Alice; Pharr, Jennifer R; Nwokoro, Uche; Ike, Anulika; Ali, Christiana; Ejiro, Ogheneaga; Osuyali, John; Obiefune, Michael; Fiscella, Kevin; Ezeanolue, Echezona E
Nigeria is second in the world for the number of people with HIV and has a high rate of mother-to-child transmission (MTCT). Over 60% of births in Nigeria occur outside of health care facilities, and because of this, Traditional Birth Attendants (TBAs) play a significant role in maternal and child health. It is important that TBAs be knowledgeable about HIV prevention. The purpose of this study was to determine the impact of HIV testing and counseling (HTC) knowledge on the HIV prevention practices among TBAs in Nigeria. Five hundred TBAs were surveyed. Chi-square and logistic regression were used to assess differences in HIV prevention practices between TBAs with and without HTC knowledge. TBAs with HTC knowledge are significantly more likely to engage in HIV prevention practices than TBAs without HTC. Prevention practices included: wearing gloves during delivery (p births occur outside health care facilities in Nigeria, there will be a need for TBAs. Providing TBAs with HTC training increases HIV prevention practices and can be a key to improve maternal and child health.
Vera Lúcia Marques de Figueiredo
psychometric properties of WISC-III items, specifically difficulty, discrimination and validity of items. Since WISC-III is a widely used test for the assessment of intelligence, knowledge on the quality of items of the test seems to be essential for professionals using it. Analyses were performed on data 801 protocols obtained in the adaptation research of a test within a Brazilian context. Analyses showed that the adapted items presented adequate psychometric features which make it a reliable diagnostic tool.
Collins, S; Stevenson, D; Walker, J; Bennett, A
To evaluate the usefulness of Legionella qPCR alongside traditional culture for enumeration of Legionella from water samples as part of both routine and public health investigation testing. Routine water samples (n = 2002) and samples from public health investigations (n = 215) were analysed by culture and qPCR for Legionella spp., Legionella pneumophila and L. pneumophila sg-1. A negative qPCR result was highly predictive of a negative culture result for all water systems (negative predictive values, NPV from 97·4 to 100%). Positive predictive values (PPV) were lower (0-50%). Results for qPCR were generally larger than culture with average log 10 differences of 1·1 for Legionella spp. and 1·2 for L. pneumophila. Alert and action levels of 1000 and 10 000 GU per litre, respectively, are proposed for Legionella qPCR for hot and cold water systems (HCWS). The use of qPCR significantly reduced the time to results for public health investigations by rapidly identifying potential sources and ruling out others, thus enabling a more rapid and efficient response. The high NPV of qPCR supports its use to rapidly screen out negative samples without culture. Alert and action levels for Legionella qPCR for HCWS are proposed. Quantitative PCR will be a valuable tool for both routine and public health testing. This study generated comparative data of >2000 water samples by qPCR and culture. Action and alert levels have been recommended that could enable duty holders to interpret qPCR results to facilitate timely Legionella control and public health protection. © 2017 Crown copyright. Journal of Applied Microbiology © 2017 The Society for Applied Microbiology.
Brennan, Alana T; Thea, Donald M; Semrau, Katherine; Goggin, Caitlin; Scott, Nancy; Pilingana, Portipher; Botha, Belinda; Mazimba, Arthur; Hamomba, Leoda; Seidenberg, Phil
Access to lifesaving prevention of mother-to-child transmission (PMTCT) services is problematic in rural Zambia. The simplest intervention used in Zambia has been 2-dose nevirapine (NVP) administration in the peripartum period, a regimen of 1 NVP tablet to the mother at the onset of labor and 1 dose in the form of syrup to the newborn within 4 to 72 hours after birth. This 2-dose regimen has been shown to reduce MTCT by nearly 50%. We set out to demonstrate that in-home HIV testing and NVP dosing by traditional birth attendants (TBAs) is feasible and acceptable by women in rural Zambia. This was a pilot program using TBAs to perform rapid saliva-based HIV testing and administer single-dose NVP in tablet form to the mother at the onset of labor and syrup to the infant after birth. A total of 280 pregnant women were consented and enrolled into the program, of whom 124 (44.3%) gave birth at home with the assistance of a trained TBA. Of those, 16 (12.9%) were known to be HIV positive, and 101 of the remaining 108 (93.5%) accepted a rapid HIV test. All these women tested HIV negative. In the subset of 16 mothers who were HIV positive, 13 (81.3%) took single-dose NVP administered by a TBA between 1 and 24 hours prior to birth and 100% of exposed newborns (16 of 16) received NVP syrup within 72 hours after birth, 80% of whom were dosed in the first 24 hours of life. With the substantial shortage of human resources in public health care throughout sub-Saharan Africa, it is extremely valuable to utilize lay health care workers to help extended services beyond the level of the facility. Given the high uptake of PMTCT services we believe that TBAs with proper training and support can successfully provide country-approved PMTCT. © 2013 by the American College of Nurse-Midwives.
Blom, Eva Henje; Bech, Per; Högberg, Göran
of two such scales, which may be used in a two-step screening procedure, the WHO-Five Well-being Index (WHO-5) and the six-item version of Beck's Depression Inventory (BDI-6). METHOD: 66 adolescent psychiatric patients with a clinical diagnosis of major depressive disorder (MDD), 60 girls and 6 boys......, aged 14--18 years, mean age 16.8 years, completed the WHO-5 scale as well as the BDI-6. Statistical validity was tested by Mokken and Rasch analyses. RESULTS: The correlation between WHO-5 and BDI-6 was -0.49 (p=0.0001). Mokken analyses showed a coefficient of homogeneity for the WHO-5 of 0.......52 and for the BDI-6 of 0.46. Rasch analysis also accepted unidimensionality when testing males versus females (p > 0.05). CONCLUSIONS: The WHO-5 is psychometrically valid in an adolescent psychiatric context including both genders to assess the wellness dimension and applicable as a first step in screening for MDD...
Barchard, Kimberly A.; Pace, Larry A.
Undergraduate psychometrics classes often use computer-intensive active learning projects. However, little research has examined active learning or computer-intensive projects in psychometrics courses. We describe two computer-intensive collaborative learning projects used to teach the design and evaluation of psychological tests. Course…
Thomas, Michael L.; Locke, Dona E. C.
The MMPI-2 Restructured Form (MMPI-2-RF; Tellegen & Ben-Porath, 2008) was designed to be psychometrically superior to its MMPI-2 counterpart. However, the test has yet to be extensively evaluated in diverse clinical settings. The purpose of this study was to examine the psychometric properties of the MMPI-2-RF Somatic Complaints (RC1) scale in…
Miller, Robert; Rammsayer, Thomas H.; Schweizer, Karl; Troche, Stefan J.
Several memory processes have been examined regarding their relation to psychometric intelligence with the exception of sensory memory. This study examined the relation between decay of iconic memory traces, measured with a partial-report task, and psychometric intelligence, assessed with the Berlin Intelligence Structure test, in 111…
Full Text Available Introduction Assessment of episodic memory has been traditionally used to evaluate potential cognitive impairments in senior adults. Typically, episodic memory evaluation is based on personal interviews and pen-and-paper tests. This article presents the design, development and a preliminary validation of a novel digital game to assess episodic memory intended to overcome the limitations of traditional methods, such as the cost of its administration, its intrusive character, the lack of early detection capabilities, the lack of ecological validity, the learning effect and the existence of confounding factors. Materials and Methods Our proposal is based on the gamification of the California Verbal Learning Test (CVLT and it has been designed to comply with the psychometric characteristics of reliability and validity. Two qualitative focus groups and a first pilot experiment were carried out to validate the proposal. Results A more ecological, non-intrusive and better administrable tool to perform cognitive assessment was developed. Initial evidence from the focus groups and pilot experiment confirmed the developed game’s usability and offered promising results insofar its psychometric validity is concerned. Moreover, the potential of this game for the cognitive classification of senior adults was confirmed, and administration time is dramatically reduced with respect to pen-and-paper tests. Limitations Additional research is needed to improve the resolution of the game for the identification of specific cognitive impairments, as well as to achieve a complete validation of the psychometric properties of the digital game. Conclusion Initial evidence show that serious games can be used as an instrument to assess the cognitive status of senior adults, and even to predict the onset of mild cognitive impairments or Alzheimer’s disease.
Valladares-Rodriguez, Sonia; Perez-Rodriguez, Roberto; Facal, David; Fernandez-Iglesias, Manuel J; Anido-Rifon, Luis; Mouriño-Garcia, Marcos
Assessment of episodic memory has been traditionally used to evaluate potential cognitive impairments in senior adults. Typically, episodic memory evaluation is based on personal interviews and pen-and-paper tests. This article presents the design, development and a preliminary validation of a novel digital game to assess episodic memory intended to overcome the limitations of traditional methods, such as the cost of its administration, its intrusive character, the lack of early detection capabilities, the lack of ecological validity, the learning effect and the existence of confounding factors. Our proposal is based on the gamification of the California Verbal Learning Test (CVLT) and it has been designed to comply with the psychometric characteristics of reliability and validity. Two qualitative focus groups and a first pilot experiment were carried out to validate the proposal. A more ecological, non-intrusive and better administrable tool to perform cognitive assessment was developed. Initial evidence from the focus groups and pilot experiment confirmed the developed game's usability and offered promising results insofar its psychometric validity is concerned. Moreover, the potential of this game for the cognitive classification of senior adults was confirmed, and administration time is dramatically reduced with respect to pen-and-paper tests. Additional research is needed to improve the resolution of the game for the identification of specific cognitive impairments, as well as to achieve a complete validation of the psychometric properties of the digital game. Initial evidence show that serious games can be used as an instrument to assess the cognitive status of senior adults, and even to predict the onset of mild cognitive impairments or Alzheimer's disease.
Polyak, Stephen T.; von Davier, Alina A.; Peterschmidt, Kurt
This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses. PMID:29238314
Stephen T. Polyak
Full Text Available This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
Polyak, Stephen T; von Davier, Alina A; Peterschmidt, Kurt
This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
Maza, Paul Sadiri
In recent years, technological advances such as computers have been employed in teaching gross anatomy at all levels of education, even in professional schools such as medical and veterinary medical colleges. Benefits of computer based instructional tools for gross anatomy include the convenience of not having to physically view or dissect a cadaver. Anatomy educators debate over the advantages versus the disadvantages of computer based resources for gross anatomy instruction. Many studies, case reports, and editorials argue for the increased use of computer based anatomy educational tools, while others discuss the necessity of dissection for various reasons important in learning anatomy, such as a three-dimensional physical view of the specimen, physical handling of tissues, interactions with fellow students during dissection, and differences between specific specimens. While many articles deal with gross anatomy education using computers, there seems to be a lack of studies investigating the use of computer based resources as an assessment tool for gross anatomy, specifically using the Apple application QuickTime Virtual Reality (QTVR). This study investigated the use of QTVR movie modules to assess if using computer based QTVR movie module assessments were equal in quality to actual physical specimen examinations. A gross anatomy course in the College of Veterinary Medicine at Cornell University was used as a source of anatomy students and gross anatomy examinations. Two groups were compared, one group taking gross anatomy examinations in a traditional manner, by viewing actual physical specimens and answering questions based on those specimens. The other group took the same examinations using the same specimens, but the specimens were viewed as simulated three-dimensional objects in a QTVR movie module. Sample group means for the assessments were compared. A survey was also administered asking students' perceptions of quality and user-friendliness of the QTVR
Full Text Available Recent work in animals suggests that the extent of early tactile stimulation by parents of offspring is an important element in early caregiving. We evaluate the psychometric properties of a new parent-report measure designed to assess frequency of tactile stimulation across multiple caregiving domains in infancy. We describe the full item set of the Parent-Infant Caregiving Touch Scale (PICTS and, using data from a UK longitudinal Child Health and Development Study, the response frequencies and factor structure and whether it was invariant over two time points in early development (5 and 9 weeks. When their infant was 9 weeks old, 838 mothers responded on the PICTS while a stratified subsample of 268 mothers completed PICTS at an earlier 5 week old assessment (229 responded on both occasions. Three PICTS factors were identified reflecting stroking, holding and affective communication. These were moderately to strongly correlated at each of the two time points of interest and were unrelated to, and therefore distinct from, a traditional measure of maternal sensitivity at 7-months. A wholly stable psychometry over 5 and 9-week assessments was not identified which suggests that behavior profiles differ slightly for younger and older infants. Tests of measurement invariance demonstrated that all three factors are characterized by full configural and metric invariance, as well as a moderate degree of evidence of scalar invariance for the stroking factor. We propose the PICTS as a valuable new measure of important aspects of caregiving in infancy.
Activities of daily living (ADL), such as walking, often involve the added complexity of walking while doing other activities (i.e. dual task walking). A complex walking task may require a greater motor and mental capacity, resulting in decrements in gait performance not seen for simple walking tasks. The purpose of this study was to determine if the trail walking test (TWT), the mobile adaptation of the trail making test (TMT), could be a reliable and valid early detection tool to discriminate between non-fallers and fallers. This study examined dual task costs of a cognitive and a sensorimotor task (walking) in 94 older adults aged 50-81 years (average age M = 67.4 years, SD ± 7.34). Based on the idea of the paper and pencil TMT, participants walked along a fixed pathway (TWT-1), stepped on targets with increasing sequential numbers (i.e. 1, 2, 3, TWT-2), and increasing sequential numbers and letters (i.e. 1, A, 2, B, 3, C, TWT-3). The dual task costs were calculated for each task. Additionally, the following tests were conducted: TMT, block tapping test (BTT), timed up and go (TUG) test, 30s chair rising test, 10 m walking time test with and without head turns, German physical activity questionnaire (German PAQ-50 +) and the activities-specific balance confidence (ABC-D) scale. The TWT performance times as well as errors increased with increasing age. Reliability coefficients were high (interclass correlation ICC > 0.90). Correlations between the different TWT conditions and potential falls-related predictors were moderate to high (r = -0.430 to 0.699). Of the participants 34 % reported falling in the past year. The stepwise logistic regression analysis revealed that the dual task costs for the numbers and letters (odds ratio OR 1.162, 95 % confidence interval CI 1.058-1.277, p = 0.002), the ABC-D (OR 0.767, 95 % CI 0.651-0.904, p = 0.002) and exercise (OR 1.027, 95 % CI 1.008-1.046, p = 0.006) were significantly related to
Muis, Krista R; Winne, Philip H; Edwards, Ordene V
A program of research is needed that assesses the psychometric properties of instruments designed to quantify students' achievement goal orientations to clarify inconsistencies across previous studies and to provide a stronger basis for future research. We conducted traditional psychometric and modern Rasch-model analyses of the Achievement Goals Questionnaire (AGQ, Elliot & McGregor, 2001) and the Patterns of Adaptive Learning Scale (PALS, Midgley et al., 2000) to provide an in-depth analysis of the two most popular instruments in educational psychology. For Study 1, 217 undergraduate students enrolled in educational psychology courses participated. Thirty-four were male and 181 were female (two did not respond). Participants completed the AGQ in the context of their educational psychology class. For Study 2, 126 undergraduate students enrolled in educational psychology courses participated. Thirty were male and 95 were female (one did not respond). Participants completed the PALS in the context of their educational psychology class. Traditional psychometric assessments of the AGQ and PALS replicated previous studies. For both, reliability estimates ranged from good to very good for raw subscale scores and fit for the models of goal orientations were good. Based on traditional psychometrics, the AGQ and PALS are valid and reliable indicators of achievement goals. Rasch analyses revealed that estimates of reliability for items were very good but respondent ability estimates varied from poor to good for both the AGQ and PALS. These findings indicate that items validly and reliably reflect a group's aggregate goal orientation, but using either instrument to characterize an individual's goal orientation is hazardous.
Watson, Shaun D.; Gomez, Rapson; Gullone, Eleonora
This study examined various psychometric properties of the items comprising the shame and guilt scales of the Test of Self-Conscious Affect-Adolescent (TOSCA-A) in a group children between 8 and 11 years of age. A total of 699 children (367 females and 332 males) completed these scales, and also measures of depression and empathy. Confirmatory factor analysis (CFA) provided support for an oblique two-factor model, with the originally proposed shame and guilt items comprising shame and guilt factors, respectively. There was good internal consistency reliability for the shame and guilt scales, with omega coefficient values of 0.77 and 0.81 for shame and guilt, respectively. Also, shame correlated with depression symptoms positively (0.34, p Guilt correlated with depression symptoms negatively (-0.28, p guilt factors. Multiple-group CFA comparing this group of children with a separate group of adolescents (320 females and 242 males), based on the chi-square difference test, supported full metric invariance, the intercept invariance of 17 of the 30 shame and guilt items, and higher latent mean scores among children for both shame and guilt. The non-equivalency for intercepts and mean scores were of small effect sizes. Comparisons based on the difference in root mean squared error of approximation values supported full measurement invariance and no group difference for latent mean scores. The findings in the current study support the use of the TOSCA-A in children and the valid comparison of scores between children and adolescents, thereby opening up the possibility of evaluating change in the TOSCA-A shame and guilt factors over these developmental age groups. PMID:27242573
Clemens, Sheila M; Gailey, Robert S; Bennett, Christopher L; Pasquina, Paul F; Kirk-Sanchez, Neva J; Gaunaurd, Ignacio A
Using a custom mobile application to evaluate the reliability and validity of the Component Timed-Up-and-Go test to assess prosthetic mobility in people with lower limb amputation. Cross-sectional design. National conference for people with limb loss. A total of 118 people with non-vascular cause of lower limb amputation participated. Subjects had a mean age of 48 (±13.7) years and were an average of 10 years post amputation. Of them, 54% ( n = 64) of subjects were male. None. The Component Timed-Up-and-Go was administered using a mobile iPad application, generating a total time to complete the test and five component times capturing each subtask (sit to stand transitions, linear gait, turning) of the standard timed-up-and-go test. The outcome underwent test-retest reliability using intraclass correlation coefficients (ICCs) and convergent validity analyses through correlation with self-report measures of balance and mobility. The Component Timed-Up-and-Go exhibited excellent test-retest reliability with ICCs ranging from .98 to .86 for total and component times. Evidence of discriminative validity resulted from significant differences in mean total times between people with transtibial (10.1 (SD: ±2.3)) and transfemoral (12.76 (SD: ±5.1) amputation, as well as significant differences in all five component times ( P < .05). Convergent validity of the Component Timed-Up-and-Go was demonstrated through moderate correlations with the PLUS-M ( r s = -.56). The Component Timed-Up-and-Go is a reliable and valid clinical tool for detailed assessment of prosthetic mobility in people with non-vascular lower limb amputation. The iPad application provided a means to easily record data, contributing to clinical utility.
Aim: To construct normal values for the tests of the psychometric hepatic encephalopathy score (PHES) and evaluate the prevalence of minimal hepatic encephalopathy (MHE) among Turkish patients with liver cirrhosis. Materials and Methods: One hundred and eighty-five healthy subjects and sixty patients with liver ...
Burger, Helena; Franchignoni, Franco; Puzic, Natasa; Giordano, Andrea
The objective of this study was to evaluate by means of classical test theory and Rasch analysis the scaling characteristics and psychometric properties of the Fatigue Severity Scale (FSS) in polio survivors. A questionnaire, consisting of five general questions (sex, age, age at time of acute polio, sequelae of polio, and new symptoms), the FSS,…
Slaney, Kathleen L.; Tkatchouk, Masha; Gabriel, Stephanie M.; Maraun, Michael D.
The aim of the current study is twofold: (a) to investigate the rates at which researchers assess and report on the psychometric properties of the measures they use in their research and (b) to examine whether or not researchers appear to be generally employing sound/unsound rationales when it comes to how they conduct test evaluations. Based on a…
Monsen, Jeremy J.; Ewing, Donna L.; Boyle, James
This paper presents the psychometric properties of a questionnaire measure that updates and extends Larrivee and Cook's (1979) Opinions Relative to Mainstreaming Scale in terms of structure, terminology, and language. The revised scale was tested using a sample of 106 teachers based in inclusive mainstream schools. Using Principal Component…
Full Text Available Abstract Background No control tools for nasal congestion (NC are currently available in Spanish. This study aimed to adapt and validate the Congestion Quantifier Seven Item Test (CQ7 for Spain. Methods CQ7 was adapted from English following international guidelines. The instrument was validated in an observational, prospective study in allergic rhinitis patients with NC (N = 166 and a control group without NC (N = 35. Participants completed the CQ7, MOS sleep questionnaire, and a measure of psychological well-being (PGWBI. Clinical data included NC severity rating, acoustic rhinometry, and total symptom score (TSS. Internal consistency was assessed using Cronbach's alpha and test-retest reliability using the intraclass correlation coefficient (ICC. Construct validity was tested by examining correlations with other outcome measures and ability to discriminate between groups classified by NC severity. Sensitivity and specificity were assessed using Area under the Receiver Operating Curve (AUC and responsiveness over time using effect sizes (ES. Results Cronbach's alpha for the CQ7 was 0.92, and the ICC was 0.81, indicating good reliability. CQ7 correlated most strongly with the TSS (r = 0.60, p Conclusions The Spanish version of the CQ7 is appropriate for detecting, measuring, and monitoring NC in allergic rhinitis patients.
Khanjari, Sedigheh; Oskouie, Fatemeh; Langius-Eklöf, Ann
To translate and test the reliability and validity of the Persian version of the Caregiver Quality of Life Index-Cancer scale. Research across many countries has determined quality of life of cancer patients, but few attempts have been made to measure the quality of life of family caregivers of patients with breast cancer. The Caregiver Quality of Life Index-Cancer scale was developed for this purpose, but until now, it has not been translated into or tested in the Persian language. Methodological research design. After standard translation, the 35-item Caregiver Quality of Life Index-Cancer scale was administered to 166 Iranian family caregivers of patients with breast cancer. A confirmatory factor analysis was carried out using LISREL to test the scale's construct validity. Further, the internal consistency and convergent validity of the instrument were tested. For convergent validity, four instruments were used in the study: sense of coherence scale, spirituality perspective scale, health index and brief religious coping scale. The confirmatory factor analysis resulted in the same four-factor structure as the original, though, with somewhat different item loadings. The Persian version of the Caregiver Quality of Life Index-Cancer scales had satisfactory internal consistency (0·72-0·90). Tests of convergent validity showed that all hypotheses were confirmed. A hierarchical multiple regression analysis additionally confirmed the convergent validity between the total Caregiver Quality of Life Index-Cancer score and sense of coherence (β = 0·34), negative religious coping (β = -0·21), education (β = 0·24) and the more severe stage of breast cancer (β = 0·23), in total explaining 41% of the variance. The Persian version of the Caregiver Quality of Life Index-Cancer scale could be a reliable and valid measure in Iranian family caregivers of patients with breast cancer. The Persian version of the Caregiver Quality of Life Index-Cancer scale is simple to
Feinberg, Richard A.; Rubright, Jonathan D.
Simulation studies are fundamental to psychometric discourse and play a crucial role in operational and academic research. Yet, resources for psychometricians interested in conducting simulations are scarce. This Instructional Topics in Educational Measurement Series (ITEMS) module is meant to address this deficiency by providing a comprehensive…
Clark, Khaya D; Yovanoff, Paul; Tate, Charlotte Ursula
The Racial Attitudes Index (RAI) measures a child's racial attitudes. Designed for children aged 5-9 years, the RAI is delivered over the Internet using Audio Computer Assisted Self-Interviewing (ACASI). Unlike traditional binary forced-choice instruments, the RAI uses an expanded response format permitting a more nuanced understanding of patterns of children's racial attitudes. In addition to establishing psychometric evidence of the RAI technical adequacy, hypotheses about RAI item response patterns were tested. The racial attitudes of 336 Black and White children in grades K-3 were assessed using a forced-choice instrument (Preschool Racial Attitudes Measure II) and the RAI. Findings from this study indicate measures obtained with the RAI are technically adequate, and the measure functions invariantly across racial groups. Also, patterns of children's racial attitudes measured with the RAI are more nuanced than those obtained using the forced-choice response format.
Yang, Luke; Liu, Yung-Fang; Sun, Huey-Fang; Chiang, Hsien-Hsien; Tsai, Yu-Lun; Liaw, Jen-Jiuan
The study purpose was to examine the validities and reliabilities of the Chinese-versions Frommelt Attitudes Toward Care of the Dying Scale (Attitudes Scale) and Caregiving Behaviors Scale for End-of-Life Patients and Families (Behaviors Scale). The scales were tested in a convenience sample of 318 nurses with ≥6 months work experience at three hospitals. Cronbach's alphas of the Attitudes and Behaviors Scales were .90 and .96, respectively. Each scale had Kaiser-Meyer-Olkin index >.85 and Bartlett's test of sphericity >4000 ( p < .001). Attitudes Scale loaded on three factors: respecting and caring for dying patients and families, avoiding care of the dying, and involving patients and families in end-of-life care. The Behaviors Scale loaded on two factors: supporting dying patients and families, and helping families cope with grief. Factor loadings for both scales were ≥.49. Both Attitudes and Behaviors Scales are reliable and valid for evaluating nurses' attitudes and caregiving behaviors for the dying.
Radley, S C; Jones, G L; Tanguy, E A; Stevens, V G; Nelson, C; Mathers, N J
To develop and evaluate a Web-based, electronic pelvic floor symptoms assessment questionnaire (e-PAQ)1 for women. A cross-sectional study in primary and secondary care. Two general practices, two community health clinics and a secondary care urogynaecology clinic. A total of 432 women (204 in primary care and 228 in secondary care) were recruited between June 2003 and January 2004. The e-PAQ was located on a workstation (computer, touchscreen and printer). Women completed the e-PAQ prior to their appointment. Untreated women in primary care were asked to return seven days later to complete the e-PAQ a second time (test-retest). Factor analysis, reliability, validity, patient satisfaction, completion times and system costs. In secondary care, factor analysis identified 14 domains within the four dimensions (urinary, bowel, vaginal and sexual symptoms) with internal consistency (Cronbach's alpha)>or=0.7 in 11 of these. In primary care, alpha values were all>or=0.7 and test-retest analysis found acceptable intraclass correlations of 0.50-0.95 (PPAQ offers a user-friendly clinical tool, which provides valid and reliable data. The system offers comprehensive symptoms and quality of life evaluation and may enhance the clinical episode as well as the quality of care for women with pelvic floor disorders.
Santo, Ruth Miyuki; Ribeiro-Ferreira, Felipe; Alves, Milton Ruiz; Epstein, Jonathan; Novaes, Priscila
To provide a reliable, validated, and culturally adapted instrument that may be used in monitoring dry eye in Brazilian patients and to discuss the strategies for the enhancement of the cross-cultural adaptation and validation process of a self-report measure for dry eye. The cross-cultural adaptation process (CCAP) of the original Ocular Surface Disease Index (OSDI) into Brazilian-Portuguese was conducted using a 9-step guideline. The synthesis of translations was tested twice, for face and content validity, by different subjects (focus groups and cognitive interviews). The expert committee contributed on several steps, and back translations were based on the final rather than the prefinal version. For validation, the adapted version was applied in a prospective longitudinal study to 101 patients from the Dry Eye Clinic at the General Hospital of the University of São Paulo, Brazil. Simultaneously to the OSDI, patients answered the short form-36 health survey (SF-36) and the 25-item visual function questionnaire (VFQ-25) and underwent clinical evaluation. Internal consistency, test-retest reliability, and measure validity were assessed. Cronbach's alpha value of the cross-culturally adapted Brazilian-Portuguese version of the OSDI was 0.905, and the intraclass correlation coefficient was 0.801. There was a statistically significant difference between OSDI scores in patients with dry eye (41.15 ± 27.40) and without dry eye (17.88 ± 17.09). There was a negative association between OSDI and VFQ-25 total score (P adaptation process requires skill, knowledge, experience, and a considerable investment of time to maximize the attainment of semantic, idiomatic, experiential, and conceptual equivalence between the source and target questionnaires. A well-established guideline resulted in a culturally adapted Brazilian-Portuguese version of the OSDI, tested and validated on a sample of Brazilian population, and proved to be a valid and reliable instrument for assessing
Vescia Vieira de Alencar Caldas
Full Text Available OBJECTIVE: To validate the Leganés cognitive test (LCT for cognitive screening in low educated elderly Brazilians. METHODS: The study sample was composed of 59 elderly residents from the city of Santa Cruz, in Brazil, with low schooling levels. Reliability was analyzed with a two-day interval between assessments, and concurrent validity was assessed using the Mini Mental State Examination (MMSE. RESULTS: According to the LCT, the prevalence of dementia was 11.8%. The scale items showed a moderate to strong correlation between domains (p<0.01, and inter-rater reliability exhibited ICC=0.81, 95%CI=0.72-0.88. The factor analysis resulted in two factors: memory and orientation. Interscale agreement was considered poor (k=-0.02, supporting the hypothesis of an educational impact on final MMSE scores. CONCLUSION: The results suggest that LCT has acceptable levels of reliability for use in low-educated Brazilian elderly.
Full Text Available Abstract Background Aging entails not only a decrease in the ability to be active, but also a trend toward increased dependence to sustain basic life functions. An important aspect for appropriately elucidating the individual's care needs is the ability to measure them both simply and reliably. Since 2006 a new version of the Time in Care needs (TIC-n instrument (19-item version has been explored and used in one additional municipality with the same structure as the one described in an earlier study. Methods The TIC-n assessment was conducted on a total of 1282 care recipients. Factor analysis (principal component was applied to explore the construct validity of the TIC-n. Cronbach's alpha was calculated to test reliability and for each of the items remaining in the instrument after factor analysis, an inter-rater comparison was carried out on all recipients in both municipalities. Independently of each other, a weighted Kappa (Kw was calculated. Results. The mean of each weighted Kappa (Kw for the dimensions in the two municipalities was 0.75 and 0.76, respectively. Factor analysis showed that all 19 items had a factor loading of ≥ 0.40. Three factors (General Care, Medical Care and Cognitive Care were created. Conclusion The TIC-n instrument has now been tested for validity and reliability in two municipalities with satisfactory results. However, TIC-n can not yet be used as a golden standard, but it can be recommended for use of measurement of individual care needs in municipal elderly care.
Igwesi-Chidobe, Chinonso N; Obiekwe, Chinwe; Sorinola, Isaac O; Godfrey, Emma L
Cross-culturally adapt and validate the Igbo Roland Morris Disability Questionnaire. Cross-cultural adaptation, test-retest, and cross-sectional psychometric testing. Roland Morris Disability Questionnaire was forward and back translated by clinical/non-clinical translators. An expert committee appraised the translations. Twelve participants with chronic low back pain pre-tested the measure in a rural Nigerian community. Internal consistency using Cronbach's alpha; test-retest reliability using intra-class correlation coefficient and Bland-Altman plot; and minimal detectable change were investigated in a convenient sample of 50 people with chronic low back pain in rural and urban Nigeria. Pearson's correlation analyses using the eleven-point box scale and back performance scale, and exploratory factor analysis were used to examine construct validity in a random sample of 200 adults with chronic low back pain in rural Nigeria. Ceiling and floor effects were investigated in the two samples. Modifications gave the option of interviewer-administration and reflected Nigerian social context. The measure had excellent internal consistency (α = 0.91) and intraclass correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity, and a predominant uni-dimensional structure, with no ceiling or floor effects. Igbo Roland Morris Disability Questionnaire is a valid and reliable measure of pain-related disability. Implications for rehabilitation Low back pain is the leading cause of years lived with disability worldwide, and is particularly prevalent in rural Nigeria, but there are no self-report measures to assess its impact due to low literacy rates. This study describes the cross-cultural adaptation and validation of a core self-report back pain specific disability measure in a low-literate Nigerian population. The Igbo Roland Morris Disability Questionnaire is a reliable and valid measure of self
Lehmann, V.; Ouwens, M.A.; Braeken, J.; Danner, U.N.; van Elburg, A.A.; Bekker, M.H.J.; Breurkens, A.; van Strien, T.
The psychometric properties of the Dutch version of the Eating Disorder Inventory–3 (EDI-3) were tested in eating disordered patients (N = 514) using confirmatory factor analyses, variance decomposition, reliabilities, and receiver operating characteristic (ROC) curve analyses. Factorial validity
Vickers, Andrew J; Chen, Ling Y
New technologies to collect patient - reported outcomes have substantially solved the challenge of integrating a questionnaire in a busy clinical practice. At Memorial Sloan Kettering, we have been collecting patient reported outcomes electronically for many years. Our experience confirms the predicted benefits of obtaining patient reported outcomes but has also raised serious concerns about whether instruments developed for the research setting are appropriate for routine clinical use. We summarize four principles for a clinically - relevant psychometrics. First, minimize patient burden: the use of a large number of items for a single domain may be of interest for research but additional items have little clinical utility. Secondly, use simplified language: patients who do not have good language skills are typically excluded from research studies but will nonetheless present in clinical practice. Third, avoid dumb questions: many questionnaire items are inappropriate when applied to a more general population. Fourth, what works for the group may not work for the individual: group level statistics used to validate survey instruments can obscure problems when applied to a subgroup of patients. There is a need for a clinically-oriented psychometrics to help design, test, and evaluate questionnaires that would be used in routine practice. Developing statistical methods to optimize questionnaires will be highly challenging but needed to bring the potential of patient reported outcomes into widespread clinical use.
Kemp, Joanne L; Collins, Natalie J; Roos, Ewa M.
Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown.......Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown....
Moazami, Fariborz; Bahrampour, Ehsan; Azar, Mohammad Reza; Jahedi, Farzad; Moattari, Marzieh
The importance of using technologies such as e-learning in different disciplines is discussed in the literature. Researchers have measured the effectiveness of e-learning in a number of fields.Considering the lack of research on the effectiveness of online learning in dental education particularly in Iran, the advantages of these learning methods and the positive university atmosphere regarding the use of online learning. This study, therefore, aims to compare the effects of two methods of teaching (virtual versus traditional) on student learning. This post-test only design study approached 40, fifth year dental students of Shiraz University of Medical Sciences. From this group, 35 students agreed to participate. These students were randomly allocated into two groups, experimental (virtual learning) and comparison (traditional learning). To ensure similarity between groups, we compared GPAs of all participants by the Mann-Whitney U test (P > 0.05). The experimental group received a virtual learning environment courseware package specifically designed for this study, whereas the control group received the same module structured in a traditional lecture form. The virtual learning environment consisted of online and offline materials. Two identical valid, reliable post-tests that consisted of 40 multiple choice questions (MCQs) and 4 essay questions were administered immediately (15 min) after the last session and two months later to assess for knowledge retention. Data were analyzed by SPSS version 20. A comparison of the mean knowledge score of both groups showed that virtual learning was more effective than traditional learning (effect size = 0.69). The newly designed virtual learning package is feasible and will result in more effective learning in comparison with lecture-based training. However further studies are needed to generalize the findings of this study.
May 1, 2018 ... (HCEQ) among Iranian Reproductive Age Women: Persian Version .... satisfaction, improved interaction with others and .... Private employee 75 (13.7) ... Married. 526 (95.8). Unmarried. 23 (4.2). Frequency of receiving.
Aronow, E; Reznikoff, M; Moreland, K L
Various approaches to the Rorschach Technique are described in terms of the idiographic-nomothetic axis and the perceptual-content axis. It is suggested that it is most productive to view the Rorschach as a projective tool, with perceptual scoring a secondary factor. Current efforts at objectification of the Rorschach are not seen as useful as efforts to enhance its projective qualities. Some possible ways are discussed in which the projective value of the instrument can be maximized.
Various studies have been carried out on emotional intelligence. But few of these studies have been able to link emotional intelligence to creativity, innovation and entrepreneurship. Importantly, the fact that creativity, innovation and entrepreneurship are strongly driven by positive emotions cannot be ignored. This study ...
Durak, Mithat; Senol-Durak, Emre; Gencoz, Tulin
This study aims to extensively examine the psychometric properties of adapted version of the Satisfaction with Life Scale (SWLS) in different Turkish samples. In order to test the psychometric properties of the SWLS three separate and independent samples are utilized in this study, namely university students (n = 547), correctional officers (n =…
Richardson, George B; Sanning, Blair K; Lai, Mark H C; Copping, Lee T; Hardesty, Patrick H; Kruger, Daniel J
This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS), provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a) use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b) confirm the dimensionality of life history indicators, and (c) establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested-there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
George B. Richardson
Full Text Available This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS, provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b confirm the dimensionality of life history indicators, and (c establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested—there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
Epskamp, S.; Rhemtulla, M.; Borsboom, D.
We introduce the network model as a formal psychometric model, conceptualizing the covariance between psychometric indicators as resulting from pairwise interactions between observable variables in a network structure. This contrasts with standard psychometric models, in which the covariance between
Practitioners in health sciences education and assessment regularly use a range of psychometric techniques to analyse data, evaluate models, and make crucial progression decisions regarding student learning. However, a recent editorial entitled "Is Psychometrics Science?" highlighted some core epistemological and practical problems in psychometrics, and brought its legitimacy into question. This paper attempts to address these issues by applying some key ideas from history and philosophy of science (HPS) discourse. I present some of the conceptual developments in HPS that have bearing on the psychometrics debate. Next, by shifting the focus onto what constitutes the practice of science, I discuss psychometrics in action. Some incorrectly conceptualize science as an assemblage of truths, rather than an assemblage of tools and goals. Psychometrics, however, seems to be an assemblage of methods and techniques. Psychometrics in action represents a range of practices using specific tools in specific contexts. This does not render the practice of psychometrics meaningless or futile. Engaging in debates about whether or not we should regard psychometrics as 'scientific' is, however, a fruitless enterprise. The key question and focus should be whether, on what grounds, and in what contexts, the existing methods and techniques used by psychometricians can be justified or criticized.
Oliveira, Jorge; Gamito, Pedro; Alghazzawi, Daniyal M; Fardoun, Habib M; Rosa, Pedro J; Sousa, Tatiana; Picareli, Luís Felipe; Morais, Diogo; Lopes, Paulo
This investigation sought to understand whether performance in naturalistic virtual reality tasks for cognitive assessment relates to the cognitive domains that are supposed to be measured. The Shoe Closet Test (SCT) was developed based on a simple visual search task involving attention skills, in which participants have to match each pair of shoes with the colors of the compartments in a virtual shoe closet. The interaction within the virtual environment was made using the Microsoft Kinect. The measures consisted of concurrent paper-and-pencil neurocognitive tests for global cognitive functioning, executive functions, attention, psychomotor ability, and the outcomes of the SCT. The results showed that the SCT correlated with global cognitive performance as measured with the Montreal Cognitive Assessment (MoCA). The SCT explained one third of the total variance of this test and revealed good sensitivity and specificity in discriminating scores below one standard deviation in this screening tool. These findings suggest that performance of such functional tasks involves a broad range of cognitive processes that are associated with global cognitive functioning and that may be difficult to isolate through paper-and-pencil neurocognitive tests.
Full Text Available The purpose of this study is to examine the effects of blended e-learning on electrical machinery performance (achievement test and self-assessment. Participants were two classes of 11th graders majoring in electrical engineering and taking the electrical machinery class at a vocational high school in Taiwan. The participants were randomly selected and assigned to either the experimental group (n = 33 which studied through blended e-learning or the control group (n = 32 which studied through traditional classroom learning. The experiment lasted for five weeks. The results showed that (a there were no significant differences in achievement test scores between blended e-learning and traditional learning; (b students in the experimental group obtained significantly higher scores on self-assessment than students in the control group; (c students’ scores on self-assessment were significantly higher after studying through blended e-learning than before. Overall, blended e-learning did not significantly affect students’ achievement test scores, but significantly affected their self-assessment scores.
Suliane Beatriz Rauber
Full Text Available The present study aimed to verify if blood pressure (BP reactivity could be reduced through a previous single session of active playing when compared to sedentary leisure. Sixteen pre-pubertal healthy children participated in this study. After familiarization with procedures and anthropometric evaluation, participants performed three sessions in randomized order: 1 30 min of traditional Brazilian games (PLAY; 2 30 min of video game playing (DDR; and 3 30 min of watching TV (TV. Each session lasted ~80 minutes, being 10 min of rest; 30 min of intervention activity; and 40 min of recovery. After recovery, the Cold Pressor Test (CPT was used for the assessment of acute cardiovascular reactivity. Blood Pressure (BP was recorded at 30 s and 1 min during the CPT. Analysis of variance showed post-exercise hypotension (PEH only after PLAY, and that systolic and diastolic BP were significantly increased in all conditions during CPT. However, the magnitude of the CPT-induced blood pressure response was significantly less in PLAY compared to DDR and TV. The PEH observed during recovery and the reduced BP response to CPT following playing traditional games may be due its higher cardiovascular and metabolic demand as was indicated by the increased heart rate, oxygen consumption, and blood pressure. It was concluded that BP reactivity to stress may be reduced through a previous single session of traditional games and that PEH was recorded only after this exercise form. This benefit indicates a potential role of playing strategies for cardiovascular health in childhood.
Rauber, Suliane B; Boullosa, Daniel A; Carvalho, Ferdinando O; de Moraes, José F V N; de Sousa, Ioranny R C; Simões, Herbert G; Campbell, Carmen S G
The present study aimed to verify if blood pressure (BP) reactivity could be reduced through a previous single session of active playing when compared to sedentary leisure. Sixteen pre-pubertal healthy children participated in this study. After familiarization with procedures and anthropometric evaluation, participants performed three sessions in randomized order: (1) 30 min of traditional Brazilian games (PLAY); (2) 30 min of video game playing (DDR); and (3) 30 min of watching TV (TV). Each session lasted 80 min, being 10 min of rest; 30 min of intervention activity; and 40 min of recovery. After recovery, the Cold Pressor Test (CPT) was used for the assessment of acute cardiovascular reactivity. BP was recorded at 30 s and 1 min during the CPT. Analysis of variance showed post-exercise hypotension (PEH) only after PLAY, and that systolic and diastolic BP were significantly increased in all conditions during CPT. However, the magnitude of the CPT-induced BP response was significantly less in PLAY compared to DDR and TV. The PEH observed during recovery and the reduced BP response to CPT following playing traditional games may be due its higher cardiovascular and metabolic demand as was indicated by the increased heart rate, oxygen consumption, and BP. It was concluded that BP reactivity to stress may be reduced through a previous single session of traditional games and that PEH was recorded only after this exercise form. This benefit indicates a potential role of playing strategies for cardiovascular health in childhood.
Zenhong, C.; Buwalda, P.L.
Chinese dumplings such as Jiao Zi and Bao Zi are two of the popular traditional foods in Asia. They are usually made from wheat flour dough (rice flour or starch is sometimes used) that contains fillings. They can be steamed, boiled and fried and are consumed either as a main meal or dessert. As
Rodríguez-Rosell, David; Mora-Custodio, Ricardo; Franco-Márquez, Felipe; Yáñez-García, Juan M; González-Badillo, Juan J
Rodríguez-Rosell, D, Mora-Custodio, R, Franco-Márquez, F, Yáñez-García, JM, González-Badillo, JJ. Traditional vs. sport-specific vertical jump tests: reliability, validity, and relationship with the legs strength and sprint performance in adult and teen soccer and basketball players. J Strength Cond Res 31(1): 196-206, 2017-The vertical jump is considered an essential motor skill in many team sports. Many protocols have been used to assess vertical jump ability. However, controversy regarding test selection still exists based on the reliability and specificity of the tests. The main aim of this study was to analyze the reliability and validity of 2 standardized (countermovement jump [CMJ] and Abalakov jump [AJ]) and 2 sport-specific (run-up with 2 [2-LEGS] or 1 leg [1-LEG] take-off jump) vertical jump tests, and their usefulness as predictors of sprint and strength performance for soccer (n = 127) and basketball (n = 59) players in 3 different categories (Under-15, Under-18, and Adults). Three attempts for each of the 4 jump tests were recorded. Twenty-meter sprint time and estimated 1 repetition maximum in full squat were also evaluated. All jump tests showed high intraclass correlation coefficients (0.969-0.995) and low coefficients of variation (1.54-4.82%), although 1-LEG was the jump test with the lowest absolute and relative reliability. All selected jump tests were significantly correlated (r = 0.580-0.983). Factor analysis resulted in the extraction of one principal component, which explained 82.90-95.79% of the variance of all jump tests. The 1-LEG test showed the lowest associations with sprint and strength performance. The results of this study suggest that CMJ and AJ are the most reliable tests for the estimation of explosive force in soccer and basketball players in different age categories.
Mothibe, M E; Osuch, E; Kahler-Venter, C P
The prevalent use of African traditional medicine by the general public has been reported. With commercialisation and marketing, some of the herbal medicines (HMs) used are readily available over the counter, most of them promoted as immune boosters. These commercial HMs have not been taken through clinical trials and other tests that would validate their composition and safety, and other properties such as their effect on laboratory diagnostic tests. To investigate the cross-reactivity of selected HMs with commonly tested drugs of abuse (DoA) using a qualitative rapid urinalysis assay. The six HMs selected were bought from local pharmacies. A rapid urinalysis screening test was performed with the Instant View Multi-Drug of Abuse Test kit from Labstix Diagnostics. Drug-free urine (DFU) was pooled from samples donated by healthy volunteers. Urine samples that had tested positive for DoA were obtained from a pharmacology laboratory. Aliquots of the urine samples were spiked with the HMs in neat and diluted form, and tested at various time intervals. The results for the DFU samples spiked with the HMs remained negative. There were no significant changes in pH or specific gravity of the samples. The results of samples that had tested positive for tetrahydrocannabinol (THC) were not altered by five of the HMs when spiked at 40% v/v. The HM Ngoma Herbal Tonic Immune Booster caused false-negative results for the THC test. An important finding is that the herbal mixture Ngoma Herbal Tonic Immune Booster caused false-negative results for the cannabinoid screening test. It adds to the list of substances that may be potential adulterants of urine for screening tests.
Kane, Michael T.
In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Caselli, G; Fernie, B; Canfora, F; Mascolo, C; Ferrari, A; Antonioni, M; Giustina, L; Donato, G; Marcotriggiani, A; Bertani, A; Altieri, A; Pellegrini, E; Spada, MM
Recent research has suggested that metacognitions may play a role across the spectrum of addictive behaviours. The goal of our studies was to develop the first self-report scale of metacognitions about gambling. We conducted three studies with one community (n = 165) and two clinical (n = 110; n = 87) samples to test the structure and psychometric properties of the Metacognitions about Gambling Questionnaire and examined its capacity to prospectively predict severity of gambling. Findings sup...
Cibele Cristina Osawa
Full Text Available The TBA test is essential to quality control of fat-containing food, being the test most applied to evaluate lipid peroxidation in fishery, meat and poultry products. It estimates malonaldehyde, a secondary oxidation product, by reacting with 2-thiobarbituric acid, forming a coloured complex, measured spectrophotometrically atlambda = 532 nm. Results are expressed as mg malonaldehyde per kg sample or frequently as "TBA value". There are four ways of quantifying it: by lipid extraction, direct heating, distillation or heat-acid extraction. This review intends to point out traditional, modified and alternative TBA test methods, besides enumerating advantages and drawbacks of each one.
Herschbach, Peter; Berg, Petra; Dankert, Andrea; Duran, Gabriele; Engst-Hastreiter, Ursula; Waadt, Sabine; Keller, Monika; Ukat, Robert; Henrich, Gerhard
The aim of this study was the development and psychometric testing of a new psychological questionnaire to measure the fear of progression (FoP) in chronically ill patients (cancer, diabetes mellitus and rheumatic diseases). The Fear of Progression Questionnaire (FoP-Q) was developed in four phases: (1) generation of items (65 interviews); (2) reduction of items--the initial version of the questionnaire (87 items) was presented to 411 patients, to construct subscales and test the reliability; (3) testing the convergent and discriminative validity of the reduced test version (43 items) within a new sample (n=439); (4) translation--German to English. The scale comprised five factors (Cronbach's alpha >.70): affective reactions (13 items), partnership/family (7), occupation (7), loss of autonomy (7) and coping with anxiety (9). The test-retest reliability coefficients varied between .77 and .94. There was only a medium relationship to traditional anxiety scales. This is an indication of the independence of the FoP. Significant relationships between the FoP-Q and the patient's illness behaviour indicate discriminative validity. The FoP-Q is a new and unique questionnaire developed for the chronically ill. A major problem and source of stress for this patient group has been measuring both specifically and economically the FoP of an illness. The FoP-Q was designed to resolve this problem, fulfill this need and reduce this stress.
Morrison, Todd G; Bishop, C J; Morrison, Melanie A; Parker-Taneo, Kandice
Discrimination against sexual minorities is widespread and has deleterious consequences on victims' psychological and physical wellbeing. However, a review of the psychometric properties of instruments measuring lesbian, gay, and bisexual (LGB) discrimination has not been conducted. The results of this review, which involved evaluating 162 articles, reveal that most have suboptimal psychometric properties. Specifically, myriad scales possess questionable content validity as (1) items are not created in collaboration with sexual minorities; (2) measures possess a small number of items and, thus, may not sufficiently represent the domain of interest; and (3) scales are "adapted" from measures designed to examine race- and gender-based discrimination. Additional limitations include (1) summed scores are computed, often in the absence of scale score reliability metrics; (2) summed scores operate from the questionable assumption that diverse forms of discrimination are necessarily interrelated; (3) the dimensionality of instruments presumed to consist of subscales is seldom tested; (4) tests of criterion-related validity are routinely omitted; and (5) formal tests of measures' construct validity are seldom provided, necessitating that one infer validity based on the results obtained. The absence of "gold standard" measures, the attendant difficulty in formulating a coherent picture of this body of research, and suggestions for psychometric improvements are noted.
Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C.
Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric evaluation). Classical Test Theory and Item Response Theory provide two psychometric frameworks for evaluating the quality of assessment tools. We discuss how these theories can be applied to assessment tools generally and then apply them to the Digital Logic Concept Inventory (DLCI). We demonstrate that the DLCI is sufficiently reliable for research purposes when used in its entirety and as a post-course assessment of students' conceptual understanding of digital logic. The DLCI can also discriminate between students across a wide range of ability levels, providing the most information about weaker students' ability levels.
Paula Inez Cunha Gomide
Full Text Available Abstract The development of forensic evaluation scales is fundamental. This study's purpose was to explore the psychometric properties of a parental alienation scale. Forensic technicians completed 193 scales concerning parents involved in a lawsuit: 48 families with at least one parent indicated as the alienator (group A and 48 families with no parental alienation claim (group B. The scale consisted of five categories and 69 items: denying access to the child; derogatory comparisons; emotional manipulation; behavior of parent and child during assessment. The results show Cronbach's alpha = .965 and split-half = .745; KMO = .884 and Bartlett's sphericity test ( p < .001. Concurrent criterion validity applied to data showed that the scale is able to distinguish between the alienator and target parent. The results showed significant and consistent standards in the instrument's psychometric characteristics.
Karimy, Mahmood; Fakhri, Ahmad; Vali, Esmaeel; Vali, Farzaneh; Veiga, Feliciano H; Stein, L A R; Araban, Marzieh
Growing evidence indicates that if disruptive behavior is left unidentified and untreated, a significant proportion of these problems will persist and may develop into problems linked with delinquency, substance abuse, and violence. Research is needed to develop valid and reliable measures of disruptive behavior to assist recognition and impact of treatments on disruptive behavior. The aim of this study was to develop and evaluate the psychometric properties of a scale for disruptive behavior in adolescents. Six hundred high school students (50% girls), ages ranged 15-18 years old, selected through multi stage random sampling. Psychometrics of the disruptive behavior scale for adolescents (DISBA) (Persian version) was assessed through content validity, explanatory factor analysis (EFA) using Varimax rotation and confirmatory factor analysis (CFA). The reliability of this scale was assessed via internal consistency and test-retest reliability. EFA revealed four factors accounting for 59% of observed variance. The final 29-item scale contained four factors: (1) aggressive school behavior, (2) classroom defiant behavior, (3) unimportance of school, and (4) defiance to school authorities. Furthermore, CFA produced a sufficient Goodness of Fit Index > 0.90. Test-retest and internal consistency reliabilities were acceptable at 0.85 and 0.89, respectively. The findings from this study suggest that the Iranian version of DISBA questionnaire has content validity. Further studies are needed to evaluate stronger psychometric properties for DISBA.
The commercialization of drugs started toward the end of Heian period (794-1192) when not only aristocrats and monks who were traditional patrons to drug makers, but also local clans and landlords who became powerful as a result of the disbanding of aristocratic manors accumulated enough wealth to spend money on medicine. Although traveling around the country was still a dangerous endeavor, merchants assembled groups to bring lucrative foreign drugs (mainly Chinese) to remote areas. The spread of commercial drugs to common people, however, did not happen until the early Edo period (1603-1867), when the so-called barrier system was installed nationwide to make domestic travel safe. Commercialization started in large cities and gradually spread to other areas. Many nostrums popular until recently appeared in the Genroku period (1688-1703) or later. Many such nostrums were all-cures, often consisting of such active ingredients as Saussureae radix, Agalloch, or Gambir. Even in the Edo period, many people living in agricultural or fishing villages, as well as those in the lower tier, were still poor. Much of the medication available to those people was therefore made of various plant or animal-derived substances that were traditionally used as folk medicines.
van Ballegooijen, Wouter; Riper, Heleen; Cuijpers, Pim; van Oppen, Patricia; Smit, Johannes H
Online questionnaires for measuring common mental health disorders such as depression and anxiety disorders are increasingly used. The psychometrics of several pen-and-paper questionnaires have been re-examined for online use and new online instruments have been developed and tested for validity as well. This study aims to review and synthesise the literature on this subject and provide a framework for future research. We searched Medline and PsycINFO for psychometric studies on online instruments for common mental health disorders and extracted the psychometric data. Studies were coded and assessed for quality by independent raters. We included 56 studies on 62 online instruments. For common instruments such as the CES-D, MADRS-S and HADS there is mounting evidence for adequate psychometric properties. Further results are scattered over different instruments and different psychometric characteristics. Few studies included patient populations. We found at least one online measure for each of the included mental health disorders and symptoms. A small number of online questionnaires have been studied thoroughly. This study provides an overview of online instruments to refer to when choosing an instrument for assessing common mental health disorders online, and can structure future psychometric research.
Full Text Available Malaria presents a diagnostic challenge in areas where both Plasmodium falciparum and P.vivax are co-endemic. Bivalent Rapid Diagnostic tests (RDTs showed promise as diagnostic tools for P.falciparum and P.vivax. To assist national malaria control programme in the selection of RDTs, commercially available seven malaria RDTs were evaluated in terms of their performance with special reference to heat stability.This study was undertaken in four forested districts of central India (July, 2011- March, 2012. All RDTs were tested simultaneously in field along with microscopy as gold standard. These RDTs were stored in their original packing at 25°C before transport to the field or they were stored at 35°C and 45°C upto 100 days for testing the performance of RDTs at high temperature. In all 2841 patients with fever were screened for malaria of which 26% were positive for P.falciparum, and 17% for P.vivax. The highest sensitivity of any RDT for P.falciparum was 98% (95% CI; 95.9-98.8 and lowest sensitivity was 76% (95% CI; 71.7-79.6. For P.vivax highest and lowest sensitivity for any RDT was 80% (95% CI; 94.9 - 83.9 and 20% (95% CI; 15.6-24.5 respectively. Heat stability experiments showed that most RDTs for P.falciparum showed high sensitivity at 45°C upto 90 days. While for P.vivax only two RDTs maintained good sensitivity upto day 90 when compared with RDTs kept at room temperature. Agreement between observers was excellent for positive and negative readings for both P.falciparum and P.vivax (Kappa >0.6-0.9.This is first field evaluation of RDTs regarding their temperature stability. Although RDTs are useful as diagnostic tool for P.falciparum and P.vivax even at high temperature, the quality of RDTs should be regulated and monitored more closely.
Full Text Available An adaption of McConahay, Harder and Batts’ (1981 moderm racism scale is presented for Chilean population andits psychometric properties, (reliability and validity are studied, along with its relationship with other relevantpsychosocial variables in studies on prejudice and ethnic discrimination (authoritarianism, religiousness, politicalposition, etc., as well as with other forms of prejudice (gender stereotypes and homophobia. The sample consistedof 120 participants, students of psychology, resident in the city of Antofagasta (a geographical zone with a highnumber of Latin-American inmigrants. Our findings show that the scale seems to be a reliable instrument to measurethe prejudice towards Bolivian immigrants in our social environment. Likewise, important differences among thesubjects are detected with high and low scores in the psychosocial variables used.
陳柏熹 Po-Hsi Chen
Full Text Available 本研究目的旨在發展大學生基本素養測驗並進行信度與效度評估。藉由分析國內大專院校的通識教育目標和核心素養，並參考ATC21S 提出的21世紀現代學生需具備的10 項基本素養，歸納出大學生基本素養測驗的九項素養，分別為：溝通合作、美感素養、科學思辨、資訊素養、終身學習、創新領導、問題解決、公民社會及生涯發展。測驗形式為線上多媒體情境式題型，每個題本均包含九項素養的內容，每項素養皆有二至三個題組。研究對象為全國大專校院一至四年級學生，研究樣本來自20 校10,958名大學生。由效度評估結果可知，大學生基本素養測驗的題組效果不大，可以忽略，並採用部分計分模式來估計，幾乎所有試題與模式都能適配，顯示建構效度良好。試題發展過程均歷經嚴謹修審題程序，取得良好專家效度證據。此外，不同性別和年級的學生在各素養的表現上差異不大，和過去的文獻相符合，具有良好的效標關聯效度。信度證據方面，各素養能力估計誤差約在 .20～ .60 logit 之間，單一題本的信度高於 .69，顯示本測驗題數雖少，但信度大致良好。整體而言，大學生基本素養測驗具良好的信度與效度。 This study evaluated the psychometric properties of the General Literacy Test for University Students. To develop the assessment framework, the educational objectives of general literacy courses of universities in Taiwan as well as the core competencies of Assessment and Teaching of 21st Century Skills were all reviewed and considered. The general literacy test is composed of nine literacy domains: communication and collaboration, esthetics, information, lifelong learning, career, leadership, problem solving, social concerns and citizenship, and scientific thinking. The items of the general literacy test were developed into a
Stone, L.L.; Janssens, J.M.A.M.; Vermulst, A.A.; Maten, M.L. van der; Engels, R.C.M.E.; Otten, R.
Background The Strengths and Difficulties Questionnaire is one of the most employed screening instruments. Although there is a large research body investigating its psychometric properties, reliability and validity are not yet fully tested using modern techniques. Therefore, we investigate
Norum, Jan; Grindedal, Eli Marie; Heramb, Cecilie; Karsrud, Inga; Ariansen, Sarah Louise; Undlien, Dag Erik; Schlichting, Ellen; Mæhle, Lovise
Background Identification of BRCA mutation carriers among patients with breast cancer (BC) involves costs and gains. Testing has been performed according to international guidelines, focusing on family history (FH) of breast and/or ovarian cancer. An alternative is testing all patients with BC employing sequencing of the BRCA genes and Multiplex Ligation Probe Amplification (MLPA). Patients and methods A model-based cost-effectiveness analysis, employing data from Oslo University Hospital, Ullevål (OUH-U) and a decision tree, was done. The societal and the healthcare perspectives were focused and a lifetime perspective employed. The comparators were the traditional FH approach used as standard of care at OUH-U in 2013 and the intervention (testing all patients with BC) performed in 2014 and 2015 at the same hospital. During the latter period, 535 patients with BC were offered BRCA testing with sequencing and MLPA. National 2014 data on mortality rates and costs were implemented, a 3% discount rate used and the costing year was 2015. The incremental cost-effectiveness ratio was calculated in euros (€) per life-year gained (LYG). Results The net healthcare cost (healthcare perspective) was €40 503/LYG. Including all resource use (societal perspective), the cost was €5669/LYG. The univariate sensitivity analysis documented the unit cost of the BRCA test and the number of LYGs the prominent parameters affecting the result. Diagnostic BRCA testing of all patients with BC was superior to the FH approach and cost-effective within the frequently used thresholds (healthcare perspective) in Norway (€60 000–€80 000/LYG). PMID:29682331
Norum, Jan; Grindedal, Eli Marie; Heramb, Cecilie; Karsrud, Inga; Ariansen, Sarah Louise; Undlien, Dag Erik; Schlichting, Ellen; Mæhle, Lovise
Identification of BRCA mutation carriers among patients with breast cancer (BC) involves costs and gains. Testing has been performed according to international guidelines, focusing on family history (FH) of breast and/or ovarian cancer. An alternative is testing all patients with BC employing sequencing of the BRCA genes and Multiplex Ligation Probe Amplification (MLPA). A model-based cost-effectiveness analysis, employing data from Oslo University Hospital, Ullevål (OUH-U) and a decision tree, was done. The societal and the healthcare perspectives were focused and a lifetime perspective employed. The comparators were the traditional FH approach used as standard of care at OUH-U in 2013 and the intervention (testing all patients with BC) performed in 2014 and 2015 at the same hospital. During the latter period, 535 patients with BC were offered BRCA testing with sequencing and MLPA. National 2014 data on mortality rates and costs were implemented, a 3% discount rate used and the costing year was 2015. The incremental cost-effectiveness ratio was calculated in euros (€) per life-year gained (LYG). The net healthcare cost (healthcare perspective) was €40 503/LYG. Including all resource use (societal perspective), the cost was €5669/LYG. The univariate sensitivity analysis documented the unit cost of the BRCA test and the number of LYGs the prominent parameters affecting the result.Diagnostic BRCA testing of all patients with BC was superior to the FH approach and cost-effective within the frequently used thresholds (healthcare perspective) in Norway (€60 000-€80 000/LYG).
Full Text Available Berberis hartwegii Benth., Berberidaceae, Hamelia patens Jacq., Rubiaceae, Dendropanax arboreus (L. Decne & Planch., Araliaceae, Erythrina herbacea L., Fabaceae, and Zanthoxylum caribaeum Lam., Rutaceae, acetone extracts were selected on the basis of their use in traditional Mexican medicine to treat scabies or skin diseases. Anti-dermatophyte activity in vitro was evaluated using the agar dilution assay, and the therapeutic efficacy of B. hartwegii and Z. caribaeum were tested against experimental tinea pedis. The infected animals were treated intragastrically daily for seven days with 2.5 and 5 mg/kg of acetone extracts. The acetone extract of H. patens exhibited 100% growth inhibition against T. mentagrophytes and E. floccosum at 100.0 and 50.0 µg/ml, respectively, and B. hartwegii inhibited growth of M. canis and T. mentagrophytes at 100.0 µg/ml. Effective treatments with 2.5 mg/kg of Z. caribaeum and B. hartwegii extract were comparable with 1 mg/kg of clotrimazole in mice. Liver profile tests and histological analyses did not exhibit any signs of toxicity and the Ames test indicated that both extracts were safe when evaluated in strains TA98, TA100 and TA102. Our results suggest the potential for the future development of new antifungal drugs from B. hartwegii or Z. caribaeum.
Full Text Available Almost every component of the information society is influenced by elements built on communication technology. Learning also tends to be related to the dynamic usage of computers. Nowadays, a number of applications (online or offline are also available that engage large groups of potential users and simultaneously provide a virtual environment to facilitate learning. This study introduces the self-developed interactive blind map teaching-examining e-learning system of the University of Debrecen. Results of testing the system with a control group are also presented.Both experimental and control groups of students were required to sita test of topographic knowledge following a semester of study. The pass mark for the test was 80%. The experimental group used the new digital environment to study, while the control group prepared for their exam using paper maps in the traditional way. The key research questions addressed by the study were to determine whether exam results obtained by the group using the ‘digital’ method better than those of the control's; and if there were a difference between the exam performances of the two groups, was this statistically significant and, therefore, likely to occur in other similar scenarios?
Outcome measures with good reliability, validity, responsiveness, and low burden of administration are clinically useful. The Oswestry Disability Index (ODI) is one of the most commonly used outcome measures for individuals with low back pain. Psychometric properties of the ODI will determine the questionnaire's suitability as a useful clinical tool. A literature search of relevant databases on psychometric evaluation of the ODI was performed. The search was done using the key words disability evaluation, and low back pain, and questionnaires, and reproducibility of results, and the term Oswestry. Inclusion criterion was direct reference regarding psychometric property, interpretability, and burden being included in the abstract. Eight articles met the inclusion criterion. The ODI shows good construct validity; internal consistency is rated as acceptable; test-retest reliability and responsiveness have been shown to be high; and burden of administration is low. The ODI is a valid, reliable, and responsive condition-specific assessment tool that is suited for use in clinical practice. It is easy to administer and score, objectifies clients' complaints, and monitors effects of therapy.
Tylka, Tracy L; Wood-Barcalow, Nichole L
Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.
Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
Validation of the Psychometric Properties of the Self-Compassion Scale. Testing the Factorial Validity and Factorial Invariance of the Measure among Borderline Personality Disorder, Anxiety Disorder, Eating Disorder and General Populations.
Costa, Joana; Marôco, João; Pinto-Gouveia, José; Ferreira, Cláudia; Castilho, Paula
During the last years, there has been a growing interest in self-compassion. Empirical evidences show that self-compassion is associated with psychological benefits among young adults and it might be considered a buffer factor in several mental disorders. The aim of this study was to validate the psychometric properties of the Self-compassion Scale (SCS: Neff, 2003a) after the initial lack of replicating the original six-factor structure. Data were collected from the overall database of a research centre (56 men and 305 women; mean age = 25.19) and comprised four groups: borderline personality disorder, anxiety disorder, eating disorder and general population. Confirmatory factor analysis supported a two-factor model (self-compassionate attitude versus self-critical attitude) with good internal consistencies, construct-related validity and external validity. Configural, weak measurement and structural invariance of the two-factor model of SCS were also shown. Findings support the generalizability of the two-factor model and show that both properties and interpretations of scores on self-compassion are equivalent across these population groups. Copyright © 2015 John Wiley & Sons, Ltd. A two-factor structure of SCS with strong psychometric validity was supported in clinical and non-clinical samples. Helping individuals with limited experiences of compassion to develop positive internal processing systems seems to be related with better mental health, self-acceptance and self-nurturing abilities. The non-probabilistic sampling limits the generalization of our conclusions. Copyright © 2015 John Wiley & Sons, Ltd.
Deepak, Kishore K; Al-Umran, Khalid Umran; AI-Sheikh, Mona H; Dkoli, B V; Al-Rubaish, Abdullah
The functionality of distracters in a multiple choice question plays a very important role. We examined the frequency and impact of functioning and non-functioning distracters on psychometric properties of 5-option items in clinical disciplines. We analyzed item statistics of 1115 multiple choice questions from 15 summative assessments of undergraduate medical students and classified the items into five groups by their number of non-functioning distracters. We analyzed the effect of varying degree of non-functionality ranging from 0 to 4, on test reliability, difficulty index, discrimination index and point biserial correlation. The non-functionality of distracters inversely affected the test reliability and quality of items in a predictable manner. The non-functioning distracters made the items easier and lowered the discrimination index significantly. Three non-functional distracters in a 5-option MCQ significantly affected all psychometric properties (p psychometrically as effective as 5-option items. Our study reveals that a multiple choice question with 3 functional options provides lower most limit of item format that has adequate psychometric property. The test containing items with less number of functioning options have significantly lower reliability. The distracter function analysis and revision of nonfunctioning distracters can serve as important methods to improve the psychometrics and reliability of assessment.
Hafner, J.; Uckert, G.; Graef, F.; Hoffmann, H.; Kimaro, A. A.; Sererya, O.; Sieber, S.
In Tanzania, a majority of rural residents cook using firewood-based three-stone-fire stoves. In this study, quantitative performance differences between technologically advanced improved cooking stoves and three-stone-fire stoves are analysed. We test the performance of improved cooking stoves and three-stone-fire stoves using local cooks, foods, and fuels, in the semi-arid region of Dodoma in Tanzania. We used the cooking protocol of the Controlled Cooking Test following a two-pot test design. The findings of the study suggest that improved cooking stoves use less firewood and less time than three-stone-fire stoves to conduct a predefined cooking task. In total, 40 households were assessed and ask to complete two different cooking tasks: (1) a fast cooking meal (rice and vegetables) and (2) a slow cooking meal (beans and rice). For cooking task 1, the results show a significant reduction in firewood consumption of 37.1% by improved cooking stoves compared to traditional three-stone-fire stoves; for cooking task 2 a reduction of 15.6% is found. In addition, it was found that the time needed to conduct cooking tasks 1 and 2 was significantly reduced by 26.8% and 22.8% respectively, when improved cooking stoves were used instead of three-stone-fire-stoves. We observed that the villagers altered the initial improved cooking stove design, resulting in the so-called modified improved cooking stove. In an additional Controlled Cooking Test, we conducted cooking task 3: a very fast cooking meal (maize flour and vegetables) within 32 households. Significant changes between the initial and modified improved cooking stoves regarding firewood and time consumption were not detected. However, analyses show that both firewood and time consumption during cooking was reduced when large amounts (for 6-7 household members) of food were prepared instead of small amounts (for 2-3 household members).
Mao, Wei-Chung; Chen, Li-Fen; Chi, Chia-Hsing; Lin, Ching-Hung; Kao, Yu-Chen; Hsu, Wen-Yau; Lane, Hsien-Yuan; Hsieh, Jen-Chuen
Schizophrenia is an illness that impairs a person's social cognition. The Mayer Salovey Caruso Emotional Intelligence Test (MSCEIT) is the most well-known test used to measure emotional intelligence (EI), which is a major component of social cognition. Given the absence of EI ability-based scales adapted to Chinese speakers, we translated the MSCEIT into a Traditional Chinese version (MSCEIT-TC) and validated this scale for use in schizophrenia studies. The specific aims were to validate the MSCEIT-TC, to develop a norm for the MSCEIT-TC, and use this norm to explore the EI performance of schizophrenic individuals. We included in our study seven hundred twenty-eight healthy controls and seventy-six individuals with schizophrenia. The results suggest that the MSCEIT-TC is reliable and valid when assessing EI. The results showed good discrimination and validity when comparing the two study groups. Impairment was the greatest for two branches Understanding and Managing Emotions, which implies that the deficits of schizophrenia individuals involve ToM (theory of mind) tasks. Deficits involving the negative scale of schizophrenia was related to impaired performance when the MSCEIT-TC was used (in branch 2, 3, 4, and the area Strategic). Our findings suggest that the MSCEIT-TC can be used for emotional studies in healthy Chinese and in clinical setting for investigating schizophrenic individuals. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Ferrando, Pere J.; Masip-Cabrera, Antoni; Navarro-González, David; Lorenzo-Seva, Urbano
The Psychometric Toolbox (PT) is a user-friendly, non-commercial package mainly intended to be used for instructional purposes in introductory courses of educational and psychological measurement, psychometrics and statistics. The PT package is organized in six separate modules or sub-programs: Data preprocessor (descriptive analyses and data…
Kraus, Shane; Rosenberg, Harold
Despite the prevalence of pornography use, and recent conceptualization of problematic use as an addiction, we could find no published scale to measure craving for pornography. Therefore, we conducted three studies employing young male pornography users to develop and evaluate such a questionnaire. In Study 1, we had participants rate their agreement with 20 potential craving items after reading a control script or a script designed to induce craving to watch pornography. We dropped eight items because of low endorsement. In Study 2, we revised both the questionnaire and cue exposure stimuli and then evaluated several psychometric properties of the modified questionnaire. Item loadings from a principal components analysis, a high internal consistency reliability coefficient, and a moderate mean inter-item correlation supported interpreting the 12 revised items as a single scale. Correlations of craving scores with preoccupation with pornography, sexual history, compulsive internet use, and sensation seeking provided support for convergent validity, criterion validity, and discriminant validity, respectively. The enhanced imagery script did not impact reported craving; however, more frequent users of pornography reported higher craving than less frequent users regardless of script condition. In Study 3, craving scores demonstrated good one-week test-retest reliability and predicted the number of times participants used pornography during the following week. This questionnaire could be applied in clinical settings to plan and evaluate therapy for problematic users of pornography and as a research tool to assess the prevalence and contextual triggers of craving among different types of pornography users.
Wedman, Jonathan; Lyrén, Per-Erik
When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…
Sarah E. Mennenga
Full Text Available We constructed an 11-arm, walk-through, human radial-arm maze (HRAM as a translational instrument to compare existing methodology in the areas of rodent and human learning and memory research. The HRAM, utilized here, serves as an intermediary test between the classic rat radial-arm maze (RAM and standard human neuropsychological and cognitive tests. We show that the HRAM is a useful instrument to examine working memory ability, explore the relationships between rodent and human memory and cognition models, and evaluate factors that contribute to human navigational ability. One-hundred-and-fifty-seven participants were tested on the HRAM, and scores were compared to performance on a standard cognitive battery focused on episodic memory, working memory capacity, and visuospatial ability. We found that errors on the HRAM increased as working memory demand became elevated, similar to the pattern typically seen in rodents, and that for this task, performance appears similar to Miller’s classic description of human working memory capacity of 7±2 items. Regression analysis revealed that measures of working memory capacity and visuospatial ability accounted for a large proportion of variance in HRAM scores, while measures of episodic memory and general intelligence did not serve as significant predictors of HRAM performance. We present the HRAM as a novel instrument for measuring navigational behavior in humans, as is traditionally done in basic science studies evaluating rodent learning and memory, thus providing a useful tool to help connect and translate between human and rodent models of cognitive functioning.
Davidson, Charlie A; Lesser, Rebecca; Parente, Lori T; Fiszdon, Joanna M
Social cognition represents an important treatment target, closely linked to everyday social function. While a number of social cognitive interventions have recently been developed, measures used to evaluate these treatments are only beginning to receive psychometric scrutiny. Study goals were to replicate recently-published psychometrics for several social cognitive measures, and to provide information for additional social cognitive measures not included in recent reports. Forty-eight outpatients with psychotic-spectrum disorders completed measures of emotion perception, theory of mind, and attributional bias on two occasions, one month apart. Measures were tested for distributional characteristics, test-retest reliability, utility as a repeated measure, and relationship to symptoms and functioning. For a subgroup of participants, information about sensitivity to social cognitive treatment was also available. We replicated aspects of prior work, including largely favorable psychometric characteristics for the Bell-Lysaker Emotion Recognition Task, and promising but weaker characteristics for The Awareness of Social Inferences Test subscales and Reading the Mind in the Eyes Task. The Hinting Task had adequate test-retest statistics but a more pronounced ceiling effect. Ambiguous Intentions and Hostility Questionnaire data showed evidence of validity but were limited by inconsistency over time. Our results strongly support the Davos Assessment of Cognitive Biases Scale for future evaluation as a social cognitive treatment outcome measure. Its scores were adequately distributed, consistent over time, related to symptoms and functioning, and sensitive to treatment effects. Other relatively novel assessments of attributional bias and theory of mind showed some promise, although more work is needed. Published by Elsevier B.V.
Full Text Available In a real e-commerce website, usually only a small number of users will give ratings to the items they purchased, and this can lead to the very sparse user-item rating data. The data sparsity issue will greatly limit the recommendation performance of most recommendation algorithms. However, a user may register accounts in many e-commerce websites. If such users’ historical purchasing data on these websites can be integrated, the recommendation performance could be improved. But it is difficult to align the users and items between these websites, and thus how to effectively borrow the users’ rating data of one website (source domain to help improve the recommendation performance of another website (target domain is very challenging. To this end, this paper extended the traditional one-dimensional psychometrics model to multidimension. The extended model can effectively capture users’ multiple interests. Based on this multidimensional psychometrics model, we further propose a novel transfer learning algorithm. It can effectively transfer users’ rating preferences from the source domain to the target domain. Experimental results show that the proposed method can significantly improve the recommendation performance.
Blount, Claire; Evans, Chris; Birch, Sarah; Warren, Fiona; Norton, Kingsley
Self-report measures pertinent for personality disorder are widely used and many are available. Their relative merits are usually assessed on nomothetic psychometrics and acceptability to users is neglected. We report reactions of lay, patient and professional groups to the Personality Diagnostic Questionnaire (PDQ-IV); Millon Clinical Multiaxial Inventory (MCMI-III); the Borderline Syndrome Index (BSI); Rosenberg's Self-Esteem Scale (RSE) and the Social Functioning Questionnaire (SFQ). These were sent to 148 professionals, ex-patients and lay people for comment. Thirty-six per cent were returned. Pattern-coding by three raters revealed problematic themes across all measures, including inappropriate length, vague items and language, cultural assumptions and slang, state-bias and response-set. Measures can be depressing and upsetting for some participants (both patients and non-patients), hence administration of measures should be sensitive. Treatment may make people more self-aware, which may compromise validity for outcome research. This evaluation raises issues and concerns, which are missed in traditional psychometric evaluation.
Culpepper, Steven; Janssen, Rianne; González, Jorge; Molenaar, Dylan
This proceedings book highlights the latest research and developments in psychometrics and statistics. Featuring contributions presented at the 82nd Annual Meeting of the Psychometric Society (IMPS), organized by the University of Zurich and held in Zurich, Switzerland from July 17 to 21, 2017, its 34 chapters address a diverse range of psychometric topics including item response theory, factor analysis, causal inference, Bayesian statistics, test equating, cognitive diagnostic models and multistage adaptive testing. The IMPS is one of the largest international meetings on quantitative measurement in psychology, education and the social sciences, attracting over 500 participants and 250 paper presentations from around the world every year. This book gathers the contributions of selected presenters, which were subsequently expanded and peer-reviewed.
Leung, W C; Lau, E T; Lau, W L; Tang, Rebecca; Wong, Shell Fean; Lau, T K; Tse, K T; Wong, S F; To, W K; Ng, Lucy K L; Lao, T T; Tang, Mary H Y
The application of rapid aneuploidy testing as a stand-alone approach in prenatal diagnosis is much debated. The major criticism of this targeted approach is that it will not detect other chromosomal abnormalities that will be picked up by traditional karyotyping. This study aimed to study the nature of such chromosomal abnormalities and whether parents would choose to terminate affected pregnancies. Retrospective study on a cytogenetic database. Eight public hospitals in Hong Kong. The karyotype results of 19 517 amniotic fluid cultures performed for advanced maternal age (>or=35 years) from 1997 to 2002 were classified according to whether they were detectable by rapid aneuploidy testing. The outcomes of pregnancies with abnormal karyotypes were reviewed from patient records. In all, 333 (1.7%) amniotic fluid cultures yielded abnormal karyotypes; 175 (52.6%) of these were detected by rapid aneuploidy testing, and included trisomy 21 (n=94, 28.2%), trisomy 18 or 13 (n=21, 6.3%), and sex chromosome abnormalities (n=60, 18.0%). The other 158 (47.4%) chromosomal abnormalities were not detectable by rapid aneuploidy testing, of which 63 (18.9%) were regarded to be of potential clinical significance and 95 (28.5%) of no clinical significance. Pregnancy outcomes in 327/333 (98.2%) of these patients were retrieved. In total, 143 (42.9%) of these pregnancies were terminated: 93/94 (98.9%) for trisomy 21, 20/21 (95.2%) for trisomy 18 or 13, 19/60 (31.7%) for sex chromosome abnormalities, and 11/63 (17.5%) for other chromosomal abnormalities with potential clinical significance. There were no terminations in the 95 pregnancies in which karyotyping results were regarded to be of no clinical significance. 'Knowing less' by the rapid aneuploidy stand-alone testing could miss about half of all chromosomal abnormalities detectable by amniocentesis performed for advanced maternal age. Findings from two fifths of the latter were of potential clinical significance, and the parents
Castellanos, Irina; Kronenberger, William G; Pisoni, David B
The psychometric properties of the Learning, Executive, and Attention Functioning (LEAF) scale were investigated in an outpatient clinical pediatric sample. As a part of clinical testing, the LEAF scale, which broadly measures neuropsychological abilities related to executive functioning and learning, was administered to parents of 118 children and adolescents referred for psychological testing at a pediatric psychology clinic; 85 teachers also completed LEAF scales to assess reliability across different raters and settings. Scores on neuropsychological tests of executive functioning and academic achievement were abstracted from charts. Psychometric analyses of the LEAF scale demonstrated satisfactory internal consistency, parent-teacher inter-rater reliability in the small to large effect size range, and test-retest reliability in the large effect size range, similar to values for other executive functioning checklists. Correlations between corresponding subscales on the LEAF and other behavior checklists were large, while most correlations with neuropsychological tests of executive functioning and achievement were significant but in the small to medium range. Results support the utility of the LEAF as a reliable and valid questionnaire-based assessment of delays and disturbances in executive functioning and learning. Applications and advantages of the LEAF and other questionnaire measures of executive functioning in clinical neuropsychology settings are discussed.
Saltychev, Mikhail; Mattie, Ryan; McCormick, Zachary; Bärlund, Esa; Laimi, Katri
The aim of this study was to investigate the psychometric properties of the Oswestry Disability Index (ODI) in a large cross-sectional cohort of individuals with chronic low back pain by defining its internal consistency, construct structure and validity, and its ability to differentiate between different degrees of functional limitation. A total of 837 consecutive outpatient patients with low back pain were studied. The internal consistency of ODI was assessed by Cronbach's α, construct structure by exploratory factor analysis, construct validity by confirmatory factor analysis, and discrimination was determined by item response theory analysis. The ODI showed good internal consistency (α=0.85). Explanatory factor analysis showed that ODI is a unidimensional test measuring functional level and nothing else. The confirmatory factor analysis showed that the standardized regression weights of all ODI items were relatively high, varying from 0.5 to 0.7. The item response theory analysis suggested that eight out of 10 ODI items have a close to perfect ability to measure functional limitations in accordance with the actual severity of disability experienced by the respondents. Discrimination of all the items was high to perfect (1.08-2.01). The test characteristic and test information curves showed that the discriminative ability of the ODI is superior at higher levels of disability. The present data showed that the ODI is an internally consistent, unidimensional scale with overall excellent construct validity and ability to discriminate the severity of functional disability. The analysis suggests that the ODI may better distinguish between the relative degrees of function at above-average disability levels.
Eun-Hyun Lee, RN, PhD
Conclusion: Overall, the PSS is an easy-to-use questionnaire with established acceptable psychometric properties. However, future studies should evaluate these psychometric properties in greater depth, and validate the scale using diverse populations.
Lacroix, Emilie; Alberga, Angela; Russell-Mathew, Shelly; McLaren, Lindsay; von Ranson, Kristin
People living with overweight and obesity often experience weight-based stigmatization. Investigations of the prevalence and correlates of weight bias and evaluation of weight bias reduction interventions depend upon psychometrically-sound measurement. Our paper is the first to comprehensively evaluate the psychometric properties, use of people-first language within items, and suitability for use with various populations of available self-report measures of weight bias. We searched five electronic databases to identify English-language self-report questionnaires of weight bias. We rated each questionnaire's psychometric properties based on initial validation reports and subsequent use, and examined item language. Our systematic review identified 40 original self-report questionnaires. Most questionnaires were brief, demonstrated adequate internal consistency, and tapped key cognitive and affective dimensions of weight bias such as stereotypes and blaming. Current psychometric evidence is incomplete for many questionnaires, particularly with regard to the properties of test-retest reliability, sensitivity to change as well as discriminant and structural validity. Most questionnaires were developed prior to debate surrounding terminology preferences, and do not employ people-first language in the items administered to participants. We provide information and recommendations for clinicians and researchers in selecting psychometrically sound measures of weight bias for various purposes and populations, and discuss future directions to improve measurement of this construct. © 2017 The Author(s) Published by S. Karger GmbH, Freiburg.
Ferriero, Giorgio; Kristensen, Morten T; Invernizzi, Marco
INTRODUCTION: In the geriatric population, independent mobility is a key factor in determining readiness for discharge following acute hospitalization. The Cumulated Ambulation Score (CAS) is a potentially valuable score that allows day-to-day measurements of basic mobility. The CAS was developed...... and validated in older patients with hip fracture as an early postoperative predictor of short-term outcome, but it is also used to assess geriatric in-patients with acute medical illness. Despite the fast- accumulating literature on the CAS, to date no systematic review synthesizing its psychometric properties....... Of 49 studies identified, 17 examined the psychometric properties of the CAS. EVIDENCE SYNTHESIS: Most papers dealt with patients after hip fracture surgery, and only 4 studies assessed the CAS psychometric characteristics also in geriatric in-patients with acute medical illness. Two versions of CAS...
Tilden, V P; Hirsch, A M; Nelson, C A
For norm-referenced measures to be useful in social-behavioral research, investigators who develop measures face several psychometric challenges, including: (a) adequate domain specification; (b) adequate initial evidence of reliability and validity; and (c) ongoing evidence of psychometric quality. The Interpersonal Relationship Inventory (IPRI) was developed in response to gaps in measurement of social relationships, and contributed scales for reciprocity and conflict to a measure of social support. For the IPRI, the first two points were addressed during the period of instrument development. The measure now has been in use for 4 years. This article reports evidence addressing the third challenge: ongoing evidence of psychometric quality. Findings from 19 studies using the IPRI provide compelling evidence for internal consistency reliability and construct validity of the scales.
Saide Umut ZEYBEK
Full Text Available The aim of this study is to investigate the applicability of Coping Attitudes Scale: Measure of Positive Attitudes in Depression (CAS among Turkish young adult community sample and determine the psychometric properties (validity and reliability of this scale. This study was conducted with 419 students attending different departments in Mugla Sitki Kocman University, Faculty of Education in the spring semester of academic year of 2015-2016. Positive Functional Attitudes Scale, Beck Depression Scale, Beck Hopelessness Scale, Automatic Thoughts Scale, Positivity Scale and Developed Automatic Thoughts Scale.were used as data collection tools. Confirmatory factor analysis (CFA were used for investigation of the psychometric properties of the PFAS. Also, criterion-related validity, test-retest validity, and internal consistency were used calculated. The CFA results showed that standardized item estimates of the CAS ranged between 0.45 and 0.47. Also the CFA results showed that the original factor structure of the PFAS confirmed on the Turkish sample. internal consistency was calculated using the total community samples PFAS score. Cronbachs alpha coefficient ort he total scale (.93 was high. Test-retest results of the subscales were 0.76. The findings showed that factor structures of the PFAS life perspective, personal accomplishment, positive future, self-worth, coping with problems had psychometric quality in Turkish version. As a result of the study, the Turkish version of PFAS has good validity and reliability for young adult community sample. [JCBPR 2017; 6(2.000: 59-66
Santos-Iglesias, Pablo; Sierra, Juan Carlos
The study analyzed psychometric properties of a Spanish version of the Hurlbert Index of Sexual Assertiveness in a Spanish sample of 400 men and 453 women who had had a partner for the last 6 mo. or longer at the time of the study. Exploratory and confirmatory factor analyses suggested a two-factor solution with the factors Initiation and No shyness/Refusal. Internal consistency values for total scores were .87 and .83 for the factors, respectively. Convergent validity tests were also satisfactory. It is therefore reasonable to conclude that the Spanish version of the scale has appropriate psychometric properties.
Lundman, Berit; Viglund, Kerstin; Aléx, Lena; Jonsén, Elisabeth; Norberg, Astrid; Fischer, Regina Santamäki; Strandberg, Gunilla; Nygren, Björn
Four dimensions of inner strength were previously identified in a meta-theoretical analysis: firmness, creativity, connectedness, and flexibility. The aim of this study was to develop an Inner Strength Scale (ISS) based on those four dimensions and to evaluate its psychometric properties. An initial version of ISS was distributed for validation purpose with the Rosenberg Self-Esteem Scale, the resilience scale, and the sense of Coherence Scale. A convenience sample of 391 adults, aged 19-90 years participated. Principal component analysis (PCA) and confirmatory factor analysis (CFA) were used in the process of exploring, evaluating, and reducing the 63-item ISS to the 20-item ISS. Cronbach's alpha and test-retest were used to measure reliability. CFA showed satisfactory goodness-of-fit for the 20-item ISS. The analysis supported a fourfactor solution explaining 51% of the variance. Cronbach's alpha on the 20-item ISS was 0.86, and the test-retest showed stability over time (r=0.79). The ISS was found to be a valid and reliable instrument for capturing a multifaceted understanding of inner strength. Further tests of psychometric properties of the ISS will be performed in forthcoming studies. Copyright © 2011 Elsevier Ltd. All rights reserved.
Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela
To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.
Thevis, Mario; Schänzer, Wilhelm; Geyer, Hans; Thieme, Detlef; Grosse, Joachim; Rautenberg, Claudia; Flenker, Ulrich; Beuck, Simon; Thomas, Andreas; Holland, Ruben; Dvorak, Jiri
The administration of musk extract, that is, ingredients obtained by extraction of the liquid secreted from the preputial gland or resulting grains of the male musk deer (eg, Moschus moschiferus), has been recommended in Traditional Chinese Medicine (TCM) applications and was listed in the Japanese pharmacopoeia for various indications requiring cardiovascular stimulation, anti-inflammatory medication or androgenic hormone therapy. Numerous steroidal components including cholesterol, 5α-androstane-3,17-dione, 5β-androstane-3,17-dione, androsterone, etiocholanolone, epiandrosterone, 3β-hydroxy-androst-5-en-17-one, androst-4-ene-3,17-dione and the corresponding urea adduct 3α-ureido-androst-4-en-17-one were characterised as natural ingredients of musk over several decades, implicating an issue concerning doping controls if used for the treatment of elite athletes. In the present study, the impact of musk extract administration on sports drug testing results of five females competing in an international sporting event is reported. In the course of routine doping controls, adverse analytical findings concerning the athletes' steroid profile, corroborated by isotope-ratio mass spectrometry (IRMS) data, were obtained. The athletes' medical advisors admitted the prescription of TCM-based musk pod preparations and provided musk pod samples for comparison purposes to clarify the antidoping rule violation. Steroid profiles, IRMS results, literature data and a musk sample obtained from a living musk deer of a local zoo conclusively demonstrated the use of musk pod extracts in all cases which, however, represented a doping offence as prohibited anabolic-androgenic steroids were administered.
Mitchell, Sherese A.
This researcher investigated the long- and short-term retention of information using traditional instruction versus previously tested tactual resources versus innovative tactual resources on the achievement and attitudes of second-grade students in science. The processing of new and difficult knowledge has challenged many young children who tend to be kinesthetic or tactual learners. In compliance with the National Science Education Standards, students should be actively engaged in their own learning. Therefore, to boost student achievement in science, the use of tactual materials was implemented. The sample included 67 second-grade students drawn from three heterogeneously grouped classes in a low socio-economic neighborhood. It consisted of 30 females and 37 males of which 97 percent were African American, 2 percent were Hispanic, and 1 percent Other. Students were unaware of their diagnosed learning-style preference(s) during the instruction and assessment phases of the study. Therefore, students' knowledge of their learning-style preferences could not have had any impact on their achievement or attitudes. A counterbalanced research design was employed. During the first session, Group 1 was taught with previously tested tactual resources (Electroboards, Flip Chutes, Fact Wheels, and Fact Fans), and Group 3 was taught traditionally. During the second session of instruction, Group 1 received instruction with innovative tactual resources, Group 2 received traditional instruction, Group 3 received instruction with previously tested tactual resources. During the final session of instruction, Group 1 received traditional instruction, Group 2 received instruction with previously tested tactual resources, and Group 3 received instruction with innovative tactual resources. The results indicated that the use of tactual materials, regardless of whether they were previously tested or innovative, produced higher achievement gains and more positive attitudes than traditional
Full Text Available The primary purpose of this study was to evaluate the possibility of using a psychometric approach for assessing supervisory competencies relevant to the mining and refining environment. The competency questionnaire was developed using supervisory roles and registered supervisory unit standards from the United Kingdom (UK, as no registered unit standards exist in South Africa. Twenty-four supervisors from three departments (Production, Engineering and Laboratory were evaluated by 125 raters; besides by themselves, also by their managers, peers, customers and their sub-ordinates. Based on difference scores derived from the Importance and Performance scales, a single factor was extracted with an internal reliability of 0,965. No statistical significant differences were obtained (ANOVA’s, t-test and F-statistics between groups based on biographical variables or between rater groups. The findings and their implications are further discussed. Opsomming Die primêre doel van die studie was om die moontlikheid vir die gebruik van ’n psigometriese benadering tot toesighouerbevoegdheidsbeoordeling, te evalueer. Die bevoegdheidsvraelys is ontwikkel deur gebruik te maak van toesighouersrolle en geregistreerde toesighouerseenheidstandaarde van die Verenigde Koningkryk, as gevolg van ‘n gebrek aan bestaande eenheidstandaarde in Suid-Afrika. Vier-en-twintig toesighouers van drie departemente (Produksie, Ingenieurswese en Laboratorium is deur 125 beoordelaars geëvalueer; buiten deur hulself, ook deur hul bestuurders, kollegas, kliënte en hul ondergeskiktes. ’n Enkele faktor, met ’n betroubaarheid van 0,965, gebaseer op die verskiltellings van die Prestasie- en Belangrikheidskaal, is onttrek. Geen beduidende verskille (ANOVA’s, t-toetse en F-statistiek kon tussen groepe gebaseer op biografiese veranderlikes en die onderskeie beoordelaarsgroepe gevind word nie. Hierdie bevindinge en die implikasies daarvan word verder bespreek.
Farmer, Ryan L.; Floyd, Randy G.; Reynolds, Matthew R.; Kranzler, John H.
The most global score yielded by intelligence tests, IQs, are supported by substantial validity evidence and have historically been central to the identification of intellectual disabilities, learning disabilities, and giftedness. This study examined the extent to which IQs measure the ability they target, psychometric "g." Data from…
Lovakov, Andrey V.; Agadullina, Elena R.; Schaufeli, Wilmar B.
This article aims to analyze the psychometric properties of the Russian version of the Utrecht Work Engagement Scale (UWES-9) by using a sample of 1783 employees of a large Russian organization. We conducted a series of Confirmatory Factor Analysis (CFA) tests of the factorial structure and the
Khanna, Rahul; Madhavan, S. Suresh; Smith, Michael J.; Tworek, Cindy; Patrick, Julie H.; Becker-Cottrill, Barbara
The purpose of this study was to test the psychometric properties of the Caregiver Strain Questionnaire (CGSQ) among caregivers of children with autism. The CGSQ was originally developed to assess burden experienced by parents of children and adolescents with serious emotional and behavioral disorders. Study data was collected from 304 primary…
Al-Hendawi, Maha; Keller, Clayton; Cloninger, Lea
The Child Behavior Checklist for children 6 to 18 (CBCL/6-18) is a widely used, standardized parent rating scale. However, few studies have tested the psychometric properties of this instrument in the Arab world despite the great need for such instruments to support the identification and education of children with emotional, behavioral, and…
Aronson, David M.; Baum, Steven K.
A new psychometric instrument for measuring the impact of divorce on elementary school age children was developed: the Child's Report of the Impact of Separation by Parents (CRISP). This structured projective test was specifically designed to assess children's postdivorce stress/adjustment. An initial version of the CRISP was administered to 99…
Raykov, Tenko; Marcoulides, George A.; Patelis, Thanos
A critical discussion of the assumption of uncorrelated errors in classical psychometric theory and its applications is provided. It is pointed out that this assumption is essential for a number of fundamental results and underlies the concept of parallel tests, the Spearman-Brown's prophecy and the correction for attenuation formulas as well as…
Brouwer, B.J.M. de; Kaljouw, M.J.; Schoonhoven, L.; Achterberg, T. van
AIMS AND OBJECTIVES: To develop and psychometrically test the Essentials of Magnetism II in nursing homes. BACKGROUND: Increasing numbers and complex needs of older people in nursing homes strain the nursing workforce. Fewer adequately trained staff and increased care complexity raise concerns about
Smith, Leann V.; Cokley, Kevin
The authors investigated the psychometric properties of the Social Identities and Attitudes Scale developed by Picho and Brown, which captures an individual's vulnerability to Stereotype Threat effects. Confirmatory factor analyses and group invariance tests conducted on a diverse sample of 516 college students revealed adequate reliability and…
Moussa, Miriam Taouk; Lovibond, Peter; Laube, Roy; Megahead, Hamido A.
Objective: To translate and evaluate the psychometric properties of an Arabic-language version of the Depression Anxiety Stress Scales (DASS). Method: The items were translated, back translated, refined, and tested in an Australian immigrant sample (N = 220). Results: Confirmatory factor analysis showed that the Arabic DASS discriminates between…
Kim, L. H.; McLeod, R. S.; Kiss, Z. H. T.
Objective. There have been remarkable advances over the past decade in neural prostheses to restore lost motor function. However, restoration of somatosensory feedback, which is essential for fine motor control and user acceptance, has lagged behind. With an increasing interest in using electrical stimulation to restore somatosensory sensations within the peripheral (PNS) and central nervous systems (CNS), it is critical to characterize the percepts evoked by electrical stimulation in a standardized manner with a validated psychometric questionnaire. This will allow comparison of results from applications at various nervous system levels in multiple settings. Approach. We compiled a summary of published reports of somatosensory percepts that were elicited by electrical stimulation in humans and used these to develop a new psychometric questionnaire. Results. This new questionnaire was able to characterize subjective evoked sensations with good test-retest reliability (Spearman’s correlation coefficients ranging 0.716 ⩽ ρ ⩽ 1.000, p ⩽ 0.005) in 13 subjects receiving stimulation through neural implants in both the CNS and PNS. Furthermore, the new questionnaire captured more descriptors (M = 2.65, SD = 0.91) that would have been missed by being categorized as ‘other sensations’, using a previous questionnaire (M = 1.40, SD = 0.77, t(12) = -10.24, p psychometric questionnaire will aid in establishing consistency and standardization of reporting in future studies of somatosensory neural prostheses.
Caselli, Gabriele; Fernie, Bruce; Canfora, Flaviano; Mascolo, Cristina; Ferrari, Andrea; Antonioni, Maria; Giustina, Lucia; Donato, Gilda; Marcotriggiani, Antonella; Bertani, Andrea; Altieri, Antonella; Pellegrini, Eliana; Spada, Marcantonio M
Recent research has suggested that metacognitions may play a role across the spectrum of addictive behaviours. The goal of our studies was to develop the first self-report scale of metacognitions about gambling. We conducted three studies with one community (n = 165) and two clinical (n = 110; n = 87) samples to test the structure and psychometric properties of the Metacognitions about Gambling Questionnaire and examined its capacity to prospectively predict severity of gambling. Findings supported a two factor solution consisting of positive and negative metacognitions about gambling. Internal consistency, predictive and divergent validity were acceptable. All the factors of the Metacognitions about Gambling Questionnaire correlated positively with gambling severity. Regression analyses showed that negative metacognitions about gambling were significantly associated to gambling severity over and above negative affect and gambling-specific cognitive distortions. Finally only gambling severity and negative metacognitions about gambling were significant prospective predictors of gambling severity as measured three months later. The Metacognitions about Gambling Questionnaire was shown to possess good psychometric properties, as well as predictive and divergent validity within the populations that were tested. Copyright © 2018 Elsevier B.V. All rights reserved.
Spada, Marcantonio M; Caselli, Gabriele
Recent research has suggested that metacognitions may play a role across the spectrum of addictive behaviours. The goal of our studies was to develop the first self-report scale of metacognitions about online gaming. We conducted two studies with samples of online gamers (n=225, n=348) to test the structure and psychometric properties of the Metacognitions about Online Gaming Scale and examined its capacity to predict weekly online gaming hours and Internet addiction. Exploratory and confirmatory factor analyses supported a three-factor solution: positive metacognitions about online gaming, negative metacognitions about the uncontrollability of online gaming, and negative metacognitions about the dangers of online gaming. Internal consistency, predictive and divergent validity were acceptable. All the factors of the Metacognitions about Online Gaming Scale correlated positively with weekly online gaming hours and Internet addiction. Regression analyses showed that negative metacognitions about the uncontrollability of online gaming and levels of Internet addiction were the only significant predictors of weekly online gaming hours, and that positive metacognitions about online gaming and negative metacognitions about the uncontrollability of online gaming were the only significant predictors of Internet addiction. The Metacognitions about Online Gaming Scale was shown to possess good psychometric properties, as well as predictive and divergent validity within the populations that were tested. Copyright © 2015 Elsevier Ltd. All rights reserved.
Houle, Janie; Mishara, Brian L; Chagnon, François
Men die by suicide three to four times more often than women in Western countries. The adverse impact of the traditional male gender role as well as men's reluctance to seek help are possible explanations of this gender gap, but these hypotheses have not been well documented empirically. This study compares two groups of men who experienced comparable severely stressful life events during the preceding 12 months: 40 men admitted to hospital emergency following suicide attempts, and 40 men with no history of suicide attempts. Structured interviews were conducted to measure adherence to the traditional male gender role, help seeking behaviour, social support, suicide acceptability and mental health. ANOVAS indicated that attempters are more likely to adhere to the traditional masculine gender role and regression analysis revealed that this relationship persists even when the presence of mental disorders is statistically controlled. Sequential regression analysis support the mediation model and show that the effects of the traditional male gender role on suicidal behavior are mediated through protective and risk factors for suicide, namely mental state, help seeking and social support. The traditional male gender role appears to increase the risk of suicidal behavior in men by undermining their mental state and by inhibiting the protective factors of help seeking and social support. This study underscores the importance of encouraging men to seek help.
Chambers, Brittany D; Erausquin, Jennifer Toller; Tanner, Amanda E; Nichols, Tracy R; Brown-Jeffy, Shelly
Despite decreases in infants born premature and at low birth weight in the United States (U.S.), racial disparities between Black and White women continue. In response, the purpose of this analysis was to examine associations between both traditional and novel indicators of county-level structural racism and birth outcomes among Black and White women. We merged individual-level data from the California Birth Statistical Master Files 2009-2013 with county-level data from the United States (U.S.) Census American Community Survey. We used hierarchical linear modeling to examine Black-White differences among 531,170 primiparous women across 33 California counties. Traditional (e.g., dissimilarity index) and novel indicators (e.g., Black to White ratio in elected office) were associated with earlier gestational age and lower birth weight among Black and White women. A traditional indicator was more strongly associated with earlier gestational age for Black women than for White women. This was the first study to empirically demonstrate that structural racism, measured by both traditional and novel indicators, is associated with poor health and wellbeing of infants born to Black and White women. However, findings indicate traditional indicators of structural racism, rather than novel indicators, better explain racial disparities in birth outcomes. Results also suggest the need to develop more innovative approaches to: (1) measure structural racism at the county-level and (2) reform public policies to increase integration and access to resources.
Huggins-Manley, Anne Corinne
This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
Introduction: The aim of study was to evaluate the psychometric indices Sternberg love scale on married men and women in Iranian society. Methods: The study type is correlation (factor analysis). In this research factor analysis was used that is an exploratory and confirmatory technique to study the structure of a set of data, ...
The present research focuses on the psychometric properties of the Birthday Party measure for ages 3-5. The Birthday Party was developed to provide a reliable, valid, and engaging measure of early mathematical content--Number and Operation, Shape, Space, and Pattern--that can be given in either a short or a long form to English and Spanish…
Cannito, Michael P.
This study examined emotional characteristics of 18 female spasmodic dysphonic subjects in comparison to matched normal controls across psychometric measures of depression, anxiety, and somatic complaints. Statistically significant differences were noted between groups for all measures and over half of the dysphonic subjects exhibited clinically…
Longo, Matthew R.; Schuur, Friederike; Kammers, Marjolein P. M.; Tsakiris, Manos; Haggard, Patrick
What is it like to have a body? The present study takes a psychometric approach to this question. We collected structured introspective reports of the rubber hand illusion, to systematically investigate the structure of bodily self-consciousness. Participants observed a rubber hand that was stroked either synchronously or asynchronously with their…
Waugh, Mark H; Hopwood, Christopher J; Krueger, Robert F; Morey, Leslie C; Pincus, Aaron L; Wright, Aidan G C
The Diagnostic and Statistical Manual of Mental Disorders Fifth Edition (DSM-5) Section III Alternative Model for Personality Disorders (AMPD; APA, 2013) represents an innovative system for simultaneous psychiatric classification and psychological assessment of personality disorders (PD). The AMPD combines major paradigms of personality assessment and provides an original, heuristic, flexible, and practical framework that enriches clinical thinking and practice. Origins, emerging research, and clinical application of the AMPD for diagnosis and psychological assessment are reviewed. The AMPD integrates assessment and research traditions, facilitates case conceptualization, is easy to learn and use, and assists in providing patient feedback. New as well as existing tests and psychometric methods may be used to operationalize the AMPD for clinical assessments.
Malec, James F; Kragness, Miriam; Evans, Randall W; Finlay, Karen L; Kent, Ann; Lezak, Muriel D
To evaluate the internal consistency of the Mayo-Portland Adaptability Inventory (MPAI), further refine the instrument, and provide reference data based on a large, geographically diverse sample of persons with acquired brain injury (ABI). 386 persons, most with moderate to severe ABI. Outpatient, community-based, and residential rehabilitation facilities for persons with ABI located in the United States: West, Midwest, and Southeast. Rasch, item cluster, principal components, and traditional psychometric analyses for internal consistency of MPAI data and subscales. With rescoring of rating scales for 4 items, a 29-item version of the MPAI showed satisfactory internal consistency by Rasch (Person Reliability=.88; Item Reliability=.99) and traditional psychometric indicators (Cronbach's alpha=.89). Three rationally derived subscales for Ability, Activity, and Participation demonstrated psychometric properties that were equivalent to subscales derived empirically through item cluster and factor analyses. For the 3 subscales, Person Reliability ranged from.78 to.79; Item Reliability, from.98 to.99; and Cronbach's alpha, from.76 to.83. Subscales correlated moderately (Pearson r =.49-.65) with each other and strongly with the overall scale (Pearson r=.82-.86). Outcome after ABI is represented by the unitary dimension described by the MPAI. MPAI subscales further define regions of this dimension that may be useful for evaluation of clinical cases and program evaluation.
Dutch Translation and Psychometric Testing of the 9-Item Shared Decision Making Questionnaire (SDM-Q-9) and Shared Decision Making Questionnaire-Physician Version (SDM-Q-Doc) in Primary and Secondary Care.
Rodenburg-Vandenbussche, Sumayah; Pieterse, Arwen H; Kroonenberg, Pieter M; Scholl, Isabelle; van der Weijden, Trudy; Luyten, Gre P M; Kruitwagen, Roy F P M; den Ouden, Henk; Carlier, Ingrid V E; van Vliet, Irene M; Zitman, Frans G; Stiggelbout, Anne M
The SDM-Q-9 and SDM-Q-Doc measure patient and physician perception of the extent of shared decision making (SDM) during a physician-patient consultation. So far, no self-report instrument for SDM was available in Dutch, and validation of the scales in other languages has been limited. The aim of this study was to translate both scales into Dutch and assess their psychometric characteristics. Participants were patients and their treating physicians (general practitioners and medical specialists). Patients (N = 182) rated their consultation using the SDM-Q-9, 43 physicians rated their consultations using the SDM-Q-Doc (N = 201). Acceptability, reliability (internal consistency), and the factorial structure of the instruments were determined. For convergent validity the CPSpost was used. Reliabilities of both scales were high (alpha SDM-Q-9 0.88; SDM-Q-Doc 0.87). The SDM-Q-9 and SDM-Q-Doc total scores correlated as expected with the CPSpost (SDM-Q-9: r = 0.29; SDM-Q-Doc: r = 0.48) and were significantly different between the CPSpost categories, with lowest mean scores when the physician made the decision alone. Principal Component Analyses showed a two-component model for each scale. A confirmatory factor analysis yielded a mediocre, but acceptable, one-factor model, if Item 1 was excluded; for both scales the best indices of fit were obtained for a one-factor solution, if both Items 1 and 9 were excluded. The Dutch SDM-Q-9 and SDM-Q-Doc demonstrate good acceptance and reliability; they correlated as expected with the CPSpost and are suitable for use in Dutch primary and specialised care. Although the best model fit was found when excluding Items 1 and 9, we believe these items address important aspects of SDM. Therefore, also based on the coherence with theory and comparability with other studies, we suggest keeping all nine items of the scale. Further research on the SDM-concept in patients and physicians, in different clinical settings and different countries, is
Hammond Sean M
Full Text Available Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM. Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89, subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming's indices shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the
Hammond, Sean M
Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM). Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89), subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming\\'s indices) shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the device in future
Michels, Charlotte TJ; Boulton, Mary; Adams, Astrid; Wee, Bee; Peters, Michele
Background: Informal carers face many challenges in caring for patients with palliative care needs. Selecting suitable valid and reliable outcome measures to determine the impact of caring and carers’ outcomes is a common problem. Aim: To identify outcome measures used for informal carers looking after patients with palliative care needs, and to evaluate the measures’ psychometric properties. Design: A systematic review was conducted. The studies identified were evaluated by independent reviewers (C.T.J.M., M.B., M.P.). Data regarding study characteristics and psychometric properties of the measures were extracted and evaluated. Good psychometric properties indicate a high-quality measure. Data sources: The search was conducted, unrestricted to publication year, in the following electronic databases: Applied Social Sciences Index and Abstracts, Cumulative Index to Nursing and Allied Health Literature, The Cochrane Library, EMBASE, PubMed, PsycINFO, Social Sciences Citation Index and Sociological Abstracts. Results: Our systematic search revealed 4505 potential relevant studies, of which 112 studies met the inclusion criteria using 38 carer measures for informal carers of patients with palliative care needs. Psychometric properties were reported in only 46% (n = 52) of the studies, in relation to 24 measures. Where psychometric data were reported, the focus was mainly on internal consistency (n = 45, 87%), construct validity (n = 27, 52%) and/or reliability (n = 14, 27%). Of these, 24 measures, only four (17%) had been formally validated in informal carers in palliative care. Conclusion: A broad range of outcome measures have been used for informal carers of patients with palliative care needs. Little formal psychometric testing has been undertaken. Furthermore, development and refinement of measures in this field is required. PMID:26407683
Cabeza de Baca, Tomás; Black, Candace Jasmine; García, Rafael Antonio; Fernandes, Heitor Barcellos Ferreira; Wolf, Pedro Sofio Abril; Woodley of Menie, Michael Anthony
Copping, Campbell, and Muncer (2014) have recently published an article critical of the psychometric approach to the assessment of life history (LH) strategy. Their purported goal was testing for the convergent validation and examining the psychometric structure of the High-K Strategy Scale (HKSS). As much of the literature on the psychometrics of human LH during the past decade or so has emanated from our research laboratory and those of close collaborators, we have prepared this detailed response. Our response is organized into four main sections: (1) A review of psychometric methods for the assessment of human LH strategy, expounding upon the essence of our approach; (2) our theoretical/conceptual concerns regarding the critique, addressing the broader issues raised by the critique regarding the latent and hierarchical structure of LH strategy; (3) our statistical/methodological concerns regarding the critique, examining the validity and persuasiveness of the empirical case made specifically against the HKSS; and (4) our recommendations for future research that we think might be helpful in closing the gap between the psychometric and biometric approaches to measurement in this area. Clearly stating our theoretical positions, describing our existing body of work, and acknowledgintheir limitations should assist future researchers in planning and implementing more informed and prudent empirical research that will synthesize the psychometric approach to the assessment of LH strategy with complementary methods. PMID:25844774
Aurelio José Figueredo
Full Text Available Copping, Campbell, and Muncer (2014 have recently published an article critical of the psychometric approach to the assessment of life history (LH strategy. Their purported goal was testing for the convergent validation and examining the psychometric structure of the High-K Strategy Scale (HKSS. As much of the literature on the psychometrics of human LH during the past decade or so has emanated from our research laboratory and those of close collaborators, we have prepared this detailed response. Our response is organized into four main sections: (1 A review of psychometric methods for the assessment of human LH strategy, expounding upon the essence of our approach; (2 our theoretical/conceptual concerns regarding the critique, addressing the broader issues raised by the critique regarding the latent and hierarchical structure of LH strategy; (3 our statistical/methodological concerns regarding the critique, examining the validity and persuasiveness of the empirical case made specifically against the HKSS; and (4 our recommendations for future research that we think might be helpful in closing the gap between the psychometric and biometric approaches to measurement in this area. Clearly stating our theoretical positions, describing our existing body of work, and acknowledging their limitations should assist future researchers in planning and implementing more informed and prudent empirical research that will synthesize the psychometric approach to the assessment of LH strategy with complementary methods.
Psychometric properties of the portuguese version of the Jebsen-Taylor test for adults with mild hemiparesis Avaliação das propriedades pscicométricas da versão em português do teste de Jebsen Taylor para adultos com hemiparesia leve
Karina N. Ferreiro
Full Text Available OBJECTIVES: To evaluate the psychometric properties of the Portuguese version of the Jebsen-Taylor Test (JTT in patients with stroke. METHODS: Forty participants who suffered a stroke in the cerebral hemisphere were videotaped while performing the JTT. Scores were defined by the time taken to perform the tasks, and two physical therapists evaluated the performance of the participants. Intra- and inter-rater reliability was defined by intraclass correlation coefficients (ICC through videotape analysis. Cronbach's alpha and Pearson's correlation coefficient (r were used to measure the internal consistency of the scale. Confidence intervals (CI were calculated, and the influence of handedness and educational level on the JTT scores was evaluated. RESULTS: Inter-rater (ICC = 1.0; CI, 1.0-1.0 and intra-rater reliabilities (ICC=0.997; CI, 0.995-0.998 were excellent. Regarding internal consistency, Cronbach's α was 0.924. The item "writing a sentence" was less consistent than the other items (Cronbach's alpha=0.884. Pearson's r (item score - total score was lower for the item "small objects" (r=0.657. There was no significant influence of handedness or educational level on the JTT scores. CONCLUSIONS: Videotaping test performances can be a useful tool in multicenter studies if inter-rater reliability is appropriate. The inter- and intra-rater reliabilities of the Portuguese version of the JTT were excellent in patients with stroke. The JTT can be a valuable tool for evaluating dexterity in research protocols aiming at efficacy of rehabilitation interventions.OBJETIVOS: Avaliar as propriedades psicométricas da versão em Português do teste de Jebsen-Taylor (TJT em pacientes com acidente vascular encefálico (AVE. MÉTODOS: Quarenta pacientes com AVEs em hemisférios cerebrais foram filmados enquanto realizaram o TJT. A pontuação no teste é definida pelo tempo de execução de tarefas motoras. Duas fisioterapeutas avaliaram o desempenho dos
Sardá, Jamir; Nicholas, Michael K; Pimenta, Cibele A M; Asghari, Ali
Depression is a common contributor to suffering and disability in people with chronic pain. However, the assessment of depression in this population has been hampered by the presence of a number of somatic symptoms that are shared between chronic pain, treatment side-effects and traditional concepts of depression. As a result, the use of depression measures that do not contain somatic items has been encouraged. This study examined the psychometric properties of the Depression sub-scale of the Depression Anxiety and Stress Scales (DASS) in a Brazilian chronic pain patient population. Data on a number of measures were collected from 348 participants attending pain facilities. Principal components and exploratory factor analyses indicated the presence of only one factor. Item analyses indicated adequate item-scale correlations. The Cronbach alpha was .96, which suggests an excellent internal consistency. The DASS-Depression scale has adequate psychometric properties and its further use with Brazilian chronic pain populations can now be supported.
Full text of publication follows: this paper argues that perceptual control is an essential component in human risk evaluation. Control is seen as an integrative concept between the psychometric research paradigm and various psychological theories. The psychometric approach to the study of risk has mainly dealt with the intuitive judgements people do when they are asked to evaluate risky activities and technologies. It shows that people judge risk in relation to the possible consequences and probabilities related to an outcome; the former more typical for the public and the latter more often used by experts. The psychometric research tradition has concentrated on doing human risk evaluations quantifiable and the reactions predictable. This paper also relates to possible practical implications of this strategy, namely that humans react heterogeneously to different kinds of threats due to perceived control. Theoretical ability to explain and elaborate perceptions of risk, as well as individual reactions, were the main criteria for the literature selection, which includes work on e.g. attribution theory, locus of control, and learned helplessness. Thus, the paper addresses available psychological views for a contribution to a developed theoretical framework for human risk evaluation. It seeks to compare and integrate the psychometric research tradition within social psychological theories. The way in which people find their informational basis for their risk judgements, either from others or from their own perceptions is also discussed. Furthermore, the theories are related to the social and psychological reactions of the Chernobyl accident. The paper concludes that psychological theories can contribute to a more comprehensive framework for the understanding of human risk evaluation, leading to a more coherent and integrative knowledge. (author)
Pahlevan Sharif, Saeed
Purpose The purpose of this paper is to develop and evaluate psychometrically an instrument named the Breast Size Satisfaction Scale (BSSS) to assess breast size satisfaction. Design/methodology/approach The present scale was developed using a set of 16 computer-generated 3D images of breasts to overcome some of the limitations of existing instruments. The images were presented to participants and they were asked to select the figure that most accurately depicted their actual breast size and the figure that most closely represented their ideal breast size. Breast size satisfaction was computed by subtracting the absolute value of the difference between ideal and actual perceived size from 16, such that higher values indicate greater breast size satisfaction. Findings Study 1 ( n=65 female undergraduate students) showed good test-retest reliability and study 2 ( n=1,000 Iranian women, aged 18 years and above) provided support for convergent validity using a nomological network approach. Originality/value The BSSS demonstrated good psychometric properties and thus can be used in future studies to assess breast size satisfaction among women.
Lundgren, Tobias; Parling, Thomas
Psychological inflexibility and experiential avoidance are equivalent (with somewhat different connotations) concepts and refer to an unwillingness to remain in contact with particular private events. This concept is most often measured by the Acceptance and Action Questionnaire (AAQ-II) and is strongly related to psychopathology and behavioral effectiveness. In this study, the preliminary psychometric properties of the Swedish version of the AAQ-II (Swedish Acceptance and Action Questionnaire-SAAQ) are presented. The study is done in two steps. In the first step, the 10-item version of the AAQ-II is investigated through principal component analysis (n = 147). Secondly, due to problems with the component structure, the instrument is reduced to a six-item version and its validity and internal consistency are investigated (n = 154). The six-item version shows good concurrent and convergent validity as well as satisfying internal consistency (α = .85). Furthermore, the Swedish six-item version of the AAQ-II showed one strong component. Test-retest reliability was satisfactory (r = .80; n = 228). In future research, predictive and external validity would be important to investigate in order to further ensure that the SAAQ is a useful measure for clinical research. In conclusion, the SAAQ has satisfactory psychometric properties, but more data need to be gathered to further explore the possibilities for the instruments in Swedish contexts.
Graffigna, Guendalina; Barello, Serena; Bonanomi, Andrea; Lozza, Edoardo
Beyond the rhetorical call for increasing patients' engagement, policy makers recognize the urgency to have an evidence-based measure of patients' engagement and capture its effect when planning and implementing initiatives aimed at sustaining the engagement of consumers in their health. In this paper, authors describe the Patient Health Engagement Scale (PHE-scale), a measure of patient engagement that is grounded in rigorous conceptualization and appropriate psychometric methods. The scale was developed based on our previous conceptualization of patient engagement (the PHE-model). In particular, the items of the PHE-scale were developed based on the findings from the literature review and from interviews with chronic patients. Initial psychometric analysis was performed to pilot test a preliminary version of the items. The items were then refined and administered to a national sample of chronic patients (N = 382) to assess the measure's psychometric performance. A final phase of test-retest reliability was performed. The analysis showed that the PHE Scale has good psychometric properties with good correlation with concurrent measures and solid reliability. Having a valid and reliable measure to assess patient engagement is the first step in understanding patient engagement and its role in health care quality, outcomes, and cost containment. The PHE Scale shows a promising clinical relevance, indicating that it can be used to tailor intervention and assess changes after patient engagement interventions.
Eplov, Lene Falgaard; Petersen, Janne; Jørgensen, Torben
The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim of the pre......The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim...... 0.30 for the 12 and the 22 item scales. All five Mental Vulnerability scales had positively skewed score distributions which were associated significantly with both SCL-90-R symptom scores and NEO-PI-R personality scales (primarily Neuroticism and Extraversion). Coefficient alpha was highest...
Hafner, Brian J.; Morgan, Sara J.; Askew, Robert L.; Salem, Rana
Documentation of clinical outcomes is increasingly expected in delivery of prosthetic services and devices. However, many outcome measures suitable for use in clinical care and research have not been psychometrically tested with prosthesis users. The aim of this study was to determine test-retest reliability, mode-of-administration (MoA) equivalence, standard error of measurement (SEM), and minimal detectable change (MDC) of standardized, self-report instruments that assess constructs of impo...
Gardner, Ian A.; Greiner, Matthias
Receiver-operating characteristic (ROC) curves provide a cutoff-independent method for the evaluation of continuous or ordinal tests used in clinical pathology laboratories. The area under the curve is a useful overall measure of test accuracy and can be used to compare different tests (or...... different equipment) used by the same tester, as well as the accuracy of different diagnosticians that use the same test material. To date, ROC analysis has not been widely used in veterinary clinical pathology studies, although it should be considered a useful complement to estimates of sensitivity...... and specificity in test evaluation studies. In addition, calculation of likelihood ratios can potentially improve the clinical utility of such studies because likelihood ratios provide an indication of how the post-test probability changes as a function of the magnitude of the test results. For ordinal test...
Full Text Available Background and purpose: Using valid and reliable instruments is an important way for collecting data in qualitative researches. This paper is a report of a study conducted to examine the extent of psychometric properties of the scales in research papers published in Journal of Advanced Nursing.Methods: In this study, the Journal of Advanced Nursing was chosen for systematic review. All articles which were published during 2007-2009 in this journal were collected and articles related to instrument development were selected. Each article was completely reviewed to identify the methods of instrument validation and reliability.Results: From 980 articles published in Journal of Advanced Nursing during 2007-2009, 41 (4.18% articles were about research methodology. In these, 12 articles (29.27% were related to developing an instrument. In this study, review of 12 articles that published in Journal of Advanced Nursing, 2007-2009, showed that some of the articles did not measure psychometric properties properly, thus some of the developed scales need to measure other types of necessary validity. In addition, reliability testing needs to be performed on each instrument used in a study before other statistical analysis are performed. From 12 articles, all of the articles measured and reported Cronbach’s alpha, but four of them did not measure test-retest.Conclusions: Although researchers put a great emphasis on methodology and statistical analysis, they pay less attention to the psychometric properties of their new instruments. The authors of this article hope to draw the attention of researcher to the importance of measuring psychometric properties of new instruments.Keywords: PSYCHOMETRIC, SCALES, CRITICAL REVIEW
Hong, Ickpyo; Lee, Mi Jung; Kim, Moon Young; Park, Hae Yean
The aim of this study is to investigate the psychometrics of the 12 items of an instrument assessing activities of daily living (ADL) using an item response theory model. A total of 648 adults with physical disabilities and having difficulties in ADLs were retrieved from the 2014 Korean National Survey on People with Disabilities. The psychometric testing included factor analysis, internal consistency, precision, and differential item functioning (DIF) across categories including sex, older age, marital status, and physical impairment area. The sample had a mean age of 69.7 years old (SD = 13.7). The majority of the sample had lower extremity impairments (62.0%) and had at least 2.1 chronic conditions. The instrument demonstrated unidimensional construct and good internal consistency (Cronbach's alpha = 0.95). The instrument precisely estimated person measures within a wide range of theta values (-2.22 logits 5.0%). Our findings indicate that the dressing item would need to be modified to improve its psychometrics. Overall, the ADL instrument demonstrates good psychometrics, and thus, it may be used as a standardized instrument for measuring disability in rehabilitation contexts. However, the findings are limited to adults with physical disabilities. Future studies should replicate psychometric testing for survey respondents with other disorders and for children.
Traditional African healing has been in existence for many centuries yet many people still seem not to understand how it relates to God and religion/spirituality. Some people seem to believe that traditional healers worship the ancestors and not God. It is therefore the aim of this paper to clarify this relationship by discussing a chain of communication between the worshipers and the Almighty God. Other aspects of traditional healing namely types of traditional healers, training of tradition...
Risk perception has traditionally been conceived as a cognitive phenomenon, basically a question of information processing. The very term perception suggests that information processing is involved and of crucial importance. Kahneman and Tversky suggested that the use of 'heuristics' in the intuitive estimation of probabilities accounts for biased probability perception, hence claiming to explain risk perception as well. The psychometric approach of Slovic et al, a further step in in the cognitive tradition, conceives of perceived risk as a function of general properties of a hazard. However, the psychometric approach is shown here to explain only about 20% of the variance of perceived risk, even less of risk acceptability. Its claim to explanatory power is based on a statistical illusion: mean values were investigated and accounted for, across hazards. A currently popular alternative to the psychometric tradition, Cultural Theory, is even less successful and explains only about 5% of the variance of perceived risk. The claims of this approach were also based on a statistical illusion: 'significant' results were reported and interpreted as being of substantial importance. The present paper presents a new approach: attitude to the risk generating technology, general sensitivity to risks and specific risk explained well over 60% of the variance of perceived risk of nuclear waste, in a study of extensive data from a representative sample of the Swedish population. The attitude component functioning as an explanatory factor of perceived risk, rather than as a consequence of perceived risk, suggests strongly that perceived risk is something other than cognition. Implications for risk communication are discussed. (author)
McGilton, Katherine S
The purpose of this study was to develop and evaluate the psychometric properties of 2 supportive leadership scales, the Charge Nurse Support Scale and the Unit Manager Support Scale, designed for long-term-care environments. These 6-item self-report scales were administered to 70 nursing staff and their internal consistency reliability, test-retest reliability, content validity, factor structure, and construct validity investigated. Content validity was established with the assistance of experts. Both scales were deemed reliable. As hypothesized, a significant relationship was found between the measure of how nursing staff related to residents and measures of charge nurses' supportive behaviours (r = .42, p = .05). Reliable and valid measures of supportive leadership could be developed for use in identifying the quality of support provided to staff in long-term-care environments.
Shi, Lu-Feng; Zaki, Nancy A
The present study attempted to establish psychometric function in individuals whose first language is not English. Psychometric function was obtained for one of the most commonly used clinical tests, the Northwestern University Auditory Test No. 6 (Tillman & Carhart 1966), so that findings could be directly applied to everyday clinical practice. Five groups of 14 normal-hearing, adult listeners differing in their first language and dominant language (English monolinguals, English- and Arabic-dominant Arabic-English bilinguals, and English- and Russian-dominant Russian-English bilinguals) participated. Both forms of the Northwestern University Auditory Test No. 6 test (8 lists of 50 monosyllabic English words) were presented. The lists were randomly assigned to eight signal-to-noise ratios (-3 to 18 dB in 3 dB steps). Listeners responded verbally and in writing. Psychometric functions were derived via logistic regression and described by two parameters: the 50% correct performance level (θ) and the slope (k). Both English-dominant bilingual groups obtained psychometric functions comparable with monolinguals. The θ and k of the functions for these three groups of participants were consistent with the literature. Compared with these three groups, non-English-dominant bilinguals' functions grew significantly more gradually (i.e., a significantly higher θ and a significantly lower k). No differences in either θ or k were found between bilinguals with the same dominant language but different first languages. Bilinguals reporting themselves to be dominant in English generate monolingual-like psychometric functions. By contrast, a different set of psychometric properties describes the function of bilinguals dominant in their first language. Because first language did not appear to be a significant factor in determining bilinguals' functions, it is concluded that English learning history and English proficiency are more important variables than first language for
Angelopoulou, Matina V; Kavvadia, Katerina; Taoufik, Konstantina; Oulis, Constantine J
School based oral health education through traditional lecturing has been found successful only in improving oral health knowledge, while has low effectiveness in oral hygiene and gingival health. The aim of this study was to evaluate the effectiveness of experiential learning (EL) oral health education to traditional lecturing (TL), on enhancing oral health knowledge, attitude and behavior as well as oral hygiene, gingival health and caries of 10-year-old children. Eighty-four children were recruited for the EL and 100 for the TL group from 3 locations in Greece. Data regarding oral health knowledge, attitude and behavior were collected via questionnaires. Data regarding dental plaque, gingivitis and caries were collected by clinical examination. The evaluation using questionnaires and clinical examination was assessed at baseline and 6 and 18 months afterwards. Two calibrated pediatric dentists examined the students using a periodontal probe and artificial light. Modified hygiene index (HI) was used for dental plaque recording, the simplified gingival index (GI-S) was used for gingivitis and DMFT, based on BASCD criteria, for dental caries. Based on a dedicated manual, the teacher applied in the classroom the oral health educational program using EL. EL group had statistically significant better hygiene than the TL at 6 months (p 0.05) and attitude (p > 0.05) at 6 months in comparison to baseline. EL program was found more successful than TL in oral hygiene improvement. Both oral health education programs improved the oral health knowledge, attitude and behavior of children. ClinicalTrials.gov (NCT02320162).
Othman Mohamad Hashim
Full Text Available Urinalysis was used in previous studies among higher institution students (n=16252 in Malaysia to answer the question of whether university students are involved in drug abuse. However, the use of urinalysis had faced some problems. The problems were related to human rights issues and the cost to perform the urinalysis was expensive and quite impossible to be implemented to a large population of university students. To overcome this problem, this study was conducted to examine the effectiveness of psychometric measures in screening drug, alcohol and substance abuse. The Substance Abuse Subtle Screening Inventory A2 (SASSI-A2 was used for this purpose. SASSI-A2 is a brief screening tool designed to identify individuals who have a high probability of having a substance use disorder, including both substance abuse and substance dependence. SASSI-A2 comprises of 72 items that are rated on a two point scale with response; true and false. SASSI-A2 was translated into Malay language and it was refined through a back-translation technique and focus group approach. Psychometric testing was undertaken on a sample of 750 university students from five public universities in Malaysia. All participants were aged between 19 and 20 years. Internal consistency coefficients were calculated for the total scale and its subscales. Chronbach's alpha obtained for SASSI-A2 was 0.72. This relatively high level of Chronbach's alpha showed relatively high level of reliability. The results demonstrated that the whole SASSI-A2 meets the fundamental measurement properties and can discriminate groups of higher institution students from high to low on the substance dependency variable. The accuracy of the test has been found to be unaffected by gender, ethnicity, age and years of education. Although more rigorous validation studies are needed, it is recommended that SASSI-A2 be considered for usage to higher institution students populations when a brief, objective, and
Mokgobi, M G
Traditional African healing has been in existence for many centuries yet many people still seem not to understand how it relates to God and religion/spirituality. Some people seem to believe that traditional healers worship the ancestors and not God. It is therefore the aim of this paper to clarify this relationship by discussing a chain of communication between the worshipers and the Almighty God. Other aspects of traditional healing namely types of traditional healers, training of traditional healers as well as the role of traditional healers in their communities are discussed. In conclusion, the services of traditional healers go far beyond the uses of herbs for physical illnesses. Traditional healers serve many roles which include but not limited to custodians of the traditional African religion and customs, educators about culture, counselors, social workers and psychologists.
Lundman, Berit; Årestedt, Kristofer; Norberg, Astrid; Norberg, Catharina; Fischer, Regina Santamäki; Lövheim, Hugo
This study tested the psychometric properties of a Swedish version of the Self-Transcendence Scale (STS). Cohen's weighted kappa, agreement, absolute reliability, relative reliability, and internal consistency were calculated, and the underlying structure of the STS was established by exploratory factor analysis. There were 2 samples available: 1 including 194 people aged 85-103 years and a convenience sample of 60 people aged 21-69 years. Weighted kappa values ranged from .40 to .89. The intraclass correlation coefficient for the original STS was .763, and the least significant change between repeated tests was 6.25 points. The revised STS was found to have satisfactory psychometric properties, and 2 of the 4 underlying dimensions in Reed's self-transcendence theory were supported.
Idrovo, Alvaro J; Camacho-Avila, Anabel; García-Rivas, Javier; Juárez-García, Arturo
Most studies on social capital and health are carried out with large home-based surveys, neglecting that many interactions among individuals occur in the workplace. The objective of this study was to explore the psychometric properties of a scale in Spanish used to measure social capital at work. The scale designed by Kouvonen et al was translated into Spanish and tested under classical test theory, item response theory, and confirmatory factorial analysis; 152 public health workers from different socio-cultural contexts participated in the survey. Internal consistency was high (Chronbach's alpha = 0.88). Social capital at work correlated properly with two Job Content Questionnaire dimensions. A ceiling effect was detected and item difficulty was quantified. The confirmatory factor analysis showed the expected theoretical components of social capital: bonding, bridging and trust. The scale has acceptable psychometric properties, thus it can be used in future studies.
García-Fernández, José Manuel; Espada Sánchez, José Pedro; Orgilés Amorós, Mireia; Méndez Carrillo, Xavier
This paper describes the psychometric properties of a new children's self-report measure. The School Fears Survey Scale, Form II (SFSS-II) assesses school fears in children from ages 8 to 11. The factor solution with a Spanish sample of 3,665 children isolated four factors: Fear of academic failure and punishment, fear of physical discomfort, fear of social and school assessment and anticipatory and separation anxiety. The questionnaire was tested by confirmatory factor analysis, which accounted for 55.80% of the total variance. Results indicated that the SFSS-II has a high internal consistency (alpha= .89). The results revealed high test-retest reliability and appropriate relationship with other scales. The age by gender interaction was significant. Two-way analysis of variance found that older children and girls had higher anxiety. The instrument shows adequate psychometric guarantees and can be used for the multidimensional assessment of anxiety in clinical and educational settings.
Seo, Daeryong; Taherbhai, Husein; Frantz, Roger
The importance of listening in the context of English language acquisition is gaining acceptance, but its unique attributes in language performance, while substantively and qualitatively justifiable, are generally not psychometrically defined. This article psychometrically supports listening as a distinct domain among the three other domains of…
May, Keith A; Solomon, Joshua A
In a 2-alternative forced-choice (2AFC) discrimination task, observers choose which of two stimuli has the higher value. The psychometric function for this task gives the probability of a correct response for a given stimulus difference, Δx. This paper proves four theorems about the psychometric function. Assuming the observer applies a transducer and adds noise, Theorem 1 derives a convenient general expression for the psychometric function. Discrimination data are often fitted with a Weibull function. Theorem 2 proves that the Weibull "slope" parameter, β, can be approximated by β(Noise) x β(Transducer), where β(Noise) is the β of the Weibull function that fits best to the cumulative noise distribution, and β(Transducer) depends on the transducer. We derive general expressions for β(Noise) and β(Transducer), from which we derive expressions for specific cases. One case that follows naturally from our general analysis is Pelli's finding that, when d' ∝ (Δx)(b), β ≈ β(Noise) x b. We also consider two limiting cases. Theorem 3 proves that, as sensitivity improves, 2AFC performance will usually approach that for a linear transducer, whatever the actual transducer; we show that this does not apply at signal levels where the transducer gradient is zero, which explains why it does not apply to contrast detection. Theorem 4 proves that, when the exponent of a power-function transducer approaches zero, 2AFC performance approaches that of a logarithmic transducer. We show that the power-function exponents of 0.4-0.5 fitted to suprathreshold contrast discrimination data are close enough to zero for the fitted psychometric function to be practically indistinguishable from that of a log transducer. Finally, Weibull β reflects the shape of the noise distribution, and we used our results to assess the recent claim that internal noise has higher kurtosis than a Gaussian. Our analysis of β for contrast discrimination suggests that, if internal noise is stimulus
Keith A May
Full Text Available In a 2-alternative forced-choice (2AFC discrimination task, observers choose which of two stimuli has the higher value. The psychometric function for this task gives the probability of a correct response for a given stimulus difference, Δx. This paper proves four theorems about the psychometric function. Assuming the observer applies a transducer and adds noise, Theorem 1 derives a convenient general expression for the psychometric function. Discrimination data are often fitted with a Weibull function. Theorem 2 proves that the Weibull "slope" parameter, β, can be approximated by β(Noise x β(Transducer, where β(Noise is the β of the Weibull function that fits best to the cumulative noise distribution, and β(Transducer depends on the transducer. We derive general expressions for β(Noise and β(Transducer, from which we derive expressions for specific cases. One case that follows naturally from our general analysis is Pelli's finding that, when d' ∝ (Δx(b, β ≈ β(Noise x b. We also consider two limiting cases. Theorem 3 proves that, as sensitivity improves, 2AFC performance will usually approach that for a linear transducer, whatever the actual transducer; we show that this does not apply at signal levels where the transducer gradient is zero, which explains why it does not apply to contrast detection. Theorem 4 proves that, when the exponent of a power-function transducer approaches zero, 2AFC performance approaches that of a logarithmic transducer. We show that the power-function exponents of 0.4-0.5 fitted to suprathreshold contrast discrimination data are close enough to zero for the fitted psychometric function to be practically indistinguishable from that of a log transducer. Finally, Weibull β reflects the shape of the noise distribution, and we used our results to assess the recent claim that internal noise has higher kurtosis than a Gaussian. Our analysis of β for contrast discrimination suggests that, if internal noise is
Full Text Available Starting with the common origins of biometrics and psychometrics at the beginning of the twentieth century, the paper compares and contrasts subsequent developments, informed by the author's 35 years at Rothamsted Experimental Station followed by a period with the data theory group in Leiden and thereafter. Although the methods used by biometricians and psychometricians have much in common, there are important differences arising from the different fields of study. Similar differences arise wherever data are generated and may be regarded as a major driving force in the development of statistical ideas.
TEMPLE, SUSANNAH FLEUR
Functional Fluency denotes efficacy of interpersonal functioning in terms of flexibility and balance of the behavioural modes a person uses. The aim of this project is to design and create a psychometric tool for mapping the patterns of such functioning. The intention is that feedback on the test results will stimulate the insights and understanding to support and encourage positive behavioural change. This process, involving the development of self-awareness, which is a key as...
Burton, Amy L.; Hay, Phillipa; Kleitman, Sabina; Smith, Evelyn; Raman, Jayanthi; Swinbourne, Jessica; Touyz, Stephen W.; Abbott, Maree J.
Background The Eating Beliefs Questionnaire (EBQ) is a 27-item self-report measure that assesses positive and negative beliefs about binge eating. It has been validated and its factor structure explored in a non-clinical sample. This study tested the psychometric properties of the EBQ in a clinical and a non-clinical sample. Method A sample of 769 participants (573 participants recruited from the university and general community, 76 seeking treatment for an eating disorder and 120 participati...
Marc, Linda G.; Wang, Ming-Mei; Testa, Marcia A.
The objective of this paper is to psychometrically validate the HIV Symptom Distress Scale (SDS), an instrument that can be used to measure overall HIV symptom distress or clinically relevant groups of HIV symptoms. A secondary data analysis was conducted using the Collaborations in HIV Outcomes Research U.S. Cohort (CHORUS). Inclusion criteria required study participants (N=5,521) to have a valid baseline measure of the AIDS Clinical Trial Group Symptom Distress Module, with an SF-12 or SF-36 completed on the same day. Psychometric testing assessed unidimensionality, internal consistency and factor structure using exploratory and confirmatory factor analysis, and structural equation modeling (SEM). Construct validity examined whether the new measure discriminates across clinical significance (CD4 and HIV viral load). Findings show that the SDS has high reliability (α=0.92), and SEM supports a correlated second-order factor model (physical and mental distress) with acceptable fit (GFI=0.88, AGFI=0.85, NFI=0.99, NNFI=0.99; RMSEA=0.06, [90% CI 0.06 – 0.06]; Satorra Bentler Scaled, C2 =3274.20; p=0.0). Construct validity shows significant differences across categories for HIV-1 viral load (p< 0.001) and CD4 (p< 0.001). Differences in mean SDS scores exist across gender (p< 0.001), race/ethnicity (p< 0.05) and educational attainment (p < 0.001). Hence, the HIV Symptom Distress Scale is a reliable and valid instrument, which measures overall HIV symptoms or clinically relevant groups of symptoms. PMID:22409246
Serrani Azcurra, Daniel Jorge Luis
Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs.
Manuel Salvador Ortiz Parada
Full Text Available The present study tested the psychometric properties of the Zimet et al.’ Multidimensional Scale of Perceived Social Support (MSPSS, in patients with Type 2 Diabetes Mellitus (DM (n = 76 from Temuco City, Chile. The total scale shown appropriate levels of internal consistency (0,849. An exploratory factorial analysis, with Varimax rotation, was performed over the MSPSS measures. In agreement with the original Scale, three factors were obtained explaining 66,8% of the variance. These results suggest that MSPSS is a psychometrically sound instrument that can be applied to patients with Type 2 Diabetes.
Aim: The aim of this research was to evaluate the psychometric properties of a measure of entrepreneurial climate. Entrepreneurial climate was measured using a shortened version of the Hornsby, Kuratko and Zahra (2002 instrument, called the Corporate Entrepreneurship Assessment Instrument (CEAI. Making information on the psychometric properties of the instrument available directly relates to its utility. Setting: The setting was medium to large South African companies. A random sample of employees was drawn from 53 selected companies across South Africa, with 60 respondents per company (N = 3 180. Methods: A cross-sectional survey design was used. Several instruments were administered, including the shortened version of the CEAI. Cronbach’s alpha was used to test for reliability and several methods were used to test for validity. Correlation analysis was used to test for concurrent validity, convergent validity and divergent validity. Principle component factor analysis was used to test for factorial validity and a t-test to test for known-group validity. Results: The results showed that the reliability for the total score of the shortened version of the CEAI was acceptable at 0.758. The results also showed some evidence of concurrent validity, as well as homogeneity among the items. With regard to factorial validity, all items loaded in accordance with the subscales of the instrument. The measure was able to distinguish, as expected, between government organisations and private business entities, suggesting known-group validity. Convergent validity and divergent validity were also assessed. Interesting to note was that entrepreneurship climate correlates more with general employee attitude (e.g. employee engagement; R= 0.420, p < 0.001 and organisational commitment, R = 0.331, p < 0.001 than with self-reported innovation (R = 0.277, p < 0.001 and R = 0.267, p < 0.001. Contribution: This paper not only provided information on the reliability
Chang, Chi-Cheng; Shu, Kuen-Ming; Liang, Chaoyun; Tseng, Ju-Shih; Hsu, Yu-Sheng
The purpose of this study is to examine the effects of blended e-learning on electrical machinery performance (achievement test and self-assessment). Participants were two classes of 11th graders majoring in electrical engineering and taking the electrical machinery class at a vocational high school in Taiwan. The participants were randomly…
Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Eric Swanson, MD
Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
Richardson, George B; Chen, Ching-Chen; Dai, Chia-Liang; Brubaker, Michael D; Nedelec, Joseph L
Many published studies have employed the Mini-K to measure a single fast-slow life history dimension. However, the internal structure of the Mini-K has not been determined and it is not clear that a single higher order K-factor fits the data. It is also not clear that the Mini-K is measurement invariant across groups such as the sexes. To establish the construct validity of K as well as the broader usefulness of applying life history theory to humans, it is crucial that these psychometric issues are addressed as a part of measure validation efforts. Here we report on three studies that used latent variable modeling and data drawn from two college student samples ( ns = 361 and 300) to elucidate the psychometrics of the Mini-K. We found that (a) the Mini-K had a six dimensional first-order structure, (b) the K-factor provided a parsimonious explanation of the associations among the lower order factors at no significant cost to fit, (c) the Mini-K measured the same K-factor across the sexes, (d) K-factor means did not have the same meaning across the sexes and thus the first-order factors should be used in studies of mean sex differences, and finally, (e) the K-factor was only associated with environment and aspects of mating competition in females. Implications and future directions for life history research are discussed.
V. Duprez (Veerle); S.M. van Hooft (Susanne); J. Dwarswaard (Jolanda); A.L. van Staa (AnneLoes); A. Van Hecke (Ann); M.M.H. Strating (Mathilde)
markdownabstract__Aim:__ To develop and psychometrically test the self-efficacy and performance in self-management support (SEPSS) instrument. __Background:__ Facilitating persons with a chronic condition to take an active role in the management of their condition, implicates that nurses acquire
Slaney, Kathleen L.; Tkatchouk, Masha; Gabriel, Stephanie M.; Ferguson, Leona P.; Knudsen, Jared R. S.; Legere, Julien C.
The primary aim of the present study is to determine whether the psychometric evaluation practices and test-analytic rationales of researchers publishing in journals with a measurement focus differ from those of researchers publishing in journals with varying substantive research foci. Several components of two different samples of articles were…
Fazeli, Seyed Hossein
The current study aims to analyze the psychometric qualities of the Persian adapted version of Strategy Inventory for Language Learning (SILL) developed by Rebecca L. Oxford (1990). Three instruments were used: Persian adapted version of SILL, a Background Questionnaire, and Test of English as a Foreign Language. Two hundred and thirteen Iranian…
Gillespie, Brigid M; Polit, Denise F; Hamlin, Lois; Chaboyer, Wendy
This paper describes the development and validation of the Revised Perioperative Competence Scale (PPCS-R). There is a lack of a psychometrically tested sound self-assessment tools to measure nurses' perceived competence in the operating room. Content validity was established by a panel of international experts and the original 98-item scale was pilot tested with 345 nurses in Queensland, Australia. Following the removal of several items, a national sample that included all 3209 nurses who were members of the Australian College of Operating Room Nurses was surveyed using the 94-item version. Psychometric testing assessed content validity using exploratory factor analysis, internal consistency using Cronbach's alpha, and construct validity using the "known groups" technique. During item reduction, several preliminary factor analyses were performed on two random halves of the sample (n=550). Usable data for psychometric assessment were obtained from 1122 nurses. The original 94-item scale was reduced to 40 items. The final factor analysis using the entire sample resulted in a 40 item six-factor solution. Cronbach's alpha for the 40-item scale was .96. Construct validation demonstrated significant differences (pperceived competence scores relative to years of operating room experience and receipt of specialty education. On the basis of these results, the psychometric properties of the PPCS-R were considered encouraging. Further testing of the tool in different samples of operating room nurses is necessary to enable cross-cultural comparisons. Copyright © 2011 Elsevier Ltd. All rights reserved.
Fang, Ke; Pieterse, Alex L.; Friedlander, Myrna; Cao, Junhong
This investigation tested the psychometric properties of the Attitudes Toward Seeking Professional Psychological Help Scale-Short Form (ATSPPH-SF; Fisher and Farina ["Journal of College Student Development, 36", 368-373, 1995]) in a sample of 338 Mainland Chinese college students. Using back-translation, the ATSPPH-SF was translated into…
Masters, Kevin S; Ross, Kaile M; Hooker, Stephanie A; Wooldridge, Jennalee L
There has been a notable disconnect between theories of behavior change and behavior change interventions. Because few interventions are both explicitly and adequately theory-based, investigators cannot assess the impact of theory on intervention effectiveness. Theory-based interventions, designed to deliberately engage the theory's proposed mechanisms of change, are needed to adequately test theories. Thus, systematic approaches to theory-based intervention development are needed. This article will introduce and discuss the psychometric method of developing theory-based interventions. The psychometric approach to intervention development utilizes basic psychometric principles at each step of the intervention development process in order to build a theoretically driven intervention to, subsequently, be tested in process (mechanism) and outcome studies. Five stages of intervention development are presented as follows: (i) Choice of theory; (ii) Identification and characterization of key concepts and expected relations; (iii) Intervention construction; (iv) Initial testing and revision; and (v) Empirical testing of the intervention. Examples of this approach from the Colorado Meaning-Activity Project (COMAP) are presented. Based on self-determination theory integrated with meaning or purpose, and utilizing a motivational interviewing approach, the COMAP intervention is individually based with an initial interview followed by smart phone-delivered interventions for increasing daily activity. The psychometric approach to intervention development is one method to ensure careful consideration of theory in all steps of intervention development. This structured approach supports developing a research culture that endorses deliberate and systematic operationalization of theory into behavior change intervention from the outset of intervention development.
Muller-Staub, M.; Lunney, M.; Lavin, M.A.; Needham, I.; Odenbreit, M.; Achterberg, T. van
The instrument Q-DIO was developed in the years 2005 till 2006 to measure the quality of documented nursing diagnoses, interventions, and nursing sensitive patient outcomes. Testing psychometric properties of the Q-DIO (Quality of nursing Diagnoses, Interventions and Outcomes.) was the study aim.
Heaton Lisa J
Full Text Available Abstract Background It would be useful to have psychometrically-sound measures of dental fear for Hispanics, who comprise the largest ethnic minority in the United States. We report on the psychometric properties of Spanish-language versions of two common adult measures of dental fear (Modified Dental Anxiety Scale, MDAS; Dental Fear Survey, DFS, as well as a measure of fear of dental injections (Needle Survey, NS. Methods Spanish versions of the measures were administered to 213 adults attending Hispanic cultural festivals, 31 students (who took the questionnaire twice, for test-retest reliability, and 100 patients at a dental clinic. We also administered the questionnaire to 136 English-speaking adults at the Hispanic festivals and 58 English-speaking students at the same college where we recruited the Spanish-speaking students, to compare the performance of the English and Spanish measures in the same populations. Results The internal reliabilities of the Spanish MDAS ranged from 0.80 to 0.85. Values for the DFS ranged from 0.92 to 0.96, and values for the NS ranged from 0.92 to 0.94. The test-retest reliabilities (intra-class correlations for the three measures were 0.69, 0.86, and 0.94 for the MDAS, DFS, and NS, respectively. The three measures showed moderate correlations with one another in all three samples, providing evidence for construct validity. Patients with higher scores on the measures were rated as being more anxious during dental procedures. Similar internal reliabilities and correlations were found in the English-version analyses. The test-retest values were also similar in the English students for the DFS and NS; however, the English test-retest value for the MDAS was better than that found in the Spanish students. Conclusion We found evidence for the internal reliability, construct validity, and criterion validity for the Spanish versions of the three measures, and evidence for the test-retest reliability of the Spanish
Mittelstädt, Justin M; Pecena, Yvonne; Oubaid, Viktor; Maschke, Peter
This paper investigates personality traits as potential factors for success in an astronaut selection by comparing personality profiles of unsuccessful and successful astronaut candidates in different phases of the ESA selection procedure. It is further addressed whether personality traits could predict an overall assessment rating at the end of the selection. In 2008/2009, ESA performed an astronaut selection with 902 candidates who were either psychologically recommended for mission training (N = 46) or failed in basic aptitude (N = 710) or Assessment Center and interview testing (N = 146). Candidates completed the Temperament Structure Scales (TSS) and the NEO Personality Inventory Revised (NEO-PI-R). Those candidates who failed in basic aptitude testing showed higher levels of Neuroticism (M = 49.8) than the candidates who passed that phase (M = 45.4 and M = 41.6). Additionally, candidates who failed in basic testing had lower levels of Agreeableness (M = 132.9) than recommended candidates (M = 138.1). TSS scales for Achievement (r = 0.19) and Vitality (r = 0.18) showed a significant correlation with the overall assessment rating given by a panel board after a final interview. Results indicate that a personality profile similar to Helmreich's "Right Stuff" is beneficial in astronaut selection. Influences of test anxiety on performance are discussed. Mittelstädt JM, Pecena Y, Oubaid V, Maschke P. Psychometric personality differences between candidates in astronaut selection. Aerosp Med Hum Perform. 2016; 87(11):933-939.
González, David Andrés; Boals, Adriel; Jenkins, Sharon Rae; Schuler, Eric R; Taylor, Daniel
Students and young adults have high rates of suicide and depression, thus are a population of interest. To date, there is no normative psychometric information on the IDS and QIDS in these populations. Furthermore, there is equivocal evidence on the factor structure and subscales of the IDS. Two samples of young adult students (ns=475 and 1681) were given multiple measures to test the psychometrics and dimensionality of the IDS and QIDS. The IDS, its subscales, and QIDS had acceptable internal consistencies (αs=.79-90) and favorable convergent and divergent validity correlations. A three-factor structure and two Rasch-derived subscales best fit the IDS. The samples were collected from one university, which may influence generalizability. The IDS and QIDS are desirable measures of depressive symptoms when studying young adult students. Copyright © 2013 Elsevier B.V. All rights reserved.
Griffith, James W; Sumner, Jennifer A; Raes, Filip; Barnhofer, Thorsten; Debeer, Elise; Hermans, Dirk
Autobiographical memory is a multifaceted construct that is related to psychopathology and other difficulties in functioning. Across many studies, a variety of methods have been used to study autobiographical memory. The relationship between overgeneral autobiographical memory (OGM) and psychopathology has been of particular interest, and many studies of this cognitive phenomenon rely on the Autobiographical Memory Test (AMT) to assess it. In this paper, we examine several methodological approaches to studying autobiographical memory, and focus primarily on methodological and psychometric considerations in OGM research. We pay particular attention to what is known about the reliability, validity, and methodological variations of the AMT. The AMT has adequate psychometric properties, but there is great variability in methodology across studies that use it. Methodological recommendations and suggestions for future studies are presented. Copyright © 2011 Elsevier Ltd. All rights reserved.
Travis A. Ryan
Full Text Available The Drive for Muscularity Attitudes Questionnaire (DMAQ was developed to measure men’s desire to attain an idealized muscular body. To date, the cross-cultural suitability of this measure has received limited attention. The current study addressed this omission by testing the psychometric properties of the DMAQ using an online sample of Irish men (N = 327. Confirmatory factor analysis revealed that a unidimensional model adequately matched observed data (i.e., fit indices suggested acceptable model fit. Analyses also showed that the DMAQ yielded reliable and construct valid scores, suggesting that the scale holds promise as an indicant of the drive for muscularity among Irish men. Strengths and limitations associated with this study are discussed, such as advantages and disadvantages of Internet research. Directions for future research are given, including the need for more psychometric work.
Hannan, Jean; Youngblut, JoAnne M; Brooten, Dorothy; Bazzani, Dianne; Romero, Norma R; Chavez, Blanca; Picanes, Joann
Measuring stress in Hispanic Americans, the fastest growing U.S. minority, is problematic. The Life Events Inventory (LEI) and the Daily Hassles Scale (DHS), widely used stress instruments, are not available in Spanish. To test the psychometric properties of the translated Spanish versions of the LEI and DHS. A convenience sample of 63 Hispanic women completed both instruments in Spanish and English 2 weeks apart. Internal consistency reliability and stability were strong for both instruments (.85-.97). Reliability and validity evidence for the translated Spanish versions were strong and similar to the English version. Psychometric findings suggest that the newly translated Spanish versions are good representations of the English versions and that these newly translated instruments are ready for use.
Greenslade, Kathryn J; Coggins, Truman E
This study presents an independent replication and extension of psychometric evidence supporting the Theory of Mind Inventory (ToMI). Parents of 20 children with ASD (4; 1-6; 7 years; months) and 20 with typical development (3; 1-6; 5), rated their child's theory of mind abilities in everyday situations. Other parent report and child behavioral assessments included the Social Responsiveness Scale-2, Vineland Adaptive Behavior Scales-2, Peabody Picture Vocabulary Test-4, and Clinical Evaluation of Language Fundamentals-Preschool, 2. Results revealed high internal consistency, expected developmental changes in children with typical development, expected group differences between children with and without ASD, and strong correlations with other measures of social and communication abilities. The ToMI demonstrates strong psychometrics, suggesting considerable utility in identifying theory of mind deficits in children with ASD.
Lerdal, Anners; Moe, Britt; Digre, Elin; Harding, Thomas; Kristensen, Frode; Grov, Ellen K; Bakken, Linda N; Eklund, Marthe L; Ruud, Ireen; Rossi, Joseph S
Title Stages of Change – Continuous Measure (URICA-E2): psychometrics of a Norwegian version. Aim This paper is a report of research to translate the English version of the Stages of Change continuous measure questionnaire (URICA-E2) into Norwegian and to test the validity of the questionnaire and its usefulness in predicting behavioural change. Background While the psychometric properties of the Stages of Change categorical measure have been tested extensively, evaluation of the psychometric properties of the continuous questionnaire has not been described elsewhere in the literature. Method Cross-sectional data were collected with a convenience sample of 198 undergraduate nursing students in 2005 and 2006. The English version of URICA-E2 was translated into Norwegian according to standardized procedures. Findings Principal components analysis clearly confirmed five of the dimensions of readiness to change (Precontemplation Non-Believers, Precontemplation Believers, Contemplation, Preparation and Maintenance), while the sixth dimension, Action, showed the lowest Eigenvalue (0·93). Findings from the cluster analysis indicate distinct profiles among the respondents in terms of readiness to change their exercise behaviour. Conclusion The URICA-E2 was for the most part replicated from Reed’s original work. The result of the cluster analysis of the items associated with the factor ‘Action’ suggests that these do not adequately measure the factor. PMID:19032513
Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael
This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.
Full Text Available Objective. The Geriatric Depression Scale (GDS is an evaluation tool to diagnose older adult’s depression. This questionnaire was defined by Yesavage and Brink in 1982; it was designed expressly for the older person and defines his/her degree of satisfaction, quality of life, and feelings. The objective of this study is to evaluate the psychometric properties of the Italian translation of the Geriatric Depression Scale (GDS-IT. Methods. The Italian version of the Geriatric Depression Scale was administered to 119 people (79 people with a depression diagnosis and 40 healthy ones. We examined the following psychometric characteristics: internal consistency reliability, test-retest reliability, concurrent validity, and construct validity (factor structure. Results. Cronbach’s Alpha for the GDS-IT administered to the depressed sample was 0.84. Test-retest reliability was 0.91 and the concurrent validity was 0.83. The factorial analysis showed a structure of 5 factors, and the scale cut-off is between 10 and 11. Conclusion. The GDS-IT proved to be a reliable and valid questionnaire for the evaluation of depression in an Italian population. In the present study, the GDS-IT showed good psychometric properties. Health professionals now have an assessment tool for the evaluation of depression symptoms in the Italian population.
Neumann, Guenter; Schaadt, Anna-Katharina; Reinhart, Stefan; Kerkhoff, Georg
Cerebral vision disorders (CVDs) are frequent after brain damage and impair the patient's outcome. Yet clinically and psychometrically validated procedures for the anamnesis of CVD are lacking. To evaluate the clinical validity and psychometric qualities of the Cerebral Vision Screening Questionnaire (CVSQ) for the anamnesis of CVD in individuals poststroke. Analysis of the patients' subjective visual complaints in the 10-item CVSQ in relation to objective visual perimetry, tests of reading, visual scanning, visual acuity, spatial contrast sensitivity, light/dark adaptation, and visual depth judgments. Psychometric analyses of concurrent validity, specificity, sensitivity, positive/negative predictive value, and interrater reliability were also done. Four hundred sixty-one patients with unilateral (39.5% left, 47.5% right) or bilateral stroke (13.0%) were included. Most patients were assessed in the chronic stage, on average 36.7 (range = 1-620) weeks poststroke. The majority of all patients (96.4%) recognized their visual symptoms within 1 week poststroke when asked for specifically. Mean concurrent validity of the CVSQ with objective tests was 0.64 (0.54-0.79, P reliability was 0.76 for a 1-week interval between both assessments (all P guides the clinician in the selection of necessary assessments and appropriate neurovisual therapies for the patient. © The Author(s) 2015.
Karin, A; Hannesdottir, K; Jaeger, J; Annas, P; Segerdahl, M; Karlsson, P; Sjögren, N; von Rosen, T; Miller, F
To conduct a psychometric analysis to determine the adequacy of instruments that measure cognition in Alzheimer's disease trials. Both the Alzheimer's Disease Assessment Scale - Cognition (ADAS-Cog) and the Neuropsychological Test Battery (NTB) are validated outcome measures for clinical trials in Alzheimer's disease and are approved also for regulatory purposes. However, it is not clear how comparable they are in measuring cognitive function. In fact, many recent trials in Alzheimer's disease patients have failed and it has been questioned if ADAS-Cog still is a sensitive measure. The present paper examines the psychometric properties of ADAS-Cog and NTB, based on a post hoc analysis of data from a clinical trial (NCT01024660), which was conducted by AstraZeneca, in mild-to-moderate Alzheimer's disease (AD) patients, with a Mini Mental State Examination (MMSE) Total score 16-24. Acceptability, reliability, different types of validity and ability to detect change were assessed using relevant statistical methods. Total scores of both tests, as well as separate domains of both tests, including the Wechsler Memory Scale (WMS), Rey Auditory Verbal Learning Test (RAVLT) and Delis-Kaplan Executive Function System (D-KEFS) Verbal Fluency Condition, were analyzed. Overall, NTB performed well, with acceptable reliability and ability to detect change, while ADAS-Cog had insufficient psychometric properties, including ceiling effects in 8 out of a total of 11 ADAS-Cog items in mild AD patients, as well as low test-retest reliability in some of the items. Based on a direct comparison on the same patient sample, we see advantages of the NTB compared with the ADAS-Cog for the evaluation of cognitive function in the population of mild-to-moderate AD patients. The results suggest that not all of ADAS-Cog items are relevant for both mild and moderate AD population. This validation study demonstrates satisfactory psychometric properties of the NTB, while ADAS-Cog was found to be
Blanchard, E B; Arena, J G; Pallmeyer, T P
Four studies were conducted on a sample of 230 undergraduates to determine the psychometric properties of a measure of alexithymia, the Schalling-Sifneos Scale. In the first study it was found that scores on the scale are approximately normally distributed for each sex with 8.2% of males and 1.8% of females in the alexithymia range. In the second study a factor analysis of the scale revealed three distinct factors: (1) 'difficulty in expression of feelings'; (2) 'the importance of feelings especially about people'; (3) 'day-dreaming or introspection'. In the second factor analytic study, scores from several standard psychological tests on the same subjects were introduced with the scale items. Two factors in this analysis were comprised almost entirely of the other test scores: a 'general psychological distress factor' and a 'concerns about physical symptoms factor'. The other two factors were similar to factors 1 and 2 above in terms of items. The Rathus Assertiveness Scale loaded positively on the equivalent of factor 1. In the lst study, it was shown that Schalling-Sifneos Scale score is relatively orthogonal to other psychological tests with the exception of a Psychosomatic Symptom Checklist and thus is measuring something other than depression, anxiety, etc.
Dikken, Jeroen; Hoogerduijn, Jita G; Kruitwagen, Cas; Schuurmans, Marieke J
To assess the content validity and psychometric characteristics of the Knowledge about Older Patients Quiz (KOP-Q), which measures nurses' knowledge regarding older hospitalized adults and their certainty regarding this knowledge. Cross-sectional. Content validity: general hospitals. Psychometric characteristics: nursing school and general hospitals in the Netherlands. Content validity: 12 nurse specialists in geriatrics. Psychometric characteristics: 107 first-year and 78 final-year bachelor of nursing students, 148 registered nurses, and 20 nurse specialists in geriatrics. Content validity: The nurse specialists rated each item of the initial KOP-Q (52 items) on relevance. Ratings were used to calculate Item-Content Validity Index and average Scale-Content Validity Index (S-CVI/ave) scores. Items with insufficient content validity were removed. Psychometric characteristics: Ratings of students, nurses, and nurse specialists were used to test for different item functioning (DIF) and unidimensionality before item characteristics (discrimination and difficulty) were examined using Item Response Theory. Finally, norm references were calculated and nomological validity was assessed. Content validity: Forty-three items remained after assessing content validity (S-CVI/ave = 0.90). Psychometric characteristics: Of the 43 items, two demonstrating ceiling effects and 11 distorting ability estimates (DIF) were subsequently excluded. Item characteristics were assessed for the remaining 30 items, all of which demonstrated good discrimination and difficulty parameters. Knowledge was positively correlated with certainty about this knowledge. The final 30-item KOP-Q is a valid, psychometrically sound, comprehensive instrument that can be used to assess the knowledge of nursing students, hospital nurses, and nurse specialists in geriatrics regarding older hospitalized adults. It can identify knowledge and certainty deficits for research purposes or serve as a tool in educational
Bech, P; Bech, P
OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety, antidepress......OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety...... psychometrics in psychiatry have been found to cover a pharmacopsychometric triangle illustrating the measurements of wanted and unwanted effects of pharmacotherapeutic drugs as well as health-related quality of life....
Full Text Available This study describes the psychometric properties of the Children's Separation Anxiety Scale (CSAS, which assesses separation anxiety symptoms in childhood. Participants in Study 1 were 1,908 schoolchildren aged between 8 and 11. Exploratory factor analysis identified four factors: worry about separation, distress from separation, opposition to separation, and calm at separation, which explained 46.91% of the variance. In Study 2, 6,016 children aged 8-11 participated. The factor model in Study 1 was validated by confirmatory factor analysis. The internal consistency (α = 0.82 and temporal stability (r = 0.83 of the instrument were good. The convergent and discriminant validity were evaluated by means of correlations with other measures of separation anxiety, childhood anxiety, depression and anger. Sensitivity of the scale was 85% and its specificity, 95%. The results support the reliability and validity of the CSAS.
Pereira, Jorge A; Silva, Pedro; Matos, Teresa J S; Patarata, Luís
Morcela de Arroz (MA) is a ready-to-eat blood and rice cooked sausage produced with pork, blood, rice, and seasonings, stuffed in natural casing and cooked above 90 °C/30 min. It is commercialized whole, not packed, with a restricted shelf life (1 wk/0 to 5 °C). The objective of this work was to establish sliced MA shelf life considering both the behavior of L. monocytogenes through a microbiological challenge test (MCT) and the consumer acceptability of MA stored: vacuum packed (VP), modified atmosphere packed (MAP: 80% CO2/20% N2 ), and aerobic packed (AP). The MCT was conducted inoculating ±3 log CFU/g of L. monocytogenes cell suspension on the MA slices. Packaged samples were stored at 3 ± 1 °C and 7 ± 1 °C until 20 d. At 3 ± 1 °C, L. monocytogenes behavior was not affected by packaging or storage time. At 7 ± 1 °C, the pathogen increased nearly 1 log CFU/g in the first 4 d. L. monocytogenes populations in AP were higher (P < 0.05) than in MAP. The pathogen may grow to hazardous levels in the 1st days if a temperature abuse occurs. Considering the acceptability by the consumers, the shelf life of MA stored at 3 ± 1 °C was 4.4 d for AP, 8.1 d for VP, and 10.4 d for MAP. The sensory shelf life established based on sensory spoilage is shorter than the shelf life to maintain the population of L. monocytogenes in safe levels. © 2015 Institute of Food Technologists®
Dhir, Amandeep; Chen, Sufen; Nieminen, Marko
The past few years have witnessed great developments in Internet infrastructure, which have led to increased Internet usage among people of various age groups. However, at the same time, there have been some negative implications associated with increased Internet usage for some individuals. "Internet addiction" (IA) is one such negative…
paradigm shift and suggest that qualitative research designs might be used more ... remained on quantitative methods and the meaning behind the concepts is often ... the guideline in practice, a list of common errors, and a set of references for ...
ZHU, YP; WOERDENBAG, HJ
Herbal medicine, acupuncture and moxibustion, and massage and the three major constituent parts of traditional Chinese medicine. Although acupuncture is well known in many Western countries, Chinese herbal medicine, the mos important part of traditional Chinese medicine, is less well known in the
Jorissen, A.J.M.; Hamer, den J.; Leijten, A.J.M.; Salenikovich, A.
Due to new possibilities traditional timber framing has become increasingly popular since the beginning of the 21e century. Although traditional timber framing has been used for centuries, the expected mechanical behaviour is not dealt with in great detail in building codes, guidelines or text
Potkin, Steven G; Bugarski-Kirola, Dragana; Edgar, Chris J; Soliman, Sherif; Le Scouiller, Stephanie; Kunovac, Jelena; Miguel Velasco, Eugenio; Garibaldi, George M
Unemployment can negatively impact quality of life among patients with schizophrenia. Employment status depends on ability, opportunity, education, and cultural influences. A clinician-rated scale of work readiness, independent of current work status, can be a valuable assessment tool. A series of studies were conducted to create and validate a Work Readiness Questionnaire (WoRQ) for clinicians to assess patient ability to engage in socially useful activity, independent of work availability. Content validity, test-retest and inter-rater reliability, and construct validity were evaluated in three separate studies. Content validity was supported. Cronbach's α was 0.91, in the excellent range. Clinicians endorsed WoRQ concepts, including treatment adherence, physical appearance, social competence, and symptom control. The final readiness decision showed good test-retest reliability and moderate inter-rater reliability. Work readiness was associated with higher function and lower levels of negative symptoms. Low positive and high negative predictive values confirmed the concept validity. The WoRQ has suitable psychometric properties for use in a clinical trial for patients with a broad range of symptom severity. The scale may be applicable to assess therapeutic interventions. It is not intended to assess eligibility for supported work interventions. The WoRQ is suitable for use in schizophrenia clinical trials to assess patient work functional potential.
Guadagnin, Simone C; Nakano, Eduardo Y; Dutra, Eliane S; de Carvalho, Kênia M B; Ito, Marina K
Workplace dietary intervention studies in low- and middle-income countries using psychometrically sound measures are scarce. This study aimed to validate a nutrition knowledge questionnaire (NQ) and its utility in evaluating the changes in knowledge among participants of a Nutrition Education Program (NEP) conducted at the workplace. A NQ was tested for construct validity, internal consistency and discriminant validity. It was applied in a NEP conducted at six workplaces, in order to evaluate the effect of an interactive or a lecture-based education programme on nutrition knowledge. Four knowledge domains comprising twenty-three items were extracted in the final version of the NQ. Internal consistency of each domain was significant, with Kuder-Richardson formula values>0·60. These four domains presented a good fit in the confirmatory factor analysis. In the discriminant validity test, both the Expert and Lay groups scored>0·52, but the Expert group scores were significantly higher than those of the Lay group in all domains. When the NQ was applied in the NEP, the overall questionnaire scores increased significantly because of the NEP intervention, in both groups (Pnutrition knowledge among participants of NEP at the workplace. According to the NQ, an interactive nutrition education had a higher impact on nutrition knowledge than a lecture programme.
Full Text Available Health literacy refers to personal competencies for the access to, understanding of, appraisal of and application of health information in order to make sound decisions in everyday life. The aim of this study was to develop and evaluate the psychometric properties of an instrument for the measurement of health literacy among adolescents (the Health Literacy Measure for Adolescents-HELMA.This study was made up of two phases, qualitative and quantitative, which were carried out in 2012-2014 in Tehran, Iran. In the qualitative part of the study, in-depth interviews with 67 adolescents aged 15-18 were carried out in 4 high schools to generate the initial item pool for the survey. The content validity of the items was then assessed by an expert panel review (n = 13 and face validity was assessed by interviewing adolescents (n = 16. In the quantitative part of the study, in order to describe the psychometric properties of the scale, validity, reliability (internal consistency and test-retest and factor analysis were assessed.An item pool made up of 104 items was generated at the qualitative stage. After content validity was considered, this decreased to 47 items. In the quantitative stage, 582 adolescents aged 15-18 participated in the study with a mean age of 16.2 years. 51.2% of participants were females. In principal component factor analysis, 8 factors were loaded, which accounted for 53.37% of the variance observed. Reliability has been approved by α = 0.93 and the test-retest of the scale at two-week intervals indicated an appropriate stability for the scale (ICC = 0.93. The final questionnaire was approved with 44 items split into eight sections. The sections were titled: gain access to, reading, understanding, appraise, use, communication, self-efficacy and numeracy.The Health Literacy Measure for Adolescents (HELMA is a valid and reliable tool for the measurement of the health literacy of adolescents aged 15-18 and can be used to evaluate
Pittman, Joyce; Bakas, Tamilyn; Ellett, Marsha; Sloan, Rebecca; Rawl, Susan M
The purpose of this study was to evaluate the psychometric properties of a new instrument to measure incidence and severity of ostomy complications early in the postoperative period. 71 participants were enrolled, most were men (52%), white (96%), and married or partnered (55%). The mean age of participants was 57 ± 15.09 years (mean ± SD). Fifty-two participants (84%) experienced at least 1 ostomy complication in the 60-day postoperative period. The research setting was 3 acute care settings within a large healthcare system in the Midwestern United States. We developed an evidence-based conceptual model to guide development and evaluation of a new instrument, the Pittman Ostomy Complication Severity Index (OCSI). The OCSI format includes Likert-like scale with 9 individual items scored 0 to 3 and a total score computed by summing the individual items. Higher scores indicate more severe ostomy complications. This study consisted of 2 phases: (1) an expert review, conducted to establish content validity; and (2) a prospective, longitudinal study design, to examine psychometric properties of the instrument. A convenience sample of 71 adult patients who underwent surgery to create a new fecal ostomy was recruited from 3 hospitals. Descriptive analyses, content validity indices, interrater reliability testing, and construct validity testing were employed. Common complications included leakage (60%), peristomal moisture-associated dermatitis (50%), stomal pain (42%), retraction (39%), and bleeding (32%). The OCSI demonstrated acceptable evidence of content validity index (CVI = 0.9) and interrater reliability for individual items (k = 0.71-1.0), as well as almost perfect agreement for total scores among raters (ICC = 0.991, P ≤ .001). Construct validity of the OCSI was supported by significant correlations among variables in the conceptual model (complications, risk factors, stoma care self-efficacy, and ostomy adjustment). OCSI demonstrated acceptable validity and
.... The more specific intent is to encourage reevaluation from a structured psychometric viewpoint. The end goal is to facilitate a uniformly higher standard of measurement quality in unidimensional scaling having complex scale step descriptors...
Bergmann Tiest, W.M.; Kappers, A.M.L.
This very brief report introduces a psychometric function, very suitable for psychophysical data that displays Weber-like behaviour, because it is antisymmetric on a logarithmic scale. © 2011 a Pion publication.
Wiberg, Marie; Culpepper, Steven; Douglas, Jeffrey; Wang, Wen-Chung
This proceedings volume compiles and expands on selected and peer reviewed presentations given at the 81st Annual Meeting of the Psychometric Society (IMPS), organized by the University of North Carolina at Greensboro, and held in Asheville, North Carolina, July 11th to 17th, 2016. IMPS is one of the largest international meetings focusing on quantitative measurement in psychology, education, and the social sciences, both in terms of participants and number of presentations. The meeting built on the Psychometric Society's mission to share quantitative methods relevant to psychology, addressing a diverse set of psychometric topics including item response theory, factor analysis, structural equation modeling, time series analysis, mediation analysis, cognitive diagnostic models, and multi-level models. Selected presenters were invited to revise and expand their contributions and to have them peer reviewed and published in this proceedings volume. Previous volumes to showcase work from the Psychometric Society�...
Matthews, Gerald; Reinerman-Jones, Lauren E; Barber, Daniel J; Abich, Julian
A study was run to test the sensitivity of multiple workload indices to the differing cognitive demands of four military monitoring task scenarios and to investigate relationships between indices. Various psychophysiological indices of mental workload exhibit sensitivity to task factors. However, the psychometric properties of multiple indices, including the extent to which they intercorrelate, have not been adequately investigated. One hundred fifty participants performed in four task scenarios based on a simulation of unmanned ground vehicle operation. Scenarios required threat detection and/or change detection. Both single- and dual-task scenarios were used. Workload metrics for each scenario were derived from the electroencephalogram (EEG), electrocardiogram, transcranial Doppler sonography, functional near infrared, and eye tracking. Subjective workload was also assessed. Several metrics showed sensitivity to the differing demands of the four scenarios. Eye fixation duration and the Task Load Index metric derived from EEG were diagnostic of single-versus dual-task performance. Several other metrics differentiated the two single tasks but were less effective in differentiating single- from dual-task performance. Psychometric analyses confirmed the reliability of individual metrics but failed to identify any general workload factor. An analysis of difference scores between low- and high-workload conditions suggested an effort factor defined by heart rate variability and frontal cortex oxygenation. General workload is not well defined psychometrically, although various individual metrics may satisfy conventional criteria for workload assessment. Practitioners should exercise caution in using multiple metrics that may not correspond well, especially at the level of the individual operator.
Caminha, Guilherme Pilla; Melo Junior, José Tavares de; Hopkins, Claire; Pizzichini, Emilio; Pizzichini, Marcia Margaret Menezes
Rhinosinusitis is a highly prevalent disease and a major cause of high medical costs. It has been proven to have an impact on the quality of life through generic health-related quality of life assessments. However, generic instruments may not be able to factor in the effects of interventions and treatments. SNOT-22 is a major disease-specific instrument to assess quality of life for patients with rhinosinusitis. Nevertheless, there is still no validated SNOT-22 version in our country. Cross-cultural adaptation of the SNOT-22 into Brazilian Portuguese and assessment of its psychometric properties. The Brazilian version of the SNOT-22 was developed according to international guidelines and was broken down into nine stages: 1) Preparation 2) Translation 3) Reconciliation 4) Back-translation 5) Comparison 6) Evaluation by the author of the SNOT-22 7) Revision by committee of experts 8) Cognitive debriefing 9) Final version. Second phase: prospective study consisting of a verification of the psychometric properties, by analyzing internal consistency and test-retest reliability. Cultural adaptation showed adequate understanding, acceptability and psychometric properties. We followed the recommended steps for the cultural adaptation of the SNOT-22 into Portuguese language, producing a tool for the assessment of patients with sinonasal disorders of clinical importance and for scientific studies.
Sköld, Annika; Janeslätt, Gunnel Kristina
Impaired ability to manage time has been shown in several diagnoses common in childhood. Impaired ability involves activities and participation domain (daily time management, DTM) and body function and structure domain (time-processing ability, TPA). DTM needs to be evaluated from an individual's own perspective. To date, there has been a lack of self-rating instruments for children that focus on DTM. The aim of this study is to describe psychometric properties of Time-S when used in children aged 10-17 years with a diagnosis of ADHD, Autism, CP or mild ID. Further, to test whether TPA correlates with self-rated DTM. Eighty-three children aged 10-17 years participated in the study. Rasch analysis was used to assess psychometric properties. Correlation analysis was performed between Time-S and a measure of TPA. The 21 items of the Time-S questionnaire fit into a unitary construct measuring self-perceived daily management of an individual's time. A non-significant, small correlation was found between TPA and DTM. The results indicate good psychometric properties for the questionnaire. The questionnaire is potentially useful in intervention planning and evaluation.
Waldréus, Nana; Jaarsma, Tiny; van der Wal, Martje Hl; Kato, Naoko P
Patients with heart failure can experience thirst distress. However, there is no instrument to measure this in patients with heart failure. The aim of the present study was to develop the Thirst Distress Scale for patients with Heart Failure (TDS-HF) and to evaluate psychometric properties of the scale. The TDS-HF was developed to measure thirst distress in patients with heart failure. Face and content validity was confirmed using expert panels including patients and healthcare professionals. Data on the TDS-HF was collected from patients with heart failure at outpatient heart failure clinics and hospitals in Sweden, the Netherlands and Japan. Psychometric properties were evaluated using data from 256 heart failure patients (age 72±11 years). Concurrent validity of the scale was assessed using a thirst intensity visual analogue scale. Patients did not have any difficulties answering the questions, and time taken to answer the questions was about five minutes. Factor analysis of the scale showed one factor. After psychometric testing, one item was deleted. For the eight item TDS-HF, a single factor explained 61% of the variance and Cronbach's alpha was 0.90. The eight item TDS-HF was significantly associated with the thirst intensity score ( r=0.55, pfailure.
McDermott, Orii; Orgeta, Vasiliki; Ridder, Hanne Mette; Orrell, Martin
Music in Dementia Assessment Scales (MiDAS), an observational outcome measure for music therapy with people with moderate to severe dementia, was developed from qualitative data of focus groups and interviews. Expert and peer consultations were conducted at each stage of the scale development to maximize its content validity. This study aimed to evaluate the psychometric properties of MiDAS. Care home residents with dementia attended weekly group music therapy for up to ten sessions. Music therapists and care home staff were requested to complete weekly MiDAS ratings. The Quality of Life Scale (QoL-AD) was completed at three time-points. A total of 629 (staff = 306, therapist = 323) MiDAS forms were completed. The statistical analysis revealed that MiDAS has high therapist inter-rater reliability, low staff inter-rater reliability, adequate staff test-retest reliability, adequate concurrent validity, and good construct validity. High factor loadings between the five MiDAS Visual Analogue Scale (VAS) items, levels of Interest, Response, Initiation, Involvement, and Enjoyment, were found. This study indicates that MiDAS has good psychometric properties despite the small sample size. Future research with a larger sample size could provide a more in-depth psychometric evaluation, including further exploration of the underlying factors. MiDAS provides a measure of engagement with musical experience and offers insight into who is likely to benefit on other outcomes such as quality of life or reduction in psychiatric symptoms.
Full Text Available Palmira Faraci,1 Michael Lock,2 Robert Wheeler2 1Faculty of Human and Social Sciences, University of Enna “Kore”, Enna, Italy; 2Formula 4 Leadership Limited, Nottingham, UK Abstract: This study aimed to validate the Italian version of the Leadership Judgement Indicator, an unconventional instrument devoted to measurement of leaders' judgments and preferred styles, ie, directive, consultative, consensual, or delegative, when dealing with a range of decision-making scenarios. After forward-translation and back-translation, its psychometric properties were estimated for 299 managers at various levels, who were asked to put themselves in the position of leader and to rate the appropriateness of certain ways of responding to challenge. Differences between several groups of managers, ranked in order of seniority, provided evidence for discriminant validity. Internal consistency was adequate. The findings show that the Italian adaptation of the Leadership Judgement Indicator has promising psychometric qualities, suggesting its suitability for use to improve outcomes in both organizational and selection settings. Keywords: Leadership Judgement Indicator, decision-making, situational test, scenarios, psychometric properties
Coluci, Marina Zambon Orpinelli; Alexandre, Neusa Maria Costa
The objectives of this study were to develop a questionnaire that evaluates the perception of nursing workers to job factors that may contribute to musculoskeletal symptoms, and to evaluate its psychometric properties. Internationally recommended methodology was followed: construction of domains, items and the instrument as a whole, content validity, and pre-test. Psychometric properties were evaluated among 370 nursing workers. Construct validity was analyzed by the factorial analysis, known-groups technique, and convergent validity. Reliability was assessed through internal consistency and stability. Results indicated satisfactory fit indices during confirmatory factor analysis, significant difference (p office workers, and moderate correlations between the new questionnaire and Numeric Pain Scale, SF-36 and WRFQ. Cronbach's alpha was close to 0.90 and ICC values ranged from 0.64 to 0.76. Therefore, results indicated that the new questionnaire had good psychometric properties for use in studies involving nursing workers. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Ge, Xiaohua; Zhang, Tingting; Zhou, Lingling
This study evaluated the psychometric properties of subjective sedation scales using one psychometric scoring system to identify the appropriate scale that is most suitable for clinical care practice. A number of published sedation assessment scales for paediatric patients are currently used to attempt to achieve a moderate depth of sedation to avoid the undesirable effects caused by over- or undersedation. However, there has been no systematic review of these scales. We searched the Cochrane Library, PubMed, EMBASE, the Cumulative Index to Nursing and Allied Health Literature, etc., to obtain relevant articles. The quality of the selected studies was evaluated according to the Consensus-based Standards for the Selection of Health Measurement Instruments checklist. Articles that had been published or were in press and discussed the psychometric properties of sedation scales were included. The population comprised critically ill infants and non-verbal children ranging in age from 0 to 18 years who underwent sedation in an intensive care unit. Data were independently extracted by two investigators using a standard data extraction checklist: 43 articles were included in this review, and 13 sedation scales were examined. The quality of the psychometric evidence for the Comfort Scale and Comfort Behaviour Scale was 'very good', with the Comfort Scale having a higher quality (total weighted scores, Comfort Scale = 17·3 and Comfort Behaviour Scale = 15·5). We suggest that the scales be systematically and comprehensively tested in terms of development method, reliability, validation, feasibility and correlation with clinical outcome. The Comfort Scale and Comfort Behaviour Scale are useful tools for measuring sedation in paediatric patients. Nursing staff should choose one subjective sedation scale that is suitable for assessing paediatric patients' depth of sedation. We recommend the Comfort Scale and Comfort Behaviour Scale as optimal choices if the clinical
Elholm, Bjarne; Larsen, Klaus; Hornnes, Nete
The study aimed to evaluate psychometrically a Danish translation of the Short Alcohol Withdrawal Scale (SAWS) in an outpatient setting in patients with Alcohol Dependence (AD) and Alcohol Withdrawal Symptoms/Syndrome (AWS).......The study aimed to evaluate psychometrically a Danish translation of the Short Alcohol Withdrawal Scale (SAWS) in an outpatient setting in patients with Alcohol Dependence (AD) and Alcohol Withdrawal Symptoms/Syndrome (AWS)....
Existing statistical tests for the fit of the Rasch model have been criticized, because they are only sensitive to specific violations of its assumptions. Contingency table methods using loglinear models have been used to test various psychometric models. In this paper, the assumptions of the Rasch
Hald, Søren Vester; Baker, Felicity A.; Ridder, Hanne Mette Ochsner
Primary objective: To evaluate the psychometric properties of two adapted versions of the interpersonal communication competence scale (ICCS) that were applied to people with acquired brain injury (ABI). Construct validity was tested for both new scales and a factor extraction was performed....... Participants with medium-to-severe ABI self-rated their interpersonal communication skills using the modified ICCS. Cronbach Alpha test was performed on both scales followed by a correlation analysis. Results: Seventeen participants with medium-to-severe ABI and staff and relatives (n¼37) were involved...... of the proxy-rating revealed six meaningful sub-groups of interpersonal communication competencies....
Full Text Available ′Omics′ developments in the form of genomics, proteomics and metabolomics have increased the impetus of traditional medicine research. Studies exploring the genomic, proteomic and metabolomic basis of human constitutional types based on Ayurveda and other systems of oriental medicine are becoming popular. Such studies remain important to developing better understanding of human variations and individual differences. Countries like India, Korea, China and Japan are investing in research on evidence-based traditional medicines and scientific validation of fundamental principles. This review provides an account of studies addressing relationships between traditional medicine and genomics.
Joshi, Kalpana; Ghodke, Yogita; Shintre, Pooja
'Omics' developments in the form of genomics, proteomics and metabolomics have increased the impetus of traditional medicine research. Studies exploring the genomic, proteomic and metabolomic basis of human constitutional types based on Ayurveda and other systems of oriental medicine are becoming popular. Such studies remain important to developing better understanding of human variations and individual differences. Countries like India, Korea, China and Japan are investing in research on evidence-based traditional medicines and scientific validation of fundamental principles. This review provides an account of studies addressing relationships between traditional medicine and genomics.
Wang, Tien-Ni; Liang, Kai-Jie; Liu, Yi-Chia; Shieh, Jeng-Yi; Chen, Hao-Ling
To examine the psychometric and clinimetric properties of the Melbourne Assessment 2 (MA2), an outcome measurement that is increasingly used in clinical studies. Psychometric and clinimetric study. Community. Seventeen children with cerebral palsy (CP) from 5 to 12 years were recruited for the estimation of the test-retest reliability and minimal detectable change (MDC). Thirty-five children with CP were recruited to receive an 8-week intensive neurorehabilitation intervention to estimate the validity, responsiveness, and minimal clinically important difference (MCID). Thirty-five children with CP received upper limb neurorehabilitation programs for 8 weeks. The MA2 and the criterion measures, including the Bruininks-Oseretsky Test of Motor Proficiency, 2nd edition (BOT-2), the Box and Blocks Test (BBT), and the Pediatric Motor Activity Log-Revised (PMAL-R), were evaluated at pretreatment and posttreatment. The MA2 has 4 subscales: range of motion, fluency, accuracy, and dexterity. The test-retest reliability of the MA2 is high (intraclass correlation coefficient, .92-.98). The significant relationships between the MA2 and BBT, BOT-2, and PMAL-R support its validity. The significance of paired t test results (PMA2. The MDC values of the 4 subscales of the MA2 are 2.85, 1.63, 1.97, and 1.84, respectively, and the suggested MCID values of these 4 subscales are 2.35, 3.20, 2.09, and 2.22, respectively, indicating the minimum scores of improvement to be interpreted as both statistically significant and clinically important. The study findings indicate that the MA2 has sound psychometric and clinimetric properties and is thus an adequate measurement for research and clinical applications. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Hill, Bridget; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea
To evaluate reproducibility (reliability and agreement) of the Brachial Assessment Tool (BrAT), a new patient-reported outcome measure for adults with traumatic brachial plexus injury (BPI). Prospective repeated-measure design. Outpatient clinics. Adults with confirmed traumatic BPI (N=43; age range, 19-82y). People with BPI completed the 31-item 4-response BrAT twice, 2 weeks apart. Results for the 3 subscales and summed score were compared at time 1 and time 2 to determine reliability, including systematic differences using paired t tests, test retest using intraclass correlation coefficient model 1,1 (ICC 1,1 ), and internal consistency using Cronbach α. Agreement parameters included standard error of measurement, minimal detectable change, and limits of agreement. BrAT. Test-retest reliability was excellent (ICC 1,1 =.90-.97). Internal consistency was high (Cronbach α=.90-.98). Measurement error was relatively low (standard error of measurement range, 3.1-8.8). A change of >4 for subscale 1, >6 for subscale 2, >4 for subscale 3, and >10 for the summed score is indicative of change over and above measurement error. Limits of agreement ranged from ±4.4 (subscale 3) to 11.61 (summed score). These findings support the use of the BrAT as a reproducible patient-reported outcome measure for adults with traumatic BPI with evidence of appropriate reliability and agreement for both individual and group comparisons. Further psychometric testing is required to establish the construct validity and responsiveness of the BrAT. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Lochhead, Lois E; MacMillan, Peter D
The Oswestry disability index (ODI) is the most widely used measure of perceived disability for low back conditions. It has been adopted without adaptation in functional capacity evaluation (FCE). Rigorous testing of the ODI with modern psychometric methods, in this setting, is warranted. To determine the psychometric properties of the ODI in FCE: unidimensionality; differential item functioning; item coverage and to identify poorly functioning items, allowing for improvement of these items and recalibration of the scale. Rasch analysis, specifically Masters' partial credit model, was conducted on data. 133 work-disabled individuals presenting for FCE in northern British Columbia, Canada. All items had one poorly functioning option. Items were rescaled from six categories to five, improving the psychometric properties of the ODI as a unidimensional (disability due to back pain) scale. Item difficulty range is sufficient for a population with mild to severe disability. Although two of the ten ODI items functioned marginally unsatisfactorily in the unrevised state, the 5-option revised ODI appears superior. Use in clinical settings across a broad spectrum of disability levels could help establish its psychometric properties. Health professionals should be aware that the ODI may perform differently depending on client population.
Moreira, Diana; Almeida, Fernando; Pinto, Marta; Segarra, Pilar; Barbosa, Fernando
The behavioral inhibition/behavioral activation (BIS/BAS) scales (Carver & White, 1994), which allow rating the Gray's motivational systems, were translated and adapted into Portuguese. In this study, the authors present the procedure and the psychometric analyses of the Portuguese version of the scales, which included basic item and scales psychometric characteristics, as well as confirmatory and exploratory factor analyses. After the psychometric analyses provided evidence for the quality of the Portuguese version of the scales, the normative data was provided by age and school grade. The confirmatory factor analysis of the BIS/BAS scales that the authors performed did not demonstrate satisfactory fit for the 2- or 4-factor solution. The authors also tested the more recent 5-factor model, but the fit indices remained inadequate. As fit indices were not satisfactory they proceeded with an exploratory factor analysis to examine the structure of the Portuguese scales. These psychometric analyses provided evidence of a successful translation of the original scales. Therefore these scales can now be used in future research with Portuguese or Brazilian population. (c) 2015 APA, all rights reserved.
Ringblom, Jenny; Wåhlin, Ingrid; Proczkowska, Marie
Emergence delirium and emergence agitation have been a subject of interest since the early 1960s. This behavior has been associated with increased risk of injury in children and dissatisfaction with anesthesia care in their parents. The Pediatric Anesthesia Emergence Delirium Scale is a commonly used instrument for codifying and recording this behavior. The aim of this study was to psychometrically evaluate the Pediatric Anesthesia Emergence Delirium scale, focusing on the factor structure, in a sample of children recovering from anesthesia after surgery or diagnostic procedures. The reliability of the Pediatric Anesthesia Emergence Delirium scale was also tested. One hundred and twenty-two children younger than seven years were observed at postoperative care units during recovery from anesthesia. Two or 3 observers independently assessed the children using the Pediatric Anesthesia Emergence Delirium scale. The factor analysis clearly revealed a one-factor solution, which accounted for 82% of the variation in the data. Internal consistency, calculated with Cronbach's alpha, was good (0.96). The Intraclass Correlation Coefficient, which was used to assess interrater reliability for the Pediatric Anesthesia Emergence Delirium scale sum score, was 0.97 (P Pediatric Anesthesia Emergence Delirium scale for assessing emergence delirium in children recovering from anesthesia after surgery or diagnostic procedures. The kappa statistics for the Pediatric Anesthesia Emergence Delirium scale items essentially indicated good agreement between independent raters, supporting interrater reliability. © 2018 John Wiley & Sons Ltd.
Endsley, Paige; Weobong, Benedict; Nadkarni, Abhijit
The Alcohol Use Disorders Identification Test (AUDIT) is a 10-item screening questionnaire used to detect alcohol use disorders. The AUDIT has been validated in only two studies in India and although it has been previously used in Goa, India, it has yet to be validated in that setting. In this paper, we aim to report data on the validity of the AUDIT for the screening of AUDs among men in Goa, India. Concurrent and convergent validity of the AUDIT were assessed against the Mini International Neuropsychiatric Interview (MINI) and World Health Organisation Disability Assessment Scale (WHODAS) for alcohol abuse, alcohol dependence, and functional status respectively through the secondary analysis of data from a community cohort of men from Goa, India. The AUDIT showed high internal reliability and acceptable criterion validity with adequate psychometric properties for the detection of alcohol abuse and dependence. However, all of the optimal cut-off points from ROC analyses were lower than the WHO recommended for identification of risk of all AUDs, with a score of 6-12 detecting alcohol abuse and 13 and higher alcohol dependence. In order to optimize the utility of the AUDIT, a lowered cut-off point for alcohol abuse and dependence is recommended for Goa, India. Further validation studies for the AUDIT should be conducted for continued validation of the tool in other parts of India. Copyright © 2017 The Authors. Published by Elsevier B.V. All rights reserved.
Recent public controversies, ranging from the 2014 Facebook 'emotional contagion' study to psychographic data profiling by Cambridge Analytica in the 2016 American presidential election, Brexit referendum and elsewhere, signal watershed moments in which the intersecting trajectories of psychology and computer science have become matters of public concern. The entangled history of these two fields grounds the application of applied psychological techniques to digital technologies, and an investment in applying calculability to human subjectivity. Today, a quantifiable psychological subject position has been translated, via 'big data' sets and algorithmic analysis, into a model subject amenable to classification through digital media platforms. I term this position the 'scalable subject', arguing it has been shaped and made legible by algorithmic psychometrics - a broad set of affordances in digital platforms shaped by psychology and the behavioral sciences. In describing the contours of this 'scalable subject', this paper highlights the urgent need for renewed attention from STS scholars on the psy sciences, and on a computational politics attentive to psychology, emotional expression, and sociality via digital media.
Rosneck, James S; Hughes, Joel; Gunstad, John; Josephson, Richard; Noe, Donald A; Waechter, Donna
This article describes the systematic construction and psychometric analysis of a knowledge assessment instrument for phase II cardiac rehabilitation (CR) patients measuring risk modification disease management knowledge and behavioral outcomes derived from national standards relevant to secondary prevention and management of cardiovascular disease. First, using adult curriculum based on disease-specific learning outcomes and competencies, a systematic test item development process was completed by clinical staff. Second, a panel of educational and clinical experts used an iterative process to identify test content domain and arrive at consensus in selecting items meeting criteria. Third, the resulting 31-question instrument, the Cardiac Knowledge Assessment Tool (CKAT), was piloted in CR patients to ensure use of application. Validity and reliability analyses were performed on 3638 adults before test administrations with additional focused analyses on 1999 individuals completing both pretreatment and posttreatment administrations within 6 months. Evidence of CKAT content validity was substantiated, with 85% agreement among content experts. Evidence of construct validity was demonstrated via factor analysis identifying key underlying factors. Estimates of internal consistency, for example, Cronbach's α = .852 and Spearman-Brown split-half reliability = 0.817 on pretesting, support test reliability. Item analysis, using point biserial correlation, measured relationships between performance on single items and total score (P knowledge instrument specifically designed for an adult CR population was systematically developed and tested in a large representative patient population, satisfying psychometric parameters, including validity and reliability.
McCauley, Rebecca J.; Strand, Edythe A.
Purpose: To review the content and psychometric characteristics of 6 published tests currently available to aid in the study, diagnosis, and treatment of motor speech disorders in children. Method: We compared the content of the 6 tests and critically evaluated the degree to which important psychometric characteristics support the tests' use for…
Akerman, Eva; Fridlund, Bengt; Samuelson, Karin; Baigi, Amir; Ersson, Anders
This is a further development of a specific questionnaire, the 3-set 4P, to be used for measuring former ICU patients' physical and psychosocial problems after intensive care and the need for follow-up. The aim was to psychometrically test and evaluate the 3-set 4P questionnaire in a larger population. The questionnaire consists of three sets: "physical", "psychosocial" and "follow-up". The questionnaires were sent by mail to all patients with more than 24-hour length of stay on four ICUs in Sweden. Construct validity was measured with exploratory factor analysis with Varimax rotation. This resulted in three factors for the "physical set", five factors for the "psychosocial set" and four factors for the "follow-up set" with strong factor loadings and a total explained variance of 62-77.5%. Thirteen questions in the SF-36 were used for concurrent validity showing Spearman's r(s) 0.3-0.6 in eight questions and less than 0.2 in five. Test-retest was used for stability reliability. In set follow-up the correlation was strong to moderate and in physical and psychosocial sets the correlations were moderate to fair. This may have been because the physical and psychosocial status changed rapidly during the test period. All three sets had good homogeneity. In conclusion, the 3-set 4P showed overall acceptable results, but it has to be further modified in different cultures before being considered a fully operational instrument for use in clinical practice. Copyright © 2012 Elsevier Ltd. All rights reserved.
Telch, Michael J; Pujols, Yasisca
Erectile dysfunction is a highly publicized and prevalent condition with marked adverse effects on men's social, emotional, and quality of life. Although several instruments have emerged for assessing erectile dysfunction and its impact on men's quality of life, none of the existing instruments provide a specific assessment of men's erectile performance anxiety. This article reports on the development and psychometric evaluation of the Erectile Performance Anxiety Index (EPAI)--a 10-item self-report scale designed to fill an important gap in the assessment of male erectile dysfunction. A total of 207 men ranging in age from 18 to 79 took part in the study. All subjects completed an online battery consisting of the EPAI, along with measures of related sexual functioning, social anxiety, state anxiety, and depressive symptoms. A small subset of study participants (N = 42) completed the EPAI a second time for determining test-retest reliability. Test-retest reliability was determined by Pearson's product-moment correlations. Internal reliability was assessed using Cronbach's alpha. Factor validity was evaluated by a maximum likelihood factor analysis with oblique rotation. Convergent and discriminant validity was assessed by comparing the strength of association between the EPAI and measures varying in their hypothesized shared variance with the construct of erectile performance anxiety. The EPAI demonstrated excellent internal consistency, with Cronbach's alpha = 0.93 and excellent test-retest reliability (r = 0.85) over an average period of 3.5 weeks. Results of an exploratory factor analysis revealed a one-factor solution that accounted for 63% of the total variance. Preliminary evidence supports the convergent and discriminant validity of the EPAI. Results support the use of the EPAI as a reliable, valid, and efficient instrument for the assessment of erectile performance anxiety. Potential research and clinical applications are discussed. © 2013 International
Full Text Available Clothing is a unique dressing style of a community, a period or a profession. In clothing there is social status and difference principle rather than fashion. In this context, the society created a clothing style in line with its own customs, traditions and social structure. One of the features separating societies from each other and indicating their cultural and social classes is the clothing style. As it is known, traditional Turkish clothes reflecting the characteristics of Turkish society is our most beautiful heritage from past to present. From this heritage there are several examples of women's clothes c arried to present. When these examples are examined, it is possible to see the taste, the way of understanding art, joy and the lifestyle of the history. These garments are also the documents outlining the taste and grace of Turkish people. In the present study, traditional Kastamonu women's clothing, that has an important place in traditional cultural clothes of Anatolia, is investigated . The method of the present research is primarily defined as the examination of the written sources. The study is complet ed with the observations and examinations made in Kastamonu. According to the findings of the study, traditional Kastamonu women's clothing are examined and adapted to todays’ clothing.
Xu, Yan; Wang, Dong; Fan, Wen Lai; Mu, Xiao Qing; Chen, Jian
The earliest industrial biotechnology originated in ancient China and developed into a vibrant industry in traditional Chinese liquor, rice wine, soy sauce, and vinegar. It is now a significant component of the Chinese economy valued annually at about 150 billion RMB. Although the production methods had existed and remained basically unchanged for centuries, modern developments in biotechnology and related fields in the last decades have greatly impacted on these industries and led to numerous technological innovations. In this chapter, the main biochemical processes and related technological innovations in traditional Chinese biotechnology are illustrated with recent advances in functional microbiology, microbial ecology, solid-state fermentation, enzymology, chemistry of impact flavor compounds, and improvements made to relevant traditional industrial facilities. Recent biotechnological advances in making Chinese liquor, rice wine, soy sauce, and vinegar are reviewed.
Silveira, Vladímir de Aquino; Souza, Givago da Silva; Gomes, Bruno Duarte; Rodrigues, Anderson Raiol; Silveira, Luiz Carlos de Lima
We used psychometric functions to estimate the joint entropy for space discrimination and spatial frequency discrimination. Space discrimination was taken as discrimination of spatial extent. Seven subjects were tested. Gábor functions comprising unidimensionalsinusoidal gratings (0.4, 2, and 10 cpd) and bidimensionalGaussian envelopes (1°) were used as reference stimuli. The experiment comprised the comparison between reference and test stimulithat differed in grating's spatial frequency or envelope's standard deviation. We tested 21 different envelope's standard deviations around the reference standard deviation to study spatial extent discrimination and 19 different grating's spatial frequencies around the reference spatial frequency to study spatial frequency discrimination. Two series of psychometric functions were obtained for 2%, 5%, 10%, and 100% stimulus contrast. The psychometric function data points for spatial extent discrimination or spatial frequency discrimination were fitted with Gaussian functions using the least square method, and the spatial extent and spatial frequency entropies were estimated from the standard deviation of these Gaussian functions. Then, joint entropy was obtained by multiplying the square root of space extent entropy times the spatial frequency entropy. We compared our results to the theoretical minimum for unidimensional Gábor functions, 1/4π or 0.0796. At low and intermediate spatial frequencies and high contrasts, joint entropy reached levels below the theoretical minimum, suggesting non-linear interactions between two or more visual mechanisms. We concluded that non-linear interactions of visual pathways, such as the M and P pathways, could explain joint entropy values below the theoretical minimum at low and intermediate spatial frequencies and high contrasts. These non-linear interactions might be at work at intermediate and high contrasts at all spatial frequencies once there was a substantial decrease in joint
Schwartz, Carolyn E; Michael, Wesley; Zhang, Jie; Rapkin, Bruce D; Sprangers, Mirjam A G
A growing body of research suggests that regularly engaging in stimulating activities across multiple domains-physical, cultural, intellectual, communal, and spiritual-builds resilience. This project investigated the psychometric characteristics of the DeltaQuest Reserve-Building Measure for use in prospective research. The study included Rare Patient Voice panel participants. The web-based survey included the Reserve-Building Measure with one-week re-test, measures of quality of life (QOL) and well-being (PROMIS General Health; NeuroQOL Cognitive Function and Positive Affect & Well-Being short-forms; Ryff Environmental Mastery subscale); and the Big Five Inventory-10 personality measure. Classical test theory and item response theory (IRT) analyses investigated psychometric characteristics of the Reserve-Building Measure. This North American sample (n = 592) included both patients and caregivers [mean age = 44, SD 19)]. Psychometric analyses revealed distinct subscales measuring current reserve-building activities (Active in the World, Games, Outdoors, Creative, Religious/Spiritual, Exercise, Inner Life, Shopping/Cooking, Passive Media Consumption,), past reserve-building activities (Childhood Activities, Achievement), and reserve-related person-factors (Perseverance, Current and Past Social Support, and Work Value). Test-retest stability (n = 101) was moderately high for 11 of 15 subscales (ICC range 0.78-0.99); four were below 0.59 indicating a need for further refinement. IRT analyses supported the item functioning of all subscales. Correlational analyses suggest the measure's subscales tap distinct constructs (range r = 0.11-0.46) which are not redundant with QOL, well-being, or personality (range r = 0.11-0.48). The Reserve-Building Measure provides a measure of activities and person-factors related to reserve that may potentially be useful in prospective research.
McCreary, Linda L.; Conrad, Karen M.; Conrad, Kendon J.; Scott, Christy K; Funk, Rodney R.; Dennis, Michael L.
Background Valid assessment of family functioning can play a vital role in optimizing client outcomes. Because family functioning is influenced by family structure, socioeconomic context, and culture, existing measures of family functioning--primarily developed with nuclear, middle class European American families--may not be valid assessments of families in diverse populations. The Family Effectiveness Measure was developed to address this limitation. Objectives To test the Family Effectiveness Measure with data from a primarily low-income African American convenience sample, using the Rasch measurement model. Method A sample of 607 adult women completed the measure. Rasch analysis was used to assess unidimensionality, response category functioning, item fit, person reliability, differential item functioning by race and parental status, and item hierarchy. Criterion-related validity was tested using correlations with five other variables related to family functioning. Results The Family Effectiveness Measure measures two separate constructs: The effective family functioning construct was a psychometrically sound measure of the target construct that was more efficient due to the deletion of 22 items. The ineffective family functioning construct consisted of 16 of those deleted items but was not as strong psychometrically. Items in both constructs evidenced no differential item functioning by race. Criterion-related validity was supported for both. Discussion In contrast to the prevailing conceptualization that family functioning is a single construct, assessed by positively and negatively worded items, use of the Rasch analysis suggested the existence of two constructs. While the effective family functioning is a strong and efficient measure of family functioning, the ineffective family functioning will require additional item development and psychometric testing. PMID:23636342
de Brouwer, Brigitte Johanna Maria; Kaljouw, Marian J; Schoonhoven, Lisette; van Achterberg, Theo
To develop and psychometrically test the Essentials of Magnetism II in nursing homes. Increasing numbers and complex needs of older people in nursing homes strain the nursing workforce. Fewer adequately trained staff and increased care complexity raise concerns about declining quality. Nurses' practice environment has been reported to affect quality of care and productivity. The Essentials of Magnetism II © measures processes and relationships of practice environments that contribute to productivity and quality of care and can therefore be useful in identifying processes requiring change to pursue excellent practice environments. However, this instrument was not explicitly evaluated for its use in nursing home settings so far. In a preparatory phase, a cross-sectional survey study focused on face validity of the essentials of magnetism in nursing homes. A second cross-sectional survey design was then used to further test the instrument's validity and reliability. Psychometric testing included evaluation of content and construct validity, and reliability. Nurses (N = 456) working at 44 units of three nursing homes were included. Respondent acceptance, relevance and clarity were adequate. Five of the eight subscales and 54 of the 58 items did meet preset psychometric criteria. All essentials of magnetism are considered relevant for nursing homes. The subscales Adequacy of Staffing, Clinically Competent Peers, Patient Centered Culture, Autonomy and Nurse Manager Support can be used in nursing homes without problems. The other subscales cannot be directly applied to this setting. The valid subscales of the Essentials of Magnetism II instrument can be used to design excellent nursing practice environments that support nurses' delivery of care. Before using the entire instrument, however, the other subscales have to be improved. © 2016 John Wiley & Sons Ltd.
Edward F. Millen
The study of traditional food and healthy eating habits has been one of the fast growing areas. All humans, both men and women, require food for their survival. However, both men and women indulge in food as if it were their sole purpose of existence. Hence, eating disorders are common among men and women. Then media has played an effective role not only in establishing faulty standards for traditional healthy food but also it has highlighted the importance of healthy eating. It has brought t...
Antifungal Susceptibility Testing of Aspergillus spp. by Using a Composite Correlation Index (CCI)-Based Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry Method Appears To Not Offer Benefit over Traditional Broth Microdilution Testing.
Gitman, Melissa R; McTaggart, Lisa; Spinato, Joanna; Poopalarajah, Rahgavi; Lister, Erin; Husain, Shahid; Kus, Julianne V
Aspergillus spp. cause serious invasive lung infections, and Aspergillus fumigatus is the most commonly encountered clinically significant species. Voriconazole is considered to be the drug of choice for treating A. fumigatus infections; however, rising resistance rates have been reported. We evaluated a matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS)-based method for the differentiation between wild-type and non-wild-type isolates of 20 Aspergillus spp. (including 2 isolates of Aspergillus ustus and 1 of Aspergillus calidoustus that were used as controls due their intrinsic low azole susceptibility with respect to the in vitro response to voriconazole). At 30 and 48 h of incubation, there was complete agreement between Cyp51A sequence analysis, broth microdilution, and MALDI-TOF MS classification of isolates as wild type or non-wild type. In this proof-of-concept study, we demonstrated that MALDI-TOF MS can be used to accurately detect A. fumigatus strains with reduced voriconazole susceptibility. However, rather than proving to be a rapid and simple method for antifungal susceptibility testing, this particular MS-based method showed no benefit over conventional testing methods. © Crown copyright 2017.
Antifungal Susceptibility Testing of Aspergillus spp. by Using a Composite Correlation Index (CCI)-Based Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry Method Appears To Not Offer Benefit over Traditional Broth Microdilution Testing
Gitman, Melissa R.; McTaggart, Lisa; Spinato, Joanna; Poopalarajah, Rahgavi; Lister, Erin; Husain, Shahid
ABSTRACT Aspergillus spp. cause serious invasive lung infections, and Aspergillus fumigatus is the most commonly encountered clinically significant species. Voriconazole is considered to be the drug of choice for treating A. fumigatus infections; however, rising resistance rates have been reported. We evaluated a matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS)-based method for the differentiation between wild-type and non-wild-type isolates of 20 Aspergillus spp. (including 2 isolates of Aspergillus ustus and 1 of Aspergillus calidoustus that were used as controls due their intrinsic low azole susceptibility with respect to the in vitro response to voriconazole). At 30 and 48 h of incubation, there was complete agreement between Cyp51A sequence analysis, broth microdilution, and MALDI-TOF MS classification of isolates as wild type or non-wild type. In this proof-of-concept study, we demonstrated that MALDI-TOF MS can be used to accurately detect A. fumigatus strains with reduced voriconazole susceptibility. However, rather than proving to be a rapid and simple method for antifungal susceptibility testing, this particular MS-based method showed no benefit over conventional testing methods. PMID:28404678
Carvalho, Lucas de Francisco; Pianowski, Giselle; Silveira, Fernando José; Bacciotti, Jonatha Tiago; Vieira, Philipe Gomes
Abstract We aimed to review of the Eccentricity dimension of the Dimensional Clinical Personality Inventory (IDCP), through two steps. The first one focused on developing new items and the second on testing the psychometric properties in a sample of 225 subjects (70.1% females), aging between 18 and 66 years, mostly undergraduate students (58.9%). The subjects answered the IDCP, and the Brazilian versions of the NEO-PI-R, PID-5 and MIS. The first step resulted in 42 items, which 22 were new. ...
Zarem, Cori; Kidokoro, Hiroyuki; Neil, Jeffrey; Wallendorf, Michael; Inder, Terrie; Pineda, Roberta
To establish the psychometrics of the Neonatal Oral Motor Assessment Scale (NOMAS). In this prospective cohort study of 75 preterm infants (39 females, 36 males) born at or before 30 weeks gestation (mean gestational age 26.56 wks, SD 1.90, range 23-30 wks; mean birthweight 967.33 g, SD 288.54, range 480-2240), oral feeding was videotaped before discharge from the neonatal intensive care unit (NICU). The NOMAS was used to classify feeding as normal, disorganized, or dysfunctional. Neurobehavior was assessed at term equivalent, and infants underwent magnetic resonance imaging. Children returned for developmental testing at 2 years corrected age. Associations between NOMAS scores and (1) neurobehavior; (2) cerebral injury and metrics; and (3) developmental outcome were investigated using χ(2) -analyses, t-tests, and linear regression. For reliability, six certified NOMAS evaluators rated five randomly selected NOMAS recordings and re-scored them 2 weeks later in a second randomized order. Reliability was calculated with Cohen's kappa statistics. Dysfunctional NOMAS scores were associated with lower Dubowitz scores [t=-2.14; mean difference -2.32 (95% confidence interval [CI] -0.157 to -4.49); p=0.036], higher stress on the NICU Network Neurobehavioral Scale (t=2.61; mean difference 0.073 [95% CI 0.017-0.129]; p=0.0110), and decreased transcerebellar diameter (t=-2.22; mean difference -2.04 [CI=-3.89 to -0.203]; p=0.03). No significant associations were found between NOMAS scores and 2-year outcome. Some concurrent validity was established with associations between NOMAS scores and measures of infant behavior and cerebral structure. The NOMAS did not show predictive validity in this study of preterm infants at high risk of developmental delay. Reliability was variable and suboptimal. © 2013 Mac Keith Press.
Full Text Available Chinese noodles originated in the Han dynasty, which has more than 4,000 years of history. There are many stories about the origin of noodles. To a certain extent, noodles also reflect the cultural traditions and customs of China, which essentially means “human nature” and “worldly common sense”. There are thousands of varieties of noodles in China, according to the classification of the shape of noodles, seasoning gravy, cooking craft, and so on. Many noodles have local characteristics. Noodles are accepted by people from all over the world. The industrial revolution and the development of the food industry realized the transition from a traditional handicraft industry to mass production using machinery. In addition, the invention of instant noodles and their mass production also greatly changed the noodle industry. In essence, noodles are a kind of cereal food, which is the main body of the traditional Chinese diet. It is the main source of energy for Chinese people and the most economical energy food. Adhering to the principle of “making cereal food the main food”, is to maintain our Chinese good diet tradition, which can avoid the disadvantages of a high energy, high fat, and low carbohydrate diet, and promote health. The importance of the status of noodles in the dietary structure of residents in our country and the health impact should not be ignored.
Hendrix, Janey B.
A collection for children and teachers of traditional Cherokee recipes emphasizes the art, rather than the science, of cooking. The hand-printed, illustrated format is designed to communicate the feeling of Cherokee history and culture and to encourage readers to collect and add family recipes. The cookbook could be used as a starting point for…
This article discusses traditional methods, such as the grammar-translation, and modern methods, the communicative approach, for teaching English-as-a-foreign-language in China. The relationship between linguistic accuracy and communicative competence, student-centered orientation, and the role of the teacher are highlighted. (Author/VWL)
This article presents a recipe for non-traditional wraps. In this article, the author describes how adults and children can help with the recipe and the skills involved with this recipe. The bigger role that children can play in the making of the item the more they are apt to try new things and appreciate the texture and taste.
In this podcast, a Latina nutrition educator shows how a community worked with local farmers to grow produce traditionally enjoyed by Hispanic/Latinos. Created: 11/1/2007 by National Diabetes Education Program (NDEP), a joint program of the Centers for Disease Control and Prevention and the National Institutes of Health. Date Released: 11/10/2007.