Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako
The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.
Background The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. Methods We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson’s correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach’s alpha coefficient; the intraclass (ICC) and Pearson’s correlation coefficients were used for test-retest reliability. Results One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson’s correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson’s correlation coefficient was 0.89 and ICC was 0.90. Conclusions The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality. PMID:23327598
Anderson Kathryn L
Full Text Available Abstract The Quality of Life Scale (QOLS, created originally by American psychologist John Flanagan in the 1970's, has been adapted for use in chronic illness groups. This paper reviews the development and psychometric testing of the QOLS. A descriptive review of the published literature was undertaken and findings summarized in the frequently asked questions format. Reliability, content and construct validity testing has been performed on the QOLS and a number of translations have been made. The QOLS has low to moderate correlations with physical health status and disease measures. However, content validity analysis indicates that the instrument measures domains that diverse patient groups with chronic illness define as quality of life. The QOLS is a valid instrument for measuring quality of life across patient groups and cultures and is conceptually distinct from health status or other causal indicators of quality of life.
Watt, Torquil; Hegedus, Laszlo; Grønvold, Mogens
Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test...
Watt, Torquil; Hegedüs, Laszlo; Groenvold, Mogens
Background Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated. Aim To investigate clinical ('known-groups') validity and test-retest reliability of the Danish version of the ThyPRO. Methods For each of the 13 ThyPRO scales, we defined groups expected to have high versus low scores ('known-groups'). The clinical validity (known......-groups validity) was evaluated by whether the ThyPRO scales could detect expected differences in a cross-sectional study of 907 thyroid patients. Test-retest reliability was evaluated by intra-class correlations of two responses to the ThyPRO 2 weeks apart in a subsample of 87 stable patients. Results On all 13...
Abay, Halime; Kaplan, Sena
There are a limited number of menopause-specific quality-of-life scales for the Turkish population. This study was conducted to evaluate the validity and reliability of the Turkish Utian Quality-of-Life Scale in postmenopausal women. The study group was comprised of 250 postmenopausal women who applied to a training and research hospital's menopause clinic in Turkey. A survey form and the Turkish Utian quality-of-Life Scale were used to collect data, and the Turkish version of Short Form-36 was used to evaluate reliability with an equivalent form. Language-validity, content-validity, and construct-validity methods were used to assess the validity of the scale, and Cronbach's α coefficient calculation and the equivalent-form reliability methods were used to assess the reliability of the scale. The Turkish Utian Quality-of-Life Scale was determined to be a valid and reliable instrument for measuring the quality of life of postmenopausal women. Confirmatory factor analysis demonstrates that the instrument fits well with 23 items and a four-factor model. The Cronbach's α coefficient for the quality-of-life domains were as follows: 0.88 overall, 0.79 health, 0.78 emotional, 0.76 sexual, and 0.75 occupational. Reliability of the instrument was confirmed through significant correlations between scores on the Turkish version of the Utian Quality-of-Life Scale and the Turkish version of the Short Form-36 (r = 0.745, P measuring quality of life during menopause.
Dawson, Andreas; Raphael, Karen G; Glaros, Alan; Axelsson, Susanna; Arima, Taro; Ernberg, Malin; Farella, Mauro; Lobbezoo, Frank; Manfredini, Daniele; Michelotti, Ambra; Svensson, Peter; List, Thomas
To combine empirical evidence and expert opinion in a formal consensus method in order to develop a quality-assessment tool for experimental bruxism studies in systematic reviews. Tool development comprised five steps: (1) preliminary decisions, (2) item generation, (3) face-validity assessment, (4) reliability and discriminitive validity assessment, and (5) instrument refinement. The kappa value and phi-coefficient were calculated to assess inter-observer reliability and discriminative ability, respectively. Following preliminary decisions and a literature review, a list of 52 items to be considered for inclusion in the tool was compiled. Eleven experts were invited to join a Delphi panel and 10 accepted. Four Delphi rounds reduced the preliminary tool-Quality-Assessment Tool for Experimental Bruxism Studies (Qu-ATEBS)- to 8 items: study aim, study sample, control condition or group, study design, experimental bruxism task, statistics, interpretation of results, and conflict of interest statement. Consensus among the Delphi panelists yielded good face validity. Inter-observer reliability was acceptable (k = 0.77). Discriminative validity was excellent (phi coefficient 1.0; P reviews of experimental bruxism studies, exhibits face validity, excellent discriminative validity, and acceptable inter-observer reliability. Development of quality assessment tools for many other topics in the orofacial pain literature is needed and may follow the described procedure.
Thompson, Bruce; Cook, Colleen
Research libraries are increasingly supplementing collection counts with perceptions of service quality as indices of status and productivity. The present study was undertaken to explore the reliability and validity of scores from the SERVQUAL measurement protocol (A. Parasuraman and others, 1991), which has previously been used in this type of…
DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M
The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no
Güler, Sibel; Turan, F Nesrin
Our aim was to translate the Quality of Life in Essential Tremor Questionnaire (QUEST) advanced by Troster (2005) and to analyse the validity and reliability of this questionnaire. Two hundred twelve consecutive patients with essential tremor (ET) and forty-three control subjects were included in the study. Permission for the translation and validation of the QUEST scale was obtained. The translation was performed according to the guidelines provided by the publisher. After the translation, the final version of the scale was administered to both groups to determine its reliability and validity. The QUEST Physical, Psychosocial, communication, Hobbies/leisure and Work/finance scores were 0.967, 0.968, 0.933, 0.964 and 0.925, respectively. There were good correlations between each of the QUEST scores that were indicative of good internal consistency. Additionally, we observed that all of the QUEST scores were most strongly related to the right and left arms (p=0.0001). However, we observed that all of the QUEST scores were weakly related to the voice, head and right leg (p=0.0001). These findings support the notion that the Turkish version of the Quality of Life in Essential Tremor (QUEST) questionnaire is a valid and reliable tool for the assessment of the quality of life of patients with ET.
Full Text Available Introduction: Many studies reported poorer quality of life (QoL in youth with diabetes compared to healthy peers. One of the tools used is the Diabetes Quality of Life for Youth (DQoLY questionnaire in English. A validated instrument in Malay is needed to assess the perception of QoL among youth with diabetes in Malaysia. Objective: To translate the modified version, i.e., the DQoLY questionnaire,into Malay and determine its reliability and validity.Methods: Translation and back-translation were used. An expert panel reviewed the translated version for conceptual and content equivalence. The final version was then administered to youths with type 1 diabetes mellitus from the universities and Ministry of Health hospitals between August 2006 and September 2007. Reliability was analysed using Cronbach’s alpha, while validity was confirmed using concurrent validity (HbA1c and self-rated health score.Results: A total of 82 youths with type 1 diabetes (38 males aged 10-18 years were enrolled from eight hospitals. The reliability of overall questionnaire was 0.917, and the reliabilities of the three domains ranged from 0.832 to 0.867. HbA1c was positively correlated with worry (p=0.03. The self-rated health score was found to have significant negative correlation with the “satisfaction” (p=0.013 and “impact” (p=0.007 domains.Conclusion: The Malay translated version of DQoLY questionnaire was reliable and valid to be used among youths with type 2 diabetes in Malaysia.
Tezcaner, Zahide Çiler; Aksoy, Songül
This study aims to test the validity and reliability of the Turkish version of the Voice-Related Quality of Life (V-RQOL) questionnaire. This is a nonrandomized, prospective study with control group. The questionnaire was administered to 249 individuals-130 with vocal complaint and 119 without-with a mean age of 37.8 ± 12.3 years. The Turkish version of the Voice Handicap Index (VHI) and perceptual voice evaluation measures were also administered at 2-14 days for retest reliability. The instrument was submitted to validity and reliability evaluation. The V-RQOL measure showed a strong internal consistency and test-retest reliability; the Cronbach's alpha coefficient for the overall V-RQOL was 0.969, the physical functioning domain was 0.949, and the social-emotional domain was 0.940. In the test-retest reliability test, the overall V-RQOL was found to be 0.989. The construct validity of the V-RQOL was determined based on the strength and direction of its relation to the VHI and the perceptual voice evaluation measure. The higher the VHI level, the lower the physical functioning, social-emotional, and overall score levels of the V-RQOL (r = -0.927, r = -0.912, r = -0.944, respectively; P reliability and validity and may play a crucial role in evaluating Turkish-speaking patients with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Assunção, Flávia Fernanda Oliveira; Dantas, Rosana Aparecida Spadoti; Ciol, Márcia Aparecida; Gonçalves, Natália; Farina, Jayme Adriano; Rossi, Lidia Aparecida
The aims of this study were to adapt the Body Image Quality of Life Inventory (BIQLI) into Brazilian Portuguese (BP) and to assess the psychometric properties of the adapted version. Construct validity was assessed by correlating the BIQLI-BP scores with the Rosenberg's Self-Esteem Scale, with Burns Specific Health Scale-Revised (BSHS-R), and with gender, total body surface area burned, and visibility of the scars. Participants were 77 adult burn patients. Cronbach's alpha for the adapted version was .90 and moderate linear correlations were found between body image and self-esteem and between BIQLI-BP scores and two domains of the BSHS-R: affect and body image and interpersonal relationships. The BIQLI-BP showed acceptable levels of reliability and validity for Brazilian burn patients. Copyright © 2013 Wiley Periodicals, Inc.
Herrera-Kiengelher, L; Zepeda-Zaragoza, J; Austria-Corrales, F; Vázquez-Zarate, V M
Patient Safety is a major public health problem worldwide and is responsibility of all those involved in health care. Establishing a Safety Culture has proved to be a factor that favors the integration of work teams, communication and construction of clear procedures in various organizations. Promote a culture of safety depends on several factors, such as organization, work unit and staff. Objective assessment of these factors will help to identify areas for improvement and establish strategic lines of action. [corrected] To adapt, validate and calibrate the questionnaire Culture of Quality in Health Services (CQHS) in Mexican population. A cross with a stratified representative sample of 522 health workers. The questionnaire was translated and adapted from Singer's. Content was validated by experts, internal consistency, confirmatory factorial validity and item calibration with Samejima's Graded Response Model. Convergent and divergent construct validity was confirmed from the CQHS, item calibration showed that the questionnaire is able to discriminate between patients and represent different levels of the hypothesized dimensions with greater accuracy and lower standard error. The CQHS is a valid and reliable instrument to assess patient safety culture in hospitals in Mexico. Copyright © 2013 SECA. Published by Elsevier Espana. All rights reserved.
Full Text Available Abstract Background This article describes the development and validation of a self-reported questionnaire, the KQoL-26, that is based on the views of patients with a suspected ligamentous or meniscal injury of the knee that assesses the impact of their knee problem on the quality of their lives. Methods Patient interviews and focus groups were used to derive questionnaire content. The instrument was assessed for data quality, reliability, validity, and responsiveness using data from a randomised trial and patient survey about general practitioners' use of Magnetic Resonance Imaging for patients with a suspected ligamentous or meniscal injury. Results Interview and focus group data produced a 40-item questionnaire designed for self-completion. 559 trial patients and 323 survey patients responded to the questionnaire. Following principal components analysis and Rasch analysis, 26 items were found to contribute to three scales of knee-related quality of life: physical functioning, activity limitations, and emotional functioning. Item-total correlations ranged from 0.60–0.82. Cronbach's alpha and test retest reliability estimates were 0.91–0.94 and 0.80–0.93 respectively. Hypothesised correlations with the Lysholm Knee Scale, EQ-5D, SF-36 and knee symptom questions were evidence for construct validity. The instrument produced highly significant change scores for 65 trial patients indicating that their knee was a little or somewhat better at six months. The new instrument had higher effect sizes (range 0.86–1.13 and responsiveness statistics (range 1.50–2.13 than the EQ-5D and SF-36. Conclusion The KQoL-26 has good evidence for internal reliability, test-retest reliability, validity and responsiveness, and is recommended for use in randomised trials and other evaluative studies of patients with a suspected ligamentous or meniscal injury.
Garratt, Andrew M; Brealey, Stephen; Robling, Michael; Atwell, Chris; Russell, Ian; Gillespie, William; King, David
This article describes the development and validation of a self-reported questionnaire, the KQoL-26, that is based on the views of patients with a suspected ligamentous or meniscal injury of the knee that assesses the impact of their knee problem on the quality of their lives. Patient interviews and focus groups were used to derive questionnaire content. The instrument was assessed for data quality, reliability, validity, and responsiveness using data from a randomised trial and patient survey about general practitioners' use of Magnetic Resonance Imaging for patients with a suspected ligamentous or meniscal injury. Interview and focus group data produced a 40-item questionnaire designed for self-completion. 559 trial patients and 323 survey patients responded to the questionnaire. Following principal components analysis and Rasch analysis, 26 items were found to contribute to three scales of knee-related quality of life: physical functioning, activity limitations, and emotional functioning. Item-total correlations ranged from 0.60-0.82. Cronbach's alpha and test retest reliability estimates were 0.91-0.94 and 0.80-0.93 respectively. Hypothesised correlations with the Lysholm Knee Scale, EQ-5D, SF-36 and knee symptom questions were evidence for construct validity. The instrument produced highly significant change scores for 65 trial patients indicating that their knee was a little or somewhat better at six months. The new instrument had higher effect sizes (range 0.86-1.13) and responsiveness statistics (range 1.50-2.13) than the EQ-5D and SF-36. The KQoL-26 has good evidence for internal reliability, test-retest reliability, validity and responsiveness, and is recommended for use in randomised trials and other evaluative studies of patients with a suspected ligamentous or meniscal injury.
Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P
The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Yi, Sang Won; Kim, Ji-Hae; Lim, Ki-Young; Bang, Dongsik; Lee, Sungnack
Purpose The Behçet's Disease Quality of Life (BD-QoL) is a BD-specific measure developed in the UK. The aim of this study was to adapt the BD-QoL for use in Korea. Patients and Methods The translation was based on the guidelines for cross-cultural adaptation. A total of 201 Korean patients with BD participated in this study. To evaluate the psychometric properties, internal consistency and test-retest reliability were used. Factor analysis was performed to examine the construct validity. To provide further evidence for validity, the correlation of BD-QoL with the Clinical Activity Form for Korean Patients with BD (BDCAF-K) and the Center for Epidemiologic Studies-Depression (CES-D) scales was assessed. Results The Korean version had high internal consistency (Cronbach's alpha, 0.93) and test-retest reliability (r = 0.835). Factor analysis of the questionnaire revealed one interpretable factor as a general health-related quality of life factor. The Korean version significantly correlated with scores of CES-D (r = 0.749, p= 0.000), self-rating scale of well-being over the past 28 days (r = 0.446, p= 0.000), and BDCAF-K score (r = 0.502, p = 0.000). Conclusion Adaptation of the BD-QoL for use in Korea was successful. Together with the BDCAF-K, it may be a valuable tool for assessing the influence of interventions in BD patients and outcome in clinical trials. PMID:18972588
van der Leeuw, Renée; Lombarts, Kiki; Heineman, Maas Jan; Arah, Onyebuchi
The importance of effective clinical teaching for the quality of future patient care is globally understood. Due to recent changes in graduate medical education, new tools are needed to provide faculty with reliable and individualized feedback on their teaching qualities. This study validates two
Hickman, Ronald L; Clochesy, John M; Hetland, Breanna; Alaamri, Marym
There are limited reliable and valid measures of the patient- provider interaction among adults with hypertension. Therefore, the purpose of this report is to describe the construct validity and reliability of the Questionnaire on the Quality of Physician-Patient Interaction (QQPPI), in community-dwelling adults with hypertension. A convenience sample of 109 participants with hypertension was recruited and administered the QQPPI at baseline and 8 weeks later. The exploratory factor analysis established a 12-item, 2-factor structure for the QQPPI was valid in this sample. The modified QQPPI proved to have sufficient internal consistency and test- retest reliability. The modified QQPPI is a valid and reliable measure of the provider-patient interaction, a construct posited to impact self-management, in adults with hypertension.
Renée van der Leeuw
Full Text Available BACKGROUND: The importance of effective clinical teaching for the quality of future patient care is globally understood. Due to recent changes in graduate medical education, new tools are needed to provide faculty with reliable and individualized feedback on their teaching qualities. This study validates two instruments underlying the System for Evaluation of Teaching Qualities (SETQ aimed at measuring and improving the teaching qualities of obstetrics and gynecology faculty. METHODS AND FINDINGS: This cross-sectional multi-center questionnaire study was set in seven general teaching hospitals and two academic medical centers in the Netherlands. Seventy-seven residents and 114 faculty were invited to complete the SETQ instruments in the duration of one month from September 2008 to September 2009. To assess reliability and validity of the instruments, we used exploratory factor analysis, inter-item correlation, reliability coefficient alpha and inter-scale correlations. We also compared composite scales from factor analysis to global ratings. Finally, the number of residents' evaluations needed per faculty for reliable assessments was calculated. A total of 613 evaluations were completed by 66 residents (85.7% response rate. 99 faculty (86.8% response rate participated in self-evaluation. Factor analysis yielded five scales with high reliability (Cronbach's alpha for residents' and faculty: learning climate (0.86 and 0.75, professional attitude (0.89 and 0.81, communication of learning goals (0.89 and 0.82, evaluation of residents (0.87 and 0.79 and feedback (0.87 and 0.86. Item-total, inter-scale and scale-global rating correlation coefficients were significant (P<0.01. Four to six residents' evaluations are needed per faculty (reliability coefficient 0.60-0.80. CONCLUSIONS: Both SETQ instruments were found reliable and valid for evaluating teaching qualities of obstetrics and gynecology faculty. Future research should examine improvement of
Azevedo, Auro Mauro; Alonso, Neide Barreira; Vidal-Dourado, Marcos; Noffs, Maria Helena da Silva; Pascalicchio, Tatiana Frascarelli; Caboclo, Luís Otávio Sales Ferreira; Ciconelli, Rozana Mesquita; Sakamoto, Américo Ceiki; Yacubian, Elza Márcia Targas
The purpose of this article was to report the translation of the Quality of Life in Epilepsy Inventory-89 (QOLIE-89) into a Portuguese-Brazilian version and evaluate its reliability and validity. This study involved 105 outpatients: 54 patients with refractory temporal lobe epilepsy (TLE) with mesial temporal sclerosis (MTS) and 51 with juvenile myoclonic epilepsy (JME). Reliability and test-retest reliability were assessed. Relationships between QOLIE-89 domains and other questionnaires (Nottingham Health Profile, Beck Depression Inventory, Adverse Event Profile, Neuropsychological Evaluation), and external measures such as demographic and clinical variables were analyzed to examine construct validity. Internal consistency (Cronbach's alpha=0.73-0.92) and test-retest reliability (intraclass correlation coefficient=0.60-0.84) for individual domains were acceptable. For construct validity, we verified high correlations between the QOLIE-89 and the Nottingham Health Profile, Beck Depression Inventory, Adverse Event Profile, and Neuropsychological Evaluation. For clinical characteristics, the patients with juvenile myoclonic epilepsy had better quality-of-life scores on 11 of 17 QOLIE-89 subscales compared with patients with temporal lobe epilepsy (P<0.05). These results support the reliability and validity of the Portuguese-Brazilian translation of QOLIE-89.
Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti
to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; pGERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Nikjooy, Afsaneh; Jafari, Hassan; Saba, Maryam A; Ebrahimi, Naghmeh; Mirzaei, Rezvan
The Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire is the most validated and the most specific tool for measuring the quality of life of patients with constipation. Over 120 million people live in countries whose official language is Persian. There is no reported Persian version of the PAC-QOL questionnaire yet. The aim of this study was to translate and culturally adapt the PAC-QOL questionnaire and to assess its reliability and validity among Persian patients with chronic constipation. Following the translation and cultural adaptation of the PAC-QOL questionnaire to Persian, 100 patients (mean±SD age=40.51±13.67) with constipation were recruited for validity measurement and 20 patients were re-examined for reliability. Content validity was assessed based on the opinions of an expert committee and the floor/ceiling effect. Construct validity was evaluated according to the hypothesis test. The SF-36 questionnaire was used for concurrent criterion validity, intra-class correlation coefficient for reliability, and Cronbach's alpha for internal consistency. The content validity of the PAC-QOL questionnaire was proven, and there was no floor/ceiling effect. Construct validity also was confirmed based on the hypothesis test. The overall Cronbach's alpha of the PAC-QOL questionnaire was 0.92 (range=0.72-0.92), and the overall intra-class correlation coefficient of the questionnaire was 0.88 (range=0.69-0.87). The correlation between the SF-36 and PAC-QOL questionnaires was moderate. The Persian version of the PAC-QOL questionnaire demonstrated good validity and reliability properties in chronic constipation. Accordingly, Persian researchers and clinicians can benefit from this questionnaire in further research and assessment of treatment outcomes.
Gao, Wenjun; Yuan, Changrong; Wang, Jichuan; Du, Jiarui; Wu, Huiqiao; Qian, Xiaojie; Hinds, Pamela S
The City of Hope Quality of Life-Ostomy Questionnaire is a widely accepted scale to assess quality of life in ostomy patients. However, the validity and reliability of the Chinese version (C-COH) have not been studied. The objective of the study was to assess the validity and reliability of the C-COH among ostomy patients sampled from Shanghai from August 2010 to June 2011. Content validity was examined based on the reviews of a panel of 10 experts; test-retest was conducted to assess the item reliabilities of the scale; a pilot sample (n = 274) was selected to explore the factorial structure of the C-COH using exploratory factor analysis; a validation sample (n = 370) was selected to confirm the findings from the exploratory study using confirmatory factor analysis (CFA). Statistical package SPSS version 16.0 was used for the exploratory factor analysis, and Amos 17.0 was used for the CFA. The C-COH was developed by modifying 1 item and excluding 11 items from the original scale. Four factors/subscales (physical well-being, psychological well-being, social well-being, and spiritual well-being) were identified and confirmed in the C-COH The scale reliabilities estimated from the CFA results for the 4 subscales were 0.860, 0.885, 0.864, and 0.686, respectively. Findings support the reliability and validity of the C-COH. The C-COH could be a useful measure of the level of quality of life among Chinese patients with a stoma and may provide important intervention implications for healthcare providers to help improve the life quality of patients with a stoma.
Rasmussen, Trine Bernholdt; Konradsen, Hanne; Dixon, Jane
been validated in this patient population. The purpose of this study was thus to assess the validity, reliability and responsiveness of the Danish Body Image Quality of Life Inventory (BIQLI-DA) on patients treated for IE. METHODS: We evaluated the psychometric properties of the BIQLI-DA on data......: The BIQLI-DA may be applicable in healthcare research as it seems to be valid, reliable and responsive; however, evidence should be strengthened through further exploration of instrument performance, particularly regarding responsiveness.......: Participants were seventy patients with a mean age of 58 years and of which 83% were men. Results indicated convergent construct validity by confirming hypothesised associations to potentially related constructs. The BIQLI-DA was found to be highly internally consistent with a Cronbach's alpha of 0...
Dawson, A.; Raphael, K.G.; Glaros, A.; Axelsson, S.; Arima, T.; Ernberg, M.; Farella, M.; Lobbezoo, F.; Manfredini, D.; Michelotti, A.; Svensson, P.; List, T.
AIMS: To combine empirical evidence and expert opinion in a formal consensus method in order to develop a quality-assessment tool for experimental bruxism studies in systematic reviews. METHODS: Tool development comprised five steps: (1) preliminary decisions, (2) item generation, (3) face-validity
Anderson, James D; Johnston, Dennis A; Haugh, Gil S; Kiat-Amnuay, Sudarat; Gettleman, Lawrence
The purpose of this study was to refine the Toronto Outcome Measure for Craniofacial Prosthetics (TOMCP), present evidence for its reliability and validity, and use the instrument to explore differences in quality of life between prostheses made with chlorinated polyethylene (CPE) (experimental) and silicone (control). As part of a multicenter prospective controlled randomized double-blind single-crossover clinical trial of the two materials, the TOMCP was administered at the start and end of two 4-month study arms, during which 42 patients wore prostheses made from one material then the other. Reliability was assessed at the crossover. To determine validity of the TOMCP, the Linear Analogue Self-Assessment (LASA-12) and the Short-Form 8 (SF-8) were also administered with the TOMCP. The TOMCP was reduced by removing items that were unreliable, had poorly distributed answers, showed increased internal consistency after their removal, or were too highly correlated with more than one other item. The tests of reliability and validity were then repeated. Finally, the reduced instrument was used to test for differences in quality of life between prostheses made of the two materials. The item reduction tactics pared the 52-item instrument down to 27 items. The correlations of both TOMCP versions with the LASA-12 and the SF-8 were found to be statistically significant, providing evidence of the validity of the TOMCP. The instrument revealed significantly better quality of life with silicone rather than CPE prostheses. Both versions of the TOMCP were found to be reliable and valid. The instrument was able to show differences in quality of life between two materials.
Duruöz, M T; Unal, C; Toprak, C Sanal; Sezer, I; Yilmaz, F; Ulutatar, F; Atagündüz, P; Baklacioglu, H S
Background Systemic lupus erythematosus (SLE) may have a profound impact on quality of life. There is increasing interest in measuring quality of life in lupus patients. The purpose of this study was to investigate the validity and reliability of SLE Quality of Life Questionnaire (L-QoL) in Turkish SLE patients. Methods SLE according to 2012 Systemic Lupus International Collaborating Clinics Classification Criteria were recruited into the study. Demographic data, clinical parameters and disease activity measured with the Systemic Lupus Erythematosus Disease Activity Index-2000 (SLEDAI-2K); were noted. Nottingham Health Profile and Health Assessment Questionnaire were filled out in addition to the Turkish L-QoL (LQoL-TR). Internal consistency, test-retest reliability, and convergent and discriminant validity were evaluated. Results The mean age of participants was 43.55 ± 14.33 years and the mean disease duration was 89.8 ± 92.1 months. The patients filled out LQoL-TR in 2.5 min. Strong correlation of LQoL-TR with all subgroups of the Nottingham Health Profile and the Health Assessment Questionnaire were established showing the convergent validity. The highest correlation was demonstrated with emotional reactions (rho = 0.72) and sleep component (rho = 0.65) of the Nottingham Health Profile scale ( p < 0.0001). Its poor and not significant correlation with nonfunctional parameters (age, disease duration, perceived general health, SLEDAI-2K) showed its discriminative properties. LQoL-TR demonstrated good internal reliability with a Cronbach's α of 0.93 and test-retest reliability with intraclass correlation coefficient of 0.87. Conclusion The LQoL-TR is a practical and useful tool which demonstrates good validity and reliability.
Hosseini, Mandana; Gotfredsen, Klaus
OBJECTIVES: To test the reliability and validity of six aesthetic parameters and to compare the professional- and patient-reported aesthetic outcomes. MATERIAL AND METHODS: Thirty-four patients with 66 implant-supported premolar crowns were included. Two prosthodontists and 11 dental students......,24) were found between patient and professional evaluations. CONCLUSIONS: The feasibility, reliability and validity of the CIS make the parameters useful for quality control of implant-supported restorations. The professional- and patient-reported aesthetic outcomes had no significant correlation....... and the internal consistency were analysed by Cohen's ¿ and Cronbach's a, respectively. The validity of CIS parameters was tested against the corresponding Visual Analogue Scales (VAS) scores. The Spearman correlation coefficients were used. Six aesthetic Oral Health Impact Profile (OHIP) questions were correlated...
van der Leeuw, Renée; Lombarts, Kiki; Heineman, Maas Jan; Arah, Onyebuchi
The importance of effective clinical teaching for the quality of future patient care is globally understood. Due to recent changes in graduate medical education, new tools are needed to provide faculty with reliable and individualized feedback on their teaching qualities. This study validates two instruments underlying the System for Evaluation of Teaching Qualities (SETQ) aimed at measuring and improving the teaching qualities of obstetrics and gynecology faculty. This cross-sectional multi-center questionnaire study was set in seven general teaching hospitals and two academic medical centers in the Netherlands. Seventy-seven residents and 114 faculty were invited to complete the SETQ instruments in the duration of one month from September 2008 to September 2009. To assess reliability and validity of the instruments, we used exploratory factor analysis, inter-item correlation, reliability coefficient alpha and inter-scale correlations. We also compared composite scales from factor analysis to global ratings. Finally, the number of residents' evaluations needed per faculty for reliable assessments was calculated. A total of 613 evaluations were completed by 66 residents (85.7% response rate). 99 faculty (86.8% response rate) participated in self-evaluation. Factor analysis yielded five scales with high reliability (Cronbach's alpha for residents' and faculty): learning climate (0.86 and 0.75), professional attitude (0.89 and 0.81), communication of learning goals (0.89 and 0.82), evaluation of residents (0.87 and 0.79) and feedback (0.87 and 0.86). Item-total, inter-scale and scale-global rating correlation coefficients were significant (Pteaching qualities of obstetrics and gynecology faculty. Future research should examine improvement of teaching qualities when using SETQ.
Lim, Lynette L-Y; Seubsman, Sam-Ang; Sleigh, Adrian
Since its translation to Thai in 2000, the SF-36 Health Survey has been used extensively in many different clinical settings in Thailand. Its popularity has increased despite the absence of published evidence that the translated instrument satisfies scoring assumptions, the psychometric properties required for valid interpretation of the SF-36 summated ratings scales. The purpose of this paper was to examine these properties and to report on the reliability and validity of the Thai SF-36 in a non-clinical general population. 1345 distance-education university students who live in all areas of Thailand completed a questionnaire comprising the Thai SF-36 (Version 1). Median age was 31 years. Psychometric tests recommended by the International Quality of Life Assessment Project were used. Data quality was satisfactory: questionnaire completion rate was high (97.5%) and missing data rates were low (Vitality scale correlated better with the Mental Health scale than with itself, possibly because a healthy mental state is central to the concept of vitality in Thailand. The summated ratings method can be used for scoring the Thai SF-36. The instrument was found to be reliable and valid for use in a general non-clinical population. Version 2 of the SF-36 could improve ceiling and floor effects in the role functioning scales. Further work is warranted to refine items that measure the concepts of social functioning, vitality and mental health to improve the reliability and discriminant validity of these scales.
Full Text Available Abstract Background Since its translation to Thai in 2000, the SF-36 Health Survey has been used extensively in many different clinical settings in Thailand. Its popularity has increased despite the absence of published evidence that the translated instrument satisfies scoring assumptions, the psychometric properties required for valid interpretation of the SF-36 summated ratings scales. The purpose of this paper was to examine these properties and to report on the reliability and validity of the Thai SF-36 in a non-clinical general population. Methods 1345 distance-education university students who live in all areas of Thailand completed a questionnaire comprising the Thai SF-36 (Version 1. Median age was 31 years. Psychometric tests recommended by the International Quality of Life Assessment Project were used. Results Data quality was satisfactory: questionnaire completion rate was high (97.5% and missing data rates were low ( Conclusion The summated ratings method can be used for scoring the Thai SF-36. The instrument was found to be reliable and valid for use in a general non-clinical population. Version 2 of the SF-36 could improve ceiling and floor effects in the role functioning scales. Further work is warranted to refine items that measure the concepts of social functioning, vitality and mental health to improve the reliability and discriminant validity of these scales.
Full Text Available Abstract Background Health-related quality of life is a very important issue in children with end-stage renal disease and their family. Moreover, this can be a lifelong problem. In this study, we performed a cross-sectional investigation of the health-related quality of life in Korean children, undergoing renal replacement therapies, such as dialysis and renal transplantation. Findings We validated the Korean version of the PedsQL 3.0 End-Stage Renal Disease Module by comparing with the PedsQL 4.0 Generic Core Scales. A total of 92 pediatric patients with end-stage renal disease, aged 2–18 year old, were enrolled in four teaching hospitals in Korea. The module was acceptable for both parent proxy-report and child self-report. The response rate was acceptable, since no reminders were delivered. A large proportion of the responders answered >90% of the items, which suggests a good face validity. The PedsQL 4.0 Generic Core Scales and the PedsQL 3.0 End-Stage Renal Disease Module showed minimal missing values in the current study, which supported feasibility. The validation analyses revealed acceptable floor and ceiling effects and an acceptable construct validity. Conclusions The PedsQL 3.0 End-stage Renal Disease Module may be useful as an end-stage renal disease -specific instrument in the evaluation of the health-related quality of life in Korean children; however, a larger, longitudinal prospective study is needed.
Bengi, Göksel; Yalçın, Mustafa; Akpınar, Hale; Keskinoğlu, Pembe; Ellidokuz, Hülya
There are few specific evaluation forms for evaluating the quality of life among patients with chronic constipation. Our study aimed to determine the validity and reliability of the translated Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire for the Turkish population because evidence of its reliability and validity is required to justify its use in other studies and clinical practice. This study included 154 patients with constipation who were treated at the Department of Gastroenterology, Dokuz Eylül University Hospital between January and June 2012. The translated PAC-QOL questionnaire was completed by patients at the clinic and also at a 2-week follow-up to test its reliability. Cronbach's alpha coefficient (internal consistency) was 0.91 (good) for the translated PAC-QOL questionnaire. Time validity was evaluated using the intraclass correlation coefficient (ICC) method, and the ICC value for all questions was confirmed as 0.68 at the 2-week follow-up. The validity of the tool in the study group was evaluated using factor analysis, and the results were highly significant (Kaiser-Meyer-Olkin value: 0.857; Bartlett's test: p=0.001). Questions were categorized according to six factors based on the factor analysis, and these factors explained 65.1% of the total variation. For hypothesis verification of the tool, the correlation coefficient for PAC-QOL and PAC Symptoms (PAC-SYM) was r=0.577 (p<0.001), whereas the correlation coefficient for PAC-QOL and constipation severity score was r=0.457 (p<0.001). The PAC-QOL questionnaire was reliable, although not valid because of the limited sample group.
Full Text Available Background and objectives : Identifying the readiness of hospital and its strengths and weaknesses can be useful in developing appropriate planning and situation analyses and management to getting effective in clinical audit programs. The aim of this study was to design and assess the validity of the Hospital Readiness Assessment Tools to conduct quality improvement and clinical audit programs. Material and Methods: In this study, based on the results of a systematic review of literature, an initial questionnaire with 77 items was designed. Questionnaire content validity was reviewed by experts in the field of hospital management and quality improvement in Tabriz University of Medical Sciences. For this purpose, 20 questionnaires were sent to experts. Finally, 15 participants returned completed questionnaire. Questionnaire validity was reviewed and confirmed based on Content Validity Index and Content Validity Ratio. Questionnaire reliability was confirmed based on Cronbach's alpha index (α = 0.96 in a pilot study by participation of 30 hospital managers. Results: The results showed that the final questionnaire contains 54 questions as nine category as: data and information (9 items, teamwork (12 questions, resources (5 questions, patient and education (5, intervention design and implementation (5 questions, clinical audit management (4 questions, human resources (6 questions, evidence and standard (4 items and evaluation and feedback (4 items. The final questionnaire content validity index was 0.91 and final questionnaire Cronbach's alpha coefficient was 0.96. Conclusion: Considering the relative good validity and reliability of the designed tool in this study, it appears that the questionnaire can be used to identify and assess the readiness of hospitals for quality improvement and clinical audit program implementation
da Silva, Tatiana Indelicato; Ciconelli, Rozana Mesquita; Alonso, Neide Barreira; Azevedo, Auro Mauro; Westphal-Guitti, Ana Carolina; Pascalicchio, Tatiana Frascarelli; Marques, Carolina Mattos; Caboclo, Luís Otávio Sales Ferreira; Cramer, Joyce A; Sakamoto, Américo Ceiki; Yacubian, Elza Márcia Targas
We report the cultural adaptation and psychometric properties of the Quality of Life in Epilepsy-31 Inventory (QOLIE-31) for the Portuguese language and Brazilian culture. This study involved 150 outpatients: 50 presurgical patients with refractory temporal lobe epilepsy (TLE) related to mesial temporal sclerosis (MTS), 50 patients with juvenile myoclonic epilepsy (JME), and 50 seizure-free patients with TLE. They completed the QOLIE-31, Nottingham Health Profile (NHP), Beck Depression Inventory (BDI), and Adverse Events Profile (AEP) and underwent a neuropsychological evaluation (NE). Internal consistency reliability, interrater and test-retest reliability, and construct validity were assessed. QOLIE-31 mean scores were 33.1 (Social Function), 68.9 (Overall Quality of Life), 56.5 (Seizure Worry), 64.1 (Emotional Well-Being), 63.7 (Energy/Fatigue), 38.9 (Cognitive Function), and 49.7 (Medication Effects). Internal consistency was high (Cronbach's alpha), as were the associations between QOLIE-31 and the BDI, NHP, AEP, and NE. The Portuguese/Brazilian version of the QOLIE-31 inventory showed good reliability, validity, and construct validity.
Naito, Yusuke; Tanaka, Yuu; Sasaoka, Noriyuki; Iwata, Toshio; Fujimoto, Yuko; Okamoto, Nozomi; Inoue, Satoki; Kawaguchi, Masahiko
The Postoperative Quality of Recovery Scale (PQRS) is a multi-domain tracking scale to assess recovery after surgery. The PQRS is used in seven countries and five languages; however, the Japanese version of the PQRS (PQRSj) has not been established. We therefore translated the PQRS into Japanese, and examined the feasibility, reliability and validity of the PQRSj. The time taken to complete the test was measured to assess feasibility. Cronbach's alpha was calculated to assess reliability. The Mini Mental State Examination (MMSE) and the Japanese version of the Quality of Recovery Scale 40 (QoR-40 J) were performed for comparison with the PQRSj (validity). Fifty-one patients were enrolled in the study. The mean completion time was 3.9 min for baseline (feasibility). Cronbach's alpha was between 0.40 and 0.94 in each domain (reliability). A relationship was shown between cognitive domain and MMSE at baseline (r = 0.65, P < 0.01); however, no relationship was found between the other domains and the MMSE and QoR-40 J. Ceiling effects were observed in 78% of the questions. These results indicate that the PQRSj can be used to assess recovery after surgery, although it may be better to revise some of the questions to improve the validity of the PQRSj.
Ohaeri, Jude U; Awadallab, Abdel W
There is rising interest in quality of life (QOL) research in Arabian countries. The aim of this study was to assess in a nationwide sample of Kuwaiti subjects the reliability and validity of the World Health Organization Quality of Life (WHOQOL-BREF), a shorter version of the widely used QOL assessment instrument that comprises 26 items in the domains of physical health, psychological health, social relationships, and the environment. A one-in-three systematic random proportionate sample of consenting Kuwaiti nationals attending large cooperative stores and municipal government offices in the six governorates completed the Arabic translation of the questionnaire. The indices assessed included test-retest reliability, internal consistency, item internal consistency (2C), item discriminant validity (IDV), known-groups and construct validity. There were 3303 participants (44.8% males, 55.2% females, mean age 35.4 years, range 16 to 87 years). The intra-class correlation for the test-retest statistic and the internal consistency values for the full questionnaire and the domains had a Cronbach's alpha > - 0.7. Of the 24 items that constitute the domains, 21 met the 2 C requirement of correlation > - 0.4 with the corresponding domain, while 16 met the IDV criterion of having a higher correlation with their corresponding domain than other domains. Domain scores discriminated significantly between well and sick groups. In the factor analysis, four strong factors emerged with the same construct as in the WHO report. The Arabic translation of the WHOQOL-BREF has impressive reliability and validity indices. The poor IDV findings are due to the multidimensional nature of the questionnaire. The highly significant validity indices should reassure researchers that the questionnaire represents the same constructs across cultures. Negatively worded items possibly need refinement. (author)
Rekleiti, Maria; Souliotis, Kyriakos; Sarafis, Pavlos; Kyriazis, Ioannis; Tsironi, Maria
The present study focuses on studying the validity and reliability of the Greek edition of DQOL-BCI. DQOL-BCI includes 15 questions-elements that are evaluated on a 5-grade scale like Likert and two general form-shapes. The translation process was conducted in conformity with the guidelines of EuroQol group. A non-random sample of 65 people-patients diagnosed with diabetes I and II was selected. The questionnaire that was used to collect the data was the translated version of DQOL-BCI, and included the demographic characteristics of the interviewees. The content validity of DQOL-BCI was re-examined from a team of five experts (expert panel) for qualitative and quantitative performance. The completion of the questionnaire was done via a personal interview. The sample consisted of 58 people (35 men and 23 women, 59.9 ± 10.9 years). The translation of the questionnaire was found appropriate in accordance to the peculiarities of the Greek language and culture. The largest deviation of values is observed in QOL1 (1.71) in comparison to QOL6 (2.98). The difference between the standard deviations is close to 0.6. The statistics results of the tests showed satisfactory content validity and high construct validity, while the high values for Cronbach alpha index (0.95) reveal high reliability and internal consistency. The Greek version of DQOL-BCI has acceptable psychometric properties and appears to demonstrate high internal reliability and satisfactory construct validity, which allows its use as an important tool in evaluating the quality of life of diabetic patients in relation to their health. Copyright © 2018. Published by Elsevier B.V.
Full Text Available OBJECTIVE: The Psoriatic Arthritis Quality of Life (PsAQoL questionnaire is a disease- specific instrument developed to measure quality of life (QoL in patients with psoriatic arthritis (PsA. The aim of this study was to translate the measure into Dutch and to determine its psychometric properties. METHOD: Translation of the original English PsAQoL into Dutch was performed by bilingual and lay panel. Ten field-test interviews with PsA patients were performed to assess face and content validity. In total, 211 PsA patients were included in a test-retest postal survey to investigate the reliability and construct validity of the Dutch adaptation of the PsAQoL. The PsAQoL, Health Assessment Questionnaire (HAQ and Skindex-17 were administered on two different occasions approximately two weeks apart. RESULTS: The Dutch version of the PsAQoL was found to be relevant, understandable and easy to complete in only a few minutes. It correlated as expected with the HAQ (Spearman's ρ = 0.72 and the 2 subscales of the Skindex-17 (ρ = 0.40 for the psychosocial and ρ = 0.46 for the symptom scale. Furthermore, the measure had good internal consistency (Cronbach's α = 0.92 and test-retest reliability (ρ = 0.89. The PsAQoL was able to define groups of patients based on self-reported general health status, self-reported severity of PsA and flare of arthritis. Duration of PsA did not influence PsAQoL scores. CONCLUSIONS: The Dutch version of the PsAQoL is a valid and reliable questionnaire suitable for use in clinical or research settings to asses PsA-specific QoL.
Full Text Available Abstract Background The Effective Musculoskeletal Consumer Scale (EC-17 is a self-administered questionnaire for evaluating self-management interventions that empower and educate people with rheumatic conditions. The aim of the study was to translate and evaluate the Norwegian version of EC-17 against the necessary criteria for a patient-reported outcome measure, including responsiveness to change. Methods Data quality, reliability, validity and responsiveness were assessed in two groups. One group comprising 103 patients received a questionnaire before and at the end of a self-management programme. The second group comprising 96 patients' received the questionnaire two weeks before and on arrival of the program. Internal consistency and test-retest reliability were assessed. Construct validity was assessed through comparisons with the Brief Approach/Avoidance Coping Questionnaire, (BACQ, the Emotional Approach Coping Scale (EAC and the General Health Questionnaire (GHQ-20. Responsiveness was assessed with the Standardised Response Mean (SRM. Results Respondents included 66 (64% and 52 (54% patients from the first and second groups respectively. Levels of missing data were low for all items. There was good evidence for unidimensionality, item-total correlations ranged from 0.59 to 0.82 and Cronbach's Alpha and test-retest correlations were over 0.90. As hypothesised EC-17 scores had statistically significant low to moderate correlations with the BACQ, EAC and GHQ-20 in the range 0.26 to 0.42. Following the self-management program, EC-17 scores showed a significant improvement with an SRM of 0.48. Conclusion The Norwegian version of the EC-17 has evidence for data quality, internal consistency and test-retest reliability, construct validity and responsiveness to change. The EC-17 seems promising as an outcome measure for evaluating self-management interventions for people with rheumatic conditions, but further studies are needed.
Hamnes, Bente; Garratt, Andrew; Kjeken, Ingvild; Kristjansson, Elizabeth; Hagen, Kåre B
The Effective Musculoskeletal Consumer Scale (EC-17) is a self-administered questionnaire for evaluating self-management interventions that empower and educate people with rheumatic conditions. The aim of the study was to translate and evaluate the Norwegian version of EC-17 against the necessary criteria for a patient-reported outcome measure, including responsiveness to change. Data quality, reliability, validity and responsiveness were assessed in two groups. One group comprising 103 patients received a questionnaire before and at the end of a self-management programme. The second group comprising 96 patients' received the questionnaire two weeks before and on arrival of the program. Internal consistency and test-retest reliability were assessed. Construct validity was assessed through comparisons with the Brief Approach/Avoidance Coping Questionnaire, (BACQ), the Emotional Approach Coping Scale (EAC) and the General Health Questionnaire (GHQ-20). Responsiveness was assessed with the Standardised Response Mean (SRM). Respondents included 66 (64%) and 52 (54%) patients from the first and second groups respectively. Levels of missing data were low for all items. There was good evidence for unidimensionality, item-total correlations ranged from 0.59 to 0.82 and Cronbach's Alpha and test-retest correlations were over 0.90. As hypothesised EC-17 scores had statistically significant low to moderate correlations with the BACQ, EAC and GHQ-20 in the range 0.26 to 0.42. Following the self-management program, EC-17 scores showed a significant improvement with an SRM of 0.48. The Norwegian version of the EC-17 has evidence for data quality, internal consistency and test-retest reliability, construct validity and responsiveness to change. The EC-17 seems promising as an outcome measure for evaluating self-management interventions for people with rheumatic conditions, but further studies are needed.
Muus, Ingrid; Williams, Linda S; Ringsberg, Karin C
To test the reliability and validity of the Danish version of the Stroke Specific Quality of Life Scale version 2.0 (SS-QOL-DK), an instrument for evaluation of health-related quality of life. A correlational study. A stroke unit that provides acute care and rehabilitation for stroke patients in Frederiksborg County, Denmark. One hundred and fifty-two stroke survivors participated; 24 of these performed test-retest. Questionnaires were sent out and returned by mail. A subsequent telephone interview assessed functional level and missing items. Test-retest was measured using Spearman's r, internal consistency was estimated using Cronbach's alpha, and evaluation of floor and ceiling values in proportion of minimum and maximum scores. Construct validity was assessed by comparing patients' scores on the SS-QOL-DK with those obtained by other test methods: Beck's Depression Index, the General Health Survey Short Form 36 (SF-36), the Barthel Index and the National Institutes of Health Stroke Scale, evaluating shared variance using coefficient of determination, r2. Comparing groups with known scores assessed known-group validity. Convergent and discriminant validity were assessed. Test-retest of SS-QOL-DK showed excellent stability, Spearman's r = 0.65-0.99. Internal consistency for all domains showed Cronbach's alpha = 0.81-0.94. Missing items rate was 1.0%. Most SS-QOL-DK domains showed moderately shared variance with similar domains of other test methods, r2 = 0.03-0.62. Groups with known differences showed statistically significant difference in scores. Item-to-scale correlation coefficients of 0.37-0.88 supported convergent validity. SS-QOL-DK is a reliable and valid instrument for measuring self-reported health-related quality of life on group level among people with mild to moderate stroke.
Baumstarck, Karine; Boyer, Laurent; Boucekine, Mohamed; Aghababian, Valérie; Parola, Nathalie; Lançon, Christophe; Auquier, Pascal
Impaired executive functions are among the most widely observed in patients suffering from schizophrenia. The use of self-reported outcomes for evaluating treatment and managing care of these patients has been questioned. The aim of this study was to provide new evidence about the suitability of self-reported outcome for use in this specific population by exploring the internal structure, reliability and external validity of a specific quality of life (QoL) instrument, the Schizophrenia Quality of Life questionnaire (SQoL18). cross-sectional study. age over 18 years, diagnosis of schizophrenia according to the DSM-IV criteria. sociodemographic (age, gender, and education level) and clinical data (duration of illness, Positive and Negative Syndrome Scale, Calgary Depression Scale for Schizophrenia); QoL (SQoL18); and executive performance (Stroop test, lexical and verbal fluency, and trail-making test). Non-impaired and impaired populations were defined for each of the three tests. For the six groups, psychometric properties were compared to those reported from the reference population assessed in the validation study. One hundred and thirteen consecutive patients were enrolled. The factor analysis performed in the impaired groups showed that the questionnaire structure adequately matched the initial structure of the SQoL18. The unidimensionality of the dimensions was preserved, and the internal/external validity indices were close to those of the non-impaired groups and the reference population. Our study suggests that executive dysfunction did not compromise the reliability or validity of self-reported disease-specific QoL questionnaire. Copyright © 2013 Elsevier B.V. All rights reserved.
Kasitanon, N; Wangkaew, S; Puntana, S; Sukitawut, W; Leong, K P; Louthrenoo, W
The English version of the Systemic Lupus Erythematosus Quality of Life Questionnaire (SLEQOL) is a validated disease-specific quality of life instrument. The aim of this study was to evaluate the psychometric properties of the Thai version of the SLEQOL (SLEQOL-TH). Two independent translators translated the SLEQOL into Thai. The back translation of this version was performed by two other independent translators. The final version, SLEQOL-TH, was completed after resolving the discrepancies revealed by the back translation. One hundred and nine patients with SLE were enrolled to test the reliability, construct validity, floor and ceiling effects, and sensitivity to the changes of the SLEQOL-TH at six months. The differential item functioning (DIF) between the Thai and English versions was analyzed using the partial gamma. The internal consistency of the SLEQOL-TH was satisfactory with the overall Cronbach's alpha of 0.86. The test-retest reliability of the SLEQOL-TH was acceptable with the intra-class correlation coefficient of 0.86. Low correlations between the SLEQOL-TH and SLEDAI were observed. The total score of the SLEQOL-TH was moderately responsive to changes in quality of life, with a standardized response mean of 0.50. When comparing the SLEQOL-TH from Thai SLE patients with the original SLEQOL version obtained from Singapore SLE patients, 11 out of 40 items showed a moderate to large DIF. The SLEQOL-TH has acceptable psychometric properties and shows construct validity. In comparison with the English version of SLEQOL, there are some items that showed DIF. The applicability of the SLEQOL-TH in real-life clinical practice and clinical trials needs to be determined.
Hedlund, Lena; Gyllensten, Amanda Lundvik; Waldegren, Tomas; Hansson, Lars
Motor disturbances and disturbed self-recognition are common features that affect mobility in persons with schizophrenia spectrum disorder and bipolar disorder. Physiotherapists in Scandinavia assess and treat movement difficulties in persons with severe mental illness. The Body Awareness Scale Movement Quality and Experience (BAS MQ-E) is a new and shortened version of the commonly used Body Awareness Scale-Health (BAS-H). The purpose of this study was to investigate the inter-rater reliability and the concurrent validity of BAS MQ-E in persons with severe mental illness. The concurrent validity was examined by investigating the relationships between neurological soft signs, alexithymia, fatigue, anxiety, and mastery. Sixty-two persons with severe mental illness participated in the study. The results showed a satisfactory inter-rater reliability (n = 53) and a concurrent validity (n = 62) with neurological soft signs, especially cognitive and perceptual based signs. There was also a concurrent validity linked to physical fatigue and aspects of alexithymia. The scores of BAS MQ-E were in general higher for persons with schizophrenia compared to persons with other diagnoses within the schizophrenia spectrum disorders and bipolar disorder. The clinical implications are presented in the discussion.
Jakupovic, Vedran; Solakovic, Suajb; Celebic, Nedim; Kulovic, Dzenan
Diabetes is progressive condition which requires various ways of treatment. Adequate therapy prescribed in the right time helps patient to postpone development of complications. Adherence to complicated therapy is challenge for both patients and HCPs and is subject of research in many disciplines. Improvement in communication between HCP and patients is very important in patient's adherence to therapy. Aim of this research was to explore validity and reliability of modified SERVQUAL instrument in attempt to explore ways of motivating diabetic patient to accept prescribed insulin therapy. We used modified SERVQUAL questionnaire as instrument in the research. It was necessary to check validity and reliability of the new modified instrument. Results show that modified Servqual instrument has excellent reliability (α=0.908), so we could say that it measures precisely Expectations, Perceptions and Motivation at patients. Factor analysis (EFA method) with Varimax rotation extracted 4 factors which together explain 52.902% variance of the results on this subscale. Bifactorial solution could be seen on Scree-plot diagram (break at second factor). Results in this research show that modified Servqual instrument which is created in order to measure expectations and perceptions of the patients is valid and reliable. Reliability and validity are proven indeed in additional dimension which was created originally for this research - motivation to accept insulin therapy.
Jakupovic, Vedran; Solakovic, Suajb; Celebic, Nedim; Kulovic, Dzenan
Introduction: Diabetes is progressive condition which requires various ways of treatment. Adequate therapy prescribed in the right time helps patient to postpone development of complications. Adherence to complicated therapy is challenge for both patients and HCPs and is subject of research in many disciplines. Improvement in communication between HCP and patients is very important in patient’s adherence to therapy. Aim: Aim of this research was to explore validity and reliability of modified SERVQUAL instrument in attempt to explore ways of motivating diabetic patient to accept prescribed insulin therapy. Material and Methods: We used modified SERVQUAL questionnaire as instrument in the research. It was necessary to check validity and reliability of the new modified instrument. Results: Results show that modified Servqual instrument has excellent reliability (α=0.908), so we could say that it measures precisely Expectations, Perceptions and Motivation at patients. Factor analysis (EFA method) with Varimax rotation extracted 4 factors which together explain 52.902% variance of the results on this subscale. Bifactorial solution could be seen on Scree-plot diagram (break at second factor). Conclusion: Results in this research show that modified Servqual instrument which is created in order to measure expectations and perceptions of the patients is valid and reliable. Reliability and validity are proven indeed in additional dimension which was created originally for this research - motivation to accept insulin therapy. PMID:29670478
Linde, L.; Sørensen, J.; Østergaard, Morten
OBJECTIVE: To compare validity, reliability, and responsiveness of generic and disease specific health-related quality of life (HRQOL) instruments in rheumatoid arthritis (RA). METHODS: Two samples of patients completed the Medical Outcomes Study Short Form-36 Health Survey (SF-36), EuroQol (EQ)-5D...... and VAS pain were responsive to both improvement and deterioration. CONCLUSION: All instruments were valid measures for HRQOL in RA. The RAQoL and HAQ displayed the best reliability, while the SF-36 bodily pain scale and VAS pain were the most responsive. The choice of instrument should depend......, 15D, Rheumatoid Arthritis Quality of Life Scale (RAQoL), Health Assessment Questionnaire (HAQ), and visual analog scales (VAS) for pain, fatigue, and global RA. Validity (convergent, discriminant, and known-groups) was evaluated in a cross-section of 200 patients. Reliability was evaluated...
Aven, Terje; Heide, Bjornar
In this paper we investigate to what extent risk analysis meets the scientific quality requirements of reliability and validity. We distinguish between two types of approaches within risk analysis, relative frequency-based approaches and Bayesian approaches. The former category includes both traditional statistical inference methods and the so-called probability of frequency approach. Depending on the risk analysis approach, the aim of the analysis is different, the results are presented in different ways and consequently the meaning of the concepts reliability and validity are not the same.
Pakpour, Amir; Yekaninejad, Mirsaeed; Mølsted, Stig
AIM: The aims of the study were to translate the Kidney Disease Quality of Life--Short Form version 1.3 (KDQOL-SF ver. 1.3) questionnaire into Iranian (Farsi), and to then assess it in terms of validity and reliability on Iranian patients. METHODS: The questionnaire was first translated into Farsi...... a larger group (212 patients with end-stage renal disease on haemodialysis). Afterwards, reliability was estimated by internal consistency, and validity was assessed using known group comparisons and constructs for the patient group as a whole. Finally, the factor structure of the questionnaire...... be summarized into an 11 factor structure that jointly accounted for 79.81% of the variance. CONCLUSION: The Iranian version of the KDQOL-SF questionnaire is both highly reliable and valid for use with Iranian patients on haemodialysis....
Normand, J.; Charon, M.
Concern for obtaining high-quality products which will function properly when required to do so is nothing new - it is one manifestation of a conscientious attitude to work. However, the complexity and cost of equipment and the consequences of even temporary immobilization are such that it has become necessary to make special arrangements for obtaining high-quality products and examining what one has obtained. Each unit within an enterprise must examine its own work or arrange for it to be examined; a unit whose specific task is quality assurance is responsible for overall checking, but does not relieve other units of their responsibility. Quality assurance is a form of mutual assistance within an enterprise, designed to remove the causes of faults as far as possible. It begins very early in a project and continues through the ordering stage, construction, start-up trials and operation. Quality and hence reliability are the direct result of what is done at all stages of a project. They depend on constant attention to detail, for even a minor piece of poor workmanship can, in the case of an essential item of equipment, give rise to serious operational difficulties
Fernanda Alessandra Silva Michels
Full Text Available Objective: To validate and assess reliability and understanding of the EORTC–C30 quality of life questionnaire and its breast cancer specific module, the EORTC-BR23. Methods: This study was conducted at the AC Camargo Cancer Hospital, São Paulo, Brazil. A total of 100 women diagnosed with breast cancer were interviewed. Internal consistency, confirmatory factorial analysis, convergent validity, construct validity and degree of understanding were examined. Reliability was assessed by comparison of means at times 1 and 2, inter-class coefficient and Bland-Altman graphics. Results: Cronbach’s alpha ranged from 0.72 to 0.86 for the EORTC-C30 and from 0.78 to 0.83 for the EORTC-BR23 questionnaire. Most questions were confirmed in the confirmatory factorial analysis. In the construct validity analysis, the questionnaires were capable of differentiating patients with or without lymphedema, apart from the symptom scales of both questionnaires. Both questionnaires presented a significant correlation in most domains of the SF-36, in the convergent validity analysis. Only a few criticisms were reported concerning questions, and the mean grade of understanding was high (C30 = 4.91 and BR23 = 4.89. The questionnaires presented good rates of reliability, with the exception of the functional scale of the C30 and the symptom scale of the BR23. Conclusions: The EORTC-C30 and EORTC-BR23 quality of life questionnaires were validated, presented good rates of reliability and are easily understood, allowing them to be used in Brazil to assess quality of life among women with breast cancer.
Xiao, Lin; Gao, Yulin; Zhang, Lili; Chen, Peiyun; Sun, Xiaojia; Tang, Siyuan
Previous literatures on quality of life (QoL) in bipolar disorder (BD) strongly suggested that a disease-specific QoL measure for patients with BD should be developed to evaluate QoL more specifically and reliably. To our knowledge, "Quality of Life in Bipolar Disorder" (QoL.BD) is the first and only questionnaire produced to specifically measure QoL in people with BD. In China, there is no disease-targeted measure available to specifically measure QoL in Chinese patients with BD. The aim of the study is to revise and validate the brief version of the QoL.BD (Bref QoL.BD ) into Chinese version. All the items of the Bref QoL.BD was translated into Chinese language, using the Brislin translation mode. The questionnaire was administered to a total sample of 231 subjects, including 101 BD patients and 130 healthy controls, to test the psychometric properties of Bref QoL.BD (e.g. internal consistency, retest reliability, content validity, item analysis, confirmatory factor analysis, criterion validity, convergent validity, discriminative validity and feasibility). The Chinese version of the Bref QoL.BD had very high internal consistency (Cronbach's alpha=0.815) and retest reliability (intraclass correlation coefficient (ICC )=0.808). Confirmatory factor analysis (CFA) validated the original one-factor structure. The direction and magnitude of correlations with 36-item Short-Form Health Survey (SF-36; rs= 0.313, Psize from only one tertiary care center. And BD patients enrolled were euthymic, excluding the acute BD patients. The Chinese version of the Bref QoL.BD is a feasible, reliable and valid tool for the assessment of QoL for Chinese BD patients. Copyright © 2015 Elsevier B.V. All rights reserved.
Oren, Besey; Zengin, Neriman; Yildiz, Nebahat
This study aimed to test the validity and reliability of a version of the tool developed in Sri Lanka in 2011 to assess patient perceptions of the quality of nursing care and related hospital services created for use with Turkish patients. This methodological study was conducted between November 2013 and November 2014 after obtaining ethical approval and organizational permission. Data was collected during discharge from 180 adult patients who were hospitalized for at least 3 days at a medical school hospital located in Istanbul. After language validation, validity and reliability analyses of the scale were conducted. Content validity, content validity index (CVI), construct validity, and exploratory factor analysis were assessed and examined, and reliability was tested using the Cronbach's alpha coefficient and item-total correlations. Mean CVI was found to be 0.95, which is above expected value. Exploratory factor analysis revealed 4 factors with eigenvalues above 1, which explained 82.4% of total variance in the Turkish version of the tool to measure patient perceptions of nursing care and other hospital services. Factor loading for each item was ≥.40. Cronbach's alpha coefficient of sub-dimensions and total scale were found to be 0.84-0.98 and 0.98, respectively. Item-total correlations ranged from 0.56 to 0.83 for the entire group, which was above expected values. The Turkish version of the scale to assess patient perceptions of the quality of nursing care and related hospital services, which comprised 4 sub-dimensions and 36 items, was found to be valid and reliable for use with the Turkish population.
Full Text Available Background and Objectives : Customer Quality refers to customers’ capabilities to be effectively participating in service delivery and correct care processes. The aim of this study was to evaluate the validity and reliability of customer quality questionnaire in services related to patients with Inflammatory Bowel Disease. Material and Methods : This is a qualitative study with the aim of assessment of customer quality questionnaire with 16 questions. This questionnaire was distributed among 11 experts to determine the validity of the questionnaire. Finally, 10 of them expressed their own opinions based on the criteria of validity and in the face validity section , they expressed their opinions qualitatively. Also, reliability of questionnaire was reviewed with 30 samples, according to the internal consistency. Results : In determining the content validity, all the questionnaire items were confirmed by CVR (1 and CVI (0.89 indicators. Also, reliability of the questionnaire was confirmed by Cronbach’s alpha (α=0.78.The final questionnaire was changed according to the experts’ recommendations and their qualitative comments on the 19 questions in four stages of customer quality, including: a the patient’s belief that his/her role in care process is important, b having knowledge, skill and confidence to take action in the care process, c taking action to protect and improve the health or active involvement, d continuity of care, even in critical situations and under stress. Conclusion : Confirmation of the questionnaire with statistical scientific methods showed that this questionnaire is a very strong tool that using it in research can be an effective step in order to improve the quality of health services.
Full Text Available Background: Heart failure due to changes in the lungs, circulation, and skeletal muscle adversely influences the quality of life. Objectives: The objective of the present study is to assess the reliability and validity of the Persian version of the MacNew in patients with heart failure. Patients and Methods: The 200 Iranian patients who referred to Shahid Beheshti hospital in Qom were recruited by convenience sampling. All the patients filled out the MacNew HRQL questionnaire, the hospital anxiety and depression scale (HADS, the Short Form- 36 and socio-demographic and clinical characteristics. The reliability of the MacNew was assessed by internal consistency and test-retest reliability. Construct validity was assessed by factor analysis, convergent validity and discriminant validity. Discriminant validity of MacNew was assessed by the known-groups approach. All analyses were done through SPSS, version 20, and level of significance was considered at P < 0.05. Results: The mean MacNew Global score was 3.6 ± 0.82. Our results demonstrated that internal consistency (α = 0.94 and reproducibility (ICC = 0.84 of the Persian version of the MacNew were confirmed. The Factor analysis confirmed three factors as the original MacNew. Convergent and divergent validity of the MacNew was confirmed by its correlation pattern with physical and mental components of the SF-36. Discriminative validity was confirmed statistically and clinically for the differences in the MacNew scores on the Global scale and each subscale between Iranian patients with and without anxiety and depression. Conclusions: The Persian version of the MacNew HRQL questionnaire can be applied as a reliable and valid tool in the clinical research for Iranian patients with heart failure.
Full Text Available The purpose of this cross-sectional study on 191 HIV/AIDS patient was to prepare the first Persian translation of complete WHOQOL-HIV instrument, evaluate its reliability and validity, and apply it to determine quality of life and its associated factors in Tehran, Iran. Student's t-test was used to compare quality of life between groups. Mean Cronbach’s α of facets in all six domains of instrument were more than 0.6 indicating good reliability. Item/total corrected correlations coefficients had a lower limit of more than 0.5 in all facets except for association between energy and fatigue facet and physical domain. Compared to younger participants, patients older than 35 years had significantly lower scores in overall quality of life (P = 0.003, social relationships (P = 0.021, and spirituality/religion/personal beliefs (P = 0.024. Unemployed patients had significantly lower scores in overall quality of life (P = 0.01, level of independence (P = 0.004, and environment (P = 0.001 compared to employed participants. This study demonstrated that the standard, complete WHOQOL-HIV 120 instrument translated into Farsi and evaluated among Iranian participants provides a reliable and valid basis for future research on quality of life for HIV and other patients in Iran.
Full Text Available Validation of land cover products is a fundamental task prior to data applications. Current validation schemes and methods are, however, suited only for assessing classification accuracy and disregard the reliability of land cover products. The reliability evaluation of land cover products should be undertaken to provide reliable land cover information. In addition, the lack of high-quality reference data often constrains validation and affects the reliability results of land cover products. This study proposes a validation schema to evaluate the reliability of land cover products, including two methods, namely, result reliability evaluation and process reliability evaluation. Result reliability evaluation computes the reliability of land cover products using seven reliability indicators. Process reliability evaluation analyzes the reliability propagation in the data production process to obtain the reliability of land cover products. Fuzzy fault tree analysis is introduced and improved in the reliability analysis of a data production process. Research results show that the proposed reliability evaluation scheme is reasonable and can be applied to validate land cover products. Through the analysis of the seven indicators of result reliability evaluation, more information on land cover can be obtained for strategic decision-making and planning, compared with traditional accuracy assessment methods. Process reliability evaluation without the need for reference data can facilitate the validation and reflect the change trends of reliabilities to some extent.
Speyer, Renée; Heijnen, Bas J; Baijens, Laura W; Vrijenhoef, Femke H; Otters, Elsemieke F; Roodenburg, Nel; Bogaardt, Hans C
Quality of life is an important outcome measurement in objectifying the current health status or therapy effects in patients with oropharyngeal dysphagia. In this study, the validity and reliability of the Dutch version of the Deglutition Handicap Index (DHI) and the MD Anderson Dysphagia Inventory (MDADI) have been determined for oncological patients with oropharyngeal dysphagia. At Maastricht University Medical Center, 76 consecutive patients were selected and asked to fill in three questionnaires on quality of life related to oropharyngeal dysphagia (the SWAL-QOL, the MDADI, and the DHI) as well as a simple one-item visual analog Dysphagia Severity Scale. None of the quality-of-life questionnaires showed any floor or ceiling effect. The test-retest reliability of the MDADI and the Dysphagia Severity Scale proved to be good. The test-retest reliability of the DHI could not be determined because of insufficient data, but the intraclass correlation coefficients were rather high. The internal consistency proved to be good. However, confirmatory factor analysis could not distinguish the underlying constructs as defined by the subscales per questionnaire. When assessing criterion validity, both the MDADI and the DHI showed satisfactory associations with the SWAL-QOL (reference or gold standard) after having removed the less relevant subscales of the SWAL-QOL. In conclusion, when assessing the validity and reliability of the Dutch version of the DHI or the MDADI, not all psychometric properties have been adequately met. In general, because of difficulties in the interpretation of study results when using questionnaires lacking sufficient psychometric quality, it is recommended that researchers strive to use questionnaires with the most optimal psychometric properties.
Ahmadi, Akram; Tohidast, Seyed Abolfazl; Mansuri, Banafshe; Kamali, Mohammad; Krishnan, Gopee
This systematic review aimed to explore the acceptability, reliability, and validity of the Stroke and Aphasia Quality of Life-39 (SAQOL-39) scale across languages. We employed a systematic search of the online databases including MEDLINE (Pubmed), Science direct, Web of science, Psychinfo, Scopus, ProQuest, Google Scholar, and Cochrane library published between 2003 and 2016. We used PRISMA guidelines for conducting and reporting this review. Subsequently, screening of the titles and abstracts, extraction of data as well as the appraisal of the quality of relevant studies were carried out. The initial search returned 8185 studies. Subsequent screening and study selection processes narrowed them to 20, needing detailed review. Forward-backward translation scheme was the preferred method for translation of the SAQOL-39 from English to other languages. Mainly, the socio-cultural and linguistic adaptations were performed in the translated versions. Most versions of the SAQOL-39 showed high test-retest reliability and internal consistency. However, several psychometric properties including the validity and responsiveness were seldom reported in these versions. The SAQOL-39 scale showed high acceptability, and reliability across the languages reviewed in this study. Future translations may additionally focus on reporting the validity and responsiveness of the instrument.
Cheung, Yin Bun; Yeo, Khung Keong; Chong, Kok Joon; Khoo, Eric Yh; Wee, Hwee Lin
The World Health Organization Quality of Life (WHOQOL-BREF) questionnaire is a 26-item questionnaire that evaluates 4 domains of quality of life (QoL), namely Physical, Psychological, Social Relationships and Environment. This study aimed to evaluate the validity and reliability of the WHOQOL-BREF among Singapore residents aged 21 and above. We recruited participants from the general population by using multistage cluster sampling and participants from 2 hospitals by using convenience sampling. Participants completed either English, Chinese or Malay versions of the WHOQOL-BREF and the EuroQoL 5 Dimension 5 Levels (EQ-5D-5L) questionnaires. Confirmatory factor analysis, known-group validity, internal consistency (Cronbach's alpha) and test-retest reliability using the intraclass correlation coefficient (ICC) were performed. Data from 1316 participants were analysed (Chinese: 46.9%, Malay: 41.0% and Indian: 11.7%; 57.5% mean, mean standard deviation [SD, range] age: 51.9 [15.68, 24 to 90] years); 154 participants took part in the retest in various languages (English: 60, Chinese: 49 and Malay: 45). Tucker-Lewis Index (TLI) was 0.919, 0.913 and 0.909 for the English, Chinese and Malay versions, respectively. Cronbach's alpha exceeded 0.7 and ICC exceeded 0.4 for all domains in all language versions. The WHOQOL-BREF is valid and reliable for assessing QoL in Singapore. Model fit is reasonable with room for improvement.
Wendel, Christopher S; Grant, Marcia; Herrinton, Lisa; Temple, Larissa K F; Hornbrook, Mark C; McMullen, Carmit K; Bulkley, Joanna E; Altschuler, Andrea; Krouse, Robert S
Examination of reliability and validity of a specialized health-related quality of life questionnaire for rectal cancer (RC) survivors (≥5 years post-diagnosis). We mailed 1,063 Kaiser Permanente (KP) RC survivors (313 ostomy and 750 anastomosis) a questionnaire containing the Modified City of Hope Quality of Life-Ostomy (mCOH-QOL-O), SF-12v2, Duke-UNC Functional Social Support Questionnaire (FSSQ), and Memorial Sloan-Kettering Cancer Center Bowel Function Index (BFI). We adapted certain BFI items for use by subjects with intestinal ostomies. We evaluated reliability for all instruments with inter-item correlations and Cronbach's alpha. We assessed construct validity only for the BFI in the ostomy group, because such use has not been reported. The overall response rate was 60.5 % (577 respondents/953 eligible). Compared with non-responders, participants were on average 2 years younger and more likely non-Hispanic white, resided in educationally non-deprived areas, and had KP membership through a group. The mCOH-QOL-O, SF-12, and FSSQ were found to be highly reliable for RC survivors. In the ostomy group, BFI Urgency/Soilage and Dietary subscales were found to be reliable, but Frequency was not. Factor analysis supported the construct of Urgency/Soilage and Dietary subscales in the ostomy group, although one item had a moderate correlation with all three factors. The BFI also demonstrated good concurrent validity with other instruments in the ostomy group. With possible exception of the BFI Frequency subscale in populations with ostomies, components of our survey can be used for the entire population of RC survivors, no matter whether they received anastomosis or ostomy.
Cordier, Reinie; Speyer, Renée; Schindler, Antonio; Michou, Emilia; Heijnen, Bas Joris; Baijens, Laura; Karaduman, Ayşe; Swan, Katina; Clavé, Pere; Joosten, Annette Veronica
The Swallowing Quality of Life questionnaire (SWAL-QOL) is widely used clinically and in research to evaluate quality of life related to swallowing difficulties. It has been described as a valid and reliable tool, but was developed and tested using classic test theory. This study describes the reliability and validity of the SWAL-QOL using item response theory (IRT; Rasch analysis). SWAL-QOL data were gathered from 507 participants at risk of oropharyngeal dysphagia (OD) across four European countries. OD was confirmed in 75.7% of participants via videofluoroscopy and/or fiberoptic endoscopic evaluation, or a clinical diagnosis based on meeting selected criteria. Patients with esophageal dysphagia were excluded. Data were analysed using Rasch analysis. Item and person reliability was good for all the items combined. However, person reliability was poor for 8 subscales and item reliability was poor for one subscale. Eight subscales exhibited poor person separation and two exhibited poor item separation. Overall item and person fit statistics were acceptable. However, at an individual item fit level results indicated unpredictable item responses for 28 items, and item redundancy for 10 items. The item-person dimensionality map confirmed these findings. Results from the overall Rasch model fit and Principal Component Analysis were suggestive of a second dimension. For all the items combined, none of the item categories were 'category', 'threshold' or 'step' disordered; however, all subscales demonstrated category disordered functioning. Findings suggest an urgent need to further investigate the underlying structure of the SWAL-QOL and its psychometric characteristics using IRT.
Kunaratnam, Kanita; Halaki, Mark; Wen, Li Ming; Baur, Louise A; Flood, Victoria M
To report on the reliability and validity of a Diet Quality Index (DQI) to assess preschoolers dietary patterns using a short food frequency questionnaire (sFFQ) and 3-day food records (3d-FR). Seventy-seven preschool carers/parents completed a telephone interview on preschoolers (2-5-year olds) dietary habits in metropolitan Sydney. Agreement in scores was assessed using intraclass correlation (ICC) and paired t-tests for repeated sFFQ-DQI scores and Bland-Altman methods and paired t-tests for sFFQ-DQI and 3d-FR-DQI scores. Mean-total sFFQ-DQI ICC scores was high = 0.89, 95% CI (0.81, 0.93). There was weak agreement between sFFQ-DQI and 3d-FR-DQI scores (r = 0.36, p < 0.01). The 3d-FR-DQI scores were positively associated with carbohydrate, folate, ß-carotene, magnesium, calcium, protein, total fat and negatively associated with sugar, starch, niacin, vitamin C, phosphorus, polyunsaturated fat, and monounsaturated fat. The sFFQ-DQI demonstrated good reliability but weak validity. Associations between nutrients and 3d-FR-DQI scores indicate promising usability and warrants further investigation. Further research is needed to establish its validity in accurately scoring children's diet quality using sFFQ compared to 3d-FR before the tool can be implemented for use in population settings.
Bannigan, Katrina; Watson, Roger
To explore and explain the different concepts of reliability and validity as they are related to measurement instruments in social science and health care. There are different concepts contained in the terms reliability and validity and these are often explained poorly and there is often confusion between them. To develop some clarity about reliability and validity a conceptual framework was built based on the existing literature. The concepts of reliability, validity and utility are explored and explained. Reliability contains the concepts of internal consistency and stability and equivalence. Validity contains the concepts of content, face, criterion, concurrent, predictive, construct, convergent (and divergent), factorial and discriminant. In addition, for clinical practice and research, it is essential to establish the utility of a measurement instrument. To use measurement instruments appropriately in clinical practice, the extent to which they are reliable, valid and usable must be established.
Full Text Available Abstract Background This study aimed to investigate the reliability and validity of the Iranian version of the Pediatric Quality of Life Inventory™ 4.0 (PedsQL™ 4.0 Generic Core Scales in children. Methods A standard forward and backward translation procedure was used to translate the US English version of the PedsQL™ 4.0 Generic Core Scales for children into the Iranian language (Persian. The Iranian version of the PedsQL™ 4.0 Generic Core Scales was completed by 503 healthy and 22 chronically ill children aged 8-12 years and their parents. The reliability was evaluated using internal consistency. Known-groups discriminant comparisons were made, and exploratory factor analysis (EFA and confirmatory factor analysis (CFA were conducted. Results The internal consistency, as measured by Cronbach's alpha coefficients, exceeded the minimum reliability standard of 0.70. All monotrait-multimethod correlations were higher than multitrait-multimethod correlations. The intraclass correlation coefficients (ICC between the children self-report and parent proxy-reports showed moderate to high agreement. Exploratory factor analysis extracted six factors from the PedsQL™ 4.0 for both self and proxy reports, accounting for 47.9% and 54.8% of total variance, respectively. The results of the confirmatory factor analysis for 6-factor models for both self-report and proxy-report indicated acceptable fit for the proposed models. Regarding health status, as hypothesized from previous studies, healthy children reported significantly higher health-related quality of life than those with chronic illnesses. Conclusions The findings support the initial reliability and validity of the Iranian version of the PedsQL™ 4.0 as a generic instrument to measure health-related quality of life of children in Iran.
Cafiero, Carlo; Melgar-Quiñonez, Hugo R; Ballard, Terri J; Kepple, Anne W
This paper reviews some of the existing food security indicators, discussing the validity of the underlying concept and the expected reliability of measures under reasonably feasible conditions. The main objective of the paper is to raise awareness on existing trade-offs between different qualities of possible food security measurement tools that must be taken into account when such tools are proposed for practical application, especially for use within an international monitoring framework. The hope is to provide a timely, useful contribution to the process leading to the definition of a food security goal and the associated monitoring framework within the post-2015 Development Agenda. © 2014 New York Academy of Sciences.
Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana
Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Tantilipikorn, Pinailug; Watter, Pauline; Prasertsukdee, Saipin
Quality of Life (QOL) and Health-related Quality of Life (HRQOL) are now considered as necessary outcome measures for children with cerebral palsy (CP). Various reliable and valid condition-specific HRQOL tools are available for these children. One of these is Pediatric Quality of Life Inventory (PedsQL) 3.0 CP module which has been widely used and was translated to many languages. As no Thai version is available, the authors have completed this translation. This study then aimed to investigate psychometric properties of the newly translated Thai PedsQL 3.0 CP module and to establish parent confidence in their ratings in the translated tool. Translation of the PedsQL 3.0 CP module was performed based on linguistic translation guidelines. Then, the psychometric properties of the Thai version were established. PedsQL 3.0 CP module was completed by children with CP and their parents or caregivers twice with 2-4 weeks. Respondents were 97 parents or caregivers and 54 children. Minimal missing data were found. Acceptable internal consistency was supported except for Movement and Balance Scale (self-report). Intraclass correlation coefficients for parent proxy and self-report were good to excellent (0.684-0.950). The feasibility, reliability and validity of the translated tool were supported.
Nockemann, C.; Tillack, G.R.; Schnitger, D.; Heidt, H.
A concept of the harmonic integration of the following three mainstays of the reliability of ndt is proposed: 1. Theoretical prediction of the reliability as a function of physical parameter by computer modelling of the test problem concerned and the ndt process; maximisation by variation of the parameters. 2. Experimental evaluation of the reliability of ndt processes by the application of statistical methods to test practice. 3. Increasing the reliability by the combination of several ndt methods in a standard DV environment and European interconnection and provision of a distributed databank system. International exchange of experience via telecommunication. (orig.) [de
Fayers, Tessa; Dolman, Peter J
To develop and test a user-friendly questionnaire for rapidly assessing quality of life (QOL) in thyroid eye disease (TED). A three-item questionnaire, the TED-QOL, was designed and compared to the 16-item Graves Ophthalmopathy (GO)-QOL and the nine-item GO-Quality of Life Scale (QLS). 100 patients with TED were administered all three questionnaires on two occasions. Results were compared to clinical severity scores (Vision, Inflammation, Strabismus, Appearance (VISA) classification). Main outcomes were construct and criterion validity, test-retest reliability, duration, comprehension and completion rates. TED-QOL correlated strongly with the other questionnaires for corresponding items (Pearson correlation: appearance 0.71, 0.62; functioning 0.69, 0.66; overall QOL 0.53). Test-retest analysis demonstrated good reliability for all three questionnaires (intraclass correlations: TED-QOL 0.81, 0.74, 0.87; GO-QOL 0.81, 0.82; GO-QLS 0.74, 0.86, 0.67). TED-QOL was significantly faster to complete (1.6 min vs GO-QOL 3.1 min, GO-QLS 2.7 min, p<0.0001) and had a higher completion rate (100% vs GO-QOL 78%, GO-QLS 94%). There was only moderate correlation between items on all three questionnaires and VISA scores. The TED-QOL is rapid and easy to complete and analyse and has similar validity and reliability to longer questionnaires. All questionnaires showed only moderate correlation with disease severity, emphasising the discrepancy between objective and subjective assessments and the importance of measuring both.
Moriguchi, Eri; Ito, Mikiko; Nagai, Toshisaburo
A Japanese version of the Quality of Life in Childhood Epilepsy Questionnaire (QOLCE-J) was developed using international guidelines as a QOL scale for childhood epilepsy; its reliability and validity were examined, focusing on Japanese pediatric epilepsy patients applicability. A pilot test questionnaire survey was conducted; involving parents of pediatric epilepsy patients aged 4-15 undergoing outpatient treatment. 278 responses were obtained and analyzed. Internal consistency for the 16 QOLCE-J subscales, except for , was sufficient, and a high overall coefficient α was obtained. The intraclass correlation coefficient was also high, supporting the test-retest reliability of this version. Associations among the subscales, high correlations of r>0.7 were observed among , , and , representing cognitive and behavioral aspects, and among these and . In contrast, correlations among others were moderate or weaker. Furthermore, correlations of r>0.35 were observed among the subscales of the SDQ (Strength and Difficulties Questionnaire) used as an external criterion and the QOLCE-J, confirming the criterion validity of the study version. Analysis of associations between the total QOLCE-J score and pathology of epilepsy, found significant correlation with age of onset and frequency of seizures, ADL, and antiepileptics side effects' symptoms. QOLCE has mostly been used in treatment resistant pediatric patients, the influence of interictal period presently observed, like antiepileptic side effects' symptoms; suggest usefulness for pediatric patients with seizures under control. The QOLCE-J with sufficient reliability and validity may be applicable as a QOL scale for Japanese children with epilepsy. Copyright © 2015 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.
Drost, Ellen A.
In this paper, the author aims to provide novice researchers with an understanding of the general problem of validity in social science research and to acquaint them with approaches to developing strong support for the validity of their research. She provides insight into these two important concepts, namely (1) validity; and (2) reliability, and…
Full Text Available Background: Quality of life (QOL improvement is the main objective of treating patients with irritable bowel syndrome (IBS. This study aimed to assess the validity, reliability, and factor analysis of IBS-QOL-34 questionnaire as a common transcultural instrument for Iranian IBS patients. Materials and Methods: Two hundred and forty patients with IBS (based on gastroenterologists′ diagnosis according to ROM III criteria were referred to Digestive Health Clinic in Psychosomatic Research Center have been selected in this study. Aside with IBS-QOL-34, MOS 36-item short-form health survey (SF-36 and IBS severity index (IBSSI questionnaires were completed by the cases for determination of correlation coefficients; the data were analyzed using descriptive statistics, factor analysis, Cronbach′s alpha, Pearson correlation coefficient by Statistical Package for Social Sciences (SPSS software, version 18. Results: Total reliability of the questionnaire was reported by using Cronbach′s alpha as 0.95, ranging from 0.65 to 0.90. Correlation coefficients of concurrent implementation of IBS-QOL with SF-36 and IBSSI resulted in −0.61 and 0.64, respectively. Exploratory factor analysis using varimax rotation identified eight principle components, which will determine QOL at 67% variance. Conclusion: According to the results, IBS-QOL-34 questionnaire has good psychometric properties in the research community and can be safely used as a valid tool to assess QOL of patients with IBS for healthcare and therapeutic purposes.
Wink, Freke; Arends, Suzanne; McKenna, Stephen P; Houtman, Pieternella M; Brouwer, Elisabeth; Spoorenberg, Anneke
Objective: The Psoriatic Arthritis Quality of Life (PsAQoL) questionnaire is a disease-specific instrument developed to measure quality of life (QoL) in patients with psoriatic arthritis (PsA). The aim of this study was to translate the measure into Dutch and to determine its psychometric
Kaasa, S.; Aaronson, N.
The EORTC Core Quality of Life questionnaire (EORTC QLQ-C30) is designed to measure cancer patients' physical, psychological and social functions. The questionnaire is composed of multi-item scales and single items. 247 patients completed the EORTC QLQ-C30 before palliative radiotherapy and 181 after palliative radiotherapy. The questionnaire was well accepted with a high completion rate in the present patient population consisting of advanced cancer patients with short life expectancy. In addition, the questionnaire was found to be useful to detect the effect of palliative radiotherapy over time. The scale reliability was excellent for all scales except the role functioning scale. Excellent criterion validity was found for the emotional functioning scale where it was correlated with GHQ-20. Performance of the questionnaire was improved after the second evaluation as compared with the first. The present study shows that the EORTC-QLQ-C30 is found to be practical and valid in measuring quality of life in patients with advanced disease. (author)
Kemerer Verne F
Full Text Available Abstract Background/Aim As Thai people living with HIV/AIDS gain increasing access to antiretroviral (ARV therapy, it is important to evaluate the impact this has not only on clinical outcomes, but also on patients' functional status and well-being. In this study, we translated, culturally adapted and tested the reliability and validity of two widely-used health-related quality of life questionnaires - the MOS-HIV Health Survey and the SF-12 - in people living with HIV/AIDS in Northern Thailand. Methods: Questionnaires were administered to 100 patients at community hospital outpatient ARV clinics in northern Thailand. Reliability was estimated using Cronbach's alpha, while evidence for validity was tested using known-groups comparison based on CD4 group, symptom distress score, bed days and days of reduced activity in the past three months. Results Patients' median age was 36, with 58% female, 58% working as laborers, and 60% completing at least primary education. Median CD4 count was 218 cells/mm3. There were no missing data. For the MOS-HIV and SF-12, mean physical summary scores were 53.1 and 49.0 respectively; mean mental summary scores were 53.4 and 45.6, respectively. Internal consistency coefficients were >0.7 for all but one scale, the PF scale (0.67. As hypothesized, scores were slightly to moderately correlated with CD4 count, symptom score, number of days in bed or with reduced activity. Correlations were higher with physical health scores than with mental health scales. The MOS-HIV discriminated clinical known groups slightly better than the SF-12. Conclusion Both the MOS-HIV and the shorter SF-12 were successfully adapted for people with HIV/AIDS in Northern Thailand, and showed encouraging evidence for reliability and validity. These patient reported questionnaires could be valuable tools in evaluating therapeutic interventions and other innovations in health and social services, and to estimate health needs and population
Full Text Available Abstract Background Health-related quality of life (HRQOL has been recognized as an important health outcome measurement for pediatric patients. One of the most promising instruments in measuring pediatric HRQOL emerged in recent years is the Pediatric Quality of Life Inventory (PedsQL™. The PedsQL™ 3.0 Asthma Module, one of the PedsQL™disease-specific scales, was designed to measure HRQOL dimensions specifically tailored for pediatric asthma. The present study is aimed to evaluate the psychometric properties of the Chinese version of the PedsQL™ 3.0 Asthma Module. Methods The PedsQL™ 3.0 Asthma Module was translated into Chinese following the PedsQL™ Measurement Model Translation Methodology. The Chinese version scale was administered to 204 children with asthma and 337 parents of children with asthma from four Triple A hospitals. The psychometric properties were then evaluated. Results The percentage of missing value for each item of the scale ranged from 0.00% to 8.31%. All child self-report subscales and parent proxy-report subscales approached or exceeded the minimum reliability standard of 0.70 for alpha coefficient, except 3 subscales of Young Child (aged 5-7 self-report (alphas ranging from 0.59 to 0.68. Test-retest reliability was satisfactory with intraclass correlation coefficients (ICCs which exceeded the recommended standard of 0.80 in all subscales. Correlation coefficients between items and their hypothesized subscales were higher than those with other subscales. The PedsQL™ 3.0 Asthma Module distinguished between outpatients and inpatients. Patients with mild asthma reported higher scores than those with moderate/severe asthma in majority of subscales. The intercorrelations among the PedsQL™ 3.0 Asthma Module subscales and the PedsQL™ 4.0 Generic Core Scales were in medium to large effect size. The child self-report scores were consistent with the parent proxy-report scores. Conclusions The Chinese version of
Feng, Lifen; Zhang, Yingfen; Chen, Ruoqing; Hao, Yuantao
Health-related quality of life (HRQOL) has been recognized as an important health outcome measurement for pediatric patients. One of the most promising instruments in measuring pediatric HRQOL emerged in recent years is the Pediatric Quality of Life Inventory (PedsQL™). The PedsQL™ 3.0 Asthma Module, one of the PedsQL™disease-specific scales, was designed to measure HRQOL dimensions specifically tailored for pediatric asthma. The present study is aimed to evaluate the psychometric properties of the Chinese version of the PedsQL™ 3.0 Asthma Module. The PedsQL™ 3.0 Asthma Module was translated into Chinese following the PedsQL™ Measurement Model Translation Methodology. The Chinese version scale was administered to 204 children with asthma and 337 parents of children with asthma from four Triple A hospitals. The psychometric properties were then evaluated. The percentage of missing value for each item of the scale ranged from 0.00% to 8.31%. All child self-report subscales and parent proxy-report subscales approached or exceeded the minimum reliability standard of 0.70 for alpha coefficient, except 3 subscales of Young Child (aged 5-7) self-report (alphas ranging from 0.59 to 0.68). Test-retest reliability was satisfactory with intraclass correlation coefficients (ICCs) which exceeded the recommended standard of 0.80 in all subscales. Correlation coefficients between items and their hypothesized subscales were higher than those with other subscales. The PedsQL™ 3.0 Asthma Module distinguished between outpatients and inpatients. Patients with mild asthma reported higher scores than those with moderate/severe asthma in majority of subscales. The intercorrelations among the PedsQL™ 3.0 Asthma Module subscales and the PedsQL™ 4.0 Generic Core Scales were in medium to large effect size. The child self-report scores were consistent with the parent proxy-report scores. The Chinese version of the PedsQL™ 3.0 Asthma Module has acceptable psychometric
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Doctor, S.R.; Deffenbaugh, J.D.; Good, M.S.; Green, E.R.; Heasler, P.G.; Hutton, P.H.; Reid, L.D.; Simonen, F.A.; Spanner, J.C.; Vo, T.V.
This paper reports on progress for three programs: (1) evaluation and improvement in nondestructive examination reliability for inservice inspection of light water reactors (LWR) (NDE Reliability Program), (2) field validation acceptance, and training for advanced NDE technology, and (3) evaluation of computer-based NDE techniques and regional support of inspection activities. The NDE Reliability Program objectives are to quantify the reliability of inservice inspection techniques for LWR primary system components through independent research and establish means for obtaining improvements in the reliability of inservice inspections. The areas of significant progress will be described concerning ASME Code activities, re-analysis of the PISC-II data, the equipment interaction matrix study, new inspection criteria, and PISC-III. The objectives of the second program are to develop field procedures for the AE and SAFT-UT techniques, perform field validation testing of these techniques, provide training in the techniques for NRC headquarters and regional staff, and work with the ASME Code for the use of these advanced technologies. The final program's objective is to evaluate the reliability and accuracy of interpretation of results from computer-based ultrasonic inservice inspection systems, and to develop guidelines for NRC staff to monitor and evaluate the effectiveness of inservice inspections conducted on nuclear power reactors. This program started in the last quarter of FY89, and the extent of the program was to prepare a work plan for presentation to and approval from a technical advisory group of NRC staff
Pigford, T.H.; Chambre, P.L.
The objective of predicting long-term performance should be to make reliable determinations of whether the prediction falls within the criteria for acceptable performance. Establishing reliable predictions of long-term performance of a waste repository requires emphasis on valid theories to predict performance. The validation process must establish the validity of the theory, the parameters used in applying the theory, the arithmetic of calculations, and the interpretation of results; but validation of such performance predictions is not possible unless there are clear criteria for acceptable performance. Validation programs should emphasize identification of the substantive issues of prediction that need to be resolved. Examples relevant to waste package performance are predicting the life of waste containers and the time distribution of container failures, establishing the criteria for defining container failure, validating theories for time-dependent waste dissolution that depend on details of the repository environment, and determining the extent of congruent dissolution of radionuclides in the UO 2 matrix of spent fuel. Prediction and validation should go hand in hand and should be done and reviewed frequently, as essential tools for the programs to design and develop repositories. 29 refs
Erhart, Michael; Rajmil, Luis; Herdman, Michael; Auquier, Pascal; Bruil, Jeanet; Power, Mick; Duer, Wolfgang; Abel, Thomas; Czemy, Ladislav; Mazur, Joanna; Czimbalmos, Agnes; Tountas, Yannis; Hagquist, Curt; Kilroe, Jean
Background To assess the criterion and construct validity of the KIDSCREEN-10 well-being and health-related quality of life (HRQoL) score, a short version of the KIDSCREEN-52 and KIDSCREEN-27 instruments. Methods The child self-report and parent report versions of the KIDSCREEN-10 were tested in a sample of 22,830 European children and adolescents aged 8–18 and their parents (n = 16,237). Correlation with the KIDSCREEN-52 and associations with other generic HRQoL measures, physical and mental health, and socioeconomic status were examined. Score differences by age, gender, and country were investigated. Results Correlations between the 10-item KIDSCREEN score and KIDSCREEN-52 scales ranged from r = 0.24 to 0.72 (r = 0.27–0.72) for the self-report version (proxy-report version). Coefficients below r = 0.5 were observed for the KIDSCREEN-52 dimensions Financial Resources and Being Bullied only. Cronbach alpha was 0.82 (0.78), test–retest reliability was ICC = 0.70 (0.67) for the self- (proxy-)report version. Correlations between other children self-completed HRQoL questionnaires and KIDSCREEN-10 ranged from r = 0.43 to r = 0.63 for the KIDSCREEN children self-report and r = 0.22–0.40 for the KIDSCREEN parent proxy report. Known group differences in HRQoL between physically/mentally healthy and ill children were observed in the KIDSCREEN-10 self and proxy scores. Associations with self-reported psychosomatic complaints were r = −0.52 (−0.36) for the KIDSCREEN-10 self-report (proxy-report). Statistically significant differences in KIDSCREEN-10 self and proxy scores were found by socioeconomic status, age, and gender. Conclusions Our results indicate that the KIDSCREEN-10 provides a valid measure of a general HRQoL factor in children and adolescents, but the instrument does not represent well most of the single dimensions of the original KIDSCREEN-52. Test–retest reliability was slightly below a priori defined thresholds. PMID:20668950
Minner, Daphne Diane
The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating
Full Text Available Abstract Background Wolfram syndrome (WFS is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS. Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease. WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age. Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93, moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91 and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, ps>.76, ps=-.86, p=.001. The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83. Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS.
Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt
physicians' OSAUS scores with diagnostic accuracy. RESULTS: The generalizability coefficient was high (0.81) and a D-study demonstrated that 1 assessor and 5 cases would result in similar reliability. The construct validity of the OSAUS scale was supported by a significant difference in the mean scores......OBJECTIVE: To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. BACKGROUND: POC US is increasingly used by clinicians and is an essential part of the management of acute surgical...... conditions. However, the quality of performance is highly operator-dependent. Therefore, reliable and valid assessment of trainees' ultrasonography competence is needed to ensure patient safety. METHODS: Twenty-four physicians, representing novices, intermediates, and experts in POC US, scanned 4 different...
Schlegel, Claudia; Woermann, Ulrich; Rethans, Jan-Joost; van der Vleuten, Cees
In the training of healthcare professionals, one of the advantages of communication training with simulated patients (SPs) is the SP's ability to provide direct feedback to students after a simulated clinical encounter. The quality of SP feedback must be monitored, especially because it is well known that feedback can have a profound effect on student performance. Due to the current lack of valid and reliable instruments to assess the quality of SP feedback, our study examined the validity and reliability of one potential instrument, the 'modified Quality of Simulated Patient Feedback Form' (mQSF). Content validity of the mQSF was assessed by inviting experts in the area of simulated clinical encounters to rate the importance of the mQSF items. Moreover, generalizability theory was used to examine the reliability of the mQSF. Our data came from videotapes of clinical encounters between six simulated patients and six students and the ensuing feedback from the SPs to the students. Ten faculty members judged the SP feedback according to the items on the mQSF. Three weeks later, this procedure was repeated with the same faculty members and recordings. All but two items of the mQSF received importance ratings of > 2.5 on a four-point rating scale. A generalizability coefficient of 0.77 was established with two judges observing one encounter. The findings for content validity and reliability with two judges suggest that the mQSF is a valid and reliable instrument to assess the quality of feedback provided by simulated patients.
Ravens-Sieberer, U.; Erhart, M.; Rajmil, L.; Herdman, M.; Auquier, P.; Bruil, J.; Power, M.; Duer, W.; Abel, T.; Czemy, L.; Mazur, J.; Czimbalmos, A.; Tountas, Y.; Hagquist, C.; Kilroe, J.
Background: To assess the criterion and construct validity of the KIDSCREEN-10 well-being and health-related quality of life (HRQoL) score, a short version of the KIDSCREEN-52 and KIDSCREEN-27 instruments. Methods: The child self-report and parent report versions of the KIDSCREEN-10 were tested in a
The Cambridge Otology Quality of Life Questionnaire: an otology-specific patient-recorded outcome measure. A paper describing the instrument design and a report of preliminary reliability and validity.
Martin, T P C; Moualed, D; Paul, A; Ronan, N; Tysome, J R; Donnelly, N P; Cook, R; Axon, P R
The Cambridge Otology Quality of Life Questionnaire (COQOL) is a patient-recorded outcome measurement (PROM) designed to quantify the quality of life of patients attending otology clinics. Item-reduction model. A systematically designed long-form version (74 items) was tested with patient focus groups before being presented to adult otology patients (n. 137). Preliminary item analysis tested reliability, reducing the COQOL to 24 questions. This was then presented in conjunction with the SF-36 (V1) questionnaire to a total of 203 patients. Subsequently, these were re-presented at T + 3 months, and patients recorded whether they felt their condition had improved, deteriorated or remained the same. Non-responders were contacted by post. A correlation between COQOL scores and patient perception of change was examined to analyse content validity. Teaching hospital and university psychology department. Adult patients attending otology clinics with a wide range of otological conditions. Item reliability measured by item–total correlation, internal consistency and test– retest reliability. Validity measured by correlation between COQOL scores and patient-reported symptom change. Reliability: the COQOL showed excellent internal consistency at both initial presentation (a = 0.90) and 3 months later (a = 0.93). Validity: One-way analysis of variance showed a significant difference between groups reporting change and those reporting no change in quality of life (F(2, 80) = 5.866, P < 0.01). The COQOL is the first otology-specific PROM. Initial studies demonstrate excellent reliability and encouraging preliminary criterion validity: further studies will allow a deeper validation of the instrument.
Buri, Hilary M; Daly, Jeanette M; Jogerst, Gerald J
(a) To identify reliable and valid questions that identify elder abuse, (b) to assess the reliability and validity of extant self-reported elder abuse screens in a high-risk elderly population, and (c) to describe difficulties of completing and interpreting screens in a high-need elderly population. All elders referred to research-trained social workers in a community service agency were asked to participate. Of the 70 elders asked, 49 participated, 44 completed the first questionnaire, and 32 completed the duplicate second questionnaire. A research assistant administered the telephone questionnaires. Twenty-nine (42%) persons were judged abused, 12 (17%) had abuse reported, and 4 (6%) had abuse substantiated. The elder abuse screen instruments were not found to be predictive of assessed abuse or as predictors of reported abuse; the measures tended toward being inversely predictive. Two questions regarding harm and taking of belongings were significantly different for the assessed abused group. In this small group of high-need community-dwelling elders, the screens were not effective in discriminating between abused and nonabused groups. Better instruments are needed to assess for elder abuse.
Health related quality of life in inflammatory bowel disease is influenced both by disease activity as well as by the psychosocial characteristics of the individual patient. The Short Health Scale (SHS) is a four-part visual analogue scale questionnaire using open-ended questions that are designed to assess the impact of inflammatory bowel disease on a health related quality of life. The four dimensions include bowel symptoms, activities of daily life, worry and general wellbeing. It has previously been validated in Swedish and Norwegian speaking patients.
Goodwin, Laura D.; Goodwin, William L.
The views of prominant qualitative methodologists on the appropriateness of validity and reliability estimation for the measurement strategies employed in qualitative evaluations are summarized. A case is made for the relevance of validity and reliability estimation. Definitions of validity and reliability for qualitative measurement are presented…
Tarakci, E; Baydogan, S N; Kasapcopur, O; Dirican, A
The aim of this study was to describe the cultural adaptation, validity, and reliability of a Turkish version of the pediatric quality-of-life inventory (PedsQL) 3.0 Arthritis Module in a population with juvenile idiopathic arthritis (JIA). A total of 169 patients with JIA and their parents were enrolled in the study. The Turkish version of the childhood health assessment questionnaire (CHAQ) was used to evaluate the validity of related domains in the PedsQL 3.0 Arthritis Module. Both the PedsQL 3.0 Arthritis Module and CHAQ were filled out by children over 8 years of age and by the parents of children 2-7 years of age. Internal reliability was poor to excellent (Cronbach's alpha coefficients 0.56-0.84 for self-reporting and 0.63-0.82 for parent reporting), and interobserver reliability varied from good to excellent (intraclass correlation coefficient (ICC) 0.79-0.91 for self-reporting and 0.80-0.88 for parent reporting) for the total scores of the PedsQL 3.0 Arthritis Module. Parent-child concordance for all scores was moderate to excellent (ICC 0.42-0.92). The PedsQL 3.0 Arthritis Module and CHAQ were highly positively correlated, with coefficients from 0.21 to 0.76, indicating concurrent validity. We demonstrated the reliability and validity of quality-of-life measurement using the Turkish version of the PedsQL 3.0 Arthritis Module in our sociocultural context. The PedsQL 3.0 Arthritis Module can be utilized as a tool for the evaluation of quality of life in patients with JIA aged 2-18 years.
Hill, C.; Robinson, L.
Mammographers currently score their own images according to criteria set out by Regional Quality Assurance. The criteria used are based on the ‘Perfect, Good, Moderate, Inadequate’ (PGMI) marking criteria established by the National Health Service Breast Screening Programme (NHSBSP) in their Quality Assurance Guidelines of 2006 1 . This document discusses the validity and reliability of the current mammography image assessment scheme. Commencing with a critical review of the literature this document sets out to highlight problems with the national approach to the use of marking schemes. The findings suggest that ‘PGMI’ scheme is flawed in terms of reliability and validity and is not universally applied across the UK. There also appear to be differences in schemes used by trainees and qualified mammographers. Initial recommendations are to be made in collaboration with colleagues within the National Health Service Breast Screening Programme (NHSBSP), Higher Education Centres, College of Radiographers and the Royal College of Radiologists in order to identify a mammography image appraisal scheme that is fit for purpose. - Highlights: • Currently no robust evidence based marking tools in use for the assessment of images in mammography. • Is current system valid, reliable and robust? • How can the current image assessment tool be improved? • Should students and qualified mammographers use the same tool? • What marking criteria are available for image assessment?
Linde, Louise; Sørensen, Jan; Ostergaard, Mikkel
.21-6.47). The longitudinal sample included 80% women, median age 60 years (22-82). Validity: all instruments discriminated between low, moderate, and high DAS28. Reliability: RAQoL and HAQ displayed good repeatability (ICC > 0.95) and internal consistency (Cronbach's alpha > 0.90). Responsiveness: SF-36 bodily pain scale......, 15D, Rheumatoid Arthritis Quality of Life Scale (RAQoL), Health Assessment Questionnaire (HAQ), and visual analog scales (VAS) for pain, fatigue, and global RA. Validity (convergent, discriminant, and known-groups) was evaluated in a cross-section of 200 patients. Reliability was evaluated...... questionnaires (at 2 weeks and 6 months) included questions about changes in health status since baseline. RESULTS: The cross-sectional sample included 77% women, median age 57 years (range 19-87), disease duration 6 years (0-58), with Disease Activity Score 28-joint count (DAS28) of 3.10 (1...
Post, Marcel W
Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability
This book presents the state-of-the-art in quality and reliability engineering from a product life cycle standpoint. Topics in reliability include reliability models, life data analysis and modeling, design for reliability and accelerated life testing, while topics in quality include design for quality, acceptance sampling and supplier selection, statistical process control, production tests such as screening and burn-in, warranty and maintenance. The book provides comprehensive insights into two closely related subjects, and includes a wealth of examples and problems to enhance reader comprehension and link theory and practice. All numerical examples can be easily solved using Microsoft Excel. The book is intended for senior undergraduate and post-graduate students in related engineering and management programs such as mechanical engineering, manufacturing engineering, industrial engineering and engineering management programs, as well as for researchers and engineers in the quality and reliability fields. D...
Taking as an example electronic assemblies in printed circuit board engineering, quality control during manufacture is dealt with. After giving a survey of four phases of quality and reliability control, some specific methods of quality control are dealt with by means of a flowchart, and by some examples the necessity and the success of these measures are shown. (RW) [de
Integrating development processes, policies, and reliability predictions from the beginning of the product development lifecycle to ensure high levels of product performance and safety, this book helps companies overcome the challenges posed by increasingly complex systems in today’s competitive marketplace. Examining both research on and practical aspects of product quality and reliability management with an emphasis on applications, the book features contributions written by active researchers and/or experienced practitioners in the field, so as to effectively bridge the gap between theory and practice and address new research challenges in reliability and quality management in practice. Postgraduates, researchers and practitioners in the areas of reliability engineering and management, amongst others, will find the book to offer a state-of-the-art survey of quality and reliability management and practices.
Sorenson, Shawn C.; Romano, Russell; Scholefield, Robin M.; Schroeder, E. Todd; Azen, Stanley P.; Salem, George J.
Context Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. Objective To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Design Descriptive laboratory study. Setting A large National Collegiate Athletic Association Division I university. Patients or Other Participants A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Intervention(s) Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Main Outcome Measure(s) Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Results Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent
Haradhan Kumar Mohajan
Full Text Available Reliability and validity are two most important and fundamental features in the evaluation of any measurement instrument or toll for a good research. The purpose of this research is to discuss the validity and reliability of measurement instruments that are used in research. Validity concerns what an instrument measures, and how well it does so. Reliability concerns the faith that one can have in the data obtained from use of an instrument, that is, the degree to which any measuring tool controls for random error. An attempt has been taken here to review the reliability and validity, and threat to them in some details.
Reliability and Concurrent Validity of the International Personality item Pool (IPIP) Big-five Factor Markers in Nigeria. ... Nigerian Journal of Psychiatry ... Aims: The aim of this study was to assess the internal consistency and concurrent validity ...
Kaartina, Sanker; Chin, Yit Siew; Fara Wahida, Rezali; Woon, Fui Chee; Hiew, Chu Chien; Zalilah, Mohd Shariff; Mohd Nasir, Mohd Taib
The Pediatric Quality of Life Inventory Generic Core Scales (PedsQL) 4.0 is a generalized assessment of health-related quality of life (HRQoL) based on adolescent self-report and parent proxy-report. This study aims to determine the construct validity and reliability of PedsQL 4.0 among a sample of Malaysian adolescents and parents. A cross-sectional study was carried out at three selected public schools in the state of Selangor. A total of 379 Malaysian adolescents completed the PedsQL 4.0 adolescent self-report and 218 (55.9%) parents completed the PedsQL 4.0 parent proxy-report. Weight and height of adolescents were measured and BMI-for-age by sex was used to determine their body weight status. There were 50.8% male and 49.2% female adolescents who participated in this study (14.25 ± 1.23 years). The prevalence of overweight and obesity (25.8%) was four times higher than the prevalence of severe thinness and thinness (6.1%). Construct validity was analyzed using Confirmatory Factor Analysis (CFA). Based on CFA, adolescent self-report and parent proxy-report met the criteria of convergent validity (factor loading > 0.5, Average Variance Extracted (AVE) > 0.5, Construct Reliability > 0.7) and showed good fit to the data. The adolescent self-report and parent proxy-report exhibited discriminant validity as the AVE values were larger than the R(2) values. Cronbach's alpha coefficients of the adolescent self-report (α = 0.862) and parent proxy-report (α = 0.922) showed these instruments are reliable. Parents perceived the HRQoL of adolescents was poorer compared to the perception of the adolescent themselves (t = 5.92, p 0.05). Parent proxy-report was negatively associated with the adolescents' BMI-for-age (r = -0.152, p 0.05). Adolescent self-report and parent proxy-report of the PedsQL 4.0 are valid and reliable to assess HRQoL of Malaysian adolescents. Future studies are recommended to use both adolescent self-report and parent-proxy report of HRQoL as
Barrett, Eva; McCreesh, Karen; Lewis, Jeremy
A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.
Badia, X; Mascaró, J M; Lozano, R
The aim of this study was to assess the feasibility, validity, reliability and sensitivity to change of a Spanish version of the Dermatology Life Quality Index (DLQI) in patients with mild to moderate eczema and psoriasis who were treated with topical corticosteroids. The final study sample comprised 237 patients (48% eczema). Discriminant validity was tested by comparing patients' scores with those of a random sample of the general population (n = 100), and convergent validity by analysing correlations between DLQI scores, measures of clinical severity, and domain scores on the Nottingham Health Profile (NHP). Internal consistency and test-retest reliability were tested in clinically stable patients (n = 94), and responsiveness in a clinically unstable group (n = 143) initiating treatment with topical corticosteroids. Patient scores were significantly higher than general population scores (4.3 vs. 0. 27, P effect size for the total group of de novo patients = 0.70), though the great majority of changes occurred in items 1 and 2. The NHP Emotional Reactions and Mobility domains were more responsive than some DLQI domains. In clinical trials of treatments for mild to moderate eczema and psoriasis, it is likely that only items 1 and 2 of the DLQI will be needed, and it is probably advisable to include generic instruments alongside the DLQI.
Stockbrugger, Barry A.; Haennel, Robert G.
Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…
Alkhamra, Rana A.; Al-Jazi, Aya B.
Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…
Badjadi, Nour El Imane
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Osadebe, P. U.
The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Measuring health-related quality of life in children with cancer living in mainland China: feasibility, reliability and validity of the Chinese mandarin version of PedsQL 4.0 Generic Core Scales and 3.0 Cancer Module
Full Text Available Abstract Background The Pediatric Quality of Life Inventory (PedsQL is widely used instrument to measure pediatric health-related quality of life (HRQOL for children aged 2 to 18 years. The purpose of the current study was to investigate the feasibility, reliability and validity of the Chinese mandarin version of the PedsQL 4.0 Generic Core Scales and 3.0 Cancer Module in a group of Chinese children with cancer. Methods The PedsQL 4.0 Genetic Core Scales and the PedsQL 3.0 Cancer Module were administered to children with cancer (aged 5-18 years and parents of such children (aged 2-18 years. For comparison, a survey on a demographically group-matched sample of the general population with children (aged 5-18 and parents of children (aged 2-18 years was conducted with the PedsQL 4.0 Genetic Core Scales. Result The minimal mean percentage of missing item responses (except the School Functioning scale supported the feasibility of the PedsQL 4.0 Generic Core Scales and 3.0 Cancer Module for Chinese children with cancer. Most of the scales showed satisfactory reliability with Cronbach's α of exceeding 0.70, and all scales demonstrated sufficient test-retest reliability. Assessing the clinical validity of the questionnaires, statistically significant difference was found between healthy children and children with cancer, and between children on-treatment versus off-treatment ≥12 months. Positive significant correlations were observed between the scores of the PedsQL 4.0 Generic Core Scale and the PedsQL 3.0 Cancer Module. Exploratory factor analysis demonstrated sufficient factorial validity. Moderate to good agreement was found between child self- and parent proxy-reports. Conclusion The findings support the feasibility, reliability and validity of the Chinese Mandarin version of PedsQL 4.0 Generic Core Scales and 3.0 Cancer Module in children with cancer living in mainland China.
CIE 2015 August 2-5, 2015, Boston, Massachusetts, USA [DRAFT] DETC2015-46982 DEVELOPMENT OF A CONSERVATIVE MODEL VALIDATION APPROACH FOR RELIABLE...obtain a conservative simulation model for reliable design even with limited experimental data. Very little research has taken into account the...3, the proposed conservative model validation is briefly compared to the conventional model validation approach. Section 4 describes how to account
McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio
We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807
Full Text Available Both qualitative and quantitative paradigms try to find the same result; the truth. Qualitative studies are tools used in understanding and describing the world of human experience. Since we maintain our humanity throughout the research process, it is largely impossible to escape the subjective experience, even for the most experienced of researchers. Reliability and Validity are the issue that has been described in great deal by advocates of quantitative researchers. The validity and the norms of rigor that are applied to quantitative research are not entirely applicable to qualitative research. Validity in qualitative research means the extent to which the data is plausible, credible and trustworthy; and thus can be defended when challenged. Reliability and validity remain appropriate concepts for attaining rigor in qualitative research. Qualitative researchers have to salvage responsibility for reliability and validity by implementing verification strategies integral and self-correcting during the conduct of inquiry itself. This ensures the attainment of rigor using strategies inherent within each qualitative design, and moves the responsibility for incorporating and maintaining reliability and validity from external reviewers’ judgments to the investigators themselves. There have different opinions on validity with some suggesting that the concepts of validity is incompatible with qualitative research and should be abandoned while others argue efforts should be made to ensure validity so as to lend credibility to the results. This paper is an attempt to clarify the meaning and use of reliability and validity in the qualitative research paradigm.
Escobar, Antonio; Trujillo-Martín, Maria del Mar; Rueda, Antonio; Pérez-Ruiz, Elisabeth; Avis, Nancy E; Bilbao, Amaia
The aim of this study was to validate the Quality of Life in Adult Cancer Survivors (QLACS) in short-term Spanish cancer survivor's patients. Patients with breast, colorectal or prostate cancer that had finished their initial cancer treatment 3 years before the beginning of this study completed QLACS, WHOQOL, Short Form-36, Hospital Anxiety and Depression Scale, EORTC-QLQ-BR23 and EQ-5D. Cultural adaptation was made based on established guidelines. Reliability was evaluated using internal consistency and test-retest. Convergent validity was studied by mean of Pearson's correlation coefficient. Structural validity was determined by a second-order confirmatory factor analysis (CFA) and Rasch analysis was used to assess the unidimensionality of the Generic and Cancer-specific scales. Cronbach's alpha were above 0.7 in all domains and summary scales. Test-retest coefficients were 0.88 for Generic and 0.82 for Cancer-specific summary scales. QLACS generic summary scale was correlated with other generic criterion measures, SF-36 MCS (r = - 0.74) and EQ-VAS (r = - 0.63). QLACS cancer-specific scale had lower values with the same constructs. CFA provided satisfactory fit indices in all cases. The RMSEA value was 0.061 and CFI and TLI values were 0.929 and 0.925, respectively. All factor loadings were higher than 0.40 and statistically significant (P validity and reliability of QLACS questionnaire to be used in short-term cancer survivors.
Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville
With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is
Garcia-Rea, Elizabeth A.; LePage, James P.
With the high number of homeless, there is a critical need for rapid and accurate assessment of quality of life to assess program outcomes. The World Health Organization's WHOQOL-100 has demonstrated promise in accurately assessing quality-of-life in this population. However, its length may make large scale use impractical for working with a…
Uysal, Hilal; Ozcan, Şeyda
Many new measuring devices have been developed so that broader psychometric measurements in the coronary artery disease, disease-specific health status measurements, and identification of the broader quality of life can be performed in the recent years. The study was intended to determine whether, and to what extent, MIDAS is a valid and reliable measurement to the patients suffering from myocardial infarction for the first time in Turkey. The research was conducted with the patients hospitalized and treated with myocardial infarction in the cardiology departments of 2 hospitals in Istanbul, Turkey, between 2007 and 2008. Psychometric evaluations of TR-MIDAS were used for validity studies; language validity, content validity, construct validity were examined. For reliability studies; the tool's internal consistency reliability, Cronbach's alpha reliability coefficient, and test-retest reliability were completed. The instrument's content validity index was determined to be "0.95". Principal component analysis revealed six factors with an eigenvalue >1.5. Cronbach's alpha was found to be 0.89 for total scale which was an acceptable value. The total's test-retest reliability was 0.51 (p<0.01). Data obtained at the end of the study supports that Turkish Myocardial Infarction Dimensional Assessment Scale is a valid and reliable instrument as a disease-specific scale to assess the patients' quality of life suffering from myocardial infarction in Turkey. Copyright © 2010 European Society of Cardiology. Published by Elsevier B.V. All rights reserved.
Admiraal, W.; Hoeksma, M.; van de Kamp, M.-T.; van Duin, G.
The richness and complexity of video portfolios endanger both the reliability and validity of the assessment of teacher competencies. In a post-graduate teacher education program, the assessment of video portfolios was evaluated for its reliability, construct validity, and consequential validity.
Full Text Available Objectives: The aim of this study was to evaluate the psychometric features of the Persian version of the Autism Behavior Checklist (ABC. Method:The International Quality of Life Assessment (IQOLA approach was used to translate the English ABC into Persian. A total sample of 184 parents of children including 114 children with autism disorder (mean age =7.21, SD =1.65 and 70 typically developing children (mean age = 6.82, SD =1.75 completed the ABC. Internal consistency, test-retest reliability, concurrent and discriminant validity, and cut-off score were assessed. Results: The results of this study revealed that the Persian version of the ABC has an acceptable degree of internal consistency (.73. Test–retest comparisons using interclass correlation confirmed the instrument’s time stability (.83. The instrument’s concurrent validity with Gilliam Autism Rating Scale (GARS was verified; the correlation between total scores was .94. In the discriminant validity, the autism group had significantly higher scores compared to the normal group. Receiver Operating Characteristic (ROC analysis revealed that individuals with total scores below 25 are less likely to be in the autism group. Conclusion:The Persian version of the ABC can be used as an initial screening tool in clinical contexts.
Akmaz, Hazel Ekin; Uyar, Meltem; Kuzeyli Yıldırım, Yasemin; Akın Korhan, Esra
Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Methodological and cross sectional study. A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance of chronic pain.
Hazel Ekin Akmaz
Full Text Available Background: Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. Aims: To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Study Design: Methodological and cross sectional study. Methods: A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. Results: The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. Conclusion: The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance
Robbins, Mandy; Francis, Leslie J; Bradford, Amanda
A sample of 16 male and 30 female undergraduates completed the Greer and Francis Scale of Rejection of Christianity. The data support the internal consistency reliability and construct validity of the scale for this sample.
Corty, E W; Althof, S E; Kurit, D M
The present study assessed the reliability and validity of a measure of sexual functioning, the CMSH-SFQ, for male patients and their partners. The CMSH-SFQ measures erectile and orgasmic functioning, sexual drive, frequency of sexual behavior, and sexual satisfaction. Test-retest reliability was assessed with 19 males and 19 females for the baseline CMSH-SFQ. Criterion validity was measured by comparing the answers of 25 male patients to those of their partners at baseline and follow-up. The majority of items had acceptable levels of reliability and validity. The CMSH-SFQ provides a reliable and valid device that can be used to measure global sexual functioning in men and their partners and may be used to evaluate the efficacy of treatments for sexual dysfunctions. Limitations and suggestions for use of the CMSH-SFQ are addressed.
McDonald, Ann E; Vigen, Cheryl
This study examined the ability of a two-part self-report instrument, the McDonald Play Inventory, to reliably and validly measure the play activities and play styles of 7- to 11-yr-old children and to discriminate between the play of neurotypical children and children with known learning and developmental disabilities. A total of 124 children ages 7-11 recruited from a sample of convenience and a subsample of 17 parents participated in this study. Reliability estimates yielded moderate correlations for internal consistency, total test intercorrelations, and test-retest reliability. Validity estimates were established for content and construct validity. The results suggest that a self-report instrument yields reliable and valid measures of a child's perceived play performance and discriminates between the play of children with and without disabilities. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Previous research funded by Florida Department of Transportation (FDOT) developed a method for estimating : travel time reliability for arterials. This method was not initially implemented or validated using field data. This : project evaluated and r...
Serafin, Ana Gil
This study sought to design and test a survey instrument which examined college faculty satisfaction with their roles of teaching, research, and service. A panel of experts reviewed the Spanish and English versions of the 39 item survey for quality of items and grammatical accuracy. Thirty randomly selected faculty members from a population of 234…
Bele, Irene Velsvik
This study focuses on speaking voice quality in male teachers (n = 35) and male actors (n = 36), who represent untrained and trained voice users, because we wanted to investigate normal and supranormal voices. In this study, both substantial and methodologic aspects were considered. It includes a method for perceptual voice evaluation, and a basic issue was rater reliability. A listening group of 10 listeners, 7 experienced speech-language therapists, and 3 speech-language therapist students evaluated the voices by 15 vocal characteristics using VA scales. Two sets of voice signals were investigated: text reading (2 loudness levels) and sustained vowel (3 levels). The results indicated a high interrater reliability for most perceptual characteristics. Connected speech was evaluated more reliably, especially at the normal level, but both types of voice signals were evaluated reliably, although the reliability for connected speech was somewhat higher than for vowels. Experienced listeners tended to be more consistent in their ratings than did the student raters. Some vocal characteristics achieved acceptable reliability even with a smaller panel of listeners. The perceptual characteristics grouped in 4 factors reflected perceptual dimensions.
Stice, Eric; Fisher, Melissa; Martinez, Erin
The authors conducted 4 studies investigating the reliability and validity of the Eating Disorder Diagnostic Scale (HDDS; E. Stice, C. F. Telch, & S. L. Rizvi, 2000), a brief self-report measure for diagnosing anorexia nervosa, bulimia nervosa, and binge eating disorder. Study 1 found that the HDDS showed criterion validity with interview-based…
Due, Ulla; Ottesen, Marianne
Objective. To revise, validate and test for reliability an anal sphincter rupture questionnaire in relation to construct, content and face validity. Setting and background. Since 1996 women with anal sphincter rupture (ASR) at one of the public university hospitals in Copenhagen, Denmark have bee...
Kara, Kerime C; Çıtak Karakaya, İlkim; Tunalı, Nur; Karakaya, Mehmet G
The aim of this study was to investigate the reliability and validity of the Turkish version of the Incontinence Quiz, which was developed by Branch et al. (1994), to assess women's knowledge of and attitudes toward urinary incontinence. Comprehensibility of the Turkish version of the 14-item Incontinence Quiz, which was prepared following translation-back translation procedures, was tested on a pilot group of eight women, and its internal reliability, test-retest reliability and construct validity were assessed in 150 women who attended the gynecology clinics of three hospitals in İçel, Turkey. Physical and sociodemographic characteristics and presence of incontinence complaints were also recorded. Data were analyzed at the 0.05 alpha level, using SPSS version 22. The scale had good reliability and validity. The internal reliability coefficient (Cronbach α) was 0.80, test-retest correlation coefficients were 0.83-0.94; and with regard to construct validity, Kaiser-Meyer-Olkin coefficient was 0.76 and Barlett sphericity test was 562.777 (P = 0.000). Turkish version of the Incontinence Quiz had a four-factor structure, with Eigenvalues ranging from 1.17 to 4.08. The Incontinence Quiz-Turkish version is a highly comprehensible, reliable and valid scale, which may be used to assess Turkish-speaking women's knowledge of and attitudes toward urinary incontinence. © 2017 Japan Society of Obstetrics and Gynecology.
Lund, Rikke; Nielsen, Lene Snabe; Henriksen, Pia Wichmann
OBJECTIVE: The aim of the present article is to describe the face and content validity as well as reliability of the Copenhagen Social Relations Questionnaire (CSRQ). METHOD: The face and content validity test was based on focus group discussions and individual interviews with 31 informants...... from the interviews. Two additional themes not covered by CSRQ on dynamics and reciprocity of social relations were identified. DISCUSSION: CSRQ holds satisfactory face and content validity as well as reliability, and is suitable for measuring structure and function of social relations including...
Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C
Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Due, Ulla; Ottesen, Marianne
Objective. To revise, validate and test for reliability an anal sphincter rupture questionnaire in relation to construct, content and face validity. Setting and background. Since 1996 women with anal sphincter rupture (ASR) at one of the public university hospitals in Copenhagen, Denmark have been...... main questions but one. Two questions needed further explanation. Seven women made minor errors. Conclusion. The validated Danish questionnaire has a good construct, content and face validity. It is a well accepted, reliable, simple and clinically relevant screening tool. It reveals physical problems...... offered pelvic floor muscle examination and instruction by a specialist physiotherapist. In relation to that, a non-validated questionnaire about anal and urinary incontinence was to be answered six months after childbirth. Method. The original questionnaire was revised and a pilot test was performed...
Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K
Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.
So, Hyang Sook; Chae, Myeong Jeong; Kim, Hye Young
In this study the reliability and validity of the Korean version of the Cancer Stigma Scale (KCSS) was evaluated. The KCSS was formed through translation and modification of Cataldo Lung Cancer Stigma Scale. The KCSS, Psychological Symptom Inventory (PSI), and European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 30 (EORTC QLQ-C30) were administered to 247 men and women diagnosed with one of the five major cancers. Construct validity, item convergent and discriminant validity, concurrent validity, known-group validity, and internal consistency reliability of the KCSS were evaluated. Exploratory factor analysis supported the construct validity with a six-factor solution; that explained 65.7% of the total variance. The six-factor model was validated by confirmatory factor analysis (Q (χ²/df)= 2.28, GFI=.84, AGFI=.81, NFI=.80, TLI=.86, RMR=.03, and RMSEA=.07). Concurrent validity was demonstrated with the QLQ-C30 (global: r=-.44; functional: r=-.19; symptom: r=.42). The KCSS had known-group validity. Cronbach's alpha coefficient for the 24 items was .89. The results of this study suggest that the 24-item KCSS has relatively acceptable reliability and validity and can be used in clinical research to assess cancer stigma and its impacts on health-related quality of life in Korean cancer patients. © 2017 Korean Society of Nursing Science
Konge, Lars; Lehnert, Per; Hansen, Henrik Jessen
BACKGROUND: As we move toward competency-based education in medicine, we have lagged in developing competency-based evaluation methods. In the era of minimally invasive surgery, there is a need for a reliable and valid tool dedicated to measure competence in video-assisted thoracoscopic surgery....... The purpose of this study is to create such an assessment tool, and to explore its reliability and validity. METHODS: An expert group of physicians created an assessment tool consisting of 10 items rated on a five-point rating scale. The following factors were included: economy and confidence of movement...
Full Text Available Objective: Discomfort Intolerance Scale was developed by Norman B. Schmidt et al. to assess the individual differences of capacity to withstand physical perturbations or uncomfortable bodily states (2006. The aim of this study is to investigate the validity and reliability of Discomfort Intolerance Scale-Turkish Version (RDÖ. Method: From two different universities, total of 225 students (male=167, female=58 were participated in this study. In order to determine the criterion validity, Beck Anxiety Inventory (BAI and State-Trait Anxiety Inventory (STAI were used. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability the scale was re-applied to 54 participants 6 weeks later. Results: To assess construct validity of DIS, factor analyses were performed using varimax principal components analysis with varimax rotation. The factor analysis resulted in two factors named “discomfort (in tolerance” and “discomfort avoidance”. The Cronbach’s alpha coefficient for the entire scale, discomfort-(intolerance subscale, discomfortavoidance subscale were, .592, .670, .600 respectively. Correlations between two factors of DIS, discomfort intolerance and discomfort avoidance, and Trait Anxiety Inventory of STAI (State-Trait Anxiety Inventory were statistically significant at the level of 0.05. Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DIS had a satisfactory level of reliability and validity in Turkish university students.
John, A. C.
The aim of the study was to examine the importance of reliability and validity as necessary foundation for fair assessment. The concepts of reliability, validity, fair assessment and their relationships were analysed. Qualities of fair assessment were discussed. A number of recommendations were made to make assessors be more cautious in award of…
Carlsen, C G; Lindorff Larsen, Karen; Funch-Jensen, P
PURPOSE: Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity...... of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. METHODS: Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia...... a significant difference between the three groups which indicates construct validity, p skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment...
Garnacho-Castaño, Manuel V.; López-Lastra, Silvia; Maté-Muñoz, José L.
The objectives of the study were to determine the validity and reliability of peak velocity (PV), average velocity (AV), peak power (PP) and average power (AP) measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain) during two resistance exercises, bench press (BP) and full back squat (BS), performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2). Intraclass correlation coefficients (ICCs) indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W). Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W). Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP) make this device a useful tool for monitoring resistance training. Key points This study determined the validity and reliability of peak velocity, average velocity, peak power and average power measurements made using a linear position transducer The Tendo Weight-lifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and power. PMID:25729300
Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa
Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Monbaliu, E; Ortibus, E; Roelens, F; Desloovere, K; Deklerck, J; Prinzie, P; de Cock, P; Feys, H
This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System and six patients were classified at level V. Reliability was measured by (1) intraclass correlation coefficient (ICC) for interrater reliability, (2) standard error of measurement (SEM) and smallest detectable difference (SDD), and (3) Cronbach's alpha for internal consistency. Validity was assessed by Pearson's correlations among the three scales used and by content analysis. Moderate to good interrater reliability was found for total scores of the three scales (ICC: BADS=0.87; BFMMS=0.86; UDRS=0.79). However, many subitems showed low reliability, in particular for the UDRS. SEM and SDD were respectively 6.36% and 17.72% for the BADS, 9.88% and 27.39% for the BFMMS, and 8.89% and 24.63% for the UDRS. High internal consistency was found. Pearson's correlations were high. Content validity showed insufficient accordance with the new CP definition and classification. Our results support the internal consistency and concurrent validity of the scales; however, taking into consideration the limitations in reliability, including the large SDD values and the content validity, further research on methods of assessment of dystonia is warranted.
O’CONNOR, MELISSA; DAVITT, JOAN K.
The Outcome and Assessment Information Set (OASIS) is the patient-specific, standardized assessment used in Medicare home health care to plan care, determine reimbursement, and measure quality. Since its inception in 1999, there has been debate over the reliability and validity of the OASIS as a research tool and outcome measure. A systematic literature review of English-language articles identified 12 studies published in the last 10 years examining the validity and reliability of the OASIS. Empirical findings indicate the validity and reliability of the OASIS range from low to moderate but vary depending on the item studied. Limitations in the existing research include: nonrepresentative samples; inconsistencies in methods used, items tested, measurement, and statistical procedures; and the changes to the OASIS itself over time. The inconsistencies suggest that these results are tentative at best; additional research is needed to confirm the value of the OASIS for measuring patient outcomes, research, and quality improvement. PMID:23216513
Frank, Guido K W; Favaro, Angela; Marsh, Rachel; Ehrlich, Stefan; Lawson, Elizabeth A
Human brain imaging can help improve our understanding of mechanisms underlying brain function and how they drive behavior in health and disease. Such knowledge may eventually help us to devise better treatments for psychiatric disorders. However, the brain imaging literature in psychiatry and especially eating disorders has been inconsistent, and studies are often difficult to replicate. The extent or severity of extremes of eating and state of illness, which are often associated with differences in, for instance hormonal status, comorbidity, and medication use, commonly differ between studies and likely add to variation across study results. Those effects are in addition to the well-described problems arising from differences in task designs, data quality control procedures, image data preprocessing and analysis or statistical thresholds applied across studies. Which of those factors are most relevant to improve reproducibility is still a question for debate and further research. Here we propose guidelines for brain imaging research in eating disorders to acquire valid results that are more reliable and clinically useful. © 2018 Wiley Periodicals, Inc.
Marbach, G.; Beche, M.; Pajot, J.
The excellent behavior of PHENIX driver fuel and the burnup values currently reached suggest that the first SUPERPHENIX fuel load will meet the design lifetime. However, to ensure the reliability of the entire load, all the parameters affecting fuel behavior in reactor must be analyzed. For that purpose, we have taken into account all the results of the examination and verifications during the fabrication process of the first load subassemblies. These data concern geometrical parameters or oxide composition as well as the cladding tube and plug weld soundness tests. The objective is to determine the actual dispersion of all the parameters to ensure the absence of failure due to fabrication defects with very high statistical confidence limits. The influence of all the parameters has been investigated for the situations which can occur during power-up, steady-state operation and transients. The fabrication quality allows us to demonstrate that in all cases good behavior criteria for fuel and structure will be maintained. This demonstration is based on calculation code results as well as on validation by specific experiments
Hadley, Wendy; Stewart, Angela; Hunter, Heather L; Affleck, Katelyn; Donenberg, Geri; Diclemente, Ralph; Brown, Larry K
We evaluated the reliability and validity of the Dyadic Observed Communication Scale (DOCS) coding scheme, which was developed to capture a range of communication components between parents and adolescents. Adolescents and their caregivers were recruited from mental health facilities for participation in a large, multi-site family-based HIV prevention intervention study. Seventy-one dyads were randomly selected from the larger study sample and coded using the DOCS at baseline. Preliminary validity and reliability of the DOCS was examined using various methods, such as comparing results to self-report measures and examining interrater reliability. Results suggest that the DOCS is a reliable and valid measure of observed communication among parent-adolescent dyads that captures both verbal and nonverbal communication behaviors that are typical intervention targets. The DOCS is a viable coding scheme for use by researchers and clinicians examining parent-adolescent communication. Coders can be trained to reliably capture individual and dyadic components of communication for parents and adolescents and this complex information can be obtained relatively quickly.
Ahmed, Hussam; Chateauneuf, Alaa
The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces
Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara
This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Hop, M.; Moues, C.; Bogomolova, K.; Nieuwenhuis, M.; Oen, I.; Middelkoop, E.; Breederveld, R.; de Baar, M.
Objective: The aim of this study was to examine the reliability and validity of using photographs of burns to assess both burn size and depth. Method: Fifty randomly selected photographs taken on day 0-1 post burn were assessed by seven burn experts and eight referring physicians. Inter-rater
Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel
The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…
Young, Daniel Kim-Wan; Ng, Petrus Y. N.; Pan, Jia-Yan; Cheng, Daphne
Purpose: This study aims to translate and test the reliability and validity of the Internalized Stigma of Mental Illness-Cantonese (ISMI-C). Methods: The original English version of ISMI is translated into the ISMI-C by going through forward and backward translation procedure. A cross-sectional research design is adopted that involved 295…
Kerkhoffs, Gino M. M. J.; Blankevoort, Leendert; Sierevelt, Inger N.; Corvelein, Ruby; Janssen, Guido H. W.; van Dijk, C. Niek
Two test devices were manufactured to objectively measure ankle joint laxity: the dynamic anterior ankle tester (DAAT) and the quasi-static anterior ankle tester (QAAT). The primary aim was to analyse the reliability of both testers; The secondary aim was to assess validity in correlation with TELOS
Halpin, Glennelle; Halpin, Gerald
Research indicating that different cut-off points result from the use of different standard-setting techniques leaves decision makers with a disturbing dilemma: Which standard-setting method is best? This investigation of the reliability and validity of 10 different standard-setting approaches was designed to provide information that might help…
Yildiz, F. Ülkü; Çagdas, Aysel; Kayili, Gökhan
The purpose of this study is to perform the validity-reliability analysis of the three subtests of Basic School Skills Inventory 3--Mathematics, Classroom Behavior and Daily Life skills--and do its adaptation for four to six year-old Turkish children. The sample of the study included 595 four to six year-old Turkish children attending public and…
Tretter, Thomas R.; Brown, Sherri L.; Bush, William S.; Saderholm, Jon C.; Holmes, Vicki-Lynn
Science teachers' content knowledge is an important influence on student learning, highlighting an ongoing need for programs, and assessments of those programs, designed to support teacher learning of science. Valid and reliable assessments of teacher science knowledge are needed for direct measurement of this crucial variable. This paper…
Arevalo Romero, J.; Brinkkemper, T.; van der Heide, A.; Rietjens, J.A.; Ribbe, M.W.; Deliens, L.; Loer, S.A.; Zuurmond, W.W.A.; Perez, R.S.G.M.
Context: Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. Objectives: To study the reliability and validity of
van der Wulp, I.
Reliability and validity of triage systems is important because this can affect patient safety. In this thesis, these aspects of two emergency department (ED) triage systems were studied as well as methodological aspects in these types of studies. The consistency, reproducibility, and criterion
Putnam, Frank W.; And Others
Evaluation of the Child Dissociative Checklist found it to be a reliable and valid observer report measure of dissociation in children, including sexually abused girls and children with dissociative disorder and with multiple personality disorder. The checklist, which is appended, is intended as a clinical screening instrument and research measure…
Kooiman, Thea; Dontje, Manon L.; Sprenger, Siska; Krijnen, Wim; van der Schans, Cees; de Groot, Martijn
Background: Activity trackers can potentially stimulate users to increase their physical activity behavior. The aim of this study was to examine the reliability and validity of ten consumer activity trackers for measuring step count in both laboratory and free-living conditions. Method: Healthy
Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. Objective: To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. Design: A blinded, repeated-measures diagnostic test was carried ...
Automated Body Reaction Test (ABRT) is a new device for skills and physical assessment instrument to measure ability on react, move quickly and accurately in accordance with stimulus. A total of 474 subjects aged 7-17 years old were randomly selected for the construct validity (n=330) and reliability (n=144). The ABRT ...
The aim of this study is to develop a useful, valid and reliable measurement tool that will help teacher candidates determine their Turkish metalinguistic awareness. During the development of the scale, a pool of items was created by scanning the relevant literature and examining other awareness scales. The materials prepared were re-examined…
Results: Two valid factors emerged with items 1-3 and items 4, 5 & 7 loading on respectively, making the BFSS a twodimensional (multidimensional) scale which measures 2 aspects of brain fag [labeled burning sensation and crawling sensation respectively]. The reliability analysis yielded a Cronbach Alpha coefficient of ...
Fuchs, Lynn; And Others
A study was conducted to explore the reliability and validity of three prominent procedures used in informal reading inventories (IRIs): (1) choosing a 95% word recognition accuracy standard for determining student instructional level, (2) arbitrarily selecting a passage to represent the difficulty level of a basal reader, and (3) employing…
In facilitating cross-cultural study in the field of psychology and Logotherapy, the reliability and validity of the logotest which measures inner meaning fulfillment was carried out among 885 University of Ibadan students, 439 males and 434 females, aged between 15 and 60 years old with mean X age of 6.0. Data analyses ...
de Boer, J. B.; Sprangers, M. A.; Aaronson, N. K.; Lange, J. M.; van Dam, F. S.
The objective of this study was to evaluate the feasibility, reliability, validity and responsiveness of the HIV Overview of Problems Evaluation System (HOPES) in a Dutch sample. The HOPES was administered three times in a one-year period to a sample of 106 outpatients with a symptomatic
Shrestha, Bidhan; Niraula, Surya Raj; Parajuli, Prakash K; Suwal, Pramita; Singh, Raj Kumar
To assess the reliability and to validate the translated Nepalese version of the Oral Health Impact Profile (OHIP-EDENT-N) in Nepalese edentulous subjects. The international guidelines for translation and cross-cultural adaption of OHIP-EDENT were followed, and a Nepalese version of the questionnaire was adapted for this study. Eighty-eight completely edentulous subjects were then selected for the study and completed their responses for the questionnaire. The reliability of the OHIP-EDENT-N was evaluated using internal consistency. Validity was assessed as construct and convergent validity. Construct validity was determined using exploratory factor analysis (EFA). The correlation between OHIP-EDENT-N subscale scores and the global question was investigated to test the convergent validity. Cronbach's alpha for the total score of OHIP-EDENT-N was 0.78. Construct validity was assessed by factor analysis: 70.196% of the variance was accountable to five factors extracted from the factor analysis. Factor loadings above 0.40 were noted for all items. In terms of convergent validity, significant correlations could be established between OHIP-EDENT-N and global questions. This study has been able to establish the reliability and validity of the OHIP-EDENT-N, and OHIP-EDENT-N can be a considered a reliable tool to assess the oral health related quality of life in the Nepalese edentulous population. © 2016 by the American College of Prosthodontists.
Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto
Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Full Text Available Validity and Reliability of Agoraphobic Cognitions Questionnaire-Turkish Version Objective: The aim of this study is to investigate the validity and reliability of Agoraphobic Cognitions Questionnaire -Turkish Version (ACQ. Method: ACQ was administered to 92 patients with agoraphobia or panic disorder with agoraphobia. BSQ Turkish version completed by translation, back-translation and pilot assessment. Reliability of ACQ was analyzed by test-retest correlation, split-half technique, Cronbach’s alpha coefficient. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Bartlett test had been performed. Principal component analysis and varimax rotation used for factor analysis. Results: 64% of patients evaluated in the study were female and 36% were male. Age interval was between 18 and 58, mean age was 31.5±10.4. The Cronbach’s alpha coefficient was 0.91. Analysis of test-retest evaluations revealed that there were statistically significant correlations ranging between 24% and 84% concerning questionnaire components. In analysis performed by split-half method reliability coefficients of half questionnaires were found as 0.77 and 0.91. Again Spearmen-Brown coefficient was found as 0.87 by the same analysis. To assess construct validity of ACQ, factor analysis was performed and two basic factors found. These two factors explained 57.6% of the total variance. (Factor 1: 34.6%, Factor 2: 23% Conclusion: Our findings support that ACQ-Turkish version had a satisfactory level of reliability and validity
Feasibility, reliability, and validity of the Pediatric Quality of Life Inventory ™ generic core scales, cancer module, and multidimensional fatigue scale in long-term adult survivors of pediatric cancer.
Robert, Rhonda S; Paxton, Raheem J; Palla, Shana L; Yang, Grace; Askins, Martha A; Joy, Shaini E; Ater, Joann L
Most health-related quality of life assessments are designed for either children or adults and have not been evaluated for adolescent and young adult survivors of pediatric cancer. The objective of this study was to examine the feasibility, reliability, and validity of the Pediatric Quality of Life Inventory (PedsQL ™ Generic Core Scales, Cancer Module, and Multidimensional Fatigue Scale in adult survivors of pediatric cancer. Adult survivors (n = 64; Mean age 35 year old; >2 years after treatment) completed the PedsQL™ Generic Core Scales, Cancer Module, and Multidimensional Fatigue Scale. Feasibility was examined with floor and ceiling effects; and internal consistency was determined by Cronbach's coefficient alpha calculations. Inter-factor correlations were also assessed. Significant ceiling effects were observed for the scales of social function, nausea, procedural anxiety, treatment anxiety, and communication. Internal consistency for all subscales was within the recommended ranges (α ≥ 0.70). Moderate to strong correlations between most Cancer Module and Generic Core Scales (r = 0.25 to r = 0.76) and between the Multidimensional Fatigue Scale and Generic Core Scales (r = 0.37 to r = 0.73). The PedsQL™ Generic Core Scales, Cancer Module, and Multidimensional Fatigue Scale appear to be feasible for an older population of pediatric cancer survivors; however, some of the Cancer Module Scales (nausea, procedural/treatment anxiety, and communication) were deemed not relevant for long-term survivors. More information is needed to determine whether the issues addressed by these modules are meaningful to long-term adult survivors of pediatric cancers. Copyright © 2012 Wiley Periodicals, Inc.
Yoshizumi, Takahiro; Murase, Satomi; Murakami, Takashi; Takai, Jiro
The purposes of the present study were to develop a Parenting Scale of Inconsistency and to evaluate its initial reliability and validity. The 12 items assess the inconsistency among parents' moods, behaviors, and attitudes toward children. In the primary study, 517 participants completed three measures: the new Parenting Scale of Inconsistency, the Parental Bonding Instrument, and the Depression Scale of the General Health Questionnaire. The Parenting Scale of Inconsistency had good test-retest reliability of .85 and internal consistency of .88 (Cronbach coefficient alpha). Construct validity was good as Inconsistency scores were significantly correlated with the Care and Overprotection scores of the Parental Bonding Instrument and with the Depression scores. Moreover, Inconsistency scores' relation with a dimension of parenting style distinct from Care and Overprotection suggested that the Parenting Scale of Inconsistency had factorial validity. This scale seems a potential measure for examining the relationships between inconsistent parenting and the mental health of children.
Lagarde, Marloes L J; Kamalski, Digna M A; van den Engel-Hoek, Lenie
To systematically review the available evidence for the reliability and validity of cervical auscultation in diagnosing the several aspects of dysphagia in adults and children suffering from dysphagia. Medline (PubMed), Embase and the Cochrane Library databases. The systematic review was carried out applying the steps of the PRISMA-statement. The methodological quality of the included studies were evaluated using the Dutch 'Cochrane checklist for diagnostic accuracy studies'. A total of 90 articles were identified through the search strategy, and after applying the inclusion and exclusion criteria, six articles were included in this review. In the six studies, 197 patients were assessed with cervical auscultation. Two of the six articles were considered to be of 'good' quality and three studies were of 'moderate' quality. One article was excluded because of a 'poor' methodological quality. Sensitivity ranges from 23%-94% and specificity ranges from 50%-74%. Inter-rater reliability was 'poor' or 'fair' in all studies. The intra-rater reliability shows a wide variance among speech language therapists. In this systematic review, conflicting evidence is found for the validity of cervical auscultation. The reliability of cervical auscultation is insufficient when used as a stand-alone tool in the diagnosis of dysphagia in adults. There is no available evidence for the validity and reliability of cervical auscultation in children. Cervical auscultation should not be used as a stand-alone instrument to diagnose dysphagia. © The Author(s) 2015.
Understanding Health-related Quality of Life in Caregivers of Civilians and Service Members/Veterans with Traumatic Brain Injury: Establishing the Reliability and Validity of PROMIS Mental Health Measures.
Carlozzi, Noelle E; Hanks, Robin; Lange, Rael T; Brickell D Psych, Tracey A; Ianni, Phillip A; Miner, Jennifer A; French Psy D, Louis M; Kallen, Michael A; Sander, Angelle M
To provide important reliability and validity data to support the use of the PROMIS Mental Health measures in caregivers of civilians or service members/veterans with traumatic brain injury (TBI). Patient-reported outcomes surveys administered through an electronic data collection platform. Three TBI Model Systems rehabilitation hospitals, an academic medical center, and a military medical treatment facility. 560 caregivers of individuals with a documented TBI (344 civilians and 216 military) INTERVENTION: Not Applicable MAIN OUTCOME MEASURES: PROMIS Anxiety, Depression, and Anger Item Banks RESULTS: Internal consistency for all of the PROMIS Mental Health item banks was very good (all α > .86) and three-week test retest reliability was good to adequate (ranged from .65 to .85). Convergent validity and discriminant validity of the PROMIS measures was also supported. Caregivers of individuals that were low functioning had worse emotional HRQOL (as measured by the three PROMIS measures) than caregivers of high functioning individuals, supporting known groups validity. Finally, levels of distress, as measured by the PROMIS measures, were elevated for those caring for low-functioning individuals in both samples (rates ranged from 26.2% to 43.6% for caregivers of low-functioning individuals). Results support the reliability and validity of the PROMIS Anxiety, Depression, and Anger item banks in caregivers of civilians and service members/veterans with TBI. Ultimately, these measures can be used to provide a standardized assessment of HRQOL as it relates to mental health in these caregivers. Copyright © 2018. Published by Elsevier Inc.
La Monte, Michelle Evonne
This study focused on developing a valid and reliable instrument that can not only identify successful co-teaching, but also the professional development needs of co-teachers and their administrators in public schools. Two general questions about the quality of co-teaching were addressed in this study: (a) How well did descriptors within each of…
van Saane, N.; Sluiter, J. K.; Verbeek, J. H. A. M.; Frings-Dresen, M. H. W.
Background Although job satisfaction research has been carried out for decades, no recent overview of job satisfaction instruments and their quality is available. Aim The aim of this systematic review is to select job satisfaction instruments of adequate reliability and validity for use as
Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D
Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
Claycamp, H Gregg; Rahaman, Faiad; Urban, Jason M
Engineering reliability typically refers to the probability that a system, or any of its components, will perform a required function for a stated period of time and under specified operating conditions. As such, reliability is inextricably linked with time-dependent quality concepts, such as maintaining a state of control and predicting the chances of losses from failures for quality risk management. Two popular current good manufacturing practice (cGMP) and quality risk management tools, failure mode and effects analysis (FMEA) and root cause analysis (RCA) are examples of engineering reliability evaluations that link reliability with quality and risk. Current concepts in pharmaceutical quality and quality management systems call for more predictive systems for maintaining quality; yet, the current pharmaceutical manufacturing literature and guidelines are curiously silent on engineering quality. This commentary discusses the meaning of engineering reliability while linking the concept to quality systems and quality risk management. The essay also discusses the difference between engineering reliability and statistical (assay) reliability. The assurance of quality in a pharmaceutical product is no longer measured only "after the fact" of manufacturing. Rather, concepts of quality systems and quality risk management call for designing quality assurance into all stages of the pharmaceutical product life cycle. Interestingly, most assays for quality are essentially static and inform product quality over the life cycle only by being repeated over time. Engineering process reliability is the fundamental concept that is meant to anticipate quality failures over the life cycle of the product. Reliability is a well-developed theory and practice for other types of manufactured products and manufacturing processes. Thus, it is well known to be an appropriate index of manufactured product quality. This essay discusses the meaning of reliability and its linkages with quality
Yoshii, Hatsumi; Mandai, Nozomu; Saito, Hidemitsu; Akazawa, Kouhei
Self-stigma, defined by a negative attitude toward oneself combined with the consciousness of being a target of prejudice, is a critical problem for psychiatric patients. Self-stigma studies among psychiatric patients have indicated that high stigma is predictive of detrimental effects such as the delay of treatment and decreases in social participation in patients, and levels of self-stigma should be statistically evaluated. In this study, we developed the Workplace Social Distance Scale (WSDS), rephrasing the eight items of the Japanese version of the Social Distance Scale (SDSJ) to apply to the work setting in Japan. We examined the reliability and validity of the WSDS among 83 psychiatric patients. Factor analysis extracted three factors from the scale items: "work relations," "shallow relationships," and "employment." These factors are similar to the assessment factors of the SDSJ. Cronbach's alpha coefficient for the WSDS was 0.753. The split-half reliability for the WSDS was 0.801, indicating significant correlations. In addition, the WSDS was significantly correlated with the SDSJ. These findings suggest that the WSDS represents an approximation of self-stigma in the workplace among psychiatric patients. Our study assessed the reliability and validity of the WSDS for measuring self-stigma in Japan. Future studies should investigate the reliability and validity of the scale in other countries.
Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin
The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P visual analog scale was r = 0.626 (P disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur
The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: email@example.com.
Full Text Available Purpose: Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. Methods: A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF with oblimin rotation, a scree plot test, and parallel analysis (PA. The reliability of LSS was tested using Cronbach’s α, corrected item-total correlation, and test-retest. Results: Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach’s α was > 0.70 for all subscales in both study populations. The corrected item-total correlations were > 0.30 for the items in each component. Conclusion: The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L
Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.
Abdollahimohammad, Abdolghani; Ja'afar, Rogayah
Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF) with oblimin rotation, a scree plot test, and parallel analysis (PA). The reliability of LSS was tested using Cronbach's α, corrected item-total correlation, and test-retest. Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach's α was >0.70 for all subscales in both study populations. The corrected item-total correlations were >0.30 for the items in each component. The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Arevalo, Jimmy J; Brinkkemper, Tijn; van der Heide, Agnes; Rietjens, Judith A; Ribbe, Miel; Deliens, Luc; Loer, Stephan A; Zuurmond, Wouter W A; Perez, Roberto S G M
Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. To study the reliability and validity of observer-based sedation scales in palliative sedation. A prospective evaluation of 54 patients under intermittent or continuous sedation with four sedation scales was performed by 52 nurses. Included scales were the Minnesota Sedation Assessment Tool (MSAT), Richmond Agitation-Sedation Scale (RASS), Vancouver Interaction and Calmness Scale (VICS), and a sedation score proposed in the Guideline for Palliative Sedation of the Royal Dutch Medical Association (KNMG). Inter-rater reliability was tested with the intraclass correlation coefficient (ICC) and Cohen's kappa coefficient. Correlations between the scales using Spearman's rho tested concurrent validity. We also examined construct, discriminative, and evaluative validity. In addition, nurses completed a user-friendliness survey. Overall moderate to high inter-rater reliability was found for the VICS interaction subscale (ICC = 0.85), RASS (ICC = 0.73), and KNMG (ICC = 0.71). The largest correlation between scales was found for the RASS and KNMG (rho = 0.836). All scales showed discriminative and evaluative validity, except for the MSAT motor subscale and VICS calmness subscale. Finally, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. The RASS and KNMG scales stand as the most reliable and valid among the evaluated scales. In addition, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. Further research is needed to evaluate the impact of the scales on better symptom control and patient comfort. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Ahmet Emre SARGIN
Full Text Available Objective: Distress Tolerance Scale (DTS is developed by Simons and Gaher in order to measure individual differences in the capacity of distress tolerance.The aim of this study is to assess the reliability and validity of the Turkish version of DTS. Method: One hundred and sixty seven university students (male=66, female=101 participated in this study. Beck Anxiety Inventory (BAI, State-trait Anxiety Inventory (STAI and Discomfort Intolerance Scale (DIS were used to determine the criterion validity. Construct validity was evaluated with factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability, the scale was re-applied to 79 participants six weeks later. Results: To assess construct validity, factor analyses were performed using varimax principal components analysis with varimax rotation. While there were factors in the original study, our factor analysis resulted in three factors. Cronbach’s alpha coefficients for the entire scale and tolerance, regulation, self-efficacy subscales were .89, .90, .80 and .64 respectively. There were correlations at the level of 0.01 between the Trait Anxiety Inventory of STAI and BAI, and all the subscales of DTS and also between the State Anxiety Inventory and regulation subscale. Both of the subscales of DIS were correlated with the entire subscale and all the subscales except regulation at the level of 0.05.Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DTS had a satisfactory level of reliability and validity in Turkish university students.
Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl
To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P volleyball players.
Understanding health-related quality of life in caregivers of civilians and service members/veterans with traumatic brain injury: Establishing the reliability and validity of PROMIS Fatigue and Sleep Disturbance item banks.
Carlozzi, Noelle E; Ianni, Phillip A; Tulsky, David S; Brickell, Tracey A; Lange, Rael T; French, Louis M; Cella, David; Kallen, Michael A; Miner, Jennifer A; Kratz, Anna L
To examine the reliability and validity of Patient Reported Outcomes Measurement Information System (PROMIS) measures of sleep disturbance and fatigue in TBI caregivers and to determine the severity of fatigue and sleep disturbance in these caregivers. Cross-sectional survey data collected through an online data capture platform. Four rehabilitation hospitals and Walter Reed National Military Medical Center. Caregivers (N=560) of civilians (n=344) and service member/veterans (n=216) with TBI. Not Applicable MAIN OUTCOME MEASURES: PROMIS sleep and fatigue measures administered as both computerized adaptive tests (CATs) and 4-item short forms (SFs). For both samples, floor and ceiling effects for the PROMIS measures were low (internal consistency was very good (all alphas ≥0.80), and test-retest reliability was acceptable (all r≥0.70 except for the fatigue CAT in the service member/veteran sample r=0.63). Convergent validity was supported by moderate correlations between the PROMIS and related measures. Discriminant validity was supported by low correlations between PROMIS measures and measures of dissimilar constructs. PROMIS scores indicated significantly worse sleep and fatigue for those caring for someone with high levels versus low levels of impairment. Findings support the reliability and validity of the PROMIS CAT and SF measures of sleep disturbance and fatigue in caregivers of civilians and service members/veterans with TBI. Copyright © 2018. Published by Elsevier Inc.
Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob
study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately......The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present...... = .07). The limits of agreement were ±18.53. A strong correlation was found between test and retest (intercorrelation coefficient .908); the standard error of measurement was 6.7, and the minimal detectable change was 18.5. The Danish version of the ATRS showed moderately strong criterion validity...
Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina
We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.
Ruhinda, E; Byanyima, R K; Mugerwa, H
Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. A blinded, repeated-measures diagnostic test was carried out on lumbar spine x-ray radiographs. Radiology Department at Joint Clinical Research Centre (JCRC), Mengo-Kampala-Uganda. Seventy (70) lateral lumbar x-ray films were used for this study and were obtained from the archive of JCRC radiology department at Butikiro house, Mengo-Kampala. Poor observer agreement, both inter- and intra-observer, with kappa values of 0.16 was found. Inter-observer agreement was poorer than intra-observer agreement. Kappa values significantly rose when the lumbar lordosis was clustered into four categories without grading each abnormality. The results confirm that subjective assessment of lumbar lordosis has low reliability and validity. Film quality has limited influence on the observer reliability. This study further shows that fewer scale categories of lordosis abnormalities produce better observer reliability.
Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo
Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.
Collins, Cristiana Kahl; Johnson, Vicky Saliba; Godwin, Ellen M; Pappas, Evangelos
To determine the reliability and validity of the Saliba Postural Classification System (SPCS). Two physical therapists classified pictures of 100 volunteer participants standing in their habitual posture for inter and intra-tester reliability. For validity, 54 participants stood on a force plate in a habitual and a corrected posture, while a vertical force was applied through the shoulders until the clinician felt a postural give. Data were extracted at the time the give was felt and at a time in the corrected posture that matched the peak vertical ground reaction force (VGRF) in the habitual posture. Inter-tester reliability demonstrated 75% agreement with a Kappa = 0.64 (95% CI = 0.524-0.756, SE = 0.059). Intra-tester reliability demonstrated 87% agreement with a Kappa = 0.8, (95% CI = 0.702-0.898, SE = 0.05) and 80% agreement with a Kappa = 0.706, (95% CI = 0.594-0818, SE = 0.057). The examiner applied a significantly higher (p < 0.001) peak vertical force in the corrected posture prior to a postural give when compared to the habitual posture. Within the corrected posture, the %VGRF was higher when the test was ongoing vs. when a postural give was felt (p < 0.001). The %VGRF was not different between the two postures when comparing the peaks (p = 0.214). The SPCS has substantial agreement for inter- and intra-tester reliability and is largely a valid postural classification system as determined by the larger vertical forces in the corrected postures. Further studies on the correlation between the SPCS and diagnostic classifications are indicated.
Manuel V. Garnacho-Castaño
Full Text Available The objectives of the study were to determine the validity and reliability of peak velocity (PV, average velocity (AV, peak power (PP and average power (AP measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain during two resistance exercises, bench press (BP and full back squat (BS, performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2. Intraclass correlation coefficients (ICCs indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W. Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W. Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP make this device a useful tool for monitoring resistance training.
Erkan Alpsoy; Yeşim Şenol; Aslı Bilgiç Temel; G. Özge Baysal; Ayşe Akman Karakaş
Backround and design. Internalized stigma involves endorsing negative feelings and beliefs such as insignificance, shame and withdrawal triggered by applying these negative stereotypes to one self. Internalized Stigma Scale has not been applied to psoriasis patients. We aimed to evaluate the reliability and validity of Internalized Stigma Scale in psoriasis patients. Materials and Methods. 100 consecutive, volunteer psoriasis patients (48 female, 52 male; aged, 40.59±15.44 years) were enro...
Sayed Hadi Sayed Alitabar; Mojtaba Habibi; Maryam Falahatpisheh; Musa Arvin
Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST). Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men) with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. T...
Cruz, Jonas P; Baldacchino, Donia R; Alquwez, Nahed
Patients often resort to religious and spiritual activities to cope with physical and mental challenges. The effect of spiritual coping on overall health, adaptation and health-related quality of life among patients undergoing haemodialysis (HD) is well documented. Thus, it is essential to establish a valid and reliable instrument that can assess both the religious and non-religious coping methods in patients undergoing HD. This study aimed to assess the validity and reliability of the Spiritual Coping Strategies Scale Arabic version (SCS-A) in Saudi patients undergoing HD. A convenience sample of 60 Saudi patients undergoing HD was recruited for this descriptive, cross-sectional study. Data were collected between May and June 2015. Forward-backward translation was used to formulate the SCS-A. The SCS-A, Muslim Religiosity Scale and the Quality of Life Index Dialysis Version III were used to procure the data. Internal consistency reliability, stability reliability, factor analysis and construct validity tests were performed. Analyses were set at the 0.05 level of significance. The SCS-A showed an acceptable internal consistency and strong stability reliability over time. The EFA produced two factors (non-religious and religious coping). Satisfactory construct validity was established by the convergent and divergent validity and known-groups method. The SCS-A is a reliable and valid tool that can be used to measure the religious and non-religious coping strategies of patients undergoing HD in Saudi Arabia and other Muslim and Arabic-speaking countries. © 2016 European Dialysis and Transplant Nurses Association/European Renal Care Association.
Park, Dae-Sung; Lee, GyuChang
A balance test provides important information such as the standard to judge an individual's functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment software using the Nintendo Wii Balance Board, investigated its reliability and validity, and compared it with a laboratory-grade force platform. Twenty healthy adults participated in our study. The participants participated in the test for inter-rater reliability, intra-rater reliability, and concurrent validity. The tests were performed with balance assessment software using the Nintendo Wii balance board and a laboratory-grade force platform. Data such as Center of Pressure (COP) path length and COP velocity were acquired from the assessment systems. The inter-rater reliability, the intra-rater reliability, and concurrent validity were analyzed by an intraclass correlation coefficient (ICC) value and a standard error of measurement (SEM). The inter-rater reliability (ICC: 0.89-0.79, SEM in path length: 7.14-1.90, SEM in velocity: 0.74-0.07), intra-rater reliability (ICC: 0.92-0.70, SEM in path length: 7.59-2.04, SEM in velocity: 0.80-0.07), and concurrent validity (ICC: 0.87-0.73, SEM in path length: 5.94-0.32, SEM in velocity: 0.62-0.08) were high in terms of COP path length and COP velocity. The balance assessment software incorporating the Nintendo Wii balance board was used in our study and was found to be a reliable assessment device. In clinical settings, the device can be remarkably inexpensive, portable, and convenient for the balance assessment.
Gore, Shweta; Blackwood, Jennifer; Guyette, Mary; Alsalaheen, Bara
Reduced physical activity is associated with poor prognosis in chronic obstructive pulmonary disease (COPD). Accelerometers have greatly improved quantification of physical activity by providing information on step counts, body positions, energy expenditure, and magnitude of force. The purpose of this systematic review was to compare the validity and reliability of accelerometers used in patients with COPD. An electronic database search of MEDLINE and CINAHL was performed. Study quality was assessed with the Strengthening the Reporting of Observational Studies in Epidemiology checklist while methodological quality was assessed using the modified Quality Appraisal Tool for Reliability Studies. The search yielded 5392 studies; 25 met inclusion criteria. The SenseWear Pro armband reported high criterion validity under controlled conditions (r = 0.75-0.93) and high reliability (ICC = 0.84-0.86) for step counts. The DynaPort MiniMod demonstrated highest concurrent validity for step count using both video and manual methods. Validity of the SenseWear Pro armband varied between studies especially in free-living conditions, slower walking speeds, and with addition of weights during gait. A high degree of variability was found in the outcomes used and statistical analyses performed between studies, indicating a need for further studies to measure reliability and validity of accelerometers in COPD. The SenseWear Pro armband is the most commonly used accelerometer in COPD, but measurement properties are limited by gait speed variability and assistive device use. DynaPort MiniMod and Stepwatch accelerometers demonstrated high validity in patients with COPD but lack reliability data.
Cetin, Fatma Cosar; Sezer, Ayse; Merih, Yeliz Dogan
OBJECTIVE: The objective of this study is to investigate the validity and the reliability of Birth Satisfaction Scale (BSS) and to adapt it into the Turkish language. This scale is used for measuring maternal satisfaction with birth in order to evaluate women’s birth perceptions. METHODS: In this study there were 150 women who attended to inpatient postpartum clinic. The participants filled in an information form and the BSS questionnaire forms. The properties of the scale were tested by conducting reliability and validation analyses. RESULTS: BSS entails 30 Likert-type questions. It was developed by Hollins Martin and Fleming. Total scale scores ranged between 30–150 points. Higher scores from the scale mean increases in birth satisfaction. Three overarching themes were identified in Scale: service provision (home assessment, birth environment, support, relationships with health care professionals); personal attributes (ability to cope during labour, feeling in control, childbirth preparation, relationship with baby); and stress experienced during labour (distress, obstetric injuries, receiving sufficient medical care, obstetric intervention, pain, prolonged labour and baby’s health). Cronbach’s alfa coefficient was 0.62. CONCLUSION: According to the present study, BSS entails 30 Likert-type questions and evaluates women’s birth perceptions. The Turkish version of BSS has been proven to be a valid and a reliable scale. PMID:28058355
Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob; Troelsen, Anders
The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately strong correlations with the physical subscores of the Medical Outcomes Study 36-item Short-Form Health Survey (r = .70 to .75; p questionnaire (r = .71; p validity. For study and follow-up purposes, the ATRS seems reliable for comparisons of groups of patients. Its usability is limited for repeated assessment of individual patients. The development of analysis guidelines would be desirable. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Curry, M A; Campbell, R A; Christian, M
Two studies of low-income pregnant women (N = 179) were done to examine the validity and reliability of the Prenatal Psychosocial Profile (PPP). The PPP, a composite of the Rosenberg Self-Esteem Scale, the Support Behaviors Inventory, and a newly developed measure of stress, is a brief, comprehensive clinical assessment of psychosocial risk during pregnancy. Construct validity of the stress scale was supported by theoretically predicted negative correlations with self-esteem, partner support, and support from others (N = 91). Convergent validity of the stress scale was demonstrated by a correlation of .71 with the Difficult Life Circumstances Scale. Adequate levels of internal consistency were found. Interrelationships between the four subscales were consistent with the underlying conceptualization, and there was beginning evidence of the factorial independence of the subscales.
Hestbaek, L; Leboeuf-Yde, C
OBJECTIVE: To systematically review the peer-reviewed literature about the reliability and validity of chiropractic tests used to determine the need for spinal manipulative therapy of the lumbo-pelvic spine, taking into account the quality of the studies. DATA SOURCES: The CHIROLARS database......-pelvic spine were included. DATA EXTRACTION: Data quality were assessed independently by the two reviewers, with a quality score based on predefined methodologic criteria. Results of the studies were then evaluated in relation to quality. DATA SYNTHESIS: None of the tests studied had been sufficiently...... evaluated in relation to reliability and validity. Only tests for palpation for pain had consistently acceptable results. Motion palpation of the lumbar spine might be valid but showed poor reliability, whereas motion palpation of the sacroiliac joints seemed to be slightly reliable but was not shown...
Condon, David; Revelle, William
Separating the signal in a test from the irrelevant noise is a challenge for all measurement. Low test reliability limits test validity, attenuates important relationships, and can lead to regression artifacts. Multiple approaches to the assessment and improvement of reliability are discussed. The advantages and disadvantages of several different approaches to reliability are considered. Practical advice on how to assess reliability using open source software is provided.
Full Text Available Objective: The Temperament and Character Inventory (TCI was developed to assess temperament including Novelty Seeking (NS, Harm Avoidance (HA, Reward Dependence (RD, Persistence (PS, and Character including Self-Directedness (SD, Cooperativeness (CO and Self Transcendence (ST dimensions of Cloninger's biopsychosocial model of personality in adults. The purpose of this study was to evaluate the reliability and validity of this inventory. Materials & Methods: In this validity test and standardization study, after translation of TCI into Farsi and back translation, the final form was prepared and administered to 220 students who were selected via simple sampling. Cronbach's alpha procedure and test-retest method was used to assess the reliability, and factor analysis of promax rotation was utilized to determine the validity of the inventory. Correlation of interscales and age with scales of TCI was calculated by Pearson correlation. A comparison of TCI scores between sex and also cross-cultural was down using independent t-test. Results: The alpha cofficients for the inventory ranged from 0.44 for the Persistence scale to 0.81 for the ST scale with a median 0f 0.68. The overall alpha cofficients for the whole inventory was 0.74. The Pearson correlation cofficient for the test-retest on 31 students after two months ranged from 0.53 for Novelty Seeking and Persistence to 0.82 for Harm Avoidance scales and from 0.24 for disorderliness vs regimentation (NS4 to 0.86 for fear of uncertainty vs self-confidene (HA2 subscales. The factor analysis showed six factors. Significant correlations were obtained between scales of Self–Directedness with Harm Avoidance (0.57, Self–Directedness with Cooperativeness (0.46. Conclusion: The current study confirms that Persian version of the Temperament and Character Inventory has satisfactory psychometric properties and acceptable reliability and validity for the use students of university population.
Full Text Available John C Sieverdes,1 Eric E Wickel,2 Gregory A Hand,3 Marco Bergamin,4 Robert R Moran,5 Steven N Blair3,51Medical University of South Carolina, College of Nursing and Medicine, Charleson, SC, 2University of Tulsa, Exercise and Sport Science, Tulsa, OK, 3University of South Carolina, Department of Exercise Science, Division of Health Aspects of Physical Activity, Arnold School of Public Health, Columbia, SC, USA; 4University of Padova, Department of Medicine, Sports Medicine Division, Padova, Italy; 5University of South Carolina, Department of Epidemiology and Biostatistics, Arnold School of Public Health, Columbia, SC, USABackground: This study evaluated the reliability and criterion validity of the Mywellness Key accelerometer (MWK using treadmill protocols and indirect calorimetry.Methods: Twenty-five participants completed two four-stage 20-minute treadmill protocols while wearing two MWK accelerometers. Reliability was assessed using raw counts. Validity was assessed by comparing the estimated VO2 calculated from the MWK with values from respiratory gas exchange.Results: Good overall and point estimates of reliability were found for the MWK (all intraclass correlations > 0.93. Generalizability theory coefficients showed lower values for running speed (0.70 versus walking speed (all > 0.84, with the majority of the overall percentage of variability derived from the participant (68%–88% of the total 100%. Acceptable validity was found overall (Pearson’s r = 0.895–0.902, P < 0.0001, with an overall mean absolute error of 16.22% and a coefficient of variance of 16.92%. Bland-Altman plots showed an overestimation of energy expenditure during the running speed, but total kilocalories were underestimated during the protocol by approximately 10%.Conclusion: Good validity was found during light and moderate walking, while running was slightly overestimated. The MWK may be useful for clinicians and researchers interested in promotion or assessment
Bühn, Stefanie; Mathes, Tim; Prengel, Peggy; Wegewitz, Uta; Ostermann, Thomas; Robens, Sibylle; Pieper, Dawid
There is a movement from generic quality checklists toward a more domain-based approach in critical appraisal tools. This study aimed to report on a first experience with the newly developed risk of bias in systematic reviews (ROBIS) tool and compare it with A Measurement Tool to Assess Systematic Reviews (AMSTAR), that is, the most common used tool to assess methodological quality of systematic reviews while assessing validity, reliability, and applicability. Validation study with four reviewers based on 16 systematic reviews in the field of occupational health. Interrater reliability (IRR) of all four raters was highest for domain 2 (Fleiss' kappa κ = 0.56) and lowest for domain 4 (κ = 0.04). For ROBIS, median IRR was κ = 0.52 (range 0.13-0.88) for the experienced pair of raters compared to κ = 0.32 (range 0.12-0.76) for the less experienced pair of raters. The percentage of "yes" scores of each review of ROBIS ratings was strongly correlated with the AMSTAR ratings (r s = 0.76; P = 0.01). ROBIS has fair reliability and good construct validity to assess the risk of bias in systematic reviews. More validation studies are needed to investigate reliability and applicability, in particular. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available Background: The Endometriosis Health Profile-30 (EHP-30 is a disease-specific questionnaire to measure the health-related quality of life in patients with endometriosis. The aim of this study was to evaluate the validity and reliability of the Persian version of Endometriosis Health Profile (EHP-30 in women with endometriosis referring to three Gynecology Clinics in Tehran, Iran. Methods: One hundred women (20 to 50 years old with surgically confirmed endometriosis recruited from three outpatient Gynecology Clinics affiliated to the Iran University of Medical Sciences. All 100 patients were asked to complete EHP-30 questionnaire while referring to the Clinics. The findings were analyzed using descriptive statistics, internal reliability consistency, construct validity (using short form-36, which had already been validated in Iran, factor analysis (with principle component analysis method, and item total correlation to assess the validity and reliability of the questionnaire. Results: The internal consistency reliability of the questionnaire was high (Cronbach’s α ranged between 0.80 and 0.93 for core, and 0.78 and 0.90 for modular parts. All items were loaded on their own factors except item 17 (feeling aggressive or violent and item 18 (feeling unwell, which were loaded on pain and social support domains, respectively. Construct validity of EHP-30, established by using SF-36, indicates good correlations in several similar scales of these two questionnaires. Conclusion: The findings of the study demonstrate that Persian version of EHP-30 is a valid and reliable measure to assess the quality of life in women with endometriosis
Long, S; Wang, C; Wang, L i; Yuan, J; Liu, H; Jiao, X
To clarify the relationship between human quality and reliability, 1925 experiments in 20 subjects were carried out to study the relationship between disposition character, digital memory, graphic memory, multi-reaction time and education level and simulated aircraft operation. Meanwhile, effects of task difficulty and enviromental factor on human reliability were also studied. The results showed that human quality can be predicted and evaluated through experimental methods. The better the human quality, the higher the human reliability.
Conclusion: The APA shows good internal reliability, test–retest reliability, discriminant validity, and construct validity. However, evidence of psychometric properties was limited by a small sample size. Psychometric properties such as interrater reliability as well as concurrent validity and construct validity need to be tested using a larger sample size with representative demographics.
Sayed Hadi Sayed Alitabar
Full Text Available Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST. Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. The DAST Was Used as Instrument. Divergent & Convergent Validity of this Scale Was Assessed with Problems Assessment for Substance Using Psychiatric Patients (PASUPP and Relapse Prediction Scale (RPS.Results: The DAST after the First Time Factor Structure of Using Confirmatory Factor Analysis Was Confirmed. The DAST Had a Good Internal Consistency (Cranach’s Alpha, and the Reliability of the Test Within a Week, 0.9, 0.8. Also this Scale Had a Positive Correlation with Problems Assessment for Substance Using Psychiatric Patients and Relapse Prediction Scale (P<0.01.Conclusion: The Overall Results Showed that the Drug Abuse Screening Test in Iranian Society Is Valid. It Can Be Said that Self-Report Scale Tool Is Useful for Research Purposes and Addiction.
Haggerty, Greg; Zodan, Jennifer; Mehra, Ashwin; Zubair, Ayyan; Ghosh, Krishnendu; Siefert, Caleb J; Sinclair, Samuel J; DeFife, Jared
The current study investigated the interrater reliability and validity of prototype ratings of 5 common adolescent psychiatric disorders: attention-deficit/hyperactivity disorder, conduct disorder, major depressive disorder, generalized anxiety disorder, and posttraumatic stress disorder. One hundred fifty-seven adolescent inpatient participants consented to participate in this study. We compared ratings from 2 inpatient clinicians, blinded to each other's ratings and patient measures, after their separate initial diagnostic interview to assess interrater reliability. Prototype ratings completed by clinicians after their initial diagnostic interview with adolescent inpatients and outpatients were compared with patient-reported behavior problems and parents' report of their child's behavioral problems. Prototype ratings demonstrated good interrater reliability. Clinicians' prototype ratings showed predicted relationships with patient-reported behavior problems and parent-reported behavior problems. Prototype matching seems to be a possible alternative for psychiatric diagnosis. Prototype ratings showed good interrater reliability based on clinicians unique experiences with the patient (as opposed to video-/audio-recorded material) with no training.
Full Text Available The aim of this research is to adapt the Workplace Bullying Scale (Tınaz, Gök & Karatuna, 2013 to Albanian language and to examine its psychometric properties. The research was conducted on 386 person from different sectors of Albania. Results of exploratory and confirmatory factor analysis demonstrated that Albanian scale yielded 2 factors different from original form because of cultural differences. Internal consistency coefficients are,890 -,801 and split-half test reliability coefficients, 864 -,808. Comfirmatory Factor Analysis results change from,40 to,73. Corrected item-total correlations ranged,339 to,672 and according to t-test results differences between each item’s means of upper 27% and lower 27% points were significant. Thus Workplace Bullying Scale can be use as a valid and reliable instrument in social sciences in Albania.
Bongers, Coen C W G; Daanen, Hein A M; Bogerd, Cornelis P; Hopman, Maria T E; Eijsvogels, Thijs M H
Telemetric temperature capsule systems are wireless, relatively noninvasive, and easily applicable in field conditions and have therefore great advantages for monitoring core body temperature. However, the accuracy and responsiveness of available capsule systems have not been compared previously. Therefore, the aim of this study was to examine the validity, reliability, and inertia characteristics of four ingestible temperature capsule systems (i.e., CorTemp, e-Celsius, myTemp, and VitalSense). Ten temperature capsules were examined for each system in a temperature-controlled water bath during three trials. The water bath temperature gradually increased from 33°C to 44°C in trials 1 and 2 to assess the validity and reliability, and from 36°C to 42°C in trial 3 to assess the inertia characteristics of the temperature capsules. A systematic difference between capsule and water bath temperature was found for CorTemp (0.077°C ± 0.040°C), e-Celsius (-0.081°C ± 0.055°C), myTemp (-0.003°C ± 0.006°C), and VitalSense (-0.017°C ± 0.023°C; P 0.05). Comparable inertia characteristics were found for CorTemp (25 ± 4 s), e-Celsius (21 ± 13 s), and myTemp (19 ± 2 s), whereas the VitalSense system responded more slowly (39 ± 6 s) to changes in water bath temperature (P inertia were observed between capsule systems, an excellent validity, test-retest reliability, and inertia was found for each system between 36°C and 44°C after removal of outliers.
Bonnet, Michael H; Doghramji, Karl; Roehrs, Timothy; Stepanski, Edward J; Sheldon, Stephen H; Walters, Arthur S; Wise, Merrill; Chesson, Andrew L
The reliability and validity of EEG arousals and other types of arousal are reviewed. Brief arousals during sleep had been observed for many years, but the evolution of sleep medicine in the 1980s directed new attention to these events. Early studies at that time in animals and humans linked brief EEG arousals and associated fragmentation of sleep to daytime sleepiness and degraded performance. Increasing interest in scoring of EEG arousals led the ASDA to publish a scoring manual in 1992. The current review summarizes numerous studies that have examined scoring reliability for these EEG arousals. Validity of EEG arousals was explored by review of studies that empirically varied arousals and found deficits similar to those found after total sleep deprivation depending upon the rate and extent of sleep fragmentation. Additional data from patients with clinical sleep disorders prior to and after effective treatment has also shown a continuing relationship between reduction in pathology-related arousals and improved sleep and daytime function. Finally, many suggestions have been made to refine arousal scoring to include additional elements (e.g., CAP), change the time frame, or focus on other physiological responses such as heart rate or blood pressure changes. Evidence to support the reliability and validity of these measures is presented. It was concluded that the scoring of EEG arousals has added much to our understanding of the sleep process but that significant work on the neurophysiology of arousal needs to be done. Additional refinement of arousal scoring will provide improved insight into sleep pathology and recovery.
Wikstrom, Erik A.
Context: Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. Objective: To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Design: Descriptive laboratory study. Setting: Sports medicine research laboratory. Patients or Other Participants: Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Intervention(s): Participants completed a single-limb–stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Main Outcome Measure(s): Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. Results: All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with
Wikstrom, Erik A
Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Descriptive laboratory study. Sports medicine research laboratory. Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Participants completed a single-limb-stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with scores ranging from fair (ICC = 0.74) to poor (ICC = 0.29). Wii Fit balance activity scores had poor concurrent validity relative to COP outcomes and SEBT
Monbaliu, Elegast; Ortibus, Els; Roelens, F; Desloovere, Kaat; Declerck, Jan; Prinzie, Peter; De Cock, Paul; Feys, Hilde
AIM: This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). METHOD: Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System a...
ATEŞ, Hatice KADIOĞLU; ADA, Sefer; BAYSAL, Z. Nurdan
Abstract The aim of this study is to develop visual presentation attitude rubric which is valid and reliable for the 4th grade students. 218 students took part in this study from Engin Can Güre which located in Istanbul, Esenler. While preparing this assessment tool with 34 criterias , 6 university lecturers view have been taken who are experts in their field. The answer key sheet has 4 (likert )type options. The rubric has been first tested by Kaiser-Meyer Olkin and Bartletts tests an...
Kurita, H; Miyake, Y
The Tokyo Autistic Behavior Scale (TABS) consisting of 39 items provisionally grouped in four areas--interpersonal-social relationship, language-communication, habit-mannerism and others--is an instrument used by a child's caretaker to rate the child's autistic behaviors on a 3-point scale. Test-retest reliability was satisfactory (i.e., an r for a total score was .94). Among six DSM-III diagnostic groups, infantile autism showed a significantly higher total TABS score than the other five groups, and a taxonomic validity coefficient was .54. An r between total scores of the TABS and the Childhood Autism Rating Scale--Tokyo Version was .59. The area scores showed a lower validity than the total score. The TABS appears to be a useful instrument to assess autistic behavior.
von Porat, Anette; Holmström, Eva; Roos, Ewa
BACKGROUND AND PURPOSE: In clinical practice, visual observation is often used to determine functional impairment and to evaluate treatment following a knee injury. The aim of this study was to evaluate the reliability and validity of observational assessments of knee movement pattern quality......, crossover hop on one leg and one-leg hop. The videos were observed by four physiotherapists, and the knee movement pattern quality, a feature of the loading strategy of the lower extremity, was scored on an 11-point rating scale. To assess the criterion validity, the observational rating was correlated...... obtained between the observers' assessment and knee flexion angle, r = 0.37-0.61. The crossover hop test or one-leg hop test was ranked as the most useful test in 172 of 192 occasions (90%) when assessing knee function. CONCLUSION: The moderate to good inter-observer reliability and the moderate criterion...
Burcu Ersöz Hüseyinsinoğlu
Full Text Available OBJECTIVE: The aim of this study was to adapt the Motor Activity Log-28 (MAL-28 into Turkish and probe the reliability and validity of this questionnaire in stroke patients. METHODS: Following the translation of the MAL-28 into Turkish, its reliability and construct validity was examined in 30 stroke patients. For the reliability study, patients were interviewed twice within a three day period, during which no rehabilitative activities were undertaken. The test-retest reliability was determined by using intra-class correlation coefficient (ICC and Spearman correlation coefficient (r; internal consistency was determined by Cronbach's alpha (α. The construct validity was examined by comparing MAL-28 Quality Of Movement (QOM scale and Amount Of Use (AOU scale with Wolf Motor Function Test (WMFT-Performance Time (PT and Functional Ability (FA scores. Furthermore, item-to-scale correlations of AOU and QOM scales were determined and correlation between totol scores of two scales was examined. RESULTS: Turkish version of MAL-28 AOU and QOM scales were reliable (ICC scores were 0.97 and 0.96, respectively and internally consistent (Cronbach’s α value was 0.96 for both scales. Test-retest reliability was supported (AOU, r=0.94; QOM, r=0.93. WMFT FA scores was correlated with both scales (r=0.63. Correlation between WMFT PT and AOU and QOM scales were -0.56 and -0.55. AOU and QOM scales were highly correlated (r=0.95. CONCLUSION: The findings indicate that Turkish version of MAL-28 is reliable and valid in individuals with stroke. Further investigation about its responsiveness is needed before using that version as a primary measurement in clinical trials
McEwan, Troy E; Shea, Daniel E; Daffern, Michael; MacKenzie, Rachel D; Ogloff, James R P; Mullen, Paul E
This study assessed the reliability and validity of the Stalking Risk Profile (SRP), a structured measure for assessing stalking risks. The SRP was administered at the point of assessment or retrospectively from file review for 241 adult stalkers (91% male) referred to a community-based forensic mental health service. Interrater reliability was high for stalker type, and moderate-to-substantial for risk judgments and domain scores. Evidence for predictive validity and discrimination between stalking recidivists and nonrecidivists for risk judgments depended on follow-up duration. Discrimination was moderate (area under the curve = 0.66-0.68) and positive and negative predictive values good over the full follow-up period ( Mdn = 170.43 weeks). At 6 months, discrimination was better than chance only for judgments related to stalking of new victims (area under the curve = 0.75); however, high-risk stalkers still reoffended against their original victim(s) 2 to 4 times as often as low-risk stalkers. Implications for the clinical utility and refinement of the SRP are discussed.
Maffini, Cara S; Wong, Y Joel
Although measures of cultural identity, values, and behavior exist in the multicultural psychological literature, there is currently no measure that explicitly assesses ethnic minority individuals' positive and negative affect toward culture. Therefore, we developed 2 new measures called the Feelings About Culture Scale--Ethnic Culture and Feelings About Culture Scale--Mainstream American Culture and tested their psychometric properties. In 6 studies, we piloted the measures, conducted factor analyses to clarify their factor structure, and examined reliability and validity. The factor structure revealed 2 dimensions reflecting positive and negative affect for each measure. Results provided evidence for convergent, discriminant, criterion-related, and incremental validity as well as the reliability of the scales. The Feelings About Culture Scales are the first known measures to examine both positive and negative affect toward an individual's ethnic culture and mainstream American culture. The focus on affect captures dimensions of psychological experiences that differ from cognitive and behavioral constructs often used to measure cultural orientation. These measures can serve as a valuable contribution to both research and counseling by providing insight into the nuanced affective experiences ethnic minority individuals have toward culture. (c) 2015 APA, all rights reserved).
Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U
The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p squat and FHS test performances (r squat and FHS test performance can be attributed to differences in the movement patterns of the tests
Full Text Available The aim of this research is to develop a measurement instrument that will determine the cultural responsive teaching readiness level of teacher candidates. The study group consisted of a total of 231 candidate teachers, of which 83 were males and 148 were females, who were attending their final year of class teacher education programs at various Turkish universities during the 2016-2017 education year. In the first phase, a 33-item draft form was presented to experts to be reviewed. Based on the feedback received, revisions were made and the final scale was applied to a group of 231 candidate teachers. In the analysis of the data obtained as the result of the application, Exploratory Factor Analysis (EFA was performed. The EFA produced 21 items within a two-factor structure as, “Personal Readiness” and “Professional Readiness.” It was observed that the sub-factors were components of the “cultural responsive teaching readiness” dimension, and that the goodness of fit measures obtained as a result of the First and Second Level Confirmatory Factor Analyzes (CFA were high. In addition, reliability coefficients were found to be high as a result of reliability measurements. With the help of these findings, this study concludes that the Cultural Responsive Teaching Readiness scale is both valid and reliable.
High product quality, service reliability, and management of operations are key factors in business growth and sustainability. Analyzing “The Starbucks Experience” is a pedagogical approach to reinforcing the concepts of control and management of quality, service reliability, and efficient operations in action. The objective ...
Harmanci Seren, Arzu Kader; Tuna, Rujnan; Eskin Bacaksiz, Feride
Objective measurement of the job performance of nursing staff using valid and reliable instruments is important in the evaluation of healthcare quality. A current, valid, and reliable instrument that specifically measures the performance of nurses is required for this purpose. The aim of this study was to determine the validity and reliability of the Turkish version of the Job Performance Instrument. This study used a methodological design and a sample of 240 nurses working at different units in four hospitals in Istanbul, Turkey. A descriptive data form, the Job Performance Scale, and the Employee Performance Scale were used to collect data. Data were analyzed using IBM SPSS Statistics Version 21.0 and LISREL Version 8.51. On the basis of the data analysis, the instrument was revised. Some items were deleted, and subscales were combined. The Turkish version of the Job Performance Instrument was determined to be valid and reliable to measure the performance of nurses. The instrument is suitable for evaluating current nursing roles.
Huber, Catherine; Mesbah, Mounir
Reliability and survival analysis are important applications of stochastic mathematics (probability, statistics and stochastic processes) that are usually covered separately in spite of the similarity of the involved mathematical theory. This title aims to redress this situation: it includes 21 chapters divided into four parts: Survival analysis, Reliability, Quality of life, and Related topics. Many of these chapters were presented at the European Seminar on Mathematical Methods for Survival Analysis, Reliability and Quality of Life in 2006.
Marc, Linda G; Henderson, Whitney R; Desrosiers, Astrid; Testa, Marcia A; Jean, Samuel E; Akom, Eniko Edit
There is limited information on depression in Haitians and this is partly attributable to the absence of culturally and linguistically adapted measures for depression. To perform a psychometric evaluation of the Haitian-Creole version of the PHQ-9 administered to men who have sex with men (MSM) in the Republic of Haiti. This study uses a cross-sectional design and data are from the Integrated Behavioral and Biological HIV Survey (IBBS) for MSM in Haiti. Inclusion criteria required that participants be male, ≥ 18 years, report sexual relations with a male partner in the last 12 months, and lived in Haiti during the past 3 months. Respondent Driven Sampling was used for participant recruitment. A structured questionnaire was verbally administered in Haitian-Creole capturing information on sociodemographics, sexual behaviors, human immunodeficiency virus (HIV) status and depressive symptomatology using the PHQ-9. Psychometric analyses of the translated PHQ-9 assessed unidimensionality, factor structure, reliability, construct validity, and differential item functioning (DIF) across subgroups (age, educational level, sexual orientation and HIV status). In a study population of 1,028 MSM, the Haitian-Creole version of the PHQ-9 is unidimensional, has moderately high internal consistency reliability (α = 0.78), and shows evidence of construct validity where HIV-positive subjects have greater depression (p = 0.002). There is no evidence of DIF across age, education, sexual orientation or HIV status. HIV-positive MSM are twice as likely to screen positive for moderately severe and severe depressive symptoms compared to their HIV-negative counterparts. There is strong evidence for the psychometric adequacy of the translated PHQ-9 screening tool as a measure of depression with MSM in Haiti. Future research is necessary to examine the predictive validity of depression for subsequent health behaviors or clinical outcomes among Haitian MSM.
Chung, Mi Ja; Park, Youngrye; Eun, Young
The aim of this study was to examine the validity and reliability of the Korean Version of the Spiritual Care Competence Scale (K-SCCS). A cross-sectional study design was used. The K-SCCS consisted of 26 questions to measure spiritual care competence of nurses. Participants, 228 nurses who had more than 3 years'experience as a nurse, completed the survey. Confirmatory factor analysis was used to examine the construct validity and correlations of K-SCCS and spiritual well-being (SWB) were used to examine the criterion validity of K-SCCS. Cronbach's alpha was used to test internal consistency. The construct and the criterion-related validity of K-SCCS were supported as measures of spiritual care competence. Cronbach's alpha was .95. Factor loadings of the 26 questions ranged from .60 to .96. Construct validity of K-SCCS was verified by confirmatory factor analysis (RMSEA=.08, CFI=.90, NFI=.85). Criterion validity compared to the SWB showed significant correlation (r=.44, pspiritual care competence with validity and reliability. However, further study is needed to retest the verification of the factor analysis related to factor 2 (professionalisation and improving the quality of spiritual care) and factor 3 (personal support and patient counseling). Therefore, we recommend using the total score without distinguishing subscales.
In educational research that calls itself empirical, the relationship between validity and reliability is that of trade-off: the stronger the bases for validity, the weaker the bases for reliability (and vice versa). Validity and reliability are widely regarded as basic criteria for evaluating research; however, there are ethical implications of…
Choi, Seong Uk; Lee, Hye Sun; Shin, Joon Ho; Ho, Seung Hee; Koo, Mi Jung; Park, Kyoung Hae; Yoon, Jeong Ah; Kim, Dong Min; Oh, Jung Eun; Yu, Se Hwa; Kim, Dong A
To establish the reliability and validity the Korean version of the Stroke Impact Scale (K-SIS) 3.0. A total of 70 post-stroke patients were enrolled. All subjects were evaluated for general characteristics, Mini-Mental State Examination (MMSE), the National Institutes of Health Stroke Scale (NIHSS), Modified Barthel Index, Hospital Anxiety and Depression Scale (HADS). The SF-36 and K-SIS 3.0 assessed their health-related quality of life. Statistical analysis after evaluation, determined the reliability and validity of the K-SIS 3.0. A total of 70 patients (mean age, 54.97 years) participated in this study. Internal consistency of the SIS 3.0 (Cronbach's alpha) was obtained, and all domains had good co-efficiency, with threshold above 0.70. Test-retest reliability of SIS 3.0 required correlation (Spearman's rho) of the same domain scores obtained on the first and second assessments. Results were above 0.5, with the exception of social participation and mobility. Concurrent validity of K-SIS 3.0 was assessed using the SF-36, and other scales with the same or similar domains. Each domain of K-SIS 3.0 had a positive correlation with corresponding similar domain of SF-36 and other scales (HADS, MMSE, and NIHSS). The newly developed K-SIS 3.0 showed high inter-intra reliability and test-retest reliabilities, together with high concurrent validity with the original and various other scales, for patients with stroke. K-SIS 3.0 can therefore be used for stroke patients, to assess their health-related quality of life and treatment efficacy.
Haidar, Rachid K; Kassak, Kassem; Masrouha, Karim; Ibrahim, Kamal; Mhaidli, Hani
Cross-sectional validation and reliability assessment study of Arabic version of Scoliosis Research Society-22 (SRS-22r) Questionnaire. To develop and validate the Arabic version of the SRS-22r questionnaire. The diagnosis and treatment of adolescent idiopathic scoliosis may influence patient quality of life. SRS-22r is an internationally validated questionnaire used to assess function/activity, pain, self-image, and mental health of patients with scoliosis. It has been translated into several languages but not into Arabic language. Therefore, a valid health-related quality-of-life outcome questionnaire for patients with spinal deformity is still lacking in Arabic language. The English version of SRS-22r questionnaire was translated, back-translated, and culturally adapted to Arabic language. Then, 81 patients with idiopathic adolescent scoliosis were allocated randomly into either the reliability testing group (group 1) or the validity testing group (group 2). Group 1 patients completed Arabic version of SRS-22r questionnaire twice with 1-week interval in-between. Cronbach α and intraclass correlation coefficient were measured to determine internal consistency and temporal reliability. Group 2 patients completed the Arabic version of SRS-22r questionnaire and the previously validated Arabic version of 36-Item Short Form Health Survey (Short Form-36) questionnaire concurrently, and Pearson correlation coefficient was obtained to assess validity. Content analysis, internal consistency reliability, test/retest reproducibility (intraclass correlation coefficient range: 0.82-0.90), and test of concurrent validity showed satisfactory results. Function/activity and satisfaction with management domains had a lower Cronbach α (0.58 and 0.44, respectively, vs. 0.71-0.85 range for others). Self-image/appearance and satisfaction with management had a lower correlation with domains of the 36-Item Short Form Health Survey. An Arabic version of the SRS-22r questionnaire has
Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje
This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters. Key points Study evaluated the reliability and construct validity of a newly developed inline skating skill test. Evaluated test is a first protocol designed to assess specific inline skating skill. Two groups of amateur skaters with different skating proficiency repeated the skill test in four separate occasions. The results suggest that evaluated test is reliable and valid to evaluate inline skating skill in amateur skaters. PMID:27803616
Conclusion: Only seven studies calculated validity coefficients within the study whereas 47 cited the validity coefficient. Twenty-six calculated a reliability coefficient whereas 47 cited the reliability of the ED measures. Four studies found validity evidence for the EAT, EDI, BULIT-R, QEDD, and EDE-Q in an athlete population. Few studies reviewed calculated validity and reliability coefficients of ED measures. Cross-validation of these measures in athlete populations is clearly needed.
Mraity, H.; England, A.; Hogg, P.
Purpose: Using AP pelvis as a catalyst, this paper explains how a psychometric scale for image quality assessment can be created using Bandura's theory for self-efficacy. Background: Establishing an accurate diagnosis is highly dependent upon the quality of the radiographic image. Image quality, as a construct (i.e. set of attributes that makes up the image quality), continues to play an essential role in the field of diagnostic radiography. The process of assessing image quality can be facilitated by using criteria, such as the European Commission (EC) guidelines for quality criteria as published in 1996. However, with the advent of new technology (Computed Radiography and Digital Radiography), some of the EC criteria may no longer be suitable for assessing the visual quality of a digital radiographic image. Moreover, a lack of validated visual image quality scales in the literature can also lead to significant variations in image quality evaluation. Creating and validating visual image quality scales, using a robust methodology, could reduce variability and improve the validity and reliability of perceptual image quality evaluations
Gusi, N; Perez-Sousa, M A; Gozalo-Delgado, M; Olivares, P R
A proxy version of the EQ-5D-Y, a questionnaire to evaluate the Health Related Quality of Life (HRQoL) in children and adolescents, has recently been developed. There are currently no data on the validity and reliability of this tool. The objective of this study was to analyze the validity and reliability of the EQ-5D-Y proxy version. A core set of self-report tools, including the Spanish version of the EQ-5D-Y were administered to a group of Spanish children and adolescents drawn from the general population. A similar core set of internationally standardized proxy tools, including the EQ-5D-Y proxy version were administered to their parents. Test-retest reliability was determined, and correlations with other generic measurements of HRQoL were calculated. Additionally, known group validity was examined by comparing groups with a priori expected differences in HRQoL. The agreement between the self-report and proxy version responses was also calculated. A total of 477 children and adolescents and their parents participated in the study. One week later, 158 participants completed the EQ-5D-Y/EQ-5D-Y proxy to facilitate reliability analysis. Agreement between the test-retest scores was higher than 88% for EQ-5D-Y self-report, and proxy version. Correlations with other health measurements showed similar convergent validity to that observed in the international EQ-5D-Y. Agreement between the self-report and proxy versions ranged from 72.9% to 97.1%. The results provide preliminary evidence of the reliability and validity of the EQ-5D-Y proxy version. Copyright © 2013 Asociación Española de Pediatría. Published by Elsevier Espana. All rights reserved.
Granier, Cyril; Hausswirth, Christophe; Dorel, Sylvain; Yann, Le Meur
This study aimed to determine the validity and the reliability of the Stages power meter crank system (Boulder, United States) during several laboratory cycling tasks. Eleven trained participants completed laboratory cycling trials on an indoor cycle fitted with SRM Professional and Stages systems. The trials consisted of an incremental test at 100W, 200W, 300W, 400W and four 7s sprints. The level of pedaling asymmetry was determined for each cycling intensity during a similar protocol completed on a Lode Excalibur Sport ergometer. The reliability of Stages and SRM power meters was compared by repeating the incremental test during a test-retest protocol on a Cyclus 2 ergometer. Over power ranges of 100-1250W the Stages system produced trivial to small differences compared to the SRM (standardized typical error values of 0.06, 0.24 and 0.08 for the incremental, sprint and combined trials, respectively). A large correlation was reported between the difference in power output (PO) between the two systems and the level of pedaling asymmetry (r=0.58, p system according to the level of pedaling asymmetry provided only marginal improvements in PO measures. The reliability of the Stages power meter at the sub-maximal intensities was similar to the SRM Professional model (coefficient of variation: 2.1 and 1.3% for Stages and SRM, respectively). The Stages system is a suitable device for PO measurements, except when a typical error of measurement power ranges of 100-1250W is expected.
Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo
We prepared a cross-cultural adaptation of the Behcet's Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Fifty patients with Behcet's disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet's Disease Current Activity Form (BDCAF) and a Behcet's Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). The Korean version of BSAS is a reliable and valid instrument to measure BD activity.
Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo
Background/Aims: We prepared a cross-cultural adaptation of the Behcet’s Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Methods: Fifty patients with Behcet’s disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet’s Disease Current Activity Form (BDCAF) and a Behcet’s Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Results: Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). Conclusions: The Korean version of BSAS is a reliable and valid instrument to measure BD activity. PMID:26767871
Pecht, Michael; Boulton, William R.
Quality and reliability are two attributes required for all Japanese products, although the JTEC panel found these attributes to be secondary to customer cost requirements. While our Japanese hosts gave presentations on the challenges of technology, cost, and miniaturization, quality and reliability were infrequently the focus of our discussions. Quality and reliability were assumed to be sufficient to meet customer needs. Fujitsu's slogan, 'quality built-in, with cost and performance as prime consideration,' illustrates this point. Sony's definition of a next-generation product is 'one that is going to be half the size and half the price at the same performance of the existing one'. Quality and reliability are so integral to Japan's electronics industry that they need no new emphasis.
This study describes the development and evaluation of the Nursery Teacher's Stress Scale (NTSS), which explores the relation between daily hassles at work and work-related stress. In Analysis 1, 29 items were chosen to construct the NTSS. Six factors were identified: I. Stress relating to child care; II. Stress from human relations at work; III. Stress from staff-parent relations; IV. Stress from lack of time; V. Stress relating to compensation; and VI. Stress from the difference between individual beliefs and school policy. All these factors had high degrees of internal consistency. In Analysis 2, the concurrent validity of the NTSS was examined. The results showed that the NTSS total scores were significantly correlated with the Job Stress Scale-Revised Version (job stressor scale, r = .68), the Pre-school Teacher-efficacy Scale (r = -.21), and the WHO-five Well-Being Index Japanese Version (r = -.40). Work stresses are affected by several daily hassles at work. The NTSS has acceptable reliability and validity, and can be used to improve nursery teacher's mental health.
Britt Karin Støen Utvær
Full Text Available Self-determination theory (SDT distinguishes types of motivation according to types of self-regulation along a continuum of internalisation. Types of motivation vary in quality and outcomes and are frequently used in research as predictors of educational outcomes such as learning, performance, engagement, and persistence. The Academic Motivation Scale (AMS, which is based on the SDT, has not previously been evaluated in Norway. In response, by using correlation and confirmatory factor analysis, we examined the dimensionality, reliability, and construct validity of the AMS among vocational health and social care students. Our hypothesised 7-factor model demonstrated the best fit, while the AMS demonstrated good reliability and construct validity in the sample of students. However, some improvements remain necessary. In predicting the rate of school completion among students on vocational tracks, amotivation and identified regulation appeared to be more powerful as intrinsic motivational variables.
Hulteen, Ryan M; Lander, Natalie J; Morgan, Philip J; Barnett, Lisa M; Robertson, Samuel J; Lubans, David R
It has been suggested that young people should develop competence in a variety of 'lifelong physical activities' to ensure that they can be active across the lifespan. The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71% reporting inter-rater reliability and 41% reporting intra-rater reliability. Only four studies in this review reported test-retest reliability. Ten studies reported validity results; content validity was cited in 41% of these studies. Construct validity was reported in 24% of studies, while criterion validity was only reported in 12% of studies. Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review
Bernard, Johnathan A; Dattilo, Jonathan R; Srikumaran, Uma; Zikria, Bashir A; Jain, Amit; LaPorte, Dawn M
Traditional measures for evaluating resident surgical technical skills (e.g., case logs) assess operative volume but not level of surgical proficiency. Our goal was to compare the reliability and validity of 3 tools for measuring surgical skill among orthopedic residents when performing 3 open surgical approaches to the shoulder. A total of 23 residents at different stages of their surgical training were tested for technical skill pertaining to 3 shoulder surgical approaches using the following measures: Objective Structured Assessment of Technical Skills (OSATS) checklists, the Global Rating Scale (GRS), and a final pass/fail assessment determined by 3 upper extremity surgeons. Adverse events were recorded. The Cronbach α coefficient was used to assess reliability of the OSATS checklists and GRS scores. Interrater reliability was calculated with intraclass correlation coefficients. Correlations among OSATS checklist scores, GRS scores, and pass/fail assessment were calculated with Spearman ρ. Validity of OSATS checklists was determined using analysis of variance with postgraduate year (PGY) as a between-subjects factor. Significance was set at p shoulder approaches. Checklist scores showed superior interrater reliability compared with GRS and subjective pass/fail measurements. GRS scores were positively correlated across training years. The incidence of adverse events was significantly higher among PGY-1 and PGY-2 residents compared with more experienced residents. OSATS checklists are a valid and reliable assessment of technical skills across 3 surgical shoulder approaches. However, checklist scores do not measure quality of technique. Documenting adverse events is necessary to assess quality of technique and ultimate pass/fail status. Multiple methods of assessing surgical skill should be considered when evaluating orthopedic resident surgical performance. Copyright Â© 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights
Ulusoy, Hatice; Güler, Güngör; Yıldırım, Gülay; Demir, Ecem
Developing professional values among nursing students is important because values are a significant predictor of the quality care that will be provided, the clients' recognition, and consequently the nurses' job satisfaction. The literature analysis showed that there is only one validated tool available in Turkish that examines both the personal and the professional values of nursing students. The aim of this study was to assess the reliability and validity of the Salford-Scott Nursing Values Questionnaire in Turkish. This study was a Turkish linguistic and cultural adaptation of a research tool. Participants and research context: The sample of this study consisted of 627 undergraduate nursing students from different geographical areas of Turkey. Two questionnaires were used for data collection: a socio-demographic form and the Salford-Scott Nursing Values Questionnaire. For the Salford-Scott Nursing Values Questionnaire, construct validity was examined using factor analyses. Ethical considerations: The study was approved by the Cumhuriyet University Faculty of Medicine Research Ethics Board. Students were informed that participation in the study was entirely voluntary and anonymous. Item content validity index ranged from 0.66 to 1.0, and the total content validity index was 0.94. The Kaiser-Meyer-Olkin measure of sampling was 0.870, and Bartlett's test of sphericity was statistically significant (x 2 = 3108.714, p < 0.001). Construct validity was examined using factor analyses and the six factors were identified. Cronbach's alpha was used to assess the internal consistency reliability and the value of 0.834 was obtained. Our analyses showed that the Turkish version of Salford-Scott Nursing Values Questionnaire has high validity and reliability.
Arian Nahad, Homa; Rouzbahani, Masomeh; Jarollahi, Farnoush; Jalaie, Shohreh; Pourbakht, Akram; Mokrian, Helnaz; Mahdi, Parvane; Amali, Amin; Nodin Zadeh, Abdolmajid
Tinnitus is a common otologic symptom that can seriously affect a patient's quality of life. The purpose of the present study was to translate and validate the Iowa Tinnitus Handicap Questionnaire (THQ) into the Persian language, and to make it applicable as a tool for determining the effects of tinnitus on a patient's life. The main version of the THQ was translated into the Persian language. The agreed Persian version was administered to 150 tinnitus patients. The validity of the Persian THQ was evaluated and internal reliability was confirmed using Cronbach's α-coefficient. Finally, the effect of independent variables such as age, mean patient threshold, gender, and duration of tinnitus were considered in order to determine the psychometric properties of tinnitus. After an exact translation process, the Persian THQ was found to exhibit face validity. In terms of content validity, content validity index in total questionnaire was 0.93. Further, in structural validity measurements, intermediate correlation with annoyance from tinnitus (r=0.49), low correlation with duration of tinnitus (r=0.34) and high correlation with the Tinnitus Handicap Inventory (THI) questionnaire (r=0.84) were demonstrated. Additionally, a negligible effect of gender and age was noted on degree of tinnitus handicap (P= 0.754, P= 0.573, respectively). In the internal reliability assessment for Factors 1, 2, 3, and the whole questionnaire, Cronbach`s α-coefficient was 0.95, 0.92, 0.25 and 0.88, respectively. The Persian version of the Iowa THQ demonstrates high validity and reliability and can be used for the determination of tinnitus handicap and for following-up in the intervention process in Persian tinnitus patients.
Polat, Cansu; Tülek, Zeliha; Kürtüncü, Murat; Eraksoy, Mefkure
This research was conducted to adapt the Monitoring My Multiple Sclerosis (MMMS) scale, which is a scale used for self-evaluation by multiple sclerosis (MS) patients of their own health and quality of life, to Turkish and to determine the psychometric properties of the scale. The methodological research was conducted in the outpatient MS clinic of a university hospital between January and September 2013. The sample in this study consisted of 140 patients aged above 18 who had a diagnosis of definite MS. Patients who experienced attacks in the previous month or had any serious medical problems other than MS were not included in the group. The linguistic validity of MMMS was tested by a backward-forward translation method and an expert panel. Reliability analysis was performed using test-retest correlations, item-total correlations, and internal consistency analysis. Confirmatory factor analysis and concurrent validity were used to determine the construct validity. The Multiple Sclerosis Quality of Life-54 instrument was used to determine concurrent validity and the Expanded Disability Status Scale, Hospital Anxiety and Depression Scale, and Mini Mental State Examination were used for further determination of the construct validity. We determined that the scale consisted of four factors with loadings ranging from 0.49 to 0.79. The correlation coefficients of the scale were determined to be between 0.47 and 0.76 for item-total score and between 0.60 and 0.81 for items and subscale scores. Cronbach's alpha coefficient was determined to be 0.94 for the entire scale and between 0.64 and 0.89 for the subscales. Test-retest correlations were significant. Correlations between MMMS and other scales were also found to be significant. The Turkish MMMS provides adequate validity and reliability for assessing the impact of MS on quality of life and health status in patients.
Chuang, Li-Ling; Chuang, Yu-Fen; Hsu, Miao-Ju; Huang, Ying-Zu; Wong, Alice M. K.
Background Fatigue is a common symptom in the general population and has a substantial effect on individuals’ quality of life. The Multidimensional Fatigue Inventory (MFI) has been widely used to quantify the impact of fatigue, but no Traditional Chinese translation has yet been validated. The goal of this study was to translate the MFI from English into Traditional Chinese (‘the MFI-TC’) and subsequently to examine its validity and reliability. Methods The study recruited a convenience sample of 123 people from various age groups in Taiwan. The MFI was examined using a two-step process: (1) translation and back-translation of the instrument; and (2) examination of construct validity, convergent validity, internal consistency, test-retest reliability, and measurement error. The validity and reliability of the MFI-TC were assessed by factor analysis, Spearman rho correlation coefficient, Cronbach’s alpha coefficient, intraclass correlation coefficient (ICC), minimal detectable change (MDC), and Bland-Altman analysis. All participants completed the Short-Form-36 Health Survey Taiwan Form (SF-36-T) and the Chinese version of the Pittsburgh Sleep Quality Index (PSQI) concurrently to test the convergent validity of the MFI-TC. Test-retest reliability was assessed by readministration of the MFI-TC after a 1-week interval. Results Factor analysis confirmed the four dimensions of fatigue: general/physical fatigue, reduced activity, reduced motivation, and mental fatigue. A four-factor model was extracted, combining general fatigue and physical fatigue as one factor. The results demonstrated moderate convergent validity when correlating fatigue (MFI-TC) with quality of life (SF-36-T) and sleep disturbances (PSQI) (Spearman's rho = 0.68 and 0.47, respectively). Cronbach’s alpha for the MFI-TC total scale and subscales ranged from 0.73 (mental fatigue subscale) to 0.92 (MFI-TC total scale). ICCs ranged from 0.85 (reduced motivation) to 0.94 (MFI-TC total scale), and
Mezzich, Juan E; Cohen, Neal L; Ruiperez, Maria A; Banzato, Claudio E M; Zapata-Vega, Maria I
Quality of life has emerged as a crucial concept for the assessment of health and the planning of health care. Desirable features for the evaluation of quality of life include comprehensiveness, self-ratedness, cultural sensitivity, practicality and psychometric soundness. An attempt to meet these challenges led to the development of a brief multicultural quality of life instrument and to the appraisal of its applicability, reliability and validity. The development of the proposed assessment instrument was based on a wide review of the literature and the engagement of a multicultural mental health scholarly team. Its validation was conducted on samples of psychiatric patients (n = 124) and hospital professionals (n = 53) in New York City. A new generic culture-informed and self-rate instrument, the Multicultural Quality of Life Index, has been developed. Its 10 items cover key aspects of the concept, from physical well-being to spiritual fulfilment. Concerning its applicability, mean time for completion was less than 3 minutes and 96% of raters found it easy to use. Test-retest reliability was high (r = 0.87). A Cronbach's α of 0.92 documented its internal consistency and a factor analysis revealed a strong structure. With regard to discriminant validity, a highly significant difference was found between the mean total scores of professionals (x = 8.41) and patients (x = 6.34) presumed to have different levels of quality of life. The Multicultural Quality of Life Index is a brief and culturally informed instrument that appears to be easy to complete, reliable, internally consistent and valid. © 2011 Blackwell Publishing Ltd.
Matejka, L.A. Jr.
Environmental data validation consists of an assessment of three major areas: analytical method validation; field procedures and documentation review; evaluation of the level of achievement of data quality objectives based in part on PARCC parameters analysis and expected applications of data. A program utilizing matrix association of required levels of validation effort and analytical levels versus applications of this environmental data was developed in conjunction with DOE-ID guidance documents to implement actions under the Federal Facilities Agreement and Consent Order in effect at the Idaho National Engineering Laboratory. This was an effort to bring consistent quality to the INEL-wide Environmental Restoration Program and database in an efficient and cost-effective manner. This program, documenting all phases of the review process, is described here
Nicholas A. Petrunoff
Full Text Available Background. The purpose of this study was to assess the (previously untested reliability and validity of survey questions commonly used to assess travel mode and travel time. Methods. Sixty-five respondents from a staff survey of travel behaviour conducted in a south-western Sydney hospital agreed to complete a travel diary for a week, wear an accelerometer over the same period, and twice complete an online travel survey an average of 21 days apart. The agreement in travel modes between the self-reported online survey and travel diary was examined with the kappa statistic. Spearman’s correlation coefficient was used to examine agreement of travel time from home to workplace measured between the self-reported online survey and four-day travel diary. Moderate-to-vigorous physical activity (MVPA time of active and nonactive travellers was compared by t-test. Results. There was substantial agreement between travel modes (K=0.62, P<0.0001 and a moderate correlation for travel time (ρ=0.75, P<0.0001 reported in the travel diary and online survey. There was a high level of agreement for travel mode (K=0.82, P<0.0001 and travel time (ρ=0.83, P<0.0001 between the two travel surveys. Accelerometer data indicated that for active travellers, 16% of the journey-to-work time is MVPA, compared with 6% for car drivers. Active travellers were significantly more active across the whole workday. Conclusions. The survey question “How did you travel to work this week? If you used more than one transport mode specify the one you used for the longest (distance portion of your journey” is reliable over 21 days and agrees well with a travel diary.
Full Text Available Backround and design. Internalized stigma involves endorsing negative feelings and beliefs such as insignificance, shame and withdrawal triggered by applying these negative stereotypes to one self. Internalized Stigma Scale has not been applied to psoriasis patients. We aimed to evaluate the reliability and validity of Internalized Stigma Scale in psoriasis patients. Materials and Methods. 100 consecutive, volunteer psoriasis patients (48 female, 52 male; aged, 40.59±15.44 years were enrolled in the study. PASI and BSA were evaluated by physician (A.B.. Patients responded contemporaneously to Psoriasis Internalized Stigma Scale (PISS, DQoL, and Perceived Health Status (PHS, single-item self-rated general health question, of which Likert scores 1, 2, and 3 were classified as “from fair to very poor”, and 4, 5 as “good”. Results. Cronbach's alpha coefficient of PISS subscales was 0.83 for alienation, 0.70 for stereotype endorsement, 0.70 for perceived discrimination, 0.84 for social withdrawal and 0.68 for stigma resistance. The same value was 0.89 for the total scale. PISS and DQoL scores mean values were 58.8±12.6 and 10.0±9.4, respectively. PISS was significantly correlated with the patients' DQoL scores (r=,726, p=0,001. PISS was also significantly correlated with disease duration (r=,209, p=0,047. There was no any significant relationship between PASI or BSA and PISS. Mean DQoL scores in patients reporting their PHS as “from fair to very poor” and “good” were 12.1±7.3 and 5.0±4.3, respectively. Mean values of PISS in patients reporting their PHS as “from fair to very poor” was significantly increased compared with patients reporting their PHS as “good” (p=0.001. Conclusion. PISS can be used as a reliable and valid tool in assesing internalized stigmatization in psoriasis patients. Our results indicate a high level of stigmatization in psoriasis patients. Low DQoL scores show a correlation with increased levels of
Ivan Radman, Lana Ruzic, Viktoria Padovan, Vjekoslav Cigrovski, Hrvoje Podnar
Full Text Available This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male were randomized into two groups (competitive level vs. recreational level. They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]. In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%] and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]. The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2 revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p < 0.01 to complete the test in comparison to recreational-level skaters. Moreover, moderate correlation (ρ = 0.80–0.82; all p < 0.01 was observed between the participant’s self-rating and achieved performance times. In conclusion, the proposed test is a reliable and valid method to evaluate inline skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.
Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K
To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Full Text Available Background: The agreement of new instruments or clinical tests with other instruments or tests defines the possibility of these being used interchangeably. Aim: To investigate the validity and reliability of the SW-100 autokeratometer using a Bausch & Lomb (B&L keratometer as the ‘gold standard’. Methods: Eighty subjects (80 right eyes aged between 21 and 38 years were recruited. For intra-test repeatability, two measurements of the corneal radius of curvature were taken with the SW-100 and B&L keratometers. Forty of the 80 subjects participated in the inter-test repeatability measurement. Results: Corneal radius of curvature was found to be statistically different between the two instruments (p < 0.001, with the SW-100 providing slightly flatter values of 0.11 mm and 0.05 mm for the horizontal and vertical meridians, respectively, than the B&L keratometer. The average corneal curvature was 0.07 mm flatter with the SW-100 autokeratometer than with the B&L device. Agreement between the SW-100 and B&L keratometers’ axes was 45% within ± 5°, 60.3% within ± 10°, 78.8% within ± 15°, 80.3% within ± 20°, and 88.7% within ± 40°. Intertest repeatability was better for the B&L device than the SW-100 and showed no significant difference between the two sessions. Both instruments demonstrated comparable intrasession repeatability. As such, both instruments were comparatively reliable (per coefficients of repeatability. The range of limits of agreement of ± 0.14 mm (horizontal meridian and ± 0.17 mm (vertical meridian between the SW-100 and B&L devices showed good agreement. Conclusion: The results suggest that the SW-100 autokeratometer is a reliable and objective instrument that, however, provides flatter radii of curvature measurements than the B&L keratometer. A compensating factor incorporated into the instrument could reduce the difference between the two instruments and make them more interchangeable.
Paulus, David C; Reynolds, Michael C; Schilling, Brian K
During the concentric portion of the free-weight squat exercise, accelerating the mass from rest results in a fluctuation in ground reaction force. It is characterized by an initial period of force greater than the load while accelerating from rest followed by a period of force lower than the external load during negative acceleration. During the deceleration phase, less force is exerted and muscles are loaded sub-optimally. Thus, using a reduced inertia form of resistance such as pneumatics has the capability to minimize these inertial effects as well as control the force in real time to maximize the force exerted over the exercise cycle. To improve the system response of a preliminary design, a squat device was designed with a reduced mass barbell and two smaller pneumatic cylinders. The resistance was controlled by regulating cylinder pressure such that it is capable of adjusting force within a repetition to maximize force exerted during the lift. The resistance force production of the machine was statically validated with the input voltage and output force R2 =0.9997 for at four increments of the range of motion, and the intraclass correlation coefficient (ICC) between trials at the different heights equaled 0.999. The slew rate at three forces was 749.3 N/s +/- 252.3. Dynamic human subject testing showed the desired input force correlated with average and peak ground reaction force with R2 = 0.9981 and R2 = 0.9315, respectively. The ICC between desired force and average and peak ground reaction force was 0.963. Thus, the system is able to deliver constant levels of static and dynamic force with validity and reliability. Future work will be required to develop the control strategy required for real-time control, and performance testing is required to determine its efficacy.
Anderson, R.T.; Lauffenburger, H.A.
The techniques and practices utilized in an allied industry (electric power generation) that might serve as a baseline for formulating Quality Assurance and Reliability (QA and R) procedures for photovoltaic solar energy systems were studied. The study results provide direct near-term input for establishing validation methods as part of the SERI performance criteria and test standards development task.
Full Text Available Objective: To translate the Perceived Stress Scale (versions PSS-4, -10 and -14 and to assess its psychometric properties in a sample of general Greek population. Methods: 941 individuals completed anonymously questionnaires comprising of PSS, the Depression Anxiety and Stress scale (DASS-21 version, and a list of stress-related symptoms. Psychometric properties of PSS were investigated by confirmatory factor analysis (construct validity, Cronbach’s alpha (reliability, and by investigating relations with the DASS-21 scores and the number of symptoms, across individuals’ characteristics. The two-factor structure of PSS-10 and PSS-14 was confirmed in our analysis. We found satisfactory Cronbach’s alpha values (0.82 for the full scale for PSS-14 and PSS-10 and marginal satisfactory values for PSS-4 (0.69. PSS score exhibited high correlation coefficients with DASS-21 subscales scores, meaning stress (r = 0.64, depression (r = 0.61, and anxiety (r = 0.54. Women reported significantly more stress compared to men and divorced or widows compared to married or singled only. A strong significant (p < 0.001 positive correlation between the stress score and the number of self-reported symptoms was also noted. Conclusions: The Greek versions of the PSS-14 and PSS-10 exhibited satisfactory psychometric properties and their use for research and health care practice is warranted.
Andreou, Eleni; Alexopoulos, Evangelos C; Lionis, Christos; Varvogli, Liza; Gnardellis, Charalambos; Chrousos, George P; Darviri, Christina
To translate the Perceived Stress Scale (versions PSS-4, -10 and -14) and to assess its psychometric properties in a sample of general Greek population. 941 individuals completed anonymously questionnaires comprising of PSS, the Depression Anxiety and Stress scale (DASS-21 version), and a list of stress-related symptoms. Psychometric properties of PSS were investigated by confirmatory factor analysis (construct validity), Cronbach's alpha (reliability), and by investigating relations with the DASS-21 scores and the number of symptoms, across individuals' characteristics. The two-factor structure of PSS-10 and PSS-14 was confirmed in our analysis. We found satisfactory Cronbach's alpha values (0.82 for the full scale) for PSS-14 and PSS-10 and marginal satisfactory values for PSS-4 (0.69). PSS score exhibited high correlation coefficients with DASS-21 subscales scores, meaning stress (r = 0.64), depression (r = 0.61), and anxiety (r = 0.54). Women reported significantly more stress compared to men and divorced or widows compared to married or singled only. A strong significant (p < 0.001) positive correlation between the stress score and the number of self-reported symptoms was also noted. The Greek versions of the PSS-14 and PSS-10 exhibited satisfactory psychometric properties and their use for research and health care practice is warranted.
Lillo-Bevia, José R; Pallarés, Jesús G
To validate the new drive indoor trainer Hammer designed by Cycleops®. Eleven cyclists performed 44 randomized and counterbalanced graded exercise tests (100-500W), at 70, 85 and 100 rev.min -1 cadences, in seated and standing positions, on 3 different Hammer units, while a scientific SRM system continuously recorded cadence and power output data. No significant differences were detected between the three Hammer devices and the SRM for any workload, cadence, or pedalling condition (P value between 1.00 and 0.350), except for some minor differences (P 0.03 and 0.04) found in the Hammer 1 at low workloads, and for Hammer 2 and 3 at high workloads, all in seated position. Strong ICCs were found between the power output values recorded by the Hammers and the SRM (≥0.996; P=0.001), independently from the cadence condition and seated position. Bland-Altman analysis revealed low Bias (-5.5-3.8) and low SD of Bias (2.5-5.3) for all testing conditions, except marginal values found for the Hammer 1 at high cadences and seated position (9.6±6.6). High absolute reliability values were detected for the 3 Hammers (150-500W; CVreliable device to drive and measure power output in cyclists, providing an alternative to larger and more expensive laboratory ergometers, and allowing cyclists to use their own bicycle.
Salacinski, Amanda J; Alford, Micah; Drevets, Kathryn; Hart, Sarah; Hunt, Brian E
As an appealing alternative to reference glucose analyzers, portable glucometers are recommended for self-monitoring at home, in the field, and in research settings. The purpose was to characterize the accuracy and precision, and bias of glucometers in biomedical research. Fifteen young (20-36 years; mean = 24.5), moderately to highly active men (n = 10) and women (n = 5), defined by exercising 2 to 3 times a week for the past 6 months, were given an oral glucose tolerance test (OGTT) after an overnight fast. Participants ingested 50, 75, or 150 grams of glucose over a 5-minute period. The glucometer was compared to a reference instrument. The glucometer had 39% of values within 15% of measurements made using the reference instrument ranging from 45.05 to 169.37 mg/dl. There was both a proportional (-0.45 to -0.39) and small fixed (5.06 and 0.90 mg/dl) bias. Results of the present study suggest that the glucometer provided poor validity and reliability results compared to the results provided by the reference laboratory analyzer. The portable glucometers should be used for patient management, but not for diagnosis, treatment, or research purposes. © 2014 Diabetes Technology Society.
Full Text Available Objectives: The present study aims to investigate the validity and reliability of measuring dementia scale in a sample of elderly women has been reviewed. Methods & Materials: This study is descriptive based on what has been designed in which 310 elderly women were participated in this study out of two ranges of Tehran Kahrisak Prison and Tehran Jahandidegan Institute after being received the consent letter to participate in. In this study, samples were selected as available sampling and four questionnaires were used to collect information which was completed as an interview with elderly women. Information obtained has been analyzed using SPSS software and related statistical methods. Results: Based on research findings, mean age of people under study was 76/86 years and the scale reliability were reviewed in convergence method (Quality of Life and Elderly's Depression Questionnaire and divergence (The informant Questionnaire on Cognition Decline in Elderly questionnaire and Mine Mental Status Examination (MMSE. Also, scale validity was 0.792 with Cronbach's alpha method and 0.759 with division method and 76.0 with test-retest method. In addition, factor analysis indicates the saturation of this scale of a factor. Conclusion: The results achieved from this above study indicate that this scale has appropriate reliability and validity among elderly women. Therefore, it seems that planning to do this research will be very effective in clinical and complex population.
Tsoucalas, Gregory; Bourelia, Stamati; Kalogirou, Vaso; Giatsiou, Styliani; Mavrogiannaki, Eirini; Gatos, Georgios; Galanos, Antonis; Repana, Olga; Iliadou, Eleni; Antoniou, Antonis; Sgantzos, Markos; Gatos, Konstantinos
Fl oor effects are present in most dementia assessment tools as dementia progresses and the in-depth assessment of patients considered more or less on vegetative state is questionable. To develop a questionnaire (the "Gatos Clinical Test-GCT") for the assessment of end-stage demented patients. Five hundred patients with dementia of various causes and an MMSE score between 0 and 2 were enrolled in the study. The GCT consists of 14 closed type questions rated on a Likert scale. The total score is used to evaluate patient's dementia. Various aspects of validity and reliability (including face, content and structural validity as well as test-retest reliability) were examined. Three subscales "Autonomy/Alertness", "Gnosias" and "Somatokinetic function" were defined, with a Cronbach equal to 0.851, 0.756 and 0.598 respectively. The GCT subscales and total score were statistically significant higher in patients with MMSE score 1 or 2 compared with those with MMSE score 0 (pGATOS" questionnaire is a valid and reliable test for patients with severe dementia, aiming at identification of those patients who could sustain some quality of life. It is a relatively short and easy to administer tool. As dementia prevalence is expected to rise further worldwide we believe that GCT could offer valuable services to health professionals, caregivers and patients.
Clark, Ross A; Mentiplay, Benjamin F; Pua, Yong-Hao; Bower, Kelly J
The use of force platform technologies to assess standing balance is common across a range of clinical areas. Numerous researchers have evaluated the low-cost Wii Balance Board (WBB) for its utility in assessing balance, with variable findings. This review aimed to systematically evaluate the reliability and concurrent validity of the WBB for assessment of static standing balance. Articles were retrieved from six databases (Medline, SCOPUS, EMBASE, CINAHL, Web of Science, Inspec) from 2007 to 2017. After independent screening by two reviewers, 25 articles were included. Two reviewers performed the data extraction and quality assessment. Test-retest reliability was investigated in 12 studies, with intraclass correlation coefficients or Pearson's correlation values showing a range from poor to excellent reliability (range: 0.27 to 0.99). Concurrent validity (i.e. comparison with another force platform) was examined in 21 studies, and was generally found to be excellent in studies examining the association between the same outcome measures collected on both devices. For studies reporting predominantly poor to moderate validity, potentially influential factors included the choice of 1) criterion reference (e.g. not a common force platform), 2) test duration (e.g. balance. Protocol registration number: PROSPERO 2017: CRD42017058122. Copyright © 2018 Elsevier B.V. All rights reserved.
Dogan, Tayfun; Cetin, Bayram
The purpose of the present study was to investigate the reliability and validity of the Turkish version of the Tromso Social Intelligence Scale (TSIS) developed by Silvera, Martinussen, and Dahl (2001). 719 students from Sakarya University participated in the study. Construct validity and criterion related validity and reliability were assessed.…
Leticia de Matos Malavasi
Full Text Available The lack of adherence to practice physical activities urges several researchers to ind answers for this matter. Among these researches, it is investigated how or what motivates people to perform any type of physical activity. Besides that, the environmental conditions are an important reason to establish a healthier lifestyle among individuals. In Brazil, the amount of validated scales about environmental barriers for physical activity in communities is restricted. The validation and the cultural adaptation of these instruments are important not only to compare with studies from other countries, but mainly for planning public politics to improve the adherence to practice physical activities. Thus, the present research aimed to analyze the validity and reliability of the Brazilian version of the Neighborhood EnvironmentWalkability Scale (NEWS. The methodological procedures were structured in three stages. The first stage had the following procedures: translation of NEWS and back-translation by bilingual specialists. The second stage was the adaptation of NEWS to the Brazilian reality through a pilot study and with reliability. The third stage, together with a professional urban panel indicating which neighborhoods had better or worse mobility, it was accomplished a application of the NEWS questionnaire to assure construct validation. The sample of this research were separated in two parts, 75persons for the reliability; and for the validity of the questionnaire 200 residents from the four neighborhoods pointed by the specialists of the city of Florianópolis (SC. Through the NEWS the subjects answered questions about the neighborhoods regarding: type of residences, stores and trade proximity, perception of access to these places, streets characteristics, facilities to walk and ride bicycle, and safety related to traffic and crimes. The statistical analysis was made in the SPSS 11.0 version for the intra-class correlation and reliability for the
Simony, Ane; Carreon, Leah Y; Hansen, Karen Højmark
Study Design Cross-sectional. Objective To develop a psychometrically reliable and valid Danish version of the Spinal Appearance Questionnaire (SAQ). Summary of Background Data The SAQ was developed as a disease-specific measure of quality of life in patients with adolescent idiopathic scoliosis...... (AIS), specifically for younger patients, as it has more visual cues than verbal questions. A reliable and valid Danish Version is not available. Methods A Danish version of the SAQ was developed using previously published and widely accepted guidelines. The final Danish SAQ and the Danish SRS22-R were...... effect for SAQ Expectations. There was good to excellent internal consistency within each domain. Conclusion This purpose of this study was to translate and validate a Danish version of the SAQ. Although problems were identified with items 7 and 8, the Danish SAQ is reliable and valid....
Dias, D.C.S.; Silva, N.C.; Bonifácio, R.L.
Quality control practices are indispensable to organizations aiming to reach analytical excellence. Method validation is an essential component to quality systems in laboratories, serving as a powerful tool for standardization and reliability of outcomes. This paper presents a study of robustness conducted over a SSNTD technique validation process, with the goal of developing indoor radon measurements at the highest level of quality. This quality parameter indicates how well a technique is able to provide reliable results in face of unexpected variations along the measurement. In this robustness study, based on the Youden method, 7 analytical conditions pertaining to different phases of the SSNTD technique (with focus on detector etching) were selected. Based on the ideal values for each condition as reference, extreme levels regarded as high and low were prescribed to each condition. A partial factorial design of 8 unique etching procedures was defined, where each presented their own set of high and low condition values. The Youden test provided 8 indoor radon concentration results, which allowed percentage estimations that indicate the potential influence of each analytical condition on the SSNTD technique. As expected, detector etching factors such as etching solution concentration, temperature and immersion time were identified as the most critical parameters to the technique. Detector etching is a critical step in the SSNTD method – one that must be carefully designed during validation and meticulously controlled throughout the entire process. (author)
Dias, D.C.S.; Silva, N.C.; Bonifácio, R.L., E-mail: firstname.lastname@example.org [Comissao Nacional de Energia Nuclear (LAPOC/CNEN), Pocos de Caldas, MG (Brazil). Laboratorio de Pocos de Caldas
Quality control practices are indispensable to organizations aiming to reach analytical excellence. Method validation is an essential component to quality systems in laboratories, serving as a powerful tool for standardization and reliability of outcomes. This paper presents a study of robustness conducted over a SSNTD technique validation process, with the goal of developing indoor radon measurements at the highest level of quality. This quality parameter indicates how well a technique is able to provide reliable results in face of unexpected variations along the measurement. In this robustness study, based on the Youden method, 7 analytical conditions pertaining to different phases of the SSNTD technique (with focus on detector etching) were selected. Based on the ideal values for each condition as reference, extreme levels regarded as high and low were prescribed to each condition. A partial factorial design of 8 unique etching procedures was defined, where each presented their own set of high and low condition values. The Youden test provided 8 indoor radon concentration results, which allowed percentage estimations that indicate the potential influence of each analytical condition on the SSNTD technique. As expected, detector etching factors such as etching solution concentration, temperature and immersion time were identified as the most critical parameters to the technique. Detector etching is a critical step in the SSNTD method – one that must be carefully designed during validation and meticulously controlled throughout the entire process. (author)
Negahban, Hossein; Mazaheri, Masood; Salavati, Mahyar; Sohani, Soheil Mansour; Askari, Marjan; Fanian, Hossein; Parnianpour, Mohamad
The aims of this study were to culturally adapt and validate the Persian version of Foot and Ankle Outcome Score (FAOS) and present data on its psychometric properties for patients with different foot and ankle problems. The Persian version of FAOS was developed after a standard forward-backward translation and cultural adaptation process. The sample included 93 patients with foot and ankle disorders who were asked to complete two questionnaires: FAOS and Short-Form 36 Health Survey (SF-36). To determine test-retest reliability, 60 randomly chosen patients completed the FAOS again 2 to 6 days after the first administration. Test-retest reliability and internal consistency were assessed using intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. To evaluate convergent and divergent validity of FAOS compared to similar and dissimilar concepts of SF-36, the Spearman's rank correlation was used. Dimensionality was determined by assessing item-subscale correlation corrected for overlap. The results of test-retest reliability show that all the FAOS subscales have a very high ICC, ranging from 0.92 to 0.96. The minimum Cronbach's alpha level of 0.70 was exceeded by most subscales. The Spearman's correlation coefficient for convergent construct validity fell within 0.32 to 0.58 for the main hypotheses presented a priori between FAOS and SF-36 subscales. For dimensionality, the minimum Spearman's correlation coefficient of 0.40 was exceeded by most items. In conclusion, the results of our study show that the Persian version of FAOS seems to be suitable for Iranian patients with various foot and ankle problems especially lateral ankle sprain. Future studies are needed to establish stronger psychometric properties for patients with different foot and ankle problems.
Disease-specific questionnaire for quality of life in patients with peripheral arterial occlusive disease in the stage of critical ischemia (FLeQKI) - methodical development of a specific measuring instrument and psychometric evaluation of its validity and reliability. Pt. 1
Wohlgemuth, W.A.; Bohndorf, K.; Kirchhof, K.; Olbricht, W.; Klarmann, S.; Engelhardt, M.; Freitag, M.H.; Woelfle, K.
Purpose: To develop a disease-specific measuring instrument for quality of life in German-speaking patients with peripheral arterial occlusive disease in the stage of critical ischemia and to test it in a prospective study for validity and reliability. Materials and methods: We developed a questionnaire compiling items representing subjective disease relevant health states. With 35 of these items, we designed the scales comorbidity (KO), physical pain (SZ), physical functioning (KF), physical state (KS), social functioning (SB), mental health (PB), and therapy-induced limitations (TE). Each item was to be valued as never, seldom, often or always. The scales were standardized with a control group of 40 individuals without peripheral arterial occlusive disease who were interviewed twice in an interval of 6 months using both the FLeQKI and the Medical Outcomes Study Group Short Form 36 (SF-36). Convergent and discriminative validity was determined in 65 consecutive in-patients with peripheral occlusive arterial disease in the stage of critical ischemia who were interviewed with FLeQKI and SF-36 prior to percutaneous transluminal angioplasty (PTA) or bypass operation and 1 month and 6 months after. The internal consistency and test-retest reliability of the FLeQKI were determined in the control group. For statistical analysis, Cronbach's α Test and Pearsons Product Moment Correlation were used. Results: The control group consisted of 21 men and 19 women with an age of 73.4 ± 7.8, and the treatment group was comprised of 35 men and 30 women with an age of 75.1 ± 7.0. In the treatment group, convergent validity reached high values in the scales SB, KF, PB, and SZ (r = 0.41 - 0.70). With their discriminative validity (r = -0.04 - 0.30), TE and KS were independent, specific dimensions of life quality. The control group showed good values for internal consistency (Cronbach's α = 0.54 - 0.93) and for test-retest reliability (r = 0.44 - 0.96). Conclusion: The FLeQKI is
Sardenberg, Fernanda; Oliveira, Ana Cristina; Paiva, Saul M; Auad, Sheyla Márcia; Vale, Miriam P
Oral health-related quality of life (OHRQoL) is an important aspect of health outcomes and its assessment should be made using validated instruments. The psychosocial impact of dental aesthetics questionnaire (PIDAQ) is an OHRQoL instrument that assesses the psychosocial impact of dental aesthetics was developed and validated for use on young adults. The aim of the present study was to assess the reliability, validity, and applicability of the PIDAQ for young adults in Brazil. After translation and cross-cultural adaptation, the questionnaire was completed by 245 individuals (124 males and 121 females) aged 18-30 years from the city of Belo Horizonte, Brazil. In order to test discriminant validity, the subjects were examined for the presence or absence of malocclusion based on the dental aesthetic index criteria. Dental examinations were carried out by a previously calibrated examiner [weighted kappa = 0.64-1.00, intraclass correlation coefficient (ICC) = 0.78-1.00]. Internal consistency measured by Cronbach's alpha of the subscales was between 0.75 and 0.91 and test-retest reliability was assessed using the ICC, which ranged from 0.89 to 0.99 for dental self-confidence and social impact, thereby revealing satisfactory reliability. Discriminant validity revealed that subjects without malocclusion had different PIDAQ scores when compared with those with malocclusion. The results suggest that the Brazilian version of the PIDAQ has satisfactory psychometric properties and is thus applicable to young adults in Brazil. Further research is needed to assess these properties in population studies.
Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E
Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje
This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8-2.6%] - 2.2% [95% CI: 0.0-4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2-2.4%] - 2.7% [95% CI: 2.1-4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92-0.99] - 0.99 [95% CI: 0.98-1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters' performances. Competitive-level skaters needed shorter time (24.4-26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.
Shioda, Ai; Tadaka, Etsuko; Okochi, Ayako
Background Community integration is an essential right for people with schizophrenia that affects their well-being and quality of life, but no valid instrument exists to measure it in Japan. The aim of the present study is to develop and evaluate the reliability and validity of the Japanese version of the Community Integration Measure (CIM) for people with schizophrenia. Methods The Japanese version of the CIM was developed as a self-administered questionnaire based on the original version of...
Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J
To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; particles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
The challenge of maintaining power quality and reliability in Ontario's new competitive electricity market was the focus of this paper. The intention of Ontario's deregulation is to lower costs and give customers more power and choice. However, some fear that costs pressures may lead to poorer quality. This paper reviewed the causes, costs, and responsibilities, and made some suggestions concerning how customers can protect their interests as the market opens. It was suggested that in order to assure power quality, especially during the transition period, customers need to take care to design their systems and equipment to tolerate some quality variations. Conversely, utilities need to strive for at least minimum industry standards under most conditions. Contracts should be carefully crafted, with responsibilities for reliability and quality clearly stated. 2 refs
This monograph offers a thorough and updated guide to the theory and methods of progressive censoring, an area that has experienced tremendous growth in recent years. Progressive censoring, originally proposed in the 1950s, is an efficient method of handling samples from industrial experiments involving lifetimes of units that have either failed or censored in a progressive fashion during the life test, with many practical applications to reliability and quality. Key topics and features: Data sets from the literature as well as newly simulated data sets are used to illustrate concepts throughout the text Emphasis on real-life applications to life testing, reliability, and quality control Discussion of parametric and nonparametric inference Coverage of experimental design with optimal progressive censoring The Art of Progressive Censoring is a valuable reference for graduate students, researchers, and practitioners in applied statistics, quality control, life testing, and reliability. With its accessible style...
Bendo Cristiane B
Full Text Available Abstract Background Oral and orofacial problems may cause a profound impact on children’s oral health-related quality of life (OHRQoL because of symptoms associated with these conditions that may influence the physical, psychological and social aspects of their daily life. The OHRQoL questionnaires found in the literature are very specific and are not able to measure the impact of oral health on general health domains. Consequently, the objective of this study was to evaluate the psychometric properties of the Portuguese version for Brazilian translation of the Pediatric Quality of Life Inventory™ (PedsQL™ Oral Health Scale in combination with the PedsQL™ 4.0 Generic Core Scales. Methods The PedsQL™ Oral Health Scale was forward-backward translated and cross-culturally adapted for the Brazilian Portuguese language. In order to assess the feasibility, reliability and validity of the Brazilian version of the instrument, a study was carried out in Belo Horizonte with 208 children and adolescents between 2 and 18 years-of-age and their parents. Clinical evaluation of dental caries, socioeconomic information and the Brazilian versions of the PedsQL™ Oral Health Scale, PedsQL™ 4.0 Generic Core Scales, Child Perceptions Questionnaire (CPQ11-14 and CPQ8-10 and Parental-Caregiver Perception Questionnaire (P-CPQ were administered. Statistical analysis included feasibility (missing values, confirmatory factor analysis (CFA, internal consistency reliability, and test-retest intraclass correlation coefficients (ICC of the PedsQL™ Oral Health Scale. Results There were no missing data for both child self-report and parent proxy-report on the Brazilian version of the PedsQL™ Oral Health Scale. The CFA showed that the five items of child self-report and parent proxy-report loaded on a single construct. The Cronbach's alpha coefficients for child/adolescent and parent oral health instruments were 0.65 and 0.59, respectively. The test
Tander, Berna; Ulus, Yasemin; Terzi, Yüksel; Zahiroğlu, Yeliz; Kesmen, Hakan; Farisoğullari, Bayram; Akyol, Yeşim; Bilgici, Ayhan; Kuru, Ömer
This study aims to evaluate the reliability and validity of the Turkish language version of VITACORA-19 (psoriatic arthritis quality of life questionnaire) in patients with psoriatic arthritis. The Turkish version of VITACORA-19 questionnaire was obtained after a translation and back translation process. The study sample included 61 PsA patients (22 males, 39 females; mean age 46.5±12.2 years; range 19 to 71 years). To assess the test-retest reliability of the Turkish VITACORA-19, the questionnaire was reapplied 10 to 15 days after the first interview (interclass correlation coefficient). Cronbach's alpha (a) was used to evaluate the internal consistency. VITACORA-19 was compared with visual analog scale for physician and patient global assessments, the Health Assessment Questionnaire, and Nottingham Health Profile for construct validity. The internal structure of VITACORA-19 was examined by factor analysis. The individual item intraclass correlation coefficient ranged from 0.77 to 0.98 and Cronbach's alpha ranged from 0.77 to 0.98. The Cronbach's alpha value for whole scale was determined as 0.96. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.90, and Bartlett's test of sphericity had a p<0.001. Turkish VITACORA-19 total scores were correlated negatively with Health Assessment Questionnaire, visual analog scale for pain, and Nottingham Health Profile subgroups, and positively with physician and patient global assessments (p<0.01). Turkish version of VITACORA-19 questionnaire is a reliable and valid measure for health-related quality of life in Turkish patients with psoriatic arthritis.
Asadollahpour, Faezeh; Baghban, Kowsar; Asadi, Mozhgan
The Dysphagia Handicap Index (DHI) is one of the instruments used for measuring a dysphagic patient's self-assessment. In some ways, it reflects the patient's quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. The Persian DHI showed good internal consistency (Cronbach's alpha coefficients range from 0.82 to 0.94). Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89). There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001). The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
Full Text Available Introduction: The Dysphagia Handicap Index (DHI is one of the instruments used for measuring a dysphagic patient’s self-assessment. In some ways, it reflects the patient’s quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Materials and Methods: Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. Results: The Persian DHI showed good internal consistency (Cronbach’s alpha coefficients range from 0.82 to 0.94. Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89. There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001. Conclusion: The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
Anderson-Butcher, Dawn; Iachini, Aidyn L.; Amorose, Anthony J.
Objective: This study describes the development and validation of a perceived social competence scale that social workers can easily use to assess children's and youth's social competence. Method: Exploratory and confirmatory factor analyses were conducted on a calibration and a cross-validation sample of youth. Predictive validity was also…
Suzuki, Asuka; Choi, So Yung; Lim, Eunjung; Tauyan, Socorro; Banna, Jinan C
To examine factorial validity, test-retest reliability, and internal consistency of a Tagalog-language food behavior checklist (FBC) for a low-income Filipino population. Participants (n = 160) completed the FBC on 2 occasions 3 weeks apart. Factor structure was examined using principal component analysis. For internal consistency, Cronbach α was calculated. For test-retest reliability, Spearman correlation or intraclass correlation coefficient (ICC) was calculated between scores at the 2 points. All but 1 item loaded on 6 factors: fruit and vegetable quantity, fruit and vegetable variety, fast food, sweetened beverage, healthy fat, and diet quality. Cronbach α was .75 for the total scale (range, .39-.76 for subscales). Spearman correlation was 0.78 (ICC, 0.79) for the total scale (range, 0.66-0.80 [ICC, 0.68-0.80] for subscales). The FBC demonstrated adequate factorial validity, test-retest reliability, and internal consistency. With additional testing, the FBC may be used to evaluate the US Department of Agriculture's nutrition education programs for Tagalog speakers. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Tülek, Zeliha; Polat, Cansu; Kürtüncü, Murat; Eraksoy, Mefkure
Multiple sclerosis (MS) is a disease that causes different symptoms in each attack and has an individual-specific course. Detailed questioning and recording of MS symptoms is important for developing a management plan for individual-specific symptoms. The present study was planned to evaluate the validity and reliability of the Turkish version of "Multiple Sclerosis-Related Symptom Checklist" (MS-RS), which has been developed for patients to personally follow-up the symptoms they experience. The study was conducted in the outpatient MS clinic of the Istanbul University Istanbul Faculty of Medicine between January and October 2013 and included a sample group of 148 patients who were aged >18 years, could easily communicate, had a definite diagnosis of MS, and had no other medical problems besides MS. The data were collected using patient information forms, including sociodemographic and MS-RS forms. To assess the linguistic validity, the Likert-type scale with 26 items was first applied to a group of 30 patients. Confirmatory factor analysis was used to test the construct validity. Furthermore, the correlation of the scale with the Expanded Disability Status Scale (EDSS), Hospital Anxiety and Depression Scale (HADS), Mini-Mental Status Evaluation (MMSE) scale, and Multiple Sclerosis Quality of Life Scale-54 (MSQL-54) was evaluated. The scale comprised five factors with factor loading values between 0.39 and 0.86. The item-total correlation coefficients revealed values of 0.27-0.88. The Cronbach's alpha reliability coefficient for the whole scale was determined to be 0.89 and for the subscales to be 0.60-0.85. The test-retest analysis revealed no difference between the scale and its subscales in terms of invariance with time (p>0.05). Moreover, MS-RS was significantly correlated with EDSS, HADS, MMSE, and MSQL-54. The Turkish version of MS-RS is a valid and reliable scale that can be used in the Turkish population.
Ito, Yuko; Sagara, Junko
A marital love scale was created to study the marital quality of middle-aged and elderly couples, and the scale's reliability and validity were examined. In this study, 888 middle-aged and elderly married participants completed the marital love scale questionnaire as well as answering questions regarding marriage satisfaction and husband-wife communication. In all age groups, men scored higher than women on the marital love scale. The marital love score gradually increased from the middle-aged to the senior period, and like the marriage satisfaction score, the marital love score showed a U-shaped curve in the whole married life. The results also showed that the scale was highly correlated with marriage satisfaction and spousal self-disclosure. Thus, the validity and internal consistency of the marital love scale were confirmed.
Full Text Available The purpose of this study is to develop a scale unique to our culture, concerning individual instrument performance anxiety of the students who are getting instrument training in the Department of Music Education. In the study, the descriptive research model is used and qualitative research techniques are utilized. The study population consists of the students attending the 23 universities which has Music Education Department. The sample of the study consists of 438 girls and 312 boys, totally 750 students who are studying in the Department of Music Education of randomly selected 10 universities. As a result of the explanatory and confirmatory factor analyses that were performed, a one-dimensional structure consisting of 14 items was obtained. Also, t-scores and the coefficient scores of total item correlation concerning the distinguishing power of the items, the difference in the scores of the set of lower and upper 27% was calculated, and it was observed that the items are distinguishing as a result of both analyses. Of the scale, Cronbach's alpha coefficient of internal consistency was calculated as .94, and test-retest reliability coefficient was calculated as .93. As a result, a valid and reliable assessment and evaluation instrument that measures the exam performance anxiety of the students studying in the Department of Music Education, has been developed.Extended AbstractsIntroductionAnxiety is a universal phenomenon which people experience once or a few times during lives. It was accepted as concern for the future or as an unpleasant emotional experience regarding probable hitches of the events (Di Tomasso & Gosch, 2002.In general, the occasions on which negative feelings are experienced cause anxiety to arise (Baltaş and Baltaş, 2000. People also feel anxious in dangerous situations. Anxiety may lead a person to be creative, while it may have hindering characteristics. Anxiety is that an individual considers him
Erdem, D.; Hava, D.; Keskinoglu, P.
cord injury (SCI). The reliability of NBD score was assessed by test-retest reliability and internal consistency. Cronbach's alpha coefficient was calculated to determine internal consistency. The construct validity was evaluated by exploring correlations between the NBD score and SF-36 scales, patient...... assessment of impact of NBD on quality of life (QoL) and the physician global assessment (PGA). The Global Rating of Change (GRC) scale was used to assess the change of NBD to investigate the sensitivity of the score to change. Results: Cronbach's alpha coefficient was 0.547. In test-retest reliability...
Mousavi, Sayed Javad; Mobini, Bahram; Mehdian, Hossein; Akbarnia, Behrouz; Bouzari, Behshad; Askary-Ashtiani, Ahmadreza; Montazeri, Ali; Parnianpour, Mohamad
Cross-sectional validation study to investigate psychometric properties of adapted Persian version of the Scoliosis Research Society-22r (SRS-22r) questionnaire. OBJECTIVES.: To translate the SRS-22r into Persian and to evaluate the internal consistency, reliability, and validity of the Persian SRS-22r. The SRS-22r has not been translated and validated for Persian-speaking patients with idiopathic scoliosis. This was to provide a validated instrument to measure health-related quality of life in patients with idiopathic scoliosis in Iran. The translation and cultural adaptation of the original questionnaire were carried out in accordance with the published guidelines. About 84 patients with adolescent idiopathic scoliosis were participated in the study. The Short Form Health Survey (SF-36) was used to test convergent validity of the Persian SRS-22r. Moderate to high correlations were found between the SRS-22r domains and SF-36 subscales. The correlations ranged from 0.54 to 0.67 (function/activity domain), 0.48 to 0.74 (pain domain), 0.45 to 0.55 (self image domain), 0.66 to 0.85 (mental health domain), and 0.35 to 0.55 (satisfaction domain) (P adolescent patients with scoliosis in Iran. It is simple and easy to use and now can be applied in clinical settings and future outcome studies in Iran.
Seyyede Zohreh Ziatabar Ahmadi
Full Text Available Objective: Theory of mind (ToM or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children.Method: We searched MEDLINE (PubMed interface, Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library databases from 1990 to June 2015. Search strategy was Latin transcription of ‘Theory of Mind’ AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks for ToM assessment and/or had no description about structure, validity or reliability of their tests. Methodological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP.Result: In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. Conclusion: There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric
Erkin, Özüm; Göl, İlknur
This study aims to measure the validity and reliability of Turkish male breast self-examination (MBSE) instrument. The methodological study was performed in 2016 at Ege University, Faculty of Nursing, İzmir, Turkey. The MBSE includes ten steps. For validity studies, face validity, content validity, and construct validity (exploratory factor analysis) were done. For reliability study, Kuder Richardson was calculated. The content validity index was found to be 0.94. Kendall W coefficient was 0.80 (p=0.551). The total variance explained by the two factors was found to be 63.24%. Kuder Richardson 21 was done for reliability study and found to be 0.97 for the instrument. The final instrument included 10 steps and two stages. The Turkish version of MBSE is a valid and reliable instrument for early diagnose. The MBSE can be used in Turkish speaking countries and cultures with two stages and 10 steps.
Ayvat, Ender; Kilinç, Muhammed; Kirdi, Nuray
This study aimed to describe the cultural adaptation of the Turkish Physical Activity Scale for the Elderly (PASE) and to examine the reliability and validity of the scale in older Turkish adults. Eighty elderly people were recruited for the study. The assessments included the PASE, the International Physical Activity Questionnaire (IPAQ), the Short Physical Performance Battery and Short Form-36 Quality of Life Questionnaire (SF-36), and the Mini Mental State Test. Outcome measures were conducted twice within a week (test-retest) for reliability. Cronbach's α coefficient was 0.714 for the initial evaluation. The intraclass correlation coefficient for the test-retest reliability was 0.995 with a 95% confidence interval of 0.993-0.997. A high level of positive correlation (0.742, P reliable and valid scale for the fields of research and practice.
Khang, Dongwoo; Rim, Hyo-Deog; Woo, Jungmin
The Body Image Scale (BIS) developed in collaboration with the European Organization for Research and Treatment of Cancer (EORTC) Quality of Life Study Group is a brief questionnaire for measuring body image concerns in patients with cancer. This study sought to assess the reliability and validity of the Korean version of the Body Image Scale (K-BIS). The participants consisted of 155 postoperative breast cancer patients (56 breast conserving surgery, 56 mastectomy, and 43 oncoplastic surgery). Subjects were evaluated using the K-BIS, the Body-Esteem Scale for Adolescents and Adults (BESAA), the Rosenberg Self-Esteem Scale (RSES), the Hospital Anxiety and Depression Scale (HADS), and the World Health Organization Quality of Life Scale Abbreviated Version (WHOQOL-BREF). Test-retest reliability and internal consistency were examined as a measure of reliability and validity was evaluated by convergent validity, discriminant validity and factor analysis. Cronbach's α value was 0.943. The total score of the K-BIS was negatively correlated with the BESAA (r=0.301, p59% variance. The K-BIS showed good reliability and validity for assessment of body image in Korean breast cancer patients.
Arce-Ferrer, Alvaro J.; Castillo, Irene Borges
The use of face-to-face interviews is controversial for college admissions decisions in light of the lack of availability of validity and reliability evidence for most college admission processes. This study investigated reliability and incremental predictive validity of a face-to-face postgraduate college admission interview with a sample of…
de Groot, Sonja; Balvers, Inge J.M.; Kouwenhoven, Sanne M.; Janssen, Thomas W.J.
The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Nederhof, Esther; Brink, Michel S.; Lemmink, Koen A. P. M.
The purpose of the present study was to investigate the cross-cultural validity of the Recovery Stress Questionnaire for Athletes (RESTQ-sport) by analysing reliability and validity of a Dutch translation. Two studies were performed to assess test-retest reliability with a one week interval,
De Groot, Sonja; Balvers, Inge J. M.; Kouwenhoven, Sanne M.; Janssen, Thomas W. J.
The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
The present study aims to determine the validity and reliability of the academic resilience scale in Turkish high school. The participances of the study includes 378 high school students in total (192 female and 186 male). A set of analyses were conducted in order to determine the validity and reliability of the study. Firstly, both exploratory…
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Raykov, Tenko; Marcoulides, George A.
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
Bhat, Mehraj A.
This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Markon, Kristian E.; Chmielewski, Michael; Miller, Christopher J.
In 2 meta-analyses involving 58 studies and 59,575 participants, we quantitatively summarized the relative reliability and validity of continuous (i.e., dimensional) and discrete (i.e., categorical) measures of psychopathology. Overall, results suggest an expected 15% increase in reliability and 37% increase in validity through adoption of a…
Worrell, Frank C.; Mello, Zena R.
In this study, the authors examined the reliability, structural validity, and concurrent validity of Zimbardo Time Perspective Inventory (ZTPI) scores in a group of 815 academically talented adolescents. Reliability estimates of the purported factors' scores were in the low to moderate range. Exploratory factor analysis supported a five-factor…
Smith, Jack E.; Hakel, Milton D.
Examined are questions pertinent to the use of the Position Analysis Questionnaire: Who can use the PAQ reliably and validly? Must one rely on trained job analysts? Can people having no direct contact with the job use the PAQ reliably and validly? Do response biases influence PAQ responses? (Author/KC)
Boonstra, Anne M.; Schiphorst Preuper, Henrica R.; Reneman, Michiel F.; Posthumus, Jitze B.; Stewart, Roy E.
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional
Boonstra, Anne M.; Reneman, Michiel F.; Stewart, Roy E.; Balk, Gerlof A.
The aim of this study was to determine the reliability and discriminant validity of the Dutch version of the life satisfaction questionnaire (Lisat-9 DV) to assess patients with an acquired brain injury. The reliability study used a test-retest design, and the validity study used a cross-sectional design. The setting was the general rehabilitation…
Mehmet Emrah Karadere
Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that Reasoning with Inductive Argument Test supports reliability and validity in Turkish population. [JCBPR 2013; 2(3.000: 156-161
Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin
The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is
Prevention strategies are effective only when there are epidemiological data for the targeted populations. The collection of such .... Proquest, Sport discuss and Cochrane as these are ... 0.74, test retest reliability 0.70; Diet: internal consistency:.
Travel time reliability (TTR) has been proposed as : a better measure of a facilitys performance than : a statistical measure like peak hour demand. TTR : is based on more information about average traffic : flows and longer time periods, thus inc...
Hendriks, A A Jolijn; Smith, Sarah C; Chrysanthaki, Theopisti; Black, Nick
This study aimed to investigate the reliability and validity of a self-administered version of DEMQOL-Proxy, a disease-specific instrument that measures health-related quality of life in people with dementia. The sample consisted of 173 informal carers of people with dementia, aged 29 to 89 years old. Carers were mostly female, White/White British and closely related to the patient. They completed DEMQOL-Proxy (self-administered), EQ-5D-3L (proxy reported about the person with dementia), EQ-5D-3L (self-reported about their own health) and the Zarit Burden Interview. Using well-established methods from classical test theory, we evaluated scale level acceptability, reliability and convergent, discriminant and known-groups validity of DEMQOL-Proxy. DEMQOL-Proxy (self-administered) showed high acceptability (3.5% missing data and 0% scores at floor or ceiling), high internal consistency reliability (α = 0.93) and good convergent and discriminant validity. Amongst others, we found a moderately high correlation with EQ-5D-3L proxy reported (r = 0.52) and low to essentially zero correlations with EQ-5D-3L self-reported (r = 0.20) and carer and patient background variables (r ≤ 0.20). As predicted, DEMQOL-Proxy (self-administered) showed a modest correlation with DEMQOL (r = 0.32). Known-groups differences on health-related quality of life (comparing people with versus people without cognitive impairment) were of moderate effect size (d = 0.38) and in the expected direction. DEMQOL-Proxy (self-administered) has comparable acceptability, reliability and validity with DEMQOL-Proxy (interviewer administered). DEMQOL-Proxy (self-administered) can be used in a wider variety of contexts than its interviewer-administered version, including routine use in busy clinics. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
The EuReDatA Working Group produced a basic document that addressed many of the problems associated with the design of a suitable data collection scheme to achieve pre-defined objectives. The book that resulted from this work describes the need for reliability data, data sources and collection procedures, component description and classification, form design, data management, updating and checking procedures, the estimation of failure rates, availability and utilisation factors, and uncertainties in reliability parameters. (DG)
Chen, Stephanie Y; Griffin, Brianna M; Mancuso, Dean; Shiau, Stephanie; DiMattia, Michelle; Cellum, Ilana; Harvey Boyd, Kelly; Prevoteau, Charlotte; Kohlberg, Gavriel D; Spitzer, Jaclyn B; Lalwani, Anil K
Although speech perception tests are available to evaluate hearing, there is no standardized validated tool to quantify speech quality. The objective of this study is to develop a validated tool to measure quality of speech heard. Prospective instrument validation study of 35 normal hearing adults recruited at a tertiary referral center. Participants listened to 44 speech clips of male/female voices reciting the Rainbow Passage. Speech clips included original and manipulated excerpts capturing goal qualities such as mechanical and garbled. Listeners rated clips on a 10-point visual analog scale (VAS) of 18 characteristics (e.g. cartoonish, garbled). Skewed distribution analysis identified mean ratings in the upper and lower 2-point limits of the VAS (ratings of 8-10, 0-2, respectively); items with inconsistent responses were eliminated. The test was pruned to a final instrument of nine speech clips that clearly define qualities of interest: speech-like, male/female, cartoonish, echo-y, garbled, tinny, mechanical, rough, breathy, soothing, hoarse, like, pleasant, natural. Mean ratings were highest for original female clips (8.8) and lowest for not-speech manipulation (2.1). Factor analysis identified two subsets of characteristics: internal consistency demonstrated Cronbach's alpha of 0.95 and 0.82 per subset. Test-retest reliability of total scores was high, with an intraclass correlation coefficient of 0.76. The Speech Quality Instrument (SQI) is a concise, valid tool for assessing speech quality as an indicator for hearing performance. SQI may be a valuable outcome measure for cochlear implant recipients who, despite achieving excellent speech perception, often experience poor speech quality. 2b. Laryngoscope, 2017. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
seyed abolfazl zakerian; Roya Azizi; Mehdi Rahgozar
The term usability refers to a special index for success of an operating system. This study aimed to determine the reliability and validity of the Software Usability Measurements Inventory (SUMI) questionnaire as one of the valid and common questionnaires about usability evaluation. The back translation method was used to translate the questionnaire from English to Persian back to English. Moreover, repeatability or test-retest reliability was practically used to determine the reliability of ...
Zhang, Tingting; Yin, Anchun; Sun, Xiaohong; Liu, Qigui; Song, Guirong; Li, Lianhong
To develop psychosocial adaptation scale for Parkinson's disease (PD) in Chinese population and evaluate its reliability and validity. The items were designed by literature review, expert consultation and semi-structured interview. The methods of corrected item-total correlation, discrimination analysis and exploratory factor analysis were used for items selection. 427 valid scales from PD patients were collected in the study to test the reliability and validity. The scale incorporated six dimensions: anxiety, self-esteem, attitude, self-acceptance, self-efficacy and social support, a total of 32 items. The scale possessed good internal consistency. The test-retest correlation coefficient was 0.99 and average content validation rate was 0.97. The Hoehn and Yahr stage were correlated with total score of the scale. The psychosocial adaptation scale in this study showed good reliability and validity, it can be used as a reliable and valid instrument to evaluate the psychosocial adaptation of PD objectively and effectively.
Full Text Available It is known that laboratory investigations secure safe and reliable results that provide a final confirmation of the quality of work. Ideas, planning, knowledge, skills, experience, and environment, along with good laboratory practice, quality control and reliability of quality, make the area of biological investigations very complex. In recent years, quality control, including the control of work in the laboratory, is based on international standards and is used at that level. The implementation of widely recognized international standards, such as the International Standard ISO/IEC 17025 (1 and the implementing of the quality system series ISO/IEC 9000 (2 have become the imperative on the grounds of which laboratories have a formal, visible and corresponding system of quality. The diagnostic methods that are used must constantly yield results which identify the animal as positive or negative, and the precise status of the animal is determined with a predefined degree of statistical significance. Methods applied on a selected population reduce the risk of obtaining falsely positive or falsely negative results. A condition for this are well conceived and documented methods, with the application of the corresponding reagents, and work with professional and skilled staff. This process requires also a consistent implementation of the most rigorous experimental plans, epidemiological and statistical data and estimations, with constant monitoring of the validity of the applied methods. Such an approach is necessary in order to cut down the number of misconceptions and accidental mistakes, for a referent population of animals on which the validity of a method is tested. Once a valid method is included in daily routine investigations, it is necessary to apply constant monitoring for the purpose of internal quality control, in order adequately to evaluate its reproducibility and reliability. Consequently, it is necessary at least twice yearly to conduct
Climie, Rachel E D; Schultz, Martin G; Nikolic, Sonja B; Ahuja, Kiran D K; Fell, James W; Sharman, James E
Noninvasive central blood pressure (BP) independently predicts mortality, but current methods are operator-dependent, requiring skill to obtain quality recordings. The aims of this study were first, to determine the validity of an automatic, upper arm oscillometric cuff method for estimating central BP (O(CBP)) by comparison with the noninvasive reference standard of radial tonometry (T(CBP)). Second, we determined the intratest and intertest reliability of O(CBP). To assess validity, central BP was estimated by O(CBP) (Pulsecor R6.5B monitor) and compared with T(CBP) (SphygmoCor) in 47 participants free from cardiovascular disease (aged 57 ± 9 years) in supine, seated, and standing positions. Brachial mean arterial pressure (MAP) and diastolic BP (DBP) from the O(CBP) device were used to calibrate in both devices. Duplicate measures were recorded in each position on the same day to assess intratest reliability, and participants returned within 10 ± 7 days for repeat measurements to assess intertest reliability. There was a strong intraclass correlation (ICC = 0.987, P difference (1.2 ± 2.2 mm Hg) for central systolic BP (SBP) determined by O(CBP) compared with T(CBP). Ninety-six percent of all comparisons (n = 495 acceptable recordings) were within 5 mm Hg. With respect to reliability, there were strong correlations but higher limits of agreement for the intratest (ICC = 0.975, P difference 0.6 ± 4.5 mm Hg) and intertest (ICC = 0.895, P difference 4.3 ± 8.0 mm Hg) comparisons. Estimation of central SBP using cuff oscillometry is comparable to radial tonometry and has good reproducibility. As a noninvasive, relatively operator-independent method, O(CBP) may be as useful as T(CBP) for estimating central BP in clinical practice.
Härdén, Marie; Nyström, Britta; Kulich, Károly; Carlsson, Jonas; Bengtson, Ann; Edvardsson, Nils
Symptoms related to atrial fibrillation and their impact on health-related quality of life (HRQoL) are often evaluated in clinical trials. However, there remains a need for a properly validated instrument. We aimed to develop and validate a short symptoms scale for patients with AF. One hundred and eleven patients with a variety of symptoms related to AF were scheduled for DC cardioversion. The mean age was 67.1 +/- 12.1 years, and 80% were men. The patients completed the new symptoms scale, the Toronto Symptoms Check List (SCL) and the generic Short Form 36 (SF-36) the day before the planned DC cardioversion. Compliance was excellent, with only 1 of 666 answers missing. One item, 'limitations in working capability', was deleted because of a low numerical response rate, as many of the patients were retired. The internal consistency reliability of the remaining six items was 0.81 (Cronbach's alpha). Patients scored highest in the items of 'dyspnoea on exertion', 'limitations in daily life due to AF' and 'fatigue due to AF', with scores of 4.5, 3.3 and 4.5, respectively. There was a good correlation to all relevant SF-36 domains and to the relevant questions of the SCL. The Rasch analyses showed that the items are unidimensional and that they are clearly separated and cover an adequate range. Test-retest reliability was performed in patients who failed DC and was adequate for three of six items, > 0.70. The psychometric characteristics of the new short symptoms scale were found to have satisfactory reliability and validity.
Full Text Available Abstract Background Symptoms related to atrial fibrillation and their impact on health-related quality of life (HRQoL are often evaluated in clinical trials. However, there remains a need for a properly validated instrument. We aimed to develop and validate a short symptoms scale for patients with AF. Methods One hundred and eleven patients with a variety of symptoms related to AF were scheduled for DC cardioversion. The mean age was 67.1 ± 12.1 years, and 80% were men. The patients completed the new symptoms scale, the Toronto Symptoms Check List (SCL and the generic Short Form 36 (SF-36 the day before the planned DC cardioversion. Compliance was excellent, with only 1 of 666 answers missing. Results One item, 'limitations in working capability', was deleted because of a low numerical response rate, as many of the patients were retired. The internal consistency reliability of the remaining six items was 0.81 (Cronbach's α. Patients scored highest in the items of 'dyspnoea on exertion', 'limitations in daily life due to AF' and 'fatigue due to AF', with scores of 4.5, 3.3 and 4.5, respectively. There was a good correlation to all relevant SF-36 domains and to the relevant questions of the SCL. The Rasch analyses showed that the items are unidimensional and that they are clearly separated and cover an adequate range. Test-retest reliability was performed in patients who failed DC and was adequate for three of six items, >0.70. Conclusion The psychometric characteristics of the new short symptoms scale were found to have satisfactory reliability and validity.
Mehrnoosh Pazargadi; Tahereh Ashktorab; Sharareh Khosravi; Hamid Alavi majd
Background: The necessity of a valid and reliable assessment tool is one of the most repeated issues in nursing students` clinical evaluation. But it is believed that present tools are not mostly valid and can not assess students` performance properly.Objectives: This study was conducted to design a valid and reliable assessment tool for evaluating nursing students` performance in clinical education.Methods: In this methodological study considering nursing students` performance definition; th...
Camacho Luiz Antonio Bastos
Full Text Available Medical audit of hospital records has been a major component of quality of care assessment, although physician judgment is known to have low reliability. We estimated interrater agreement of quality assessment in a sample of patients with cardiac conditions admitted to an American teaching hospital. Physician-reviewers used structured review methods designed to improve quality assessment based on judgment. Chance-corrected agreement for the items considered more relevant to process and outcome of care ranged from low to moderate (0.2 to 0.6, depending on the review item and the principal diagnoses and procedures the patients underwent. Results from several studies seem to converge on this point. Comparisons among different settings should be made with caution, given the sensitivity of agreement measurements to prevalence rates. Reliability of review methods in their current stage could be improved by combining the assessment of two or more reviewers, and by emphasizing outcome-oriented events.
1 University of Northern Iowa, Division of Athletic Training, 003C Human. Performance Center, Cedar ... concurrent validity of the fingertip-to-floor distance test (FFD) ... in these protocols are spinal and extremity range of motion, pelvic control ...
Li, Tianling; Winnel, Melissa; Lin, Hao; Panther, Jared; Liu, Chang; O'Halloran, Roger; Wang, Kewen; An, Taicheng; Wong, Po Keung; Zhang, Shanqing; Zhao, Huijun
With closing water loop through purified recycled water, wastewater becomes a part of source water, requiring reliable wastewater quality monitoring system (WQMS) to manage wastewater source and mitigate potential health risks. However, the development of reliable WQMS is fatally constrained by severe contamination and biofouling of sensors due to the hostile analytical environment of wastewaters, especially raw sewages, that challenges the limit of existing sensing technologies. In this work, we report a technological solution to enable the development of WQMS for real-time abnormal event detection with high reliability and practicality. A vectored high flow hydrodynamic self-cleaning approach and a dual-sensor self-diagnostic concept are adopted for WQMS to effectively encounter vital sensor failing issues caused by contamination and biofouling and ensure the integrity of sensing data. The performance of the WQMS has been evaluated over a 3-year trial period at different sewage catchment sites across three Australian states. It has demonstrated that the developed WQMS is capable of continuously operating in raw sewage for a prolonged period up to 24 months without maintenance and failure, signifying the high reliability and practicality. The demonstrated WQMS capability to reliably acquire real-time wastewater quality information leaps forward the development of effective wastewater source management system. The reported self-cleaning and self-diagnostic concepts should be applicable to other online water quality monitoring systems, opening a new way to encounter the common reliability and stability issues caused by sensor contamination and biofouling. Copyright © 2017 Elsevier Ltd. All rights reserved.
Dong, Aishu; Chen, Sisi; Zhu, Lianlian; Shi, Lingmin; Cai, Yueli; Zeng, Jingni; Guo, Wenjian
Chronic heart failure (CHF), a major public health problem worldwide, seriously limits health-related quality of life (HRQOL). How to evaluate HRQOL in older patients with CHF remains a problem. To evaluate the reliability and validity of the Chinese version of the Medical Outcomes Study Short Form version 2 (SF-36v2) in CHF patients. From September 2012 to June 2014, we assessed QOL using the SF-36v2 in 171 aging participants with CHF in four cardiology departments. Convergent and discriminant validity, factorial validity, sensitivity among different NYHA classes and between different age groups, and reliability were determined using standard measurement methods. A total of 150 participants completed a structured questionnaire including general information and the Chinese SF-36v2; 132 questionnaires were considered valid, while 21 patients refused to take part. 25 of the 50 participants invited to complete the 2-week test-retest questionnaires returned completed questionnaires. The internal consistency reliability (Cronbach's α) of the total SF-36v2 was 0.92 (range 0.74-0.93). All hypothesized item-subscale correlations showed satisfactory convergent and discriminant validity. Sensitivity was measured in different NYHA classes and age groups. Comparison of different NYHA classes showed statistical significance, but there was no significant difference between age groups. We confirmed the SF-36v2 as a valid instrument for evaluating HRQOL Chinese CHF patients. Both reliability and validity were strongly satisfactory, but there was divergence in understanding subscales such as "social functioning" because of differing cultural background. The reliability, validity, and sensitivity of SF-36v2 in aging patients with CHF were acceptable.
This article discusses the use of assessment by teachers to replace external marking. It shows how professional participation and moderation can provide reliability in summative assessment, even in public examinations for older students. It draws on historical experiences of assessment for A-level English literature.
All three of these instruments do not involve high costs, do not require high technical skills, mobile, save time, and are suitable for use in large populations. Because all three instruments can estimate the percentage of body fat, but it is important to identify the most appropriate instruments and have high reliability. Hence, this ...
Kern, Jeffrey M.; MacDonald, Marian L.
The reliability and meaning of assertiveness tests were explored using 120 female undergraduates. Several self-report inventories (the College Self-Expression Scale, Conflict Resolution Inventory, and a global rating from one to seven) were administered, as were three anxiety measures (Timed Behavior Checklist, response latency, and response…
Konge, Lars; Larsen, Klaus Richter; Clementsen, Paul
: The interrater reliability was high, with Cronbach's a = 0.86. Assessment of 3 bronchoscopies by a single rater had a generalizability coefficient of 0.84. The correlation between experience and performance was good (Pearson correlation = 0.76). There were significant differences between the groups for all...
Powers, Stephen; And Others
Spanish speaking first graders were administered the Artes de Lenguage (ADL)--a Spanish, criterion-referenced, language arts test. Reliability analyses indicated the adequacy of three of the four subscales (Phonetic Analysis, Vocabulary Development, Comprehension Skills, and General Skills). A principal factors analysis of the intercorrelation…
Cetin, Bayram; Yaman, Erkan; Peker, Adem
The purpose of this study is to develop a reliable and valid scale, which determines cyber victimization and bullying behaviors of high school students. Research group consisted of 404 students (250 male, 154 male) in Sakarya, in 2009-2010 academic years. In the study sample, mean age is 16.68. Content validity and face validity of the scale was…
Shek, Daniel T. L.; Lai, Kelly Y. C.
Reliability and validity of Chinese Self-Report Family Inventory (C-SFI) were examined in three studies. Study 1 showed C-SFI was temporally stable and internally consistent. Study 2 indicated C-SFI could discriminate between clinical and nonclinical groups. Study 3 gave support for internal consistency, concurrent validity and construct validity.…
Full Text Available The Chinese Healthy Eating Index (CHEI is a measuring instrument of diet quality in accordance with the Dietary Guidelines for Chinese (DGC-2016. The objective of the study was to evaluate the validity and reliability of the CHEI. Data from 12,473 adults from the China Health and Nutrition Survey (CHNS-2011, including 3-day–24-h dietary recalls were used in this study. The CHEI was assessed by four exemplary menus developed by the DGC-2016, the general linear models, the independent t-test and the Mann–Whitney U-test, the Spearman’s correlation analysis, the principal components analysis (PCA, the Cronbach’s coefficient, and the Pearson correlation with nutrient intakes. A higher CHEI score was linked with lower exposure to known risk factors of Chinese diets. The CHEI scored nearly perfect for exemplary menus for adult men (99.8, adult women (99.7, and the healthy elderly (99.1, but not for young children (91.2. The CHEI was able to distinguish the difference in diet quality between smokers and non-smokers (P < 0.0001, people with higher and lower education levels (P < 0.0001, and people living in urban and rural areas (P < 0.0001. Low correlations with energy intake for the CHEI total and component scores (|r| < 0.34, P < 0.01 supported the index assessed diet quality independently of diet quantity. The PCA indicated that underlying multiple dimensions compose the CHEI, and Cronbach’s coefficient α was 0.22. Components of dairy, fruits and cooking oils had the greatest impact on the total score. People with a higher CHEI score had not only a higher absolute intake of nutrients (P < 0.001, but also a more nutrient-dense diet (P < 0.001. Our findings support the validity and reliability of the CHEI when using the 3-day–24-h recalls.
Vuković Mira H.
Full Text Available Introduction: The structure of healthcare professionals' economic reasoning (HPER is still unknown. The aim: The structural validity and the reliability of the HPER questionnaire were evaluated. Methods: The psychometric study about evaluation of the HPER factors was conducted. The healthcare professionals (physicians, dentists, pharmacists that are employed in healthcare state sector of Republic of Serbia were interviewed. The HPER Questionnaire contained 29 preliminary items. Its structure was evaluated by Principal Component Analysis for categorial and ordinal data. Results: The 9 items that defined seven HPER factors were extracted: (1 the negligence of the cost of quality by the administration and financier; (2 the consideration of the market price and healthcare benefit; (3 market orientation considering patient's requests; (4 the recognition of inadequate resource alocations by non-medical administrative authority; (5 the attitude about redirecting the profit from state to private healthcare sector; (6 the recognition of the unjustified spending by the healthcare professionals and (7 the relationship physician/patient. Conclusion: The HPER-9 Questionnaire has high structural validity and reliability in the HPER measurement.
Patrícia Pinto Fonseca
Full Text Available INTRODUCTION: Patients' perception about their health condition, mainly involving chronic diseases, has been investigated in many studies and it has been associated to depression, compliance with the treatment, quality of life and prognosis. The Illness Effects Questionnaire (IEQ is a tool which makes the standardized evaluation of patients' perception about their illness possible, so that it is brief and accessible to the different clinical settings. This work aims to begin the transcultural adaptation of the IEQ to Brazil through the validated translation and the reliability study. METHODS: The back-translation method and the test-retest reliability study were used in a sample of 30 adult patients under chronic hemodialysis. The reliability indexes were estimated using the Pearson, Spearman, Weighted Kappa and Cronbach's alpha coefficients. RESULTS: The semantic equivalence was reached through the validated translation. In this study, the reliability indexes obtained were respectively: 0.85 and 0.75 (p < 0.001; 0.68 and 0.92 (p < 0.0001. DISCUSSION: The reliability indexes obtained attest to the stability of responses in both evaluations. Additional procedures are necessary for the transcultural adaptation of the IEQ to be complete. CONCLUSION: The results indicate the translation validity and the reliability of the Brazilian version of the IEQ for the sample studied.
Full Text Available The retail industry remains one of the largest sectors in the global economy. In South Africa, retailing is one of the toughest and most competitive industries. The South African retail business environment is becoming increasingly hostile and unforgiving, with intense competition from both domestic and foreign companies (Terblanche, 1998: 1. The findings of this preliminary study do provide basic support for a three-factor structure for supermarket service quality in terms of reliability and validity. The reliability analysis, which followed the factor analysis, reflected coefficient α values ranging from 0.85 to 0.90, indicating high internal consistency among variables within each dimension. In today's saturated retail markets, retailersface increasing hurdles to attract and maintain customers.
Skinner, T. C.; Howells, L.; Greene, S.
Aims: This article reports on the development and validity of a Diabetes-specific Illness Representations Questionnaire (DIRQ) to assess all five dimensions of an individual's perception of diabetes, for adolescents with Type 1 diabetes mellitus. Methods: There were two development studies. Study 1...... with a diabetes self-efficacy and barriers to adherence questionnaire. Subsequently there were two validation studies. Study 3: participants (n = 44 adolescents and 28 parents) completed the DIRQ and questionnaires assessing their self-care and psychological well-being. Glycaemic control was assessed through...... consist of two subscales, perceived threat and perceived impact, and provide further support for the distinction between treatment effectiveness to control diabetes and treatment effectiveness to prevent complications. Along with the validation studies, the results indicate that the questionnaire scales...
Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen
Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Cheung, Kenneth M C; Senkoylu, Alpaslan; Alanay, Ahmet; Genc, Yasemin; Lau, Sarah; Luk, Keith D
Validation study to define validity and reliability of an adapted and translated questionnaire. Assessment of the concurrent validity and reliability of a Chinese version of SRS-22 outcome instrument. No valid health-related quality of life (HRQL) outcome instrument exists for patients with spinal deformity in Chinese. The modified SRS-22 questionnaire was proven to be an appropriate outcome instrument in English, and has already been translated and validated in several other languages. The English version of the SRS-22 questionnaire was adapted to Chinese according to the International Quality of Life Assessment Project guidelines. To assess reliability, 48 subjects with adolescent idiopathic scoliosis (mean age, 16.5 years) filled the questionnaire on 2 separate occasions (Group 1). To assess concurrent validity, 50 subjects (mean age, 21 years) filled in the same questionnaire and a previously validated Chinese version of the Short Form-36 (SF36) questionnaire (Group 2). Internal consistency, reproducibility and concurrent validity were determined with Cronbach's alpha coefficient, interclass correlation coefficient and Pearson correlation coefficient, respectively. Cronbach's alpha coefficient for the 4 major domains (function/activity, pain, self-image/appearance and mental health) were high. Intraclass correlation was also excellent for all domains. For concurrent validity, excellent correlation was found in 1 domain, good in 12 domains, moderate in 3 domains, and poor in 1 domain of the 17 relevant domains. Both cultural adaptation and linguistic translation are essential in any attempt to use a HRQL questionnaire across cultures. The Chinese version of the SRS-22 outcome instrument has satisfactory internal consistency and excellent reproducibility. It is ready for use in clinical studies on idiopathic scoliosis in Chinese-speaking societies.
The complexity of magnetic confinement machines has been a matter of concern in developing fusion power plants as electricity generating stations because it might reduce plant availability. A comprehensive reliability and availability (R and A) programme to determine the availability of a next step fusion machine was performed during definition and conceptual design of the Next European Torus. In addition to giving an overview of the expected contributions to unavailability of the various components, this activity identified the basic approach to be taken to specify and to achieve necessary improvements. This paper, after giving some basic definitions, describes the essentials of the R and A programme, its results, and the guidelines derived for further work towards a sufficiently reliable fusion plant. These guidelines refer to improvement of the reliability database and the quality assurance to be performed at the design stage of a next step machine. (orig.)
Bech, B; Lönn, L; Falkenberg, M
Objectives To study the construct validity and reliability of a novel endovascular global rating scale, Structured Assessment of endoVascular Expertise (SAVE). Design A Clinical, experimental study. Materials Twenty physicians with endovascular experiences ranging from complete novices to highly....... Validity was analysed by correlating experience with performance results. Reliability was analysed according to generalisability theory. Results The mean score on the 29 items of the SAVE scale correlated well with clinical experience (R = 0.84, P ... with clinical experience (R = -0.53, P validity and reliability of assessment with the SAVE scale was high when applied to performances in a simulation setting with advanced realism. No ceiling effect...
Sharma, Sonia; Crow, Heidi C; McCall, W D; Gonzalez, Yoly M
To conduct a systematic review of papers reporting the reliability and diagnostic validity of the joint vibration analysis (JVA) for diagnosis of temporomandibular disorders (TMD). A search of Pubmed identified English-language publications of the reliability and diagnostic validity of the JVA. Guidelines were adapted from applied STAndards for the Reporting of Diagnostic accuracy studies (STARD) to evaluate the publications. Fifteen publications were included in this review, each of which presented methodological limitations. This literature is unable to provide evidence to support the reliability and diagnostic validity of the JVA for diagnosis of TMD.
Turel, Yalin Kilic
The interactive whiteboard (IWB) has become a popular technology for instructors over the last decade. Though research asserts that the IWBs facilitate learning in different ways, there is a lack of studies examining actual IWB use in classroom settings based on learners' perspectives by means of valid instruments. The purpose of this study is to…
They completed this 15 item self-rated instrument that assesses patient satisfaction with services using a 5 point response format. Results:The internal consistency for the scale was high ( a=0.91), and item total correlations ranged between 0.33 to 0.70. Its convergent validity was supported by significant correlations of all ...
Ilker, Gokce Erturan; Arslan, Yunus; Demirhan, Giyasettin
The Trichotomous Achievement Goal Scale was developed by Agbuga and Xiang (2008) by including selected items from the scales of Duda and Nicholls (1992), Elliot (1999), and Elliot and Church (1997) and adapting them into Turkish. The scale consists of 18 items, and students rated each item on a 7-point Likert scale. To ascertain the validity and…
Kane, Michael; Case, Susan
The scores on two distinct tests (e.g., essay and objective) are often combined into a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to a separate criterion. In cases where no criterion is available, the observed composite has generally been evaluated in terms of its…
Moore, Martha; Barker, Karen
The four square step test (FSST) was first validated in healthy older adults to provide a measure of dynamic standing balance and mobility. The FSST has since been used in a variety of patient populations. The purpose of this systematic review is to determine the validity and reliability of the FSST in these different adult patient populations. The literature search was conducted to highlight all the studies that measured validity and reliability of the FSST. Six electronic databases were searched including AMED, CINAHL, MEDLINE, PEDro, Web of Science and Google Scholar. Grey literature was also searched for any documents relevant to the review. Two independent reviewers carried out study selection and quality assessment. The methodological quality was assessed using the QUADAS-2 tool, which is a validated tool for the quality assessment of diagnostic accuracy studies, and the COSMIN four-point checklist, which contains standards for evaluating reliability studies on the measurement properties of health instruments. Fifteen studies were reviewed studying community-dwelling older adults, Parkinson's disease, Huntington's disease, multiple sclerosis, vestibular disorders, post stroke, post unilateral transtibial amputation, knee pain and hip osteoarthritis. Three of the studies were of moderate methodological quality scoring low in risk of bias and applicability for all domains in the QUADAS-2 tool. Three studies scored "fair" on the COSMIN four-point checklist for the reliability components. The concurrent validity of the FSST was measured in nine of the studies with moderate to strong correlations being found. Excellent Intraclass Correlation Coefficients were found between physiotherapists carrying out the tests (ICC = .99) with good to excellent test-retest reliability shown in nine of the studies (ICC = .73-.98). The FSST may be an effective and valid tool for measuring dynamic balance and a participants' falls risk. It has been shown to have strong
Childers, Frank M.
This Kennedy Historical Document (KHD) provides a unique historical perspective of the organizational and functional responsibilities for the manned and un-manned programs at Kennedy Space Center, Florida. As systems become more complex and hazardous, the attention to detailed planning and execution continues to be a challenge. The need for a robust reliability and quality assurance program will always be a necessity to ensure mission success. As new space missions are defined and technology allows for continued access to space, these programs cannot be compromised. The organizational structure that has provided the reliability and quality assurance functions for both the manned and unmanned programs has seen many changes since the first group came to Florida in the 1950's. The roles of government and contractor personnel have changed with each program and organizational alignment has changed based on that responsibility. The organizational alignment of the personnel performing these functions must ensure independent assessment of the processes.
A. Tugba Bahadir
Full Text Available Background: Pectus excavatum (PE is the most common chest wall deformity. The Nuss Questionnaire modified for Adults (NQmA is a disease-specific health-related quality of life assessment tool for patients with pectus deformities. The aim of this study is to adapt the NQmA into Turkish. Methods: Two hundred and sixty-five patients with PE were participated, with an age range of 14 to 29 years. All patients underwent a physical examination and had not undergone corrective surgery. The Turkish version of the NQmA was completed by patients and their parents. Results: The content validity index based on expert opinions was 91% for the patient questionnaire and 96% for the parent questionnaire. The Cronbach’s alpha value for the NQmA was found to be 0.805 for the patient questionnaire and 0.800 for the parent questionnaire. Exploratory factor analysis was used to assess construct validity. Two factors explained 51.1% of the total variance in the patient questionnaire (psychosocial: 31.145%, Cronbach’s alpha=0.818; physical: 19.955%, Cronbach’s alpha=0.862. In the parent questionnaire, two factors explained 51.422% of the total variance (psychosocial: 26.097%, Cronbach’s alpha=0.743; physical: 25.325%, Cronbach’s alpha=0.827. Construct validity was confirmed by confirmatory factor analysis. Conclusion: The Turkish version of the NQmA was found to be valid and reliable for the assessment of quality of life in patients with PE.
Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M
Background: Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. Aim: This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). Subjects and Methods: This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's c...
Yehya, Arij; Ghuloum, Suhaila; Mahfoud, Ziyad; Opler, Mark; Khan, Anzalee; Hammoudeh, Samer; Abdulhakam, Abdulmoneim; Al-Mujalli, Azza; Hani, Yahya; Elsherbiny, Reem; Al-Amin, Hassen
The Positive and Negative Syndrome Scale (PANSS) is widely used for patients with schizophrenia. This scale is reliable and valid. The PANSS was translated and validated in several languages. The aim of this study was to translate and validate the PANSS in the Arab population. The PANSS was translated into formal Arabic language using the back-translation method. 101 Arab patients with schizophrenia and 98 Arabs with no diagnosis of any mental disorder were recruited. The Arabic version of the Mini International Neuropsychiatric Interview (MINI-6) was used as a diagnostic tool to confirm the diagnosis of schizophrenia or rule out any diagnosis for the healthy control group. Reliability of the scale was assessed by calculating internal consistency, interrater reliability and test-retest reliability. Construct validity was assessed using the Arabic version of the MINI-6. PANSS total scores were correlated with the Clinical Global Impression-Severity scale. Our findings showed that the internal consistency was good (0.92). Scores on the PANSS of the patients were much higher than those of the healthy controls. The PANSS showed good interrater reliability and test-retest reliability (0.92 and 0.75, respectively). In comparison with the MINI-6, the PANSS showed good sensitivity and specificity, which implies good construct validity of this version. In conclusion, the Arabic version of the PANSS is a reliable and valid instrument for the assessment of patients with schizophrenia in the Arab population. © 2016 S. Karger AG, Basel.
In order to decrease costs and increase competitiveness, nuclear utilities use more challenging operational conditions, longer fuel cycles and higher burnups, which require modifications in fuel designs and materials. Different aspects of quality assurance and control, as well as analysis of fuel performance have been considered in a number of specialized publications. The present publication provides a concise but comprehensive overview of all interconnected quality and reliability issues in fuel fabrication, design and operation. It jointly tackles technical, safety and organizational aspects, and contains examples of state of the art developments and good practices of coordinated work of fuel designers, vendors and reactor operators
Jin, Elaine W.; Keelan, Brian W.; Chen, Junqing; Phillips, Jonathan B.; Chen, Ying
A softcopy quality ruler method was implemented for the International Imaging Industry Association (I3A) Camera Phone Image Quality (CPIQ) Initiative. This work extends ISO 20462 Part 3 by virtue of creating reference digital images of known subjective image quality, complimenting the hardcopy Standard Reference Stimuli (SRS). The softcopy ruler method was developed using images from a Canon EOS 1Ds Mark II D-SLR digital still camera (DSC) and a Kodak P880 point-and-shoot DSC. Images were viewed on an Apple 30in Cinema Display at a viewing distance of 34 inches. Ruler images were made for 16 scenes. Thirty ruler images were generated for each scene, representing ISO 20462 Standard Quality Scale (SQS) values of approximately 2 to 31 at an increment of one just noticeable difference (JND) by adjusting the system modulation transfer function (MTF). A Matlab GUI was developed to display the ruler and test images side-by-side with a user-adjustable ruler level controlled by a slider. A validation study was performed at Kodak, Vista Point Technology, and Aptina Imaging in which all three companies set up a similar viewing lab to run the softcopy ruler method. The results show that the three sets of data are in reasonable agreement with each other, with the differences within the range expected from observer variability. Compared to previous implementations of the quality ruler, the slider-based user interface allows approximately 2x faster assessments with 21.6% better precision.
Uggioni, Paula Lazzarin; Salay, Elisabete
Validated and reliable instruments for measuring consumer attitudes regarding food quality certifications are lacking, but the measurement of consumer attitude could be an important tool for understanding consumer behavior. Thus the objective of this study was to develop an instrument for measuring consumer attitudes regarding private food safety certifications for commercial restaurants. To this end, the following steps were carried out: development of the interview items; complete pilot testing; item analyses (influence of social desirability and total-item correlation); reliability test (internal consistency and test-retest); and validity assessment (content and discriminative validity and exploratory and confirmatory factor analysis). The subjects, all over the age of 18 and drawn from six non-probabilistic samples (n=7-350) in the city of Campinas, Brazil, were all subjected to an interview. The final scale included 24 items and had a Cronbach's alpha coefficient of 0.79 and a content validation coefficient of 0.99, both within acceptable limits. The confirmatory factor analysis validated a model with five factors and the final instrument discriminated reasonably well between the groups and showed satisfactory reproducibility (r=0.955). Furthermore, the scale validity and reliability were satisfactory, suggesting it could also be applied to future studies. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Rassin, E; Merckelbach, H; Muris, P; Schmidt, H
Thought-action fusion (TAF) refers to a set of cognitive biases that are thought to play a role in the development of obsessional phenomena. To measure these biases, R. Shafran, D. S. Thordarson, and S. Rachman (1996; Journal of Anxiety Disorders, 10, 379-391) developed the TAF-scale. They concluded that the TAF-scale possesses adequate psychometric qualities. The current study sought to further explore the reliability and validity of the TAF-scale. Results indicate that the TAF-scale has good internal consistency. TAF-scores correlated with self-reports of obsessional problems. Furthermore, mean scores in a mixed sample of anxiety disordered patients were higher than those in a normal sample. However, temporal consistency was somewhat disappointing. Also, the question remains whether TAF is specific to obsessive-compulsive disorder or taps more pervasive biases that play a role in a variety of disorders.
Oikonomidi, Theodora; Vikelis, Michail; Artemiadis, Artemios; Chrousos, George P; Darviri, Christina
The Migraine Disability Assessment (MIDAS) Questionnaire is a reliable and valid instrument for migraine-related disability. Such a tool is needed to quantify migraine-related disability in the Greek population. This validation study aims to assess the test-retest reliability, internal consistency, item discriminant and convergent validity of the Greek translation of the MIDAS. Adults diagnosed with migraine completed the MIDAS Questionnaire on two occasions 3 weeks apart to assess reliability, and completed the RAND-36 to assess validity. Participants (n = 152) had a median MIDAS score of 24 and mostly severe disability (58% were grade IV). The test-retest reliability analysis (N = 59) revealed excellent reliability for the total score. Internal consistency was α = 0.71 for initial and α = 0.82 for retest completion. For item discriminant validity, the correlations between each question and the total score were significant, with high correlations for questions 2-5 (range 0.67 ≤ r ≤ 0.79; p MIDAS score tended to have better wellbeing. Psychometric properties are comparable with those of other published validation studies of the MIDAS and the original. Findings on question 1 show that missing work/school days may be closely related with increased affect issues. The Greek version of the MIDAS Questionnaire has good reliability and validity. This study allowed for cross-cultural comparability of research findings.
Verdugo, M. A.; Cordoba, L.; Gomez, J.
Background: Assessing the quality of life (QOL) for families that include a person with a disability have recently become a major emphasis in cross-cultural QOL studies. The present study examined the reliability and validity of the Family Quality of Life Survey (FQOL) on a Spanish sample. Method and Results: The sample comprised 385 families who…
DunnGalvin, Audrey; Barnett, Julie; Begen, Fiona M; Ryan, Kathleen; Lucas, Jane S
Approximately 20% of children and adults avoid certain foods because of perceived food intolerance. Valid and reliable health-related quality of life instruments are needed to measure changes following clinical, therapeutic or policy interventions. However, there are no disease-specific quality of life instruments for adults with food intolerances. To develop the Food Intolerance Quality of Life Questionnaire FIQLQ. Then to conduct psychometric validation including reliability and construct validity. We adapted the existing Food Allergy Quality of Life questionnaire (FAQLQ) for interviews with 14 adults with food intolerance. For preliminary psychometric validation, 229 adults with food intolerances completed the online electronic version of FIQLQ. The resultant FIQLQ had 18 items which loaded onto 3 subscales-Emotional Impact, Social and Dietary Restrictions, Reactions and Avoidance. Each subscale had excellent internal consistency reliability (Cronbach's α 0.81-0.94). Content, convergent and construct validity was supported by significant correlations of FIQLQ subscale scores with hypothesised variables including age, numbers of symptoms and level of stress experienced due to intolerance. The FIQLQ has good reliability, construct validity and internal consistency. It is short and easy to use, providing a good tool for evaluating quality of life in the clinical research setting and to inform health and regulatory policies.
Maria C. Richaud
Full Text Available Empathy is a basic socio-emotional process of human development that involves the ability to perceive, share, and understand the emotional states of others. This process is essential to successful social functioning. However, despite its significance, empathy has been difficult to define and measure, particularly when incorporating both its emotional and cognitive aspects. The purpose of this study was to develop an Empathy Questionnaire for children aged 9–12 years based on a model of social cognitive neuroscience and to analyze its construct validity and reliability. This questionnaire aimed to integrate the following aspects: emotional contagion, self-other awareness, perspective-taking, emotional regulation, and empathic action. Three studies were conducted. Study 1 evaluated the discriminative power of the items and studied the underlying structure of the instrument using exploratory factor analysis. In Study 2, confirmatory factor analysis was performed to test the model obtained. Finally, the goal of Study 3 was to analyze the convergent and discriminant validity of the questionnaire and the internal consistency of its dimensions. The final version of the instrument contained 15 items that operationalized the previously listed dimensions. The results of the 3 studies indicated that the questionnaire had good validity and reliability. This study has important implications for research and clinical practice. Given its simplicity and brevity, this new self-report scale may work well as a screening method to evaluate the key psychological issues underlying numerous child behaviors that predict the success or failure of social relationships, individual quality of life, and mental well-being.
Ringsted, C; Lippert, F; Hesselfeldt, R
Cardiac Arrest Simulation Test (CASTest) scenarios for the assessments according to guidelines 2005. AIMS: To analyse the reliability and validity of the individual sub-tests provided by ERC and to find a combination of MCQ and CASTest that provides a reliable and valid single effect measure of ALS...... that possessed high reliability, equality of test sets, and ability to discriminate between the two groups of supposedly different ALS competence. CONCLUSIONS: ERC sub-tests of ALS competence possess sufficient reliability and validity. A combined ALS score with equal weighting of one MCQ and one CASTest can...... competence. METHODS: Two groups of participants were included in this randomised, controlled experimental study: a group of newly graduated doctors, who had not taken the ALS course (N=17) and a group of students, who had passed the ALS course 9 months before the study (N=16). Reliability in terms of inter...
Rikkert, Marcel G M Olde; Tona, Klodiana Daphne; Janssen, Lieneke
New staging systems of dementia require adaptation of disease management programs and adequate staging instruments. Therefore, we systematically reviewed the literature on validity and reliability of clinically applicable, multidomain, and dementia staging instruments. A total of 23 articles...
M. Reijman (Max); J.M.W. Hazes (Mieke); H.A.P. Pols (Huib); R.M.D. Bernsen (Roos); B.W. Koes (Bart); S.M. Bierma-Zeinstra (Sita)
textabstractOBJECTIVES: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the
Chorong Park, MSN, RN
Conclusion: The K-HES had acceptable validity and reliability. The brevity and ease of administration of the K-HES makes it a suitable tool for evaluating empowerment-based education programs targeted towards older populations.
Betül Tosun, RN, PhD
Conclusions: The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems.
Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe
The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis.
Boer, Y.A. de; Ende, C.H.M. van den; Eygendaal, D.; Jolie, I.M.M.; Hazes, J.M.W.; Rozing, P.M.
OBJECTIVES: (1) To investigate the measurement characteristics of the Hospital for Special Surgery (HSS) and Mayo Clinic elbow assessment instruments, utilizing methodological criteria including feasibility, reliability, validity, and discriminative ability; and (2) to develop an efficient and
knowledge-dietary behaviour relationship require use of valid and reliable knowledge .... Which of the following beverages has the lowest energy content per cup (250 ml)?b .... Diploma (ND): Consumer Science: Food and Nutrition together.
Gamze Sarikoc, PhD, RN
Conclusion: Results showed that the SNSI had a satisfactory level of reliability and validity in nursing students in Turkey. Multicenter studies including nursing students from different nursing schools are recommended for the SNSI to be generalized.
McMullen, Tara; Resnick, Barbara
To establish the reliability and validity of the Rosenberg Self-Esteem Scale (RSES) when used with nursing assistants (NAs). Testing the RSES used baseline data from a randomized controlled trial testing the Res-Care Intervention. Female NAs were recruited from nursing homes (n = 508). Validity testing for the positive and negative subscales of the RSES was based on confirmatory factor analysis (CFA) using structural equation modeling and Rasch analysis. Estimates of reliability were based on Rasch analysis and the person separation index. Evidence supports the reliability and validity of the RSES in NAs although we recommend minor revisions to the measure for subsequent use. Establishing reliable and valid measures of self-esteem in NAs will facilitate testing of interventions to strengthen workplace self-esteem, job satisfaction, and retention.
Ohrbach, R; Gale, E N
Four studies are presented testing the validity and reliability of pressure pain thresholds (PPTs) and of examination parameters believed to be important in the clinical assessment of sites commonly used for such measures in patient samples. Forty-five patients with a myogenous temporomandibular disorder were examined clinically prior to PPT measures. Criteria for history and examination included functional aspects of the pain, tissue quality of the pain site, and the type of pain elicited from palpation. Control sites within the same muscle and in the contralateral muscle were also examined. PPTs were measured as an index of tenderness using a strain gauge algometer at these sites. The data from the 5 male subjects were excluded from subsequent analyses due to the higher PPT in the males and to their unequal distribution among the various factorial conditions. The first study demonstrated strong validity in PPT measures between patients (using pain sites replicating the patients' pain) and matched controls (n = 11). The PPT was not significantly different between the primary pain site (referred pain and non-referred pain collapsed) and the no-pain control site in the same muscle (n = 16). The PPT was significantly lower at the pain site compared to the no-pain control site in the contralateral muscle (n = 13). The second study indicated adequate reliability in patient samples of the PPT measures. In the third study, the PPT was significantly lower at sites producing referred pain on palpation compared to sites producing localized pain on palpation. The PPT findings from the control sites were inconsistent on this factor. The fourth study presented preliminary evidence that palpable bands and nodular areas in muscle were most commonly associated with muscle regions that produce pain; such muscle findings were not specific, however, for regions that produce pain. Further, the intraexaminer reliability in reassessing these pain sites qualitatively was only fair
van de Grift, Wim
Instability in the school population between school entrance and school leaving is not just a problem of missing data but often the visible result of the educational problems in some schools and is, therefore, not merely to be treated as missing data but as indicator for the quality of educational
5D is a standardised instrument that measures health-related quality-of-life and explores cost-effectiveness of treatments. ... Fifty-three patients with various musculoskeletal problems were administered the Chichewa EQ-5D-3L and World.
Schlegel, C.; Woermann, U.; Rethans, J.J.; Vleuten, C.P.M. van der
BACKGROUND: In the training of healthcare professionals, one of the advantages of communication training with simulated patients (SPs) is the SP's ability to provide direct feedback to students after a simulated clinical encounter. The quality of SP feedback must be monitored, especially because it
Kahraman, Turhan; Özdoğar, Asiye Tuba; Honan, Cynthia Alison; Ertekin, Özge; Özakbaş, Serkan
To linguistically and culturally adapt the Multiple Sclerosis Work Difficulties Questionnaire-23 (MSWDQ-23) for use in Turkey, and to examine its reliability and validity. Following standard forward-back translation of the MSWDQ-23, it was administered to 124 people with multiple sclerosis (MS). Validity was evaluated using related outcome measures including those related to employment status and expectations, disability level, fatigue, walking, and quality of life. Randomly selected participants were asked to complete the MSWDQ-23 again to assess test-retest reliability. Confirmatory factor analysis on the MSWDQ-23 demonstrated a good fit for the data, and the internal consistency of each subscale was excellent. The test-retest reliability for the total score, psychological/cognitive barriers, physical barriers, and external barriers subscales were high. The MSWDQ-23 and its subscales were positively correlated with the employment, disability level, walking, and fatigue outcome measures. This study suggests that the Turkish version of MSWDQ-23 has high reliability and adequate validity, and it can be used to determine the difficulties faced by people with multiple sclerosis in workplace. Moreover, the study provides evidence about the test-retest reliability of the questionnaire. Implications for rehabilitation Multiple sclerosis affects young people of working age. Understanding work-related problems is crucial to enhance people with multiple sclerosis likelihood of maintaining their job. The Multiple Sclerosis Work Difficulties Questionnaire-23 (MSWDQ-23) is a valid and reliable measure of perceived workplace difficulties in people with multiple sclerosis: we presented its validation to Turkish. Professionals working in the field of vocational rehabilitation may benefit from using the MSWDQ-23 to predict the current work outcomes and future employment expectations.
Spiekermann, Christoph; Rudack, Claudia; Stenner, Markus
The outcome of aesthetic rhinoplasty is determined by the patient's subjective satisfaction with the nasal appearance which is difficult to assess. The Utrecht Questionnaire for Outcome Assessment in Aesthetic Rhinoplasty (OAR) is a brief and reliable instrument to assess the influence of the subjective nasal appearance on quality of life in patients undergoing aesthetic rhinoplasty. Preoperative application of this questionnaire reveals important aspects and possible disturbances of the body image which could be negative predictors concerning the result. On the other hand, it represents an appropriate tool to assess the postoperative outcome. The aim of this study was to determine the validity, reliability and responsiveness of the adapted German version of the OAR (D-OAR). The adaption of the OAR to German language was performed by a forward and backward translation process. Patients undergoing rhinoplasty were asked to complete the D-OAR preoperatively, 1, 3 and 12 months after procedure and healthy volunteers without any nasal complaints served as controls to test validity, reliability and responsiveness. An excellent internal consistency, a good test-retest reliability and good inter-item and item-total correlations demonstrated a good reliability of the D-OAR. The convincing validity of the adapted version was proven by an excellent discriminant and a sufficient content validity. Significant differences between pre- and postoperative D-OAR scores revealed a good responsiveness of the instrument. Hence, with a sufficient validity, reliability and sensitivity to changes, the D-OAR is a short and helpful instrument to assess the subjective perception of the nasal appearance in German patients.
Mazaheri, Maryam Amidi; Karbasi, Mojtaba
Background: With regard to large number of mobile users especially among college students in Iran, addiction to mobile phone is attracting increasing concern. There is an urgent need for reliable and valid instrument to measure this phenomenon. This study examines validity and reliability of the Persian version of mobile phone addiction scale (MPAIS) in college students. Materials and Methods: this methodological study was down in Isfahan University of Medical Sciences. One thousand one hundr...
Reijman, Max; Hazes, Mieke; Pols, Huib; Bernsen, Roos; Koes, Bart; Bierma-Zeinstra, Sita
textabstractOBJECTIVES: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the validity of the three definitions of hip OA is sex dependent. METHODS: SUBJECTS: from the Rotterdam study (aged > or= 55 years, n = 3585) were evaluated. The inter-rater reliability was tested in a random set of 148 x rays. ...
Li, Fengzhi; Li, Changji; Long, Yunfang; Zhan, Chenglie; Hennessy, Dwight
The present research was designed to examine the psychometric properties of Chinese versions of the Self Report Driver Behavior Aggression and Assertiveness subscales, the Driving Vengeance Questionnaire, and the Violent Driving Questionnaire. Study 1 found that the all scales demonstrated good internal consistency, with alphas ranging from .76 to .87 and that assertive driving was related to demerit points received over the past 12 months while driver aggression and violence were linked to collisions over the past 12 months. Study 2 found that the scales exhibited reasonable test-retest reliability, with correlations ranging from .82 to .89. Finally, Study 3 showed that each scale was predicted by other dangerous driving attitudes and behaviors, similar to the original versions. The consistency between the translated and original scales, the implications for use in a Chinese sample, and the uniformity of actions in the traffic environment across cultures are discussed.
Conclusion: Considering that Validity and Reliability factors of the questionnaire were be appropriate, it can be recommended that NIOSH Generic Job Stress Questionnaire (GJSQ can be used as a Valid and Reliable questionnaire for job stress evaluation in Iran.
Toohey, Liam Anthony; De Noronha, Marcos; Taylor, Carolyn; Thomas, James
Muscle strength measurement is a key component of physiotherapists' assessment and is frequently used as an outcome measure. A sphygmomanometer is an instrument commonly used to measure blood pressure that can be potentially used as a tool to assess isometric muscle strength. To systematically review the evidence on the reliability and validity of a sphygmomanometer for measuring isometric strength of hip muscles. A literature search was conducted across four databases. Studies were eligible if they presented data on reliability and/or validity, used a sphygmomanometer to measure isometric muscle strength of the hip region, and were peer reviewed. The individual studies were evaluated for quality using a standardized critical appraisal tool. A total of 644 articles were screened for eligibility, with five articles chosen for inclusion. The use of a sphygmomanometer to objectively assess isometric muscle strength of the hip muscles appears to be reliable with intraclass correlation coefficient values ranging from 0.66 to 0.94 in elderly and young populations. No studies were identified that have assessed the validity of a sphygmomanometer. The sphygmomanometer appears to be reliable for assessment of isometric muscle strength around the hip joint, but further research is warranted to establish its validity.
Schoppen, Tanneke; Boonstra, Antje; Groothoff, JW; de Vries, J; Goeken, LNH; Eisma, Willem
Objective: To determine the interrater and interrater reliability and the validity of the Timed "up and go" test as a measure for physical mobility in elderly patients with an amputation of the lower extremity. Design: To test interrater reliability, the test was performed for two observers at
Michailov, Michail Lubomirov; Baláš, Jirí; Tanev, Stoyan Kolev; Andonov, Hristo Stoyanov; Kodejška, Jan; Brown, Lee
Purpose: An advanced system for the assessment of climbing-specific performance was developed and used to: (a) investigate the effect of arm fixation (AF) on construct validity evidence and reliability of climbing-specific finger-strength measurement; (b) assess reliability of finger-strength and endurance measurements; and (c) evaluate the…
The Rey Visual Design Learning Test (Rey, 1964, in Spreen & Strauss, 1991) assesses immediate memory span, new learning and recognition for non-verbal material. Three studies are presented that focused on the reliability and validity of the RVDLT in primary school children. Test-retest reliability
Rae, James R.; Olson, Kristina R.
The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Brody, Michelle L.; And Others
Examined reliability and validity of binge eating disorder (BED), proposed for inclusion in Diagnostic and Statistical Manual of Mental Disorders (DSM), fourth edition. Interrater reliability of BED diagnosis compared favorably with that of most diagnoses in DSM revised third edition. Study comparing obese individuals with and without BED and…
Conclusion: The tool designed to assess bag-mask ventilation and tracheal intubation skills in anesthesia trainees demonstrated excellent inter-rater reliability, fair test-retest reliability, and good construct validity. The authors recommend its use for formative and summative assessment of junior anesthesia trainees.
Bartlett, Susan J; Barbic, Skye P; Bykerk, Vivian P
-FQ), and the voting results at OMERACT 2016. METHODS: Classic and modern psychometric methods were used to assess reliability, validity, sensitivity, factor structure, scoring, and thresholds. Interviews with patients and clinicians also assessed content validity, utility, and meaningfulness of RA-FQ scores. RESULTS......: People with RA in observational trials in Canada (n = 896) and France (n = 138), and an RCT in the Netherlands (n = 178) completed 5 items (11-point numerical rating scale) representing RA Flare core domains. There was moderate to high evidence of reliability, content and construct validity...... to identify and measure RA flares. Its review through OMERACT Filter 2.0 shows evidence of reliability, content and construct validity, and responsiveness. These properties merit its further validation as an outcome for clinical trials....
Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth
The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor structure, reliability and discriminant validity were also assessed. Seventy-one participants with mild-moderate learning disabilities completed the two scales of the SMAT and two False Belief Tasks for Theory of Mind. Reliability of the SMAT was very good, and the scales were shown to be uni-dimensional in factor structure. There was a significant positive relationship between Theory of Mind and both SMAT scales. There is early evidence of the construct validity and reliability of the SMAT. Further assessment of the validity of the SMAT will be required. © 2012 Blackwell Publishing Ltd.
Salvetti, Maria Vittoria; Meyers, Johan; Sagaut, Pierre
The second Workshop on "Quality and Reliability of Large-Eddy Simulations", QLES2009, was held at the University of Pisa from September 9 to September 11, 2009. Its predecessor, QLES2007, was organized in 2007 in Leuven (Belgium). The focus of QLES2009 was on issues related to predicting, assessing and assuring the quality of LES. The main goal of QLES2009 was to enhance the knowledge on error sources and on their interaction in LES and to devise criteria for the prediction and optimization of simulation quality, by bringing together mathematicians, physicists and engineers and providing a platform specifically addressing these aspects for LES. Contributions were made by leading experts in the field. The present book contains the written contributions to QLES2009 and is divided into three parts, which reflect the main topics addressed at the workshop: (i) SGS modeling and discretization errors; (ii) Assessment and reduction of computational errors; (iii) Mathematical analysis and foundation for SGS modeling.
Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Wickramasinghe, Nuwan Darshana; Dissanayake, Devani Sakunthala; Abeywardena, Gihan Sajiwa
The present study was aimed at assessing the validity and the reliability of the Sinhala version of the Utrecht Work Engagement Scale-Student Version (UWES-S) among collegiate cycle students in Sri Lanka. The 17-item UWES-S was translated to Sinhala and the judgmental validity was assessed by a multi-disciplinary panel of experts. Construct validity of the UWES-S was appraised by using multi-trait scaling analysis and exploratory factor analysis (EFA) on data obtained from a sample of 194 grade thirteen students in the Kurunegala district, Sri Lanka. Reliability of the UWES-S was assessed by using internal consistency and test-retest reliability. Except for item 13, all other items showed good psychometric properties in judgemental validity, item-convergent validity and item-discriminant validity. EFA using principal component analysis with Oblimin rotation, suggested a three-factor solution (including vigor, dedication and absorption subscales) explaining 65.4% of the total variance for the 16-item UWES-S (with item 13 deleted). All three subscales show high internal consistency with Cronbach's α coefficient values of 0.867, 0.819, and 0.903 and test-retest reliability was high (p valid and a reliable instrument to assess work engagement among collegiate cycle students in Sri Lanka.
Braun, Tobias; Marks, Detlef; Thiel, Christian; Grüneberg, Christian
To establish the validity and reliability of the de Morton Mobility Index (DEMMI) in patients with sub-acute stroke. This cross-sectional study was performed in a neurological rehabilitation hospital. We assessed unidimensionality, construct validity, internal consistency reliability, inter-rater reliability, minimal detectable change and possible floor and ceiling effects of the DEMMI in adult patients with sub-acute stroke. The study included a total sample of 121 patients with sub-acute stroke. We analysed validity (n = 109) and reliability (n = 51) in two sub-samples. Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 12.37, p = 0.577). All hypotheses on construct validity were confirmed. Internal consistency reliability (Cronbach's alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.95; 95% confidence interval: 0.92-0.97) were excellent. The minimal detectable change with 90% confidence was 13 points. No floor or ceiling effects were evident. These results indicate unidimensionality, sufficient internal consistency reliability, inter-rater reliability, and construct validity of the DEMMI in patients with a sub-acute stroke. Advantages of the DEMMI in clinical application are the short administration time, no need for special equipment and interval level data. The de Morton Mobility Index, therefore, may be a useful performance-based bedside test to measure mobility in individuals with a sub-acute stroke across the whole mobility spectrum. Implications for Rehabilitation The de Morton Mobility Index (DEMMI) is an unidimensional measurement instrument of mobility in individuals with sub-acute stroke. The DEMMI has excellent internal consistency and inter-rater reliability, and sufficient construct validity. The minimal detectable change of the DEMMI with 90% confidence in stroke rehabilitation is 13 points. The lack of any floor or ceiling effects on hospital admission indicates
D'Angelo, Daniela; Mastroianni, Chiara; Artico, Marco; Biagioli, Valentina; Latina, Roberto; Guarda, Michela; Piredda, Michela; De Marinis, Maria Grazia
Patients suffering from advanced disease face different care transitions. The transition from acute to palliative care is challenging and may lead to the discontinuity of care. Family caregivers become important sources of information, as patients begin to experience difficulties in coping with emotional transition events. The Care Transition Measure was developed to evaluate care transitions as experienced by the elderly. It has never been used in palliative care. The aim of this study was to test the validity and reliability of a modified version of the Palliative Care Transition Measure, specifically the Palliative Care Transition Measure for Caregivers (PCTM-C). The study included two main phases. Phase I focused on the construction of a modified version of the Palliative Care Transition Measure through two focus groups and by computing the content validity index. Phase II focused on testing the psychometric properties of the PCTM-C on 272 family caregivers through confirmatory factor analysis. Result The content validity index for each of the items was higher than 0.80, whereas that for the scale was 0.95. The model tested with confirmatory factor analysis fitted the data well and confirmed that the transition measures referred to communication, integrated care and a trusting-relationship, and therefore the core dimensions of continuity according to existing conceptual models. The internal consistency was high (Cronbach's alpha = 0.94). Significance of results The PCTM-C proved to be a suitable measure of the quality of such transitions. It may be used in clinical practice as a continuity quality indicator and has the potential to guide interventions to enhance family caregivers' experience of care continuity.
Qualidade de vida de crianças e adolescentes de São Paulo: confiabilidade e validade da versão brasileira do questionário genérico Pediatric Quality of Life InventoryTM versão 4.0 Quality of life of children and adolescents from São Paulo: reliability and validity of the Brazilian version of the Pediatric Quality of Life InventoryTM version 4.0 Generic Core Scales
Denise A. Klatchoian
Full Text Available OBJETIVOS: Avaliar a confiabilidade e a validade da versão brasileira do questionário genérico Pediatric Quality of Life InventoryTM (PedsQL TM 4.0 e mensurar a qualidade de vida de crianças e adolescentes saudáveis e de pacientes com doenças reumáticas. MÉTODOS: No processo de validação, seguimos a metodologia proposta pelos idealizadores da versão original em inglês do questionário PedsQL TM 4.0. O instrumento foi administrado por entrevista em dois grupos: 240 crianças e adolescentes aparentemente saudáveis de São Paulo (SP e 105 pacientes com doenças reumáticas crônicas, pareados por idade, e aos respectivos pais ou responsáveis. O questionário foi aplicado nos cuidadores e nas crianças separadamente e no mesmo dia. RESULTADOS: Os valores do teste alfa de Cronbach situaram-se entre 0,6 e 0,9 para todas as dimensões, demonstrando uma consistência interna adequada. Os pacientes com doenças reumáticas apresentaram uma redução significativa na qualidade de vida quando comparados com as crianças saudáveis (p OBJECTIVES: To evaluate the reliability and validity of the Brazilian version of the Pediatric Quality of Life InventoryTM (PedsQL TM 4.0 Generic Core Scales and measure the quality of life of healthy children and adolescents and patients with rheumatic diseases. METHODS: We followed the translation methodology proposed by the developer of the original English version of the PedsQL TM 4.0. The instrument was administered by interviews in two groups: 240 apparently healthy children and adolescents from São Paulo (SP, Brazil and 105 patients with chronic rheumatic diseases, matched by age, as well as to their respective parents or caregivers. The parent proxy-report was administered to the children's parents or caregivers separately on the same day. RESULTS: Cronbach's alpha values were between 0.6 and 0.9 for all dimensions, demonstrating adequate internal consistency. Patients with rheumatic diseases reported
Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh
The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Suzuki, Eiko; Kanoya, Yuka; Katsuki, Takeshi; Sato, Chifumi
To verify the reliability and validity of a Japanese version of the Rathus Assertiveness Schedule in novice nurses to contribute to nursing management. An adequate scale is needed to measure the assertiveness and the effect of assertion training for Japanese nurses and to compare them with those in other countries. Rathus Assertiveness Schedule was adapted to Japanese with back-translation and its validity was examined in 989 novice nurses. The Japanese version showed a high coefficient of reliability in a split-half reliability test (r=0.76; PAssertiveness Schedule. The Japanese version of Rathus Assertiveness Schedule was verified.
Dere, Zeynep; Ömeroglu, Esra
This study, Creative Behavior Observation Form was developed to assess creativity of the children. While the study group on the reliability and validity of Creative Behavior Observation Form was being developed, 257 children in total who were at the ages of 5-6 were used as samples with stratified sampling method. Content Validity Index (CVI) and…
Yirci, Ramazan; Karakose, Turgut; Uygun, Harun; Ozdemir, Tuncay Yavuz
The purpose of this study is to adapt the Mentoring Relationship Effectiveness Scale to Turkish, and to conduct validity and reliability tests regarding the scale. The study group consisted of 156 university science students receiving graduate education. Construct validity and factor structure of the scale was analyzed first through exploratory…
Ng, Petrus; Su, Xiqing Susan; Chan, Vivien; Leung, Heidi; Cheung, Wendy; Tsun, Angela
This study validated a Perceived Campus Caring Scale with 1,520 university students. Using factor analysis, seven factors namely, Faculty Support, Nonfaculty Support, Peer Relationship, Sense of Detachment, Sense of Belonging, Caring Attitude, and Campus Involvement, are identified with high reliability, validity, and close correlation with the…
Biasutti, Michele; Frate, Sara
This article describes the development and validation of the Attitudes toward Sustainable Development scale, a quantitative 20-item scale that measures Italian university students' attitudes toward sustainable development. A total of 484 undergraduate students completed the questionnaire. The validity and reliability of the scale was statistically…
Vanbellingen, Tim; Nyffeler, Thomas; Nef, Tobias; Kwakkel, Gert; Bohlhalter, Stephan; van Wegen, Erwin E.H.
Background Patients with Parkinson's disease exhibit disturbed dexterity. Validated self-reported outcomes for dexterity in Parkinson's disease are lacking. The aim of this study was to investigate the reliability, content and construct validity of a new Dexterity Questionnaire 24. Methods One
M. Reijman (Max); J.M.W. Hazes (Mieke); H.A.P. Pols (Huib); R.M.D. Bernsen (Roos); B.W. Koes (Bart); S.M. Bierma-Zeinstra (Sita)
textabstractObjectives: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the validity of the three
Objective. We sought to determine the validity and reliability of a self-report physical activity questionnaire (PAQ) measuring physical activity/inactivity in South African schoolgirls of different ethnic origins. Methods. Construct validity of the PAQ was tested against physical activity energy expenditure estimated from an ...
The purpose of this study was to determine the test-retest reliability and concurrent validity of the short form (Form B) of the Coopersmith Self-Esteem Inventory. Criterion measures for validity included: (1) sociometric measures; (2) teacher's popularity ranking; and, (3) self-esteem rating. (Author/LMO)
Full Text Available Handwriting in Parkinson's disease (PD features specific abnormalities which are difficult to assess in clinical practice since no specific tool for evaluation of spontaneous movement is currently available.This study aims to validate the 'Systematic Screening of Handwriting Difficulties' (SOS-test in patients with PD.Handwriting performance of 87 patients and 26 healthy age-matched controls was examined using the SOS-test. Sixty-seven patients were tested a second time within a period of one month. Participants were asked to copy as much as possible of a text within 5 minutes with the instruction to write as neatly and quickly as in daily life. Writing speed (letters in 5 minutes, size (mm and quality of handwriting were compared. Correlation analysis was performed between SOS outcomes and other fine motor skill measurements and disease characteristics. Intrarater, interrater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC and Spearman correlation coefficient.Patients with PD had a smaller (p = 0.043 and slower (p 0.769 for both groups.The SOS-test is a short and effective tool to detect handwriting problems in PD with excellent reliability. It can therefore be recommended as a clinical instrument for standardized screening of handwriting deficits in PD.
Frandes, Mirela; Deiac, Anca V; Timar, Bogdan; Lungeanu, Diana
Nowadays, mobile technologies are part of everyday life, but the lack of instruments to assess their acceptability for the management of chronic diseases makes their actual adoption for this purpose slow. The objective of this study was to develop a survey instrument for assessing patients' attitude toward and intention to use mobile technology for diabetes mellitus (DM) self-management, as well as to identify sociodemographic characteristics and quality of life factors that affect them. We first conducted the documentation and instrument design phases, which were subsequently followed by the pilot study and instrument validation. Afterward, the instrument was administered 103 patients (median age: 37 years; range: 18-65 years) diagnosed with type 1 or type 2 DM, who accepted to participate in the study. The reliability and construct validity were assessed by computing Cronbach's alpha and using factor analysis, respectively. The instrument included statements about the actual use of electronic devices for DM management, interaction between patient and physician, attitude toward using mobile technology, and quality of life evaluation. Cronbach's alpha was 0.9 for attitude toward using mobile technology and 0.97 for attitude toward using mobile device applications for DM self-management. Younger patients (Spearman's ρ =-0.429; P higher education level (Kendall's τ =0.51; P mobile assistive applications for DM control. Moreover, patients with a higher quality of life presented a significantly more positive attitude toward using modern technology (Spearman's ρ =0.466; P mobile technology for DM self-management. Additionally, we found that even if most of the patients showed positive attitude toward mobile applications, only a moderate level of intention to indeed use them was observed. Moreover, the study indicated that barriers were truthfulness and easiness to use.
Zhang, C; Yang, G P; Li, Z; Li, X N; Li, Y; Hu, J; Zhang, F Y; Zhang, X J
Objective: To assess the reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test (AUDIT) among medical students in China and to provide correct way of application on the recommended scales. Methods: An E-questionnaire was developed and sent to medical students in five different colleges. Students were all active volunteers to accept the testings. Cronbach's α and split-half reliability were calculated to evaluate the reliability of AUDIT while content, contract, discriminant and convergent validity were performed to measure the validity of the scales. Results: The overall Cronbach's α of AUDIT was 0.782 and the split-half reliability was 0.711. Data showed that the domain Cronbach's α and split-half reliability were 0.796 and 0.794 for hazardous alcohol use, 0.561 and 0.623 for dependence symptoms, and 0.647 and 0.640 for harmful alcohol use. Results also showed that the content validity index on the levels of items I-CVI) were from 0.83 to 1.00, the content validity index of scale level (S-CVI/UA) was 0.90, content validity index of average scale level (S-CVI/Ave) was 0.99 and the content validity ratios (CVR) were from 0.80 to 1.00. The simplified version of AUDIT supported a presupposed three-factor structure which could explain 61.175% of the total variance revealed through exploratory factor analysis. AUDIT semed to have good convergent and discriminant validity, with the success rate of calibration experiment as 100%. Conclusion: AUDIT showed good reliability and validity among medical students in China thus worth for promotion on its use.
Qolipour, Mohammad; Torabipour, Amin; Khiavi, Farzad Faraji; Malehi, Amal Saki
Assessing service quality is one of the basic requirements to develop the medical tourism industry. There is no valid and reliable tool to measure service quality of medical tourism. This study aimed to determine the reliability and validity of a Persian version of medical tourism service quality questionnaire for Iranian hospitals. To validate the medical tourism service quality questionnaire (MTSQQ), a cross-sectional study was conducted on 250 Iraqi patients referred to hospitals in Ahvaz (Iran) from 2015. To design a questionnaire and determine its content validity, the Delphi Technique (3 rounds) with the participation of 20 medical tourism experts was used. Construct validity of the questionnaire was assessed through exploratory and confirmatory factor analysis. Reliability was assessed using Cronbach's alpha coefficient. Data were analyzed by Excel 2007, SPSS version18, and Lisrel l8.0 software. The content validity of the questionnaire with CVI=0.775 was confirmed. According to exploratory factor analysis, the MTSQQ included 31 items and 8 dimensions (tangibility, reliability, responsiveness, assurance, empathy, exchange and travel facilities, technical and infrastructure facilities and safety and security). Construct validity of the questionnaire was confirmed, based on the goodness of fit quantities of model (RMSEA=0.032, CFI= 0.98, GFI=0.88). Cronbach's alpha coefficient was 0.837 and 0.919 for expectation and perception questionnaire. The results of the study showed that the medical tourism SERVQUAL questionnaire with 31 items and 8 dimensions was a valid and reliable tool to measure service quality of medical tourism in Iranian hospitals.
Køster, B; Søndergaard, J; Nielsen, J B; Olsen, A; Bentzen, J
An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high. The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed questionnaire for monitoring and evaluating population sun-related behavior. Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly in a questionnaire adapted to measure behavior throughout the summer. The response rates for questionnaire 1, 2 and 3 were high and the drop out was not dependent on demographic characteristic. There was at least 73% agreement between sunburns in the measurement week and the entire summer, and a possible sunburn underestimation in questionnaires summarizing the entire summer. The participants underestimated their outdoor exposure in the evaluation covering the entire summer as compared to the measurement week. The reliability of scales measuring attitude and knowledge was high for majority of scales, while consistency in protection behavior was low. To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability, while consistency of protection behavior in general and in a week's measurement was low.
Sejbæk, Tobias; Blaabjerg, Morten; Sprogøe, Pippi
. The Multiple Sclerosis Neuropsychological Screening Questionnaire (MSNQ) has previously shown good validity in American, Argentinean, and Dutch MS cohorts. We sought to test reliability and validity of a Danish translation of the MSNQ compared with formal neuropsychological testing, and measures of depression...... the Expanded Disability Status Scale and MS Impairment Scale. Results: The test-retest reliability of the MSNQ-P was significant (R2 = 0.79, P ... that the MSNQ-P measures these items more than the cognitive abilities of the patients. Conclusions: This study does not support use of the MSNQ as a sensitive or valid screening tool for cognitive impairment in Danish patients with MS....
Alanay, Ahmet; Cil, Akin; Berk, Haluk; Acaroglu, R Emre; Yazici, Muharrem; Akcali, Omer; Kosay, Can; Genc, Yasemin; Surat, Adil
Outcome study to determine the internal consistency, and validity of adapted Turkish version of Scoliosis Research Society-22 (SRS-22) Instrument. To evaluate the validity and reliability of adapted Turkish Version of SRS-22 questionnaire. The SRS-22 questionnaire is a widely accepted questionnaire to assess the health-related quality of life for scoliotic patients in the United States. However, its adaptation in languages other than the source language is necessary for its multinational use. Translation/retranslation of the English version of the SRS-22 was done, and all steps for cross-cultural adaptation process were performed properly by an expert committee. Later, SRS-22 questionnaires and previously validated Short Form-36 (SF-36) outcome instruments were mailed to 82 patients who had been surgically treated for idiopathic scoliosis. All patients had a minimum of 2 years follow-up. Fifty-four patients (66%) responded to the first set of questionnaires. Forty-seven of the first time respondents returned their second survey. The average age of the 47 patients (12 male, 35 female) was 19.8 years (range, 14-31 years). The two measures of reliability as internal consistency and reproducibility were determined by Cronbach alpha statistics and intraclass correlation coefficient, respectively. Concurrent validity was measured by comparing with an already validated questionnaire (SF-36). Measurement was made using the Pearson correlation coefficient (r). The study demonstrated satisfactory internal consistency with high Cronbach alpha values for the four of the corresponding domains (pain, 0.72; self-image, 0.80; mental health, 0.72; and satisfaction, 0.83). However, the Cronbach alpha value for function/activity domain (0.48) was considerably lower than the original questionnaire. The intraclass correlation coefficient for the same domains was 0.80, 0.82, 0.78, 0.81, and 0.76, respectively, demonstrating a satisfactory test/retest reproducibility. Considering
Jesús F. Salgado
Full Text Available There is criticism in the literature about the use of interrater coefficients to correct for criterion reliability in validity generalization (VG studies and disputing whether .52 is an accurate and non-dubious estimate of interrater reliability of overall job performance (OJP ratings. We present a second-order meta-analysis of three independent meta-analytic studies of the interrater reliability of job performance ratings and make a number of comments and reflections on LeBreton et al.s paper. The results of our meta-analysis indicate that the interrater reliability for a single rater is .52 (k = 66, N = 18,582, SD = .105. Our main conclusions are: (a the value of .52 is an accurate estimate of the interrater reliability of overall job performance for a single rater; (b it is not reasonable to conclude that past VG studies that used .52 as the criterion reliability value have a less than secure statistical foundation; (c based on interrater reliability, test-retest reliability, and coefficient alpha, supervisor ratings are a useful and appropriate measure of job performance and can be confidently used as a criterion; (d validity correction for criterion unreliability has been unanimously recommended by "classical" psychometricians and I/O psychologists as the proper way to estimate predictor validity, and is still recommended at present; (e the substantive contribution of VG procedures to inform HRM practices in organizations should not be lost in these technical points of debate.
Keessen, Paul; Maaskant, Jolanda; Visser, Bart
The standardized Mensendieck test (SMT) was developed to quantify posture, movement, gait, and respiration. In the hands of an experienced therapist, the SMT is proven to be a reliable tool. It is unclear whether posture, movement, gait, and respiration are related to the degree of functional disability in patients with chronic pain. The objective of this study was to assess the reliability and convergent validity of the SMT in a heterogeneous sample of 50 patients with chronic pain. Internal consistency was determined by Cronbach's α and interrater reliability by the intraclass correlation coefficient (ICC). Convergent validity was assessed by determining the Spearman rank correlation coefficient between the movement quality measured in the SMT and functional limitation measured on the disability rating index (DRI). The internal consistency was Cronbach's α 0.91. Substantial reliability was found for the items: movement (ICC = 0.68), gait (ICC = 0.69), sitting posture (ICC = 0.63), and respiration (ICC = 0.64). Insufficient reliability was found for standing posture (ICC = 0.23). A moderate correlation was found between average test score SMT and the DRI (r = -0.37) and respiration and DRI (r = -0.45). The SMT is a reasonably reliable tool to assess movement, gait, sitting posture, and respiration. None of the items in the domain standing posture has sufficient reliability. A thorough study of this domain should be considered. The results show little evidence for convergent validity. Several items of the SMT correlated moderately with functional limitation with the DRI. These items were global movement, hip flexion, pelvis rotation, and all respiration items.
Cousson-Gélie, Florence; de Chalvron, Stéphanie; Zozaya, Carole; Lafaye, Anaïs
Among psychosocial factors affecting emotional adjustment and quality of life, social support is one of the most important and widely studied in cancer patients, but little is known about the perception of support in specific significant relationships in patients with cancer. This study examined the psychometric properties of the Quality of Relationship Inventory (QRI) by evaluating its factor structure and its convergent and discriminant validity in a sample of cancer patients. A total of 388 patients completed the QRI. Convergent validity was evaluated by testing the correlations between the QRI subscales and measures of general social support, anxiety and depression symptoms. Discriminant validity was examined by testing group comparison. The QRI's longitudinal invariance across time was also tested. Principal axis factor analysis with promax rotation identified three factors accounting for 42.99% of variance: perceived social support, depth, and interpersonal conflict. Estimates of reliability with McDonald's ω coefficient were satisfactory for all the QRI subscales (ω ranging from 0.75 - 0.85). Satisfaction from general social support was negatively correlated with the interpersonal conflict subscale and positively with the depth subscale. The interpersonal conflict and social support scales were correlated with depression and anxiety scores. We also found a relative stability of QRI subscales (measured 3 months after the first evaluation) and differences between partner status and gender groups. The Quality of Relationship Inventory is a valid tool for assessing the quality of social support in a particular relationship with cancer patients.
Müller, Alessandra Bombarda; Valentini, Nadia Cristina; Bandeira, Paulo Felipe Ribeiro
The range of stimuli provided by physical space, toys and care practices contributes to the motor, cognitive and social development of children. However, assessing the quality of child education environments is a challenge, and can be considered a health promotion initiative. This study investigated the validity of the criterion, content, construct and reliability of the Affordances in the Home Environment for Motor Development - Infant Scale (AHEMD-IS), version 3-18 months, for the use in daycare settings. Content validation was conducted with the participation of seven motor development and health care experts; and, face validity by 20 specialists in health and education. The results indicate the suitability of the adapted AHEMD-IS, evidencing its validity for the daycare setting a potential tool to assess the opportunities that the collective context offers to child development. Copyright © 2017 Elsevier Inc. All rights reserved.
Rikkert, Marcel G M Olde; Tona, Klodiana Daphne; Janssen, Lieneke
New staging systems of dementia require adaptation of disease management programs and adequate staging instruments. Therefore, we systematically reviewed the literature on validity and reliability of clinically applicable, multidomain, and dementia staging instruments. A total of 23 articles...... describing 12 staging instruments were identified (N = 6109 participants, age 65-87). Reliability was studied in most (91%) of the articles and was judged moderate to good. Approximately 78% of the articles evaluated concurrent validity, which was good to very good, while discriminant validity was assessed...... in only 25%. The scales can be applied in ±15 minutes. Clinical Dementia Rating (CDR), Global Deterioration scale (GDS), and Functional Assessment Staging (FAST) have been monitored on reliability and validity, and the CDR currently is the best-evidenced scale, also studied in international perspective...
Jillian E. Frideres
Full Text Available The purpose of this study was to design and to test the validity and reliability of an instrument to evaluate coaches' knowledge about the female athlete triad syndrome and their confidence in this knowledge. The instrument collects information regarding: knowledge of the syndrome, components, prevention and intervention; confidence of the coaches in their answers; and coach's characteristics (gender, degree held, years of experience in coaching females, continuing education participation specific to the syndrome and its components, and sport coached. The process of designing the questionnaire and testing the validity and reliability of it was done in four phases: a design and development of the instrument, b content validity, c instrument reliability, and d concurrent validity. The results show that the instrument is suitable for measuring coaches' female athlete triad knowledge. The instrument can contribute to assessing the coaches' knowledge level in relation to this topic.
Pololi, Linda H; Evans, Arthur T; Civian, Janet T; Gibbs, Brian K; Gillum, Linda H; Brennan, Robert T
Despite the well-recognized benefits of mentoring in academic medicine, there is a lack of clarity regarding what constitutes effective mentoring. We developed a tool to assess mentoring activities experienced by faculty and evaluated evidence for its validity. The National Initiative on Gender, Culture, and Leadership in Medicine-"C-Change"-previously developed the C-Change Faculty Survey to assess the culture of academic medicine. After intensive review, we added six items representing six components of mentoring to the survey-receiving help with career and personal goals, learning skills, sponsorship, and resources. We tested the items in four academic health centers during 2013 to 2014. We estimated reliability of the new items and tested the correlation of the new items with a mentoring composite variable representing faculty mentoring experiences as positive, neutral, or inadequate and with other C-Change dimensions of culture. Among the 1520 responding faculty (response rate 61-63%), there was a positive association between each of the six mentoring activities and satisfaction with both the amount and quality of mentoring received. There was no difference by sex. Cronbach α coefficients ranged from 0.89 to 0.95 across subgroups of faculty (by sex, race, and principal roles). The mentoring responses were associated most closely with dimensions of Institutional Support (r = 0.58, P Mentoring scale is a valid instrument to assess mentoring. Survey results could facilitate mentoring program development and evaluation.
Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra
The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.
Lonjon, Guillaume; Ilharreborde, Brice; Odent, Thierry; Moreau, Sébastien; Glorion, Christophe; Mazda, Keyvan
Outcome study to determine the internal consistency, reproducibility, and concurrent validity of the French-Canadian version of the Scoliosis Research Society 22 (SRS-22 fcv) patient questionnaire in France. To determine whether the SRS-22 fcv can be used in a population from France. The SRS-22 has been translated and validated in multiple countries, notably in the French-Canadian language in Quebec, Canada. Use of SRS-22 fcv seems appropriate for evaluating adolescent idiopathic scoliosis in France. However, French-Canadian French is noticeably different from the French spoken in France, and no study has investigated the use of a French-Canadian version of a health-quality questionnaire in another French population. The methods used for validating the SRS-22 fcv in Quebec were adopted for use with a group of 200 adolescents with idiopathic scoliosis and 60 healthy adolescents in France. Reliability and reproducibility were measured by the Cronbach α and intraclass correlation coefficient (ICC), construct validity by factorial analysis, concurrent validity by the Short-Form of the survey, and discriminant validity by analysis of variance and multivariate linear regression. In France, the SRS-22 fcv showed good global internal consistency (Cronbach α = 0.87, intraclass correlation coefficient = 0.92), a coherent factorial structure, and high correlation coefficients between the SRS-22 fcv and Short-Form of the survey (P < 0.001). However, reliability and validity were slightly less than that for the instrument's original validation and the validation of the SRS-22 fcv in Quebec. These differences could be explained by language and cultural differences. The SRS-22 fcv is relevant for use in France, but further development and validation of a specific French questionnaire remain necessary to improve the assessment of functional outcomes of adolescents with scoliosis in France. N/A.
Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Eric Swanson, MD
Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results.
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
Introduction Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. Objective To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. Method The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. Results The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Conclusion Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results. PMID:29736104
Casartelli, Nicola; Müller, Roland; Maffiuletti, Nicola A
The aim of the present study was to verify the validity and reliability of the Myotest accelerometric system (Myotest SA, Sion, Switzerland) for the assessment of vertical jump height. Forty-four male basketball players (age range: 9-25 years) performed series of squat, countermovement and repeated jumps during 2 identical test sessions separated by 2-15 days. Flight height was simultaneously quantified with the Myotest system and validated photoelectric cells (Optojump). Two calculation methods were used to estimate the jump height from Myotest recordings: flight time (Myotest-T) and vertical takeoff velocity (Myotest-V). Concurrent validity was investigated comparing Myotest-T and Myotest-V to the criterion method (Optojump), and test-retest reliability was also examined. As regards validity, Myotest-T overestimated jumping height compared to Optojump (p 0.98), that is, excellent validity. Myotest-V overestimated jumping height compared to Optojump (p 12 cm), high limits of agreement ratios (>36%), and low ICCs (9 cm). In conclusion, Myotest-T is a valid and reliable method for the assessment of vertical jump height, and its use is legitimate for field-based evaluations, whereas Myotest-V is neither valid nor reliable.
Rokotonarivo, Sarobidy; Schaafsma, Marije; Hockley, Neal
reliability measures. DCE results were generally consistent with those of other stated preference techniques (convergent validity), but hypothetical bias was common. Evidence supporting theoretical validity (consistency with assumptions of rational choice theory) was limited. In content validity tests, 2...
Ghaemi, Hamide; Khoddami, Seyyedeh Maryam; Soleymani, Zahra; Zandieh, Fariborz; Jalaie, Shohreh; Ahanchian, Hamid; Khadivi, Ehsan
The aim of this study was to develop, validate, and assess the reliability of the Persian version of Vocal Cord Dysfunction Questionnaire (VCDQ P ). The study design was cross-sectional or cultural survey. Forty-four patients with vocal fold dysfunction (VFD) and 40 healthy volunteers were recruited for the study. To assess the content validity, the prefinal questions were given to 15 experts to comment on its essential. Ten patients with VFD rated the importance of VCDQ P in detecting face validity. Eighteen of the patients with VFD completed the VCDQ 1 week later for test-retest reliability. To detect absolute reliability, standard error of measurement and smallest detected change were calculated. Concurrent validity was assessed by completing the Persian Chronic Obstructive Pulmonary Disease (COPD) Assessment Test (CAT) by 34 patients with VFD. Discriminant validity was measured from 34 participants. The VCDQ was further validated by administering the questionnaire to 40 healthy volunteers. Validation of the VCDQ as a treatment outcome tool was conducted in 18 patients with VFD using pre- and posttreatment scores. The internal consistency was confirmed (Cronbach α = 0.78). The test-retest reliability was excellent (intraclass correlation coefficient = 0.97). The standard error of measurement and smallest detected change values were acceptable (0.39 and 1.08, respectively). There was a significant correlation between the VCDQ P and the CAT total scores (P validity was significantly different. The VCDQ scores in patients with VFD before and after treatment was significantly different (P valid and reliable self-administered questionnaire in Persian-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Azam, Mohammad; Tu, Fang; Shlapak, Yuri; Kirubarajan, Thiagalingam; Pattipati, Krishna R.; Karanam, Rajaiah
The deregulation of energy markets, the ongoing advances in communication networks, the proliferation of intelligent metering and protective power devices, and the standardization of software/hardware interfaces are creating a dramatic shift in the way facilities acquire and utilize information about their power usage. The currently available power management systems gather a vast amount of information in the form of power usage, voltages, currents, and their time-dependent waveforms from a variety of devices (for example, circuit breakers, transformers, energy and power quality meters, protective relays, programmable logic controllers, motor control centers). What is lacking is an information processing and decision support infrastructure to harness this voluminous information into usable operational and management knowledge to handle the health of their equipment and power quality, minimize downtime and outages, and to optimize operations to improve productivity. This paper considers the problem of evaluating the capacity and reliability analyses of power systems with very high availability requirements (e.g., systems providing energy to data centers and communication networks with desired availability of up to 0.9999999). The real-time capacity and margin analysis helps operators to plan for additional loads and to schedule repair/replacement activities. The reliability analysis, based on computationally efficient sum of disjoint products, enables analysts to decide the optimum levels of redundancy, aids operators in prioritizing the maintenance options for a given budget and monitoring the system for capacity margin. The resulting analytical and software tool is demonstrated on a sample data center.
Pedrero-Pérez, Eduardo J; Díaz-Olalla, José Manuel
The COOP/WONCA test was initially proposed as a self-report in which the answers were supported by drawings illustrating the state investigated. Subsequent studies have confirmed its usefulness as a mere verbal self-report face-to-face administered. No data have been found about its useful when administered by telephone interview. The aim of this study was to determine the psychometric properties of the COOP / WONCA test to measure Related Quality of Life (HRQoL) administered by telephone and compare them with those obtained in other forms of prior administration. Cross-sectional study on a random. City of Madrid. Random sample of 802 adult subjects, representative of the adult population in Madrid, obtained by stratification from the population census. Questionnaire COOP/WONCA with 9 ítems included in a broader battery, administered by telephone interview. The unrestricted factor analysis points to the unifactoriality of the scale, which measures a single latent construct (HRQOL), showing high internal consistency, not significantly different from those found by face-to-face administration, ruling out the existence of biases in the phone modality. The COOP/WONCA test appears as a reliable and valid measure of HRQOL and telephonic administration allows to assume no changes in the results, which can reduce costs in population studies, increasing efficiency without loss of quality in the information collected. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.
Szucs, Kimberly A; Brown, Elena V Donoso
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings.
Kolokotroni, Philippa; Anagnostopoulos, Fotios; Missitzis, Ioannis
The study and measurement of psychosocial adjustment is important for evaluating patients' well-being, and assessing the illness's course, treatment's success, and patients' recovery. In this study, internal consistency reliability and construct validity of the Greek version of the Psychosocial Adjustment to Illness Scale-Self-Report (PAIS-SR) were examined. Demographic and psychosocial data were collected from a sample of 243 women with breast cancer, recruited from September 2011 to December 2012. With some exceptions in specific items, the original conceptually-derived PAIS-SR subscales emerged in a seven-factor solution. Social Environment, Job and Household Duties, and Psychological Distress accounted for more of the total variance than other subscales. PAIS-SR showed good internal consistency reliability, with Cronbach's alpha coefficients >0.62. Correlations of PAIS-SR domains with measures of quality of life and posttraumatic stress symptoms supported the convergent validity of the PAIS-SR and its significance for cancer research. The Greek version of the PAIS-SR has acceptable internal consistency reliability and construct validity, as well as satisfactory convergent validity. Results provide some suggestions for the development of programs to evaluate adjustment status and implement psychosocial interventions among breast cancer survivors.
Lupi, Jaqueline Basilio; Carvalho de Abreu, Daniela Cristina; Ferreira, Mariana Candido; Oliveira, Renê Donizeti Ribeiro de; Chaves, Thais Cristina
This study aimed to culturally adapt and validate the Revised Fibromyalgia Impact Questionnaire (FIQR) to Brazilian Portuguese, by the use of analysis of internal consistency, reliability, and construct and structural validity. A total of 100 female patients with fibromyalgia participated in the validation process of the Brazilian Portuguese version of the FIQR (FIQR-Br).The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Pearson's rank correlation for construct validity, and confirmatory factor analysis (CFA) for structural validity. It was verified excellent levels of reliability, with ICC greater than 0.75 for all questions and domains of the FIQR-Br. For internal consistency, alpha values greater than 0.70 for the items and domains of the questionnaire were observed. Moderate (0.40 0.70) correlations were observed for the scores of domains and total score between the FIQR-Br and FIQ-Br. The structure of the three domains of the FIQR-Br was confirmed by CFA. The results of this study suggest that that the FIQR-Br is a reliable and valid instrument for assessing fibromyalgia-related impact, and supports its use in clinical settings and research. The structure of the three domains of the FIQR-Br was also confirmed. Implications for Rehabilitation Fibromyalgia is a chronic musculoskeletal disorder characterized by widespread and diffuse pain, fatigue, sleep disturbances, and depression. The disease significantly impairs patients' quality of life and can be highly disabling. To be used in multicenter research efforts, the Revised Fibromyalgia Impact Questionnaire (FIQR) must be cross-culturally validated and psychometrically tested. This paper will make available a new version of the FIQR-Br since another version already exists, but there are concerns about its measurement properties. The availability of an instrument adapted to and validated for Brazilian
Full Text Available An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high.The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed questionnaire for monitoring and evaluating population sun-related behavior.Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly in a questionnaire adapted to measure behavior throughout the summer.The response rates for questionnaire 1, 2 and 3 were high and the drop out was not dependent on demographic characteristic. There was at least 73% agreement between sunburns in the measurement week and the entire summer, and a possible sunburn underestimation in questionnaires summarizing the entire summer. The participants underestimated their outdoor exposure in the evaluation covering the entire summer as compared to the measurement week. The reliability of scales measuring attitude and knowledge was high for majority of scales, while consistency in protection behavior was low.To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability, while consistency of protection behavior in general and in a week's measurement was low. Keywords: Questionnaire, Validation, Reliability, Skin cancer, Prevention, Ultraviolet radiation
Huang, X N; Zhang, Y; Feng, W W; Wang, H S; Cao, B; Zhang, B; Yang, Y F; Wang, H M; Zheng, Y; Jin, X M; Jia, M X; Zou, X B; Zhao, C X; Robert, J; Jing, Jin
Objective: To evaluate the reliability and validity of warning signs checklist developed by the National Health and Family Planning Commission of the People's Republic of China (NHFPC), so as to determine the screening effectiveness of warning signs on developmental problems of early childhood. Method: Stratified random sampling method was used to assess the reliability and validity of checklist of warning sign and 2 110 children 0 to 6 years of age(1 513 low-risk subjects and 597 high-risk subjects) were recruited from 11 provinces of China. The reliability evaluation for the warning signs included the test-retest reliability and interrater reliability. With the use of Age and Stage Questionnaire (ASQ) and Gesell Development Diagnosis Scale (GESELL) as the criterion scales, criterion validity was assessed by determining the correlation and consistency between the screening results of warning signs and the criterion scales. Result: In terms of the warning signs, the screening positive rates at different ages ranged from 10.8%(21/141) to 26.2%(51/137). The median (interquartile) testing time for each subject was 1(0.6) minute. Both the test-retest reliability and interrater reliability of warning signs reached 0.7 or above, indicating that the stability was good. In terms of validity assessment, there was remarkable consistency between ASQ and warning signs, with the Kappa value of 0.63. With the use of GESELL as criterion, it was determined that the sensitivity of warning signs in children with suspected developmental delay was 82.2%, and the specificity was 77.7%. The overall Youden index was 0.6. Conclusion: The reliability and validity of warning signs checklist for screening early childhood developmental problems have met the basic requirements of psychological screening scales, with the characteristics of short testing time and easy operation. Thus, this warning signs checklist can be used for screening psychological and behavioral problems of early childhood
Li, Z; Yang, Y M; Zhang, C; Li, Y; Hu, J; Gao, L W; Zhou, Y X; Zhang, X J
Objective: To assess the reliability and validity of the Chinese version of Driving Anger Scale (DAS) in professional drivers in China and provide a scientific basis for the application of the scale in drivers in China. Methods: Professional drivers, including taxi drivers, bus drivers, truck drivers and school bus drivers, were selected to complete the questionnaire. Cronbach's α and split-half reliability were calculated to evaluate the reliability of DAS, and content, contract, discriminant and convergent validity were performed to measure the validity of the scale. Results: The overall Cronbach's α of DAS was 0.934 and the split-half reliability was 0.874. The correlation coefficient of each subscale with the total scale was 0.639-0.922. The simplified version of DAS supported a presupposed six-factor structure, explaining 56.371% of the total variance revealed by exploratory factor analysis. The DAS had good convergent and discriminant validity, with the success rate of calibration experiment of 100%. Conclusion: DAS has a good reliability and validity in professional drivers in China, and the use of DAS is worth promoting in divers.
Full Text Available Background: The purpose of this study was to evaluate the validity and reliability on the Persian translation of the Modifiable Activity Questionnaire (MAQ in a sample of Tehranian adolescents. Methods: Of a total of 52 subjects, a sub-sample of 40 participations (55.0% boys was used to assess the reliability and the validity of the physical activity questionnaire. The reliability of the two MAQs was calculated by intraclass correlation coefficients, and validation was evaluated using Pearson correlation coefficients to compare data between mean of the two MAQs and mean of four physical activity records. Results: Intraclass correlation coefficient was calculated to assess the reliability between two MAQs and the results of leisure time physical activity over the past year were 0.97. Pearson correlation coefficients between mean of two MAQs and mean of four physical activity records were 0.49 (P < 0.001, for leisure time physical activities. Conclusions: High reliability and relatively moderate validity were found for the Persian translation of the MAQ in a Tehranian adolescent population. Further studies with large sample size are suggested to assess the validity more precisely.
Vendrig, A A; Schaafsma, F G
Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.
Ling, Samuel K K; Chan, Vincent; Ho, Karen; Ling, Fona; Lui, T H
Develop the first reliable and validated open-source outcome scoring system in the Chinese language for foot and ankle problems. Translation of the English FAOS into Chinese following regular protocols. First, two forward-translations were created separately, these were then combined into a preliminary version by an expert committee, and was subsequently back-translated into English. The process was repeated until the original and back translations were congruent. This version was then field tested on actual patients who provided feedback for modification. The final Chinese FAOS version was then tested for reliability and validity. Reliability analysis was performed on 20 subjects while validity analysis was performed on 50 subjects. Tools used to validate the Chinese FAOS were the SF36 and Pain Numeric Rating Scale (NRS). Internal consistency between the FAOS subgroups was measured using Cronbach's alpha. Spearman's correlation was calculated between each subgroup in the FAOS, SF36 and NRS. The Chinese FAOS passed both reliability and validity testing; meaning it is reliable, internally consistent and correlates positively with the SF36 and the NRS. The Chinese FAOS is a free, open-source scoring system that can be used to provide a relatively standardised outcome measure for foot and ankle studies. Copyright © 2017 Elsevier Ltd. All rights reserved.
Marco Aurelio Lumertz Saffi
Full Text Available Using a sample of patients with coronary artery disease, this methodological study aimed to conduct a cross-cultural adaptation and validation of a questionnaire on knowledge of cardiovascular risk factors (Q-FARCS, lifestyle changes, and treatment adherence for use in Brazil. The questionnaire has three scales: general knowledge of risk factors (RFs; specific knowledge of these RFs; and lifestyle changes achieved. Cross-cultural adaptation included translation, synthesis, back-translation, expert committee review, and pretesting. Face and content validity, reliability, and construct validity were measured. Cronbach’s alpha for the total sample (n = 240 was 0.75. Assessment of psychometric properties revealed adequate face and content validity, and the construct revealed seven components. It was concluded that the Brazilian version of Q-FARCS had adequate reliability and validity for the assessment of knowledge of cardiovascular RFs.
Hanif, Faisal; Read, Janet C; Goodacre, John A; Chaudhry, Afzal; Gibbs, Paul
The Internet has made it possible for patients and their families to access vast quantities of information that previously would have been difficult for anyone but a physician or librarian to obtain. Health information websites, however, are recognised to differ widely in quality and reliability of their content. This has led to the development of various codes of conduct or quality rating tools to assess the quality of health websites. However, the validity and reliability of these quality tools and their applicability to different health websites also varies. In principle, rating tools should be available to consumers, require a limited number of elements to be assessed, be assessable in all elements, be readable and be able to gauge the readability and consistency of information provided from a patient's view point. This article reviews the literature on the trends of the Internet use for health and analyses various codes of conduct/ethics or 'quality tools' available to monitor the quality of health websites from a patient perspective.
Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument. PMID:22938557
Helmerhorst Hendrik JF
Full Text Available Abstract Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA and in particular by physical activity questionnaires (PAQs remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument.
Campo-Arias, Adalberto; Lafaurie, María Mercedes; Gaitán-Duarte, Hernando G
There are several scales to quantify homophobia in different populations. However, the reliability and validity of these instruments among Colombian students are unknown. Consequently, this work is intended to assess reliability (inner consistency) as well as the validity of the Scale for Homophobia in Medicine students from a private university in Bogotá (Colombia). Methodological study with 199 Medicine students from 1st to 5th semester that filled out the Homophobia Scale form, the general welfare questionnaire, the Attitude Towards Gays and Lesbians Scale (ATGL), WHO-5 (divergent validity) and the Francis Scale of Attitude Toward Christianity (nomologic validity). Pearson's correlations were computed, the Cronbach's alfa coefficient, the omega coefficient (construct's reliability) and confirmatory factorial analysis. The Scale for Homophobia showed an alpha Cronbach coefficient of 0,785, an omega coefficient of 0,790 and a Pearson correlation with the ATGL of 0,844; with WHO-5, -0,059; and a Francis Scale of Attitude Toward Christianity, 0,187. The Scale toward Homophobia exhibited a relevant factor of 44,7% of the total variance. The Scale for Homophobia showed acceptable reliability and validity. New studies should investigate the stability of the scale and the nomologic validity regarding other constructs. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank M; Donnan, Peter T; Gágyor, Ildikó; Hobbelen, Hans J S M; Althabe, Fernando; Krishnan, Jerry A; Treweek, Shaun
PRagmatic Explanatory Continuum Indicator Summary (PRECIS)-2 is a tool that could improve design insight for trialists. Our aim was to validate the PRECIS-2 tool, unlike its predecessor, testing the discriminant validity and interrater reliability. Over 80 international trialists, methodologists, clinicians, and policymakers created PRECIS-2 helping to ensure face validity and content validity. The interrater reliability of PRECIS-2 was measured using 19 experienced trialists who used PRECIS-2 to score a diverse sample of 15 randomized controlled trial protocols. Discriminant validity was tested with two raters to independently determine if the trial protocols were more pragmatic or more explanatory, with scores from the 19 raters for the 15 trials as predictors of pragmatism. Interrater reliability was generally good, with seven of nine domains having an intraclass correlation coefficient over 0.65. Flexibility (adherence) and recruitment had wide confidence intervals, but raters found these difficult to rate and wanted more information. Each of the nine PRECIS-2 domains could be used to differentiate between trials taking more pragmatic or more explanatory approaches with better than chance discrimination for all domains. We have assessed the validity and reliability of PRECIS-2. An elaboration study and web site provide guidance to help future users of the tool which is continuing to be tested by trial teams, systematic reviewers, and funders. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available In this study it was aimed to make the studies of the translation of Perception of Organizational Politics Scale into Turkish and the validity and reliability of the scale. Perceptions of Organizational Politics Scale’s (POPS validities has been tested in terms of view, content and structure. The application is designed as a two-stage process. In the first stage, face and content validity was tested. In the second stage, it was sought evidences for the construct validity of the scale by making exploratory factor analysis (EFA and then the confirmatory factor analysis (CFA to the data obtained. In determining the reliability of the scale item-total score correlations and Cronbach alpha coefficient was used. The application made for the validity and reliability of the scale was conducted on the data collected from 277 faculty members working in universities’ education faculties. As a method of achieving those faculty members "Simple randomized (random sampling" is used. The psychometric properties of the Turkish version of Perception of Organizational Politics Scale showed that the scale has a satisfactory level of reliability and validity for the Turkish employee sample.
In a first part this lectures will present a survey on nuclear power production and plant performance in the Western World and discuss key parameters such as load factors and non-availability. Some main reasons for reliable performance of nuclear power plants are given. The second part of this lecture deals with the question how quality assurance and quality control measures do directly influence plant reliability, availability and, thus, economy. Derived from worldwide experience gained from operating nuclear power plants, it may be concluded that the implementation of an overall quality assurance programme does not only satisfy safety requirements set forth by the nuclear regulatory bodies, but has also a considerable impact on plant reliability and availability. A positive effect on these figures will be achieved if the established quality assurance programme provides for a coordinated approach to all activities affecting quality. It is discussed how the quality of a product should be controlled and what kind of quality assurance measures by performed examples are given to demonstrate that the expenditure for maintenance work on components will decrease if planned and systematic quality assurance actions have been implemented during all procurement stages. (orig./RW)
Brady, Karen; Cracknell, Nina; Zulch, Helen; Mills, Daniel Simon
Working dogs are selected based on predictions from tests that they will be able to perform specific tasks in often challenging environments. However, withdrawal from service in working dogs is still a big problem, bringing into question the reliability of the selection tests used to make these predictions. A systematic review was undertaken aimed at bringing together available information on the reliability and predictive validity of the assessment of behavioural characteristics used with working dogs to establish the quality of selection tests currently available for use to predict success in working dogs. The search procedures resulted in 16 papers meeting the criteria for inclusion. A large range of behaviour tests and parameters were used in the identified papers, and so behaviour tests and their underpinning constructs were grouped on the basis of their relationship with positive core affect (willingness to work, human-directed social behaviour, object-directed play tendencies) and negative core affect (human-directed aggression, approach withdrawal tendencies, sensitivity to aversives). We then examined the papers for reports of inter-rater reliability, within-session intra-rater reliability, test-retest validity and predictive validity. The review revealed a widespread lack of information relating to the reliability and validity of measures to assess behaviour and inconsistencies in terminologies, study parameters and indices of success. There is a need to standardise the reporting of these aspects of behavioural tests in order to improve the knowledge base of what characteristics are predictive of optimal performance in working dog roles, improving selection processes and reducing working dog redundancy. We suggest the use of a framework based on explaining the direct or indirect relationship of the test with core affect.
Aertssen, W F M; Steenbergen, B; Smits-Engelsman, B C M
There is lack of valid and reliable field-based tests for assessing functional strength in young children with mild intellectual disabilities (IDs). The aim of this study was to investigate the test-retest reliability and construct validity of the Functional Strength Measurement in children with ID (FSM-ID). Fifty-two children with mild ID (40 boys and 12 girls, mean age 8.48 years, SD = 1.48) were tested with the FSM. Test-retest reliability (n = 32) was examined by a two-way interclass correlation coefficient for agreement (ICC 2.1A). Standard error of measurement and smallest detectable change were calculated. Construct validity was determined by calculating correlations between the FSM-ID and handheld dynamometry (HHD) (convergent validity), FSM-ID, FSM-ID and subtest strength of the Bruininks-Oseretsky test of motor proficiency - second edition (BOT-2) (convergent validity) and the FSM-ID and balance subtest of the BOT-2 (discriminant validity). Test-retest reliability ICC ranged 0.89-0.98. Correlation between the items of the FSM-ID and HHD ranged 0.39-0.79 and between FSM-ID and BOT-2 (strength items) 0.41-0.80. Correlation between items of the FSM-ID and BOT-2 (balance items) ranged 0.41-0.70. The FSM-ID showed good test-retest reliability and good convergent validity with the HHD and BOT-2 subtest strength. The correlations assessing discriminant validity were higher than expected. Poor levels of postural control and core stability in children with mild IDs may be the underlying factor of those higher correlations. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Park, Sung Shin; Kwon, Tack-Kyun; Choi, Seong Hee; Lee, Won Yong; Hong, Young Hye; Jeong, Nyun Gi; Sung, Myung-Whun; Kim, Kwang Hyun
The aim of this study was to assess the reliability and validity of the Pediatric Voice Handicap Index (pVHI) for cross-cultural adaptation of the Korean version with school age children. The questionnaire was translated into Korean and was completed by 101 Korean parents who have children with or without disordered voice. The Korean version-pVHI scores were obtained with 60 parents of normal children and 41 parents who have children with voice problems. Content validity was verified by five experienced speech-language pathologists with clinical specialization in voice disorders. Internal consistency was calculated through Cronbach's α coefficient and test-retest reliability of the Korean version-pVHI score was determined using Pearson product-moment correlation coefficients. Mann-Whitney U test was used to compare GRBAS with the Korean version-pVHI scores between normal and dysphonia group. The relationship between the parent-reported the Korean version-pVHI total scores and perceptual ratings of voice quality from experts was investigated using Spearman correlation coefficients. The results showed that the Korean version-pVHI provided a high internal consistency (α=0.92) and test-retest reliability of its subscales: total (T) 0.97, functional (F) 0.90, physical (P) 0.95, emotional (E) 0.92. The Korean version-pVHI mean scores in normal group were 1.28 (T), 0.62 (F), 0.35 (P) and 0.32 (E), respectively whereas those of the Korean version-pVHI in children group with dysphonia were 23.13 (T), 8.90 (F), 9.54 (P) and 4.93 (E). Significant differences in the Korean version-pVHI (T, F, P, E) and perceptual evaluation (grade, rough, breathy) between normal and dysphonia group were revealed (PKorean version-pVHI parameters (T) and perceptual measures (G) was exhibited in children with dysphonia. The subjective Korean version-pVHI can be applicable and useful supplementary tool for evaluating parents' perception of their children's voice dysfunction, identifying
Ana Lúcia Araújo Gomes
Full Text Available ABSTRACT Objective: To evaluate the psychometric properties in terms of validity and reliability of the scale Self-efficacy and their child's level of asthma control: Brazilian version. Method: Methodological study in which 216 parents/guardians of children with asthma participated. A construct validation (factor analysis and test of hypothesis by comparison of contrasted groups and an analysis of reliability in terms of homogeneity (Cronbach's alpha and stability (test-retest were carried out. Results: Exploratory factor analysis proved suitable for the Brazilian version of the scale (Kaiser-Meyer-Olkim index of 0.879 and Bartlett's sphericity with p < 0.001. The correlation matrix in factor analysis suggested the removal of item 7 from the scale. Cronbach's alpha of the final scale, with 16 items, was 0.92. Conclusion: The Brazilian version of Self-efficacy and their child's level of asthma control presented psychometric properties that confirmed its validity and reliability.
Nygren, Björn; Randström, Kerstin Björkman; Lejonklou, Anna K; Lundman, Beril
The purpose of this study was to test the reliability and validity of the Swedish language version of the Resilience Scale (RS). Participants were 142 adults between 19-85 years of age. Internal consistency reliability, stability over time, and construct validity were evaluated using Cronbach's alpha, principal components analysis with varimax rotation and correlations with scores on the Sense of Coherence Scale (SOC) and the Rosenberg Self-Esteem Scale (RSE). The mean score on the RS was 142 (SD = 15). The possible scores on the RS range from 25 to 175, and scores higher than 146 are considered high. The test-retest correlation was .78. Correlations with the SOC and the RSE were .41 (p Self and Life emerged as components from the principal components analysis. These findings provide evidence for the reliability and validity of the Swedish language version of the RS.
Pakpour, Amir H.; Nourozi, Saeedeh; Mølsted, Stig
INTRODUCTION: The aim of the study was to assess the validity and reliability of the SF-12 questionnaire in a sample of Iranian patients undergoing hemodialysis. MATERIALS AND METHODS: One hundred and forty-four hemodialysis patients were included from dialysis centers in Zanjan, Iran, and were...... asked to complete the SF-12 and SF-36 questionnaires. An initial test-retest reliability evaluation was performed on a sample of 70 patients from the total group, with a retest interval of 14 days. Reliability was estimated by internal consistency and validity was assessed using known-group comparisons...... and construct validity on the patient group as a whole. A linear regression analysis was used to assess any variation in the physical component summary and mental component summary scores of the SF-36 with the respective component summary scores of the SF-12. In addition, the factor structure...
Strøyer, Jesper; Essendrop, Morten; Jensen, Lone Donbaek
To test the validity and reliability of self-assessed physical fitness samples included healthcare assistants working at a hospital (women=170, men=17), persons working with physically and mentally handicapped patients (women=530, men= 123), and two separate groups of healthcare students (a) women...... except for flexibility among men. The reliability was moderate to good (ICC = .62 - .80). Self-assessed aerobic fitness, muscle strength, and flexibility showed moderate construct validity and moderate to good reliability using visual analogues.......=91 and men=5 and (b) women=159 and men=10. Five components of physical fitness were self-assessed by Visual Analogue Scales with illustrations and verbal anchors for the extremes: aerobic fitness, muscle strength, endurance, flexibility, and balance. Convergent and divergent validity were evaluated...
Eshghi, Mohammad Ali; Kordi, Ramin; Memari, Amir Hossein; Ghaziasgar, Ahmad; Mansournia, Mohammad-Ali; Zamani Sani, Seyed Hojjat
The Youth Sport Environment Questionnaire (YSEQ) had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13-17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting.
Mohammad Ali Eshghi
Full Text Available The Youth Sport Environment Questionnaire (YSEQ had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13–17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting.
Full Text Available The main objective of this study is to develop a valid and reliable scale for identifying digital citizenship perceptions of young people in the most common age groups. The study was conducted as a survey study. The study group of this study is composed of 438 people in Turkey who are among 16-24 age group with the highest rate of internet use in Turkey. An exploratory factor analysis was performed to determine the validity of the scale and the item discrimination powers were calculated. The total variance of the scale was determined that the scale had 8-factor structure and was found to be 49,70%. The internal consistency level was also calculated to determine the reliability of the scale. As a result, it can be said that this scale is a valid and reliable scale that can be used to determine the digital citizenship perceptions of young people.
Valasek, Tamás; Varga, Peter Paul; Szövérfi, Zsolt; Kümin, Michelle; Fairbank, Jeremy; Lazary, Aron
Patient-reported outcome measurements (PROMs) are widely used in spine care. The development of reliable and valid National versions of spine-related disability questionnaires is strongly recommended from both the clinical and scientific points-of-view. The aims of this study were to adapt and validate the Oswestry Disability Index (ODI) and the Quebec back pain disability scale (QDS) for use with the Hungarian language. After translating and culturally adapting the ODI and QDS, 133 patients with lumbar degenerative spinal disorder filled in the questionnaire booklet twice within 2 weeks. Subjects completed the Hungarian versions of the two PROMs as well as the WHOQoL-BREF validated as a general life quality questionnaire and Visual Analogue Scale of pain. Internal consistency, reliability and construct validity of the questionnaires were determined, as were the standard error of measurement (SEM) and minimal detectable change (MDC) scores. The Hungarian ODI consisted of one factor that showed good internal consistency (Cronbach-α 0.890). The QDS showed a four-factor structure with Cronbach-α values between 0.788 and 0.917. No significant floor or ceiling effects were observed. The test-retest analysis showed excellent reliability of the Hungarian ODI and QDS. The intraclass correlation coefficients (ICC) were 0.927 and 0.923, respectively. SEM values of 4.8 and 5.2 resulted in a MDC of 13 and 14 points in the Hungarian ODI and QDS, respectively. The correlation coefficient (r) between pain and ODI was 0.680 (p 0.4, p disability measured by the Hungarian ODI and QDS was significantly higher in the surgical subgroup than in non-surgically treated patients (p < 0.001). Translation and cultural adaptation of the ODI and QDS were successful. Hungarian versions of the ODI and QDS proved to be reliable, valid PROMs confirming that they can be used in future clinical and scientific work with Hungarian-speaking spine patients.
Garg, Sanjay; Melcher, Kevin J.
For new access to space systems with challenging mission requirements, effective implementation of integrated system health management (ISHM) must be available early in the program to support the design of systems that are safe, reliable, highly autonomous. Early ISHM availability is also needed to promote design for affordable operations; increased knowledge of functional health provided by ISHM supports construction of more efficient operations infrastructure. Lack of early ISHM inclusion in the system design process could result in retrofitting health management systems to augment and expand operational and safety requirements; thereby increasing program cost and risk due to increased instrumentation and computational complexity. Having the right sensors generating the required data to perform condition assessment, such as fault detection and isolation, with a high degree of confidence is critical to reliable operation of ISHM. Also, the data being generated by the sensors needs to be qualified to ensure that the assessments made by the ISHM is not based on faulty data. NASA Glenn Research Center has been developing technologies for sensor selection and data validation as part of the FDDR (Fault Detection, Diagnosis, and Response) element of the Upper Stage project of the Ares 1 launch vehicle development. This presentation will provide an overview of the GRC approach to sensor selection and data quality validation and will present recent results from applications that are representative of the complexity of propulsion systems for access to space vehicles. A brief overview of the sensor selection and data quality validation approaches is provided below. The NASA GRC developed Systematic Sensor Selection Strategy (S4) is a model-based procedure for systematically and quantitatively selecting an optimal sensor suite to provide overall health assessment of a host system. S4 can be logically partitioned into three major subdivisions: the knowledge base, the down
Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan
To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.
Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio
OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurre...
Full Text Available Objective: Experiencing pleasant events during daily life has a significant positive role in the personal mental health and acts as a keystone for “behavioral activation” (BA interventions. There are serious differences in the pleasant event schedules in different cultures and countries. We aimed to develop a Persian checklist of pleasant events (PCPE to provide and validate a culturally compatible checklist for Iranians.Methods: To develop a checklist of pleasant events, inspired by Pleasant Events Schedule (PES (MacPhillamy & Lewinsohn, 1982, we held three focused group discussions with 24 normal healthy participants from both genders (female = 12 and asked them to mention as much pleasant events as possible. When the list reached saturation level, the inappropriate items with respect to legal, cultural and religious concerns were omitted. The final checklist of PCPE consists of two subscales: Frequency (frequency of events during last month and pleasantness (perceived pleasantness of events. The total score consists of frequency multiplied by pleasantness. To test the reliability and validity of the checklist, the PCPE, Depression, Anxiety and Stress Scale (DASS, the Persian version of WHO Quality of Life and the Demographic Questionnaire were administered in a sample of 104 participants (50 male and 54 female.Results: Frequency, pleasantness and the total scores of PCPE showed high levels of internal consistency (Cronbach’s alpha, .976, .976 & .974, respectively. Further support for the convergent validity of the PCPE was obtained via moderate negative correlations with depression, anxiety, stress scores in DASS and positive correlation with quality of life as well as respondent’s perceived happiness. There were negative correlations between
Boonstra, Anne M; Schiphorst Preuper, Henrica R; Reneman, Michiel F; Posthumus, Jitze B; Stewart, Roy E
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional design was used. A general rehabilitation centre and a university rehabilitation centre was the setting for the study. The study population consisted of patients over 18 years of age, suffering from chronic musculoskeletal pain; 52 patients in the reliability study, 344 patients in the validity study. Main outcome measures were as follows. Reliability study: Spearman's correlation coefficients (rho values) of the test and retest data of the VAS for disability; validity study: rho values of the VAS disability scores with the scores on four domains of the Short-Form Health Survey (SF-36) and VAS pain scores, and with Roland-Morris Disability Questionnaire scores in chronic low back pain patients. Results were as follows: in the reliability study rho values varied from 0.60 to 0.77; and in the validity study rho values of VAS disability scores with SF-36 domain scores varied from 0.16 to 0.51, with Roland-Morris Disability Questionnaire scores from 0.38 to 0.43 and with VAS pain scores from 0.76 to 0.84. The conclusion of the study was that the reliability of the VAS for disability is moderate to good. Because of a weak correlation with other disability instruments and a strong correlation with the VAS for pain, however, its validity is questionable.
Evenson Kelly R
Full Text Available Abstract Background The purpose of this study is to assess the reliability and validity of the U.S. National Center for Safe Routes to School's in-class student travel tallies and written parent surveys. Over 65,000 tallies and 374,000 parent surveys have been completed, but no published studies have examined their measurement properties. Methods Students and parents from two Charlotte, NC (USA elementary schools participated. Tallies were conducted on two consecutive days using a hand-raising protocol; on day two students were also asked to recall the previous days' travel. The recall from day two was compared with day one to assess 24-hour test-retest reliability. Convergent validity was assessed by comparing parent-reports of students' travel mode with student-reports of travel mode. Two-week test-retest reliability of the parent survey was assessed by comparing within-parent responses. Reliability and validity were assessed using kappa statistics. Results A total of 542 students participated in the in-class student travel tally reliability assessment and 262 parent-student dyads participated in the validity assessment. Reliability was high for travel to and from school (kappa > 0.8; convergent validity was lower but still high (kappa > 0.75. There were no differences by student grade level. Two-week test-retest reliability of the parent survey (n = 112 ranged from moderate to very high for objective questions on travel mode and travel times (kappa range: 0.62 - 0.97 but was substantially lower for subjective assessments of barriers to walking to school (kappa range: 0.31 - 0.76. Conclusions The student in-class student travel tally exhibited high reliability and validity at all elementary grades. The parent survey had high reliability on questions related to student travel mode, but lower reliability for attitudinal questions identifying barriers to walking to school. Parent survey design should be improved so that responses clearly indicate
McDonald, Noreen C; Dwelley, Amanda E; Combs, Tabitha S; Evenson, Kelly R; Winters, Richard H
The purpose of this study is to assess the reliability and validity of the U.S. National Center for Safe Routes to School's in-class student travel tallies and written parent surveys. Over 65,000 tallies and 374,000 parent surveys have been completed, but no published studies have examined their measurement properties. Students and parents from two Charlotte, NC (USA) elementary schools participated. Tallies were conducted on two consecutive days using a hand-raising protocol; on day two students were also asked to recall the previous days' travel. The recall from day two was compared with day one to assess 24-hour test-retest reliability. Convergent validity was assessed by comparing parent-reports of students' travel mode with student-reports of travel mode. Two-week test-retest reliability of the parent survey was assessed by comparing within-parent responses. Reliability and validity were assessed using kappa statistics. A total of 542 students participated in the in-class student travel tally reliability assessment and 262 parent-student dyads participated in the validity assessment. Reliability was high for travel to and from school (kappa > 0.8); convergent validity was lower but still high (kappa > 0.75). There were no differences by student grade level. Two-week test-retest reliability of the parent survey (n=112) ranged from moderate to very high for objective questions on travel mode and travel times (kappa range: 0.62-0.97) but was substantially lower for subjective assessments of barriers to walking to school (kappa range: 0.31-0.76). The student in-class student travel tally exhibited high reliability and validity at all elementary grades. The parent survey had high reliability on questions related to student travel mode, but lower reliability for attitudinal questions identifying barriers to walking to school. Parent survey design should be improved so that responses clearly indicate issues that influence parental decision making in regards to their
Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H
Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (preliability was (ICC3,3) = 0.953 (pvalidity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu
The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (pr=0.67 pr=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
Sahin Cankurtaran, Eylem; Danişman, Mustafa; Tutar, Hasan; Ulusoy Kaymak, Semra
The Neuropsychiatric Inventory-Clinician (NPI-C) scale is one of the best-known scales for evaluating the behavioral and psychological symptoms of dementia. This study aimed to assess the reliability and validity of the Turkish version of the NPI-C scale in patients with Alzheimer disease (AD). The NPI-C scale was administered to 125 patients with AD. For reliability, both Cronbach's α and interrater reliability were analyzed. The Behavioral Pathology in Alzheimer's Disease (BEHAVE-AD) scale was applied for validity and, in addition, the Mini Mental State Examination (MMSE), Instrumental Activities of Daily Living (IADL) scale, and Disability Assessment of Dementia (DAD) scale were completed. The Turkish version of the NPI-C scale showed high internal consistency (Cronbach's α = 0.75) and mostly good interrater reliability. Assessments of validity showed that the NPI-C and corresponding BEHAVE-AD domains were found to be significantly correlated, between 0.925 and 0.195. Moreover, the correlations between NPI-C and MMSE were significant for all domains except the dysphoria, anxiety, and elation/euphoria domains. When we conducted a correlation analysis of NPI-C with IADL, all domains were statistically significantly correlated except aggression, anxiety, elation/euphoria, and dysphoria. The Turkish version of the NPI-C scale was found to be a reliable and valid instrument to assess neuropsychiatric symptoms in Turkish elderly subjects with AD.
Full Text Available This study aims to develop a valid and reliable instrument for measuring students' social studies achievement goal. The research was conducted on a study group consisted of 374 middle school students studying in the central district of Diyarbakır in 2014-2015 school year fall semester. Expert opinion was consulted with regard to the scale's content and face validity. Exploratory Factor Analysis (EFA and Confirmatory Factor Analysis (CFA were performed in order to measure the scale's construct validity. As a result of EFA, a 29-item and a six-factor structure model which explains 50.82% of the total variance was obtained. The emerging factors were called as a self-approach, task-approach, other-approach, task-avoidance, other-avoidance and self-avoidance respectively. The findings acquired CFA indicated that the 29-item and six-factor structure related to social studies oriented achievement goal scale have acceptable goodness of fit indices. The scale's reliability coefficients were calculated by means of internal consistency method. As a result of reliability analysis, it was determined that the reliability coefficients were within admissible limits. The finding of the item correlation and 27% of upper and lower group comparisons demonstrated that all of the items in the scale should remain. In light of these results, it could be argued that the scale is reliable and valid instrument and can be used in order to test students' social studies achievement goals.
Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia
Baker, Nancy A; Cook, James R; Redfern, Mark S
This paper describes the inter-rater and intra-rater reliability, and the concurrent validity of an observational instrument, the Keyboard Personal Computer Style instrument (K-PeCS), which assesses stereotypical postures and movements associated with computer keyboard use. Three trained raters independently rated the video clips of 45 computer keyboard users to ascertain inter-rater reliability, and then re-rated a sub-sample of 15 video clips to ascertain intra-rater reliability. Concurrent validity was assessed by comparing the ratings obtained using the K-PeCS to scores developed from a 3D motion analysis system. The overall K-PeCS had excellent reliability [inter-rater: intra-class correlation coefficients (ICC)=.90; intra-rater: ICC=.92]. Most individual items on the K-PeCS had from good to excellent reliability, although six items fell below ICC=.75. Those K-PeCS items that were assessed for concurrent validity compared favorably to the motion analysis data for all but two items. These results suggest that most items on the K-PeCS can be used to reliably document computer keyboarding style.
England-Joseph, Judy A.; Adams, Charles M.; Wood, David G.; Feehan, Daniel J.; Veal, Howard F.; Skeen, John H. III; Koenigs, Melvin J.; Lichtenfeld, David I.; Seretakis, Pauline J.
Life extension of fossil fuel plants is a relatively recent phenomenon; thus, utilities have little experience to demonstrate the longer-term operating reliability of plants with an extended service life. While utility industry officials and government and industry studies express optimism that these plants will continue to operate reliably, the officials and the studies also caution that it is too soon to determine how pursuing life extension will affect the reliability of the nation's electricity supply. According to DOE, the number of fossil fuel generating units' 30 years old or older is expected to increase from about 2,500 in 1989 to roughly 3,700 in 1998, increasing such plants' share of overall generating capacity from 13 percent in 1989 to 27 percent in 1998. EPA estimates that with existing air quality requirements, fossil fuel plant emissions will increase steadily during the coming decade. Proposed acid rain control legislation, which would affect many plants that may have their service life extended, would require utilities to significantly reduce emissions by the year 2000 but would allow utilities flexibility in deciding how and where to achieve the reductions. If such legislation is enacted, utilities generally are expected to find reducing emissions from existing plants more cost-effective than replacing them and to continue extending plants' service life. Officials of DOE and utility organizations expressed concern, however, that EPA could decide, as it did for one plant in 1988, that alterations made in extending the service life of plants exempted from the Clean Air Act would result in increased emissions and thus cause the altered plants to lose their exemption. According to the officials, the additional costs of achieving the Clean Air Act's standards could discourage some life extension projects. However, such decisions by EPA could also reduce the nation's total power plant emissions by eliminating an existing incentive to retain exempt
Zhao, M.; McDonald, A.; Dick, P.
The test rig for Validation and Reliability Testing of shutdown system software has been upgraded from the AECL Windows-based test rig previously used for CANDU6 stations. It includes a Virtual Trip Computer, which is a software simulation of the functional specification of the trip computer, and a real-time trip computer simulator in a separate chassis, which is used during the preparation of trip computer test cases before the actual trip computers are available. This allows preparation work for Validation and Reliability Testing to be performed in advance of delivery of actual trip computers to maintain a project schedule. (author)
Bayani, Ali Asghar
The internal consistency, test-retest reliability, and construct validity of the Farsi version of the Depression Anxiety Stress Scales were examined, with a sample of 306 undergraduate students (123 men, 183 women) ranging from 18 to 51 years of age (M age = 25.4, SD = 6.1). Participants completed the Satisfaction with Life Scale, Rosenberg Self-esteem Scale, and the Depression Anxiety Stress Scales. The findings confirmed the preliminary reliabilities and preliminary construct validity of the Farsi translation of the Depression Anxiety Stress Scales.
Jacobsen, Stine Lindahl
The paper will present a phd study concerning reliability and validity of music therapy assessment model “Assessment of Parenting Competences” (APC) in the area of families with emotionally neglected children. This study had a multiple strategy design with a philosophical base of critical realism...... and pragmatism. The fixed design for this study was a between and within groups design in testing the APCs reliability and validity. The two different groups were parents with neglected children and parents with non-neglected children. The flexible design had a multiple case study strategy specifically...
Jingying Liu; Jipeng Yang; Yanhui Liu; Yang Yang; Hongfu Zhang
Purpose: To test the validity and reliability of a modified Career Growth Scale (CGS) to assess nurse career growth. Method: A cross-sectional design was used to analyze the use of the CGS to survey 600 full-time registered nurses from Grade A hospitals in Tianjin. Results: A modified scale we called Career Growth of Nurse Scale (CGNS) is acceptable, valid, and reliable for the evaluation of nurse career growth in Chinese hospitals. This scale measured three main factors (career goal, c...
Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin
This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensio...
Iyigun, Gozde; Kirmizigil, Berkiye; Angin, Ender; Oksuz, Sevim; Can, Filiz; Eker, Levent; Rose, Debra J
The aim of this study was to evaluate the reliability and validity of the Turkish version of the FAB(FAB-T) scale in the older Turkish adults. The reliability and validity of the scale was tested on 200 community-dwelling older adults. FAB-T scale was scored by different physiotherapists on different days to evaluate inter-rater and intrarater reliability. The Berg Balance Scale (BBS) was used for the evaluation of convergent validity, and the content validity of the FAB-T scale was investigated. The FAB-T scale showed very high inter- and intra-rater reliability. For inter-rater agreement, on the individual test items and total score ICC values were 0.92 (95 %CI; 0.90-0.94) and 0.96 (95% CI; 0.95-0.97) respectively. The intra-rater agreement, on the individual test items and total score ICC values were 0.93 (95 %CI; 0.91- 0.95) and 0.96 (95% CI; 0.95- 0.97) respectively. There was a good agreement between the FAB-T and BBS scales. A high correlation was found between the BBS and FAB-T scales [rho = 0.70 (%95 CI; 0.62-0.76)] indicating good convergent validity. Considering the content validity of the FAB-T scale, no floor (floor score: 0%) or ceiling (ceiling score: 6.5%) effect was detected. The FAB-T scale was successfully translated from the original English version (FAB) and demonstrated strong psychometric features. It was found that the FAB-T scale has very high inter-rater and intra-rater reliability. Considering the convergent validity, the scale has high correlation with the BBS. The FAB-T has no floor and ceiling effect. Copyright © 2018 Elsevier B.V. All rights reserved.
Apolzan, John W; Myers, Candice A; Cowley, Amanda D; Brady, Heather; Hsia, Daniel S; Stewart, Tiffany M; Redman, Leanne M; Martin, Corby K
Mindfulness is theorized to affect the eating behavior and weight of pregnant women, yet no measure has been validated during pregnancy. This study qualitatively and quantitatively evaluated the reliability and validity of the Mindful Eating Questionnaire (MEQ) in overweight and obese pregnant women. Participants completed focus groups and cognitive interviews. The MEQ was administered twice to measure test-retest reliability. The Eating Inventory (EI) and Mindful Attention Awareness Scale (MAAS) were administered to assess convergent validity, and the Neighborhood Environment Walkability Scale (NEWS) assessed discriminant validity. Participants were 20 ± 8 weeks gestation (mean ± SD), 30 ± 2 years old, and 55% were obese. The MEQ total score had good test-retest reliability (r = .85). The total score internal consistency reliability was poor (Cronbach's α = .56). The external cues subscale (ECS) was not internally consistent (α = .31). Other subscales ranged from α = .59-.68. When the ECS was excluded, the MEQ total score internal consistency was acceptable (α = .62). Convergent validity was supported by the MEQ total score (with and without ECS) correlating significantly with the MAAS and the EI disinhibition and hunger subscales. Discriminant validity of the MEQ was supported by the MEQ and NEWS total scores and subscales not being significantly correlated. The quantitative results were supported by the qualitative context and content analysis. With the exception of the ECS, the MEQ's reliability and validity was supported in pregnant women, and most of the subscales were more robust in pregnant women than in the original sample of healthy adults. The MEQ's use with overweight and obese pregnant women is supported. Copyright © 2016 Elsevier Ltd. All rights reserved.
Magoutas, Babis; Mentzas, Gregoris
This paper presents the refinement and validation of a model for Quality of e-Government Services (QeGS). We built upon our previous work where a conceptualized model was identified and put focus on the confirmatory phase of the model development process, in order to come up with a valid and reliable QeGS model. The validated model, which was benchmarked with very positive results with similar models found in the literature, can be used for measuring the QeGS in a reliable and valid manner. This will form the basis for a continuous quality improvement process, unleashing the full potential of e-government services for both citizens and public administrations.
Isabelle Ottenvall Hammar
Full Text Available In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA-Older persons (IPA-O, showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation.
Sanders, James L; Williams, Robert J
Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Danielle Fabiana Cucolo
Full Text Available ABSTRACT Objectives: to verify the reliability and construct validity estimates of the "Assessment of nursing care product" scale (APROCENF and its applicability. Methods: this validation study included a sample of 40 (inter-rater reliability and 172 (construct validity assessments performed by nurses at the end of the work shift at nine inpatient services of a teaching hospital in the Brazilian Southeast. The data were collected between February and September/2014 with interruptions. Cronbach's alpha and Spearman's correlation coefficients were calculated, as well as the intraclass correlation and the weighted kappa index (inter-rater reliability. Exploratory factor analysis was used with principal component extraction and varimax rotation (construct validity. Results: the internal consistency revealed an alpha coefficient of 0.85, item-item correlation ranging between 0.13 and 0.61 and item-total correlation between 0.43 and 0.69. Inter-rater equivalence was obtained and all items evidenced significant factor loadings. Conclusion: this research evidenced the reliability and construct validity of the scale to assess the nursing care product. Its application in nursing practice permits identifying improvements needed in the production process, contributing to management and care decisions.
Fang, Hua; Farooq, Umer; Wang, Dimiao; Yu, Fangfang; Younus, Mohammad Imran; Guo, Xiong
Kashin-Beck Disease (KBD) is an endemic osteoarthropathy in areas which extend from the North-East to the South-West of China. Most of the patients with KBD suffer multiple dysfunctions in major joints causing decreased health status. However because of their low education level and unique living habits, it is hard to find tools to measure the health-related quality of life (HRQOL). European quality of life (EQ-5D-3L) patient-reported instrument is widely used to measure HRQOL. This study aimed to establish the validity and reliability of the Chinese version of the EQ-5D-3L for evaluating HRQOL of KBD individuals in rural area. 368 individuals who were suffering from KBD were recruited through stratified multistage random sampling from Shaanxi province, China. The EQ-5D-3L and the WHOQOL-BREF were administrated in each individual by face to face interview. Test-retest reliability was assessed at 10-14 days intervals. The test-retest reliability was measured by calculating the Kappa coefficients for EQ-5D-3L five dimensions. For the EQ VAS, the intraclass correlation coefficient (ICC) was computed. Convergent and divergent analysis, construct validity was established using Spearman's rank correlation between the EQ-5D-3L and the WHOQOL-BREF. Known groups' validity was examined by comparing groups with a priori expected differences in health-related quality of life (HRQOL). For 362 individuals (98%), comprehensive data of all the EQ-5D-3L dimensions were available. Kappa values of the EQ-5D-3L five items ranged from 0.324 to 0.554. ICC of the EQ VAS was 0.497. For convergent validity, the three items (self-care, usual activity, and mobility) of EQ-5D-3L, index scores, and VAS showed moderate correlations with the physical health domain of the WHOQOL-BREF (r absolute value ranged from 0.339 to 0.475). For divergent validity, the 5 items of EQ-5D-3L showed weak or no correlations with environment and social relationship domains of WHOQOL-BREF. The Chinese EQ-5D-3L