Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville
With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is
Full Text Available The aim of this research is to develop a measurement instrument that will determine the cultural responsive teaching readiness level of teacher candidates. The study group consisted of a total of 231 candidate teachers, of which 83 were males and 148 were females, who were attending their final year of class teacher education programs at various Turkish universities during the 2016-2017 education year. In the first phase, a 33-item draft form was presented to experts to be reviewed. Based on the feedback received, revisions were made and the final scale was applied to a group of 231 candidate teachers. In the analysis of the data obtained as the result of the application, Exploratory Factor Analysis (EFA was performed. The EFA produced 21 items within a two-factor structure as, “Personal Readiness” and “Professional Readiness.” It was observed that the sub-factors were components of the “cultural responsive teaching readiness” dimension, and that the goodness of fit measures obtained as a result of the First and Second Level Confirmatory Factor Analyzes (CFA were high. In addition, reliability coefficients were found to be high as a result of reliability measurements. With the help of these findings, this study concludes that the Cultural Responsive Teaching Readiness scale is both valid and reliable.
Smith, Jack E.; Hakel, Milton D.
Examined are questions pertinent to the use of the Position Analysis Questionnaire: Who can use the PAQ reliably and validly? Must one rely on trained job analysts? Can people having no direct contact with the job use the PAQ reliably and validly? Do response biases influence PAQ responses? (Author/KC)
Bartlett, Susan J; Barbic, Skye P; Bykerk, Vivian P
-FQ), and the voting results at OMERACT 2016. METHODS: Classic and modern psychometric methods were used to assess reliability, validity, sensitivity, factor structure, scoring, and thresholds. Interviews with patients and clinicians also assessed content validity, utility, and meaningfulness of RA-FQ scores. RESULTS......: People with RA in observational trials in Canada (n = 896) and France (n = 138), and an RCT in the Netherlands (n = 178) completed 5 items (11-point numerical rating scale) representing RA Flare core domains. There was moderate to high evidence of reliability, content and construct validity...... to identify and measure RA flares. Its review through OMERACT Filter 2.0 shows evidence of reliability, content and construct validity, and responsiveness. These properties merit its further validation as an outcome for clinical trials....
Ameneh S. Forouzan
Full Text Available Background: The Health System Responsiveness Questionnaire is an instrument designed by the World Health Organization (WHO in 2000 to assess the experience of patients when interacting with the health care system. This investigation aimed to adapt a Mental Health System Responsiveness Questionnaire (MHSRQ based on the WHO concept and evaluate its validity and reliability to the mental health care system in Iran. Design: In accordance with the WHO health system responsiveness questionnaire and the findings of a qualitative study, a Farsi version of the MHSRQ was tailored to suit the mental health system in Iran. This version was tested in a cross-sectional study at nine public mental health clinics in Tehran. A sample of 500 mental health services patients was recruited and subsequently completed the questionnaire. Item missing rate was used to check the feasibility while the reliability of the scale was determined by assessing the Cronbach's alpha and item total correlations. The factor structure of the questionnaire was investigated by performing confirmatory factor analysis (CFA. Results: The results showed a satisfactory feasibility since the item missing value was lower than 5.2%. With the exception of access domain, reliability of different domains of the questionnaire was within a desirable range. The factor loading showed an acceptable unidimentionality of the scale despite the fact that three items related to access did not perform well. The CFA also indicated good fit indices for the model (CFI=0.99, GFI=0.97, IFI=0.99, AGFI=0.97. Conclusions: In general, the findings suggest that the Farsi version of the MHSRQ is a feasible, reliable, and valid measure of the mental health system responsiveness in Iran. Changes to the questions related to the access domain should be considered in order to improve the psychometric properties of the measure.
Full Text Available Abstract Background The Oxford elbow score (OES is an English questionnaire that measures the patients' subjective experience of elbow surgery. The OES comprises three domains: elbow function, pain, and social-psychological effects. This questionnaire can be completed by the patient and used as an outcome measure after elbow surgery. The aim of this study was to develop and evaluate the Dutch version of the translated OES for reliability, validity and responsiveness with respect to patients after elbow trauma and surgery. Methods The 12 items of the English-language OES were translated into Dutch and then back-translated; the back-translated questionnaire was then compared to the original English version. The OES Dutch version was completed by 69 patients (group A, 60 of whom had an elbow luxation, four an elbow fracture and five an epicondylitis. QuickDASH, the visual analogue pain scale (VAS and the Mayo Elbow Performance Index (MEPI were also completed to examine the convergent validity of the OES in group A. To calculate the test-retest reliability and responsiveness of the OES, this questionnaire was completed three times by 43 different patients (group B. An average of 52 days elapsed between therapy and the administration of the third OES (SD = 24.1. Results The Cronbach's α coefficients for the function, pain and social-psychological domains were 0.90, 0.87 and 0.90, respectively. The intra-class correlation coefficients for the domains were 0.87 for function, 0.89 for pain and 0.87 for social-psychological. The standardised response means for the domains were 0.69, 0.46 and 0.60, respectively, and the minimal detectable changes were 27.6, 21.7 and 24.0, respectively. The convergent validity for the function, pain and social-psychological domains, which were measured as the Spearman's correlation of the OES domains with the MEPI, were 0.68, 0.77 and 0.77, respectively. The Spearman's correlations of the OES domains with QuickDASH were
Denteneer, Lenie; Van Daele, Ulrike; Truijen, Steven; De Hertogh, Willem; Meirte, Jill; Deckers, Kristiaan; Stassijns, Gaetane
Cross-sectional study. The goal of this study is to translate the English version of the Modified Low Back Pain Disability Questionnaire (MDQ) into a Dutch version and investigate its clinimetric properties for patients with nonspecific chronic low back pain (CLBP). Fritz et al (2001) developed a modified version of the Oswestry Disability Questionnaire (ODI) to assess functional status and named it the MDQ. In this version, a question regarding employment and homemaking ability was substituted for the question related to sex life. Good clinimetric properties for the MDQ were identified but up until now it is not clear whether the clinimetric properties of the MDQ would change if it was translated into a Dutch version. Translation of the MDQ into Dutch was done in 4 steps. Test-retest reliability was investigated using the intraclass correlation coefficient (ICC) model. Validity was calculated using Pearson correlations and a 2-way analysis of variance for repeated measures. Finally, responsiveness was calculated with the area under the curve (AUC), minimal detectable change (MDC), and the standardized response mean (SRM). A total of 80 completed questionnaires were collected in 3 different hospitals and a total of 43 patients finished a 9 weeks intervention period, completing the retest. Test-retest reliability was excellent with an ICC of 0.89 (95% confidence interval [CI], 0.74-0.95). To confirm the convergent validity, the MDQ answered all predefined hypothesises (r = -0.65-0.69/P = 0.01-0.00) and good results for construct validity were found (P = 0.02). The MDQ had an AUC of 0.64 (95% confidence interval [CI], 0.47-0.81), an MDC of 8.80 points, and a SRM of 0.65. The Dutch version of the MDQ shows good clinimetric properties and is shown to be usable in the assessment of the functional status of Dutch-speaking patients with nonspecific CLBP. 3.
Rasmussen, Trine Bernholdt; Konradsen, Hanne; Dixon, Jane
been validated in this patient population. The purpose of this study was thus to assess the validity, reliability and responsiveness of the Danish Body Image Quality of Life Inventory (BIQLI-DA) on patients treated for IE. METHODS: We evaluated the psychometric properties of the BIQLI-DA on data......: The BIQLI-DA may be applicable in healthcare research as it seems to be valid, reliable and responsive; however, evidence should be strengthened through further exploration of instrument performance, particularly regarding responsiveness.......: Participants were seventy patients with a mean age of 58 years and of which 83% were men. Results indicated convergent construct validity by confirming hypothesised associations to potentially related constructs. The BIQLI-DA was found to be highly internally consistent with a Cronbach's alpha of 0...
Thorborg, Kristian; Roos, Ewa; Bartels, Else Marie
disability based on a systematic review of evidence of validity, reliability and responsiveness of these instruments. Methods MEDLINE, EMBASE, CINAHL, Cochrane Central Register of Controlled Trials, PsycINFO, SportsDiscus and Web of Science were all searched up to January 2009. Two reviewers independently...
Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin
The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice
Full Text Available Abstract Background This article describes the development and validation of a self-reported questionnaire, the KQoL-26, that is based on the views of patients with a suspected ligamentous or meniscal injury of the knee that assesses the impact of their knee problem on the quality of their lives. Methods Patient interviews and focus groups were used to derive questionnaire content. The instrument was assessed for data quality, reliability, validity, and responsiveness using data from a randomised trial and patient survey about general practitioners' use of Magnetic Resonance Imaging for patients with a suspected ligamentous or meniscal injury. Results Interview and focus group data produced a 40-item questionnaire designed for self-completion. 559 trial patients and 323 survey patients responded to the questionnaire. Following principal components analysis and Rasch analysis, 26 items were found to contribute to three scales of knee-related quality of life: physical functioning, activity limitations, and emotional functioning. Item-total correlations ranged from 0.60–0.82. Cronbach's alpha and test retest reliability estimates were 0.91–0.94 and 0.80–0.93 respectively. Hypothesised correlations with the Lysholm Knee Scale, EQ-5D, SF-36 and knee symptom questions were evidence for construct validity. The instrument produced highly significant change scores for 65 trial patients indicating that their knee was a little or somewhat better at six months. The new instrument had higher effect sizes (range 0.86–1.13 and responsiveness statistics (range 1.50–2.13 than the EQ-5D and SF-36. Conclusion The KQoL-26 has good evidence for internal reliability, test-retest reliability, validity and responsiveness, and is recommended for use in randomised trials and other evaluative studies of patients with a suspected ligamentous or meniscal injury.
Garratt, Andrew M; Brealey, Stephen; Robling, Michael; Atwell, Chris; Russell, Ian; Gillespie, William; King, David
This article describes the development and validation of a self-reported questionnaire, the KQoL-26, that is based on the views of patients with a suspected ligamentous or meniscal injury of the knee that assesses the impact of their knee problem on the quality of their lives. Patient interviews and focus groups were used to derive questionnaire content. The instrument was assessed for data quality, reliability, validity, and responsiveness using data from a randomised trial and patient survey about general practitioners' use of Magnetic Resonance Imaging for patients with a suspected ligamentous or meniscal injury. Interview and focus group data produced a 40-item questionnaire designed for self-completion. 559 trial patients and 323 survey patients responded to the questionnaire. Following principal components analysis and Rasch analysis, 26 items were found to contribute to three scales of knee-related quality of life: physical functioning, activity limitations, and emotional functioning. Item-total correlations ranged from 0.60-0.82. Cronbach's alpha and test retest reliability estimates were 0.91-0.94 and 0.80-0.93 respectively. Hypothesised correlations with the Lysholm Knee Scale, EQ-5D, SF-36 and knee symptom questions were evidence for construct validity. The instrument produced highly significant change scores for 65 trial patients indicating that their knee was a little or somewhat better at six months. The new instrument had higher effect sizes (range 0.86-1.13) and responsiveness statistics (range 1.50-2.13) than the EQ-5D and SF-36. The KQoL-26 has good evidence for internal reliability, test-retest reliability, validity and responsiveness, and is recommended for use in randomised trials and other evaluative studies of patients with a suspected ligamentous or meniscal injury.
Full Text Available The current study aims to investigate factor structure and psychometric proporties of Turkish version of Responses to Job Dissatisfaction Scale. For this purpose, data were collected from 302 persons who work at public and private sector in Turkey. Both exploratory (principal and confirmatory analyses were used to test the factor structure of the scale. The Turkish version of the scale in a manner similar to the original scale was found to have a four-factor structure. In addition, there were significant corelations between the Turkish version of the scale and Rahim Organizational Conflict Inventory-II. Cronbach Alpha’s internal consistencies of the sub-scales ranged from .67 and .59, split half reliabilities of the items ranged from .48 and .45 and also test-retest relabilities of the scale ranged from .69 and .47. Finally results revealed that The Turkish version of Responses to Job Dissatisfaction Scale had sufficently psychometric properties for Turkish Researchers to use.
Full Text Available Abstract Background The Effective Musculoskeletal Consumer Scale (EC-17 is a self-administered questionnaire for evaluating self-management interventions that empower and educate people with rheumatic conditions. The aim of the study was to translate and evaluate the Norwegian version of EC-17 against the necessary criteria for a patient-reported outcome measure, including responsiveness to change. Methods Data quality, reliability, validity and responsiveness were assessed in two groups. One group comprising 103 patients received a questionnaire before and at the end of a self-management programme. The second group comprising 96 patients' received the questionnaire two weeks before and on arrival of the program. Internal consistency and test-retest reliability were assessed. Construct validity was assessed through comparisons with the Brief Approach/Avoidance Coping Questionnaire, (BACQ, the Emotional Approach Coping Scale (EAC and the General Health Questionnaire (GHQ-20. Responsiveness was assessed with the Standardised Response Mean (SRM. Results Respondents included 66 (64% and 52 (54% patients from the first and second groups respectively. Levels of missing data were low for all items. There was good evidence for unidimensionality, item-total correlations ranged from 0.59 to 0.82 and Cronbach's Alpha and test-retest correlations were over 0.90. As hypothesised EC-17 scores had statistically significant low to moderate correlations with the BACQ, EAC and GHQ-20 in the range 0.26 to 0.42. Following the self-management program, EC-17 scores showed a significant improvement with an SRM of 0.48. Conclusion The Norwegian version of the EC-17 has evidence for data quality, internal consistency and test-retest reliability, construct validity and responsiveness to change. The EC-17 seems promising as an outcome measure for evaluating self-management interventions for people with rheumatic conditions, but further studies are needed.
Hamnes, Bente; Garratt, Andrew; Kjeken, Ingvild; Kristjansson, Elizabeth; Hagen, Kåre B
The Effective Musculoskeletal Consumer Scale (EC-17) is a self-administered questionnaire for evaluating self-management interventions that empower and educate people with rheumatic conditions. The aim of the study was to translate and evaluate the Norwegian version of EC-17 against the necessary criteria for a patient-reported outcome measure, including responsiveness to change. Data quality, reliability, validity and responsiveness were assessed in two groups. One group comprising 103 patients received a questionnaire before and at the end of a self-management programme. The second group comprising 96 patients' received the questionnaire two weeks before and on arrival of the program. Internal consistency and test-retest reliability were assessed. Construct validity was assessed through comparisons with the Brief Approach/Avoidance Coping Questionnaire, (BACQ), the Emotional Approach Coping Scale (EAC) and the General Health Questionnaire (GHQ-20). Responsiveness was assessed with the Standardised Response Mean (SRM). Respondents included 66 (64%) and 52 (54%) patients from the first and second groups respectively. Levels of missing data were low for all items. There was good evidence for unidimensionality, item-total correlations ranged from 0.59 to 0.82 and Cronbach's Alpha and test-retest correlations were over 0.90. As hypothesised EC-17 scores had statistically significant low to moderate correlations with the BACQ, EAC and GHQ-20 in the range 0.26 to 0.42. Following the self-management program, EC-17 scores showed a significant improvement with an SRM of 0.48. The Norwegian version of the EC-17 has evidence for data quality, internal consistency and test-retest reliability, construct validity and responsiveness to change. The EC-17 seems promising as an outcome measure for evaluating self-management interventions for people with rheumatic conditions, but further studies are needed.
Cai, Weixiong; Zhang, Qingting; Huang, Fuyin; Guan, Wei; Tang, Tao; Liu, Chao
results suggested that 88.90% of the original grouped cases were correctly classified, and the discriminant value had high conformity with the experts' opinions. The data showed that the scale would be the best validated instrument for the criminal responsibility in China. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Kasitanon, N; Wangkaew, S; Puntana, S; Sukitawut, W; Leong, K P; Louthrenoo, W
The English version of the Systemic Lupus Erythematosus Quality of Life Questionnaire (SLEQOL) is a validated disease-specific quality of life instrument. The aim of this study was to evaluate the psychometric properties of the Thai version of the SLEQOL (SLEQOL-TH). Two independent translators translated the SLEQOL into Thai. The back translation of this version was performed by two other independent translators. The final version, SLEQOL-TH, was completed after resolving the discrepancies revealed by the back translation. One hundred and nine patients with SLE were enrolled to test the reliability, construct validity, floor and ceiling effects, and sensitivity to the changes of the SLEQOL-TH at six months. The differential item functioning (DIF) between the Thai and English versions was analyzed using the partial gamma. The internal consistency of the SLEQOL-TH was satisfactory with the overall Cronbach's alpha of 0.86. The test-retest reliability of the SLEQOL-TH was acceptable with the intra-class correlation coefficient of 0.86. Low correlations between the SLEQOL-TH and SLEDAI were observed. The total score of the SLEQOL-TH was moderately responsive to changes in quality of life, with a standardized response mean of 0.50. When comparing the SLEQOL-TH from Thai SLE patients with the original SLEQOL version obtained from Singapore SLE patients, 11 out of 40 items showed a moderate to large DIF. The SLEQOL-TH has acceptable psychometric properties and shows construct validity. In comparison with the English version of SLEQOL, there are some items that showed DIF. The applicability of the SLEQOL-TH in real-life clinical practice and clinical trials needs to be determined.
Linde, L.; Sørensen, J.; Østergaard, Morten
OBJECTIVE: To compare validity, reliability, and responsiveness of generic and disease specific health-related quality of life (HRQOL) instruments in rheumatoid arthritis (RA). METHODS: Two samples of patients completed the Medical Outcomes Study Short Form-36 Health Survey (SF-36), EuroQol (EQ)-5D...... and VAS pain were responsive to both improvement and deterioration. CONCLUSION: All instruments were valid measures for HRQOL in RA. The RAQoL and HAQ displayed the best reliability, while the SF-36 bodily pain scale and VAS pain were the most responsive. The choice of instrument should depend......, 15D, Rheumatoid Arthritis Quality of Life Scale (RAQoL), Health Assessment Questionnaire (HAQ), and visual analog scales (VAS) for pain, fatigue, and global RA. Validity (convergent, discriminant, and known-groups) was evaluated in a cross-section of 200 patients. Reliability was evaluated...
Garcés, Juan B Gerstner; Winson, Ian; Goldhahn, Sabine; Castro, Michael D; Swords, Michael P; Grujic, Leslie; Rammelt, Stefan; Sands, Andrew K
The Manchester-Oxford Foot Questionnaire (MOXFQ) has been validated in Spanish for use in patients undergoing foot and ankle surgery. 120 patients completed the MOXFQ and the SF-36 before surgery and 6 and 12 months postoperative. Surgeons completed the American Orthopaedic Foot and Ankle Society (AOFAS) Clinical Rating System. Psychometric properties were assessed for all three MOXFQ dimensions, and for the MOXFQ Index. The Spanish MOXFQ demonstrated consistency with Cronbach's alpha values between 0.65 and 0.90, and reliability ([ICCs] >0.95). It shows a moderate to strong correlation between the Walking/standing dimension and the related domains of the SF-36 (|r|>0.6), the AOFAS Ankle-Hindfoot Scale (|r|>0.47) and Hallux-MTP-IP Scale (|r|>0.64). Responsiveness was excellent, (effect sizes >2.1). The respective minimal detectable change (MDC90) was 14.18 for the MOXFQ Index. The Spanish version of the MOXFQ showed good psychometric properties in patients with foot and ankle disorders. Copyright © 2015 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.
Burgers, P.T.; Poolman, R.W.; Bakel, T.M. Van; Tuinebreijer, W.E.; Zielinski, S.M.; Bhandari, M.; Patka, P.; Lieshout, E.M. van; Kampen, A. van; Biert, J.; Vugt, A.B. van; Edwards, M.J.R.; Blokhuis, T.J.; Frolke, J.P.; Geeraedts, L.M.G.; Gardeniers, J.W.M.; Tan, E.C.T.H.; Poelhekke, L.M.S.J.; Waal Malefijt, M.C. de; Schreurs, B.W.; et al.,
BACKGROUND: The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) has been extensively evaluated in groups of patients with osteoarthritis, yet not in patients with a femoral neck fracture. This study aimed to determine the reliability, construct validity, and responsiveness of
Burgers, Paul T. P. W.; Poolman, Rudolf W.; van Bakel, Theodorus M. J.; Tuinebreijer, Wim E.; Zielinski, Stephanie M.; Bhandari, Mohit; Patka, Peter; van Lieshout, Esther M. M.; Devereaux, P. J.; Guyatt, Gordon H.; Einhorn, Thomas A.; Thabane, Lehana; Schemitsch, Emil H.; Koval, Kenneth J.; Frihagen, Frede; Tetsworth, Kevin; Guerra-Farfan, Ernesto; Walter, Stephen D.; Sprague, Sheila; Swinton, Marilyn; Scott, Taryn; McKay, Paula; Madden, Kim; Heels-Ansdell, Diane; Buckingham, Lisa; Duraikannan, Aravin; Silva, Heather; Heetveld, Martin J.; Burgers, T. P. W.; Zura, Robert D.; Avram, Victoria; Eygendaal, Denise; Krips, Rover; Raven, Eric E. J.; Haverlag, Robert; Mutsaerts, Eduard L. A. R.; Haverkamp, Daniel; van den Bekerom, Michel P. J.; Beimers, Lijkele; de Vries, Jasper; Zurcher, Arthur W.; Bulstra, Gythe H.; Campo, Martin M.; Somford, Mathijs P.; Schep, Niels W. L.; Festen, Sebastiaan; Geeraedts, Leo M. G.; Peters, Rolf; Goslings, J. Carel; Ponsen, Kees Jan
Background: The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) has been extensively evaluated in groups of patients with osteoarthritis, yet not in patients with a femoral neck fracture. This study aimed to determine the reliability, construct validity, and responsiveness of
Mercier, Catherine; Roche, Sylvain; Gaillard, Ségolène; Kassai, Behrouz; Arzimanoglou, Alexis; Herbillon, Vania; Roy, Pascal; Rheims, Sylvain
Attention deficit hyperactivity disorder (ADHD) is a well-known comorbidity in children with epilepsy. In English-speaking countries, the scores of the original ADHD-rating scale IV are currently used as main outcomes in various clinical trials in children with epilepsy. In French-speaking countries, several French versions are in use though none has been fully validated yet. We sought here for a partial validation of a French version of the ADHD-RS IV regarding construct validity, internal consistency (i.e., scale reliability), item reliability, and responsiveness in a group of French children with ADHD and epilepsy. The study involved 167 children aged 6-15years in 10 French neuropediatric units. The factorial structure and item reliability were assessed with a confirmatory factorial analysis for ordered categorical variables. The dimensions' internal consistency was assessed with Guttman's lambda 6 coefficient. The responsiveness was assessed by the change in score under methylphenidate and in comparison with a control group. The results confirmed the original two-dimensional factorial structure (inattention, hyperactivity/impulsivity) and showed a satisfactory reliability of most items, a good dimension internal consistency, and a good responsiveness of the total score and the two subscores. The studied French version of the ADHD-RS IV is thus validated regarding construct validity, reliability, and responsiveness. It can now be used in French-speaking countries in clinical trials of treatments involving children with ADHD and epilepsy. The full validation requires further investigations. Copyright © 2016 Elsevier Inc. All rights reserved.
Arbab, Dariusch; van Ochten, Johannes H M; Schnurr, Christoph; Bouillon, Bertil; König, Dietmar
Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures. The intention of this study was to evaluate reliability, validity, responsiveness and minimally important change of the German version of the Hip dysfunction and osteoarthritis outcome score (HOOS). The German HOOS was investigated in 251 consecutive patients before and 6 months after total hip arthroplasty. All patients completed HOOS, Oxford-Hip Score, Short-Form (SF-36) and numeric scales for pain and disability. Test-retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German HOOS demonstrated excellent test-retest reliability with intraclass correlation coefficient values > 0.7. Cronbach´s alpha values demonstrated strong internal consistency. As hypothesized, HOOS subscales strongly correlated with corresponding OHS and SF-36 domains. All subscales showed excellent (effect size/standardized response means > 0.8) responsiveness between preoperative assessment and postoperative follow-up. The HOOS and all subdomains showed higher changes than the minimal detectable change which indicates true changes. The German version of the HOOS demonstrated good psychometric properties. It proved to be valid, reliable and responsive to the changes instrument for use in patients with hip osteoarthritis undergoing total hip replacement.
Arbab, Dariusch; Kuhlmann, Katharina; Schnurr, Christoph; Bouillon, Bertil; Lüring, Christian; König, Dietmar
Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures and are increasingly used in clinical trials to assess outcomes of health care. The intention of this study was to develop and culturally adapt a German version of the Self-reported Foot and Ankle Score (SEFAS) and to evaluate reliability, validity and responsiveness. According to Cross Cultural Adaptation of Self-Reported Measure guidelines forward and backward translation has been performed. The German SEFAS was investigated in 177 consecutive patients. 177 Patients completed the German SEFAS, Foot and Ankle Outcome Score (FAOS), Short-Form 36 and numeric scales for pain and disability (NRS) before and 118 patients 6 months after foot or ankle surgery. Test-Retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German SEFAS demonstrated excellent test-retest reliability with ICC values of 0.97. Cronbach's alpha (α) value of 0.89 demonstrated strong internal consistency. No floor or ceiling effects were observed for the German version of the SEFAS. As hypothesized SEFAS correlated strongly with FAOS and SF-36 domains. It showed moderate (ES/SRM > 0.5) responsiveness between preoperative assessment and postoperative follow-up. The German version of the SEFAS demonstrated good psychometric properties. It proofed to be a valid and reliable instrument for use in foot and ankle patients. DRKS00007585.
Bajada, Stefan; Mohanty, Khitish
The Majeed scoring system is a disease-specific outcome measure that was originally designed to assess pelvic injuries. The aim of this study was to determine the psychometric properties of the Majeed scoring system for chronic sacroiliac joint pain. Internal consistency, content validity, criterion validity, construct validity and responsiveness to change was assessed prospectively for the Majeed scoring system in a cohort of 60 patients diagnosed with sacroiliac joint pain. This diagnosis was confirmed with CT-guided sacroiliac joint anaesthetic block. The overall Majeed score showed acceptable internal consistency (Cronbach alpha = 0.63). Similarly, it showed acceptable floor (0 %) and ceiling (0 %) effects. On the other hand, the domains of pain, work, sitting and sexual intercourse had high (>30 %) floor effects. Significant correlation with the physical component of the Short Form-36 (p = 0.005) and Oswestry disability index (p ≤ 0.001) was found indicating acceptable criterion validity. The overall Majeed score showed acceptable construct validity with all five developed hypotheses showing significance (p ≤ 0.05). The overall Majeed score showed acceptable responsiveness to change with a large (≥0.80) effect size and standardized response mean. Overall the Majeed scoring system demonstrated acceptable psychometric properties for outcome assessment in chronic sacroiliac joint pain. Thus, its use in this condition is adequate. However, some domains demonstrated suboptimal performance indicating that improvement might be achieved with the development of an outcome measure specific for sacroiliac joint dysfunction and degeneration.
Bannigan, Katrina; Watson, Roger
To explore and explain the different concepts of reliability and validity as they are related to measurement instruments in social science and health care. There are different concepts contained in the terms reliability and validity and these are often explained poorly and there is often confusion between them. To develop some clarity about reliability and validity a conceptual framework was built based on the existing literature. The concepts of reliability, validity and utility are explored and explained. Reliability contains the concepts of internal consistency and stability and equivalence. Validity contains the concepts of content, face, criterion, concurrent, predictive, construct, convergent (and divergent), factorial and discriminant. In addition, for clinical practice and research, it is essential to establish the utility of a measurement instrument. To use measurement instruments appropriately in clinical practice, the extent to which they are reliable, valid and usable must be established.
Monticone, Marco; Ambrosini, Emilia; Verheyden, Geert; Brivio, Flavia; Brunati, Roberto; Longoni, Luca; Mauri, Gaia; Molteni, Alessandro; Nava, Claudia; Rocca, Barbara; Ferrante, Simona
To cross-culturally adapt and psychometrically analyse the Italian version of the Trunk Impairment Scale on acute (cohort 1) and chronic stroke patients (cohort 2). The Trunk Impairment Scale was culturally adapted in accordance with international standards. The psychometric testing included: internal consistency (Cronbach's alpha), inter- and intra-rater reliability (intraclass correlation coefficient; standard error of measurement and minimal detectable change), construct validity by comparing Trunk Impairment Scale score with Barthel Index, motor subscale of Functional Independence Measure, and Trunk Control Test (Pearson's correlation), and responsiveness (Effect Size, Effect Size with Guyatt approach, standardized response mean, and Receiver Operating Characteristics curves). The Trunk Impairment Scale was administered to 125 and 116 acute and chronic stroke patients, respectively. Internal consistency was acceptable (α > 0.7), inter- and intra-rater reliability (ICC > 0.9, Minimal Detectable Change for total score 0.4) with all scales but the motor Functional Independence Measure in cohort 2. Distribution-based methods showed large effects in cohort 1 and moderate to large effects in cohort 2. The Minimal Important Difference was 3.5 both from patient's and therapist's perspective in cohort 1 and 2.5 and 1.5 from patient's and therapist's perspective, respectively, in cohort 2. The Trunk Impairment Scale was successfully translated into Italian and proved to be reliable, valid, and responsive. Its use is recommended for clinical and research purposes. Implications for Rehabilitation Trunk control is an essential part of balance and postural control, constituting an important prerequisite for daily activities and function. The TIS administered in subjects with subacute and chronic stroke was reliable, valid and responsive. The TIS is expected to help clinicians and researchers by identifying key functional processes related to disability in people
Powers, John H; Bacci, Elizabeth D; Guerrero, M Lourdes; Leidy, Nancy Kline; Stringer, Sonja; Kim, Katherine; Memoli, Matthew J; Han, Alison; Fairchok, Mary P; Chen, Wei-Ju; Arnold, John C; Danaher, Patrick J; Lalani, Tahaniyat; Ridoré, Michelande; Burgess, Timothy H; Millar, Eugene V; Hernández, Andrés; Rodríguez-Zulueta, Patricia; Smolskis, Mary C; Ortega-Gallegos, Hilda; Pett, Sarah; Fischer, William; Gillor, Daniel; Macias, Laura Moreno; DuVal, Anna; Rothman, Richard; Dugas, Andrea; Ruiz-Palacios, Guillermo M
To assess the reliability, validity, and responsiveness of InFLUenza Patient-Reported Outcome (FLU-PRO©) scores for quantifying the presence and severity of influenza symptoms. An observational prospective cohort study of adults (≥18 years) with influenza-like illness in the United States, the United Kingdom, Mexico, and South America was conducted. Participants completed the 37-item draft FLU-PRO daily for up to 14 days. Item-level and factor analyses were used to remove items and determine factor structure. Reliability of the final tool was estimated using Cronbach α and intraclass correlation coefficients (2-day reliability). Convergent and known-groups validity and responsiveness were assessed using global assessments of influenza severity and return to usual health. Of the 536 patients enrolled, 221 influenza-positive subjects comprised the analytical sample. The mean age of the patients was 40.7 years, 60.2% were women, and 59.7% were white. The final 32-item measure has six factors/domains (nose, throat, eyes, chest/respiratory, gastrointestinal, and body/systemic), with a higher order factor representing symptom severity overall (comparative fit index = 0.92; root mean square error of approximation = 0.06). Cronbach α was high (total = 0.92; domain range = 0.71-0.87); test-retest reliability (intraclass correlation coefficient, day 1-day 2) was 0.83 for total scores and 0.57 to 0.79 for domains. Day 1 FLU-PRO domain and total scores were moderately to highly correlated (≥0.30) with Patient Global Rating of Flu Severity (except nose and throat). Consistent with known-groups validity, scores differentiated severity groups on the basis of global rating (total: F = 57.2, P FLU-PRO score improvement by day 7 than did those who did not, suggesting score responsiveness. Results suggest that FLU-PRO scores are reliable, valid, and responsive to change in influenza-positive adults. Copyright © 2018 International Society for Pharmacoeconomics and Outcomes
Drost, Ellen A.
In this paper, the author aims to provide novice researchers with an understanding of the general problem of validity in social science research and to acquaint them with approaches to developing strong support for the validity of their research. She provides insight into these two important concepts, namely (1) validity; and (2) reliability, and…
Cordier, Reinie; Speyer, Renée; Schindler, Antonio; Michou, Emilia; Heijnen, Bas Joris; Baijens, Laura; Karaduman, Ayşe; Swan, Katina; Clavé, Pere; Joosten, Annette Veronica
The Swallowing Quality of Life questionnaire (SWAL-QOL) is widely used clinically and in research to evaluate quality of life related to swallowing difficulties. It has been described as a valid and reliable tool, but was developed and tested using classic test theory. This study describes the reliability and validity of the SWAL-QOL using item response theory (IRT; Rasch analysis). SWAL-QOL data were gathered from 507 participants at risk of oropharyngeal dysphagia (OD) across four European countries. OD was confirmed in 75.7% of participants via videofluoroscopy and/or fiberoptic endoscopic evaluation, or a clinical diagnosis based on meeting selected criteria. Patients with esophageal dysphagia were excluded. Data were analysed using Rasch analysis. Item and person reliability was good for all the items combined. However, person reliability was poor for 8 subscales and item reliability was poor for one subscale. Eight subscales exhibited poor person separation and two exhibited poor item separation. Overall item and person fit statistics were acceptable. However, at an individual item fit level results indicated unpredictable item responses for 28 items, and item redundancy for 10 items. The item-person dimensionality map confirmed these findings. Results from the overall Rasch model fit and Principal Component Analysis were suggestive of a second dimension. For all the items combined, none of the item categories were 'category', 'threshold' or 'step' disordered; however, all subscales demonstrated category disordered functioning. Findings suggest an urgent need to further investigate the underlying structure of the SWAL-QOL and its psychometric characteristics using IRT.
The Psychometric Properties of the Center for Epidemiologic Studies Depression Scale in Chinese Primary Care Patients: Factor Structure, Construct Validity, Reliability, Sensitivity and Responsiveness.
Chin, Weng Yee; Choi, Edmond P H; Chan, Kit T Y; Wong, Carlos K H
The Center for Epidemiologic Studies Depression Scale (CES-D) is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness. The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9) and the Short Form-12 Health Survey (version 2) Mental Component Summary (SF-12 v2 MCS). The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78) and SF-12 v2 MCS (coefficient: -0.75). Internal consistency was assessed by McDonald's omega hierarchical (ωH). The ωH value for the general depression factor was 0.855. The ωH values for "somatic", "depressed affect", "positive affect" and "interpersonal problems" were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value 0.2). The CES-D was externally responsive, with the AUC>0.7. The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original four
The Psychometric Properties of the Center for Epidemiologic Studies Depression Scale in Chinese Primary Care Patients: Factor Structure, Construct Validity, Reliability, Sensitivity and Responsiveness.
Weng Yee Chin
Full Text Available The Center for Epidemiologic Studies Depression Scale (CES-D is a commonly used instrument to measure depressive symptomatology. Despite this, the evidence for its psychometric properties remains poorly established in Chinese populations. The aim of this study was to validate the use of the CES-D in Chinese primary care patients by examining factor structure, construct validity, reliability, sensitivity and responsiveness.The psychometric properties were assessed amongst a sample of 3686 Chinese adult primary care patients in Hong Kong. Three competing factor structure models were examined using confirmatory factor analysis. The original CES-D four-structure model had adequate fit, however the data was better fit into a bi-factor model. For the internal construct validity, corrected item-total correlations were 0.4 for most items. The convergent validity was assessed by examining the correlations between the CES-D, the Patient Health Questionnaire 9 (PHQ-9 and the Short Form-12 Health Survey (version 2 Mental Component Summary (SF-12 v2 MCS. The CES-D had a strong correlation with the PHQ-9 (coefficient: 0.78 and SF-12 v2 MCS (coefficient: -0.75. Internal consistency was assessed by McDonald's omega hierarchical (ωH. The ωH value for the general depression factor was 0.855. The ωH values for "somatic", "depressed affect", "positive affect" and "interpersonal problems" were 0.434, 0.038, 0.738 and 0.730, respectively. For the two-week test-retest reliability, the intraclass correlation coefficient was 0.91. The CES-D was sensitive in detecting differences between known groups, with the AUC >0.7. Internal responsiveness of the CES-D to detect positive and negative changes was satisfactory (with p value 0.2. The CES-D was externally responsive, with the AUC>0.7.The CES-D appears to be a valid, reliable, sensitive and responsive instrument for screening and monitoring depressive symptoms in adult Chinese primary care patients. In its original
Full Text Available Both qualitative and quantitative paradigms try to find the same result; the truth. Qualitative studies are tools used in understanding and describing the world of human experience. Since we maintain our humanity throughout the research process, it is largely impossible to escape the subjective experience, even for the most experienced of researchers. Reliability and Validity are the issue that has been described in great deal by advocates of quantitative researchers. The validity and the norms of rigor that are applied to quantitative research are not entirely applicable to qualitative research. Validity in qualitative research means the extent to which the data is plausible, credible and trustworthy; and thus can be defended when challenged. Reliability and validity remain appropriate concepts for attaining rigor in qualitative research. Qualitative researchers have to salvage responsibility for reliability and validity by implementing verification strategies integral and self-correcting during the conduct of inquiry itself. This ensures the attainment of rigor using strategies inherent within each qualitative design, and moves the responsibility for incorporating and maintaining reliability and validity from external reviewers’ judgments to the investigators themselves. There have different opinions on validity with some suggesting that the concepts of validity is incompatible with qualitative research and should be abandoned while others argue efforts should be made to ensure validity so as to lend credibility to the results. This paper is an attempt to clarify the meaning and use of reliability and validity in the qualitative research paradigm.
Burgers, Paul T P W; Poolman, Rudolf W; Van Bakel, Theodorus M J; Tuinebreijer, Wim E; Zielinski, Stephanie M; Bhandari, Mohit; Patka, Peter; Van Lieshout, Esther M M
The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) has been extensively evaluated in groups of patients with osteoarthritis, yet not in patients with a femoral neck fracture. This study aimed to determine the reliability, construct validity, and responsiveness of the WOMAC compared with the Short Form-12 (SF-12) and the EuroQol 5D (EQ-5D) questionnaires for the assessment of elderly patients with a femoral neck fracture. Reliability was tested by assessing the Cronbach alpha. Construct validity was determined with the Pearson correlation coefficient. Change scores were calculated from ten weeks to twelve months of follow-up. Standardized response means and floor and ceiling effects were determined. Analyses were performed to compare the results for patients less than eighty years old with those for patients eighty years of age or older. The mean WOMAC total score was 89 points before the fracture in the younger patients and increased from 70 points at ten weeks to 81 points at two years postoperatively. In the older age group, these scores were 86, 75, and 78 points. The mean WOMAC pain scores before the fracture and at ten weeks and two years postoperatively were 92, 76, and 87 points, respectively, in the younger age group and 92, 84, and 93 points in the older age group. Function scores were 89, 68, and 79 points for the younger age group and 84, 71, and 73 points for the older age group. The Cronbach alpha for pain, stiffness, function, and the total scale ranged from 0.83 to 0.98 for the younger age group and from 0.79 to 0.97 for the older age group. Construct validity was good, with 82% and 79% of predefined hypotheses confirmed in the younger and older age groups, respectively. Responsiveness was moderate. No floor effects were found. Moderate to large ceiling effects were found for pain and stiffness scales at ten weeks and twelve months in younger patients (18% to 36%) and in the older age group (38% to 53%). The WOMAC showed good
Bartlett, S.J.; Barbic, S.P.; Bykerk, V.P.; Choy, E.H.; Alten, R.; Christensen, R.; Broeder, A. den; Fautrel, B.; Furst, D.E.; Guillemin, F.; Hewlett, S.; Leong, A.L.; Lyddiatt, A.; March, L.; Montie, P.; Pohl, C.; Voshaar, M.; Woodworth, T.G.; Bingham, C.O.
OBJECTIVE: The Outcome Measures in Rheumatology (OMERACT) Rheumatoid Arthritis (RA) Flare Group was established to develop a reliable way to identify and measure RA flares in randomized controlled trials (RCT). Here, we summarized the development and field testing of the RA Flare Questionnaire
Aven, Terje; Heide, Bjornar
In this paper we investigate to what extent risk analysis meets the scientific quality requirements of reliability and validity. We distinguish between two types of approaches within risk analysis, relative frequency-based approaches and Bayesian approaches. The former category includes both traditional statistical inference methods and the so-called probability of frequency approach. Depending on the risk analysis approach, the aim of the analysis is different, the results are presented in different ways and consequently the meaning of the concepts reliability and validity are not the same.
Introduction The novel arthritis-specific Work Productivity Survey (WPS) was developed to estimate patient productivity limitations associated with arthritis within and outside the home, which is an unmet need in psoriatic arthritis (PsA). The WPS has been validated in rheumatoid arthritis. This report assesses the discriminant validity, responsiveness and reliability of the WPS in adult-onset PsA. Methods Psychometric properties were assessed using data from the RAPID-PsA trial (NCT01087788) investigating certolizumab pegol (CZP) efficacy and safety in PsA. WPS was completed at baseline and every 4 weeks until Week 24. Validity was evaluated at baseline via known-groups defined using first and third quartiles of patients’ Disease Activity Score 28 based on C-reactive protein (DAS28(CRP)), Health Assessment Questionnaire-Disability Index (HAQ-DI), Short Form-36 (SF-36) items and PsA Quality of Life (PsAQoL) scores. Responsiveness and reliability were assessed by comparing WPS mean changes at Week 12 in American College of Rheumatology 20% improvement criteria (ACR20) or HAQ-DI Minimal Clinically Important Difference (MCID) 0.3 responders versus non-responders, as well as using standardized response means (SRM). All comparisons were conducted on the observed cases in the Randomized Set, regardless of the randomization group, using a non-parametric bootstrap-t method. Results Compared with patients with a better health state, patients with a worse health state had on average 2 to 6 times more household work days lost, more days with reduced household productivity, more days missed of family/social/leisure activities, more days with outside help hired and a significantly higher interference of arthritis per month. Among employed patients, those with a worse health state had 2 to 4 times more workplace days lost, more days with patient workplace productivity reduced, and a significantly higher interference of arthritis on patient workplace productivity versus
This article reviews three topics from test theory that continue to raise discussion and controversy and capture test theorists' and constructors' interest. The first topic concerns the discussion of the methodology of investigating and establishing construct validity; the second topic concerns reliability and its misuse, alternative definitions…
Doctor, S.R.; Deffenbaugh, J.D.; Good, M.S.; Green, E.R.; Heasler, P.G.; Hutton, P.H.; Reid, L.D.; Simonen, F.A.; Spanner, J.C.; Vo, T.V.
This paper reports on progress for three programs: (1) evaluation and improvement in nondestructive examination reliability for inservice inspection of light water reactors (LWR) (NDE Reliability Program), (2) field validation acceptance, and training for advanced NDE technology, and (3) evaluation of computer-based NDE techniques and regional support of inspection activities. The NDE Reliability Program objectives are to quantify the reliability of inservice inspection techniques for LWR primary system components through independent research and establish means for obtaining improvements in the reliability of inservice inspections. The areas of significant progress will be described concerning ASME Code activities, re-analysis of the PISC-II data, the equipment interaction matrix study, new inspection criteria, and PISC-III. The objectives of the second program are to develop field procedures for the AE and SAFT-UT techniques, perform field validation testing of these techniques, provide training in the techniques for NRC headquarters and regional staff, and work with the ASME Code for the use of these advanced technologies. The final program's objective is to evaluate the reliability and accuracy of interpretation of results from computer-based ultrasonic inservice inspection systems, and to develop guidelines for NRC staff to monitor and evaluate the effectiveness of inservice inspections conducted on nuclear power reactors. This program started in the last quarter of FY89, and the extent of the program was to prepare a work plan for presentation to and approval from a technical advisory group of NRC staff
Pigford, T.H.; Chambre, P.L.
The objective of predicting long-term performance should be to make reliable determinations of whether the prediction falls within the criteria for acceptable performance. Establishing reliable predictions of long-term performance of a waste repository requires emphasis on valid theories to predict performance. The validation process must establish the validity of the theory, the parameters used in applying the theory, the arithmetic of calculations, and the interpretation of results; but validation of such performance predictions is not possible unless there are clear criteria for acceptable performance. Validation programs should emphasize identification of the substantive issues of prediction that need to be resolved. Examples relevant to waste package performance are predicting the life of waste containers and the time distribution of container failures, establishing the criteria for defining container failure, validating theories for time-dependent waste dissolution that depend on details of the repository environment, and determining the extent of congruent dissolution of radionuclides in the UO 2 matrix of spent fuel. Prediction and validation should go hand in hand and should be done and reviewed frequently, as essential tools for the programs to design and develop repositories. 29 refs
Extremera, Natalio; Fernández-Berrocal, Pablo
This study investigated the construct validity and reliability of the Spanish Ruminative Responses Scale-Short From, and the Distraction Responses Scale of the Response Styles Questionnaire for a sample of 727 Spanish high school and college students who responded anonymously and voluntarily to a questionnaire (293 men, 434 women; ages 16 to 29 years, M=18.8, SD=3.0). In addition to the above scales, the questionnaire included the Spanish forms of the Beck Depression Inventory, the Trait Anxiety Scale from the State-Trait Anxiety Scale, the Satisfaction with Life Scale, and the Subjective Happiness Scale. The internal consistency of the scales was satisfactory (Cronbach alpha=.86 for the Ruminative Responses Scale and .78 for the Distraction Responses Scale). As expected, scores on the Spanish Ruminative Responses Scale showed positive correlations with those on the Beck Depression Inventory and the Trait Anxiety Scale and negative associations with the Satisfaction with Life Scale and the Subjective Happiness Scale. Conversely, the Spanish Distraction Responses Scale was negatively correlated with the Beck Depression Inventory and positively associated with the Satisfaction with Life Scale and the Subjective Happiness Scale. These results provide evidence of appropriate reliability for research purposes. Furthermore, the correlational analysis supported prior findings that ruminative response and distraction response styles are differentially associated with reported depressed and positive moods.
P.T.P.W. Burgers (Paul); R.W. Poolman (Rudolf); T.M. van Bakel (Theodorus); W.E. Tuinebreijer (Wim); S.M. Zielinski (Stephanie); M. Bhandari (Mohit); P. Patka (Peter); E.M.M. van Lieshout (Esther)
markdownabstractBackground: The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) has been extensively evaluated in groups of patients with osteoarthritis, yet not in patients with a femoral neck fracture. This study aimed to determine the reliability, construct validity, and
Cafiero, Carlo; Melgar-Quiñonez, Hugo R; Ballard, Terri J; Kepple, Anne W
This paper reviews some of the existing food security indicators, discussing the validity of the underlying concept and the expected reliability of measures under reasonably feasible conditions. The main objective of the paper is to raise awareness on existing trade-offs between different qualities of possible food security measurement tools that must be taken into account when such tools are proposed for practical application, especially for use within an international monitoring framework. The hope is to provide a timely, useful contribution to the process leading to the definition of a food security goal and the associated monitoring framework within the post-2015 Development Agenda. © 2014 New York Academy of Sciences.
Utrecht Work Engagement Scale-Student Forms' (UWES-SF) Adaptation to Turkish, Validity and Reliability Studies, and the Mediator Role of Work Engagement between Academic Procrastination and Academic Responsibility
Çapri, Burhan; Gündüz, Bülent; Akbay, Sinem Evin
The primary goal of this study is to complete the adaptation, validity and reliability studies of the long (17 items) and short (9 items) forms of UWES-SF. The secondary goal of this study is to study the mediating role of work engagement between academic procrastination and academic responsibility in high school students. The study group consists…
de Boer, J. B.; Sprangers, M. A.; Aaronson, N. K.; Lange, J. M.; van Dam, F. S.
The objective of this study was to evaluate the feasibility, reliability, validity and responsiveness of the HIV Overview of Problems Evaluation System (HOPES) in a Dutch sample. The HOPES was administered three times in a one-year period to a sample of 106 outpatients with a symptomatic
Buri, Hilary M; Daly, Jeanette M; Jogerst, Gerald J
(a) To identify reliable and valid questions that identify elder abuse, (b) to assess the reliability and validity of extant self-reported elder abuse screens in a high-risk elderly population, and (c) to describe difficulties of completing and interpreting screens in a high-need elderly population. All elders referred to research-trained social workers in a community service agency were asked to participate. Of the 70 elders asked, 49 participated, 44 completed the first questionnaire, and 32 completed the duplicate second questionnaire. A research assistant administered the telephone questionnaires. Twenty-nine (42%) persons were judged abused, 12 (17%) had abuse reported, and 4 (6%) had abuse substantiated. The elder abuse screen instruments were not found to be predictive of assessed abuse or as predictors of reported abuse; the measures tended toward being inversely predictive. Two questions regarding harm and taking of belongings were significantly different for the assessed abused group. In this small group of high-need community-dwelling elders, the screens were not effective in discriminating between abused and nonabused groups. Better instruments are needed to assess for elder abuse.
Goodwin, Laura D.; Goodwin, William L.
The views of prominant qualitative methodologists on the appropriateness of validity and reliability estimation for the measurement strategies employed in qualitative evaluations are summarized. A case is made for the relevance of validity and reliability estimation. Definitions of validity and reliability for qualitative measurement are presented…
Full Text Available Validation of land cover products is a fundamental task prior to data applications. Current validation schemes and methods are, however, suited only for assessing classification accuracy and disregard the reliability of land cover products. The reliability evaluation of land cover products should be undertaken to provide reliable land cover information. In addition, the lack of high-quality reference data often constrains validation and affects the reliability results of land cover products. This study proposes a validation schema to evaluate the reliability of land cover products, including two methods, namely, result reliability evaluation and process reliability evaluation. Result reliability evaluation computes the reliability of land cover products using seven reliability indicators. Process reliability evaluation analyzes the reliability propagation in the data production process to obtain the reliability of land cover products. Fuzzy fault tree analysis is introduced and improved in the reliability analysis of a data production process. Research results show that the proposed reliability evaluation scheme is reasonable and can be applied to validate land cover products. Through the analysis of the seven indicators of result reliability evaluation, more information on land cover can be obtained for strategic decision-making and planning, compared with traditional accuracy assessment methods. Process reliability evaluation without the need for reference data can facilitate the validation and reflect the change trends of reliabilities to some extent.
Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin
The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P visual analog scale was r = 0.626 (P disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
Post, Marcel W
Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability
Haradhan Kumar Mohajan
Full Text Available Reliability and validity are two most important and fundamental features in the evaluation of any measurement instrument or toll for a good research. The purpose of this research is to discuss the validity and reliability of measurement instruments that are used in research. Validity concerns what an instrument measures, and how well it does so. Reliability concerns the faith that one can have in the data obtained from use of an instrument, that is, the degree to which any measuring tool controls for random error. An attempt has been taken here to review the reliability and validity, and threat to them in some details.
Reliability and Concurrent Validity of the International Personality item Pool (IPIP) Big-five Factor Markers in Nigeria. ... Nigerian Journal of Psychiatry ... Aims: The aim of this study was to assess the internal consistency and concurrent validity ...
Linde, Louise; Sørensen, Jan; Ostergaard, Mikkel
.21-6.47). The longitudinal sample included 80% women, median age 60 years (22-82). Validity: all instruments discriminated between low, moderate, and high DAS28. Reliability: RAQoL and HAQ displayed good repeatability (ICC > 0.95) and internal consistency (Cronbach's alpha > 0.90). Responsiveness: SF-36 bodily pain scale......, 15D, Rheumatoid Arthritis Quality of Life Scale (RAQoL), Health Assessment Questionnaire (HAQ), and visual analog scales (VAS) for pain, fatigue, and global RA. Validity (convergent, discriminant, and known-groups) was evaluated in a cross-section of 200 patients. Reliability was evaluated...... questionnaires (at 2 weeks and 6 months) included questions about changes in health status since baseline. RESULTS: The cross-sectional sample included 77% women, median age 57 years (range 19-87), disease duration 6 years (0-58), with Disease Activity Score 28-joint count (DAS28) of 3.10 (1...
Stockbrugger, Barry A.; Haennel, Robert G.
Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…
Alkhamra, Rana A.; Al-Jazi, Aya B.
Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…
Badjadi, Nour El Imane
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Osadebe, P. U.
The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Azad, Akram; Hassani Mehraban, Afsoon; Mehrpour, Masoud; Mohammadi, Babak
Fear of falling may be related to falling during stroke onset. The Fall Efficacy ScaleInternational (FES-I) with excellent psychometric properties, is an instrument developed to assess patients' concerns about fallings. The aim of this study was to determine validation of this scale in Iranian patients with stroke. The "forward-backward" procedure was applied to translate the FES-I from English to Persian. One hundred-twenty patients who had suffered stroke, aged 40 to 80 years (55% male) completed the Persian FES-I, Geriatric Depression Scale-15 (GDS-15), General Health Questionnaire-28 (GHQ-28), Berg Balance Scale (BBS) and Timed up and Go (TUG) questionnaires. The interval time for the test-retest of the Persian scale was 7-14 days. The test-retest and inter-rater reliabilities of the Persian FES-I were excellent (ICC2,1=0.98, pPersian scale showed only one significant factor. The total Persian FES-I score had a significantly negative correlation (pPersian FES-I proved to be an effective and valuable measurement tool to assess stroke patients' fear of falling in practice and research setting.
CIE 2015 August 2-5, 2015, Boston, Massachusetts, USA [DRAFT] DETC2015-46982 DEVELOPMENT OF A CONSERVATIVE MODEL VALIDATION APPROACH FOR RELIABLE...obtain a conservative simulation model for reliable design even with limited experimental data. Very little research has taken into account the...3, the proposed conservative model validation is briefly compared to the conventional model validation approach. Section 4 describes how to account
Bongers, Coen C W G; Daanen, Hein A M; Bogerd, Cornelis P; Hopman, Maria T E; Eijsvogels, Thijs M H
Telemetric temperature capsule systems are wireless, relatively noninvasive, and easily applicable in field conditions and have therefore great advantages for monitoring core body temperature. However, the accuracy and responsiveness of available capsule systems have not been compared previously. Therefore, the aim of this study was to examine the validity, reliability, and inertia characteristics of four ingestible temperature capsule systems (i.e., CorTemp, e-Celsius, myTemp, and VitalSense). Ten temperature capsules were examined for each system in a temperature-controlled water bath during three trials. The water bath temperature gradually increased from 33°C to 44°C in trials 1 and 2 to assess the validity and reliability, and from 36°C to 42°C in trial 3 to assess the inertia characteristics of the temperature capsules. A systematic difference between capsule and water bath temperature was found for CorTemp (0.077°C ± 0.040°C), e-Celsius (-0.081°C ± 0.055°C), myTemp (-0.003°C ± 0.006°C), and VitalSense (-0.017°C ± 0.023°C; P 0.05). Comparable inertia characteristics were found for CorTemp (25 ± 4 s), e-Celsius (21 ± 13 s), and myTemp (19 ± 2 s), whereas the VitalSense system responded more slowly (39 ± 6 s) to changes in water bath temperature (P inertia were observed between capsule systems, an excellent validity, test-retest reliability, and inertia was found for each system between 36°C and 44°C after removal of outliers.
Bonnet, Michael H; Doghramji, Karl; Roehrs, Timothy; Stepanski, Edward J; Sheldon, Stephen H; Walters, Arthur S; Wise, Merrill; Chesson, Andrew L
The reliability and validity of EEG arousals and other types of arousal are reviewed. Brief arousals during sleep had been observed for many years, but the evolution of sleep medicine in the 1980s directed new attention to these events. Early studies at that time in animals and humans linked brief EEG arousals and associated fragmentation of sleep to daytime sleepiness and degraded performance. Increasing interest in scoring of EEG arousals led the ASDA to publish a scoring manual in 1992. The current review summarizes numerous studies that have examined scoring reliability for these EEG arousals. Validity of EEG arousals was explored by review of studies that empirically varied arousals and found deficits similar to those found after total sleep deprivation depending upon the rate and extent of sleep fragmentation. Additional data from patients with clinical sleep disorders prior to and after effective treatment has also shown a continuing relationship between reduction in pathology-related arousals and improved sleep and daytime function. Finally, many suggestions have been made to refine arousal scoring to include additional elements (e.g., CAP), change the time frame, or focus on other physiological responses such as heart rate or blood pressure changes. Evidence to support the reliability and validity of these measures is presented. It was concluded that the scoring of EEG arousals has added much to our understanding of the sleep process but that significant work on the neurophysiology of arousal needs to be done. Additional refinement of arousal scoring will provide improved insight into sleep pathology and recovery.
Admiraal, W.; Hoeksma, M.; van de Kamp, M.-T.; van Duin, G.
The richness and complexity of video portfolios endanger both the reliability and validity of the assessment of teacher competencies. In a post-graduate teacher education program, the assessment of video portfolios was evaluated for its reliability, construct validity, and consequential validity.
Akmaz, Hazel Ekin; Uyar, Meltem; Kuzeyli Yıldırım, Yasemin; Akın Korhan, Esra
Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Methodological and cross sectional study. A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance of chronic pain.
Hazel Ekin Akmaz
Full Text Available Background: Pain acceptance is the process of giving up the struggle with pain and learning to live a worthwhile life despite it. In assessing patients with chronic pain in Turkey, making a diagnosis and tracking the effectiveness of treatment is done with scales that have been translated into Turkish. However, there is as yet no valid and reliable scale in Turkish to assess the acceptance of pain. Aims: To validate a Turkish version of the Chronic Pain Acceptance Questionnaire developed by McCracken and colleagues. Study Design: Methodological and cross sectional study. Methods: A simple randomized sampling method was used in selecting the study sample. The sample was composed of 201 patients, more than 10 times the number of items examined for validity and reliability in the study, which totaled 20. A patient identification form, the Chronic Pain Acceptance Questionnaire, and the Brief Pain Inventory were used to collect data. Data were collected by face-to-face interviews. In the validity testing, the content validity index was used to evaluate linguistic equivalence, content validity, construct validity, and expert views. In reliability testing of the scale, Cronbach’s α coefficient was calculated, and item analysis and split-test reliability methods were used. Principal component analysis and varimax rotation were used in factor analysis and to examine factor structure for construct concept validity. Results: The item analysis established that the scale, all items, and item-total correlations were satisfactory. The mean total score of the scale was 21.78. The internal consistency coefficient was 0.94, and the correlation between the two halves of the scale was 0.89. Conclusion: The Chronic Pain Acceptance Questionnaire, which is intended to be used in Turkey upon confirmation of its validity and reliability, is an evaluation instrument with sufficient validity and reliability, and it can be reliably used to examine patients’ acceptance
Robbins, Mandy; Francis, Leslie J; Bradford, Amanda
A sample of 16 male and 30 female undergraduates completed the Greer and Francis Scale of Rejection of Christianity. The data support the internal consistency reliability and construct validity of the scale for this sample.
Corty, E W; Althof, S E; Kurit, D M
The present study assessed the reliability and validity of a measure of sexual functioning, the CMSH-SFQ, for male patients and their partners. The CMSH-SFQ measures erectile and orgasmic functioning, sexual drive, frequency of sexual behavior, and sexual satisfaction. Test-retest reliability was assessed with 19 males and 19 females for the baseline CMSH-SFQ. Criterion validity was measured by comparing the answers of 25 male patients to those of their partners at baseline and follow-up. The majority of items had acceptable levels of reliability and validity. The CMSH-SFQ provides a reliable and valid device that can be used to measure global sexual functioning in men and their partners and may be used to evaluate the efficacy of treatments for sexual dysfunctions. Limitations and suggestions for use of the CMSH-SFQ are addressed.
McDonald, Ann E; Vigen, Cheryl
This study examined the ability of a two-part self-report instrument, the McDonald Play Inventory, to reliably and validly measure the play activities and play styles of 7- to 11-yr-old children and to discriminate between the play of neurotypical children and children with known learning and developmental disabilities. A total of 124 children ages 7-11 recruited from a sample of convenience and a subsample of 17 parents participated in this study. Reliability estimates yielded moderate correlations for internal consistency, total test intercorrelations, and test-retest reliability. Validity estimates were established for content and construct validity. The results suggest that a self-report instrument yields reliable and valid measures of a child's perceived play performance and discriminates between the play of children with and without disabilities. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Previous research funded by Florida Department of Transportation (FDOT) developed a method for estimating : travel time reliability for arterials. This method was not initially implemented or validated using field data. This : project evaluated and r...
Stice, Eric; Fisher, Melissa; Martinez, Erin
The authors conducted 4 studies investigating the reliability and validity of the Eating Disorder Diagnostic Scale (HDDS; E. Stice, C. F. Telch, & S. L. Rizvi, 2000), a brief self-report measure for diagnosing anorexia nervosa, bulimia nervosa, and binge eating disorder. Study 1 found that the HDDS showed criterion validity with interview-based…
Due, Ulla; Ottesen, Marianne
Objective. To revise, validate and test for reliability an anal sphincter rupture questionnaire in relation to construct, content and face validity. Setting and background. Since 1996 women with anal sphincter rupture (ASR) at one of the public university hospitals in Copenhagen, Denmark have bee...
Kara, Kerime C; Çıtak Karakaya, İlkim; Tunalı, Nur; Karakaya, Mehmet G
The aim of this study was to investigate the reliability and validity of the Turkish version of the Incontinence Quiz, which was developed by Branch et al. (1994), to assess women's knowledge of and attitudes toward urinary incontinence. Comprehensibility of the Turkish version of the 14-item Incontinence Quiz, which was prepared following translation-back translation procedures, was tested on a pilot group of eight women, and its internal reliability, test-retest reliability and construct validity were assessed in 150 women who attended the gynecology clinics of three hospitals in İçel, Turkey. Physical and sociodemographic characteristics and presence of incontinence complaints were also recorded. Data were analyzed at the 0.05 alpha level, using SPSS version 22. The scale had good reliability and validity. The internal reliability coefficient (Cronbach α) was 0.80, test-retest correlation coefficients were 0.83-0.94; and with regard to construct validity, Kaiser-Meyer-Olkin coefficient was 0.76 and Barlett sphericity test was 562.777 (P = 0.000). Turkish version of the Incontinence Quiz had a four-factor structure, with Eigenvalues ranging from 1.17 to 4.08. The Incontinence Quiz-Turkish version is a highly comprehensible, reliable and valid scale, which may be used to assess Turkish-speaking women's knowledge of and attitudes toward urinary incontinence. © 2017 Japan Society of Obstetrics and Gynecology.
Watt, Torquil; Hegedüs, Laszlo; Groenvold, Mogens
Background Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated. Aim To investigate clinical ('known-groups') validity and test-retest reliability of the Danish version of the ThyPRO. Methods For each of the 13 ThyPRO scales, we defined groups expected to have high versus low scores ('known-groups'). The clinical validity (known......-groups validity) was evaluated by whether the ThyPRO scales could detect expected differences in a cross-sectional study of 907 thyroid patients. Test-retest reliability was evaluated by intra-class correlations of two responses to the ThyPRO 2 weeks apart in a subsample of 87 stable patients. Results On all 13...
Lund, Rikke; Nielsen, Lene Snabe; Henriksen, Pia Wichmann
OBJECTIVE: The aim of the present article is to describe the face and content validity as well as reliability of the Copenhagen Social Relations Questionnaire (CSRQ). METHOD: The face and content validity test was based on focus group discussions and individual interviews with 31 informants...... from the interviews. Two additional themes not covered by CSRQ on dynamics and reciprocity of social relations were identified. DISCUSSION: CSRQ holds satisfactory face and content validity as well as reliability, and is suitable for measuring structure and function of social relations including...
Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C
Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Full Text Available Abstract Background Wolfram syndrome (WFS is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS. Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease. WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age. Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93, moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91 and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, ps>.76, ps=-.86, p=.001. The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83. Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS.
'Responsibilist\\' approaches to epistemology link knowledge and justification with epistemically responsible belief management, where responsible management is understood to involve an essential element of guidance by recognized epistemic norms. By contrast, reliabilist approaches stress the de facto reliability of ...
Due, Ulla; Ottesen, Marianne
Objective. To revise, validate and test for reliability an anal sphincter rupture questionnaire in relation to construct, content and face validity. Setting and background. Since 1996 women with anal sphincter rupture (ASR) at one of the public university hospitals in Copenhagen, Denmark have been...... main questions but one. Two questions needed further explanation. Seven women made minor errors. Conclusion. The validated Danish questionnaire has a good construct, content and face validity. It is a well accepted, reliable, simple and clinically relevant screening tool. It reveals physical problems...... offered pelvic floor muscle examination and instruction by a specialist physiotherapist. In relation to that, a non-validated questionnaire about anal and urinary incontinence was to be answered six months after childbirth. Method. The original questionnaire was revised and a pilot test was performed...
Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K
Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.
Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen
Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Huang, Ting-Cheng; Zhang, Yong-Jun
Incentive-based demand response (IBDR) can guide customers to adjust their behaviour of electricity and curtail load actively. Meanwhile, distributed generation (DG) and energy storage system (ESS) can provide time for the implementation of IBDR. The paper focus on the reliability evaluation of microgrid considering IBDR. Firstly, the mechanism of IBDR and its impact on power supply reliability are analysed. Secondly, the IBDR dispatch model considering customer’s comprehensive assessment and the customer response model are developed. Thirdly, the reliability evaluation method considering IBDR based on Monte Carlo simulation is proposed. Finally, the validity of the above models and method is studied through numerical tests on modified RBTS Bus6 test system. Simulation results demonstrated that IBDR can improve the reliability of microgrid.
Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt
physicians' OSAUS scores with diagnostic accuracy. RESULTS: The generalizability coefficient was high (0.81) and a D-study demonstrated that 1 assessor and 5 cases would result in similar reliability. The construct validity of the OSAUS scale was supported by a significant difference in the mean scores......OBJECTIVE: To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. BACKGROUND: POC US is increasingly used by clinicians and is an essential part of the management of acute surgical...... conditions. However, the quality of performance is highly operator-dependent. Therefore, reliable and valid assessment of trainees' ultrasonography competence is needed to ensure patient safety. METHODS: Twenty-four physicians, representing novices, intermediates, and experts in POC US, scanned 4 different...
Konge, Lars; Lehnert, Per; Hansen, Henrik Jessen
BACKGROUND: As we move toward competency-based education in medicine, we have lagged in developing competency-based evaluation methods. In the era of minimally invasive surgery, there is a need for a reliable and valid tool dedicated to measure competence in video-assisted thoracoscopic surgery....... The purpose of this study is to create such an assessment tool, and to explore its reliability and validity. METHODS: An expert group of physicians created an assessment tool consisting of 10 items rated on a five-point rating scale. The following factors were included: economy and confidence of movement...
Full Text Available Objective: Discomfort Intolerance Scale was developed by Norman B. Schmidt et al. to assess the individual differences of capacity to withstand physical perturbations or uncomfortable bodily states (2006. The aim of this study is to investigate the validity and reliability of Discomfort Intolerance Scale-Turkish Version (RDÖ. Method: From two different universities, total of 225 students (male=167, female=58 were participated in this study. In order to determine the criterion validity, Beck Anxiety Inventory (BAI and State-Trait Anxiety Inventory (STAI were used. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability the scale was re-applied to 54 participants 6 weeks later. Results: To assess construct validity of DIS, factor analyses were performed using varimax principal components analysis with varimax rotation. The factor analysis resulted in two factors named “discomfort (in tolerance” and “discomfort avoidance”. The Cronbach’s alpha coefficient for the entire scale, discomfort-(intolerance subscale, discomfortavoidance subscale were, .592, .670, .600 respectively. Correlations between two factors of DIS, discomfort intolerance and discomfort avoidance, and Trait Anxiety Inventory of STAI (State-Trait Anxiety Inventory were statistically significant at the level of 0.05. Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DIS had a satisfactory level of reliability and validity in Turkish university students.
Duus, Nicolaj; Shogilev, Daniel J; Skibsted, Simon
PURPOSE: We investigated the reproducibility of passive leg raise (PLR) and fluid bolus (BOLUS) using the Non-Invasive Cardiac Output Monitor (NICOM; Cheetah Medical, Tel Aviv, Israel) for assessment of fluid responsiveness (FR) in spontaneously breathing emergency department (ED) patients. METHODS...
Carlsen, C G; Lindorff Larsen, Karen; Funch-Jensen, P
PURPOSE: Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity...... of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. METHODS: Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia...... a significant difference between the three groups which indicates construct validity, p skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment...
Minner, Daphne Diane
The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating
Garnacho-Castaño, Manuel V.; López-Lastra, Silvia; Maté-Muñoz, José L.
The objectives of the study were to determine the validity and reliability of peak velocity (PV), average velocity (AV), peak power (PP) and average power (AP) measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain) during two resistance exercises, bench press (BP) and full back squat (BS), performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2). Intraclass correlation coefficients (ICCs) indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W). Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W). Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP) make this device a useful tool for monitoring resistance training. Key points This study determined the validity and reliability of peak velocity, average velocity, peak power and average power measurements made using a linear position transducer The Tendo Weight-lifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and power. PMID:25729300
Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa
Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: firstname.lastname@example.org.
Monbaliu, E; Ortibus, E; Roelens, F; Desloovere, K; Deklerck, J; Prinzie, P; de Cock, P; Feys, H
This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System and six patients were classified at level V. Reliability was measured by (1) intraclass correlation coefficient (ICC) for interrater reliability, (2) standard error of measurement (SEM) and smallest detectable difference (SDD), and (3) Cronbach's alpha for internal consistency. Validity was assessed by Pearson's correlations among the three scales used and by content analysis. Moderate to good interrater reliability was found for total scores of the three scales (ICC: BADS=0.87; BFMMS=0.86; UDRS=0.79). However, many subitems showed low reliability, in particular for the UDRS. SEM and SDD were respectively 6.36% and 17.72% for the BADS, 9.88% and 27.39% for the BFMMS, and 8.89% and 24.63% for the UDRS. High internal consistency was found. Pearson's correlations were high. Content validity showed insufficient accordance with the new CP definition and classification. Our results support the internal consistency and concurrent validity of the scales; however, taking into consideration the limitations in reliability, including the large SDD values and the content validity, further research on methods of assessment of dystonia is warranted.
Enevoldsen, I.; Faber, M. H.; Sørensen, John Dalsgaard
Problems in connection with estimation of the reliability of a component modelled by a limit state function including noise or first order discontinuitics are considered. A gradient free adaptive response surface algorithm is developed. The algorithm applies second order polynomial surfaces...
Hadley, Wendy; Stewart, Angela; Hunter, Heather L; Affleck, Katelyn; Donenberg, Geri; Diclemente, Ralph; Brown, Larry K
We evaluated the reliability and validity of the Dyadic Observed Communication Scale (DOCS) coding scheme, which was developed to capture a range of communication components between parents and adolescents. Adolescents and their caregivers were recruited from mental health facilities for participation in a large, multi-site family-based HIV prevention intervention study. Seventy-one dyads were randomly selected from the larger study sample and coded using the DOCS at baseline. Preliminary validity and reliability of the DOCS was examined using various methods, such as comparing results to self-report measures and examining interrater reliability. Results suggest that the DOCS is a reliable and valid measure of observed communication among parent-adolescent dyads that captures both verbal and nonverbal communication behaviors that are typical intervention targets. The DOCS is a viable coding scheme for use by researchers and clinicians examining parent-adolescent communication. Coders can be trained to reliably capture individual and dyadic components of communication for parents and adolescents and this complex information can be obtained relatively quickly.
Ahmed, Hussam; Chateauneuf, Alaa
The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces
Hop, M.; Moues, C.; Bogomolova, K.; Nieuwenhuis, M.; Oen, I.; Middelkoop, E.; Breederveld, R.; de Baar, M.
Objective: The aim of this study was to examine the reliability and validity of using photographs of burns to assess both burn size and depth. Method: Fifty randomly selected photographs taken on day 0-1 post burn were assessed by seven burn experts and eight referring physicians. Inter-rater
Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel
The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…
Young, Daniel Kim-Wan; Ng, Petrus Y. N.; Pan, Jia-Yan; Cheng, Daphne
Purpose: This study aims to translate and test the reliability and validity of the Internalized Stigma of Mental Illness-Cantonese (ISMI-C). Methods: The original English version of ISMI is translated into the ISMI-C by going through forward and backward translation procedure. A cross-sectional research design is adopted that involved 295…
Kerkhoffs, Gino M. M. J.; Blankevoort, Leendert; Sierevelt, Inger N.; Corvelein, Ruby; Janssen, Guido H. W.; van Dijk, C. Niek
Two test devices were manufactured to objectively measure ankle joint laxity: the dynamic anterior ankle tester (DAAT) and the quasi-static anterior ankle tester (QAAT). The primary aim was to analyse the reliability of both testers; The secondary aim was to assess validity in correlation with TELOS
Halpin, Glennelle; Halpin, Gerald
Research indicating that different cut-off points result from the use of different standard-setting techniques leaves decision makers with a disturbing dilemma: Which standard-setting method is best? This investigation of the reliability and validity of 10 different standard-setting approaches was designed to provide information that might help…
Yildiz, F. Ülkü; Çagdas, Aysel; Kayili, Gökhan
The purpose of this study is to perform the validity-reliability analysis of the three subtests of Basic School Skills Inventory 3--Mathematics, Classroom Behavior and Daily Life skills--and do its adaptation for four to six year-old Turkish children. The sample of the study included 595 four to six year-old Turkish children attending public and…
Tretter, Thomas R.; Brown, Sherri L.; Bush, William S.; Saderholm, Jon C.; Holmes, Vicki-Lynn
Science teachers' content knowledge is an important influence on student learning, highlighting an ongoing need for programs, and assessments of those programs, designed to support teacher learning of science. Valid and reliable assessments of teacher science knowledge are needed for direct measurement of this crucial variable. This paper…
Arevalo Romero, J.; Brinkkemper, T.; van der Heide, A.; Rietjens, J.A.; Ribbe, M.W.; Deliens, L.; Loer, S.A.; Zuurmond, W.W.A.; Perez, R.S.G.M.
Context: Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. Objectives: To study the reliability and validity of
van der Wulp, I.
Reliability and validity of triage systems is important because this can affect patient safety. In this thesis, these aspects of two emergency department (ED) triage systems were studied as well as methodological aspects in these types of studies. The consistency, reproducibility, and criterion
Putnam, Frank W.; And Others
Evaluation of the Child Dissociative Checklist found it to be a reliable and valid observer report measure of dissociation in children, including sexually abused girls and children with dissociative disorder and with multiple personality disorder. The checklist, which is appended, is intended as a clinical screening instrument and research measure…
Kooiman, Thea; Dontje, Manon L.; Sprenger, Siska; Krijnen, Wim; van der Schans, Cees; de Groot, Martijn
Background: Activity trackers can potentially stimulate users to increase their physical activity behavior. The aim of this study was to examine the reliability and validity of ten consumer activity trackers for measuring step count in both laboratory and free-living conditions. Method: Healthy
Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. Objective: To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. Design: A blinded, repeated-measures diagnostic test was carried ...
Automated Body Reaction Test (ABRT) is a new device for skills and physical assessment instrument to measure ability on react, move quickly and accurately in accordance with stimulus. A total of 474 subjects aged 7-17 years old were randomly selected for the construct validity (n=330) and reliability (n=144). The ABRT ...
The aim of this study is to develop a useful, valid and reliable measurement tool that will help teacher candidates determine their Turkish metalinguistic awareness. During the development of the scale, a pool of items was created by scanning the relevant literature and examining other awareness scales. The materials prepared were re-examined…
Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako
The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.
Results: Two valid factors emerged with items 1-3 and items 4, 5 & 7 loading on respectively, making the BFSS a twodimensional (multidimensional) scale which measures 2 aspects of brain fag [labeled burning sensation and crawling sensation respectively]. The reliability analysis yielded a Cronbach Alpha coefficient of ...
Fuchs, Lynn; And Others
A study was conducted to explore the reliability and validity of three prominent procedures used in informal reading inventories (IRIs): (1) choosing a 95% word recognition accuracy standard for determining student instructional level, (2) arbitrarily selecting a passage to represent the difficulty level of a basal reader, and (3) employing…
Background The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. Methods We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson’s correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach’s alpha coefficient; the intraclass (ICC) and Pearson’s correlation coefficients were used for test-retest reliability. Results One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson’s correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson’s correlation coefficient was 0.89 and ICC was 0.90. Conclusions The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality. PMID:23327598
In facilitating cross-cultural study in the field of psychology and Logotherapy, the reliability and validity of the logotest which measures inner meaning fulfillment was carried out among 885 University of Ibadan students, 439 males and 434 females, aged between 15 and 60 years old with mean X age of 6.0. Data analyses ...
Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto
Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Full Text Available Validity and Reliability of Agoraphobic Cognitions Questionnaire-Turkish Version Objective: The aim of this study is to investigate the validity and reliability of Agoraphobic Cognitions Questionnaire -Turkish Version (ACQ. Method: ACQ was administered to 92 patients with agoraphobia or panic disorder with agoraphobia. BSQ Turkish version completed by translation, back-translation and pilot assessment. Reliability of ACQ was analyzed by test-retest correlation, split-half technique, Cronbach’s alpha coefficient. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Bartlett test had been performed. Principal component analysis and varimax rotation used for factor analysis. Results: 64% of patients evaluated in the study were female and 36% were male. Age interval was between 18 and 58, mean age was 31.5±10.4. The Cronbach’s alpha coefficient was 0.91. Analysis of test-retest evaluations revealed that there were statistically significant correlations ranging between 24% and 84% concerning questionnaire components. In analysis performed by split-half method reliability coefficients of half questionnaires were found as 0.77 and 0.91. Again Spearmen-Brown coefficient was found as 0.87 by the same analysis. To assess construct validity of ACQ, factor analysis was performed and two basic factors found. These two factors explained 57.6% of the total variance. (Factor 1: 34.6%, Factor 2: 23% Conclusion: Our findings support that ACQ-Turkish version had a satisfactory level of reliability and validity
Schlegel, Claudia; Woermann, Ulrich; Rethans, Jan-Joost; van der Vleuten, Cees
In the training of healthcare professionals, one of the advantages of communication training with simulated patients (SPs) is the SP's ability to provide direct feedback to students after a simulated clinical encounter. The quality of SP feedback must be monitored, especially because it is well known that feedback can have a profound effect on student performance. Due to the current lack of valid and reliable instruments to assess the quality of SP feedback, our study examined the validity and reliability of one potential instrument, the 'modified Quality of Simulated Patient Feedback Form' (mQSF). Content validity of the mQSF was assessed by inviting experts in the area of simulated clinical encounters to rate the importance of the mQSF items. Moreover, generalizability theory was used to examine the reliability of the mQSF. Our data came from videotapes of clinical encounters between six simulated patients and six students and the ensuing feedback from the SPs to the students. Ten faculty members judged the SP feedback according to the items on the mQSF. Three weeks later, this procedure was repeated with the same faculty members and recordings. All but two items of the mQSF received importance ratings of > 2.5 on a four-point rating scale. A generalizability coefficient of 0.77 was established with two judges observing one encounter. The findings for content validity and reliability with two judges suggest that the mQSF is a valid and reliable instrument to assess the quality of feedback provided by simulated patients.
Yoshizumi, Takahiro; Murase, Satomi; Murakami, Takashi; Takai, Jiro
The purposes of the present study were to develop a Parenting Scale of Inconsistency and to evaluate its initial reliability and validity. The 12 items assess the inconsistency among parents' moods, behaviors, and attitudes toward children. In the primary study, 517 participants completed three measures: the new Parenting Scale of Inconsistency, the Parental Bonding Instrument, and the Depression Scale of the General Health Questionnaire. The Parenting Scale of Inconsistency had good test-retest reliability of .85 and internal consistency of .88 (Cronbach coefficient alpha). Construct validity was good as Inconsistency scores were significantly correlated with the Care and Overprotection scores of the Parental Bonding Instrument and with the Depression scores. Moreover, Inconsistency scores' relation with a dimension of parenting style distinct from Care and Overprotection suggested that the Parenting Scale of Inconsistency had factorial validity. This scale seems a potential measure for examining the relationships between inconsistent parenting and the mental health of children.
Paulus, David C; Reynolds, Michael C; Schilling, Brian K
During the concentric portion of the free-weight squat exercise, accelerating the mass from rest results in a fluctuation in ground reaction force. It is characterized by an initial period of force greater than the load while accelerating from rest followed by a period of force lower than the external load during negative acceleration. During the deceleration phase, less force is exerted and muscles are loaded sub-optimally. Thus, using a reduced inertia form of resistance such as pneumatics has the capability to minimize these inertial effects as well as control the force in real time to maximize the force exerted over the exercise cycle. To improve the system response of a preliminary design, a squat device was designed with a reduced mass barbell and two smaller pneumatic cylinders. The resistance was controlled by regulating cylinder pressure such that it is capable of adjusting force within a repetition to maximize force exerted during the lift. The resistance force production of the machine was statically validated with the input voltage and output force R2 =0.9997 for at four increments of the range of motion, and the intraclass correlation coefficient (ICC) between trials at the different heights equaled 0.999. The slew rate at three forces was 749.3 N/s +/- 252.3. Dynamic human subject testing showed the desired input force correlated with average and peak ground reaction force with R2 = 0.9981 and R2 = 0.9315, respectively. The ICC between desired force and average and peak ground reaction force was 0.963. Thus, the system is able to deliver constant levels of static and dynamic force with validity and reliability. Future work will be required to develop the control strategy required for real-time control, and performance testing is required to determine its efficacy.
Yoshii, Hatsumi; Mandai, Nozomu; Saito, Hidemitsu; Akazawa, Kouhei
Self-stigma, defined by a negative attitude toward oneself combined with the consciousness of being a target of prejudice, is a critical problem for psychiatric patients. Self-stigma studies among psychiatric patients have indicated that high stigma is predictive of detrimental effects such as the delay of treatment and decreases in social participation in patients, and levels of self-stigma should be statistically evaluated. In this study, we developed the Workplace Social Distance Scale (WSDS), rephrasing the eight items of the Japanese version of the Social Distance Scale (SDSJ) to apply to the work setting in Japan. We examined the reliability and validity of the WSDS among 83 psychiatric patients. Factor analysis extracted three factors from the scale items: "work relations," "shallow relationships," and "employment." These factors are similar to the assessment factors of the SDSJ. Cronbach's alpha coefficient for the WSDS was 0.753. The split-half reliability for the WSDS was 0.801, indicating significant correlations. In addition, the WSDS was significantly correlated with the SDSJ. These findings suggest that the WSDS represents an approximation of self-stigma in the workplace among psychiatric patients. Our study assessed the reliability and validity of the WSDS for measuring self-stigma in Japan. Future studies should investigate the reliability and validity of the scale in other countries.
Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur
The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: email@example.com.
Full Text Available Purpose: Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. Methods: A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF with oblimin rotation, a scree plot test, and parallel analysis (PA. The reliability of LSS was tested using Cronbach’s α, corrected item-total correlation, and test-retest. Results: Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach’s α was > 0.70 for all subscales in both study populations. The corrected item-total correlations were > 0.30 for the items in each component. Conclusion: The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L
Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.
Abdollahimohammad, Abdolghani; Ja'afar, Rogayah
Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF) with oblimin rotation, a scree plot test, and parallel analysis (PA). The reliability of LSS was tested using Cronbach's α, corrected item-total correlation, and test-retest. Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach's α was >0.70 for all subscales in both study populations. The corrected item-total correlations were >0.30 for the items in each component. The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Shrestha, Bidhan; Niraula, Surya Raj; Parajuli, Prakash K; Suwal, Pramita; Singh, Raj Kumar
To assess the reliability and to validate the translated Nepalese version of the Oral Health Impact Profile (OHIP-EDENT-N) in Nepalese edentulous subjects. The international guidelines for translation and cross-cultural adaption of OHIP-EDENT were followed, and a Nepalese version of the questionnaire was adapted for this study. Eighty-eight completely edentulous subjects were then selected for the study and completed their responses for the questionnaire. The reliability of the OHIP-EDENT-N was evaluated using internal consistency. Validity was assessed as construct and convergent validity. Construct validity was determined using exploratory factor analysis (EFA). The correlation between OHIP-EDENT-N subscale scores and the global question was investigated to test the convergent validity. Cronbach's alpha for the total score of OHIP-EDENT-N was 0.78. Construct validity was assessed by factor analysis: 70.196% of the variance was accountable to five factors extracted from the factor analysis. Factor loadings above 0.40 were noted for all items. In terms of convergent validity, significant correlations could be established between OHIP-EDENT-N and global questions. This study has been able to establish the reliability and validity of the OHIP-EDENT-N, and OHIP-EDENT-N can be a considered a reliable tool to assess the oral health related quality of life in the Nepalese edentulous population. © 2016 by the American College of Prosthodontists.
Arevalo, Jimmy J; Brinkkemper, Tijn; van der Heide, Agnes; Rietjens, Judith A; Ribbe, Miel; Deliens, Luc; Loer, Stephan A; Zuurmond, Wouter W A; Perez, Roberto S G M
Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. To study the reliability and validity of observer-based sedation scales in palliative sedation. A prospective evaluation of 54 patients under intermittent or continuous sedation with four sedation scales was performed by 52 nurses. Included scales were the Minnesota Sedation Assessment Tool (MSAT), Richmond Agitation-Sedation Scale (RASS), Vancouver Interaction and Calmness Scale (VICS), and a sedation score proposed in the Guideline for Palliative Sedation of the Royal Dutch Medical Association (KNMG). Inter-rater reliability was tested with the intraclass correlation coefficient (ICC) and Cohen's kappa coefficient. Correlations between the scales using Spearman's rho tested concurrent validity. We also examined construct, discriminative, and evaluative validity. In addition, nurses completed a user-friendliness survey. Overall moderate to high inter-rater reliability was found for the VICS interaction subscale (ICC = 0.85), RASS (ICC = 0.73), and KNMG (ICC = 0.71). The largest correlation between scales was found for the RASS and KNMG (rho = 0.836). All scales showed discriminative and evaluative validity, except for the MSAT motor subscale and VICS calmness subscale. Finally, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. The RASS and KNMG scales stand as the most reliable and valid among the evaluated scales. In addition, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. Further research is needed to evaluate the impact of the scales on better symptom control and patient comfort. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Ahmet Emre SARGIN
Full Text Available Objective: Distress Tolerance Scale (DTS is developed by Simons and Gaher in order to measure individual differences in the capacity of distress tolerance.The aim of this study is to assess the reliability and validity of the Turkish version of DTS. Method: One hundred and sixty seven university students (male=66, female=101 participated in this study. Beck Anxiety Inventory (BAI, State-trait Anxiety Inventory (STAI and Discomfort Intolerance Scale (DIS were used to determine the criterion validity. Construct validity was evaluated with factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability, the scale was re-applied to 79 participants six weeks later. Results: To assess construct validity, factor analyses were performed using varimax principal components analysis with varimax rotation. While there were factors in the original study, our factor analysis resulted in three factors. Cronbach’s alpha coefficients for the entire scale and tolerance, regulation, self-efficacy subscales were .89, .90, .80 and .64 respectively. There were correlations at the level of 0.01 between the Trait Anxiety Inventory of STAI and BAI, and all the subscales of DTS and also between the State Anxiety Inventory and regulation subscale. Both of the subscales of DIS were correlated with the entire subscale and all the subscales except regulation at the level of 0.05.Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DTS had a satisfactory level of reliability and validity in Turkish university students.
Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl
To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P volleyball players.
Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob
study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately......The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present...... = .07). The limits of agreement were ±18.53. A strong correlation was found between test and retest (intercorrelation coefficient .908); the standard error of measurement was 6.7, and the minimal detectable change was 18.5. The Danish version of the ATRS showed moderately strong criterion validity...
Køster, B; Søndergaard, J; Nielsen, J B; Olsen, A; Bentzen, J
An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high. The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed questionnaire for monitoring and evaluating population sun-related behavior. Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly in a questionnaire adapted to measure behavior throughout the summer. The response rates for questionnaire 1, 2 and 3 were high and the drop out was not dependent on demographic characteristic. There was at least 73% agreement between sunburns in the measurement week and the entire summer, and a possible sunburn underestimation in questionnaires summarizing the entire summer. The participants underestimated their outdoor exposure in the evaluation covering the entire summer as compared to the measurement week. The reliability of scales measuring attitude and knowledge was high for majority of scales, while consistency in protection behavior was low. To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability, while consistency of protection behavior in general and in a week's measurement was low.
Sorenson, Shawn C.; Romano, Russell; Scholefield, Robin M.; Schroeder, E. Todd; Azen, Stanley P.; Salem, George J.
Context Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. Objective To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Design Descriptive laboratory study. Setting A large National Collegiate Athletic Association Division I university. Patients or Other Participants A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Intervention(s) Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Main Outcome Measure(s) Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Results Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent
Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo
Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.
Collins, Cristiana Kahl; Johnson, Vicky Saliba; Godwin, Ellen M; Pappas, Evangelos
To determine the reliability and validity of the Saliba Postural Classification System (SPCS). Two physical therapists classified pictures of 100 volunteer participants standing in their habitual posture for inter and intra-tester reliability. For validity, 54 participants stood on a force plate in a habitual and a corrected posture, while a vertical force was applied through the shoulders until the clinician felt a postural give. Data were extracted at the time the give was felt and at a time in the corrected posture that matched the peak vertical ground reaction force (VGRF) in the habitual posture. Inter-tester reliability demonstrated 75% agreement with a Kappa = 0.64 (95% CI = 0.524-0.756, SE = 0.059). Intra-tester reliability demonstrated 87% agreement with a Kappa = 0.8, (95% CI = 0.702-0.898, SE = 0.05) and 80% agreement with a Kappa = 0.706, (95% CI = 0.594-0818, SE = 0.057). The examiner applied a significantly higher (p < 0.001) peak vertical force in the corrected posture prior to a postural give when compared to the habitual posture. Within the corrected posture, the %VGRF was higher when the test was ongoing vs. when a postural give was felt (p < 0.001). The %VGRF was not different between the two postures when comparing the peaks (p = 0.214). The SPCS has substantial agreement for inter- and intra-tester reliability and is largely a valid postural classification system as determined by the larger vertical forces in the corrected postures. Further studies on the correlation between the SPCS and diagnostic classifications are indicated.
Manuel V. Garnacho-Castaño
Full Text Available The objectives of the study were to determine the validity and reliability of peak velocity (PV, average velocity (AV, peak power (PP and average power (AP measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain during two resistance exercises, bench press (BP and full back squat (BS, performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2. Intraclass correlation coefficients (ICCs indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W. Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W. Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP make this device a useful tool for monitoring resistance training.
Erkan Alpsoy; Yeşim Şenol; Aslı Bilgiç Temel; G. Özge Baysal; Ayşe Akman Karakaş
Backround and design. Internalized stigma involves endorsing negative feelings and beliefs such as insignificance, shame and withdrawal triggered by applying these negative stereotypes to one self. Internalized Stigma Scale has not been applied to psoriasis patients. We aimed to evaluate the reliability and validity of Internalized Stigma Scale in psoriasis patients. Materials and Methods. 100 consecutive, volunteer psoriasis patients (48 female, 52 male; aged, 40.59±15.44 years) were enro...
Sayed Hadi Sayed Alitabar; Mojtaba Habibi; Maryam Falahatpisheh; Musa Arvin
Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST). Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men) with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. T...
Park, Dae-Sung; Lee, GyuChang
A balance test provides important information such as the standard to judge an individual's functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment software using the Nintendo Wii Balance Board, investigated its reliability and validity, and compared it with a laboratory-grade force platform. Twenty healthy adults participated in our study. The participants participated in the test for inter-rater reliability, intra-rater reliability, and concurrent validity. The tests were performed with balance assessment software using the Nintendo Wii balance board and a laboratory-grade force platform. Data such as Center of Pressure (COP) path length and COP velocity were acquired from the assessment systems. The inter-rater reliability, the intra-rater reliability, and concurrent validity were analyzed by an intraclass correlation coefficient (ICC) value and a standard error of measurement (SEM). The inter-rater reliability (ICC: 0.89-0.79, SEM in path length: 7.14-1.90, SEM in velocity: 0.74-0.07), intra-rater reliability (ICC: 0.92-0.70, SEM in path length: 7.59-2.04, SEM in velocity: 0.80-0.07), and concurrent validity (ICC: 0.87-0.73, SEM in path length: 5.94-0.32, SEM in velocity: 0.62-0.08) were high in terms of COP path length and COP velocity. The balance assessment software incorporating the Nintendo Wii balance board was used in our study and was found to be a reliable assessment device. In clinical settings, the device can be remarkably inexpensive, portable, and convenient for the balance assessment.
Cetin, Fatma Cosar; Sezer, Ayse; Merih, Yeliz Dogan
OBJECTIVE: The objective of this study is to investigate the validity and the reliability of Birth Satisfaction Scale (BSS) and to adapt it into the Turkish language. This scale is used for measuring maternal satisfaction with birth in order to evaluate women’s birth perceptions. METHODS: In this study there were 150 women who attended to inpatient postpartum clinic. The participants filled in an information form and the BSS questionnaire forms. The properties of the scale were tested by conducting reliability and validation analyses. RESULTS: BSS entails 30 Likert-type questions. It was developed by Hollins Martin and Fleming. Total scale scores ranged between 30–150 points. Higher scores from the scale mean increases in birth satisfaction. Three overarching themes were identified in Scale: service provision (home assessment, birth environment, support, relationships with health care professionals); personal attributes (ability to cope during labour, feeling in control, childbirth preparation, relationship with baby); and stress experienced during labour (distress, obstetric injuries, receiving sufficient medical care, obstetric intervention, pain, prolonged labour and baby’s health). Cronbach’s alfa coefficient was 0.62. CONCLUSION: According to the present study, BSS entails 30 Likert-type questions and evaluates women’s birth perceptions. The Turkish version of BSS has been proven to be a valid and a reliable scale. PMID:28058355
Hill, C.; Robinson, L.
Mammographers currently score their own images according to criteria set out by Regional Quality Assurance. The criteria used are based on the ‘Perfect, Good, Moderate, Inadequate’ (PGMI) marking criteria established by the National Health Service Breast Screening Programme (NHSBSP) in their Quality Assurance Guidelines of 2006 1 . This document discusses the validity and reliability of the current mammography image assessment scheme. Commencing with a critical review of the literature this document sets out to highlight problems with the national approach to the use of marking schemes. The findings suggest that ‘PGMI’ scheme is flawed in terms of reliability and validity and is not universally applied across the UK. There also appear to be differences in schemes used by trainees and qualified mammographers. Initial recommendations are to be made in collaboration with colleagues within the National Health Service Breast Screening Programme (NHSBSP), Higher Education Centres, College of Radiographers and the Royal College of Radiologists in order to identify a mammography image appraisal scheme that is fit for purpose. - Highlights: • Currently no robust evidence based marking tools in use for the assessment of images in mammography. • Is current system valid, reliable and robust? • How can the current image assessment tool be improved? • Should students and qualified mammographers use the same tool? • What marking criteria are available for image assessment?
Visser, Martine; Leentjens, Albert F. G.; Marinus, Johan; Stiggelbout, Anne M.; van Hilten, Jacobus J.
We evaluated the validity, reliability, and potential responsiveness of the Beck Depression Inventory (BDI) in patients with Parkinson's disease (PD). In part 1 of the study, 92 patients with PD underwent a structured clinical interview for DSM major depression and based on this patients were
Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob; Troelsen, Anders
The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately strong correlations with the physical subscores of the Medical Outcomes Study 36-item Short-Form Health Survey (r = .70 to .75; p questionnaire (r = .71; p validity. For study and follow-up purposes, the ATRS seems reliable for comparisons of groups of patients. Its usability is limited for repeated assessment of individual patients. The development of analysis guidelines would be desirable. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E
Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Curry, M A; Campbell, R A; Christian, M
Two studies of low-income pregnant women (N = 179) were done to examine the validity and reliability of the Prenatal Psychosocial Profile (PPP). The PPP, a composite of the Rosenberg Self-Esteem Scale, the Support Behaviors Inventory, and a newly developed measure of stress, is a brief, comprehensive clinical assessment of psychosocial risk during pregnancy. Construct validity of the stress scale was supported by theoretically predicted negative correlations with self-esteem, partner support, and support from others (N = 91). Convergent validity of the stress scale was demonstrated by a correlation of .71 with the Difficult Life Circumstances Scale. Adequate levels of internal consistency were found. Interrelationships between the four subscales were consistent with the underlying conceptualization, and there was beginning evidence of the factorial independence of the subscales.
Anderson Kathryn L
Full Text Available Abstract The Quality of Life Scale (QOLS, created originally by American psychologist John Flanagan in the 1970's, has been adapted for use in chronic illness groups. This paper reviews the development and psychometric testing of the QOLS. A descriptive review of the published literature was undertaken and findings summarized in the frequently asked questions format. Reliability, content and construct validity testing has been performed on the QOLS and a number of translations have been made. The QOLS has low to moderate correlations with physical health status and disease measures. However, content validity analysis indicates that the instrument measures domains that diverse patient groups with chronic illness define as quality of life. The QOLS is a valid instrument for measuring quality of life across patient groups and cultures and is conceptually distinct from health status or other causal indicators of quality of life.
Kern, Jeffrey M.; MacDonald, Marian L.
The reliability and meaning of assertiveness tests were explored using 120 female undergraduates. Several self-report inventories (the College Self-Expression Scale, Conflict Resolution Inventory, and a global rating from one to seven) were administered, as were three anxiety measures (Timed Behavior Checklist, response latency, and response…
Eto, Joseph H. [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Energy Technologies Division; Lewis, Nancy Jo [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Energy Technologies Division; Watson, David [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Energy Technologies Division; Kiliccote, Sila [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Energy Technologies Division; Auslander, David [Univ. of California, Berkeley, CA (United States); Paprotny, Igor [Univ. of California, Berkeley, CA (United States); Makarov, Yuri [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
The Demand Response as a System Reliability Resource project consists of six technical tasks: • Task 2.1. Test Plan and Conduct Tests: Contingency Reserves Demand Response (DR) Demonstration—a pioneering demonstration of how existing utility load-management assets can provide an important electricity system reliability resource known as contingency reserve. • Task 2.2. Participation in Electric Power Research Institute (EPRI) IntelliGrid—technical assistance to the EPRI IntelliGrid team in developing use cases and other high-level requirements for the architecture. • Task 2.3. Research, Development, and Demonstration (RD&D) Planning for Demand Response Technology Development—technical support to the Public Interest Energy Research (PIER) Program on five topics: Sub-task 1. PIER Smart Grid RD&D Planning Document; Sub-task 2. System Dynamics of Programmable Controllable Thermostats; Sub-task 3. California Independent System Operator (California ISO) DR Use Cases; Sub-task 4. California ISO Telemetry Requirements; and Sub-task 5. Design of a Building Load Data Storage Platform. • Task 2.4. Time Value of Demand Response—research that will enable California ISO to take better account of the speed of the resources that it deploys to ensure compliance with reliability rules for frequency control. • Task 2.5. System Integration and Market Research: Southern California Edison (SCE)—research and technical support for efforts led by SCE to conduct demand response pilot demonstrations to provide a contingency reserve service (known as non-spinning reserve) through a targeted sub-population of aggregated residential and small commercial customers enrolled in SCE’s traditional air conditioning (AC) load cycling program, the Summer Discount Plan. • Task 2.6. Demonstrate Demand Response Technologies: Pacific Gas and Electric (PG&E)—research and technical support for efforts led by PG&E to conduct a demand response pilot demonstration to provide non
Condon, David; Revelle, William
Separating the signal in a test from the irrelevant noise is a challenge for all measurement. Low test reliability limits test validity, attenuates important relationships, and can lead to regression artifacts. Multiple approaches to the assessment and improvement of reliability are discussed. The advantages and disadvantages of several different approaches to reliability are considered. Practical advice on how to assess reliability using open source software is provided.
Full Text Available Objective: The Temperament and Character Inventory (TCI was developed to assess temperament including Novelty Seeking (NS, Harm Avoidance (HA, Reward Dependence (RD, Persistence (PS, and Character including Self-Directedness (SD, Cooperativeness (CO and Self Transcendence (ST dimensions of Cloninger's biopsychosocial model of personality in adults. The purpose of this study was to evaluate the reliability and validity of this inventory. Materials & Methods: In this validity test and standardization study, after translation of TCI into Farsi and back translation, the final form was prepared and administered to 220 students who were selected via simple sampling. Cronbach's alpha procedure and test-retest method was used to assess the reliability, and factor analysis of promax rotation was utilized to determine the validity of the inventory. Correlation of interscales and age with scales of TCI was calculated by Pearson correlation. A comparison of TCI scores between sex and also cross-cultural was down using independent t-test. Results: The alpha cofficients for the inventory ranged from 0.44 for the Persistence scale to 0.81 for the ST scale with a median 0f 0.68. The overall alpha cofficients for the whole inventory was 0.74. The Pearson correlation cofficient for the test-retest on 31 students after two months ranged from 0.53 for Novelty Seeking and Persistence to 0.82 for Harm Avoidance scales and from 0.24 for disorderliness vs regimentation (NS4 to 0.86 for fear of uncertainty vs self-confidene (HA2 subscales. The factor analysis showed six factors. Significant correlations were obtained between scales of Self–Directedness with Harm Avoidance (0.57, Self–Directedness with Cooperativeness (0.46. Conclusion: The current study confirms that Persian version of the Temperament and Character Inventory has satisfactory psychometric properties and acceptable reliability and validity for the use students of university population.
Full Text Available John C Sieverdes,1 Eric E Wickel,2 Gregory A Hand,3 Marco Bergamin,4 Robert R Moran,5 Steven N Blair3,51Medical University of South Carolina, College of Nursing and Medicine, Charleson, SC, 2University of Tulsa, Exercise and Sport Science, Tulsa, OK, 3University of South Carolina, Department of Exercise Science, Division of Health Aspects of Physical Activity, Arnold School of Public Health, Columbia, SC, USA; 4University of Padova, Department of Medicine, Sports Medicine Division, Padova, Italy; 5University of South Carolina, Department of Epidemiology and Biostatistics, Arnold School of Public Health, Columbia, SC, USABackground: This study evaluated the reliability and criterion validity of the Mywellness Key accelerometer (MWK using treadmill protocols and indirect calorimetry.Methods: Twenty-five participants completed two four-stage 20-minute treadmill protocols while wearing two MWK accelerometers. Reliability was assessed using raw counts. Validity was assessed by comparing the estimated VO2 calculated from the MWK with values from respiratory gas exchange.Results: Good overall and point estimates of reliability were found for the MWK (all intraclass correlations > 0.93. Generalizability theory coefficients showed lower values for running speed (0.70 versus walking speed (all > 0.84, with the majority of the overall percentage of variability derived from the participant (68%–88% of the total 100%. Acceptable validity was found overall (Pearson’s r = 0.895–0.902, P < 0.0001, with an overall mean absolute error of 16.22% and a coefficient of variance of 16.92%. Bland-Altman plots showed an overestimation of energy expenditure during the running speed, but total kilocalories were underestimated during the protocol by approximately 10%.Conclusion: Good validity was found during light and moderate walking, while running was slightly overestimated. The MWK may be useful for clinicians and researchers interested in promotion or assessment
Full Text Available An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high.The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed questionnaire for monitoring and evaluating population sun-related behavior.Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly in a questionnaire adapted to measure behavior throughout the summer.The response rates for questionnaire 1, 2 and 3 were high and the drop out was not dependent on demographic characteristic. There was at least 73% agreement between sunburns in the measurement week and the entire summer, and a possible sunburn underestimation in questionnaires summarizing the entire summer. The participants underestimated their outdoor exposure in the evaluation covering the entire summer as compared to the measurement week. The reliability of scales measuring attitude and knowledge was high for majority of scales, while consistency in protection behavior was low.To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability, while consistency of protection behavior in general and in a week's measurement was low. Keywords: Questionnaire, Validation, Reliability, Skin cancer, Prevention, Ultraviolet radiation
Conclusion: The APA shows good internal reliability, test–retest reliability, discriminant validity, and construct validity. However, evidence of psychometric properties was limited by a small sample size. Psychometric properties such as interrater reliability as well as concurrent validity and construct validity need to be tested using a larger sample size with representative demographics.
Sayed Hadi Sayed Alitabar
Full Text Available Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST. Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. The DAST Was Used as Instrument. Divergent & Convergent Validity of this Scale Was Assessed with Problems Assessment for Substance Using Psychiatric Patients (PASUPP and Relapse Prediction Scale (RPS.Results: The DAST after the First Time Factor Structure of Using Confirmatory Factor Analysis Was Confirmed. The DAST Had a Good Internal Consistency (Cranach’s Alpha, and the Reliability of the Test Within a Week, 0.9, 0.8. Also this Scale Had a Positive Correlation with Problems Assessment for Substance Using Psychiatric Patients and Relapse Prediction Scale (P<0.01.Conclusion: The Overall Results Showed that the Drug Abuse Screening Test in Iranian Society Is Valid. It Can Be Said that Self-Report Scale Tool Is Useful for Research Purposes and Addiction.
Haggerty, Greg; Zodan, Jennifer; Mehra, Ashwin; Zubair, Ayyan; Ghosh, Krishnendu; Siefert, Caleb J; Sinclair, Samuel J; DeFife, Jared
The current study investigated the interrater reliability and validity of prototype ratings of 5 common adolescent psychiatric disorders: attention-deficit/hyperactivity disorder, conduct disorder, major depressive disorder, generalized anxiety disorder, and posttraumatic stress disorder. One hundred fifty-seven adolescent inpatient participants consented to participate in this study. We compared ratings from 2 inpatient clinicians, blinded to each other's ratings and patient measures, after their separate initial diagnostic interview to assess interrater reliability. Prototype ratings completed by clinicians after their initial diagnostic interview with adolescent inpatients and outpatients were compared with patient-reported behavior problems and parents' report of their child's behavioral problems. Prototype ratings demonstrated good interrater reliability. Clinicians' prototype ratings showed predicted relationships with patient-reported behavior problems and parent-reported behavior problems. Prototype matching seems to be a possible alternative for psychiatric diagnosis. Prototype ratings showed good interrater reliability based on clinicians unique experiences with the patient (as opposed to video-/audio-recorded material) with no training.
Full Text Available The aim of this research is to adapt the Workplace Bullying Scale (Tınaz, Gök & Karatuna, 2013 to Albanian language and to examine its psychometric properties. The research was conducted on 386 person from different sectors of Albania. Results of exploratory and confirmatory factor analysis demonstrated that Albanian scale yielded 2 factors different from original form because of cultural differences. Internal consistency coefficients are,890 -,801 and split-half test reliability coefficients, 864 -,808. Comfirmatory Factor Analysis results change from,40 to,73. Corrected item-total correlations ranged,339 to,672 and according to t-test results differences between each item’s means of upper 27% and lower 27% points were significant. Thus Workplace Bullying Scale can be use as a valid and reliable instrument in social sciences in Albania.
Wikstrom, Erik A.
Context: Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. Objective: To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Design: Descriptive laboratory study. Setting: Sports medicine research laboratory. Patients or Other Participants: Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Intervention(s): Participants completed a single-limb–stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Main Outcome Measure(s): Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. Results: All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with
Wikstrom, Erik A
Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Descriptive laboratory study. Sports medicine research laboratory. Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Participants completed a single-limb-stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with scores ranging from fair (ICC = 0.74) to poor (ICC = 0.29). Wii Fit balance activity scores had poor concurrent validity relative to COP outcomes and SEBT
Evenson Kelly R
Full Text Available Abstract Background The purpose of this study is to assess the reliability and validity of the U.S. National Center for Safe Routes to School's in-class student travel tallies and written parent surveys. Over 65,000 tallies and 374,000 parent surveys have been completed, but no published studies have examined their measurement properties. Methods Students and parents from two Charlotte, NC (USA elementary schools participated. Tallies were conducted on two consecutive days using a hand-raising protocol; on day two students were also asked to recall the previous days' travel. The recall from day two was compared with day one to assess 24-hour test-retest reliability. Convergent validity was assessed by comparing parent-reports of students' travel mode with student-reports of travel mode. Two-week test-retest reliability of the parent survey was assessed by comparing within-parent responses. Reliability and validity were assessed using kappa statistics. Results A total of 542 students participated in the in-class student travel tally reliability assessment and 262 parent-student dyads participated in the validity assessment. Reliability was high for travel to and from school (kappa > 0.8; convergent validity was lower but still high (kappa > 0.75. There were no differences by student grade level. Two-week test-retest reliability of the parent survey (n = 112 ranged from moderate to very high for objective questions on travel mode and travel times (kappa range: 0.62 - 0.97 but was substantially lower for subjective assessments of barriers to walking to school (kappa range: 0.31 - 0.76. Conclusions The student in-class student travel tally exhibited high reliability and validity at all elementary grades. The parent survey had high reliability on questions related to student travel mode, but lower reliability for attitudinal questions identifying barriers to walking to school. Parent survey design should be improved so that responses clearly indicate
McDonald, Noreen C; Dwelley, Amanda E; Combs, Tabitha S; Evenson, Kelly R; Winters, Richard H
The purpose of this study is to assess the reliability and validity of the U.S. National Center for Safe Routes to School's in-class student travel tallies and written parent surveys. Over 65,000 tallies and 374,000 parent surveys have been completed, but no published studies have examined their measurement properties. Students and parents from two Charlotte, NC (USA) elementary schools participated. Tallies were conducted on two consecutive days using a hand-raising protocol; on day two students were also asked to recall the previous days' travel. The recall from day two was compared with day one to assess 24-hour test-retest reliability. Convergent validity was assessed by comparing parent-reports of students' travel mode with student-reports of travel mode. Two-week test-retest reliability of the parent survey was assessed by comparing within-parent responses. Reliability and validity were assessed using kappa statistics. A total of 542 students participated in the in-class student travel tally reliability assessment and 262 parent-student dyads participated in the validity assessment. Reliability was high for travel to and from school (kappa > 0.8); convergent validity was lower but still high (kappa > 0.75). There were no differences by student grade level. Two-week test-retest reliability of the parent survey (n=112) ranged from moderate to very high for objective questions on travel mode and travel times (kappa range: 0.62-0.97) but was substantially lower for subjective assessments of barriers to walking to school (kappa range: 0.31-0.76). The student in-class student travel tally exhibited high reliability and validity at all elementary grades. The parent survey had high reliability on questions related to student travel mode, but lower reliability for attitudinal questions identifying barriers to walking to school. Parent survey design should be improved so that responses clearly indicate issues that influence parental decision making in regards to their
Monbaliu, Elegast; Ortibus, Els; Roelens, F; Desloovere, Kaat; Declerck, Jan; Prinzie, Peter; De Cock, Paul; Feys, Hilde
AIM: This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). METHOD: Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System a...
ATEŞ, Hatice KADIOĞLU; ADA, Sefer; BAYSAL, Z. Nurdan
Abstract The aim of this study is to develop visual presentation attitude rubric which is valid and reliable for the 4th grade students. 218 students took part in this study from Engin Can Güre which located in Istanbul, Esenler. While preparing this assessment tool with 34 criterias , 6 university lecturers view have been taken who are experts in their field. The answer key sheet has 4 (likert )type options. The rubric has been first tested by Kaiser-Meyer Olkin and Bartletts tests an...
Kurita, H; Miyake, Y
The Tokyo Autistic Behavior Scale (TABS) consisting of 39 items provisionally grouped in four areas--interpersonal-social relationship, language-communication, habit-mannerism and others--is an instrument used by a child's caretaker to rate the child's autistic behaviors on a 3-point scale. Test-retest reliability was satisfactory (i.e., an r for a total score was .94). Among six DSM-III diagnostic groups, infantile autism showed a significantly higher total TABS score than the other five groups, and a taxonomic validity coefficient was .54. An r between total scores of the TABS and the Childhood Autism Rating Scale--Tokyo Version was .59. The area scores showed a lower validity than the total score. The TABS appears to be a useful instrument to assess autistic behavior.
They completed this 15 item self-rated instrument that assesses patient satisfaction with services using a 5 point response format. Results:The internal consistency for the scale was high ( a=0.91), and item total correlations ranged between 0.33 to 0.70. Its convergent validity was supported by significant correlations of all ...
McEwan, Troy E; Shea, Daniel E; Daffern, Michael; MacKenzie, Rachel D; Ogloff, James R P; Mullen, Paul E
This study assessed the reliability and validity of the Stalking Risk Profile (SRP), a structured measure for assessing stalking risks. The SRP was administered at the point of assessment or retrospectively from file review for 241 adult stalkers (91% male) referred to a community-based forensic mental health service. Interrater reliability was high for stalker type, and moderate-to-substantial for risk judgments and domain scores. Evidence for predictive validity and discrimination between stalking recidivists and nonrecidivists for risk judgments depended on follow-up duration. Discrimination was moderate (area under the curve = 0.66-0.68) and positive and negative predictive values good over the full follow-up period ( Mdn = 170.43 weeks). At 6 months, discrimination was better than chance only for judgments related to stalking of new victims (area under the curve = 0.75); however, high-risk stalkers still reoffended against their original victim(s) 2 to 4 times as often as low-risk stalkers. Implications for the clinical utility and refinement of the SRP are discussed.
Maffini, Cara S; Wong, Y Joel
Although measures of cultural identity, values, and behavior exist in the multicultural psychological literature, there is currently no measure that explicitly assesses ethnic minority individuals' positive and negative affect toward culture. Therefore, we developed 2 new measures called the Feelings About Culture Scale--Ethnic Culture and Feelings About Culture Scale--Mainstream American Culture and tested their psychometric properties. In 6 studies, we piloted the measures, conducted factor analyses to clarify their factor structure, and examined reliability and validity. The factor structure revealed 2 dimensions reflecting positive and negative affect for each measure. Results provided evidence for convergent, discriminant, criterion-related, and incremental validity as well as the reliability of the scales. The Feelings About Culture Scales are the first known measures to examine both positive and negative affect toward an individual's ethnic culture and mainstream American culture. The focus on affect captures dimensions of psychological experiences that differ from cognitive and behavioral constructs often used to measure cultural orientation. These measures can serve as a valuable contribution to both research and counseling by providing insight into the nuanced affective experiences ethnic minority individuals have toward culture. (c) 2015 APA, all rights reserved).
Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U
The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p squat and FHS test performances (r squat and FHS test performance can be attributed to differences in the movement patterns of the tests
Valente, Ana Rita S; Hall, Andreia; Alvelos, Helena; Leahy, Margaret; Jesus, Luis M T
The appropriate use of language in context depends on the speaker's pragmatic language competencies. A coding system was used to develop a specific and adult-focused self-administered questionnaire to adults who stutter and adults who do not stutter, The Assessment of Language Use in Social Contexts for Adults, with three categories: precursors, basic exchanges, and extended literal/non-literal discourse. This paper presents the content validity, item analysis, reliability coefficients and evidences of construct validity of the instrument. Content validity analysis was based on a two-stage process: first, 11 pragmatic questionnaires were assessed to identify items that probe each pragmatic competency and to create the first version of the instrument; second, items were assessed qualitatively by an expert panel composed by adults who stutter and controls, and quantitatively and qualitatively by an expert panel composed by clinicians. A pilot study was conducted with five adults who stutter and five controls to analyse items and calculate reliability. Construct validity evidences were obtained using the hypothesized relationships method and factor analysis with 28 adults who stutter and 28 controls. Concerning content validity, the questionnaires assessed up to 13 pragmatic competencies. Qualitative and quantitative analysis revealed ambiguities in items construction. Disagreement between experts was solved through item modification. The pilot study showed that the instrument presented internal consistency and temporal stability. Significant differences between adults who stutter and controls and different response profiles revealed the instrument's underlying construct. The instrument is reliable and presented evidences of construct validity.
Michael J. Lasee
Full Text Available Continuous Performance Tests (CPTs are commonly utilized clinical measures of attention and response inhibition. While there have been many studies of CPTs that utilize a visual format, there is considerably less research employing auditory CPTs. The current study provides initial reliability and validity evidence for the Auditory Vigilance Screening Measure (AVSM, a newly developed CPT. Participants included 105 five- to nine-year-old children selected from two rural Midwestern school districts. Reliability data for the AVSM was collected through retesting of 42 participants. Validity was evaluated through correlation of AVSM scales with subscales from the ADHD Rating Scale–IV. Test–retest reliability coefficients ranged from .62 to .74 for AVSM subscales. A significant (r = .31 correlation was obtained between the AVSM Impulsivity Scale and teacher ratings of inattention. Limitations and implications for future study are discussed.
Marc, Linda G; Henderson, Whitney R; Desrosiers, Astrid; Testa, Marcia A; Jean, Samuel E; Akom, Eniko Edit
There is limited information on depression in Haitians and this is partly attributable to the absence of culturally and linguistically adapted measures for depression. To perform a psychometric evaluation of the Haitian-Creole version of the PHQ-9 administered to men who have sex with men (MSM) in the Republic of Haiti. This study uses a cross-sectional design and data are from the Integrated Behavioral and Biological HIV Survey (IBBS) for MSM in Haiti. Inclusion criteria required that participants be male, ≥ 18 years, report sexual relations with a male partner in the last 12 months, and lived in Haiti during the past 3 months. Respondent Driven Sampling was used for participant recruitment. A structured questionnaire was verbally administered in Haitian-Creole capturing information on sociodemographics, sexual behaviors, human immunodeficiency virus (HIV) status and depressive symptomatology using the PHQ-9. Psychometric analyses of the translated PHQ-9 assessed unidimensionality, factor structure, reliability, construct validity, and differential item functioning (DIF) across subgroups (age, educational level, sexual orientation and HIV status). In a study population of 1,028 MSM, the Haitian-Creole version of the PHQ-9 is unidimensional, has moderately high internal consistency reliability (α = 0.78), and shows evidence of construct validity where HIV-positive subjects have greater depression (p = 0.002). There is no evidence of DIF across age, education, sexual orientation or HIV status. HIV-positive MSM are twice as likely to screen positive for moderately severe and severe depressive symptoms compared to their HIV-negative counterparts. There is strong evidence for the psychometric adequacy of the translated PHQ-9 screening tool as a measure of depression with MSM in Haiti. Future research is necessary to examine the predictive validity of depression for subsequent health behaviors or clinical outcomes among Haitian MSM.
Full Text Available Objectives: The aim of this study was to evaluate the psychometric features of the Persian version of the Autism Behavior Checklist (ABC. Method:The International Quality of Life Assessment (IQOLA approach was used to translate the English ABC into Persian. A total sample of 184 parents of children including 114 children with autism disorder (mean age =7.21, SD =1.65 and 70 typically developing children (mean age = 6.82, SD =1.75 completed the ABC. Internal consistency, test-retest reliability, concurrent and discriminant validity, and cut-off score were assessed. Results: The results of this study revealed that the Persian version of the ABC has an acceptable degree of internal consistency (.73. Test–retest comparisons using interclass correlation confirmed the instrument’s time stability (.83. The instrument’s concurrent validity with Gilliam Autism Rating Scale (GARS was verified; the correlation between total scores was .94. In the discriminant validity, the autism group had significantly higher scores compared to the normal group. Receiver Operating Characteristic (ROC analysis revealed that individuals with total scores below 25 are less likely to be in the autism group. Conclusion:The Persian version of the ABC can be used as an initial screening tool in clinical contexts.
In educational research that calls itself empirical, the relationship between validity and reliability is that of trade-off: the stronger the bases for validity, the weaker the bases for reliability (and vice versa). Validity and reliability are widely regarded as basic criteria for evaluating research; however, there are ethical implications of…
Burcu Ersöz Hüseyinsinoğlu
Full Text Available OBJECTIVE: The aim of this study was to adapt the Motor Activity Log-28 (MAL-28 into Turkish and probe the reliability and validity of this questionnaire in stroke patients. METHODS: Following the translation of the MAL-28 into Turkish, its reliability and construct validity was examined in 30 stroke patients. For the reliability study, patients were interviewed twice within a three day period, during which no rehabilitative activities were undertaken. The test-retest reliability was determined by using intra-class correlation coefficient (ICC and Spearman correlation coefficient (r; internal consistency was determined by Cronbach's alpha (α. The construct validity was examined by comparing MAL-28 Quality Of Movement (QOM scale and Amount Of Use (AOU scale with Wolf Motor Function Test (WMFT-Performance Time (PT and Functional Ability (FA scores. Furthermore, item-to-scale correlations of AOU and QOM scales were determined and correlation between totol scores of two scales was examined. RESULTS: Turkish version of MAL-28 AOU and QOM scales were reliable (ICC scores were 0.97 and 0.96, respectively and internally consistent (Cronbach’s α value was 0.96 for both scales. Test-retest reliability was supported (AOU, r=0.94; QOM, r=0.93. WMFT FA scores was correlated with both scales (r=0.63. Correlation between WMFT PT and AOU and QOM scales were -0.56 and -0.55. AOU and QOM scales were highly correlated (r=0.95. CONCLUSION: The findings indicate that Turkish version of MAL-28 is reliable and valid in individuals with stroke. Further investigation about its responsiveness is needed before using that version as a primary measurement in clinical trials
Erford, Bradley T.; Alsamadi, Silvana C.
Score reliability and validity of parent responses concerning their 10- to 17-year-old students were analyzed using the Screening Test for Emotional Problems-Parent Report (STEP-P), which assesses a variety of emotional problems classified under the Individuals with Disabilities Education Improvement Act. Score reliability, convergent, and…
Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje
This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters. Key points Study evaluated the reliability and construct validity of a newly developed inline skating skill test. Evaluated test is a first protocol designed to assess specific inline skating skill. Two groups of amateur skaters with different skating proficiency repeated the skill test in four separate occasions. The results suggest that evaluated test is reliable and valid to evaluate inline skating skill in amateur skaters. PMID:27803616
Conclusion: Only seven studies calculated validity coefficients within the study whereas 47 cited the validity coefficient. Twenty-six calculated a reliability coefficient whereas 47 cited the reliability of the ED measures. Four studies found validity evidence for the EAT, EDI, BULIT-R, QEDD, and EDE-Q in an athlete population. Few studies reviewed calculated validity and reliability coefficients of ED measures. Cross-validation of these measures in athlete populations is clearly needed.
Casey M. Hearing
Full Text Available Objectives: The aim of this study was to devise a reliable and valid survey to predict the intensity of someone’s gag reflex. Material and Methods: A 10-question Predictive Gagging Survey was created, refined, and tested on 59 undergraduate participants. The questions focused on risk factors and experiences that would indicate the presence and strength of someone’s gag reflex. Reliability was assessed by administering the survey to a group of 17 participants twice, with 3 weeks separating the two administrations. Finally, the survey was given to 25 dental patients. In these cases, patients completed an informed consent form, filled out the survey, and then had a maxillary impression taken while their gagging response was quantified from 1 to 5 on the Fiske and Dickinson Gagging Intensity Index. Results: There was a moderate positive correlation between the Predictive Gagging Survey and Fiske and Dickinson’s Gagging Severity Index, r = +0.64, demonstrating the survey’s validity. Furthermore, the test-retest reliability was r = +0.96, demonstrating the survey’s reliability. Conclusions: The Predictive Gagging Survey is a 10-question survey about gag-related experiences and behaviours. We established that it is a reliable and valid method to assess the strength of someone’s gag reflex.
DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M
The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no
Cypress, Brigitte S
Issues are still raised even now in the 21st century by the persistent concern with achieving rigor in qualitative research. There is also a continuing debate about the analogous terms reliability and validity in naturalistic inquiries as opposed to quantitative investigations. This article presents the concept of rigor in qualitative research using a phenomenological study as an exemplar to further illustrate the process. Elaborating on epistemological and theoretical conceptualizations by Lincoln and Guba, strategies congruent with qualitative perspective for ensuring validity to establish the credibility of the study are described. A synthesis of the historical development of validity criteria evident in the literature during the years is explored. Recommendations are made for use of the term rigor instead of trustworthiness and the reconceptualization and renewed use of the concept of reliability and validity in qualitative research, that strategies for ensuring rigor must be built into the qualitative research process rather than evaluated only after the inquiry, and that qualitative researchers and students alike must be proactive and take responsibility in ensuring the rigor of a research study. The insights garnered here will move novice researchers and doctoral students to a better conceptual grasp of the complexity of reliability and validity and its ramifications for qualitative inquiry.
Tangney, June Price; Stuewig, Jeffrey; Furukawa, Emi; Kopelovich, Sarah; Meyer, Patrick; Cosby, Brandon
Theory, research, and clinical reports suggest that moral cognitions play a role in initiating and sustaining criminal behavior. The 25 item Criminogenic Cognitions Scale (CCS) was designed to tap 5 dimensions: Notions of entitlement; Failure to Accept Responsibility; Short-Term Orientation; Insensitivity to Impact of Crime; and Negative Attitudes Toward Authority. Results from 552 jail inmates support the reliability, validity, and predictive utility of the measure. The CCS was linked to cri...
Granier, Cyril; Hausswirth, Christophe; Dorel, Sylvain; Yann, Le Meur
This study aimed to determine the validity and the reliability of the Stages power meter crank system (Boulder, United States) during several laboratory cycling tasks. Eleven trained participants completed laboratory cycling trials on an indoor cycle fitted with SRM Professional and Stages systems. The trials consisted of an incremental test at 100W, 200W, 300W, 400W and four 7s sprints. The level of pedaling asymmetry was determined for each cycling intensity during a similar protocol completed on a Lode Excalibur Sport ergometer. The reliability of Stages and SRM power meters was compared by repeating the incremental test during a test-retest protocol on a Cyclus 2 ergometer. Over power ranges of 100-1250W the Stages system produced trivial to small differences compared to the SRM (standardized typical error values of 0.06, 0.24 and 0.08 for the incremental, sprint and combined trials, respectively). A large correlation was reported between the difference in power output (PO) between the two systems and the level of pedaling asymmetry (r=0.58, p system according to the level of pedaling asymmetry provided only marginal improvements in PO measures. The reliability of the Stages power meter at the sub-maximal intensities was similar to the SRM Professional model (coefficient of variation: 2.1 and 1.3% for Stages and SRM, respectively). The Stages system is a suitable device for PO measurements, except when a typical error of measurement power ranges of 100-1250W is expected.
This study describes the development and evaluation of the Nursery Teacher's Stress Scale (NTSS), which explores the relation between daily hassles at work and work-related stress. In Analysis 1, 29 items were chosen to construct the NTSS. Six factors were identified: I. Stress relating to child care; II. Stress from human relations at work; III. Stress from staff-parent relations; IV. Stress from lack of time; V. Stress relating to compensation; and VI. Stress from the difference between individual beliefs and school policy. All these factors had high degrees of internal consistency. In Analysis 2, the concurrent validity of the NTSS was examined. The results showed that the NTSS total scores were significantly correlated with the Job Stress Scale-Revised Version (job stressor scale, r = .68), the Pre-school Teacher-efficacy Scale (r = -.21), and the WHO-five Well-Being Index Japanese Version (r = -.40). Work stresses are affected by several daily hassles at work. The NTSS has acceptable reliability and validity, and can be used to improve nursery teacher's mental health.
Coolidge, T; Heima, M; Heaton, L J; Nakai, Y; Höskuldsson, O; Smith, T A; Weinstein, P; Milgrom, P
The Child Dental Control Assessment (CDCA) measures children's preferred control strategies in the dental situation. Three studies are reported, assessing aspects of this instrument in youths from the USA, Japan and Australia. In particular, measurements were made as to the reliability and validity of this instrument in this age group in the three cultures, as well as comparing some results across cultures. These studies used a questionnaire design. Questionnaires (including the CDCA and other measures) were given to youths aged 11-15 in the three cultures. In one culture, youths received the questionnaire twice, to compute test-retest reliability. The measure's reliability and validity were similar to those of other measures. The CDCA behaves similarly to the Revised Iowa Dental Control Index (R-IDCI). Youths in all three cultures showed similar responses, although the Japanese were less likely to endorse items. Internal reliability of the scale ranged from 0.74 to 0.85. Test- retest reliability was 0.74. Participants in the High Desire/Low Predicted classification on the R-IDCI scored higher on the CDCA (t (73) = 2.9, p < .01). In the Japanese and Australian samples the correlation between CDCA and dental fear was 0.29-0.33 (p < .001). The Australian and USA samples scored significantly higher than the Japanese sample (overall F(2,1544) = 383.98, p < .001, followed by Tukey's HSD, p < .001). These results provide evidence for the reliability and validity of the CDCA in youth. It appears to measure the discrepancy between Desired and Predicted Control identified in the Revised Iowa Dental Control Index (R-IDCI). Responses of the youth in all three cultures were similar, indicating common dental control preferences for individuals of this age. However, consistent with cultural values, Japanese youth were less likely to endorse the control strategies. These results underline the need to develop culturally-specific, as well as situationally-specific control measures.
Bohannon, Richard W; Steffl, Michal; Glenney, Susan S; Green, Michelle; Cashwell, Leah; Prajerova, Kveta; Bunn, Jennifer
The prone bridge maneuver, or plank, has been viewed as a potential alternative to curl-ups for assessing trunk muscle performance. The purpose of this study was to assess prone bridge test performance, validity, and reliability among younger and older adults. Sixty younger (20-35 years old) and 60 older (60-79 years old) participants completed this study. Groups were evenly divided by sex. Participants completed surveys regarding physical activity and abdominal exercise participation. Height, weight, body mass index (BMI), and waist circumference were measured. On two occasions, 5-9 days apart, participants held a prone bridge until volitional exhaustion or until repeated technique failure. Validity was examined using data from the first session: convergent validity by calculating correlations between survey responses, anthropometrics, and prone bridge time, known groups validity by using an ANOVA comparing bridge times of younger and older adults and of men and women. Test-retest reliability was examined by using a paired t-test to compare prone bridge times for Session1 and Session 2. Furthermore, an intraclass correlation coefficient (ICC) was used to characterize relative reliability and minimal detectable change (MDC 95% ) was used to describe absolute reliability. The mean prone bridge time was 145.3 ± 71.5 s, and was positively correlated with physical activity participation (p ≤ 0.001) and negatively correlated with BMI and waist circumference (p ≤ 0.003). Younger participants had significantly longer plank times than older participants (p = 0.003). The ICC between testing sessions was 0.915. The prone bridge test is a valid and reliable measure for evaluating abdominal performance in both younger and older adults. Copyright © 2017 Elsevier Ltd. All rights reserved.
Gusi, N; Perez-Sousa, M A; Gozalo-Delgado, M; Olivares, P R
A proxy version of the EQ-5D-Y, a questionnaire to evaluate the Health Related Quality of Life (HRQoL) in children and adolescents, has recently been developed. There are currently no data on the validity and reliability of this tool. The objective of this study was to analyze the validity and reliability of the EQ-5D-Y proxy version. A core set of self-report tools, including the Spanish version of the EQ-5D-Y were administered to a group of Spanish children and adolescents drawn from the general population. A similar core set of internationally standardized proxy tools, including the EQ-5D-Y proxy version were administered to their parents. Test-retest reliability was determined, and correlations with other generic measurements of HRQoL were calculated. Additionally, known group validity was examined by comparing groups with a priori expected differences in HRQoL. The agreement between the self-report and proxy version responses was also calculated. A total of 477 children and adolescents and their parents participated in the study. One week later, 158 participants completed the EQ-5D-Y/EQ-5D-Y proxy to facilitate reliability analysis. Agreement between the test-retest scores was higher than 88% for EQ-5D-Y self-report, and proxy version. Correlations with other health measurements showed similar convergent validity to that observed in the international EQ-5D-Y. Agreement between the self-report and proxy versions ranged from 72.9% to 97.1%. The results provide preliminary evidence of the reliability and validity of the EQ-5D-Y proxy version. Copyright © 2013 Asociación Española de Pediatría. Published by Elsevier Espana. All rights reserved.
Togari, Taisuke; Yamazaki, Yoshihiko; Koide, Syotaro; Miyata, Ayako
In community and workplace health plans, the Perceived Health Competence Scale (PHCS) is employed as an index of health competency. The purpose of this research was to examine the reliability and validity of a modified Japanese PHCS. Interviews were sought with 3,000 randomly selected Japanese individuals using a two-step stratified method. Valid PHCS responses were obtained from 1,910 individuals, yielding a 63.7% response rate. Reliability was assessed using Cronbach's alpha coefficient (henceforth, alpha) to evaluate internal consistency, and by employing item-total correlation and alpha coefficient analyses to assess the effect of removal of variables from the model. To examine content validity, we assessed the correlation between the PHCS score and four respondent attribute characteristics, that is, sex, age, the presence of chronic disease, and the existence of chronic disease at age 18. The correlation between PHCS score and commonly employed healthy lifestyle indices was examined to assess construct validity. General linear model statistical analysis was employed. The modified Japanese PHCS demonstrated a satisfactory alpha coefficient of 0.869. Moreover, reliability was confirmed by item-total correlation and alpha coefficient analyses after removal of variables from the model. Differences in PHCS scores were seen between individuals 60 years and older, and younger individuals. These with current chronic disease, or who had had a chronic disease at age 18, tended to have lower PHCS scores. After controlling for the presence of current or age 18 chronic disease, age, and sex, significant correlations were seen between PHCS scores and tobacco use, dietary habits, and exercise, but not alcohol use or frequency of medical consultation. This study supports the reliability and validity, and hence supports the use, of the modified Japanese PHCS. Future longitudinal research is needed to evaluate the predictive power of modified Japanese PHCS scores, to examine
Nicholas A. Petrunoff
Full Text Available Background. The purpose of this study was to assess the (previously untested reliability and validity of survey questions commonly used to assess travel mode and travel time. Methods. Sixty-five respondents from a staff survey of travel behaviour conducted in a south-western Sydney hospital agreed to complete a travel diary for a week, wear an accelerometer over the same period, and twice complete an online travel survey an average of 21 days apart. The agreement in travel modes between the self-reported online survey and travel diary was examined with the kappa statistic. Spearman’s correlation coefficient was used to examine agreement of travel time from home to workplace measured between the self-reported online survey and four-day travel diary. Moderate-to-vigorous physical activity (MVPA time of active and nonactive travellers was compared by t-test. Results. There was substantial agreement between travel modes (K=0.62, P<0.0001 and a moderate correlation for travel time (ρ=0.75, P<0.0001 reported in the travel diary and online survey. There was a high level of agreement for travel mode (K=0.82, P<0.0001 and travel time (ρ=0.83, P<0.0001 between the two travel surveys. Accelerometer data indicated that for active travellers, 16% of the journey-to-work time is MVPA, compared with 6% for car drivers. Active travellers were significantly more active across the whole workday. Conclusions. The survey question “How did you travel to work this week? If you used more than one transport mode specify the one you used for the longest (distance portion of your journey” is reliable over 21 days and agrees well with a travel diary.
Full Text Available Backround and design. Internalized stigma involves endorsing negative feelings and beliefs such as insignificance, shame and withdrawal triggered by applying these negative stereotypes to one self. Internalized Stigma Scale has not been applied to psoriasis patients. We aimed to evaluate the reliability and validity of Internalized Stigma Scale in psoriasis patients. Materials and Methods. 100 consecutive, volunteer psoriasis patients (48 female, 52 male; aged, 40.59±15.44 years were enrolled in the study. PASI and BSA were evaluated by physician (A.B.. Patients responded contemporaneously to Psoriasis Internalized Stigma Scale (PISS, DQoL, and Perceived Health Status (PHS, single-item self-rated general health question, of which Likert scores 1, 2, and 3 were classified as “from fair to very poor”, and 4, 5 as “good”. Results. Cronbach's alpha coefficient of PISS subscales was 0.83 for alienation, 0.70 for stereotype endorsement, 0.70 for perceived discrimination, 0.84 for social withdrawal and 0.68 for stigma resistance. The same value was 0.89 for the total scale. PISS and DQoL scores mean values were 58.8±12.6 and 10.0±9.4, respectively. PISS was significantly correlated with the patients' DQoL scores (r=,726, p=0,001. PISS was also significantly correlated with disease duration (r=,209, p=0,047. There was no any significant relationship between PASI or BSA and PISS. Mean DQoL scores in patients reporting their PHS as “from fair to very poor” and “good” were 12.1±7.3 and 5.0±4.3, respectively. Mean values of PISS in patients reporting their PHS as “from fair to very poor” was significantly increased compared with patients reporting their PHS as “good” (p=0.001. Conclusion. PISS can be used as a reliable and valid tool in assesing internalized stigmatization in psoriasis patients. Our results indicate a high level of stigmatization in psoriasis patients. Low DQoL scores show a correlation with increased levels of
Ivan Radman, Lana Ruzic, Viktoria Padovan, Vjekoslav Cigrovski, Hrvoje Podnar
Full Text Available This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male were randomized into two groups (competitive level vs. recreational level. They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]. In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%] and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]. The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2 revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p < 0.01 to complete the test in comparison to recreational-level skaters. Moreover, moderate correlation (ρ = 0.80–0.82; all p < 0.01 was observed between the participant’s self-rating and achieved performance times. In conclusion, the proposed test is a reliable and valid method to evaluate inline skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.
Full Text Available Background: The agreement of new instruments or clinical tests with other instruments or tests defines the possibility of these being used interchangeably. Aim: To investigate the validity and reliability of the SW-100 autokeratometer using a Bausch & Lomb (B&L keratometer as the ‘gold standard’. Methods: Eighty subjects (80 right eyes aged between 21 and 38 years were recruited. For intra-test repeatability, two measurements of the corneal radius of curvature were taken with the SW-100 and B&L keratometers. Forty of the 80 subjects participated in the inter-test repeatability measurement. Results: Corneal radius of curvature was found to be statistically different between the two instruments (p < 0.001, with the SW-100 providing slightly flatter values of 0.11 mm and 0.05 mm for the horizontal and vertical meridians, respectively, than the B&L keratometer. The average corneal curvature was 0.07 mm flatter with the SW-100 autokeratometer than with the B&L device. Agreement between the SW-100 and B&L keratometers’ axes was 45% within ± 5°, 60.3% within ± 10°, 78.8% within ± 15°, 80.3% within ± 20°, and 88.7% within ± 40°. Intertest repeatability was better for the B&L device than the SW-100 and showed no significant difference between the two sessions. Both instruments demonstrated comparable intrasession repeatability. As such, both instruments were comparatively reliable (per coefficients of repeatability. The range of limits of agreement of ± 0.14 mm (horizontal meridian and ± 0.17 mm (vertical meridian between the SW-100 and B&L devices showed good agreement. Conclusion: The results suggest that the SW-100 autokeratometer is a reliable and objective instrument that, however, provides flatter radii of curvature measurements than the B&L keratometer. A compensating factor incorporated into the instrument could reduce the difference between the two instruments and make them more interchangeable.
Britt Karin Støen Utvær
Full Text Available Self-determination theory (SDT distinguishes types of motivation according to types of self-regulation along a continuum of internalisation. Types of motivation vary in quality and outcomes and are frequently used in research as predictors of educational outcomes such as learning, performance, engagement, and persistence. The Academic Motivation Scale (AMS, which is based on the SDT, has not previously been evaluated in Norway. In response, by using correlation and confirmatory factor analysis, we examined the dimensionality, reliability, and construct validity of the AMS among vocational health and social care students. Our hypothesised 7-factor model demonstrated the best fit, while the AMS demonstrated good reliability and construct validity in the sample of students. However, some improvements remain necessary. In predicting the rate of school completion among students on vocational tracks, amotivation and identified regulation appeared to be more powerful as intrinsic motivational variables.
Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P
The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Susan J Bartlett
Full Text Available To evaluate the reliability and validity of 11 PROMIS measures to assess symptoms and impacts identified as important by people with rheumatoid arthritis (RA.Consecutive patients (N = 177 in an observational study completed PROMIS computer adapted tests (CATs and a short form (SF assessing pain, fatigue, physical function, mood, sleep, and participation. We assessed test-test reliability and internal consistency using correlation and Cronbach's alpha. We assessed convergent validity by examining Pearson correlations between PROMIS measures and existing measures of similar domains and known groups validity by comparing scores across disease activity levels using ANOVA.Participants were mostly female (82% and white (83% with mean (SD age of 56 (13 years; 24% had ≤ high school, 29% had RA ≤ 5 years with 13% ≤ 2 years, and 22% were disabled. PROMIS Physical Function, Pain Interference and Fatigue instruments correlated moderately to strongly (rho's ≥ 0.68 with corresponding PROs. Test-retest reliability ranged from .725-.883, and Cronbach's alpha from .906-.991. A dose-response relationship with disease activity was evident in Physical Function with similar trends in other scales except Anger.These data provide preliminary evidence of reliability and construct validity of PROMIS CATs to assess RA symptoms and impacts, and feasibility of use in clinical care. PROMIS instruments captured the experiences of RA patients across the broad continuum of RA symptoms and function, especially at low disease activity levels. Future research is needed to evaluate performance in relevant subgroups, assess responsiveness and identify clinically meaningful changes.
Full Text Available Objective: To translate the Perceived Stress Scale (versions PSS-4, -10 and -14 and to assess its psychometric properties in a sample of general Greek population. Methods: 941 individuals completed anonymously questionnaires comprising of PSS, the Depression Anxiety and Stress scale (DASS-21 version, and a list of stress-related symptoms. Psychometric properties of PSS were investigated by confirmatory factor analysis (construct validity, Cronbach’s alpha (reliability, and by investigating relations with the DASS-21 scores and the number of symptoms, across individuals’ characteristics. The two-factor structure of PSS-10 and PSS-14 was confirmed in our analysis. We found satisfactory Cronbach’s alpha values (0.82 for the full scale for PSS-14 and PSS-10 and marginal satisfactory values for PSS-4 (0.69. PSS score exhibited high correlation coefficients with DASS-21 subscales scores, meaning stress (r = 0.64, depression (r = 0.61, and anxiety (r = 0.54. Women reported significantly more stress compared to men and divorced or widows compared to married or singled only. A strong significant (p < 0.001 positive correlation between the stress score and the number of self-reported symptoms was also noted. Conclusions: The Greek versions of the PSS-14 and PSS-10 exhibited satisfactory psychometric properties and their use for research and health care practice is warranted.
Andreou, Eleni; Alexopoulos, Evangelos C; Lionis, Christos; Varvogli, Liza; Gnardellis, Charalambos; Chrousos, George P; Darviri, Christina
To translate the Perceived Stress Scale (versions PSS-4, -10 and -14) and to assess its psychometric properties in a sample of general Greek population. 941 individuals completed anonymously questionnaires comprising of PSS, the Depression Anxiety and Stress scale (DASS-21 version), and a list of stress-related symptoms. Psychometric properties of PSS were investigated by confirmatory factor analysis (construct validity), Cronbach's alpha (reliability), and by investigating relations with the DASS-21 scores and the number of symptoms, across individuals' characteristics. The two-factor structure of PSS-10 and PSS-14 was confirmed in our analysis. We found satisfactory Cronbach's alpha values (0.82 for the full scale) for PSS-14 and PSS-10 and marginal satisfactory values for PSS-4 (0.69). PSS score exhibited high correlation coefficients with DASS-21 subscales scores, meaning stress (r = 0.64), depression (r = 0.61), and anxiety (r = 0.54). Women reported significantly more stress compared to men and divorced or widows compared to married or singled only. A strong significant (p < 0.001) positive correlation between the stress score and the number of self-reported symptoms was also noted. The Greek versions of the PSS-14 and PSS-10 exhibited satisfactory psychometric properties and their use for research and health care practice is warranted.
Frank, Guido K W; Favaro, Angela; Marsh, Rachel; Ehrlich, Stefan; Lawson, Elizabeth A
Human brain imaging can help improve our understanding of mechanisms underlying brain function and how they drive behavior in health and disease. Such knowledge may eventually help us to devise better treatments for psychiatric disorders. However, the brain imaging literature in psychiatry and especially eating disorders has been inconsistent, and studies are often difficult to replicate. The extent or severity of extremes of eating and state of illness, which are often associated with differences in, for instance hormonal status, comorbidity, and medication use, commonly differ between studies and likely add to variation across study results. Those effects are in addition to the well-described problems arising from differences in task designs, data quality control procedures, image data preprocessing and analysis or statistical thresholds applied across studies. Which of those factors are most relevant to improve reproducibility is still a question for debate and further research. Here we propose guidelines for brain imaging research in eating disorders to acquire valid results that are more reliable and clinically useful. © 2018 Wiley Periodicals, Inc.
Lillo-Bevia, José R; Pallarés, Jesús G
To validate the new drive indoor trainer Hammer designed by Cycleops®. Eleven cyclists performed 44 randomized and counterbalanced graded exercise tests (100-500W), at 70, 85 and 100 rev.min -1 cadences, in seated and standing positions, on 3 different Hammer units, while a scientific SRM system continuously recorded cadence and power output data. No significant differences were detected between the three Hammer devices and the SRM for any workload, cadence, or pedalling condition (P value between 1.00 and 0.350), except for some minor differences (P 0.03 and 0.04) found in the Hammer 1 at low workloads, and for Hammer 2 and 3 at high workloads, all in seated position. Strong ICCs were found between the power output values recorded by the Hammers and the SRM (≥0.996; P=0.001), independently from the cadence condition and seated position. Bland-Altman analysis revealed low Bias (-5.5-3.8) and low SD of Bias (2.5-5.3) for all testing conditions, except marginal values found for the Hammer 1 at high cadences and seated position (9.6±6.6). High absolute reliability values were detected for the 3 Hammers (150-500W; CVreliable device to drive and measure power output in cyclists, providing an alternative to larger and more expensive laboratory ergometers, and allowing cyclists to use their own bicycle.
Marbach, G.; Beche, M.; Pajot, J.
The excellent behavior of PHENIX driver fuel and the burnup values currently reached suggest that the first SUPERPHENIX fuel load will meet the design lifetime. However, to ensure the reliability of the entire load, all the parameters affecting fuel behavior in reactor must be analyzed. For that purpose, we have taken into account all the results of the examination and verifications during the fabrication process of the first load subassemblies. These data concern geometrical parameters or oxide composition as well as the cladding tube and plug weld soundness tests. The objective is to determine the actual dispersion of all the parameters to ensure the absence of failure due to fabrication defects with very high statistical confidence limits. The influence of all the parameters has been investigated for the situations which can occur during power-up, steady-state operation and transients. The fabrication quality allows us to demonstrate that in all cases good behavior criteria for fuel and structure will be maintained. This demonstration is based on calculation code results as well as on validation by specific experiments
Salacinski, Amanda J; Alford, Micah; Drevets, Kathryn; Hart, Sarah; Hunt, Brian E
As an appealing alternative to reference glucose analyzers, portable glucometers are recommended for self-monitoring at home, in the field, and in research settings. The purpose was to characterize the accuracy and precision, and bias of glucometers in biomedical research. Fifteen young (20-36 years; mean = 24.5), moderately to highly active men (n = 10) and women (n = 5), defined by exercising 2 to 3 times a week for the past 6 months, were given an oral glucose tolerance test (OGTT) after an overnight fast. Participants ingested 50, 75, or 150 grams of glucose over a 5-minute period. The glucometer was compared to a reference instrument. The glucometer had 39% of values within 15% of measurements made using the reference instrument ranging from 45.05 to 169.37 mg/dl. There was both a proportional (-0.45 to -0.39) and small fixed (5.06 and 0.90 mg/dl) bias. Results of the present study suggest that the glucometer provided poor validity and reliability results compared to the results provided by the reference laboratory analyzer. The portable glucometers should be used for patient management, but not for diagnosis, treatment, or research purposes. © 2014 Diabetes Technology Society.
Dogan, Tayfun; Cetin, Bayram
The purpose of the present study was to investigate the reliability and validity of the Turkish version of the Tromso Social Intelligence Scale (TSIS) developed by Silvera, Martinussen, and Dahl (2001). 719 students from Sakarya University participated in the study. Construct validity and criterion related validity and reliability were assessed.…
Leticia de Matos Malavasi
Full Text Available The lack of adherence to practice physical activities urges several researchers to ind answers for this matter. Among these researches, it is investigated how or what motivates people to perform any type of physical activity. Besides that, the environmental conditions are an important reason to establish a healthier lifestyle among individuals. In Brazil, the amount of validated scales about environmental barriers for physical activity in communities is restricted. The validation and the cultural adaptation of these instruments are important not only to compare with studies from other countries, but mainly for planning public politics to improve the adherence to practice physical activities. Thus, the present research aimed to analyze the validity and reliability of the Brazilian version of the Neighborhood EnvironmentWalkability Scale (NEWS. The methodological procedures were structured in three stages. The first stage had the following procedures: translation of NEWS and back-translation by bilingual specialists. The second stage was the adaptation of NEWS to the Brazilian reality through a pilot study and with reliability. The third stage, together with a professional urban panel indicating which neighborhoods had better or worse mobility, it was accomplished a application of the NEWS questionnaire to assure construct validation. The sample of this research were separated in two parts, 75persons for the reliability; and for the validity of the questionnaire 200 residents from the four neighborhoods pointed by the specialists of the city of Florianópolis (SC. Through the NEWS the subjects answered questions about the neighborhoods regarding: type of residences, stores and trade proximity, perception of access to these places, streets characteristics, facilities to walk and ride bicycle, and safety related to traffic and crimes. The statistical analysis was made in the SPSS 11.0 version for the intra-class correlation and reliability for the
Remelhe, Mafalda; Teixeira, Pedro M; Lopes, Irene; Silva, Luís; Correia de Sousa, Jaime
Enabling patients with asthma to obtain the knowledge, confidence and skills they need in order to assume a major role in the management of their disease is cost effective. It should be an integral part of any plan for long-term control of asthma. The modified Patient Enablement Instrument (mPEI) is an easily administered questionnaire that was adapted in the United Kingdom to measure patient enablement in asthma, but its applicability in Portugal is not known. Validity and reliability of questionnaires should be tested before use in settings different from those of the original version. The purpose of this study was to test the applicability of the mPEI to Portuguese asthma patients after translation and cross-cultural adaptation, and to verify the structural validity, internal consistency and reproducibility of the instrument. The mPEI was translated to Portuguese and back translated to English. Its content validity was assessed by a debriefing interview with 10 asthma patients. The translated instrument was then administered to a random sample of 142 patients with persistent asthma. Structural validity and internal consistency were assessed. For reproducibility analysis, 86 patients completed the instrument again 7 days later. Item-scale correlations and exploratory factor analysis were used to assess structural validity. Cronbach's alpha was used to test internal consistency, and the intra-class correlation coefficient was used for the analysis of reproducibility. All items of the Portuguese version of the mPEI were found to be equivalent to the original English version. There were strong item-scale correlations that confirmed construct validity, with a one component structure and good internal consistency (Cronbach's alpha >0.8) as well as high test-retest reliability (ICC=0.85). The mPEI showed sound psychometric properties for the evaluation of enablement in patients with asthma making it a reliable instrument for use in research and clinical practice in
Gironés Muriel, Alberto; Campos Segovia, Ana; Ríos Gómez, Patricia
The study of mediating variables and psychological responses to child surgery involves the evaluation of both the patient and the parents as regards different stressors. To have a reliable and reproducible valid evaluation tool that assesses the level of paternal involvement in relation to different stressors in the setting of surgery. A self-report questionnaire study was completed by 123 subjects of both sexes, subdivided into 2populations, due to their relationship with the hospital setting. The items were determined by a group of experts and analysed using the Lawshe validity index to determine a first validity of content. Subsequently, the reliability of the tool was determined by an item-re-item analysis of the 2sub-populations. A factorial analysis was performed to analyse the construct validity with the maximum likelihood and rotation of varimax type factors. A questionnaire of paternal concern was offered, consisting of 21 items with a Cronbach coefficient of 0.97, giving good precision and stability. The posterior factor analysis gives an adequate validity to the questionnaire, with the determination of 10 common stressors that cover 74.08% of the common and non-common variance of the questionnaire. The proposed questionnaire is reliable, valid and easy-to-apply and is developed to assess the level of paternal concern about the surgery of a child and to be able to apply measures and programs through the prior assessment of these elements. Copyright © 2016 Asociación Española de Pediatría. Publicado por Elsevier España, S.L.U. All rights reserved.
Negahban, Hossein; Mazaheri, Masood; Salavati, Mahyar; Sohani, Soheil Mansour; Askari, Marjan; Fanian, Hossein; Parnianpour, Mohamad
The aims of this study were to culturally adapt and validate the Persian version of Foot and Ankle Outcome Score (FAOS) and present data on its psychometric properties for patients with different foot and ankle problems. The Persian version of FAOS was developed after a standard forward-backward translation and cultural adaptation process. The sample included 93 patients with foot and ankle disorders who were asked to complete two questionnaires: FAOS and Short-Form 36 Health Survey (SF-36). To determine test-retest reliability, 60 randomly chosen patients completed the FAOS again 2 to 6 days after the first administration. Test-retest reliability and internal consistency were assessed using intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. To evaluate convergent and divergent validity of FAOS compared to similar and dissimilar concepts of SF-36, the Spearman's rank correlation was used. Dimensionality was determined by assessing item-subscale correlation corrected for overlap. The results of test-retest reliability show that all the FAOS subscales have a very high ICC, ranging from 0.92 to 0.96. The minimum Cronbach's alpha level of 0.70 was exceeded by most subscales. The Spearman's correlation coefficient for convergent construct validity fell within 0.32 to 0.58 for the main hypotheses presented a priori between FAOS and SF-36 subscales. For dimensionality, the minimum Spearman's correlation coefficient of 0.40 was exceeded by most items. In conclusion, the results of our study show that the Persian version of FAOS seems to be suitable for Iranian patients with various foot and ankle problems especially lateral ankle sprain. Future studies are needed to establish stronger psychometric properties for patients with different foot and ankle problems.
Lenderking, William R; Wyrwich, Kathleen W; Stolar, Marilyn; Howard, Kellee A; Leibman, Chris; Buchanan, Jacqui; Lacey, Loretto; Kopp, Zoe; Stern, Yaakov
The Dependence Scale (DS) was designed to measure dependence on others among patients with Alzheimer's disease (AD). The objectives of this research were primarily to strengthen the psychometric evidence for the use of the DS in AD studies. Patients with mild to moderately severe AD were examined in 3 study databases. Within each data set, internal consistency, validity, and responsiveness were examined, and structural equation models were fit. The DS has strong psychometric properties. The DS scores differed significantly across known groups and demonstrated moderate to strong correlations with measures hypothesized to be related to dependence (|r| ≥ .31). Structural equation modeling supported the validity of the DS concept. An anchor-based DS responder definition to interpret a treatment benefit over time was identified. The DS is a reliable, valid, and interpretable measure of dependence associated with AD and is shown to be related to--but provides information distinct from--cognition, functioning, and behavior.
Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje
This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8-2.6%] - 2.2% [95% CI: 0.0-4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2-2.4%] - 2.7% [95% CI: 2.1-4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92-0.99] - 0.99 [95% CI: 0.98-1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters' performances. Competitive-level skaters needed shorter time (24.4-26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.
Grindem, Hege; Eitzen, Ingrid; Snyder-Mackler, Lynn; Risberg, May Arna
The current methods measuring sports activity after anterior cruciate ligament (ACL) injury are commonly restricted to the most knee-demanding sports, and do not consider participation in multiple sports. We therefore developed an online activity survey to prospectively record the monthly participation in all major sports relevant to our patient-group. To assess the reliability, content validity and concurrent validity of the survey and to evaluate if it provided more complete data on sports participation than a routine activity questionnaire. 145 consecutively included ACL-injured patients were eligible for the reliability study. The retest of the online activity survey was performed 2 days after the test response had been recorded. A subsample of 88 ACL-reconstructed patients was included in the validity study. The ACL-reconstructed patients completed the online activity survey from the first to the 12th postoperative month, and a routine activity questionnaire 6 and 12 months postoperatively. The online activity survey was highly reliable (κ ranging from 0.81 to 1). It contained all the common sports reported on the routine activity questionnaire. There was a substantial agreement between the two methods on return to preinjury main sport (κ=0.71 and 0.74 at 6 and 12 months postoperatively). The online activity survey revealed that a significantly higher number of patients reported to participate in running, cycling and strength training, and patients reported to participate in a greater number of sports. The online activity survey is a highly reliable way of recording detailed changes in sports participation after ACL injury. The findings of this study support the content and concurrent validity of the survey, and suggest that the online activity survey can provide more complete data on sports participation than a routine activity questionnaire.
Grindem, Hege; Eitzen, Ingrid; Snyder-Mackler, Lynn; Risberg, May Arna
Background Current methods measuring sports activity after anterior cruciate ligament (ACL) injury are commonly restricted to the most knee-demanding sport, and do not consider participation in multiple sports. We therefore developed an online activity survey to prospectively record monthly participation in all major sports relevant to our patient-group. Objective To assess the reliability, content validity, and concurrent validity of the survey, and evaluate if it provided more complete data on sports participation than a routine activity questionnaire. Methods One hundred and forty-five consecutively included ACL-injured patients were eligible for the reliability study. The retest of the online activity survey was performed two days after the test response had been recorded. A subsample of 88 ACL-reconstructed patients were included in the validity study. The ACL-reconstructed patients completed the online activity survey from the first to the twelfth postoperative month, and a routine activity questionnaire 6 and 12 months postoperatively. Results The online activity survey was highly reliable (κ ranging from 0.81 to 1). It contained all the common sports reported on the routine activity questionnaire. There was substantial agreement between the two methods on return to preinjury main sport (κ = 0.71 and 0.74 at 6 and 12 months postoperatively). The online activity survey revealed that a significantly higher number of patients reported to participate in running, cycling and strength training, and patients reported to participate in a greater number of sports. Conclusion The online activity survey is a highly reliable way of recording detailed changes in sports participation after ACL injury. The findings of this study support the content and concurrent validity of the survey, and suggest that the online activity survey can provide more complete data on sports participation than a routine activity questionnaire. PMID:23645830
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%-80.2%). Fifty-four workers were included in the analysis of test-retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach's α coefficients in men and women were 0.85 and 0.86, respectively) and test-retest reliability (intraclass correlation coefficient = 0.91). Significant (p workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J
To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; particles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
Full Text Available The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS, a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%–80.2%. Fifty-four workers were included in the analysis of test–retest reliability (response rate, 77.1%. The SCS showed high internal consistency (Cronbach’s α coefficients in men and women were 0.85 and 0.86, respectively and test–retest reliability (intraclass correlation coefficient = 0.91. Significant (p < 0.001, positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS’s convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05 and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort–reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Full Text Available Holland’s RIASEC types are being frequently utilized in commercial vocational profiling tools for various human resources purposes. On the other hand, the length of the RIASEC scale and the copyright restrictions put by the publishers, are important barriers to application. In the present study, a RIASEC scale consisting of 41 items and adapted to Turkish language and culture, was developed. Each RIASEC type was represented with 6 or 7 items. Responses were obtained from a sample of 364 business professionals. Survey results indicated a good reliability for the scale, with a Cronbach’s alpha of 0.889. However, reliability analysis pointed out to the need for revision of certain scale items when each RIASEC facet was separately analysed. Then, feedback regarding scale composition, wording and structure were gathered from 20 PhD students. Lastly, feedback of 7 HR professionals were sought, regarding scale items’ expression and application of the scale in regular HR processes of companies. Results from face and content validity have been that for some items of the scale, more descriptive and specific expressions in Turkish are required. Moreover, some of the items would need to be reallocated to another facet where they would be more relevant. In line with findings from face and content validity, construct validity through confirmatory factor analysis also indicated that the short version of RIASEC must be revised substantially in order to become a valid tool for vocational profiling in Turkish context.
Herrera-Kiengelher, L; Zepeda-Zaragoza, J; Austria-Corrales, F; Vázquez-Zarate, V M
Patient Safety is a major public health problem worldwide and is responsibility of all those involved in health care. Establishing a Safety Culture has proved to be a factor that favors the integration of work teams, communication and construction of clear procedures in various organizations. Promote a culture of safety depends on several factors, such as organization, work unit and staff. Objective assessment of these factors will help to identify areas for improvement and establish strategic lines of action. [corrected] To adapt, validate and calibrate the questionnaire Culture of Quality in Health Services (CQHS) in Mexican population. A cross with a stratified representative sample of 522 health workers. The questionnaire was translated and adapted from Singer's. Content was validated by experts, internal consistency, confirmatory factorial validity and item calibration with Samejima's Graded Response Model. Convergent and divergent construct validity was confirmed from the CQHS, item calibration showed that the questionnaire is able to discriminate between patients and represent different levels of the hypothesized dimensions with greater accuracy and lower standard error. The CQHS is a valid and reliable instrument to assess patient safety culture in hospitals in Mexico. Copyright © 2013 SECA. Published by Elsevier Espana. All rights reserved.
Anderson-Butcher, Dawn; Iachini, Aidyn L.; Amorose, Anthony J.
Objective: This study describes the development and validation of a perceived social competence scale that social workers can easily use to assess children's and youth's social competence. Method: Exploratory and confirmatory factor analyses were conducted on a calibration and a cross-validation sample of youth. Predictive validity was also…
Full Text Available The purpose of this study is to develop a scale unique to our culture, concerning individual instrument performance anxiety of the students who are getting instrument training in the Department of Music Education. In the study, the descriptive research model is used and qualitative research techniques are utilized. The study population consists of the students attending the 23 universities which has Music Education Department. The sample of the study consists of 438 girls and 312 boys, totally 750 students who are studying in the Department of Music Education of randomly selected 10 universities. As a result of the explanatory and confirmatory factor analyses that were performed, a one-dimensional structure consisting of 14 items was obtained. Also, t-scores and the coefficient scores of total item correlation concerning the distinguishing power of the items, the difference in the scores of the set of lower and upper 27% was calculated, and it was observed that the items are distinguishing as a result of both analyses. Of the scale, Cronbach's alpha coefficient of internal consistency was calculated as .94, and test-retest reliability coefficient was calculated as .93. As a result, a valid and reliable assessment and evaluation instrument that measures the exam performance anxiety of the students studying in the Department of Music Education, has been developed.Extended AbstractsIntroductionAnxiety is a universal phenomenon which people experience once or a few times during lives. It was accepted as concern for the future or as an unpleasant emotional experience regarding probable hitches of the events (Di Tomasso & Gosch, 2002.In general, the occasions on which negative feelings are experienced cause anxiety to arise (Baltaş and Baltaş, 2000. People also feel anxious in dangerous situations. Anxiety may lead a person to be creative, while it may have hindering characteristics. Anxiety is that an individual considers him
To improve the reliability of offshore wind turbines, accurate prediction of their response is required. Therefore, validation of models with site measurements is imperative. In the present thesis a 3.6MW pitch regulated-variable speed offshore wind turbine on a monopole foundation is built...... are used for the modification of the sub-structure/foundation design for possible material savings. First, the background of offshore wind engineering, including wind-wave conditions, support structure, blade loading and wind turbine dynamics are presented. Second, a detailed description of the site...
Erkin, Özüm; Göl, İlknur
This study aims to measure the validity and reliability of Turkish male breast self-examination (MBSE) instrument. The methodological study was performed in 2016 at Ege University, Faculty of Nursing, İzmir, Turkey. The MBSE includes ten steps. For validity studies, face validity, content validity, and construct validity (exploratory factor analysis) were done. For reliability study, Kuder Richardson was calculated. The content validity index was found to be 0.94. Kendall W coefficient was 0.80 (p=0.551). The total variance explained by the two factors was found to be 63.24%. Kuder Richardson 21 was done for reliability study and found to be 0.97 for the instrument. The final instrument included 10 steps and two stages. The Turkish version of MBSE is a valid and reliable instrument for early diagnose. The MBSE can be used in Turkish speaking countries and cultures with two stages and 10 steps.
Chiang, Hui-Ying; Hsiao, Ya-Chu; Lin, Shu-Yuan; Lee, Huan-Fang
To examine the psychometric validity and reliability of the incident reporting culture questionnaire (IRCQ; in Chinese) following an exploration of the reporting culture perceived by hospital nurses in Taiwan. Scale development with psychometric examination and a cross-sectional study. Ten teaching hospitals. A total of 1064 nurses participated with an average response rate of 83% between November 2008 and June 2009. The factorial construct, criterion-related validity, homogeneity and stability of the IRCQ were evaluated. The nurses' perceptions of the IRCQ were also explored. The four-factor structure of the 20-item IRCQ had satisfactory construct validity (explained variance: 49.37%), criterion-related validity (r = 0.42; P = 0.001), reliability (Cronbach's alpha: 0.83) and stability (3-week-interval correlation: r = 0.80; P = 0.001). These factors included 'application of learning from errors', 'readiness to provide feedback on incident reports', 'collegial atmospheres of unpleasantness and punishment' (CA) and 'incident management: confidential and system driven'. The nurses perceived a moderate overall reporting culture (mean positive response = 49.25%; range: 67.2-24.94%). They weakly agreed on the CA factor of five items (mean positive response = 24.94%; range: 33.0-17.2%). This study provides empirical evidence for the psychometric properties of the IRCQ and the reporting culture which nurses perceive in Taiwan. To Taiwanese nurses, the reporting culture within their work environments especially as it relates to coworker relations, inter-professional collaboration and non-punitive atmosphere is their major concern. Healthcare administrators should consider nurses' perceptions related to incident reporting when managing underreporting issues.
Aggio, Daniel; Fairclough, Stuart; Knowles, Zoe; Graves, Lee
Adaptation of physical activity self-report questionnaires is sometimes required to reflect the activity behaviours of diverse populations. The processes used to modify self-report questionnaires though are typically underreported. This two-phased study used a formative approach to investigate the validity and reliability of the Physical Activity Questionnaire for Adolescents (PAQ-A) in English youth. Phase one examined test content and response process validity and subsequently informed a modified version of the PAQ-A. Phase two assessed the validity and reliability of the modified PAQ-A. In phase one, focus groups (n = 5) were conducted with adolescents (n = 20) to investigate test content and response processes of the original PAQ-A. Based on evidence gathered in phase one, a modified version of the questionnaire was administered to participants (n = 169, 14.5 ± 1.7 years) in phase two. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and intra-class correlations, respectively. Spearman correlations were used to assess associations between modified PAQ-A scores and accelerometer-derived physical activity, self-reported fitness and physical activity self-efficacy. Phase one revealed that the original PAQ-A was unrepresentative for English youth and that item comprehension varied. Contextual and population/cultural-specific modifications were made to the PAQ-A for use in the subsequent phase. In phase two, modified PAQ-A scores had acceptable internal consistency (α = 0.72) and test-retest reliability (ICC = 0.78). Modified PAQ-A scores were significantly associated with objectively assessed moderate-to-vigorous physical activity (r = 0.39), total physical activity (r = 0.42), self-reported fitness (r = 0.35), and physical activity self-efficacy (r = 0.32) (p ≤ 0.01). The modified PAQ-A had acceptable internal consistency and test-retest reliability. Modified PAQ-A scores
Lin, Li-Chun; Lee, Sheuan; Ueng, Steve Wen-Neng; Tang, Woung-Ru
The objective of this study was to test the reliability and construct validity of the Nurse Practitioners' Roles and Competencies Scale. The role of nurse practitioners has attracted international attention. The advanced nursing role played by nurse practitioners varies with national conditions and medical environments. To date, no suitable measurement tool has been available for assessing the roles and competencies of nurse practitioners in Asian countries. Secondary analysis of data from three studies related to nurse practitioners' role competencies. We analysed data from 563 valid questionnaires completed in three studies to identify the factor structure of the Nurse Practitioners' Roles and Competencies Scale. To this end, we performed exploratory factor analysis using principal component analysis extraction with varimax orthogonal rotation. The internal consistency reliabilities of the overall scale and its subscales were examined using Cronbach's alpha coefficient. The scale had six factors: professionalism, direct care, clinical research, practical guidance, medical assistance, as well as leadership and reform. These factors explained 67·5% of the total variance in nurse practitioners' role competencies. Cronbach's alpha coefficient for the overall scale was 0·98, and those of its subscales ranged from 0·83-0·97. The internal consistency reliability and construct validity of the Nurse Practitioners' Roles and Competencies Scale were good. The high internal consistency reliabilities suggest item redundancy, which should be minimised by using item response theory to enhance the applicability of this questionnaire for future academic and clinical studies. The Nurse Practitioners' Roles and Competencies Scale can be used as a tool for assessing the roles and competencies of nurse practitioners in Taiwan. Our findings can also serve as a reference for other Asian countries to develop the nurse practitioner role. © 2015 John Wiley & Sons Ltd.
Arce-Ferrer, Alvaro J.; Castillo, Irene Borges
The use of face-to-face interviews is controversial for college admissions decisions in light of the lack of availability of validity and reliability evidence for most college admission processes. This study investigated reliability and incremental predictive validity of a face-to-face postgraduate college admission interview with a sample of…
de Groot, Sonja; Balvers, Inge J.M.; Kouwenhoven, Sanne M.; Janssen, Thomas W.J.
The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Nederhof, Esther; Brink, Michel S.; Lemmink, Koen A. P. M.
The purpose of the present study was to investigate the cross-cultural validity of the Recovery Stress Questionnaire for Athletes (RESTQ-sport) by analysing reliability and validity of a Dutch translation. Two studies were performed to assess test-retest reliability with a one week interval,
De Groot, Sonja; Balvers, Inge J. M.; Kouwenhoven, Sanne M.; Janssen, Thomas W. J.
The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
The present study aims to determine the validity and reliability of the academic resilience scale in Turkish high school. The participances of the study includes 378 high school students in total (192 female and 186 male). A set of analyses were conducted in order to determine the validity and reliability of the study. Firstly, both exploratory…
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Raykov, Tenko; Marcoulides, George A.
A latent variable modeling method is outlined, which accomplishes estimation of criterion validity and reliability for a multicomponent measuring instrument with hierarchical structure. The approach provides point and interval estimates for the scale criterion validity and reliability coefficients, and can also be used for testing composite or…
Bhat, Mehraj A.
This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Markon, Kristian E.; Chmielewski, Michael; Miller, Christopher J.
In 2 meta-analyses involving 58 studies and 59,575 participants, we quantitatively summarized the relative reliability and validity of continuous (i.e., dimensional) and discrete (i.e., categorical) measures of psychopathology. Overall, results suggest an expected 15% increase in reliability and 37% increase in validity through adoption of a…
Worrell, Frank C.; Mello, Zena R.
In this study, the authors examined the reliability, structural validity, and concurrent validity of Zimbardo Time Perspective Inventory (ZTPI) scores in a group of 815 academically talented adolescents. Reliability estimates of the purported factors' scores were in the low to moderate range. Exploratory factor analysis supported a five-factor…
Boonstra, Anne M.; Schiphorst Preuper, Henrica R.; Reneman, Michiel F.; Posthumus, Jitze B.; Stewart, Roy E.
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional
Boonstra, Anne M.; Reneman, Michiel F.; Stewart, Roy E.; Balk, Gerlof A.
The aim of this study was to determine the reliability and discriminant validity of the Dutch version of the life satisfaction questionnaire (Lisat-9 DV) to assess patients with an acquired brain injury. The reliability study used a test-retest design, and the validity study used a cross-sectional design. The setting was the general rehabilitation…
Mehmet Emrah Karadere
Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that Reasoning with Inductive Argument Test supports reliability and validity in Turkish population. [JCBPR 2013; 2(3.000: 156-161
Barrett, Eva; McCreesh, Karen; Lewis, Jeremy
A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.
Prevention strategies are effective only when there are epidemiological data for the targeted populations. The collection of such .... Proquest, Sport discuss and Cochrane as these are ... 0.74, test retest reliability 0.70; Diet: internal consistency:.
Travel time reliability (TTR) has been proposed as : a better measure of a facilitys performance than : a statistical measure like peak hour demand. TTR : is based on more information about average traffic : flows and longer time periods, thus inc...
The EuReDatA Working Group produced a basic document that addressed many of the problems associated with the design of a suitable data collection scheme to achieve pre-defined objectives. The book that resulted from this work describes the need for reliability data, data sources and collection procedures, component description and classification, form design, data management, updating and checking procedures, the estimation of failure rates, availability and utilisation factors, and uncertainties in reliability parameters. (DG)
Fang, C K; Li, P Y; Lai, M L; Lin, M H; Bridge, D T; Chen, H W
The purpose of this study was to develop a Physician's Spiritual Well-Being Scale (PSpWBS). The significance of a physician's spiritual well-being was explored through in-depth interviews with and qualitative data collection from focus groups. Based on the results of qualitative analysis and related literature, the PSpWBS consisting of 25 questions was established. Reliability and validity tests were performed on 177 subjects. Four domains of the PSpWBS were devised: physician's characteristics; medical practice challenges; response to changes; and overall well-being. The explainable total variance was 65.65%. Cronbach α was 0.864 when the internal consistency of the whole scale was calculated. Factor analysis showed that the internal consistency Cronbach α value for each factor was between 0.625 and 0.794 and the split-half reliability was 0.865. The scale has satisfactory reliability and validity and could serve as the basis for assessment of the spiritual well-being of a physician.
McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio
We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807
seyed abolfazl zakerian; Roya Azizi; Mehdi Rahgozar
The term usability refers to a special index for success of an operating system. This study aimed to determine the reliability and validity of the Software Usability Measurements Inventory (SUMI) questionnaire as one of the valid and common questionnaires about usability evaluation. The back translation method was used to translate the questionnaire from English to Persian back to English. Moreover, repeatability or test-retest reliability was practically used to determine the reliability of ...
Andersson, Björn; Xin, Tao
In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…
Seyyede Zohreh Ziatabar Ahmadi
Full Text Available Objective: Theory of mind (ToM or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children.Method: We searched MEDLINE (PubMed interface, Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library databases from 1990 to June 2015. Search strategy was Latin transcription of ‘Theory of Mind’ AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks for ToM assessment and/or had no description about structure, validity or reliability of their tests. Methodological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP.Result: In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. Conclusion: There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric
Keefe, Richard S E; Davis, Vicki G; Spagnola, Nathan B; Hilt, Dana; Dgetluck, Nancy; Ruse, Stacy; Patterson, Thomas D; Narasimhan, Meera; Harvey, Philip D
Cognitive functioning can be assessed with performance-based assessments such as neuropsychological tests and with interview-based assessments. Both assessment methods have the potential to assess whether treatments for schizophrenia improve clinically relevant aspects of cognitive impairment. However, little is known about the reliability, validity and treatment responsiveness of interview-based measures, especially in the context of clinical trials. Data from two studies were utilized to assess these features of the Schizophrenia Cognition Rating Scale (SCoRS). One of the studies was a validation study involving 79 patients with schizophrenia assessed at 3 academic research centers in the US. The other study was a 32-site clinical trial conducted in the US and Europe comparing the effects of encenicline, an alpha-7 nicotine agonist, to placebo in 319 patients with schizophrenia. The SCoRS interviewer ratings demonstrated excellent test-retest reliability in several different circumstances, including those that did not involve treatment (ICC> 0.90), and during treatment (ICC>0.80). SCoRS interviewer ratings were related to cognitive performance as measured by the MCCB (r=-0.35), and demonstrated significant sensitivity to treatment with encenicline compared to placebo (Pcognition in schizophrenia, and may be useful for clinical practice. The weaknesses of the SCoRS include its reliance on informant information, which is not available for some patients, and reduced validity when patient's self-report is the sole information source. Copyright © 2014 Elsevier B.V. and ECNP. All rights reserved.
Chiang, Peggy Pei-Chia; Fenwick, Eva; Marella, Manjula; Finger, Robert; Lamoureux, Ecosse
To evaluate the validity, reliability, and measurement characteristics of the Visual Function 14 (VF-14) in a German sample using Rasch analysis. This was a clinic-based, cross-sectional study with 184 patients with low vision recruited from an outpatient clinic at a German eye hospital. Participants underwent a clinical examination and completed the German VF-14 scale. The validity of the VF-14 scale was assessed using Rasch analysis. The main outcome measure was the overall functional score provided by the VF-14. After collapsing two response categories for items 13 and 14, the VF-14 scale satisfied fundamental criteria to achieve fit to the Rasch model, namely, ordered thresholds, the ability to distinguish between different strata of participant ability, absence of misfitting items, no evidence of unidimensionality, and no significant differential item functioning for key sociodemographic covariates. The VF-14 is able to discriminate between participants with different levels of vision impairment and across different cultural groups. The VF-14 is a valid, reliable, and unidimensional questionnaire for use in a German population. These findings contribute to the growing evidence base for second generation patient reported outcome measures in ophthalmology, and support the use of the German VF-14 in tertiary eye clinics in Germany to capture the impact of visual impairment on visual function from the patient's perspective and to inform low vision rehabilitation and interventions.
Full Text Available This study investigates the validity and reliability of the Turkish adaptation ofAcceptance of Couple Violence Scale (ACVS. The data of research has been attainedfrom 474 (M =243, F=231 high school students who were attending 1st, 2nd and 3thclass and coming from middle socio-economic levels in Malatya. Acceptance of CoupleViolence Scale has 11 items, Likert type and 4 point response format. The constructvalidity of ACVS was conducted by using exploratory factor analysis and varimaxrotation. Single independent factor with the eigenvalue over 1.00 has been found. Thisfactor explained 44% of total variance. To test concurrent validity, correlations betweenscores on ACVS and Aggressiveness Questionnaire were calculated. There was asignificant relationship between scores on the two scales (r= .61. Cronbach alphacoefficient of the scale was found “.87”; test-retest correlation coefficient was “r=.80”.Item-total correlation co-efficiencies vary between “.52” and “.71”. Findings show thatACVS can be used with acceptable level of validity and reliability for high schoolstudents.
Zhang, Tingting; Yin, Anchun; Sun, Xiaohong; Liu, Qigui; Song, Guirong; Li, Lianhong
To develop psychosocial adaptation scale for Parkinson's disease (PD) in Chinese population and evaluate its reliability and validity. The items were designed by literature review, expert consultation and semi-structured interview. The methods of corrected item-total correlation, discrimination analysis and exploratory factor analysis were used for items selection. 427 valid scales from PD patients were collected in the study to test the reliability and validity. The scale incorporated six dimensions: anxiety, self-esteem, attitude, self-acceptance, self-efficacy and social support, a total of 32 items. The scale possessed good internal consistency. The test-retest correlation coefficient was 0.99 and average content validation rate was 0.97. The Hoehn and Yahr stage were correlated with total score of the scale. The psychosocial adaptation scale in this study showed good reliability and validity, it can be used as a reliable and valid instrument to evaluate the psychosocial adaptation of PD objectively and effectively.
Mehrnoosh Pazargadi; Tahereh Ashktorab; Sharareh Khosravi; Hamid Alavi majd
Background: The necessity of a valid and reliable assessment tool is one of the most repeated issues in nursing students` clinical evaluation. But it is believed that present tools are not mostly valid and can not assess students` performance properly.Objectives: This study was conducted to design a valid and reliable assessment tool for evaluating nursing students` performance in clinical education.Methods: In this methodological study considering nursing students` performance definition; th...
Because of the lack of consistency in the associations of the socioeconomic status (SES) of prostate cancer (PC) patients from diverse racial and ethnic backgrounds with PC health outcomes, I created the Socioeconomic Status Instrument (SESI) from the Demographic and Health Access components of the Behavioral Risk Factor Surveillance System 2004 Questionnaires and the socioeconomic indices of the subjects' residential counties to better assess the SES of PC patients. The SESI was tested on 220 consecutive subjects with pathologically confirmed PC at the Veterans Affairs Medical Center in Houston, TX. A team that included an epidemiologist, a validation statistician/health services research scientist, and PC survivors assessed the content validity of the SESI. The construct validity of the SESI was assessed with factor analysis by extracting and analyzing 5 principal components based on the subjects' individual responses on the assessment: county socioeconomic characteristics, individual socioeconomic characteristics, financial distress, increased domestic burden with limited earnings, and affluence. The internal consistency reliability of the SESI was assessed with Cronbach's alpha coefficients. Based on the reviews of the SESI, all of the initial 10 items were retained. The correlations between individual responses on the SESI were similar to the results of previous studies. The 5 principal components that I assessed accounted for 71.5% of the variance. Factor loadings ranged from 0.66 to 0.98 and communalities ranged from 0.55 to 0.94. County socioeconomic characteristics accounted for 22.6% of the variance, whereas individual socioeconomic characteristics accounted for 14.6% of the variance. The overall Cronbach's alpha coefficient was 0.78. The SESI is valid and reliable. Accurate measurements of the SES of PC patients would provide better guidance for future research and care deliveries.
1 University of Northern Iowa, Division of Athletic Training, 003C Human. Performance Center, Cedar ... concurrent validity of the fingertip-to-floor distance test (FFD) ... in these protocols are spinal and extremity range of motion, pelvic control ...
This article discusses the use of assessment by teachers to replace external marking. It shows how professional participation and moderation can provide reliability in summative assessment, even in public examinations for older students. It draws on historical experiences of assessment for A-level English literature.
All three of these instruments do not involve high costs, do not require high technical skills, mobile, save time, and are suitable for use in large populations. Because all three instruments can estimate the percentage of body fat, but it is important to identify the most appropriate instruments and have high reliability. Hence, this ...
Konge, Lars; Larsen, Klaus Richter; Clementsen, Paul
: The interrater reliability was high, with Cronbach's a = 0.86. Assessment of 3 bronchoscopies by a single rater had a generalizability coefficient of 0.84. The correlation between experience and performance was good (Pearson correlation = 0.76). There were significant differences between the groups for all...
Powers, Stephen; And Others
Spanish speaking first graders were administered the Artes de Lenguage (ADL)--a Spanish, criterion-referenced, language arts test. Reliability analyses indicated the adequacy of three of the four subscales (Phonetic Analysis, Vocabulary Development, Comprehension Skills, and General Skills). A principal factors analysis of the intercorrelation…
Cetin, Bayram; Yaman, Erkan; Peker, Adem
The purpose of this study is to develop a reliable and valid scale, which determines cyber victimization and bullying behaviors of high school students. Research group consisted of 404 students (250 male, 154 male) in Sakarya, in 2009-2010 academic years. In the study sample, mean age is 16.68. Content validity and face validity of the scale was…
Shek, Daniel T. L.; Lai, Kelly Y. C.
Reliability and validity of Chinese Self-Report Family Inventory (C-SFI) were examined in three studies. Study 1 showed C-SFI was temporally stable and internally consistent. Study 2 indicated C-SFI could discriminate between clinical and nonclinical groups. Study 3 gave support for internal consistency, concurrent validity and construct validity.…
Yang, Lei; He, Chengqi; Pang, Marco Yiu Chung
Background The ability to perform a cognitive task while walking simultaneously (dual-tasking) is important in real life. However, the psychometric properties of dual-task walking tests have not been well established in stroke. Objective To assess the test-retest reliability, concurrent and known-groups validity of various dual-task walking tests in people with chronic stroke. Design Observational measurement study with a test-retest design. Methods Eighty-eight individuals with chronic stroke participated. The testing protocol involved four walking tasks (walking forward at self-selected and maximal speed, walking backward at self-selected speed, and crossing over obstacles) performed simultaneously with each of the three attention-demanding tasks (verbal fluency, serial 3 subtractions or carrying a cup of water). For each dual-task condition, the time taken to complete the walking task, the correct response rate (CRR) of the cognitive task, and the dual-task effect (DTE) for the walking time and CRR were calculated. Forty-six of the participants were tested twice within 3–4 days to establish test-retest reliability. Results The walking time in various dual-task assessments demonstrated good to excellent reliability [Intraclass correlation coefficient (ICC2,1) = 0.70–0.93; relative minimal detectable change at 95% confidence level (MDC95%) = 29%-45%]. The reliability of the CRR (ICC2,1 = 0.58–0.81) and the DTE in walking time (ICC2,1 = 0.11–0.80) was more varied. The reliability of the DTE in CRR (ICC2,1 = -0.31–0.40) was poor to fair. The walking time and CRR obtained in various dual-task walking tests were moderately to strongly correlated with those of the dual-task Timed-up-and-Go test, thus demonstrating good concurrent validity. None of the tests could discriminate fallers (those who had sustained at least one fall in the past year) from non-fallers. Limitation The results are generalizable to community-dwelling individuals with chronic stroke only
Spiekermann, Christoph; Rudack, Claudia; Stenner, Markus
The outcome of aesthetic rhinoplasty is determined by the patient's subjective satisfaction with the nasal appearance which is difficult to assess. The Utrecht Questionnaire for Outcome Assessment in Aesthetic Rhinoplasty (OAR) is a brief and reliable instrument to assess the influence of the subjective nasal appearance on quality of life in patients undergoing aesthetic rhinoplasty. Preoperative application of this questionnaire reveals important aspects and possible disturbances of the body image which could be negative predictors concerning the result. On the other hand, it represents an appropriate tool to assess the postoperative outcome. The aim of this study was to determine the validity, reliability and responsiveness of the adapted German version of the OAR (D-OAR). The adaption of the OAR to German language was performed by a forward and backward translation process. Patients undergoing rhinoplasty were asked to complete the D-OAR preoperatively, 1, 3 and 12 months after procedure and healthy volunteers without any nasal complaints served as controls to test validity, reliability and responsiveness. An excellent internal consistency, a good test-retest reliability and good inter-item and item-total correlations demonstrated a good reliability of the D-OAR. The convincing validity of the adapted version was proven by an excellent discriminant and a sufficient content validity. Significant differences between pre- and postoperative D-OAR scores revealed a good responsiveness of the instrument. Hence, with a sufficient validity, reliability and sensitivity to changes, the D-OAR is a short and helpful instrument to assess the subjective perception of the nasal appearance in German patients.
Uysal, Hilal; Ozcan, Şeyda
Many new measuring devices have been developed so that broader psychometric measurements in the coronary artery disease, disease-specific health status measurements, and identification of the broader quality of life can be performed in the recent years. The study was intended to determine whether, and to what extent, MIDAS is a valid and reliable measurement to the patients suffering from myocardial infarction for the first time in Turkey. The research was conducted with the patients hospitalized and treated with myocardial infarction in the cardiology departments of 2 hospitals in Istanbul, Turkey, between 2007 and 2008. Psychometric evaluations of TR-MIDAS were used for validity studies; language validity, content validity, construct validity were examined. For reliability studies; the tool's internal consistency reliability, Cronbach's alpha reliability coefficient, and test-retest reliability were completed. The instrument's content validity index was determined to be "0.95". Principal component analysis revealed six factors with an eigenvalue >1.5. Cronbach's alpha was found to be 0.89 for total scale which was an acceptable value. The total's test-retest reliability was 0.51 (p<0.01). Data obtained at the end of the study supports that Turkish Myocardial Infarction Dimensional Assessment Scale is a valid and reliable instrument as a disease-specific scale to assess the patients' quality of life suffering from myocardial infarction in Turkey. Copyright © 2010 European Society of Cardiology. Published by Elsevier B.V. All rights reserved.
Gilam, Gadi; Abend, Rany; Shani, Hagai; Ben-Zion, Ziv; Hendler, Talma
The Ultimatum Game (UG) is a canonical social decision-making task whereby a proposer divides a sum of money between himself and a responder who accepts or rejects the offer. Studies consistently demonstrate that unfair offers induce anger, and that rejecting such offers relates to aggression. Nevertheless, the UG is limited in interpersonal provocations common to real-life experiences of anger. Moreover, the psychometric properties of the UG as an anger-induction paradigm have yet to be evaluated. Here, to induce a more intense and genuine anger experience, we implemented a modified UG whereby short written provocations congruent with unfairness levels accompanied each offer. We aimed to test whether this anger-infused UG led to more anger and aggressive responses relative to the standard UG and to establish the reliability and validity of both versions. Participants performed either the anger-infused UG or a standard version, repeated twice, a week apart. They also performed the Taylor Aggression Paradigm, a reactive aggression paradigm, and completed emotion ratings and a trait anger inventory. Results indicate similar decreases in acceptance rates with increase in offer unfairness, and increases in reported anger, across both UG versions. Both versions demonstrated strong test-retest reliability. However, the anger-infused UG led to significantly stronger relations with reactive aggression and trait anger compared to the standard UG, providing evidence for better validity. The development of the anger-infused UG as a reliable and valid paradigm is pivotal for the induction and assessment of interpersonal anger and its aggressive expression in basic and clinical research settings. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Härdén, Marie; Nyström, Britta; Kulich, Károly; Carlsson, Jonas; Bengtson, Ann; Edvardsson, Nils
Symptoms related to atrial fibrillation and their impact on health-related quality of life (HRQoL) are often evaluated in clinical trials. However, there remains a need for a properly validated instrument. We aimed to develop and validate a short symptoms scale for patients with AF. One hundred and eleven patients with a variety of symptoms related to AF were scheduled for DC cardioversion. The mean age was 67.1 +/- 12.1 years, and 80% were men. The patients completed the new symptoms scale, the Toronto Symptoms Check List (SCL) and the generic Short Form 36 (SF-36) the day before the planned DC cardioversion. Compliance was excellent, with only 1 of 666 answers missing. One item, 'limitations in working capability', was deleted because of a low numerical response rate, as many of the patients were retired. The internal consistency reliability of the remaining six items was 0.81 (Cronbach's alpha). Patients scored highest in the items of 'dyspnoea on exertion', 'limitations in daily life due to AF' and 'fatigue due to AF', with scores of 4.5, 3.3 and 4.5, respectively. There was a good correlation to all relevant SF-36 domains and to the relevant questions of the SCL. The Rasch analyses showed that the items are unidimensional and that they are clearly separated and cover an adequate range. Test-retest reliability was performed in patients who failed DC and was adequate for three of six items, > 0.70. The psychometric characteristics of the new short symptoms scale were found to have satisfactory reliability and validity.
Full Text Available Abstract Background Symptoms related to atrial fibrillation and their impact on health-related quality of life (HRQoL are often evaluated in clinical trials. However, there remains a need for a properly validated instrument. We aimed to develop and validate a short symptoms scale for patients with AF. Methods One hundred and eleven patients with a variety of symptoms related to AF were scheduled for DC cardioversion. The mean age was 67.1 ± 12.1 years, and 80% were men. The patients completed the new symptoms scale, the Toronto Symptoms Check List (SCL and the generic Short Form 36 (SF-36 the day before the planned DC cardioversion. Compliance was excellent, with only 1 of 666 answers missing. Results One item, 'limitations in working capability', was deleted because of a low numerical response rate, as many of the patients were retired. The internal consistency reliability of the remaining six items was 0.81 (Cronbach's α. Patients scored highest in the items of 'dyspnoea on exertion', 'limitations in daily life due to AF' and 'fatigue due to AF', with scores of 4.5, 3.3 and 4.5, respectively. There was a good correlation to all relevant SF-36 domains and to the relevant questions of the SCL. The Rasch analyses showed that the items are unidimensional and that they are clearly separated and cover an adequate range. Test-retest reliability was performed in patients who failed DC and was adequate for three of six items, >0.70. Conclusion The psychometric characteristics of the new short symptoms scale were found to have satisfactory reliability and validity.
Mohan, Arjun; Sethi, Sanjay
Despite the increasing awareness of their pathogenesis and clinical consequences, research on and clinical management of acute exacerbations of chronic obstructive lung disease (AECOPDs) have been hindered by the lack of a consistent and reliable definition. Symptom-based definitions of exacerbations are sensitive to events and account for unreported exacerbations. Event (healthcare utilization)-based definitions are somewhat more definitive but miss unreported events. Objective quantification of symptoms in AECOPD is now possible with the development of the Exacerbations of Chronic Obstructive Pulmonary Disease Tool (EXACT-PRO), a patient-reported outcome (PRO) measure. Several studies have revealed that unreported AECOPDs are more frequent than reported events and are associated with long-term adverse consequences. New antibiotic development for AECOPD has been hampered by the lack of validated measures for resolution of exacerbations. As a result of these observations, a unique collaborative effort between academia, industry and regulatory agencies resulted in the development of the EXACT-PRO. It consists of 14 questions that generate a score between 0 and 100, and it has been shown to have excellent reliability and validity. In the absence of a reliable biomarker, the definition and measurement of exacerbations has been subjective and imprecise. PRO measures such as EXACT can provide much needed objectivity in assessing symptom-defined exacerbations, which may translate into a uniform outcome measure in clinical trials. With further development and validation, it may have a role in clinical practice in the earlier detection of exacerbations, stratification of an exacerbation severity and the assessment of clinical response to treatment.
Skinner, T. C.; Howells, L.; Greene, S.
Aims: This article reports on the development and validity of a Diabetes-specific Illness Representations Questionnaire (DIRQ) to assess all five dimensions of an individual's perception of diabetes, for adolescents with Type 1 diabetes mellitus. Methods: There were two development studies. Study 1...... with a diabetes self-efficacy and barriers to adherence questionnaire. Subsequently there were two validation studies. Study 3: participants (n = 44 adolescents and 28 parents) completed the DIRQ and questionnaires assessing their self-care and psychological well-being. Glycaemic control was assessed through...... consist of two subscales, perceived threat and perceived impact, and provide further support for the distinction between treatment effectiveness to control diabetes and treatment effectiveness to prevent complications. Along with the validation studies, the results indicate that the questionnaire scales...
Bech, B; Lönn, L; Falkenberg, M
Objectives To study the construct validity and reliability of a novel endovascular global rating scale, Structured Assessment of endoVascular Expertise (SAVE). Design A Clinical, experimental study. Materials Twenty physicians with endovascular experiences ranging from complete novices to highly....... Validity was analysed by correlating experience with performance results. Reliability was analysed according to generalisability theory. Results The mean score on the 29 items of the SAVE scale correlated well with clinical experience (R = 0.84, P ... with clinical experience (R = -0.53, P validity and reliability of assessment with the SAVE scale was high when applied to performances in a simulation setting with advanced realism. No ceiling effect...
Sharma, Sonia; Crow, Heidi C; McCall, W D; Gonzalez, Yoly M
To conduct a systematic review of papers reporting the reliability and diagnostic validity of the joint vibration analysis (JVA) for diagnosis of temporomandibular disorders (TMD). A search of Pubmed identified English-language publications of the reliability and diagnostic validity of the JVA. Guidelines were adapted from applied STAndards for the Reporting of Diagnostic accuracy studies (STARD) to evaluate the publications. Fifteen publications were included in this review, each of which presented methodological limitations. This literature is unable to provide evidence to support the reliability and diagnostic validity of the JVA for diagnosis of TMD.
Turel, Yalin Kilic
The interactive whiteboard (IWB) has become a popular technology for instructors over the last decade. Though research asserts that the IWBs facilitate learning in different ways, there is a lack of studies examining actual IWB use in classroom settings based on learners' perspectives by means of valid instruments. The purpose of this study is to…
Ilker, Gokce Erturan; Arslan, Yunus; Demirhan, Giyasettin
The Trichotomous Achievement Goal Scale was developed by Agbuga and Xiang (2008) by including selected items from the scales of Duda and Nicholls (1992), Elliot (1999), and Elliot and Church (1997) and adapting them into Turkish. The scale consists of 18 items, and students rated each item on a 7-point Likert scale. To ascertain the validity and…
Kane, Michael; Case, Susan
The scores on two distinct tests (e.g., essay and objective) are often combined into a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to a separate criterion. In cases where no criterion is available, the observed composite has generally been evaluated in terms of its…
Yehya, Arij; Ghuloum, Suhaila; Mahfoud, Ziyad; Opler, Mark; Khan, Anzalee; Hammoudeh, Samer; Abdulhakam, Abdulmoneim; Al-Mujalli, Azza; Hani, Yahya; Elsherbiny, Reem; Al-Amin, Hassen
The Positive and Negative Syndrome Scale (PANSS) is widely used for patients with schizophrenia. This scale is reliable and valid. The PANSS was translated and validated in several languages. The aim of this study was to translate and validate the PANSS in the Arab population. The PANSS was translated into formal Arabic language using the back-translation method. 101 Arab patients with schizophrenia and 98 Arabs with no diagnosis of any mental disorder were recruited. The Arabic version of the Mini International Neuropsychiatric Interview (MINI-6) was used as a diagnostic tool to confirm the diagnosis of schizophrenia or rule out any diagnosis for the healthy control group. Reliability of the scale was assessed by calculating internal consistency, interrater reliability and test-retest reliability. Construct validity was assessed using the Arabic version of the MINI-6. PANSS total scores were correlated with the Clinical Global Impression-Severity scale. Our findings showed that the internal consistency was good (0.92). Scores on the PANSS of the patients were much higher than those of the healthy controls. The PANSS showed good interrater reliability and test-retest reliability (0.92 and 0.75, respectively). In comparison with the MINI-6, the PANSS showed good sensitivity and specificity, which implies good construct validity of this version. In conclusion, the Arabic version of the PANSS is a reliable and valid instrument for the assessment of patients with schizophrenia in the Arab population. © 2016 S. Karger AG, Basel.
The 133-item Emotional Quotient Inventory (EQ-i designed by Bar-On (1997 and translated by Dehshiry (2003 was revised and modified by removing the response validity items, changing reverse indicators into positively worded statements and revising the remaining 117 Persian indicators on the basis of schema theory. It was administered to 669 instructors most of whom were teaching English as a foreign language (EFL in national branches of Iran Language Institute in 15 cities. The application of the Principal Axis Factoring to the data and rotating the extracted factors via Varimax with Kaiser Normalization yielded 15 latent variables (LVs called Humanistic, Self-Satisfying, Self-Confident, Self-Aware, Self-Controlled, Research-Oriented, Content, Sociable, Empathetic, Tolerant, Flexible, Realistic, Independent, Emotional and Happy in this study. Not only did the modified Persian EQ-I proved to be more reliable than its original version, but also its thirteen LVs reached very high and acceptable levels of reliability. With the exception of the last, all the LVs also correlated significantly with each other and thus established the EI as a multifactorial construct whose constituting LVs are closely related to each other. The findings question correlating the so-called competences of EI and offer employing the factorially valid LVs as the best factors to explore the relationship between EI and variables involved in teaching and learning EFL.
Oikonomidi, Theodora; Vikelis, Michail; Artemiadis, Artemios; Chrousos, George P; Darviri, Christina
The Migraine Disability Assessment (MIDAS) Questionnaire is a reliable and valid instrument for migraine-related disability. Such a tool is needed to quantify migraine-related disability in the Greek population. This validation study aims to assess the test-retest reliability, internal consistency, item discriminant and convergent validity of the Greek translation of the MIDAS. Adults diagnosed with migraine completed the MIDAS Questionnaire on two occasions 3 weeks apart to assess reliability, and completed the RAND-36 to assess validity. Participants (n = 152) had a median MIDAS score of 24 and mostly severe disability (58% were grade IV). The test-retest reliability analysis (N = 59) revealed excellent reliability for the total score. Internal consistency was α = 0.71 for initial and α = 0.82 for retest completion. For item discriminant validity, the correlations between each question and the total score were significant, with high correlations for questions 2-5 (range 0.67 ≤ r ≤ 0.79; p MIDAS score tended to have better wellbeing. Psychometric properties are comparable with those of other published validation studies of the MIDAS and the original. Findings on question 1 show that missing work/school days may be closely related with increased affect issues. The Greek version of the MIDAS Questionnaire has good reliability and validity. This study allowed for cross-cultural comparability of research findings.
Patrícia Pinto Fonseca
Full Text Available INTRODUCTION: Patients' perception about their health condition, mainly involving chronic diseases, has been investigated in many studies and it has been associated to depression, compliance with the treatment, quality of life and prognosis. The Illness Effects Questionnaire (IEQ is a tool which makes the standardized evaluation of patients' perception about their illness possible, so that it is brief and accessible to the different clinical settings. This work aims to begin the transcultural adaptation of the IEQ to Brazil through the validated translation and the reliability study. METHODS: The back-translation method and the test-retest reliability study were used in a sample of 30 adult patients under chronic hemodialysis. The reliability indexes were estimated using the Pearson, Spearman, Weighted Kappa and Cronbach's alpha coefficients. RESULTS: The semantic equivalence was reached through the validated translation. In this study, the reliability indexes obtained were respectively: 0.85 and 0.75 (p < 0.001; 0.68 and 0.92 (p < 0.0001. DISCUSSION: The reliability indexes obtained attest to the stability of responses in both evaluations. Additional procedures are necessary for the transcultural adaptation of the IEQ to be complete. CONCLUSION: The results indicate the translation validity and the reliability of the Brazilian version of the IEQ for the sample studied.
Dueñas, María; Mendonça, Liliane; Sampaio, Rute; Gouvinhas, Cláudia; Oliveira, Daniela; Castro-Lopes, José Manuel; Azevedo, Luís Filipe
The Bowel Function Index (BFI) is a simple and sound bowel function and opioid-induced constipation (OIC) screening tool. We aimed to develop the translation and cultural adaptation of this measure (BFI-P) and to assess its reliability and validity for the Portuguese language and a chronic pain population. The BFI-P was created after a process including translation, back translation and cultural adaptation. Participants (n = 226) were recruited in a chronic pain clinic and were assessed at baseline and after one week. Internal consistency, test-retest reliability, responsiveness, construct (convergent and known groups) and factorial validity were assessed. Test-retest reliability had an intra-class correlation of 0.605 for BFI mean score. Internal consistency of BFI had Cronbach's alpha of 0.865. The construct validity of BFI-P was shown to be excellent and the exploratory factor analysis confirmed its unidimensional structure. The responsiveness of BFI-P was excellent, with a suggested 17-19 point and 8-12 point change in score constituting a clinically relevant change in constipation for patients with and without previous constipation, respectively. This study had some limitations, namely, the criterion validity of BFI-P was not directly assessed; and the absence of a direct criterion for OIC precluded the assessment of the criterion based responsiveness of BFI-P. Nevertheless, BFI may importantly contribute to better OIC screening and its Portuguese version (BFI-P) has been shown to have excellent reliability, internal consistency, validity and responsiveness. Further suggestions regarding statistically and clinically important change cut-offs for this instrument are presented.
Lobban, Fiona; Solis-Trapala, Ivonne; Symes, Wendy; Morriss, Richard
Recognising early warning signs (EWS) of mood changes is a key part of many effective interventions for people with Bipolar Disorder (BD). This study describes the development of valid and reliable checklists required to assess these signs of depression and mania. Checklists of EWS based on previous research and participant feedback were designed for depression and mania and compared with spontaneous reporting of EWS. Psychometric properties and utility were examined in 96 participants with BD. The majority of participants did not spontaneously monitor EWS regularly prior to use of the checklists. The checklists identified most spontaneously generated EWS and led to a ten fold increase in the identification of EWS for depression and an eight fold increase for mania. The scales were generally reliable over time and responses were not associated with current mood. Frequency of monitoring for EWS correlated positively with social and occupational functioning for depression (beta=3.80, p=0.015) and mania (beta=3.92, p=0.008). The study is limited by a small sample size and the fact that raters were not blind to measures of mood and function. EWS checklists are useful and reliable clinical and research tools helping to generate enough EWS for an effective EWS intervention. Copyright © 2011 Elsevier B.V. All rights reserved.
Ringsted, C; Lippert, F; Hesselfeldt, R
Cardiac Arrest Simulation Test (CASTest) scenarios for the assessments according to guidelines 2005. AIMS: To analyse the reliability and validity of the individual sub-tests provided by ERC and to find a combination of MCQ and CASTest that provides a reliable and valid single effect measure of ALS...... that possessed high reliability, equality of test sets, and ability to discriminate between the two groups of supposedly different ALS competence. CONCLUSIONS: ERC sub-tests of ALS competence possess sufficient reliability and validity. A combined ALS score with equal weighting of one MCQ and one CASTest can...... competence. METHODS: Two groups of participants were included in this randomised, controlled experimental study: a group of newly graduated doctors, who had not taken the ALS course (N=17) and a group of students, who had passed the ALS course 9 months before the study (N=16). Reliability in terms of inter...
Rikkert, Marcel G M Olde; Tona, Klodiana Daphne; Janssen, Lieneke
New staging systems of dementia require adaptation of disease management programs and adequate staging instruments. Therefore, we systematically reviewed the literature on validity and reliability of clinically applicable, multidomain, and dementia staging instruments. A total of 23 articles...
M. Reijman (Max); J.M.W. Hazes (Mieke); H.A.P. Pols (Huib); R.M.D. Bernsen (Roos); B.W. Koes (Bart); S.M. Bierma-Zeinstra (Sita)
textabstractOBJECTIVES: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the
Chorong Park, MSN, RN
Conclusion: The K-HES had acceptable validity and reliability. The brevity and ease of administration of the K-HES makes it a suitable tool for evaluating empowerment-based education programs targeted towards older populations.
Betül Tosun, RN, PhD
Conclusions: The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems.
Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe
The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis.
Boer, Y.A. de; Ende, C.H.M. van den; Eygendaal, D.; Jolie, I.M.M.; Hazes, J.M.W.; Rozing, P.M.
OBJECTIVES: (1) To investigate the measurement characteristics of the Hospital for Special Surgery (HSS) and Mayo Clinic elbow assessment instruments, utilizing methodological criteria including feasibility, reliability, validity, and discriminative ability; and (2) to develop an efficient and
knowledge-dietary behaviour relationship require use of valid and reliable knowledge .... Which of the following beverages has the lowest energy content per cup (250 ml)?b .... Diploma (ND): Consumer Science: Food and Nutrition together.
Gamze Sarikoc, PhD, RN
Conclusion: Results showed that the SNSI had a satisfactory level of reliability and validity in nursing students in Turkey. Multicenter studies including nursing students from different nursing schools are recommended for the SNSI to be generalized.
McMullen, Tara; Resnick, Barbara
To establish the reliability and validity of the Rosenberg Self-Esteem Scale (RSES) when used with nursing assistants (NAs). Testing the RSES used baseline data from a randomized controlled trial testing the Res-Care Intervention. Female NAs were recruited from nursing homes (n = 508). Validity testing for the positive and negative subscales of the RSES was based on confirmatory factor analysis (CFA) using structural equation modeling and Rasch analysis. Estimates of reliability were based on Rasch analysis and the person separation index. Evidence supports the reliability and validity of the RSES in NAs although we recommend minor revisions to the measure for subsequent use. Establishing reliable and valid measures of self-esteem in NAs will facilitate testing of interventions to strengthen workplace self-esteem, job satisfaction, and retention.
Mazaheri, Maryam Amidi; Karbasi, Mojtaba
Background: With regard to large number of mobile users especially among college students in Iran, addiction to mobile phone is attracting increasing concern. There is an urgent need for reliable and valid instrument to measure this phenomenon. This study examines validity and reliability of the Persian version of mobile phone addiction scale (MPAIS) in college students. Materials and Methods: this methodological study was down in Isfahan University of Medical Sciences. One thousand one hundr...
Reijman, Max; Hazes, Mieke; Pols, Huib; Bernsen, Roos; Koes, Bart; Bierma-Zeinstra, Sita
textabstractOBJECTIVES: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the validity of the three definitions of hip OA is sex dependent. METHODS: SUBJECTS: from the Rotterdam study (aged > or= 55 years, n = 3585) were evaluated. The inter-rater reliability was tested in a random set of 148 x rays. ...
Li, Fengzhi; Li, Changji; Long, Yunfang; Zhan, Chenglie; Hennessy, Dwight
The present research was designed to examine the psychometric properties of Chinese versions of the Self Report Driver Behavior Aggression and Assertiveness subscales, the Driving Vengeance Questionnaire, and the Violent Driving Questionnaire. Study 1 found that the all scales demonstrated good internal consistency, with alphas ranging from .76 to .87 and that assertive driving was related to demerit points received over the past 12 months while driver aggression and violence were linked to collisions over the past 12 months. Study 2 found that the scales exhibited reasonable test-retest reliability, with correlations ranging from .82 to .89. Finally, Study 3 showed that each scale was predicted by other dangerous driving attitudes and behaviors, similar to the original versions. The consistency between the translated and original scales, the implications for use in a Chinese sample, and the uniformity of actions in the traffic environment across cultures are discussed.
Conclusion: Considering that Validity and Reliability factors of the questionnaire were be appropriate, it can be recommended that NIOSH Generic Job Stress Questionnaire (GJSQ can be used as a Valid and Reliable questionnaire for job stress evaluation in Iran.
Schoppen, Tanneke; Boonstra, Antje; Groothoff, JW; de Vries, J; Goeken, LNH; Eisma, Willem
Objective: To determine the interrater and interrater reliability and the validity of the Timed "up and go" test as a measure for physical mobility in elderly patients with an amputation of the lower extremity. Design: To test interrater reliability, the test was performed for two observers at
Michailov, Michail Lubomirov; Baláš, Jirí; Tanev, Stoyan Kolev; Andonov, Hristo Stoyanov; Kodejška, Jan; Brown, Lee
Purpose: An advanced system for the assessment of climbing-specific performance was developed and used to: (a) investigate the effect of arm fixation (AF) on construct validity evidence and reliability of climbing-specific finger-strength measurement; (b) assess reliability of finger-strength and endurance measurements; and (c) evaluate the…
The Rey Visual Design Learning Test (Rey, 1964, in Spreen & Strauss, 1991) assesses immediate memory span, new learning and recognition for non-verbal material. Three studies are presented that focused on the reliability and validity of the RVDLT in primary school children. Test-retest reliability
Rae, James R.; Olson, Kristina R.
The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Brody, Michelle L.; And Others
Examined reliability and validity of binge eating disorder (BED), proposed for inclusion in Diagnostic and Statistical Manual of Mental Disorders (DSM), fourth edition. Interrater reliability of BED diagnosis compared favorably with that of most diagnoses in DSM revised third edition. Study comparing obese individuals with and without BED and…
Conclusion: The tool designed to assess bag-mask ventilation and tracheal intubation skills in anesthesia trainees demonstrated excellent inter-rater reliability, fair test-retest reliability, and good construct validity. The authors recommend its use for formative and summative assessment of junior anesthesia trainees.
Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth
The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor structure, reliability and discriminant validity were also assessed. Seventy-one participants with mild-moderate learning disabilities completed the two scales of the SMAT and two False Belief Tasks for Theory of Mind. Reliability of the SMAT was very good, and the scales were shown to be uni-dimensional in factor structure. There was a significant positive relationship between Theory of Mind and both SMAT scales. There is early evidence of the construct validity and reliability of the SMAT. Further assessment of the validity of the SMAT will be required. © 2012 Blackwell Publishing Ltd.
Francine Guimarães Gonçalves
Full Text Available Abstract The Revised Olweus Bully/Victim Questionnaire (OBVQ is among the few bullying assessment instruments with well-established psychometric properties in different countries. Nevertheless, the psychometric properties of the Brazilian version (Questionário de Bullying de Olweus - QBO have not been determined. We aimed at verifying the construct validity and reliability of the bully and victim scales of the QBO. To achieve that goal, the victim and bully scales were assessed using polytomous item response theory (IRT. The best fit was obtained with a generalized partial credit model that is capable of measuring the specific discriminating power for each item in these scales. The QBO was administered to 703 public school students (mean age: 13 years; standard deviation = 1.58. Based on IRT analysis, the number of response categories in each item was reduced from four to three. Cronbach reliability scores were satisfactory: α = 0.85 (victim scale and α = 0.87 (bully scale. In this study, hurtful comments, persecution, or threats had high power to discriminate victims and bullies. For both QBO scales, higher severity parameters were observed for direct bullying items. The results also show that the construct of both QBO scales measures the same construct proposed for the overall instrument. Thus, the QBO can be administered to different Brazilian populations to assess the main characteristics of bullying: repetition of behavior over time and intentionally acting to humiliate, threaten, or harm somebody.
Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Wickramasinghe, Nuwan Darshana; Dissanayake, Devani Sakunthala; Abeywardena, Gihan Sajiwa
The present study was aimed at assessing the validity and the reliability of the Sinhala version of the Utrecht Work Engagement Scale-Student Version (UWES-S) among collegiate cycle students in Sri Lanka. The 17-item UWES-S was translated to Sinhala and the judgmental validity was assessed by a multi-disciplinary panel of experts. Construct validity of the UWES-S was appraised by using multi-trait scaling analysis and exploratory factor analysis (EFA) on data obtained from a sample of 194 grade thirteen students in the Kurunegala district, Sri Lanka. Reliability of the UWES-S was assessed by using internal consistency and test-retest reliability. Except for item 13, all other items showed good psychometric properties in judgemental validity, item-convergent validity and item-discriminant validity. EFA using principal component analysis with Oblimin rotation, suggested a three-factor solution (including vigor, dedication and absorption subscales) explaining 65.4% of the total variance for the 16-item UWES-S (with item 13 deleted). All three subscales show high internal consistency with Cronbach's α coefficient values of 0.867, 0.819, and 0.903 and test-retest reliability was high (p valid and a reliable instrument to assess work engagement among collegiate cycle students in Sri Lanka.
Braun, Tobias; Marks, Detlef; Thiel, Christian; Grüneberg, Christian
To establish the validity and reliability of the de Morton Mobility Index (DEMMI) in patients with sub-acute stroke. This cross-sectional study was performed in a neurological rehabilitation hospital. We assessed unidimensionality, construct validity, internal consistency reliability, inter-rater reliability, minimal detectable change and possible floor and ceiling effects of the DEMMI in adult patients with sub-acute stroke. The study included a total sample of 121 patients with sub-acute stroke. We analysed validity (n = 109) and reliability (n = 51) in two sub-samples. Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 12.37, p = 0.577). All hypotheses on construct validity were confirmed. Internal consistency reliability (Cronbach's alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.95; 95% confidence interval: 0.92-0.97) were excellent. The minimal detectable change with 90% confidence was 13 points. No floor or ceiling effects were evident. These results indicate unidimensionality, sufficient internal consistency reliability, inter-rater reliability, and construct validity of the DEMMI in patients with a sub-acute stroke. Advantages of the DEMMI in clinical application are the short administration time, no need for special equipment and interval level data. The de Morton Mobility Index, therefore, may be a useful performance-based bedside test to measure mobility in individuals with a sub-acute stroke across the whole mobility spectrum. Implications for Rehabilitation The de Morton Mobility Index (DEMMI) is an unidimensional measurement instrument of mobility in individuals with sub-acute stroke. The DEMMI has excellent internal consistency and inter-rater reliability, and sufficient construct validity. The minimal detectable change of the DEMMI with 90% confidence in stroke rehabilitation is 13 points. The lack of any floor or ceiling effects on hospital admission indicates
Tangney, June Price; Stuewig, Jeffrey; Furukawa, Emi; Kopelovich, Sarah; Meyer, Patrick; Cosby, Brandon
Theory, research, and clinical reports suggest that moral cognitions play a role in initiating and sustaining criminal behavior. The 25 item Criminogenic Cognitions Scale (CCS) was designed to tap 5 dimensions: Notions of entitlement; Failure to Accept Responsibility; Short-Term Orientation; Insensitivity to Impact of Crime; and Negative Attitudes Toward Authority. Results from 552 jail inmates support the reliability, validity, and predictive utility of the measure. The CCS was linked to criminal justice system involvement, self-report measures of aggression, impulsivity, and lack of empathy. Additionally, the CCS was associated with violent criminal history, antisocial personality, and clinicians' ratings of risk for future violence and psychopathy (PCL:SV). Furthermore, criminogenic thinking upon incarceration predicted subsequent official reports of inmate misconduct during incarceration. CCS scores varied somewhat by gender and race. Research and applied uses of CCS are discussed.
Rabbani, Alireza; Kargarfard, Mehdi; Twist, Craig
Rabbani, A, Kargarfard, M, and Twist, C. Reliability and validity of a submaximal warm-up test for monitoring training status in professional soccer players. J Strength Cond Res 32(2): 326-333, 2018-Two studies were conducted to assess the reliability and validity of a submaximal warm-up test (SWT) in professional soccer players. For the reliability study, 12 male players performed an SWT over 3 trials, with 1 week between trials. For the validity study, 14 players of the same team performed an SWT and a 30-15 intermittent fitness test (30-15IFT) 7 days apart. Week-to-week reliability in selected heart rate (HR) responses (exercise heart rate [HRex], heart rate recovery [HRR] expressed as the number of beats recovered within 1 minute [HRR60s], and HRR expressed as the mean HR during 1 minute [HRpost1]) was determined using the intraclass correlation coefficient (ICC) and typical error of measurement expressed as coefficient of variation (CV). The relationships between HR measures derived from the SWT and the maximal speed reached at the 30-15IFT (VIFT) were used to assess validity. The range for ICC and CV values was 0.83-0.95 and 1.4-7.0% in all HR measures, respectively, with the HRex as the most reliable HR measure of the SWT. Inverse large (r = -0.50 and 90% confidence limits [CLs] [-0.78 to -0.06]) and very large (r = -0.76 and CL, -0.90 to -0.45) relationships were observed between HRex and HRpost1 with VIFT in relative (expressed as the % of maximal HR) measures, respectively. The SWT is a reliable and valid submaximal test to monitor high-intensity intermittent running fitness in professional soccer players. In addition, the test's short duration (5 minutes) and simplicity mean that it can be used regularly to assess training status in high-level soccer players.
Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D
Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh
The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Suzuki, Eiko; Kanoya, Yuka; Katsuki, Takeshi; Sato, Chifumi
To verify the reliability and validity of a Japanese version of the Rathus Assertiveness Schedule in novice nurses to contribute to nursing management. An adequate scale is needed to measure the assertiveness and the effect of assertion training for Japanese nurses and to compare them with those in other countries. Rathus Assertiveness Schedule was adapted to Japanese with back-translation and its validity was examined in 989 novice nurses. The Japanese version showed a high coefficient of reliability in a split-half reliability test (r=0.76; PAssertiveness Schedule. The Japanese version of Rathus Assertiveness Schedule was verified.
So, Hyang Sook; Chae, Myeong Jeong; Kim, Hye Young
In this study the reliability and validity of the Korean version of the Cancer Stigma Scale (KCSS) was evaluated. The KCSS was formed through translation and modification of Cataldo Lung Cancer Stigma Scale. The KCSS, Psychological Symptom Inventory (PSI), and European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 30 (EORTC QLQ-C30) were administered to 247 men and women diagnosed with one of the five major cancers. Construct validity, item convergent and discriminant validity, concurrent validity, known-group validity, and internal consistency reliability of the KCSS were evaluated. Exploratory factor analysis supported the construct validity with a six-factor solution; that explained 65.7% of the total variance. The six-factor model was validated by confirmatory factor analysis (Q (χ²/df)= 2.28, GFI=.84, AGFI=.81, NFI=.80, TLI=.86, RMR=.03, and RMSEA=.07). Concurrent validity was demonstrated with the QLQ-C30 (global: r=-.44; functional: r=-.19; symptom: r=.42). The KCSS had known-group validity. Cronbach's alpha coefficient for the 24 items was .89. The results of this study suggest that the 24-item KCSS has relatively acceptable reliability and validity and can be used in clinical research to assess cancer stigma and its impacts on health-related quality of life in Korean cancer patients. © 2017 Korean Society of Nursing Science
Renée van der Leeuw
Full Text Available BACKGROUND: The importance of effective clinical teaching for the quality of future patient care is globally understood. Due to recent changes in graduate medical education, new tools are needed to provide faculty with reliable and individualized feedback on their teaching qualities. This study validates two instruments underlying the System for Evaluation of Teaching Qualities (SETQ aimed at measuring and improving the teaching qualities of obstetrics and gynecology faculty. METHODS AND FINDINGS: This cross-sectional multi-center questionnaire study was set in seven general teaching hospitals and two academic medical centers in the Netherlands. Seventy-seven residents and 114 faculty were invited to complete the SETQ instruments in the duration of one month from September 2008 to September 2009. To assess reliability and validity of the instruments, we used exploratory factor analysis, inter-item correlation, reliability coefficient alpha and inter-scale correlations. We also compared composite scales from factor analysis to global ratings. Finally, the number of residents' evaluations needed per faculty for reliable assessments was calculated. A total of 613 evaluations were completed by 66 residents (85.7% response rate. 99 faculty (86.8% response rate participated in self-evaluation. Factor analysis yielded five scales with high reliability (Cronbach's alpha for residents' and faculty: learning climate (0.86 and 0.75, professional attitude (0.89 and 0.81, communication of learning goals (0.89 and 0.82, evaluation of residents (0.87 and 0.79 and feedback (0.87 and 0.86. Item-total, inter-scale and scale-global rating correlation coefficients were significant (P<0.01. Four to six residents' evaluations are needed per faculty (reliability coefficient 0.60-0.80. CONCLUSIONS: Both SETQ instruments were found reliable and valid for evaluating teaching qualities of obstetrics and gynecology faculty. Future research should examine improvement of
Dere, Zeynep; Ömeroglu, Esra
This study, Creative Behavior Observation Form was developed to assess creativity of the children. While the study group on the reliability and validity of Creative Behavior Observation Form was being developed, 257 children in total who were at the ages of 5-6 were used as samples with stratified sampling method. Content Validity Index (CVI) and…
Yirci, Ramazan; Karakose, Turgut; Uygun, Harun; Ozdemir, Tuncay Yavuz
The purpose of this study is to adapt the Mentoring Relationship Effectiveness Scale to Turkish, and to conduct validity and reliability tests regarding the scale. The study group consisted of 156 university science students receiving graduate education. Construct validity and factor structure of the scale was analyzed first through exploratory…
Ng, Petrus; Su, Xiqing Susan; Chan, Vivien; Leung, Heidi; Cheung, Wendy; Tsun, Angela
This study validated a Perceived Campus Caring Scale with 1,520 university students. Using factor analysis, seven factors namely, Faculty Support, Nonfaculty Support, Peer Relationship, Sense of Detachment, Sense of Belonging, Caring Attitude, and Campus Involvement, are identified with high reliability, validity, and close correlation with the…
Biasutti, Michele; Frate, Sara
This article describes the development and validation of the Attitudes toward Sustainable Development scale, a quantitative 20-item scale that measures Italian university students' attitudes toward sustainable development. A total of 484 undergraduate students completed the questionnaire. The validity and reliability of the scale was statistically…
Vanbellingen, Tim; Nyffeler, Thomas; Nef, Tobias; Kwakkel, Gert; Bohlhalter, Stephan; van Wegen, Erwin E.H.
Background Patients with Parkinson's disease exhibit disturbed dexterity. Validated self-reported outcomes for dexterity in Parkinson's disease are lacking. The aim of this study was to investigate the reliability, content and construct validity of a new Dexterity Questionnaire 24. Methods One
M. Reijman (Max); J.M.W. Hazes (Mieke); H.A.P. Pols (Huib); R.M.D. Bernsen (Roos); B.W. Koes (Bart); S.M. Bierma-Zeinstra (Sita)
textabstractObjectives: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the validity of the three
Objective. We sought to determine the validity and reliability of a self-report physical activity questionnaire (PAQ) measuring physical activity/inactivity in South African schoolgirls of different ethnic origins. Methods. Construct validity of the PAQ was tested against physical activity energy expenditure estimated from an ...
Watt, Torquil; Hegedus, Laszlo; Grønvold, Mogens
Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test...
The purpose of this study was to determine the test-retest reliability and concurrent validity of the short form (Form B) of the Coopersmith Self-Esteem Inventory. Criterion measures for validity included: (1) sociometric measures; (2) teacher's popularity ranking; and, (3) self-esteem rating. (Author/LMO)
Pololi, Linda H; Evans, Arthur T; Civian, Janet T; Gibbs, Brian K; Gillum, Linda H; Brennan, Robert T
Despite the well-recognized benefits of mentoring in academic medicine, there is a lack of clarity regarding what constitutes effective mentoring. We developed a tool to assess mentoring activities experienced by faculty and evaluated evidence for its validity. The National Initiative on Gender, Culture, and Leadership in Medicine-"C-Change"-previously developed the C-Change Faculty Survey to assess the culture of academic medicine. After intensive review, we added six items representing six components of mentoring to the survey-receiving help with career and personal goals, learning skills, sponsorship, and resources. We tested the items in four academic health centers during 2013 to 2014. We estimated reliability of the new items and tested the correlation of the new items with a mentoring composite variable representing faculty mentoring experiences as positive, neutral, or inadequate and with other C-Change dimensions of culture. Among the 1520 responding faculty (response rate 61-63%), there was a positive association between each of the six mentoring activities and satisfaction with both the amount and quality of mentoring received. There was no difference by sex. Cronbach α coefficients ranged from 0.89 to 0.95 across subgroups of faculty (by sex, race, and principal roles). The mentoring responses were associated most closely with dimensions of Institutional Support (r = 0.58, P Mentoring scale is a valid instrument to assess mentoring. Survey results could facilitate mentoring program development and evaluation.
Endo, Arisa; Suzuki, Makoto; Akagi, Atsumi; Chiba, Naoyuki; Ishizaka, Ikuyo; Matsunaga, Atsuhiko; Fukuda, Michinari
The purpose of this study was to examine the reliability and validity of the Upper-body Dressing Scale (UBDS) for buttoned shirt dressing, which evaluates the learning process of new component actions of upper-body dressing in patients diagnosed with dementia and hemiparesis. This was a preliminary correlational study of concurrent validity and reliability in which 10 vascular dementia patients with hemiparesis were enrolled and assessed repeatedly by six occupational therapists by means of the UBDS and the dressing item of the Functional Independence Measure (FIM). Intraclass correlation coefficient was 0.97 for intra-rater reliability and 0.99 for inter-rater reliability. The level of correlation between UBDS score and FIM dressing item scores was -0.93. UBDS scores for paralytic hand passed into the sleeve and sleeve pulled up beyond the shoulder joint were worse than the scores for the other components of the task. The UBDS has good reliability and validity for vascular dementia patients with hemiparesis. Further research is needed to investigate the relation between UBDS score and the effect of intervention and to clarify sensitivity or responsiveness of the scale to clinical change. Copyright © 2014 John Wiley & Sons, Ltd.
Van Oyen, Herman; Bogaert, Petronille; Yokota, Renata T C; Berger, Nicolas
GALI or Global Activity Limitation Indicator is a global survey instrument measuring participation restriction. GALI is the measure underlying the European indicator Healthy Life Years (HLY). Gali has a substantial policy use within the EU and its Member States. The objective of current paper is to bring together what is known from published manuscripts on the validity and the reliability of GALI. Following the PRISMA guidelines, two search strategies (PUBMED, Google Scholar) were combined to identify manuscripts published in English with publication date 2000 or beyond. Articles were classified as reliability studies, concurrent or predictive validity studies, in national or international populations. Four cross-sectional studies (of which 2 international) studied how GALI relates to other health measures (concurrent validity). A dose-response effect by GALI severity level on the association with the other health status measures was observed in the national studies. The 2 international studies (SHARE, EHIS) concluded that the odds of reporting participation restriction was higher in subjects with self-reported or observed functional limitations. In SHARE, the size of the Odds Ratio's (ORs) in the different countries was homogeneous, while in EHIS the size of the ORs varied more strongly. For the predictive validity, subjects were followed over time (4 studies of which one international). GALI proved, both in national and international data, to be a consistent predictor of future health outcomes both in terms of mortality and health care expenditure. As predictors of mortality, the two distinct health concepts, self-rated health and GALI, acted independently and complementary of each other. The one reliability study identified reported a sufficient reliability of GALI. GALI as inclusive one question instrument fits all conceptual characteristics specified for a global measure on participation restriction. In none of the studies, included in the review, there was
Zhang, C; Yang, G P; Li, Z; Li, X N; Li, Y; Hu, J; Zhang, F Y; Zhang, X J
Objective: To assess the reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test (AUDIT) among medical students in China and to provide correct way of application on the recommended scales. Methods: An E-questionnaire was developed and sent to medical students in five different colleges. Students were all active volunteers to accept the testings. Cronbach's α and split-half reliability were calculated to evaluate the reliability of AUDIT while content, contract, discriminant and convergent validity were performed to measure the validity of the scales. Results: The overall Cronbach's α of AUDIT was 0.782 and the split-half reliability was 0.711. Data showed that the domain Cronbach's α and split-half reliability were 0.796 and 0.794 for hazardous alcohol use, 0.561 and 0.623 for dependence symptoms, and 0.647 and 0.640 for harmful alcohol use. Results also showed that the content validity index on the levels of items I-CVI) were from 0.83 to 1.00, the content validity index of scale level (S-CVI/UA) was 0.90, content validity index of average scale level (S-CVI/Ave) was 0.99 and the content validity ratios (CVR) were from 0.80 to 1.00. The simplified version of AUDIT supported a presupposed three-factor structure which could explain 61.175% of the total variance revealed through exploratory factor analysis. AUDIT semed to have good convergent and discriminant validity, with the success rate of calibration experiment as 100%. Conclusion: AUDIT showed good reliability and validity among medical students in China thus worth for promotion on its use.
Sejbæk, Tobias; Blaabjerg, Morten; Sprogøe, Pippi
. The Multiple Sclerosis Neuropsychological Screening Questionnaire (MSNQ) has previously shown good validity in American, Argentinean, and Dutch MS cohorts. We sought to test reliability and validity of a Danish translation of the MSNQ compared with formal neuropsychological testing, and measures of depression...... the Expanded Disability Status Scale and MS Impairment Scale. Results: The test-retest reliability of the MSNQ-P was significant (R2 = 0.79, P ... that the MSNQ-P measures these items more than the cognitive abilities of the patients. Conclusions: This study does not support use of the MSNQ as a sensitive or valid screening tool for cognitive impairment in Danish patients with MS....
Jesús F. Salgado
Full Text Available There is criticism in the literature about the use of interrater coefficients to correct for criterion reliability in validity generalization (VG studies and disputing whether .52 is an accurate and non-dubious estimate of interrater reliability of overall job performance (OJP ratings. We present a second-order meta-analysis of three independent meta-analytic studies of the interrater reliability of job performance ratings and make a number of comments and reflections on LeBreton et al.s paper. The results of our meta-analysis indicate that the interrater reliability for a single rater is .52 (k = 66, N = 18,582, SD = .105. Our main conclusions are: (a the value of .52 is an accurate estimate of the interrater reliability of overall job performance for a single rater; (b it is not reasonable to conclude that past VG studies that used .52 as the criterion reliability value have a less than secure statistical foundation; (c based on interrater reliability, test-retest reliability, and coefficient alpha, supervisor ratings are a useful and appropriate measure of job performance and can be confidently used as a criterion; (d validity correction for criterion unreliability has been unanimously recommended by "classical" psychometricians and I/O psychologists as the proper way to estimate predictor validity, and is still recommended at present; (e the substantive contribution of VG procedures to inform HRM practices in organizations should not be lost in these technical points of debate.
Rikkert, Marcel G M Olde; Tona, Klodiana Daphne; Janssen, Lieneke
New staging systems of dementia require adaptation of disease management programs and adequate staging instruments. Therefore, we systematically reviewed the literature on validity and reliability of clinically applicable, multidomain, and dementia staging instruments. A total of 23 articles...... describing 12 staging instruments were identified (N = 6109 participants, age 65-87). Reliability was studied in most (91%) of the articles and was judged moderate to good. Approximately 78% of the articles evaluated concurrent validity, which was good to very good, while discriminant validity was assessed...... in only 25%. The scales can be applied in ±15 minutes. Clinical Dementia Rating (CDR), Global Deterioration scale (GDS), and Functional Assessment Staging (FAST) have been monitored on reliability and validity, and the CDR currently is the best-evidenced scale, also studied in international perspective...
Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara
This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Jillian E. Frideres
Full Text Available The purpose of this study was to design and to test the validity and reliability of an instrument to evaluate coaches' knowledge about the female athlete triad syndrome and their confidence in this knowledge. The instrument collects information regarding: knowledge of the syndrome, components, prevention and intervention; confidence of the coaches in their answers; and coach's characteristics (gender, degree held, years of experience in coaching females, continuing education participation specific to the syndrome and its components, and sport coached. The process of designing the questionnaire and testing the validity and reliability of it was done in four phases: a design and development of the instrument, b content validity, c instrument reliability, and d concurrent validity. The results show that the instrument is suitable for measuring coaches' female athlete triad knowledge. The instrument can contribute to assessing the coaches' knowledge level in relation to this topic.
Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra
The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.
Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Eric Swanson, MD
Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results.
Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe
Introduction Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. Objective To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. Method The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. Results The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Conclusion Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results. PMID:29736104
Casartelli, Nicola; Müller, Roland; Maffiuletti, Nicola A
The aim of the present study was to verify the validity and reliability of the Myotest accelerometric system (Myotest SA, Sion, Switzerland) for the assessment of vertical jump height. Forty-four male basketball players (age range: 9-25 years) performed series of squat, countermovement and repeated jumps during 2 identical test sessions separated by 2-15 days. Flight height was simultaneously quantified with the Myotest system and validated photoelectric cells (Optojump). Two calculation methods were used to estimate the jump height from Myotest recordings: flight time (Myotest-T) and vertical takeoff velocity (Myotest-V). Concurrent validity was investigated comparing Myotest-T and Myotest-V to the criterion method (Optojump), and test-retest reliability was also examined. As regards validity, Myotest-T overestimated jumping height compared to Optojump (p 0.98), that is, excellent validity. Myotest-V overestimated jumping height compared to Optojump (p 12 cm), high limits of agreement ratios (>36%), and low ICCs (9 cm). In conclusion, Myotest-T is a valid and reliable method for the assessment of vertical jump height, and its use is legitimate for field-based evaluations, whereas Myotest-V is neither valid nor reliable.
Goossens, Nina; Janssens, Lotte; Pijnenburg, Madelon; Caeyenberghs, Karen; Van Rompuy, Charlotte; Meugens, Paul; Sunaert, Stefan; Brumagne, Simon
Processing proprioceptive information in the brain is essential for optimal postural control and can be studied with proprioceptive stimulation, provided by muscle vibration, during functional magnetic resonance imaging (fMRI). Classic electromagnetic muscle vibrators, however, cannot be used in the high-strength magnetic field of the fMRI scanner. Pneumatic vibrators offer an fMRI-compatible alternative. However, whether these devices produce reliable and valid proprioceptive stimuli has not been investigated, although this is essential for these devices to be used in longitudinal research. Test–retest reliability and concurrent validity of the postural response to muscle vibration, provided by custom-made fMRI-compatible pneumatic vibrators, were assessed in a repeated-measures design. Mean center of pressure (CoP) displacements during, respectively, ankle muscle and back muscle vibration (45–60 Hz, 0.5 mm) provided by an electromagnetic and a pneumatic vibrator were measured in ten young healthy subjects. The test was repeated on the same day and again within one week. Intraclass correlation coefficients (ICC) were calculated to assess (a) intra- and interday reliability of the postural responses to, respectively, pneumatic and electromagnetic vibration, and (b) concurrent validity of the response to pneumatic compared to electromagnetic vibration. Test–retest reliability of mean CoP displacements during pneumatic vibration was good to excellent (ICCs = 0.64–0.90) and resembled that of responses to electromagnetic vibration (ICCs = 0.64–0.94). Concurrent validity of the postural effect of pneumatic vibration was good to excellent (ICCs = 0.63–0.95). In conclusion, the proposed fMRI-compatible pneumatic vibrator can be used with confidence to stimulate muscle spindles during fMRI to study central processing of proprioception.
Li, Hong Shuang
To reduce the computational effort of reliability-based design optimization (RBDO), the response surface method (RSM) has been widely used to evaluate reliability constraints. We propose an efficient methodology for solving RBDO problems based on an improved high order response surface method (HORSM) that takes advantage of an efficient sampling method, Hermite polynomials and uncertainty contribution concept to construct a high order response surface function with cross terms for reliability analysis. The sampling method generates supporting points from Gauss-Hermite quadrature points, which can be used to approximate response surface function without cross terms, to identify the highest order of each random variable and to determine the significant variables connected with point estimate method. The cross terms between two significant random variables are added to the response surface function to improve the approximation accuracy. Integrating the nested strategy, the improved HORSM is explored in solving RBDO problems. Additionally, a sampling based reliability sensitivity analysis method is employed to reduce the computational effort further when design variables are distributional parameters of input random variables. The proposed methodology is applied on two test problems to validate its accuracy and efficiency. The proposed methodology is more efficient than first order reliability method based RBDO and Monte Carlo simulation based RBDO, and enables the use of RBDO as a practical design tool.
Rokotonarivo, Sarobidy; Schaafsma, Marije; Hockley, Neal
reliability measures. DCE results were generally consistent with those of other stated preference techniques (convergent validity), but hypothetical bias was common. Evidence supporting theoretical validity (consistency with assumptions of rational choice theory) was limited. In content validity tests, 2...
Ghaemi, Hamide; Khoddami, Seyyedeh Maryam; Soleymani, Zahra; Zandieh, Fariborz; Jalaie, Shohreh; Ahanchian, Hamid; Khadivi, Ehsan
The aim of this study was to develop, validate, and assess the reliability of the Persian version of Vocal Cord Dysfunction Questionnaire (VCDQ P ). The study design was cross-sectional or cultural survey. Forty-four patients with vocal fold dysfunction (VFD) and 40 healthy volunteers were recruited for the study. To assess the content validity, the prefinal questions were given to 15 experts to comment on its essential. Ten patients with VFD rated the importance of VCDQ P in detecting face validity. Eighteen of the patients with VFD completed the VCDQ 1 week later for test-retest reliability. To detect absolute reliability, standard error of measurement and smallest detected change were calculated. Concurrent validity was assessed by completing the Persian Chronic Obstructive Pulmonary Disease (COPD) Assessment Test (CAT) by 34 patients with VFD. Discriminant validity was measured from 34 participants. The VCDQ was further validated by administering the questionnaire to 40 healthy volunteers. Validation of the VCDQ as a treatment outcome tool was conducted in 18 patients with VFD using pre- and posttreatment scores. The internal consistency was confirmed (Cronbach α = 0.78). The test-retest reliability was excellent (intraclass correlation coefficient = 0.97). The standard error of measurement and smallest detected change values were acceptable (0.39 and 1.08, respectively). There was a significant correlation between the VCDQ P and the CAT total scores (P validity was significantly different. The VCDQ scores in patients with VFD before and after treatment was significantly different (P valid and reliable self-administered questionnaire in Persian-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Szucs, Kimberly A; Brown, Elena V Donoso
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings.
Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina
We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.
Ruhinda, E; Byanyima, R K; Mugerwa, H
Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. A blinded, repeated-measures diagnostic test was carried out on lumbar spine x-ray radiographs. Radiology Department at Joint Clinical Research Centre (JCRC), Mengo-Kampala-Uganda. Seventy (70) lateral lumbar x-ray films were used for this study and were obtained from the archive of JCRC radiology department at Butikiro house, Mengo-Kampala. Poor observer agreement, both inter- and intra-observer, with kappa values of 0.16 was found. Inter-observer agreement was poorer than intra-observer agreement. Kappa values significantly rose when the lumbar lordosis was clustered into four categories without grading each abnormality. The results confirm that subjective assessment of lumbar lordosis has low reliability and validity. Film quality has limited influence on the observer reliability. This study further shows that fewer scale categories of lordosis abnormalities produce better observer reliability.
Huang, X N; Zhang, Y; Feng, W W; Wang, H S; Cao, B; Zhang, B; Yang, Y F; Wang, H M; Zheng, Y; Jin, X M; Jia, M X; Zou, X B; Zhao, C X; Robert, J; Jing, Jin
Objective: To evaluate the reliability and validity of warning signs checklist developed by the National Health and Family Planning Commission of the People's Republic of China (NHFPC), so as to determine the screening effectiveness of warning signs on developmental problems of early childhood. Method: Stratified random sampling method was used to assess the reliability and validity of checklist of warning sign and 2 110 children 0 to 6 years of age(1 513 low-risk subjects and 597 high-risk subjects) were recruited from 11 provinces of China. The reliability evaluation for the warning signs included the test-retest reliability and interrater reliability. With the use of Age and Stage Questionnaire (ASQ) and Gesell Development Diagnosis Scale (GESELL) as the criterion scales, criterion validity was assessed by determining the correlation and consistency between the screening results of warning signs and the criterion scales. Result: In terms of the warning signs, the screening positive rates at different ages ranged from 10.8%(21/141) to 26.2%(51/137). The median (interquartile) testing time for each subject was 1(0.6) minute. Both the test-retest reliability and interrater reliability of warning signs reached 0.7 or above, indicating that the stability was good. In terms of validity assessment, there was remarkable consistency between ASQ and warning signs, with the Kappa value of 0.63. With the use of GESELL as criterion, it was determined that the sensitivity of warning signs in children with suspected developmental delay was 82.2%, and the specificity was 77.7%. The overall Youden index was 0.6. Conclusion: The reliability and validity of warning signs checklist for screening early childhood developmental problems have met the basic requirements of psychological screening scales, with the characteristics of short testing time and easy operation. Thus, this warning signs checklist can be used for screening psychological and behavioral problems of early childhood
Li, Z; Yang, Y M; Zhang, C; Li, Y; Hu, J; Gao, L W; Zhou, Y X; Zhang, X J
Objective: To assess the reliability and validity of the Chinese version of Driving Anger Scale (DAS) in professional drivers in China and provide a scientific basis for the application of the scale in drivers in China. Methods: Professional drivers, including taxi drivers, bus drivers, truck drivers and school bus drivers, were selected to complete the questionnaire. Cronbach's α and split-half reliability were calculated to evaluate the reliability of DAS, and content, contract, discriminant and convergent validity were performed to measure the validity of the scale. Results: The overall Cronbach's α of DAS was 0.934 and the split-half reliability was 0.874. The correlation coefficient of each subscale with the total scale was 0.639-0.922. The simplified version of DAS supported a presupposed six-factor structure, explaining 56.371% of the total variance revealed by exploratory factor analysis. The DAS had good convergent and discriminant validity, with the success rate of calibration experiment of 100%. Conclusion: DAS has a good reliability and validity in professional drivers in China, and the use of DAS is worth promoting in divers.
Full Text Available Background: The purpose of this study was to evaluate the validity and reliability on the Persian translation of the Modifiable Activity Questionnaire (MAQ in a sample of Tehranian adolescents. Methods: Of a total of 52 subjects, a sub-sample of 40 participations (55.0% boys was used to assess the reliability and the validity of the physical activity questionnaire. The reliability of the two MAQs was calculated by intraclass correlation coefficients, and validation was evaluated using Pearson correlation coefficients to compare data between mean of the two MAQs and mean of four physical activity records. Results: Intraclass correlation coefficient was calculated to assess the reliability between two MAQs and the results of leisure time physical activity over the past year were 0.97. Pearson correlation coefficients between mean of two MAQs and mean of four physical activity records were 0.49 (P < 0.001, for leisure time physical activities. Conclusions: High reliability and relatively moderate validity were found for the Persian translation of the MAQ in a Tehranian adolescent population. Further studies with large sample size are suggested to assess the validity more precisely.
Vendrig, A A; Schaafsma, F G
Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.
Ling, Samuel K K; Chan, Vincent; Ho, Karen; Ling, Fona; Lui, T H
Develop the first reliable and validated open-source outcome scoring system in the Chinese language for foot and ankle problems. Translation of the English FAOS into Chinese following regular protocols. First, two forward-translations were created separately, these were then combined into a preliminary version by an expert committee, and was subsequently back-translated into English. The process was repeated until the original and back translations were congruent. This version was then field tested on actual patients who provided feedback for modification. The final Chinese FAOS version was then tested for reliability and validity. Reliability analysis was performed on 20 subjects while validity analysis was performed on 50 subjects. Tools used to validate the Chinese FAOS were the SF36 and Pain Numeric Rating Scale (NRS). Internal consistency between the FAOS subgroups was measured using Cronbach's alpha. Spearman's correlation was calculated between each subgroup in the FAOS, SF36 and NRS. The Chinese FAOS passed both reliability and validity testing; meaning it is reliable, internally consistent and correlates positively with the SF36 and the NRS. The Chinese FAOS is a free, open-source scoring system that can be used to provide a relatively standardised outcome measure for foot and ankle studies. Copyright © 2017 Elsevier Ltd. All rights reserved.
Marco Aurelio Lumertz Saffi
Full Text Available Using a sample of patients with coronary artery disease, this methodological study aimed to conduct a cross-cultural adaptation and validation of a questionnaire on knowledge of cardiovascular risk factors (Q-FARCS, lifestyle changes, and treatment adherence for use in Brazil. The questionnaire has three scales: general knowledge of risk factors (RFs; specific knowledge of these RFs; and lifestyle changes achieved. Cross-cultural adaptation included translation, synthesis, back-translation, expert committee review, and pretesting. Face and content validity, reliability, and construct validity were measured. Cronbach’s alpha for the total sample (n = 240 was 0.75. Assessment of psychometric properties revealed adequate face and content validity, and the construct revealed seven components. It was concluded that the Brazilian version of Q-FARCS had adequate reliability and validity for the assessment of knowledge of cardiovascular RFs.
Lee, Jie Eun; Lee, Dong Hwa; Oh, Tae Jung; Kim, Kyoung Min; Choi, Sung Hee; Lim, Soo; Park, Young Joo; Park, Do Joon; Jang, Hak Chul; Moon, Jae Hoon
Thyrotoxicosis is a common disease resulting from an excess of thyroid hormones, which affects many organ systems. The clinical symptoms and signs are relatively nonspecific and can vary depending on age, sex, comorbidities, and the duration and cause of the disease. Several symptom rating scales have been developed in an attempt to assess these symptoms objectively and have been applied to diagnosis or to evaluation of the response to treatment. The aim of this study was to assess the reliability and validity of the Korean version of the hyperthyroidism symptom scale (K-HSS). Twenty-eight thyrotoxic patients and 10 healthy subjects completed the K-HSS at baseline and after follow-up at Seoul National University Bundang Hospital. The correlation between K-HSS scores and thyroid function was analyzed. K-HSS scores were compared between baseline and follow-up in patient and control groups. Cronbach's α coefficient was calculated to demonstrate the internal consistency of K-HSS. The mean age of the participants was 34.7±9.8 years and 13 (34.2%) were men. K-HSS scores demonstrated a significant positive correlation with serum free thyroxine concentration and decreased significantly with improved thyroid function. K-HSS scores were highest in subclinically thyrotoxic subjects, lower in patients who were euthyroid after treatment, and lowest in the control group at follow-up, but these differences were not significant. Cronbach's α coefficient for the K-HSS was 0.86. The K-HSS is a reliable and valid instrument for evaluating symptoms of thyrotoxicosis in Korean patients. Copyright © 2018 Korean Endocrine Society.
Harris, Sion Kim; Sherritt, Lon R; Holder, David W; Kulig, John; Shrier, Lydia A; Knight, John R
Developed for use in health research, the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) consists of brief measures of a broad range of religiousness and spirituality (R/S) dimensions. It has established psychometric properties among adults, but little is known about its appropriateness for use with adolescents. We assessed the psychometric properties of the BMMRS among adolescents. We recruited a racially diverse (85% non-White) sample of 305 adolescents aged 12-18 years (median 16 yrs, IQR 14-17) from 3 urban medical clinics; 93 completed a retest 1 week later. We assessed internal consistency and test-retest reliability. We assessed construct validity by examining how well the measures discriminated groups expected to differ based on self-reported religious preference, and how they related to a hypothesized correlate, depressive symptoms. Religious preference was categorized into "No religion/Atheist" (11%), "Don't know/Confused" (9%), or "Named a religion" (80%). Responses to multi-item measures were generally internally consistent (alpha > or = 0.70 for 12/16 measures) and stable over 1 week (intraclass correlation coefficients > or = 0.70 for 14/16). Forgiveness, Negative R/S Coping, and Commitment items showed lower internal cohesiveness. Scores on most measures were higher (p Atheist" group. Forgiveness, Commitment, and Anticipated Support from members of one's congregation were inversely correlated with depressive symptoms, while BMMRS measures assessing negative R/S experiences (Negative R/S Coping, Negative Interactions with others in congregation, Loss in Faith) were positively correlated with depressive symptoms. These findings suggest that most BMMRS measures are reliable and valid for use among adolescents.
Melguizo-Herrera, Estela; Álvarez-Romero, Yuleysi; Cabarcas-Mendoza, Mayerlin Vanessa; Calvo-Rodríguez, Rossy Stefanie; Flórez-Almanza, Jeomaidis; Moadie-Contreras, Olga Patricia; Campo-Arias, Adalberto
There are many stereotypes and prejudices about the sexual lives of the elderly. However, there are no validated and reliable tools for measuring these in the Latin-American context. To determine the internal consistency, dimensionality, differential item functioning (DIF) by gender and stability of the Attitudes towards Sexuality in the Elderly Questionnaire (ASEQ) in adults over 60 years-old in Cartagena, Colombia. A validation study was designed that included a sample of 130 participants without cognitive impairment attending a Life Center. The ages ranged between 60 and 90 years (mean, 73.7±8.0), and there were 61.5% females. Internal consistency was calculated using Cronbach alpha and McDonald omega, exploratory factor analysis (EFA) (dimensionality), DIF by gender (item response theory) with Kendall correlation, and stability (reproducibility) with Pearson correlation and intraclass correlation coefficient (ICC). The ASEQ showed high internal consistency on the first application (α=.83 and ω=.87) and in the second one (α=.85 and ω=.89). AFE showed two salient factors (prejudices and limitations) that explained 42.6% of the total variance. The IDF presented appropriate coefficients, with the exception of item 14 that showed a high value (τ=.37). ASEQ showed high stability (r=.82 and ICC=.89; 95% confidence interval, 0.83- 0.92; P<.001). ASEQ is a two-dimensional and reliable scale in older adults attending a Life Center in Cartagena, Colombia. New studies are required to evaluate the performance in a representative sample. Copyright © 2014 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Full Text Available BackgroundThyrotoxicosis is a common disease resulting from an excess of thyroid hormones, which affects many organ systems. The clinical symptoms and signs are relatively nonspecific and can vary depending on age, sex, comorbidities, and the duration and cause of the disease. Several symptom rating scales have been developed in an attempt to assess these symptoms objectively and have been applied to diagnosis or to evaluation of the response to treatment. The aim of this study was to assess the reliability and validity of the Korean version of the hyperthyroidism symptom scale (K-HSS.MethodsTwenty-eight thyrotoxic patients and 10 healthy subjects completed the K-HSS at baseline and after follow-up at Seoul National University Bundang Hospital. The correlation between K-HSS scores and thyroid function was analyzed. K-HSS scores were compared between baseline and follow-up in patient and control groups. Cronbach's α coefficient was calculated to demonstrate the internal consistency of K-HSS.ResultsThe mean age of the participants was 34.7±9.8 years and 13 (34.2% were men. K-HSS scores demonstrated a significant positive correlation with serum free thyroxine concentration and decreased significantly with improved thyroid function. K-HSS scores were highest in subclinically thyrotoxic subjects, lower in patients who were euthyroid after treatment, and lowest in the control group at follow-up, but these differences were not significant. Cronbach's α coefficient for the K-HSS was 0.86.ConclusionThe K-HSS is a reliable and valid instrument for evaluating symptoms of thyrotoxicosis in Korean patients.
Lee, Dong Hwa
Background Thyrotoxicosis is a common disease resulting from an excess of thyroid hormones, which affects many organ systems. The clinical symptoms and signs are relatively nonspecific and can vary depending on age, sex, comorbidities, and the duration and cause of the disease. Several symptom rating scales have been developed in an attempt to assess these symptoms objectively and have been applied to diagnosis or to evaluation of the response to treatment. The aim of this study was to assess the reliability and validity of the Korean version of the hyperthyroidism symptom scale (K-HSS). Methods Twenty-eight thyrotoxic patients and 10 healthy subjects completed the K-HSS at baseline and after follow-up at Seoul National University Bundang Hospital. The correlation between K-HSS scores and thyroid function was analyzed. K-HSS scores were compared between baseline and follow-up in patient and control groups. Cronbach's α coefficient was calculated to demonstrate the internal consistency of K-HSS. Results The mean age of the participants was 34.7±9.8 years and 13 (34.2%) were men. K-HSS scores demonstrated a significant positive correlation with serum free thyroxine concentration and decreased significantly with improved thyroid function. K-HSS scores were highest in subclinically thyrotoxic subjects, lower in patients who were euthyroid after treatment, and lowest in the control group at follow-up, but these differences were not significant. Cronbach's α coefficient for the K-HSS was 0.86. Conclusion The K-HSS is a reliable and valid instrument for evaluating symptoms of thyrotoxicosis in Korean patients. PMID:29589389
Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument. PMID:22938557
Helmerhorst Hendrik JF
Full Text Available Abstract Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA and in particular by physical activity questionnaires (PAQs remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument.
Full Text Available Abstract Background We evaluated the reliability and validity of the short form household food security scale in a different setting from the one in which it was developed. Methods The scale was interview administered to 531 subjects from 286 households in north central Trinidad in Trinidad and Tobago, West Indies. We evaluated the six items by fitting item response theory models to estimate item thresholds, estimating agreement among respondents in the same households and estimating the slope index of income-related inequality (SII after adjusting for age, sex and ethnicity. Results Item-score correlations ranged from 0.52 to 0.79 and Cronbach's alpha was 0.87. Item responses gave within-household correlation coefficients ranging from 0.70 to 0.78. Estimated item thresholds (standard errors from the Rasch model ranged from -2.027 (0.063 for the 'balanced meal' item to 2.251 (0.116 for the 'hungry' item. The 'balanced meal' item had the lowest threshold in each ethnic group even though there was evidence of differential functioning for this item by ethnicity. Relative thresholds of other items were generally consistent with US data. Estimation of the SII, comparing those at the bottom with those at the top of the income scale, gave relative odds for an affirmative response of 3.77 (95% confidence interval 1.40 to 10.2 for the lowest severity item, and 20.8 (2.67 to 162.5 for highest severity item. Food insecurity was associated with reduced consumption of green vegetables after additionally adjusting for income and education (0.52, 0.28 to 0.96. Conclusions The household food security scale gives reliable and valid responses in this setting. Differing relative item thresholds compared with US data do not require alteration to the cut-points for classification of 'food insecurity without hunger' or 'food insecurity with hunger'. The data provide further evidence that re-evaluation of the 'balanced meal' item is required.
Campo-Arias, Adalberto; Lafaurie, María Mercedes; Gaitán-Duarte, Hernando G
There are several scales to quantify homophobia in different populations. However, the reliability and validity of these instruments among Colombian students are unknown. Consequently, this work is intended to assess reliability (inner consistency) as well as the validity of the Scale for Homophobia in Medicine students from a private university in Bogotá (Colombia). Methodological study with 199 Medicine students from 1st to 5th semester that filled out the Homophobia Scale form, the general welfare questionnaire, the Attitude Towards Gays and Lesbians Scale (ATGL), WHO-5 (divergent validity) and the Francis Scale of Attitude Toward Christianity (nomologic validity). Pearson's correlations were computed, the Cronbach's alfa coefficient, the omega coefficient (construct's reliability) and confirmatory factorial analysis. The Scale for Homophobia showed an alpha Cronbach coefficient of 0,785, an omega coefficient of 0,790 and a Pearson correlation with the ATGL of 0,844; with WHO-5, -0,059; and a Francis Scale of Attitude Toward Christianity, 0,187. The Scale toward Homophobia exhibited a relevant factor of 44,7% of the total variance. The Scale for Homophobia showed acceptable reliability and validity. New studies should investigate the stability of the scale and the nomologic validity regarding other constructs. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank M; Donnan, Peter T; Gágyor, Ildikó; Hobbelen, Hans J S M; Althabe, Fernando; Krishnan, Jerry A; Treweek, Shaun
PRagmatic Explanatory Continuum Indicator Summary (PRECIS)-2 is a tool that could improve design insight for trialists. Our aim was to validate the PRECIS-2 tool, unlike its predecessor, testing the discriminant validity and interrater reliability. Over 80 international trialists, methodologists, clinicians, and policymakers created PRECIS-2 helping to ensure face validity and content validity. The interrater reliability of PRECIS-2 was measured using 19 experienced trialists who used PRECIS-2 to score a diverse sample of 15 randomized controlled trial protocols. Discriminant validity was tested with two raters to independently determine if the trial protocols were more pragmatic or more explanatory, with scores from the 19 raters for the 15 trials as predictors of pragmatism. Interrater reliability was generally good, with seven of nine domains having an intraclass correlation coefficient over 0.65. Flexibility (adherence) and recruitment had wide confidence intervals, but raters found these difficult to rate and wanted more information. Each of the nine PRECIS-2 domains could be used to differentiate between trials taking more pragmatic or more explanatory approaches with better than chance discrimination for all domains. We have assessed the validity and reliability of PRECIS-2. An elaboration study and web site provide guidance to help future users of the tool which is continuing to be tested by trial teams, systematic reviewers, and funders. Copyright © 2017 Elsevier Inc. All rights reserved.
Dawson, Andreas; Raphael, Karen G; Glaros, Alan; Axelsson, Susanna; Arima, Taro; Ernberg, Malin; Farella, Mauro; Lobbezoo, Frank; Manfredini, Daniele; Michelotti, Ambra; Svensson, Peter; List, Thomas
To combine empirical evidence and expert opinion in a formal consensus method in order to develop a quality-assessment tool for experimental bruxism studies in systematic reviews. Tool development comprised five steps: (1) preliminary decisions, (2) item generation, (3) face-validity assessment, (4) reliability and discriminitive validity assessment, and (5) instrument refinement. The kappa value and phi-coefficient were calculated to assess inter-observer reliability and discriminative ability, respectively. Following preliminary decisions and a literature review, a list of 52 items to be considered for inclusion in the tool was compiled. Eleven experts were invited to join a Delphi panel and 10 accepted. Four Delphi rounds reduced the preliminary tool-Quality-Assessment Tool for Experimental Bruxism Studies (Qu-ATEBS)- to 8 items: study aim, study sample, control condition or group, study design, experimental bruxism task, statistics, interpretation of results, and conflict of interest statement. Consensus among the Delphi panelists yielded good face validity. Inter-observer reliability was acceptable (k = 0.77). Discriminative validity was excellent (phi coefficient 1.0; P reviews of experimental bruxism studies, exhibits face validity, excellent discriminative validity, and acceptable inter-observer reliability. Development of quality assessment tools for many other topics in the orofacial pain literature is needed and may follow the described procedure.
Full Text Available In this study it was aimed to make the studies of the translation of Perception of Organizational Politics Scale into Turkish and the validity and reliability of the scale. Perceptions of Organizational Politics Scale’s (POPS validities has been tested in terms of view, content and structure. The application is designed as a two-stage process. In the first stage, face and content validity was tested. In the second stage, it was sought evidences for the construct validity of the scale by making exploratory factor analysis (EFA and then the confirmatory factor analysis (CFA to the data obtained. In determining the reliability of the scale item-total score correlations and Cronbach alpha coefficient was used. The application made for the validity and reliability of the scale was conducted on the data collected from 277 faculty members working in universities’ education faculties. As a method of achieving those faculty members "Simple randomized (random sampling" is used. The psychometric properties of the Turkish version of Perception of Organizational Politics Scale showed that the scale has a satisfactory level of reliability and validity for the Turkish employee sample.
Moussas, George; Dadouti, Georgia; Douzenis, Athanassios; Poulis, Evangelos; Tzelembis, Athanassios; Bratis, Dimitris; Christodoulou, Christos; Lykouras, Lefteris
Problems associated with alcohol abuse are recognised by the World Health Organization as a major health issue, which according to most recent estimations is responsible for 1.4% of the total world burden of morbidity and has been proven to increase mortality risk by 50%. Because of the size and severity of the problem, early detection is very important. This requires easy to use and specific tools. One of these is the Alcohol Use Disorders Identification Test (AUDIT). This study aims to standardise the questionnaire in a Greek population. AUDIT was translated and back-translated from its original language by two English-speaking psychiatrists. The tool contains 10 questions. A score >or= 11 is an indication of serious abuse/dependence. In the study, 218 subjects took part: 128 were males and 90 females. The average age was 40.71 years (+/- 11.34). From the 218 individuals, 109 (75 male, 34 female) fulfilled the criteria for alcohol dependence according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV), and presented requesting admission; 109 subjects (53 male, 56 female) were healthy controls. Internal reliability (Cronbach alpha) was 0.80 for the controls and 0.80 for the alcohol-dependent individuals. Controls had significantly lower average scores (t test P 8 was 0.98 and its specificity was 0.94 for the same score. For the alcohol-dependent sample 3% scored as false negatives and from the control group 1.8% scored false positives. In the alcohol-dependent sample there was no difference between males and females in their average scores (t test P > 0.05). The Greek version of AUDIT has increased internal reliability and validity. It detects 97% of the alcohol-dependent individuals and has a high sensitivity and specificity. AUDIT is easy to use, quick and reliable and can be very useful in detection alcohol problems in sensitive populations.
Niki, Hisateru; Tatsunami, Shinobu; Haraguchi, Naoki; Aoki, Takafumi; Okuda, Ryuzo; Suda, Yasunori; Takao, Masato; Tanaka, Yasuhito
The Japanese Society for Surgery of the Foot (JSSF) is developing a QOL questionnaire instrument for use in pathological conditions related to the foot and ankle. The main body of the outcome instrument (the Self-Administered Foot Evaluation Questionnaire, SAFE-Q version 2) consists of 34 questionnaire items, which provide five subscale scores (1: Pain and Pain-Related; 2: Physical Functioning and Daily Living; 3: Social Functioning; 4: Shoe-Related; and 5: General Health and Well-Being). In addition, the instrument has nine optional questionnaire items that provide a Sports Activity subscale score. The purpose of this study was to evaluate the test-retest reliability of the SAFE-Q. Version 2 of the SAFE-Q was administered to 876 patients and 491 non-patients, and the test-retest reliability was evaluated for 131 patients. In addition, the SF-36 questionnaire and the JSSF Scale scoring form were administered to all of the participants. Subscale scores were scaled such that the final sum of scores ranged between zero (least healthy) to 100 (healthiest). The intraclass correlation coefficients were larger than 0.7 for all of the scores. The means of the five subscale scores were between 60 and 75. The five subscales easily separated patients from non-patients. The coefficients for the correlations of the subscale scores with the scores on the JSSF Scale and the SF-36 subscales were all highly statistically significantly greater than zero (p valid and reliable. In the future, it will be beneficial to test the responsiveness of the SAFE-Q.
Echevarria-Guanilo, Maria E; Dantas, Rosana A S; Farina, Jayme A; Alonso, Jordi; Rajmil, Luis; Rossi, Lídia A
The aims of this study were to assess the internal reliability (internal consistency), construct validity, sensitivity and ceiling and floor effects of the Brazilian-Portuguese version of the Impact of Event Scale (IES). Methodological research design. The Brazilian-Portuguese version of the IES was applied to a group of 91 burned patients at three times: the first week after the burn injury (time one), between the fourth and the sixth months (time two) and between the ninth and the 12th months (time three). The internal consistency, construct validity (convergent and dimensionality), sensitivity and ceiling and floor effects were tested. Cronbach's alpha coefficients showed high internal consistency for the total scale (0·87) and for the domains intrusive thoughts (0·87) and avoidance responses (0·76). During the hospitalisation (time one), the scale showed low and positive correlations with pain measures immediately before (r=0·22; pnegative correlations with self-esteem (r=-0·52; plow and negative with the Bodily pain (r=-0·24; pimpact of the event in the group of patients under analysis. The Impact of Event Scale can be used in research and clinical practice to assess nursing interventions aimed at decreasing stress during rehabilitation. © 2011 Blackwell Publishing Ltd.
Aertssen, W F M; Steenbergen, B; Smits-Engelsman, B C M
There is lack of valid and reliable field-based tests for assessing functional strength in young children with mild intellectual disabilities (IDs). The aim of this study was to investigate the test-retest reliability and construct validity of the Functional Strength Measurement in children with ID (FSM-ID). Fifty-two children with mild ID (40 boys and 12 girls, mean age 8.48 years, SD = 1.48) were tested with the FSM. Test-retest reliability (n = 32) was examined by a two-way interclass correlation coefficient for agreement (ICC 2.1A). Standard error of measurement and smallest detectable change were calculated. Construct validity was determined by calculating correlations between the FSM-ID and handheld dynamometry (HHD) (convergent validity), FSM-ID, FSM-ID and subtest strength of the Bruininks-Oseretsky test of motor proficiency - second edition (BOT-2) (convergent validity) and the FSM-ID and balance subtest of the BOT-2 (discriminant validity). Test-retest reliability ICC ranged 0.89-0.98. Correlation between the items of the FSM-ID and HHD ranged 0.39-0.79 and between FSM-ID and BOT-2 (strength items) 0.41-0.80. Correlation between items of the FSM-ID and BOT-2 (balance items) ranged 0.41-0.70. The FSM-ID showed good test-retest reliability and good convergent validity with the HHD and BOT-2 subtest strength. The correlations assessing discriminant validity were higher than expected. Poor levels of postural control and core stability in children with mild IDs may be the underlying factor of those higher correlations. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Ana Lúcia Araújo Gomes
Full Text Available ABSTRACT Objective: To evaluate the psychometric properties in terms of validity and reliability of the scale Self-efficacy and their child's level of asthma control: Brazilian version. Method: Methodological study in which 216 parents/guardians of children with asthma participated. A construct validation (factor analysis and test of hypothesis by comparison of contrasted groups and an analysis of reliability in terms of homogeneity (Cronbach's alpha and stability (test-retest were carried out. Results: Exploratory factor analysis proved suitable for the Brazilian version of the scale (Kaiser-Meyer-Olkim index of 0.879 and Bartlett's sphericity with p < 0.001. The correlation matrix in factor analysis suggested the removal of item 7 from the scale. Cronbach's alpha of the final scale, with 16 items, was 0.92. Conclusion: The Brazilian version of Self-efficacy and their child's level of asthma control presented psychometric properties that confirmed its validity and reliability.
Nygren, Björn; Randström, Kerstin Björkman; Lejonklou, Anna K; Lundman, Beril
The purpose of this study was to test the reliability and validity of the Swedish language version of the Resilience Scale (RS). Participants were 142 adults between 19-85 years of age. Internal consistency reliability, stability over time, and construct validity were evaluated using Cronbach's alpha, principal components analysis with varimax rotation and correlations with scores on the Sense of Coherence Scale (SOC) and the Rosenberg Self-Esteem Scale (RSE). The mean score on the RS was 142 (SD = 15). The possible scores on the RS range from 25 to 175, and scores higher than 146 are considered high. The test-retest correlation was .78. Correlations with the SOC and the RSE were .41 (p Self and Life emerged as components from the principal components analysis. These findings provide evidence for the reliability and validity of the Swedish language version of the RS.
O’CONNOR, MELISSA; DAVITT, JOAN K.
The Outcome and Assessment Information Set (OASIS) is the patient-specific, standardized assessment used in Medicare home health care to plan care, determine reimbursement, and measure quality. Since its inception in 1999, there has been debate over the reliability and validity of the OASIS as a research tool and outcome measure. A systematic literature review of English-language articles identified 12 studies published in the last 10 years examining the validity and reliability of the OASIS. Empirical findings indicate the validity and reliability of the OASIS range from low to moderate but vary depending on the item studied. Limitations in the existing research include: nonrepresentative samples; inconsistencies in methods used, items tested, measurement, and statistical procedures; and the changes to the OASIS itself over time. The inconsistencies suggest that these results are tentative at best; additional research is needed to confirm the value of the OASIS for measuring patient outcomes, research, and quality improvement. PMID:23216513
Pakpour, Amir H.; Nourozi, Saeedeh; Mølsted, Stig
INTRODUCTION: The aim of the study was to assess the validity and reliability of the SF-12 questionnaire in a sample of Iranian patients undergoing hemodialysis. MATERIALS AND METHODS: One hundred and forty-four hemodialysis patients were included from dialysis centers in Zanjan, Iran, and were...... asked to complete the SF-12 and SF-36 questionnaires. An initial test-retest reliability evaluation was performed on a sample of 70 patients from the total group, with a retest interval of 14 days. Reliability was estimated by internal consistency and validity was assessed using known-group comparisons...... and construct validity on the patient group as a whole. A linear regression analysis was used to assess any variation in the physical component summary and mental component summary scores of the SF-36 with the respective component summary scores of the SF-12. In addition, the factor structure...
Strøyer, Jesper; Essendrop, Morten; Jensen, Lone Donbaek
To test the validity and reliability of self-assessed physical fitness samples included healthcare assistants working at a hospital (women=170, men=17), persons working with physically and mentally handicapped patients (women=530, men= 123), and two separate groups of healthcare students (a) women...... except for flexibility among men. The reliability was moderate to good (ICC = .62 - .80). Self-assessed aerobic fitness, muscle strength, and flexibility showed moderate construct validity and moderate to good reliability using visual analogues.......=91 and men=5 and (b) women=159 and men=10. Five components of physical fitness were self-assessed by Visual Analogue Scales with illustrations and verbal anchors for the extremes: aerobic fitness, muscle strength, endurance, flexibility, and balance. Convergent and divergent validity were evaluated...
Eshghi, Mohammad Ali; Kordi, Ramin; Memari, Amir Hossein; Ghaziasgar, Ahmad; Mansournia, Mohammad-Ali; Zamani Sani, Seyed Hojjat
The Youth Sport Environment Questionnaire (YSEQ) had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13-17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting.
Mohammad Ali Eshghi
Full Text Available The Youth Sport Environment Questionnaire (YSEQ had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13–17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting.
Full Text Available The main objective of this study is to develop a valid and reliable scale for identifying digital citizenship perceptions of young people in the most common age groups. The study was conducted as a survey study. The study group of this study is composed of 438 people in Turkey who are among 16-24 age group with the highest rate of internet use in Turkey. An exploratory factor analysis was performed to determine the validity of the scale and the item discrimination powers were calculated. The total variance of the scale was determined that the scale had 8-factor structure and was found to be 49,70%. The internal consistency level was also calculated to determine the reliability of the scale. As a result, it can be said that this scale is a valid and reliable scale that can be used to determine the digital citizenship perceptions of young people.
Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan
To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.
Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio
OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurre...
Abay, Halime; Kaplan, Sena
There are a limited number of menopause-specific quality-of-life scales for the Turkish population. This study was conducted to evaluate the validity and reliability of the Turkish Utian Quality-of-Life Scale in postmenopausal women. The study group was comprised of 250 postmenopausal women who applied to a training and research hospital's menopause clinic in Turkey. A survey form and the Turkish Utian quality-of-Life Scale were used to collect data, and the Turkish version of Short Form-36 was used to evaluate reliability with an equivalent form. Language-validity, content-validity, and construct-validity methods were used to assess the validity of the scale, and Cronbach's α coefficient calculation and the equivalent-form reliability methods were used to assess the reliability of the scale. The Turkish Utian Quality-of-Life Scale was determined to be a valid and reliable instrument for measuring the quality of life of postmenopausal women. Confirmatory factor analysis demonstrates that the instrument fits well with 23 items and a four-factor model. The Cronbach's α coefficient for the quality-of-life domains were as follows: 0.88 overall, 0.79 health, 0.78 emotional, 0.76 sexual, and 0.75 occupational. Reliability of the instrument was confirmed through significant correlations between scores on the Turkish version of the Utian Quality-of-Life Scale and the Turkish version of the Short Form-36 (r = 0.745, P measuring quality of life during menopause.
Cruz, Jonas P; Baldacchino, Donia R; Alquwez, Nahed
Patients often resort to religious and spiritual activities to cope with physical and mental challenges. The effect of spiritual coping on overall health, adaptation and health-related quality of life among patients undergoing haemodialysis (HD) is well documented. Thus, it is essential to establish a valid and reliable instrument that can assess both the religious and non-religious coping methods in patients undergoing HD. This study aimed to assess the validity and reliability of the Spiritual Coping Strategies Scale Arabic version (SCS-A) in Saudi patients undergoing HD. A convenience sample of 60 Saudi patients undergoing HD was recruited for this descriptive, cross-sectional study. Data were collected between May and June 2015. Forward-backward translation was used to formulate the SCS-A. The SCS-A, Muslim Religiosity Scale and the Quality of Life Index Dialysis Version III were used to procure the data. Internal consistency reliability, stability reliability, factor analysis and construct validity tests were performed. Analyses were set at the 0.05 level of significance. The SCS-A showed an acceptable internal consistency and strong stability reliability over time. The EFA produced two factors (non-religious and religious coping). Satisfactory construct validity was established by the convergent and divergent validity and known-groups method. The SCS-A is a reliable and valid tool that can be used to measure the religious and non-religious coping strategies of patients undergoing HD in Saudi Arabia and other Muslim and Arabic-speaking countries. © 2016 European Dialysis and Transplant Nurses Association/European Renal Care Association.
Boonstra, Anne M; Schiphorst Preuper, Henrica R; Reneman, Michiel F; Posthumus, Jitze B; Stewart, Roy E
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional design was used. A general rehabilitation centre and a university rehabilitation centre was the setting for the study. The study population consisted of patients over 18 years of age, suffering from chronic musculoskeletal pain; 52 patients in the reliability study, 344 patients in the validity study. Main outcome measures were as follows. Reliability study: Spearman's correlation coefficients (rho values) of the test and retest data of the VAS for disability; validity study: rho values of the VAS disability scores with the scores on four domains of the Short-Form Health Survey (SF-36) and VAS pain scores, and with Roland-Morris Disability Questionnaire scores in chronic low back pain patients. Results were as follows: in the reliability study rho values varied from 0.60 to 0.77; and in the validity study rho values of VAS disability scores with SF-36 domain scores varied from 0.16 to 0.51, with Roland-Morris Disability Questionnaire scores from 0.38 to 0.43 and with VAS pain scores from 0.76 to 0.84. The conclusion of the study was that the reliability of the VAS for disability is moderate to good. Because of a weak correlation with other disability instruments and a strong correlation with the VAS for pain, however, its validity is questionable.
Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H
Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (preliability was (ICC3,3) = 0.953 (pvalidity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu
The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (pr=0.67 pr=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
Sahin Cankurtaran, Eylem; Danişman, Mustafa; Tutar, Hasan; Ulusoy Kaymak, Semra
The Neuropsychiatric Inventory-Clinician (NPI-C) scale is one of the best-known scales for evaluating the behavioral and psychological symptoms of dementia. This study aimed to assess the reliability and validity of the Turkish version of the NPI-C scale in patients with Alzheimer disease (AD). The NPI-C scale was administered to 125 patients with AD. For reliability, both Cronbach's α and interrater reliability were analyzed. The Behavioral Pathology in Alzheimer's Disease (BEHAVE-AD) scale was applied for validity and, in addition, the Mini Mental State Examination (MMSE), Instrumental Activities of Daily Living (IADL) scale, and Disability Assessment of Dementia (DAD) scale were completed. The Turkish version of the NPI-C scale showed high internal consistency (Cronbach's α = 0.75) and mostly good interrater reliability. Assessments of validity showed that the NPI-C and corresponding BEHAVE-AD domains were found to be significantly correlated, between 0.925 and 0.195. Moreover, the correlations between NPI-C and MMSE were significant for all domains except the dysphoria, anxiety, and elation/euphoria domains. When we conducted a correlation analysis of NPI-C with IADL, all domains were statistically significantly correlated except aggression, anxiety, elation/euphoria, and dysphoria. The Turkish version of the NPI-C scale was found to be a reliable and valid instrument to assess neuropsychiatric symptoms in Turkish elderly subjects with AD.
Full Text Available This study aims to develop a valid and reliable instrument for measuring students' social studies achievement goal. The research was conducted on a study group consisted of 374 middle school students studying in the central district of Diyarbakır in 2014-2015 school year fall semester. Expert opinion was consulted with regard to the scale's content and face validity. Exploratory Factor Analysis (EFA and Confirmatory Factor Analysis (CFA were performed in order to measure the scale's construct validity. As a result of EFA, a 29-item and a six-factor structure model which explains 50.82% of the total variance was obtained. The emerging factors were called as a self-approach, task-approach, other-approach, task-avoidance, other-avoidance and self-avoidance respectively. The findings acquired CFA indicated that the 29-item and six-factor structure related to social studies oriented achievement goal scale have acceptable goodness of fit indices. The scale's reliability coefficients were calculated by means of internal consistency method. As a result of reliability analysis, it was determined that the reliability coefficients were within admissible limits. The finding of the item correlation and 27% of upper and lower group comparisons demonstrated that all of the items in the scale should remain. In light of these results, it could be argued that the scale is reliable and valid instrument and can be used in order to test students' social studies achievement goals.
Czaikowski, Brianna L; Liang, Hong; Stewart, C Todd
The Full Outline of UnResponsiveness (FOUR) Score is a coma scale that consists of four components (eye and motor response, brainstem reflexes, and respiration). It was originally validated among the adult population and recently in a pediatric population. To enhance clinical assessment of pediatric intensive care unit patients, including those intubated and/or sedated, at our children's hospital, we modified the FOUR Score Scale for this population. This modified scale would provide many of the same advantages as the original, such as interrater reliability, simplicity, and elimination of the verbal component that is not compatible with the Glasgow Coma Scale (GCS), creating a more valuable neurological assessment tool for the nursing community. Our goal was to potentially provide greater information than the formally used GCS when assessing critically ill, neurologically impaired patients, including those sedated and/or intubated. Experienced pediatric intensive care unit nurses were trained as "expert raters." Two different nurses assessed each subject using the Pediatric FOUR Score Scale (PFSS), GCS, and Richmond Agitation Sedation Scale at three different time points. Data were compared with the Pediatric Cerebral Performance Category (PCPC) assessed by another nurse. Our hypothesis was that the PFSS and PCPC should highly correlate and the GCS and PCPC should correlate lower. Study results show that the PFSS is excellent for interrater reliability for trained nurse-rater pairs and prediction of poor outcome and in-hospital mortality, under various situations, but there were no statistically significant differences between the PFSS and the GCS. However, the PFSS does have the potential to provide greater neurological assessment in the intubated and/or sedated patient based on the outcomes of our study.
Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia
Baker, Nancy A; Cook, James R; Redfern, Mark S
This paper describes the inter-rater and intra-rater reliability, and the concurrent validity of an observational instrument, the Keyboard Personal Computer Style instrument (K-PeCS), which assesses stereotypical postures and movements associated with computer keyboard use. Three trained raters independently rated the video clips of 45 computer keyboard users to ascertain inter-rater reliability, and then re-rated a sub-sample of 15 video clips to ascertain intra-rater reliability. Concurrent validity was assessed by comparing the ratings obtained using the K-PeCS to scores developed from a 3D motion analysis system. The overall K-PeCS had excellent reliability [inter-rater: intra-class correlation coefficients (ICC)=.90; intra-rater: ICC=.92]. Most individual items on the K-PeCS had from good to excellent reliability, although six items fell below ICC=.75. Those K-PeCS items that were assessed for concurrent validity compared favorably to the motion analysis data for all but two items. These results suggest that most items on the K-PeCS can be used to reliably document computer keyboarding style.
Zhao, M.; McDonald, A.; Dick, P.
The test rig for Validation and Reliability Testing of shutdown system software has been upgraded from the AECL Windows-based test rig previously used for CANDU6 stations. It includes a Virtual Trip Computer, which is a software simulation of the functional specification of the trip computer, and a real-time trip computer simulator in a separate chassis, which is used during the preparation of trip computer test cases before the actual trip computers are available. This allows preparation work for Validation and Reliability Testing to be performed in advance of delivery of actual trip computers to maintain a project schedule. (author)
Bayani, Ali Asghar
The internal consistency, test-retest reliability, and construct validity of the Farsi version of the Depression Anxiety Stress Scales were examined, with a sample of 306 undergraduate students (123 men, 183 women) ranging from 18 to 51 years of age (M age = 25.4, SD = 6.1). Participants completed the Satisfaction with Life Scale, Rosenberg Self-esteem Scale, and the Depression Anxiety Stress Scales. The findings confirmed the preliminary reliabilities and preliminary construct validity of the Farsi translation of the Depression Anxiety Stress Scales.
Jacobsen, Stine Lindahl
The paper will present a phd study concerning reliability and validity of music therapy assessment model “Assessment of Parenting Competences” (APC) in the area of families with emotionally neglected children. This study had a multiple strategy design with a philosophical base of critical realism...... and pragmatism. The fixed design for this study was a between and within groups design in testing the APCs reliability and validity. The two different groups were parents with neglected children and parents with non-neglected children. The flexible design had a multiple case study strategy specifically...
Jingying Liu; Jipeng Yang; Yanhui Liu; Yang Yang; Hongfu Zhang
Purpose: To test the validity and reliability of a modified Career Growth Scale (CGS) to assess nurse career growth. Method: A cross-sectional design was used to analyze the use of the CGS to survey 600 full-time registered nurses from Grade A hospitals in Tianjin. Results: A modified scale we called Career Growth of Nurse Scale (CGNS) is acceptable, valid, and reliable for the evaluation of nurse career growth in Chinese hospitals. This scale measured three main factors (career goal, c...
Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin
This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensio...
Iyigun, Gozde; Kirmizigil, Berkiye; Angin, Ender; Oksuz, Sevim; Can, Filiz; Eker, Levent; Rose, Debra J
The aim of this study was to evaluate the reliability and validity of the Turkish version of the FAB(FAB-T) scale in the older Turkish adults. The reliability and validity of the scale was tested on 200 community-dwelling older adults. FAB-T scale was scored by different physiotherapists on different days to evaluate inter-rater and intrarater reliability. The Berg Balance Scale (BBS) was used for the evaluation of convergent validity, and the content validity of the FAB-T scale was investigated. The FAB-T scale showed very high inter- and intra-rater reliability. For inter-rater agreement, on the individual test items and total score ICC values were 0.92 (95 %CI; 0.90-0.94) and 0.96 (95% CI; 0.95-0.97) respectively. The intra-rater agreement, on the individual test items and total score ICC values were 0.93 (95 %CI; 0.91- 0.95) and 0.96 (95% CI; 0.95- 0.97) respectively. There was a good agreement between the FAB-T and BBS scales. A high correlation was found between the BBS and FAB-T scales [rho = 0.70 (%95 CI; 0.62-0.76)] indicating good convergent validity. Considering the content validity of the FAB-T scale, no floor (floor score: 0%) or ceiling (ceiling score: 6.5%) effect was detected. The FAB-T scale was successfully translated from the original English version (FAB) and demonstrated strong psychometric features. It was found that the FAB-T scale has very high inter-rater and intra-rater reliability. Considering the convergent validity, the scale has high correlation with the BBS. The FAB-T has no floor and ceiling effect. Copyright © 2018 Elsevier B.V. All rights reserved.
Apolzan, John W; Myers, Candice A; Cowley, Amanda D; Brady, Heather; Hsia, Daniel S; Stewart, Tiffany M; Redman, Leanne M; Martin, Corby K
Mindfulness is theorized to affect the eating behavior and weight of pregnant women, yet no measure has been validated during pregnancy. This study qualitatively and quantitatively evaluated the reliability and validity of the Mindful Eating Questionnaire (MEQ) in overweight and obese pregnant women. Participants completed focus groups and cognitive interviews. The MEQ was administered twice to measure test-retest reliability. The Eating Inventory (EI) and Mindful Attention Awareness Scale (MAAS) were administered to assess convergent validity, and the Neighborhood Environment Walkability Scale (NEWS) assessed discriminant validity. Participants were 20 ± 8 weeks gestation (mean ± SD), 30 ± 2 years old, and 55% were obese. The MEQ total score had good test-retest reliability (r = .85). The total score internal consistency reliability was poor (Cronbach's α = .56). The external cues subscale (ECS) was not internally consistent (α = .31). Other subscales ranged from α = .59-.68. When the ECS was excluded, the MEQ total score internal consistency was acceptable (α = .62). Convergent validity was supported by the MEQ total score (with and without ECS) correlating significantly with the MAAS and the EI disinhibition and hunger subscales. Discriminant validity of the MEQ was supported by the MEQ and NEWS total scores and subscales not being significantly correlated. The quantitative results were supported by the qualitative context and content analysis. With the exception of the ECS, the MEQ's reliability and validity was supported in pregnant women, and most of the subscales were more robust in pregnant women than in the original sample of healthy adults. The MEQ's use with overweight and obese pregnant women is supported. Copyright © 2016 Elsevier Ltd. All rights reserved.
Stoner, Lee; Bonner, Chantel; Credeur, Daniel; Lambrick, Danielle; Faulkner, James; Wadsworth, Daniel; Williams, Michelle A.
BackgroundMonitoring central hemodynamic responses to an orthostatic challenge may provide important insight into autonomic nervous system function. Oscillometric pulse wave analysis devices have recently emerged, presenting clinically viable options for investigating central hemodynamic properties. The purpose of the current study was to determine whether oscillometric pulse wave analysis can be used to reliably (between-day) assess central blood pressure and central pressure augmentation (a...
Laflamme, Patrick; Seli, Paul; Smilek, Daniel
The metronome response task (MRT)-a sustained-attention task that requires participants to produce a response in synchrony with an audible metronome-was recently developed to index response variability in the context of studies on mind wandering. In the present studies, we report on the development and validation of a visual version of the MRT (the visual metronome response task; vMRT), which uses the rhythmic presentation of visual, rather than auditory, stimuli. Participants completed the vMRT (Studies 1 and 2) and the original (auditory-based) MRT (Study 2) while also responding to intermittent thought probes asking them to report the depth of their mind wandering. The results showed that (1) individual differences in response variability during the vMRT are highly reliable; (2) prior to thought probes, response variability increases with increasing depth of mind wandering; (3) response variability is highly consistent between the vMRT and the original MRT; and (4) both response variability and depth of mind wandering increase with increasing time on task. Our results indicate that the original MRT findings are consistent across the visual and auditory modalities, and that the response variability measured in both tasks indexes a non-modality-specific tendency toward behavioral variability. The vMRT will be useful in the place of the MRT in experimental contexts in which researchers' designs require a visual-based primary task.
Isabelle Ottenvall Hammar
Full Text Available In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA-Older persons (IPA-O, showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation.
Sanders, James L; Williams, Robert J
Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Danielle Fabiana Cucolo
Full Text Available ABSTRACT Objectives: to verify the reliability and construct validity estimates of the "Assessment of nursing care product" scale (APROCENF and its applicability. Methods: this validation study included a sample of 40 (inter-rater reliability and 172 (construct validity assessments performed by nurses at the end of the work shift at nine inpatient services of a teaching hospital in the Brazilian Southeast. The data were collected between February and September/2014 with interruptions. Cronbach's alpha and Spearman's correlation coefficients were calculated, as well as the intraclass correlation and the weighted kappa index (inter-rater reliability. Exploratory factor analysis was used with principal component extraction and varimax rotation (construct validity. Results: the internal consistency revealed an alpha coefficient of 0.85, item-item correlation ranging between 0.13 and 0.61 and item-total correlation between 0.43 and 0.69. Inter-rater equivalence was obtained and all items evidenced significant factor loadings. Conclusion: this research evidenced the reliability and construct validity of the scale to assess the nursing care product. Its application in nursing practice permits identifying improvements needed in the production process, contributing to management and care decisions.
Full Text Available Magno F Formiga,1,2 Kathryn E Roach,1 Isabel Vital,3 Gisel Urdaneta,3 Kira Balestrini,3 Rafael A Calderon-Candelario,3,4 Michael A Campos,3,4,* Lawrence P Cahalin1,* 1Department of Physical Therapy, University of Miami Miller School of Medicine, Coral Gables, FL, USA; 2CAPES Foundation, Ministry of Education of Brazil, Brasilia, Brazil; 3Pulmonary Section, Miami Veterans Administration Medical Center, Miami, FL, USA; 4Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, University of Miami Miller School of Medicine, Miami, FL, USA *These authors contributed equally to this work Purpose: The Test of Incremental Respiratory Endurance (TIRE provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP over time. The integration of MIP over inspiratory duration (ID provides the sustained maximal inspiratory pressure (SMIP. Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods: Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results: All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP
Gadbury-Amyot, Cynthia C.
This study examined validity and reliability of portfolio assessment using Messick's (1996, 1995) unified framework of construct validity. Theoretical and empirical evidence was sought for six aspects of construct validity. The sample included twenty student portfolios. Each portfolio were evaluated by seven faculty raters using a primary trait analysis scoring rubric. There was a significant relationship (r = .81--.95; p Dental Hygiene Board Examination (r = .60; p Dental Testing Service examination was both weak and nonsignificant (r = .19; p > .05). An open-ended survey was used to elicit student feedback on portfolio development. A majority of the students (76%) perceived value in the development of programmatic portfolios. In conclusion, the pattern of findings from this study suggest that portfolios can serve as a valid and reliable measure for assessing student competency.
Bornstein, P H; Hamilton, S B; Miller, R K; Quevillon, R P; Spitzform, M
This study investigated the effects of reliability and validity "enhancers" on fidelity of self-report data in an analogue therapy situation. Under the guise of a Concentration Skills Training Program, 57 Ss were assigned randomly to one of the following conditions: (a) Reliability Enhancement; (b) Truth Talk; (c) No Comment Control. Results indicated significant differences among groups (p less than .05). In addition, tests of multiple comparisons revealed that Reliability Enhancement was significantly different from Truth Talk in occurrences of unreliability (p less than .05). These findings are discussed in light of the increased reliance on self-report data in behavioral intervention, and recommendations are made for future research.
Jørgensen, René; Ris Hansen, Inge; Falla, Deborah
-retest reliability in people with and without chronic neck pain. Moreover, construct and between-group discriminative validity of the tests were examined. METHODS: Twenty-one participants with chronic neck pain and 21 asymptomatic participants were included. Intra- and inter-reliability were evaluated for the Cranio-Cervical...... Flexion Test (CCFT), Range of Movement (ROM), Joint Position Error (JPE), Gaze Stability (GS), Smooth Pursuit Neck Torsion Test (SPNTT), and neuromuscular control of the Deep Cervical Extensors (DCE). Test-retest reliability was assessed for Postural Control (SWAY) and Pressure Pain Threshold (PPT) over......BACKGROUND: The reliability of clinical tests for the cervical spine has not been adequately evaluated. Six cervical clinical tests, which are low cost and easy to perform in clinical settings, were tested for intra- and inter-examiner reliability, and two performance tests were assessed for test...
The purpose of this study is to make Turkish adaptation the Writing Attitude Scale (WAS) that In order to measure writing anxiety developed by Marcia et al (1984). For this purpose was carried out the Validation of a Writing Attitude Scale and to examine its reliability and validity. Writing Attitude Scale (WAS) was first translated into Turkish and, equivalence analysis of forms English / Turkish language of the scale were carried out by the reading of three English teachers / lecturers. The...
Nualnong Wongtongkam; Paul Russell Ward; Andrew Day; Anthony Harold Winefield
In Thailand physical violence among male adolescents is considered a significant public health issue, although there has been little published research into the aetiology and functions of violence in Thai youth. Research in this area has been hampered by a lack of psychometrically sound tools that have been validated to assess problem behaviours in Asian youth. The purpose of this paper is to provide validity and reliability data on an instrument to measure violence in Thai youth. In this stu...
Ausserhofer, Dietmar; Anderson, Ruth A; Colón-Emeric, Cathleen; Schwendimann, René
The Safety Organizing Scale is a valid and reliable measure on safety behaviors and practices in hospitals. This study aimed to explore the psychometric properties of the Safety Organizing Scale-Nursing Home version (SOS-NH). In a cross-sectional analysis of staff survey data, we examined validity and reliability of the 9-item Safety SOS-NH using American Educational Research Association guidelines. This substudy of a larger trial used baseline survey data collected from staff members (n = 627) in a variety of work roles in 13 nursing homes (NHs) in North Carolina and Virginia. Psychometric evaluation of the SOS-NH revealed good response patterns with low average of missing values across all items (3.05%). Analyses of the SOS-NH's internal structure (eg, comparative fit indices = 0.929, standardized root mean square error of approximation = 0.045) and consistency (composite reliability = 0.94) suggested its 1-dimensionality. Significant between-facility variability, intraclass correlations, within-group agreement, and design effect confirmed appropriateness of the SOS-NH for measurement at the NH level, justifying data aggregation. The SOS-NH showed discriminate validity from one related concept: communication openness. Initial evidence regarding validity and reliability of the SOS-NH supports its utility in measuring safety behaviors and practices among a wide range of NH staff members, including those with low literacy. Further psychometric evaluation should focus on testing concurrent and criterion validity, using resident outcome measures (eg, patient fall rates). Copyright © 2013 American Medical Directors Association, Inc. All rights reserved.
Murray, Nicholas; Salvatore, Anthony; Powell, Douglas; Reed-Jones, Rebecca
Context: An estimated 300 000 sport-related concussion injuries occur in the United States annually. Approximately 30% of individuals with concussions experience balance disturbances. Common methods of balance assessment include the Clinical Test of Sensory Organization and Balance (CTSIB), the Sensory Organization Test (SOT), the Balance Error Scoring System (BESS), and the Romberg test; however, the National Collegiate Athletic Association recommended the Wii Fit as an alternative measure of balance in athletes with a concussion. A central concern regarding the implementation of the Wii Fit is whether it is reliable and valid for measuring balance disturbance in athletes with concussion. Objective: To examine the reliability and validity evidence for the CTSIB, SOT, BESS, Romberg test, and Wii Fit for detecting balance disturbance in athletes with a concussion. Data Sources: Literature considered for review included publications with reliability and validity data for the assessments of balance (CTSIB, SOT, BESS, Romberg test, and Wii Fit) from PubMed, PsycINFO, and CINAHL. Data Extraction: We identified 63 relevant articles for consideration in the review. Of the 63 articles, 28 were considered appropriate for inclusion and 35 were excluded. Data Synthesis: No current reliability or validity information supports the use of the CTSIB, SOT, Romberg test, or Wii Fit for balance assessment in athletes with a concussion. The BESS demonstrated moderate to high reliability (interclass correlation coefficient = 0.87) and low to moderate validity (sensitivity = 34%, specificity = 87%). However, the Romberg test and Wii Fit have been shown to be reliable tools in the assessment of balance in Parkinson patients. Conclusions: The BESS can evaluate balance problems after a concussion. However, it lacks the ability to detect balance problems after the third day of recovery. Further investigation is needed to establish the use of the CTSIB, SOT, Romberg test, and Wii Fit for
Murray, Nicholas; Salvatore, Anthony; Powell, Douglas; Reed-Jones, Rebecca
An estimated 300 000 sport-related concussion injuries occur in the United States annually. Approximately 30% of individuals with concussions experience balance disturbances. Common methods of balance assessment include the Clinical Test of Sensory Organization and Balance (CTSIB), the Sensory Organization Test (SOT), the Balance Error Scoring System (BESS), and the Romberg test; however, the National Collegiate Athletic Association recommended the Wii Fit as an alternative measure of balance in athletes with a concussion. A central concern regarding the implementation of the Wii Fit is whether it is reliable and valid for measuring balance disturbance in athletes with concussion. To examine the reliability and validity evidence for the CTSIB, SOT, BESS, Romberg test, and Wii Fit for detecting balance disturbance in athletes with a concussion. Literature considered for review included publications with reliability and validity data for the assessments of balance (CTSIB, SOT, BESS, Romberg test, and Wii Fit) from PubMed, PsycINFO, and CINAHL. We identified 63 relevant articles for consideration in the review. Of the 63 articles, 28 were considered appropriate for inclusion and 35 were excluded. No current reliability or validity information supports the use of the CTSIB, SOT, Romberg test, or Wii Fit for balance assessment in athletes with a concussion. The BESS demonstrated moderate to high reliability (interclass correlation coefficient = 0.87) and low to moderate validity (sensitivity = 34%, specificity = 87%). However, the Romberg test and Wii Fit have been shown to be reliable tools in the assessment of balance in Parkinson patients. The BESS can evaluate balance problems after a concussion. However, it lacks the ability to detect balance problems after the third day of recovery. Further investigation is needed to establish the use of the CTSIB, SOT, Romberg test, and Wii Fit for assessing balance in athletes with concussions.
Mills, Tamara L; Holm, Margo B; Schmeler, Mark
The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.
Full Text Available This study aimed to translate MIDAS questionnaire from English into Persian and determine its content validity and reliability. MIDAS was translated and validated on a sample (N = 110 of Iranian adult population. The participants were both male and female with the age range of 17-57. They were at different educational levels and from different ethnic groups in Iran. A translating team, consisting of five members, bilingual in English and Persian and familiar with multiple intelligences (MI theory and practice, were involved in translating and determining content validity, which included the processes of forward translation, back-translation, review, final proof-reading, and testing. The statistical analyses of inter-scale correlation were performed using the Cronbach's alpha coefficient. In an intra-class correlation, the Cronbach's alpha was high for all of the questions. Translation and content validity of MIDAS questionnaire was completed by a proper process leading to high reliability and validity. The results suggest that Persian MIDAS (P-MIDAS could serve as a valid and reliable instrument for measuring Iranian adults MIs.
Guspatni, G.; Kurniawati, Y.
The aim of this paper is to examine validity and reliability of a questionnaire used to evaluate e-learning implementation in chemistry instruction. 48 questionnaires were filled in by students who had studied chemistry through e-learning system. The questionnaire consisted of 20 indicators evaluating students’ perception on using e-learning. Parametric testing was done as data were assumed to follow normal distribution. Item validity of the questionnaire was examined through item-total correlation using Pearson’s formula while its reliability was assessed with Cronbach’s alpha formula. Moreover, convergent validity was assessed to see whether indicators building a factor had theoretically the same underlying construct. The result of validity testing revealed 19 valid indicators while the result of reliability testing revealed Cronbach’s alpha value of .886. The result of factor analysis showed that questionnaire consisted of five factors, and each of them had indicators building the same construct. This article shows the importance of factor analysis to get a construct valid questionnaire before it is used as research instrument.
Ammerman Alice S
Full Text Available Abstract Background Few assessment instruments have examined the nutrition and physical activity environments in child care, and none are self-administered. Given the emerging focus on child care settings as a target for intervention, a valid and reliable measure of the nutrition and physical activity environment is needed. Methods To measure inter-rater reliability, 59 child care center directors and 109 staff completed the self-assessment concurrently, but independently. Three weeks later, a repeat self-assessment was completed by a sub-sample of 38 directors to assess test-retest reliability. To assess criterion validity, a researcher-administered environmental assessment was conducted at 69 centers and was compared to a self-assessment completed by the director. A weighted kappa test statistic and percent agreement were calculated to assess agreement for each question on the self-assessment. Results For inter-rater reliability, kappa statistics ranged from 0.20 to 1.00 across all questions. Test-retest reliability of the self-assessment yielded kappa statistics that ranged from 0.07 to 1.00. The inter-quartile kappa statistic ranges for inter-rater and test-retest reliability were 0.45 to 0.63 and 0.27 to 0.45, respectively. When percent agreement was calculated, questions ranged from 52.6% to 100% for inter-rater reliability and 34.3% to 100% for test-retest reliability. Kappa statistics for validity ranged from -0.01 to 0.79, with an inter-quartile range of 0.08 to 0.34. Percent agreement for validity ranged from 12.9% to 93.7%. Conclusion This study provides estimates of criterion validity, inter-rater reliability and test-retest reliability for an environmental nutrition and physical activity self-assessment instrument for child care. Results indicate that the self-assessment is a stable and reasonably accurate instrument for use with child care interventions. We therefore recommend the Nutrition and Physical Activity Self-Assessment for
Short- and long-term aspects of measuring structural response parameters are addressed. Two specific examples of such measurements are considered for the purpose of illustration and in order to focus the discussion. These examples are taken from the petroleum industry (monitoring of riser response) and from the shipping industry (monitoring of ice-induced strains in a ship hull). Similarities and differences between the two cases are elaborated with respect to which are the most relevant mechanical limit states. Furthermore, main concerns related to reliability levels within a short-term versus long-term time horizon are highlighted. Quantifying the economic benefits of applying monitoring systems is also addressed. - Highlights: • Two examples of structural response monitoring are described. • Application of measurements is discussed in relation to updating of load and structural parameters. • Quantification of the value of response monitoring is made for both of the examples.
...] Food and Drug Administration/American Glaucoma Society Workshop on the Validity, Reliability, and... entitled ``FDA/American Glaucoma Society (AGS) Workshop on the Validity, Reliability, and Usability of... research. The purpose of this public workshop is to provide a forum for discussing the validity...
Gore, Shweta; Blackwood, Jennifer; Guyette, Mary; Alsalaheen, Bara
Reduced physical activity is associated with poor prognosis in chronic obstructive pulmonary disease (COPD). Accelerometers have greatly improved quantification of physical activity by providing information on step counts, body positions, energy expenditure, and magnitude of force. The purpose of this systematic review was to compare the validity and reliability of accelerometers used in patients with COPD. An electronic database search of MEDLINE and CINAHL was performed. Study quality was assessed with the Strengthening the Reporting of Observational Studies in Epidemiology checklist while methodological quality was assessed using the modified Quality Appraisal Tool for Reliability Studies. The search yielded 5392 studies; 25 met inclusion criteria. The SenseWear Pro armband reported high criterion validity under controlled conditions (r = 0.75-0.93) and high reliability (ICC = 0.84-0.86) for step counts. The DynaPort MiniMod demonstrated highest concurrent validity for step count using both video and manual methods. Validity of the SenseWear Pro armband varied between studies especially in free-living conditions, slower walking speeds, and with addition of weights during gait. A high degree of variability was found in the outcomes used and statistical analyses performed between studies, indicating a need for further studies to measure reliability and validity of accelerometers in COPD. The SenseWear Pro armband is the most commonly used accelerometer in COPD, but measurement properties are limited by gait speed variability and assistive device use. DynaPort MiniMod and Stepwatch accelerometers demonstrated high validity in patients with COPD but lack reliability data.
Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne
To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube
Adams, Emma J; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C; Cooper, Ashley R; Ogilvie, David
No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, ptravel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires.
Emmanuel, Andy; Clow, Sheila E
Validating a questionnaire/instrument (whether developed or adapted) before proceeding to the field for data collection is important. This article presents the modification of an Irish questionnaire for a Nigerian setting. The validation process and reliability testing of this questionnaire (which was used in assessing previous breastfeeding practices and breastfeeding intentions of pregnant women in English and Hausa languages) were also presented. Five experts in the field of breastfeeding and infant feeding voluntarily and independently evaluated the instrument. The experts evaluated the various items of the questionnaire based on relevance, clarity, simplicity and ambiguity on a Likert scale of 4. The analysis was performed to determine the content validity index (CVI).Two language experts performed the translation and back-translation. Ten pregnant women completed questionnaires which were evaluated for internal consistency. Two other pregnant women completed the questionnaire twice at an interval of two weeks to test the reliability. SPSS version 21 was used to calculate the coefficient of reliability. The content validity index was high (0.94 for relevance, clarity and ambiguity and 0.96 for simplicity). The analysis suggested that four of the seventy one items should be removed. Cronbach's Alpha was 0.81, while the reliability coefficient was 0.76. The emerged validated questionnaire was translated from English to Hausa, then, back-translated into English and compared for accuracy. The final instrument is reliable and valid for data collection on breastfeeding in Nigeria among English and Hausa speakers. Therefore, the instrument is recommended for use in assessing breastfeeding intention and practices in Nigeria.
Full Text Available Background: The Endometriosis Health Profile-30 (EHP-30 is a disease-specific questionnaire to measure the health-related quality of life in patients with endometriosis. The aim of this study was to evaluate the validity and reliability of the Persian version of Endometriosis Health Profile (EHP-30 in women with endometriosis referring to three Gynecology Clinics in Tehran, Iran. Methods: One hundred women (20 to 50 years old with surgically confirmed endometriosis recruited from three outpatient Gynecology Clinics affiliated to the Iran University of Medical Sciences. All 100 patients were asked to complete EHP-30 questionnaire while referring to the Clinics. The findings were analyzed using descriptive statistics, internal reliability consistency, construct validity (using short form-36, which had already been validated in Iran, factor analysis (with principle component analysis method, and item total correlation to assess the validity and reliability of the questionnaire. Results: The internal consistency reliability of the questionnaire was high (Cronbach’s α ranged between 0.80 and 0.93 for core, and 0.78 and 0.90 for modular parts. All items were loaded on their own factors except item 17 (feeling aggressive or violent and item 18 (feeling unwell, which were loaded on pain and social support domains, respectively. Construct validity of EHP-30, established by using SF-36, indicates good correlations in several similar scales of these two questionnaires. Conclusion: The findings of the study demonstrate that Persian version of EHP-30 is a valid and reliable measure to assess the quality of life in women with endometriosis
Park, Jinse; Koh, Seong-Beom; Kim, Hee Jin; Oh, Eungseok; Kim, Joong-Seok; Yun, Ji Young; Kwon, Do-Young; Kim, Younsoo; Kim, Ji Seon; Kwon, Kyum-Yil; Park, Jeong-Ho; Youn, Jinyoung; Jang, Wooyoung
Postural instability and gait disturbance are the cardinal symptoms associated with falling among patients with Parkinson's disease (PD). The Tinetti mobility test (TMT) is a well-established measurement tool used to predict falls among elderly people. However, the TMT has not been established or widely used among PD patients in Korea. The purpose of this study was to evaluate the reliability and validity of the Korean version of the TMT for PD patients. Twenty-four patients diagnosed with PD were enrolled in this study. For the interrater reliability test, thirteen clinicians scored the TMT after watching a video clip. We also used the test-retest method to determine intrarater reliability. For concurrent validation, the unified Parkinson's disease rating scale, Hoehn and Yahr staging, Berg Balance Scale, Timed-Up and Go test, 10-m walk test, and gait analysis by three-dimensional motion capture were also used. We analyzed receiver operating characteristic curve to predict falling. The interrater reliability and intrarater reliability of the Korean Tinetti balance scale were 0.97 and 0.98, respectively. The interrater reliability and intra-rater reliability of the Korean Tinetti gait scale were 0.94 and 0.96, respectively. The Korean TMT scores were significantly correlated with the other clinical scales and three-dimensional motion capture. The cutoff values for predicting falling were 14 points (balance subscale) and 10 points (gait subscale). We found that the Korean version of the TMT showed excellent validity and reliability for gait and balance and had high sensitivity and specificity for predicting falls among patients with PD.
Lagarde, Marloes L J; Kamalski, Digna M A; van den Engel-Hoek, Lenie
To systematically review the available evidence for the reliability and validity of cervical auscultation in diagnosing the several aspects of dysphagia in adults and children suffering from dysphagia. Medline (PubMed), Embase and the Cochrane Library databases. The systematic review was carried out applying the steps of the PRISMA-statement. The methodological quality of the included studies were evaluated using the Dutch 'Cochrane checklist for diagnostic accuracy studies'. A total of 90 articles were identified through the search strategy, and after applying the inclusion and exclusion criteria, six articles were included in this review. In the six studies, 197 patients were assessed with cervical auscultation. Two of the six articles were considered to be of 'good' quality and three studies were of 'moderate' quality. One article was excluded because of a 'poor' methodological quality. Sensitivity ranges from 23%-94% and specificity ranges from 50%-74%. Inter-rater reliability was 'poor' or 'fair' in all studies. The intra-rater reliability shows a wide variance among speech language therapists. In this systematic review, conflicting evidence is found for the validity of cervical auscultation. The reliability of cervical auscultation is insufficient when used as a stand-alone tool in the diagnosis of dysphagia in adults. There is no available evidence for the validity and reliability of cervical auscultation in children. Cervical auscultation should not be used as a stand-alone instrument to diagnose dysphagia. © The Author(s) 2015.
Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael
Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
Dong, Lijuan; Liu, Na; Tian, Xiaoyu; Qiao, Xiaoxia; Gobbens, Robbert J J; Kane, Robert L; Wang, Cuili
To translate the Tilburg Frailty Indicator (TFI) into Chinese and assess its reliability and validity. A sample of 917 community-dwelling older people, aged ≥60 years, in a Chinese city was included between August 2015 and March 2016. Construct validity was assessed using alternative measures corresponding to the TFI items, including self-rated health status (SRH), unintentional weight loss, walking speed, timed-up-and-go tests (TUGT), making telephone calls, grip strength, exhaustion, Short Portable Mental Status Questionnaire (SPMSQ), Geriatric Depression scale (GDS-15), emotional role, Adaptability Partnership Growth Affection and Resolve scale (APGAR) and Social Support Rating Scale (SSRS). Fried's phenotype and frailty index were measured to evaluate criterion validity. Adverse health outcomes (ADL and IADL disability, healthcare utilization, GDS-15, SSRS) were used to assess predictive (concurrent) validity. The internal consistency reliability was good (Cronbach's α=0.71). The test-retest reliability was strong (r=0.88). Kappa coefficients showed agreements between the TFI items and corresponding alternative measures. Alternative measures correlated as expected with the three domains of TFI, with an exclusion that alternative psychological measures had similar correlations with psychological and physical domains of the TFI. The Chinese TFI had excellent criterion validity with the AUCs regarding physical phenotype and frailty index of 0.87 and 0.86, respectively. The predictive (concurrent) validities of the adverse health outcomes and healthcare utilization were acceptable (AUCs: 0.65-0.83). The Chinese TFI has good validity and reliability as an integral instrument to measure frailty of older people living in the community in China. Copyright © 2017 Elsevier B.V. All rights reserved.
O'Sullivan, Elizabeth J; Rasmussen, Kathleen M
The breastfeeding surveillance tool in the United States, the National Immunization Survey, considers the maternal-infant dyad to be breastfeeding for as long as the infant consumes human milk (HM). However, many infants consume at least some HM from a bottle, which can lead to health outcomes different from those for at-the-breast feeding. Our aim was to develop a construct-valid questionnaire that categorizes infants by nutrition source, that is, own mother's HM, another mother's HM, infant formula, or other and feeding mode, that is, at the breast or from a bottle, and test the reliability of this questionnaire. The Questionnaire on Infant Feeding was developed through a literature review and modified based on qualitative research. Construct validity was assessed through cognitive interviews and a test-retest reliability study was conducted among mothers who completed the questionnaire twice, 1 month apart. Cognitive interviews were conducted with ten mothers from upstate New York between September and December 2014. A test-retest reliability study was conducted among 44 mothers from across the United States between March and May 2015. Equivalence of questions with continuous responses about the timing of starting and stopping various behaviors and the agreement between responses to questions with categorical responses on the two questionnaires completed 1 month apart. Reliability was assessed using paired-equivalence tests for questions about the timing of starting and stopping behaviors and weighted Cohen's κ for questions about the frequency and intensity of behaviors. Reliability of the Questionnaire on Infant Feeding was moderately high among mothers of infants aged 19 to 35 months, with most questions about the timing of starting and stopping behaviors equivalent to within 1 month. Weighted Cohen's κ for categorical questions indicated substantial agreement. The Questionnaire on Infant Feeding is a construct-valid tool to measure duration, intensity
Erlich, Richard J.; Russ-Eft, Darlene F.
The validity and reliability of three instruments, the "Counselor Rubric for Gauging Student Understanding of Academic Planning," micro-analytic questions, and the "Student Survey for Understanding Academic Planning," all based on social cognitive theory, were tested as means to assess self-efficacy and self-regulated learning in college academic…
Bühn, Stefanie; Mathes, Tim; Prengel, Peggy; Wegewitz, Uta; Ostermann, Thomas; Robens, Sibylle; Pieper, Dawid
There is a movement from generic quality checklists toward a more domain-based approach in critical appraisal tools. This study aimed to report on a first experience with the newly developed risk of bias in systematic reviews (ROBIS) tool and compare it with A Measurement Tool to Assess Systematic Reviews (AMSTAR), that is, the most common used tool to assess methodological quality of systematic reviews while assessing validity, reliability, and applicability. Validation study with four reviewers based on 16 systematic reviews in the field of occupational health. Interrater reliability (IRR) of all four raters was highest for domain 2 (Fleiss' kappa κ = 0.56) and lowest for domain 4 (κ = 0.04). For ROBIS, median IRR was κ = 0.52 (range 0.13-0.88) for the experienced pair of raters compared to κ = 0.32 (range 0.12-0.76) for the less experienced pair of raters. The percentage of "yes" scores of each review of ROBIS ratings was strongly correlated with the AMSTAR ratings (r s = 0.76; P = 0.01). ROBIS has fair reliability and good construct validity to assess the risk of bias in systematic reviews. More validation studies are needed to investigate reliability and applicability, in particular. Copyright © 2017 Elsevier Inc. All rights reserved.
Coenen, P.; Formanoy, M.; Douwes, M.; Bosch, T.; Kraker, H. de
Exposure to mechanical vibrations at work (e.g., due to handling powered tools) is a potential occupational risk as it may cause upper extremity complaints. However, reliable and valid assessment methods for vibration exposure at work are lacking. Measuring hand-arm vibration objectively is often
Markovina, J.; Stewart-Knox, B.J.; Rankin, A.; Gibney, M.; Almeida, M.D.V.; Fischer, A.R.H.; Kuznesof, S.A.; Poínhos, R.; Panzone, L.; Frewer, L.J.
This analysis has been conducted to explore the validity and reliability of the Food Choice Questionnaire (FCQ) across 9 European countries. Variation in the factor structure and the perceived importance of food choice motives have been compared cross-nationally. Volunteers (N = 9381) were recruited
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen B.; Murphy, Joseph; Elliott, Stephen N.; May, Henry
The Vanderbilt Assessment of Leadership in Education (VAL-ED) is a multirater assessment of principals' learning-centered leadership. The instrument was developed based on the Standards for Educational and Psychological Testing. In this article, we report on the validity and reliability evidence for the VAL-ED accumulated in a national field…
Perkins, Rose J. Merlino
"Women's Mental Health Questionnaire" (W-MHQ) assesses females' adult mental health concerns, and examines their associations with specified father-daughter childhood relationships. Presented are W-MHQ item and scale development, and psychometric findings drawn from factor analyses, reliability assessments, and validation processes. For…
Pekdogan, Serpil; Ulutas, Ilkay
The purpose of this study is to develop a valid and reliable data collection tool to assess the decision-making skills of children at the age of 5 to 6. The study group is composed of 300 children attending independent pre-schools located in the central district of Amasya province and their parents. In the study, four-factor and 29-item…
Shaik, Munvar Miya; Hassan, Norul Badriah; Tan, Huay Lin; Bhaskar, Shalini; Gan, Siew Hua
The study was designed to determine the validity and reliability of the Bahasa Melayu version (MIDAS-M) of the Migraine Disability Assessment (MIDAS) questionnaire. Patients having migraine for more than six months attending the Neurology Clinic, Hospital Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia, were recruited. Standard forward and back translation procedures were used to translate and adapt the MIDAS questionnaire to produce the Bahasa Melayu version. The translated Malay version was tested for face and content validity. Validity and reliability testing were further conducted with 100 migraine patients (1st administration) followed by a retesting session 21 days later (2nd administration). A total of 100 patients between 15 and 60 years of age were recruited. The majority of the patients were single (66%) and students (46%). Cronbach's alpha values were 0.84 (1st administration) and 0.80 (2nd administration). The test-retest reliability for the total MIDAS score was 0.73, indicating that the MIDAS-M questionnaire is stable; for the five disability questions, the test-retest values ranged from 0.77 to 0.87. The MIDAS-M questionnaire is comparable with the original English version in terms of validity and reliability and may be used for the assessment of migraine in clinical settings.
Munvar Miya Shaik
Full Text Available Background. The study was designed to determine the validity and reliability of the Bahasa Melayu version (MIDAS-M of the Migraine Disability Assessment (MIDAS questionnaire. Methods. Patients having migraine for more than six months attending the Neurology Clinic, Hospital Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia, were recruited. Standard forward and back translation procedures were used to translate and adapt the MIDAS questionnaire to produce the Bahasa Melayu version. The translated Malay version was tested for face and content validity. Validity and reliability testing were further conducted with 100 migraine patients (1st administration followed by a retesting session 21 days later (2nd administration. Results. A total of 100 patients between 15 and 60 years of age were recruited. The majority of the patients were single (66% and students (46%. Cronbach’s alpha values were 0.84 (1st administration and 0.80 (2nd administration. The test-retest reliability for the total MIDAS score was 0.73, indicating that the MIDAS-M questionnaire is stable; for the five disability questions, the test-retest values ranged from 0.77 to 0.87. Conclusion. The MIDAS-M questionnaire is comparable with the original English version in terms of validity and reliability and may be used for the assessment of migraine in clinical settings.
To facilitate cross-cultural research in the psychology of religion, the reliability and validity of the 7-item short form of the Francis Scale of Attitude toward Christianity was examined among a sample of 453 young people aged between 12-19 years old from standards six, seven, eight, nine and ten attending a secondary ...
Ersoy, Mehmet Akif; Varan, Azmi
The aim of this study was to evaluate the reliability and validity of the Turkish version of the Internalized Stigma of Mental Illness Scale (ISMI) in patients with psychiatric disorders. The study included 203 patients diagnosed with various psychiatric disorders in a psychiatry outpatient clinic of a university hospital. The reliability of the scale was assessed by investigation of its internal consistency and split-half reliability. The convergent validity of the scale was demonstrated by the relationship between the Turkish form of the ISMI and various criteria scales. Cronbach's alpha value was 0.93 for the entire scale and ranged between 0.63 and 0.87 for the 5 subscales of the ISMI. In terms of convergent validity, the total score of the Turkish ISMI significantly correlated with the Beck Depression Inventory, Rosenberg Self-Esteem Scale, Sociotropy-Autonomy Scale, Brief Symptom Inventory, Multidimensional Scale of Perceived Social Support, Clinical Global Impression Scale, and Global Assessment of Functioning Scale scores. All values were in the expected direction. In the light of the findings, it was concluded that the Turkish version of ISMI could be used as a reliable and valid tool in assessing internalized stigma of the Turkish psychiatric patients.
Baek, Hyun-Sook; Lee, Kyoung-Uk; Joo, Eun-Jeong; Lee, Mi-Young; Choi, Kyeong-Sook
The Connor-Davidson Resilience Scale (CD-RISC) measures various aspects of psychological resilience in patients with posttraumatic stress disorder (PTSD) and other psychiatric ailments. This study sought to assess the reliability and validity of the Korean version of the Connor-Davidson Resilience Scale (K-CD-RISC). In total, 576 participants were enrolled (497 females and 79 males), including hospital nurses, university students, and firefighters. Subjects were evaluated using the K-CD-RISC, the Beck Depression Inventory (BDI), the Impact of Event Scale-Revised (IES-R), the Rosenberg Self-Esteem Scale (RSES), and the Perceived Stress Scale (PSS). Test-retest reliability and internal consistency were examined as a measure of reliability, and convergent validity and factor analysis were also performed to evaluate validity. Cronbach's alpha coefficient and test-retest reliability were 0.93 and 0.93, respectively. The total score on the K-CD-RISC was positively correlated with the RSES (r=0.56, preliability and validity for measurement of resilience among Korean subjects.
Hager, Erin R.; Treuth, Margarita S.; Gormely, Candice; Epps, LaShawna; Snitker, Soren; Black, Maureen M.
Purpose: Ankle accelerometry allows for 24-hr data collection and improves data volume/integrity versus hip accelerometry. Using Actical ankle accelerometry, the purpose of this study was to (a) develop sensitive/specific thresholds, (b) examine validity/reliability, (c) compare new thresholds with those of the manufacturer, and (d) examine…
Arseven, Zeynep; Kiliç, Abdurrahman; Sahin, Seyma
In the present study, it is aimed to develop a valid and reliable scale for determining value-eroding behaviors of teachers, hence their values of judgment. The items of the "Value-eroding Teacher Behaviors Scale" were designed in the form of 5-point likert type rating scale. The exploratory factor analysis (EFA) was conducted to…
Tezer, Murat; Ozcan, Deniz
Attitudes of the students towards mathematics lessons are very important in terms of their success and motivation. The purpose of this study is to develop a scale for the assessment of primary school students' attitudes towards mathematics courses in the 2nd and 3rd grades, to analyse its validity-reliability structure and to determine the…
Cenkseven-Önder, Fulya; Avci, Rasit; Çolakkadioglu, Oguzhan
The aim of this study was to adapt the Reactive-Proactive Aggression Questionnaire (RPQ), developed to measure two dimensions of aggression which are reactive and proactive, to Turkish and test the validity and reliability of the Turkish form. The study group consisted of 278 students in four junior high schools in Adana, Turkey, and 485 students…
Yalçin, Mehmet Tufan; Eres, Figen
The aim of this study is to develop a valid and reliable measurement tool that can determine the instructional capacity, according to teacher opinions. In the academic year of 2016-2017, 1011 teachers working in the public high schools and vocational technical schools in Ankara participated in the study. The total number of items on the scale was…
Hanson, E. K.; Schaufeli, W.; Vrijkotte, T.; Plomp, N. H.; Godaert, G. L.
The reliability and validity of the Effort-Reward Imbalance Questionnaire were tested in 775 blue- and white-collar workers in the Netherlands. Cronbach's alpha revealed sufficient internal consistency of all subscales except Need for Control. With exploratory probabilistic scaling (Mokken)
Kuçukosmanoglu, Hayrettin Onur
The main purpose of this study is to develop a scale to determine students' attitude levels on individual instruments and individual instrument courses in instrument training, which is an important dimension of music education, and to conduct a validity-reliability research of the scale that has been developed. The scale consists of 16 items. The…
Fettahlioglu, Pinar; Timur, Serkan; Timur, Betül
The aim of this study is to conduct a research under circumstances of Turkey about the validity and reliability of the Affective Tendencies towards Environmental Scale prepared by Yavetz, Goldman and Pe'er (2009). The translation of this scale to Turkish was done by the researchers and language specialists. And then, the scale was evaluated by the…
Basha, Ertan; Kaya, Mehmet
The purpose of this study is to examine validity and reliability of the Albanian version of the Depression, Anxiety and Stress Scale (DASS), which is developed by Lovibond and Lovibond (1995). The sample of this study is consisted of 555 subjects who were living in Kosovo. The results of confirmatory factor analysis indicated 42 items loaded on…
Olpak, Yusuf Ziya; Kiliç Çakmak, Ebru
The aim of this study was to describe the validity and reliability of a Turkish language version of the CoI survey developed by Arbaugh et al. (2008). Data were obtained from 1150 students enrolled in online courses in various departments in three Turkish state universities. The data were randomly divided into two parts: the first part was…
Sno, H. N.; Schalken, H. F.; de Jonghe, F.; Koeter, M. W.
In this article the development, utility, reliability, and validity of the Inventory for Déjà vu Experiences Assessment (IDEA) are described. The IDEA is a 23-item self-administered questionnaire consisting of a general section of nine questions and qualitative section of 14 questions. The latter
Kerimova, Melek; Gunuc, Selim
The purpose of the present paper was to adapt Gunuc and Kayri's (2010) "Internet Addiction Scale," with show validity and reliability for many various sampling groups, into the Azerbaijani language. Another objective of the study is to determine the prevalence of Internet addiction among Azerbaijani adolescents and youth, which…
Koonce, Glenn L.; Kelly, Michael D.
In this study, researchers analyzed the reliability and validity of the mentor's assessment for principal internships at a university in the Southeast region of the United States. The results of the study yielded how trustworthy and dependable the instrument is and the effectiveness of the instrument in the current principal preparation program.…
Nolan, Meaghan M.; Beran, Tanya; Hecker, Kent G.
Students with positive attitudes toward statistics are likely to show strong academic performance in statistics courses. Multiple surveys measuring students' attitudes toward statistics exist; however, a comparison of the validity and reliability of interpretations based on their scores is needed. A systematic review of relevant electronic…
Bear, George G.; Holst, Bruna; Lisboa, Carolina; Chen, Dandan; Yang, Chunyan; Chen, Fang Fang
This study presents evidence of the validity and reliability of scores for the newly developed Brazilian Portuguese version of the Delaware School Climate Survey-Student (Brazilian DSCS-S). The sample consisted of 378 students, grades 5 through 9, attending four private and three public schools in southern Brazil. Confirmatory factor analyses…
Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G.
Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…
Solomon, Benjamin G.; Tobin, Kevin G.; Schutte, Gregory M.
The Effective Behavior Support Self-Assessment Survey (SAS; Sugai, Horner, & Todd, 2003) is designed to measure perceived Positive Behavior Interventions and Supports (PBIS) implementation and identify priorities for improvement. Despite its longevity, little published research exists documenting its reliability or validity for these purposes.…
Tekin, Ahmet; Polat, Ebru
Problem Statement: For an effective teaching and learning process it is critical to provide support for teachers in the development of e-content, and teachers should play an active role in this development. Purpose of the Study: The purpose of this study is to develop a valid and reliable Likert-type scale that will determine pre-service teachers'…
Cicero Luciano Alves Costa
Full Text Available This study aims to investigate the construct validity and reliability of the checklist for qualitative analysis of the overhand serve in Volleyball. Fifty-five male subjects aged 13-17 years participated in the study. The overhand serve was analyzed using the checklist proposed by Meira Junior (2003, which analyzes the pattern of serve movement in four phases: (I initial position, (II ball lifting, (III ball attacking, and (IV finalization. Construct validity was analyzed using confirmatory factorial analysis and reliability through the Cronbach’s alpha coefficient. The construct validity was supported by confirmatory factor analysis with the RMSEA results (0.037 [confidence interval 90% = 0.020-0.040], CFI (0.970 and TLI (0.950 indicating good fit of the model. In relation to reliability, Cronbach’s alpha coefficient was 0.661, being this value considered acceptable. Among the items on the checklist, ball lifting and attacking showed higher factor loadings, 0.69 and 0.99, respectively. In summary, the checklist for the qualitative analysis of the overhand serve of Meira Junior (2003 can be considered a valid and reliable instrument for use in research in the field of Sports Sciences.
The purpose of this study is to develop a valid and reliable assessment tool for use in determining the competency beliefs of school administrators about innovation management. The scale applied to a study group of 216 school administrators, after work Centered on assessing intelligibility and specialized opinion. Exploratory and confirmatory…
The purpose of this study is to develop, and test the validity and reliability of a scale for the use of researchers to determine the accreditation standards of open and distance education based on the views of administrators, teachers, staff and students. This research was designed according to the general descriptive survey model since it aims…
Francis, L J; Katz, Y J
The Hebrew translation of the Oxford Happiness Inventory and the short form Revised Eysenck Personality Questionnaire were completed by 298 undergraduate women in Israel. The findings confirm the internal reliability of the Hebrew translation of the Oxford Happiness Inventory and support the construct validity according to which "happiness is a thing called stable extraversion."
Full Text Available OBJECTIVE: Aphasia assessment is the first step towards a well- founded language therapy. Language tests need to consider cultural as well as typological linguistic aspects of a given language. This study was designed to determine the standardization, validity and reliability of Language Assessment Test for Aphasia, which consists of eight subtests including spontaneous speech and language, auditory comprehension, repetition, naming, reading, grammar, speech acts, and writing. METHODS: The test was administered to 282 healthy participants and 92 aphasic participants in age, education and gender matched groups. The validity study of the test was investigated with analysis of content, structure and criterion-related validity. For reliability of the test, the analysis of internal consistency, stability and equivalence reliability was conducted. The influence of variables on healhty participants’ sub-test scores, test score and language score was examined. According to significant differences, norms and cut-off scores based on language score were determined. RESULTS: The group with aphasia performed highly lower than healthy participants on subtest, test and language scores. The test scores of healthy group were mostly affected by age and educational level but not affected by gender. According to significant differences, age and educational level for both groups were determined. Considering age and educational levels, the reference values for the cut-off scores were presented. CONCLUSION: The test was found to be a highly reliable and valid aphasia test for Turkish- speaking aphasic patients either in Turkey or other Turkish communities around the world
Dilmac, Bulent; Aricak, Osman Tolga; Cesur, Sevim
The purpose of the present study is to examine the initial psychometric properties of the Values Scale for adults. While developing the first stage of the Values Scale, open-ended data on the values held by 216 university students were obtained. During the second stage, the validity and reliability studies of the 60-item Values Scale obtained by…
Ersanli, Ercümend; Mameghani, Shiva Saeighi
In the present study, the Tolerance Scale developed by Ersanli (2014) was adapted to the Iranian culture, and its validity and reliability were investigated in the case of Iranian college students. The participants consisted of 552 Iranian college students (62% male, M = 20.84, S.D.: 1.53) selected using the convenience sampling method. The sample…
Akin, Ahmet; Cetin, Bayram
This study investigated the validity and reliability of the Turkish version of the Depression Anxiety Stress Scale (DASS). The sample of the study consisted of 590 university students, 121 English teachers and 136 emotionally disturbed individuals who sought treatment in various clinics and counseling centers. Factor loadings of the scale ranged…
Alkhateeb, Haitham M
The Arabic translation of the Mathematics Teaching Efficacy Beliefs was completed by 144 undergraduate students (M age=20.6) in Jordan. The findings support the internal reliability of the Arabic translation of the Mathematics Teaching Efficacy Beliefs as well as its construct validity.
Tsuchiya, Kenji J.; Matsumoto, Kaori; Yagi, Atsuko; Inada, Naoko; Kuroda, Miho; Inokuchi, Eiko; Koyama, Tomonori; Kamio, Yoko; Tsujii, Masatsugu; Sakai, Saeko; Mohri, Ikuko; Taniike, Masako; Iwanaga, Ryoichiro; Ogasahara, Kei; Miyachi, Taishi; Nakajima, Shunji; Tani, Iori; Ohnishi, Masafumi; Inoue, Masahiko; Nomura, Kazuyo; Hagiwara, Taku; Uchiyama, Tokio; Ichikawa, Hironobu; Kobayashi, Shuji; Miyamoto, Ken; Nakamura, Kazuhiko; Suzuki, Katsuaki; Mori, Norio; Takei, Nori
To examine the inter-rater reliability of Autism Diagnostic Interview-Revised, Japanese Version (ADI-R-JV), the authors recruited 51 individuals aged 3-19 years, interviewed by two independent raters. Subsequently, to assess the discriminant and diagnostic validity of ADI-R-JV, the authors investigated 317 individuals aged 2-19 years, who were…
Fokkema, Tryntsje; Kooiman, Thea J. M.; Krijnen, Wim P.; Van der Schans, Cees P.; De Groot, Martijn
Purpose: To examine the test-retest reliability and validity of ten activity trackers for step counting at three different walking speeds. Methods: Thirty-one healthy participants walked twice on a treadmill for 30 min while wearing 10 activity trackers (Polar Loop, Garmin Vivosmart, Fitbit Charge
Fokkema, Tryntsje; Kooiman, Thea; Krijnen, Wim; van der Schans, Cees; de Groot, Martijn
Purpose: To examine the test–retest reliability and validity of ten activity trackers for step counting at three different walking speeds. Methods: Thirty-one healthy participants walked twice on a treadmill for 30 min while wearing 10 activity trackers (Polar Loop, Garmin Vivosmart, Fitbit Charge
Nakano, Hideki; Kodama, Takayuki; Ukai, Kazumasa; Kawahara, Satoru; Horikawa, Shiori; Murata, Shin
In this study, we aimed to (1) translate the English version of the Kinesthetic and Visual Imagery Questionnaire (KVIQ), which assesses motor imagery ability, into Japanese, and (2) investigate the reliability and validity of the Japanese KVIQ. We enrolled 28 healthy adults in this study. We used Cronbach’s alpha coefficients to assess reliability reflected by the internal consistency. Additionally, we assessed validity reflected by the criterion-related validity between the Japanese KVIQ and the Japanese version of the Movement Imagery Questionnaire-Revised (MIQ-R) with Spearman’s rank correlation coefficients. The Cronbach’s alpha coefficients for the KVIQ-20 were 0.88 (Visual) and 0.91 (Kinesthetic), which indicates high reliability. There was a significant positive correlation between the Japanese KVIQ-20 (Total) and the Japanese MIQ-R (Total) (r = 0.86, p < 0.01). Our results suggest that the Japanese KVIQ is an assessment that is a reliable and valid index of motor imagery ability.
Lagarde, Marloes L J; Kamalski, DMA; Van Den Engel-Hoek, Lenie
Objective: To systematically review the available evidence for the reliability and validity of cervical auscultation in diagnosing the several aspects of dysphagia in adults and children suffering from dysphagia. Data sources: Medline (PubMed), Embase and the Cochrane Library databases. Review
Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam
Several sophisticated methods of footprint analysis currently exist. However, it is sometimes useful to apply standard measurement methods of recognized evidence with an easy and quick application. We sought to assess the reliability and validity of a new method of footprint assessment in a healthy population using Photoshop CS5 software (Adobe Systems Inc, San Jose, California). Forty-two footprints, corresponding to 21 healthy individuals (11 men with a mean ± SD age of 20.45 ± 2.16 years and 10 women with a mean ± SD age of 20.00 ± 1.70 years) were analyzed. Footprints were recorded in static bipedal standing position using optical podography and digital photography. Three trials for each participant were performed. The Hernández-Corvo, Chippaux-Smirak, and Staheli indices and the Clarke angle were calculated by manual method and by computerized method using Photoshop CS5 software. Test-retest was used to determine reliability. Validity was obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed high values (ICC, 0.98-0.99). Moreover, the validity test clearly showed no difference between techniques (ICC, 0.99-1). The reliability and validity of a method to measure, assess, and record the podometric indices using Photoshop CS5 software has been demonstrated. This provides a quick and accurate tool useful for the digital recording of morphostatic foot study parameters and their control.
Beery, Thomas H.
The purpose of this preliminary study is to establish a reliable and valid measure of environmental connectedness (EC) to allow for further exploration of the Swedish Outdoor Recreation in Change national survey data. The Nordic concept of friluftsliv (nature-based outdoor recreation) and the environmental psychology concept of EC are explored to…
Levpušcek, Melita Puklek; Inglés, Candido J.; Marzo, Juan C.; García-Fernández, Jose M.
The purpose of this study was to examine the reliability and validity of the School Anxiety Inventory (SAI) using a sample of 646 Slovenian adolescents (48% boys), ranging in age from 12 to 19 years. Single confirmatory factor analyses replicated the correlated four-factor structure of scores on the SAI for anxiety-provoking school situations…
Palmer, F; Davis, M C
Interventions for childhood overweight and obesity that target parents as the agents of change by increasing parent self-efficacy for facilitating their child's healthy weight behaviours require a reliable and valid tool to measure parent self-efficacy before and after interventions. Nelson and Davis developed the Parent Efficacy for Child Healthy Weight Behaviour (PECHWB) scale with good preliminary evidence of reliability and validity. The aim of this research was to provide further psychometric evidence from an independent Australian sample. Data were provided by a convenience sample of 261 primary caregivers of children aged 4-17 years via an online survey. PECHWB scores were correlated with scores on other self-report measures of parenting efficacy and 2- to 4-week test-retest reliability of the PECHWB was assessed. The results of the study confirmed the four-factor structure of the PECHWB (Fat and Sugar, Sedentary Behaviours, Physical Activity, and Fruit and Vegetables) and provided strong evidence of internal consistency and test-retest reliability, as well as good evidence of convergent validity. Future research should investigate the properties of the PECHWB in a sample of parents of overweight or obese children, including measures of child weight and actual child healthy weight behaviours to provide evidence of the concurrent and predictive validity of PECHWB scores. © 2013 John Wiley & Sons Ltd.
Conclusion: The adaptation of translated and ldquo;Hand Hygiene Belief Scale and Hand Hygiene Practices Inventory and rdquo; in Turkey is found to be reliable and valid to evaluate hand hygiene belief and practices. [Cukurova Med J 2016; 41(2.000: 271-284
La Monte, Michelle Evonne
This study focused on developing a valid and reliable instrument that can not only identify successful co-teaching, but also the professional development needs of co-teachers and their administrators in public schools. Two general questions about the quality of co-teaching were addressed in this study: (a) How well did descriptors within each of…
Rae, James R; Olson, Kristina R
The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Woodburn, Jim; Sutcliffe, Nick
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Harmanci Seren, Arzu Kader; Tuna, Rujnan; Eskin Bacaksiz, Feride
Objective measurement of the job performance of nursing staff using valid and reliable instruments is important in the evaluation of healthcare quality. A current, valid, and reliable instrument that specifically measures the performance of nurses is required for this purpose. The aim of this study was to determine the validity and reliability of the Turkish version of the Job Performance Instrument. This study used a methodological design and a sample of 240 nurses working at different units in four hospitals in Istanbul, Turkey. A descriptive data form, the Job Performance Scale, and the Employee Performance Scale were used to collect data. Data were analyzed using IBM SPSS Statistics Version 21.0 and LISREL Version 8.51. On the basis of the data analysis, the instrument was revised. Some items were deleted, and subscales were combined. The Turkish version of the Job Performance Instrument was determined to be valid and reliable to measure the performance of nurses. The instrument is suitable for evaluating current nursing roles.
Full Text Available This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin
This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥ 18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Berenschot, L.; Grift, Y.K.
This study evaluates the reliability and validity of the Impact on Autonomy and Participation instrument (IPA) for heterogeneous populations of social support clients. Decentralisation of social support and accompanying budget cuts spurred interest in outcome-related payment systems to foster
van Saane, N.; Sluiter, J. K.; Verbeek, J. H. A. M.; Frings-Dresen, M. H. W.
Background Although job satisfaction research has been carried out for decades, no recent overview of job satisfaction instruments and their quality is available. Aim The aim of this systematic review is to select job satisfaction instruments of adequate reliability and validity for use as
Benítez-Porres, Javier; López-Fernández, Iván; Raya, Juan Francisco; Álvarez Carnero, Sabrina; Alvero-Cruz, José Ramón; Álvarez Carnero, Elvis
Background: Physical activity (PA) assessment by questionnaire is a cornerstone in the field of sport epidemiology studies. The Physical Activity Questionnaire for Children (PAQ-C) has been used widely to assess PA in healthy school populations. The aim of this study was to evaluate the reliability and validity of the PAQ-C questionnaire in…
Erturan Ilker, Gökçe; Arslan, Yunus; Demirhan, Giyasettin
The aim of this study is to determine the validity and reliability of the Motivated Strategies for Learning Questionnaire (MSLQ) for high school students. In total, 1605 students (829 girls, 776 boys, average age = 15.67 ± 1.19) from three different high schools in the central district of Ankara voluntarily participated in the study. The MSLQ was…
Kim, Soo Jin; Yang, You-Na; Lee, Jong Won; Lee, Jin-Youn; Jeong, Eunhwa; Kim, Bo-Ram; Lee, Jongmin
To evaluate the reliability and validity of Korean version of AST (K-AST) as a bedside screening test of apraxia in patients with stroke for early and reliable detection. AST was translated into Korean, and the translated version received authorization from the author of AST. The performances of K-AST in 26 patients (21 males, 5 females; mean age 65.42±17.31 years) with stroke (23 ischemic, 3 hemorrhagic) were videotaped. To test the reliability and validity of K-AST, the recorded performances were assessed by two physiatrists and two occupational therapists twice at a 1-week interval. The patient performances at admission in Korean version of Mini-Mental State Examination (K-MMSE), self-care and transfer categories of Functional Independence Measure (FIM), and motor praxis area of Loewenstein Occupational Therapy Cognitive Assessment, the second edition (LOTCA-II) were also evaluated. Scores of motor praxis area of LOTCA-II was used to assess the validity of K-AST. Inter-rater reliabilities were 0.983 (preliable and valid test for bedside screening of apraxia.
Galetzka, Mirjam; Verhoeven, J.W.M.; Pruyn, Adriaan T.H.
The purpose of this research is to add to our understanding of the antecedents of customer satisfaction by examining the effects of service reliability (Is the service “correctly” produced?) and service validity (Is the “correct” service produced?) of search, experience and credence services.
Brady, Michael P.; Heiser, Lawrence A.; McCormick, Jazarae K.; Forgan, James
High-stakes standardized student assessments are increasingly used in value-added evaluation models to connect teacher performance to P-12 student learning. These assessments are also being used to evaluate teacher preparation programs, despite validity and reliability threats. A more rational model linking student performance to candidates who…
Stellmack, Mark A.; Konheim-Kalkstein, Yasmine L.; Manor, Julia E.; Massey, Abigail R.; Schmitz, Julie Ann P.
This article describes the empirical evaluation of the reliability and validity of a grading rubric for grading APA-style introductions of undergraduate students. Levels of interrater agreement and intrarater agreement were not extremely high but were similar to values reported in the literature for comparably structured rubrics. Rank-order…
Valero-Aguayo, Luis; Ferro-Garcia, Rafael; Lopez-Bermudez, Miguel Angel; de Huralde, Ma. Angeles Selva-Lopez
The Experiencing of Self Scale (EOSS) was created for the evaluation of Functional Analytic Psychotherapy (Kohlenberg & Tsai, 1991, 2001, 2008) in relation to the concept of the experience of personal self as socially and verbally constructed. This paper presents a reliability and validity study of the EOSS with a Spanish sample (582…
Strand, Edythe A.; McCauley, Rebecca J.; Weigand, Stephen D.; Stoeckel, Ruth E.; Baas, Becky S.
Purpose: In this article, the authors report reliability and validity evidence for the Dynamic Evaluation of Motor Speech Skill (DEMSS), a new test that uses dynamic assessment to aid in the differential diagnosis of childhood apraxia of speech (CAS). Method: Participants were 81 children between 36 and 79 months of age who were referred to the…
van Baar, M. E.; Essink-Bot, M. L.; Oen, I. M. M. H.; Dokter, J.; Boxma, H.; Hinson, M. I.; van Loey, N. E. E.; Faber, A. W.; van Beeck, E. F.
The Health Outcomes Burn Questionnaire (HOBQ) is a self-administered questionnaire to monitor outcome after burns in young children. This study aimed to assess feasibility, reliability and validity of the Dutch version of the HOBQ. The HOBQ was adapted into Dutch and tested in a population of
Adler, Lenard A.; Faraone, Stephen V.; Spencer, Thomas J.; Michelson, David; Reimherr, Frederick W.; Glatt, Stephen J.; Marchant, Barrie K.; Biederman, Joseph
Objective: Little information is available comparing self- versus investigator ratings of symptoms in adult ADHD. The authors compared the reliability, validity, and utility in a sample of adults with ADHD and also as an index of clinical improvement during treatment of self- and investigator ratings of ADHD symptoms via the Conners Adult ADHD…
Peterson, Anne C.; And Others
The Self-Image Questionnaire for Young Adolescents (SIQYA), an adaptation of the Offer Self-Image Questionnaire (OSIQ), designed to measure aspects of self-image among young adolescents, was administered to two groups of sixth graders. The development of the SIQYA is described and reliability and validity results are presented. (EGS)
Sayin, Ayfer; Sahin, Mustafa Yasar
The present study aimed to provide a Turkish adaptation of the Organizational Justice in Sport Scale and perform reliability and validity studies. Answers provided by 260 participants who work as football, male basketball and female basketball coaches in National Collegiate Athletic Association (NCAA) were analysed using the original scale that…
Thompson, Bruce; Cook, Colleen
Research libraries are increasingly supplementing collection counts with perceptions of service quality as indices of status and productivity. The present study was undertaken to explore the reliability and validity of scores from the SERVQUAL measurement protocol (A. Parasuraman and others, 1991), which has previously been used in this type of…
The eButton takes frontal images at 4 second intervals throughout the day. A three-dimensional (3D) manually administered wire mesh procedure has been developed to quantify portion sizes from the two-dimensional (2D) images. This paper reports a test of the interrater reliability and validity of use...
The purpose of this study is to develop a valid and reliable measurement tool to determine the social media addictions of secondary school, high school and university students. 998 students participated in the study. 476 students from secondary schools, high schools and universities participated in the first application during which the…
Cintas, Holly Lea; Parks, Rebecca; Don, Sarah; Gerber, Lynn
Content validity and reliability of the Brief Assessment of Motor Function (BAMF) Upper Extremity Gross Motor Scale (UEGMS) were evaluated in this prospective, descriptive study. The UEGMS is one of five BAMF ordinal scales designed for quick documentation of gross, fine, and oral motor skill levels. Designed to be independent of age and…
Ramamoorthy, C.V.; Mok, Y.R.; Bastani, F.B.; Chin, G.
The necessity of a good methodology for the development of reliable software, especially with respect to the final software validation and testing activities, is discussed. A formal specification development and validation methodology is proposed. This methodology has been applied to the development and validation of a pilot software, incorporating typical features of critical software for nuclear power plants safety protection. The main features of the approach include the use of a formal specification language and the independent development of two sets of specifications. 1 ref
Stoner, Lee; Bonner, Chantel; Credeur, Daniel; Lambrick, Danielle; Faulkner, James; Wadsworth, Daniel; Williams, Michelle A
Monitoring central hemodynamic responses to an orthostatic challenge may provide important insight into autonomic nervous system function. Oscillometric pulse wave analysis devices have recently emerged, presenting clinically viable options for investigating central hemodynamic properties. The purpose of the current study was to determine whether oscillometric pulse wave analysis can be used to reliably (between-day) assess central blood pressure and central pressure augmentation (augmentation index) responses to a 5 min orthostatic challenge (modified tilt-table). Twenty healthy adults (26.4 y (SD 5.2), 55% F, 24.7 kg/m(2) (SD 3.8)) were tested on 3 different mornings in the fasted state, separated by a maximum of 7 days. Central hemodynamic variables were assessed on the left arm using an oscillometric device. Repeated measures analysis of variance indicated a significant main effect of the modified tilt-table for all central hemodynamic variables (P response to the tilt, central diastolic pressure increased by 4.5 mmHg (CI: 2.6, 6.4), central systolic blood pressure increased by 2.3 (CI: 4.4, 0.16) mmHg, and augmentation index decreased by an absolute - 5.3%, (CI: -2.7, -7.9%). The intra-class correlation coefficient values for central diastolic pressure (0.83-0.86), central systolic blood pressure (0.80-0.87) and AIx (0.79-0.82) were above the 0.75 criterion in both the supine and tilted positions, indicating excellent between-day reliability. Central hemodynamic responses to an orthostatic challenge can be assessed with acceptable between-day reliability using oscillometric pulse wave analysis. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Kneebone, Ian I.; Dewar, Sophie J.
Background: The current study aimed to examine the psychometric properties of an attributional style measure that can be administered remotely, to people who have multiple sclerosis (MS). Methods: A total of 495 participants with MS were recruited. Participants completed the Attributional Style Questionnaire-Survey (ASQ-S) and two comparison measures of cognitive variables via postal survey on three occasions, each 12 months apart. Internal reliability, test-retest reliability and congruent validity were considered. Results: The internal reliability of the ASQ-S was good (α > 0.7). The test-retest correlations were significant, but failed to reach the 0.7 set. The congruent validity of the ASQ-S was established relative to the comparisons. Conclusions: The psychometric properties of the ASQ-S indicate that it shows promise as a tool for researchers investigating depression in people with MS and is likely sound to use clinically in this population. PMID:28450893
Ricardo Franco de Lima
Full Text Available Abstract This paper aimed to verify evidences of validity and reliability of Luria-Nebraska Test for Children (TLN-C, in Portuguese. Three hundred eighty-seven students aged 6–13 years old, with learning difficulties, comprised the study. They were assessed with the Wechsler Intelligence Scale for Children (WISC-III and TLN-C; and effect of age differences, as well as accuracy rating by internal consistency were investigated. Age effects were found for all subtests and in the general score, except for receptive speech subtest, even when total IQ effect was controlled. Reliability analysis had satisfactory results (0.79. The TLN-C showed evidences of validity and reliability. Receptive speech subtest requires revision.
Reliable Multicast Protocol (RMP) is a communication protocol that provides an atomic, totally ordered, reliable multicast service on top of unreliable IP multicasting. In this report, we develop formal models for RMP using existing automated verification systems, and perform validation on the formal RMP specifications. The validation analysis help identifies some minor specification and design problems. We also use the formal models of RMP to generate a test suite for conformance testing of the implementation. Throughout the process of RMP development, we follow an iterative, interactive approach that emphasizes concurrent and parallel progress of implementation and verification processes. Through this approach, we incorporate formal techniques into our development process, promote a common understanding for the protocol, increase the reliability of our software, and maintain high fidelity between the specifications of RMP and its implementation.
Chung, Mi Ja; Park, Youngrye; Eun, Young
The aim of this study was to examine the validity and reliability of the Korean Version of the Spiritual Care Competence Scale (K-SCCS). A cross-sectional study design was used. The K-SCCS consisted of 26 questions to measure spiritual care competence of nurses. Participants, 228 nurses who had more than 3 years'experience as a nurse, completed the survey. Confirmatory factor analysis was used to examine the construct validity and correlations of K-SCCS and spiritual well-being (SWB) were used to examine the criterion validity of K-SCCS. Cronbach's alpha was used to test internal consistency. The construct and the criterion-related validity of K-SCCS were supported as measures of spiritual care competence. Cronbach's alpha was .95. Factor loadings of the 26 questions ranged from .60 to .96. Construct validity of K-SCCS was verified by confirmatory factor analysis (RMSEA=.08, CFI=.90, NFI=.85). Criterion validity compared to the SWB showed significant correlation (r=.44, pspiritual care competence with validity and reliability. However, further study is needed to retest the verification of the factor analysis related to factor 2 (professionalisation and improving the quality of spiritual care) and factor 3 (personal support and patient counseling). Therefore, we recommend using the total score without distinguishing subscales.
Tezcaner, Zahide Çiler; Aksoy, Songül
This study aims to test the validity and reliability of the Turkish version of the Voice-Related Quality of Life (V-RQOL) questionnaire. This is a nonrandomized, prospective study with control group. The questionnaire was administered to 249 individuals-130 with vocal complaint and 119 without-with a mean age of 37.8 ± 12.3 years. The Turkish version of the Voice Handicap Index (VHI) and perceptual voice evaluation measures were also administered at 2-14 days for retest reliability. The instrument was submitted to validity and reliability evaluation. The V-RQOL measure showed a strong internal consistency and test-retest reliability; the Cronbach's alpha coefficient for the overall V-RQOL was 0.969, the physical functioning domain was 0.949, and the social-emotional domain was 0.940. In the test-retest reliability test, the overall V-RQOL was found to be 0.989. The construct validity of the V-RQOL was determined based on the strength and direction of its relation to the VHI and the perceptual voice evaluation measure. The higher the VHI level, the lower the physical functioning, social-emotional, and overall score levels of the V-RQOL (r = -0.927, r = -0.912, r = -0.944, respectively; P reliability and validity and may play a crucial role in evaluating Turkish-speaking patients with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Mehmet Emrah Karadere
Full Text Available Reasoning with Inductive Argument Test:A Study of Validity and Reliability Objective: The aim of our study is to research reliability and validity and to evaluate the usability of Turkish version of Reasoning with Inductive Argument Test (RIAT in Turkish healty population. Method: 51 healty volunteers who work in Ankara Dıskapi Yildirim Beyazit Research and Training Hospital participated in this study. Reasoning with Inductive Argument Test (RIAT was translated into Turkish by three clinical good knowledge of English. Participants were given a sociodemographic data form, and RIAT were performed by clinicians. To test the reliability of the Turkish version of RIAT, Cronbach’s alpha coefficient was calculated and the halving method was used for the test. Results: The internal consistency of the Reasoning with Inductive Argument Test (RIAT items, Cronbach’s alpha internal consistency coefficient measurements of 0.73 was found to be statistically significant. Spearman-Brown coefficient that determines the reliability of the whole test r=0.74 was found. Kurtosis values of all the items was below 1.5 and the percentages in the second evaluation were mainly lower. At the same time, both change in belief between self produced RIAT options and given RIAT options (p=0.02, z=-2296 as well as changes in beliefs between related and unrelated items for Obsessive Compulsive Disorder (OCD difference (p=0.03, z=-2.199 were significant. Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that ‘Reasoning with Inductive Argument Test’ supports reliability and validity in Turkish population.
Cabanas-Sánchez, Verónica; Martínez-Gómez, David; Esteban-Cornejo, Irene; Castro-Piñero, José; Conde-Caveda, Julio; Veiga, Óscar L
To develop a questionnaire able to assess time spent by youth in a wide range of leisure-time sedentary behaviors (SB) and evaluate its test-retest reliability and criterion validity. Cross-sectional observational. The reliability sample included 194 youth, aged 10-18 years, who completed the questionnaire twice, separated by one-week interval. The validity study comprised 1207 participants aged 8-18 years. Participants wore an accelerometer for 7 consecutive days. The questionnaire was designed to assess the amount of time spent in twelve different SB during weekdays and weekends, separately. In order to avoid usual phenomenon of time over reporting, values were adjusted to real available leisure-time (LT) for each participant. Reliability was assessed by using Intraclass Correlation Coefficients (ICC) and weighted (quadratic) kappa (k), and validity was assessed by using Pearson correlation and Bland-Altman plots. The reliability of questionnaire showed a moderate-to-substantial agreement for the most (91%) of items (k=0.43-0.74; ICC=0.41-0.79) with three items (4%) reaching an almost perfect agreement (ICC=0.82-0.83). Only 'sitting and talking' evidenced fair-to-moderate reliability (k=0.27-0.39; ICC=0.34-0.46). The relationship between average sedentary time assessed by the questionnaire and accelerometry was moderate (r=0.36; pquestionnaire and accelerometer sedentary time for average day (r=0.05; p=0.11) but Bland-Altman plots suggest moderate discrepancies between both methods of SB measurement (mean=19.86; limits of agreement=-280.04 to 319.76). The questionnaire showed moderate to good test-retest reliability and a moderate level of validity for assessing SB in youth, similar or slightly better to previously published in this population. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Full Text Available Abstract: Introduction: The eHEALS is an 8-item measure of eHealth literacy developed to measure consumers’ combined knowledge, comfort, and perceived skills at finding, evaluating, and applying electronic health information to health problems. The current study aims to measure validity and reliability of the Iranian version of eHEALS questionnaire in a population context. Materials & Methods: A cross-sectional study was done on 525 youths people who has been chosen randomly in Iran, Yazd. We determined content validity, construct validity and predictive validity of the translated questionnaire. Principal components factor analysis was used to determine the theoretical fit of the measures with the data. The internal consistency of the translated questionnaire was evaluated using Cronbach α coefficient. The results were analyzed in SPSSv16. Results: The principal component analysis (PCA produced a single factor solution (70.48% of variance with factor loading ranging from 0.723 to 0.862. The internal consistency of the scale was sufficient (alpha=0.88 , P<0.001 and the test-retest coefficients for the items were reliable (r= 0.96, P<0.001. Discussion: The results of the study showed that the items in the translated questionnaire were equivalent to the original scale .The version of the eHEALS questionnaire showed both good reliability and validity for the screening of eHealth literacy of Iranian people.
Reijman, M; Hazes, J M W; Pols, H A P; Bernsen, R M D; Koes, B W; Bierma-Zeinstra, S M A
To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the validity of the three definitions of hip OA is sex dependent. from the Rotterdam study (aged > or= 55 years, n = 3585) were evaluated. The inter-rater reliability was tested in a random set of 148 x rays. The validity was expressed as the ability to identify patients who show clinical symptoms of hip OA (construct validity) and as the ability to predict total hip replacement (THR) at follow up (predictive validity). Inter-rater reliability was similar for the Kellgren and Lawrence grade and MJS (kappa statistics 0.68 and 0.62, respectively) but lower for Croft's grade (kappa statistic, 0.51). The Kellgren and Lawrence grade and MJS showed the strongest associations with clinical symptoms of hip OA. Sex appeared to be an effect modifier for Kellgren and Lawrence and MJS definitions, women showing a stronger association between grading and symptoms than men. However, the sex dependency was attributed to differences in height between women and men. The Kellgren and Lawrence grade showed the highest predictive value for THR at follow up. Based on these findings, Kellgren and Lawrence still appears to be a useful OA definition for epidemiological studies focusing on the presence of hip OA.
Brindle, Richard A; Ebaugh, D David; Milner, Clare E
Side-lying hip abductor strength tests are commonly used to evaluate muscle strength. In a 'break' test the tester applies sufficient force to lower the limb to the table while the patient resists. The peak force is postulated to occur while the leg is lowering, thus representing the participant's eccentric muscle strength. However, it is unclear whether peak force occurs before or after the leg begins to lower. To determine intra-rater reliability and construct validity of a hip abductor eccentric strength test. Intra-rater reliability and construct validity study. Twenty healthy adults (26 ±6 years; 1.66 ±0.06 m; 62.2 ±8.0 kg) made two visits to the laboratory at least one week apart. During the hip abductor eccentric strength test, a hand-held dynamometer recorded peak force and time to peak force and limb position was recorded via a motion capture system. Intra-rater reliability was determined using intra-class correlation (ICC), standard error of measurement (SEM), and minimal detectable difference (MDD). Construct validity was assessed by determining if peak force occurred after the start of the lowering phase using a one-sample t-test. The hip abductor eccentric strength test had substantial intra-rater reliability (ICC( 3,3 ) = 0.88; 95% confidence interval: 0.65-0.95), SEM of 0.9%BWh, and a MDD of 2.5%BWh. Construct validity was established as peak force occurred 2.1s (±0.6s; range 0.7s to 3.7s) after the start of the lowering phase of the test (p ≤ 0.001). The hip abductor eccentric strength test is a valid and reliable measure of eccentric muscle strength. This test may be used clinically to assess changes in eccentric muscle strength over time.
Tim, Carla Roberta; Martignago, Cintia Cristina Santi; da Silva, Viviane Ribeiro; Dos Santos, Estefany Camila Bonfim; Vieira, Fabiana Nascimento; Parizotto, Nivaldo Antonio; Liebano, Richard Eloin
Objective: Technological advances have provided new alternatives to the analysis of skin flap viability in animal models; however, the interrater validity and reliability of these techniques have yet to be analyzed. The present study aimed to evaluate the interrater validity and reliability of three different methods: weight of paper template (WPT), paper template area (PTA), and photographic analysis. Approach: Sixteen male Wistar rats had their cranially based dorsal skin flap elevated. On the seventh postoperative day, the viable tissue area and the necrotic area of the skin flap were recorded using the paper template method and photo image. The evaluation of the percentage of viable tissue was performed using three methods, simultaneously and independently by two raters. The analysis of interrater reliability and viability was performed using the intraclass correlation coefficient and Bland Altman Plot Analysis was used to visualize the presence or absence of systematic bias in the evaluations of data validity. Results: The results showed that interrater reliability for WPT, measurement of PTA, and photographic analysis were 0.995, 0.990, and 0.982, respectively. For data validity, a correlation >0.90 was observed for all comparisons made between the three methods. In addition, Bland Altman Plot Analysis showed agreement between the comparisons of the methods and the presence of systematic bias was not observed. Innovation: Digital methods are an excellent choice for assessing skin flap viability; moreover, they make data use and storage easier. Conclusion: Independently from the method used, the interrater reliability and validity proved to be excellent for the analysis of skin flaps' viability.
Multanen, Juhani; Honkanen, Mikko; Häkkinen, Arja; Kiviranta, Ilkka
The Knee Injury and Osteoarthritis Outcome Score (KOOS) is a commonly used knee assessment and outcome tool in both clinical work and research. However, it has not been formally translated and validated in Finnish. The purpose of this study was to translate and culturally adapt the KOOS questionnaire into Finnish and to determine its validity and reliability among Finnish middle-aged patients with knee injuries. KOOS was translated and culturally adapted from English into Finnish. Subsequently, 59 patients with knee injuries completed the Finnish version of KOOS, Western Ontario and McMaster Osteoarthritis Index (WOMAC), Short-Form 36 Health Survey (SF-36) and Numeric Pain Rating Scale (Pain-NRS). The same KOOS questionnaire was re-administered 2 weeks later. Psychometric assessment of the Finnish KOOS was performed by testing its construct validity and reliability by using internal consistency, test-retest reliability and measurement error. The floor and ceiling effects were also examined. The cross-cultural adaptation revealed only minor cultural differences and was well received by the patients. For construct validity, high to moderate Spearman's Correlation Coefficients were found between the KOOS subscales and the WOMAC, SF-36, and Pain-NRS subscales. The Cronbach's alpha was from 0.79 to 0.96 for all subscales indicating acceptable internal consistency. The test-retest reliability was good to excellent, with Intraclass Correlation Coefficients ranging from 0.73 to 0.86 for all KOOS subscales. The minimal detectable change ranged from 17 to 34 on an individual level and from 2 to 4 on a group level. No floor or ceiling effects were observed. This study yielded an appropriately translated and culturally adapted Finnish version of KOOS which demonstrated good validity and reliability. Our data indicate that the Finnish version of KOOS is suitable for assessment of the knee status of Finnish patients with different knee complaints. Further studies are needed to
Full Text Available Introduction: Many studies reported poorer quality of life (QoL in youth with diabetes compared to healthy peers. One of the tools used is the Diabetes Quality of Life for Youth (DQoLY questionnaire in English. A validated instrument in Malay is needed to assess the perception of QoL among youth with diabetes in Malaysia. Objective: To translate the modified version, i.e., the DQoLY questionnaire,into Malay and determine its reliability and validity.Methods: Translation and back-translation were used. An expert panel reviewed the translated version for conceptual and content equivalence. The final version was then administered to youths with type 1 diabetes mellitus from the universities and Ministry of Health hospitals between August 2006 and September 2007. Reliability was analysed using Cronbach’s alpha, while validity was confirmed using concurrent validity (HbA1c and self-rated health score.Results: A total of 82 youths with type 1 diabetes (38 males aged 10-18 years were enrolled from eight hospitals. The reliability of overall questionnaire was 0.917, and the reliabilities of the three domains ranged from 0.832 to 0.867. HbA1c was positively correlated with worry (p=0.03. The self-rated health score was found to have significant negative correlation with the “satisfaction” (p=0.013 and “impact” (p=0.007 domains.Conclusion: The Malay translated version of DQoLY questionnaire was reliable and valid to be used among youths with type 2 diabetes in Malaysia.