validity test-retest reliability: Topics by WorldWideScience.org

Sample records for validity test-retest reliability

Test-retest reliability and predictive validity of the Implicit Association Test in children.

Science.gov (United States)

Rae, James R; Olson, Kristina R

2018-02-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Test re-test reliability and construct validity of the star-track test of manual dexterity

DEFF Research Database (Denmark)

Kildebro, Niels; Amirian, Ilda; Gögenur, Ismail

2015-01-01

Objectives. We wished to determine test re-test reliability and construct validity of the star-track test of manual dexterity. Design. Test re-test reliability was examined in a controlled study. Construct validity was tested in a blinded randomized crossover study. Setting. The study was performed...... at a university hospital in Denmark. Participants. A total of 11 subjects for test re-test and 20 subjects for the construct validity study were included. All subjects were healthy volunteers. Intervention. The test re-test trial had two measurements with 2 days pause in between. The interventions...... in the construct validity study included baseline measurement, intervention 1: fatigue, intervention 2: stress, and intervention 3: fatigue and stress. There was a 2 day pause between each intervention. Main outcome measure. An integrated measure of completion time and number of errors was used. Results. All...
Construct Validity and Test-Retest Reliability of the Climbing Stairs Questionnaire in Lower-Limb Amputees

NARCIS (Netherlands)

de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

de Laat FA, Rommers GM, Geertzen JH, Roorda LD. Construct validity and test-retest reliability of the Climbing Stairs Questionnaire in lower-limb amputees. Arch Phys Med Rehabil 2010;91:1396-401. Objective: To investigate the construct validity and test-retest reliability of the Climbing Stairs
Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

Science.gov (United States)

Rae, James R.; Olson, Kristina R.

2018-01-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

Science.gov (United States)

Mills, Tamara L; Holm, Margo B; Schmeler, Mark

2007-01-01

The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.
Development, test-retest reliability, and construct validity of the resistance training skills battery.

Science.gov (United States)

Lubans, David R; Smith, Jordan J; Harries, Simon K; Barnett, Lisa M; Faigenbaum, Avery D

2014-05-01

The aim of this study was to describe the development and assess test-retest reliability and construct validity of the Resistance Training Skills Battery (RTSB) for adolescents. The RTSB provides an assessment of resistance training skill competency and includes 6 exercises (i.e., body weight squat, push-up, lunge, suspended row, standing overhead press, and front support with chest touches). Scoring for each skill is based on the number of performance criteria successfully demonstrated. An overall resistance training skill quotient (RTSQ) is created by adding participants' scores for the 6 skills. Participants (44 boys and 19 girls, mean age = 14.5 ± 1.2 years) completed the RTSB on 2 occasions separated by 7 days. Participants also completed the following fitness tests, which were used to create a muscular fitness score (MFS): handgrip strength, timed push-up, and standing long jump tests. Intraclass correlation (ICC), paired samples t-tests, and typical error were used to assess test-retest reliability. To assess construct validity, gender and RTSQ were entered into a regression model predicting MFS. The rank order repeatability of the RTSQ was high (ICC = 0.88). The model explained 39% of the variance in MFS (p ≤ 0.001) and RTSQ (r = 0.40, p ≤ 0.001) was a significant predictor. This study has demonstrated the construct validity and test-retest reliability of the RTSB in a sample of adolescents. The RTSB can reliably rank participants in regards to their resistance training competency and has the necessary sensitivity to detect small changes in resistance training skill proficiency.
Test-retest reliability of the Work Ability Index questionnaire

NARCIS (Netherlands)

de Zwart, B. C. H.; Frings-Dresen, M. H. W.; Van Duivenbooden, J. C.

2002-01-01

The goal of the study was to assess the test-retest reliability of the Work Ability Index (WAI) questionnaire. Reliability was tested using a test-retest design with a 4 week interval between measurements. Valid data were collected among 97 elderly construction workers aged 40 years and older. We
Test-retest reliability and validity of the Sniffin' TOM odor memory test.

Science.gov (United States)

Croy, Ilona; Zehner, Cora; Larsson, Maria; Zucco, Gesualdo M; Hummel, Thomas

2015-03-01

Few attempts have been made to develop an olfactory test that captures episodic retention of olfactory information. Assessment of episodic odor memory is of particular interest in aging and in the cognitively impaired as both episodic memory deficits and olfactory loss have been targeted as reliable hallmarks of cognitive decline and impending dementia. Here, 96 healthy participants (18-92 years) and an additional 19 older people with mild cognitive impairment were tested (73-82 years). Participants were presented with 8 common odors with intentional encoding instructions that were followed by a yes-no recognition test. After recognition completion, participants were asked to identify all odors by means of free or cued identification. A retest of the odor memory test (Sniffin' TOM = test of odor memory) took place 17 days later. The results revealed satisfactory test-retest reliability (0.70) of odor recognition memory. Both recognition and identification performance were negatively affected by age and more pronounced among the cognitively impaired. In conclusion, the present work presents a reliable, valid, and simple test of episodic odor recognition memory that may be used in clinical groups where both episodic memory deficits and olfactory loss are prevalent preclinically such as Alzheimer's disease. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

Science.gov (United States)

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (Test possessed low CV and significant (pTest possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Development of an Agility Test for Badminton Players and Assessment of Its Validity and Test-Retest Reliability.

Science.gov (United States)

Loureiro, Luiz de França Bahia; de Freitas, Paulo Barbosa

2016-04-01

Badminton requires open and fast actions toward the shuttlecock, but there is no specific agility test for badminton players with specific movements. To develop an agility test that simultaneously assesses perception and motor capacity and examine the test's concurrent and construct validity and its test-retest reliability. The Badcamp agility test consists of running as fast as possible to 6 targets placed on the corners and middle points of a rectangular area (5.6 × 4.2 m) from the start position located in the center of it, following visual stimuli presented in a luminous panel. The authors recruited 43 badminton players (17-32 y old) to evaluate concurrent (with shuttle-run agility test--SRAT) and construct validity and test-retest reliability. Results revealed that Badcamp presents concurrent and construct validity, as its performance is strongly related to SRAT (ρ = 0.83, P < .001), with performance of experts being better than nonexpert players (P < .01). In addition, Badcamp is reliable, as no difference (P = .07) and a high intraclass correlation (ICC = .93) were found in the performance of the players on 2 different occasions. The findings indicate that Badcamp is an effective, valid, and reliable tool to measure agility, allowing coaches and athletic trainers to evaluate players' athletic condition and training effectiveness and possibly detect talented individuals in this sport.
Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.

Science.gov (United States)

Taylor, Karen; Bulsara, Max; Monterosso, Leanne

2018-01-01

Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Adaptation, test-retest reliability, and construct validity of the Physical Activity Neighborhood Environment Scale in Nigeria (PANES-N).

Science.gov (United States)

Oyeyemi, Adewale L; Sallis, James F; Oyeyemi, Adetoyeje Y; Amin, Mariam M; De Bourdeaudhuij, Ilse; Deforche, Benedicte

2013-11-01

This study adapted the Physical Activity Neighborhood Environment Scale (PANES) to the Nigerian context and assessed the test-retest reliability and construct validity of the Nigerian version (PANESN). A multidisciplinary panel of experts adapted the original PANES to reflect the built and social environment of Nigeria. The adapted PANES was subjected to cognitive testing and test retest reliability in a diverse sample of Nigerian adults (N = 132) from different neighborhood types. Intraclass Correlation Coefficients (ICC) was used to assess test-retest reliability, and construct validity was investigated with Analysis of Covariance for differences in environmental attributes between neighborhoods. Four of the 17 items on the original PANES were significantly modified, 3 were removed and 2 new items were incorporated into the final version of adapted PANES-N. Test-retest reliability was substantial to almost perfect (ICC = 0.62-1.00) for all items on the PANES-N, and residents of neighborhoods in the inner city reported higher residential density, land use mix and safety, but lower pedestrian facilities and aesthetics than did residents of government reserved area/new layout neighborhoods. The PANES-N appears promising for assessing environmental perceptions related to physical activity in Nigeria, but further testing is required to assess its applicability across Africa.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

Directory of Open Access Journals (Sweden)

Singh Amika S

2011-12-01

Full Text Available Abstract Background Insight in children's energy balance-related behaviours (EBRBs and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77% showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23% and poor for one item. Construct validity appeared to be good to excellent for 70 (47% of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26% and poor for 41 items (27%. Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

Science.gov (United States)

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Establishing survey validity and reliability for American Indians through "think aloud" and test-retest methods.

Science.gov (United States)

Hauge, Cindy Horst; Jacobs-Knight, Jacque; Jensen, Jamie L; Burgess, Katherine M; Puumala, Susan E; Wilton, Georgiana; Hanson, Jessica D

2015-06-01

The purpose of this study was to use a mixed-methods approach to determine the validity and reliability of measurements used within an alcohol-exposed pregnancy prevention program for American Indian women. To develop validity, content experts provided input into the survey measures, and a "think aloud" methodology was conducted with 23 American Indian women. After revising the measurements based on this input, a test-retest was conducted with 79 American Indian women who were randomized to complete either the original measurements or the new, modified measurements. The test-retest revealed that some of the questions performed better for the modified version, whereas others appeared to be more reliable for the original version. The mixed-methods approach was a useful methodology for gathering feedback on survey measurements from American Indian participants and in indicating specific survey questions that needed to be modified for this population. © The Author(s) 2015.
Establishing the Test-Retest Reliability & Concurrent Validity for the Repeat Ice Skating Test (RIST) in Adolescent Male Ice Hockey Players

Science.gov (United States)

Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William

2012-01-01

In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…
Test-retest and interrater reliability of the functional lower extremity evaluation.

Science.gov (United States)

Haitz, Karyn; Shultz, Rebecca; Hodgins, Melissa; Matheson, Gordon O

2014-12-01

Repeated-measures clinical measurement reliability study. To establish the reliability and face validity of the Functional Lower Extremity Evaluation (FLEE). The FLEE is a 45-minute battery of 8 standardized functional performance tests that measures 3 components of lower extremity function: control, power, and endurance. The reliability and normative values for the FLEE in healthy athletes are unknown. A face validity survey for the FLEE was sent to sports medicine personnel to evaluate the level of importance and frequency of clinical usage of each test included in the FLEE. The FLEE was then administered and rated for 40 uninjured athletes. To assess test-retest reliability, each athlete was tested twice, 1 week apart, by the same rater. To assess interrater reliability, 3 raters scored each athlete during 1 of the testing sessions. Intraclass correlation coefficients were used to assess the test-retest and interrater reliability of each of the FLEE tests. In the face validity survey, the FLEE tests were rated as highly important by 58% to 71% of respondents but frequently used by only 26% to 45% of respondents. Interrater reliability intraclass correlation coefficients ranged from 0.83 to 1.00, and test-retest reliability ranged from 0.71 to 0.95. The FLEE tests are considered clinically important for assessing lower extremity function by sports medicine personnel but are underused. The FLEE also is a reliable assessment tool. Future studies are required to determine if use of the FLEE to make return-to-play decisions may reduce reinjury rates.
Test-retest reliability of cognitive EEG

Science.gov (United States)

McEvoy, L. K.; Smith, M. E.; Gevins, A.

2000-01-01

OBJECTIVE: Task-related EEG is sensitive to changes in cognitive state produced by increased task difficulty and by transient impairment. If task-related EEG has high test-retest reliability, it could be used as part of a clinical test to assess changes in cognitive function. The aim of this study was to determine the reliability of the EEG recorded during the performance of a working memory (WM) task and a psychomotor vigilance task (PVT). METHODS: EEG was recorded while subjects rested quietly and while they performed the tasks. Within session (test-retest interval of approximately 1 h) and between session (test-retest interval of approximately 7 days) reliability was calculated for four EEG components: frontal midline theta at Fz, posterior theta at Pz, and slow and fast alpha at Pz. RESULTS: Task-related EEG was highly reliable within and between sessions (r0.9 for all components in WM task, and r0.8 for all components in the PVT). Resting EEG also showed high reliability, although the magnitude of the correlation was somewhat smaller than that of the task-related EEG (r0.7 for all 4 components). CONCLUSIONS: These results suggest that under appropriate conditions, task-related EEG has sufficient retest reliability for use in assessing clinical changes in cognitive status.
Internal Consistency, Retest Reliability, and their Implications For Personality Scale Validity

Science.gov (United States)

McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio

2010-01-01

We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807
Test-retest reliability and construct validity of the Helplessness, Hopelessness, and Haplessness Scale in patients with anxiety disorders.

Science.gov (United States)

Vatan, Sevginar; Ertaş, Sedar; Lester, David

2011-04-01

In a sample of 100 Turkish psychiatric patients with diagnoses of anxiety disorders, Lester's Helplessness, Hopelessness, and Haplessness inventory had moderate estimates of internal consistency, test-retest reliability, and construct validity.

Test-retest reliability and construct validity of the DOiT (Dutch Obesity Intervention in Teenagers) questionnaire: measuring energy balance-related behaviours in Dutch adolescents.

Science.gov (United States)

Janssen, Evelien H C; Singh, Amika S; van Nassau, Femke; Brug, Johannes; van Mechelen, Willem; Chinapaw, Mai J M

2014-02-01

Adequate assessment of energy balance-related behaviours in adolescents is essential to develop and evaluate effective obesity prevention programmes. The present study examined the test-retest reliability and construct validity of a questionnaire assessing energy balance-related behaviours in adolescents during the evaluation of the DOiT (Dutch Obesity Intervention in Teenagers) intervention. To assess test-retest reliability, adolescents filled in the questionnaire twice (n 111). To assess construct validity, the results from the first test were compared with data collected in a personal cognitive interview (n 20, independent from the reliability study). For both reliability and validity, intraclass correlation coefficients for continuous data or Cohen's kappa coefficients for categorical data were calculated as well as percentage agreement. Data were collected during school time from February to May 2010. Study participants were Dutch adolescents aged 12-14 years attending pre-vocational secondary schools. In more than three-quarters of the ninety-five questionnaire items the test-retest reliability appeared to be good to excellent. Moderate reliability was found for all other twenty-one items. Fifty-one items (of ninety-five items) showed good to excellent construct validity. Construct validity appeared moderate in twenty-three items and poor in twenty-one items. Most items with poor construct validity concerned consumption of sugar-containing beverages and high-energy snacks/sweets. Our study showed good test-retest reliability and largely moderate to good construct validity for the majority of items of the DOiT questionnaire. Items with poor construct validity (most of them found for items concerning energy intake-related behaviours) should be revised and tested again to improve the questionnaire for future use.
Construct Validity and Test-Retest Reliability of the Walking Questionnaire in People With a Lower Limb Amputation

NARCIS (Netherlands)

de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

Objective: To investigate the construct validity and test-retest reliability of the Walking Questionnaire, a patient-reported measure of activity limitations in walking in people with a lower limb amputation. Design: Cross-sectional study. Setting: Outpatient department of a rehabilitation center.
Validity and test-retest reliability of a novel simple back extensor muscle strength test.

Science.gov (United States)

Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

2017-01-01

To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r = 0.824, p strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p strength ( p strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.
The role of test-retest reliability in measuring individual and group differences in executive functioning.

Science.gov (United States)

Paap, Kenneth R; Sawi, Oliver

2016-12-01

Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.
Test-retest reliability of infant event related potentials evoked by faces.

Science.gov (United States)

Munsters, N M; van Ravenswaaij, H; van den Boomen, C; Kemner, C

2017-04-05

Reliable measures are required to draw meaningful conclusions regarding developmental changes in longitudinal studies. Little is known, however, about the test-retest reliability of face-sensitive event related potentials (ERPs), a frequently used neural measure in infants. The aim of the current study is to investigate the test-retest reliability of ERPs typically evoked by faces in 9-10 month-old infants. The infants (N=31) were presented with neutral, fearful and happy faces that contained only the lower or higher spatial frequency information. They were tested twice within two weeks. The present results show that the test-retest reliability of the face-sensitive ERP components is moderate (P400 and Nc) to substantial (N290). However, there is low test-retest reliability for the effects of the specific experimental manipulations (i.e. emotion and spatial frequency) on the face-sensitive ERPs. To conclude, in infants the face-sensitive ERP components (i.e. N290, P400 and Nc) show adequate test-retest reliability, but not the effects of emotion and spatial frequency on these ERP components. We propose that further research focuses on investigating elements that might increase the test-retest reliability, as adequate test-retest reliability is necessary to draw meaningful conclusions on individual developmental trajectories of the face-sensitive ERPs in infants. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Test-retest reliability of the Military Pre-training Questionnaire.

Science.gov (United States)

Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

2010-09-01

Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
The Physical Activity Scale for Individuals with Physical Disabilities: test-retest reliability and comparison with an accelerometer.

Science.gov (United States)

van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem

2007-01-01

The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects' diagnoses were: stroke, spinal cord injury, whiplash, and neurological-, orthopedic- or back disorders. The PASIPD is a 7-d recall physical activity questionnaire that was completed twice, 1 wk apart. During this week, physical activity was also measured with an Actigraph accelerometer. The test-retest reliability Spearman correlation of the PASIPD was 0.77. The criterion validity Spearman correlation was 0.30 when compared to the accelerometer. The PASIPD had test-retest reliability and criterion validity that is comparable to well established self-report physical activity questionnaires from the general population.
Re-test reliability of gustatory testing and introduction of the sensitive Taste-Drop-Test

DEFF Research Database (Denmark)

Fjaeldstad, A; Niklassen, A; Fernandes, H

2018-01-01

. Testing gustatory function can be important for diagnostics and assessment of treatment effects. However, the gustatory tests applied are required to be both sensitive and reliable.In this study, we investigate the re-test validity of popular Taste Strips gustatory test for gustatory screening....... Furthermore, we introduce a new sensitive Taste-Drop-Test, which was found to be superior for detecting a more accurate measure of tastant sensitivity....
Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project

Directory of Open Access Journals (Sweden)

Singh Amika S

2012-08-01

Full Text Available Abstract Background Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10–12 year old children. Findings We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study of 10–12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement. All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. Conclusions The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
Test-retest reliability of the multifocal photopic negative response.

Science.gov (United States)

Van Alstine, Anthony W; Viswanathan, Suresh

2017-02-01

To assess the test-retest reliability of the multifocal photopic negative response (mfPhNR) of normal human subjects. Multifocal electroretinograms were recorded from one eye of 61 healthy adult subjects on two separate days using a Visual Evoked Response Imaging System software version 4.3 (EDI, San Mateo, California). The visual stimulus delivered on a 75-Hz monitor consisted of seven equal-sized hexagons each subtending 12° of visual angle. The m-step exponent was 9, and the m-sequence was slowed to include at least 30 blank frames after each flash. Only the first slice of the first-order kernel was analyzed. The mfPhNR amplitude was measured at a fixed time in the trough from baseline (BT) as well as at the same fixed time in the trough from the preceding b-wave peak (PT). Additionally, we also analyzed BT normalized either to PT (BT/PT) or to the b-wave amplitude (BT/b-wave). The relative reliability of test-retest differences for each test location was estimated by the Wilcoxon matched-pair signed-rank test and intraclass correlation coefficients (ICC). Absolute test-retest reliability was estimated by Bland-Altman analysis. The test-retest amplitude differences for neither of the two measurement techniques were statistically significant as determined by Wilcoxon matched-pair signed-rank test. PT measurements showed greater ICC values than BT amplitude measurements for all test locations. For each measurement technique, the ICC value of the macular response was greater than that of the surrounding locations. The mean test-retest difference was close to zero for both techniques at each of the test locations, and while the coefficient of reliability (COR-1.96 times the standard deviation of the test-retest difference) was comparable for the two techniques at each test location when expressed in nanovolts, the %COR (COR normalized to the mean test and retest amplitudes) was superior for PT than BT measurements. The ICC and COR were comparable for the BT/PT and
Interrater and test-retest reliability and validity of the Norwegian version of the BESTest and mini-BESTest in people with increased risk of falling.

Science.gov (United States)

Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L

2017-04-20

The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95 = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95 = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95 = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95 = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.
Test-retest reliability of the Progressive Isoinertial Lifting Evaluation (PILE).

Science.gov (United States)

Lygren, Hildegunn; Dragesund, Tove; Joensen, Jón; Ask, Tove; Moe-Nilssen, Rolf

2005-05-01

A repeated measures single group design. To investigate test-retest reliability of Progressive Isoinertial Lifting Evaluation on patients with long lasting musculoskeletal problems related to the lumbar spine. Test-retest reliability has been satisfactory in healthy men. Test-retest reliability for clinical populations has not been reported. A total of 31 patients (17 women and 14 men) with long lasting low back pain participated in the study. The patients were tested twice at an interval of 2 days and at the same time of the day. The heaviest load that the patient could lift 4 times was used as outcome measure. The error of measurement indicates that the true result in 95% of cases will be within +/-4.5 kg from the measured value, while the difference between 2 measurements in 95% of cases will be less than 6.4 kg. Intra-class correlation (1,1) was 0.91. Relative test-retest reliability was high assessed by intra-class correlation, but absolute measurement variability reported as the smallest detectable difference has relevance for the interpretation of clinical test results and should also be considered.
Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

Science.gov (United States)

Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

2016-04-01

The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
Balance Assessment in Sports-Related Concussion: Evaluating Test-Retest Reliability of the Equilibrate System.

Science.gov (United States)

Odom, Mitchell J; Lee, Young M; Zuckerman, Scott L; Apple, Rachel P; Germanos, Theodore; Solomon, Gary S; Sills, Allen K

2016-01-01

This study evaluated the test-retest reliability of a novel computer-based, portable balance assessment tool, the Equilibrate System (ES), used to diagnose sports-related concussion. Twenty-seven students participated in ES testing consisting of three sessions over 4 weeks. The modified Balance Error Scoring System was performed. For each participant, test-retest reliability was established using the intraclass correlation coefficient (ICC). The ES test-retest reliability from baseline to week 2 produced an ICC value of 0.495 (95% CI, 0.123-0.745). Week 2 testing produced ICC values of 0.602 (95% CI, 0.279-0.803) and 0.610 (95% CI, 0.299-0.804), respectively. All other single measures test-retest reliability values produced poor ICC values. Same-day ES testing showed fair to good test-retest reliability while interweek measures displayed poor to fair test-retest reliability. Testing conditions should be controlled when using computerized balance assessment methods. ES testing should only be used as a part of a comprehensive assessment.
Multilevel Factor Structure, Concurrent Validity, and Test-Retest Reliability of the High School Teacher Version of the Authoritative School Climate Survey

Science.gov (United States)

Huang, Francis L.; Cornell, Dewey G.

2016-01-01

Although school climate has long been recognized as an important factor in the school improvement process, there are few psychometrically supported measures based on teacher perspectives. The current study replicated and extended the factor structure, concurrent validity, and test-retest reliability of the teacher version of the Authoritative…
Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

Science.gov (United States)

Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

2014-09-04

There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a
The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

Science.gov (United States)

Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

2017-10-23

Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (preliability was (ICC3,3) = 0.953 (pvalidity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Evaluating the reliability of an injury prevention screening tool: Test-retest study.

Science.gov (United States)

Gittelman, Michael A; Kincaid, Madeline; Denny, Sarah; Wervey Arnold, Melissa; FitzGerald, Michael; Carle, Adam C; Mara, Constance A

2016-10-01

A standardized injury prevention (IP) screening tool can identify family risks and allow pediatricians to address behaviors. To assess behavior changes on later screens, the tool must be reliable for an individual and ideally between household members. Little research has examined the reliability of safety screening tool questions. This study utilized test-retest reliability of parent responses on an existing IP questionnaire and also compared responses between household parents. Investigators recruited parents of children 0 to 1 year of age during admission to a tertiary care children's hospital. When both parents were present, one was chosen as the "primary" respondent. Primary respondents completed the 30-question IP screening tool after consent, and they were re-screened approximately 4 hours later to test individual reliability. The "second" parent, when present, only completed the tool once. All participants received a 10-dollar gift card. Cohen's Kappa was used to estimate test-retest reliability and inter-rater agreement. Standard test-retest criteria consider Kappa values: 0.0 to 0.40 poor to fair, 0.41 to 0.60 moderate, 0.61 to 0.80 substantial, and 0.81 to 1.00 as almost perfect reliability. One hundred five families participated, with five lost to follow-up. Thirty-two (30.5%) parent dyads completed the tool. Primary respondents were generally mothers (88%) and Caucasian (72%). Test-retest of the primary respondents showed their responses to be almost perfect; average 0.82 (SD = 0.13, range 0.49-1.00). Seventeen questions had almost perfect test-retest reliability and 11 had substantial reliability. However, inter-rater agreement between household members for 12 objective questions showed little agreement between responses; inter-rater agreement averaged 0.35 (SD = 0.34, range -0.19-1.00). One question had almost perfect inter-rater agreement and two had substantial inter-rater agreement. The IP screening tool used by a single individual had excellent
Validity, Reliability, and Sensitivity of a Volleyball Intermittent Endurance Test.

Science.gov (United States)

Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl

2017-03-01

To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P volleyball players.
Resting-state test-retest reliability of a priori defined canonical networks over different preprocessing steps.

Science.gov (United States)

Varikuti, Deepthi P; Hoffstaedter, Felix; Genon, Sarah; Schwender, Holger; Reid, Andrew T; Eickhoff, Simon B

2017-04-01

Resting-state functional connectivity analysis has become a widely used method for the investigation of human brain connectivity and pathology. The measurement of neuronal activity by functional MRI, however, is impeded by various nuisance signals that reduce the stability of functional connectivity. Several methods exist to address this predicament, but little consensus has yet been reached on the most appropriate approach. Given the crucial importance of reliability for the development of clinical applications, we here investigated the effect of various confound removal approaches on the test-retest reliability of functional-connectivity estimates in two previously defined functional brain networks. Our results showed that gray matter masking improved the reliability of connectivity estimates, whereas denoising based on principal components analysis reduced it. We additionally observed that refraining from using any correction for global signals provided the best test-retest reliability, but failed to reproduce anti-correlations between what have been previously described as antagonistic networks. This suggests that improved reliability can come at the expense of potentially poorer biological validity. Consistent with this, we observed that reliability was proportional to the retained variance, which presumably included structured noise, such as reliable nuisance signals (for instance, noise induced by cardiac processes). We conclude that compromises are necessary between maximizing test-retest reliability and removing variance that may be attributable to non-neuronal sources.

Resting-state test-retest reliability of a priori defined canonical networks over different preprocessing steps

Science.gov (United States)

Varikuti, Deepthi P.; Hoffstaedter, Felix; Genon, Sarah; Schwender, Holger; Reid, Andrew T.; Eickhoff, Simon B.

2016-01-01

Resting-state functional connectivity analysis has become a widely used method for the investigation of human brain connectivity and pathology. The measurement of neuronal activity by functional MRI, however, is impeded by various nuisance signals that reduce the stability of functional connectivity. Several methods exist to address this predicament, but little consensus has yet been reached on the most appropriate approach. Given the crucial importance of reliability for the development of clinical applications, we here investigated the effect of various confound removal approaches on the test-retest reliability of functional-connectivity estimates in two previously defined functional brain networks. Our results showed that grey matter masking improved the reliability of connectivity estimates, whereas de-noising based on principal components analysis reduced it. We additionally observed that refraining from using any correction for global signals provided the best test-retest reliability, but failed to reproduce anti-correlations between what have been previously described as antagonistic networks. This suggests that improved reliability can come at the expense of potentially poorer biological validity. Consistent with this, we observed that reliability was proportional to the retained variance, which presumably included structured noise, such as reliable nuisance signals (for instance, noise induced by cardiac processes). We conclude that compromises are necessary between maximizing test-retest reliability and removing variance that may be attributable to non-neuronal sources. PMID:27550015
Development, content validity and test-retest reliability of the Lifelong Physical Activity Skills Battery in adolescents.

Science.gov (United States)

Hulteen, Ryan M; Barnett, Lisa M; Morgan, Philip J; Robinson, Leah E; Barton, Christian J; Wrotniak, Brian H; Lubans, David R

2018-03-28

Numerous skill batteries assess fundamental motor skill (e.g., kick, hop) competence. Few skill batteries examine lifelong physical activity skill competence (e.g., resistance training). This study aimed to develop and assess the content validity, test-retest and inter-rater reliability of the "Lifelong Physical Activity Skills Battery". Development of the skill battery occurred in three stages: i) systematic reviews of lifelong physical activity participation rates and existing motor skill assessment tools, ii) practitioner consultation and iii) research expert consultation. The final battery included eight skills: grapevine, golf swing, jog, push-up, squat, tennis forehand, upward dog and warrior I. Adolescents (28 boys, 29 girls; M = 15.8 years, SD = 0.4 years) completed the Lifelong Physical Activity Skills Battery on two occasions two weeks apart. The skill battery was highly reliable (ICC = 0.84, 95% CI = 0.72-0.90) with individual skill reliability scores ranging from moderate (warrior I; ICC = 0.56) to high (tennis forehand; ICC = 0.82). Typical error (4.0; 95% CI 3.4-5.0) and proportional bias (r = -0.21, p = .323) were low. This study has provided preliminary evidence for the content validity and reliability of the Lifelong Physical Activity Skills Battery in an adolescent population.
Acoustic stapedial reflexes in healthy neonates: normative data and test-retest reliability.

Science.gov (United States)

Kei, Joseph

2012-01-01

The acoustic stapedial reflex (ASR) test provides useful information about the function of the auditory system. While it is frequently used with adults and children in a clinical setting, its use with young infants is limited. Presently, there are few data for neonates and inadequate research into the test-retest reliability of the ASR test. This study aimed to establish normative data and evaluate the test-retest reliability of the ASR test in healthy neonates. A cross-sectional experimental design was used to establish ASR normative data and assess the test-retest reliability of ASR thresholds obtained from healthy neonates. Sixty-eight full-term neonates with mean chronological age of 2.5 days (SD = 1.8 day), who passed the automated auditory brainstem response, transient evoked otoacoustic emission, and high frequency (1 kHz) tympanometry (HFT) tests. One randomly selected ear from each neonate was tested using TEOAE (transient evoked otoacoustic emission), HFT, and ASR tests using a 1 kHz probe tone. ASR thresholds were elicited by presenting pure tones of 0.5, 2, and 4 kHz and broadband noise (BBN) separately to the test ear in an ipsilateral stimulation mode. The ASR procedure was repeated to acquire retest data within the same testing session. Descriptive statistics, χ2, and analysis of variance with repeated measures tests were used to analyze ASR data. All neonates exhibited ASR when stimulated by tonal stimuli or BBN. The mean ASRTs (acoustic stapedial reflex thresholds) for the 0.5, 2, and 4 kHz tones were 81.6 ± 7.9, 71.3 ± 7.9, and 65.4 ± 8.7 dB HL, respectively. The mean ASRT for the BBN was estimated to be smaller than 57.2 dB HL, given the limitation of the equipment. The 95th percentiles of the ASRT were 95, 85, 80, and 75 dB HL for the 0.5, 2, and 4 kHz and BBN, respectively. The test-retest reliability of the ASR test for all stimuli was high, with no significant difference in mean ASRTs across the test and retest conditions. Test-retest
Test-retest reliability of a balance testing protocol with external perturbations in young healthy adults.

Science.gov (United States)

Robbins, Shawn M; Caplan, Ryan M; Aponte, Daniel I; St-Onge, Nancy

2017-10-01

External perturbations are utilized to challenge balance and mimic realistic balance threats in patient populations. The reliability of such protocols has not been established. The purpose was to examine test-retest reliability of balance testing with external perturbations. Healthy adults (n=34; mean age 23 years) underwent balance testing over two visits. Participants completed ten balance conditions in which the following parameters were combined: perturbation or non-perturbation, single or double leg, and eyes open or closed. Three trials were collected for each condition. Data were collected on a force plate and external perturbations were applied by translating the plate. Force plate center of pressure (CoP) data were summarized using 13 different CoP measures. Test-retest reliability was examined using intraclass correlation coefficients (ICC) and Bland-Altman plots. CoP measures of total speed and excursion in both anterior-posterior and medial-lateral directions generally had acceptable ICC values for perturbation conditions (ICC=0.46 to 0.87); however, many other CoP measures (e.g. range, area of ellipse) had unacceptable test-retest reliability (ICCbalance testing protocols that include external perturbations should be made to improve test-retest reliability and diminish learning including more extensive participant training and increasing the number of trials. CoP measures that consider all data points (e.g. total speed) are more reliable than those that only consider a few data points. Copyright © 2017 Elsevier B.V. All rights reserved.
The Comprehensive Snack Parenting Questionnaire (CSPQ: Development and Test-Retest Reliability

Directory of Open Access Journals (Sweden)

Dorus W. M. Gevers

2018-04-01

Full Text Available The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41 or agreement scores (≥0.60 for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.
Test-retest reliability, smallest real difference and concurrent validity of six different balance tests on young people with mild to moderate intellectual disability.

Science.gov (United States)

Blomqvist, Sven; Wester, Anita; Sundelin, Gunnevi; Rehn, Börje

2012-12-01

Some studies have reported that people with intellectual disability may have reduced balance ability compared with the population in general. However, none of these studies involved adolescents, and the reliability and validity of balance tests in this population are not known. The purpose of this study was to examine the reliability of six different balance tests and to investigate their concurrent validity. Test-retest reliability assessment. All subjects were recruited from a special school for people with intellectual disability in Bollnäs, Sweden. Eighty-nine adolescents (35 females and 54 males) with mild to moderate intellectual disability with a mean age of 18 years (range 16 to 20 years). All subjects followed the same test protocol on two occasions within an 11-day period. Balance test performances. Intraclass correlation coefficients greater than 0.80 were achieved for four of the balance tests: Extended Timed Up and Go Test, Modified Functional Reach Test, One-leg Stance Test and Force Platform Test. The smallest real differences ranged from 12% to 40%; less than 20% is considered to be low. Concurrent validity among these balance tests varied between no and low correlation. The results indicate that these tests could be used to evaluate changes in balance ability over time in people with mild to moderate intellectual disability. The low concurrent validity illustrates the importance of knowing more about the influence of various sensory subsystems that are significant for balance among adolescents with intellectual disability. Copyright © 2011 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

Science.gov (United States)

Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

2014-01-01

This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Test-retest reliability for aerodynamic measures of voice.

Science.gov (United States)

Awan, Shaheen N; Novaleski, Carolyn K; Yingling, Julie R

2013-11-01

The purpose of this study was to investigate the intrasubject reliability of aerodynamic characteristics of the voice within typical/normal speakers across testing sessions using the Phonatory Aerodynamic System (PAS 6600; KayPENTAX, Montvale, NJ). Participants were 60 healthy young adults (30 males and 30 females) between the ages 18 and 31 years with perceptually typical voice. Participants were tested using the PAS 6600 (Phonatory Aerodynamic System) on two separate days with approximately 1 week between each session at approximately the same time of day. Four PAS protocols were conducted (vital capacity, maximum sustained phonation, comfortable sustained phonation, and voicing efficiency) and measures of expiratory volume, maximum phonation time, mean expiratory airflow (during vowel production) and target airflow (obtained via syllable repetition), peak air pressure, aerodynamic power, aerodynamic resistance, and aerodynamic efficiency were obtained during each testing session. Associated acoustic measures of vocal intensity and frequency were also collected. All phonations were elicited at comfortable pitch and loudness. All aerodynamic and associated variables evaluated in this study showed useable test-retest reliability (ie, intraclass correlation coefficients [ICCs] ≥ 0.60). A high degree of mean test-retest reliability was found across all subjects for aerodynamic and associated acoustic measurements of vital capacity, maximum sustained phonation, glottal resistance, and vocal intensity (all with ICCs > 0.75). Although strong ICCs were observed for measures of glottal power and mean expiratory airflow in males, weaker overall results for these measures (ICC range: 0.60-0.67) were observed in females subjects and sizable coefficients of variation were observed for measures of power, resistance, and efficiency in both men and women. Differences in degree of reliability from measure to measure were revealed in greater detail using methods such as ICCs and
Test-retest reliability and smallest detectable change of the Bristol Impact of Hypermobility (BIoH) questionnaire.

Science.gov (United States)

Palmer, S; Manns, S; Cramp, F; Lewis, R; Clark, E M

2017-12-01

The Bristol Impact of Hypermobility (BIoH) questionnaire is a patient-reported outcome measure developed in conjunction with adults with Joint Hypermobility Syndrome (JHS). It has demonstrated strong concurrent validity with the Short Form-36 (SF-36) physical component score but other psychometric properties have yet to be established. This study aimed to determine its test-retest reliability and smallest detectable change (SDC). A test-retest reliability study. Participants were recruited from the Hypermobility Syndromes Association, a patient organisation in the United Kingdom. Recruitment packs were sent to 1080 adults who had given permission to be contacted about research. BIoH and SF-36 questionnaires were administered at baseline and repeated two weeks later. An 11-point global rating of change scale (-5 to +5) was also administered at two weeks. Test-retest analysis and calculation of the SDC was conducted on 'stable' patients (defined as global rating of change -1 to +1). 462 responses were received. 233 patients reported a 'stable' condition and were included in analysis (95% women; mean (SD) age 44.5 (13.9) years; BIoH score 223.6 (54.0)). The BIoH questionnaire demonstrated excellent test-retest reliability (ICC 0.923, 95% CI 0.900-0.940). The SDC was 42 points (equivalent to 19% of the mean baseline score). The SF-36 physical and mental component scores demonstrated poorer test-retest reliability and larger SDCs (as a proportion of the mean baseline scores). The results provide further evidence of the potential of the BIoH questionnaire to underpin research and clinical practice for people with JHS. Copyright © 2017 Elsevier Ltd. All rights reserved.
Test-retest reliability and stability of N400 effects in a word-pair semantic priming paradigm.

Science.gov (United States)

Kiang, Michael; Patriciu, Iulia; Roy, Carolyn; Christensen, Bruce K; Zipursky, Robert B

2013-04-01

Elicited by any meaningful stimulus, the N400 event-related potential (ERP) component is reduced when the stimulus is related to a preceding one. This N400 semantic priming effect has been used to probe abnormal semantic relationship processing in clinical disorders, and suggested as a possible biomarker for treatment studies. Validating N400 semantic priming effects as a clinical biomarker requires characterizing their test-retest reliability. We assessed test-retest reliability of N400 semantic priming in 16 healthy adults who viewed the same related and unrelated prime-target word pairs in two sessions one week apart. As expected, N400 amplitudes were smaller for related versus unrelated targets across sessions. N400 priming effects (amplitude differences between unrelated and related targets) were highly correlated across sessions (r=0.85, Pmotivational changes. Use of N400 priming effects in treatment studies should account for possible magnitude decreases with repeat testing. Further research is needed to delineate N400 priming effects' test-retest reliability and stability in different age and clinical groups, and with different stimulus types. Copyright © 2012 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Test-Retest Reliability of a Survey to Measure Transport-Related Physical Activity in Adults

Science.gov (United States)

Badland, Hannah; Schofield, Grant

2006-01-01

The present research details test-retest reliability of a newly developed, telephone-administered TPA survey for adults. This instrument examines barriers, perceptions, and current travel behaviors to place of work/study and local convenience shops. Demonstrated test-retest reliability of the Active Friendly Environments-Transport-Related Physical…
Reliability and criterion-related validity testing (construct) of the Endotracheal Suction Assessment Tool (ESAT©).

Science.gov (United States)

Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne

2018-05-01

To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube
The Physical Activity Scale for Individuals with Physical Disabilities : test-retest reliability and comparison with an accelerometer

NARCIS (Netherlands)

van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem; van der Woude, Lucas

BACKGROUND: The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). METHODS: Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects'
Test-Retest Reliability of Computerized, Everyday Memory Measures and Traditional Memory Tests.

Science.gov (United States)

Youngjohn, James R.; And Others

Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…
Test-retest reliability of selected items of Health Behaviour in School-aged Children (HBSC survey questionnaire in Beijing, China

Directory of Open Access Journals (Sweden)

Liu Yang

2010-08-01

Full Text Available Abstract Background Children's health and health behaviour are essential for their development and it is important to obtain abundant and accurate information to understand young people's health and health behaviour. The Health Behaviour in School-aged Children (HBSC study is among the first large-scale international surveys on adolescent health through self-report questionnaires. So far, more than 40 countries in Europe and North America have been involved in the HBSC study. The purpose of this study is to assess the test-retest reliability of selected items in the Chinese version of the HBSC survey questionnaire in a sample of adolescents in Beijing, China. Methods A sample of 95 male and female students aged 11 or 15 years old participated in a test and retest with a three weeks interval. Student Identity numbers of respondents were utilized to permit matching of test-retest questionnaires. 23 items concerning physical activity, sedentary behaviour, sleep and substance use were evaluated by using the percentage of response shifts and the single measure Intraclass Correlation Coefficients (ICC with 95% confidence interval (CI for all respondents and stratified by gender and age. Items on substance use were only evaluated for school children aged 15 years old. Results The percentage of no response shift between test and retest varied from 32% for the item on computer use at weekends to 92% for the three items on smoking. Of all the 23 items evaluated, 6 items (26% showed a moderate reliability, 12 items (52% displayed a substantial reliability and 4 items (17% indicated almost perfect reliability. No gender and age group difference of the test-retest reliability was found except for a few items on sedentary behaviour. Conclusions The overall findings of this study suggest that most selected indicators in the HBSC survey questionnaire have satisfactory test-retest reliability for the students in Beijing. Further test-retest studies in a large
Construct validity, test-retest reliability and internal consistency of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) in patients with carpal tunnel syndrome.

Science.gov (United States)

Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan

2018-03-27

This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.
Test-retest reliability of trunk accelerometric gait analysis

DEFF Research Database (Denmark)

Henriksen, Marius; Lund, Hans; Moe-Nilssen, R

2004-01-01

The purpose of this study was to determine the test-retest reliability of a trunk accelerometric gait analysis in healthy subjects. Accelerations were measured during walking using a triaxial accelerometer mounted on the lumbar spine of the subjects. Six men and 14 women (mean age 35.2; range 18...... a definite potential in clinical gait analysis....
Long term test-retest reliability of Oswestry Disability Index in male office workers.

Science.gov (United States)

Irmak, Rafet; Baltaci, Gul; Ergun, Nevin

2015-01-01

The Oswestry Disability Index (ODI) is one of the most common condition specific outcome measures used in the management of spinal disorders. But there is insufficient study on healthy populations and long term test-retest reliability. This is important because healthy populations are often used for control groups in low back pain interventions, and knowing the reliability of the controls affects the interpretation of the findings of these studies. The purpose of this study is to determine the long term test-retest reliability of ODI in office workers. Participants who have no chronic low back pain history were included in study. Subjects were assessed by the Turkish-ODI 2.0 (e-forms) on 1st, 2nd, 4th, 8th, 15th, 30th days to determine the stability of ODI scores over time. The study began with 58 (12 female, 46 male) participants. 36 (3 female, 33 male) participated for the full 30 days. Kolmogorov-Smirnov and Friedman tests were used. Test-retest reliability was evaluated by using nonparametric statistics. All tests were done by using SPSS-11. There was no statistically significant difference among the median scores of each day. (χ= 6.482, p > 0.05). The difference between median score of the days with 1st day was neither statistically nor clinically significant. ODI has long term test re-test reliability in healthy subjects over a 1 month time interval.
Impact on participation and autonomy: test of validity and reliability for older persons

Directory of Open Access Journals (Sweden)

Isabelle Ottenvall Hammar

2014-10-01

Full Text Available In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA-Older persons (IPA-O, showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation.
Test-Retest Reliability, Convergent Validity, and Internal Consistency of the Persian Version of Fullerton Advanced Balance Scale in Iranian Community-Dwelling Older Adults

OpenAIRE

Azar Sabet; Akram Azad; Ghorban Taghizadeh

2016-01-01

Objectives: This study was performed to evaluate convergent validity, test-retest reliability and internal consistency of the Persian translation of the Fullerton advanced balance (FAB) for use in Iranian community- dwelling older adults and improve the quality of their functional balance assessment. Methods & Materials: The original scale was translated with forward-backward protocol. In the next step, using convenience sampling and inclusion criteria, 88 functionally indep...

Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

Science.gov (United States)

Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

2018-05-01

Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.
Test-Retest Reliability of the Salutogenic Wellness Promotion Scale (SWPS)

Science.gov (United States)

Anderson, L. M.; Moore, J. B.; Hayden, B. M.; Becker, C. M.

2014-01-01

Objective: This study examined the temporal stability (i.e. test-retest reliability) of the Salutogenic Wellness Promotion Scale (SWPS) using intraclass correlation coefficients (ICC). Current intraclass results were also compared to previously published interclass correlations to support the use of the intraclass method for test-retest…
Test-Retest Reliability of a Serious Game for Delirium Screening in the Emergency Department.

Science.gov (United States)

Tong, Tiffany; Chignell, Mark; Tierney, Mary C; Lee, Jacques S

2016-01-01

Introduction: Cognitive screening in settings such as emergency departments (ED) is frequently carried out using paper-and-pencil tests that require administration by trained staff. These assessments often compete with other clinical duties and thus may not be routinely administered in these busy settings. Literature has shown that the presence of cognitive impairments such as dementia and delirium are often missed in older ED patients. Failure to recognize delirium can have devastating consequences including increased mortality (Kakuma et al., 2003). Given the demands on emergency staff, an automated cognitive test to screen for delirium onset could be a valuable tool to support delirium prevention and management. In earlier research we examined the concurrent validity of a serious game, and carried out an initial assessment of its potential as a delirium screening tool (Tong et al., 2016). In this paper, we examine the test-retest reliability of the game, as it is an important criterion in a cognitive test for detecting risk of delirium onset. Objective: To demonstrate the test-retest reliability of the screening tool over time in a clinical sample of older emergency patients. A secondary objective is to assess whether there are practice effects that might make game performance unstable over repeated presentations. Materials and Methods: Adults over the age of 70 were recruited from a hospital ED. Each patient played our serious game in an initial session soon after they arrived in the ED, and in follow up sessions conducted at 8-h intervals (for each participant there were up to five follow up sessions, depending on how long the person stayed in the ED). Results: A total of 114 adults (61 females, 53 males) between the ages of 70 and 104 years ( M = 81 years, SD = 7) participated in our study after screening out delirious patients. We observed a test-retest reliability of the serious game (as assessed by correlation r -values) between 0.5 and 0.8 across adjacent
Test-Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study.

Science.gov (United States)

Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina

2017-01-01

Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan.

Science.gov (United States)

Moore, Amy Lawson; Miller, Terissa M

2018-01-01

The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Test-retest reliability of the Danish Adult Reading Test in patients with comorbid psychosis and cannabis-use disorder

DEFF Research Database (Denmark)

Hjorthøj, Carsten Rygaard; Vesterager, Lone; Nordentoft, Merete

2013-01-01

Background: The New Adult Reading Test is a common instrument for assessing pre-morbid IQ for patients with, for instance, schizophrenia. However, test-retest reliability has not been established for patients dually diagnosed with psychosis and substance use disorder. Furthermore, test......-retest reliability of the Danish adaptation has never been established in any population. Aims: To determine the test-retest reliability of the Danish Adult Reading Test (DART) (adapted from the National Adult Reading Test, NART) for patients dually diagnosed with psychosis and cannabis-use disorder. Methods......: This was a secondary analysis of the CapOpus randomized trial. As part of the trial, 103 patients were randomized, and completed the DART up to three times. Pearson's r and pairwise t-tests were calculated. Results: DART score was independent of randomization, cannabis-use frequency and psychopathology. Scores...
Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep.

Science.gov (United States)

Wang, Jiahui; Han, Junwei; Nguyen, Vinh T; Guo, Lei; Guo, Christine C

2017-01-01

Resting state functional magnetic resonance imaging (rs-fMRI) provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV) derived from simultaneous electrocardiogram (ECG) recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.
Evaluation of the Relative Validity and Test-Retest Reliability of a 15-Item Beverage Intake Questionnaire in Children and Adolescents.

Science.gov (United States)

Hill, Catelyn E; MacDougall, Carly R; Riebl, Shaun K; Savla, Jyoti; Hedrick, Valisa E; Davy, Brenda M

2017-11-01

Added sugar intake, in the form of sugar-sweetened beverages (SSBs), may contribute to weight gain and obesity development in children and adolescents. A valid and reliable brief beverage intake assessment tool for children and adolescents could facilitate research in this area. The purpose of this investigation was to evaluate the relative validity and test-retest reliability of a 15-item beverage intake questionnaire (BEVQ) for assessing usual beverage intake in children and adolescents. This cross-sectional investigation included four study visits within a 2- to 3-week time period. Participants (333 enrolled; 98% completion rate) were children aged 6 to 11 years and adolescents aged 12 to18 years recruited from the New River Valley, VA, region from January 2014 to September 2015. Study visits included assessment of height/weight, health history, and four 24-hour dietary recalls (24HRs). The BEVQ was completed at two visits (BEVQ 1, BEVQ 2). To evaluate relative validity, BEVQ 1 was compared with habitual beverage intake determined by the averaged 24HR. To evaluate test-retest reliability, BEVQ 1 was compared with BEVQ 2. Analyses included descriptive statistics, independent sample t tests, χ 2 tests, one-way analysis of variance, paired sample t tests, and correlational analyses. In the full sample, self-reported water and total SSB intake were not different between BEVQ 1 and 24HR (mean differences 0±1 fl oz and 0±1 fl oz, respectively; both P values >0.05). Reported intake across all beverage categories was significantly correlated between BEVQ 1 and BEVQ 2 (Pbeverages was not different (all P values >0.05) between BEVQ 1 and 24HR (mean differences: whole milk=3±4 kcal, reduced-fat milk=9±5 kcal, and fat-free milk=7±6 kcal, which is 7±15 total beverage kilocalories). In adolescents (n=200), water and SSB kilocalories were not different (both P values >0.05) between BEVQ 1 and 24HR (mean differences: -1±1 fl oz and 12±9 kcal, respectively). A 15
Validation and Test-Retest Reliability of New Thermographic Technique Called Thermovision Technique of Dry Needling for Gluteus Minimus Trigger Points in Sciatica Subjects and TrPs-Negative Healthy Volunteers

Science.gov (United States)

Rychlik, Michał; Samborski, Włodzimierz

2015-01-01

The aim of this study was to assess the validity and test-retest reliability of Thermovision Technique of Dry Needling (TTDN) for the gluteus minimus muscle. TTDN is a new thermography approach used to support trigger points (TrPs) diagnostic criteria by presence of short-term vasomotor reactions occurring in the area where TrPs refer pain. Method. Thirty chronic sciatica patients (n=15 TrP-positive and n=15 TrPs-negative) and 15 healthy volunteers were evaluated by TTDN three times during two consecutive days based on TrPs of the gluteus minimus muscle confirmed additionally by referred pain presence. TTDN employs average temperature (T avr), maximum temperature (T max), low/high isothermal-area, and autonomic referred pain phenomenon (AURP) that reflects vasodilatation/vasoconstriction. Validity and test-retest reliability were assessed concurrently. Results. Two components of TTDN validity and reliability, T avr and AURP, had almost perfect agreement according to κ (e.g., thigh: 0.880 and 0.938; calf: 0.902 and 0.956, resp.). The sensitivity for T avr, T max, AURP, and high isothermal-area was 100% for everyone, but specificity of 100% was for T avr and AURP only. Conclusion. TTDN is a valid and reliable method for T avr and AURP measurement to support TrPs diagnostic criteria for the gluteus minimus muscle when digitally evoked referred pain pattern is present. PMID:26137486
Reliability and validity of the rey visual design learning test in primary school children

NARCIS (Netherlands)

Wilhelm, P.

2004-01-01

The Rey Visual Design Learning Test (Rey, 1964, in Spreen & Strauss, 1991) assesses immediate memory span, new learning and recognition for non-verbal material. Three studies are presented that focused on the reliability and validity of the RVDLT in primary school children. Test-retest reliability
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

Science.gov (United States)

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
A reliability generalization meta-analysis of coefficient alpha and test-retest coefficient for the aging males' symptoms (AMS) scale.

Science.gov (United States)

Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen

2016-12-01

The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.
Test - retest reliability of two instruments for measuring public attitudes towards persons with mental illness

Directory of Open Access Journals (Sweden)

Leufstadius Christel

2011-01-01

Full Text Available Abstract Background Research has identified stigmatization as a major threat to successful treatment of individuals with mental illness. As a consequence several anti-stigma campaigns have been carried out. The results have been discouraging and the field suffers from lack of evidence about interventions that work. There are few reports on psychometric data for instruments used to assess stigma, which thus complicates research efforts. The aim of the present study was to investigate test-retest reliability of the Swedish versions of the questionnaires: FABI and "Changing Minds" and to examine the internal consistency of the two instruments. Method Two instruments, fear and behavioural intentions (FABI and "Changing Minds", used in earlier studies on public attitudes towards persons with mental illness were translated into Swedish and completed by 51 nursing students on two occasions, with an interval of three weeks. Test-retest reliability was calculated by using weighted kappa coefficient and internal consistency using the Cronbach's alpha coefficient. Results Both instruments attain at best moderate test-retest reliability. For the Changing Minds questionnaire almost one fifth (17.9% of the items present poor test-retest reliability and the alpha coefficient for the subscales ranges between 0.19 - 0.46. All of the items in the FABI reach a fair or a moderate agreement between the test and retest, and the questionnaire displays a high internal consistency, alpha 0.80. Conclusions There is a need for development of psychometrically tested instruments within this field of research.
Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep

Directory of Open Access Journals (Sweden)

Jiahui Wang

2017-05-01

Full Text Available Resting state functional magnetic resonance imaging (rs-fMRI provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV derived from simultaneous electrocardiogram (ECG recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.
Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.

Science.gov (United States)

Park, Myung Sook; Kang, Kyung Ja; Jang, Sun Joo; Lee, Joo Yun; Chang, Sun Ju

2018-03-01

This study aimed to evaluate the components of test-retest reliability including time interval, sample size, and statistical methods used in patient-reported outcome measures in older people and to provide suggestions on the methodology for calculating test-retest reliability for patient-reported outcomes in older people. This was a systematic literature review. MEDLINE, Embase, CINAHL, and PsycINFO were searched from January 1, 2000 to August 10, 2017 by an information specialist. This systematic review was guided by both the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and the guideline for systematic review published by the National Evidence-based Healthcare Collaborating Agency in Korea. The methodological quality was assessed by the Consensus-based Standards for the selection of health Measurement Instruments checklist box B. Ninety-five out of 12,641 studies were selected for the analysis. The median time interval for test-retest reliability was 14days, and the ratio of sample size for test-retest reliability to the number of items in each measure ranged from 1:1 to 1:4. The most frequently used statistical methods for continuous scores was intraclass correlation coefficients (ICCs). Among the 63 studies that used ICCs, 21 studies presented models for ICC calculations and 30 studies reported 95% confidence intervals of the ICCs. Additional analyses using 17 studies that reported a strong ICC (>0.09) showed that the mean time interval was 12.88days and the mean ratio of the number of items to sample size was 1:5.37. When researchers plan to assess the test-retest reliability of patient-reported outcome measures for older people, they need to consider an adequate time interval of approximately 13days and the sample size of about 5 times the number of items. Particularly, statistical methods should not only be selected based on the types of scores of the patient-reported outcome measures, but should also be described clearly in
A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders

DEFF Research Database (Denmark)

Stupar, Maja; Côté, Pierre; Beaton, Dorcas E

2015-01-01

OBJECTIVE: The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). METHODS: We performed a test-retest reliability study. We includ...
Test-retest reliability and responsiveness of the Barthel Index-based Supplementary Scales in patients with stroke.

Science.gov (United States)

Lee, Ya-Chen; Yu, Wan-Hui; Hsueh, I-Ping; Chen, Sheng-Shiung; Hsieh, Ching-Lin

2017-10-01

A lack of evidence on the test-retest reliability and responsiveness limits the utility of the BI-based Supplementary Scales (BI-SS) in both clinical and research settings. To examine the test-retest reliability and responsiveness of the BI-based Supplementary Scales (BI-SS) in patients with stroke. A repeated-assessments design (1 week apart) was used to examine the test-retest reliability of the BI-SS. For the responsiveness study, the participants were assessed with the BI-SS and BI (treated as an external criterion) at admission to and discharge from rehabilitation wards. Seven outpatient rehabilitation units and one inpatient rehabilitation unit. Outpatients with chronic stroke. Eighty-four outpatients with chronic stroke participated in the test-retest reliability study. Fifty-seven inpatients completed baseline and follow-up assessments in the responsiveness study. For the test-retest reliability study, the values of the intra-class correlation coefficient and the overall percentage of minimal detectable change for the Ability Scale and Self-perceived Difficulty Scale were 0.97, 12.8%, and 0.78, 35.8%, respectively. For the responsiveness study, the standardized effect size and standardized response mean (representing internal responsiveness) of the Ability Scale and Self-perceived Difficulty Scale were 1.17 and 1.56, and 0.78 and 0.89, respectively. Regarding external responsiveness, the change in score of the Ability Scale had significant and moderate association with that of the BI (r=0.61, Ptest-retest reliability and sufficient responsiveness for patients with stroke. However, the Self-perceived Difficulty Scale of the BI-SS has substantial random measurement error and insufficient external responsiveness, which may affect its utility in clinical settings. The findings of this study provide empirical evidence of psychometric properties of the BI-SS for assessing ability and self-perceived difficulty of ADL in patients with stroke.
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

Science.gov (United States)

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Evaluating the test-retest reliability of symptom indices associated with the ImPACT post-concussion symptom scale (PCSS).

Science.gov (United States)

Merritt, Victoria C; Bradson, Megan L; Meyer, Jessica E; Arnett, Peter A

2018-05-01

The Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) is a commonly used tool in sports concussion assessment. While test-retest reliabilities have been established for the ImPACT cognitive composites, few studies have evaluated the psychometric properties of the ImPACT's Post-Concussion Symptom Scale (PCSS). The purpose of this study was to establish the test-retest reliability of symptom indices associated with the PCSS. Participants included 38 undergraduate students (50.0% male) who underwent neuropsychological testing as part of their participation in their psychology department's research subject pool. The majority of the participants were Caucasian (94.7%) and had no history of concussion (73.7%). All participants completed the ImPACT at two time points, approximately 6 weeks apart. The PCSS was the main outcome measure, and eight symptom indices were calculated (a total symptom score, three symptom summary indices, and four symptom clusters). Pearson correlations (r) and intraclass correlation coefficients (ICCs) were computed as measures of test-retest reliability. Overall, reliabilities ranged from low to high (r = .44 to .80; ICC = .44 to .77). The cognitive symptom cluster exhibited the highest test-retest reliability (r = .80, ICC = .77), followed by the positive symptom total (PST) index, an indicator of the total number of symptoms endorsed (r = .71, ICC = .69). In contrast, the commonly used total symptom score showed lower test-retest reliability (r = .67, ICC = .62). Paired-samples t tests revealed no significant differences between test and retest for any of the symptom variables (all p > .01). Finally, reliable change indices (RCI) were computed to determine whether differences observed between test and retest represented clinically significant change. RCI values were provided for each symptom index at the 80%, 90%, and 95% confidence intervals. These results suggest that evaluating additional symptom
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan

Directory of Open Access Journals (Sweden)

Moore AL

2018-02-01

Full Text Available Amy Lawson Moore, Terissa M Miller Gibson Institute of Cognitive Research, Colorado Springs, CO, USA Purpose: The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills.Methods: This study included 2,737 participants aged 5–85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test–retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement.Results: Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test–retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93.Conclusion: The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan. Keywords: testing, cognitive skills, memory, processing speed, visual processing, auditory processing

Test-Retest Reliability of Diffusion Tensor Imaging in Huntington's Disease.

Science.gov (United States)

Cole, James H; Farmer, Ruth E; Rees, Elin M; Johnson, Hans J; Frost, Chris; Scahill, Rachael I; Hobbs, Nicola Z

2014-03-21

Diffusion tensor imaging (DTI) has shown microstructural abnormalities in patients with Huntington's Disease (HD) and work is underway to characterise how these abnormalities change with disease progression. Using methods that will be applied in longitudinal research, we sought to establish the reliability of DTI in early HD patients and controls. Test-retest reliability, quantified using the intraclass correlation coefficient (ICC), was assessed using region-of-interest (ROI)-based white matter atlas and voxelwise approaches on repeat scan data from 22 participants (10 early HD, 12 controls). T1 data was used to generate further ROIs for analysis in a reduced sample of 18 participants. The results suggest that fractional anisotropy (FA) and other diffusivity metrics are generally highly reliable, with ICCs indicating considerably lower within-subject compared to between-subject variability in both HD patients and controls. Where ICC was low, particularly for the diffusivity measures in the caudate and putamen, this was partly influenced by outliers. The analysis suggests that the specific DTI methods used here are appropriate for cross-sectional research in HD, and give confidence that they can also be applied longitudinally, although this requires further investigation. An important caveat for DTI studies is that test-retest reliability may not be evenly distributed throughout the brain whereby highly anisotropic white matter regions tended to show lower relative within-subject variability than other white or grey matter regions.
The Ostomy Adjustment Scale: translation into Norwegian language with validation and reliability testing.

Science.gov (United States)

Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin

2014-01-01

The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is
Test-retest reliability of the eating disorder examination-questionnaire (EDE-Q) in a college sample

OpenAIRE

Rose, Jennifer S; Vaewsorn, Adin; Rosselli-Navarra, Francine; Wilson, G Terence; Weissman, Ruth Striegel

2013-01-01

Background The Eating Disorder Examination-Questionnaire (EDE-Q), a widely used self-report instrument, is often used for measuring change in eating disorder symptoms over the course of treatment. However, limited data exist about test-retest reliability, particularly for men. The current study evaluated EDE-Q 7-day test-retest reliability in male (n = 47) and female (n = 44) undergraduate students together and separately by gender. Results Internal consistency was consistently higher for wom...
Test-Retest Reliability and Minimal Detectable Change of the D2 Test of Attention in Patients with Schizophrenia.

Science.gov (United States)

Lee, Posen; Lu, Wen-Shian; Liu, Chin-Hsuan; Lin, Hung-Yu; Hsieh, Ching-Lin

2017-12-08

The d2 Test of Attention (D2) is a commonly used measure of selective attention for patients with schizophrenia. However, its test-retest reliability and minimal detectable change (MDC) are unknown in patients with schizophrenia, limiting its utility in both clinical and research settings. The aim of the present study was to examine the test-retest reliability and MDC of the D2 in patients with schizophrenia. A rater administered the D2 on 108 patients with schizophrenia twice at a 1-month interval. Test-retest reliability was determined through the calculation of the intra-class correlation coefficient (ICC). We also carried out Bland-Altman analysis, which included a scatter plot of the differences between test and retest against their mean. Systematic biases were evaluated by use of a paired t-test. The ICCs for the D2 ranged from 0.78 to 0.94. The MDCs (MDC%) of the seven subscores were 102.3 (29.7), 19.4 (85.0), 7.2 (94.6), 21.0 (69.0), 104.0 (33.1), 105.0 (35.8), and 7.8 (47.8), which represented limited-to-acceptable random measurement error. Trends in the Bland-Altman plots of the omissions (E1), commissions (E2), and errors (E) were noted, presenting that the data had heteroscedasticity. According to the results, the D2 had good test-retest reliability, especially in the scores of TN, TN-E, and CP. For the further research, finding a way to improve the administration procedure to reduce random measurement error would be important for the E1, E2, E, and FR subscores. © The Author(s) 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Test-Retest Reliability of the Preschool Age Psychiatric Assessment (PAPA)

Science.gov (United States)

Egger, Helen Link; Erkanli, Alaattin; Keeler, Gordon; Potts, Edward; Walter, Barbara Keith; Angold, Adrian

2006-01-01

Objective: To examine the test-retest reliability of a new interviewer-based psychiatric diagnostic measure (the Preschool Age Psychiatric Assessment) for use with parents of preschoolers 2 to 5 years old. Method: A total of 1,073 parents of children attending a large pediatric clinic completed the Child Behavior Checklist 1 1/2-5. For 18 months,…
Test–Retest Reliability and Concurrent Validity of an fMRI-Compatible Pneumatic Vibrator to Stimulate Muscle Proprioceptors.

Science.gov (United States)

Goossens, Nina; Janssens, Lotte; Pijnenburg, Madelon; Caeyenberghs, Karen; Van Rompuy, Charlotte; Meugens, Paul; Sunaert, Stefan; Brumagne, Simon

Processing proprioceptive information in the brain is essential for optimal postural control and can be studied with proprioceptive stimulation, provided by muscle vibration, during functional magnetic resonance imaging (fMRI). Classic electromagnetic muscle vibrators, however, cannot be used in the high-strength magnetic field of the fMRI scanner. Pneumatic vibrators offer an fMRI-compatible alternative. However, whether these devices produce reliable and valid proprioceptive stimuli has not been investigated, although this is essential for these devices to be used in longitudinal research. Test–retest reliability and concurrent validity of the postural response to muscle vibration, provided by custom-made fMRI-compatible pneumatic vibrators, were assessed in a repeated-measures design. Mean center of pressure (CoP) displacements during, respectively, ankle muscle and back muscle vibration (45–60 Hz, 0.5 mm) provided by an electromagnetic and a pneumatic vibrator were measured in ten young healthy subjects. The test was repeated on the same day and again within one week. Intraclass correlation coefficients (ICC) were calculated to assess (a) intra- and interday reliability of the postural responses to, respectively, pneumatic and electromagnetic vibration, and (b) concurrent validity of the response to pneumatic compared to electromagnetic vibration. Test–retest reliability of mean CoP displacements during pneumatic vibration was good to excellent (ICCs = 0.64–0.90) and resembled that of responses to electromagnetic vibration (ICCs = 0.64–0.94). Concurrent validity of the postural effect of pneumatic vibration was good to excellent (ICCs = 0.63–0.95). In conclusion, the proposed fMRI-compatible pneumatic vibrator can be used with confidence to stimulate muscle spindles during fMRI to study central processing of proprioception.
Test-retest reliability and validity of a web-based food-frequency questionnaire for adolescents aged 13-14 to be used in the Norwegian Mother and Child Cohort Study (MoBa).

Science.gov (United States)

Overby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha

2014-01-01

The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. In total, 58 students (aged 13-14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. The median Spearman correlation coefficient for all nutrients in the test-retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). The test-retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents.
Test your memory-Turkish version (TYM-TR): reliability and validity study of a cognitive screening test.

Science.gov (United States)

Maviş, Ilknur; Özbabalik Adapinar, Belgin Demet; Yenilmez, Çinar; Aydin, Ayşe; Olgun, Engin; Bal, Cengiz

2015-01-01

The test your memory (TYM) is reported to be a sensitive cognitive function assessment scale for people with dementia. The aim of the present study was to investigate the reliability and validity of an adapted Turkish version of the TYM (TYM-TR) among Turkish dementia patients. The TYM-TR was given to 59 patients with dementia aged 60+ and 336 normal controls aged 23-75+. The diagnostic utility of the TYM-TR was compared with that of the mini-mental state examination (MMSE) to validate it. The internal consistency of the TYM-TR was a = 0.85. The test-retest reliability was 0.97 (P reliability and validity to distinguish dementia in the Turkish population.
Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease

NARCIS (Netherlands)

Strouwen, C.; Molenaar, E.A.; Keus, S.H.; Munks, L.; Bloem, B.R.; Nieuwboer, A.

2016-01-01

BACKGROUND: Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains
Test-retest reliability of computer-based video analysis of general movements in healthy term-born infants.

Science.gov (United States)

Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars

2015-10-01

A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (ptest-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Validity and test-retest reliability of manual goniometers for measuring passive hip range of motion in femoroacetabular impingement patients.

Directory of Open Access Journals (Sweden)

Nussbaumer Silvio

2010-08-01

Full Text Available Abstract Background The aims of this study were to evaluate the construct validity (known group, concurrent validity (criterion based and test-retest (intra-rater reliability of manual goniometers to measure passive hip range of motion (ROM in femoroacetabular impingement patients and healthy controls. Methods Passive hip flexion, abduction, adduction, internal and external rotation ROMs were simultaneously measured with a conventional goniometer and an electromagnetic tracking system (ETS on two different testing sessions. A total of 15 patients and 15 sex- and age-matched healthy controls participated in the study. Results The goniometer provided greater hip ROM values compared to the ETS (range 2.0-18.9 degrees; P P Conclusions The present study suggests that goniometer-based assessments considerably overestimate hip joint ROM by measuring intersegmental angles (e.g., thigh flexion on trunk for hip flexion rather than true hip ROM. It is likely that uncontrolled pelvic rotation and tilt due to difficulties in placing the goniometer properly and in performing the anatomically correct ROM contribute to the overrating of the arc of these motions. Nevertheless, conventional manual goniometers can be used with confidence for longitudinal assessments in the clinic.
Reliability of Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory in a test-retest design.

Science.gov (United States)

Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian

2014-02-01

The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.
Standardization, Validity and Reliability Study of Gülhane Aphasia Test-2 (GAT-2

Directory of Open Access Journals (Sweden)

İlknur Maviş

2007-04-01

Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia
Evidence of Reliability and Validity for a Children’s Auditory Continuous Performance Test

Directory of Open Access Journals (Sweden)

Michael J. Lasee

2013-11-01

Full Text Available Continuous Performance Tests (CPTs are commonly utilized clinical measures of attention and response inhibition. While there have been many studies of CPTs that utilize a visual format, there is considerably less research employing auditory CPTs. The current study provides initial reliability and validity evidence for the Auditory Vigilance Screening Measure (AVSM, a newly developed CPT. Participants included 105 five- to nine-year-old children selected from two rural Midwestern school districts. Reliability data for the AVSM was collected through retesting of 42 participants. Validity was evaluated through correlation of AVSM scales with subscales from the ADHD Rating Scale–IV. Test–retest reliability coefficients ranged from .62 to .74 for AVSM subscales. A significant (r = .31 correlation was obtained between the AVSM Impulsivity Scale and teacher ratings of inattention. Limitations and implications for future study are discussed.
Reliability, construct and discriminative validity of clinical testing in subjects with and without chronic neck pain

DEFF Research Database (Denmark)

Jørgensen, René; Ris Hansen, Inge; Falla, Deborah

2014-01-01

-retest reliability in people with and without chronic neck pain. Moreover, construct and between-group discriminative validity of the tests were examined. METHODS: Twenty-one participants with chronic neck pain and 21 asymptomatic participants were included. Intra- and inter-reliability were evaluated for the Cranio-Cervical...... Flexion Test (CCFT), Range of Movement (ROM), Joint Position Error (JPE), Gaze Stability (GS), Smooth Pursuit Neck Torsion Test (SPNTT), and neuromuscular control of the Deep Cervical Extensors (DCE). Test-retest reliability was assessed for Postural Control (SWAY) and Pressure Pain Threshold (PPT) over......BACKGROUND: The reliability of clinical tests for the cervical spine has not been adequately evaluated. Six cervical clinical tests, which are low cost and easy to perform in clinical settings, were tested for intra- and inter-examiner reliability, and two performance tests were assessed for test...
Test-retest and between-site reliability in a multicenter fMRI study.

Science.gov (United States)

Friedman, Lee; Stern, Hal; Brown, Gregory G; Mathalon, Daniel H; Turner, Jessica; Glover, Gary H; Gollub, Randy L; Lauriello, John; Lim, Kelvin O; Cannon, Tyrone; Greve, Douglas N; Bockholt, Henry Jeremy; Belger, Aysenil; Mueller, Bryon; Doty, Michael J; He, Jianchun; Wells, William; Smyth, Padhraic; Pieper, Steve; Kim, Seyoung; Kubicki, Marek; Vangel, Mark; Potkin, Steven G

2008-08-01

In the present report, estimates of test-retest and between-site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR-deconvolution analysis with FMRISTAT. Six functionally-derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast-to-noise-ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test-retest reliability was high, but initially, between-site reliability was low, indicating a strong contribution from site and site-by-subject variance. However, a number of factors that can markedly improve between-site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between-site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies.
Influences on the Test-Retest Reliability of Functional Connectivity MRI and its Relationship with Behavioral Utility.

Science.gov (United States)

Noble, Stephanie; Spann, Marisa N; Tokoglu, Fuyuze; Shen, Xilin; Constable, R Todd; Scheinost, Dustin

2017-11-01

Best practices are currently being developed for the acquisition and processing of resting-state magnetic resonance imaging data used to estimate brain functional organization-or "functional connectivity." Standards have been proposed based on test-retest reliability, but open questions remain. These include how amount of data per subject influences whole-brain reliability, the influence of increasing runs versus sessions, the spatial distribution of reliability, the reliability of multivariate methods, and, crucially, how reliability maps onto prediction of behavior. We collected a dataset of 12 extensively sampled individuals (144 min data each across 2 identically configured scanners) to assess test-retest reliability of whole-brain connectivity within the generalizability theory framework. We used Human Connectome Project data to replicate these analyses and relate reliability to behavioral prediction. Overall, the historical 5-min scan produced poor reliability averaged across connections. Increasing the number of sessions was more beneficial than increasing runs. Reliability was lowest for subcortical connections and highest for within-network cortical connections. Multivariate reliability was greater than univariate. Finally, reliability could not be used to improve prediction; these findings are among the first to underscore this distinction for functional connectivity. A comprehensive understanding of test-retest reliability, including its limitations, supports the development of best practices in the field. © The Author 2017. Published by Oxford University Press.
Validity and test–retest reliability of the Persian version of the Montgomery–Asberg Depression Rating Scale

Science.gov (United States)

Ahmadpanah, Mohammad; Sheikhbabaei, Meisam; Haghighi, Mohammad; Roham, Fatemeh; Jahangard, Leila; Akhondi, Amineh; Sadeghi Bahmani, Dena; Bajoghli, Hafez; Holsboer-Trachsler, Edith; Brand, Serge

2016-01-01

Background and aims The Montgomery–Asberg Depression Rating Scale (MADRS) is an expert’s rating tool to assess the severity and symptoms of depression. The aim of the present two studies was to validate the Persian version of the MADRS and determine its test–retest reliability in patients diagnosed with major depressive disorders (MDD). Methods In study 1, the translated MADRS and the Hamilton Depression Rating Scale (HDRS) were applied to 210 patients diagnosed with MDD and 100 healthy adults. In study 2, 200 patients diagnosed with MDD were assessed with the MADRS in face-to-face interviews. Thereafter, 100 patients were assessed 3–14 days later, again via face-to-face-interviews, while the other 100 patients were assessed 3–14 days later via a telephone interview. Results Study 1: The MADRS and HDRS scores between patients with MDD and healthy controls differed significantly. Agreement between scoring of the MADRS and HDRS was high (r=0.95). Study 2: The intraclass correlation coefficient (test–retest reliability) was r=0.944 for the face-to-face interviews, and r=0.959 for the telephone interviews. Conclusion The present data suggest that the Persian MADRS has high validity and excellent test–retest reliability over a time interval of 3–14 days, irrespective of whether the second assessment was carried out face-to-face or via a telephone interview. PMID:27022265
CPM Test-Retest Reliability: "Standard" vs "Single Test-Stimulus" Protocols.

Science.gov (United States)

Granovsky, Yelena; Miller-Barmak, Adi; Goldstein, Oren; Sprecher, Elliot; Yarnitsky, David

2016-03-01

Assessment of pain inhibitory mechanisms using conditioned pain modulation (CPM) is relevant clinically in prediction of pain and analgesic efficacy. Our objective is to provide necessary estimates of intersession CPM reliability, to enable transformation of the CPM paradigm into a clinical tool. Two cohorts of young healthy subjects (N = 65) participated in two dual-session studies. In Study I, a Bath-Thermode CPM protocol was used, with hot water immersion and contact heat as conditioning- and test-stimuli, respectively, in a classical parallel CPM design introducing test-stimulus first, and then the conditioning- and repeated test-stimuli in parallel. Study II consisted of two CPM protocols: 1) Two-Thermodes, one for each of the stimuli, in the same parallel design as above, and 2) single test-stimulus (STS) protocol with a single administration of a contact heat test-stimulus, partially overlapped in time by a remote shorter contact heat as conditioning stimulus. Test-retest reliability was assessed within 3-7 days. The STS-CPM had superior reliability intraclass correlation (ICC 2 ,: 1 = 0.59) over Bath-Thermode (ICC 2 ,: 1 = 0.34) or Two-Thermodes (ICC 2 ,: 1 = 0.21) protocols. The hand immersion conditioning pain had higher reliability than thermode pain (ICC 2 ,: 1 = 0.76 vs ICC 2 ,: 1 = 0.16). Conditioned test-stimulus pain scores were of good (ICC 2 ,: 1 = 0.62) or fair (ICC 2 ,: 1 = 0.43) reliability for the Bath-Thermode and the STS, respectively, but not for the Two-Thermodes protocol (ICC 2 ,: 1 = 0.20). The newly developed STS-CPM paradigm was more reliable than other CPM protocols tested here, and should be further investigated for its clinical relevance. It appears that large contact size of the conditioning-stimulus and use of single rather than dual test-stimulus pain contribute to augmentation of CPM reliability. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e
Test-Retest Reliability of Rating of Perceived Exertion and Agreement With 1-Repetition Maximum in Adults.

Science.gov (United States)

Bove, Allyn M; Lynch, Andrew D; DePaul, Samantha M; Terhorst, Lauren; Irrgang, James J; Fitzgerald, G Kelley

2016-09-01

Study Design Clinical measurement. Background It has been suggested that rating of perceived exertion (RPE) may be a useful alternative to 1-repetition maximum (1RM) to determine proper resistance exercise dosage. However, the test-retest reliability of RPE for resistance exercise has not been determined. Additionally, prior research regarding the relationship between 1RM and RPE is conflicting. Objectives The purpose of this study was to (1) determine test-retest reliability of RPE related to resistance exercise and (2) assess agreement between percentages of 1RM and RPE during quadriceps resistance exercise. Methods A sample of participants with and without knee pathology completed a series of knee extension exercises and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale, then repeated the procedure 1 to 2 weeks later for test-retest reliability. To determine agreement between RPE and 1RM, participants completed knee extension exercises at various percentages of their 1RM (10% to 130% of predicted 1RM) and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale. Percent agreement was calculated between the 1RM and RPE at each resistance interval. Results The intraclass correlation coefficient indicated excellent test-retest reliability of RPE for quadriceps resistance exercises (intraclass correlation coefficient = 0.895; 95% confidence interval: 0.866, 0.918). Overall percent agreement between RPE and 1RM was 60%, but agreement was poor within the ranges that would typically be used for training (50% 1RM for muscle endurance, 70% 1RM and greater for strength). Conclusion Test-retest reliability of perceived exertion during quadriceps resistance exercise was excellent. However, agreement between the RPE and 1RM was poor, especially in common training zones for knee extensor strengthening. J Orthop Sports Phys Ther 2016;46(9):768-774. Epub 5 Aug 2016. doi:10.2519/jospt.2016.6498.

Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

Science.gov (United States)

Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

2015-01-01

Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
Test-retest, inter-assessor and intra-assessor reliability of the modified Touwen examination

NARCIS (Netherlands)

Peters, Lieke H. J.; Maathuis, Karel G. B.; Kouw, Eva; Hamming, Marjolein; Hadders-Algra, Mijna

Interest in the Touwen examination (1979) for the assessment of minor neurological dysfunction (MND) is growing. However, information on psychometric properties of this assessment is scarce. Therefore the present study aimed at assessing the test's test-retest, inter- and intra-assessor reliability.
Test Re-Test Reliability of Four Versions of the 3-Cone Test in Non-Athletic Men

Directory of Open Access Journals (Sweden)

Jason G. Langley, Robert D. Chetlin

2017-03-01

Full Text Available Until recently, measurement and evaluation in sport science, especially agility testing, has not always included key elements of proper test construction. Often tests are published without reporting reliability and validity analysis for a specific population. The purpose of the present study was to examine the test re-test reliability of four versions of the 3-Cone Test (3CT, and provide guidance on proper test construction for testing agility in athletic populations. Forty male students enrolled in classes in the Department of Physical Education at a mid-Atlantic university participated. On each of test day participants performed 10 trials. In random order, they performed three trials to the right (3CTR, standard test, three to the left (3CTL, and two modified trials (3CTAR and 3CTAL, which included a reactive component in which a visual cue was given to indicate direction. Intra-class correlation coefficients (ICC indicated a moderate to high reliability for the four tests, 3CTR 0.79 (0.64-0.88, 95%CI, 3CTL 0.73 (0.55-0.85, 3CTAR 0.85(0.74-0.92, and 3CTAL 0.79 (0.64-0.88. Small standard error of the measurement (SEM was found; range 0.09 to 0.10. Pearson correlations between tests were high (0.82-0.92 on day one as well as day two (0.72-0.85. These results indicate each version of the 3-Cone Test is reliable; however, further tests are needed with specific athletic populations. Only the 3CTAR and 3CTAL are tests of agility due to the inclusion of a reactive component. Future studies examining agility testing and training should incorporate technological elements, including automated timing systems and motion capture analysis. Such instrumentation will allow for optimal design of tests that simulate sport-specific game conditions.
Reliability of two social cognition tests: The combined stories test and the social knowledge test.

Science.gov (United States)

Thibaudeau, Élisabeth; Cellard, Caroline; Legendre, Maxime; Villeneuve, Karèle; Achim, Amélie M

2018-04-01

Deficits in social cognition are common in psychiatric disorders. Validated social cognition measures with good psychometric properties are necessary to assess and target social cognitive deficits. Two recent social cognition tests, the Combined Stories Test (COST) and the Social Knowledge Test (SKT), respectively assess theory of mind and social knowledge. Previous studies have shown good psychometric properties for these tests, but the test-retest reliability has never been documented. The aim of this study was to evaluate the test-retest reliability and the inter-rater reliability of the COST and the SKT. The COST and the SKT were administered twice to a group of forty-two healthy adults, with a delay of approximately four weeks between the assessments. Excellent test-retest reliability was observed for the COST, and a good test-retest reliability was observed for the SKT. There was no evidence of practice effect. Furthermore, an excellent inter-rater reliability was observed for both tests. This study shows a good reliability of the COST and the SKT that adds to the good validity previously reported for these two tests. These good psychometrics properties thus support that the COST and the SKT are adequate measures for the assessment of social cognition. Copyright © 2018. Published by Elsevier B.V.
A review of culturally adapted versions of the Oswestry Disability Index: the adaptation process, construct validity, test-retest reliability and internal consistency.

Science.gov (United States)

Sheahan, Peter J; Nelson-Wong, Erika J; Fischer, Steven L

2015-01-01

The Oswestry Disability Index (ODI) is a self-report-based outcome measure used to quantify the extent of disability related to low back pain (LBP), a substantial contributor to workplace absenteeism. The ODI tool has been adapted for use by patients in several non-English speaking nations. It is unclear, however, if these adapted versions of the ODI are as credible as the original ODI developed for English-speaking nations. The objective of this study was to conduct a review of the literature to identify culturally adapted versions of the ODI and to report on the adaptation process, construct validity, test-retest reliability and internal consistency of these ODIs. Following a pragmatic review process, data were extracted from each study with regard to these four outcomes. While most studies applied adaptation processes in accordance with best-practice guidelines, there were some deviations. However, all studies reported high-quality psychometric properties: group mean construct validity was 0.734 ± 0.094 (indicated via a correlation coefficient), test-retest reliability was 0.937 ± 0.032 (indicated via an intraclass correlation coefficient) and internal consistency was 0.876 ± 0.047 (indicated via Cronbach's alpha). Researchers can be confident when using any of these culturally adapted ODIs, or when comparing and contrasting results between cultures where these versions were employed. Implications for Rehabilitation Low back pain is the second leading cause of disability in the world, behind only cancer. The Oswestry Disability Index (ODI) has been developed as a self-report outcome measure of low back pain for administration to patients. An understanding of the various cross-cultural adaptations of the ODI is important for more concerted multi-national research efforts. This review examines 16 cross-cultural adaptations of the ODI and should inform the work of health care and rehabilitation professionals.
Development of a Saudi Food Frequency Questionnaire and testing its reliability and validity.

Science.gov (United States)

Gosadi, Ibrahim M; Alatar, Abdullah A; Otayf, Mojahed M; AlJahani, Dhaherah M; Ghabbani, Hisham M; AlRajban, Waleed A; Alrsheed, Abdullah M; Al-Nasser, Khalid A

2017-06-01

To create a food frequency questionnaire specifically designed to capture the dietary habits of Saudis and test its validity and reliability. Methods: This investigation is a longitudinal, test-retest study conducted in King Saud University, Riyadh, Kingdom of Saudi Arabia between December 2015 and March 2016. A list of 140 food items was included in the questionnaire where a closed-ended and open-ended approach was used. Regarding past year food frequency consumption and 24 hours dietary recall, body weight and height were collected. Internal consistency, test-retest reliability, completeness of the food list, and criterion validity were assessed. Results: One-hundred and thirty eight participants were interviewed to complete the 24 hours dietary recall and the constructed questionnaire. Approximately 85% of the food items reported in the dietary recall were covered in the food frequency questionnaire. The association of body mass index with meats (regression coefficients: 2.28) and dairy products consumption frequency was statistically significant (regression coefficients: 2.31). A high overall reproducibility rate of the questionnaire was detected (Pearsons' correlation coefficient: 0.78 p less than 0.001). Conclusion: The developed questionnaire has a high reliability and reasonable validity, and suitable for use in nutritional epidemiological investigations in Saudi Arabia.
The Validity and Reliability Test of the Indonesian Version of Gastroesophageal Reflux Disease Quality of Life (GERD-QOL) Questionnaire.

Science.gov (United States)

Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti

2017-01-01

to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; pGERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Validity and Reliability of the 8-Item Work Limitations Questionnaire.

Science.gov (United States)

Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C

2017-12-01

Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Test-retest reliability of the driving habits questionnaire in older self-driving adults.

Science.gov (United States)

Song, Chiang-Soon; Chun, Byung-Yoon; Chung, Hyun-Sook

2015-11-01

[Purpose] The purpose of this study was to investigate the test-retest reliability of the Driving Habits Questionnaire in community-dwelling older self-drivers. [Subjects and Methods] Seventy-four participants were recruited by convenience sampling from local rehabilitation centers. This was a cross-sectional study design that used two clinical measures: the Driving Habits Questionnaire and Mini-mental State Examination. To examine the test-retest reliability of the Driving Habits Questionnaire, the clinical tool was measured twice, five days apart. [Results] The Driving Habits Questionnaire showed good reliability for older community-dwelling self-drivers. The Cronbach's alpha coefficients for the four domains of dependence (0.572), difficulty (0.871), crashes and citations (0.689), and driving space (0.961) of the Driving Habits Questionnaire indicated good or high internal consistency. Driving difficulty correlated significantly with self-reported crashes and citations and driving space. [Conclusion] The results of this study suggest that the Driving Habits Questionnaire is a reliable measure of self-reported interview-based driving behavior in the community-dwelling elderly.
Isokinetic Strength and Endurance Tests used Pre- and Post-Spaceflight: Test-Retest Reliability

Science.gov (United States)

Laughlin, Mitzi S.; Lee, Stuart M. C.; Loehr, James A.; Amonette, William E.

2009-01-01

To assess changes in muscular strength and endurance after microgravity exposure, NASA measures isokinetic strength and endurance across multiple sessions before and after long-duration space flight. Accurate interpretation of pre- and post-flight measures depends upon the reliability of each measure. The purpose of this study was to evaluate the test-retest reliability of the NASA International Space Station (ISS) isokinetic protocol. Twenty-four healthy subjects (12 M/12 F, 32.0 +/- 5.6 years) volunteered to participate. Isokinetic knee, ankle, and trunk flexion and extension strength as well as endurance of the knee flexors and extensors were measured using a Cybex NORM isokinetic dynamometer. The first weekly session was considered a familiarization session. Data were collected and analyzed for weeks 2-4. Repeated measures analysis of variance (alpha=0.05) was used to identify weekly differences in isokinetic measures. Test-retest reliability was evaluated by intraclass correlation coefficients (ICC) (3,1). No significant differences were found between weeks in any of the strength measures and the reliability of the strength measures were all considered excellent (ICC greater than 0.9), except for concentric ankle dorsi-flexion (ICC=0.67). Although a significant difference was noted in weekly endurance measures of knee extension (p less than 0.01), the reliability of endurance measure by week were considered excellent for knee flexion (ICC=0.97) and knee extension (ICC=0.96). Except for concentric ankle dorsi-flexion, the isokinetic strength and endurance measures are highly reliable when following the NASA ISS protocol. This protocol should allow accurate interpretation isokinetic data even with a small number of crew members.
Forward lunge as a functional performance test in ACL deficient subjects: test-retest reliability

DEFF Research Database (Denmark)

Alkjaer, Tine; Henriksen, Marius; Dyhre-Poulsen, Poul

2009-01-01

The forward lunge movement may be used as a functional performance test of anterior cruciate ligament (ACL) deficient and reconstructed subjects. The purposes were 1) to determine the test-retest reliability of a forward lunge in healthy subjects and 2) to determine the required numbers...... of repetitions necessary to yield satisfactory reliability. Nineteen healthy subjects performed four trials of a forward lunge on two different days. The movement time, impulses of the ground reaction forces (IFz, IFy), knee joint kinematics and dynamics during the forward lunge were calculated. The relative...... reliability was determined by calculation of Intraclass Correlation Coefficients (ICC). The IFz, IFy and the positive work of the knee extensors showed excellent reliability (ICC >0.75). All other variables demonstrated acceptable reliability (0.4>ICCreliability increased when more than...
What to Do With "Moderate" Reliability and Validity Coefficients?

NARCIS (Netherlands)

Post, Marcel W

Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability
Test-retest reliability of Eurofit Physical Fitness items for children with visual impairments

NARCIS (Netherlands)

Houwen, Suzanne; Visscher, Chris; Hartman, Esther; Lemmink, Koen A. P. M.

The purpose of this study was to examine the test-retest reliability of physical fitness items from the European Test of Physical Fitness (Eurofit) for children with visual impairments. A sample of 21 children, ages 6-12 years, that were recruited from a special school for children with visual
Test-retest reliability of the 40 Hz EEG auditory steady-state response.

Directory of Open Access Journals (Sweden)

Kristina L McFadden

Full Text Available Auditory evoked steady-state responses are increasingly being used as a marker of brain function and dysfunction in various neuropsychiatric disorders, but research investigating the test-retest reliability of this response is lacking. The purpose of this study was to assess the consistency of the auditory steady-state response (ASSR across sessions. Furthermore, the current study aimed to investigate how the reliability of the ASSR is impacted by stimulus parameters and analysis method employed. The consistency of this response across two sessions spaced approximately 1 week apart was measured in nineteen healthy adults using electroencephalography (EEG. The ASSR was entrained by both 40 Hz amplitude-modulated white noise and click train stimuli. Correlations between sessions were assessed with two separate analytical techniques: a channel-level analysis across the whole-head array and b signal-space projection from auditory dipoles. Overall, the ASSR was significantly correlated between sessions 1 and 2 (p<0.05, multiple comparison corrected, suggesting adequate test-retest reliability of this response. The current study also suggests that measures of inter-trial phase coherence may be more reliable between sessions than measures of evoked power. Results were similar between the two analysis methods, but reliability varied depending on the presented stimulus, with click train stimuli producing more consistent responses than white noise stimuli.
Test-retest reliability of the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA).

Science.gov (United States)

Bégel, Valentin; Verga, Laura; Benoit, Charles-Etienne; Kotz, Sonja A; Bella, Simone Dalla

2018-04-27

Perceptual and sensorimotor timing skills can be comprehensively assessed with the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA). The battery has been used for testing rhythmic skills in healthy adults and patient populations (e.g., with Parkinson disease), showing sensitivity to timing and rhythm deficits. Here we assessed the test-retest reliability of the BAASTA in 20 healthy adults. Participants were tested twice with the BAASTA, implemented on a tablet interface, with a 2-week interval. They completed 4 perceptual tasks, namely, duration discrimination, anisochrony detection with tones and music, and the Beat Alignment Test (BAT). Moreover, they completed motor tasks via finger tapping, including unpaced and paced tapping with tones and music, synchronization-continuation, and adaptive tapping to a sequence with a tempo change. Despite high variability among individuals, the results showed stable test-retest reliability in most tasks. A slight but significant improvement from test to retest was found in tapping with music, which may reflect a learning effect. In general, the BAASTA was found a reliable tool for evaluating timing and rhythm skills. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
The test-retest reliability of the latent construct of executive function depends on whether tasks are represented as formative or reflective indicators.

Science.gov (United States)

Willoughby, Michael T; Kuhn, Laura J; Blair, Clancy B; Samek, Anya; List, John A

2017-10-01

This study investigates the test-retest reliability of a battery of executive function (EF) tasks with a specific interest in testing whether the method that is used to create a battery-wide score would result in differences in the apparent test-retest reliability of children's performance. A total of 188 4-year-olds completed a battery of computerized EF tasks twice across a period of approximately two weeks. Two different approaches were used to create a score that indexed children's overall performance on the battery-i.e., (1) the mean score of all completed tasks and (2) a factor score estimate which used confirmatory factor analysis (CFA). Pearson and intra-class correlations were used to investigate the test-retest reliability of individual EF tasks, as well as an overall battery score. Consistent with previous studies, the test-retest reliability of individual tasks was modest (rs ≈ .60). The test-retest reliability of the overall battery scores differed depending on the scoring approach (r mean = .72; r factor_ score = .99). It is concluded that the children's performance on individual EF tasks exhibit modest levels of test-retest reliability. This underscores the importance of administering multiple tasks and aggregating performance across these tasks in order to improve precision of measurement. However, the specific strategy that is used has a large impact on the apparent test-retest reliability of the overall score. These results replicate our earlier findings and provide additional cautionary evidence against the routine use of factor analytic approaches for representing individual performance across a battery of EF tasks.
Publishing nutrition research: validity, reliability, and diagnostic test assessment in nutrition-related research.

Science.gov (United States)

Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara

2010-03-01

This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Confiabilidade teste-reteste de aspectos da rede social no Estudo Pró-Saúde Test-retest reliability of measures of social network in the "Pró-Saúde" Study

Directory of Open Access Journals (Sweden)

Rosane Harter Griep

2003-06-01

Full Text Available OBJETIVO: Avaliar os níveis de confiabilidade teste-reteste de informações relativas à rede social no Estudo Pró-saúde. MÉTODOS: Foi estimada a confiabilidade pelo estudo teste-reteste por meio de questionário multidimensional aplicado a uma coorte de trabalhadores de uma universidade. O mesmo questionário foi preenchido duas vezes por 192 funcionários não efetivos da universidade, com duas semanas de intervalo entre as aplicações. A concordância foi estimada pela estatística Kappa (variáveis categóricas, estatística Kappa ponderado e modelos log-lineares (variáveis ordinais, e coeficiente de correlação intraclasse (variáveis discretas. RESULTADOS: As medidas de concordância situaram-se acima de 0,70 para a maioria das variáveis. Estratificando-se as informações segundo gênero, idade e escolaridade, observou-se que a confiabilidade não apresentou padrão consistente de variabilidade. A aplicação de modelos log-lineares indicou que, para as variáveis ordinais do estudo, o modelo de melhor ajuste foi o de "concordância diagonal mais associação linear por linear". CONCLUSÕES: Os altos níveis de confiabilidade estimados permitem concluir que o processo de aferição dos itens sobre rede social foi adequado para as características investigadas. Estudos de validação em andamento complementarão a avaliação da qualidade dessas informações.OBJECTIVE: To evaluate test-retest reliability of social network-related information of the" Pró-Saúde" study. METHODS: A test-retest reliability study was conducted using a multidimensional questionnaire applied to a cohort of university employees. The same questionnaire was filled out twice by 192 non-permanent employees with two weeks apart. Agreement was estimated using kappa statistics (categorical variables, weighted kappa statistics, log-linear models (ordinal variables, and intraclass correlation coefficient (discrete variables. RESULTS: Estimates of reliability
Validity and reliability of the novel thyroid-specific quality of life questionnaire, ThyPRO

DEFF Research Database (Denmark)

Watt, Torquil; Hegedüs, Laszlo; Groenvold, Mogens

2010-01-01

Background Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated. Aim To investigate clinical ('known-groups') validity and test-retest reliability of the Danish version of the ThyPRO. Methods For each of the 13 ThyPRO scales, we defined groups expected to have high versus low scores ('known-groups'). The clinical validity (known......-groups validity) was evaluated by whether the ThyPRO scales could detect expected differences in a cross-sectional study of 907 thyroid patients. Test-retest reliability was evaluated by intra-class correlations of two responses to the ThyPRO 2 weeks apart in a subsample of 87 stable patients. Results On all 13...
Test-retest reliability of the proposed DSM-5 eating disorder diagnostic criteria

Science.gov (United States)

Sysko, Robyn; Roberto, Christina A.; Barnes, Rachel D.; Grilo, Carlos M.; Attia, Evelyn; Walsh, B. Timothy

2012-01-01

The proposed DSM-5 classification scheme for eating disorders includes both major and minor changes to the existing DSM-IV diagnostic criteria. It is not known what effect these modifications will have on the ability to make reliable diagnoses. Two studies were conducted to evaluate the short-term test-retest reliability of the proposed DSM-5 eating disorder diagnoses: anorexia nervosa, bulimia nervosa, binge eating disorder, and feeding and eating conditions not elsewhere classified. Participants completed two independent telephone interviews with research assessors (n=70 Study 1; n=55 Study 2). Fair to substantial agreements (κ= 0.80 and 0.54) were observed across eating disorder diagnoses in Study 1 and Study 2, respectively. Acceptable rates of agreement were identified for the individual eating disorder diagnoses, including DSM-5 anorexia nervosa (κ’s of 0.81 to 0.97), bulimia nervosa (κ=0.84), binge eating disorder (κ’s of 0.75 and 0.61), and feeding and eating disorders not elsewhere classified (κ’s of 0.70 and 0.46). Further, improved short-term test-retest reliability was noted when using the DSM-5, in comparison to DSM-IV, criteria for binge eating disorder. Thus, these studies found that trained interviewers can reliably diagnose eating disorders using the proposed DSM-5 criteria; however, additional data from general practice settings and community samples are needed. PMID:22401974

Test-retest reliability of the isernhagen work systems functional capacity evaluation in healthy adults

NARCIS (Netherlands)

Reneman, MF; Brouwer, S; Meinema, A; Dijkstra, PU; Geertzen, JHB; Groothoff, JW

2004-01-01

Aim of this study was to investigate test-retest reliability of the Isernhagen Work System Functional Capacity Evaluation (IWS FCE) in healthy subjects. The IWS FCE consists of 28 tests that reflect work-related activities such as lifting, carrying, bending, etc. A convenience sample of 26 healthy
The Test-Retest Reliability of New Generation Power Indices of Wingate All-Out Test

Directory of Open Access Journals (Sweden)

Ozgur Ozkaya

2018-04-01

Full Text Available Although reliability correlations of traditional power indices of the Wingate test have been well documented, no study has analyzed new generation power indices based on milliseconds obtained from a Peak Bike. The purpose of this study was to investigate the retest reliability of new generation power indices. Thirty-two well-trained male athletes who were specialized in basketball, football, tennis, or track and field volunteered to take part in the study (age: 24.3 ± 2.2 years; body mass: 77 ± 8.3 kg; height: 180.3 ± 6.3 cm. Participants performed two Wingate all-out sessions on two separate days. Intra-class correlation coefficient (ICC, standard error measurement (SEM, smallest real differences (SRD and coefficient of variation (CV scores were analyzed based on the test and retest data. Reliability results of traditional power indices calculated based on 5-s means such as peak power, average power, power drop, and fatigue index ratio were similar with the previous findings in literature (ICC ≥ 0.94; CV ≤ 2.8%; SEM ≤ 12.28; SRD% ≤ 7.7%. New generation power indices such as peak power, average power, lowest power, power drop, fatigue index, power decline, maximum speed as rpm, and amount of total energy expenditure demonstrated high reliability (ICC ≥ 0.94; CV ≤ 4.3%; SEM ≤ 10.36; SRD% ≤ 8.8%. Time to peak power, time at maximum speed, and power at maximum speed showed a moderate level of reliability (ICC ≥ 0.73; CV ≤ 8.9%; SEM ≤ 63.01; SRD% ≤ 22.4%. The results of this study indicate that reliability correlations and SRD% of new generation power and fatigue-related indices are similar with traditional 5-s means. However, new time-related indices are very sensitive and moderately reliable.
Test–Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study

Directory of Open Access Journals (Sweden)

Clare E. Palmer

2018-01-01

Full Text Available Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington’s disease (HD and Parkinson’s disease (PD across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test–retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test–retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test–retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT from the combined-Simon-Stop-Signal task showed moderate test–retest reliability, however, the combined SSRT interference effect showed poor test–retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Interrater and Test-Retest Reliability and Minimal Detectable Change of the Balance Evaluation Systems Test (BESTest) and Subsystems With Community-Dwelling Older Adults.

Science.gov (United States)

Wang-Hsu, Elizabeth; Smith, Susan S

2017-01-10

Falls are a common cause of injuries and hospital admissions in older adults. Balance limitation is a potentially modifiable factor contributing to falls. The Balance Evaluation Systems Test (BESTest), a clinical balance measure, categorizes balance into 6 underlying subsystems. Each of the subsystems is scored individually and summed to obtain a total score. The reliability of the BESTest and its individual subsystems has been reported in patients with various neurological disorders and cancer survivors. However, the reliability and minimal detectable change (MDC) of the BESTest with community-dwelling older adults have not been reported. The purposes of our study were to (1) determine the interrater and test-retest reliability of the BESTest total and subsystem scores; and (2) estimate the MDC of the BESTest and its individual subsystem scores with community-dwelling older adults. We used a prospective cohort methodological design. Community-dwelling older adults (N = 70; aged 70-94 years; mean = 85.0 [5.5] years) were recruited from a senior independent living community. Trained testers (N = 3) administered the BESTest. All participants were tested with the BESTest by the same tester initially and then retested 7 to 14 days later. With 32 of the participants, a second tester concurrently scored the retest for interrater reliability. Testers were blinded to each other's scores. Intraclass correlation coefficients [ICC(2,1)] were used to determine the interrater and test-retest reliability. Test-retest reliability was also analyzed using method error and the associated coefficients of variation (CVME). MDC was calculated using standard error of measurement. Interrater reliability (N = 32) of the BESTest total score was ICC(2, 1) = 0.97 (95% confidence interval [CI], 0.94-0.99). The ICCs for the individual subsystem scores ranged from 0.85 to 0.94. Test-retest reliability (N = 70) of the BESTest total score was ICC(2,1) = 0.93 (95% CI, 0.89-0.96). ICCs for the
Test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy.

Science.gov (United States)

Savva, Christos; Giakas, Giannis; Efstathiou, Michalis; Karagiannis, Christos

2014-01-01

The purpose of this study was to evaluate the test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy (CR). A convenience sample of 19 participants (14 men and 5 women; mean ± SD age, 50.5 ± 12 years) with CR was measured using a Jamar hydraulic hand dynamometer by the same rater on 2 different testing sessions with an interval of 7 days between sessions. Data collection procedures followed standardized grip strength testing guidelines established by the American Society of Hand Therapists. During the repeated measures, patients were advised to rest their upper limb in the standardized arm position and encouraged to exert 3 maximum gripping efforts. The mean value of the 3 efforts (measured in kilogram force [Kgf]) was used for data analysis. The intraclass correlation coefficient, SEM, and the Bland-Altman plot were used to estimate test-retest reliability and measurement precision. Grip strength measurement in CR demonstrated an intraclass correlation coefficient of 0.976, suggesting excellent test-retest reliability. The small SEM in both testing sessions (SEM1, 2.41 Kgf; SEM2, 2.51 Kgf) as well as the narrow width of the 95% limits of agreements (95% limits of agreement, -4.9 to 4.4 Kgf) in the Bland-Altman plot reflected precise measurements of grip strength in both occasions. Excellent test-retest reliability for grip strength measurement was measured in patients with CR, demonstrating that a hydraulic hand dynamometer could be used as an outcome measure for these patients. Copyright © 2014 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Test-retest reliability of trunk motor variability measured by large-array surface electromyography.

Science.gov (United States)

Abboud, Jacques; Nougarou, François; Loranger, Michel; Descarreaux, Martin

2015-01-01

The objective of this study was to evaluate the test-retest reliability of the trunk muscle activity distribution in asymptomatic participants during muscle fatigue using large-array surface electromyography (EMG). Trunk muscle activity distribution was evaluated twice, with 3 to 4 days between them, in 27 asymptomatic volunteers using large-array surface EMG. Motor variability, assessed with 2 different variables (the centroid coordinates of the root mean square map and the dispersion variable), was evaluated during a low back muscle fatigue task. Test-retest reliability of muscle activity distribution was obtained using Pearson correlation coefficients. A shift in the distribution of EMG amplitude toward the lateral-caudal region of the lumbar erector spinae induced by muscle fatigue was observed. Moderate to very strong correlations were found between both sessions in the last 3 phases of the fatigue task for both motor variability variables, whereas weak to moderate correlations were found in the first phases of the fatigue task only for the dispersion variable. These findings show that, in asymptomatic participants, patterns of EMG activity are less reliable in initial stages of muscle fatigue, whereas later stages are characterized by highly reliable patterns of EMG activity. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.
Validity and test–retest reliability of the Persian version of the Montgomery–Asberg Depression Rating Scale

Directory of Open Access Journals (Sweden)

Ahmadpanah M

2016-03-01

Full Text Available Mohammad Ahmadpanah,1 Meisam Sheikhbabaei,1 Mohammad Haghighi,1 Fatemeh Roham,1 Leila Jahangard,1 Amineh Akhondi,2 Dena Sadeghi Bahmani,3 Hafez Bajoghli,4 Edith Holsboer-Trachsler,3 Serge Brand3,5 1Behavioral Disorders and Substances Abuse Research Center, Hamadan University of Medical Sciences, Hamadan, Iran; 2Hamadan Educational Organization, Ministry of Education, Hamadan, Iran; 3Center for Affective, Stress, and Sleep Disorders, Psychiatric Clinics of the University of Basel, Basel, Switzerland; 4Iranian National Center for Addiction Studies (INCAS, Tehran University of Medical Sciences, Tehran, Iran; 5Department of Sport, Exercise and Health Science, Sport Science Section, University of Basel, Basel, Switzerland Background and aims: The Montgomery–Asberg Depression Rating Scale (MADRS is an expert’s rating tool to assess the severity and symptoms of depression. The aim of the present two studies was to validate the Persian version of the MADRS and determine its test–retest reliability in patients diagnosed with major depressive disorders (MDD. Methods: In study 1, the translated MADRS and the Hamilton Depression Rating Scale (HDRS were applied to 210 patients diagnosed with MDD and 100 healthy adults. In study 2,200 patients diagnosed with MDD were assessed with the MADRS in face-to-face interviews. Thereafter, 100 patients were assessed 3–14 days later, again via face-to-face-interviews, while the other 100 patients were assessed 3–14 days later via a telephone interview. Results: Study 1: The MADRS and HDRS scores between patients with MDD and healthy controls differed significantly. Agreement between scoring of the MADRS and HDRS was high (r=0.95. Study 2: The intraclass correlation coefficient (test–retest reliability was r=0.944 for the face-to-face interviews, and r=0.959 for the telephone interviews. Conclusion: The present data suggest that the Persian MADRS has high validity and excellent test–retest reliability over
Content validity and reliability of test of gross motor development in Chilean children

Directory of Open Access Journals (Sweden)

Marcelo Cano-Cappellacci

2015-01-01

Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Test-retest reliability and minimal detectable change of two simplified 3-point balance measures in patients with stroke.

Science.gov (United States)

Chen, Yi-Miau; Huang, Yi-Jing; Huang, Chien-Yu; Lin, Gong-Hong; Liaw, Lih-Jiun; Lee, Shih-Chieh; Hsieh, Ching-Lin

2017-10-01

The 3-point Berg Balance Scale (BBS-3P) and 3-point Postural Assessment Scale for Stroke Patients (PASS-3P) were simplified from the BBS and PASS to overcome the complex scoring systems. The BBS-3P and PASS-3P were more feasible in busy clinical practice and showed similarly sound validity and responsiveness to the original measures. However, the reliability of the BBS-3P and PASS-3P is unknown limiting their utility and the interpretability of scores. We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BBS-3P and PASS-3P in patients with stroke. Cross-sectional study. The rehabilitation departments of a medical center and a community hospital. A total of 51 chronic stroke patients (64.7% male). Both balance measures were administered twice 7 days apart. The test-retest reliability of both the BBS-3P and PASS-3P were examined by intraclass correlation coefficients (ICC). The MDC and its percentage over the total score (MDC%) of each measure was calculated for examining the random measurement errors. The ICC values of the BBS-3P and PASS-3P were 0.99 and 0.97, respectively. The MDC% (MDC) of the BBS-3P and PASS-3P were 9.1% (5.1 points) and 8.4% (3.0 points), respectively, indicating that both measures had small and acceptable random measurement errors. Our results showed that both the BBS-3P and the PASS-3P had good test-retest reliability, with small and acceptable random measurement error. These two simplified 3-level balance measures can provide reliable results over time. Our findings support the repeated administration of the BBS-3P and PASS-3P to monitor the balance of patients with stroke. The MDC values can help clinicians and researchers interpret the change scores more precisely.
Test-Retest Reliability and Practice Effects of the Stability Evaluation Test.

Science.gov (United States)

Williams, Richelle M; Corvo, Matthew A; Lam, Kenneth C; Williams, Travis A; Gilmer, Lesley K; McLeod, Tamara C Valovich

2017-01-17

Postural control plays an essential role in concussion evaluation. The Stability Evaluation Test (SET) aims to objectively analyze postural control by measuring sway velocity on the NeuroCom's VSR portable force platform (Natus, San Carlos, CA). To assess the test-retest reliability and practice effects of the SET protocol. Cohort. Research Laboratory. Fifty healthy adults (males=20, females=30, age=25.30±3.60 years, height=166.60±12.80 cm, mass=68.80±13.90 kg). All participants completed four trials of the SET. Each trial consisted of six 20-second balance tests with eyes closed, under the following conditions: double-leg firm (DFi), single-leg firm (SFi), tandem firm (TFi), double-leg foam (DFo), single-leg foam (SFo), and tandem foam (TFo). Each trial was separated by a 5-minute seated rest period. The dependent variable was sway velocity (deg/sec), with lower values indicating better balance. Sway velocity was recorded for each of the six conditions as well as a composite score for each trial. Test-retest reliability was analyzed across four trials with Intraclass Correlation Coefficients. Practice effects analyzed with repeated measures analysis of variance, followed by Tukey post-hoc comparisons for any significant main effects (preliability values were good to excellent: DFi (ICC=0.88;95%CI:0.81,0.92), SFi (ICC=0.75;95%CI:0.61,0.85), TFi (ICC=0.84;95%CI:0.75,0.90), DFo (ICC=0.83;95%CI:0.74,0.90), SFo (ICC=0.82;95%CI:0.72,0.89), TFo (ICC=0.81;95%CI:0.69,0.88), and composite score (ICC=0.93;95%CI:0.88,0.95). Significant practice effects (preliability for the assessment of postural control in healthy adults. Due to the practice effects noted, a familiarization session is recommended (i.e., all 6 conditions) prior to recording the data. Future studies should evaluate injured patients to determine meaningful change scores during various injuries.
Hypertension Knowledge-Level Scale (HK-LS): A Study on Development, Validity and Reliability

OpenAIRE

Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin

2012-01-01

This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensio...
Test-Retest Reliability of Self-Reported Sexual Health Measures among US Hispanic Adolescents

Science.gov (United States)

Jerman, Petra; Berglas, Nancy F.; Rohrbach, Louise A.; Constantine, Norman A.

2016-01-01

Objective: Although Hispanic adolescents in the USA are often the focus of sexual health interventions, their response to survey measures has rarely been assessed within evaluation studies. This study documents the test-retest reliability of a wide range of self-reported sexual health values, attitudes, knowledge and behaviours among Hispanic…
Test-retest reliability of the 20-sec Wingate test to assess anaerobic power in children with cerebral palsy

NARCIS (Netherlands)

Dallmeijer, A.J.; Scholtes, V.A.B.; Brehm, M.A.; Becher, J.G.

2013-01-01

OBJECTIVE: The aim of this study was to determine the test-retest reliability of the 20-sec Wingate anaerobic test in children with cerebral palsy. DESIGN: Participants were 22 ambulant children with cerebral palsy, with Gross Motor Function Classification System levels I (limitations in advanced
Test-Retest Reliability of the 20-sec Wingate Test to Assess Anaerobic Power in Children with Cerebral Palsy

NARCIS (Netherlands)

Dallmeijer, Annet J.; Scholtes, Vanessa A. B.; Brehm, Merel-Anne; Becher, Jules G.

2013-01-01

Objective: The aim of this study was to determine the test-retest reliability of the 20-sec Wingate anaerobic test in children with cerebral palsy. Design: Participants were 22 ambulant children with cerebral palsy, with Gross Motor Function Classification System levels I (limitations in advanced
Test-retest reliability and practice effects of the Wechsler Memory Scale-III.

Science.gov (United States)

Lo, Ada H Y; Humphreys, Michael; Byrne, Gerard J; Pachana, Nancy A

2012-09-01

Although serial administration of cognitive tests is increasingly common, there is a paucity of research on test-retest reliabilities and practice effects, both of which are important for evaluating changes in functioning. Reliability is generally conceptualized as involving short-lasting changes in performance. However, when repeated testing occurs over a period of years, there will be some longer lasting effects. The implications of these longer lasting effects and practice effects on reliability were examined in the context of repeated administrations of the Wechsler Memory Scale-III in 339 community-dwelling women aged 40-79 years over 2 to 7 years. The results showed that Logical Memory and Verbal Paired Associates subtests were consistently the most reliable subtests across the age cohorts. The magnitude of practice effects varied as a function of subtests and age. The largest practice effects were found in the youngest age cohort, especially on the Faces, Logical Memory, and Verbal Paired Associates subtests. ©2012 The British Psychological Society.
Reliability and construct validity of Yo-Yo tests in untrained and soccer-trained school-girls aged 9-16

DEFF Research Database (Denmark)

Póvoas, Susana C A; Castagna, Carlo; Soares, José Manuel da Costa

2016-01-01

Purpose: The reliability and construct validity of three age-adapted-intensity Yo-Yo tests were evaluated in untrained (n=67) vs. soccer-trained (n=65) 9-16-year-old school-girls. Methods: Tests were performed 7 days apart for reliability (9-11-year-old: Yo-Yo intermittent recovery level 1 children...... during test and retest. Conclusion: The Yo-Yo tests are reliable for determining intermittent-exercise capacity and %HRpeak for soccer players and untrained 9-16-year-old girls. They also possess construct validity with better performances for soccer players compared to untrained age-matched girls...
Maximal cardiorespiratory fitness testing in individuals with chronic stroke with cognitive impairment: practice test effects and test-retest reliability.

Science.gov (United States)

Olivier, Charles; Doré, Jean; Blanchet, Sophie; Brooks, Dina; Richards, Carol L; Martel, Guy; Robitaille, Nancy-Michelle; Maltais, Désirée B

2013-11-01

To evaluate, for individuals with chronic stroke with cognitive impairment, (1) the effects of a practice test on peak cardiorespiratory fitness test results; (2) cardiorespiratory fitness test-retest reliability; and (3) the relationship between individual practice test effects and cognitive impairment. Cross-sectional. Rehabilitation center. A convenience sample of 21 persons (men [n=12] and women [n=9]; age range, 48-81y; 44.9±36.2mo poststroke) with cognitive impairments who had sufficient lower limb function to perform the test. Not applicable. Peak oxygen consumption (Vo(2)peak, ml·kg(-1)·min(-1)). Test-retest reliability of Vo(2)peak was excellent (intraclass correlation coefficient model 2,1 [ICC2,1]=.94; 95% confidence interval [CI], .86-.98). A paired t test showed that there was no significant difference for the group for Vo(2)peak obtained from 2 symptom-limited cardiorespiratory fitness tests performed 1 week apart on a semirecumbent cycle ergometer (test 2-test 1 difference, -.32ml·kg(-1)·min(-1); 95% CI, -.69 to 1.33ml·kg(-1)·min(-1); P=.512). Individual test-retest differences in Vo(2)peak were, however, positively related to general cognitive function as measured by the Mini-Mental State Examination (ρ=.485; Preliably measured in this group without a practice test. General cognitive function, however, may influence the effect of a practice test in that those with lower general cognitive function appear to respond differently to a practice test than those with higher cognitive function. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
We need more replication research - A case for test-retest reliability.

Science.gov (United States)

Leppink, Jimmie; Pérez-Fuster, Patricia

2017-06-01

Following debates in psychology on the importance of replication research, we have also started to see pleas for a more prominent role for replication research in medical education. To enable replication research, it is of paramount importance to carefully study the reliability of the instruments we use. Cronbach's alpha has been the most widely used estimator of reliability in the field of medical education, notably as some kind of quality label of test or questionnaire scores based on multiple items or of the reliability of assessment across exam stations. However, as this narrative review outlines, Cronbach's alpha or alternative reliability statistics may complement but not replace psychometric methods such as factor analysis. Moreover, multiple-item measurements should be preferred above single-item measurements, and when using single-item measurements, coefficients as Cronbach's alpha should not be interpreted as indicators of the reliability of a single item when that item is administered after fundamentally different activities, such as learning tasks that differ in content. Finally, if we want to follow up on recent pleas for more replication research, we have to start studying the test-retest reliability of the instruments we use.
Reliability and Validity of Colored Progressive Matrices for 4-6 Age Children

Directory of Open Access Journals (Sweden)

Ahmet Bildiren

2017-06-01

Full Text Available In this research, it was aimed to test the reliability and validity of Colored Progressive Matrices for children between the ages of 4 to 6 from 15 schools. The sample of the study consisted of 640 kindergarten children. Test-retest and parallel form were used for reliability analyses. For the validity analysis, the relations between the Colored Progressive Matrices Test and Bender Gestalt Visual Motor Sensitivity Test, WISC-R and TONI-3 tests were examined. The results showed that there was a significant relation between the test-retest results and the parallel forms in all the age groups. Validity analyses showed strong correlations between the Colored Progressive Matrices and all the other measures.
Reliability and validity of the Incontinence Quiz-Turkish version.

Science.gov (United States)

Kara, Kerime C; Çıtak Karakaya, İlkim; Tunalı, Nur; Karakaya, Mehmet G

2018-01-01

The aim of this study was to investigate the reliability and validity of the Turkish version of the Incontinence Quiz, which was developed by Branch et al. (1994), to assess women's knowledge of and attitudes toward urinary incontinence. Comprehensibility of the Turkish version of the 14-item Incontinence Quiz, which was prepared following translation-back translation procedures, was tested on a pilot group of eight women, and its internal reliability, test-retest reliability and construct validity were assessed in 150 women who attended the gynecology clinics of three hospitals in İçel, Turkey. Physical and sociodemographic characteristics and presence of incontinence complaints were also recorded. Data were analyzed at the 0.05 alpha level, using SPSS version 22. The scale had good reliability and validity. The internal reliability coefficient (Cronbach α) was 0.80, test-retest correlation coefficients were 0.83-0.94; and with regard to construct validity, Kaiser-Meyer-Olkin coefficient was 0.76 and Barlett sphericity test was 562.777 (P = 0.000). Turkish version of the Incontinence Quiz had a four-factor structure, with Eigenvalues ranging from 1.17 to 4.08. The Incontinence Quiz-Turkish version is a highly comprehensible, reliable and valid scale, which may be used to assess Turkish-speaking women's knowledge of and attitudes toward urinary incontinence. © 2017 Japan Society of Obstetrics and Gynecology.

Reliability and concurrent validity of a motor skill competence test among 4- to 12-year old children

NARCIS (Netherlands)

Hoeboer, Joris; Krijger-Hombergen, Michiel; Savelsbergh, Geert; De Vries, Sanne

2017-01-01

The purpose of this study was to examine the test-retest reliability, internal consistency and concurrent validity of the Athletic Skills Track (AST). During a regular PE lesson, 930 4- to 12-year old children (448 girls, 482 boys) completed two motor skill competence tests: (1) the
Intensity response function of the photopic negative response (PhNR): effect of age and test-retest reliability.

Science.gov (United States)

Joshi, Nabin R; Ly, Emma; Viswanathan, Suresh

2017-08-01

To assess the effect of age and test-retest reliability of the intensity response function of the full-field photopic negative response (PhNR) in normal healthy human subjects. Full-field electroretinograms (ERGs) were recorded from one eye of 45 subjects, and 39 of these subjects were tested on two separate days with a Diagnosys Espion System (Lowell, MA, USA). The visual stimuli consisted of brief (test-retest reliability was assessed with the Wilcoxon signed-rank test and Bland-Altman analysis. Holm's correction was applied to account for multiple comparisons. V max of BT was significantly smaller than that of PT and b-wave, and the V max of PT and b-wave was not significantly different from each other. The slope parameter n was smallest for BT and the largest for b-wave and the difference between the slopes of all three measures were statistically significant. Small differences observed in the mean values of K for the different measures did not reach statistical significance. The Wilcoxon signed-rank test indicated no significant differences between the two test visits for any of the Naka-Rushton parameters for the three ERG measures, and the Bland-Altman plots indicated that the mean difference between test and retest measurements of the different fit parameters was close to zero and within 6% of the average of the test and retest values of the respective parameters for all three ERG measurements, indicating minimal bias. While the coefficient of reliability (COR, defined as 1.96 times the standard deviation of the test and retest difference) of each fit parameter was more or less comparable across the three ERG measurements, the %COR (COR normalized to the mean test and retest measures) was generally larger for BT compared to both PT and b-wave for each fit parameter. The Naka-Rushton fit parameters did not show statistically significant changes with age for any of the ERG measures when corrections were applied for multiple comparisons. However, the V max of
The Screening Test for Emotional Problems--Teacher-Report Version (Step-T): Studies of Reliability and Validity

Science.gov (United States)

Erford, Bradley T.; Butler, Caitlin; Peacock, Elizabeth

2015-01-01

The Screening Test for Emotional Problems-Teacher Version (STEP-T) was designed to identify students aged 7-17 years with wide-ranging emotional disturbances. Coefficients alpha and test-retest reliability were adequate for all subscales except Anxiety. The hypothesized five-factor model fit the data very well and external aspects of validity were…
Comprehension of Written Grammar Test: Reliability and Known-Groups Validity Study With Hearing and Deaf and Hard-of-Hearing Students.

Science.gov (United States)

Cannon, Joanna E; Hubley, Anita M; Millhoff, Courtney; Mazlouman, Shahla

2016-01-01

The aim of the current study was to gather validation evidence for the Comprehension of Written Grammar (CWG; Easterbrooks, 2010) receptive test of 26 grammatical structures of English print for use with children who are deaf and hard of hearing (DHH). Reliability and validity data were collected for 98 participants (49 DHH and 49 hearing) in Grades 2-6. The objectives were to: (a) examine 4-week test-retest reliability data; and (b) provide evidence of known-groups validity by examining expected differences between the groups on the CWG vocabulary pretest and main test, as well as selected structures. Results indicated excellent test-retest reliability estimates for CWG test scores. DHH participants performed statistically significantly lower on the CWG vocabulary pretest and main test than the hearing participants. Significantly lower performance by DHH participants on most expected grammatical structures (e.g., basic sentence patterns, auxiliary "be" singular/plural forms, tense, comparatives, and complementation) also provided known groups evidence. Overall, the findings of this study showed strong evidence of the reliability of scores and known group-based validity of inferences made from the CWG. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Development, construct validity and test-retest reliability of a field-based wheelchair mobility performance test for wheelchair basketball

NARCIS (Netherlands)

de Witte, Annemarie M. H.; Hoozemans, Marco J. M.; Berger, Monique A. M.; van der Slikke, Rienk M. A.; van der Woude, Lucas H. V.; Veeger, Dirkjan (H. E. J)

2018-01-01

The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of
Reliability and Validity of the Greek Migraine Disability Assessment (MIDAS) Questionnaire.

Science.gov (United States)

Oikonomidi, Theodora; Vikelis, Michail; Artemiadis, Artemios; Chrousos, George P; Darviri, Christina

2018-03-01

The Migraine Disability Assessment (MIDAS) Questionnaire is a reliable and valid instrument for migraine-related disability. Such a tool is needed to quantify migraine-related disability in the Greek population. This validation study aims to assess the test-retest reliability, internal consistency, item discriminant and convergent validity of the Greek translation of the MIDAS. Adults diagnosed with migraine completed the MIDAS Questionnaire on two occasions 3 weeks apart to assess reliability, and completed the RAND-36 to assess validity. Participants (n = 152) had a median MIDAS score of 24 and mostly severe disability (58% were grade IV). The test-retest reliability analysis (N = 59) revealed excellent reliability for the total score. Internal consistency was α = 0.71 for initial and α = 0.82 for retest completion. For item discriminant validity, the correlations between each question and the total score were significant, with high correlations for questions 2-5 (range 0.67 ≤ r ≤ 0.79; p MIDAS score tended to have better wellbeing. Psychometric properties are comparable with those of other published validation studies of the MIDAS and the original. Findings on question 1 show that missing work/school days may be closely related with increased affect issues. The Greek version of the MIDAS Questionnaire has good reliability and validity. This study allowed for cross-cultural comparability of research findings.
Test-Retest Reliability of Handgrip Strength as an Outcome Measure in Patients With Symptoms of Shoulder Impingement Syndrome.

Science.gov (United States)

Savva, Christos; Mougiaris, Paraskevas; Xadjimichael, Christoforos; Karagiannis, Christos; Efstathiou, Michalis

The purpose of this study was to investigate the degree of test-retest reliability of grip strength measurement using a hand dynamometer in patients with shoulder impingement syndrome. A total of 19 patients (10 women and 9 men; mean ± standard deviation age, 33.2 ± 12.9 years; range 18-59 years) with shoulder impingement syndrome were measured using a hand dynamometer by the same data collector in 2 different testing sessions with a 7-day interval. During each session, patients were encouraged to exert 3 maximal isometric contractions on the affected hand and the mean value of the 3 efforts (measured in kilogram-force [Kgf]) was used for data analysis. The intraclass correlation coefficient (ICC 2,1 ) as well as the standard error of measurement (SEM) and Bland-Altman plot were used to estimate the degree of test-retest reliability and the measurement error, respectively. Grip strength data analysis revealed an ICC 2,1 score of 0.94, which, based on the Shrout classification, is considered as excellent test-retest reliability of grip strength measurement. The small values of SEMs reported in both sessions (SEM 1 , 2.55 Kgf; SEM 2 , 2.39 Kgf) and the small width of the 95% limits of agreement in the Bland-Altman plot (ranging from -7.39 Kgf to 7.03 Kgf) reflected the measurement precision and the narrow variation of the differences during the 2 testing sessions. Results from this study identified excellent test-retest reliability of grip strength measurement in shoulder impingement syndrome, indicating its potential use as an outcome measure in clinical practice. Copyright © 2018. Published by Elsevier Inc.
Temporal Stability of Strength-Based Assessments: Test-Retest Reliability of Student and Teacher Reports

Science.gov (United States)

Romer, Natalie; Merrell, Kenneth W.

2013-01-01

This study focused on evaluating the temporal stability of self-reported and teacher-reported perceptions of students' social and emotional skills and assets. We used a test-retest reliability procedure over repeated administrations of the child, adolescent, and teacher versions of the "Social-Emotional Assets and Resilience Scales".…
A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing.

Science.gov (United States)

DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M

2017-10-27

The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no
The Validity and Reliability of the Mobbing Scale (MS)

Science.gov (United States)

Yaman, Erkan

2009-01-01

The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Test-retest reliability of stride time variability while dual tasking in healthy and demented adults with frontotemporal degeneration

Directory of Open Access Journals (Sweden)

Herrmann Francois R

2011-07-01

Full Text Available Abstract Background Although test-retest reliability of mean values of spatio-temporal gait parameters has been assessed for reliability while walking alone (i.e., single tasking, little is known about the test-retest reliability of stride time variability (STV while performing an attention demanding-task (i.e., dual tasking. The objective of this study was to examine immediate test-retest reliability of STV while single and dual tasking in cognitively healthy older individuals (CHI and in demented patients with frontotemporal degeneration (FTD. Methods Based on a cross-sectional design, 69 community-dwelling CHI (mean age 75.5 ± 4.3; 43.5% women and 14 demented patients with FTD (mean age 65.7 ± 9.8 years; 6.7% women walked alone (without performing an additional task; i.e., single tasking and while counting backward (CB aloud starting from 50 (i.e., dual tasking. Each subject completed two trials for all the testing conditions. The mean value and the coefficient of variation (CoV of stride time while walking alone and while CB at self-selected walking speed were measured using GAITRite® and SMTEC® footswitch systems. Results ICC of mean value in CHI under both walking conditions were higher than ICC of demented patients with FTD and indicated perfect reliability (ICC > 0.80. Reliability of mean value was better while single tasking than dual tasking in CHI (ICC = 0.96 under single-task and ICC = 0.86 under dual-task, whereas it was the opposite in demented patients (ICC = 0.65 under single-task and ICC = 0.81 under dual-task. ICC of CoV was slight to poor whatever the group of participants and the walking condition (ICC Conclusions The immediate test-retest reliability of the mean value of stride time in single and dual tasking was good in older CHI as well as in demented patients with FTD. In contrast, the variability of stride time was low in both groups of participants.
Validity and reliability of the single-trial line drill test of anaerobic power in basketball players.

Science.gov (United States)

Fatouros, I G; Laparidis, K; Kambas, A; Chatzinikolaou, A; Techlikidou, E; Katrabasas, I; Douroudos, I; Leontsini, D; Berberidou, F; Draganidis, D; Christoforidis, C; Tsoukas, D; Kelis, S; Taxildaris, K

2011-03-01

This study evaluated the validity, reliability, and sensitivity of the single-trial line drill test (SLDT) for anaerobic power assessment. Twenty-four volunteers were assigned to either a control (C, N.=12) or an experimental (BP, N.=12 basketball players) group. SLDT's (time-to-complete) concurrent validity was evaluated against the Wingate testing (WAnT: mean [MP] and peak power [PP]) and a 30-sec vertical jump testing test (VJT: mean height and MP). Blood lactate concentration was measured at rest and immediately post-test. SLDT's reliability [test-retest intraclass correlation coefficients (ICC), coefficient of variation (CV), Bland-Altman plots] and sensitivity were determined (one-way ANOVA). Kendall's tau correlation analysis revealed correlations (Pbasketball players.
Reliability and validity of the Dutch Recovery Stress Questionnaire for athletes

NARCIS (Netherlands)

Nederhof, Esther; Brink, Michel S.; Lemmink, Koen A. P. M.

2008-01-01

The purpose of the present study was to investigate the cross-cultural validity of the Recovery Stress Questionnaire for Athletes (RESTQ-sport) by analysing reliability and validity of a Dutch translation. Two studies were performed to assess test-retest reliability with a one week interval,
Development of a Saudi Food Frequency Questionnaire and testing its reliability and validity

OpenAIRE

Gosadi, Ibrahim M.; Alatar, Abdullah A.; Otayf, Mojahed M.; AlJahani, Dhaherah M.; Ghabbani, Hisham M.; AlRajban, Waleed A.; Alrsheed, Abdullah M.; Al-Nasser, Khalid A.

2017-01-01

Objectives: To create a food frequency questionnaire specifically designed to capture the dietary habits of Saudis and test its validity and reliability. Methods: This investigation is a longitudinal, test-retest study conducted in King Saud University, Riyadh, Kingdom of Saudi Arabia between December 2015 and March 2016. A list of 140 food items was included in the questionnaire where a closed-ended and open-ended approach was used. Regarding past year food frequency consumption and 24 hours...
Validation of the Stroke Specific Quality of Life Scale (SS-QOL): test of reliability and validity of the Danish version (SS-QOL-DK).

Science.gov (United States)

Muus, Ingrid; Williams, Linda S; Ringsberg, Karin C

2007-07-01

To test the reliability and validity of the Danish version of the Stroke Specific Quality of Life Scale version 2.0 (SS-QOL-DK), an instrument for evaluation of health-related quality of life. A correlational study. A stroke unit that provides acute care and rehabilitation for stroke patients in Frederiksborg County, Denmark. One hundred and fifty-two stroke survivors participated; 24 of these performed test-retest. Questionnaires were sent out and returned by mail. A subsequent telephone interview assessed functional level and missing items. Test-retest was measured using Spearman's r, internal consistency was estimated using Cronbach's alpha, and evaluation of floor and ceiling values in proportion of minimum and maximum scores. Construct validity was assessed by comparing patients' scores on the SS-QOL-DK with those obtained by other test methods: Beck's Depression Index, the General Health Survey Short Form 36 (SF-36), the Barthel Index and the National Institutes of Health Stroke Scale, evaluating shared variance using coefficient of determination, r2. Comparing groups with known scores assessed known-group validity. Convergent and discriminant validity were assessed. Test-retest of SS-QOL-DK showed excellent stability, Spearman's r = 0.65-0.99. Internal consistency for all domains showed Cronbach's alpha = 0.81-0.94. Missing items rate was 1.0%. Most SS-QOL-DK domains showed moderately shared variance with similar domains of other test methods, r2 = 0.03-0.62. Groups with known differences showed statistically significant difference in scores. Item-to-scale correlation coefficients of 0.37-0.88 supported convergent validity. SS-QOL-DK is a reliable and valid instrument for measuring self-reported health-related quality of life on group level among people with mild to moderate stroke.
Reliability and validity of migraine disability assessment questionnaire-Thai version (Thai-MIDAS).

Science.gov (United States)

Seethong, Piman; Nimmannit, Akarin; Chaisewikul, Rungsan; Prayoonwiwat, Naraporn; Chotinaiwattarakul, Wattanachai

2013-02-01

To assess the validity and test-retest reliability of a Thai translation of the Migraine Disability Assessment (MIDAS) Questionnaire in Thai patients with migraine. Migraineurs from the Headache Clinic in Siriraj Hospital were recruited and asked to complete a 13-weeks diary and answered the Thai-MIDAS at once. Some participants were asked to provide the 2nd Thai-MIDAS in the next 2 weeks for test-retest reliability. Ninety-three patients had completed the 13-weeks diaries. Age range was 18-58 years with mean 37.69 +/- 9.60 years. All 5 items and the total score of Thai-MIDAS were moderately correlated with data from 13-weeks diary (Spearman's correlation coefficient = 0.32-0.62). The test-retest reliability of the total score of Thai-MIDAS in 30 patients demonstrated a highly reliable degree of intraclass correlation (ICC = 0.76, 95% CI 0.49-0.88). The present study reveals that the Thai-MIDAS has satisfactory validity and reliability in comparison with the original English MIDAS version.
Test-retest reliability of barbell velocity during the free-weight bench-press exercise.

Science.gov (United States)

Stock, Matt S; Beck, Travis W; DeFreitas, Jason M; Dillon, Michael A

2011-01-01

The purpose of this study was to calculate test-retest reliability statistics for peak barbell velocity during the free-weight bench-press exercise for loads corresponding to 10-90% of the 1-repetition maximum (1RM). Twenty-one healthy, resistance-trained men (mean ± SD age = 23.5 ± 2.7 years; body mass = 90.5 ± 14.6 kg; 1RM bench press = 125.4 ± 18.4 kg) volunteered for this study. A minimum of 48 hours after a maximal strength testing and familiarization session, the subjects performed single repetitions of the free-weight bench-press exercise at each tenth percentile (10-90%) of the 1RM on 2 separate occasions. For each repetition, the subjects were instructed to press the barbell as rapidly as possible, and peak barbell velocity was measured with a Tendo Weightlifting Analyzer. The test-retest intraclass correlation coefficients (model 2,1) and corresponding standard errors of measurement (expressed as percentages of the mean barbell velocity values) were 0.717 (4.2%), 0.572 (5.0%), 0.805 (3.1%), 0.669 (4.7%), 0.790 (4.6%), 0.785 (4.8%), 0.811 (5.8%), 0.714 (10.3%), and 0.594 (12.6%) for the weights corresponding to 10-90% 1RM. There were no mean differences between the barbell velocity values from trials 1 and 2. These results indicated moderate to high test-retest reliability for barbell velocity from 10 to 70% 1RM but decreased consistency at 80 and 90% 1RM. When examining barbell velocity during the free-weight bench-press exercise, greater measurement error must be overcome at 80 and 90% 1RM to be confident that an observed change is meaningful.
The validity and reliability of the Functional Strength Measurement (FSM) in children with intellectual disabilities.

Science.gov (United States)

Aertssen, W F M; Steenbergen, B; Smits-Engelsman, B C M

2018-06-07

There is lack of valid and reliable field-based tests for assessing functional strength in young children with mild intellectual disabilities (IDs). The aim of this study was to investigate the test-retest reliability and construct validity of the Functional Strength Measurement in children with ID (FSM-ID). Fifty-two children with mild ID (40 boys and 12 girls, mean age 8.48 years, SD = 1.48) were tested with the FSM. Test-retest reliability (n = 32) was examined by a two-way interclass correlation coefficient for agreement (ICC 2.1A). Standard error of measurement and smallest detectable change were calculated. Construct validity was determined by calculating correlations between the FSM-ID and handheld dynamometry (HHD) (convergent validity), FSM-ID, FSM-ID and subtest strength of the Bruininks-Oseretsky test of motor proficiency - second edition (BOT-2) (convergent validity) and the FSM-ID and balance subtest of the BOT-2 (discriminant validity). Test-retest reliability ICC ranged 0.89-0.98. Correlation between the items of the FSM-ID and HHD ranged 0.39-0.79 and between FSM-ID and BOT-2 (strength items) 0.41-0.80. Correlation between items of the FSM-ID and BOT-2 (balance items) ranged 0.41-0.70. The FSM-ID showed good test-retest reliability and good convergent validity with the HHD and BOT-2 subtest strength. The correlations assessing discriminant validity were higher than expected. Poor levels of postural control and core stability in children with mild IDs may be the underlying factor of those higher correlations. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Reliability and Validity of the Activity Participation Assessment for School-age Children in Korea

Directory of Open Access Journals (Sweden)

Se-Yun Kim

2016-12-01

Conclusion: The APA shows good internal reliability, test–retest reliability, discriminant validity, and construct validity. However, evidence of psychometric properties was limited by a small sample size. Psychometric properties such as interrater reliability as well as concurrent validity and construct validity need to be tested using a larger sample size with representative demographics.
Test-retest reliability at the item level and total score level of the Norwegian version of the Spinal Cord Injury Falls Concern Scale (SCI-FCS).

Science.gov (United States)

Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik

2016-05-01

Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.

Validity and Reliability Study of the Korean Tinetti Mobility Test for Parkinson's Disease.

Science.gov (United States)

Park, Jinse; Koh, Seong-Beom; Kim, Hee Jin; Oh, Eungseok; Kim, Joong-Seok; Yun, Ji Young; Kwon, Do-Young; Kim, Younsoo; Kim, Ji Seon; Kwon, Kyum-Yil; Park, Jeong-Ho; Youn, Jinyoung; Jang, Wooyoung

2018-01-01

Postural instability and gait disturbance are the cardinal symptoms associated with falling among patients with Parkinson's disease (PD). The Tinetti mobility test (TMT) is a well-established measurement tool used to predict falls among elderly people. However, the TMT has not been established or widely used among PD patients in Korea. The purpose of this study was to evaluate the reliability and validity of the Korean version of the TMT for PD patients. Twenty-four patients diagnosed with PD were enrolled in this study. For the interrater reliability test, thirteen clinicians scored the TMT after watching a video clip. We also used the test-retest method to determine intrarater reliability. For concurrent validation, the unified Parkinson's disease rating scale, Hoehn and Yahr staging, Berg Balance Scale, Timed-Up and Go test, 10-m walk test, and gait analysis by three-dimensional motion capture were also used. We analyzed receiver operating characteristic curve to predict falling. The interrater reliability and intrarater reliability of the Korean Tinetti balance scale were 0.97 and 0.98, respectively. The interrater reliability and intra-rater reliability of the Korean Tinetti gait scale were 0.94 and 0.96, respectively. The Korean TMT scores were significantly correlated with the other clinical scales and three-dimensional motion capture. The cutoff values for predicting falling were 14 points (balance subscale) and 10 points (gait subscale). We found that the Korean version of the TMT showed excellent validity and reliability for gait and balance and had high sensitivity and specificity for predicting falls among patients with PD.
Test-retest reliability of the Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale.

Science.gov (United States)

Gustafsson, Margareta; Blomberg, Karin; Holmefur, Marie

2015-07-01

The Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale evaluates the student nurses' perception of the learning environment and supervision within the clinical placement. It has never been tested in a replication study. The aim of the present study was to evaluate the test-retest reliability of the CLES + T scale. The CLES + T scale was administered twice to a group of 42 student nurses, with a one-week interval. Test-retest reliability was determined by calculations of Intraclass Correlation Coefficients (ICCs) and weighted Kappa coefficients. Standard Error of Measurements (SEM) and Smallest Detectable Difference (SDD) determined the precision of individual scores. Bland-Altman plots were created for analyses of systematic differences between the test occasions. The results of the study showed that the stability over time was good to excellent (ICC 0.88-0.96) in the sub-dimensions "Supervisory relationship", "Pedagogical atmosphere on the ward" and "Role of the nurse teacher". Measurements of "Premises of nursing on the ward" and "Leadership style of the manager" had lower but still acceptable stability (ICC 0.70-0.75). No systematic differences occurred between the test occasions. This study supports the usefulness of the CLES + T scale as a reliable measure of the student nurses' perception of the learning environment within the clinical placement at a hospital. Copyright © 2015 Elsevier Ltd. All rights reserved.
Investigating univariate temporal patterns for intrinsic connectivity networks based on complexity and low-frequency oscillation: a test-retest reliability study.

Science.gov (United States)

Wang, X; Jiao, Y; Tang, T; Wang, H; Lu, Z

2013-12-19

Intrinsic connectivity networks (ICNs) are composed of spatial components and time courses. The spatial components of ICNs were discovered with moderate-to-high reliability. So far as we know, few studies focused on the reliability of the temporal patterns for ICNs based their individual time courses. The goals of this study were twofold: to investigate the test-retest reliability of temporal patterns for ICNs, and to analyze these informative univariate metrics. Additionally, a correlation analysis was performed to enhance interpretability. Our study included three datasets: (a) short- and long-term scans, (b) multi-band echo-planar imaging (mEPI), and (c) eyes open or closed. Using dual regression, we obtained the time courses of ICNs for each subject. To produce temporal patterns for ICNs, we applied two categories of univariate metrics: network-wise complexity and network-wise low-frequency oscillation. Furthermore, we validated the test-retest reliability for each metric. The network-wise temporal patterns for most ICNs (especially for default mode network, DMN) exhibited moderate-to-high reliability and reproducibility under different scan conditions. Network-wise complexity for DMN exhibited fair reliability (ICC<0.5) based on eyes-closed sessions. Specially, our results supported that mEPI could be a useful method with high reliability and reproducibility. In addition, these temporal patterns were with physiological meanings, and certain temporal patterns were correlated to the node strength of the corresponding ICN. Overall, network-wise temporal patterns of ICNs were reliable and informative and could be complementary to spatial patterns of ICNs for further study. Copyright © 2013 IBRO. Published by Elsevier Ltd. All rights reserved.
Test-retest reliability of Brazilian version of Memorial Symptom Assessment Scale for assessing symptoms in cancer patients.

Science.gov (United States)

Menezes, Josiane Roberta de; Luvisaro, Bianca Maria Oliveira; Rodrigues, Claudia Fernandes; Muzi, Camila Drumond; Guimarães, Raphael Mendonça

2017-01-01

To assess the test-retest reliability of the Memorial Symptom Assessment Scale translated and culturally adapted into Brazilian Portuguese. The scale was applied in an interview format for 190 patients with various cancers type hospitalized in clinical and surgical sectors of the Instituto Nacional de Câncer José de Alencar Gomes da Silva and reapplied in 58 patients. Data from the test-retest were double typed into a Microsoft Excel spreadsheet and analyzed by the weighted Kappa. The reliability of the scale was satisfactory in test-retest. The weighted Kappa values obtained for each scale item had to be adequate, the largest item was 0.96 and the lowest was 0.69. The Kappa subscale was also evaluated and values were 0.84 for high frequency physic symptoms, 0.81 for low frequency physical symptoms, 0.81 for psychological symptoms, and 0.78 for Global Distress Index. High level of reliability estimated suggests that the process of measurement of Memorial Symptom Assessment Scale aspects was adequate. Avaliar a confiabilidade teste-reteste da versão traduzida e adaptada culturalmente para o português do Brasil do Memorial Symptom Assessment Scale. A escala foi aplicada em forma de entrevista em 190 pacientes com diversos tipos de câncer internados nos setores clínicos e cirúrgicos do Instituto Nacional de Câncer José de Alencar Gomes da Silva e reaplicada em 58 pacientes. Os dados dos testes-retestes foram inseridos num banco de dados por dupla digitação independente em Excel e analisados pelo Kappa ponderado. A confiabilidade da escala mostrou-se satisfatória nos testes-retestes. Os valores do Kappa ponderado obtidos para cada item da escala apresentaram-se adequados, sendo o maior item de 0,96 e o menor de 0,69. Também se avaliou o Kappa das subescalas, sendo de 0,84 para sintomas físicos de alta frequência, de 0,81 para sintomas físicos de baixa frequência, de 0,81 também para sintomas psicológicos, e de 0,78 para Índice Geral de Sofrimento
Laterality judgments in people with low back pain--A cross-sectional observational and test-retest reliability study.

Science.gov (United States)

Linder, Martin; Michaelson, Peter; Röijezon, Ulrik

2016-02-01

Disruption of cortical representation, or body schema, has been indicated as a factor in the persistence and recurrence of low back pain (LBP). This has been observed through impaired laterality judgment ability and it has been suggested that this ability is affected in a spatial rather than anatomical manner. We compared laterality judgment performance of foot and trunk movements between people with LBP with or without leg pain and healthy controls, and investigated associations between test performance and pain. We also assessed the test-retest reliability of the Recognise Online™ software when used in a clinical and a home setting. Cross-sectional observational and test-retest study. Thirty individuals with LBP and 30 healthy controls performed judgment tests of foot and trunk laterality once supervised in a clinic and twice at home. No statistically significant group differences were found. LBP intensity was negatively related to trunk laterality accuracy (p = 0.019). Intraclass correlation values ranged from 0.51 to 0.91. Reaction time improved significantly between test occasions while accuracy did not. Laterality judgments were not impaired in subjects with LBP compared to controls. Further research may clarify the relationship between pain mechanisms in LBP and laterality judgment ability. Reliability values were mostly acceptable, with wide and low confidence intervals, suggesting test-retest reliability for Recognise Online™ could be questioned in this trial. A significant learning effect was observed which should be considered in clinical and research application of the test. Copyright © 2015 Elsevier Ltd. All rights reserved.
Reliability and validity of a nutrition and physical activity environmental self-assessment for child care

Directory of Open Access Journals (Sweden)

Ammerman Alice S

2007-07-01

Full Text Available Abstract Background Few assessment instruments have examined the nutrition and physical activity environments in child care, and none are self-administered. Given the emerging focus on child care settings as a target for intervention, a valid and reliable measure of the nutrition and physical activity environment is needed. Methods To measure inter-rater reliability, 59 child care center directors and 109 staff completed the self-assessment concurrently, but independently. Three weeks later, a repeat self-assessment was completed by a sub-sample of 38 directors to assess test-retest reliability. To assess criterion validity, a researcher-administered environmental assessment was conducted at 69 centers and was compared to a self-assessment completed by the director. A weighted kappa test statistic and percent agreement were calculated to assess agreement for each question on the self-assessment. Results For inter-rater reliability, kappa statistics ranged from 0.20 to 1.00 across all questions. Test-retest reliability of the self-assessment yielded kappa statistics that ranged from 0.07 to 1.00. The inter-quartile kappa statistic ranges for inter-rater and test-retest reliability were 0.45 to 0.63 and 0.27 to 0.45, respectively. When percent agreement was calculated, questions ranged from 52.6% to 100% for inter-rater reliability and 34.3% to 100% for test-retest reliability. Kappa statistics for validity ranged from -0.01 to 0.79, with an inter-quartile range of 0.08 to 0.34. Percent agreement for validity ranged from 12.9% to 93.7%. Conclusion This study provides estimates of criterion validity, inter-rater reliability and test-retest reliability for an environmental nutrition and physical activity self-assessment instrument for child care. Results indicate that the self-assessment is a stable and reasonably accurate instrument for use with child care interventions. We therefore recommend the Nutrition and Physical Activity Self-Assessment for
Test-retest Agreement and Reliability of Quantitative Sensory Testing 1 Year After Breast Cancer Surgery

DEFF Research Database (Denmark)

Andersen, Kenneth Geving; Kehlet, Henrik; Aasvang, Eske Kvanner

2015-01-01

.5 SD) than within-patient variation (0.23 to 3.55 SD). There were no significant differences between pain and pain-free patients. The individual test-retest variability was higher on the operated side compared with the nonoperated side. DISCUSSION: The QST protocol reliability allows for group......OBJECTIVES: Quantitative sensory testing (QST) is used to assess sensory dysfunction and nerve damage by examining psychophysical responses to controlled, graded stimuli such as mechanical and thermal detection and pain thresholds. In the breast cancer population, 4 studies have used QST to examine...... persistent pain after breast cancer treatment, suggesting neuropathic pain being a prominent pain mechanism. However, the agreement and reliability of QST has not been described in the postsurgical breast cancer population, hindering exact interpretation of QST studies in this population. The aim...
Reliability and Validity of Ten Consumer Activity Trackers Depend on Walking Speed.

Science.gov (United States)

Fokkema, Tryntsje; Kooiman, Thea J M; Krijnen, Wim P; VAN DER Schans, Cees P; DE Groot, Martijn

2017-04-01

To examine the test-retest reliability and validity of ten activity trackers for step counting at three different walking speeds. Thirty-one healthy participants walked twice on a treadmill for 30 min while wearing 10 activity trackers (Polar Loop, Garmin Vivosmart, Fitbit Charge HR, Apple Watch Sport, Pebble Smartwatch, Samsung Gear S, Misfit Flash, Jawbone Up Move, Flyfit, and Moves). Participants walked three walking speeds for 10 min each; slow (3.2 km·h), average (4.8 km·h), and vigorous (6.4 km·h). To measure test-retest reliability, intraclass correlations (ICC) were determined between the first and second treadmill test. Validity was determined by comparing the trackers with the gold standard (hand counting), using mean differences, mean absolute percentage errors, and ICC. Statistical differences were calculated by paired-sample t tests, Wilcoxon signed-rank tests, and by constructing Bland-Altman plots. Test-retest reliability varied with ICC ranging from -0.02 to 0.97. Validity varied between trackers and different walking speeds with mean differences between the gold standard and activity trackers ranging from 0.0 to 26.4%. Most trackers showed relatively low ICC and broad limits of agreement of the Bland-Altman plots at the different speeds. For the slow walking speed, the Garmin Vivosmart and Fitbit Charge HR showed the most accurate results. The Garmin Vivosmart and Apple Watch Sport demonstrated the best accuracy at an average walking speed. For vigorous walking, the Apple Watch Sport, Pebble Smartwatch, and Samsung Gear S exhibited the most accurate results. Test-retest reliability and validity of activity trackers depends on walking speed. In general, consumer activity trackers perform better at an average and vigorous walking speed than at a slower walking speed.
Test-retest reliability and four-week changes in cardiopulmonary fitness in stroke patients: evaluation using a robotics-assisted tilt table.

Science.gov (United States)

Saengsuwan, Jittima; Berger, Lucia; Schuster-Amft, Corina; Nef, Tobias; Hunt, Kenneth J

2016-09-06

Exercise testing devices for evaluating cardiopulmonary fitness in patients with severe disability after stroke are lacking, but we have adapted a robotics-assisted tilt table (RATT) for cardiopulmonary exercise testing (CPET). Using the RATT in a sample of patients after stroke, this study aimed to investigate test-retest reliability and repeatability of CPET and to prospectively investigate changes in cardiopulmonary outcomes over a period of four weeks. Stroke patients with all degrees of disability underwent 3 separate CPET sessions: 2 tests at baseline (TB1 and TB2) and 1 test at follow up (TF). TB1 and TB2 were at least 24 h apart. TB2 and TF were 4 weeks apart. A RATT equipped with force sensors in the thigh cuffs, a work rate estimation algorithm and a real-time visual feedback system was used to guide the patients' exercise work rate during CPET. Test-retest reliability and repeatability of CPET variables were analysed using paired t-tests, the intraclass correlation coefficient (ICC), the coefficient of variation (CoV), and Bland and Altman limits of agreement. Changes in cardiopulmonary fitness during four weeks were analysed using paired t-tests. Seventeen sub-acute and chronic stroke patients (age 62.7 ± 10.4 years [mean ± SD]; 8 females) completed the test sessions. The median time post stroke was 350 days. There were 4 severely disabled, 1 moderately disabled and 12 mildly disabled patients. For test-retest, there were no statistically significant differences between TB1 and TB2 for most CPET variables. Peak oxygen uptake, peak heart rate, peak work rate and oxygen uptake at the ventilatory anaerobic threshold (VAT) and respiratory compensation point (RCP) showed good to excellent test-retest reliability (ICC 0.65-0.94). For all CPET variables, CoV was 4.1-14.5 %. The mean difference was close to zero in most of the CPET variables. There were no significant changes in most cardiopulmonary performance parameters during the 4-week period
A Structured Clinical Interview for Kleptomania (SCI-K): preliminary validity and reliability testing.

Science.gov (United States)

Grant, Jon E; Kim, Suck Won; McCabe, James S

2006-06-01

Kleptomania presents difficulties in diagnosis for clinicians. This study aimed to develop and test a DSM-IV-based diagnostic instrument for kleptomania. To assess for current kleptomania the Structured Clinical Interview for Kleptomania (SCI-K) was administered to 112 consecutive subjects requesting psychiatric outpatient treatment for a variety of disorders. Reliability and validity were determined. Classification accuracy was examined using the longitudinal course of illness. The SCI-K demonstrated excellent test-retest (Phi coefficient = 0.956 (95% CI = 0.937, 0.970)) and inter-rater reliability (phi coefficient = 0.718 (95% CI = 0.506, 0.848)) in the diagnosis of kleptomania. Concurrent validity was observed with a self-report measure using DSM-IV kleptomania criteria (phi coefficient = 0.769 (95% CI = 0.653, 0.850)). Discriminant validity was observed with a measure of depression (point biserial coefficient = -0.020 (95% CI = -0.205, 0.166)). The SCI-K demonstrated both high sensitivity and specificity based on longitudinal assessment. The SCI-K demonstrated excellent reliability and validity in diagnosing kleptomania in subjects presenting with various psychiatric problems. These findings require replication in larger groups, including non-psychiatric populations, to examine their generalizability. Copyright (c) 2006 John Wiley & Sons, Ltd.
Validity and Reliability of Agoraphobic Cognitions Questionnaire-Turkish Version

Directory of Open Access Journals (Sweden)

Ayşegül KART

2013-11-01

Full Text Available Validity and Reliability of Agoraphobic Cognitions Questionnaire-Turkish Version Objective: The aim of this study is to investigate the validity and reliability of Agoraphobic Cognitions Questionnaire -Turkish Version (ACQ. Method: ACQ was administered to 92 patients with agoraphobia or panic disorder with agoraphobia. BSQ Turkish version completed by translation, back-translation and pilot assessment. Reliability of ACQ was analyzed by test-retest correlation, split-half technique, Cronbach’s alpha coefficient. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Bartlett test had been performed. Principal component analysis and varimax rotation used for factor analysis. Results: 64% of patients evaluated in the study were female and 36% were male. Age interval was between 18 and 58, mean age was 31.5±10.4. The Cronbach’s alpha coefficient was 0.91. Analysis of test-retest evaluations revealed that there were statistically significant correlations ranging between 24% and 84% concerning questionnaire components. In analysis performed by split-half method reliability coefficients of half questionnaires were found as 0.77 and 0.91. Again Spearmen-Brown coefficient was found as 0.87 by the same analysis. To assess construct validity of ACQ, factor analysis was performed and two basic factors found. These two factors explained 57.6% of the total variance. (Factor 1: 34.6%, Factor 2: 23% Conclusion: Our findings support that ACQ-Turkish version had a satisfactory level of reliability and validity
Validity and reliability of the Bahasa Melayu version of the Migraine Disability Assessment questionnaire.

Science.gov (United States)

Shaik, Munvar Miya; Hassan, Norul Badriah; Tan, Huay Lin; Bhaskar, Shalini; Gan, Siew Hua

2014-01-01

The study was designed to determine the validity and reliability of the Bahasa Melayu version (MIDAS-M) of the Migraine Disability Assessment (MIDAS) questionnaire. Patients having migraine for more than six months attending the Neurology Clinic, Hospital Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia, were recruited. Standard forward and back translation procedures were used to translate and adapt the MIDAS questionnaire to produce the Bahasa Melayu version. The translated Malay version was tested for face and content validity. Validity and reliability testing were further conducted with 100 migraine patients (1st administration) followed by a retesting session 21 days later (2nd administration). A total of 100 patients between 15 and 60 years of age were recruited. The majority of the patients were single (66%) and students (46%). Cronbach's alpha values were 0.84 (1st administration) and 0.80 (2nd administration). The test-retest reliability for the total MIDAS score was 0.73, indicating that the MIDAS-M questionnaire is stable; for the five disability questions, the test-retest values ranged from 0.77 to 0.87. The MIDAS-M questionnaire is comparable with the original English version in terms of validity and reliability and may be used for the assessment of migraine in clinical settings.
Calf-raise senior: a new test for assessment of plantar flexor muscle strength in older adults: protocol, validity, and reliability.

Science.gov (United States)

André, Helô-Isa; Carnide, Filomena; Borja, Edgar; Ramalho, Fátima; Santos-Rocha, Rita; Veloso, António P

2016-01-01

This study aimed to develop a new field test protocol with a standardized measurement of strength and power in plantar flexor muscles targeted to functionally independent older adults, the calf-raise senior (CRS) test, and also evaluate its reliability and validity. Forty-one subjects aged 65 years and older of both sexes participated in five different cross-sectional studies: 1) pilot (n=12); 2) inter- and intrarater agreement (n=12); 3) construct (n=41); 4) criterion validity (n=33); and 5) test-retest reliability (n=41). Different motion parameters were compared in order to define a specifically designed protocol for seniors. Two raters evaluated each participant twice, and the results of the same individual were compared between raters and participants to assess the interrater and intrarater agreement. The validity and reliability studies involved three testing sessions that lasted 2 weeks, including a battery of functional fitness tests, CRS test in two occasions, accelerometry, and strength assessments in an isokinetic dynamometer. The CRS test presented an excellent test-retest reliability (intraclass correlation coefficient [ICC] =0.90, standard error of measurement =2.0) and interrater reliability (ICC =0.93-0.96), as well as a good intrarater agreement (ICC =0.79-0.84). Participants with better results in the CRS test were younger and presented higher levels of physical activity and functional fitness. A significant association between test results and all strength parameters (isometric, r =0.87, r 2 =0.75; isokinetic, r =0.86, r 2 =0.74; and rate of force development, r =0.77, r 2 =0.59) was shown. This study was successful in demonstrating that the CRS test can meet the scientific criteria of validity and reliability. The test can be a good indicator of ankle strength in older adults and proved to discriminate significantly between individuals with improved functionality and levels of physical activity.
TEST-RETEST RELIABILITY OF THE CLOSED KINETIC CHAIN UPPER EXTREMITY STABILITY TEST (CKCUEST) IN ADOLESCENTS: RELIABILITY OF CKCUEST IN ADOLESCENTS.

Science.gov (United States)

de Oliveira, Valéria M A; Pitangui, Ana C R; Nascimento, Vinícius Y S; da Silva, Hítalo A; Dos Passos, Muana H P; de Araújo, Rodrigo C

2017-02-01

The Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) has been proposed as an option to assess upper limb function and stability; however, there are few studies that support the use of this test in adolescents. The purpose of the present study was to investigate the intersession reliability and agreement of three CKCUEST scores in adolescents and establish clinimetric values for this test. Test-retest reliability. Twenty-five healthy adolescents of both sexes were evaluated. The subjects performed two CKCUEST with an interval of one week between the tests. An intraclass correlation coefficient (ICC 3,3 ) two-way mixed model with a 95% interval of confidence was utilized to determine intersession reliability. A Bland-Altman graph was plotted to analyze the agreement between assessments. The presence of systematic error was evaluated by a one-sample t test. The difference between the evaluation and reevaluation was observed using a paired-sample t test. The level of significance was set at 0.05. Standard error of measurements and minimum detectable changes were calculated. The intersession reliability of the average touches score, normalized score, and power score were 0.68, 0.68 and 0.87, the standard error of measurement were 2.17, 1.35 and 6.49, and the minimal detectable change was 6.01, 3.74 and 17.98, respectively. The presence of systematic error (p test with moderate to excellent reliability when used with adolescents. The CKCUEST is a measurement with moderate to excellent reliability for adolescents. 2b.
Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

Science.gov (United States)

Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

2018-03-02

There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.
Reliability of the Swedish version of the Exercise Self-Efficacy Scale (S-ESES): a test-retest study in adults with neurological disease.

Science.gov (United States)

Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth

2015-03-01

To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.
Validity and Reliability of Curl-Up Test on Assessing the Core Endurance for Kindergarten Children in Hong Kong

OpenAIRE

Lai, CY; Lee, KY; Lams, MHS; Wu, CF; Peake, R; Flint, SW; Li, WHC; Ho, E

2017-01-01

Objective: The purpose of this study was to examine the test-retest reliability and the criterion validity of a curlup\\ud test (CUT) as a measure of core stability, core endurance and dynamic stability in kindergarten children. CUT\\ud performance was also compared to half hold lying test (HHLT) and walking time on course (WTC) among without\\ud obstacle, with low obstacle and high obstacle measures of core stability, core endurance and dynamic stability.\\ud Methods: To estimate reliability, 33...
Test-retest reliability and comparability of paper and computer questionnaires for the Finnish version of the Tampa Scale of Kinesiophobia.

Science.gov (United States)

Koho, P; Aho, S; Kautiainen, H; Pohjolainen, T; Hurri, H

2014-12-01

To estimate the internal consistency, test-retest reliability and comparability of paper and computer versions of the Finnish version of the Tampa Scale of Kinesiophobia (TSK-FIN) among patients with chronic pain. In addition, patients' personal experiences of completing both versions of the TSK-FIN and preferences between these two methods of data collection were studied. Test-retest reliability study. Paper and computer versions of the TSK-FIN were completed twice on two consecutive days. The sample comprised 94 consecutive patients with chronic musculoskeletal pain participating in a pain management or individual rehabilitation programme. The group rehabilitation design consisted of physical and functional exercises, evaluation of the social situation, psychological assessment of pain-related stress factors, and personal pain management training in order to regain overall function and mitigate the inconvenience of pain and fear-avoidance behaviour. The mean TSK-FIN score was 37.1 [standard deviation (SD) 8.1] for the computer version and 35.3 (SD 7.9) for the paper version. The mean difference between the two versions was 1.9 (95% confidence interval 0.8 to 2.9). Test-retest reliability was 0.89 for the paper version and 0.88 for the computer version. Internal consistency was considered to be good for both versions. The intraclass correlation coefficient for comparability was 0.77 (95% confidence interval 0.66 to 0.85), indicating substantial reliability between the two methods. Both versions of the TSK-FIN demonstrated substantial intertest reliability, good test-retest reliability, good internal consistency and acceptable limits of agreement, suggesting their suitability for clinical use. However, subjects tended to score higher when using the computer version. As such, in an ideal situation, data should be collected in a similar manner throughout the course of rehabilitation or clinical research. Copyright © 2014 Chartered Society of Physiotherapy. Published
Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease.

Science.gov (United States)

Strouwen, Carolien; Molenaar, Esther A L M; Keus, Samyra H J; Münks, Liesbeth; Bloem, Bastiaan R; Nieuwboer, Alice

2016-08-01

Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains largely unknown. The purpose of this study was to assess the reliability of DT outcome measures in patients with PD. A repeated-measures design was used. Patients with PD ("on" medication, Mini-Mental State Examination score ≥24) performed 2 cognitive tasks (ie, backward digit span task and auditory Stroop task) and 1 functional task (ie, mobile phone task) in combination with walking. Tasks were assessed at 2 time points (same hour) with an interval of 6 weeks. Test-retest reliability was assessed for gait while performing each secondary task (DT gait) for both cognitive tasks while walking (DT cognitive) and for the functional task while walking (DT functional). Sixty-two patients with PD (age=39-89 years, Hoehn and Yahr stages II-III) were included in the study. Intraclass correlation coefficients (ICCs) showed excellent reliability for DT gait measures, ranging between .86 and .95 when combined with the digit span task, between .86 and .95 when combined with the auditory Stroop task, and between .72 and .90 when combined with the mobile phone task. The standard error of measurements for DT gait speed varied between 0.06 and 0.08 m/s, leading to minimal detectable changes between 0.16 and 0.22 m/s. With regard to DT cognitive measures, reaction times showed good-to-excellent reliability (digit span task: ICC=.75; auditory Stroop task: ICC=.82). The results cannot be generalized to patients with advanced disease or to other DT measures. In people with PD, DT measures proved to be reliable for use in clinical studies and look promising for use in clinical practice to assess improvements after DT training. Large effects, however, are needed to obtain meaningful effect sizes. �
Reliability and convergent validity of the five-step test in people with chronic stroke.

Science.gov (United States)

Ng, Shamay S M; Tse, Mimi M Y; Tam, Eric W C; Lai, Cynthia Y Y

2018-01-10

(i) To estimate the intra-rater, inter-rater and test-retest reliabilities of the Five-Step Test (FST), as well as the minimum detectable change in FST completion times in people with stroke. (ii) To estimate the convergent validity of the FST with other measures of stroke-specific impairments. (iii) To identify the best cut-off times for distinguishing FST performance in people with stroke from that of healthy older adults. A cross-sectional study. University-based rehabilitation centre. Forty-eight people with stroke and 39 healthy controls. None. The FST, along with (for the stroke survivors only) scores on the Fugl-Meyer Lower Extremity Assessment (FMA-LE), the Berg Balance Scale (BBS), Limits of Stability (LOS) tests, and Activities-specific Balance Confidence (ABC) scale were tested. The FST showed excellent intra-rater (intra-class correlation coefficient; ICC = 0.866-0.905), inter-rater (ICC = 0.998), and test-retest (ICC = 0.838-0.842) reliabilities. A minimum detectable change of 9.16 s was found for the FST in people with stroke. The FST correlated significantly with the FMA-LE, BBS, and LOS results in the forward and sideways directions (r = -0.411 to -0.716, p people with stroke and healthy older adults. The FST is a reliable, easy-to-administer clinical test for assessing stroke survivors' ability to negotiate steps and stairs.

Reliability and validity of the korean version of the connor-davidson resilience scale.

Science.gov (United States)

Baek, Hyun-Sook; Lee, Kyoung-Uk; Joo, Eun-Jeong; Lee, Mi-Young; Choi, Kyeong-Sook

2010-06-01

The Connor-Davidson Resilience Scale (CD-RISC) measures various aspects of psychological resilience in patients with posttraumatic stress disorder (PTSD) and other psychiatric ailments. This study sought to assess the reliability and validity of the Korean version of the Connor-Davidson Resilience Scale (K-CD-RISC). In total, 576 participants were enrolled (497 females and 79 males), including hospital nurses, university students, and firefighters. Subjects were evaluated using the K-CD-RISC, the Beck Depression Inventory (BDI), the Impact of Event Scale-Revised (IES-R), the Rosenberg Self-Esteem Scale (RSES), and the Perceived Stress Scale (PSS). Test-retest reliability and internal consistency were examined as a measure of reliability, and convergent validity and factor analysis were also performed to evaluate validity. Cronbach's alpha coefficient and test-retest reliability were 0.93 and 0.93, respectively. The total score on the K-CD-RISC was positively correlated with the RSES (r=0.56, preliability and validity for measurement of resilience among Korean subjects.
Validity and Reliability of the Clock Drawing Test in Older People

Directory of Open Access Journals (Sweden)

Massoumeh Sadeghipour Roodsari

2013-07-01

Full Text Available Objectives: Early diagnosis of cognitive disorders in order to initiate new efficient treatments in time is an important task which cannot be fulfilled without proper cognitive screening tools. The Clock Drawing Test (CDT is a simple inexpensive cognitive screening tool which can be used in primary care settings delivering health services to older people. The aim of this study was to assess validity and reliability of the CDT in Iranian older population. Methods & Materials: In this study the CDT and Mini Mental State Examination (MMSE were concurrently performed on 74 literate participants aged 60 and over. Participants were recruited from the clients of Iran Alzheimer’s Association (dementia patients and non-demented clients, including other patients or care givers during a 5 month period. The CDT was performed by two trained raters using Shulman’s six points scoring method. Using SPSS version 20, reliability was assessed measuring kappa statistics as well as ICC. Concurrent validity between CDT and MMSE were statistically analyzed by spearman’s rank correlation coefficient. Results: Mean age of the participants was 72 years in a range of 60 to 90 years with equal numbers 0f male and female participants. Kappa statistics for test retest reliability was 0.554 (P<0.001. ICC for inter rater reliability was 0.964 (P<0.001. Spearman’s rank correlation coefficient for MMSE and CDT scores was 0.782, statistically significant at P<0.001. Conclusion: CDT is a valid and reliable test in literate older people that can be used as a cognitive screening tool in Iranian older population.
Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

Science.gov (United States)

Sanders, James L; Williams, Robert J

2016-01-01

Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Environmental education curriculum evaluation questionnaire: A reliability and validity study

Science.gov (United States)

Minner, Daphne Diane

The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating
Reliability, Validity, and Sensitivity of a Novel Smartphone-Based Eccentric Hamstring Strength Test in Professional Football Players.

Science.gov (United States)

Lee, Justin W Y; Cai, Ming-Jing; Yung, Patrick S H; Chan, Kai-Ming

2018-05-01

To evaluate the test-retest reliability, sensitivity, and concurrent validity of a smartphone-based method for assessing eccentric hamstring strength among male professional football players. A total of 25 healthy male professional football players performed the Chinese University of Hong Kong (CUHK) Nordic break-point test, hamstring fatigue protocol, and isokinetic hamstring strength test. The CUHK Nordic break-point test is based on a Nordic hamstring exercise. The Nordic break-point angle was defined as the maximum point where the participant could no longer support the weight of his body against gravity. The criterion for the sensitivity test was the presprinting and postsprinting difference of the Nordic break-point angle with a hamstring fatigue protocol. The hamstring fatigue protocol consists of 12 repetitions of the 30-m sprint with 30-s recoveries between sprints. Hamstring peak torque of the isokinetic hamstring strength test was used as the criterion for validity. A high test-retest reliability (intraclass correlation coefficient = .94; 95% confidence interval, .82-.98) was found in the Nordic break-point angle measurements. The Nordic break-point angle significantly correlated with isokinetic hamstring peak torques at eccentric action of 30°/s (r = .88, r 2 = .77, P hamstring strength measures among male professional football players.
Discomfort Intolerance Scale: A Study of Reliability and Validity

Directory of Open Access Journals (Sweden)

Kadir ÖZDEL

2012-03-01

Full Text Available Objective: Discomfort Intolerance Scale was developed by Norman B. Schmidt et al. to assess the individual differences of capacity to withstand physical perturbations or uncomfortable bodily states (2006. The aim of this study is to investigate the validity and reliability of Discomfort Intolerance Scale-Turkish Version (RDÖ. Method: From two different universities, total of 225 students (male=167, female=58 were participated in this study. In order to determine the criterion validity, Beck Anxiety Inventory (BAI and State-Trait Anxiety Inventory (STAI were used. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability the scale was re-applied to 54 participants 6 weeks later. Results: To assess construct validity of DIS, factor analyses were performed using varimax principal components analysis with varimax rotation. The factor analysis resulted in two factors named “discomfort (in tolerance” and “discomfort avoidance”. The Cronbach’s alpha coefficient for the entire scale, discomfort-(intolerance subscale, discomfortavoidance subscale were, .592, .670, .600 respectively. Correlations between two factors of DIS, discomfort intolerance and discomfort avoidance, and Trait Anxiety Inventory of STAI (State-Trait Anxiety Inventory were statistically significant at the level of 0.05. Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DIS had a satisfactory level of reliability and validity in Turkish university students.
Test-retest reliability of the Middlesex Assessment of Mental State (MEAMS): a preliminary investigation in people with probable dementia.

Science.gov (United States)

Powell, T; Brooker, D J; Papadopolous, A

1993-05-01

Relative and absolute test-retest reliability of the MEAMS was examined in 12 subjects with probable dementia and 12 matched controls. Relative reliability was good. Measures of absolute reliability showed scores changing by up to 3 points over an interval of a week. A version effect was found to be in evidence.
Test-retest reliability of an interactive voice response (IVR) version of the EORTC QLQ-C30

NARCIS (Netherlands)

Lundy, J.J.; Coons, S.J.; Aaronson, N.K.

2015-01-01

Objective: The objective of this study was to assess the test-retest reliability of an interactive voice response (IVR) version of the European Organisation for Research and Treatment of Cancer (EORTC) QLQ-C30. Methods: A convenience sample of outpatient cancer clinic patients (n = 127) was asked to
The prone bridge test: Performance, validity, and reliability among older and younger adults.

Science.gov (United States)

Bohannon, Richard W; Steffl, Michal; Glenney, Susan S; Green, Michelle; Cashwell, Leah; Prajerova, Kveta; Bunn, Jennifer

2018-04-01

The prone bridge maneuver, or plank, has been viewed as a potential alternative to curl-ups for assessing trunk muscle performance. The purpose of this study was to assess prone bridge test performance, validity, and reliability among younger and older adults. Sixty younger (20-35 years old) and 60 older (60-79 years old) participants completed this study. Groups were evenly divided by sex. Participants completed surveys regarding physical activity and abdominal exercise participation. Height, weight, body mass index (BMI), and waist circumference were measured. On two occasions, 5-9 days apart, participants held a prone bridge until volitional exhaustion or until repeated technique failure. Validity was examined using data from the first session: convergent validity by calculating correlations between survey responses, anthropometrics, and prone bridge time, known groups validity by using an ANOVA comparing bridge times of younger and older adults and of men and women. Test-retest reliability was examined by using a paired t-test to compare prone bridge times for Session1 and Session 2. Furthermore, an intraclass correlation coefficient (ICC) was used to characterize relative reliability and minimal detectable change (MDC 95% ) was used to describe absolute reliability. The mean prone bridge time was 145.3 ± 71.5 s, and was positively correlated with physical activity participation (p ≤ 0.001) and negatively correlated with BMI and waist circumference (p ≤ 0.003). Younger participants had significantly longer plank times than older participants (p = 0.003). The ICC between testing sessions was 0.915. The prone bridge test is a valid and reliable measure for evaluating abdominal performance in both younger and older adults. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

Directory of Open Access Journals (Sweden)

Formiga MF

2018-05-01

Full Text Available Magno F Formiga,1,2 Kathryn E Roach,1 Isabel Vital,3 Gisel Urdaneta,3 Kira Balestrini,3 Rafael A Calderon-Candelario,3,4 Michael A Campos,3,4,* Lawrence P Cahalin1,* 1Department of Physical Therapy, University of Miami Miller School of Medicine, Coral Gables, FL, USA; 2CAPES Foundation, Ministry of Education of Brazil, Brasilia, Brazil; 3Pulmonary Section, Miami Veterans Administration Medical Center, Miami, FL, USA; 4Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, University of Miami Miller School of Medicine, Miami, FL, USA *These authors contributed equally to this work Purpose: The Test of Incremental Respiratory Endurance (TIRE provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP over time. The integration of MIP over inspiratory duration (ID provides the sustained maximal inspiratory pressure (SMIP. Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods: Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results: All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP
Reliability and validity of Yo-Yo tests in 9- to 16-year-old football players and matched non-sports active schoolboys

DEFF Research Database (Denmark)

Póvoas, Susana C A; Castagna, Carlo; Soares, José M C

2016-01-01

The purpose of this study was to examine the test-retest reliability and construct validity of three age-adapted Yo-Yo intermittent tests in football players aged 9-16 years (n = 70) and in age-matched non-sports active boys (n = 72). Within 7 days, each participant performed two repetitions...... performances and HRpeak are reliable for 9- to 16-year-old footballers and non-sports active boys. Additionally, performances of the three Yo-Yo tests were seemingly better for football-trained than for non-sports active boys, providing evidence of construct validity....
Validity and Reliability of the Bahasa Melayu Version of the Migraine Disability Assessment Questionnaire

Directory of Open Access Journals (Sweden)

Munvar Miya Shaik

2014-01-01

Full Text Available Background. The study was designed to determine the validity and reliability of the Bahasa Melayu version (MIDAS-M of the Migraine Disability Assessment (MIDAS questionnaire. Methods. Patients having migraine for more than six months attending the Neurology Clinic, Hospital Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia, were recruited. Standard forward and back translation procedures were used to translate and adapt the MIDAS questionnaire to produce the Bahasa Melayu version. The translated Malay version was tested for face and content validity. Validity and reliability testing were further conducted with 100 migraine patients (1st administration followed by a retesting session 21 days later (2nd administration. Results. A total of 100 patients between 15 and 60 years of age were recruited. The majority of the patients were single (66% and students (46%. Cronbach’s alpha values were 0.84 (1st administration and 0.80 (2nd administration. The test-retest reliability for the total MIDAS score was 0.73, indicating that the MIDAS-M questionnaire is stable; for the five disability questions, the test-retest values ranged from 0.77 to 0.87. Conclusion. The MIDAS-M questionnaire is comparable with the original English version in terms of validity and reliability and may be used for the assessment of migraine in clinical settings.
Reliability and concurrent validity of the Dutch hip and knee replacement expectations surveys.

Science.gov (United States)

van den Akker-Scheek, Inge; van Raay, Jos J A M; Reininga, Inge H F; Bulstra, Sjoerd K; Zijlstra, Wiebren; Stevens, Martin

2010-10-19

Preoperative expectations of outcome of total hip and knee arthroplasty are important determinants of patients' satisfaction and functional outcome. Aims of the study were (1) to translate the Hospital for Special Surgery Hip Replacement Expectations Survey and Knee Replacement Expectations Survey into Dutch and (2) to study test-retest reliability and concurrent validity. Patients scheduled for total hip (N = 112) or knee replacement (N = 101) were sent the Dutch Expectations Surveys twice with a 2 week interval to determine test-retest reliability. To determine concurrent validity, the Expectation WOMAC was sent. The results for the Dutch Hip Replacement Expectations Survey revealed good test-retest reliability (ICC 0.87), no bias and good internal consistency (alpha 0.86) (N = 72). The correlation between the Hip Expectations Score and the Expectation WOMAC score was 0.59 (N = 86). The results for the Dutch Knee Replacement Expectations Survey revealed good test-retest reliability (ICC 0.79), no bias and good internal consistency (alpha 0.91) (N = 46). The correlation with the Expectation WOMAC score was 0.52 (N = 57). Both Dutch Expectations Surveys are reliable instruments to determine patients' expectations before total hip or knee arthroplasty. As for concurrent validity, the correlation between both surveys and the Expectation WOMAC was moderate confirming that the same construct was determined. However, patients scored systematically lower on the Expectation WOMAC compared to the Dutch Expectation Surveys. Research on patients' expectations before total hip and knee replacement has only been performed in a limited amount of countries. With the Dutch Expectations Surveys it is now possible to determine patients' expectations in another culture and healthcare setting.
Test-retest reliability and predictors of unreliable reporting for a sexual behavior questionnaire for U.S. men.

Science.gov (United States)

Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R

2010-12-01

Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.
Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.

Science.gov (United States)

Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J

2012-07-01

We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.
Demonstration of the test-retest reliability and sensitivity of the Lower Limb Functional Index-10 as a measure of functional recovery post burn injury: a cross-sectional repeated measures study design.

Science.gov (United States)

Ryland, Margaret E; Grisbrook, Tiffany L; Wood, Fiona M; Phillips, Michael; Edgar, Dale W

2016-01-01

Lower limb burns can significantly delay recovery of function. Measuring lower limb functional outcomes is challenging in the unique burn patient population and necessitates the use of reliable and valid tools. The aims of this study were to examine the test-retest reliability, sensitivity, and internal consistency of Sections 1 and 3 of the Lower Limb Functional Index-10 (LLFI-10) questionnaire for measuring functional ability in patients with lower limb burns over time. Twenty-nine adult patients who had sustained a lower limb burn injury in the previous 12 months completed the test-retest procedure of the study. In addition, the minimal detectable change (MDC) was calculated for Section 1 and 3 of the LLFI-10. Section 1 is focused on the activity limitations experienced by patients with a lower limb disorder whereas Section 3 involves patients indicating their current percentage of pre-injury duties. Section 1 of the LLFI-10 demonstrated excellent test-retest reliability (intra-class correlation coefficient (ICC) 0.98, 95 % CI 0.96-0.99) whilst Section 3 demonstrated high test-retest reliability (ICC 0.88, 95 % CI 0.79-0.94). MDC scores for Sections 1 and 3 were 1.27 points and 30.22 %, respectively. Internal consistency was demonstrated with a significant negative association (r s = -0.83) between Sections 1 and 3 of the LLFI-10 (p reliable for measuring functional ability in patients who have sustained lower limb burns in the previous 12 months, and furthermore, Section 1 is sensitive to changes in patient function over time.
Rorschach e pedofilia: a fidedignidade no teste-reteste = Rorschach and pedophilia: a reliability at test-retest

Directory of Open Access Journals (Sweden)

Scortegagna, Silvana Alba

2013-01-01

Full Text Available Esse estudo buscou investigar as características de personalidade de um indivíduo pedófilo, e evidenciar a fidedignidade do Rorschach no teste-reteste. O participante, com 38 anos de idade, masculino, respondeu a entrevista e ao método de Rorschach, em duas etapas. Os principais achados revelam: a uma tendência à fragmentação na percepção de si e dos outros; b autoimagem negativa e desfavorável em relação ao corpo e suas funções; c problemas nas relações interpessoais, falhas na capacidade de empatia; d déficit no ajustamento perceptivo da realidade; e vulnerabilidade a pressões subjetivas e impulsividade. Esses resultados mantiveram-se estáveis comparando-se as duas aplicações, permitindo ampliar a compreensão dos elementos psicológicos envolvidos na pedofilia, que se mantem, e apoiam a fidedignidade do Rorschach no teste-reteste
Reliability and Validity of the Turkish Version of the Voice-Related Quality of Life Measure.

Science.gov (United States)

Tezcaner, Zahide Çiler; Aksoy, Songül

2017-03-01

This study aims to test the validity and reliability of the Turkish version of the Voice-Related Quality of Life (V-RQOL) questionnaire. This is a nonrandomized, prospective study with control group. The questionnaire was administered to 249 individuals-130 with vocal complaint and 119 without-with a mean age of 37.8 ± 12.3 years. The Turkish version of the Voice Handicap Index (VHI) and perceptual voice evaluation measures were also administered at 2-14 days for retest reliability. The instrument was submitted to validity and reliability evaluation. The V-RQOL measure showed a strong internal consistency and test-retest reliability; the Cronbach's alpha coefficient for the overall V-RQOL was 0.969, the physical functioning domain was 0.949, and the social-emotional domain was 0.940. In the test-retest reliability test, the overall V-RQOL was found to be 0.989. The construct validity of the V-RQOL was determined based on the strength and direction of its relation to the VHI and the perceptual voice evaluation measure. The higher the VHI level, the lower the physical functioning, social-emotional, and overall score levels of the V-RQOL (r = -0.927, r = -0.912, r = -0.944, respectively; P reliability and validity and may play a crucial role in evaluating Turkish-speaking patients with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reliability and validity of the McDonald Play Inventory.

Science.gov (United States)

McDonald, Ann E; Vigen, Cheryl

2012-01-01

This study examined the ability of a two-part self-report instrument, the McDonald Play Inventory, to reliably and validly measure the play activities and play styles of 7- to 11-yr-old children and to discriminate between the play of neurotypical children and children with known learning and developmental disabilities. A total of 124 children ages 7-11 recruited from a sample of convenience and a subsample of 17 parents participated in this study. Reliability estimates yielded moderate correlations for internal consistency, total test intercorrelations, and test-retest reliability. Validity estimates were established for content and construct validity. The results suggest that a self-report instrument yields reliable and valid measures of a child's perceived play performance and discriminates between the play of children with and without disabilities. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Health Service Quality Scale: Brazilian Portuguese translation, reliability and validity.

Science.gov (United States)

Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako

2013-01-17

The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.

Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Science.gov (United States)

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M.

2018-01-01

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…
Assessment of lower urinary tract symptoms in women by a self-administered questionnaire: test-retest reliability

DEFF Research Database (Denmark)

Bernstein, Inge Thomsen; Sejr, T; Able, I

1996-01-01

A self-administered questionnaire assessing female lower urinary tract symptoms and their impact on quality of life is described and validated, on 56 females in six participating departments. The patients answered two identical questionnaires on separate occasions before treatment. Test-retest re...
Test-retest reliability of the diagnosis of schizoaffective disorder in childhood and adolescence - A systematic review and meta-analysis.

Science.gov (United States)

Salamon, Sarah; Santelmann, Hanno; Franklin, Jeremy; Baethge, Christopher

2018-04-01

Reliability of schizoaffective disorder (SAD) diagnoses is low in adults but unclear in children and adolescents (CAD). We estimate the test-retest reliability of SAD and its key differential diagnoses (schizophrenia, bipolar disorder, and unipolar depression). Systematic literature search of Medline, Embase, and PsycInfo for studies on test-retest reliability of SAD, in CAD. Cohen's kappa was extracted from studies. We performed meta-analysis for kappa, including subgroup and sensitivity analysis (PROSPERO protocol: CRD42013006713). Out of > 4000 records screened, seven studies were included. We estimated kappa values of 0.27 [95%-CI: 0.07 0.47] for SAD, 0.56 [0.29; 0.83] for schizophrenia, 0.64 [0.55; 0.74] for bipolar disorder, and 0.66 [0.52; 0.81] for unipolar depression. In 5/7 studies kappa of SAD was lower than that of schizophrenia; similar trends emerged for bipolar disorder (4/5) and unipolar depression (2/3). Estimates of positive agreement of SAD diagnoses supported these results. The number of studies and patients included is low. The point-estimate of the test-retest reliability of schizoaffective disorder is only fair, and lower than that of its main differential diagnoses. All kappa values under study were lower in children and adolescents samples than those reported for adults. Clinically, schizoaffective disorder should be diagnosed in strict adherence to the operationalized criteria and ought to be re-evaluated regularly. Should larger studies confirm the insufficient reliability of schizoaffective disorder in children and adolescents, the clinical value of the diagnosis is highly doubtful. Copyright © 2017. Published by Elsevier B.V.
Test-retest reliability of a questionnaire to assess physical environmental factors pertaining to physical activity

Directory of Open Access Journals (Sweden)

McGinn Aileen P

2005-06-01

Full Text Available Abstract Background Despite the documented benefits of physical activity, many adults do not obtain the recommended amounts. Barriers to physical activity occur at multiple levels, including at the individual, interpersonal, and environmental levels. Only until more recently has there been a concerted focus on how the physical environment might affect physical activity behavior. With this new area of study, self-report measures should be psychometrically tested before use in research studies. Therefore the objective of this study was to document the test-retest reliability of a questionnaire designed to assess physical environmental factors that might be associated with physical activity in a diverse adult population. Methods Test and retest surveys were conducted over the telephone with 106 African American and White women and men living in either Forsyth County, North Carolina or Jackson, Mississippi. Reliability of self-reported environmental factors across four domains (e.g., access to facilities and destinations, functionality and safety, aesthetics, natural environment was determined using intraclass correlation coefficients (ICC overall and separately by gender and race. Results Generally items displayed moderate and sometimes substantial reliability (ICC between 0.4 to 0.8, with a few differences by gender or race, across each of the domains. Conclusion This study provides some psychometric evidence for the use of many of these questions in studies examining the effect of self-reported physical environmental measures on physical activity behaviors, among African American and White women and men.
Work-related measures of physical and behavioral health function: Test-retest reliability.

Science.gov (United States)

Marino, Molly Elizabeth; Meterko, Mark; Marfeo, Elizabeth E; McDonough, Christine M; Jette, Alan M; Ni, Pengsheng; Bogusz, Kara; Rasch, Elizabeth K; Brandt, Diane E; Chan, Leighton

2015-10-01

The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US Social Security Administration to assess work-related function, currently consists of five multi-item scales assessing physical function and four multi-item scales assessing behavioral health function; the WD-FAB scales are administered as Computerized Adaptive Tests (CATs). The goal of this study was to evaluate the test-retest reliability of the WD-FAB Physical Function and Behavioral Health CATs. We administered the WD-FAB scales twice, 7-10 days apart, to a sample of 376 working age adults and 316 adults with work-disability. Intraclass correlation coefficients were calculated to measure the consistency of the scores between the two administrations. Standard error of measurement (SEM) and minimal detectable change (MDC90) were also calculated to measure the scales precision and sensitivity. For the Physical Function CAT scales, the ICCs ranged from 0.76 to 0.89 in the working age adult sample, and 0.77-0.86 in the sample of adults with work-disability. ICCs for the Behavioral Health CAT scales ranged from 0.66 to 0.70 in the working age adult sample, and 0.77-0.80 in the adults with work-disability. The SEM ranged from 3.25 to 4.55 for the Physical Function scales and 5.27-6.97 for the Behavioral Health function scales. For all scales in both samples, the MDC90 ranged from 7.58 to 16.27. Both the Physical Function and Behavioral Health CATs of the WD-FAB demonstrated good test-retest reliability in adults with work-disability and general adult samples, a critical requirement for assessing work related functioning in disability applicants and in other contexts. Copyright © 2015 Elsevier Inc. All rights reserved.
Validity and Reliability of a New Device (WIMU®) for Measuring Hamstring Muscle Extensibility.

Science.gov (United States)

Muyor, José M

2017-09-01

The aims of the current study were 1) to evaluate the validity of the WIMU ® system for measuring hamstring muscle extensibility in the passive straight leg raise (PSLR) test using an inclinometer for the criterion and 2) to determine the test-retest reliability of the WIMU ® system to measure hamstring muscle extensibility during the PSLR test. 55 subjects were evaluated on 2 separate occasions. Data from a Unilever inclinometer and WIMU ® system were collected simultaneously. Intraclass correlation coefficients (ICCs) for the validity were very high (0.983-1); a very low systematic bias (-0.21°--0.42°), random error (0.05°-0.04°) and standard error of the estimate (0.43°-0.34°) were observed (left-right leg, respectively) between the 2 devices (inclinometer and the WIMU ® system). The R 2 between the devices was 0.999 (p<0.001) in both the left and right legs. The test-retest reliability of the WIMU ® system was excellent, with ICCs ranging from 0.972-0.995, low coefficients of variation (0.01%), and a low standard error of the estimate (0.19-0.31°). The WIMU ® system showed strong concurrent validity and excellent test-retest reliability for the evaluation of hamstring muscle extensibility in the PSLR test. © Georg Thieme Verlag KG Stuttgart · New York.
Distress Tolerance Scale: A Study of Reliability and Validity

Directory of Open Access Journals (Sweden)

Ahmet Emre SARGIN

2012-11-01

Full Text Available Objective: Distress Tolerance Scale (DTS is developed by Simons and Gaher in order to measure individual differences in the capacity of distress tolerance.The aim of this study is to assess the reliability and validity of the Turkish version of DTS. Method: One hundred and sixty seven university students (male=66, female=101 participated in this study. Beck Anxiety Inventory (BAI, State-trait Anxiety Inventory (STAI and Discomfort Intolerance Scale (DIS were used to determine the criterion validity. Construct validity was evaluated with factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability, the scale was re-applied to 79 participants six weeks later. Results: To assess construct validity, factor analyses were performed using varimax principal components analysis with varimax rotation. While there were factors in the original study, our factor analysis resulted in three factors. Cronbach’s alpha coefficients for the entire scale and tolerance, regulation, self-efficacy subscales were .89, .90, .80 and .64 respectively. There were correlations at the level of 0.01 between the Trait Anxiety Inventory of STAI and BAI, and all the subscales of DTS and also between the State Anxiety Inventory and regulation subscale. Both of the subscales of DIS were correlated with the entire subscale and all the subscales except regulation at the level of 0.05.Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DTS had a satisfactory level of reliability and validity in Turkish university students.
Reliability and validity of the Safe Routes to school parent and student surveys

Directory of Open Access Journals (Sweden)

Evenson Kelly R

2011-06-01

Full Text Available Abstract Background The purpose of this study is to assess the reliability and validity of the U.S. National Center for Safe Routes to School's in-class student travel tallies and written parent surveys. Over 65,000 tallies and 374,000 parent surveys have been completed, but no published studies have examined their measurement properties. Methods Students and parents from two Charlotte, NC (USA elementary schools participated. Tallies were conducted on two consecutive days using a hand-raising protocol; on day two students were also asked to recall the previous days' travel. The recall from day two was compared with day one to assess 24-hour test-retest reliability. Convergent validity was assessed by comparing parent-reports of students' travel mode with student-reports of travel mode. Two-week test-retest reliability of the parent survey was assessed by comparing within-parent responses. Reliability and validity were assessed using kappa statistics. Results A total of 542 students participated in the in-class student travel tally reliability assessment and 262 parent-student dyads participated in the validity assessment. Reliability was high for travel to and from school (kappa > 0.8; convergent validity was lower but still high (kappa > 0.75. There were no differences by student grade level. Two-week test-retest reliability of the parent survey (n = 112 ranged from moderate to very high for objective questions on travel mode and travel times (kappa range: 0.62 - 0.97 but was substantially lower for subjective assessments of barriers to walking to school (kappa range: 0.31 - 0.76. Conclusions The student in-class student travel tally exhibited high reliability and validity at all elementary grades. The parent survey had high reliability on questions related to student travel mode, but lower reliability for attitudinal questions identifying barriers to walking to school. Parent survey design should be improved so that responses clearly indicate
Reliability and validity of the Safe Routes to school parent and student surveys.

Science.gov (United States)

McDonald, Noreen C; Dwelley, Amanda E; Combs, Tabitha S; Evenson, Kelly R; Winters, Richard H

2011-06-08

The purpose of this study is to assess the reliability and validity of the U.S. National Center for Safe Routes to School's in-class student travel tallies and written parent surveys. Over 65,000 tallies and 374,000 parent surveys have been completed, but no published studies have examined their measurement properties. Students and parents from two Charlotte, NC (USA) elementary schools participated. Tallies were conducted on two consecutive days using a hand-raising protocol; on day two students were also asked to recall the previous days' travel. The recall from day two was compared with day one to assess 24-hour test-retest reliability. Convergent validity was assessed by comparing parent-reports of students' travel mode with student-reports of travel mode. Two-week test-retest reliability of the parent survey was assessed by comparing within-parent responses. Reliability and validity were assessed using kappa statistics. A total of 542 students participated in the in-class student travel tally reliability assessment and 262 parent-student dyads participated in the validity assessment. Reliability was high for travel to and from school (kappa > 0.8); convergent validity was lower but still high (kappa > 0.75). There were no differences by student grade level. Two-week test-retest reliability of the parent survey (n=112) ranged from moderate to very high for objective questions on travel mode and travel times (kappa range: 0.62-0.97) but was substantially lower for subjective assessments of barriers to walking to school (kappa range: 0.31-0.76). The student in-class student travel tally exhibited high reliability and validity at all elementary grades. The parent survey had high reliability on questions related to student travel mode, but lower reliability for attitudinal questions identifying barriers to walking to school. Parent survey design should be improved so that responses clearly indicate issues that influence parental decision making in regards to their
Test-retest reliability of behavioral measures of impulsive choice, impulsive action, and inattention.

Science.gov (United States)

Weafer, Jessica; Baggott, Matthew J; de Wit, Harriet

2013-12-01

Behavioral measures of impulsivity are widely used in substance abuse research, yet relatively little attention has been devoted to establishing their psychometric properties, especially their reliability over repeated administration. The current study examined the test-retest reliability of a battery of standardized behavioral impulsivity tasks, including measures of impulsive choice (i.e., delay discounting, probability discounting, and the Balloon Analogue Risk Task), impulsive action (i.e., the stop signal task, the go/no-go task, and commission errors on the continuous performance task), and inattention (i.e., attention lapses on a simple reaction time task and omission errors on the continuous performance task). Healthy adults (n = 128) performed the battery on two separate occasions. Reliability estimates for the individual tasks ranged from moderate to high, with Pearson correlations within the specific impulsivity domains as follows: impulsive choice (r range: .76-.89, ps reliable measures and thus can be confidently used to assess various facets of impulsivity as intermediate phenotypes for drug abuse.
Development, construct validity and test–retest reliability of a field-based wheelchair mobility performance test for wheelchair basketball

NARCIS (Netherlands)

de Witte, AMH; Hoozemans, MJM; Berger, MAM; van der Slikke, R.M.A.; van der Woude, LHV; Veeger, H.E.J.

2018-01-01

The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation
Development, construct validity and test–retest reliability of a field-based wheelchair mobility performance test for wheelchair basketball

NARCIS (Netherlands)

de Witte, Annemarie M H; Hoozemans, Marco J M; Berger, Monique A M; van der Slikke, Rienk M A; van der Woude, Lucas H V; Veeger, Dirkjan (H E J)

The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of
The Persian version of phonological test of diagnostic evaluation articulation and phonology for Persian speaking children and investigating its validity and reliability

Directory of Open Access Journals (Sweden)

Talieh Zarifian

2014-10-01

Full Text Available Background and Aim: Speech and language pathologists (SLP often refer to phonological data as part of their assessment protocols in evaluating the communication skills of children. The aim of this study was to develop the Persian version of the phonological test in evaluating and diagnosing communication skills in Persian speaking children and to evaluate its validity and reliability.Methods: The Persian phonological test (PPT was conducted on 387 monolingual Persian speaking boys and girls (3-6 years of age who were selected from 12 nurseries in the northwest region of Tehran. Content validity ratio (CVR and content validity index (CVI were assessed by speechtherapists and linguists. Correlation between speech and language pathologists experts' opinions and Persian phonological test results in children with and without phonological disorders was evaluated to investigate the Persian phonological test validity. In addition, the Persian phonological test test-retest reliability was investigated.Results: Both content validity ratio and content validity index were found to be acceptable (CVR≥94.71 and CVI=97.35. The PPT validity was confirmed by finding a good correlation between s peech and language pathologists experts' opinions and Persian phonological test results ( r Kappa =0.73 and r Spearman =0.76. The percent of agreement between transcription and analyzing error patterns in test-retest (ranging from 86.27%-100% and score-rescore (ranging from 94.28%-100% showed that Persian phonological test had a very high reliability.Conclusion: The results of this study show that the Persian phonological test seems to be a suitable tool in evaluating phonological skills of Persian speaking children in clinical settings and research projects.
The Neck Disability Index-Russian Language Version (NDI-RU): A Study of Validity and Reliability.

Science.gov (United States)

Bakhtadze, Maxim A; Vernon, Howard; Zakharova, Olga B; Kuzminov, Kirill O; Bolotov, Dmitry A

2015-07-15

Cross-cultural adaptation and psychometric testing. To perform a validated Russian translation and then to evaluate the validity and reliability of the Russian language version of the Neck Disability Index (NDI-RU). Neck pain is highly prevalent and can greatly affect daily activity. The Neck Disability Index (NDI) is the most frequently used scale for self-rating of disability due to neck pain. Its translated versions are applied in many countries. However, the Russian language version of the NDI has not been developed yet. Cross-cultural adaptation of the NDI-RU was performed according to established guidelines. Then, the NDI-RU was evaluated for content validity, concurrent criterion validity, internal consistency, test-retest reliability, factor structure, and minimum detectable change. Two hundred thirty-two patients took part in the study in total: 109 in validity (39.5 ± 10 yr), 123 in reliability (38.4 ± 11 yr; 80 in the test-retest phase). A culturally valid translation was achieved. NDI-RU total scores were distributed normally. Floor/ceiling effects were absent. Good values of Cronbach α were obtained for each item (from 0.80 to 0.84) and for the total NDI-RU (0.83). A 2-factor solution was found for the NDI-RU. The average interitem correlation coefficient was 0.53. Intraclass correlation coefficients for test-retest reliability coefficients ranged from 0.65 to 0.92 for different items and 0.91 for the total NDI-RU. Moderate correlation (Spearman rs = 0.62; P Russian language version of the Neck Disability Index resulted in a valid, reliable instrument that can be used both in clinical practice and scientific investigations. 1.
Reliability and validity of the Brief Pain Inventory in individuals with chronic obstructive pulmonary disease.

Science.gov (United States)

Chen, Y-W; HajGhanbari, B; Road, J D; Coxson, H O; Camp, P G; Reid, W D

2018-06-08

Pain is prevalent in chronic obstructive pulmonary disease (COPD) and the Brief Pain Inventory (BPI) appears to be a feasible questionnaire to assess this symptom. However, the reliability and validity of the BPI have not been determined in individuals with COPD. This study aimed to determine the internal consistency, test-retest reliability and validity (construct, convergent, divergent and discriminant) of the BPI in individuals with COPD. In order to examine the test-retest reliability, individuals with COPD were recruited from pulmonary rehabilitation programmes to complete the BPI twice 1 week apart. In order to investigate validity, de-identified data was retrieved from two previous studies, including forced expiratory volume in 1-s, age, sex and data from four questionnaires: the BPI, short-form McGill Pain Questionnaire (SF-MPQ), 36-Item Short Form Survey (SF-36) and Community Health Activities Model Program for Seniors (CHAMPS) questionnaire. In total, 123 participants were included in the analyses (eligible data were retrieved from 86 participants and additional 37 participants were recruited). The BPI demonstrated excellent internal consistency and test-retest reliability. It also showed convergent validity with the SF-MPQ and divergent validity with the SF-36. The factor analysis yielded two factors of the BPI, which demonstrated that the two domains of the BPI measure the intended constructs. The BPI can also discriminate pain levels among COPD patients with varied levels of quality of life (SF-36) and physical activity (CHAMPS). The BPI is a reliable and valid pain questionnaire that can be used to evaluate pain in COPD. This study formally established the reliability and validity of the BPI in individuals with COPD, which have not been determined in this patient group. The results of this study provide strong evidence that assessment results from this pain questionnaire are reliable and valid. © 2018 European Pain Federation - EFIC®.
Test-retest reliability and task order effects of emotional cognitive tests in healthy subjects.

Science.gov (United States)

Adams, Thomas; Pounder, Zoe; Preston, Sally; Hanson, Andy; Gallagher, Peter; Harmer, Catherine J; McAllister-Williams, R Hamish

2016-11-01

Little is known of the retest reliability of emotional cognitive tasks or the impact of using different tasks employing similar emotional stimuli within a battery. We investigated this in healthy subjects. We found improved overall performance in an emotional attentional blink task (EABT) with repeat testing at one hour and one week compared to baseline, but the impact of an emotional stimulus on performance was unchanged. Similarly, performance on a facial expression recognition task (FERT) was better one week after a baseline test, though the relative effect of specific emotions was unaltered. There was no effect of repeat testing on an emotional word categorising, recall and recognition task. We found no difference in performance in the FERT and EABT irrespective of task order. We concluded that it is possible to use emotional cognitive tasks in longitudinal studies and combine tasks using emotional facial stimuli in a single battery.
The eye-complaint questionnaire in a visual display unit work environment: Internal consistency and test-retest reliability

NARCIS (Netherlands)

Steenstra, Ivan A.; Sluiter, Judith K.; Frings-Dresen, Monique H. W.

2009-01-01

The internal consistency and test-retest reliability of a 10-item eye-complaint questionnaire (ECQ) were examined within a sample of office workers. Repeated within-subjects measures were performed within a single day and over intervals of 1 and 7 d. Questionnaires were completed by 96 workers (70%
Hypertension Knowledge-Level Scale (HK-LS: A Study on Development, Validity and Reliability

Directory of Open Access Journals (Sweden)

Cemalettin Kalyoncu

2012-03-01

Full Text Available This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Hypertension Knowledge-Level Scale (HK-LS): a study on development, validity and reliability.

Science.gov (United States)

Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin

2012-03-01

This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥ 18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Intra-Rater, Inter-Rater and Test-Retest Reliability of an Instrumented Timed Up and Go (iTUG Test in Patients with Parkinson's Disease.

Directory of Open Access Journals (Sweden)

Rob C van Lummel

Full Text Available The "Timed Up and Go" (TUG is a widely used measure of physical functioning in older people and in neurological populations, including Parkinson's Disease. When using an inertial sensor measurement system (instrumented TUG [iTUG], the individual components of the iTUG and the trunk kinematics can be measured separately, which may provide relevant additional information.The aim of this study was to determine intra-rater, inter-rater and test-retest reliability of the iTUG in patients with Parkinson's Disease.Twenty eight PD patients, aged 50 years or older, were included. For the iTUG the DynaPort Hybrid (McRoberts, The Hague, The Netherlands was worn at the lower back. The device measured acceleration and angular velocity in three directions at a rate of 100 samples/s. Patients performed the iTUG five times on two consecutive days. Repeated measurements by the same rater on the same day were used to calculate intra-rater reliability. Repeated measurements by different raters on the same day were used to calculate intra-rater and inter-rater reliability. Repeated measurements by the same rater on different days were used to calculate test-retest reliability.Nineteen ICC values (15% were ≥ 0.9 which is considered as excellent reliability. Sixty four ICC values (49% were ≥ 0.70 and < 0.90 which is considered as good reliability. Thirty one ICC values (24% were ≥ 0.50 and < 0.70, indicating moderate reliability. Sixteen ICC values (12% were ≥ 0.30 and < 0.50 indicating poor reliability. Two ICT values (2% were < 0.30 indicating very poor reliability.In conclusion, in patients with Parkinson's disease the intra-rater, inter-rater, and test-retest reliability of the individual components of the instrumented TUG (iTUG was excellent to good for total duration and for turning durations, and good to low for the sub durations and for the kinematics of the SiSt and StSi. The results of this fully automated analysis of instrumented TUG movements

Criterion validity and reliability of a smartphone delivered sub-maximal fitness test for people with type 2 diabetes

DEFF Research Database (Denmark)

Brinklov, Cecilie Fau; Thorsen, Ida Kær; Karstoft, Kristian

2016-01-01

Background: Prevention of multi-morbidities following non-communicable diseases requires a systematic registration of adverse modifiable risk factors, including low physical fitness. The aim of the study was to establish criterion validity and reliability of a smartphone app (InterWalk) delivered....... The algorithm was validated using leave-one-out cross validation. Test-retest reliability was tested in a subset of participants (N = 10). Results: The overall VO2peak prediction of the algorithm (R2) was 0.60 and 0.45 when the smartphone was placed in the pockets of the pants and jacket, respectively (p ... calorimetry and the acceleration (vector magnitude) from the smartphone was obtained. The vector magnitude was used to predict VO2peak along with the co-variates weight, height and sex. The validity of the algorithm was tested when the smartphone was placed in the right pocket of the pants or jacket...
Validity and Reliability of the Upper Extremity Work Demands Scale.

Science.gov (United States)

Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K

2017-12-01

Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.
Test-Retest Reliability of the Parent Behavior Importance Questionnaire-Revised and the Parent Behavior Frequency Questionnaire-Revised

Science.gov (United States)

Mowder, Barbara A.; Shamah, Renee

2011-01-01

This study evaluated the test-retest reliability of two parenting measures: the Parent Behavior Importance Questionnaire-Revised (PBIQ-R) and Parent Behavior Frequency Questionnaire-Revised (PBFQ-R). These self-report parenting behavior assessment measures may be utilized as pre- and post-parent education program measures, with parents as well as…
The Reliability and Validity of the Coopersmith Self-Esteem Inventory-Form B.

Science.gov (United States)

Chiu, Lian-Hwang

1985-01-01

The purpose of this study was to determine the test-retest reliability and concurrent validity of the short form (Form B) of the Coopersmith Self-Esteem Inventory. Criterion measures for validity included: (1) sociometric measures; (2) teacher's popularity ranking; and, (3) self-esteem rating. (Author/LMO)
Reliability and validity of the Japanese version of the Resilience Scale and its short version

Directory of Open Access Journals (Sweden)

Kondo Maki

2010-11-01

Full Text Available Abstract Background The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS and short version of the RS (RS-14. Findings The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D, Rosenberg Self-Esteem Scale (RSES, Social Support Questionnaire (SSQ, Perceived Stress Scale (PSS, and Sheehan Disability Scale (SDS were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p Conclusions This study demonstrates that the Japanese version of RS has psychometric properties with high degrees of internal consistency, high test-retest reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
Cultural Adaptation of the Portuguese Version of the "Sniffin' Sticks" Smell Test: Reliability, Validity, and Normative Data.

Science.gov (United States)

Ribeiro, João Carlos; Simões, João; Silva, Filipe; Silva, Eduardo D; Hummel, Cornelia; Hummel, Thomas; Paiva, António

2016-01-01

The cross-cultural adaptation and validation of the Sniffin`Sticks test for the Portuguese population is described. Over 270 people participated in four experiments. In Experiment 1, 67 participants rated the familiarity of presented odors and seven descriptors of the original test were adapted to a Portuguese context. In Experiment 2, the Portuguese version of Sniffin`Sticks test was administered to 203 healthy participants. Older age, male gender and active smoking status were confirmed as confounding factors. The third experiment showed the validity of the Portuguese version of Sniffin`Sticks test in discriminating healthy controls from patients with olfactory dysfunction. In Experiment 4, the test-retest reliability for both the composite score (r71 = 0.86) and the identification test (r71 = 0.62) was established (pPortuguese version of Sniffin`Sticks test is provided, showing good validity and reliability and effectively distinguishing patients from healthy controls with high sensitivity and specificity. The Portuguese version of Sniffin`Sticks test identification test is a clinically suitable screening tool in routine outpatient Portuguese settings.
Health service quality scale: Brazilian Portuguese translation, reliability and validity

Science.gov (United States)

2013-01-01

Background The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. Methods We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson’s correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach’s alpha coefficient; the intraclass (ICC) and Pearson’s correlation coefficients were used for test-retest reliability. Results One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson’s correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson’s correlation coefficient was 0.89 and ICC was 0.90. Conclusions The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality. PMID:23327598
Measuring older adults' sedentary time: reliability, validity, and responsiveness.

Science.gov (United States)

Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville

2011-11-01

With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is
Test-retest reliability of Antonovsky's 13-item sense of coherence scale in patients with hand-related disorders

DEFF Research Database (Denmark)

Hansen, Alice Ørts; Kristensen, Hanne Kaae; Cederlund, Ragnhild

2017-01-01

to be a powerful tool to measure the ICF component personal factors, which could have an impact on patients' rehabilitation outcomes. Implications for rehabilitation Antonovsky's SOC-13 scale showed test-retest reliability for patients with hand-related disorders. The SOC-13 scale could be a suitable tool to help...... measure personal factors....
Validity and reliability of the Utrecht Work Engagement Scale-Student Version in Sri Lanka.

Science.gov (United States)

Wickramasinghe, Nuwan Darshana; Dissanayake, Devani Sakunthala; Abeywardena, Gihan Sajiwa

2018-05-04

The present study was aimed at assessing the validity and the reliability of the Sinhala version of the Utrecht Work Engagement Scale-Student Version (UWES-S) among collegiate cycle students in Sri Lanka. The 17-item UWES-S was translated to Sinhala and the judgmental validity was assessed by a multi-disciplinary panel of experts. Construct validity of the UWES-S was appraised by using multi-trait scaling analysis and exploratory factor analysis (EFA) on data obtained from a sample of 194 grade thirteen students in the Kurunegala district, Sri Lanka. Reliability of the UWES-S was assessed by using internal consistency and test-retest reliability. Except for item 13, all other items showed good psychometric properties in judgemental validity, item-convergent validity and item-discriminant validity. EFA using principal component analysis with Oblimin rotation, suggested a three-factor solution (including vigor, dedication and absorption subscales) explaining 65.4% of the total variance for the 16-item UWES-S (with item 13 deleted). All three subscales show high internal consistency with Cronbach's α coefficient values of 0.867, 0.819, and 0.903 and test-retest reliability was high (p valid and a reliable instrument to assess work engagement among collegiate cycle students in Sri Lanka.
The Trojan Lifetime Champions Health Survey: Development, Validity, and Reliability

Science.gov (United States)

Sorenson, Shawn C.; Romano, Russell; Scholefield, Robin M.; Schroeder, E. Todd; Azen, Stanley P.; Salem, George J.

2015-01-01

Context Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. Objective To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Design Descriptive laboratory study. Setting A large National Collegiate Athletic Association Division I university. Patients or Other Participants A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Intervention(s) Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Main Outcome Measure(s) Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Results Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent
Validity and reliability of short form-12 questionnaire in Iranian hemodialysis patients

DEFF Research Database (Denmark)

Pakpour, Amir H.; Nourozi, Saeedeh; Mølsted, Stig

2011-01-01

INTRODUCTION: The aim of the study was to assess the validity and reliability of the SF-12 questionnaire in a sample of Iranian patients undergoing hemodialysis. MATERIALS AND METHODS: One hundred and forty-four hemodialysis patients were included from dialysis centers in Zanjan, Iran, and were...... asked to complete the SF-12 and SF-36 questionnaires. An initial test-retest reliability evaluation was performed on a sample of 70 patients from the total group, with a retest interval of 14 days. Reliability was estimated by internal consistency and validity was assessed using known-group comparisons...... and construct validity on the patient group as a whole. A linear regression analysis was used to assess any variation in the physical component summary and mental component summary scores of the SF-36 with the respective component summary scores of the SF-12. In addition, the factor structure...
The Persian version of auditory word discrimination test (P-AWDT) for children: Development, validity, and reliability.

Science.gov (United States)

Hashemi, Nassim; Ghorbani, Ali; Soleymani, Zahra; Kamali, Mohmmad; Ahmadi, Zohreh Ziatabar; Mahmoudian, Saeid

2018-07-01

Auditory discrimination of speech sounds is an important perceptual ability and a precursor to the acquisition of language. Auditory information is at least partially necessary for the acquisition and organization of phonological rules. There are few standardized behavioral tests to evaluate phonemic distinctive features in children with or without speech and language disorders. The main objective of the present study was the development, validity, and reliability of the Persian version of auditory word discrimination test (P-AWDT) for 4-8-year-old children. A total of 120 typical children and 40 children with speech sound disorder (SSD) participated in the present study. The test comprised of 160 monosyllabic paired-words distributed in the Forms A-1 and the Form A-2 for the initial consonants (80 words) and the Forms B-1 and the Form B-2 for the final consonants (80 words). Moreover, the discrimination of vowels was randomly included in all forms. Content validity was calculated and 50 children repeated the test twice with two weeks of interval (test-retest reliability). Further analysis was also implemented including validity, intraclass correlation coefficient (ICC), Cronbach's alpha (internal consistency), age groups, and gender. The content validity index (CVI) and the test-retest reliability of the P-AWDT were achieved 63%-86% and 81%-96%, respectively. Moreover, the total Cronbach's alpha for the internal consistency was estimated relatively high (0.93). Comparison of the mean scores of the P-AWDT in the typical children and the children with SSD revealed a significant difference. The results revealed that the group with SSD had greater severity of deficit than the typical group in auditory word discrimination. In addition, the difference between the age groups was statistically significant, especially in 4-4.11-year-old children. The performance of the two gender groups was relatively same. The comparison of the P-AWDT scores between the typical children
Life Satisfaction Questionnaire (Lisat-9): Reliability and Validity for Patients with Acquired Brain Injury

Science.gov (United States)

Boonstra, Anne M.; Reneman, Michiel F.; Stewart, Roy E.; Balk, Gerlof A.

2012-01-01

The aim of this study was to determine the reliability and discriminant validity of the Dutch version of the life satisfaction questionnaire (Lisat-9 DV) to assess patients with an acquired brain injury. The reliability study used a test-retest design, and the validity study used a cross-sectional design. The setting was the general rehabilitation…
Test-retest reliability of Physical Activity Neighborhood Environment Scale among urban men and women in Nanjing, China.

Science.gov (United States)

Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F

2018-03-01

The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Reliability and validity of the foot and ankle outcome score: a validation study from Iran.

Science.gov (United States)

Negahban, Hossein; Mazaheri, Masood; Salavati, Mahyar; Sohani, Soheil Mansour; Askari, Marjan; Fanian, Hossein; Parnianpour, Mohamad

2010-05-01

The aims of this study were to culturally adapt and validate the Persian version of Foot and Ankle Outcome Score (FAOS) and present data on its psychometric properties for patients with different foot and ankle problems. The Persian version of FAOS was developed after a standard forward-backward translation and cultural adaptation process. The sample included 93 patients with foot and ankle disorders who were asked to complete two questionnaires: FAOS and Short-Form 36 Health Survey (SF-36). To determine test-retest reliability, 60 randomly chosen patients completed the FAOS again 2 to 6 days after the first administration. Test-retest reliability and internal consistency were assessed using intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. To evaluate convergent and divergent validity of FAOS compared to similar and dissimilar concepts of SF-36, the Spearman's rank correlation was used. Dimensionality was determined by assessing item-subscale correlation corrected for overlap. The results of test-retest reliability show that all the FAOS subscales have a very high ICC, ranging from 0.92 to 0.96. The minimum Cronbach's alpha level of 0.70 was exceeded by most subscales. The Spearman's correlation coefficient for convergent construct validity fell within 0.32 to 0.58 for the main hypotheses presented a priori between FAOS and SF-36 subscales. For dimensionality, the minimum Spearman's correlation coefficient of 0.40 was exceeded by most items. In conclusion, the results of our study show that the Persian version of FAOS seems to be suitable for Iranian patients with various foot and ankle problems especially lateral ankle sprain. Future studies are needed to establish stronger psychometric properties for patients with different foot and ankle problems.
Short-term test-retest-reliability of conditioned pain modulation using the cold-heat-pain method in healthy subjects and its correlation to parameters of standardized quantitative sensory testing.

Science.gov (United States)

Gehling, Julia; Mainka, Tina; Vollert, Jan; Pogatzki-Zahn, Esther M; Maier, Christoph; Enax-Krumova, Elena K

2016-08-05

Conditioned Pain Modulation (CPM) is often used to assess human descending pain inhibition. Nine different studies on the test-retest-reliability of different CPM paradigms have been published, but none of them has investigated the commonly used heat-cold-pain method. The results vary widely and therefore, reliability measures cannot be extrapolated from one CPM paradigm to another. Aim of the present study was to analyse the test-retest-reliability of the common heat-cold-pain method and its correlation to pain thresholds. We tested the short-term test-retest-reliability within 40 ± 19.9 h using a cold-water immersion (10 °C, left hand) as conditioning stimulus (CS) and heat pain (43-49 °C, pain intensity 60 ± 5 on the 101-point numeric rating scale, right forearm) as test stimulus (TS) in 25 healthy right-handed subjects (12females, 31.6 ± 14.1 years). The TS was applied 30s before (TSbefore), during (TSduring) and after (TSafter) the 60s CS. The difference between the pain ratings for TSbefore and TSduring represents the early CPM-effect, between TSbefore and TSafter the late CPM-effect. Quantitative sensory testing (QST, DFNS protocol) was performed on both sessions before the CPM assessment. paired t-tests, Intraclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Pearson's correlation, Bland-Altman analysis, significance level p Pain ratings during CPM correlated significantly (ICC: 0.411…0.962) between both days, though ratings for TSafter were lower on day 2 (p pain thresholds. The short-term test-retest-reliability of the early CPM-effect using the heat-cold-pain method in healthy subjects achieved satisfying results in terms of the ICC. The SRD of the early CPM effect showed that an individual change of > 20 NRS can be attributed to a real change rather than chance. The late CPM-effect was weaker and not reliable.
A Test-Retest Analysis of the Vanderbilt Assessment for Leadership in Education in the USA

Science.gov (United States)

Minor, Elizabeth Covay; Porter, Andrew C.; Murphy, Joseph; Goldring, Ellen; Elliott, Stephen N.

2017-01-01

The Vanderbilt Assessment for Leadership in Education (VAL-ED) is a 360-degree learning-centered behaviors principal evaluation tool that includes ratings from the principal, supervisors, and teachers. The current study assesses the test-retest reliability of the VAL-ED for a sample of seven school districts as part of multiple validity and…
Reliability and validity of a tool to assess airway management skills in anesthesia trainees

Directory of Open Access Journals (Sweden)

Aliya Ahmed

2016-01-01

Conclusion: The tool designed to assess bag-mask ventilation and tracheal intubation skills in anesthesia trainees demonstrated excellent inter-rater reliability, fair test-retest reliability, and good construct validity. The authors recommend its use for formative and summative assessment of junior anesthesia trainees.
The Vocal Cord Dysfunction Questionnaire: Validity and Reliability of the Persian Version.

Science.gov (United States)

Ghaemi, Hamide; Khoddami, Seyyedeh Maryam; Soleymani, Zahra; Zandieh, Fariborz; Jalaie, Shohreh; Ahanchian, Hamid; Khadivi, Ehsan

2017-12-25

The aim of this study was to develop, validate, and assess the reliability of the Persian version of Vocal Cord Dysfunction Questionnaire (VCDQ P ). The study design was cross-sectional or cultural survey. Forty-four patients with vocal fold dysfunction (VFD) and 40 healthy volunteers were recruited for the study. To assess the content validity, the prefinal questions were given to 15 experts to comment on its essential. Ten patients with VFD rated the importance of VCDQ P in detecting face validity. Eighteen of the patients with VFD completed the VCDQ 1 week later for test-retest reliability. To detect absolute reliability, standard error of measurement and smallest detected change were calculated. Concurrent validity was assessed by completing the Persian Chronic Obstructive Pulmonary Disease (COPD) Assessment Test (CAT) by 34 patients with VFD. Discriminant validity was measured from 34 participants. The VCDQ was further validated by administering the questionnaire to 40 healthy volunteers. Validation of the VCDQ as a treatment outcome tool was conducted in 18 patients with VFD using pre- and posttreatment scores. The internal consistency was confirmed (Cronbach α = 0.78). The test-retest reliability was excellent (intraclass correlation coefficient = 0.97). The standard error of measurement and smallest detected change values were acceptable (0.39 and 1.08, respectively). There was a significant correlation between the VCDQ P and the CAT total scores (P validity was significantly different. The VCDQ scores in patients with VFD before and after treatment was significantly different (P valid and reliable self-administered questionnaire in Persian-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

Reliability and validity of the visual analogue scale for disability in patients with chronic musculoskeletal pain

NARCIS (Netherlands)

Boonstra, Anne M.; Schiphorst Preuper, Henrica R.; Reneman, Michiel F.; Posthumus, Jitze B.; Stewart, Roy E.

To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional
Reliability, validity and description of timed performance of the Jebsen-Taylor Test in patients with muscular dystrophies.

Science.gov (United States)

Artilheiro, Mariana Cunha; Fávero, Francis Meire; Caromano, Fátima Aparecida; Oliveira, Acary de Souza Bulle; Carvas, Nelson; Voos, Mariana Callil; Sá, Cristina Dos Santos Cardoso de

2017-12-08

The Jebsen-Taylor Test evaluates upper limb function by measuring timed performance on everyday activities. The test is used to assess and monitor the progression of patients with Parkinson disease, cerebral palsy, stroke and brain injury. To analyze the reliability, internal consistency and validity of the Jebsen-Taylor Test in people with Muscular Dystrophy and to describe and classify upper limb timed performance of people with Muscular Dystrophy. Fifty patients with Muscular Dystrophy were assessed. Non-dominant and dominant upper limb performances on the Jebsen-Taylor Test were filmed. Two raters evaluated timed performance for inter-rater reliability analysis. Test-retest reliability was investigated by using intraclass correlation coefficients. Internal consistency was assessed using the Cronbach alpha. Construct validity was conducted by comparing the Jebsen-Taylor Test with the Performance of Upper Limb. The internal consistency of Jebsen-Taylor Test was good (Cronbach's α=0.98). A very high inter-rater reliability (0.903-0.999), except for writing with an Intraclass correlation coefficient of 0.772-1.000. Strong correlations between the Jebsen-Taylor Test and the Performance of Upper Limb Module were found (rho=-0.712). The Jebsen-Taylor Test is a reliable and valid measure of timed performance for people with Muscular Dystrophy. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.

Science.gov (United States)

Oremus, Mark; Oremus, Carolina; Hall, Geoffrey B C; McKinnon, Margaret C

2012-01-01

Quality assessment of included studies is an important component of systematic reviews. The authors investigated inter-rater and test-retest reliability for quality assessments conducted by inexperienced student raters. Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle-Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13-20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. McMaster Integrative Neuroscience Discovery and Study Program. 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test-retest reliability using ICC(2,1). Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI -0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were -0.14 (95% CI -0.28 to 0.00) to 0.39 (95% CI -0.02 to 0.81) for the NOS cohort and -0.20 (95% CI -0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case-control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test-retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were -0.19 (95% CI -0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case-control, the ICC(2,1)s were 0.46 (95% CI -0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). Inter-rater reliability was generally poor
The reliability and validity of a sexual functioning questionnaire.

Science.gov (United States)

Corty, E W; Althof, S E; Kurit, D M

1996-01-01

The present study assessed the reliability and validity of a measure of sexual functioning, the CMSH-SFQ, for male patients and their partners. The CMSH-SFQ measures erectile and orgasmic functioning, sexual drive, frequency of sexual behavior, and sexual satisfaction. Test-retest reliability was assessed with 19 males and 19 females for the baseline CMSH-SFQ. Criterion validity was measured by comparing the answers of 25 male patients to those of their partners at baseline and follow-up. The majority of items had acceptable levels of reliability and validity. The CMSH-SFQ provides a reliable and valid device that can be used to measure global sexual functioning in men and their partners and may be used to evaluate the efficacy of treatments for sexual dysfunctions. Limitations and suggestions for use of the CMSH-SFQ are addressed.
Identification of conductive hearing loss using air conduction tests alone: reliability and validity of an automatic test battery.

Science.gov (United States)

Convery, Elizabeth; Keidser, Gitte; Seeto, Mark; Freeston, Katrina; Zhou, Dan; Dillon, Harvey

2014-01-01

The primary objective of this study was to determine whether a combination of automatically administered pure-tone audiometry and a tone-in-noise detection task, both delivered via an air conduction (AC) pathway, could reliably and validly predict the presence of a conductive component to the hearing loss. The authors hypothesized that performance on the battery of tests would vary according to hearing loss type. A secondary objective was to evaluate the reliability and validity of a novel automatic audiometry algorithm to assess its suitability for inclusion in the test battery. Participants underwent a series of hearing assessments that were conducted in a randomized order: manual pure-tone air conduction audiometry and bone conduction audiometry; automatic pure-tone air conduction audiometry; and an automatic tone-in-noise detection task. The automatic tests were each administered twice. The ability of the automatic test battery to: (a) predict the presence of an air-bone gap (ABG); and (b) accurately measure AC hearing thresholds was assessed against the results of manual audiometry. Test-retest conditions were compared to determine the reliability of each component of the automatic test battery. Data were collected on 120 ears from normal-hearing and conductive, sensorineural, and mixed hearing-loss subgroups. Performance differences between different types of hearing loss were observed. Ears with a conductive component (conductive and mixed ears) tended to have normal signal to noise ratios (SNR) despite impaired thresholds in quiet, while ears without a conductive component (normal and sensorineural ears) demonstrated, on average, an increasing relationship between their thresholds in quiet and their achieved SNR. Using the relationship between these two measures among ears with no conductive component as a benchmark, the likelihood that an ear has a conductive component can be estimated based on the deviation from this benchmark. The sensitivity and
Test-retest reliability and factor structures of organizational citizenship behavior for Hong Kong workers.

Science.gov (United States)

Lam, S S

2001-02-01

In 1990 Podsakoff, MacKenzie, Moorman, and Fetter developed a scale to measure the five dimensions of organizational citizenship behavior. Test-retest data over 15 weeks are reported for this scale for a sample of 82 female and 32 male Chinese tellers (ages 18 to 54 years) from a large international bank in Hong Kong. Stability was .83, and there was no significant change between Times 1 and 2. Analysis indicated the five-factor structure and showed it to be a reliable measure when used with a nonwestern sample.
Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

Science.gov (United States)

Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

2016-01-01

To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.
The Dutch language anterior cruciate ligament return to sport after injury scale (ACL-RSI) - validity and reliability.

Science.gov (United States)

Slagers, Anton J; Reininga, Inge H F; van den Akker-Scheek, Inge

2017-02-01

The ACL-Return to Sport after Injury scale (ACL-RSI) measures athletes' emotions, confidence in performance, and risk appraisal in relation to return to sport after ACL reconstruction. Aim of this study was to study the validity and reliability of the Dutch version of the ACL-RSI (ACL-RSI (NL)). Total 150 patients, who were 3-16 months postoperative, completed the ACL-RSI(NL) and 5 other questionnaires regarding psychological readiness to return to sports, knee-specific physical functioning, kinesiophobia, and health-specific locus of control. Construct validity of the ACL-RSI(NL) was determined with factor analysis and by exploring 10 hypotheses regarding correlations between ACL-RSI(NL) and the other questionnaires. For test-retest reliability, 107 patients (5-16 months postoperative) completed the ACL-RSI(NL) again 2 weeks after the first administration. Cronbach's alpha, Intraclass Correlation Coefficient (ICC), SEM, and SDC, were calculated. Bland-Altman analysis was conducted to assess bias between test and retest. Nine hypotheses (90%) were confirmed, indicating good construct validity. The ACL-RSI(NL) showed good internal consistency (Cronbach's alpha 0.94) and test-retest reliability (ICC 0.93). SEM was 5.5 and SDC was 15. A significant bias of 3.2 points between test and retest was found. Therefore, the ACL-RSI(NL) can be used to investigate psychological factors relevant to returning to sport after ACL reconstruction.
Evaluation of Factorial Validity and Reliability of a Food Behavior Checklist for Low-Income Filipinos.

Science.gov (United States)

Suzuki, Asuka; Choi, So Yung; Lim, Eunjung; Tauyan, Socorro; Banna, Jinan C

To examine factorial validity, test-retest reliability, and internal consistency of a Tagalog-language food behavior checklist (FBC) for a low-income Filipino population. Participants (n = 160) completed the FBC on 2 occasions 3 weeks apart. Factor structure was examined using principal component analysis. For internal consistency, Cronbach α was calculated. For test-retest reliability, Spearman correlation or intraclass correlation coefficient (ICC) was calculated between scores at the 2 points. All but 1 item loaded on 6 factors: fruit and vegetable quantity, fruit and vegetable variety, fast food, sweetened beverage, healthy fat, and diet quality. Cronbach α was .75 for the total scale (range, .39-.76 for subscales). Spearman correlation was 0.78 (ICC, 0.79) for the total scale (range, 0.66-0.80 [ICC, 0.68-0.80] for subscales). The FBC demonstrated adequate factorial validity, test-retest reliability, and internal consistency. With additional testing, the FBC may be used to evaluate the US Department of Agriculture's nutrition education programs for Tagalog speakers. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Reliability And Validity Of Turkish Version Of Motor Activity Log-28

Directory of Open Access Journals (Sweden)

Burcu Ersöz Hüseyinsinoğlu

2011-06-01

Full Text Available OBJECTIVE: The aim of this study was to adapt the Motor Activity Log-28 (MAL-28 into Turkish and probe the reliability and validity of this questionnaire in stroke patients. METHODS: Following the translation of the MAL-28 into Turkish, its reliability and construct validity was examined in 30 stroke patients. For the reliability study, patients were interviewed twice within a three day period, during which no rehabilitative activities were undertaken. The test-retest reliability was determined by using intra-class correlation coefficient (ICC and Spearman correlation coefficient (r; internal consistency was determined by Cronbach's alpha (α. The construct validity was examined by comparing MAL-28 Quality Of Movement (QOM scale and Amount Of Use (AOU scale with Wolf Motor Function Test (WMFT-Performance Time (PT and Functional Ability (FA scores. Furthermore, item-to-scale correlations of AOU and QOM scales were determined and correlation between totol scores of two scales was examined. RESULTS: Turkish version of MAL-28 AOU and QOM scales were reliable (ICC scores were 0.97 and 0.96, respectively and internally consistent (Cronbach’s α value was 0.96 for both scales. Test-retest reliability was supported (AOU, r=0.94; QOM, r=0.93. WMFT FA scores was correlated with both scales (r=0.63. Correlation between WMFT PT and AOU and QOM scales were -0.56 and -0.55. AOU and QOM scales were highly correlated (r=0.95. CONCLUSION: The findings indicate that Turkish version of MAL-28 is reliable and valid in individuals with stroke. Further investigation about its responsiveness is needed before using that version as a primary measurement in clinical trials
Test-Retest Reliability of fMRI During Nonverbal Semantic Decisions in Moderate-Severe Nonfluent Aphasia Patients

Directory of Open Access Journals (Sweden)

Jacquie Kurland

2004-01-01

Full Text Available Cortical reorganization in poststroke aphasia is not well understood. Few studies have investigated neural mechanisms underlying language recovery in severe aphasia patients, who are typically viewed as having a poor prognosis for language recovery. Although test-retest reliability is routinely demonstrated during collection of language data in single-subject aphasia research, this is rarely examined in fMRI studies investigating the underlying neural mechanisms in aphasia recovery.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

Science.gov (United States)

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Reliability, validity and sensitivity to change of neurogenic bowel dysfunction score in patients with spinal cord injury

DEFF Research Database (Denmark)

Erdem, D.; Hava, D.; Keskinoglu, P.

2017-01-01

cord injury (SCI). The reliability of NBD score was assessed by test-retest reliability and internal consistency. Cronbach's alpha coefficient was calculated to determine internal consistency. The construct validity was evaluated by exploring correlations between the NBD score and SF-36 scales, patient...... assessment of impact of NBD on quality of life (QoL) and the physician global assessment (PGA). The Global Rating of Change (GRC) scale was used to assess the change of NBD to investigate the sensitivity of the score to change. Results: Cronbach's alpha coefficient was 0.547. In test-retest reliability...
A Turkish version of myocardial infarction dimensional assessment scale (TR-MIDAS): reliability-validity assesment.

Science.gov (United States)

Uysal, Hilal; Ozcan, Şeyda

2011-06-01

Many new measuring devices have been developed so that broader psychometric measurements in the coronary artery disease, disease-specific health status measurements, and identification of the broader quality of life can be performed in the recent years. The study was intended to determine whether, and to what extent, MIDAS is a valid and reliable measurement to the patients suffering from myocardial infarction for the first time in Turkey. The research was conducted with the patients hospitalized and treated with myocardial infarction in the cardiology departments of 2 hospitals in Istanbul, Turkey, between 2007 and 2008. Psychometric evaluations of TR-MIDAS were used for validity studies; language validity, content validity, construct validity were examined. For reliability studies; the tool's internal consistency reliability, Cronbach's alpha reliability coefficient, and test-retest reliability were completed. The instrument's content validity index was determined to be "0.95". Principal component analysis revealed six factors with an eigenvalue >1.5. Cronbach's alpha was found to be 0.89 for total scale which was an acceptable value. The total's test-retest reliability was 0.51 (p<0.01). Data obtained at the end of the study supports that Turkish Myocardial Infarction Dimensional Assessment Scale is a valid and reliable instrument as a disease-specific scale to assess the patients' quality of life suffering from myocardial infarction in Turkey. Copyright © 2010 European Society of Cardiology. Published by Elsevier B.V. All rights reserved.
The test-retest reliability of anatomical co-ordinate axes definition for the quantification of lower extremity kinematics during running.

Science.gov (United States)

Sinclair, Jonathan; Taylor, Paul John; Greenhalgh, Andrew; Edmundson, Christopher James; Brooks, Darrell; Hobbs, Sarah Jane

2012-12-01

Three-dimensional (3-D) kinematic analyses are used widely in both sport and clinical examinations. However, this procedure depends on reliable palpation of anatomical landmarks and mal-positioning of markers between sessions may result in improperly defined segment co-ordinate system axes which will produce in-consistent joint rotations. This had led some to question the efficacy of this technique. The aim of the current investigation was to assess the reliability of the anatomical frame definition when quantifying 3-D kinematics of the lower extremities during running. Ten participants completed five successful running trials at 4.0 m·s(-1) ± 5%. 3-D angular joint kinematics parameters from the hip, knee and ankle were collected using an eight camera motion analysis system. Two static calibration trials were captured. The first (test) was conducted prior to the running trials following which anatomical landmarks were removed. The second was obtained following completion of the running trials where anatomical landmarks were re-positioned (retest). Paired samples t-tests were used to compare 3-D kinematic parameters quantified using the two static trials, and intraclass correlations were employed to examine the similarities between the sagittal, coronal and transverse plane waveforms. The results indicate that no significant (p>0.05) differences were found between test and retest 3-D kinematic parameters and strong (R(2)≥0.87) correlations were observed between test and retest waveforms. Based on the results obtained from this investigation, it appears that the anatomical co-ordinate axes of the lower extremities can be defined reliably thus confirming the efficacy of studies using this technique.
Hip abduction-adduction strength and one-leg hop tests: test-retest reliability and relationship to function in elite ice hockey players.

Science.gov (United States)

Kea, J; Kramer, J; Forwell, L; Birmingham, T

2001-08-01

Single group, test-retest. To determine: (1) hip abduction and adduction torques during concentric and eccentric muscle actions, (2) medial and lateral one-leg hop distances, (3) the test-retest reliability of these measurements, and (4) the relationship between isokinetic measures of hip muscle strength and hop distances in elite ice hockey players. The skating motion used in ice hockey requires strong contractions of the hip and knee musculature. However, baseline scores for hip strength and hop distances, their test-retest reliability, and measures of the extent to which these tests are related for this population are not available. The dominant leg of 27 men (mean age 20 +/- 3 yrs) was tested on 2 occasions. Hip abduction and adduction movements were completed at 60 degrees.s(-1) angular velocity, with the subject lying on the non-test side and the test leg moving vertically in the subject's coronal plane. One-leg hops requiring jumping from and landing on the same leg without losing balance were completed in the medial and lateral directions. Hip adduction torques were significantly greater than abduction torques during both concentric and eccentric muscle actions, while no significant difference was observed between medial and lateral hop distances. Although hop test scores produced excellent ICCs (> 0.75) when determined using scores on 1 occasion, torques needed to be averaged over 2 test occasions to reach this level. Correlations between the strength and hop tests ranged from slight to low (r = -0.26 to 0.27) and were characterized by wide 95% confidence intervals (-0.54 to 0.61). Isokinetic tests of hip abduction and adduction did not provide a strong indication of performance during sideways hop tests. Although isokinetic tests can provide a measure of muscular strength under specific test conditions, they should not be relied upon as a primary indicator of functional abilities or readiness to return to activity.
Reliability and Validity of the Work and Well-Being Inventory (WBI) for Employees.

Science.gov (United States)

Vendrig, A A; Schaafsma, F G

2018-06-01

Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.
Test-retest reliability of speech-evoked auditory brainstem response in healthy children at a low sensation level.

Science.gov (United States)

Zakaria, Mohd Normani; Jalaei, Bahram

2017-11-01

Auditory brainstem responses evoked by complex stimuli such as speech syllables have been studied in normal subjects and subjects with compromised auditory functions. The stability of speech-evoked auditory brainstem response (speech-ABR) when tested over time has been reported but the literature is limited. The present study was carried out to determine the test-retest reliability of speech-ABR in healthy children at a low sensation level. Seventeen healthy children (6 boys, 11 girls) aged from 5 to 9 years (mean = 6.8 ± 3.3 years) were tested in two sessions separated by a 3-month period. The stimulus used was a 40-ms syllable /da/ presented at 30 dB sensation level. As revealed by pair t-test and intra-class correlation (ICC) analyses, peak latencies, peak amplitudes and composite onset measures of speech-ABR were found to be highly replicable. Compared to other parameters, higher ICC values were noted for peak latencies of speech-ABR. The present study was the first to report the test-retest reliability of speech-ABR recorded at low stimulation levels in healthy children. Due to its good stability, it can be used as an objective indicator for assessing the effectiveness of auditory rehabilitation in hearing-impaired children in future studies. Copyright © 2017 Elsevier B.V. All rights reserved.
Validity and Reliability of Farsi Version of Youth Sport Environment Questionnaire.

Science.gov (United States)

Eshghi, Mohammad Ali; Kordi, Ramin; Memari, Amir Hossein; Ghaziasgar, Ahmad; Mansournia, Mohammad-Ali; Zamani Sani, Seyed Hojjat

2015-01-01

The Youth Sport Environment Questionnaire (YSEQ) had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13-17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting.
Good validity and reliability of the forgotten joint score in evaluating the outcome of total knee arthroplasty

DEFF Research Database (Denmark)

Thomsen, Morten G; Latifi, Roshan; Kallemose, Thomas

2016-01-01

. We investigated the validity and reliability of the FJS. Patients and methods - A Danish version of the FJS questionnaire was created according to internationally accepted standards. 360 participants who underwent primary TKA were invited to participate in the study. Of these, 315 were included...... in a validity study and 150 in a reliability study. Correlation between the Oxford knee score (OKS) and the FJS was examined and test-retest evaluation was performed. A ceiling effect was defined as participants reaching a score within 15% of the maximum achievable score. Results - The validity study revealed...... of the FJS (ICC? 0.79). We found a high level of internal consistency (Cronbach's? = 0.96). The ceiling effect for the FJS was 16%, as compared to 37% for the OKS. Interpretation - The FJS showed good construct validity and test-retest reliability. It had a lower ceiling effect than the OKS. The FJS appears...

Validity and Reliability of the Arabic Version of the Positive and Negative Syndrome Scale.

Science.gov (United States)

Yehya, Arij; Ghuloum, Suhaila; Mahfoud, Ziyad; Opler, Mark; Khan, Anzalee; Hammoudeh, Samer; Abdulhakam, Abdulmoneim; Al-Mujalli, Azza; Hani, Yahya; Elsherbiny, Reem; Al-Amin, Hassen

The Positive and Negative Syndrome Scale (PANSS) is widely used for patients with schizophrenia. This scale is reliable and valid. The PANSS was translated and validated in several languages. The aim of this study was to translate and validate the PANSS in the Arab population. The PANSS was translated into formal Arabic language using the back-translation method. 101 Arab patients with schizophrenia and 98 Arabs with no diagnosis of any mental disorder were recruited. The Arabic version of the Mini International Neuropsychiatric Interview (MINI-6) was used as a diagnostic tool to confirm the diagnosis of schizophrenia or rule out any diagnosis for the healthy control group. Reliability of the scale was assessed by calculating internal consistency, interrater reliability and test-retest reliability. Construct validity was assessed using the Arabic version of the MINI-6. PANSS total scores were correlated with the Clinical Global Impression-Severity scale. Our findings showed that the internal consistency was good (0.92). Scores on the PANSS of the patients were much higher than those of the healthy controls. The PANSS showed good interrater reliability and test-retest reliability (0.92 and 0.75, respectively). In comparison with the MINI-6, the PANSS showed good sensitivity and specificity, which implies good construct validity of this version. In conclusion, the Arabic version of the PANSS is a reliable and valid instrument for the assessment of patients with schizophrenia in the Arab population. © 2016 S. Karger AG, Basel.
Reliability and validity of the Attributional Style Questionnaire- Survey in people with multiple sclerosis

Science.gov (United States)

Kneebone, Ian I.; Dewar, Sophie J.

2016-01-01

Background: The current study aimed to examine the psychometric properties of an attributional style measure that can be administered remotely, to people who have multiple sclerosis (MS). Methods: A total of 495 participants with MS were recruited. Participants completed the Attributional Style Questionnaire-Survey (ASQ-S) and two comparison measures of cognitive variables via postal survey on three occasions, each 12 months apart. Internal reliability, test-retest reliability and congruent validity were considered. Results: The internal reliability of the ASQ-S was good (α > 0.7). The test-retest correlations were significant, but failed to reach the 0.7 set. The congruent validity of the ASQ-S was established relative to the comparisons. Conclusions: The psychometric properties of the ASQ-S indicate that it shows promise as a tool for researchers investigating depression in people with MS and is likely sound to use clinically in this population. PMID:28450893
Reliability, validity and minimal detectable change of the Mini-BESTest in Greek participants with chronic stroke.

Science.gov (United States)

Lampropoulou, Sofia I; Billis, Evdokia; Gedikoglou, Ingrid A; Michailidou, Christina; Nowicky, Alexander V; Skrinou, Dimitra; Michailidi, Fotini; Chandrinou, Danae; Meligkoni, Margarita

2018-02-23

This study aimed to investigate the psychometric characteristics of reliability, validity and ability to detect change of a newly developed balance assessment tool, the Mini-BESTest, in Greek patients with stroke. A prospective, observational design study with test-retest measures was conducted. A convenience sample of 21 Greek patients with chronic stroke (14 male, 7 female; age of 63 ± 16 years) was recruited. Two independent examiners administered the scale, for the inter-rater reliability, twice within 10 days for the test-retest reliability. Bland Altman Analysis for repeated measures assessed the absolute reliability and the Standard Error of Measurement (SEM) and the Minimum Detectable Change at 95% confidence interval (MDC 95% ) were established. The Greek Mini-BESTest (Mini-BESTest GR ) was correlated with the Greek Berg Balance Scale (BBS GR ) for assessing the concurrent validity and with the Timed Up and Go (TUG), the Functional Reach Test (FRT) and the Greek Falls Efficacy Scale-International (FES-I GR ) for the convergent validity. The Mini-BESTestGR demonstrated excellent inter-rater reliability (ICC (95%CI) = 0.997 (0.995-0.999, SEM = 0.46) with the scores of two raters within the limits of agreement (mean dif = -0.143 ± 0.727, p > 0.05) and test-retest reliability (ICC (95%CI) = 0.966 (0.926-0.988), SEM = 1.53). Additionally, the Mini-BESTest GR yielded very strong to moderate correlations with BBS GR (r = 0.924, p reliability and the equally good validity of the Mini-BESTest GR , strongly support its utility in Greek people with chronic stroke. Its ability to identify clinically meaningful changes and falls risk need further investigation.
The interrater and test-retest reliability of the Home Falls and Accidents Screening Tool (HOME FAST) in Malaysia: Using raters with a range of professional backgrounds.

Science.gov (United States)

Romli, Muhammad Hibatullah; Mackenzie, Lynette; Lovarini, Meryl; Tan, Maw Pin; Clemson, Lindy

2017-06-01

Falls can be a devastating issue for older people living in the community, including those living in Malaysia. Health professionals and community members have a responsibility to ensure that older people have a safe home environment to reduce the risk of falls. Using a standardised screening tool is beneficial to intervene early with this group. The Home Falls and Accidents Screening Tool (HOME FAST) should be considered for this purpose; however, its use in Malaysia has not been studied. Therefore, the aim of this study was to evaluate the interrater and test-retest reliability of the HOME FAST with multiple professionals in the Malaysian context. A cross-sectional design was used to evaluate interrater reliability where the HOME FAST was used simultaneously in the homes of older people by 2 raters and a prospective design was used to evaluate test-retest reliability with a separate group of older people at different times in their homes. Both studies took place in an urban area of Kuala Lumpur. Professionals from 9 professional backgrounds participated as raters in this study, and a group of 51 community older people were recruited for the interrater reliability study and another group of 30 for the test-retest reliability study. The overall agreement was moderate for interrater reliability and good for test-retest reliability. The HOME FAST was consistently rated by different professionals, and no bias was found among the multiple raters. The HOME FAST can be used with confidence by a variety of professionals across different settings. The HOME FAST can become a universal tool to screen for home hazards related to falls. © 2017 John Wiley & Sons, Ltd.
The Turkish version of the Physical Activity Scale for the Elderly (PASE): its cultural adaptation, validation, and reliability.

Science.gov (United States)

Ayvat, Ender; Kilinç, Muhammed; Kirdi, Nuray

2017-06-12

This study aimed to describe the cultural adaptation of the Turkish Physical Activity Scale for the Elderly (PASE) and to examine the reliability and validity of the scale in older Turkish adults. Eighty elderly people were recruited for the study. The assessments included the PASE, the International Physical Activity Questionnaire (IPAQ), the Short Physical Performance Battery and Short Form-36 Quality of Life Questionnaire (SF-36), and the Mini Mental State Test. Outcome measures were conducted twice within a week (test-retest) for reliability. Cronbach's α coefficient was 0.714 for the initial evaluation. The intraclass correlation coefficient for the test-retest reliability was 0.995 with a 95% confidence interval of 0.993-0.997. A high level of positive correlation (0.742, P reliable and valid scale for the fields of research and practice.
Internal consistency, reliability, and temporal stability of the Oxford Happiness Questionnaire short-form: Test-retest data over two weeks

OpenAIRE

MCGUCKIN, CONOR

2006-01-01

PUBLISHED The Oxford Happiness Questionnaire short-form is a recently developed eight-item measure of happiness. This study evaluated the internal consistency reliability and test-retest reliability of the Oxford Happiness Questionnaire short-form among 55 Northern Irish undergraduate university students who completed the measure on two occasions separated by two weeks. Internal consistency of the measure on both occasions was satisfactory at both Time 1 (alpha = .62) and Time 2 (alpha = ....
Reliability and Validity of Ten Consumer Activity Trackers Depend on Walking Speed

NARCIS (Netherlands)

Fokkema, Tryntsje; Kooiman, Thea J. M.; Krijnen, Wim P.; Van der Schans, Cees P.; De Groot, Martijn

Purpose: To examine the test-retest reliability and validity of ten activity trackers for step counting at three different walking speeds. Methods: Thirty-one healthy participants walked twice on a treadmill for 30 min while wearing 10 activity trackers (Polar Loop, Garmin Vivosmart, Fitbit Charge
Reliability and Validity of the Beijing Version of the Montreal Cognitive Assessment in the Evaluation of Cognitive Function of Adult Patients with OSAHS.

Science.gov (United States)

Chen, Xiong; Zhang, Rui; Xiao, Ying; Dong, Jiaqi; Niu, Xun; Kong, Weijia

2015-01-01

The patients with obstructive sleep apnea hypopnea syndrome (OSAHS) tend to develop cognitive deficits, which usually go unrecognized, and can affect their daily life. The Beijing version of the Montreal cognitive assessment (MoCA-BJ), a Chinese version of MoCA, has been used for the assessment of cognitive functions of OSAHS patients in clinical practice. So far, its reliability and validity have not been tested. This study examined the reliability and validity of MoCA-BJ in a cohort of adult OSAHS patients. 152 OSAHS patients, ranging from mild, moderate to severe, 49 primary snoring subjects and 40 normal controls were evaluated for cognitive functions by employing both MoCA-BJ and the Mini Mental State Examination (MMSE). Forty of them were re-tested by MoCA-BJ 14 days after the first test. Internal consistency, test-retest reliability, discriminate and concurrent validity of MoCA-BJ were analyzed. Internal consistency reliability by Cronbach's alpha was adequate (0.73). Intra-class correlation coefficient (ICC), an measure of test-retest reliability, was 0.87 (Preliable and stable. The MoCA-BJ was capable of detecting cognitive dysfunction by visuospatial and total MoCA-BJ score.
Reliability and validity of television food advertising questionnaire in Malaysia.

Science.gov (United States)

Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa

2015-09-01

Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The Brief Multidimensional Students' Life Satisfaction Scale (BMSLSS): Reliability, validity, and gender invariance in an Indian adolescent sample.

Science.gov (United States)

Hashim, Jayana; Areepattamannil, Shaljan

2017-06-01

This study examined the internal consistency reliability, factorial, convergent, discriminant, and predictive validity, as well as gender invariance of the Brief Multidimensional Students' Life Satisfaction Scale (BMSLSS; Seligson, Huebner, & Valois, 2003) in a sample of 445 adolescents (M age = 16.04 years) hailing from the southernmost state of India, Kerala. The study also examined the test-retest reliability (n = 392) of the BMSLSS. The Cronbach's alpha coefficient suggested that the BMSLSS was reliable. Confirmatory factor analyses demonstrated the factorial validity of the BMSLSS. Bivariate correlational analyses provided support for the convergent, discriminant, and predictive validity of the BMSLSS. The test-retest reliability coefficient indicated the temporal stability of the BMSLSS. Finally, multi-group confirmatory factor analysis provided support for the gender invariance of the BMSLSS. Copyright © 2017 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.
Validation and reliability of a Behcet's Syndrome Activity Scale in Korea.

Science.gov (United States)

Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo

2016-01-01

We prepared a cross-cultural adaptation of the Behcet's Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Fifty patients with Behcet's disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet's Disease Current Activity Form (BDCAF) and a Behcet's Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). The Korean version of BSAS is a reliable and valid instrument to measure BD activity.
Reliability and validity of the Children's Fear Survey Schedule-Dental Subscale for Arabic-speaking children: a cross-sectional study.

Science.gov (United States)

El-Housseiny, Azza A; Alsadat, Farah A; Alamoudi, Najlaa M; El Derwi, Douaa A; Farsi, Najat M; Attar, Moaz H; Andijani, Basil M

2016-04-14

Early recognition of dental fear is essential for the effective delivery of dental care. This study aimed to test the reliability and validity of the Arabic version of the Children's Fear Survey Schedule-Dental Subscale (CFSS-DS). A school-based sample of 1546 children was randomly recruited. The Arabic version of the CFSS-DS was completed by children during class time. The scale was tested for internal consistency and test-retest reliability. To test criterion validity, children's behavior was assessed using the Frankl scale during dental examination, and results were compared with children's CFSS-DS scores. To test the scale's construct validity, scores on "fear of going to the dentist soon" were correlated with CFSS-DS scores. Factor analysis was also used. The Arabic version of the CFSS-DS showed high reliability regarding both test-retest reliability (intraclass correlation = 0.83, p children with negative behavior had significantly higher fear scores (t = 13.67, p fear of invasive dental procedures," "fear of less invasive dental procedures" and "fear of strangers." The Arabic version of the CFSS-DS is a reliable and valid measure of dental fear in Arabic-speaking children. Pediatric dentists and researchers may use this validated version of the CFSS-DS to measure dental fear in Arabic-speaking children.
TEST-RETEST RELIABILITY OF HAND GRIP STRENGTH MEASUREMENT USING A JAMAR HAND DYNAMOMETER IN PATIENTS WITH ACUTE AND CHRONIC CERVICAL RADICULOPATHY

Directory of Open Access Journals (Sweden)

Ejazi G

2017-12-01

Full Text Available Background: To evaluate the test-retest reliability of Jamar hand held dynamometer for measuring handgrip strength (HGS in patients with acute and chronic cervical radiculopathy and to find out the difference in measurement of the handgrip strength between acute and chronic cervical radiculopathy. Methods: A prospective, observational and non-experimental, the comparative study design was used. A sample of 72 subjects (37 women and 35 men suffering from cervical radiculopathy were divided into two groups i.e., Group A(acute and Group B(chronic, handgrip strength was measured using Jamar hand held dynamometer on two occasions by the same rater with an interval of 7-days. Data collection was based on standard guidelines of American Society of Hand Therapists. Three gripping trials (measured in Kg with patient’s arm in standardized arm position were recorded. The data was analyzed from the mean score obtained from the sample. Result: One-way Analysis of Variance(ANOVA was used to evaluate test-retest reliability and Tukey-Kramer Multiple Comparison Test used to find the difference between handgrip strength among acute and chronic Cervical radiculopathy cases. Greater P-value (>0.05 in both testing session, as well as 95% of the confidence interval, shows the reliability of the instrument and lesser p-value (0.05 in female subjects shows no significant difference in handgrip strength between the two groups. Conclusion: Excellent test-retest reliability for hand grip strength measurement was measured in patients with acute and chronic cervical radiculopathy shows that the equipment could be used as an assessment tool for this patient and significant difference exists among male handgrip strength between acute and chronic cervical radiculopathy cases whereas no difference exists among female handgrip strength between acute and chronic cervical radiculopathy cases.
Validity and reliability of the South African health promoting schools monitoring questionnaire.

Science.gov (United States)

Struthers, Patricia; Wegner, Lisa; de Koker, Petra; Lerebo, Wondwossen; Blignaut, Renette J

2017-04-01

Health promoting schools, as conceptualised by the World Health Organisation, have been developed in many countries to facilitate the health-education link. In 1994, the concept of health promoting schools was introduced in South Africa. In the process of becoming a health promoting school, it is important for schools to monitor and evaluate changes and developments taking place. The Health Promoting Schools (HPS) Monitoring Questionnaire was developed to obtain opinions of students about their school as a health promoting school. It comprises 138 questions in seven sections: socio-demographic information; General health promotion programmes; health related Skills and knowledge; Policies; Environment; Community-school links; and support Services. This paper reports on the reliability and face validity of the HPS Monitoring Questionnaire. Seven experts reviewed the questionnaire and agreed that it has satisfactory face validity. A test-retest reliability study was conducted with 83 students in three high schools in Cape Town, South Africa. The kappa-coefficients demonstrate mostly fair (κ-scores between 0.21 and 0.4) to moderate (κ-scores between 0.41 and 0.6) agreement between test-retest General and Environment items; poor (κ-scores up to 0.2) agreement between Skills and Community test-retest items, fair agreement between Policies items, and for most of the questions focussing on Services a fair agreement was found. The study is a first effort at providing a tool that may be used to monitor and evaluate students' opinions about changes in health promoting schools. Although the HPS Monitoring Questionnaire has face validity, the results of the reliability testing were inconclusive. Further research is warranted. © The Author 2016. Published by Oxford University Press.
Reliability and validity of the Japanese version of the Resilience Scale and its short version.

Science.gov (United States)

Nishi, Daisuke; Uehara, Ritei; Kondo, Maki; Matsuoka, Yutaka

2010-11-17

The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS) and short version of the RS (RS-14). The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D), Rosenberg Self-Esteem Scale (RSES), Social Support Questionnaire (SSQ), Perceived Stress Scale (PSS), and Sheehan Disability Scale (SDS) were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
Reliability and validity of a Chinese version of the Diagnostic Interview for Borderlines-Revised.

Science.gov (United States)

Wang, Lanlan; Yuan, Chenmei; Qiu, Jianying; Gunderson, John; Zhang, Min; Jiang, Kaida; Leung, Freedom; Zhong, Jie; Xiao, Zeping

2014-09-01

Borderline personality disorder (BPD) is the most studied of the axis II disorders. One of the most widely used diagnostic instruments is the Diagnostic Interview for Borderline Patients-Revised (DIB-R). The aim of this study was to test the reliability and validity of DIB-R for use in the Chinese culture. The reliability and validity of the DIB-R Chinese version were assessed in a sample of 236 outpatients with a probable BPD diagnosis. The Structured Clinical Interview for DSM-IV Personality Disorders (SCID-II) was used as a standard. Test-retest reliability was tested six months later with 20 patients, and inter-rater reliability was tested on 32 patients. The Chinese version of the DIB-R showed good internal global consistency (Cronbach's α of 0.916), good test-retest reliability (Pearson correlation of 0.704), good inter-rater reliability (intra-class correlation coefficient of 0.892 and kappa of 0.861). When compared with the DSM-IV diagnosis as measured by the SCID-II, the DIB-R showed relatively good sensitivity (0.768) and specificity (0.891) at the cutoff of 7, moderate diagnostic convergence (kappa of 0.631), as well as good discriminating validity. The Chinese version of the DIB-R has good psychometric properties, which renders it a valuable method for examining the presence, the severity, and component phenotypes of BPD in Chinese samples. © 2013 Wiley Publishing Asia Pty Ltd.
Test-retest paradigm of the forced swimming test in female mice is not valid for predicting antidepressant-like activity: participation of acetylcholine and sigma-1 receptors.

Science.gov (United States)

Su, Jing; Hato-Yamada, Noriko; Araki, Hiroaki; Yoshimura, Hiroyuki

2013-01-01

The forced swimming test (FST) in mice is widely used to predict the antidepressant activity of a drug, but information describing the immobility of female mice is limited. We investigated whether a prior swimming experience affects the immobility duration in a second FST in female mice and whether the test-retest paradigm is a valid screening tool for antidepressants. Female ICR mice were exposed to the FST using two experimental paradigms: a single FST and a double FST in which mice had experienced FST once 24 h prior to the second trail. The initial FST experience reliably prolonged immobility duration in the second FST. The antidepressants imipramine and paroxetine significantly reduced immobility duration in the single FST, but not in the double FST. Scopolamine and the sigma-1 (σ1) antagonist NE-100 administered before the second trial significantly prevented the prolongation of immobility. Neither a 5-HT1A nor a 5-HT2A receptor agonist affected immobility duration. We suggest that the test-retest paradigm in female mice is not adequate for predicting antidepressant-like activity of a drug; the prolongation of immobility in the double FST is modulated through acetylcholine and σ1 receptors.
Determining Reliability and Validity of the Persian Version of Software Usability Measurements Inventory (SUMI) Questionnaire

OpenAIRE

seyed abolfazl zakerian; Roya Azizi; Mehdi Rahgozar

2013-01-01

The term usability refers to a special index for success of an operating system. This study aimed to determine the reliability and validity of the Software Usability Measurements Inventory (SUMI) questionnaire as one of the valid and common questionnaires about usability evaluation. The back translation method was used to translate the questionnaire from English to Persian back to English. Moreover, repeatability or test-retest reliability was practically used to determine the reliability of ...
Validity and Reliability of Farsi Version of Youth Sport Environment Questionnaire

Directory of Open Access Journals (Sweden)

Mohammad Ali Eshghi

2015-01-01

Full Text Available The Youth Sport Environment Questionnaire (YSEQ had been developed from Group Environment Questionnaire, a well-known measure of team cohesion. The aim of this study was to adapt and examine the reliability and validity of the Farsi version of the YSEQ. This version was completed by 455 athletes aged 13–17 years. Results of confirmatory factor analysis indicated that two-factor solution showed a good fit to the data. The results also revealed that the Farsi YSEQ showed high internal consistency, test-retest reliability, and good concurrent validity. This study indicated that the Farsi version of the YSEQ is a valid and reliable measure to assess team cohesion in sport setting.
Test-retest reliability of schizoaffective disorder compared with schizophrenia, bipolar disorder, and unipolar depression--a systematic review and meta-analysis.

Science.gov (United States)

Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher

2015-11-01

Schizoaffective disorder is a frequent diagnosis, and its reliability is subject to ongoing discussion. We compared the diagnostic reliability of schizoaffective disorder with its main differential diagnoses. We systematically searched Medline, Embase, and PsycInfo for all studies on the test-retest reliability of the diagnosis of schizoaffective disorder as compared with schizophrenia, bipolar disorder, and unipolar depression. We used meta-analytic methods to describe and compare Cohen's kappa as well as positive and negative agreement. In addition, multiple pre-specified and post hoc subgroup and sensitivity analyses were carried out. Out of 4,415 studies screened, 49 studies were included. Test-retest reliability of schizoaffective disorder was consistently lower than that of schizophrenia (in 39 out of 42 studies), bipolar disorder (27/33), and unipolar depression (29/35). The mean difference in kappa between schizoaffective disorder and the other diagnoses was approximately 0.2, and mean Cohen's kappa for schizoaffective disorder was 0.50 (95% confidence interval: 0.40-0.59). While findings were unequivocal and homogeneous for schizoaffective disorder's diagnostic reliability relative to its three main differential diagnoses (dichotomous: smaller versus larger), heterogeneity was substantial for continuous measures, even after subgroup and sensitivity analyses. In clinical practice and research, schizoaffective disorder's comparatively low diagnostic reliability should lead to increased efforts to correctly diagnose the disorder. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

Reliability and Validity of the Temperament and Character Inventory

Directory of Open Access Journals (Sweden)

Mahboubeh Dadfar

2010-10-01

Full Text Available Objective: The Temperament and Character Inventory (TCI was developed to assess temperament including Novelty Seeking (NS, Harm Avoidance (HA, Reward Dependence (RD, Persistence (PS, and Character including Self-Directedness (SD, Cooperativeness (CO and Self Transcendence (ST dimensions of Cloninger's biopsychosocial model of personality in adults. The purpose of this study was to evaluate the reliability and validity of this inventory. Materials & Methods: In this validity test and standardization study, after translation of TCI into Farsi and back translation, the final form was prepared and administered to 220 students who were selected via simple sampling. Cronbach's alpha procedure and test-retest method was used to assess the reliability, and factor analysis of promax rotation was utilized to determine the validity of the inventory. Correlation of interscales and age with scales of TCI was calculated by Pearson correlation. A comparison of TCI scores between sex and also cross-cultural was down using independent t-test. Results: The alpha cofficients for the inventory ranged from 0.44 for the Persistence scale to 0.81 for the ST scale with a median 0f 0.68. The overall alpha cofficients for the whole inventory was 0.74. The Pearson correlation cofficient for the test-retest on 31 students after two months ranged from 0.53 for Novelty Seeking and Persistence to 0.82 for Harm Avoidance scales and from 0.24 for disorderliness vs regimentation (NS4 to 0.86 for fear of uncertainty vs self-confidene (HA2 subscales. The factor analysis showed six factors. Significant correlations were obtained between scales of Self–Directedness with Harm Avoidance (0.57, Self–Directedness with Cooperativeness (0.46. Conclusion: The current study confirms that Persian version of the Temperament and Character Inventory has satisfactory psychometric properties and acceptable reliability and validity for the use students of university population.
VALIDITY OF THE MODIFIED CONCONI TEST FOR DETERMINING VENTILATORY THRESHOLD DURING ON-WATER ROWING

Directory of Open Access Journals (Sweden)

Jorge Villamil Cabo

2011-12-01

Full Text Available The objectives of this study were to design a field test based on the Conconi protocol to determine the ventilatory threshold of rowers and to test its reliability and validity. A group of sixteen oarsmen completed a modified Conconi test for on-water rowing. The reliability of the detection of the heart rate threshold was evaluated using heart rate breaking point in the Conconi test and retest. Heart rate threshold was detected in 88.8% of cases in the test-retest. The validity of the modified Conconi test was evaluated by comparing the heart rate threshold data acquired with that obtained in a ventilatory threshold test (VT2. No significant differences were found for the values of different intensity parameters i.e. heart rate (HR, oxygen consumption (VO2, stroke rate (SR and speed (S between the heart rate threshold and the ventilatory threshold, (170.9 ± 6.8 vs. 169.3 ± 6.4 beats·min-1; 42.0 ± 8.6 vs. 43.5 ± 8.3 ml·kg-1·min-1; 25.8 ± 3.3 vs. 27.0 ± 3.2 strokes·min-1 and 14.4 ± 0.8 vs. 14.6 ± 0.8 km·h-1. The differences in averages obtained in the Conconi test-retest were small with a low standard error of the mean. The reliability data between the Conconi test-retest showed low coefficients of variations (CV and high intraclass correlation coefficients (ICC. The total errors for the Conconi test-retest are low for the measured variables (1.31 HR, 0.87 VO2, 0.65 SR, and 0.1 S. The Bland- Altman's method for analysis validity showed a strong concordance according to the analyzed variables. We conclude that the modified Conconi test for on-water rowing is a valid and reliable method for the determination of the second ventilatory threshold (VT2.
The reliability and validity of the Danish Draft Board Cognitive Ability Test: Børge Prien's Prøve.

Science.gov (United States)

Teasdale, Thomas W; Hartmann, Peter V W; Pedersen, Christoffer H; Bertelsen, Mette

2011-04-01

The Danish Draft Board has used the same test for assessing general cognitive ability, the Børge Prien's Prøve (BPP), for over 50 years during which time all men on reaching the age of 18 become liable for conscription. Data from the test has, over the decades, been used in numerous and wide-ranging research studies. Nonetheless, owing to the special circumstances of its administration, some psychometric properties, which are generally assessed for psychological tests, have not previously been investigated for the BPP. First, since the test is only used at the assessment phase, retesting with the BPP occurs only rarely and under exceptional circumstances. Therefore, its Test-Retest reliability has hitherto not been documented. Second, questions have often been raised as to whether the validity of the BPP is undermined by either a lack of motivation and under-performing among some of the men taking the test, being, as they are, compelled to do so, and/or by gradual obsolescence of the test over the decades of its use. We here present findings from three new studies to show that (a) the BPP has a satisfactory Test-Retest reliability, r=0.77, (b) BPP test scores are not positively associated with expressed attitude to being called upon to serve conscription and (c) the correlation between the BPP and a measure of educational level has remained stable (at about 0.5) through the last two decades. Taken together these three findings further support the continuing value of the BPP in research relating to cognitive ability. © 2010 The Authors. Scandinavian Journal of Psychology © 2010 The Scandinavian Psychological Associations.
Safety, reliability, and validity of a physiologic definition of bronchopulmonary dysplasia.

Science.gov (United States)

Walsh, Michele C; Wilson-Costello, Deanna; Zadell, Arlene; Newman, Nancy; Fanaroff, Avroy

2003-09-01

Bronchopulmonary dysplasia (BPD) is the focus of many intervention trials, yet the outcome measure when based solely on oxygen administration may be confounded by differing criteria for oxygen administration between physicians. Thus, we wished to define BPD by a standardized oxygen saturation monitoring at 36 weeks corrected age, and compare this physiologic definition with the standard clinical definition of BPD based solely on oxygen administration. A total of 199 consecutive very low birthweight infants (VLBW, 501 to 1500 g birthweight) were assessed prospectively at 36+/-1 weeks corrected age. Neonates on positive pressure support or receiving >30% supplemental oxygen were assigned the outcome BPD. Those receiving or =88% for 60 minutes) or "BPD" (saturation reliability, test-retest reliability, and validity of the physiologic definition vs the clinical definition were assessed. A total of 199 VLBW were assessed, of whom 45 (36%) were diagnosed with BPD by the clinical definition of oxygen use at 36 weeks corrected age. The physiologic definition identified 15 infants treated with oxygen who successfully passed the saturation monitoring test in room air. The physiologic definition diagnosed BPD in 30 (24%) of the cohort. All infants were safely studied. The test was highly reliable (inter-rater reliability, kappa=1.0; test-retest reliability, kappa=0.83) and highly correlated with discharge home in oxygen, length of hospital stay, and hospital readmissions in the first year of life. The physiologic definition of BPD is safe, feasible, reliable, and valid and improves the precision of the diagnosis of BPD. This may be of benefit in future multicenter clinical trials.
The Unsupported Upper Limb Exercise Test in People Without Disabilities: Assessing the Within-Day Test-Retest Reliability and the Effects of Age and Gender.

Science.gov (United States)

Oliveira, Ana; Cruz, Joana; Jácome, Cristina; Marques, Alda

2018-01-01

Purpose: To estimate the within-day test-retest reliability and standard error of measurement (SEM) of the unsupported upper limb exercise test (UULEX) in adults without disabilities and to determine the effects of age and gender on performance of the UULEX. Method: A cross-sectional study was conducted with 100 adults without disabilities (44 men, mean age 44.2 [SD 26] y; 56 women, mean age 38.1 [SD 24.1] y). Participants performed three UULEX tests to establish within-day reliability, measured using an intra-class correlation coefficient (ICC) model 2 (two-way random effects) with a single rater (ICC[2,1]) and SEM. The effects of age and gender were examined using two-factor mixed-design analysis of variance (ANOVA) and one-way repeated-measures ANOVA. For analysis purposes, four sub-groups were created: younger adults, older adults, men, and women. Results: Excellent within-day reliability and a small SEM were found in the four sub-groups (younger adults: ICC[2,1]=0.88; 95% CI: 0.82, 0.92; SEM∼40 s; older adults: ICC[2,1]=0.82; 95% CI: 0.72, 0.90; SEM∼50 s; men: ICC[2,1]=0.93; 95% CI: 0.88, 0.96; SEM∼30 s; women: ICC[2,1]=0.85; 95% CI: 0.78, 0.91; SEM∼45 s). Younger adults took, on average, 308.24 seconds longer than older adults to perform the test; older adults performed significantly better on the third test ( p 0.05). Conclusion: The within-day test-retest reliability and SEM values of the UULEX may be used to define the magnitude of the error obtained with repeated measures. One UULEX test seems to be adequate for younger adults to achieve reliable results, whereas three tests seem to be needed for older adults.
Validity and reliability of a Nigerian-Yoruba version of the stroke-specific quality of life scale 2.0.

Science.gov (United States)

Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana

2017-10-19

Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Validity and reliability of the Portuguese-Brazilian version of the Quality of Life in Epilepsy Inventory-89.

Science.gov (United States)

Azevedo, Auro Mauro; Alonso, Neide Barreira; Vidal-Dourado, Marcos; Noffs, Maria Helena da Silva; Pascalicchio, Tatiana Frascarelli; Caboclo, Luís Otávio Sales Ferreira; Ciconelli, Rozana Mesquita; Sakamoto, Américo Ceiki; Yacubian, Elza Márcia Targas

2009-03-01

The purpose of this article was to report the translation of the Quality of Life in Epilepsy Inventory-89 (QOLIE-89) into a Portuguese-Brazilian version and evaluate its reliability and validity. This study involved 105 outpatients: 54 patients with refractory temporal lobe epilepsy (TLE) with mesial temporal sclerosis (MTS) and 51 with juvenile myoclonic epilepsy (JME). Reliability and test-retest reliability were assessed. Relationships between QOLIE-89 domains and other questionnaires (Nottingham Health Profile, Beck Depression Inventory, Adverse Event Profile, Neuropsychological Evaluation), and external measures such as demographic and clinical variables were analyzed to examine construct validity. Internal consistency (Cronbach's alpha=0.73-0.92) and test-retest reliability (intraclass correlation coefficient=0.60-0.84) for individual domains were acceptable. For construct validity, we verified high correlations between the QOLIE-89 and the Nottingham Health Profile, Beck Depression Inventory, Adverse Event Profile, and Neuropsychological Evaluation. For clinical characteristics, the patients with juvenile myoclonic epilepsy had better quality-of-life scores on 11 of 17 QOLIE-89 subscales compared with patients with temporal lobe epilepsy (P<0.05). These results support the reliability and validity of the Portuguese-Brazilian translation of QOLIE-89.
[Attempt for development of rapid word reading test for children--evaluation of reliability and validity].

Science.gov (United States)

Hashimoto, Ryusaku; Kashiwagi, Mitsuru; Suzuki, Shuhei

2008-09-01

We developed a rapid word reading test for examining the phonological processing ability of Japanese children. We prepared two versions of the test, version A and B. Each test has word and non-word tasks. Twenty-two healthy boys of third grade in primary schools participated in this validation study. For criterion related validity, we performed the serial Hiragana reading test, the sentence reading test, Raven's coloured progressive matrices (RCPM), the Token test for children, the Kana word dictation test, the standardized comprehension test of abstract words (SCTAW), and Trail Circle test. The reading times of the newly developed test correlated moderately or highly with those of the serial Hiragana reading test and the sentence reading test. However, the scores of the other tests (RCPM, Token test for children, Kana word dictation test, SCTAW, Trail Circle test) did not correlated with the reading time of the rapid word reading test. Test-retest reliabilities in the word tasks were more than moderate: 0.52 and 0.76 in versions A and B, while those in the non-word tasks were high: 0.91 and 0.88 in versions A and B. The correlation coefficient between versions A and B was 0.7 for the word tasks and 0.92 for the non-word tasks. This study showed that the rapid word reading test has substantial validity and reliability for testing the phonological processing ability of Japanese children. In addition, the non-word tasks were more suitable for selectively examining the speed of the grapheme to phoneme conversion process.
Reliability and validity of a Danish version of the multiple sclerosis neuropsychological screening Questionnaire

DEFF Research Database (Denmark)

Sejbæk, Tobias; Blaabjerg, Morten; Sprogøe, Pippi

2018-01-01

. The Multiple Sclerosis Neuropsychological Screening Questionnaire (MSNQ) has previously shown good validity in American, Argentinean, and Dutch MS cohorts. We sought to test reliability and validity of a Danish translation of the MSNQ compared with formal neuropsychological testing, and measures of depression...... the Expanded Disability Status Scale and MS Impairment Scale. Results: The test-retest reliability of the MSNQ-P was significant (R2 = 0.79, P ... that the MSNQ-P measures these items more than the cognitive abilities of the patients. Conclusions: This study does not support use of the MSNQ as a sensitive or valid screening tool for cognitive impairment in Danish patients with MS....
The Dichotic Digits difference Test (DDdT): Development, Normative Data, and Test-Retest Reliability Studies Part 1.

Science.gov (United States)

Cameron, Sharon; Glyde, Helen; Dillon, Harvey; Whitfield, Jessica; Seymour, John

2016-06-01

The dichotic digits test is one of the most widely used assessment tools for central auditory processing disorder. However, questions remain concerning the impact of cognitive factors on test results. To develop the Dichotic Digits difference Test (DDdT), an assessment tool that could differentiate children with cognitive deficits from children with genuine dichotic deficits based on differential test results. The DDdT consists of four subtests: dichotic free recall (FR), dichotic directed left ear (DLE), dichotic directed right ear (DRE), and diotic. Scores for six conditions are calculated (FR left ear [LE], FR right ear [RE], and FR total, as well as DLE, DRE, and diotic). Scores for four difference measures are also calculated: dichotic advantage, right-ear advantage (REA) FR, REA directed, and attention advantage. Experiment 1 involved development of the DDdT, including error rate analysis. Experiment 2 involved collection of normative and test-retest reliability data. Twenty adults (aged 25 yr 10 mo to 50 yr 7 mo, mean 36 yr 4 mo) took part in the development study; 62 normal-hearing, typically developing, primary-school children (aged 7 yr 1 mo to 11 yr 11 mo, mean 9 yr 4 mo) and 10 adults (aged 25 yr 0 mo to 51 yr 6 mo, mean 34 yr 10 mo) took part in the normative and test-retest reliability study. In Experiment 1, error rate analysis was conducted on the 36 digit-pair combinations of the DDdT. Normative data collected in Experiment 2 were arcsine transformed to achieve a distribution that was closer to a normal distribution and z-scores calculated. Pearson product-moment correlations were used to determine the strength of relationships between DDdT conditions. The development study revealed no significant differences in the adult population between test and retest on any DDdT condition. Error rates on 36 digit pairs ranged from 1.5% to 16.7%. The most and the least error-prone digits were removed before commencement of the normative data study, leaving 25
Harmony in Life Scale - Turkish version: Studies of validity and reliability

Directory of Open Access Journals (Sweden)

Seydi Ahmet Satici

2017-11-01

Full Text Available Abstract This article presents the adaptation and psychometric evaluation of the Turkish version of Harmony in Life Scale (Turkish-HiL. The present paper investigates (study 1; N 1 = 253 confirmatory factor analysis, measurement invariance; (study 2; N 2 = 231 concurrent validity; (study 3; N 3 = 260 convergent and known-group validities; (study 4; N t − t = 50 test-retest, Cronbach alpha, and composite reliabilities of the Turkish-HiL. In study 1, based on a confirmatory factor analysis, results confirmed that unidimensional-factor structure. The results suggested that the model demonstrated a configural and metric invariance across the gender groups. In study 2, Turkish-HiL significantly correlated with measures of satisfaction with life, subjective happiness, positive affect, and negative affect. In study 3, Turkish-HiL was predicted positively by flourishing, conversely, negatively predicted by depression, anxiety, and stress. Finally, in study 4, alpha, composite and test-retest reliabilities are acceptable. Overall, the scale presented here may prove useful for satisfactorily assessing, in Turkish, the harmony in life of the university students.
Test-retest reliability of tibiofemoral joint space width measurements made using a low-dose standing CT scanner

Energy Technology Data Exchange (ETDEWEB)

Segal, Neil A. [University of Kansas Medical Center, Department of Rehabilitation Medicine, 3901 Rainbow Boulevard, Mailstop 1046, Kansas City, KS (United States); The University of Iowa, Iowa City, IA (United States); Bergin, John; Kern, Andrew; Findlay, Christian [The University of Iowa, Iowa City, IA (United States); Anderson, Donald D. [The University of Iowa, Department of Orthopaedics and Rehabilitation, Iowa City, IA (United States)

2017-02-15

To determine the test-retest reliability of knee joint space width (JSW) measurements made using standing CT (SCT) imaging. This prospective two-visit study included 50 knees from 30 subjects (66% female; mean ± SD age 58.2 ± 11.3 years; BMI 29.1 ± 5.6 kg/m{sup 2}; 38% KL grade 0-1). Tibiofemoral geometry was obtained from bilateral, approximately 20 fixed-flexed SCT images acquired at visits 2 weeks apart. For each compartment, the total joint area was defined as the area with a JSW <10 mm. The summary measurements of interest were the percentage of the total joint area with a JSW less than 0.5-mm thresholds between 2.0 and 5.0 mm in each tibiofemoral compartment. Test-retest reliability of the summary JSW measurements was assessed by intraclass correlation coefficients (ICC 2,1) for the percentage area engaged at each threshold of JSW and root-mean-square errors (RMSE) were calculated to assess reproducibility. The ICCs were excellent for each threshold assessed, ranging from 0.95 to 0.97 for the lateral and 0.90 to 0.97 for the medial compartment. RMSE ranged from 1.1 to 7.2% for the lateral and from 3.1 to 9.1% for the medial compartment, with better reproducibility at smaller JSW thresholds. The knee joint positioning protocol used demonstrated high day-to-day reliability for SCT 3D tibiofemoral JSW summary measurements repeated 2 weeks apart. Low-dose SCT provides a great deal of information about the joint while maintaining high reliability, making it a suitable alternative to plain radiographs for evaluating JSW in people with knee OA. (orig.)
Test-retest reliability of tibiofemoral joint space width measurements made using a low-dose standing CT scanner

International Nuclear Information System (INIS)

Segal, Neil A.; Bergin, John; Kern, Andrew; Findlay, Christian; Anderson, Donald D.

2017-01-01

To determine the test-retest reliability of knee joint space width (JSW) measurements made using standing CT (SCT) imaging. This prospective two-visit study included 50 knees from 30 subjects (66% female; mean ± SD age 58.2 ± 11.3 years; BMI 29.1 ± 5.6 kg/m 2 ; 38% KL grade 0-1). Tibiofemoral geometry was obtained from bilateral, approximately 20 fixed-flexed SCT images acquired at visits 2 weeks apart. For each compartment, the total joint area was defined as the area with a JSW <10 mm. The summary measurements of interest were the percentage of the total joint area with a JSW less than 0.5-mm thresholds between 2.0 and 5.0 mm in each tibiofemoral compartment. Test-retest reliability of the summary JSW measurements was assessed by intraclass correlation coefficients (ICC 2,1) for the percentage area engaged at each threshold of JSW and root-mean-square errors (RMSE) were calculated to assess reproducibility. The ICCs were excellent for each threshold assessed, ranging from 0.95 to 0.97 for the lateral and 0.90 to 0.97 for the medial compartment. RMSE ranged from 1.1 to 7.2% for the lateral and from 3.1 to 9.1% for the medial compartment, with better reproducibility at smaller JSW thresholds. The knee joint positioning protocol used demonstrated high day-to-day reliability for SCT 3D tibiofemoral JSW summary measurements repeated 2 weeks apart. Low-dose SCT provides a great deal of information about the joint while maintaining high reliability, making it a suitable alternative to plain radiographs for evaluating JSW in people with knee OA. (orig.)
Reliability, validity, and significance of assessment of sense of contribution in the workplace.

Science.gov (United States)

Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito

2014-01-29

The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%-80.2%). Fifty-four workers were included in the analysis of test-retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach's α coefficients in men and women were 0.85 and 0.86, respectively) and test-retest reliability (intraclass correlation coefficient = 0.91). Significant (p workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Learning Style Scales: a valid and reliable questionnaire

Directory of Open Access Journals (Sweden)

Abdolghani Abdollahimohammad

2014-08-01

Full Text Available Purpose: Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. Methods: A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF with oblimin rotation, a scree plot test, and parallel analysis (PA. The reliability of LSS was tested using Cronbach’s α, corrected item-total correlation, and test-retest. Results: Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach’s α was > 0.70 for all subscales in both study populations. The corrected item-total correlations were > 0.30 for the items in each component. Conclusion: The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Learning Style Scales: a valid and reliable questionnaire.

Science.gov (United States)

Abdollahimohammad, Abdolghani; Ja'afar, Rogayah

2014-01-01

Learning-style instruments assist students in developing their own learning strategies and outcomes, in eliminating learning barriers, and in acknowledging peer diversity. Only a few psychometrically validated learning-style instruments are available. This study aimed to develop a valid and reliable learning-style instrument for nursing students. A cross-sectional survey study was conducted in two nursing schools in two countries. A purposive sample of 156 undergraduate nursing students participated in the study. Face and content validity was obtained from an expert panel. The LSS construct was established using principal axis factoring (PAF) with oblimin rotation, a scree plot test, and parallel analysis (PA). The reliability of LSS was tested using Cronbach's α, corrected item-total correlation, and test-retest. Factor analysis revealed five components, confirmed by PA and a relatively clear curve on the scree plot. Component strength and interpretability were also confirmed. The factors were labeled as perceptive, solitary, analytic, competitive, and imaginative learning styles. Cronbach's α was >0.70 for all subscales in both study populations. The corrected item-total correlations were >0.30 for the items in each component. The LSS is a valid and reliable inventory for evaluating learning style preferences in nursing students in various multicultural environments.
Validity and reliability of the NAB Naming Test.

Science.gov (United States)

Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto

2016-05-01

Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Validation of the German version of the Ford Insomnia Response to Stress Test.

Science.gov (United States)

Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

2018-06-01

The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.
Validation of EncephalApp, Smartphone-Based Stroop Test, for the Diagnosis of Covert Hepatic Encephalopathy.

Science.gov (United States)

Bajaj, Jasmohan S; Heuman, Douglas M; Sterling, Richard K; Sanyal, Arun J; Siddiqui, Muhammad; Matherly, Scott; Luketic, Velimir; Stravitz, R Todd; Fuchs, Michael; Thacker, Leroy R; Gilles, HoChong; White, Melanie B; Unser, Ariel; Hovermale, James; Gavis, Edith; Noble, Nicole A; Wade, James B

2015-10-01

Detection of covert hepatic encephalopathy (CHE) is difficult, but point-of-care testing could increase rates of diagnosis. We aimed to validate the ability of the smartphone app EncephalApp, a streamlined version of Stroop App, to detect CHE. We evaluated face validity, test-retest reliability, and external validity. Patients with cirrhosis (n = 167; 38% with overt HE [OHE]; mean age, 55 years; mean Model for End-Stage Liver Disease score, 12) and controls (n = 114) were each given a paper and pencil cognitive battery (standard) along with EncephalApp. EncephalApp has Off and On states; results measured were OffTime, OnTime, OffTime+OnTime, and number of runs required to complete 5 off and on runs. Thirty-six patients with cirrhosis underwent driving simulation tests, and EncephalApp results were correlated with results. Test-retest reliability was analyzed in a subgroup of patients. The test was performed before and after transjugular intrahepatic portosystemic shunt placement, and before and after correction for hyponatremia, to determine external validity. All patients with cirrhosis performed worse on paper and pencil and EncephalApp tests than controls. Patients with cirrhosis and OHE performed worse than those without OHE. Age-dependent EncephalApp cutoffs (younger or older than 45 years) were set. An OffTime+OnTime value of >190 seconds identified all patients with CHE with an area under the receiver operator characteristic value of 0.91; the area under the receiver operator characteristic value was 0.88 for diagnosis of CHE in those without OHE. EncephalApp times correlated with crashes and illegal turns in driving simulation tests. Test-retest reliability was high (intraclass coefficient, 0.83) among 30 patients retested 1-3 months apart. OffTime+OnTime increased significantly (206 vs 255 seconds, P = .007) among 10 patients retested 33 ± 7 days after transjugular intrahepatic portosystemic shunt placement. OffTime+OnTime decreased significantly (242 vs
Test-retest and interobserver reliability of quantitative sensory testing according to the protocol of the German Research Network on Neuropathic Pain (DFNS): a multi-centre study.

Science.gov (United States)

Geber, Christian; Klein, Thomas; Azad, Shahnaz; Birklein, Frank; Gierthmühlen, Janne; Huge, Volker; Lauchart, Meike; Nitzsche, Dorothee; Stengel, Maike; Valet, Michael; Baron, Ralf; Maier, Christoph; Tölle, Thomas; Treede, Rolf-Detlef

2011-03-01

Quantitative sensory testing (QST) is an instrument to assess positive and negative sensory signs, helping to identify mechanisms underlying pathologic pain conditions. In this study, we evaluated the test-retest reliability (TR-R) and the interobserver reliability (IO-R) of QST in patients with sensory disturbances of different etiologies. In 4 centres, 60 patients (37 male and 23 female, 56.4±1.9years) with lesions or diseases of the somatosensory system were included. QST comprised 13 parameters including detection and pain thresholds for thermal and mechanical stimuli. QST was performed in the clinically most affected test area and a less or unaffected control area in a morning and an afternoon session on 2 consecutive days by examiner pairs (4 QSTs/patient). For both, TR-R and IO-R, there were high correlations (r=0.80-0.93) at the affected test area, except for wind-up ratio (TR-R: r=0.67; IO-R: r=0.56) and paradoxical heat sensations (TR-R: r=0.35; IO-R: r=0.44). Mean IO-R (r=0.83, 31% unexplained variance) was slightly lower than TR-R (r=0.86, 26% unexplained variance, Ptest area (TR-R: r=0.86; IO-R: r=0.83) than in the control area (TR-R: r=0.79; IO-R: r=0.71, each Preliability of QST. We conclude that standardized QST performed by trained examiners is a valuable diagnostic instrument with good test-retest and interobserver reliability within 2days. With standardized training, observer bias is much lower than random variance. Quantitative sensory testing performed by trained examiners is a valuable diagnostic instrument with good interobserver and test-retest reliability for use in patients with sensory disturbances of different etiologies to help identify mechanisms of neuropathic and non-neuropathic pain. Copyright © 2010 International Association for the Study of Pain. Published by Elsevier B.V. All rights reserved.

Adaptation of the Godin Leisure-Time Exercise Questionnaire into Turkish: The Validity and Reliability Study

Directory of Open Access Journals (Sweden)

Emine Sari

2016-01-01

Full Text Available This study was conducted with the aim of determining whether the Turkish form of the “Leisure-Time Exercise Questionnaire” developed by Godin is a valid and reliable tool for diabetic patients in Turkey. The study was conducted as a methodological research on 300 diabetic patients in Turkey. The linguistic equivalence of the questionnaire was assessed through the back-translation method, while its content validity was assessed through obtaining expert opinions. Cronbach’s alpha value was found to assess the reliability of the questionnaire. The test-retest analysis and the correlation between independent observers were examined. The content validity index (CVI was found to be .82 according to the expert assessments, and no statistical difference was found between them (Kendall’s W=.17, p=.235. Cronbach’s alpha was found to be α=.64, the result of the test-retest analysis was r=.97, and the correlation between independent observers (ICC was .98. This study found that the Turkish form of the Leisure-Time Exercise Questionnaire is a valid and reliable tool that can be used to define and assess the exercise behaviors of Turkish diabetic patients.
Which is the most useful patient-reported outcome in femoroacetabular impingement? Test-retest reliability of six questionnaires.

Science.gov (United States)

Hinman, Rana S; Dobson, Fiona; Takla, Amir; O'Donnell, John; Bennell, Kim L

2014-03-01

The most reliable patient-reported outcomes (PROs) for people with femoroacetabular impingement (FAI) is unknown because there have been no direct comparisons of questionnaires. Thus, the aim was to evaluate the test-retest reliability of six existing PROs in a single cohort of young active people with hip/groin pain consistent with a clinical diagnosis of FAI. Young adults with clinical FAI completed six PRO questionnaires on two occasions, 1-2 weeks apart. The PROs were modified Harris Hip Score, Hip dysfunction and Osteoarthritis Score, Hip Outcome Score, Non-Arthritic Hip Score, International Hip Outcome Tool, Copenhagen Hip and Groin Outcome Score. 30 young adults (mean age 24 years, SD 4 years, range 18-30 years; 15 men) with stable symptoms participated. Intraclass correlation coefficient(3,1) values ranged from 0.73 to 0.93 (95% CI 0.38 to 0.98) indicating that most questionnaires reached minimal reliability benchmarks. Measurement error at the individual level was quite large for most questionnaires (minimal detectable change (MDC95) 12.4-35.6, 95% CI 8.7 to 54.0). In contrast, measurement error at the group level was quite small for most questionnaires (MDC95 2.2-7.3, 95% CI 1.6 to 11). The majority of the questionnaires were reliable and precise enough for use at the group level. Samples of only 23-30 individuals were required to achieve acceptable measurement variation at the group level. Further direct comparisons of these questionnaires are required to assess other measurement properties such as validity, responsiveness and meaningful change in young people with FAI.
The multiple sclerosis work difficulties questionnaire: translation and cross-cultural adaptation to Turkish and assessment of validity and reliability.

Science.gov (United States)

Kahraman, Turhan; Özdoğar, Asiye Tuba; Honan, Cynthia Alison; Ertekin, Özge; Özakbaş, Serkan

2018-05-09

To linguistically and culturally adapt the Multiple Sclerosis Work Difficulties Questionnaire-23 (MSWDQ-23) for use in Turkey, and to examine its reliability and validity. Following standard forward-back translation of the MSWDQ-23, it was administered to 124 people with multiple sclerosis (MS). Validity was evaluated using related outcome measures including those related to employment status and expectations, disability level, fatigue, walking, and quality of life. Randomly selected participants were asked to complete the MSWDQ-23 again to assess test-retest reliability. Confirmatory factor analysis on the MSWDQ-23 demonstrated a good fit for the data, and the internal consistency of each subscale was excellent. The test-retest reliability for the total score, psychological/cognitive barriers, physical barriers, and external barriers subscales were high. The MSWDQ-23 and its subscales were positively correlated with the employment, disability level, walking, and fatigue outcome measures. This study suggests that the Turkish version of MSWDQ-23 has high reliability and adequate validity, and it can be used to determine the difficulties faced by people with multiple sclerosis in workplace. Moreover, the study provides evidence about the test-retest reliability of the questionnaire. Implications for rehabilitation Multiple sclerosis affects young people of working age. Understanding work-related problems is crucial to enhance people with multiple sclerosis likelihood of maintaining their job. The Multiple Sclerosis Work Difficulties Questionnaire-23 (MSWDQ-23) is a valid and reliable measure of perceived workplace difficulties in people with multiple sclerosis: we presented its validation to Turkish. Professionals working in the field of vocational rehabilitation may benefit from using the MSWDQ-23 to predict the current work outcomes and future employment expectations.
Validity and Reliability of the Persian Version of the Dysphagia Handicap Index (DHI).

Science.gov (United States)

Asadollahpour, Faezeh; Baghban, Kowsar; Asadi, Mozhgan

2015-05-01

The Dysphagia Handicap Index (DHI) is one of the instruments used for measuring a dysphagic patient's self-assessment. In some ways, it reflects the patient's quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. The Persian DHI showed good internal consistency (Cronbach's alpha coefficients range from 0.82 to 0.94). Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89). There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001). The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
Validity and Reliability of the Persian Version of the Dysphagia Handicap Index (DHI

Directory of Open Access Journals (Sweden)

faezeh asadollahpour

2015-05-01

Full Text Available Introduction: The Dysphagia Handicap Index (DHI is one of the instruments used for measuring a dysphagic patient’s self-assessment. In some ways, it reflects the patient’s quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Materials and Methods: Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. Results: The Persian DHI showed good internal consistency (Cronbach’s alpha coefficients range from 0.82 to 0.94. Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89. There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001. Conclusion: The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
Validity and reliability of the European portuguese version of neuropsychiatric inventory in an institutionalized sample.

Science.gov (United States)

Ferreira, Ana Rita; Martins, Sonia; Ribeiro, Orquidea; Fernandes, Lia

2015-01-01

Neuropsychiatric symptoms are very common in dementia and have been associated with patient and caregiver distress, increased risk of institutionalization and higher costs of care. In this context, the neuropsychiatric inventory (NPI) is the most widely used comprehensive tool designed to measure neuropsychiatric Symptoms in geriatric patients with dementia. The aim of this study was to present the validity and reliability of the European Portuguese version of NPI. A cross-sectional study was carried out with a convenience sample of institutionalized patients (≥ 50 years old) in three nursing homes in Portugal. All patients were also assessed with mini-mental state examination (MMSE) (cognition), geriatric depression scale (GDS) (depression) and adults and older adults functional assessment inventory (IAFAI) (functionality). NPI was administered to a formal caregiver, usually from the clinical staff. Inter-rater and test-retest reliability were assessed in a subsample of 25 randomly selected subjects. The sample included 166 elderly, with a mean age of 80.9 (standard deviation: 10.2) years. Three out of the NPI behavioral items had negative correlations with MMSE: delusions (rs = -0.177, P = 0.024), disinhibition (rs = -0.174, P = 0.026) and aberrant motor activity (rs = -0.182, P = 0.020). The NPI subsection of depression/dysphoria correlated positively with GDS total score (rs = 0.166, P = 0.038). NPI showed good internal consistency (overall α = 0.766; frequency α = 0.737; severity α = 0.734). The inter-rater reliability was excellent (intraclass correlation coefficient (ICC): 1.00, 95% confidence interval (CI) 1.00 - 1.00), as well as test-retest reliability (ICC: 0.91, 95% CI 0.80 - 0.96). The results found for convergent validity, inter-rater and test-retest reliability, showed that this version appears to be a valid and reliable instrument for evaluation of neuropsychiatric symptoms in institutionalized elderly.
Validity and Reliability of Dynamic Visual Acuity (DVA) Measurement During Walking

Science.gov (United States)

Deshpande, Nandini; Peters, Brian T.; Bloomberg, Jacob J.

2014-01-01

DVA is primarily subserved by the vestibulo-ocular reflex mechanism. Individuals with vestibular hypofunction commonly experience highly debilitating illusory movement or blurring of visual images during daily activities possibly, due to impaired DVA. Even without pathologies, gradual age-related morphological deterioration is evident in all components of the vestibular system. We examined the construct validity to detect age-related differences and test-retest reliability of DVA measurements performed during walking. METHODS: Healthy adults were recruited into 3 groups: 1. young (20-39years, n=18), 2. middle-aged (40-59years, n=14), and 3. older adults (60-80years, n=15). Randomly selected seven participants from each group (n=21) participated in retesting. Participants were excluded if they had a history of vestibular or neuromuscular pathologies, dizziness/vertigo or >1 falls in the past year. Older persons with MMSE scores reliability. RESULTS: The three age groups were not different in their height, weight and normal walking speed (p>0.05). The post hoc analyses for DVA measurements demonstrated that each group was significantly different from the other two groups for Near as well as FarDVA (preliability. FarDVA at 0.8 m/s and 1.0 m/s demonstrated good test-retest reliability (ICCs 0.71 and 0.77, respectively).
Modified sphygmomanometer test for the assessment of strength of the trunk, upper and lower limbs muscles in subjects with subacute stroke: reliability and validity.

Science.gov (United States)

Aguiar, Larissa T; Lara, Eliza M; Martins, Julia C; Teixeira-Salmela, Luci F; Quintino, Ludmylla F; Christo, Paulo P; DE Morais Fairaa, Christina

2016-10-01

Limitations in activities have been related to weakness of the upper limbs (UL), lower limbs (LL) and trunk muscles after stroke. Therefore, the measurement of strength after stroke becomes essential. The Modified Sphygmomanometer Test (MST) is an alternative method for the measurement of strength, since it is cheap and provides objective values. However, no studies have investigated the measurement properties of the MST in sub-acute stroke. To investigate the test-retest and inter-rater reliabilities and criterion-related validity of the MST for the measurement of strength of the UL, LL, and trunk muscles in subjects with sub-acute stroke, and verify whether the number of trials would affect the results. Diagnostic accuracy. Local community, out-patient clinics, and university laboratory. Sixty- five subjects with sub-acute stroke (62±14 years) participated of the present study. The strength of 36 muscular groups was measured with the MST and dynamometers (criterion standard). To investigate whether the number of trials would affect the results, analysis of variance was applied. For the test-retest and inter-rater reliabilities and criterion-related validity of the MST, intra-class correlation coefficients (ICC), Pearson correlation coefficients, and coefficients of determination were calculated. Similar results were found for all muscular groups and number of trials (0.01≤F≤0.14; 0.87≤p≤0.99) with significant and adequate values of test-retest (0.57≤ICC≥0.98) (exception: first trial of the non-paretic ankle dorsiflexors) and inter-rater (0.50≤ICC≥0.99) (exception: non-paretic ankle plantar flexors) reliabilities and validity (0.70≤r≥0.95; p≤0.001). The values obtained with the MST were good predictors of those obtained with the dynamometers (0.54≤r2≤0.90). In general, the MST showed adequate reliabilities and criterion-related validity for measuring strength of subjects with sub-acute stroke, and only one trial, after familiarization
[Reliability and validity studies of Turkish translation of Eysenck Personality Questionnaire Revised-Abbreviated].

Science.gov (United States)

Karanci, A Nuray; Dirik, Gülay; Yorulmaz, Orçun

2007-01-01

The aim of the present study was to examine the reliability and the validity of the Turkish translation of the Eysneck Personality Questionnaire Revised-abbreviated Form (EPQR-A) (Francis et al., 1992), which consists of 24 items that assess neuroticism, extraversion, psychoticism, and lying. The questionnaire was first translated into Turkish and then back translated. Subsequently, it was administered to 756 students from 4 different universities. The Fear Survey Inventory-III (FSI-III), Rosenberg Self-Esteem Scales (RSES), and Egna Minnen Betraffande Uppfostran (EMBU-C) were also administered in order to assess the questionnaire's validity. The internal consistency, test-retest reliability, and validity were subsequently evaluated. Factor analysis, similar to the original scale, yielded 4 factors; the neuroticism, extraversion, psychoticism, and lie scales. Kuder-Richardson alpha coefficients for the extraversion, neuroticism, psychoticism, and lie scales were 0.78, 0.65, 0.42, and 0.64, respectively, and the test-retest reliability of the scales was 0.84, 0.82, 0.69, and 0.69, respectively. The relationships between EPQR-A-48, FSI-III, EMBU-C, and RSES were examined in order to evaluate the construct validity of the scale. Our findings support the construct validity of the questionnaire. To investigate gender differences in scores on the subscales, MANOVA was conducted. The results indicated that there was a gender difference only in the lie scale scores. Our findings largely supported the reliability and validity of the questionnaire in a Turkish student sample. The psychometric characteristics of the Turkish version of the EPQR-A were discussed in light of the relevant literature.
Reliability, validity, and responsiveness of the Persian version of Shoulder Activity Scale in a group of patients with shoulder disorders.

Science.gov (United States)

Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin

2015-01-01

The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice
[The validity and reliability of the general self-efficacy scale-Turkish form].

Science.gov (United States)

Yildirim, Fatma; Ilhan, Inci Ozgür

2010-01-01

Self-efficacy, which is a basic construct in social cognitive theory, has been defined as one's belief in his/her ability to start, continue, and complete an action in a manner that has an impact on his/her environment. This study aimed to investigate the psychometric properties of the General Self-Efficacy Scale-Turkish Form. The General Self-Efficacy Scale-Turkish Form was administered to 895 individuals ?18 years of age that had at least 5 years of education. Exploratory factor analysis, criterion validity testing (using the Beck Depression Scale, Spielberger Trait Anxiety Inventory, Locus of Control Scale, Learned Resourcefulness Scale, and Coopersmith Self Esteem Inventory), internal consistency analysis, and test-retest reliability analysis were performed. The 3-factor structure of the scale explained 41.5% of the observed variance. Correlations between the General Self-Efficacy Scale-Turkish Form and the other measures were statistically significant. The Cronbach's alpha coefficient for the entire scale was 0.80 and the test-retest reliability coefficient estimated from data for 236 individuals that were contacted for follow-up was 0.69. The General Self-Efficacy Scale-Turkish Form is a valid and reliable instrument for the assessment of general self-efficacy in individuals ?18 years of age with at least 5 years of education.
Reliability and validity of the parent efficacy for child healthy weight behaviour (PECHWB) scale.

Science.gov (United States)

Palmer, F; Davis, M C

2014-05-01

Interventions for childhood overweight and obesity that target parents as the agents of change by increasing parent self-efficacy for facilitating their child's healthy weight behaviours require a reliable and valid tool to measure parent self-efficacy before and after interventions. Nelson and Davis developed the Parent Efficacy for Child Healthy Weight Behaviour (PECHWB) scale with good preliminary evidence of reliability and validity. The aim of this research was to provide further psychometric evidence from an independent Australian sample. Data were provided by a convenience sample of 261 primary caregivers of children aged 4-17 years via an online survey. PECHWB scores were correlated with scores on other self-report measures of parenting efficacy and 2- to 4-week test-retest reliability of the PECHWB was assessed. The results of the study confirmed the four-factor structure of the PECHWB (Fat and Sugar, Sedentary Behaviours, Physical Activity, and Fruit and Vegetables) and provided strong evidence of internal consistency and test-retest reliability, as well as good evidence of convergent validity. Future research should investigate the properties of the PECHWB in a sample of parents of overweight or obese children, including measures of child weight and actual child healthy weight behaviours to provide evidence of the concurrent and predictive validity of PECHWB scores. © 2013 John Wiley & Sons Ltd.
Reliability, validity and usefulness of 30-15 Intermittent Fitness Test in Female Soccer Players

Directory of Open Access Journals (Sweden)

Nedim Čović

2016-11-01

Full Text Available PURPOSE: The aim of this study was to examine the reliability, validity and usefulness of the 30-15IFT in competitive female soccer players. METHODS: Seventeen elite female soccer players participated in the study. A within subject test-retest study design was utilized to assess the reliability of the 30-15 intermittent fitness test (IFT. Seven days prior to 30-15IFT, subjects performed a continuous aerobic running test (CT under laboratory conditions to assess the criterion validity of the 30-15IFT. End running velocity (VCT and VIFT, peak heart rate (HRpeak and maximal oxygen consumption (VO2max were collected and/or estimated for both tests. RESULTS: VIFT (ICC = 0.91; CV = 1.8%, HRpeak (ICC = 0.94; CV = 1.2%, and VO2max (ICC = 0.94; CV = 1.6% obtained from the 30-15IFT were all deemed highly reliable (p>0.05. Pearson product moment correlations between the CT and 30-15IFT for VO2max, HRpeak and end running velocity were large (r = 0.67, p=0.013, very large (r = 0.77, p=0.02 and large (r = 0.57, p=0.042, respectively. CONCLUSION: Current findings suggest that the 30 -15IFT is a valid and reliable intermittent aerobic fitness test of elite female soccer players. The findings have also provided practitioners with evidence to support the accurate detection of meaningful individual changes in VIFT of 0.5 km/h (1 stage and HRpeak of 2 bpm. This information may assist coaches in monitoring ‘real’ aerobic fitness changes to better inform training of female intermittent team sport athletes. Lastly, coaches could use the 30-15IFT as a practical alternative to laboratory based assessments to assess and monitor intermittent aerobic fitness changes in their athletes. Keywords: 30-15 intermittent fitness test, aerobic, cardiorespiratory fitness, intermittent activity, soccer, high intensity interval training.
Test-retest reliability and agreement of the Satisfaction with the Assistive Technology Services (SATS) instrument in two Nordic countries.

Science.gov (United States)

Sund, Terje; Iwarsson, Susanne; Anttila, Heidi; Helle, Tina; Brandt, Ase

2014-07-01

The purpose of this study was to investigate test-retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (PWCs) or powered scooters (scooters). Test-retest design, two telephone interviews 7-18 days apart of 40 informants, with mean age of 67.5 (SD 13.09) years in the Danish; and 54 informants with mean age of 55.6 (SD 12.09) years in the Finnish sample. The intra-class correlation coefficient varied between 0.57 and 0.93 for items in the Danish and between 0.41 and 0.93 in the Finnish sample. The percentage agreement varied between 54.2 and 79.5 for items in the Danish and between 69.2 and 81.1 in the Finnish sample, while the Cronbach's alpha values varied between 0.87 and 0.96 in the two samples. A ceiling effect was found in all items of both samples. This study indicates that the SATS may be reliably administered for telephone interviews among adult PWC and scooter users, and give information about aspects of the service delivery process for quality development improvement purposes. Further psychometric testing of the SATS is required.
Test-Retest Reliability of Measurements of Hand-Grip Strength Obtained by Dynamometry from Older Adults: A Systematic Review of Research in the PubMed Database.

Science.gov (United States)

Bohannon, R W

2017-01-01

A systematic review was performed to summarize literature describing the test-retest reliability of grip strength measures obtained from older adults. Relevant literature was identified via a PubMed search. Seventeen articles were deemed appropriate based on inclusion and exclusion criteria. The relative test-retest reliability of grip strength measures obtained by dynamometry was good to excellent (intra-class correlation coefficients > 0.80) in all but 3 studies, which involved older adults with severe dementia. Absolute reliability, as indicated by summary statistics such as the minimum detectable change (95%), was more variable. As a percentage, that change ranged from 14.5% to 98.5%. Consequently, clinicians can be confident in the relative reliability of grip strength measures obtained from at risk older adults. However, relatively large percentage changes in grip strength may be necessary to conclude with confidence that a real change has occurred over time in some populations.
[Reliability and validity of the PAQ-A questionnaire to assess physical activity in Spanish adolescents].

Science.gov (United States)

Martínez-Gómez, David; Martínez-de-Haro, Vicente; Pozo, Tamara; Welk, Gregory J; Villagra, Ariel; Calle, Marisa E; Marcos, Ascensión; Veiga, Oscar L

2009-01-01

Questionnaires are feasible instruments to assess physical activity (PA) in large samples. The aim of the current study was to evaluate the reliability and validity of the PAQ-A questionnaire in Spanish adolescents using the measurement of PA by accelerometer as criterion. In a sample of 82 adolescents, aged 12 to 17 years, 1-week PAQ-A test-retest was administered. Reliability was analyzed by the Intraclass Correlation Coefficient (ICC) and the internal consistency by the Cronbach's alpha Coefficient. Two hundred thirty-two adolescents, aged 13-17 years, completed the PAQ-A and wore the ActiGraph GT1M accelerometer during 7-days. The PAQ-A was compared against total PA and moderate to vigorous PA (MVPA) obtained by the accelerometer. Test-retest reliability showed ICC = 0.71 for the final score of PAQ-A. Internal consistency was alpha = 0.65 in the first self-report, alpha = 0.67 in the retest in 82 adolescents sample, and alpha = 0.74 in the 232 adolescents sample. The PAQ-A was moderately correlated with total PA (rho = 0.39) and MVPA (rho= 0.34) assessed by the accelerometer. The PAQ-A obtained significantly moderate correlations in boys but not in girls against the accelerometer. The PAQ-A questionnaire shows an adequate reliability and a reasonable validity for assessing PA in Spanish adolescents.
Reliability and validity of a Swedish language version of the Resilience Scale.

Science.gov (United States)

Nygren, Björn; Randström, Kerstin Björkman; Lejonklou, Anna K; Lundman, Beril

2004-01-01

The purpose of this study was to test the reliability and validity of the Swedish language version of the Resilience Scale (RS). Participants were 142 adults between 19-85 years of age. Internal consistency reliability, stability over time, and construct validity were evaluated using Cronbach's alpha, principal components analysis with varimax rotation and correlations with scores on the Sense of Coherence Scale (SOC) and the Rosenberg Self-Esteem Scale (RSE). The mean score on the RS was 142 (SD = 15). The possible scores on the RS range from 25 to 175, and scores higher than 146 are considered high. The test-retest correlation was .78. Correlations with the SOC and the RSE were .41 (p Self and Life emerged as components from the principal components analysis. These findings provide evidence for the reliability and validity of the Swedish language version of the RS.
Reliability, Validity, and Minimal Detectable Change of Balance Evaluation Systems Test and Its Short Versions in Older Cancer Survivors: A Pilot Study.

Science.gov (United States)

Huang, Min H; Miller, Kara; Smith, Kristin; Fredrickson, Kayle; Shilling, Tracy

2016-01-01

Cancer is primarily a disease of older adults. About 77% of all cancers are diagnosed in persons aged 55 years and older. Cancer and its treatment can cause diverse sequelae impacting body systems underlying balance control. No study has examined the psychometric properties of balance assessment tools in older cancer survivors, presenting a significant challenge in the selection of outcome measures for clinicians treating this fast-growing population. This study aimed to determine the reliability, validity, and minimal detectable change (MDC) of the Balance Evaluation System Test (BESTest), Mini-Balance Evaluation Systems Test (Mini-BESTest), and Brief-Balance Evaluation Systems Test (Brief-BESTest) in community-dwelling older cancer survivors. This study was a cross-sectional design. Twenty breast and 8 prostate cancer survivors participated [age (SD) = 68.4 (8.13) years]. The BESTest and Activity-specific Balance Confidence (ABC) Scale were administered during the first session. Scores of Mini-BESTest and Brief-BESTest were extracted on the basis of the scores of BESTest. The BESTest was repeated within 1 to 2 weeks by the same rater to determine the test-retest reliability. For the analysis of the inter-rater reliability, 21 participants were randomly selected to be evaluated by 2 raters. A primary rater administered the test. The 2 raters independently and concurrently scored the performance of the participants. Each rater recorded the ratings separately on the scoring sheet. No discussion among the raters was allowed throughout the testing. Intraclass correlation coefficients (ICCs), standard error of measurement, minimal detectable change (MDC), and Bland-Altman plots were calculated. Concurrent validity of these balance tests with the ABC Scale was examined using the Spearman correlation. The BESTest, Mini-BESTest, and Brief-BESTest had high test-retest (ICC = 0.90-0.94) and interrater reliability (ICC = 0.86-0.96), small standard error of measurement (0
Is the Bayley Scales of Infant and Toddler Developmental Screening Test, Valid and Reliable for Persian Speaking Children?

Science.gov (United States)

Soleimani, Farin; Azari, Nadia; Vameghi, Roshanak; Sajedi, Firoozeh; Shahshahani, Soheila; Karimi, Hossein; Kraskian, Adis; Shahrokhi, Amin; Teymouri, Robab; Gharib, Masoud

2016-10-01

Advances in perinatal and neonatal care have substantially improved the survival of at-risk infants over the past two decades. The purpose of this study was to assess the reliability and validity of the Bayley Scales of infant and toddler developmental Screening test in Persian-speaking children. This was a cross-sectional prospective study of 403 children aged 1 - 42-months. The Bayley scales screening instrument, which consists of five domains (cognitive, receptive, and expressive communication and fine and gross motor items), was used to measure infants' and toddlers' development. The psychometric properties examined included the face and content validity of the scale, in addition to cultural and linguistic modifications to the scale and its test-retest and inter-rater reliability. An expert team changed some of the test items relating to cultural and linguistic issues. In almost all the age groups, cultural or linguistic changes were made to items in the communication domains. According to Cronbach's alpha for internal consistency, the reliability of the cognitive scale was r = 0.79, and the reliability of the receptive scale was r = 0.76. The reliability for expressive communication, fine motor, and gross motor scales was r = 0.81, r = 0.80, and r = 0.81, respectively. The construct validity of the tests was confirmed using a factor analysis and comparison of the mean scores of the age groups. The intra- and inter-rater reliabilities of the Bayley Scales were good-to-excellent. The results indicated that the Bayley Scales had a high level of reliability in the present study. Thus, the scale can be used in a Persian population.
The reliability and validity of the Alcohol Use Disorders Identification Test (AUDIT) in a German general practice population sample.

Science.gov (United States)

Dybek, Inga; Bischof, Gallus; Grothues, Janina; Reinhardt, Susa; Meyer, Christian; Hapke, Ulfert; John, Ulrich; Broocks, Andreas; Hohagen, Fritz; Rumpf, Hans-Jürgen

2006-05-01

Our goal was to analyze the retest reliability and validity of the Alcohol Use Disorders Identification Test (AUDIT) in a primary-care setting and recommend a cut-off value for the different alcohol-related diagnoses. Participants recruited from general practices (GPs) in two northern German cities received the AUDIT, which was embedded in a health-risk questionnaire. In total, 10,803 screenings were conducted. The retest reliability was tested on a subsample of 99 patients, with an intertest interval of 30 days. Sensitivity and specificity at a number of different cut-off values were estimated for the sample of alcohol consumers (n=8237). For this study, 1109 screen-positive patients received a diagnostic interview. Individuals who scored less than five points in the AUDIT and also tested negative in a second alcohol-related screen were defined as "negative" (n=6003). This definition was supported by diagnostic interviews of 99 screen-negative patients from which no false negatives could be detected. As the gold standard for detection of an alcohol-use disorder (AUD), we used the Munich-Composite International Diagnostic Interview (MCIDI), which is based on Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, criteria. On the item level, the reliability, measured by the intraclass correlation coefficient (ICC), ranged between .39 (Item 9) and .98 (Item 10). For the total score, the ICC was .95. For cut-off values of eight points and five points, 87.5% and 88.9%, respectively, of the AUDIT-positives, and 98.9% and 95.1%, respectively, of the AUDIT-negatives were identically identified at retest, with kappa = .86 and kappa = .81. At the cut-off value of five points, we determined good combinations of sensitivity and specificity for the following diagnoses: alcohol dependence (sensitivity and specificity of .97 and .88, respectively), AUD (.97 and .92), and AUD and/or at-risk consumption (.97 and .91). Embedded in a health-risk questionnaire in

The Cognitive Telephone Screening Instrument (COGTEL: A Brief, Reliable, and Valid Tool for Capturing Interindividual Differences in Cognitive Functioning in Epidemiological and Aging Studies

Directory of Open Access Journals (Sweden)

Andreas Ihle

2017-10-01

Full Text Available Aims: The present study set out to evaluate the psychometric properties of the Cognitive Telephone Screening Instrument (COGTEL in 2 different samples of older adults. Methods: We assessed COGTEL in 116 older adults, with retest after 7 days to evaluate the test-retest reliability. Moreover, we assessed COGTEL in 868 older adults to evaluate convergent validity to the Mini-Mental State Examination (MMSE. Results: Test-retest reliability of the COGTEL total score was good at 0.85 (p < 0.001. Latent variable analyses revealed that COGTEL and MMSE correlated by 0.93 (p < 0.001, indicating convergent validity of the COGTEL. Conclusion: The present analyses suggest COGTEL as a brief, reliable, and valid instrument for capturing interindividual differences in cognitive functioning in epidemiological and aging studies, with the advantage of covering more cognitive domains than traditional screening tools such as the MMSE, as well as differentiating between individual performance levels, in healthy older adults.
Analysis of the reliability and validity of the Turkish version of the intermittent and constant osteoarthritis pain questionnaire.

Science.gov (United States)

Erel, Suat; Şimşek, İbrahim Engin; Özkan, Hüseyin

2015-01-01

The aim of this study was to analyze the validity and reliability of the Turkish version (ICOAP-TR) of the intermittent and constant osteoarthritis pain (ICOAP) questionnaire in patients with knee osteoarthritis (OA). Thirty-eight volunteer patients diagnosed with knee OA answered the questionnaire twice with an interval of 2-4 days. The reliability of the measurement was assessed using Cronbach's alpha coefficient and intraclass correlation (ICC) for test-retest reliability. Criterion validity was tested against the Western Ontario and McMaster Universities Arthritis Index (WOMAC) pain score and visual analog scale (VAS) designed to assess the perceived discomfort rated by the patient. Test-retest reliability was found to be ICC=0.942 for total score, 0.902 for constant pain subscale, and 0.945 for intermittent pain subscale. Internal consistency was tested using Cronbach's alpha and was found to be 0.970 for total score, 0.948 for constant pain subscale, and 0.972 for intermittent pain subscale. For criterion validity, the correlation between the total score of ICOAP-TR and WOMAC pain subscale was r=0.779 (p<0.05), and correlation between total score of ICOAP-TR and VAS was r=0.570 (p<0.05). The ICOAP-TR is a reliable and valid instrument to be used with patients with knee OA.
Reliability and validity of a modified MEDFICTS dietary fat screener in South African schoolchildren are determined by use and outcome measures.

Science.gov (United States)

Wenhold, Friedeburg Anna Maria; MacIntyre, Una Elizabeth; Rheeder, Paul

2014-06-01

In South Africa, noncommunicable diseases and obesity are increasing and also affect children. No validated assessment tools for fat intake are available. To determine test-retest reliability and relative validity of a pictorial modified meats, eggs, dairy, fried foods, fats in baked goods, convenience foods, table fats, and snacks (MEDFICTS) dietary fat screener. We determined test-retest reliability and diagnostic accuracy with the modified MEDFICTS as the index test and a 3-day weighed food record and parental completion of the screener as primary and secondary reference methods, respectively. Grade-six learners (aged 12 years, 4 months) in an urban, middle-class school (n=93) and their parents (n=72). Portion size, frequency of intake, final score, and classification of fat intake of the modified MEDFICTS, and percent energy from fat, saturated fatty acids, and cholesterol of the food record. For categorical data agreement was based on kappa statistics, McNemar's test for symmetry, and diagnostic performance parameters. Continuous data were analyzed with correlations, mean differences, the Bland-Altman method, and receiver operating characteristics. The classification of fat intake by the modified MEDFICTS was test-retest reliable. Final scores of the group did not differ between administrations (P=0.86). The correlation of final scores between administrations was significant for girls only (r=0.58; P=0.01). Reliability of portion size and frequency of intake scores depended on the food category. For girls the screener final score was significantly (P90%), but chance corrected agreement between the classifications was poor. Parents did not agree with their children. Test-retest reliability and relative validity of a modified MEDFICTS dietary fat screener in South African schoolchildren depended on the use and outcome measures applied. Copyright © 2014 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Validity and reliability of an instrumented leg-extension machine for measuring isometric muscle strength of the knee extensors.

Science.gov (United States)

Ruschel, Caroline; Haupenthal, Alessandro; Jacomel, Gabriel Fernandes; Fontana, Heiliane de Brito; Santos, Daniela Pacheco dos; Scoz, Robson Dias; Roesler, Helio

2015-05-20

Isometric muscle strength of knee extensors has been assessed for estimating performance, evaluating progress during physical training, and investigating the relationship between isometric and dynamic/functional performance. To assess the validity and reliability of an adapted leg-extension machine for measuring isometric knee extensor force. Validity (concurrent approach) and reliability (test and test-retest approach) study. University laboratory. 70 healthy men and women aged between 20 and 30 y (39 in the validity study and 31 in the reliability study). Intraclass correlation coefficient (ICC) values calculated for the maximum voluntary isometric torque of knee extensors at 30°, 60°, and 90°, measured with the prototype and with an isokinetic dynamometer (ICC2,1, validity study) and measured with the prototype in test and retest sessions, scheduled from 48 h to 72 h apart (ICC1,1, reliability study). In the validity analysis, the prototype showed good agreement for measurements at 30° (ICC2,1 = .75, SEM = 18.2 Nm) and excellent agreement for measurements at 60° (ICC2,1 = .93, SEM = 9.6 Nm) and at 90° (ICC2,1 = .94, SEM = 8.9 Nm). Regarding the reliability analysis, between-days' ICC1,1 were good to excellent, ranging from .88 to .93. Standard error of measurement and minimal detectable difference based on test-retest ranged from 11.7 Nm to 18.1 Nm and 32.5 Nm to 50.1 Nm, respectively, for the 3 analyzed knee angles. The analysis of validity and repeatability of the prototype for measuring isometric muscle strength has shown to be good or excellent, depending on the knee joint angle analyzed. The new instrument, which presents a relative low cost and easiness of transportation when compared with an isokinetic dynamometer, is valid and provides consistent data concerning isometric strength of knee extensors and, for this reason, can be used for practical, clinical, and research purposes.
Examination of the reliability and validity of the Mindful Eating Questionnaire in pregnant women.

Science.gov (United States)

Apolzan, John W; Myers, Candice A; Cowley, Amanda D; Brady, Heather; Hsia, Daniel S; Stewart, Tiffany M; Redman, Leanne M; Martin, Corby K

2016-05-01

Mindfulness is theorized to affect the eating behavior and weight of pregnant women, yet no measure has been validated during pregnancy. This study qualitatively and quantitatively evaluated the reliability and validity of the Mindful Eating Questionnaire (MEQ) in overweight and obese pregnant women. Participants completed focus groups and cognitive interviews. The MEQ was administered twice to measure test-retest reliability. The Eating Inventory (EI) and Mindful Attention Awareness Scale (MAAS) were administered to assess convergent validity, and the Neighborhood Environment Walkability Scale (NEWS) assessed discriminant validity. Participants were 20 ± 8 weeks gestation (mean ± SD), 30 ± 2 years old, and 55% were obese. The MEQ total score had good test-retest reliability (r = .85). The total score internal consistency reliability was poor (Cronbach's α = .56). The external cues subscale (ECS) was not internally consistent (α = .31). Other subscales ranged from α = .59-.68. When the ECS was excluded, the MEQ total score internal consistency was acceptable (α = .62). Convergent validity was supported by the MEQ total score (with and without ECS) correlating significantly with the MAAS and the EI disinhibition and hunger subscales. Discriminant validity of the MEQ was supported by the MEQ and NEWS total scores and subscales not being significantly correlated. The quantitative results were supported by the qualitative context and content analysis. With the exception of the ECS, the MEQ's reliability and validity was supported in pregnant women, and most of the subscales were more robust in pregnant women than in the original sample of healthy adults. The MEQ's use with overweight and obese pregnant women is supported. Copyright © 2016 Elsevier Ltd. All rights reserved.
Stroke Impact Scale 3.0: Reliability and Validity Evaluation of the Korean Version.

Science.gov (United States)

Choi, Seong Uk; Lee, Hye Sun; Shin, Joon Ho; Ho, Seung Hee; Koo, Mi Jung; Park, Kyoung Hae; Yoon, Jeong Ah; Kim, Dong Min; Oh, Jung Eun; Yu, Se Hwa; Kim, Dong A

2017-06-01

To establish the reliability and validity the Korean version of the Stroke Impact Scale (K-SIS) 3.0. A total of 70 post-stroke patients were enrolled. All subjects were evaluated for general characteristics, Mini-Mental State Examination (MMSE), the National Institutes of Health Stroke Scale (NIHSS), Modified Barthel Index, Hospital Anxiety and Depression Scale (HADS). The SF-36 and K-SIS 3.0 assessed their health-related quality of life. Statistical analysis after evaluation, determined the reliability and validity of the K-SIS 3.0. A total of 70 patients (mean age, 54.97 years) participated in this study. Internal consistency of the SIS 3.0 (Cronbach's alpha) was obtained, and all domains had good co-efficiency, with threshold above 0.70. Test-retest reliability of SIS 3.0 required correlation (Spearman's rho) of the same domain scores obtained on the first and second assessments. Results were above 0.5, with the exception of social participation and mobility. Concurrent validity of K-SIS 3.0 was assessed using the SF-36, and other scales with the same or similar domains. Each domain of K-SIS 3.0 had a positive correlation with corresponding similar domain of SF-36 and other scales (HADS, MMSE, and NIHSS). The newly developed K-SIS 3.0 showed high inter-intra reliability and test-retest reliabilities, together with high concurrent validity with the original and various other scales, for patients with stroke. K-SIS 3.0 can therefore be used for stroke patients, to assess their health-related quality of life and treatment efficacy.
Assessment of the severity of dementia: validity and reliability of the Chinese (Cantonese) version of the Hierarchic Dementia Scale (CV-HDS).

Science.gov (United States)

Poon, Vickie Wan-kei; Lam, Linda Chiu-wa; Wong, Samuel Yeung-shan

2008-09-01

With the rapid growth of the older population, early detection of cognitive deficits is crucial in slowing down functional deterioration of the elderly persons. To examine the validity and reliability of the Chinese (Cantonese) version of the Hierarchic Dementia Scale (CV-HDS) for Chinese older persons in Hong Kong. The HDS was translated into Cantonese Chinese. The content and cultural validity were evaluated by six expert panel members. Sixty-two participants with diagnosis of dementia were recruited for evaluation. Inter-rater reliability, test-retest reliability, internal consistency and concurrent validity were examined. The CV-HDS demonstrated satisfactory psychometric properties. inter-rater reliability and test-retest reliability were high (alpha=0.89 and alpha=0.94 respectively). High value of Cronbach's alpha (alpha=0.94) demonstrated good internal consistency. The concurrent validity of CV-HDS, through correlation with its scores with that of the Chinese version of Mini Mental Status Examination, was established (ranged from r=0.58 to r=0.78, pCantonese speaking Chinese people with dementia. It facilitates treatment planning to optimize the effects of functional training and rehabilitation.
Reliability and validity of a questionnaire to measure personal, social and environmental correlates of fruit and vegetable intake in 10-11-year-old children in five European countries

DEFF Research Database (Denmark)

De Bourdeaudhuij, I; Klepp, K-I; Due, P

2005-01-01

To investigate the internal consistency of the scales and the test-retest reliability and predictive validity of behaviour theory-based constructs measuring personal, social and environmental correlates of fruit and vegetable intake in 10-11-year-old children.......To investigate the internal consistency of the scales and the test-retest reliability and predictive validity of behaviour theory-based constructs measuring personal, social and environmental correlates of fruit and vegetable intake in 10-11-year-old children....
Validation and reliability of a Behcet’s Syndrome Activity Scale in Korea

Science.gov (United States)

Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo

2016-01-01

Background/Aims: We prepared a cross-cultural adaptation of the Behcet’s Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Methods: Fifty patients with Behcet’s disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet’s Disease Current Activity Form (BDCAF) and a Behcet’s Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Results: Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). Conclusions: The Korean version of BSAS is a reliable and valid instrument to measure BD activity. PMID:26767871
A Systematic Review of the Reliability and Validity of Behavioural Tests Used to Assess Behavioural Characteristics Important in Working Dogs.

Science.gov (United States)

Brady, Karen; Cracknell, Nina; Zulch, Helen; Mills, Daniel Simon

2018-01-01

Working dogs are selected based on predictions from tests that they will be able to perform specific tasks in often challenging environments. However, withdrawal from service in working dogs is still a big problem, bringing into question the reliability of the selection tests used to make these predictions. A systematic review was undertaken aimed at bringing together available information on the reliability and predictive validity of the assessment of behavioural characteristics used with working dogs to establish the quality of selection tests currently available for use to predict success in working dogs. The search procedures resulted in 16 papers meeting the criteria for inclusion. A large range of behaviour tests and parameters were used in the identified papers, and so behaviour tests and their underpinning constructs were grouped on the basis of their relationship with positive core affect (willingness to work, human-directed social behaviour, object-directed play tendencies) and negative core affect (human-directed aggression, approach withdrawal tendencies, sensitivity to aversives). We then examined the papers for reports of inter-rater reliability, within-session intra-rater reliability, test-retest validity and predictive validity. The review revealed a widespread lack of information relating to the reliability and validity of measures to assess behaviour and inconsistencies in terminologies, study parameters and indices of success. There is a need to standardise the reporting of these aspects of behavioural tests in order to improve the knowledge base of what characteristics are predictive of optimal performance in working dog roles, improving selection processes and reducing working dog redundancy. We suggest the use of a framework based on explaining the direct or indirect relationship of the test with core affect.
Validity and reliability of a pictorial instrument for assessing perceived motor competence in Portuguese children.

Science.gov (United States)

Lopes, V P; Barnett, L M; Saraiva, L; Gonçalves, C; Bowe, S J; Abbott, G; Rodrigues, L P

2016-09-01

It is important to assess young children's perceived Fundamental Movement Skill (FMS) competence in order to examine the role of perceived FMS competence in motivation toward physical activity. Children's perceptions of motor competence may vary according to the culture/country of origin; therefore, it is also important to measure perceptions in different cultural contexts. The purpose was to assess the face validity, internal consistency, test-retest reliability and construct validity of the 12 FMS items in the Pictorial Scale for Perceived Movement Skill Competence for Young Children (PMSC) in a Portuguese sample. Two hundred one Portuguese children (girls, n = 112), 5 to 10 years of age (7.6 ± 1.4), participated. All children completed the PMSC once. Ordinal alpha assessed internal consistency. A random subsamples (n = 47) were reassessed one week later to determine test-retest reliability with Bland-Altman method. Children were asked questions after the second administration to determine face validity. Construct validity was assessed on the whole sample with a Bayesian Structural Equation Modelling (BSEM) approach. The hypothesized theoretical model used the 12 items and two hypothesized factors: object control and locomotor skills. The majority of children correctly identified the skills and could understand most of the pictures. Test-retest reliability analysis was good, with an agreement ration between 0.99 and 1.02. Ordinal alpha values ranged from acceptable (object control 0.73, locomotor 0.68) to good (all FMS 0.81). The hypothesized BSEM model had an adequate fit. The PMSC can be used to investigate perceptions of children's FMS competence. This instrument can also be satisfactorily used among Portuguese children. © 2016 John Wiley & Sons Ltd.
Feasibility and test-retest reliability of measuring lower‑limb strength in young children with cerebral palsy.

Science.gov (United States)

Van Vulpen, L F; De Groot, S; Becher, J G; De Wolf, G S; Dallmeijer, A J

2013-12-01

Quantifying leg muscle strength in young children with cerebral palsy (CP) is essential for identifying muscle groups for treatment and for monitoring progress. To study the feasibility, intratester reliability and the optimal test design (number of test occasions and repetitions) of measuring lower-limb strength with handheld dynamometry (HHD) and dynamic ankle plantar flexor strength with the standing heel-rise (SH) test in 3-10 year aged children with CP. Test-retest design. Rehabilitation centre, special needs school for children with disabilities, and university medical centre. Knee extensor, hip abductor and calf muscle strength was assessed in 20 ambulatory children with spastic CP (3-5 years [N.=10] and 6-10 years [N.=10]) on two test occasions. Intraclass correlation coefficients (ICC) and Smallest Detectable Differences (SDD) were calculated to determine the optimal test design for detecting changes in strength. All isometric strength tests had acceptable SDDs (9-30%), when taking the mean values of 2-3 test occasions (separate days) and 2-3 repetitions. The one-leg SH test had large SDDs (40-128% for younger group, 23-48% for older group). Isometric strength (improvements) can only be measured reliably with HHD in young children with CP when the average values over at least 2 test occasions are taken. Reliability of the SH test is not sufficient for measuring individual changes in dynamic muscle strength in the younger children. Results of this study can be used to determine the optimal number of test occasions and repetitions for reliable HHD measurements depending on expected changes, muscle group and age in 3-10 year old children with CP.
[Validity and reliability of the spanish EQ-5D-Y proxy version].

Science.gov (United States)

Gusi, N; Perez-Sousa, M A; Gozalo-Delgado, M; Olivares, P R

2014-10-01

A proxy version of the EQ-5D-Y, a questionnaire to evaluate the Health Related Quality of Life (HRQoL) in children and adolescents, has recently been developed. There are currently no data on the validity and reliability of this tool. The objective of this study was to analyze the validity and reliability of the EQ-5D-Y proxy version. A core set of self-report tools, including the Spanish version of the EQ-5D-Y were administered to a group of Spanish children and adolescents drawn from the general population. A similar core set of internationally standardized proxy tools, including the EQ-5D-Y proxy version were administered to their parents. Test-retest reliability was determined, and correlations with other generic measurements of HRQoL were calculated. Additionally, known group validity was examined by comparing groups with a priori expected differences in HRQoL. The agreement between the self-report and proxy version responses was also calculated. A total of 477 children and adolescents and their parents participated in the study. One week later, 158 participants completed the EQ-5D-Y/EQ-5D-Y proxy to facilitate reliability analysis. Agreement between the test-retest scores was higher than 88% for EQ-5D-Y self-report, and proxy version. Correlations with other health measurements showed similar convergent validity to that observed in the international EQ-5D-Y. Agreement between the self-report and proxy versions ranged from 72.9% to 97.1%. The results provide preliminary evidence of the reliability and validity of the EQ-5D-Y proxy version. Copyright © 2013 Asociación Española de Pediatría. Published by Elsevier Espana. All rights reserved.
Development, reliability, and validity testing of Toddler NutriSTEP: a nutrition risk screening questionnaire for children 18-35 months of age.

Science.gov (United States)

Randall Simpson, Janis; Gumbley, Jillian; Whyte, Kylie; Lac, Jane; Morra, Crystal; Rysdale, Lee; Turfryer, Mary; McGibbon, Kim; Beyers, Joanne; Keller, Heather

2015-09-01

Nutrition is vital for optimal growth and development of young children. Nutrition risk screening can facilitate early intervention when followed by nutritional assessment and treatment. NutriSTEP (Nutrition Screening Tool for Every Preschooler) is a valid and reliable nutrition risk screening questionnaire for preschoolers (aged 3-5 years). A need was identified for a similar questionnaire for toddlers (aged 18-35 months). The purpose was to develop a reliable and valid Toddler NutriSTEP. Toddler NutriSTEP was developed in 4 phases. Content and face validity were determined with a literature review, parent focus groups (n = 6; 48 participants), and experts (n = 13) (phase A). A draft questionnaire was refined with key intercept interviews of 107 parents/caregivers (phase B). Test-retest reliability (phase C), based on intra-class correlations (ICC), Kappa (κ) statistics, and Wilcoxon tests was assessed with 133 parents/caregivers. Criterion validity (phase D) was assessed using Receiver Operating Characteristic (ROC) curves by comparing scores on the Toddler NutriSTEP to a comprehensive nutritional assessment of 200 toddlers with a registered dietitian (RD). The Toddler NutriSTEP was reliable between 2 administrations (ICC = 0.951, F = 20.53, p Toddler NutriSTEP were correlated (r = 0.67, p Toddler NutriSTEP questionnaire is both reliable and valid for screening for nutritional risk in toddlers.
Reliability and validity of a smartphone pulse rate application for the assessment of resting and elevated pulse rate.

Science.gov (United States)

Mitchell, Katy; Graff, Megan; Hedt, Corbin; Simmons, James

2016-08-01

Purpose/hypothesis: This study was designed to investigate the test-retest reliability, concurrent validity, and the standard error of measurement (SEm) of a pulse rate assessment application (Azumio®'s Instant Heart Rate) on both Android® and iOS® (iphone operating system) smartphones as compared to a FT7 Polar® Heart Rate monitor. Number of subjects: 111. Resting (sitting) pulse rate was assessed twice and then the participants were asked to complete a 1-min standing step test and then immediately re-assessed. The smartphone assessors were blinded to their measurements. Test-retest reliability (intraclass correlation coefficient [ICC 2,1] and 95% confidence interval) for the three tools at rest (time 1/time 2): iOS® (0.76 [0.67-0.83]); Polar® (0.84 [0.78-0.89]); and Android® (0.82 [0.75-0.88]). Concurrent validity at rest time 2 (ICC 2,1) with the Polar® device: IOS® (0.92 [0.88-0.94]) and Android® (0.95 [0.92-0.96]). Concurrent validity post-exercise (time 3) (ICC) with the Polar® device: iOS® (0.90 [0.86-0.93]) and Android® (0.94 [0.91-0.96]). The SEm values for the three devices at rest: iOS® (5.77 beats per minute [BPM]), Polar® (4.56 BPM) and Android® (4.96 BPM). The Android®, iOS®, and Polar® devices showed acceptable test-retest reliability at rest and post-exercise. Both the smartphone platforms demonstrated concurrent validity with the Polar® at rest and post-exercise. The Azumio® Instant Heart Rate application when used by either platform appears to be a reliable and valid tool to assess pulse rate in healthy individuals.
Reliability and validity of the Youth Leisure-time Sedentary Behavior Questionnaire (YLSBQ).

Science.gov (United States)

Cabanas-Sánchez, Verónica; Martínez-Gómez, David; Esteban-Cornejo, Irene; Castro-Piñero, José; Conde-Caveda, Julio; Veiga, Óscar L

2018-01-01

To develop a questionnaire able to assess time spent by youth in a wide range of leisure-time sedentary behaviors (SB) and evaluate its test-retest reliability and criterion validity. Cross-sectional observational. The reliability sample included 194 youth, aged 10-18 years, who completed the questionnaire twice, separated by one-week interval. The validity study comprised 1207 participants aged 8-18 years. Participants wore an accelerometer for 7 consecutive days. The questionnaire was designed to assess the amount of time spent in twelve different SB during weekdays and weekends, separately. In order to avoid usual phenomenon of time over reporting, values were adjusted to real available leisure-time (LT) for each participant. Reliability was assessed by using Intraclass Correlation Coefficients (ICC) and weighted (quadratic) kappa (k), and validity was assessed by using Pearson correlation and Bland-Altman plots. The reliability of questionnaire showed a moderate-to-substantial agreement for the most (91%) of items (k=0.43-0.74; ICC=0.41-0.79) with three items (4%) reaching an almost perfect agreement (ICC=0.82-0.83). Only 'sitting and talking' evidenced fair-to-moderate reliability (k=0.27-0.39; ICC=0.34-0.46). The relationship between average sedentary time assessed by the questionnaire and accelerometry was moderate (r=0.36; pquestionnaire and accelerometer sedentary time for average day (r=0.05; p=0.11) but Bland-Altman plots suggest moderate discrepancies between both methods of SB measurement (mean=19.86; limits of agreement=-280.04 to 319.76). The questionnaire showed moderate to good test-retest reliability and a moderate level of validity for assessing SB in youth, similar or slightly better to previously published in this population. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Short-interval test-retest interrater reliability of the Dutch version of the structured clinical interview for DSM-IV personality disorders (SCID-II)

NARCIS (Netherlands)

Weertman, A; ArntZ, A; Dreessen, L; van Velzen, C; Vertommen, S

2003-01-01

This study examined the short-interval test-retest reliability of the Structured Clinical Interview (SCID-II: First, Spitzer, Gibbon, & Williams, 1995) for DSM-IV personality disorders (PDs). The SCID-II was administered to 69 in- and outpatients on two occasions separated by 1 to 6 weeks. The
Reliability and validity of the visual analogue scale for disability in patients with chronic musculoskeletal pain.

Science.gov (United States)

Boonstra, Anne M; Schiphorst Preuper, Henrica R; Reneman, Michiel F; Posthumus, Jitze B; Stewart, Roy E

2008-06-01

To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional design was used. A general rehabilitation centre and a university rehabilitation centre was the setting for the study. The study population consisted of patients over 18 years of age, suffering from chronic musculoskeletal pain; 52 patients in the reliability study, 344 patients in the validity study. Main outcome measures were as follows. Reliability study: Spearman's correlation coefficients (rho values) of the test and retest data of the VAS for disability; validity study: rho values of the VAS disability scores with the scores on four domains of the Short-Form Health Survey (SF-36) and VAS pain scores, and with Roland-Morris Disability Questionnaire scores in chronic low back pain patients. Results were as follows: in the reliability study rho values varied from 0.60 to 0.77; and in the validity study rho values of VAS disability scores with SF-36 domain scores varied from 0.16 to 0.51, with Roland-Morris Disability Questionnaire scores from 0.38 to 0.43 and with VAS pain scores from 0.76 to 0.84. The conclusion of the study was that the reliability of the VAS for disability is moderate to good. Because of a weak correlation with other disability instruments and a strong correlation with the VAS for pain, however, its validity is questionable.
Validity and Reliability of the Abbreviated Barratt Impulsiveness Scale in Spanish (BIS-15S)*

Science.gov (United States)

Orozco-Cabal, Luis; Rodríguez, Maritza; Herin, David V.; Gempeler, Juanita; Uribe, Miguel

2010-01-01

Objective This study determined the validity and reliability of a new, abbreviated version of the Spanish Barratt Impulsiveness Scale (BIS-15S) in Colombian subjects. Method The BIS-15S was tested in non-clinical (n=283) and clinical (n=164) native Spanish-speakers. Intra-scale reliability was calculated using Cronbach’s α, and test-retest reliability was measured with Pearson correlations. Psychometric properties were determined using standard statistics. A factor analysis was performed to determine BIS-15S factor structure. Results 447 subjects participated in the study. Clinical subjects were older and more educated compared to non-clinical subjects. Impulsivity scores were normally distributed in each group. BIS-15S total, motor, non-planning and attention scores were significantly lower in non-clinical vs. clinical subjects. Subjects with substance-related disorders had the highest BIS-15S total scores, followed by subjects with bipolar disorders and bulimia nervosa/binge eating. Internal consistency was 0.793 and test-retest reliability was 0.80. Factor analysis confirmed a three-factor structure (attention, motor, non-planning) accounting for 47.87% of the total variance in BIS-15S total scores. Conclusions The BIS-15S is a valid and reliable self-report measure of impulsivity in this population. Further research is needed to determine additional components of impulsivity not investigated by this measure. PMID:21152412
Validity and reliability of a modified english version of the physical activity questionnaire for adolescents.

Science.gov (United States)

Aggio, Daniel; Fairclough, Stuart; Knowles, Zoe; Graves, Lee

2016-01-01

Adaptation of physical activity self-report questionnaires is sometimes required to reflect the activity behaviours of diverse populations. The processes used to modify self-report questionnaires though are typically underreported. This two-phased study used a formative approach to investigate the validity and reliability of the Physical Activity Questionnaire for Adolescents (PAQ-A) in English youth. Phase one examined test content and response process validity and subsequently informed a modified version of the PAQ-A. Phase two assessed the validity and reliability of the modified PAQ-A. In phase one, focus groups (n = 5) were conducted with adolescents (n = 20) to investigate test content and response processes of the original PAQ-A. Based on evidence gathered in phase one, a modified version of the questionnaire was administered to participants (n = 169, 14.5 ± 1.7 years) in phase two. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and intra-class correlations, respectively. Spearman correlations were used to assess associations between modified PAQ-A scores and accelerometer-derived physical activity, self-reported fitness and physical activity self-efficacy. Phase one revealed that the original PAQ-A was unrepresentative for English youth and that item comprehension varied. Contextual and population/cultural-specific modifications were made to the PAQ-A for use in the subsequent phase. In phase two, modified PAQ-A scores had acceptable internal consistency (α = 0.72) and test-retest reliability (ICC = 0.78). Modified PAQ-A scores were significantly associated with objectively assessed moderate-to-vigorous physical activity (r = 0.39), total physical activity (r = 0.42), self-reported fitness (r = 0.35), and physical activity self-efficacy (r = 0.32) (p ≤ 0.01). The modified PAQ-A had acceptable internal consistency and test-retest reliability. Modified PAQ-A scores

Validity and reliability of a nutrition knowledge survey for assessment in elementary school children.

Science.gov (United States)

Gower, Jared R; Moyer-Mileur, Laurie J; Wilkinson, Robert D; Slater, Hillarie; Jordan, Kristine C

2010-03-01

Limited surveys are available to assess the nutrition knowledge of children. The goals of this study were to test the validity and reliability of a computer nutrition knowledge survey for elementary school students and to evaluate the impact of the "Fit Kids 'r' Healthy Kids" nutrition intervention via the knowledge survey. During survey development, a sample (n=12) of health educators, elementary school teachers, and registered dietitians assessed the survey. The target population consisted of first- through fourth-grade students from Salt Lake City, UT, metropolitan area schools. Participants were divided into reliability (n=68), intervention (n=74), and control groups (n=59). The reliability group took the survey twice (2 weeks apart); the intervention and control groups also took the survey twice, but at pre- and post-intervention (4 weeks later). Only students from the intervention group participated in four weekly nutrition classes. Reliability was assessed by Pearson's correlation coefficients for knowledge scores. Results demonstrated appropriate content validity, as indicated by expert peer ratings. Test-retest reliability correlations were found to be significant for the overall survey (r=0.54; PNutrition knowledge was assessed upon program completion with paired samples t tests. Students from the intervention group demonstrated improvement in nutrition knowledge (12.2+/-1.9 to 13.5+/-1.6; Pnutrition survey demonstrated content validity and test-retest reliability for first- through fourth-grade elementary school children. Also, the study results imply that the Fit Kids 'r' Healthy Kids intervention promoted gains in nutrition knowledge. Overall, the computer survey shows promise as an appealing medium for assessing nutrition knowledge in children. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

Directory of Open Access Journals (Sweden)

Penny Moss

Full Text Available Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot. Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%. Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56 years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%. Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add
Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

Science.gov (United States)

Moss, Penny; Whitnell, Jasmine; Wright, Anthony

2016-01-01

Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot). Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%). Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56) years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%). Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add validity and
Research Review: Test-retest reliability of standardized diagnostic interviews to assess child and adolescent psychiatric disorders: a systematic review and meta-analysis.

Science.gov (United States)

Duncan, Laura; Comeau, Jinette; Wang, Li; Vitoroulis, Irene; Boyle, Michael H; Bennett, Kathryn

2018-02-19

A better understanding of factors contributing to the observed variability in estimates of test-retest reliability in published studies on standardized diagnostic interviews (SDI) is needed. The objectives of this systematic review and meta-analysis were to estimate the pooled test-retest reliability for parent and youth assessments of seven common disorders, and to examine sources of between-study heterogeneity in reliability. Following a systematic review of the literature, multilevel random effects meta-analyses were used to analyse 202 reliability estimates (Cohen's kappa = ҡ) from 31 eligible studies and 5,369 assessments of 3,344 children and youth. Pooled reliability was moderate at ҡ = .58 (CI 95% 0.53-0.63) and between-study heterogeneity was substantial (Q = 2,063 (df = 201), p reliability varied across informants for specific types of psychiatric disorder (ҡ = .53-.69 for parent vs. ҡ = .39-.68 for youth) with estimates significantly higher for parents on attention deficit hyperactivity disorder, oppositional defiant disorder and the broad groupings of externalizing and any disorder. Reliability was also significantly higher in studies with indicators of poor or fair study methodology quality (sample size reliability of SDIs and the usefulness of these tools in both clinical and research contexts. Potential remedies include the introduction of standardized study and reporting requirements for reliability studies, and exploration of other approaches to assessing and classifying child and adolescent psychiatric disorder. © 2018 Association for Child and Adolescent Mental Health.
Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

Science.gov (United States)

Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh

2018-03-01

Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA
Validity and Reliability of the Turkish Version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form.

Science.gov (United States)

Yalin Sapmaz, Şermin; Ergin, Dilek; Özek Erkuran, Handan; Şen Celasin, Nesrin; Öztürk, Masum; Karaarslan, Duygu; Köroğlu, Ertuğrul; Aydemir, Ömer

2017-09-01

This study assessed the validity and reliability of the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form for use among the Turkish population. The study group consisted of 30 patients that had been treated in a child psychiatry unit and diagnosed with posttraumatic stress disorder and 83 healthy volunteers that were attending middle or high school during the study period. For reliability analyses, the internal consistency coefficient and the test-retest correlation coefficient were measured. For validity analyses, the exploratory factor analysis and correlation analysis with the Child Posttraumatic Stress Reaction Index for concurrent validity were measured. The Cronbach's alpha (the internal consistency coefficient) of the scale was 0.909, and the test-retest correlation coefficient was 0.663. One factor that could explain 58.5% of the variance was obtained and was congruent with the original construct of the scale. As for concurrent validity, the scale showed high correlation with the Child Posttraumatic Stress Reaction Index. It was concluded that the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form can be used as a valid and reliable tool.
The Locomotor Capabilities Index; validity and reliability of the Swedish version in adults with lower limb amputation

Directory of Open Access Journals (Sweden)

Andersson Ingemar H

2009-05-01

Full Text Available Abstract Background The Locomotor Capabilities Index (LCI is a validated measure of lower-limb amputees' ability to perform activities with prosthesis. We have developed the LCI Swedish version and evaluated its validity and reliability. Methods Cross-cultural adaptation to Swedish included forward/backward translations and field testing. The Swedish LCI was then administered to 144 amputees (55 women, mean age 74 (40–93 years, attending post-rehabilitation prosthetic training. Construct validity was assessed by examining the relationship between the LCI and Timed "Up-and-Go" (TUG test and between the LCI and EQ-5D health utility index in 2 subgroups of 40 and 20 amputees, respectively. Discriminative validity was assessed by comparing scores in different age groups and in unilateral and bilateral amputees. Test-retest reliability (1–2 weeks was evaluated in 20 amputees (14 unilateral. Results The Swedish LCI showed good construct convergent validity, with high correlation with the TUG (r = -0.75 and the EQ-5D (r = 0.84, and discriminative validity, with significantly worse mean scores for older than younger and for bilateral than unilateral amputees (p Conclusion The Swedish version of the LCI demonstrated good validity and internal consistency in adult amputees. Test-retest reliability in a small subsample appears to be acceptable. The high ceiling effect of the LCI may imply that it would be most useful in assessing amputees with low to moderate functional abilities.
Development, reliability and validity of the psychosocial adaptation scale for Parkinson's disease in Chinese population.

Science.gov (United States)

Zhang, Tingting; Yin, Anchun; Sun, Xiaohong; Liu, Qigui; Song, Guirong; Li, Lianhong

2015-01-01

To develop psychosocial adaptation scale for Parkinson's disease (PD) in Chinese population and evaluate its reliability and validity. The items were designed by literature review, expert consultation and semi-structured interview. The methods of corrected item-total correlation, discrimination analysis and exploratory factor analysis were used for items selection. 427 valid scales from PD patients were collected in the study to test the reliability and validity. The scale incorporated six dimensions: anxiety, self-esteem, attitude, self-acceptance, self-efficacy and social support, a total of 32 items. The scale possessed good internal consistency. The test-retest correlation coefficient was 0.99 and average content validation rate was 0.97. The Hoehn and Yahr stage were correlated with total score of the scale. The psychosocial adaptation scale in this study showed good reliability and validity, it can be used as a reliable and valid instrument to evaluate the psychosocial adaptation of PD objectively and effectively.
The Modified Reasons for Smoking Scale: factorial structure, validity and reliability in pregnant smokers.

Science.gov (United States)

De Wilde, Katrien Sophie; Tency, Inge; Boudrez, Hedwig; Temmerman, Marleen; Maes, Lea; Clays, Els

2016-06-01

Smoking during pregnancy can cause several maternal and neonatal health risks, yet a considerable number of pregnant women continue to smoke. The objectives of this study were to test the factorial structure, validity and reliability of the Dutch version of the Modified Reasons for Smoking Scale (MRSS) in a sample of smoking pregnant women and to understand reasons for continued smoking during pregnancy. A longitudinal design was performed. Data of 97 pregnant smokers were collected during prenatal consultation. Structural equation modelling was performed to assess the construct validity of the MRSS: an exploratory factor analysis was conducted, followed by a confirmatory factor analysis.Test-retest reliability (addiction, pleasure, habit and social function. Results for internal consistency and test-retest reliability were good to acceptable. There were significant associations of nicotine dependence with tension reduction and addiction and of daily consumption with addiction and habit. Validity and reliability of the MRSS were shown in a sample of pregnant smokers. Tension reduction was the most important reason for continued smoking, followed by pleasure and addiction. Although the score for nicotine dependence was low, addiction was an important reason for continued smoking during pregnancy; therefore, nicotine replacement therapy could be considered. Half of the respondents experienced depressive symptoms. Hence, it is important to identify those women who need more specialized care, which can include not only smoking cessation counselling but also treatment for depression. © 2016 John Wiley & Sons, Ltd.
Development of a Tablet-based symbol digit modalities test for reliably assessing information processing speed in patients with stroke.

Science.gov (United States)

Tung, Li-Chen; Yu, Wan-Hui; Lin, Gong-Hong; Yu, Tzu-Ying; Wu, Chien-Te; Tsai, Chia-Yin; Chou, Willy; Chen, Mei-Hsiang; Hsieh, Ching-Lin

2016-09-01

To develop a Tablet-based Symbol Digit Modalities Test (T-SDMT) and to examine the test-retest reliability and concurrent validity of the T-SDMT in patients with stroke. The study had two phases. In the first phase, six experts, nine college students and five outpatients participated in the development and testing of the T-SDMT. In the second phase, 52 outpatients were evaluated twice (2 weeks apart) with the T-SDMT and SDMT to examine the test-retest reliability and concurrent validity of the T-SDMT. The T-SDMT was developed via expert input and college student/patient feedback. Regarding test-retest reliability, the practise effects of the T-SDMT and SDMT were both trivial (d=0.12) but significant (p≦0.015). The improvement in the T-SDMT (4.7%) was smaller than that in the SDMT (5.6%). The minimal detectable changes (MDC%) of the T-SDMT and SDMT were 6.7 (22.8%) and 10.3 (32.8%), respectively. The T-SDMT and SDMT were highly correlated with each other at the two time points (Pearson's r=0.90-0.91). The T-SDMT demonstrated good concurrent validity with the SDMT. Because the T-SDMT had a smaller practise effect and less random measurement error (superior test-retest reliability), it is recommended over the SDMT for assessing information processing speed in patients with stroke. Implications for Rehabilitation The Symbol Digit Modalities Test (SDMT), a common measure of information processing speed, showed a substantial practise effect and considerable random measurement error in patients with stroke. The Tablet-based SDMT (T-SDMT) has been developed to reduce the practise effect and random measurement error of the SDMT in patients with stroke. The T-SDMT had smaller practise effect and random measurement error than the SDMT, which can provide more reliable assessments of information processing speed.
Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

Science.gov (United States)

Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

2016-03-03

The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.
Educational testing validity and reliability in pharmacy and medical education literature.

Science.gov (United States)

Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J

2013-12-16

To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; particles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
The validity and reliability of the four square step test in different adult populations: a systematic review.

Science.gov (United States)

Moore, Martha; Barker, Karen

2017-09-11

The four square step test (FSST) was first validated in healthy older adults to provide a measure of dynamic standing balance and mobility. The FSST has since been used in a variety of patient populations. The purpose of this systematic review is to determine the validity and reliability of the FSST in these different adult patient populations. The literature search was conducted to highlight all the studies that measured validity and reliability of the FSST. Six electronic databases were searched including AMED, CINAHL, MEDLINE, PEDro, Web of Science and Google Scholar. Grey literature was also searched for any documents relevant to the review. Two independent reviewers carried out study selection and quality assessment. The methodological quality was assessed using the QUADAS-2 tool, which is a validated tool for the quality assessment of diagnostic accuracy studies, and the COSMIN four-point checklist, which contains standards for evaluating reliability studies on the measurement properties of health instruments. Fifteen studies were reviewed studying community-dwelling older adults, Parkinson's disease, Huntington's disease, multiple sclerosis, vestibular disorders, post stroke, post unilateral transtibial amputation, knee pain and hip osteoarthritis. Three of the studies were of moderate methodological quality scoring low in risk of bias and applicability for all domains in the QUADAS-2 tool. Three studies scored "fair" on the COSMIN four-point checklist for the reliability components. The concurrent validity of the FSST was measured in nine of the studies with moderate to strong correlations being found. Excellent Intraclass Correlation Coefficients were found between physiotherapists carrying out the tests (ICC = .99) with good to excellent test-retest reliability shown in nine of the studies (ICC = .73-.98). The FSST may be an effective and valid tool for measuring dynamic balance and a participants' falls risk. It has been shown to have strong
Assessment of test-retest reliability and internal consistency of the Wisconsin Gait Scale in hemiparetic post-stroke patients

Directory of Open Access Journals (Sweden)

Guzik Agnieszka

2016-09-01

Full Text Available Introduction: A proper assessment of gait pattern is a significant aspect in planning the process of teaching gait in hemiparetic post-stroke patients. The Wisconsin Gait Scale (WGS is an observational tool for assessing post-stroke patients’ gait. The aim of the study was to assess test-retest reliability and internal consistency of the WGS and examine correlations between gait assessment made with the WGS and gait speed, Brunnström scale, Ashworth’s scale and the Barthel Index.
Validity and Reliability of the Clinical Competency Evaluation Instrument for Use among Physiotherapy Students: Pilot study.

Science.gov (United States)

Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh

2015-05-01

The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Test-retest reliability of the assessment of postural stability in typically developing children and in hearing impaired children.

Science.gov (United States)

De Kegel, A; Dhooge, I; Cambier, D; Baetens, T; Palmans, T; Van Waelvelde, H

2011-04-01

The purpose of this study was to establish test-retest reliability of centre of pressure (COP) measurements obtained by an AccuGait portable forceplate (ACG), mean COG sway velocity measured by a Basic Balance Master (BBM) and clinical balance tests in children with and without balance difficulties. 49 typically developing children and 23 hearing impaired children, with a higher risk for stability problems, between 6 and 12 years of age participated. Each child performed the modified Clinical Test of Sensory Interaction on Balance (mCTSIB), Unilateral Stance (US) and Tandem Stance on ACG, mCTSIB and US on BBM and clinical balance tests: one-leg standing, balance beam walking and one-leg hopping. All subjects completed 2 test sessions on 2 different days in the same week assessed by the same examiner. Among COP measurements obtained by the ACG, mean sway velocity was the most reliable parameter with all ICCs higher than 0.72. The standard deviation (SD) of sway velocity, sway area, SD of anterior-posterior and SD of medio-lateral COP data showed moderate to excellent reliability with ICCs between 0.55 and 0.96 but some caution must be taken into account in some conditions. BBM is less reliable but clinical balance tests are as reliable as ACG. Hearing impaired children exhibited better relative reliability (ICC) and comparable absolute reliability (SEM) for most balance parameters compared to typically developing children. Reliable information regarding postural stability of typically developing children and hearing impaired children may be obtained utilizing COP measurements generated by an AccuGait system and clinical balance tests. Copyright © 2011 Elsevier B.V. All rights reserved.
Brain GABA Detection in vivo with the J-editing 1H MRS Technique: A Comprehensive Methodological Evaluation of Sensitivity Enhancement, Macromolecule Contamination and Test-Retest Reliability

Science.gov (United States)

Shungu, Dikoma C.; Mao, Xiangling; Gonzales, Robyn; Soones, Tacara N.; Dyke, Jonathan P.; van der Veen, Jan Willem; Kegeles, Lawrence S.

2016-01-01

Abnormalities in brain γ-aminobutyric acid (GABA) have been implicated in various neuropsychiatric and neurological disorders. However, in vivo GABA detection by proton magnetic resonance spectroscopy (1H MRS) presents significant challenges arising from low brain concentration, overlap by much stronger resonances, and contamination by mobile macromolecule (MM) signals. This study addresses these impediments to reliable brain GABA detection with the J-editing difference technique on a 3T MR system in healthy human subjects by (a) assessing the sensitivity gains attainable with an 8-channel phased-array head coil, (b) determining the magnitude and anatomic variation of the contamination of GABA by MM, and (c) estimating the test-retest reliability of measuring GABA with this method. Sensitivity gains and test-retest reliability were examined in the dorsolateral prefrontal cortex (DLPFC), while MM levels were compared across three cortical regions: the DLPFC, the medial prefrontal cortex (MPFC) and the occipital cortex (OCC). A 3-fold higher GABA detection sensitivity was attained with the 8-channel head coil compared to the standard single-channel head coil in DLPFC. Despite significant anatomic variation in GABA+MM and MM across the three brain regions (p GABA+MM was relatively stable across the three voxels, ranging from 41% to 49%, a non-significant regional variation (p = 0.58). The test-retest reliability of GABA measurement, expressed either as ratios to voxel tissue water (W) or total creatine, was found to be very high for both the single-channel coil and the 8-channel phased-array coil. For the 8-channel coil, for example, Pearson’s correlation coefficient of test vs. retest for GABA/W was 0.98 (R2 = 0.96, p = 0.0007), the percent coefficient of variation (CV) was 1.25%, and the intraclass correlation coefficient (ICC) was 0.98. Similar reliability was also found for the co-edited resonance of combined glutamate and glutamine (Glx) for both coils. PMID
Validation of a clinical critical thinking skills test in nursing

OpenAIRE

Shin, Sujin; Jung, Dukyoo; Kim, Sungeun

2015-01-01

Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school stud...
Test-retest reliabilty of exercise-induced hypoalgesia after aerobic exercise

DEFF Research Database (Denmark)

Vaegter, Henrik Bjarke; Dørge, Daniel Bandholtz; Schmidt, Kristian Sonne

2018-01-01

Objective: Exercise increases pressure pain thresholds (PPTs) in exercising and nonexercising muscles, known as exercise-induced hypoalgesia (EIH). No studies have investigated the test-retest reliability of change in PPTs after aerobic exercise. Primary objectives were to compare the effect...
A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders.

Science.gov (United States)

Stupar, Maja; Côté, Pierre; Beaton, Dorcas E; Boyle, Eleanor; Cassidy, J David

2015-01-01

The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). We performed a test-retest reliability study. We included insurance claimants from Ontario who were at least 18 years of age, within 21 days of their motor vehicle collision and diagnosed as having acute WAD grades I to III. The WDQ, a 13-item questionnaire scored from 0 (no disability) to 130 (complete disability), was administered to all participants at baseline and by telephone 3 days later. We computed the intraclass correlation coefficient (model 2,1) and the MDC with 95% confidence intervals (CIs; MDC95). The mean (SD) age of the 66 participants was 41.6 (12.7) years and 71.2% were female. Twenty-nine percent had WAD I and 71.2% had WAD II. Time since injury ranged from 0 to 19 days. The mean (SD) baseline WDQ score was 49.3 (28.8) and 46.5 (29.8) 3 days later. The intraclass correlation coefficient for the WDQ total score was 0.89 (95% CI, 0.85-0.92) in the entire sample and 0.83 (95% CI, 0.69-0.93) for the 15 participants reporting no change in neck pain. The MDC95 of the WDQ was 21.4 (SD = 14.9) for participants reporting no change. The WDQ was reliable in individuals with acute WAD. There is 95% confidence that a change of approximately one-sixth of the total score is beyond the daily variation of a stable condition. This level of measurement error must be taken into consideration when interpreting change in WDQ scores. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.

Test of Gross Motor Development : Expert Validity, confirmatory validity and internal consistence

Directory of Open Access Journals (Sweden)

Nadia Cristina Valentini

2008-12-01

Full Text Available The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motordevelopment. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by expertsand the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. Across-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionalsand 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls.Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated thatthe Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices ofconfirmatory factorial validity (χ2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tuckerand Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. ThePortuguese TGMD-2 demonstrated validity and reliability for the sample investigated.
Test of Gross Motor Development: expert validity, confirmatory validity and internal consistence

Directory of Open Access Journals (Sweden)

Nadia Cristina Valentini

2008-01-01

The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motor development. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by experts and the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. A cross-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionals and 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls. Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated that the Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices of confirmatory factorial validity (÷2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tucker and Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. The Portuguese TGMD-2 demonstrated validity and reliability for the sample investigated.
Sense of competence in dementia care staff (SCIDS) scale: development, reliability, and validity.

Science.gov (United States)

Schepers, Astrid Kristine; Orrell, Martin; Shanahan, Niamh; Spector, Aimee

2012-07-01

Sense of competence in dementia care staff (SCIDS) may be associated with more positive attitudes to dementia among care staff and better outcomes for those being cared for. There is a need for a reliable and valid measure of sense of competence specific to dementia care staff. This study describes the development and evaluation of a measure to assess "sense of competence" in dementia care staff and reports on its psychometric properties. The systematic measure development process involved care staff and experts. For item selection and assessment of psychometric properties, a pilot study (N = 37) and a large-scale study (N = 211) with a test-retest reliability (N = 58) sub-study were undertaken. The final measure consists of 17 items across four subscales with acceptable to good internal consistency and moderate to substantial test-retest reliability. As predicted, the measure was positively associated with work experience, job satisfaction, and person-centered approaches to dementia care, giving a first indication for its validity. The SCIDS scale provides a useful and user-friendly means of measuring sense of competence in care staff. It has been developed using a robust process and has adequate psychometric properties. Further exploration of the construct and the scale's validity is warranted. It may be useful to assess the impact of training and perceived abilities and skills in dementia care.
Reliability and Validity of Dual-Task Mobility Assessments in People with Chronic Stroke

Science.gov (United States)

Yang, Lei; He, Chengqi; Pang, Marco Yiu Chung

2016-01-01

Background The ability to perform a cognitive task while walking simultaneously (dual-tasking) is important in real life. However, the psychometric properties of dual-task walking tests have not been well established in stroke. Objective To assess the test-retest reliability, concurrent and known-groups validity of various dual-task walking tests in people with chronic stroke. Design Observational measurement study with a test-retest design. Methods Eighty-eight individuals with chronic stroke participated. The testing protocol involved four walking tasks (walking forward at self-selected and maximal speed, walking backward at self-selected speed, and crossing over obstacles) performed simultaneously with each of the three attention-demanding tasks (verbal fluency, serial 3 subtractions or carrying a cup of water). For each dual-task condition, the time taken to complete the walking task, the correct response rate (CRR) of the cognitive task, and the dual-task effect (DTE) for the walking time and CRR were calculated. Forty-six of the participants were tested twice within 3–4 days to establish test-retest reliability. Results The walking time in various dual-task assessments demonstrated good to excellent reliability [Intraclass correlation coefficient (ICC2,1) = 0.70–0.93; relative minimal detectable change at 95% confidence level (MDC95%) = 29%-45%]. The reliability of the CRR (ICC2,1 = 0.58–0.81) and the DTE in walking time (ICC2,1 = 0.11–0.80) was more varied. The reliability of the DTE in CRR (ICC2,1 = -0.31–0.40) was poor to fair. The walking time and CRR obtained in various dual-task walking tests were moderately to strongly correlated with those of the dual-task Timed-up-and-Go test, thus demonstrating good concurrent validity. None of the tests could discriminate fallers (those who had sustained at least one fall in the past year) from non-fallers. Limitation The results are generalizable to community-dwelling individuals with chronic stroke only
Reliability and validity of the Turkish version of the Berg Balance Scale.

Science.gov (United States)

Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu

2008-01-01

The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (pr=0.67 pr=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace.

Science.gov (United States)

Seguí, María del Mar; Cabrero-García, Julio; Crespo, Ana; Verdú, José; Ronda, Elena

2015-06-01

To design and validate a questionnaire to measure visual symptoms related to exposure to computers in the workplace. Our computer vision syndrome questionnaire (CVS-Q) was based on a literature review and validated through discussion with experts and performance of a pretest, pilot test, and retest. Content validity was evaluated by occupational health, optometry, and ophthalmology experts. Rasch analysis was used in the psychometric evaluation of the questionnaire. Criterion validity was determined by calculating the sensitivity and specificity, receiver operator characteristic curve, and cutoff point. Test-retest repeatability was tested using the intraclass correlation coefficient (ICC) and concordance by Cohen's kappa (κ). The CVS-Q was developed with wide consensus among experts and was well accepted by the target group. It assesses the frequency and intensity of 16 symptoms using a single rating scale (symptom severity) that fits the Rasch rating scale model well. The questionnaire has sensitivity and specificity over 70% and achieved good test-retest repeatability both for the scores obtained [ICC = 0.802; 95% confidence interval (CI): 0.673, 0.884] and CVS classification (κ = 0.612; 95% CI: 0.384, 0.839). The CVS-Q has acceptable psychometric properties, making it a valid and reliable tool to control the visual health of computer workers, and can potentially be used in clinical trials and outcome research. Copyright © 2015 Elsevier Inc. All rights reserved.
Development of Chinese Military Personnel Social Support Scale and tests for its reliability and validity

Directory of Open Access Journals (Sweden)

Kai-hong TANG

2013-01-01

Full Text Available Objective 　To develop Chinese Military Personnel Social Support Scaleand verify its reliability and validity. Methods 　The Chinese Military Personnel Social Support Scalewas initiated, organized and compiled based upon open-ended questionnaire survey done in a systematic manner, and previous researches were taken as references. A total of 630 military personnel were chosen by random cluster sampling and tested with the Scale, among them 50 were tested with Social Support Rating Scale(SSRS and Chinese Military Psychosomatic Health Scale(CMPHS simultaneously, and the test was done solely a second time with CMPHS 2 weeks later. The reliability and validity were assessed and verified by exploratory factor analysis, confirmatory factor analysis and correlation analysis. Results 　The Chinese Military Personnel Social Support Scalecomprised three factors, namely subjective support, objective support and utility of social support. Eighteen items were left in official scale after amendment by factor analysis, and one lying subscale was added. The correlation coefficients between the public factors ranged from 0.477 to 0.589 (P<0.01, and the correlation coefficients between factors and total scale ranged from 0.721 to 0.823 (P<0.01. The test-retest correlation coefficients of total scale and subscales ranged from 0.622 to 0.803 (P<0.01, the Cronbach α coefficients ranged from 0.624 to 0.874, and the split-half correlation coefficients ranged from 0.551 to 0.828. Significant correlation existed between this Scale and two criterion scales, namely SSRS and CMPHS. Conclusion 　It is verified that the Chinese Military Personnel Social Support Scalehas excellent reliability and validity, and complying with psychometric standards, it may be used to evaluate the social support level of Chinese military personnel.
German validation of the Conners Adult ADHD Rating Scales (CAARS) II: reliability, validity, diagnostic sensitivity and specificity.

Science.gov (United States)

Christiansen, H; Kis, B; Hirsch, O; Matthies, S; Hebebrand, J; Uekermann, J; Abdel-Hamid, M; Kraemer, M; Wiltfang, J; Graf, E; Colla, M; Sobanski, E; Alm, B; Rösler, M; Jacob, C; Jans, T; Huss, M; Schimmelmann, B G; Philipsen, A

2012-07-01

The German version of the Conners Adult ADHD Rating Scales (CAARS) has proven to show very high model fit in confirmative factor analyses with the established factors inattention/memory problems, hyperactivity/restlessness, impulsivity/emotional lability, and problems with self-concept in both large healthy control and ADHD patient samples. This study now presents data on the psychometric properties of the German CAARS-self-report (CAARS-S) and observer-report (CAARS-O) questionnaires. CAARS-S/O and questions on sociodemographic variables were filled out by 466 patients with ADHD, 847 healthy control subjects that already participated in two prior studies, and a total of 896 observer data sets were available. Cronbach's-alpha was calculated to obtain internal reliability coefficients. Pearson correlations were performed to assess test-retest reliability, and concurrent, criterion, and discriminant validity. Receiver Operating Characteristics (ROC-analyses) were used to establish sensitivity and specificity for all subscales. Coefficient alphas ranged from .74 to .95, and test-retest reliability from .85 to .92 for the CAARS-S, and from .65 to .85 for the CAARS-O. All CAARS subscales, except problems with self-concept correlated significantly with the Barrett Impulsiveness Scale (BIS), but not with the Wender Utah Rating Scale (WURS). Criterion validity was established with ADHD subtype and diagnosis based on DSM-IV criteria. Sensitivity and specificity were high for all four subscales. The reported results confirm our previous study and show that the German CAARS-S/O do indeed represent a reliable and cross-culturally valid measure of current ADHD symptoms in adults. Copyright © 2011 Elsevier Masson SAS. All rights reserved.
The Nutrition Literacy Assessment Instrument is a Valid and Reliable Measure of Nutrition Literacy in Adults with Chronic Disease.

Science.gov (United States)

Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K

2018-03-01

To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
The revised Generalized Expectancy for Success Scale: a validity and reliability study.

Science.gov (United States)

Hale, W D; Fiedler, L R; Cochran, C D

1992-07-01

The Generalized Expectancy for Success Scale (GESS; Fibel & Hale, 1978) was revised and assessed for reliability and validity. The revised version was administered to 199 college students along with other conceptually related measures, including the Rosenberg Self-Esteem Scale, the Life Orientation Test, and Rotter's Internal-External Locus of Control Scale. One subsample of students also completed the Eysenck Personality Inventory, while another subsample performed a criterion-related task that involved risk taking. Item analysis yielded 25 items with correlations of .45 or higher with the total score. Results indicated high internal consistency and test-retest reliability.
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah's Dental Anxiety Scale.

Science.gov (United States)

Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha

2018-01-01

Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah's Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach's alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach's alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study.
Translation of oswestry disability index into Tamil with cross cultural adaptation and evaluation of reliability and validity(§).

Science.gov (United States)

Vincent, Joshua Israel; Macdermid, Joy Christine; Grewal, Ruby; Sekar, Vincent Prabhakaran; Balachandran, Dinesh

2014-01-01

Prospective longitudinal validation study. To translate and cross-culturally adapt the Oswestry Disability Index (ODI) to the Tamil language (ODI-T), and to evaluate its reliability and construct validity. ODI is widely used as a disease specific questionnaire in back pain patients to evaluate pain and disability. A thorough literature search revealed that the Tamil version of the ODI has not been previously published. The ODI was translated and cross-culturally adapted to the Tamil language according to established guidelines. 30 subjects (16 women and 14 men) with a mean age of 42.7 years (S.D. 13.6; Range 22 - 69) with low back pain were recruited to assess the psychometric properties of the ODI-T Questionnaire. Patients completed the ODI-T, Roland-Morris disability questionnaire (RMDQ), VAS-pain and VAS-disability at baseline and 24-72 hours from the baseline visit. The ODI-T displayed a high degree of internal consistency, with a Cronbach's alpha of 0.92. The test-retest reliability was high (n=30) with an ICC of 0.92 (95% CI, 0.84 to 0.96) and a mean re-test difference of 2.6 points lower on re-test. The ODI-T scores exhibited a strong correlation with the RMDQ scores (r = 0.82) pTamil version of the ODI Questionnaire is a valid and reliable tool that can be used to measure subjective outcomes of pain and disability in Tamil speaking patients with low back pain.
Reliability and validity of the Physical Activity Scale for the Elderly (PASE in patients with hip osteoarthritis

Directory of Open Access Journals (Sweden)

Svege Ida

2012-02-01

Full Text Available Abstract Background Physical activity (PA is beneficial in reducing pain and improving function in lower limb osteoarthritis (OA, and is recommended as a first line treatment. Self-administered questionnaires are used to assess PA, but knowledge about reliability and validity of these PA questionnaires are limited, in particular for patients with OA. The purpose of this study was to evaluate the reliability and validity of the Physical Activity Scale for the Elderly (PASE in patients with hip OA. Methods Forty patients with hip OA (20 men and 20 women, mean age 61.3 ± 10 years were included. For test-retest reliability PASE was administered twice with a mean time between tests of 9 ± 4 days. Intraclass correlation coefficient (ICC, standard error of measurement (SEM and minimal detectable change (MDC were calculated for the total score and for the particular items assessing different PA intensity levels. In addition a Bland-Altman analysis for the total PASE score was performed. Construct validity was evaluated by comparing the PASE results with the Actigraph GT1M accelerometer and the International Physical Activity Questionnaire (IPAQ, using the Spearman rank correlation coefficient. Results ICC for the total PASE score was 0.78, with relatively large error of measurement; SEM = 31 and MDC = 87. ICC for the intensity items was 0.20 for moderate PA intensity, 0.46 for light PA intensity and to 0.68 for vigorous PA intensity. The Spearman rank correlation coefficient between the Actigraph GT1M total counts per minute and the total PASE score was 0.30 (p = 0.089, and ranging from 0.20-0.38 for the different PA intensity categories. The Spearman rank correlation between IPAQ and PASE was 0.61 (p = 0.001 for the total scores. Conclusions In patients with hip OA the test-retest reliability of the total PASE score was moderate, with acceptable ICC, but with large measurement errors. The construct validity of the PASE was poor when compared to the
Development, reliability and validity of the psychosocial adaptation scale for Parkinson’s disease in Chinese population

Science.gov (United States)

Zhang, Tingting; Yin, Anchun; Sun, Xiaohong; Liu, Qigui; Song, Guirong; Li, Lianhong

2015-01-01

Objective: To develop psychosocial adaptation scale for Parkinson’s disease (PD) in Chinese population and evaluate its reliability and validity. Methods: The items were designed by literature review, expert consultation and semi-structured interview. The methods of corrected item-total correlation, discrimination analysis and exploratory factor analysis were used for items selection. 427 valid scales from PD patients were collected in the study to test the reliability and validity. Results: The scale incorporated six dimensions: anxiety, self-esteem, attitude, self-acceptance, self-efficacy and social support, a total of 32 items. The scale possessed good internal consistency. The test-retest correlation coefficient was 0.99 and average content validation rate was 0.97. The Hoehn and Yahr stage were correlated with total score of the scale. Conclusions: The psychosocial adaptation scale in this study showed good reliability and validity, it can be used as a reliable and valid instrument to evaluate the psychosocial adaptation of PD objectively and effectively. PMID:26770638
Reliability and validity of neurobehavioral function on the Psychology Experimental Building Language test battery in young adults.

Science.gov (United States)

Piper, Brian J; Mueller, Shane T; Geerken, Alexander R; Dixon, Kyle L; Kroliczak, Gregory; Olsen, Reid H J; Miller, Jeremy K

2015-01-01

Background. The Psychology Experiment Building Language (PEBL) software consists of over one-hundred computerized tests based on classic and novel cognitive neuropsychology and behavioral neurology measures. Although the PEBL tests are becoming more widely utilized, there is currently very limited information about the psychometric properties of these measures. Methods. Study I examined inter-relationships among nine PEBL tests including indices of motor-function (Pursuit Rotor and Dexterity), attention (Test of Attentional Vigilance and Time-Wall), working memory (Digit Span Forward), and executive-function (PEBL Trail Making Test, Berg/Wisconsin Card Sorting Test, Iowa Gambling Test, and Mental Rotation) in a normative sample (N = 189, ages 18-22). Study II evaluated test-retest reliability with a two-week interest interval between administrations in a separate sample (N = 79, ages 18-22). Results. Moderate intra-test, but low inter-test, correlations were observed and ceiling/floor effects were uncommon. Sex differences were identified on the Pursuit Rotor (Cohen's d = 0.89) and Mental Rotation (d = 0.31) tests. The correlation between the test and retest was high for tests of motor learning (Pursuit Rotor time on target r = .86) and attention (Test of Attentional Vigilance response time r = .79), intermediate for memory (digit span r = .63) but lower for the executive function indices (Wisconsin/Berg Card Sorting Test perseverative errors = .45, Tower of London moves = .15). Significant practice effects were identified on several indices of executive function. Conclusions. These results are broadly supportive of the reliability and validity of individual PEBL tests in this sample. These findings indicate that the freely downloadable, open-source PEBL battery (http://pebl.sourceforge.net) is a versatile research tool to study individual differences in neurocognitive performance.
Reliability and validity of neurobehavioral function on the Psychology Experimental Building Language test battery in young adults

Directory of Open Access Journals (Sweden)

Brian J. Piper

2015-12-01

Full Text Available Background. The Psychology Experiment Building Language (PEBL software consists of over one-hundred computerized tests based on classic and novel cognitive neuropsychology and behavioral neurology measures. Although the PEBL tests are becoming more widely utilized, there is currently very limited information about the psychometric properties of these measures.Methods. Study I examined inter-relationships among nine PEBL tests including indices of motor-function (Pursuit Rotor and Dexterity, attention (Test of Attentional Vigilance and Time-Wall, working memory (Digit Span Forward, and executive-function (PEBL Trail Making Test, Berg/Wisconsin Card Sorting Test, Iowa Gambling Test, and Mental Rotation in a normative sample (N = 189, ages 18–22. Study II evaluated test–retest reliability with a two-week interest interval between administrations in a separate sample (N = 79, ages 18–22.Results. Moderate intra-test, but low inter-test, correlations were observed and ceiling/floor effects were uncommon. Sex differences were identified on the Pursuit Rotor (Cohen’s d = 0.89 and Mental Rotation (d = 0.31 tests. The correlation between the test and retest was high for tests of motor learning (Pursuit Rotor time on target r = .86 and attention (Test of Attentional Vigilance response time r = .79, intermediate for memory (digit span r = .63 but lower for the executive function indices (Wisconsin/Berg Card Sorting Test perseverative errors = .45, Tower of London moves = .15. Significant practice effects were identified on several indices of executive function.Conclusions. These results are broadly supportive of the reliability and validity of individual PEBL tests in this sample. These findings indicate that the freely downloadable, open-source PEBL battery (http://pebl.sourceforge.net is a versatile research tool to study individual differences in neurocognitive performance.
Optimal number of tests to achieve and validate product reliability

International Nuclear Information System (INIS)

Ahmed, Hussam; Chateauneuf, Alaa

2014-01-01

The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces
Using personality item characteristics to predict single-item reliability, retest reliability, and self-other agreement

NARCIS (Netherlands)

de Vries, Reinout Everhard; Realo, Anu; Allik, Jüri

2016-01-01

The use of reliability estimates is increasingly scrutinized as scholars become more aware that test–retest stability and self–other agreement provide a better approximation of the theoretical and practical usefulness of an instrument than its internal reliability. In this study, we investigate item
Validity and Reliability of the Arabic Token Test for Children

Science.gov (United States)

Alkhamra, Rana A.; Al-Jazi, Aya B.

2016-01-01

Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…
Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

Science.gov (United States)

Badjadi, Nour El Imane

2013-01-01

The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…

Construction of Valid and Reliable Test for Assessment of Students

Science.gov (United States)

Osadebe, P. U.

2015-01-01

The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
Cross-Cultural Adaptation, Validation, and Reliability Testing of the Modified Oswestry Disability Questionnaire in Persian Population with Low Back Pain.

Science.gov (United States)

Baradaran, Aslan; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza

2016-04-01

Prospective study. We aimed to validate the Persian version of the modified Oswestry disability questionnaire (MODQ) in patients with low back pain. Modified Oswestry low back pain disability questionnaire is a well-known condition-specific outcome measure that helps quantify disability in patients with lumbar syndromes. To test the validity in a pilot study, the Persian MODQ was administered to 25 individuals with low back pain. We then enrolled 200 consecutive patients with low back pain to fill the Persian MODQ as well as the short form 36 (SF-36) questionnaire. Convergent validity of the MODQ was tested using the Spearman's correlation coefficient between the MODQ and SF-36 subscales. Intraclass correlation coefficient (ICC) and Cronbach's α coefficient were measured to test the reliability between test and retest and internal consistency of all items, respectively. ICC for individual items ranged from 0.43 to 0.80 showing good reliability and reproducibility of each individual item. Cronbach's α coefficient was 0.69 showing good internal consistency across all 10 items of the Persian MODQ. Total MODQ score showed moderate to strong correlation with the eight subscales and the two domains of the SF-36. The highest correlation was between the MODQ and the physical functioning subscale of the SF-36 (r=-0.54, pPersian version of the MODQ is a valid and reliable tool for the assessment of the disability following low back pain.
Construct validity and reliability of the Music Attentiveness Screening Assessment (MASA).

Science.gov (United States)

Waldon, Eric G; Broadhurst, Emily

2014-01-01

Music as alternate engagement (MAE) can be used effectively to distract children during painful or anxiety-provoking medical procedures. For such interventions to be successful, it would seem important to assess the degree to which a child can attend to musical stimuli. The purposes of this study were as follows: (a) To establish construct validity by determining the extent to which the Music Attentiveness Screening Assessment (MASA) measures auditory attention; and (b) to gather evidence regarding MASA test-retest and inter-observer reliability. The Auditory Attention (AA) subtest from the NEPSY-II (NEPSY, Second Edition) and the two items from MASA were administered to a nonclinical sample of children (N = 50) aged 5 to 9 years. There was a statistically significant proportion of AA score variance shared with MASA (both items), R (2) = .21, F(2, 47) = 6.34, p = .004. Test-retest reliability on the first MASA item was moderately high (Pearson r = .84) while on the second item it was lower (r = .63). Similarly, interobserver agreement was high for Item I (intraclass correlation coefficient [ICC] = .95) and lower for Item II (ICC = .71). Evidence suggests that MASA measures, at least in part, auditory attention. Despite this finding, a large proportion of unexplained variance remains. Furthermore, reliability estimates (test-retest and interobserver agreement) differ between both items. These findings are discussed with particular attention paid to the ways in which MASA should be revised and further study conducted. © the American Music Therapy Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
[Reliability and validity of warning signs checklist for screening psychological, behavioral and developmental problems of children].

Science.gov (United States)

Huang, X N; Zhang, Y; Feng, W W; Wang, H S; Cao, B; Zhang, B; Yang, Y F; Wang, H M; Zheng, Y; Jin, X M; Jia, M X; Zou, X B; Zhao, C X; Robert, J; Jing, Jin

2017-06-02

Objective: To evaluate the reliability and validity of warning signs checklist developed by the National Health and Family Planning Commission of the People's Republic of China (NHFPC), so as to determine the screening effectiveness of warning signs on developmental problems of early childhood. Method: Stratified random sampling method was used to assess the reliability and validity of checklist of warning sign and 2 110 children 0 to 6 years of age(1 513 low-risk subjects and 597 high-risk subjects) were recruited from 11 provinces of China. The reliability evaluation for the warning signs included the test-retest reliability and interrater reliability. With the use of Age and Stage Questionnaire (ASQ) and Gesell Development Diagnosis Scale (GESELL) as the criterion scales, criterion validity was assessed by determining the correlation and consistency between the screening results of warning signs and the criterion scales. Result: In terms of the warning signs, the screening positive rates at different ages ranged from 10.8%(21/141) to 26.2%(51/137). The median (interquartile) testing time for each subject was 1(0.6) minute. Both the test-retest reliability and interrater reliability of warning signs reached 0.7 or above, indicating that the stability was good. In terms of validity assessment, there was remarkable consistency between ASQ and warning signs, with the Kappa value of 0.63. With the use of GESELL as criterion, it was determined that the sensitivity of warning signs in children with suspected developmental delay was 82.2%, and the specificity was 77.7%. The overall Youden index was 0.6. Conclusion: The reliability and validity of warning signs checklist for screening early childhood developmental problems have met the basic requirements of psychological screening scales, with the characteristics of short testing time and easy operation. Thus, this warning signs checklist can be used for screening psychological and behavioral problems of early childhood
Reliability and validity of a brief method to assess nociceptive flexion reflex (NFR) threshold.

Science.gov (United States)

Rhudy, Jamie L; France, Christopher R

2011-07-01

The nociceptive flexion reflex (NFR) is a physiological tool to study spinal nociception. However, NFR assessment can take several minutes and expose participants to repeated suprathreshold stimulations. The 4 studies reported here assessed the reliability and validity of a brief method to assess NFR threshold that uses a single ascending series of stimulations (Peak 1 NFR), by comparing it to a well-validated method that uses 3 ascending/descending staircases of stimulations (Staircase NFR). Correlations between the NFR definitions were high, were on par with test-retest correlations of Staircase NFR, and were not affected by participant sex or chronic pain status. Results also indicated the test-retest reliabilities for the 2 definitions were similar. Using larger stimulus increments (4 mAs) to assess Peak 1 NFR tended to result in higher NFR threshold estimates than using the Staircase NFR definition, whereas smaller stimulus increments (2 mAs) tended to result in lower NFR threshold estimates than the Staircase NFR definition. Neither NFR definition was correlated with anxiety, pain catastrophizing, or anxiety sensitivity. In sum, a single ascending series of electrical stimulations results in a reliable and valid estimate of NFR threshold. However, caution may be warranted when comparing NFR thresholds across studies that differ in the ascending stimulus increments. This brief method to assess NFR threshold is reliable and valid; therefore, it should be useful to clinical pain researchers interested in quickly assessing inter- and intra-individual differences in spinal nociceptive processes. Copyright © 2011 American Pain Society. Published by Elsevier Inc. All rights reserved.
Reliability and validity of a novel Kinect-based software program for measuring posture, balance and side-bending.

Science.gov (United States)

Grooten, Wilhelmus Johannes Andreas; Sandberg, Lisa; Ressman, John; Diamantoglou, Nicolas; Johansson, Elin; Rasmussen-Barr, Eva

2018-01-08

Clinical examinations are subjective and often show a low validity and reliability. Objective and highly reliable quantitative assessments are available in laboratory settings using 3D motion analysis, but these systems are too expensive to use for simple clinical examinations. Qinematic™ is an interactive movement analyses system based on the Kinect camera and is an easy-to-use clinical measurement system for assessing posture, balance and side-bending. The aim of the study was to test the test-retest the reliability and construct validity of Qinematic™ in a healthy population, and to calculate the minimal clinical differences for the variables of interest. A further aim was to identify the discriminative validity of Qinematic™ in people with low-back pain (LBP). We performed a test-retest reliability study (n = 37) with around 1 week between the occasions, a construct validity study (n = 30) in which Qinematic™ was tested against a 3D motion capture system, and a discriminative validity study, in which a group of people with LBP (n = 20) was compared to healthy controls (n = 17). We tested a large range of psychometric properties of 18 variables in three sections: posture (head and pelvic position, weight distribution), balance (sway area and velocity in single- and double-leg stance), and side-bending. The majority of the variables in the posture and balance sections, showed poor/fair reliability (ICC validity (Spearman reliability (ICC =0.898), excellent validity (r = 0.943), and Qinematic™ could differentiate between LPB and healthy individuals (p = 0.012). This paper shows that a novel software program (Qinematic™) based on the Kinect camera for measuring balance, posture and side-bending has poor psychometric properties, indicating that the variables on balance and posture should not be used for monitoring individual changes over time or in research. Future research on the dynamic tasks of Qinematic™ is warranted.
Scale for positive aspects of caregiving experience: development, reliability, and factor structure.

Science.gov (United States)

Kate, N; Grover, S; Kulhara, P; Nehra, R

2012-06-01

OBJECTIVE. To develop an instrument (Scale for Positive Aspects of Caregiving Experience [SPACE]) that evaluates positive caregiving experience and assess its psychometric properties. METHODS. Available scales which assess some aspects of positive caregiving experience were reviewed and a 50-item questionnaire with a 5-point rating was constructed. In all, 203 primary caregivers of patients with severe mental disorders were asked to complete the questionnaire. Internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity were evaluated. Principal component factor analysis was run to assess the factorial validity of the scale. RESULTS. The scale developed as part of the study was found to have good internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity. Principal component factor analysis yielded a 4-factor structure, which also had good test-retest reliability and cross-language reliability. There was a strong correlation between the 4 factors obtained. CONCLUSION. The SPACE developed as part of this study has good psychometric properties.
The validity and reliability of the Moroccan version of the Revised Fibromyalgia Impact Questionnaire.

Science.gov (United States)

Srifi, Najlaa; Bahiri, Rachid; Rostom, Samira; Bendeddouche, Imad; Lazrek, Noufissa; Hajjaj-Hassouni, Najia

2013-01-01

The Revised Fibromyalgia Impact Questionnaire (FIQ-R) is an updated version of the FIQ attempts to address the limitations of the Fibromyalgia Impact Questionnaire (FIQ). As there is no Moroccan version of the FIQ-R available, we aimed to investigate the validity and reliability of a Moroccan translation of the FIQR in Moroccan fibromyalgia (FM) patients. After translating the FIQR into Moroccan, it was administered to 80 patients with FM. All of the patients filled out the questionnaire together with Arabic version of short form-36 (SF-36). The tender-point count was calculated from tender points identified by thumb palpation. Three days later, FM patients filled out the Moroccan FIQR at their second visit. The test-retest reliability of the Moroccan FIQR questions ranged from 0.72 to 0.87. The test and retest reliability of total FIQR score was 0.84. Cronbach's alpha was 0.91 for FIQR visit 1 (the first assessment) and 0.92 for FIQR visit 2 (the second assessment), indicating acceptable levels of internal consistency for both assessments. Significant correlations for construct validity were obtained between the Moroccan FIQ-R total and domain scores and the subscales of the SF-36 (FIQR total versus SF-36 physical component score and mental component score were r = -0.69, P FIQ-R showed adequate reliability and validity. This instrument can be used in the clinical evaluation of Moroccan and Arabic-speaking patients with FM.
Cross-cultural Adaptation, Reliability, and Validity of the Yoruba Version of the Roland-Morris Disability Questionnaire.

Science.gov (United States)

Mbada, Chidozie Emmanuel; Idowu, Opeyemi Ayodiipo; Ogunjimi, Olawale Richard; Ayanniyi, Olusola; Orimolade, Elkanah Ayodele; Oladiran, Ajibola Babatunde; Johnson, Olubusola Esther; Akinsulore, Adesanmi; Oni, Temitope Olawale

2017-04-01

A translation, cross-cultural adaptation, and psychometric analysis. The aim of this study was to translate, cross-culturally adapt, and validate the Yoruba version of the RMDQ. The Roland-Morris Disability Questionnaire (RMDQ) is a valid outcome tool for low back pain (LBP) in clinical and research settings. There seems to be no valid and reliable version of the RMDQ in the Nigerian languages. Following the Guillemin criteria, the English version of the RMDQ was forward and back translated. Two Yoruba translated versions of the RMDQ were assessed for clarity, common language usage, and conceptual equivalence. Consequently, a harmonized Yoruba version was produced and was pilot-tested among 20 patients with nonspecific long-term LBP (NSLBP) for cognitive debriefing. The final version of the Yoruba RMDQ was tested for its construct validity and re-retest reliability among 120 and 87 patients with NSLBP, respectively. Pearson product moment correlation coefficient (r) of 0.82 was obtained for reliability of the Yoruba version of the RMDQ. The test-retest reliability of the Yoruba RMDQ yielded Cronbach alpha 0.932, while the intraclass correlation (ICC) ranged between 0.896 and 0.956. The analysis of the global scores of both the English and Yoruba versions of the RMDQ yielded ICC value of between 0.995 (95% confidence interval 0.996-0.997), with the item-by-item Kappa agreement ranging between 0.824 and 1.000. The external validity of RMDQ using Quadruple Visual Analogue Scale was r = -0.596 (P = 0.001). The Yoruba version of the RMDQ had no floor/ceiling effects, as no patient achieved either of the maximum or the minimum possible scores. The Yoruba version of the RMDQ has excellent reliability and validity and may be an appropriate outcome tool for clinical and research purposes among Yoruba-speaking patients with LBP. 3.
Cross-cultural adaptation, reliability and validity of the Arabic version of the reduced Western Ontario and McMaster Universities Osteoarthritis index in patients with knee osteoarthritis.

Science.gov (United States)

Alghadir, Ahmad; Anwer, Shahnawaz; Iqbal, Zaheen Ahmed; Alsanawi, Hisham Abdulaziz

2016-01-01

We adapted the reduced Western Ontario and McMaster Universities Osteoarthritis (WOMAC) index for the Arabic language and tested its metric properties in patients with knee osteoarthritis (OA). One hundred and twenty-one consecutive patients who were referred for physiotherapy to the outpatient department were asked to answer the Arabic version of the reduced WOMAC index (ArWOMAC). After the completion of the ArWOMAC, the intensity of knee pain and general health status were assessed using the visual analog scale (VAS) and the 12-item short form health survey (SF-12), respectively. A second assessment was performed at least 48 h after the first session to assess test-retest reliability. The test-retest reliability was quantified using the intra-class correlation coefficient (ICC), and Cronbach's alpha was calculated to assess the internal consistency of the Arabic questionnaire. The construct validity was assessed using Spearman rank correlation coefficients. The total ArWOMAC scale and pain and function subscales were internally consistent with Cronbach's coefficient alpha of 0.91, 0.89 and 0.90, respectively. Test-retest reliability was good to excellent with ICC of 0.91, 0.89 and 0.90, respectively. SF-12 and VAS score significantly correlated with ArWOMAC index (p < 0.01), which support the construct validity. The standard error of measurement (SEM) of the total scale was 2.94, based on repeated measurements for test-retest. The minimum detectable change based on the SEM for test-retest was 8.15. The ArWOMAC index is a reliable and valid instrument for evaluating the severity of knee OA, with metric properties in agreement with the original version. Although, the reduced WOMAC index has been clinically utilized within the Saudi population, the Arabic version of this instrument is not validated for an Arab population to measure lower limb functional disability caused by OA. The Arabic version of reduced WOMAC (ArWOMAC) index is a reliable and valid scale
Reliability and Validity of the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2) in Adults with Non-Cancer Pain

Science.gov (United States)

Hayes, Corey J.; Bhandari, Naleen Raj; Kathe, Niranjan; Payakachat, Nalin

2017-01-01

Limited evidence exists on how non-cancer pain (NCP) affects an individual’s health-related quality of life (HRQoL). This study aimed to validate the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2), a generic measure of HRQoL, in a NCP cohort using the Medical Expenditure Panel Survey Longitudinal Files. The SF Mental Component Summary (MCS12) and SF Physical Component Summary (PCS12) were tested for reliability (internal consistency and test-retest reliability) and validity (construct: convergent and discriminant; criterion: concurrent and predictive). A total of 15,716 patients with NCP were included in the final analysis. The MCS12 and PCS12 demonstrated high internal consistency (Cronbach’s alpha and Mosier’s alpha > 0.8), and moderate and high test-retest reliability, respectively (MCS12 intraclass correlation coefficient (ICC): 0.64; PCS12 ICC: 0.73). Both scales were significantly associated with a number of chronic conditions (p reliable and valid measure of HRQoL for patients with NCP. PMID:28445438
Validity and reliability of a Malay version of the Lawton instrumental activities of daily living scale among the Malay speaking elderly in Malaysia.

Science.gov (United States)

Kadar, Masne; Ibrahim, Suhaili; Razaob, Nor Afifi; Chai, Siaw Chui; Harun, Dzalani

2018-02-01

The Lawton Instrumental Activities of Daily Living Scale is a tool often used to assess independence among elderly at home. Its suitability to be used with the elderly population in Malaysia has not been validated. This current study aimed to assess the validity and reliability of the Lawton Instrumental Activities of Daily Living Scale - Malay Version to Malay speaking elderly in Malaysia. This study was divided into three phases: (1) translation and linguistic validity involving both forward and backward translations; (2) establishment of face validity and content validity; and (3) establishment of reliability involving inter-rater, test-retest and internal consistency analyses. Data used for these analyses were obtained by interviewing 65 elderly respondents. Percentages of Content Validity Index for 4 criteria were from 88.89 to 100.0. The Cronbach α coefficient for internal consistency was 0.838. Intra-class Correlation Coefficient of inter-rater reliability and test-retest reliability was 0.957 and 0.950 respectively. The result shows that the Lawton Instrumental Activities of Daily Living Scale - Malay Version has excellent reliability and validity for use with the Malay speaking elderly people in Malaysia. This scale could be used by professionals to assess functional ability of elderly who live independently in community. © 2018 Occupational Therapy Australia.
Reliability and validity of psychosocial and environmental correlates measures of physical activity and screen-based behaviors among Chinese children in Hong Kong

Directory of Open Access Journals (Sweden)

Salmon Jo

2011-03-01

Full Text Available Abstract Background Insufficient participation in physical activity and excessive screen time have been observed among Chinese children. The role of social and environmental factors in shaping physical activity and sedentary behaviors among Chinese children is under-investigated. The purpose of the present study was to assess the reliability and validity of a questionnaire to measure child- and parent-reported psychosocial and environmental correlates of physical activity and screen-based behaviors among Chinese children in Hong Kong. Methods A total of 303 schoolchildren aged 9-14 years and their parents volunteered to participate in this study and 160 of them completed the questionnaire twice within an interval of 10 days. Intraclass correlation coefficients (ICCs, kappa statistics, and percent agreement were performed to evaluate test-retest reliability of the continuous and categorical variables, respectively. Exploratory factor analyses (EFAs were conducted to assess convergent validity of the emergent scales. Cronbach's alpha and ICCs were performed to assess internal and test-retest reliability of the emergent scales. Criterion validity was assessed by correlating psychosocial and environmental measures with self-reported physical activity and screen-based behaviors, measured by a validated questionnaire. Results Reliability statistics for both child- and parent-reported continuous variables showed acceptable consistency for all of the ICC values greater than 0.70. Kappa statistics showed fair to perfect test-retest reliability for the categorical items. Adequate internal consistency and test-retest reliability were observed in most of the emergent scales. Criterion validity assessed by correlating psychosocial and environmental measures with child-reported physical activity found associations with physical activity in the self-efficacy scale (r = 0.25, P r = 0.25, P r = 0.14, P r = -0.22, P r = 0.12, P = 0.053. Conclusions The findings
The Perceived Efficacy and Goal Setting System (PEGS), part II: evaluation of test-retest reliability and differences between child and parental reports in the Swedish version.

Science.gov (United States)

Vroland-Nordstrand, Kristina; Krumlinde-Sundholm, Lena

2012-11-01

to evaluate the test-retest reliability of children's perceptions of their own competence in performing daily tasks and of their choice of goals for intervention using the Swedish version of the perceived efficacy and goal setting system (PEGS). A second aim was to evaluate agreement between children's and parents' perceptions of the child's competence and choices of intervention goals. Forty-four children with disabilities and their parents completed the Swedish version of the PEGS. Thirty-six of the children completed a retest session allocated into one of two groups: (A) for evaluation of perceived competence and (B) for evaluation of choice of goals. Cohen's kappa, weighted kappa and absolute agreement were calculated. Test-retest reliability for children's perceived competence showed good agreement for the dichotomized scale of competent/non-competent performance; however, using the four-point scale the agreement varied. The children's own goals were relatively stable over time; 78% had an absolute agreement ranging from 50% to 100%. There was poor agreement between the children's and their parents' ratings. Goals identified by the children differed from those identified by their parents, with 48% of the children having no goals identical to those chosen by their parents. These results indicate that the Swedish version of the PEGS produces reliable outcomes comparable to the original version.
Reliability and Validity of Athletes Disability Index Questionnaire.

Science.gov (United States)

Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin

2018-03-01

The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P visual analog scale was r = 0.626 (P disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
Validity and reliability of Verbal Online Subjective Opinion (VOSO and Modified Cooper-Harper scales in measuring of mental workload

Directory of Open Access Journals (Sweden)

Reza Charkhandaz Yeganeh

2016-12-01

Full Text Available Introduction: High mental workload is one of the important factors that results in errors in safety and occupational health scope and its measurement has high importance. So, this study aimed to determine validity and reliability of Verbal Online Subjective Opinion (VOSO and Modified Cooper-Harper (MCH scales in measuring mental workload. Methods: This study was conducted on 90 male students of Iran University of Medical Sciences. In this study, the Forward-Backward translation was used for translation of scales. Moreover, Content Validity Ratio (CVR and Content Validity Index (CVI were calculated by having suggestion of 6 Ergonomics and Occupational health experts. The Hybrid Memory Search Task software was used to create mental workload. Convergent validity of scales was calculated using correlation of scales with reaction time and then Test-Retest method was used to determine the reliability of scales. Results: Content and convergent validity of scales were confirmed and correlation of both scales with reaction time were higher than 0.8. Moreover for determination of scales reliabilities, Pearson correlation coefficient between scales values in test and retest trials were 0.86 and 0.91 for VOSO and MCH respectively. Conclusion: It seems that in regard to confirmation of validity and reliability of VOSO and MCH in this study and their high correlation with reaction time, it can use these scales in measurement of mental workload.
Reliability and validity of the Parenting Scale of Inconsistency.

Science.gov (United States)

Yoshizumi, Takahiro; Murase, Satomi; Murakami, Takashi; Takai, Jiro

2006-08-01

The purposes of the present study were to develop a Parenting Scale of Inconsistency and to evaluate its initial reliability and validity. The 12 items assess the inconsistency among parents' moods, behaviors, and attitudes toward children. In the primary study, 517 participants completed three measures: the new Parenting Scale of Inconsistency, the Parental Bonding Instrument, and the Depression Scale of the General Health Questionnaire. The Parenting Scale of Inconsistency had good test-retest reliability of .85 and internal consistency of .88 (Cronbach coefficient alpha). Construct validity was good as Inconsistency scores were significantly correlated with the Care and Overprotection scores of the Parental Bonding Instrument and with the Depression scores. Moreover, Inconsistency scores' relation with a dimension of parenting style distinct from Care and Overprotection suggested that the Parenting Scale of Inconsistency had factorial validity. This scale seems a potential measure for examining the relationships between inconsistent parenting and the mental health of children.
Reliability and preliminary evidence of validity of a Farsi version of the depression anxiety stress scales.

Science.gov (United States)

Bayani, Ali Asghar

2010-08-01

The internal consistency, test-retest reliability, and construct validity of the Farsi version of the Depression Anxiety Stress Scales were examined, with a sample of 306 undergraduate students (123 men, 183 women) ranging from 18 to 51 years of age (M age = 25.4, SD = 6.1). Participants completed the Satisfaction with Life Scale, Rosenberg Self-esteem Scale, and the Depression Anxiety Stress Scales. The findings confirmed the preliminary reliabilities and preliminary construct validity of the Farsi translation of the Depression Anxiety Stress Scales.
Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

Science.gov (United States)

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Sleep and sleep disturbance in children: Reliability and validity of the Dutch version of the Child Sleep Habits Questionnaire.

Science.gov (United States)

Waumans, Ruth C; Terwee, Caroline B; Van den Berg, Gerrit; Knol, Dirk L; Van Litsenburg, Raphaële R L; Gemke, Reinoud J B J

2010-06-01

The Child Sleep Habits Questionnaire (CSHQ) was developed in the US for measuring medical and behavioral sleep disorders in school-aged children. This study was conducted to assess the reliability and structural validity of the Dutch version of the CSHQ. Population-based study. Questionnaires (n = 2385) were distributed to children in primary schools and daycare centers to be completed by the parent/guardian. An identical second questionnaire was distributed for test-retest and interobserver reliability, which were assessed using intraclass correlation, and compared with published data. Internal consistency was assessed by Cronbach alpha (per subscale). Validity was analyzed by confirmatory and exploratory factor analysis. School-aged children. None. The questionnaire was returned by 1502 (63%) parents, 47% returned the questionnaire for test-retest, and 32% for interobserver reliability. Test-retest reliability was moderate to good, ranging from 0.47 to 0.93. Interobserver reliability was moderate to good, ranging from 0.53 to 0.87, with the exception of Sleep duration. Cronbach alpha ranged from 0.47 to 0.68. In confirmatory factor analysis the domain structure of the original American CSHQ could not be confirmed. Exploratory factor analysis suggested a 4-factor structure rather than the original 8 domains. The CSHQ seems to have an adequate reliability and moderate internal consistency in a Dutch population with different sociocultural characteristics than the US population in which it was devised. Factor analysis suggests that translation, cultural background, or subscales of the original instrument may affect the performance of the CSHQ.

Toward a Common Language for Measuring Patient Mobility in the Hospital: Reliability and Construct Validity of Interprofessional Mobility Measures.

Science.gov (United States)

Hoyer, Erik H; Young, Daniel L; Klein, Lisa M; Kreif, Julie; Shumock, Kara; Hiser, Stephanie; Friedman, Michael; Lavezza, Annette; Jette, Alan; Chan, Kitty S; Needham, Dale M

2018-02-01

The lack of common language among interprofessional inpatient clinical teams is an important barrier to achieving inpatient mobilization. In The Johns Hopkins Hospital, the Activity Measure for Post-Acute Care (AM-PAC) Inpatient Mobility Short Form (IMSF), also called "6-Clicks," and the Johns Hopkins Highest Level of Mobility (JH-HLM) are part of routine clinical practice. The measurement characteristics of these tools when used by both nurses and physical therapists for interprofessional communication or assessment are unknown. The purposes of this study were to evaluate the reliability and minimal detectable change of AM-PAC IMSF and JH-HLM when completed by nurses and physical therapists and to evaluate the construct validity of both measures when used by nurses. A prospective evaluation of a convenience sample was used. The test-retest reliability and the interrater reliability of AM-PAC IMSF and JH-HLM for inpatients in the neuroscience department (n = 118) of an academic medical center were evaluated. Each participant was independently scored twice by a team of 2 nurses and 1 physical therapist; a total of 4 physical therapists and 8 nurses participated in reliability testing. In a separate inpatient study protocol (n = 69), construct validity was evaluated via an assessment of convergent validity with other measures of function (grip strength, Katz Activities of Daily Living Scale, 2-minute walk test, 5-times sit-to-stand test) used by 5 nurses. The test-retest reliability values (intraclass correlation coefficients) for physical therapists and nurses were 0.91 and 0.97, respectively, for AM-PAC IMSF and 0.94 and 0.95, respectively, for JH-HLM. The interrater reliability values (intraclass correlation coefficients) between physical therapists and nurses were 0.96 for AM-PAC IMSF and 0.99 for JH-HLM. Construct validity (Spearman correlations) ranged from 0.25 between JH-HLM and right-hand grip strength to 0.80 between AM-PAC IMSF and the Katz Activities of
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah’s Dental Anxiety Scale

Directory of Open Access Journals (Sweden)

Meena Jain

2018-04-01

Full Text Available Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah’s Dental Anxiety Scale (CDAS in Hindi speaking Indian adults.Methods: A total of 348 subjects from the outpatient department of a dental hospital in Indiaparticipated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS was used to measure the convergent validity.Reliability was assessed through calculations of Cronbach’s alpha and intra class correlation 48 forms were completed for test-retest.Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO test value was 0.776. After factor analysis, a single factor (dental anxiety was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach’s alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554].Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study.
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah’s Dental Anxiety Scale

Science.gov (United States)

Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha

2018-01-01

Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah’s Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach’s alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach’s alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study. PMID:29744307
Adaptation and Assessment of Reliability and Validity of the Greek Version of the Ohkuma Questionnaire for Dysphagia Screening

Science.gov (United States)

Papadopoulou, Soultana L.; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam

2016-01-01

Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements (p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients. PMID:28050209
Test–retest reliability and validity of a web-based food-frequency questionnaire for adolescents aged 13–14 to be used in the Norwegian Mother and Child Cohort Study (MoBa)

Science.gov (United States)

Øverby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha

2014-01-01

Background The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. Objective The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. Design In total, 58 students (aged 13–14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. Results The median Spearman correlation coefficient for all nutrients in the test–retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). Conclusion The test–retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents. PMID:25371661
Test–retest reliability and validity of a web-based food-frequency questionnaire for adolescents aged 13–14 to be used in the Norwegian Mother and Child Cohort Study (MoBa

Directory of Open Access Journals (Sweden)

Nina Cecilie Øverby

2014-10-01

Full Text Available Background: The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. Objective: The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ for use among adolescents. Design: In total, 58 students (aged 13–14 from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. Results: The median Spearman correlation coefficient for all nutrients in the test–retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000. Conclusion: The test–retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents.
The Validity and Reliability of Autism Behavior Checklist

Directory of Open Access Journals (Sweden)

Negin Yousefi

2015-11-01

Full Text Available Objectives: The aim of this study was to evaluate the psychometric features of the Persian version of the Autism Behavior Checklist (ABC. Method:The International Quality of Life Assessment (IQOLA approach was used to translate the English ABC into Persian. A total sample of 184 parents of children including 114 children with autism disorder (mean age =7.21, SD =1.65 and 70 typically developing children (mean age = 6.82, SD =1.75 completed the ABC. Internal consistency, test-retest reliability, concurrent and discriminant validity, and cut-off score were assessed. Results: The results of this study revealed that the Persian version of the ABC has an acceptable degree of internal consistency (.73. Test–retest comparisons using interclass correlation confirmed the instrument’s time stability (.83. The instrument’s concurrent validity with Gilliam Autism Rating Scale (GARS was verified; the correlation between total scores was .94. In the discriminant validity, the autism group had significantly higher scores compared to the normal group. Receiver Operating Characteristic (ROC analysis revealed that individuals with total scores below 25 are less likely to be in the autism group. Conclusion:The Persian version of the ABC can be used as an initial screening tool in clinical contexts.
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

Science.gov (United States)

Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

2015-12-01

To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
Reliability of attitude and knowledge items and behavioral consistency in the validated sun exposure questionnaire in a Danish population based sample

DEFF Research Database (Denmark)

Køster, Brian; Søndergaard, Jens; Nielsen, Jesper Bo

2018-01-01

in protection behavior was low. To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability......An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high. The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed...... questionnaire for monitoring and evaluating population sun-related behavior. Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly...
The Vocal Tract Discomfort Scale: Validity and Reliability of the Persian Version in the Assessment of Patients With Muscle Tension Dysphonia.

Science.gov (United States)

Torabi, Hadi; Khoddami, Seyyedeh Maryam; Ansari, Noureddin Nakhostin; Dabirmoghaddam, Payman

2016-11-01

To cross-culturally adapt of Persian Vocal Tract Discomfort (VTDp) scale and evaluate its validity and reliability in the assessment of patients with muscle tension dysphonia (MTD). A cross-sectional and prospective cohort design was used to psychometrically test the VTDp. The VTD scale was cross-culturally adapted into Persian language following standard forward-backward translations. The VTDp scale was administrated to 100 patients with MTD (54 men and 46 women; mean age: 38.05 ± 10.02 years) and 50 healthy volunteers (26 men and 24 women; mean age: 36.50 ± 12.27 years). Forty-five patients with MTD completed the VTDp 7 days later for test-retest reliability. Patients also completed the Persian Voice Handicap Index (VHIp) to assess construct validity. The results of discriminative validity demonstrated that the VTDp was able to discriminate between patients with MTD and healthy participants. The internal consistency was confirmed with Cronbach α .77 and 0.73 for VTDp frequency and severity subscales, respectively. The test-retest reliability was excellent with an intraclass correlation coefficient (ICC agreement ) of 0.93 for the frequency subscale and 0.91 for the severity subscale. Construct validity of the VTDp was shown with significant correlations between the VTDp frequency and severity subscales and the VHIp total scores (0.36 and 0.37, respectively). The standard error of measurement and smallest detectable change values for VTDp frequency (2.11 and 5.85, respectively) and severity (2.25 and 6.23, respectively) were acceptable. The Bland-Altman analysis for assessing the agreement between test and retest measurements showed no systematic bias. The VTDp is a valid and reliable self-administered scale to measure patient's vocal tract sensations in Persian-speaking population. Copyright Â© 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Validation and reliability of the Baecke questionnaire for the evaluation of habitual physical activity in adult men

Directory of Open Access Journals (Sweden)

Alex Antonio Florindo

2003-06-01

Full Text Available The aim of this study was to verify validity and reliability of the scores for physical exercise in leisure (PEL, leisure and locomotion activities (LLA, and total score (TS of the Baecke habitual physical activity questionnaire in adult males. Twenty-one students of Physical Education were evaluated. For validation, the maximum oxygen uptake (O2max and the decrease of the heart rate in percentile (%DHR were measured through the Cooper's 12-minute walk or run test, and an annual index of physical exercise (IPE, and a week index of locomotion activities (ILA. The reliability was verified through test-retest with interval of 45 days. The Pearson correlation coefficient, and partial correlation adjusted for age and body mass index were used for validation. The intraclass correlation and paired t-test were used for reliability. The results indicated that %DHR was correlated with LLA and TS (r = 0.47 and p = 0.030; r = 0.48 and p = 0.027, respectively. IPE was correlated with PEL and TS (r = 0.56 and p = 0.008; r = 0.46 and p = 0.036, respectively. ILA was correlated with LLA and TS (r = 0.64 and p = 0.002 and r = 0.51 and p = 0.017, respectively. There was no significant difference in PEL, LLA and TS means in test-retest. The intraclass correlations were r = 0.69; r = 0.80 and r = 0.77, respectively for PEL, LLA and TS. In conclusion, the Baecke questionnaire is valid and reliable to measure habitual physical activity in Brazilian adult men.
Validity and Reliability of a Medicine Ball Explosive Power Test.

Science.gov (United States)

Stockbrugger, Barry A.; Haennel, Robert G.

2001-01-01

Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…
Cross-Cultural Adaptation, Validity, and Reliability of the Persian Version of the Orebro Musculoskeletal Pain Screening Questionnaire.

Science.gov (United States)

Shafeei, Asrin; Mokhtarinia, Hamid Reza; Maleki-Ghahfarokhi, Azam; Piri, Leila

2017-08-01

Observational study. To cross-culturally translate the Orebro Musculoskeletal Pain Screening Questionnaire (OMPQ) into Persian and then evaluate its psychometric properties (reliability, validity, ceiling, and flooring effects). To the authors' knowledge, prior to this study there has been no validated instrument to screen the risk of chronicity in Persian-speaking patients with low back pain (LBP) in Iran. The OMPQ was specifically developed as a self-administered screening tool for assessing the risk of LBP chronicity. The forward-backward translation method was used for the translation and cross-cultural adaptation of the original questionnaire. In total, 202 patients with subacute LBP completed the OMPQ and the pain disability questionnaire (PDQ), which was used to assess convergent validity. 62 patients completed the OMPQ a week later as a retest. Slight changes were made to the OMPQ during the translation/cultural adaptation process; face validity of the Persian version was obtained. The Persian OMPQ showed excellent test-retest reliability (intraclass correlation coefficient=0.89). Its internal consistency was 0.71, and its convergent validity was confirmed by good correlation coefficient between the OMPQ and PDQ total scores ( r =0.72, p validity, construct validity, reliability, and consistency. It is therefore considered a useful instrument for screening Iranian patients with LBP.
Validity and reliability of English and Marathi Oswestry Disability Index (version 2.1a) in Indian population.

Science.gov (United States)

Joshi, Veena D; Raiturker, Pradyumna P Pai; Kulkarni, Aditi A

2013-05-15

A total of 200 patients with low back pain (LBP) completed an English and Marathi Oswestry Disability Index (ODI) questionnaires (100 each), visual analogue scale, and Roland-Morris Disability Questionnaire. To validate the English and Marathi versions of ODI (version 2.1a). Patient-orientated assessment methods are important in the evaluation of treatment outcome. The ODI is one of the condition-specific questionnaires recommended for the use of patients with LBP. An adaptation of the ODI (version 2.1a) for Marathi language was carried out according to established guidelines. Average age of patients who answered the English ODI was 42 ± 15, whereas that of Marathi-speaking patients was 52 ± 15 years. About 40% were males. The Cronbach α reliability score was 0.877 for English and 0.943 for Marathi. Forty-seven and 53 of these patients were retested with English and Marathi ODI within 2 weeks (to assess test-retest reliability). The intraclass correlation coefficient (ICC) for the test-retest reliability of the questionnaire was 0.877 and 0.943 for English and Marathi respectively. The ODI scores correlated with visual analogue scale pain intensity (r = 0.67, P Disability Questionnaire score (r = 0.71, P Disability Questionnaire scores (r = 0.503, P Oswestry questionnaire is reliable and valid, and shows psychometric characteristics as good as the English version. It should represent a valuable tool for use in future patient-orientated outcome studies for population with LBP in India.
Reliability and validity of the Turkish version of ABILHAND-Kids' questionnaire in a group of patients with neuromuscular disorders.

Science.gov (United States)

Öksüz, Çigdem; Alemdaroglu, Ipek; Kilinç, Muhammed; Abaoğlu, Hatice; Demirci, Cevher; Karahan, Sevilay; Yilmaz, Oznur; Yildirim, Sibel Aksu

2017-10-01

This study was performed to examine the reliability and validity of the Turkish version of ABILHAND-Kids questionnaire which assesses manual functions of children with neuromuscular diseases (NMDs). A cross sectional survey study design and Rasch analysis were used to assess the reliability and validity of the Turkish version of scale. Ninety-three children with different neuromuscular disorders and their parents were included in the study. The scale was applied to the parents with face-to-face interview twice; on their first visit and after an interval of 15 days. The test-retest reliability was assessed with intraclass correlation coefficient (ICC), and internal consistency of the multi-item subscales by calculating Cronbach alpha values. Brooke Upper Extremity Functional Classification (BUEFC) and Wee-Functional Independency Measurement (Wee-FIM) were correlated to determine the construct validity. The ICC value for the test/retest reliability was 0.94. The internal consistency was 0.81. Floor (1.1%) and ceiling (11.8%) effects were not significant. There were moderate correlations between the Turkish version of ABILHAND-Kids and Wee-FIM (0.67) and BUEFC (-0.37). Rasch analysis indicated good item ﬁt, unidimensionality, and model ﬁt. The Turkish version of ABILHAND-Kids questionnaire was found to be a reliable and valid scale for the assessment of the manual ability of children with NMDs.
Test rig overview for validation and reliability testing of shutdown system software

International Nuclear Information System (INIS)

Zhao, M.; McDonald, A.; Dick, P.

2007-01-01

The test rig for Validation and Reliability Testing of shutdown system software has been upgraded from the AECL Windows-based test rig previously used for CANDU6 stations. It includes a Virtual Trip Computer, which is a software simulation of the functional specification of the trip computer, and a real-time trip computer simulator in a separate chassis, which is used during the preparation of trip computer test cases before the actual trip computers are available. This allows preparation work for Validation and Reliability Testing to be performed in advance of delivery of actual trip computers to maintain a project schedule. (author)
Validity and Reliability of Field-Based Measures for Assessing Movement Skill Competency in Lifelong Physical Activities: A Systematic Review.

Science.gov (United States)

Hulteen, Ryan M; Lander, Natalie J; Morgan, Philip J; Barnett, Lisa M; Robertson, Samuel J; Lubans, David R

2015-10-01

It has been suggested that young people should develop competence in a variety of 'lifelong physical activities' to ensure that they can be active across the lifespan. The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71% reporting inter-rater reliability and 41% reporting intra-rater reliability. Only four studies in this review reported test-retest reliability. Ten studies reported validity results; content validity was cited in 41% of these studies. Construct validity was reported in 24% of studies, while criterion validity was only reported in 12% of studies. Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review
Validity and Reliability of a Portable Balance Tracking System, BTrackS, in Older Adults.

Science.gov (United States)

Levy, Susan S; Thralls, Katie J; Kviatkovsky, Shiloah A

Falls are the leading cause of disability, injury, hospital admission, and injury-related death among older adults. Balance limitations have consistently been identified as predictors of falls and increased fall risk. Field measures of balance are limited by issues of subjectivity, ceiling effects, and low sensitivity to change. The gold standard for measuring balance is the force plate; however, its field use is untenable due to high cost and lack of portability. Thus, a critical need is observed for valid objective field measures of balance to accurately assess balance and identify limitations over time. The purpose of this study was to examine the concurrent validity and 3-day test-retest reliability of Balance Tracking System (BTrackS) in community-dwelling older adults. Minimal detectable change values were also calculated to reflect changes in balance beyond measurement error. Postural sway data were collected from community-dwelling older adults (N = 49, mean [SD] age = 71.3 [7.3] years) with a force plate and BTrackS in multitrial eyes open (EO) and eyes closed (EC) static balance conditions. Force sensors transmitted BTrackS data via a USB to a computer running custom software. Three approaches to concurrent validity were taken including calculation of Pearson product moment correlation coefficients, repeated-measures ANOVAs, and Bland-Altman plots. Three-day test-retest reliability of BTrackS was examined in a second sample of 47 community-dwelling older adults (mean [SD] age = 75.8 [7.7] years) using intraclass correlation coefficients and MDC values at 95% CI (MDC95) were calculated. BTrackS demonstrated good validity using Pearson product moment correlations (r > 0.90). Repeated-measures ANOVA and Bland-Altman plots indicated some BTrackS bias with center of pressure (COP) values higher than FP COP values in the EO (mean [SD] bias = 4.0 [6.8]) and EC (mean [SD] bias = 9.6 [12.3]) conditions. Test-retest reliability using intraclass correlation
Reliability and validity of selected measures associated with increased fall risk in females over the age of 45 years with distal radius fracture - A pilot study.

Science.gov (United States)

Mehta, Saurabh P; MacDermid, Joy C; Richardson, Julie; MacIntyre, Norma J; Grewal, Ruby

2015-01-01

Clinical measurement. This study examined test-retest reliability and convergent/divergent construct validity of selected tests and measures that assess balance impairment, fear of falling (FOF), impaired physical activity (PA), and lower extremity muscle strength (LEMS) in females >45 years of age after the distal radius fracture (DRF) population. Twenty one female participants with DRF were assessed on two occasions. Timed Up and Go, Functional Reach, and One Leg Standing tests assessed balance impairment. Shortened Falls Efficacy Scale, Activity-specific Balance Confidence scale, and Fall Risk Perception Questionnaire assessed FOF. International Physical Activity Questionnaire and Rapid Assessment of Physical Activity were administered to assess PA level. Chair stand test and isometric muscle strength testing for hip and knee assessed LEMS. Intraclass correlation coefficients (ICC) examined the test-retest reliability of the measures. Pearson correlation coefficients (r) examined concurrent relationships between the measures. The results demonstrated fair to excellent test-retest reliability (ICC between 0.50 and 0.96) and low to moderate concordance between the measures (low if r ≤ 0.4; moderate if r = 0.4-0.7). The results provide preliminary estimates of test-retest reliability and convergent/divergent construct validity of selected measures associated with increased risk for falling in the females >45 years of age after DRF. Further research directions to advance knowledge regarding fall risk assessment in DRF population have been identified. Copyright © 2015 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Reliability and validity of the multimedia activity recall in children and adults (MARCA) in people with chronic obstructive pulmonary disease.

Science.gov (United States)

Hunt, Toby; Williams, Marie T; Olds, Tim S

2013-01-01

To determine the reliability and validity of the Multimedia Activity Recall for Children and Adults (MARCA) in people with chronic obstructive pulmonary disease (COPD). People with COPD and their carers completed the Multimedia Activity Recall for Children and Adults (MARCA) for four, 24-hour periods (including test-retest of 2 days) while wearing a triaxial accelerometer (Actigraph GT3X+®), a multi-sensor armband (Sensewear Pro3®) and a pedometer (New Lifestyles 1000®). Self reported activity recalls (MARCA) and objective activity monitoring (Accelerometry) were recorded under free-living conditions. 24 couples were included in the analysis (COPD; age 74.4 ± 7.9 yrs, FEV1 54 ± 13% Carer; age 69.6 ± 10.9 yrs, FEV1 99 ± 24%). Not applicable. Test-retest reliability was compared for MARCA activity domains and different energy expenditure zones. Validity was assessed between MARCA-derived physical activity level (in metabolic equivalent of task (MET) per minute), duration of moderate to vigorous physical activity (min) and related data from the objective measurement devices. Analysis included intra-class correlation coefficients (ICC), Bland-Altman analyses, paired t-tests (p) and Spearman's rank correlation coefficients (rs). Reliability between occasions of recall for all activity domains was uniformly high, with test-retest correlations consistently >0.9. Validity correlations were moderate to strong (rs = 0.43-0.80) across all comparisons. The MARCA yields comparable PAL estimates and slightly higher moderate to vigorous physical activity (MVPA) estimates. In older adults with chronic illness, the MARCA is a valid and reliable tool for capturing not only the time and energy expenditure associated with physical and sedentary activities but also information on the types of activities.

Reliability and Validity of a New Method for Isometric Back Extensor Strength Evaluation Using A Hand-Held Dynamometer.

Science.gov (United States)

Park, Hee-Won; Baek, Sora; Kim, Hong Young; Park, Jung-Gyoo; Kang, Eun Kyoung

2017-10-01

To investigate the reliability and validity of a new method for isometric back extensor strength measurement using a portable dynamometer. A chair equipped with a small portable dynamometer was designed (Power Track II Commander Muscle Tester). A total of 15 men (mean age, 34.8±7.5 years) and 15 women (mean age, 33.1±5.5 years) with no current back problems or previous history of back surgery were recruited. Subjects were asked to push the back of the chair while seated, and their isometric back extensor strength was measured by the portable dynamometer. Test-retest reliability was assessed with intraclass correlation coefficient (ICC). For the validity assessment, isometric back extensor strength of all subjects was measured by a widely used physical performance evaluation instrument, BTE PrimusRS system. The limit of agreement (LoA) from the Bland-Altman plot was evaluated between two methods. The test-retest reliability was excellent (ICC=0.82; 95% confidence interval, 0.65-0.91). The Bland-Altman plots demonstrated acceptable agreement between the two methods: the lower 95% LoA was -63.1 N and the upper 95% LoA was 61.1 N. This study shows that isometric back extensor strength measurement using a portable dynamometer has good reliability and validity.
Reliability and validity of the Bowel Function Index for evaluating opioid-induced constipation: translation, cultural adaptation and validation of the Portuguese version (BFI-P).

Science.gov (United States)

Dueñas, María; Mendonça, Liliane; Sampaio, Rute; Gouvinhas, Cláudia; Oliveira, Daniela; Castro-Lopes, José Manuel; Azevedo, Luís Filipe

2017-03-01

The Bowel Function Index (BFI) is a simple and sound bowel function and opioid-induced constipation (OIC) screening tool. We aimed to develop the translation and cultural adaptation of this measure (BFI-P) and to assess its reliability and validity for the Portuguese language and a chronic pain population. The BFI-P was created after a process including translation, back translation and cultural adaptation. Participants (n = 226) were recruited in a chronic pain clinic and were assessed at baseline and after one week. Internal consistency, test-retest reliability, responsiveness, construct (convergent and known groups) and factorial validity were assessed. Test-retest reliability had an intra-class correlation of 0.605 for BFI mean score. Internal consistency of BFI had Cronbach's alpha of 0.865. The construct validity of BFI-P was shown to be excellent and the exploratory factor analysis confirmed its unidimensional structure. The responsiveness of BFI-P was excellent, with a suggested 17-19 point and 8-12 point change in score constituting a clinically relevant change in constipation for patients with and without previous constipation, respectively. This study had some limitations, namely, the criterion validity of BFI-P was not directly assessed; and the absence of a direct criterion for OIC precluded the assessment of the criterion based responsiveness of BFI-P. Nevertheless, BFI may importantly contribute to better OIC screening and its Portuguese version (BFI-P) has been shown to have excellent reliability, internal consistency, validity and responsiveness. Further suggestions regarding statistically and clinically important change cut-offs for this instrument are presented.
Test-Retest Reliability of Isokinetic Knee Strength Measurements in Children Aged 8 to 10 Years.

Science.gov (United States)

Fagher, Kristina; Fritzson, Annelie; Drake, Anna Maria

Isokinetic dynamometry is a useful tool to objectively assess muscle strength of children and adults in athletic and rehabilitative settings. This study examined test-retest reliability of isokinetic knee strength measurements in children aged 8 to 10 years and defined limits for the minimum difference (MD) in strength that indicates a clinically important change. Isokinetic knee strength measurements (using the Biodex System 4) in children will provide reliable results. Descriptive laboratory study. In 22 healthy children, 5 maximal concentric (CON) knee extensor (KE) and knee flexor (KF) contractions at 2 angular velocities (60 deg/s and 180 deg/s) and 5 maximal eccentric (ECC) KE/KF contractions at 60 deg/s were assessed 7 days apart. The intraclass correlation coefficient (ICC 2.1 ) was used to examine relative reliability, and the MD was calculated on the basis of standard error of measurement. ICCs for CON KE/KF peak torque measurements were fair to excellent (range, 0.49-0.81). The MD% values for CON KE and KF ranged from 31% to 37% at 60 deg/s and from 34% to 39% at 180 deg/s. ICCs in the ECC mode were good (range, 0.60-0.70), but associated MD% values were high (>50%). There was no systematic error for CON KE/KF and ECC KE strength measurements at 60 deg/s, but systematic error was found for all other measurements. The dynamometer provides a reliable analysis of isokinetic CON knee strength measurements at 60 deg/s in children aged 8 to 10 years. Measurements at 180 deg/s and in the ECC mode were not reliable, indicating a need for more familiarization prior to testing. The MD values may help clinicians to determine whether a change in knee strength is due to error or intervention.
Reliability, validity, and sensitivity to change of the lower extremity functional scale in individuals affected by stroke.

Science.gov (United States)

Verheijde, Joseph L; White, Fred; Tompkins, James; Dahl, Peder; Hentz, Joseph G; Lebec, Michael T; Cornwall, Mark

2013-12-01

To investigate reliability, validity, and sensitivity to change of the Lower Extremity Functional Scale (LEFS) in individuals affected by stroke. The secondary objective was to test the validity and sensitivity of a single-item linear analog scale (LAS) of function. Prospective cohort reliability and validation study. A single rehabilitation department in an academic medical center. Forty-three individuals receiving neurorehabilitation for lower extremity dysfunction after stroke were studied. Their ages ranged from 32 to 95 years, with a mean of 70 years; 77% were men. Test-retest reliability was assessed by calculating the classical intraclass correlation coefficient, and the Bland-Altman limits of agreement. Validity was assessed by calculating the Pearson correlation coefficient between the instruments. Sensitivity to change was assessed by comparing baseline scores with end of treatment scores. Measurements were taken at baseline, after 1-3 days, and at 4 and 8 weeks. The LEFS, Short-Form-36 Physical Function Scale, Berg Balance Scale, Six-Minute Walk Test, Five-Meter Walk Test, Timed Up-and-Go test, and the LAS of function were used. The test-retest reliability of the LEFS was found to be excellent (ICC = 0.96). Correlated with the 6 other measures of function studied, the validity of the LEFS was found to be moderate to high (r = 0.40-0.71). Regarding the sensitivity to change, the mean LEFS scores from baseline to study end increased 1.2 SD and for LAS 1.1 SD. LEFS exhibits good reliability, validity, and sensitivity to change in patients with lower extremity impairments secondary to stroke. Therefore, the LEFS can be a clinically efficient outcome measure in the rehabilitation of patients with subacute stroke. The LAS is shown to be a time-saving and reasonable option to track changes in a patient's functional status. Copyright © 2013 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
An Integrated Approach to Establish Validity and Reliability of Reading Tests

Science.gov (United States)

Razi, Salim

2012-01-01

This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Validity and Reliability of the Korean Version of the Utrecht Scale for Evaluation of Rehabilitation-Participation

Directory of Open Access Journals (Sweden)

Joo-Hyun Lee

2017-01-01

Full Text Available This study investigated the reliability and validity of the Korean version of the Utrecht Scale for Evaluation of Rehabilitation-Participation (K-USER-P in patients with stroke. Stroke patients participated in this study. The Utrecht Scale for Evaluation of Rehabilitation-Participation was translated from English into Korean. A total of 120 questionnaires involving the K-USER-P were distributed to rehabilitation hospitals and centers by mail. Of those, 100 questionnaires were returned and 67 were included in the final analysis after exclusion of questionnaires with insufficient responses. We analyzed the questionnaires for internal consistency, test-retest reliability, and construct validity. The results indicated that internal consistency coefficients of the frequency, restriction, and satisfaction domains were 0.69, 0.66, and 0.67, respectively. Test-retest reliability was 0.63, 0.45, and 0.71 for the three domains, respectively. Intercorrelations between the SF-12 and the London Handicap Scale were generally moderate to good. The Korean version of the Utrecht Scale for Evaluation of Rehabilitation-Participation can be used as a measure of the participation level of stroke patients in clinical practice and the local community.
Test-retest reliability of evoked BOLD signals from a cognitive-emotive fMRI test battery.

Science.gov (United States)

Plichta, Michael M; Schwarz, Adam J; Grimm, Oliver; Morgen, Katrin; Mier, Daniela; Haddad, Leila; Gerdes, Antje B M; Sauer, Carina; Tost, Heike; Esslinger, Christine; Colman, Peter; Wilson, Frederick; Kirsch, Peter; Meyer-Lindenberg, Andreas

2012-04-15

Even more than in cognitive research applications, moving fMRI to the clinic and the drug development process requires the generation of stable and reliable signal changes. The performance characteristics of the fMRI paradigm constrain experimental power and may require different study designs (e.g., crossover vs. parallel groups), yet fMRI reliability characteristics can be strongly dependent on the nature of the fMRI task. The present study investigated both within-subject and group-level reliability of a combined three-task fMRI battery targeting three systems of wide applicability in clinical and cognitive neuroscience: an emotional (face matching), a motivational (monetary reward anticipation) and a cognitive (n-back working memory) task. A group of 25 young, healthy volunteers were scanned twice on a 3T MRI scanner with a mean test-retest interval of 14.6 days. FMRI reliability was quantified using the intraclass correlation coefficient (ICC) applied at three different levels ranging from a global to a localized and fine spatial scale: (1) reliability of group-level activation maps over the whole brain and within targeted regions of interest (ROIs); (2) within-subject reliability of ROI-mean amplitudes and (3) within-subject reliability of individual voxels in the target ROIs. Results showed robust evoked activation of all three tasks in their respective target regions (emotional task=amygdala; motivational task=ventral striatum; cognitive task=right dorsolateral prefrontal cortex and parietal cortices) with high effect sizes (ES) of ROI-mean summary values (ES=1.11-1.44 for the faces task, 0.96-1.43 for the reward task, 0.83-2.58 for the n-back task). Reliability of group level activation was excellent for all three tasks with ICCs of 0.89-0.98 at the whole brain level and 0.66-0.97 within target ROIs. Within-subject reliability of ROI-mean amplitudes across sessions was fair to good for the reward task (ICCs=0.56-0.62) and, dependent on the particular ROI
Development of the Modified Four Square Step Test and its reliability and validity in people with stroke.

Science.gov (United States)

Roos, Margaret A; Reisman, Darcy S; Hicks, Gregory; Rose, William; Rudolph, Katherine S

2016-01-01

Adults with stroke have difficulty avoiding obstacles when walking, especially when a time constraint is imposed. The Four Square Step Test (FSST) evaluates dynamic balance by requiring individuals to step over canes in multiple directions while being timed, but many people with stroke are unable to complete it. The purposes of this study were to (1) modify the FSST by replacing the canes with tape so that more persons with stroke could successfully complete the test and (2) examine the reliability and validity of the modified version. Fifty-five subjects completed the Modified FSST (mFSST) by stepping over tape in all four directions while being timed. The mFSST resulted in significantly greater numbers of subjects completing the test than the FSST (39/55 [71%] and 33/55 [60%], respectively) (p < 0.04). The test-retest, intrarater, and interrater reliability of the mFSST were excellent (intraclass correlation coefficient ranges: 0.81-0.99). Construct and concurrent validity of the mFSST were also established. The minimal detectable change was 6.73 s. The mFSST, an ideal measure of dynamic balance, can identify progress in people with stroke in varied settings and can be completed by a wide range of people with stroke in approximately 5 min with the use of minimal equipment (tape, stop watch).
Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system

Energy Technology Data Exchange (ETDEWEB)

Thomas, Marianna S.; Newman, David; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N. [Norfolk and Norwich University Hospital, Department of Radiology, Norwich (United Kingdom); Leinhard, Olof Dahlqvist [Linkoeping University, Center for Medical Image Science and Visualization, Linkoeping (Sweden); Linkoeping University, Department of Medical and Health Sciences, Linkoeping (Sweden); Karlsson, Anette; Borga, Magnus [Linkoeping University, Center for Medical Image Science and Visualization, Linkoeping (Sweden); Linkoeping University, Department of Biomedical Engineering, Linkoeping (Sweden); Rosander, Johannes [Advanced MR Analytics AB, Linkoeping (Sweden); Toms, Andoni P. [Norfolk and Norwich University Hospital, Department of Radiology, Norwich (United Kingdom); Radiology Academy, Cotman Centre, Norwich, Norfolk (United Kingdom)

2014-09-15

To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19.32 L (SD9.1) and 19.28 L (SD9.12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1.0, 95 % level of agreement -0.32-0.2 L). ICC for all automated test-retest muscle volumes were almost perfect (0.99-1.0) with 95 % levels of agreement 1.8-6.6 % of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1.68 L (2SD0.6) compared to automated 1.64 L (2SD 0.6), left lower leg: manual 1.69 L (2SD 0.64) compared to automated 1.63 L (SD0.61), correlation coefficients for automated and manual segmentation were 0.94-0.96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. (orig.)
Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system

International Nuclear Information System (INIS)

Thomas, Marianna S.; Newman, David; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N.; Leinhard, Olof Dahlqvist; Karlsson, Anette; Borga, Magnus; Rosander, Johannes; Toms, Andoni P.

2014-01-01

To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19.32 L (SD9.1) and 19.28 L (SD9.12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1.0, 95 % level of agreement -0.32-0.2 L). ICC for all automated test-retest muscle volumes were almost perfect (0.99-1.0) with 95 % levels of agreement 1.8-6.6 % of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1.68 L (2SD0.6) compared to automated 1.64 L (2SD 0.6), left lower leg: manual 1.69 L (2SD 0.64) compared to automated 1.63 L (SD0.61), correlation coefficients for automated and manual segmentation were 0.94-0.96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. (orig.)
The reliability, validity, and applicability of an English language version of the Mini-ICF-APP.

Science.gov (United States)

Molodynski, Andrew; Linden, Michael; Juckel, George; Yeeles, Ksenija; Anderson, Catriona; Vazquez-Montes, Maria; Burns, Tom

2013-08-01

This study aimed at establishing the validity and reliability of an English language version of the Mini-ICF-APP. One hundred and five patients under the care of secondary mental health care services were assessed using the Mini-ICF-APP and several well-established measures of functioning and symptom severity. 47 (45 %) patients were interviewed on two occasions to ascertain test-retest reliability and 50 (48 %) were interviewed by two researchers simultaneously to determine the instrument's inter-rater reliability. Occupational and sick leave status were also recorded to assess construct validity. The Mini-ICF-APP was found to have substantial internal consistency (Chronbach's α 0.869-0.912) and all 13 items correlated highly with the total score. Analysis also showed that the Mini-ICF-APP had good test-retest (ICC 0.832) and inter-rater (ICC 0.886) reliability. No statistically significant association with length of sick leave was found, but the unemployed scored higher on the Mini ICF-APP than those in employment (mean 18.4, SD 9.1 vs. 9.4, SD 6.4, p Mini-ICF-APP correlated highly with the other measures of illness severity and functioning considered in the study. The English version of the Mini-ICF-APP is a reliable and valid measure of disorders of capacity as defined by the International Classification of Functioning. Further work is necessary to establish whether the scale could be divided into sub scales which would allow the instrument to more sensitively measure an individual's specific impairments.
Validity and reliability of the Myotest accelerometric system for the assessment of vertical jump height.

Science.gov (United States)

Casartelli, Nicola; Müller, Roland; Maffiuletti, Nicola A

2010-11-01

The aim of the present study was to verify the validity and reliability of the Myotest accelerometric system (Myotest SA, Sion, Switzerland) for the assessment of vertical jump height. Forty-four male basketball players (age range: 9-25 years) performed series of squat, countermovement and repeated jumps during 2 identical test sessions separated by 2-15 days. Flight height was simultaneously quantified with the Myotest system and validated photoelectric cells (Optojump). Two calculation methods were used to estimate the jump height from Myotest recordings: flight time (Myotest-T) and vertical takeoff velocity (Myotest-V). Concurrent validity was investigated comparing Myotest-T and Myotest-V to the criterion method (Optojump), and test-retest reliability was also examined. As regards validity, Myotest-T overestimated jumping height compared to Optojump (p 0.98), that is, excellent validity. Myotest-V overestimated jumping height compared to Optojump (p 12 cm), high limits of agreement ratios (>36%), and low ICCs (9 cm). In conclusion, Myotest-T is a valid and reliable method for the assessment of vertical jump height, and its use is legitimate for field-based evaluations, whereas Myotest-V is neither valid nor reliable.
Construct Validity and Reliability of the Questionnaire on the Quality of Physician-Patient Interaction in Adults With Hypertension.

Science.gov (United States)

Hickman, Ronald L; Clochesy, John M; Hetland, Breanna; Alaamri, Marym

2017-04-01

There are limited reliable and valid measures of the patient- provider interaction among adults with hypertension. Therefore, the purpose of this report is to describe the construct validity and reliability of the Questionnaire on the Quality of Physician-Patient Interaction (QQPPI), in community-dwelling adults with hypertension. A convenience sample of 109 participants with hypertension was recruited and administered the QQPPI at baseline and 8 weeks later. The exploratory factor analysis established a 12-item, 2-factor structure for the QQPPI was valid in this sample. The modified QQPPI proved to have sufficient internal consistency and test- retest reliability. The modified QQPPI is a valid and reliable measure of the provider-patient interaction, a construct posited to impact self-management, in adults with hypertension.
Validity and Reliability of Baseline Testing in a Standardized Environment.

Science.gov (United States)

Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur

2017-08-11

The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

Science.gov (United States)

Bhat, Mehraj A.

2014-01-01

This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

Science.gov (United States)

Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

2018-01-01

The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.
The Communication Function Classification System: cultural adaptation, validity, and reliability of the Farsi version for patients with cerebral palsy.

Science.gov (United States)

Soleymani, Zahra; Joveini, Ghodsiye; Baghestani, Ahmad Reza

2015-03-01

This study developed a Farsi language Communication Function Classification System and then tested its reliability and validity. Communication Function Classification System is designed to classify the communication functions of individuals with cerebral palsy. Up until now, there has been no instrument for assessment of this communication function in Iran. The English Communication Function Classification System was translated into Farsi and cross-culturally modified by a panel of experts. Professionals and parents then assessed the content validity of the modified version. A backtranslation of the Farsi version was confirmed by the developer of the English Communication Function Classification System. Face validity was assessed by therapists and parents of 10 patients. The Farsi Communication Function Classification System was administered to 152 individuals with cerebral palsy (age, 2 to 18 years; median age, 10 years; mean age, 9.9 years; standard deviation, 4.3 years). Inter-rater reliability was analyzed between parents, occupational therapists, and speech and language pathologists. The test-retest reliability was assessed for 75 patients with a 14 day interval between tests. The inter-rater reliability of the Communication Function Classification System was 0.81 between speech and language pathologists and occupational therapists, 0.74 between parents and occupational therapists, and 0.88 between parents and speech and language pathologists. The test-retest reliability was 0.96 for occupational therapists, 0.98 for speech and language pathologists, and 0.94 for parents. The findings suggest that the Farsi version of Communication Function Classification System is a reliable and valid measure that can be used in clinical settings to assess communication function in patients with cerebral palsy. Copyright © 2015 Elsevier Inc. All rights reserved.
Reliability and validity of the transport and physical activity questionnaire (TPAQ) for assessing physical activity behaviour.

Science.gov (United States)

Adams, Emma J; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C; Cooper, Ashley R; Ogilvie, David

2014-01-01

No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, ptravel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires.
Validity and reliability of a brief self-reported questionnaire assessing fruit and vegetable consumption among pregnant women

Directory of Open Access Journals (Sweden)

Lydi-Anne Vézina-Im

2016-09-01

Full Text Available Abstract Background Short instruments measuring frequency of specific foods, such as fruit and vegetable (FV, are increasingly used in interventions. The objective of the study was to verify the validity and test-retest reliability of such an instrument among pregnant women. Methods Pregnant women from the region of Quebec City, Quebec, Canada, were recruited through e-mails sent to female students and employees of the local university from October 2014 to April 2015. To assess the validity of the fruit and vegetable questionnaire (FVQ developed by Godin et al. (Can J Public Health 99: 494-498, 2008, pregnant women were asked in a first mailing to complete the FVQ assessing FV intake over the past 7 days and a 3-day estimated food record. A subsample (n = 33 also gave a fasting blood sample and completed a validated semi-quantitative FFQ administered by a trained registered dietitian during a visit at the research center. FV intakes for all instruments were calculated in terms of servings of FV based on Canada’s Food Guide definition of a serving of fruit or vegetable. In order to assess its test-retest reliability, respondents were asked to complete the FVQ 14 days later in a second mailing. Results Forty-eight pregnant women from all three trimesters completed the questionnaires in the first mailing. FV intake assessed using the FVQ was correlated to FV consumption measured using the food record (r = 0.34, p = 0.0180 and the FFQ (r = 0.61, p = 0.0002. Results were similar when controlling for energy intake and the experience of nausea in the past month. Only β-cryptoxanthin was significantly correlated to FV intake assessed by the FFQ when adjusted for the presence of nausea (r = 0.35, p = 0.0471. Data on the test-retest reliability was available for 44 women and the intra-class coefficient for the FVQ was 0.72 at a mean 28-day interval. Conclusions The FVQ has acceptable validity and test-retest reliability
Validity and reliability of a brief self-reported questionnaire assessing fruit and vegetable consumption among pregnant women.

Science.gov (United States)

Vézina-Im, Lydi-Anne; Godin, Gaston; Couillard, Charles; Perron, Julie; Lemieux, Simone; Robitaille, Julie

2016-09-15

Short instruments measuring frequency of specific foods, such as fruit and vegetable (FV), are increasingly used in interventions. The objective of the study was to verify the validity and test-retest reliability of such an instrument among pregnant women. Pregnant women from the region of Quebec City, Quebec, Canada, were recruited through e-mails sent to female students and employees of the local university from October 2014 to April 2015. To assess the validity of the fruit and vegetable questionnaire (FVQ) developed by Godin et al. (Can J Public Health 99: 494-498, 2008), pregnant women were asked in a first mailing to complete the FVQ assessing FV intake over the past 7 days and a 3-day estimated food record. A subsample (n = 33) also gave a fasting blood sample and completed a validated semi-quantitative FFQ administered by a trained registered dietitian during a visit at the research center. FV intakes for all instruments were calculated in terms of servings of FV based on Canada's Food Guide definition of a serving of fruit or vegetable. In order to assess its test-retest reliability, respondents were asked to complete the FVQ 14 days later in a second mailing. Forty-eight pregnant women from all three trimesters completed the questionnaires in the first mailing. FV intake assessed using the FVQ was correlated to FV consumption measured using the food record (r = 0.34, p = 0.0180) and the FFQ (r = 0.61, p = 0.0002). Results were similar when controlling for energy intake and the experience of nausea in the past month. Only β-cryptoxanthin was significantly correlated to FV intake assessed by the FFQ when adjusted for the presence of nausea (r = 0.35, p = 0.0471). Data on the test-retest reliability was available for 44 women and the intra-class coefficient for the FVQ was 0.72 at a mean 28-day interval. The FVQ has acceptable validity and test-retest reliability values, but seems to underestimate FV servings in pregnant women

Test-retest reliability of knee extensor rate of velocity and power development in older adults using the isotonic mode on a Biodex System 3 dynamometer.

Science.gov (United States)

Van Driessche, Stijn; Van Roie, Evelien; Vanwanseele, Benedicte; Delecluse, Christophe

2018-01-01

Isotonic testing and measures of rapid power production are emerging as functionally relevant test methods for detection of muscle aging. Our objective was to assess reliability of rapid velocity and power measures in older adults using the isotonic mode of an isokinetic dynamometer. Sixty-three participants (aged 65 to 82 years) underwent a test-retest protocol with one week time interval. Isotonic knee extension tests were performed at four different loads: 0%, 25%, 50% and 75% of maximal isometric strength. Peak velocity (pV) and power (pP) were determined as the highest values of the velocity and power curve. Rate of velocity (RVD) and power development (RPD) were calculated as the linear slopes of the velocity- and power-time curve. Relative and absolute measures of test-retest reliability were analyzed using intraclass correlation coefficients (ICC), standard error of measurement (SEM) and Bland-Altman analyses. Overall, reliability was high for pV, pP, RVD and RPD at 0%, 25% and 50% load (ICC: .85 - .98, SEM: 3% - 10%). A trend for increased reliability at lower loads seemed apparent. The tests at 75% load led to range of motion failure and should be avoided. In addition, results demonstrated that caution is advised when interpreting early phase results (first 50ms). To conclude, our results support the use of the isotonic mode of an isokinetic dynamometer for testing rapid power and velocity characteristics in older adults, which is of high clinical relevance given that these muscle characteristics are emerging as the primary outcomes for preventive and rehabilitative interventions in aging research.
The reliability and validity of fatigue measures during multiple-sprint work: an issue revisited.

Science.gov (United States)

Glaister, Mark; Howatson, Glyn; Pattison, John R; McInnes, Gill

2008-09-01

The ability to repeatedly produce a high-power output or sprint speed is a key fitness component of most field and court sports. The aim of this study was to evaluate the validity and reliability of eight different approaches to quantify this parameter in tests of multiple-sprint performance. Ten physically active men completed two trials of each of two multiple-sprint running protocols with contrasting recovery periods. Protocol 1 consisted of 12 x 30-m sprints repeated every 35 seconds; protocol 2 consisted of 12 x 30-m sprints repeated every 65 seconds. All testing was performed in an indoor sports facility, and sprint times were recorded using twin-beam photocells. All but one of the formulae showed good construct validity, as evidenced by similar within-protocol fatigue scores. However, the assumptions on which many of the formulae were based, combined with poor or inconsistent test-retest reliability (coefficient of variation range: 0.8-145.7%; intraclass correlation coefficient range: 0.09-0.75), suggested many problems regarding logical validity. In line with previous research, the results support the percentage decrement calculation as the most valid and reliable method of quantifying fatigue in tests of multiple-sprint performance.
Assessment of Technical Skills in Young Soccer Goalkeepers: Reliability and Validity of Two Goalkeeper-Specific Tests

Directory of Open Access Journals (Sweden)

Ricardo Rebelo-Gonçalves, António J. Figueiredo, Manuel J. Coelho-e-Silva, Antonio Tessitore

2016-09-01

Full Text Available The purpose of this study was to evaluate the reproducibility and validity of two new tests designed to examine goalkeeper-specific technique. Twenty-six goalkeepers (14.49 ± 2.52 years old completed two trial sessions, each separated by one week, to evaluate the reproducibility of the Sprint-Keeper Test (S-Keeper and the Lateral Shuffle-Keeper Test (LS-Keeper. Construct validity was assessed among forty goalkeepers (14.49 ± 1.71 years old by competitive level (elite versus non-elite, after controlling for chronological age. All participants were examined in vertical jump (CMJ and CMJ-free arms, acceleration (5-m and 10-m sprint and goalkeeper-specific technique. The S-Keeper requires the goalkeeper to accelerate during 3 m and dive over a stationary ball after performing a change of direction in a total distance of 10 m. The LS-Keeper involves three changes of direction and a diving save over a stationary ball, in a total distance of 12.55 m. Performance was respectively measured as total time for the right and left sides in each protocol. Bivariate correlations between repeated measures were high and significant (r = 0.835 – 0.912. Test-retest results for the S-Keeper and LS-Keeper showed good reliability (reliability coefficients > 0.88, intra-class correlation coefficient > 0.908 and coefficients of variation < 4.37%, even though participants tended to improve performance when diving to their right side (p < 0.05. Both tests were able to detect significant differences between elite and non-elite goalkeepers, particularly to the left side (p < 0.05. These findings suggest that the S-Keeper and LS-Keeper are reliable and valid tests for assessing goalkeeper-specific technique. Both protocols can be used as a practical tool to provide relevant information about the influence of several components of performance in the overall execution of a diving save, particularly movement patterns, take-off movements and possible asymmetries.
Test-retest studies of cerebral glucose metabolism using fluorine-18 deoxyglucose: validation of method

International Nuclear Information System (INIS)

Brooks, R.A.; Di Chiro, G.; Zukerberg, B.W.; Bairamian, D.; Larson, S.M.

1987-01-01

In studies using [ 18 F]deoxyglucose (FDG), one often wants to compare metabolic rates following stimulation (drug or motor-sensory) with the baseline values. However, because of reproducibility problems with baseline variations of 25% in the same individual not uncommon, the global effect of the stimulation may be difficult to see. One approach to this problem is to perform the two studies sequentially. This means that, with the 110-min half-life of 18 F, one must take into account the residual activity from the first study when calculating metabolic rates for the second. We performed TEST-RETEST baseline studies on four subjects, with a 1-hr interval between injections. These studies were done without stimulation, in order to validate the repeatability of the method. To reduce the amount of residual activity from the first study, the first injection was only 2 mCi in three cases, and only 1 mCi in one case, out of a total injected dose of 5 mCi. A correction for residual activity was included in the RETEST calculation of metabolic rate. The results showed a global metabolic shift between the two studies of 2% to 9%. An error analysis shows that the shift could be further reduced if anatomically comparable scans are done at comparable postinjection times
Reliability and Validity Assessment of a Linear Position Transducer

Science.gov (United States)

Garnacho-Castaño, Manuel V.; López-Lastra, Silvia; Maté-Muñoz, José L.

2015-01-01

The objectives of the study were to determine the validity and reliability of peak velocity (PV), average velocity (AV), peak power (PP) and average power (AP) measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain) during two resistance exercises, bench press (BP) and full back squat (BS), performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2). Intraclass correlation coefficients (ICCs) indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W). Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W). Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP) make this device a useful tool for monitoring resistance training. Key points This study determined the validity and reliability of peak velocity, average velocity, peak power and average power measurements made using a linear position transducer The Tendo Weight-lifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and power. PMID:25729300
The reliability and validity study of the Kinesthetic and Visual Imagery Questionnaire in individuals with Multiple Sclerosis

Directory of Open Access Journals (Sweden)

Yousef Moghadas Tabrizi

2013-12-01

Full Text Available OBJECTIVE: Motor imagery (MI has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS. It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. METHOD: Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14days apart by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R as the gold standard. Intra-class correlation coefficients (ICCs were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha and factorial structure of the KVIQ were studied. RESULTS: The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93, and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79. The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84. Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. CONCLUSIONS: The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients.
The reliability and validity study of the Kinesthetic and Visual Imagery Questionnaire in individuals with multiple sclerosis.

Science.gov (United States)

Tabrizi, Yousef Moghadas; Zangiabadi, Nasser; Mazhari, Shahrzad; Zolala, Farzaneh

2013-01-01

Motor imagery (MI) has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS). It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ) was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14 days apart) by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R) as the gold standard. Intra-class correlation coefficients (ICCs) were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha) and factorial structure of the KVIQ were studied. The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93), and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79). The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84). Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients.
Construct validity and test–retest reliability of the International Fitness Scale (IFIS in Colombian children and adolescents aged 9–17.9 years: the FUPRECOL study

Directory of Open Access Journals (Sweden)

Robinson Ramírez-Vélez

2017-05-01

Full Text Available Background There is a lack of instruments and studies written in Spanish evaluating physical fitness, impeding the determination of the current status of this important health indicator in the Latin population, especially in Colombia. The aim of the study was two-fold: to examine the validity of the International Fitness Scale (IFIS with a population-based sample of schoolchildren from Bogota, Colombia and to examine the reliability of the IFIS with children and adolescents from Engativa, Colombia. Methods The sample comprised 1,873 Colombian youths (54.5% girls aged 9–17.9 years. We measured their adiposity markers (waist-to-height ratio, skinfold thickness, percentage of body fat and body mass index, blood pressure, lipids profile, fasting glucose, and physical fitness level (self-reported and measured. A validated cardiometabolic risk index score was also used. An age- and sex-matched subsample of 229 schoolchildren who were not originally included in the sample completed the IFIS twice for reliability purposes. Results Our data suggest that both measured and self-reported overall physical fitness levels were inversely associated with percentage of body fat indicators and the cardiometabolic risk index score. Overall, schoolchildren who self-reported “good” or “very good” fitness had better measured fitness levels than those who reported “very poor/poor” fitness (all p < 0.001. The test-retest reliability of the IFIS items was also good, with an average weighted kappa of 0.811. Discussion Our findings suggest that self-reported fitness, as assessed by the IFIS, is a valid, reliable, and health-related measure. Furthermore, it can be a good alternative for future use in large studies with Latin schoolchildren from Colombia.
Accuracy and Feasibility of Video Analysis for Assessing Hamstring Flexibility and Validity of the Sit-and-Reach Test

Science.gov (United States)

Mier, Constance M.

2011-01-01

The accuracy of video analysis of the passive straight-leg raise test (PSLR) and the validity of the sit-and-reach test (SR) were tested in 60 men and women. Computer software measured static hip-joint flexion accurately. High within-session reliability of the PSLR was demonstrated (R greater than 0.97). Test-retest (separate days) reliability for…
Cross-cultural adaptation, reliability, internal consistency and validation of the Hand Function Sort (HFS©) for French speaking patients with upper limb complaints.

Science.gov (United States)

Konzelmann, M; Burrus, C; Hilfiker, R; Rivier, G; Deriaz, O; Luthi, F

2015-03-01

Functional evaluation of upper limb is not only based on clinical findings but requires self-administered questionnaires to address patients' perspective. The Hand Function Sort (HFS©) was only validated in English. The aim of this study was the French cross cultural adaptation and validation of the HFS© (HFS-F). 150 patients with various upper limbs impairments were recruited in a rehabilitation center. Translation and cross-cultural adaptation were made according to international guidelines. Construct validity was estimated through correlations with Disabilities Arm Shoulder and Hand (DASH) questionnaire, SF-36 mental component summary (MCS),SF-36 physical component summary (PCS) and pain intensity. Internal consistency was assessed by Cronbach's α and test-retest reliability by intraclass correlation. Cronbach's α was 0.98, test-retest reliability was excellent at 0.921 (95 % CI 0.871-0.971) same as original HFS©. Correlations with DASH were-0.779 (95 % CI -0.847 to -0.685); with SF 36 PCS 0.452 (95 % CI 0.276-0.599); with pain -0.247 (95 % CI -0.429 to -0.041); with SF 36 MCS 0.242 (95 % CI 0.042-0.422). There were no floor or ceiling effects. The HFS-F has the same good psychometric properties as the original HFS© (internal consistency, test retest reliability, convergent validity with DASH, divergent validity with SF-36 MCS, and no floor or ceiling effects). The convergent validity with SF-36 PCS was poor; we found no correlation with pain. The HFS-F could be used with confidence in a population of working patients. Other studies are necessary to study its psychometric properties in other populations.
Reliability and validity of the adapted Resistance Training Skills Battery for Children.

Science.gov (United States)

Furzer, Bonnie J; Bebich-Philip, Marc D; Wright, Kemi E; Reid, Siobhan L; Thornton, Ashleigh L

2017-12-29

Resistance training (RT) is emerging as a training modality to improve motor function and facilitate physical activity participation in children across the motor proficiency spectrum. Although RT competency assessments have been established and validated among adolescent cohorts, the extent to which these methods are suitable for assessing children's RT skills is unknown. This project aimed to assess the psychometric properties of the adapted Resistance Training Skills Battery for Children (RTSBc), in children with varying motor proficiency. Repeated measures design with 40 participants (M age=8.2±1.7years) displaying varying levels of motor proficiency. Participants performed the adapted RTSBc on two occasions, receiving a score for their execution of each component, in addition to an overall RT skill quotient child (RTSQc). Cronbach's alpha, intra-class correlation (ICC), Bland-Altman analysis, and typical error were used to assess test-retest reliability. To examine construct validity, exploratory factor analysis was performed alongside computing correlations between participants' muscle strength, motor proficiency, age, lean muscle mass, and RTSQc. The RTSBc displayed an acceptable level of internal consistency (alpha=0.86) and test-retest reliability (ICC range=0.86-0.99). Exploratory factor analysis supported internal test structure, with all six RT skills loading strongly on a single factor (range 0.56-0.89). Analyses of structural validity revealed positive correlations for RTSQc in relation to motor proficiency (r=0.52, preliability of the RTSBc, providing preliminary evidence that the RTSBc is appropriate for use in the assessment of children's RT competency. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 5. Technical Report #1220

Science.gov (United States)

Lai, Cheng-Fei; Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 2. Technical Report #1217

Science.gov (United States)

Anderson, Daniel; Lai, Cheg-Fei; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest an alternate form) and G-Theory/D-Study on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from the convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest. Due to…
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 1. Technical Report #1216

Science.gov (United States)

Anderson, Daniel; Park, Jasmine, Bitnara; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest/and alternate form) and G-Theory/D-Study research on the easy CBM reading measures, grades 1-5. Data were gathered in the spring 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest. Due…
A Reliable and Valid Survey to Predict a Patient’s Gagging Intensity

Directory of Open Access Journals (Sweden)

Casey M. Hearing

2014-07-01

Full Text Available Objectives: The aim of this study was to devise a reliable and valid survey to predict the intensity of someone’s gag reflex. Material and Methods: A 10-question Predictive Gagging Survey was created, refined, and tested on 59 undergraduate participants. The questions focused on risk factors and experiences that would indicate the presence and strength of someone’s gag reflex. Reliability was assessed by administering the survey to a group of 17 participants twice, with 3 weeks separating the two administrations. Finally, the survey was given to 25 dental patients. In these cases, patients completed an informed consent form, filled out the survey, and then had a maxillary impression taken while their gagging response was quantified from 1 to 5 on the Fiske and Dickinson Gagging Intensity Index. Results: There was a moderate positive correlation between the Predictive Gagging Survey and Fiske and Dickinson’s Gagging Severity Index, r = +0.64, demonstrating the survey’s validity. Furthermore, the test-retest reliability was r = +0.96, demonstrating the survey’s reliability. Conclusions: The Predictive Gagging Survey is a 10-question survey about gag-related experiences and behaviours. We established that it is a reliable and valid method to assess the strength of someone’s gag reflex.
Preliminary findings on the reliability and validity of the Cantonese Birmingham Cognitive Screen in patients with acute ischemic stroke.

Science.gov (United States)

Pan, Xiaoping; Chen, Haobo; Bickerton, Wai-Ling; Lau, Johnny King Lam; Kong, Anthony Pak Hin; Rotshtein, Pia; Guo, Aihua; Hu, Jianxi; Humphreys, Glyn W

2015-01-01

There are no currently effective cognitive assessment tools for patients who have suffered stroke in the People's Republic of China. The Birmingham Cognitive Screen (BCoS) has been shown to be a promising tool for revealing patients' poststroke cognitive deficits in specific domains, which facilitates more individually designed rehabilitation in the long run. Hence we examined the reliability and validity of a Cantonese version BCoS in patients with acute ischemic stroke, in Guangzhou. A total of 98 patients with acute ischemic stroke were assessed with the Cantonese version of the BCoS, and an additional 133 healthy individuals were recruited as controls. Apart from the BCoS, the patients also completed a number of external cognitive tests, including the Montreal Cognitive Assessment Test (MoCA), Mini Mental State Examination (MMSE), Albert's cancellation test, the Rey-Osterrieth Complex Figure Test, and six gesture matching tasks. Cutoff scores for failing each subtest, ie, deficits, were computed based on the performance of the controls. The validity and reliability of the Cantonese BCoS were examined, as well as interrater and test-retest reliability. We also compared the proportions of cases being classified as deficits in controlled attention, memory, character writing, and praxis, between patients with and without spoken language impairment. Analyses showed high test-retest reliability and agreement across independent raters on the qualitative aspects of measurement. Significant correlations were observed between the subtests of the Cantonese BCoS and the other external cognitive tests, providing evidence for convergent validity of the Cantonese BCoS. The screen was also able to generate measures of cognitive functions that were relatively uncontaminated by the presence of aphasia. This study suggests good reliability and validity of the Cantonese version of the BCoS. The Cantonese BCoS is a very promising tool for the detection of cognitive problems in
A comparison between the original and Tablet-based Symbol Digit Modalities Test in patients with schizophrenia: Test-retest agreement, random measurement error, practice effect, and ecological validity.

Science.gov (United States)

Tang, Shih-Fen; Chen, I-Hui; Chiang, Hsin-Yu; Wu, Chien-Te; Hsueh, I-Ping; Yu, Wan-Hui; Hsieh, Ching-Lin

2017-11-27

We aimed to compare the test-retest agreement, random measurement error, practice effect, and ecological validity of the original and Tablet-based Symbol Digit Modalities Test (T-SDMT) over five serial assessments, and to examine the concurrent validity of the T-SDMT in patients with schizophrenia. Sixty patients with chronic schizophrenia completed five serial assessments (one week apart) of the SDMT and T-SDMT and one assessment of the Activities of Daily Living Rating Scale III at the first time point. Both measures showed high test-retest agreement, similar levels of random measurement error over five serial assessments. Moreover, the practice effects of the two measures did not reach a plateau phase after five serial assessments in young and middle-aged participants. Nevertheless, only the practice effect of the T-SDMT became trivial after the first assessment. Like the SDMT, the T-SDMT had good ecological validity. The T-SDMT also had good concurrent validity with the SDMT. In addition, only the T-SDMT had discriminative validity to discriminate processing speed in young and middle-aged participants. Compared to the SDMT, the T-SDMT had overall slightly better psychometric properties, so it can be an alternative measure to the SDMT for assessing processing speed in patients with schizophrenia. Copyright © 2017 Elsevier B.V. All rights reserved.
Portuguese validation of the children's eating attitudes test

Directory of Open Access Journals (Sweden)

Maria Del Carmen Bento Teixeira

2012-01-01

Full Text Available BACKGROUND: The Eating Attitudes Test (EAT is the most widely used instrument for evaluating eating disorders in adults and adolescents in a variety of cultures and samples. OBJECTIVE: The aim of this study was to analyse the psychometric properties of the Portuguese version of the Children's Eating Attitudes Test (ChEAT. METHOD: Nine hundred and fifty-six Portuguese secondary students (565 girls and 391 boys answered the ChEAT. The test-retest reliability was obtained with data from 206 participants from the total sample who re-answered the questionnaire after 4-6 weeks. Psychometric analyses were carried out for the total sample and separately for girls and boys. RESULTS: Internal consistency and test-retest reliability were satisfactory. Principal components factorial analysis yielded four factors in the total sample, accounting for 42.35% of the total variance. Factor structure was similar in the total sample and in both genders. Factors were labelled: F1 "Fear of Getting Fat", F2 "Restrictive and Purgative Behaviours", F3 "Food Preoccupation" and F4 "Social Pressure to Eat". The concurrent validity, explored using the Contour Drawing Figure Rating Scale (CDRS was high. DISCUSSION: The Portuguese version of the ChEAT is a valid and useful instrument for the evaluation of abnormal eating attitudes and behaviours among Portuguese adolescents.
Validity and Reliability of Surface Electromyography in the Assessment of Primary Muscle Tension Dysphonia.

Science.gov (United States)

Khoddami, Seyyedeh Maryam; Talebian, Saeed; Izadi, Farzad; Ansari, Noureddin Nakhostin

2017-05-01

The study aims to evaluate the reliability and the discriminative validity of surface electromyography (sEMG) in the assessment of patients with primary muscle tension dysphonia (MTD). The study design is cross-sectional. Fifteen patients with primary MTD (mean age: 34.07 ± 10.99 years) and 15 healthy volunteers (mean age: 34.53 ± 10.63 years) were included. All participants underwent evaluation of sEMG to record the electrical activity of the thyrohyoid and cricothyroid muscles. The outcome measures were the root mean square (RMS), activity peak, duration, and time to the peak activity, which were obtained during /a/ and /i/ prolongation for test-retest reliability. The test-retest reliability was good to excellent for the RMS and peak activity measures (intraclass correlation coefficient [agreement] [ICC agreement ] = 0.49-0.98). The reliability for the activity duration was poor to excellent (ICC agreement = 0.19-0.9). Poor test-retest reliability was found for the time to peak measure (ICC agreement = 0.15-0.37). The standard error of measurement for all sEMG measures was between 0.41 and 2.05. The smallest detectable change (SDC) was calculated between 1.13 and 5.66. The highest SDC values were obtained for the peak and the lowest SDCs were documented for the duration (5.66 and 1.13, respectively). All sEMG measures were not able to discriminate between the MTD patients and healthy subjects (P > 0.05). The sEMG is a reliable tool to measure the RMS, the peak activity, and the activity duration in primary MTD. However, it is not able to discriminate the patients with primary MTD from healthy subjects. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Morpho-Functional 1H-MRI of the Lung in COPD: Short-Term Test-Retest Reliability.

Directory of Open Access Journals (Sweden)

Bertram J Jobst

Full Text Available Non-invasive end-points for interventional trials and tailored treatment regimes in chronic obstructive pulmonary disease (COPD for monitoring regionally different manifestations of lung disease instead of global assessment of lung function with spirometry would be valuable. Proton nuclear magnetic resonance imaging (1H-MRI allows for a radiation-free assessment of regional structure and function. The aim of this study was to evaluate the short-term reproducibility of a comprehensive morpho-functional lung MRI protocol in COPD.20 prospectively enrolled COPD patients (GOLD I-IV underwent 1H-MRI of the lung at 1.5T on two consecutive days, including sequences for morphology, 4D contrast-enhanced perfusion, and respiratory mechanics. Image quality and COPD-related morphological and functional changes were evaluated in consensus by three chest radiologists using a dedicated MRI-based visual scoring system. Test-retest reliability was calculated per each individual lung lobe for the extent of large airway (bronchiectasis, wall thickening, mucus plugging and small airway abnormalities (tree in bud, peripheral bronchiectasis, mucus plugging, consolidations, nodules, parenchymal defects and perfusion defects. The presence of tracheal narrowing, dystelectasis, pleural effusion, pulmonary trunk ectasia, right ventricular enlargement and, finally, motion patterns of diaphragma and chest wall were addressed.Median global scores [10(Q1:8.00;Q3:16.00 vs.11(Q1:6.00;Q3:15.00] as well as category subscores were similar between both timepoints, and kappa statistics indicated "almost perfect" global agreement (ĸ = 0.86, 95%CI = 0.81-0.91. Most subscores showed at least "substantial" agreement of MRI1 and MRI2 (ĸ = 0.64-1.00, whereas the agreement for the diagnosis of dystelectasis/effusion (ĸ = 0.42, 95%CI = 0.00-0.93 was "moderate" and of tracheal abnormalities (ĸ = 0.21, 95%CI = 0.00-0.75 "fair". Most MRI acquisitions showed at least diagnostic quality at

Validity and reliability of tests determining performance-related components of wheelchair basketball

NARCIS (Netherlands)

De Groot, Sonja; Balvers, Inge J. M.; Kouwenhoven, Sanne M.; Janssen, Thomas W. J.

2012-01-01

The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Validity and reliability of tests determining performance-related components of wheelchair basketball

NARCIS (Netherlands)

de Groot, Sonja; Balvers, Inge J.M.; Kouwenhoven, Sanne M.; Janssen, Thomas W.J.

The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Reliability of provocative tests of motion sickness susceptibility

Science.gov (United States)

Calkins, D. S.; Reschke, M. F.; Kennedy, R. S.; Dunlop, W. P.

1987-01-01

Test-retest reliability values were derived from motion sickness susceptibility scores obtained from two successive exposures to each of three tests: (1) Coriolis sickness sensitivity test; (2) staircase velocity movement test; and (3) parabolic flight static chair test. The reliability of the three tests ranged from 0.70 to 0.88. Normalizing values from predictors with skewed distributions improved the reliability.
Reliability and Validity Assessment of a Linear Position Transducer

Directory of Open Access Journals (Sweden)

Manuel V. Garnacho-Castaño

2015-03-01

Full Text Available The objectives of the study were to determine the validity and reliability of peak velocity (PV, average velocity (AV, peak power (PP and average power (AP measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain during two resistance exercises, bench press (BP and full back squat (BS, performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2. Intraclass correlation coefficients (ICCs indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W. Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W. Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP make this device a useful tool for monitoring resistance training.
Validity and reliability of the Malay version multidimensional scale of perceived social support (MSPSS-M) among teachers.

Science.gov (United States)

Lee, Soo Cheng; Moy, Foong Ming; Hairi, Noran Naqiah

2017-01-01

The multidimensional scale of perceived social support (MSPSS) was developed to measure perceived social support. It has been translated and culturally adapted among natives literate in the Malay language. However, its psychometric properties for teachers who are majority females and married have not been assessed. This was a cross-sectional study conducted among the public secondary school teachers in the central region of Peninsular Malaysia from May to July 2013. A total of 150 and 203 teachers were recruited to perform exploratory factor analysis and confirmatory factor analysis (CFA), respectively. Reliability testing was evaluated on 141 teachers via internal consistency and two-week interval test-retest. The 12-item three-factor structure of MSPSS-M was revised to 8-item two-factor structure. The revised MSPSS-M demonstrated excellent fit in CFA with adequate divergent and convergent validity and good factor loadings (0.80-0.90). The revised MSPSS-M also displayed good internal consistency with Cronbach's alpha of 0.91, 0.93 and 0.92 and good test-retest reliability with intraclass correlation of 0.89, 0.88 and 0.88 in the total scale, family and friends factors, respectively. The revised 8-item MSPSS-M is a reliable and valid tool for assessment of perceived social support among teachers.
Retest effects in working memory capacity tests: A meta-analysis.

Science.gov (United States)

Scharfen, Jana; Jansen, Katrin; Holling, Heinz

2018-06-15

The repeated administration of working memory capacity tests is common in clinical and research settings. For cognitive ability tests and different neuropsychological tests, meta-analyses have shown that they are prone to retest effects, which have to be accounted for when interpreting retest scores. Using a multilevel approach, this meta-analysis aims at showing the reproducibility of retest effects in working memory capacity tests for up to seven test administrations, and examines the impact of the length of the test-retest interval, test modality, equivalence of test forms and participant age on the size of retest effects. Furthermore, it is assessed whether the size of retest effects depends on the test paradigm. An extensive literature search revealed 234 effect sizes from 95 samples and 68 studies, in which healthy participants between 12 and 70 years repeatedly performed a working memory capacity test. Results yield a weighted average of g = 0.28 for retest effects from the first to the second test administration, and a significant increase in effect sizes was observed up to the fourth test administration. The length of the test-retest interval and publication year were found to moderate the size of retest effects. Retest effects differed between the paradigms of working memory capacity tests. These findings call for the development and use of appropriate experimental or statistical methods to address retest effects in working memory capacity tests.
Chinese version of the Constant-Murley questionnaire for shoulder pain and disability: a reliability and validation study.

Science.gov (United States)

Yao, Min; Yang, Long; Cao, Zuo-Yuan; Cheng, Shao-Dan; Tian, Shuang-Lin; Sun, Yue-Li; Wang, Jing; Xu, Bao-Ping; Hu, Xiao-Chun; Wang, Yong-Jun; Zhang, Ying; Cui, Xue-Jun

2017-09-18

Shoulder pain is a common musculoskeletal disorder in Chinese population, which affects more than 1,3 billion individuals. To the best of our knowledge, there has been no available Chinese-language version of measurements of shoulder pain and disability so far. Moreover, the Constant-Murley score (CMS) questionnaire is a universally recognized patient-reported questionnaire for clinical practice and research. The present study was designed to evaluate a Chinese translational version of CMS and subsequently assess its reliability and validity. The Chinese translational version of CMS was formulated by means of forward-backward translation. Meanwhile, a final review was carried out by an expert committee, followed by conducting a test of the pre-final version. Therefore, the reliability and validity of the Chinese translational version of CMS could be assessed using the internal consistency, construct validity, factor analysis, reliability and floor and ceiling effects. Specifically, the reliability was assessed by testing the internal consistency (Cronbach's α) and test-retest reliability (intraclass coefficient correlation [ICC]), while the construct validity was evaluated via comparison between the Chinese translational version of CMS with visual analog scale (VAS) score and the 36-Item Short Form Health Survey (SF-36, Spearman correlation). The questionnaire was verified to be acceptable after distribution among 120 subjects with unilateral shoulder pain. Factor analysis had revealed a two-factor and 10-item solution. Moreover, the assessment results indicated that the Chinese translational version of CMS questionnaire harbored good internal consistency (Cronbach's α = 0.739) and test-retest reliability (ICC = 0.827). In addition, the Chinese translational version of CMS was moderately correlated with VAS score (r = 0.497) and SF-36 (r = 0.135). No obvious floor and ceiling effects were observed in the Chinese translational version of CMS questionnaire
Validation and reliability of the scale Self-efficacy and their child's level of asthma control

Directory of Open Access Journals (Sweden)

Ana Lúcia Araújo Gomes

Full Text Available ABSTRACT Objective: To evaluate the psychometric properties in terms of validity and reliability of the scale Self-efficacy and their child's level of asthma control: Brazilian version. Method: Methodological study in which 216 parents/guardians of children with asthma participated. A construct validation (factor analysis and test of hypothesis by comparison of contrasted groups and an analysis of reliability in terms of homogeneity (Cronbach's alpha and stability (test-retest were carried out. Results: Exploratory factor analysis proved suitable for the Brazilian version of the scale (Kaiser-Meyer-Olkim index of 0.879 and Bartlett's sphericity with p < 0.001. The correlation matrix in factor analysis suggested the removal of item 7 from the scale. Cronbach's alpha of the final scale, with 16 items, was 0.92. Conclusion: The Brazilian version of Self-efficacy and their child's level of asthma control presented psychometric properties that confirmed its validity and reliability.
An adaptive semantic matching paradigm for reliable and valid language mapping in individuals with aphasia.

Science.gov (United States)

Wilson, Stephen M; Yen, Melodie; Eriksson, Dana K

2018-04-17

Research on neuroplasticity in recovery from aphasia depends on the ability to identify language areas of the brain in individuals with aphasia. However, tasks commonly used to engage language processing in people with aphasia, such as narrative comprehension and picture naming, are limited in terms of reliability (test-retest reproducibility) and validity (identification of language regions, and not other regions). On the other hand, paradigms such as semantic decision that are effective in identifying language regions in people without aphasia can be prohibitively challenging for people with aphasia. This paper describes a new semantic matching paradigm that uses an adaptive staircase procedure to present individuals with stimuli that are challenging yet within their competence, so that language processing can be fully engaged in people with and without language impairments. The feasibility, reliability and validity of the adaptive semantic matching paradigm were investigated in sixteen individuals with chronic post-stroke aphasia and fourteen neurologically normal participants, in comparison to narrative comprehension and picture naming paradigms. All participants succeeded in learning and performing the semantic paradigm. Test-retest reproducibility of the semantic paradigm in people with aphasia was good (Dice coefficient = 0.66), and was superior to the other two paradigms. The semantic paradigm revealed known features of typical language organization (lateralization; frontal and temporal regions) more consistently in neurologically normal individuals than the other two paradigms, constituting evidence for validity. In sum, the adaptive semantic matching paradigm is a feasible, reliable and valid method for mapping language regions in people with aphasia. © 2018 Wiley Periodicals, Inc.
Development and psychometric validation of the verbal affective memory test

DEFF Research Database (Denmark)

Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S

2015-01-01

. Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...
Reasoning with Inductive Argument Test: A Study of Validity and Reliability

Directory of Open Access Journals (Sweden)

Mehmet Emrah Karadere

2013-11-01

Full Text Available Reasoning with Inductive Argument Test:A Study of Validity and Reliability Objective: The aim of our study is to research reliability and validity and to evaluate the usability of Turkish version of Reasoning with Inductive Argument Test (RIAT in Turkish healty population. Method: 51 healty volunteers who work in Ankara Dıskapi Yildirim Beyazit Research and Training Hospital participated in this study. Reasoning with Inductive Argument Test (RIAT was translated into Turkish by three clinical good knowledge of English. Participants were given a sociodemographic data form, and RIAT were performed by clinicians. To test the reliability of the Turkish version of RIAT, Cronbach’s alpha coefficient was calculated and the halving method was used for the test. Results: The internal consistency of the Reasoning with Inductive Argument Test (RIAT items, Cronbach’s alpha internal consistency coefficient measurements of 0.73 was found to be statistically significant. Spearman-Brown coefficient that determines the reliability of the whole test r=0.74 was found. Kurtosis values of all the items was below 1.5 and the percentages in the second evaluation were mainly lower. At the same time, both change in belief between self produced RIAT options and given RIAT options (p=0.02, z=-2296 as well as changes in beliefs between related and unrelated items for Obsessive Compulsive Disorder (OCD difference (p=0.03, z=-2.199 were significant. Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that ‘Reasoning with Inductive Argument Test’ supports reliability and validity in Turkish population.
Reasoning with Inductive Argument Test: A Study of Validity and Reliability

Directory of Open Access Journals (Sweden)

Mehmet Emrah Karadere

2013-12-01

Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that Reasoning with Inductive Argument Test supports reliability and validity in Turkish population. [JCBPR 2013; 2(3.000: 156-161
Fundamentals of endoscopic surgery: creation and validation of the hands-on test.

Science.gov (United States)

Vassiliou, Melina C; Dunkin, Brian J; Fried, Gerald M; Mellinger, John D; Trus, Thadeus; Kaneva, Pepa; Lyons, Calvin; Korndorffer, James R; Ujiki, Michael; Velanovich, Vic; Kochman, Michael L; Tsuda, Shawn; Martinez, Jose; Scott, Daniel J; Korus, Gary; Park, Adrian; Marks, Jeffrey M

2014-03-01

The Fundamentals of Endoscopic Surgery™ (FES) program consists of online materials and didactic and skills-based tests. All components were designed to measure the skills and knowledge required to perform safe flexible endoscopy. The purpose of this multicenter study was to evaluate the reliability and validity of the hands-on component of the FES examination, and to establish the pass score. Expert endoscopists identified the critical skill set required for flexible endoscopy. They were then modeled in a virtual reality simulator (GI Mentor™ II, Simbionix™ Ltd., Airport City, Israel) to create five tasks and metrics. Scores were designed to measure both speed and precision. Validity evidence was assessed by correlating performance with self-reported endoscopic experience (surgeons and gastroenterologists [GIs]). Internal consistency of each test task was assessed using Cronbach's alpha. Test-retest reliability was determined by having the same participant perform the test a second time and comparing their scores. Passing scores were determined by a contrasting groups methodology and use of receiver operating characteristic curves. A total of 160 participants (17 % GIs) performed the simulator test. Scores on the five tasks showed good internal consistency reliability and all had significant correlations with endoscopic experience. Total FES scores correlated 0.73, with participants' level of endoscopic experience providing evidence of their validity, and their internal consistency reliability (Cronbach's alpha) was 0.82. Test-retest reliability was assessed in 11 participants, and the intraclass correlation was 0.85. The passing score was determined and is estimated to have a sensitivity (true positive rate) of 0.81 and a 1-specificity (false positive rate) of 0.21. The FES hands-on skills test examines the basic procedural components required to perform safe flexible endoscopy. It meets rigorous standards of reliability and validity required for high
Validity and Reliability of the Achilles Tendon Total Rupture Score

DEFF Research Database (Denmark)

Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob

2013-01-01

study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately......The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present...... = .07). The limits of agreement were ±18.53. A strong correlation was found between test and retest (intercorrelation coefficient .908); the standard error of measurement was 6.7, and the minimal detectable change was 18.5. The Danish version of the ATRS showed moderately strong criterion validity...
Translation of Oswestry Disability Index into Tamil with Cross Cultural Adaptation and Evaluation of Reliability and Validity§

Science.gov (United States)

Vincent, Joshua Israel; MacDermid, Joy Christine; Grewal, Ruby; Sekar, Vincent Prabhakaran; Balachandran, Dinesh

2014-01-01

Study Design: Prospective longitudinal validation study Objective: To translate and cross-culturally adapt the Oswestry Disability Index (ODI) to the Tamil language (ODI-T), and to evaluate its reliability and construct validity. Summary of Background Data: ODI is widely used as a disease specific questionnaire in back pain patients to evaluate pain and disability. A thorough literature search revealed that the Tamil version of the ODI has not been previously published. Methods: The ODI was translated and cross-culturally adapted to the Tamil language according to established guidelines. 30 subjects (16 women and 14 men) with a mean age of 42.7 years (S.D. 13.6; Range 22 - 69) with low back pain were recruited to assess the psychometric properties of the ODI-T Questionnaire. Patients completed the ODI-T, Roland-Morris disability questionnaire (RMDQ), VAS-pain and VAS-disability at baseline and 24-72 hours from the baseline visit. Results: The ODI-T displayed a high degree of internal consistency, with a Cronbach's alpha of 0.92. The test-retest reliability was high (n=30) with an ICC of 0.92 (95% CI, 0.84 to 0.96) and a mean re-test difference of 2.6 points lower on re-test. The ODI-T scores exhibited a strong correlation with the RMDQ scores (r = 0.82) pdisability in Tamil speaking patients with low back pain. PMID:24563681
Test-retest reliability of the novel 5-HT1B receptor PET radioligand [11C]P943

International Nuclear Information System (INIS)

Saricicek, Aybala; Chen, Jason; Ruf, Barbara; Planeta, Beata; Labaree, David; Gallezot, Jean-Dominique; Huang, Yiyun; Subramanyam, Kalyani; Maloney, Kathleen; Matuskey, David; Deserno, Lorenz; Neumeister, Alexander; Krystal, John H.; Carson, Richard E.; Bhagwagar, Zubin

2015-01-01

[ 11 C]P943 is a novel, highly selective 5-HT 1B PET radioligand. The aim of this study was to determine the test-retest reliability of [ 11 C]P943 using two different modeling methods and to perform a power analysis with each quantification technique. Seven healthy volunteers underwent two PET scans on the same day. Regions of interest (ROIs) were the amygdala, hippocampus, pallidum, putamen, insula, frontal, anterior cingulate, parietal, temporal and occipital cortices, and cerebellum. Two multilinear radioligand quantification techniques were used to estimate binding potential: MA1, using arterial input function data, and the second version of the multilinear reference tissue model analysis (MRTM2), using the cerebellum as the reference region. Between-scan percent variability and intraclass correlation coefficients (ICC) were used to assess test-retest reliability. We also performed power analyses to determine the method that would allow the least number of subjects using within-subject or between-subject study designs. A voxel-wise ICC analysis for MRTM2 BP ND was performed for the whole brain and all the ROIs studied. Mean percent variability between two scans across regions ranged between 0.4 % and 12.4 % for MA1 BP ND , 0.5 % and 11.5 % for MA1 BP P , 16.7 % and 28.3 % for MA1 BP F , and between 0.2 % and 5.4 % for MRTM2 BP ND . The power analyses showed a greater number of subjects were required using MA1 BP F compared with other outcome measures for both within-subject and between-subject study designs. ICC values were the highest using MRTM2 BP ND and the lowest with MA1 BP F in ten ROIs. Small regions and regions with low binding had lower ICC values than large regions and regions with high binding. Reliable measures of 5-HT 1B receptor binding can be obtained using the novel PET radioligand [ 11 C]P943. Quantification of 5-HT 1B receptor binding with MRTM2 BP ND and with MA1 BP P provided the least variability and optimal power for within-subject and
Reliability and validity of the EQ-5D-3L for Kashin-Beck disease in China.

Science.gov (United States)

Fang, Hua; Farooq, Umer; Wang, Dimiao; Yu, Fangfang; Younus, Mohammad Imran; Guo, Xiong

2016-01-01

Kashin-Beck Disease (KBD) is an endemic osteoarthropathy in areas which extend from the North-East to the South-West of China. Most of the patients with KBD suffer multiple dysfunctions in major joints causing decreased health status. However because of their low education level and unique living habits, it is hard to find tools to measure the health-related quality of life (HRQOL). European quality of life (EQ-5D-3L) patient-reported instrument is widely used to measure HRQOL. This study aimed to establish the validity and reliability of the Chinese version of the EQ-5D-3L for evaluating HRQOL of KBD individuals in rural area. 368 individuals who were suffering from KBD were recruited through stratified multistage random sampling from Shaanxi province, China. The EQ-5D-3L and the WHOQOL-BREF were administrated in each individual by face to face interview. Test-retest reliability was assessed at 10-14 days intervals. The test-retest reliability was measured by calculating the Kappa coefficients for EQ-5D-3L five dimensions. For the EQ VAS, the intraclass correlation coefficient (ICC) was computed. Convergent and divergent analysis, construct validity was established using Spearman's rank correlation between the EQ-5D-3L and the WHOQOL-BREF. Known groups' validity was examined by comparing groups with a priori expected differences in health-related quality of life (HRQOL). For 362 individuals (98%), comprehensive data of all the EQ-5D-3L dimensions were available. Kappa values of the EQ-5D-3L five items ranged from 0.324 to 0.554. ICC of the EQ VAS was 0.497. For convergent validity, the three items (self-care, usual activity, and mobility) of EQ-5D-3L, index scores, and VAS showed moderate correlations with the physical health domain of the WHOQOL-BREF (r absolute value ranged from 0.339 to 0.475). For divergent validity, the 5 items of EQ-5D-3L showed weak or no correlations with environment and social relationship domains of WHOQOL-BREF. The Chinese EQ-5D-3L
Reliability and Validity of a Novel Internet-Based Battery to Assess Mood and Cognitive Function in the Elderly.

Science.gov (United States)

Myers, Candice A; Keller, Jeffrey N; Allen, H Raymond; Brouillette, Robert M; Foil, Heather; Davis, Allison B; Greenway, Frank L; Johnson, William D; Martin, Corby K

2016-10-18

Dementia is a chronic condition in the elderly and depression is often a concurrent symptom. As populations continue to age, accessible and useful tools to screen for cognitive function and its associated symptoms in elderly populations are needed. The aim of this study was to test the reliability and validity of a new internet-based assessment battery for screening mood and cognitive function in an elderly population. Specifically, the Helping Hand Technology (HHT) assessments for depression (HHT-D) and global cognitive function (HHT-G) were evaluated in a sample of 57 elderly participants (22 male, 35 female) aged 59-85 years. The study sample was categorized into three groups: 1) dementia (n = 8; Mini-Mental State Exam (MMSE) score 10-24), 2) mild cognitive impairment (n = 24; MMSE score 25-28), and 3) control (n = 25; MMSE score 29-30). Test-retest reliability (Pearson correlation coefficient, r) and internal consistency reliability (Cronbach's alpha, α) of the HHT-D and HHT-G were assessed. Validity of the HHT-D and HHT-G was tested via comparison (Pearson r) to commonly used pencil-and-paper based assessments: HHT-D versus the Geriatric Depression Scale (GDS) and HHT-G versus the MMSE. Good test-retest (r = 0.80; p validity of the HHT-D was obtained (r = 0.60 between the HHT-D and GDS; p Validity of the HHT-G was supported (r = 0.71 between the HHT-G and MMSE; p valid computerized assessments to screen for depression and cognitive status, respectively, in an elderly sample.
Validity and Reliability of the Questionnaire for Assessing Women’s Reproductive History in Azar Cohort Study

Directory of Open Access Journals (Sweden)

Mohammad Zakaria Pezeshki

2017-06-01

Full Text Available This study was done to evaluate the validity and reliability of women’s reproductive history questionnaire which will be used in Azar Cohort study; a cohort that is conducted by Tabriz University of Medical Science in Shabestar county for identifying risk factors of no communicable diseases. Content and face validity were evaluated by ten experts in the field and quantified as content validity index (CVI and content validity ratio (CVR. To assess the reliability, using test-retest approach, kappa statistic was calculated for categorical variables and intra-class correlation coefficient (ICC was used for the quantitative items. The calculated CVI and CVR were 0.91and 0.94, respectively. Reliability for all items was high. The ICC was 0.99 and kappa statistic was equal to 1. The final version of questionnaire was redesigned in 26 items with 7 subscales.
Assessing Households Preparedness for Earthquakes: An Exploratory Study in the Development of a Valid and Reliable Persian-version Tool.

Science.gov (United States)

Ardalan, Ali; Sohrabizadeh, Sanaz

2016-02-25

Iran is placed among countries suffering from the highest number of earthquake casualties. Household preparedness, as one component of risk reduction efforts, is often supported in quake-prone areas. In Iran, lack of a valid and reliable household preparedness tool was reported by previous disaster studies. This study is aimed to fill this gap by developing a valid and reliable tool for assessing household preparedness in the event of an earthquake. This survey was conducted through three phases including literature review and focus group discussions with the participation of eight key informants, validity measurements and reliability measurements. Field investigation was completed with the participation of 450 households within three provinces of Iran. Content validity, construct validity, the use of factor analysis; internal consistency using Cronbach's alpha coefficient, and test-retest reliability were carried out to develop the tool. Based on the CVIs, ranging from 0.80 to 0.100, and exploratory factor analysis with factor loading of more than 0.5, all items were valid. The amount of Cronbach's alpha (0.7) and test-retest examination by Spearman correlations indicated that the scale was also reliable. The final instrument consisted of six categories and 18 questions including actions at the time of earthquakes, nonstructural safety, structural safety, hazard map, communications, drill, and safety skills. Using a Persian-version tool that is adjusted to the socio-cultural determinants and native language may result in more trustful information on earthquake preparedness. It is suggested that disaster managers and researchers apply this tool in their future household preparedness projects. Further research is needed to make effective policies and plans for transforming preparedness knowledge into behavior.

Reliability and Validity of a Measure of Sexual and Physical Abuse Histories among Women with Serious Mental Illness.

Science.gov (United States)

Meyer, Ilan H.; And Others

1996-01-01

Structured clinical interviews concerning childhood histories of physical and sexual abuse with 70 mentally ill women at 2 times found test-retest reliability of .63 for physical abuse and .82 for sexual abuse. Validity, assessed as consistency with an independent clinical assessment, showed 75% agreement for physical abuse and 93% agreement for…
The Reliability and Validity of the Coopersmith Self-Esteem Inventory for a Sample of Filipino High School Girls.

Science.gov (United States)

Watkins, David; Astilla, Estela

1980-01-01

Evidence is presented partially supporting the reliability and construct validity of the Coopersmith Self-Esteem Inventory with Filipino adolescent girls. A test-retest coefficient of 0.61 was found over a nine-month period. Self-esteem scores were significantly associated with IQ scores and teacher ratings of pupils' self-esteem. (Author/BW)
The reliability and validity of fatigue measures during short-duration maximal-intensity intermittent cycling.

Science.gov (United States)

Glaister, Mark; Stone, Michael H; Stewart, Andrew M; Hughes, Michael; Moir, Gavin L

2004-08-01

The purpose of the present study was to assess the reliability and validity of fatigue measures, as derived from 4 separate formulae, during tests of repeat sprint ability. On separate days over a 3-week period, 2 groups of 7 recreationally active men completed 6 trials of 1 of 2 maximal (20 x 5 seconds) intermittent cycling tests with contrasting recovery periods (10 or 30 seconds). All trials were conducted on a friction-braked cycle ergometer, and fatigue scores were derived from measures of mean power output for each sprint. Apart from formula 1, which calculated fatigue from the percentage difference in mean power output between the first and last sprint, all remaining formulae produced fatigue scores that showed a reasonably good level of test-retest reliability in both intermittent test protocols (intraclass correlation range: 0.78-0.86; 95% likely range of true values: 0.54-0.97). Although between-protocol differences in the magnitude of the fatigue scores suggested good construct validity, within-protocol differences highlighted limitations with each formula. Overall, the results support the use of the percentage decrement score as the most valid and reliable measure of fatigue during brief maximal intermittent work.
Intertester reliability of the talk test in a cardiac rehabilitation population

DEFF Research Database (Denmark)

Petersen, Annemette Krintel; Maribo, Thomas; Hjortdal, Vibeke Elisabeth

2013-01-01

PURPOSE: The validity of the Talk Test (TT) is well documented, but the reliability of the test is not clear. The aim of this study was to assess the absolute and relative intertester reliability of the TT in cardiac patients. METHODS: Cardiac patients (n = 64) who had completed an exercise...... randomized to tests. Workload in watts at the first negative stage of the TT was registered as the test result. Patients and physiotherapists were blinded to test results of the first test. Absolute reliability of the TT was assessed with Bland-Altman plot, standard error of measurement, and minimal...... detectable change. Relative reliability was assessed using the intraclass correlation coefficient (ICC). RESULTS: Mean difference in peak workload between test and retest was 0.8 W (95% CI: -4.8 to 3.3). Limit of agreement was estimated to be +31/-32 W. Standard error of measurement was 11 W (95% CI: 10...
Construct validity and reliability of the Finnish version of the Knee Injury and Osteoarthritis Outcome Score.

Science.gov (United States)

Multanen, Juhani; Honkanen, Mikko; Häkkinen, Arja; Kiviranta, Ilkka

2018-05-22

The Knee Injury and Osteoarthritis Outcome Score (KOOS) is a commonly used knee assessment and outcome tool in both clinical work and research. However, it has not been formally translated and validated in Finnish. The purpose of this study was to translate and culturally adapt the KOOS questionnaire into Finnish and to determine its validity and reliability among Finnish middle-aged patients with knee injuries. KOOS was translated and culturally adapted from English into Finnish. Subsequently, 59 patients with knee injuries completed the Finnish version of KOOS, Western Ontario and McMaster Osteoarthritis Index (WOMAC), Short-Form 36 Health Survey (SF-36) and Numeric Pain Rating Scale (Pain-NRS). The same KOOS questionnaire was re-administered 2 weeks later. Psychometric assessment of the Finnish KOOS was performed by testing its construct validity and reliability by using internal consistency, test-retest reliability and measurement error. The floor and ceiling effects were also examined. The cross-cultural adaptation revealed only minor cultural differences and was well received by the patients. For construct validity, high to moderate Spearman's Correlation Coefficients were found between the KOOS subscales and the WOMAC, SF-36, and Pain-NRS subscales. The Cronbach's alpha was from 0.79 to 0.96 for all subscales indicating acceptable internal consistency. The test-retest reliability was good to excellent, with Intraclass Correlation Coefficients ranging from 0.73 to 0.86 for all KOOS subscales. The minimal detectable change ranged from 17 to 34 on an individual level and from 2 to 4 on a group level. No floor or ceiling effects were observed. This study yielded an appropriately translated and culturally adapted Finnish version of KOOS which demonstrated good validity and reliability. Our data indicate that the Finnish version of KOOS is suitable for assessment of the knee status of Finnish patients with different knee complaints. Further studies are needed to
The development and validation of a test of science critical thinking for fifth graders.

Science.gov (United States)

Mapeala, Ruslan; Siew, Nyet Moi

2015-01-01

The paper described the development and validation of the Test of Science Critical Thinking (TSCT) to measure the three critical thinking skill constructs: comparing and contrasting, sequencing, and identifying cause and effect. The initial TSCT consisted of 55 multiple choice test items, each of which required participants to select a correct response and a correct choice of critical thinking used for their response. Data were obtained from a purposive sampling of 30 fifth graders in a pilot study carried out in a primary school in Sabah, Malaysia. Students underwent the sessions of teaching and learning activities for 9 weeks using the Thinking Maps-aided Problem-Based Learning Module before they answered the TSCT test. Analyses were conducted to check on difficulty index (p) and discrimination index (d), internal consistency reliability, content validity, and face validity. Analysis of the test-retest reliability data was conducted separately for a group of fifth graders with similar ability. Findings of the pilot study showed that out of initial 55 administered items, only 30 items with relatively good difficulty index (p) ranged from 0.40 to 0.60 and with good discrimination index (d) ranged within 0.20-1.00 were selected. The Kuder-Richardson reliability value was found to be appropriate and relatively high with 0.70, 0.73 and 0.92 for identifying cause and effect, sequencing, and comparing and contrasting respectively. The content validity index obtained from three expert judgments equalled or exceeded 0.95. In addition, test-retest reliability showed good, statistically significant correlations ([Formula: see text]). From the above results, the selected 30-item TSCT was found to have sufficient reliability and validity and would therefore represent a useful tool for measuring critical thinking ability among fifth graders in primary science.
Validity and reliability of the Nintendo Wii Balance Board for assessment of standing balance.

Science.gov (United States)

Clark, Ross A; Bryant, Adam L; Pua, Yonghao; McCrory, Paul; Bennell, Kim; Hunt, Michael

2010-03-01

Impaired standing balance has a detrimental effect on a person's functional ability and increases their risk of falling. There is currently no validated system which can precisely quantify center of pressure (COP), an important component of standing balance, while being inexpensive, portable and widely available. The Wii Balance Board (WBB) fits these criteria, and we examined its validity in comparison with the 'gold standard'-a laboratory-grade force platform (FP). Thirty subjects without lower limb pathology performed a combination of single and double leg standing balance tests with eyes open or closed on two separate occasions. Data from the WBB were acquired using a laptop computer. The test-retest reliability for COP path length for each of the testing devices, including a comparison of the WBB and FP data, was examined using intraclass correlation coefficients (ICC), Bland-Altman plots (BAP) and minimum detectable change (MDC). Both devices exhibited good to excellent COP path length test-retest reliability within-device (ICC=0.66-0.94) and between-device (ICC=0.77-0.89) on all testing protocols. Examination of the BAP revealed no relationship between the difference and the mean in any test, however the MDC values for the WBB did exceed those of the FP in three of the four tests. These findings suggest that the WBB is a valid tool for assessing standing balance. Given that the WBB is portable, widely available and a fraction of the cost of a FP, it could provide the average clinician with a standing balance assessment tool suitable for the clinical setting. Copyright 2009 Elsevier B.V. All rights reserved.
Temporal stability of the Francis Scale of Attitude toward Christianity short-form: test-retest data over one week.

Science.gov (United States)

Lewis, Christopher Alan; Cruise, Sharon Mary; McGuckin, Conor

2005-04-01

This study evaluated the test-retest reliability of the Francis Scale of Attitude toward Christianity short-form. 39 Northern Irish undergraduate students completed the measure on two occasions separated by one week. Stability across the two administrations was high, r = .92, and there was no significant change between Time 1(M = 25.2, SD = 5.4) and Time 2 (M = 25.7, SD = 6.2). These data support the short-term test-retest reliability of the Francis Scale of Attitude toward Christianity short-form.
Reliability and validity of two isometric squat tests.

Science.gov (United States)

Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U

2002-05-01

The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p squat and FHS test performances (r squat and FHS test performance can be attributed to differences in the movement patterns of the tests
Test-retest Reliability and Agreement of the Satisfaction with the Assistive Technology Services (SATS) Instrument in Two Nordic Countries

DEFF Research Database (Denmark)

Sund, Terje; Anttila, Heidi; Iwarsson, Susanne

2014-01-01

Purpose: The purpose of this study was to investigate test–retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (...
Reliability and validity of the Performance Recorder 1 for measuring isometric knee flexor and extensor strength.

Science.gov (United States)

Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L

2013-11-01

Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n = 15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC = 0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC = 0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC = 0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC = 0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC = 0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.
Validity and Reliability of the Catastrophic Cognitions Questionnaire-Turkish Version

Directory of Open Access Journals (Sweden)

Ayse Kart

2016-01-01

Full Text Available Aim: Importance of catastrophic cognitions is well known for the development and maintance of panic disorder. Catastrophic Cognitions Questionnaire (CCQ measures thoughts associated with danger and was originally developed by Khawaja (1992. In this study, it is aimed to evaluate the validity and reliability of CCQ- Turkish version. Material and Method: CCQ was administered to 250 patients with panic disorder. Turkish version of CCQ was created by translation, back-translation and pilot assessment. Socio-demographic Data Form and CCQ Turkish version were administered to participants. Reliability of CCQ was analyzed by test-retest correlation, split-half technique, Cronbach%u2019s alpha coefficient. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Bartlett test had been performed. Principal component analysis and varimax rotation were used for factor analysis. Results: Fifty-five point six percent (n=139 of the participants were female and fourty-four point four percent (n=111 were male. Internal consistency of the questionnaire was calculated 0.920 by Cronbach alpha. In analysis performed by split-half method reliability coefficients of half questionnaire were found as 0.917 and 0.832. Again spearmen-brown coefficient was found as 0.875 by the same analysis. Factor analysis revealed five basic factors. These five factors explained %66.2 of the total variance. Discussion: The results of this study show that the Turkish version of CCQ is a reliable and valid scale.
Development, initial content validation and reliability of Nigerian ...

African Journals Online (AJOL)

Prevention strategies are effective only when there are epidemiological data for the targeted populations. The collection of such .... Proquest, Sport discuss and Cochrane as these are ... 0.74, test retest reliability 0.70; Diet: internal consistency:.
Validity and Reliability of the Turkish Version of the Monitoring My Multiple Sclerosis Scale.

Science.gov (United States)

Polat, Cansu; Tülek, Zeliha; Kürtüncü, Murat; Eraksoy, Mefkure

2017-06-01

This research was conducted to adapt the Monitoring My Multiple Sclerosis (MMMS) scale, which is a scale used for self-evaluation by multiple sclerosis (MS) patients of their own health and quality of life, to Turkish and to determine the psychometric properties of the scale. The methodological research was conducted in the outpatient MS clinic of a university hospital between January and September 2013. The sample in this study consisted of 140 patients aged above 18 who had a diagnosis of definite MS. Patients who experienced attacks in the previous month or had any serious medical problems other than MS were not included in the group. The linguistic validity of MMMS was tested by a backward-forward translation method and an expert panel. Reliability analysis was performed using test-retest correlations, item-total correlations, and internal consistency analysis. Confirmatory factor analysis and concurrent validity were used to determine the construct validity. The Multiple Sclerosis Quality of Life-54 instrument was used to determine concurrent validity and the Expanded Disability Status Scale, Hospital Anxiety and Depression Scale, and Mini Mental State Examination were used for further determination of the construct validity. We determined that the scale consisted of four factors with loadings ranging from 0.49 to 0.79. The correlation coefficients of the scale were determined to be between 0.47 and 0.76 for item-total score and between 0.60 and 0.81 for items and subscale scores. Cronbach's alpha coefficient was determined to be 0.94 for the entire scale and between 0.64 and 0.89 for the subscales. Test-retest correlations were significant. Correlations between MMMS and other scales were also found to be significant. The Turkish MMMS provides adequate validity and reliability for assessing the impact of MS on quality of life and health status in patients.
The reliability and validity of the Tokyo Autistic Behaviour Scale.

Science.gov (United States)

Kurita, H; Miyake, Y

1990-03-01

The Tokyo Autistic Behavior Scale (TABS) consisting of 39 items provisionally grouped in four areas--interpersonal-social relationship, language-communication, habit-mannerism and others--is an instrument used by a child's caretaker to rate the child's autistic behaviors on a 3-point scale. Test-retest reliability was satisfactory (i.e., an r for a total score was .94). Among six DSM-III diagnostic groups, infantile autism showed a significantly higher total TABS score than the other five groups, and a taxonomic validity coefficient was .54. An r between total scores of the TABS and the Childhood Autism Rating Scale--Tokyo Version was .59. The area scores showed a lower validity than the total score. The TABS appears to be a useful instrument to assess autistic behavior.
Reliability and Validity of Selected PROMIS Measures in People with Rheumatoid Arthritis.

Directory of Open Access Journals (Sweden)

Susan J Bartlett

Full Text Available To evaluate the reliability and validity of 11 PROMIS measures to assess symptoms and impacts identified as important by people with rheumatoid arthritis (RA.Consecutive patients (N = 177 in an observational study completed PROMIS computer adapted tests (CATs and a short form (SF assessing pain, fatigue, physical function, mood, sleep, and participation. We assessed test-test reliability and internal consistency using correlation and Cronbach's alpha. We assessed convergent validity by examining Pearson correlations between PROMIS measures and existing measures of similar domains and known groups validity by comparing scores across disease activity levels using ANOVA.Participants were mostly female (82% and white (83% with mean (SD age of 56 (13 years; 24% had ≤ high school, 29% had RA ≤ 5 years with 13% ≤ 2 years, and 22% were disabled. PROMIS Physical Function, Pain Interference and Fatigue instruments correlated moderately to strongly (rho's ≥ 0.68 with corresponding PROs. Test-retest reliability ranged from .725-.883, and Cronbach's alpha from .906-.991. A dose-response relationship with disease activity was evident in Physical Function with similar trends in other scales except Anger.These data provide preliminary evidence of reliability and construct validity of PROMIS CATs to assess RA symptoms and impacts, and feasibility of use in clinical care. PROMIS instruments captured the experiences of RA patients across the broad continuum of RA symptoms and function, especially at low disease activity levels. Future research is needed to evaluate performance in relevant subgroups, assess responsiveness and identify clinically meaningful changes.
Evaluation of the reliability and construct validity of test of gross motor development-2 (Ulrich 2 in children of Semnan province

Directory of Open Access Journals (Sweden)

Mohammad Ali Soltanian

2012-12-01

Full Text Available Introduction: The purpose of this study was to assess the validity and reliability of the second edition of test of gross motor development (TGMD-2; Ulrich in 7-11 aged children of Semnan province, Iran.Materials and Methods: TGMD-2 measures 12 fundamental movement skills divided evenly into locomotor and object control subtests. 1277 children (651 girls and 626 boys aged from seven to eleven years were participated.Results: Cronbach's alpha coefficients for the two subtests were ranged from 0.60 to 0.78, and test-retest reliability was from 0.86 to 0.89. Two-factor structure of TGMD-2 and proper assignment of skills to locomotor and object control factors were supported for our population.Conclusion: Based on our findings, we conclude that the TGMD-2 is an appropriate tool to examine the gross motor skills in this population
Reliability and consistency of a validated sun exposure questionnaire in a population-based Danish sample

Directory of Open Access Journals (Sweden)

B. Køster

2018-06-01

Full Text Available An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high.The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed questionnaire for monitoring and evaluating population sun-related behavior.Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly in a questionnaire adapted to measure behavior throughout the summer.The response rates for questionnaire 1, 2 and 3 were high and the drop out was not dependent on demographic characteristic. There was at least 73% agreement between sunburns in the measurement week and the entire summer, and a possible sunburn underestimation in questionnaires summarizing the entire summer. The participants underestimated their outdoor exposure in the evaluation covering the entire summer as compared to the measurement week. The reliability of scales measuring attitude and knowledge was high for majority of scales, while consistency in protection behavior was low.To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability, while consistency of protection behavior in general and in a week's measurement was low. Keywords: Questionnaire, Validation, Reliability, Skin cancer, Prevention, Ultraviolet radiation
External Validation and Evaluation of Reliability and Validity of the Modified Seoul National University Renal Stone Complexity Scoring System to Predict Stone-Free Status After Retrograde Intrarenal Surgery.

Science.gov (United States)

Park, Juhyun; Kang, Minyong; Jeong, Chang Wook; Oh, Sohee; Lee, Jeong Woo; Lee, Seung Bae; Son, Hwancheol; Jeong, Hyeon; Cho, Sung Yong

2015-08-01

The modified Seoul National University Renal Stone Complexity scoring system (S-ReSC-R) for retrograde intrarenal surgery (RIRS) was developed as a tool to predict stone-free rate (SFR) after RIRS. We externally validated the S-ReSC-R. We retrospectively reviewed 159 patients who underwent RIRS. The S-ReSC-R was assigned from 1 to 12 according to the location and number of sites involved. The stone-free status was defined as no evidence of a stone or with clinically insignificant residual fragment stones less than 2 mm. Interobserver and test-retest reliabilities were evaluated. Statistical performance of the prediction model was assessed by its predictive accuracy, predictive probability, and clinical usefulness. Overall SFR was 73.0%. The SFRs were 86.7%, 70.2%, and 48.6% in low-score (1-2), intermediate-score (3-4), and high-score (5-12) groups, respectively (pR revealed an area under the curve (AUC) of 0.731 (95% CI 0.650-0.813). The AUC of the three-titered S-ReSC-R was 0.701 (95% CI 0.609-0.794). The calibration plot showed that the predicted probability of SFR had a concordance comparable to that of observed frequency. The Hosmer-Lemeshow goodness of fit test revealed a p-value of 0.01 for the S-ReSC-R and 0.90 for the three-titered S-ReSC-R. Interobserver and test-retest reliabilities revealed an almost perfect level of agreement. The present study proved the predictive value of S-ReSC-R to predict SFR following RIRS in an independent cohort. Interobserver and test-retest reliabilities confirmed that S-ReSC-R was reliable and valid.
Cross-cultural adaptation, reliability and validity of the Turkish version of the Lower Limb Functional Index.

Science.gov (United States)

Duruturk, Neslihan; Tonga, Eda; Gabel, Charles Philip; Acar, Manolya; Tekindal, Agah

2015-07-26

This study aims to adapt culturally a Turkish version of the Lower Limb Functional Index (LLFI) and to determine its validity, reliability, internal consistency, measurement sensitivity and factor structure in lower limb problems. The LLFI was translated into Turkish and cross-culturally adapted with a double forward-backward protocol that determined face and content validity. Individuals (n = 120) with lower limb musculoskeletal disorders completed the LLFI and Short Form-36 questionnaires and the Timed Up and Go physical test. The psychometric properties were evaluated for the all participants from patient-reported outcome measures made at baseline and repeated at day 3 to determine criterion between scores (Pearson's r), internal consistency (Cronbachs α) and test-retest reliability (intraclass correlation coefficient - ICC 2.1 ). Error was determined using standard error of the measurement (SEM) and minimal detectable change at the 90% level (MDC 90 ), while factor structure was determined using exploratory factor analysis with maximum likelihood extraction and Varimax rotation. The psychometric characteristics showed strong criterion validity (r = 0.74-0.76), high internal consistency (α = 0.82) and high test-retest reability (ICC 2.1 = 0.97). The SEM of 3.2% gave an MDC 90 = 5.8%. The factor structure was uni-dimensional. Turkish version of LLFI was found to be valid and reliable for the measurement of lower limb function in a Turkish population. Implications for Rehabilitation Lower extremity musculoskeletal disorders are common and greatly impact activities among the affected individuals pertaining to daily living, work, leisure and quality of life. Patient-reported outcome (PRO) measures have advantages as they are practical, cost-effective and clinically convenient for use in patient-centered care. The Lower Limb Functional Index is a recently validated PRO measure shown to have strong clinimetric properties.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.