internal consistency test-retest: Topics by WorldWideScience.org

Sample records for internal consistency test-retest

The eye-complaint questionnaire in a visual display unit work environment: Internal consistency and test-retest reliability

NARCIS (Netherlands)

Steenstra, Ivan A.; Sluiter, Judith K.; Frings-Dresen, Monique H. W.

2009-01-01

The internal consistency and test-retest reliability of a 10-item eye-complaint questionnaire (ECQ) were examined within a sample of office workers. Repeated within-subjects measures were performed within a single day and over intervals of 1 and 7 d. Questionnaires were completed by 96 workers (70%
Internal consistency, reliability, and temporal stability of the Oxford Happiness Questionnaire short-form: Test-retest data over two weeks

OpenAIRE

MCGUCKIN, CONOR

2006-01-01

PUBLISHED The Oxford Happiness Questionnaire short-form is a recently developed eight-item measure of happiness. This study evaluated the internal consistency reliability and test-retest reliability of the Oxford Happiness Questionnaire short-form among 55 Northern Irish undergraduate university students who completed the measure on two occasions separated by two weeks. Internal consistency of the measure on both occasions was satisfactory at both Time 1 (alpha = .62) and Time 2 (alpha = ....
Internal Consistency, Retest Reliability, and their Implications For Personality Scale Validity

Science.gov (United States)

McCrae, Robert R.; Kurtz, John E.; Yamagata, Shinji; Terracciano, Antonio

2010-01-01

We examined data (N = 34,108) on the differential reliability and validity of facet scales from the NEO Inventories. We evaluated the extent to which (a) psychometric properties of facet scales are generalizable across ages, cultures, and methods of measurement; and (b) validity criteria are associated with different forms of reliability. Composite estimates of facet scale stability, heritability, and cross-observer validity were broadly generalizable. Two estimates of retest reliability were independent predictors of the three validity criteria; none of three estimates of internal consistency was. Available evidence suggests the same pattern of results for other personality inventories. Internal consistency of scales can be useful as a check on data quality, but appears to be of limited utility for evaluating the potential validity of developed scales, and it should not be used as a substitute for retest reliability. Further research on the nature and determinants of retest reliability is needed. PMID:20435807
Assessment of test-retest reliability and internal consistency of the Wisconsin Gait Scale in hemiparetic post-stroke patients

Directory of Open Access Journals (Sweden)

Guzik Agnieszka

2016-09-01

Full Text Available Introduction: A proper assessment of gait pattern is a significant aspect in planning the process of teaching gait in hemiparetic post-stroke patients. The Wisconsin Gait Scale (WGS is an observational tool for assessing post-stroke patients’ gait. The aim of the study was to assess test-retest reliability and internal consistency of the WGS and examine correlations between gait assessment made with the WGS and gait speed, Brunnström scale, Ashworth’s scale and the Barthel Index.
Test-Retest Reliability, Convergent Validity, and Internal Consistency of the Persian Version of Fullerton Advanced Balance Scale in Iranian Community-Dwelling Older Adults

OpenAIRE

Azar Sabet; Akram Azad; Ghorban Taghizadeh

2016-01-01

Objectives: This study was performed to evaluate convergent validity, test-retest reliability and internal consistency of the Persian translation of the Fullerton advanced balance (FAB) for use in Iranian community- dwelling older adults and improve the quality of their functional balance assessment. Methods & Materials: The original scale was translated with forward-backward protocol. In the next step, using convenience sampling and inclusion criteria, 88 functionally indep...
Construct validity, test-retest reliability and internal consistency of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) in patients with carpal tunnel syndrome.

Science.gov (United States)

Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan

2018-03-27

This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.
Internal consistency, test-retest reliability and measurement error of the self-report version of the social skills rating system in a sample of Australian adolescents.

Directory of Open Access Journals (Sweden)

Sharmila Vaz

Full Text Available The social skills rating system (SSRS is used to assess social skills and competence in children and adolescents. While its characteristics based on United States samples (US are published, corresponding Australian figures are unavailable. Using a 4-week retest design, we examined the internal consistency, retest reliability and measurement error (ME of the SSRS secondary student form (SSF in a sample of Year 7 students (N = 187, from five randomly selected public schools in Perth, western Australia. Internal consistency (IC of the total scale and most subscale scores (except empathy on the frequency rating scale was adequate to permit independent use. On the importance rating scale, most IC estimates for girls fell below the benchmark. Test-retest estimates of the total scale and subscales were insufficient to permit reliable use. ME of the total scale score (frequency rating for boys was equivalent to the US estimate, while that for girls was lower than the US error. ME of the total scale score (importance rating was larger than the error using the frequency rating scale. The study finding supports the idea of using multiple informants (e.g. teacher and parent reports, not just student as recommended in the manual. Future research needs to substantiate the clinical meaningfulness of the MEs calculated in this study by corroborating them against the respective Minimum Clinically Important Difference (MCID.
Internal consistency, test-retest reliability and measurement error of the self-report version of the social skills rating system in a sample of Australian adolescents.

Science.gov (United States)

Vaz, Sharmila; Parsons, Richard; Passmore, Anne Elizabeth; Andreou, Pantelis; Falkmer, Torbjörn

2013-01-01

The social skills rating system (SSRS) is used to assess social skills and competence in children and adolescents. While its characteristics based on United States samples (US) are published, corresponding Australian figures are unavailable. Using a 4-week retest design, we examined the internal consistency, retest reliability and measurement error (ME) of the SSRS secondary student form (SSF) in a sample of Year 7 students (N = 187), from five randomly selected public schools in Perth, western Australia. Internal consistency (IC) of the total scale and most subscale scores (except empathy) on the frequency rating scale was adequate to permit independent use. On the importance rating scale, most IC estimates for girls fell below the benchmark. Test-retest estimates of the total scale and subscales were insufficient to permit reliable use. ME of the total scale score (frequency rating) for boys was equivalent to the US estimate, while that for girls was lower than the US error. ME of the total scale score (importance rating) was larger than the error using the frequency rating scale. The study finding supports the idea of using multiple informants (e.g. teacher and parent reports), not just student as recommended in the manual. Future research needs to substantiate the clinical meaningfulness of the MEs calculated in this study by corroborating them against the respective Minimum Clinically Important Difference (MCID).
A review of culturally adapted versions of the Oswestry Disability Index: the adaptation process, construct validity, test-retest reliability and internal consistency.

Science.gov (United States)

Sheahan, Peter J; Nelson-Wong, Erika J; Fischer, Steven L

2015-01-01

The Oswestry Disability Index (ODI) is a self-report-based outcome measure used to quantify the extent of disability related to low back pain (LBP), a substantial contributor to workplace absenteeism. The ODI tool has been adapted for use by patients in several non-English speaking nations. It is unclear, however, if these adapted versions of the ODI are as credible as the original ODI developed for English-speaking nations. The objective of this study was to conduct a review of the literature to identify culturally adapted versions of the ODI and to report on the adaptation process, construct validity, test-retest reliability and internal consistency of these ODIs. Following a pragmatic review process, data were extracted from each study with regard to these four outcomes. While most studies applied adaptation processes in accordance with best-practice guidelines, there were some deviations. However, all studies reported high-quality psychometric properties: group mean construct validity was 0.734 ± 0.094 (indicated via a correlation coefficient), test-retest reliability was 0.937 ± 0.032 (indicated via an intraclass correlation coefficient) and internal consistency was 0.876 ± 0.047 (indicated via Cronbach's alpha). Researchers can be confident when using any of these culturally adapted ODIs, or when comparing and contrasting results between cultures where these versions were employed. Implications for Rehabilitation Low back pain is the second leading cause of disability in the world, behind only cancer. The Oswestry Disability Index (ODI) has been developed as a self-report outcome measure of low back pain for administration to patients. An understanding of the various cross-cultural adaptations of the ODI is important for more concerted multi-national research efforts. This review examines 16 cross-cultural adaptations of the ODI and should inform the work of health care and rehabilitation professionals.
Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

Science.gov (United States)

Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

2014-09-04

unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.
Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.

Science.gov (United States)

Taylor, Karen; Bulsara, Max; Monterosso, Leanne

2018-01-01

Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Test of Gross Motor Development : Expert Validity, confirmatory validity and internal consistence

Directory of Open Access Journals (Sweden)

Nadia Cristina Valentini

2008-12-01

Full Text Available The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motordevelopment. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by expertsand the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. Across-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionalsand 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls.Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated thatthe Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices ofconfirmatory factorial validity (χ2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tuckerand Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. ThePortuguese TGMD-2 demonstrated validity and reliability for the sample investigated.
Test of Gross Motor Development: expert validity, confirmatory validity and internal consistence

Directory of Open Access Journals (Sweden)

Nadia Cristina Valentini

2008-01-01

The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motor development. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by experts and the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. A cross-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionals and 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls. Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated that the Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices of confirmatory factorial validity (÷2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tucker and Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. The Portuguese TGMD-2 demonstrated validity and reliability for the sample investigated.
A reliability generalization meta-analysis of coefficient alpha and test-retest coefficient for the aging males' symptoms (AMS) scale.

Science.gov (United States)

Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen

2016-12-01

The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.
Test - retest reliability of two instruments for measuring public attitudes towards persons with mental illness

Directory of Open Access Journals (Sweden)

Leufstadius Christel

2011-01-01

Full Text Available Abstract Background Research has identified stigmatization as a major threat to successful treatment of individuals with mental illness. As a consequence several anti-stigma campaigns have been carried out. The results have been discouraging and the field suffers from lack of evidence about interventions that work. There are few reports on psychometric data for instruments used to assess stigma, which thus complicates research efforts. The aim of the present study was to investigate test-retest reliability of the Swedish versions of the questionnaires: FABI and "Changing Minds" and to examine the internal consistency of the two instruments. Method Two instruments, fear and behavioural intentions (FABI and "Changing Minds", used in earlier studies on public attitudes towards persons with mental illness were translated into Swedish and completed by 51 nursing students on two occasions, with an interval of three weeks. Test-retest reliability was calculated by using weighted kappa coefficient and internal consistency using the Cronbach's alpha coefficient. Results Both instruments attain at best moderate test-retest reliability. For the Changing Minds questionnaire almost one fifth (17.9% of the items present poor test-retest reliability and the alpha coefficient for the subscales ranges between 0.19 - 0.46. All of the items in the FABI reach a fair or a moderate agreement between the test and retest, and the questionnaire displays a high internal consistency, alpha 0.80. Conclusions There is a need for development of psychometrically tested instruments within this field of research.
Test-retest reliability of the eating disorder examination-questionnaire (EDE-Q) in a college sample

OpenAIRE

Rose, Jennifer S; Vaewsorn, Adin; Rosselli-Navarra, Francine; Wilson, G Terence; Weissman, Ruth Striegel

2013-01-01

Background The Eating Disorder Examination-Questionnaire (EDE-Q), a widely used self-report instrument, is often used for measuring change in eating disorder symptoms over the course of treatment. However, limited data exist about test-retest reliability, particularly for men. The current study evaluated EDE-Q 7-day test-retest reliability in male (n = 47) and female (n = 44) undergraduate students together and separately by gender. Results Internal consistency was consistently higher for wom...
Test-retest reliability and construct validity of the Helplessness, Hopelessness, and Haplessness Scale in patients with anxiety disorders.

Science.gov (United States)

Vatan, Sevginar; Ertaş, Sedar; Lester, David

2011-04-01

In a sample of 100 Turkish psychiatric patients with diagnoses of anxiety disorders, Lester's Helplessness, Hopelessness, and Haplessness inventory had moderate estimates of internal consistency, test-retest reliability, and construct validity.
Test-retest reliability of the Military Pre-training Questionnaire.

Science.gov (United States)

Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

2010-09-01

Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Test-retest reliability and predictive validity of the Implicit Association Test in children.

Science.gov (United States)

Rae, James R; Olson, Kristina R

2018-02-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Test-retest reliability and comparability of paper and computer questionnaires for the Finnish version of the Tampa Scale of Kinesiophobia.

Science.gov (United States)

Koho, P; Aho, S; Kautiainen, H; Pohjolainen, T; Hurri, H

2014-12-01

To estimate the internal consistency, test-retest reliability and comparability of paper and computer versions of the Finnish version of the Tampa Scale of Kinesiophobia (TSK-FIN) among patients with chronic pain. In addition, patients' personal experiences of completing both versions of the TSK-FIN and preferences between these two methods of data collection were studied. Test-retest reliability study. Paper and computer versions of the TSK-FIN were completed twice on two consecutive days. The sample comprised 94 consecutive patients with chronic musculoskeletal pain participating in a pain management or individual rehabilitation programme. The group rehabilitation design consisted of physical and functional exercises, evaluation of the social situation, psychological assessment of pain-related stress factors, and personal pain management training in order to regain overall function and mitigate the inconvenience of pain and fear-avoidance behaviour. The mean TSK-FIN score was 37.1 [standard deviation (SD) 8.1] for the computer version and 35.3 (SD 7.9) for the paper version. The mean difference between the two versions was 1.9 (95% confidence interval 0.8 to 2.9). Test-retest reliability was 0.89 for the paper version and 0.88 for the computer version. Internal consistency was considered to be good for both versions. The intraclass correlation coefficient for comparability was 0.77 (95% confidence interval 0.66 to 0.85), indicating substantial reliability between the two methods. Both versions of the TSK-FIN demonstrated substantial intertest reliability, good test-retest reliability, good internal consistency and acceptable limits of agreement, suggesting their suitability for clinical use. However, subjects tended to score higher when using the computer version. As such, in an ideal situation, data should be collected in a similar manner throughout the course of rehabilitation or clinical research. Copyright © 2014 Chartered Society of Physiotherapy. Published

Test-retest reliability of the multifocal photopic negative response.

Science.gov (United States)

Van Alstine, Anthony W; Viswanathan, Suresh

2017-02-01

To assess the test-retest reliability of the multifocal photopic negative response (mfPhNR) of normal human subjects. Multifocal electroretinograms were recorded from one eye of 61 healthy adult subjects on two separate days using a Visual Evoked Response Imaging System software version 4.3 (EDI, San Mateo, California). The visual stimulus delivered on a 75-Hz monitor consisted of seven equal-sized hexagons each subtending 12° of visual angle. The m-step exponent was 9, and the m-sequence was slowed to include at least 30 blank frames after each flash. Only the first slice of the first-order kernel was analyzed. The mfPhNR amplitude was measured at a fixed time in the trough from baseline (BT) as well as at the same fixed time in the trough from the preceding b-wave peak (PT). Additionally, we also analyzed BT normalized either to PT (BT/PT) or to the b-wave amplitude (BT/b-wave). The relative reliability of test-retest differences for each test location was estimated by the Wilcoxon matched-pair signed-rank test and intraclass correlation coefficients (ICC). Absolute test-retest reliability was estimated by Bland-Altman analysis. The test-retest amplitude differences for neither of the two measurement techniques were statistically significant as determined by Wilcoxon matched-pair signed-rank test. PT measurements showed greater ICC values than BT amplitude measurements for all test locations. For each measurement technique, the ICC value of the macular response was greater than that of the surrounding locations. The mean test-retest difference was close to zero for both techniques at each of the test locations, and while the coefficient of reliability (COR-1.96 times the standard deviation of the test-retest difference) was comparable for the two techniques at each test location when expressed in nanovolts, the %COR (COR normalized to the mean test and retest amplitudes) was superior for PT than BT measurements. The ICC and COR were comparable for the BT/PT and
Balance Assessment in Sports-Related Concussion: Evaluating Test-Retest Reliability of the Equilibrate System.

Science.gov (United States)

Odom, Mitchell J; Lee, Young M; Zuckerman, Scott L; Apple, Rachel P; Germanos, Theodore; Solomon, Gary S; Sills, Allen K

2016-01-01

This study evaluated the test-retest reliability of a novel computer-based, portable balance assessment tool, the Equilibrate System (ES), used to diagnose sports-related concussion. Twenty-seven students participated in ES testing consisting of three sessions over 4 weeks. The modified Balance Error Scoring System was performed. For each participant, test-retest reliability was established using the intraclass correlation coefficient (ICC). The ES test-retest reliability from baseline to week 2 produced an ICC value of 0.495 (95% CI, 0.123-0.745). Week 2 testing produced ICC values of 0.602 (95% CI, 0.279-0.803) and 0.610 (95% CI, 0.299-0.804), respectively. All other single measures test-retest reliability values produced poor ICC values. Same-day ES testing showed fair to good test-retest reliability while interweek measures displayed poor to fair test-retest reliability. Testing conditions should be controlled when using computerized balance assessment methods. ES testing should only be used as a part of a comprehensive assessment.
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

Science.gov (United States)

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (Test possessed low CV and significant (pTest possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Reliability of the Swedish version of the Exercise Self-Efficacy Scale (S-ESES): a test-retest study in adults with neurological disease.

Science.gov (United States)

Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth

2015-03-01

To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.
Retest effects in working memory capacity tests: A meta-analysis.

Science.gov (United States)

Scharfen, Jana; Jansen, Katrin; Holling, Heinz

2018-06-15

The repeated administration of working memory capacity tests is common in clinical and research settings. For cognitive ability tests and different neuropsychological tests, meta-analyses have shown that they are prone to retest effects, which have to be accounted for when interpreting retest scores. Using a multilevel approach, this meta-analysis aims at showing the reproducibility of retest effects in working memory capacity tests for up to seven test administrations, and examines the impact of the length of the test-retest interval, test modality, equivalence of test forms and participant age on the size of retest effects. Furthermore, it is assessed whether the size of retest effects depends on the test paradigm. An extensive literature search revealed 234 effect sizes from 95 samples and 68 studies, in which healthy participants between 12 and 70 years repeatedly performed a working memory capacity test. Results yield a weighted average of g = 0.28 for retest effects from the first to the second test administration, and a significant increase in effect sizes was observed up to the fourth test administration. The length of the test-retest interval and publication year were found to moderate the size of retest effects. Retest effects differed between the paradigms of working memory capacity tests. These findings call for the development and use of appropriate experimental or statistical methods to address retest effects in working memory capacity tests.
Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

Science.gov (United States)

Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

2016-04-01

The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
Test-retest reliability of cognitive EEG

Science.gov (United States)

McEvoy, L. K.; Smith, M. E.; Gevins, A.

2000-01-01

OBJECTIVE: Task-related EEG is sensitive to changes in cognitive state produced by increased task difficulty and by transient impairment. If task-related EEG has high test-retest reliability, it could be used as part of a clinical test to assess changes in cognitive function. The aim of this study was to determine the reliability of the EEG recorded during the performance of a working memory (WM) task and a psychomotor vigilance task (PVT). METHODS: EEG was recorded while subjects rested quietly and while they performed the tasks. Within session (test-retest interval of approximately 1 h) and between session (test-retest interval of approximately 7 days) reliability was calculated for four EEG components: frontal midline theta at Fz, posterior theta at Pz, and slow and fast alpha at Pz. RESULTS: Task-related EEG was highly reliable within and between sessions (r0.9 for all components in WM task, and r0.8 for all components in the PVT). Resting EEG also showed high reliability, although the magnitude of the correlation was somewhat smaller than that of the task-related EEG (r0.7 for all 4 components). CONCLUSIONS: These results suggest that under appropriate conditions, task-related EEG has sufficient retest reliability for use in assessing clinical changes in cognitive status.
Test-retest reliability of the driving habits questionnaire in older self-driving adults.

Science.gov (United States)

Song, Chiang-Soon; Chun, Byung-Yoon; Chung, Hyun-Sook

2015-11-01

[Purpose] The purpose of this study was to investigate the test-retest reliability of the Driving Habits Questionnaire in community-dwelling older self-drivers. [Subjects and Methods] Seventy-four participants were recruited by convenience sampling from local rehabilitation centers. This was a cross-sectional study design that used two clinical measures: the Driving Habits Questionnaire and Mini-mental State Examination. To examine the test-retest reliability of the Driving Habits Questionnaire, the clinical tool was measured twice, five days apart. [Results] The Driving Habits Questionnaire showed good reliability for older community-dwelling self-drivers. The Cronbach's alpha coefficients for the four domains of dependence (0.572), difficulty (0.871), crashes and citations (0.689), and driving space (0.961) of the Driving Habits Questionnaire indicated good or high internal consistency. Driving difficulty correlated significantly with self-reported crashes and citations and driving space. [Conclusion] The results of this study suggest that the Driving Habits Questionnaire is a reliable measure of self-reported interview-based driving behavior in the community-dwelling elderly.
Test re-test reliability and construct validity of the star-track test of manual dexterity

DEFF Research Database (Denmark)

Kildebro, Niels; Amirian, Ilda; Gögenur, Ismail

2015-01-01

Objectives. We wished to determine test re-test reliability and construct validity of the star-track test of manual dexterity. Design. Test re-test reliability was examined in a controlled study. Construct validity was tested in a blinded randomized crossover study. Setting. The study was performed...... at a university hospital in Denmark. Participants. A total of 11 subjects for test re-test and 20 subjects for the construct validity study were included. All subjects were healthy volunteers. Intervention. The test re-test trial had two measurements with 2 days pause in between. The interventions...... in the construct validity study included baseline measurement, intervention 1: fatigue, intervention 2: stress, and intervention 3: fatigue and stress. There was a 2 day pause between each intervention. Main outcome measure. An integrated measure of completion time and number of errors was used. Results. All...
Test-retest reliability of the Work Ability Index questionnaire

NARCIS (Netherlands)

de Zwart, B. C. H.; Frings-Dresen, M. H. W.; Van Duivenbooden, J. C.

2002-01-01

The goal of the study was to assess the test-retest reliability of the Work Ability Index (WAI) questionnaire. Reliability was tested using a test-retest design with a 4 week interval between measurements. Valid data were collected among 97 elderly construction workers aged 40 years and older. We
Test-retest reliability of infant event related potentials evoked by faces.

Science.gov (United States)

Munsters, N M; van Ravenswaaij, H; van den Boomen, C; Kemner, C

2017-04-05

Reliable measures are required to draw meaningful conclusions regarding developmental changes in longitudinal studies. Little is known, however, about the test-retest reliability of face-sensitive event related potentials (ERPs), a frequently used neural measure in infants. The aim of the current study is to investigate the test-retest reliability of ERPs typically evoked by faces in 9-10 month-old infants. The infants (N=31) were presented with neutral, fearful and happy faces that contained only the lower or higher spatial frequency information. They were tested twice within two weeks. The present results show that the test-retest reliability of the face-sensitive ERP components is moderate (P400 and Nc) to substantial (N290). However, there is low test-retest reliability for the effects of the specific experimental manipulations (i.e. emotion and spatial frequency) on the face-sensitive ERPs. To conclude, in infants the face-sensitive ERP components (i.e. N290, P400 and Nc) show adequate test-retest reliability, but not the effects of emotion and spatial frequency on these ERP components. We propose that further research focuses on investigating elements that might increase the test-retest reliability, as adequate test-retest reliability is necessary to draw meaningful conclusions on individual developmental trajectories of the face-sensitive ERPs in infants. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Test-retest reliability at the item level and total score level of the Norwegian version of the Spinal Cord Injury Falls Concern Scale (SCI-FCS).

Science.gov (United States)

Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik

2016-05-01

Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.
Construct validity and internal consistency in the Leisure Practices Scale (EPL) for adults.

Science.gov (United States)

Andrade, Rubian Diego; Schwartz, Gisele Maria; Tavares, Giselle Helena; Pelegrini, Andreia; Teixeira, Clarissa Stefani; Felden, Érico Pereira Gomes

2018-02-01

This study proposes and analyzes the construct validity and internal consistency of the Leisure Practices Scale (EPL). This survey seeks to identify the preferences and involvement in in different leisure practices in adults. The instrument was formed based on the cultural leisure content (artistic, manual, physical, sports, intellectual, social, tourist, virtual and contemplation/leisure). The validation process was conducted with: a) content analysis by leisure experts, who evaluated the instrument for clarity of language and practical relevance, which allowed the calculation of the content validity coefficient (CVC); b) reproducibility test-retest with 51 subjects to calculate the temporal variation coefficient; c) internal consistency analysis with 885 participants. The evaluation presented appropriate coefficients, both with respect to language clarity (CVCt = 0.883) and practical relevance (CVCt = 0.879). The reproducibility coefficients were moderate to excellent. The scale showed adequate internal consistency (0.72). The EPL has psychometric quality and acceptable values in its structure, and can be used to investigate adult involvement in leisure activities.
Test-retest reliability of the isernhagen work systems functional capacity evaluation in healthy adults

NARCIS (Netherlands)

Reneman, MF; Brouwer, S; Meinema, A; Dijkstra, PU; Geertzen, JHB; Groothoff, JW

2004-01-01

Aim of this study was to investigate test-retest reliability of the Isernhagen Work System Functional Capacity Evaluation (IWS FCE) in healthy subjects. The IWS FCE consists of 28 tests that reflect work-related activities such as lifting, carrying, bending, etc. A convenience sample of 26 healthy
Rorschach e pedofilia: a fidedignidade no teste-reteste = Rorschach and pedophilia: a reliability at test-retest

Directory of Open Access Journals (Sweden)

Scortegagna, Silvana Alba

2013-01-01

Full Text Available Esse estudo buscou investigar as características de personalidade de um indivíduo pedófilo, e evidenciar a fidedignidade do Rorschach no teste-reteste. O participante, com 38 anos de idade, masculino, respondeu a entrevista e ao método de Rorschach, em duas etapas. Os principais achados revelam: a uma tendência à fragmentação na percepção de si e dos outros; b autoimagem negativa e desfavorável em relação ao corpo e suas funções; c problemas nas relações interpessoais, falhas na capacidade de empatia; d déficit no ajustamento perceptivo da realidade; e vulnerabilidade a pressões subjetivas e impulsividade. Esses resultados mantiveram-se estáveis comparando-se as duas aplicações, permitindo ampliar a compreensão dos elementos psicológicos envolvidos na pedofilia, que se mantem, e apoiam a fidedignidade do Rorschach no teste-reteste
Test-retest reliability of the Progressive Isoinertial Lifting Evaluation (PILE).

Science.gov (United States)

Lygren, Hildegunn; Dragesund, Tove; Joensen, Jón; Ask, Tove; Moe-Nilssen, Rolf

2005-05-01

A repeated measures single group design. To investigate test-retest reliability of Progressive Isoinertial Lifting Evaluation on patients with long lasting musculoskeletal problems related to the lumbar spine. Test-retest reliability has been satisfactory in healthy men. Test-retest reliability for clinical populations has not been reported. A total of 31 patients (17 women and 14 men) with long lasting low back pain participated in the study. The patients were tested twice at an interval of 2 days and at the same time of the day. The heaviest load that the patient could lift 4 times was used as outcome measure. The error of measurement indicates that the true result in 95% of cases will be within +/-4.5 kg from the measured value, while the difference between 2 measurements in 95% of cases will be less than 6.4 kg. Intra-class correlation (1,1) was 0.91. Relative test-retest reliability was high assessed by intra-class correlation, but absolute measurement variability reported as the smallest detectable difference has relevance for the interpretation of clinical test results and should also be considered.
Assessing motivation for work environment improvements: internal consistency, reliability and factorial structure.

Science.gov (United States)

Hedlund, Ann; Ateg, Mattias; Andersson, Ing-Marie; Rosén, Gunnar

2010-04-01

Workers' motivation to actively take part in improvements to the work environment is assumed to be important for the efficiency of investments for that purpose. That gives rise to the need for a tool to measure this motivation. A questionnaire to measure motivation for improvements to the work environment has been designed. Internal consistency and test-retest reliability of the domains of the questionnaire have been measured, and the factorial structure has been explored, from the answers of 113 employees. The internal consistency is high (0.94), as well as the correlation for the total score (0.84). Three factors are identified accounting for 61.6% of the total variance. The questionnaire can be a useful tool in improving intervention methods. The expectation is that the tool can be useful, particularly with the aim of improving efficiency of companies' investments for work environment improvements. Copyright 2010 Elsevier Ltd. All rights reserved.
Test of cure, retesting and extragenital testing practices for Chlamydia trachomatis and Neisseria gonorrhoeae among general practitioners in different socioeconomic status areas: A retrospective cohort study, 2011-2016

Science.gov (United States)

van Liere, Geneviève A. F. S.; Cals, Jochen W. L.; Dukers-Muijrers, Nicole H. T. M.

2018-01-01

Background For Chlamydia trachomatis (CT), a test of cure (TOC) within 3–5 weeks is not recommended. International guidelines differ in advising a Neisseria gonorrhoeae (NG) TOC. Retesting CT and NG positives within 3–12 months is recommended in international guidelines. We assessed TOC and retesting practices including extragenital testing in general practitioner (GP) practices located in different socioeconomic status (SES) areas to inform and optimize local test practices. Methods Laboratory data of 48 Dutch GP practices between January 2011 and July 2016 were used. Based on a patient’s first positive CT or NG test, the proportion of TOC (TOC and 24% had a retest at the GP practice. GP practices in low SES areas were more likely to perform a CT TOC (OR:1.8;95%CI:1.1–3.1). Younger patients (TOC (OR:1.6;95%CI:1.0–2.4). For CT (n = 622), 2.4% had a TOC and 6.1% had a retest at another STI care provider. For NG (n = 73), 25% had a TOC and 15% had a retest at the GP practice. For NG (n = 73), 2.7% had a TOC and 12.3% had a retest at another STI care provider. In only 0.3% of the consultations patients were tested on extragenital sites. Conclusion Almost 20% of the patients returned for a CT TOC, especially at GP practices in low SES areas. For NG, 1 out of 4 patients returned for a TOC. Retesting rates were low for both CT (24%) and NG (15%), (re)infections including extragenital infections may be missed. Efforts are required to focus TOC and increase retesting practices of GPs in order to improve CT/NG control. PMID:29538469
Test-retest reliability and responsiveness of the Barthel Index-based Supplementary Scales in patients with stroke.

Science.gov (United States)

Lee, Ya-Chen; Yu, Wan-Hui; Hsueh, I-Ping; Chen, Sheng-Shiung; Hsieh, Ching-Lin

2017-10-01

A lack of evidence on the test-retest reliability and responsiveness limits the utility of the BI-based Supplementary Scales (BI-SS) in both clinical and research settings. To examine the test-retest reliability and responsiveness of the BI-based Supplementary Scales (BI-SS) in patients with stroke. A repeated-assessments design (1 week apart) was used to examine the test-retest reliability of the BI-SS. For the responsiveness study, the participants were assessed with the BI-SS and BI (treated as an external criterion) at admission to and discharge from rehabilitation wards. Seven outpatient rehabilitation units and one inpatient rehabilitation unit. Outpatients with chronic stroke. Eighty-four outpatients with chronic stroke participated in the test-retest reliability study. Fifty-seven inpatients completed baseline and follow-up assessments in the responsiveness study. For the test-retest reliability study, the values of the intra-class correlation coefficient and the overall percentage of minimal detectable change for the Ability Scale and Self-perceived Difficulty Scale were 0.97, 12.8%, and 0.78, 35.8%, respectively. For the responsiveness study, the standardized effect size and standardized response mean (representing internal responsiveness) of the Ability Scale and Self-perceived Difficulty Scale were 1.17 and 1.56, and 0.78 and 0.89, respectively. Regarding external responsiveness, the change in score of the Ability Scale had significant and moderate association with that of the BI (r=0.61, Ptest-retest reliability and sufficient responsiveness for patients with stroke. However, the Self-perceived Difficulty Scale of the BI-SS has substantial random measurement error and insufficient external responsiveness, which may affect its utility in clinical settings. The findings of this study provide empirical evidence of psychometric properties of the BI-SS for assessing ability and self-perceived difficulty of ADL in patients with stroke.
Demonstration of the test-retest reliability and sensitivity of the Lower Limb Functional Index-10 as a measure of functional recovery post burn injury: a cross-sectional repeated measures study design.

Science.gov (United States)

Ryland, Margaret E; Grisbrook, Tiffany L; Wood, Fiona M; Phillips, Michael; Edgar, Dale W

2016-01-01

Lower limb burns can significantly delay recovery of function. Measuring lower limb functional outcomes is challenging in the unique burn patient population and necessitates the use of reliable and valid tools. The aims of this study were to examine the test-retest reliability, sensitivity, and internal consistency of Sections 1 and 3 of the Lower Limb Functional Index-10 (LLFI-10) questionnaire for measuring functional ability in patients with lower limb burns over time. Twenty-nine adult patients who had sustained a lower limb burn injury in the previous 12 months completed the test-retest procedure of the study. In addition, the minimal detectable change (MDC) was calculated for Section 1 and 3 of the LLFI-10. Section 1 is focused on the activity limitations experienced by patients with a lower limb disorder whereas Section 3 involves patients indicating their current percentage of pre-injury duties. Section 1 of the LLFI-10 demonstrated excellent test-retest reliability (intra-class correlation coefficient (ICC) 0.98, 95 % CI 0.96-0.99) whilst Section 3 demonstrated high test-retest reliability (ICC 0.88, 95 % CI 0.79-0.94). MDC scores for Sections 1 and 3 were 1.27 points and 30.22 %, respectively. Internal consistency was demonstrated with a significant negative association (r s = -0.83) between Sections 1 and 3 of the LLFI-10 (p reliable for measuring functional ability in patients who have sustained lower limb burns in the previous 12 months, and furthermore, Section 1 is sensitive to changes in patient function over time.

Acoustic stapedial reflexes in healthy neonates: normative data and test-retest reliability.

Science.gov (United States)

Kei, Joseph

2012-01-01

The acoustic stapedial reflex (ASR) test provides useful information about the function of the auditory system. While it is frequently used with adults and children in a clinical setting, its use with young infants is limited. Presently, there are few data for neonates and inadequate research into the test-retest reliability of the ASR test. This study aimed to establish normative data and evaluate the test-retest reliability of the ASR test in healthy neonates. A cross-sectional experimental design was used to establish ASR normative data and assess the test-retest reliability of ASR thresholds obtained from healthy neonates. Sixty-eight full-term neonates with mean chronological age of 2.5 days (SD = 1.8 day), who passed the automated auditory brainstem response, transient evoked otoacoustic emission, and high frequency (1 kHz) tympanometry (HFT) tests. One randomly selected ear from each neonate was tested using TEOAE (transient evoked otoacoustic emission), HFT, and ASR tests using a 1 kHz probe tone. ASR thresholds were elicited by presenting pure tones of 0.5, 2, and 4 kHz and broadband noise (BBN) separately to the test ear in an ipsilateral stimulation mode. The ASR procedure was repeated to acquire retest data within the same testing session. Descriptive statistics, χ2, and analysis of variance with repeated measures tests were used to analyze ASR data. All neonates exhibited ASR when stimulated by tonal stimuli or BBN. The mean ASRTs (acoustic stapedial reflex thresholds) for the 0.5, 2, and 4 kHz tones were 81.6 ± 7.9, 71.3 ± 7.9, and 65.4 ± 8.7 dB HL, respectively. The mean ASRT for the BBN was estimated to be smaller than 57.2 dB HL, given the limitation of the equipment. The 95th percentiles of the ASRT were 95, 85, 80, and 75 dB HL for the 0.5, 2, and 4 kHz and BBN, respectively. The test-retest reliability of the ASR test for all stimuli was high, with no significant difference in mean ASRTs across the test and retest conditions. Test-retest
Test-retest reliability of the 40 Hz EEG auditory steady-state response.

Directory of Open Access Journals (Sweden)

Kristina L McFadden

Full Text Available Auditory evoked steady-state responses are increasingly being used as a marker of brain function and dysfunction in various neuropsychiatric disorders, but research investigating the test-retest reliability of this response is lacking. The purpose of this study was to assess the consistency of the auditory steady-state response (ASSR across sessions. Furthermore, the current study aimed to investigate how the reliability of the ASSR is impacted by stimulus parameters and analysis method employed. The consistency of this response across two sessions spaced approximately 1 week apart was measured in nineteen healthy adults using electroencephalography (EEG. The ASSR was entrained by both 40 Hz amplitude-modulated white noise and click train stimuli. Correlations between sessions were assessed with two separate analytical techniques: a channel-level analysis across the whole-head array and b signal-space projection from auditory dipoles. Overall, the ASSR was significantly correlated between sessions 1 and 2 (p<0.05, multiple comparison corrected, suggesting adequate test-retest reliability of this response. The current study also suggests that measures of inter-trial phase coherence may be more reliable between sessions than measures of evoked power. Results were similar between the two analysis methods, but reliability varied depending on the presented stimulus, with click train stimuli producing more consistent responses than white noise stimuli.
Test-retest Reliability and Agreement of the Satisfaction with the Assistive Technology Services (SATS) Instrument in Two Nordic Countries

DEFF Research Database (Denmark)

Sund, Terje; Anttila, Heidi; Iwarsson, Susanne

2014-01-01

Purpose: The purpose of this study was to investigate test–retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (...
Test-retest reliability and stability of N400 effects in a word-pair semantic priming paradigm.

Science.gov (United States)

Kiang, Michael; Patriciu, Iulia; Roy, Carolyn; Christensen, Bruce K; Zipursky, Robert B

2013-04-01

Elicited by any meaningful stimulus, the N400 event-related potential (ERP) component is reduced when the stimulus is related to a preceding one. This N400 semantic priming effect has been used to probe abnormal semantic relationship processing in clinical disorders, and suggested as a possible biomarker for treatment studies. Validating N400 semantic priming effects as a clinical biomarker requires characterizing their test-retest reliability. We assessed test-retest reliability of N400 semantic priming in 16 healthy adults who viewed the same related and unrelated prime-target word pairs in two sessions one week apart. As expected, N400 amplitudes were smaller for related versus unrelated targets across sessions. N400 priming effects (amplitude differences between unrelated and related targets) were highly correlated across sessions (r=0.85, Pmotivational changes. Use of N400 priming effects in treatment studies should account for possible magnitude decreases with repeat testing. Further research is needed to delineate N400 priming effects' test-retest reliability and stability in different age and clinical groups, and with different stimulus types. Copyright © 2012 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Cross-cultural adaptation, reliability, internal consistency and validation of the Hand Function Sort (HFS©) for French speaking patients with upper limb complaints.

Science.gov (United States)

Konzelmann, M; Burrus, C; Hilfiker, R; Rivier, G; Deriaz, O; Luthi, F

2015-03-01

Functional evaluation of upper limb is not only based on clinical findings but requires self-administered questionnaires to address patients' perspective. The Hand Function Sort (HFS©) was only validated in English. The aim of this study was the French cross cultural adaptation and validation of the HFS© (HFS-F). 150 patients with various upper limbs impairments were recruited in a rehabilitation center. Translation and cross-cultural adaptation were made according to international guidelines. Construct validity was estimated through correlations with Disabilities Arm Shoulder and Hand (DASH) questionnaire, SF-36 mental component summary (MCS),SF-36 physical component summary (PCS) and pain intensity. Internal consistency was assessed by Cronbach's α and test-retest reliability by intraclass correlation. Cronbach's α was 0.98, test-retest reliability was excellent at 0.921 (95 % CI 0.871-0.971) same as original HFS©. Correlations with DASH were-0.779 (95 % CI -0.847 to -0.685); with SF 36 PCS 0.452 (95 % CI 0.276-0.599); with pain -0.247 (95 % CI -0.429 to -0.041); with SF 36 MCS 0.242 (95 % CI 0.042-0.422). There were no floor or ceiling effects. The HFS-F has the same good psychometric properties as the original HFS© (internal consistency, test retest reliability, convergent validity with DASH, divergent validity with SF-36 MCS, and no floor or ceiling effects). The convergent validity with SF-36 PCS was poor; we found no correlation with pain. The HFS-F could be used with confidence in a population of working patients. Other studies are necessary to study its psychometric properties in other populations.
Test-Retest Reliability of Computerized, Everyday Memory Measures and Traditional Memory Tests.

Science.gov (United States)

Youngjohn, James R.; And Others

Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…
Test-retest reliability of selected items of Health Behaviour in School-aged Children (HBSC survey questionnaire in Beijing, China

Directory of Open Access Journals (Sweden)

Liu Yang

2010-08-01

Full Text Available Abstract Background Children's health and health behaviour are essential for their development and it is important to obtain abundant and accurate information to understand young people's health and health behaviour. The Health Behaviour in School-aged Children (HBSC study is among the first large-scale international surveys on adolescent health through self-report questionnaires. So far, more than 40 countries in Europe and North America have been involved in the HBSC study. The purpose of this study is to assess the test-retest reliability of selected items in the Chinese version of the HBSC survey questionnaire in a sample of adolescents in Beijing, China. Methods A sample of 95 male and female students aged 11 or 15 years old participated in a test and retest with a three weeks interval. Student Identity numbers of respondents were utilized to permit matching of test-retest questionnaires. 23 items concerning physical activity, sedentary behaviour, sleep and substance use were evaluated by using the percentage of response shifts and the single measure Intraclass Correlation Coefficients (ICC with 95% confidence interval (CI for all respondents and stratified by gender and age. Items on substance use were only evaluated for school children aged 15 years old. Results The percentage of no response shift between test and retest varied from 32% for the item on computer use at weekends to 92% for the three items on smoking. Of all the 23 items evaluated, 6 items (26% showed a moderate reliability, 12 items (52% displayed a substantial reliability and 4 items (17% indicated almost perfect reliability. No gender and age group difference of the test-retest reliability was found except for a few items on sedentary behaviour. Conclusions The overall findings of this study suggest that most selected indicators in the HBSC survey questionnaire have satisfactory test-retest reliability for the students in Beijing. Further test-retest studies in a large
Test-retest reliability of a balance testing protocol with external perturbations in young healthy adults.

Science.gov (United States)

Robbins, Shawn M; Caplan, Ryan M; Aponte, Daniel I; St-Onge, Nancy

2017-10-01

External perturbations are utilized to challenge balance and mimic realistic balance threats in patient populations. The reliability of such protocols has not been established. The purpose was to examine test-retest reliability of balance testing with external perturbations. Healthy adults (n=34; mean age 23 years) underwent balance testing over two visits. Participants completed ten balance conditions in which the following parameters were combined: perturbation or non-perturbation, single or double leg, and eyes open or closed. Three trials were collected for each condition. Data were collected on a force plate and external perturbations were applied by translating the plate. Force plate center of pressure (CoP) data were summarized using 13 different CoP measures. Test-retest reliability was examined using intraclass correlation coefficients (ICC) and Bland-Altman plots. CoP measures of total speed and excursion in both anterior-posterior and medial-lateral directions generally had acceptable ICC values for perturbation conditions (ICC=0.46 to 0.87); however, many other CoP measures (e.g. range, area of ellipse) had unacceptable test-retest reliability (ICCbalance testing protocols that include external perturbations should be made to improve test-retest reliability and diminish learning including more extensive participant training and increasing the number of trials. CoP measures that consider all data points (e.g. total speed) are more reliable than those that only consider a few data points. Copyright © 2017 Elsevier B.V. All rights reserved.
Test-retest reliability and agreement of the Satisfaction with the Assistive Technology Services (SATS) instrument in two Nordic countries.

Science.gov (United States)

Sund, Terje; Iwarsson, Susanne; Anttila, Heidi; Helle, Tina; Brandt, Ase

2014-07-01

The purpose of this study was to investigate test-retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (PWCs) or powered scooters (scooters). Test-retest design, two telephone interviews 7-18 days apart of 40 informants, with mean age of 67.5 (SD 13.09) years in the Danish; and 54 informants with mean age of 55.6 (SD 12.09) years in the Finnish sample. The intra-class correlation coefficient varied between 0.57 and 0.93 for items in the Danish and between 0.41 and 0.93 in the Finnish sample. The percentage agreement varied between 54.2 and 79.5 for items in the Danish and between 69.2 and 81.1 in the Finnish sample, while the Cronbach's alpha values varied between 0.87 and 0.96 in the two samples. A ceiling effect was found in all items of both samples. This study indicates that the SATS may be reliably administered for telephone interviews among adult PWC and scooter users, and give information about aspects of the service delivery process for quality development improvement purposes. Further psychometric testing of the SATS is required.
The memory failures of everyday questionnaire (MFE): internal consistency and reliability.

Science.gov (United States)

Montejo Carrasco, Pedro; Montenegro, Peña Mercedes; Sueiro, Manuel J

2012-07-01

The Memory Failures of Everyday Questionnaire (MFE) is one of the most widely-used instruments to assess memory failures in daily life. The original scale has nine response options, making it difficult to apply; we created a three-point scale (0-1-2) with response choices that make it easier to administer. We examined the two versions' equivalence in a sample of 193 participants between 19 and 64 years of age. The test-retest reliability and internal consistency of the version we propose were also computed in a sample of 113 people. Several indicators attest to the two forms' equivalence: the correlation between the items' means (r = .94; p MFE 1-9. The MFE 0-2 provides a brief, simple evaluation, so we recommend it for use in clinical practice as well as research.
The test-retest reliability of the latent construct of executive function depends on whether tasks are represented as formative or reflective indicators.

Science.gov (United States)

Willoughby, Michael T; Kuhn, Laura J; Blair, Clancy B; Samek, Anya; List, John A

2017-10-01

This study investigates the test-retest reliability of a battery of executive function (EF) tasks with a specific interest in testing whether the method that is used to create a battery-wide score would result in differences in the apparent test-retest reliability of children's performance. A total of 188 4-year-olds completed a battery of computerized EF tasks twice across a period of approximately two weeks. Two different approaches were used to create a score that indexed children's overall performance on the battery-i.e., (1) the mean score of all completed tasks and (2) a factor score estimate which used confirmatory factor analysis (CFA). Pearson and intra-class correlations were used to investigate the test-retest reliability of individual EF tasks, as well as an overall battery score. Consistent with previous studies, the test-retest reliability of individual tasks was modest (rs ≈ .60). The test-retest reliability of the overall battery scores differed depending on the scoring approach (r mean = .72; r factor_ score = .99). It is concluded that the children's performance on individual EF tasks exhibit modest levels of test-retest reliability. This underscores the importance of administering multiple tasks and aggregating performance across these tasks in order to improve precision of measurement. However, the specific strategy that is used has a large impact on the apparent test-retest reliability of the overall score. These results replicate our earlier findings and provide additional cautionary evidence against the routine use of factor analytic approaches for representing individual performance across a battery of EF tasks.
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

Science.gov (United States)

Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

2014-01-01

This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Evaluating the reliability of an injury prevention screening tool: Test-retest study.

Science.gov (United States)

Gittelman, Michael A; Kincaid, Madeline; Denny, Sarah; Wervey Arnold, Melissa; FitzGerald, Michael; Carle, Adam C; Mara, Constance A

2016-10-01

A standardized injury prevention (IP) screening tool can identify family risks and allow pediatricians to address behaviors. To assess behavior changes on later screens, the tool must be reliable for an individual and ideally between household members. Little research has examined the reliability of safety screening tool questions. This study utilized test-retest reliability of parent responses on an existing IP questionnaire and also compared responses between household parents. Investigators recruited parents of children 0 to 1 year of age during admission to a tertiary care children's hospital. When both parents were present, one was chosen as the "primary" respondent. Primary respondents completed the 30-question IP screening tool after consent, and they were re-screened approximately 4 hours later to test individual reliability. The "second" parent, when present, only completed the tool once. All participants received a 10-dollar gift card. Cohen's Kappa was used to estimate test-retest reliability and inter-rater agreement. Standard test-retest criteria consider Kappa values: 0.0 to 0.40 poor to fair, 0.41 to 0.60 moderate, 0.61 to 0.80 substantial, and 0.81 to 1.00 as almost perfect reliability. One hundred five families participated, with five lost to follow-up. Thirty-two (30.5%) parent dyads completed the tool. Primary respondents were generally mothers (88%) and Caucasian (72%). Test-retest of the primary respondents showed their responses to be almost perfect; average 0.82 (SD = 0.13, range 0.49-1.00). Seventeen questions had almost perfect test-retest reliability and 11 had substantial reliability. However, inter-rater agreement between household members for 12 objective questions showed little agreement between responses; inter-rater agreement averaged 0.35 (SD = 0.34, range -0.19-1.00). One question had almost perfect inter-rater agreement and two had substantial inter-rater agreement. The IP screening tool used by a single individual had excellent
Test-Retest Reliability and Practice Effects of the Stability Evaluation Test.

Science.gov (United States)

Williams, Richelle M; Corvo, Matthew A; Lam, Kenneth C; Williams, Travis A; Gilmer, Lesley K; McLeod, Tamara C Valovich

2017-01-17

Postural control plays an essential role in concussion evaluation. The Stability Evaluation Test (SET) aims to objectively analyze postural control by measuring sway velocity on the NeuroCom's VSR portable force platform (Natus, San Carlos, CA). To assess the test-retest reliability and practice effects of the SET protocol. Cohort. Research Laboratory. Fifty healthy adults (males=20, females=30, age=25.30±3.60 years, height=166.60±12.80 cm, mass=68.80±13.90 kg). All participants completed four trials of the SET. Each trial consisted of six 20-second balance tests with eyes closed, under the following conditions: double-leg firm (DFi), single-leg firm (SFi), tandem firm (TFi), double-leg foam (DFo), single-leg foam (SFo), and tandem foam (TFo). Each trial was separated by a 5-minute seated rest period. The dependent variable was sway velocity (deg/sec), with lower values indicating better balance. Sway velocity was recorded for each of the six conditions as well as a composite score for each trial. Test-retest reliability was analyzed across four trials with Intraclass Correlation Coefficients. Practice effects analyzed with repeated measures analysis of variance, followed by Tukey post-hoc comparisons for any significant main effects (preliability values were good to excellent: DFi (ICC=0.88;95%CI:0.81,0.92), SFi (ICC=0.75;95%CI:0.61,0.85), TFi (ICC=0.84;95%CI:0.75,0.90), DFo (ICC=0.83;95%CI:0.74,0.90), SFo (ICC=0.82;95%CI:0.72,0.89), TFo (ICC=0.81;95%CI:0.69,0.88), and composite score (ICC=0.93;95%CI:0.88,0.95). Significant practice effects (preliability for the assessment of postural control in healthy adults. Due to the practice effects noted, a familiarization session is recommended (i.e., all 6 conditions) prior to recording the data. Future studies should evaluate injured patients to determine meaningful change scores during various injuries.
The Comprehensive Snack Parenting Questionnaire (CSPQ: Development and Test-Retest Reliability

Directory of Open Access Journals (Sweden)

Dorus W. M. Gevers

2018-04-01

Full Text Available The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41 or agreement scores (≥0.60 for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.
Development of an Agility Test for Badminton Players and Assessment of Its Validity and Test-Retest Reliability.

Science.gov (United States)

Loureiro, Luiz de França Bahia; de Freitas, Paulo Barbosa

2016-04-01

Badminton requires open and fast actions toward the shuttlecock, but there is no specific agility test for badminton players with specific movements. To develop an agility test that simultaneously assesses perception and motor capacity and examine the test's concurrent and construct validity and its test-retest reliability. The Badcamp agility test consists of running as fast as possible to 6 targets placed on the corners and middle points of a rectangular area (5.6 × 4.2 m) from the start position located in the center of it, following visual stimuli presented in a luminous panel. The authors recruited 43 badminton players (17-32 y old) to evaluate concurrent (with shuttle-run agility test--SRAT) and construct validity and test-retest reliability. Results revealed that Badcamp presents concurrent and construct validity, as its performance is strongly related to SRAT (ρ = 0.83, P < .001), with performance of experts being better than nonexpert players (P < .01). In addition, Badcamp is reliable, as no difference (P = .07) and a high intraclass correlation (ICC = .93) were found in the performance of the players on 2 different occasions. The findings indicate that Badcamp is an effective, valid, and reliable tool to measure agility, allowing coaches and athletic trainers to evaluate players' athletic condition and training effectiveness and possibly detect talented individuals in this sport.
Re-test reliability of gustatory testing and introduction of the sensitive Taste-Drop-Test

DEFF Research Database (Denmark)

Fjaeldstad, A; Niklassen, A; Fernandes, H

2018-01-01

. Testing gustatory function can be important for diagnostics and assessment of treatment effects. However, the gustatory tests applied are required to be both sensitive and reliable.In this study, we investigate the re-test validity of popular Taste Strips gustatory test for gustatory screening....... Furthermore, we introduce a new sensitive Taste-Drop-Test, which was found to be superior for detecting a more accurate measure of tastant sensitivity....
A diagnostic test for apraxia in stroke patients: internal consistency and diagnostic value.

NARCIS (Netherlands)

Heugten, C.M. van; Dekker, J.; Deelman, B.G.; Stehmann-Saris, F.C.; Kinebanian, A.

1999-01-01

The internal consistency and the diagnostic value of a test for apraxia in patients having had a stroke are presented. Results indicate that the items of the test form a strong and consistent scale: Cronbach's alpha as well as the results of a Mokken scale analysis present good reliability and good
Retesting with the TRUE Test in a population-based twin cohort with hand eczema

DEFF Research Database (Denmark)

Lerbaek, Anne; Kyvik, Kirsten Ohm; Menné, Torkil

2007-01-01

Population-based studies on contact allergy with retesting of individuals are infrequently performed. Variable degrees of persistence are reported when individuals with contact allergy are retested with years in between. The patch test results of 270 individuals tested in 2005-2006 are presented ...
Test-retest reliability of the Danish Adult Reading Test in patients with comorbid psychosis and cannabis-use disorder

DEFF Research Database (Denmark)

Hjorthøj, Carsten Rygaard; Vesterager, Lone; Nordentoft, Merete

2013-01-01

Background: The New Adult Reading Test is a common instrument for assessing pre-morbid IQ for patients with, for instance, schizophrenia. However, test-retest reliability has not been established for patients dually diagnosed with psychosis and substance use disorder. Furthermore, test......-retest reliability of the Danish adaptation has never been established in any population. Aims: To determine the test-retest reliability of the Danish Adult Reading Test (DART) (adapted from the National Adult Reading Test, NART) for patients dually diagnosed with psychosis and cannabis-use disorder. Methods......: This was a secondary analysis of the CapOpus randomized trial. As part of the trial, 103 patients were randomized, and completed the DART up to three times. Pearson's r and pairwise t-tests were calculated. Results: DART score was independent of randomization, cannabis-use frequency and psychopathology. Scores...

Resting-state test-retest reliability of a priori defined canonical networks over different preprocessing steps.

Science.gov (United States)

Varikuti, Deepthi P; Hoffstaedter, Felix; Genon, Sarah; Schwender, Holger; Reid, Andrew T; Eickhoff, Simon B

2017-04-01

Resting-state functional connectivity analysis has become a widely used method for the investigation of human brain connectivity and pathology. The measurement of neuronal activity by functional MRI, however, is impeded by various nuisance signals that reduce the stability of functional connectivity. Several methods exist to address this predicament, but little consensus has yet been reached on the most appropriate approach. Given the crucial importance of reliability for the development of clinical applications, we here investigated the effect of various confound removal approaches on the test-retest reliability of functional-connectivity estimates in two previously defined functional brain networks. Our results showed that gray matter masking improved the reliability of connectivity estimates, whereas denoising based on principal components analysis reduced it. We additionally observed that refraining from using any correction for global signals provided the best test-retest reliability, but failed to reproduce anti-correlations between what have been previously described as antagonistic networks. This suggests that improved reliability can come at the expense of potentially poorer biological validity. Consistent with this, we observed that reliability was proportional to the retained variance, which presumably included structured noise, such as reliable nuisance signals (for instance, noise induced by cardiac processes). We conclude that compromises are necessary between maximizing test-retest reliability and removing variance that may be attributable to non-neuronal sources.
Resting-state test-retest reliability of a priori defined canonical networks over different preprocessing steps

Science.gov (United States)

Varikuti, Deepthi P.; Hoffstaedter, Felix; Genon, Sarah; Schwender, Holger; Reid, Andrew T.; Eickhoff, Simon B.

2016-01-01

Resting-state functional connectivity analysis has become a widely used method for the investigation of human brain connectivity and pathology. The measurement of neuronal activity by functional MRI, however, is impeded by various nuisance signals that reduce the stability of functional connectivity. Several methods exist to address this predicament, but little consensus has yet been reached on the most appropriate approach. Given the crucial importance of reliability for the development of clinical applications, we here investigated the effect of various confound removal approaches on the test-retest reliability of functional-connectivity estimates in two previously defined functional brain networks. Our results showed that grey matter masking improved the reliability of connectivity estimates, whereas de-noising based on principal components analysis reduced it. We additionally observed that refraining from using any correction for global signals provided the best test-retest reliability, but failed to reproduce anti-correlations between what have been previously described as antagonistic networks. This suggests that improved reliability can come at the expense of potentially poorer biological validity. Consistent with this, we observed that reliability was proportional to the retained variance, which presumably included structured noise, such as reliable nuisance signals (for instance, noise induced by cardiac processes). We conclude that compromises are necessary between maximizing test-retest reliability and removing variance that may be attributable to non-neuronal sources. PMID:27550015
Test-Retest Reliability of a Survey to Measure Transport-Related Physical Activity in Adults

Science.gov (United States)

Badland, Hannah; Schofield, Grant

2006-01-01

The present research details test-retest reliability of a newly developed, telephone-administered TPA survey for adults. This instrument examines barriers, perceptions, and current travel behaviors to place of work/study and local convenience shops. Demonstrated test-retest reliability of the Active Friendly Environments-Transport-Related Physical…
Test-Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study.

Science.gov (United States)

Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina

2017-01-01

Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Adaptation and testing of psychosocial assessment instruments for cross-cultural use: an example from the Thailand Burma border.

Science.gov (United States)

Haroz, Emily E; Bass, Judith K; Lee, Catherine; Murray, Laura K; Robinson, Courtland; Bolton, Paul

2014-01-01

The purpose of this study was to develop valid and reliable instruments to assess priority psychosocial problems and functioning among adult survivors of systematic violence from Burma living in Thailand. The process involved four steps: 1) instrument drafting and piloting; 2) reliability and validity testing; 3) instrument revision; and 4) retesting revised instrument. A total of N = 158 interviews were completed. Overall subscales showed good internal consistency (0.73-0.92) and satisfactory combined test-retest/inter rater reliability (0.63-0.84). Criterion validity, was not demonstrated for any scale. The alcohol and functioning scales underperformed and were revised (step 3) and retested (step 4). Upon retesting, the function scale showed good internal consistency reliability (0.91-0.92), and the alcohol scale showed acceptable internal consistency (0.79) and strong test-retest/inter-rater reliability (0.86-0.89). This paper describes the importance and process of adaptation and testing, illustrated by the experiences and results for selected instruments in this population.
The role of test-retest reliability in measuring individual and group differences in executive functioning.

Science.gov (United States)

Paap, Kenneth R; Sawi, Oliver

2016-12-01

Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.
Test-retest reliability of barbell velocity during the free-weight bench-press exercise.

Science.gov (United States)

Stock, Matt S; Beck, Travis W; DeFreitas, Jason M; Dillon, Michael A

2011-01-01

The purpose of this study was to calculate test-retest reliability statistics for peak barbell velocity during the free-weight bench-press exercise for loads corresponding to 10-90% of the 1-repetition maximum (1RM). Twenty-one healthy, resistance-trained men (mean ± SD age = 23.5 ± 2.7 years; body mass = 90.5 ± 14.6 kg; 1RM bench press = 125.4 ± 18.4 kg) volunteered for this study. A minimum of 48 hours after a maximal strength testing and familiarization session, the subjects performed single repetitions of the free-weight bench-press exercise at each tenth percentile (10-90%) of the 1RM on 2 separate occasions. For each repetition, the subjects were instructed to press the barbell as rapidly as possible, and peak barbell velocity was measured with a Tendo Weightlifting Analyzer. The test-retest intraclass correlation coefficients (model 2,1) and corresponding standard errors of measurement (expressed as percentages of the mean barbell velocity values) were 0.717 (4.2%), 0.572 (5.0%), 0.805 (3.1%), 0.669 (4.7%), 0.790 (4.6%), 0.785 (4.8%), 0.811 (5.8%), 0.714 (10.3%), and 0.594 (12.6%) for the weights corresponding to 10-90% 1RM. There were no mean differences between the barbell velocity values from trials 1 and 2. These results indicated moderate to high test-retest reliability for barbell velocity from 10 to 70% 1RM but decreased consistency at 80 and 90% 1RM. When examining barbell velocity during the free-weight bench-press exercise, greater measurement error must be overcome at 80 and 90% 1RM to be confident that an observed change is meaningful.
Intensity response function of the photopic negative response (PhNR): effect of age and test-retest reliability.

Science.gov (United States)

Joshi, Nabin R; Ly, Emma; Viswanathan, Suresh

2017-08-01

To assess the effect of age and test-retest reliability of the intensity response function of the full-field photopic negative response (PhNR) in normal healthy human subjects. Full-field electroretinograms (ERGs) were recorded from one eye of 45 subjects, and 39 of these subjects were tested on two separate days with a Diagnosys Espion System (Lowell, MA, USA). The visual stimuli consisted of brief (test-retest reliability was assessed with the Wilcoxon signed-rank test and Bland-Altman analysis. Holm's correction was applied to account for multiple comparisons. V max of BT was significantly smaller than that of PT and b-wave, and the V max of PT and b-wave was not significantly different from each other. The slope parameter n was smallest for BT and the largest for b-wave and the difference between the slopes of all three measures were statistically significant. Small differences observed in the mean values of K for the different measures did not reach statistical significance. The Wilcoxon signed-rank test indicated no significant differences between the two test visits for any of the Naka-Rushton parameters for the three ERG measures, and the Bland-Altman plots indicated that the mean difference between test and retest measurements of the different fit parameters was close to zero and within 6% of the average of the test and retest values of the respective parameters for all three ERG measurements, indicating minimal bias. While the coefficient of reliability (COR, defined as 1.96 times the standard deviation of the test and retest difference) of each fit parameter was more or less comparable across the three ERG measurements, the %COR (COR normalized to the mean test and retest measures) was generally larger for BT compared to both PT and b-wave for each fit parameter. The Naka-Rushton fit parameters did not show statistically significant changes with age for any of the ERG measures when corrections were applied for multiple comparisons. However, the V max of
Isokinetic Strength and Endurance Tests used Pre- and Post-Spaceflight: Test-Retest Reliability

Science.gov (United States)

Laughlin, Mitzi S.; Lee, Stuart M. C.; Loehr, James A.; Amonette, William E.

2009-01-01

To assess changes in muscular strength and endurance after microgravity exposure, NASA measures isokinetic strength and endurance across multiple sessions before and after long-duration space flight. Accurate interpretation of pre- and post-flight measures depends upon the reliability of each measure. The purpose of this study was to evaluate the test-retest reliability of the NASA International Space Station (ISS) isokinetic protocol. Twenty-four healthy subjects (12 M/12 F, 32.0 +/- 5.6 years) volunteered to participate. Isokinetic knee, ankle, and trunk flexion and extension strength as well as endurance of the knee flexors and extensors were measured using a Cybex NORM isokinetic dynamometer. The first weekly session was considered a familiarization session. Data were collected and analyzed for weeks 2-4. Repeated measures analysis of variance (alpha=0.05) was used to identify weekly differences in isokinetic measures. Test-retest reliability was evaluated by intraclass correlation coefficients (ICC) (3,1). No significant differences were found between weeks in any of the strength measures and the reliability of the strength measures were all considered excellent (ICC greater than 0.9), except for concentric ankle dorsi-flexion (ICC=0.67). Although a significant difference was noted in weekly endurance measures of knee extension (p less than 0.01), the reliability of endurance measure by week were considered excellent for knee flexion (ICC=0.97) and knee extension (ICC=0.96). Except for concentric ankle dorsi-flexion, the isokinetic strength and endurance measures are highly reliable when following the NASA ISS protocol. This protocol should allow accurate interpretation isokinetic data even with a small number of crew members.
Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

Science.gov (United States)

Rae, James R.; Olson, Kristina R.

2018-01-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Test-retest reliability of trunk accelerometric gait analysis

DEFF Research Database (Denmark)

Henriksen, Marius; Lund, Hans; Moe-Nilssen, R

2004-01-01

The purpose of this study was to determine the test-retest reliability of a trunk accelerometric gait analysis in healthy subjects. Accelerations were measured during walking using a triaxial accelerometer mounted on the lumbar spine of the subjects. Six men and 14 women (mean age 35.2; range 18...... a definite potential in clinical gait analysis....
Test-Retest Reliability of the Salutogenic Wellness Promotion Scale (SWPS)

Science.gov (United States)

Anderson, L. M.; Moore, J. B.; Hayden, B. M.; Becker, C. M.

2014-01-01

Objective: This study examined the temporal stability (i.e. test-retest reliability) of the Salutogenic Wellness Promotion Scale (SWPS) using intraclass correlation coefficients (ICC). Current intraclass results were also compared to previously published interclass correlations to support the use of the intraclass method for test-retest…
Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

Science.gov (United States)

Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

2018-05-01

Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.
Long term test-retest reliability of Oswestry Disability Index in male office workers.

Science.gov (United States)

Irmak, Rafet; Baltaci, Gul; Ergun, Nevin

2015-01-01

The Oswestry Disability Index (ODI) is one of the most common condition specific outcome measures used in the management of spinal disorders. But there is insufficient study on healthy populations and long term test-retest reliability. This is important because healthy populations are often used for control groups in low back pain interventions, and knowing the reliability of the controls affects the interpretation of the findings of these studies. The purpose of this study is to determine the long term test-retest reliability of ODI in office workers. Participants who have no chronic low back pain history were included in study. Subjects were assessed by the Turkish-ODI 2.0 (e-forms) on 1st, 2nd, 4th, 8th, 15th, 30th days to determine the stability of ODI scores over time. The study began with 58 (12 female, 46 male) participants. 36 (3 female, 33 male) participated for the full 30 days. Kolmogorov-Smirnov and Friedman tests were used. Test-retest reliability was evaluated by using nonparametric statistics. All tests were done by using SPSS-11. There was no statistically significant difference among the median scores of each day. (χ= 6.482, p > 0.05). The difference between median score of the days with 1st day was neither statistically nor clinically significant. ODI has long term test re-test reliability in healthy subjects over a 1 month time interval.
Test-retest and interrater reliability of the functional lower extremity evaluation.

Science.gov (United States)

Haitz, Karyn; Shultz, Rebecca; Hodgins, Melissa; Matheson, Gordon O

2014-12-01

Repeated-measures clinical measurement reliability study. To establish the reliability and face validity of the Functional Lower Extremity Evaluation (FLEE). The FLEE is a 45-minute battery of 8 standardized functional performance tests that measures 3 components of lower extremity function: control, power, and endurance. The reliability and normative values for the FLEE in healthy athletes are unknown. A face validity survey for the FLEE was sent to sports medicine personnel to evaluate the level of importance and frequency of clinical usage of each test included in the FLEE. The FLEE was then administered and rated for 40 uninjured athletes. To assess test-retest reliability, each athlete was tested twice, 1 week apart, by the same rater. To assess interrater reliability, 3 raters scored each athlete during 1 of the testing sessions. Intraclass correlation coefficients were used to assess the test-retest and interrater reliability of each of the FLEE tests. In the face validity survey, the FLEE tests were rated as highly important by 58% to 71% of respondents but frequently used by only 26% to 45% of respondents. Interrater reliability intraclass correlation coefficients ranged from 0.83 to 1.00, and test-retest reliability ranged from 0.71 to 0.95. The FLEE tests are considered clinically important for assessing lower extremity function by sports medicine personnel but are underused. The FLEE also is a reliable assessment tool. Future studies are required to determine if use of the FLEE to make return-to-play decisions may reduce reinjury rates.
Test-retest reliability and validity of the Sniffin' TOM odor memory test.

Science.gov (United States)

Croy, Ilona; Zehner, Cora; Larsson, Maria; Zucco, Gesualdo M; Hummel, Thomas

2015-03-01

Few attempts have been made to develop an olfactory test that captures episodic retention of olfactory information. Assessment of episodic odor memory is of particular interest in aging and in the cognitively impaired as both episodic memory deficits and olfactory loss have been targeted as reliable hallmarks of cognitive decline and impending dementia. Here, 96 healthy participants (18-92 years) and an additional 19 older people with mild cognitive impairment were tested (73-82 years). Participants were presented with 8 common odors with intentional encoding instructions that were followed by a yes-no recognition test. After recognition completion, participants were asked to identify all odors by means of free or cued identification. A retest of the odor memory test (Sniffin' TOM = test of odor memory) took place 17 days later. The results revealed satisfactory test-retest reliability (0.70) of odor recognition memory. Both recognition and identification performance were negatively affected by age and more pronounced among the cognitively impaired. In conclusion, the present work presents a reliable, valid, and simple test of episodic odor recognition memory that may be used in clinical groups where both episodic memory deficits and olfactory loss are prevalent preclinically such as Alzheimer's disease. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Construct Validity and Test-Retest Reliability of the Climbing Stairs Questionnaire in Lower-Limb Amputees

NARCIS (Netherlands)

de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

de Laat FA, Rommers GM, Geertzen JH, Roorda LD. Construct validity and test-retest reliability of the Climbing Stairs Questionnaire in lower-limb amputees. Arch Phys Med Rehabil 2010;91:1396-401. Objective: To investigate the construct validity and test-retest reliability of the Climbing Stairs
Work-related measures of physical and behavioral health function: Test-retest reliability.

Science.gov (United States)

Marino, Molly Elizabeth; Meterko, Mark; Marfeo, Elizabeth E; McDonough, Christine M; Jette, Alan M; Ni, Pengsheng; Bogusz, Kara; Rasch, Elizabeth K; Brandt, Diane E; Chan, Leighton

2015-10-01

The Work Disability Functional Assessment Battery (WD-FAB), developed for potential use by the US Social Security Administration to assess work-related function, currently consists of five multi-item scales assessing physical function and four multi-item scales assessing behavioral health function; the WD-FAB scales are administered as Computerized Adaptive Tests (CATs). The goal of this study was to evaluate the test-retest reliability of the WD-FAB Physical Function and Behavioral Health CATs. We administered the WD-FAB scales twice, 7-10 days apart, to a sample of 376 working age adults and 316 adults with work-disability. Intraclass correlation coefficients were calculated to measure the consistency of the scores between the two administrations. Standard error of measurement (SEM) and minimal detectable change (MDC90) were also calculated to measure the scales precision and sensitivity. For the Physical Function CAT scales, the ICCs ranged from 0.76 to 0.89 in the working age adult sample, and 0.77-0.86 in the sample of adults with work-disability. ICCs for the Behavioral Health CAT scales ranged from 0.66 to 0.70 in the working age adult sample, and 0.77-0.80 in the adults with work-disability. The SEM ranged from 3.25 to 4.55 for the Physical Function scales and 5.27-6.97 for the Behavioral Health function scales. For all scales in both samples, the MDC90 ranged from 7.58 to 16.27. Both the Physical Function and Behavioral Health CATs of the WD-FAB demonstrated good test-retest reliability in adults with work-disability and general adult samples, a critical requirement for assessing work related functioning in disability applicants and in other contexts. Copyright © 2015 Elsevier Inc. All rights reserved.
Test-Retest Reliability and Minimal Detectable Change of the D2 Test of Attention in Patients with Schizophrenia.

Science.gov (United States)

Lee, Posen; Lu, Wen-Shian; Liu, Chin-Hsuan; Lin, Hung-Yu; Hsieh, Ching-Lin

2017-12-08

The d2 Test of Attention (D2) is a commonly used measure of selective attention for patients with schizophrenia. However, its test-retest reliability and minimal detectable change (MDC) are unknown in patients with schizophrenia, limiting its utility in both clinical and research settings. The aim of the present study was to examine the test-retest reliability and MDC of the D2 in patients with schizophrenia. A rater administered the D2 on 108 patients with schizophrenia twice at a 1-month interval. Test-retest reliability was determined through the calculation of the intra-class correlation coefficient (ICC). We also carried out Bland-Altman analysis, which included a scatter plot of the differences between test and retest against their mean. Systematic biases were evaluated by use of a paired t-test. The ICCs for the D2 ranged from 0.78 to 0.94. The MDCs (MDC%) of the seven subscores were 102.3 (29.7), 19.4 (85.0), 7.2 (94.6), 21.0 (69.0), 104.0 (33.1), 105.0 (35.8), and 7.8 (47.8), which represented limited-to-acceptable random measurement error. Trends in the Bland-Altman plots of the omissions (E1), commissions (E2), and errors (E) were noted, presenting that the data had heteroscedasticity. According to the results, the D2 had good test-retest reliability, especially in the scores of TN, TN-E, and CP. For the further research, finding a way to improve the administration procedure to reduce random measurement error would be important for the E1, E2, E, and FR subscores. © The Author(s) 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Test-retest reliability for aerodynamic measures of voice.

Science.gov (United States)

Awan, Shaheen N; Novaleski, Carolyn K; Yingling, Julie R

2013-11-01

The purpose of this study was to investigate the intrasubject reliability of aerodynamic characteristics of the voice within typical/normal speakers across testing sessions using the Phonatory Aerodynamic System (PAS 6600; KayPENTAX, Montvale, NJ). Participants were 60 healthy young adults (30 males and 30 females) between the ages 18 and 31 years with perceptually typical voice. Participants were tested using the PAS 6600 (Phonatory Aerodynamic System) on two separate days with approximately 1 week between each session at approximately the same time of day. Four PAS protocols were conducted (vital capacity, maximum sustained phonation, comfortable sustained phonation, and voicing efficiency) and measures of expiratory volume, maximum phonation time, mean expiratory airflow (during vowel production) and target airflow (obtained via syllable repetition), peak air pressure, aerodynamic power, aerodynamic resistance, and aerodynamic efficiency were obtained during each testing session. Associated acoustic measures of vocal intensity and frequency were also collected. All phonations were elicited at comfortable pitch and loudness. All aerodynamic and associated variables evaluated in this study showed useable test-retest reliability (ie, intraclass correlation coefficients [ICCs] ≥ 0.60). A high degree of mean test-retest reliability was found across all subjects for aerodynamic and associated acoustic measurements of vital capacity, maximum sustained phonation, glottal resistance, and vocal intensity (all with ICCs > 0.75). Although strong ICCs were observed for measures of glottal power and mean expiratory airflow in males, weaker overall results for these measures (ICC range: 0.60-0.67) were observed in females subjects and sizable coefficients of variation were observed for measures of power, resistance, and efficiency in both men and women. Differences in degree of reliability from measure to measure were revealed in greater detail using methods such as ICCs and

Test-retest reliability and factor structures of organizational citizenship behavior for Hong Kong workers.

Science.gov (United States)

Lam, S S

2001-02-01

In 1990 Podsakoff, MacKenzie, Moorman, and Fetter developed a scale to measure the five dimensions of organizational citizenship behavior. Test-retest data over 15 weeks are reported for this scale for a sample of 82 female and 32 male Chinese tellers (ages 18 to 54 years) from a large international bank in Hong Kong. Stability was .83, and there was no significant change between Times 1 and 2. Analysis indicated the five-factor structure and showed it to be a reliable measure when used with a nonwestern sample.
Interna konzistencija i retest pouzdanost hrvatske inačice PAQ-C upitnika

OpenAIRE

Podnar, Hrvoje; Kunješić, Mateja; Radman, Ivan

2017-01-01

The aim of the study was to determine internal consistency and retest reliability of the Croatian version of PAQ-C on a sample of 6-10 years old children and to report physical activity levels of elementary school pupils. The same set of questions was administered to the pupils on two different occasions, three weeks apart. Both testing rounds for 8-10 years old pupils were conducted at school in the presence of an experienced researcher. In contrast, the 6-8 years old pupils took the questio...
Fiabilidad del test 6 minutos caminando en personas con secuelas de poliomielitis paralítica mediante test-retest de 12 semanas

Directory of Open Access Journals (Sweden)

Francisco Javier Domínguez-Muñoz

2013-01-01

Full Text Available El análisis de la fiabilidad del test de 6 minutos ca- minando en una población de personas con secuelas de poliomielitis paralítica mediante test-retest de 12 semanas no ha sido estudiado. Participaron personas con secuelas de poliomielitis paralítica (n = 18; 48,72 ± 7,69 años; 65,8 ± 11,6 kg. Se les realizó un test-retest de 12 semanas de la prueba de 6 minutos caminando que consistía en que los sujetos anduvieran la mayor distan- cia, sin llegar a la carrera, en un periodo de 6 minutos. La fiabilidad relativa de la prueba fue excelente (CCI = 0,99. En lo que se refiere a la fiabilidad absoluta se obtuvo un error estándar de medida (SEM del 1,7% y un mínimo cambio real (SRD de 4,7%. La fiabilidad del test de 6 minutos caminando usando el método Bland- Altman mostró que el error sistemático (diferencia de medias entre el test-retest fue 2,72 (bias. En conclu- sión, los resultados obtenidos en el test de 6 minutos ca- minando han sido muy fiables y afirmamos que la prue- ba de 6 minutos caminando podrá ser utilizada como prueba de evaluación en una población con secuelas de poliomielitis paralítica, con un intervalo de 12 semanas entre las dos mediciones, para comprobar los cambios que se han producido tras la aplicación de un programa de actividad física.
Confiabilidade teste-reteste de aspectos da rede social no Estudo Pró-Saúde Test-retest reliability of measures of social network in the "Pró-Saúde" Study

Directory of Open Access Journals (Sweden)

Rosane Harter Griep

2003-06-01

Full Text Available OBJETIVO: Avaliar os níveis de confiabilidade teste-reteste de informações relativas à rede social no Estudo Pró-saúde. MÉTODOS: Foi estimada a confiabilidade pelo estudo teste-reteste por meio de questionário multidimensional aplicado a uma coorte de trabalhadores de uma universidade. O mesmo questionário foi preenchido duas vezes por 192 funcionários não efetivos da universidade, com duas semanas de intervalo entre as aplicações. A concordância foi estimada pela estatística Kappa (variáveis categóricas, estatística Kappa ponderado e modelos log-lineares (variáveis ordinais, e coeficiente de correlação intraclasse (variáveis discretas. RESULTADOS: As medidas de concordância situaram-se acima de 0,70 para a maioria das variáveis. Estratificando-se as informações segundo gênero, idade e escolaridade, observou-se que a confiabilidade não apresentou padrão consistente de variabilidade. A aplicação de modelos log-lineares indicou que, para as variáveis ordinais do estudo, o modelo de melhor ajuste foi o de "concordância diagonal mais associação linear por linear". CONCLUSÕES: Os altos níveis de confiabilidade estimados permitem concluir que o processo de aferição dos itens sobre rede social foi adequado para as características investigadas. Estudos de validação em andamento complementarão a avaliação da qualidade dessas informações.OBJECTIVE: To evaluate test-retest reliability of social network-related information of the" Pró-Saúde" study. METHODS: A test-retest reliability study was conducted using a multidimensional questionnaire applied to a cohort of university employees. The same questionnaire was filled out twice by 192 non-permanent employees with two weeks apart. Agreement was estimated using kappa statistics (categorical variables, weighted kappa statistics, log-linear models (ordinal variables, and intraclass correlation coefficient (discrete variables. RESULTS: Estimates of reliability
Test-retest reliabilty of exercise-induced hypoalgesia after aerobic exercise

DEFF Research Database (Denmark)

Vaegter, Henrik Bjarke; Dørge, Daniel Bandholtz; Schmidt, Kristian Sonne

2018-01-01

Objective: Exercise increases pressure pain thresholds (PPTs) in exercising and nonexercising muscles, known as exercise-induced hypoalgesia (EIH). No studies have investigated the test-retest reliability of change in PPTs after aerobic exercise. Primary objectives were to compare the effect...
Maximal cardiorespiratory fitness testing in individuals with chronic stroke with cognitive impairment: practice test effects and test-retest reliability.

Science.gov (United States)

Olivier, Charles; Doré, Jean; Blanchet, Sophie; Brooks, Dina; Richards, Carol L; Martel, Guy; Robitaille, Nancy-Michelle; Maltais, Désirée B

2013-11-01

To evaluate, for individuals with chronic stroke with cognitive impairment, (1) the effects of a practice test on peak cardiorespiratory fitness test results; (2) cardiorespiratory fitness test-retest reliability; and (3) the relationship between individual practice test effects and cognitive impairment. Cross-sectional. Rehabilitation center. A convenience sample of 21 persons (men [n=12] and women [n=9]; age range, 48-81y; 44.9±36.2mo poststroke) with cognitive impairments who had sufficient lower limb function to perform the test. Not applicable. Peak oxygen consumption (Vo(2)peak, ml·kg(-1)·min(-1)). Test-retest reliability of Vo(2)peak was excellent (intraclass correlation coefficient model 2,1 [ICC2,1]=.94; 95% confidence interval [CI], .86-.98). A paired t test showed that there was no significant difference for the group for Vo(2)peak obtained from 2 symptom-limited cardiorespiratory fitness tests performed 1 week apart on a semirecumbent cycle ergometer (test 2-test 1 difference, -.32ml·kg(-1)·min(-1); 95% CI, -.69 to 1.33ml·kg(-1)·min(-1); P=.512). Individual test-retest differences in Vo(2)peak were, however, positively related to general cognitive function as measured by the Mini-Mental State Examination (ρ=.485; Preliably measured in this group without a practice test. General cognitive function, however, may influence the effect of a practice test in that those with lower general cognitive function appear to respond differently to a practice test than those with higher cognitive function. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep.

Science.gov (United States)

Wang, Jiahui; Han, Junwei; Nguyen, Vinh T; Guo, Lei; Guo, Christine C

2017-01-01

Resting state functional magnetic resonance imaging (rs-fMRI) provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV) derived from simultaneous electrocardiogram (ECG) recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.
Assessing the test-retest repeatability of the Vietnamese version of the National Eye Institute 25-item Visual Function Questionnaire among bilateral cataract patients for a Vietnamese population.

Science.gov (United States)

To, Kien Gia; Meuleners, Lynn; Chen, Huei-Yang; Lee, Andy; Do, Dung Van; Duong, Dat Van; Phi, Tien Duy; Tran, Hoang Huy; Nguyen, Nguyen Do

2014-06-01

To determine the test-retest repeatability of the National Eye Institute 25-item Visual Function Questionnaire (NEI VFQ-25) for use with older Vietnamese adults with bilateral cataract. The questionnaire was translated into Vietnamese and back-translated into English by two independent translators. Patients with bilateral cataract aged 50 and older completed the questionnaire on two separate occasions, one to two weeks after first administration of the questionnaire. Test-retest repeatability was assessed using the Cronbach's α and intraclass correlation coefficients. The average age of participants was 67 ± 8 years and most participants were female (73%). Internal consistency was acceptable with the α coefficient above 0.7 for all subscales and intraclass correlation coefficients were 0.6 or greater in all subscales. The Vietnamese NEI VFQ-25 is reliable for use in studies assessing vision-related quality of life in older adults with bilateral cataract in Vietnam. We propose some modifications to the NEI-VFQ questions to reflect activities of older people in Vietnam. © 2013 ACOTA.
Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.

Science.gov (United States)

Park, Myung Sook; Kang, Kyung Ja; Jang, Sun Joo; Lee, Joo Yun; Chang, Sun Ju

2018-03-01

This study aimed to evaluate the components of test-retest reliability including time interval, sample size, and statistical methods used in patient-reported outcome measures in older people and to provide suggestions on the methodology for calculating test-retest reliability for patient-reported outcomes in older people. This was a systematic literature review. MEDLINE, Embase, CINAHL, and PsycINFO were searched from January 1, 2000 to August 10, 2017 by an information specialist. This systematic review was guided by both the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and the guideline for systematic review published by the National Evidence-based Healthcare Collaborating Agency in Korea. The methodological quality was assessed by the Consensus-based Standards for the selection of health Measurement Instruments checklist box B. Ninety-five out of 12,641 studies were selected for the analysis. The median time interval for test-retest reliability was 14days, and the ratio of sample size for test-retest reliability to the number of items in each measure ranged from 1:1 to 1:4. The most frequently used statistical methods for continuous scores was intraclass correlation coefficients (ICCs). Among the 63 studies that used ICCs, 21 studies presented models for ICC calculations and 30 studies reported 95% confidence intervals of the ICCs. Additional analyses using 17 studies that reported a strong ICC (>0.09) showed that the mean time interval was 12.88days and the mean ratio of the number of items to sample size was 1:5.37. When researchers plan to assess the test-retest reliability of patient-reported outcome measures for older people, they need to consider an adequate time interval of approximately 13days and the sample size of about 5 times the number of items. Particularly, statistical methods should not only be selected based on the types of scores of the patient-reported outcome measures, but should also be described clearly in
A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders

DEFF Research Database (Denmark)

Stupar, Maja; Côté, Pierre; Beaton, Dorcas E

2015-01-01

OBJECTIVE: The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). METHODS: We performed a test-retest reliability study. We includ...
7 CFR 201.55 - Retests.

Science.gov (United States)

2010-01-01

... Germination Tests in the Administration of the Act § 201.55 Retests. Retests shall be made as follows: (a) When the range of 100-seed replicates of a given test exceeds the maximum tolerated range in the table... replicates of a given test, rounding off the result to the nearest whole number. The germination is found in...
CPM Test-Retest Reliability: "Standard" vs "Single Test-Stimulus" Protocols.

Science.gov (United States)

Granovsky, Yelena; Miller-Barmak, Adi; Goldstein, Oren; Sprecher, Elliot; Yarnitsky, David

2016-03-01

Assessment of pain inhibitory mechanisms using conditioned pain modulation (CPM) is relevant clinically in prediction of pain and analgesic efficacy. Our objective is to provide necessary estimates of intersession CPM reliability, to enable transformation of the CPM paradigm into a clinical tool. Two cohorts of young healthy subjects (N = 65) participated in two dual-session studies. In Study I, a Bath-Thermode CPM protocol was used, with hot water immersion and contact heat as conditioning- and test-stimuli, respectively, in a classical parallel CPM design introducing test-stimulus first, and then the conditioning- and repeated test-stimuli in parallel. Study II consisted of two CPM protocols: 1) Two-Thermodes, one for each of the stimuli, in the same parallel design as above, and 2) single test-stimulus (STS) protocol with a single administration of a contact heat test-stimulus, partially overlapped in time by a remote shorter contact heat as conditioning stimulus. Test-retest reliability was assessed within 3-7 days. The STS-CPM had superior reliability intraclass correlation (ICC 2 ,: 1 = 0.59) over Bath-Thermode (ICC 2 ,: 1 = 0.34) or Two-Thermodes (ICC 2 ,: 1 = 0.21) protocols. The hand immersion conditioning pain had higher reliability than thermode pain (ICC 2 ,: 1 = 0.76 vs ICC 2 ,: 1 = 0.16). Conditioned test-stimulus pain scores were of good (ICC 2 ,: 1 = 0.62) or fair (ICC 2 ,: 1 = 0.43) reliability for the Bath-Thermode and the STS, respectively, but not for the Two-Thermodes protocol (ICC 2 ,: 1 = 0.20). The newly developed STS-CPM paradigm was more reliable than other CPM protocols tested here, and should be further investigated for its clinical relevance. It appears that large contact size of the conditioning-stimulus and use of single rather than dual test-stimulus pain contribute to augmentation of CPM reliability. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e
Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep

Directory of Open Access Journals (Sweden)

Jiahui Wang

2017-05-01

Full Text Available Resting state functional magnetic resonance imaging (rs-fMRI provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV derived from simultaneous electrocardiogram (ECG recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.
Test-retest reliability and practice effects of the Wechsler Memory Scale-III.

Science.gov (United States)

Lo, Ada H Y; Humphreys, Michael; Byrne, Gerard J; Pachana, Nancy A

2012-09-01

Although serial administration of cognitive tests is increasingly common, there is a paucity of research on test-retest reliabilities and practice effects, both of which are important for evaluating changes in functioning. Reliability is generally conceptualized as involving short-lasting changes in performance. However, when repeated testing occurs over a period of years, there will be some longer lasting effects. The implications of these longer lasting effects and practice effects on reliability were examined in the context of repeated administrations of the Wechsler Memory Scale-III in 339 community-dwelling women aged 40-79 years over 2 to 7 years. The results showed that Logical Memory and Verbal Paired Associates subtests were consistently the most reliable subtests across the age cohorts. The magnitude of practice effects varied as a function of subtests and age. The largest practice effects were found in the youngest age cohort, especially on the Faces, Logical Memory, and Verbal Paired Associates subtests. ©2012 The British Psychological Society.
The internal consistency of the standard gamble: tests after adjusting for prospect theory.

Science.gov (United States)

Oliver, Adam

2003-07-01

This article reports a study that tests whether the internal consistency of the standard gamble can be improved upon by incorporating loss weighting and probability transformation parameters in the standard gamble valuation procedure. Five alternatives to the standard EU formulation are considered: (1) probability transformation within an EU framework; and, within a prospect theory framework, (2) loss weighting and full probability transformation, (3) no loss weighting and full probability transformation, (4) loss weighting and no probability transformation, and (5) loss weighting and partial probability transformation. Of the five alternatives, only the prospect theory formulation with loss weighting and no probability transformation offers an improvement in internal consistency over the standard EU valuation procedure.
Evaluating the test-retest reliability of symptom indices associated with the ImPACT post-concussion symptom scale (PCSS).

Science.gov (United States)

Merritt, Victoria C; Bradson, Megan L; Meyer, Jessica E; Arnett, Peter A

2018-05-01

The Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) is a commonly used tool in sports concussion assessment. While test-retest reliabilities have been established for the ImPACT cognitive composites, few studies have evaluated the psychometric properties of the ImPACT's Post-Concussion Symptom Scale (PCSS). The purpose of this study was to establish the test-retest reliability of symptom indices associated with the PCSS. Participants included 38 undergraduate students (50.0% male) who underwent neuropsychological testing as part of their participation in their psychology department's research subject pool. The majority of the participants were Caucasian (94.7%) and had no history of concussion (73.7%). All participants completed the ImPACT at two time points, approximately 6 weeks apart. The PCSS was the main outcome measure, and eight symptom indices were calculated (a total symptom score, three symptom summary indices, and four symptom clusters). Pearson correlations (r) and intraclass correlation coefficients (ICCs) were computed as measures of test-retest reliability. Overall, reliabilities ranged from low to high (r = .44 to .80; ICC = .44 to .77). The cognitive symptom cluster exhibited the highest test-retest reliability (r = .80, ICC = .77), followed by the positive symptom total (PST) index, an indicator of the total number of symptoms endorsed (r = .71, ICC = .69). In contrast, the commonly used total symptom score showed lower test-retest reliability (r = .67, ICC = .62). Paired-samples t tests revealed no significant differences between test and retest for any of the symptom variables (all p > .01). Finally, reliable change indices (RCI) were computed to determine whether differences observed between test and retest represented clinically significant change. RCI values were provided for each symptom index at the 80%, 90%, and 95% confidence intervals. These results suggest that evaluating additional symptom
Learning effect and test-retest variability of pulsar perimetry.

Science.gov (United States)

Salvetat, Maria Letizia; Zeppieri, Marco; Parisi, Lucia; Johnson, Chris A; Sampaolesi, Roberto; Brusini, Paolo

2013-03-01

To assess Pulsar Perimetry learning effect and test-retest variability (TRV) in normal (NORM), ocular hypertension (OHT), glaucomatous optic neuropathy (GON), and primary open-angle glaucoma (POAG) eyes. This multicenter prospective study included 43 NORM, 38 OHT, 33 GON, and 36 POAG patients. All patients underwent standard automated perimetry and Pulsar Contrast Perimetry using white stimuli modulated in phase and counterphase at 30 Hz (CP-T30W test). The learning effect and TRV for Pulsar Perimetry were assessed for 3 consecutive visual fields (VFs). The learning effect were evaluated by comparing results from the first session with the other 2. TRV was assessed by calculating the mean of the differences (in absolute value) between retests for each combination of single tests. TRV was calculated for Mean Sensitivity, Mean Defect, and single Mean Sensitivity for each 66 test locations. Influence of age, VF eccentricity, and loss severity on TRV were assessed using linear regression analysis and analysis of variance. The learning effect was not significant in any group (analysis of variance, P>0.05). TRV for Mean Sensitivity and Mean Defect was significantly lower in NORM and OHT (0.6 ± 0.5 spatial resolution contrast units) than in GON and POAG (0.9 ± 0.5 and 1.0 ± 0.8 spatial resolution contrast units, respectively) (Kruskal-Wallis test, P=0.04); however, the differences in NORM among age groups was not significant (Kruskal-Wallis test, P>0.05). Slight significant differences were found for the single Mean Sensitivity TRV among single locations (Duncan test, PPulsar Perimetry CP-T30W test did not show significant learning effect in patients with standard automated perimetry experience. TRV for global indices was generally low, and was not related to patient age; it was only slightly affected by VF defect eccentricity, and significantly influenced by VF loss severity.
Test-Retest Reliability of the Preschool Age Psychiatric Assessment (PAPA)

Science.gov (United States)

Egger, Helen Link; Erkanli, Alaattin; Keeler, Gordon; Potts, Edward; Walter, Barbara Keith; Angold, Adrian

2006-01-01

Objective: To examine the test-retest reliability of a new interviewer-based psychiatric diagnostic measure (the Preschool Age Psychiatric Assessment) for use with parents of preschoolers 2 to 5 years old. Method: A total of 1,073 parents of children attending a large pediatric clinic completed the Child Behavior Checklist 1 1/2-5. For 18 months,…
Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

Science.gov (United States)

Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

2018-03-02

There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.
Short message service reminder intervention doubles sexually transmitted infection/HIV re-testing rates among men who have sex with men.

Science.gov (United States)

Bourne, C; Knight, V; Guy, R; Wand, H; Lu, H; McNulty, A

2011-04-01

To evaluate the impact of a short message service (SMS) reminder system on HIV/sexually transmitted infection (STI) re-testing rates among men who have sex with men (MSM). The SMS reminder programme started in late 2008 at a large Australian sexual health clinic. SMS reminders were recommended 3-6 monthly for MSM considered high-risk based on self-reported sexual behaviour. The evaluation compared HIV negative MSM who had a HIV/STI test between 1 January and 31 August 2010 and received a SMS reminder (SMS group) with those tested in the same time period (comparison group) and pre-SMS period (pre-SMS group, 1 January 2008 and 31 August 2008) who did not receive the SMS. HIV/STI re-testing rates were measured within 9 months for each group. Baseline characteristics were compared between study groups and multivariate logistic regression used to assess the association between SMS and re-testing and control for any imbalances in the study groups. There were 714 HIV negative MSM in the SMS group, 1084 in the comparison group and 1753 in the pre-SMS group. In the SMS group, 64% were re-tested within 9 months compared to 30% in the comparison group (preminders increased HIV/STI re-testing among HIV negative MSM. SMS offers a cheap, efficient system to increase HIV/STI re-testing in a busy clinical setting.

Test-retest reliability and smallest detectable change of the Bristol Impact of Hypermobility (BIoH) questionnaire.

Science.gov (United States)

Palmer, S; Manns, S; Cramp, F; Lewis, R; Clark, E M

2017-12-01

The Bristol Impact of Hypermobility (BIoH) questionnaire is a patient-reported outcome measure developed in conjunction with adults with Joint Hypermobility Syndrome (JHS). It has demonstrated strong concurrent validity with the Short Form-36 (SF-36) physical component score but other psychometric properties have yet to be established. This study aimed to determine its test-retest reliability and smallest detectable change (SDC). A test-retest reliability study. Participants were recruited from the Hypermobility Syndromes Association, a patient organisation in the United Kingdom. Recruitment packs were sent to 1080 adults who had given permission to be contacted about research. BIoH and SF-36 questionnaires were administered at baseline and repeated two weeks later. An 11-point global rating of change scale (-5 to +5) was also administered at two weeks. Test-retest analysis and calculation of the SDC was conducted on 'stable' patients (defined as global rating of change -1 to +1). 462 responses were received. 233 patients reported a 'stable' condition and were included in analysis (95% women; mean (SD) age 44.5 (13.9) years; BIoH score 223.6 (54.0)). The BIoH questionnaire demonstrated excellent test-retest reliability (ICC 0.923, 95% CI 0.900-0.940). The SDC was 42 points (equivalent to 19% of the mean baseline score). The SF-36 physical and mental component scores demonstrated poorer test-retest reliability and larger SDCs (as a proportion of the mean baseline scores). The results provide further evidence of the potential of the BIoH questionnaire to underpin research and clinical practice for people with JHS. Copyright © 2017 Elsevier Ltd. All rights reserved.
Test-retest reliability of computer-based video analysis of general movements in healthy term-born infants.

Science.gov (United States)

Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars

2015-10-01

A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (ptest-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Test-retest, inter-assessor and intra-assessor reliability of the modified Touwen examination

NARCIS (Netherlands)

Peters, Lieke H. J.; Maathuis, Karel G. B.; Kouw, Eva; Hamming, Marjolein; Hadders-Algra, Mijna

Interest in the Touwen examination (1979) for the assessment of minor neurological dysfunction (MND) is growing. However, information on psychometric properties of this assessment is scarce. Therefore the present study aimed at assessing the test's test-retest, inter- and intra-assessor reliability.
Development, test-retest reliability, and construct validity of the resistance training skills battery.

Science.gov (United States)

Lubans, David R; Smith, Jordan J; Harries, Simon K; Barnett, Lisa M; Faigenbaum, Avery D

2014-05-01

The aim of this study was to describe the development and assess test-retest reliability and construct validity of the Resistance Training Skills Battery (RTSB) for adolescents. The RTSB provides an assessment of resistance training skill competency and includes 6 exercises (i.e., body weight squat, push-up, lunge, suspended row, standing overhead press, and front support with chest touches). Scoring for each skill is based on the number of performance criteria successfully demonstrated. An overall resistance training skill quotient (RTSQ) is created by adding participants' scores for the 6 skills. Participants (44 boys and 19 girls, mean age = 14.5 ± 1.2 years) completed the RTSB on 2 occasions separated by 7 days. Participants also completed the following fitness tests, which were used to create a muscular fitness score (MFS): handgrip strength, timed push-up, and standing long jump tests. Intraclass correlation (ICC), paired samples t-tests, and typical error were used to assess test-retest reliability. To assess construct validity, gender and RTSQ were entered into a regression model predicting MFS. The rank order repeatability of the RTSQ was high (ICC = 0.88). The model explained 39% of the variance in MFS (p ≤ 0.001) and RTSQ (r = 0.40, p ≤ 0.001) was a significant predictor. This study has demonstrated the construct validity and test-retest reliability of the RTSB in a sample of adolescents. The RTSB can reliably rank participants in regards to their resistance training competency and has the necessary sensitivity to detect small changes in resistance training skill proficiency.
[11C]Harmine Binding to Brain Monoamine Oxidase A: Test-Retest Properties and Noninvasive Quantification.

Science.gov (United States)

Zanderigo, Francesca; D'Agostino, Alexandra E; Joshi, Nandita; Schain, Martin; Kumar, Dileep; Parsey, Ramin V; DeLorenzo, Christine; Mann, J John

2018-02-08

Inhibition of the isoform A of monoamine oxidase (MAO-A), a mitochondrial enzyme catalyzing deamination of monoamine neurotransmitters, is useful in treatment of depression and anxiety disorders. [ 11 C]harmine, a MAO-A PET radioligand, has been used to study mood disorders and antidepressant treatment. However, [ 11 C]harmine binding test-retest characteristics have to date only been partially investigated. Furthermore, since MAO-A is ubiquitously expressed, no reference region is available, thus requiring arterial blood sampling during PET scanning. Here, we investigate [ 11 C]harmine binding measurements test-retest properties; assess effects of using a minimally invasive input function estimation on binding quantification and repeatability; and explore binding potentials estimation using a reference region-free approach. Quantification of [ 11 C]harmine distribution volume (V T ) via kinetic models and graphical analyses was compared based on absolute test-retest percent difference (TRPD), intraclass correlation coefficient (ICC), and identifiability. The optimal procedure was also used with a simultaneously estimated input function in place of the measured curve. Lastly, an approach for binding potentials quantification in absence of a reference region was evaluated. [ 11 C]harmine V T estimates quantified using arterial blood and kinetic modeling showed average absolute TRPD values of 7.7 to 15.6 %, and ICC values between 0.56 and 0.86, across brain regions. Using simultaneous estimation (SIME) of input function resulted in V T estimates close to those obtained using arterial input function (r = 0.951, slope = 1.073, intercept = - 1.037), with numerically but not statistically higher test-retest difference (range 16.6 to 22.0 %), but with overall poor ICC values, between 0.30 and 0.57. Prospective studies using [ 11 C]harmine are possible given its test-retest repeatability when binding is quantified using arterial blood. Results with SIME of
A Computer-Based Sustained Visual Attention Test for Pre-School Children: Design, Development and Psychometric Properties

Directory of Open Access Journals (Sweden)

Roohollah Zahedian Nasb

2016-06-01

Full Text Available Background: Sustained visual attention is a prerequisite for learning and memory. The early evaluation of attention in childhood is essential for their school and career success in the future. The aim of this study was to design, development and investigation of psychometric properties (content, face and convergent validity and test-retest and internal consistency reliability of the computer - based sustained visual attention test (SuVAT for healthy preschool children aged 4-6 with their special needs. Methods: This study was carried out in two stages: in the first stage computerbased SuVAT in two versions original and parallel were developed. Then the test-retest and internal consistency reliability using intra-class correlation and Cronbach’s alpha coefficients respectively were examined; Face validity was calculated through ideas gathering from 10 preschool children and content validity evaluated using CVI and CVR method and convergent validity of SuVAT with CPT was assessed using Pearson correlation. Results: The developed test showed a good content and faces validity, and also had excellent test-retest reliability. In addition, the assessment of internal consistency indicated the high internal consistency of the test (Cronbach’s alpha=0.869. SuVAT and CPT test demonstrated a positive correlation upon the convergent validity testing. Conclusion: SuVAT with good reliability and validity could be used as an acceptable sustained attention assessment in preschool children.
Test-Retest Reliability of Rating of Perceived Exertion and Agreement With 1-Repetition Maximum in Adults.

Science.gov (United States)

Bove, Allyn M; Lynch, Andrew D; DePaul, Samantha M; Terhorst, Lauren; Irrgang, James J; Fitzgerald, G Kelley

2016-09-01

Study Design Clinical measurement. Background It has been suggested that rating of perceived exertion (RPE) may be a useful alternative to 1-repetition maximum (1RM) to determine proper resistance exercise dosage. However, the test-retest reliability of RPE for resistance exercise has not been determined. Additionally, prior research regarding the relationship between 1RM and RPE is conflicting. Objectives The purpose of this study was to (1) determine test-retest reliability of RPE related to resistance exercise and (2) assess agreement between percentages of 1RM and RPE during quadriceps resistance exercise. Methods A sample of participants with and without knee pathology completed a series of knee extension exercises and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale, then repeated the procedure 1 to 2 weeks later for test-retest reliability. To determine agreement between RPE and 1RM, participants completed knee extension exercises at various percentages of their 1RM (10% to 130% of predicted 1RM) and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale. Percent agreement was calculated between the 1RM and RPE at each resistance interval. Results The intraclass correlation coefficient indicated excellent test-retest reliability of RPE for quadriceps resistance exercises (intraclass correlation coefficient = 0.895; 95% confidence interval: 0.866, 0.918). Overall percent agreement between RPE and 1RM was 60%, but agreement was poor within the ranges that would typically be used for training (50% 1RM for muscle endurance, 70% 1RM and greater for strength). Conclusion Test-retest reliability of perceived exertion during quadriceps resistance exercise was excellent. However, agreement between the RPE and 1RM was poor, especially in common training zones for knee extensor strengthening. J Orthop Sports Phys Ther 2016;46(9):768-774. Epub 5 Aug 2016. doi:10.2519/jospt.2016.6498.
The Physical Activity Scale for Individuals with Physical Disabilities: test-retest reliability and comparison with an accelerometer.

Science.gov (United States)

van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem

2007-01-01

The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects' diagnoses were: stroke, spinal cord injury, whiplash, and neurological-, orthopedic- or back disorders. The PASIPD is a 7-d recall physical activity questionnaire that was completed twice, 1 wk apart. During this week, physical activity was also measured with an Actigraph accelerometer. The test-retest reliability Spearman correlation of the PASIPD was 0.77. The criterion validity Spearman correlation was 0.30 when compared to the accelerometer. The PASIPD had test-retest reliability and criterion validity that is comparable to well established self-report physical activity questionnaires from the general population.
The Dichotic Digits difference Test (DDdT): Development, Normative Data, and Test-Retest Reliability Studies Part 1.

Science.gov (United States)

Cameron, Sharon; Glyde, Helen; Dillon, Harvey; Whitfield, Jessica; Seymour, John

2016-06-01

The dichotic digits test is one of the most widely used assessment tools for central auditory processing disorder. However, questions remain concerning the impact of cognitive factors on test results. To develop the Dichotic Digits difference Test (DDdT), an assessment tool that could differentiate children with cognitive deficits from children with genuine dichotic deficits based on differential test results. The DDdT consists of four subtests: dichotic free recall (FR), dichotic directed left ear (DLE), dichotic directed right ear (DRE), and diotic. Scores for six conditions are calculated (FR left ear [LE], FR right ear [RE], and FR total, as well as DLE, DRE, and diotic). Scores for four difference measures are also calculated: dichotic advantage, right-ear advantage (REA) FR, REA directed, and attention advantage. Experiment 1 involved development of the DDdT, including error rate analysis. Experiment 2 involved collection of normative and test-retest reliability data. Twenty adults (aged 25 yr 10 mo to 50 yr 7 mo, mean 36 yr 4 mo) took part in the development study; 62 normal-hearing, typically developing, primary-school children (aged 7 yr 1 mo to 11 yr 11 mo, mean 9 yr 4 mo) and 10 adults (aged 25 yr 0 mo to 51 yr 6 mo, mean 34 yr 10 mo) took part in the normative and test-retest reliability study. In Experiment 1, error rate analysis was conducted on the 36 digit-pair combinations of the DDdT. Normative data collected in Experiment 2 were arcsine transformed to achieve a distribution that was closer to a normal distribution and z-scores calculated. Pearson product-moment correlations were used to determine the strength of relationships between DDdT conditions. The development study revealed no significant differences in the adult population between test and retest on any DDdT condition. Error rates on 36 digit pairs ranged from 1.5% to 16.7%. The most and the least error-prone digits were removed before commencement of the normative data study, leaving 25
Dual conception of risk in the Iowa Gambling Task: Effects of sleep deprivation and test-retest gap

Directory of Open Access Journals (Sweden)

Varsha eSingh

2013-09-01

Full Text Available Risk in the Iowa Gambling Task (IGT is often understood in terms of intertemporal choices, i.e., preference for immediate outcomes in favor of delayed outcomes is considered risky. According to behavioral economics, decision makers refrain from choosing the short-sighted immediate gain because, over time (10 trials, the immediate gains result in a net loss. Instead decision makers are expected to maximize their gains by choosing options that, over time (10 trials, result in net gain. However, task choices are sometimes made on the basis of the frequency of reward and punishment such that infrequent punishments are favored over frequent punishments. The presence of these two attributes (intertemporality and frequency may correspond to the emotion-cognition dichotomy and reflect a dual conception of risk. Decision making on the basis of the two attributes was tested under two conditions: test-retest gap and sleep deprivation. An interaction between these two was expected to attenuate the difference between the two attributes (n=40 male. Analysis of the effects of IGT attribute type (intertemporal vs. frequency, sleep deprivation (sleep deprivation vs. no sleep deprivation, and test-retest gap (short vs. long showed a significant effect of IGT attribute type thus confirming the difference between the two attributes. Sleep deprivation had no effect on the attributes, but test-retest gap and the three-way interaction between attribute type, test-retest gap, and sleep deprivation were significant. Post-hoc tests showed sleep deprivation and short test-retest gap to attenuate the difference between the two attributes. As expected intertemporal decision making benefited from repeated task exposure. The findings add to understanding of the emotion-cognition dichotomy and show a time-dependent effect of a universally experienced constraint (sleep deprivation.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

Science.gov (United States)

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Test-retest reliability of Eurofit Physical Fitness items for children with visual impairments

NARCIS (Netherlands)

Houwen, Suzanne; Visscher, Chris; Hartman, Esther; Lemmink, Koen A. P. M.

The purpose of this study was to examine the test-retest reliability of physical fitness items from the European Test of Physical Fitness (Eurofit) for children with visual impairments. A sample of 21 children, ages 6-12 years, that were recruited from a special school for children with visual
Characteristics and international comparability of the Finnish matrix sentence test in cochlear implant recipients.

Science.gov (United States)

Dietz, Aarno; Buschermöhle, Michael; Sivonen, Ville; Willberg, Tytti; Aarnisalo, Antti A; Lenarz, Thomas; Kollmeier, Birger

2015-01-01

The first Finnish sentence-based speech test in noise--the Finnish matrix sentence test--was recently developed. The aim of this study was to determine the characteristics of the new test with respect to test-retest reliability, speech recognition curve, and international comparability in Finnish cochlear implant (CI) recipients. The speech reception thresholds (SRT) were measured by means of an adaptive test procedure and compared with the results of the traditional Finnish word test. Additional measurements for concurrent slope and SRT estimation were conducted to determine the speech recognition curve and to check the test-retest reliability. The measurements were performed on 78 Finnish CI recipients. In a subset of 25 patients, additional measurements for test-retest reliability and slope determination were performed. The mean SRT was -3.5 ± 1.7 dB SNR, with only a weak correlation with the Finnish word test. Test-retest reliability was within ± 1 dB and the mean slope of the speech recognition curve was 14.6 ± 3.6 %/dB. The rehabilitation results were similar to the results published for the German matrix test. The Finnish matrix test was found to be suitable and efficient in CI recipients with similar characteristics as the German matrix test.
Test-retest reliability of the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA).

Science.gov (United States)

Bégel, Valentin; Verga, Laura; Benoit, Charles-Etienne; Kotz, Sonja A; Bella, Simone Dalla

2018-04-27

Perceptual and sensorimotor timing skills can be comprehensively assessed with the Battery for the Assessment of Auditory Sensorimotor and Timing Abilities (BAASTA). The battery has been used for testing rhythmic skills in healthy adults and patient populations (e.g., with Parkinson disease), showing sensitivity to timing and rhythm deficits. Here we assessed the test-retest reliability of the BAASTA in 20 healthy adults. Participants were tested twice with the BAASTA, implemented on a tablet interface, with a 2-week interval. They completed 4 perceptual tasks, namely, duration discrimination, anisochrony detection with tones and music, and the Beat Alignment Test (BAT). Moreover, they completed motor tasks via finger tapping, including unpaced and paced tapping with tones and music, synchronization-continuation, and adaptive tapping to a sequence with a tempo change. Despite high variability among individuals, the results showed stable test-retest reliability in most tasks. A slight but significant improvement from test to retest was found in tapping with music, which may reflect a learning effect. In general, the BAASTA was found a reliable tool for evaluating timing and rhythm skills. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Cross-Cultural Adaptation of the Profile Fitness Mapping Neck Questionnaire to Brazilian Portuguese: Internal Consistency, Reliability, and Construct and Structural Validity.

Science.gov (United States)

Ferreira, Mariana Cândido; Björklund, Martin; Dach, Fabiola; Chaves, Thais Cristina

The purpose of this study was to adapt and evaluate the psychometric properties of the ProFitMap-neck to Brazilian Portuguese. The cross-cultural adaptation consisted of 5 stages, and 180 female patients with chronic neck pain participated in the study. A subsample (n = 30) answered the pretest, and another subsample (n = 100) answered the questionnaire a second time. Internal consistency, test-retest reliability, and construct validity (hypothesis testing and structural validity) were estimated. For construct validity, the scores of the questionnaire were correlated with the Neck Disability Index (NDI), and the Hospital Anxiety and Depression Scale (HADS), the Tampa Scale of Kinesiophobia (TSK), and the 36-item Short-Form Health Survey (SF-36). Internal consistency was determined by adequate Cronbach's α values (α > 0.70). Strong reliability was identified by high intraclass correlation coefficients (ICC > 0.75). Construct validity was identified by moderate and strong correlations of the Br-ProFitMap-neck with total NDI score (-0.56 50%, Kaiser-Meyer-Olkin index > 0.50, eigenvalue > 1, and factor loadings > 0.2. Br-ProFitMap-neck had adequate psychometric properties and can be used in clinical settings, as well as research, in patients with chronic neck pain. Copyright © 2017. Published by Elsevier Inc.
Test-retest reliability of the 20-sec Wingate test to assess anaerobic power in children with cerebral palsy

NARCIS (Netherlands)

Dallmeijer, A.J.; Scholtes, V.A.B.; Brehm, M.A.; Becher, J.G.

2013-01-01

OBJECTIVE: The aim of this study was to determine the test-retest reliability of the 20-sec Wingate anaerobic test in children with cerebral palsy. DESIGN: Participants were 22 ambulant children with cerebral palsy, with Gross Motor Function Classification System levels I (limitations in advanced
Test-Retest Reliability of the 20-sec Wingate Test to Assess Anaerobic Power in Children with Cerebral Palsy

NARCIS (Netherlands)

Dallmeijer, Annet J.; Scholtes, Vanessa A. B.; Brehm, Merel-Anne; Becher, Jules G.

2013-01-01

Objective: The aim of this study was to determine the test-retest reliability of the 20-sec Wingate anaerobic test in children with cerebral palsy. Design: Participants were 22 ambulant children with cerebral palsy, with Gross Motor Function Classification System levels I (limitations in advanced
Dual conception of risk in the Iowa Gambling Task: effects of sleep deprivation and test-retest gap.

Science.gov (United States)

Singh, Varsha

2013-01-01

Risk in the Iowa Gambling Task (IGT) is often understood in terms of intertemporal choices, i.e., preference for immediate outcomes in favor of delayed outcomes is considered risky decision making. According to behavioral economics, healthy decision makers are expected to refrain from choosing the short-sighted immediate gain because, over time (10 trials of the IGT), the immediate gains result in a long term loss (net loss). Instead decision makers are expected to maximize their gains by choosing options that, over time (10 trials), result in delayed or long term gains (net gain). However, task choices are sometimes made on the basis of the frequency of reward and punishment such that frequent rewards/infrequent punishments are favored over infrequent rewards/frequent punishments. The presence of these two attributes (intertemporality and frequency of reward) in IGT decision making may correspond to the emotion-cognition dichotomy and reflect a dual conception of risk. Decision making on the basis of the two attributes was tested under two conditions: delay in retest and sleep deprivation. An interaction between sleep deprivation and time delay was expected to attenuate the difference between the two attributes. Participants were 40 male university students. Analysis of the effects of IGT attribute type (intertemporal vs. frequency of reinforcement), sleep deprivation (sleep deprivation vs. no sleep deprivation), and test-retest gap (short vs. long delay) showed a significant within-subjects effect of IGT attribute type thus confirming the difference between the two attributes. Sleep deprivation had no effect on the attributes, but test-retest gap and the three-way interaction between attribute type, test-retest gap, and sleep deprivation were significantly different. Post-hoc tests revealed that sleep deprivation and short test-retest gap attenuated the difference between the two attributes. Furthermore, the results showed an expected trend of increase in
Adaptation, test-retest reliability, and construct validity of the Physical Activity Neighborhood Environment Scale in Nigeria (PANES-N).

Science.gov (United States)

Oyeyemi, Adewale L; Sallis, James F; Oyeyemi, Adetoyeje Y; Amin, Mariam M; De Bourdeaudhuij, Ilse; Deforche, Benedicte

2013-11-01

This study adapted the Physical Activity Neighborhood Environment Scale (PANES) to the Nigerian context and assessed the test-retest reliability and construct validity of the Nigerian version (PANESN). A multidisciplinary panel of experts adapted the original PANES to reflect the built and social environment of Nigeria. The adapted PANES was subjected to cognitive testing and test retest reliability in a diverse sample of Nigerian adults (N = 132) from different neighborhood types. Intraclass Correlation Coefficients (ICC) was used to assess test-retest reliability, and construct validity was investigated with Analysis of Covariance for differences in environmental attributes between neighborhoods. Four of the 17 items on the original PANES were significantly modified, 3 were removed and 2 new items were incorporated into the final version of adapted PANES-N. Test-retest reliability was substantial to almost perfect (ICC = 0.62-1.00) for all items on the PANES-N, and residents of neighborhoods in the inner city reported higher residential density, land use mix and safety, but lower pedestrian facilities and aesthetics than did residents of government reserved area/new layout neighborhoods. The PANES-N appears promising for assessing environmental perceptions related to physical activity in Nigeria, but further testing is required to assess its applicability across Africa.
Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

Science.gov (United States)

Mills, Tamara L; Holm, Margo B; Schmeler, Mark

2007-01-01

The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.

How Well Does the Sum Score Summarize the Test? Summability as a Measure of Internal Consistency

NARCIS (Netherlands)

Goeman, J.J.; De, Jong N.H.

2018-01-01

Many researchers use Cronbach's alpha to demonstrate internal consistency, even though it has been shown numerous times that Cronbach's alpha is not suitable for this. Because the intention of questionnaire and test constructers is to summarize the test by its overall sum score, we advocate
The test-retest reliability of anatomical co-ordinate axes definition for the quantification of lower extremity kinematics during running.

Science.gov (United States)

Sinclair, Jonathan; Taylor, Paul John; Greenhalgh, Andrew; Edmundson, Christopher James; Brooks, Darrell; Hobbs, Sarah Jane

2012-12-01

Three-dimensional (3-D) kinematic analyses are used widely in both sport and clinical examinations. However, this procedure depends on reliable palpation of anatomical landmarks and mal-positioning of markers between sessions may result in improperly defined segment co-ordinate system axes which will produce in-consistent joint rotations. This had led some to question the efficacy of this technique. The aim of the current investigation was to assess the reliability of the anatomical frame definition when quantifying 3-D kinematics of the lower extremities during running. Ten participants completed five successful running trials at 4.0 m·s(-1) ± 5%. 3-D angular joint kinematics parameters from the hip, knee and ankle were collected using an eight camera motion analysis system. Two static calibration trials were captured. The first (test) was conducted prior to the running trials following which anatomical landmarks were removed. The second was obtained following completion of the running trials where anatomical landmarks were re-positioned (retest). Paired samples t-tests were used to compare 3-D kinematic parameters quantified using the two static trials, and intraclass correlations were employed to examine the similarities between the sagittal, coronal and transverse plane waveforms. The results indicate that no significant (p>0.05) differences were found between test and retest 3-D kinematic parameters and strong (R(2)≥0.87) correlations were observed between test and retest waveforms. Based on the results obtained from this investigation, it appears that the anatomical co-ordinate axes of the lower extremities can be defined reliably thus confirming the efficacy of studies using this technique.
Test-Retest Reliability of a Serious Game for Delirium Screening in the Emergency Department.

Science.gov (United States)

Tong, Tiffany; Chignell, Mark; Tierney, Mary C; Lee, Jacques S

2016-01-01

Introduction: Cognitive screening in settings such as emergency departments (ED) is frequently carried out using paper-and-pencil tests that require administration by trained staff. These assessments often compete with other clinical duties and thus may not be routinely administered in these busy settings. Literature has shown that the presence of cognitive impairments such as dementia and delirium are often missed in older ED patients. Failure to recognize delirium can have devastating consequences including increased mortality (Kakuma et al., 2003). Given the demands on emergency staff, an automated cognitive test to screen for delirium onset could be a valuable tool to support delirium prevention and management. In earlier research we examined the concurrent validity of a serious game, and carried out an initial assessment of its potential as a delirium screening tool (Tong et al., 2016). In this paper, we examine the test-retest reliability of the game, as it is an important criterion in a cognitive test for detecting risk of delirium onset. Objective: To demonstrate the test-retest reliability of the screening tool over time in a clinical sample of older emergency patients. A secondary objective is to assess whether there are practice effects that might make game performance unstable over repeated presentations. Materials and Methods: Adults over the age of 70 were recruited from a hospital ED. Each patient played our serious game in an initial session soon after they arrived in the ED, and in follow up sessions conducted at 8-h intervals (for each participant there were up to five follow up sessions, depending on how long the person stayed in the ED). Results: A total of 114 adults (61 females, 53 males) between the ages of 70 and 104 years ( M = 81 years, SD = 7) participated in our study after screening out delirious patients. We observed a test-retest reliability of the serious game (as assessed by correlation r -values) between 0.5 and 0.8 across adjacent
Stability of FDG-PET Radiomics features - An integrated analysis of test-retest and inter-observer variability

Energy Technology Data Exchange (ETDEWEB)

Leijenaar, Ralph T. H.; Carvalho, Sara; Rios Velazquez, Emmanuel [Dept. of Radiation Oncology (MAASTRO), GROW-School for Oncology and Developmental Biology, Maastricht Univ. Medical Center, Maastricht (Netherlands)] [and others

2013-10-15

Purpose: Besides basic measurements as maximum standardized uptake value (SUV){sub max} or SUV{sub mean} derived from 18F-FDG positron emission tomography (PET) scans, more advanced quantitative imaging features (i.e. 'Radiomics' features) are increasingly investigated for treatment monitoring, outcome prediction, or as potential biomarkers. With these prospected applications of Radiomics features, it is a requisite that they provide robust and reliable measurements. The aim of our study was therefore to perform an integrated stability analysis of a large number of PET-derived features in non-small cell lung carcinoma (NSCLC), based on both a test-retest and an inter-observer setup. Methods: Eleven NSCLC patients were included in the test-retest cohort. Patients underwent repeated PET imaging within a one day interval, before any treatment was delivered. Lesions were delineated by applying a threshold of 50 % of the maximum uptake value within the tumor. Twenty-three NSCLC patients were included in the inter-observer cohort. Patients underwent a diagnostic whole body PET-computed tomography (CT). Lesions were manually delineated based on fused PET-CT, using a standardized clinical delineation protocol. Delineation was performed independently by five observers, blinded to each other. Fifteen first order statistics, 39 descriptors of intensity volume histograms, eight geometric features and 44 textural features were extracted. For every feature, test-retest and inter-observer stability was assessed with the intra-class correlation coefficient (ICC) and the coefficient of variability, normalized to mean and range. Similarity between test-retest and inter-observer stability rankings of features was assessed with Spear man's rank correlation coefficient. Results: Results showed that the majority of assessed features had both a high test-retest (71%) and inter-observer (91%) stability in terms of their ICC. Overall, features more stable in repeated PET
Expanding the Reach of Participatory Risk Management: Testing an Online Decision-Aiding Framework for Informing Internally Consistent Choices.

Science.gov (United States)

Bessette, Douglas L; Campbell-Arvai, Victoria; Arvai, Joseph

2016-05-01

This article presents research aimed at developing and testing an online, multistakeholder decision-aiding framework for informing multiattribute risk management choices associated with energy development and climate change. The framework was designed to provide necessary background information and facilitate internally consistent choices, or choices that are in line with users' prioritized objectives. In order to test different components of the decision-aiding framework, a six-part, 2 × 2 × 2 factorial experiment was conducted, yielding eight treatment scenarios. The three factors included: (1) whether or not users could construct their own alternatives; (2) the level of detail regarding the composition of alternatives users would evaluate; and (3) the way in which a final choice between users' own constructed (or highest-ranked) portfolio and an internally consistent portfolio was presented. Participants' self-reports revealed the framework was easy to use and providing an opportunity to develop one's own risk-management alternatives (Factor 1) led to the highest knowledge gains. Empirical measures showed the internal consistency of users' decisions across all treatments to be lower than expected and confirmed that providing information about alternatives' composition (Factor 2) resulted in the least internally consistent choices. At the same time, those users who did not develop their own alternatives and were not shown detailed information about the composition of alternatives believed their choices to be the most internally consistent. These results raise concerns about how the amount of information provided and the ability to construct alternatives may inversely affect users' real and perceived internal consistency. © 2015 Society for Risk Analysis.
Interrater and Test-Retest Reliability and Minimal Detectable Change of the Balance Evaluation Systems Test (BESTest) and Subsystems With Community-Dwelling Older Adults.

Science.gov (United States)

Wang-Hsu, Elizabeth; Smith, Susan S

2017-01-10

Falls are a common cause of injuries and hospital admissions in older adults. Balance limitation is a potentially modifiable factor contributing to falls. The Balance Evaluation Systems Test (BESTest), a clinical balance measure, categorizes balance into 6 underlying subsystems. Each of the subsystems is scored individually and summed to obtain a total score. The reliability of the BESTest and its individual subsystems has been reported in patients with various neurological disorders and cancer survivors. However, the reliability and minimal detectable change (MDC) of the BESTest with community-dwelling older adults have not been reported. The purposes of our study were to (1) determine the interrater and test-retest reliability of the BESTest total and subsystem scores; and (2) estimate the MDC of the BESTest and its individual subsystem scores with community-dwelling older adults. We used a prospective cohort methodological design. Community-dwelling older adults (N = 70; aged 70-94 years; mean = 85.0 [5.5] years) were recruited from a senior independent living community. Trained testers (N = 3) administered the BESTest. All participants were tested with the BESTest by the same tester initially and then retested 7 to 14 days later. With 32 of the participants, a second tester concurrently scored the retest for interrater reliability. Testers were blinded to each other's scores. Intraclass correlation coefficients [ICC(2,1)] were used to determine the interrater and test-retest reliability. Test-retest reliability was also analyzed using method error and the associated coefficients of variation (CVME). MDC was calculated using standard error of measurement. Interrater reliability (N = 32) of the BESTest total score was ICC(2, 1) = 0.97 (95% confidence interval [CI], 0.94-0.99). The ICCs for the individual subsystem scores ranged from 0.85 to 0.94. Test-retest reliability (N = 70) of the BESTest total score was ICC(2,1) = 0.93 (95% CI, 0.89-0.96). ICCs for the
Test-retest reliability and predictors of unreliable reporting for a sexual behavior questionnaire for U.S. men.

Science.gov (United States)

Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R

2010-12-01

Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.
Reliability of Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory in a test-retest design.

Science.gov (United States)

Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian

2014-02-01

The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.
Test-Retest Reliability of Diffusion Tensor Imaging in Huntington's Disease.

Science.gov (United States)

Cole, James H; Farmer, Ruth E; Rees, Elin M; Johnson, Hans J; Frost, Chris; Scahill, Rachael I; Hobbs, Nicola Z

2014-03-21

Diffusion tensor imaging (DTI) has shown microstructural abnormalities in patients with Huntington's Disease (HD) and work is underway to characterise how these abnormalities change with disease progression. Using methods that will be applied in longitudinal research, we sought to establish the reliability of DTI in early HD patients and controls. Test-retest reliability, quantified using the intraclass correlation coefficient (ICC), was assessed using region-of-interest (ROI)-based white matter atlas and voxelwise approaches on repeat scan data from 22 participants (10 early HD, 12 controls). T1 data was used to generate further ROIs for analysis in a reduced sample of 18 participants. The results suggest that fractional anisotropy (FA) and other diffusivity metrics are generally highly reliable, with ICCs indicating considerably lower within-subject compared to between-subject variability in both HD patients and controls. Where ICC was low, particularly for the diffusivity measures in the caudate and putamen, this was partly influenced by outliers. The analysis suggests that the specific DTI methods used here are appropriate for cross-sectional research in HD, and give confidence that they can also be applied longitudinally, although this requires further investigation. An important caveat for DTI studies is that test-retest reliability may not be evenly distributed throughout the brain whereby highly anisotropic white matter regions tended to show lower relative within-subject variability than other white or grey matter regions.
Test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy.

Science.gov (United States)

Savva, Christos; Giakas, Giannis; Efstathiou, Michalis; Karagiannis, Christos

2014-01-01

The purpose of this study was to evaluate the test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy (CR). A convenience sample of 19 participants (14 men and 5 women; mean ± SD age, 50.5 ± 12 years) with CR was measured using a Jamar hydraulic hand dynamometer by the same rater on 2 different testing sessions with an interval of 7 days between sessions. Data collection procedures followed standardized grip strength testing guidelines established by the American Society of Hand Therapists. During the repeated measures, patients were advised to rest their upper limb in the standardized arm position and encouraged to exert 3 maximum gripping efforts. The mean value of the 3 efforts (measured in kilogram force [Kgf]) was used for data analysis. The intraclass correlation coefficient, SEM, and the Bland-Altman plot were used to estimate test-retest reliability and measurement precision. Grip strength measurement in CR demonstrated an intraclass correlation coefficient of 0.976, suggesting excellent test-retest reliability. The small SEM in both testing sessions (SEM1, 2.41 Kgf; SEM2, 2.51 Kgf) as well as the narrow width of the 95% limits of agreements (95% limits of agreement, -4.9 to 4.4 Kgf) in the Bland-Altman plot reflected precise measurements of grip strength in both occasions. Excellent test-retest reliability for grip strength measurement was measured in patients with CR, demonstrating that a hydraulic hand dynamometer could be used as an outcome measure for these patients. Copyright © 2014 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease

NARCIS (Netherlands)

Strouwen, C.; Molenaar, E.A.; Keus, S.H.; Munks, L.; Bloem, B.R.; Nieuwboer, A.

2016-01-01

BACKGROUND: Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains
Establishing the Test-Retest Reliability & Concurrent Validity for the Repeat Ice Skating Test (RIST) in Adolescent Male Ice Hockey Players

Science.gov (United States)

Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William

2012-01-01

In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…
Test-retest reliability of Brazilian version of Memorial Symptom Assessment Scale for assessing symptoms in cancer patients.

Science.gov (United States)

Menezes, Josiane Roberta de; Luvisaro, Bianca Maria Oliveira; Rodrigues, Claudia Fernandes; Muzi, Camila Drumond; Guimarães, Raphael Mendonça

2017-01-01

To assess the test-retest reliability of the Memorial Symptom Assessment Scale translated and culturally adapted into Brazilian Portuguese. The scale was applied in an interview format for 190 patients with various cancers type hospitalized in clinical and surgical sectors of the Instituto Nacional de Câncer José de Alencar Gomes da Silva and reapplied in 58 patients. Data from the test-retest were double typed into a Microsoft Excel spreadsheet and analyzed by the weighted Kappa. The reliability of the scale was satisfactory in test-retest. The weighted Kappa values obtained for each scale item had to be adequate, the largest item was 0.96 and the lowest was 0.69. The Kappa subscale was also evaluated and values were 0.84 for high frequency physic symptoms, 0.81 for low frequency physical symptoms, 0.81 for psychological symptoms, and 0.78 for Global Distress Index. High level of reliability estimated suggests that the process of measurement of Memorial Symptom Assessment Scale aspects was adequate. Avaliar a confiabilidade teste-reteste da versão traduzida e adaptada culturalmente para o português do Brasil do Memorial Symptom Assessment Scale. A escala foi aplicada em forma de entrevista em 190 pacientes com diversos tipos de câncer internados nos setores clínicos e cirúrgicos do Instituto Nacional de Câncer José de Alencar Gomes da Silva e reaplicada em 58 pacientes. Os dados dos testes-retestes foram inseridos num banco de dados por dupla digitação independente em Excel e analisados pelo Kappa ponderado. A confiabilidade da escala mostrou-se satisfatória nos testes-retestes. Os valores do Kappa ponderado obtidos para cada item da escala apresentaram-se adequados, sendo o maior item de 0,96 e o menor de 0,69. Também se avaliou o Kappa das subescalas, sendo de 0,84 para sintomas físicos de alta frequência, de 0,81 para sintomas físicos de baixa frequência, de 0,81 também para sintomas psicológicos, e de 0,78 para Índice Geral de Sofrimento
Test-retest and interobserver reliability of quantitative sensory testing according to the protocol of the German Research Network on Neuropathic Pain (DFNS): a multi-centre study.

Science.gov (United States)

Geber, Christian; Klein, Thomas; Azad, Shahnaz; Birklein, Frank; Gierthmühlen, Janne; Huge, Volker; Lauchart, Meike; Nitzsche, Dorothee; Stengel, Maike; Valet, Michael; Baron, Ralf; Maier, Christoph; Tölle, Thomas; Treede, Rolf-Detlef

2011-03-01

Quantitative sensory testing (QST) is an instrument to assess positive and negative sensory signs, helping to identify mechanisms underlying pathologic pain conditions. In this study, we evaluated the test-retest reliability (TR-R) and the interobserver reliability (IO-R) of QST in patients with sensory disturbances of different etiologies. In 4 centres, 60 patients (37 male and 23 female, 56.4±1.9years) with lesions or diseases of the somatosensory system were included. QST comprised 13 parameters including detection and pain thresholds for thermal and mechanical stimuli. QST was performed in the clinically most affected test area and a less or unaffected control area in a morning and an afternoon session on 2 consecutive days by examiner pairs (4 QSTs/patient). For both, TR-R and IO-R, there were high correlations (r=0.80-0.93) at the affected test area, except for wind-up ratio (TR-R: r=0.67; IO-R: r=0.56) and paradoxical heat sensations (TR-R: r=0.35; IO-R: r=0.44). Mean IO-R (r=0.83, 31% unexplained variance) was slightly lower than TR-R (r=0.86, 26% unexplained variance, Ptest area (TR-R: r=0.86; IO-R: r=0.83) than in the control area (TR-R: r=0.79; IO-R: r=0.71, each Preliability of QST. We conclude that standardized QST performed by trained examiners is a valuable diagnostic instrument with good test-retest and interobserver reliability within 2days. With standardized training, observer bias is much lower than random variance. Quantitative sensory testing performed by trained examiners is a valuable diagnostic instrument with good interobserver and test-retest reliability for use in patients with sensory disturbances of different etiologies to help identify mechanisms of neuropathic and non-neuropathic pain. Copyright © 2010 International Association for the Study of Pain. Published by Elsevier B.V. All rights reserved.
Test–Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study

Directory of Open Access Journals (Sweden)

Clare E. Palmer

2018-01-01

Full Text Available Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington’s disease (HD and Parkinson’s disease (PD across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test–retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test–retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test–retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT from the combined-Simon-Stop-Signal task showed moderate test–retest reliability, however, the combined SSRT interference effect showed poor test–retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Test-Retest Reliability of Self-Reported Sexual Health Measures among US Hispanic Adolescents

Science.gov (United States)

Jerman, Petra; Berglas, Nancy F.; Rohrbach, Louise A.; Constantine, Norman A.

2016-01-01

Objective: Although Hispanic adolescents in the USA are often the focus of sexual health interventions, their response to survey measures has rarely been assessed within evaluation studies. This study documents the test-retest reliability of a wide range of self-reported sexual health values, attitudes, knowledge and behaviours among Hispanic…
Temporal stability of the Francis Scale of Attitude toward Christianity short-form: test-retest data over one week.

Science.gov (United States)

Lewis, Christopher Alan; Cruise, Sharon Mary; McGuckin, Conor

2005-04-01

This study evaluated the test-retest reliability of the Francis Scale of Attitude toward Christianity short-form. 39 Northern Irish undergraduate students completed the measure on two occasions separated by one week. Stability across the two administrations was high, r = .92, and there was no significant change between Time 1(M = 25.2, SD = 5.4) and Time 2 (M = 25.7, SD = 6.2). These data support the short-term test-retest reliability of the Francis Scale of Attitude toward Christianity short-form.
Interrater and test-retest reliability and validity of the Norwegian version of the BESTest and mini-BESTest in people with increased risk of falling.

Science.gov (United States)

Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L

2017-04-20

The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95 = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95 = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95 = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95 = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.
Test-Retest Reliability of Handgrip Strength as an Outcome Measure in Patients With Symptoms of Shoulder Impingement Syndrome.

Science.gov (United States)

Savva, Christos; Mougiaris, Paraskevas; Xadjimichael, Christoforos; Karagiannis, Christos; Efstathiou, Michalis

The purpose of this study was to investigate the degree of test-retest reliability of grip strength measurement using a hand dynamometer in patients with shoulder impingement syndrome. A total of 19 patients (10 women and 9 men; mean ± standard deviation age, 33.2 ± 12.9 years; range 18-59 years) with shoulder impingement syndrome were measured using a hand dynamometer by the same data collector in 2 different testing sessions with a 7-day interval. During each session, patients were encouraged to exert 3 maximal isometric contractions on the affected hand and the mean value of the 3 efforts (measured in kilogram-force [Kgf]) was used for data analysis. The intraclass correlation coefficient (ICC 2,1 ) as well as the standard error of measurement (SEM) and Bland-Altman plot were used to estimate the degree of test-retest reliability and the measurement error, respectively. Grip strength data analysis revealed an ICC 2,1 score of 0.94, which, based on the Shrout classification, is considered as excellent test-retest reliability of grip strength measurement. The small values of SEMs reported in both sessions (SEM 1 , 2.55 Kgf; SEM 2 , 2.39 Kgf) and the small width of the 95% limits of agreement in the Bland-Altman plot (ranging from -7.39 Kgf to 7.03 Kgf) reflected the measurement precision and the narrow variation of the differences during the 2 testing sessions. Results from this study identified excellent test-retest reliability of grip strength measurement in shoulder impingement syndrome, indicating its potential use as an outcome measure in clinical practice. Copyright © 2018. Published by Elsevier Inc.
Temporal Stability of Strength-Based Assessments: Test-Retest Reliability of Student and Teacher Reports

Science.gov (United States)

Romer, Natalie; Merrell, Kenneth W.

2013-01-01

This study focused on evaluating the temporal stability of self-reported and teacher-reported perceptions of students' social and emotional skills and assets. We used a test-retest reliability procedure over repeated administrations of the child, adolescent, and teacher versions of the "Social-Emotional Assets and Resilience Scales".…

Influences on and Limitations of Classical Test Theory Reliability Estimates.

Science.gov (United States)

Arnold, Margery E.

It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…
The interrater and test-retest reliability of the Home Falls and Accidents Screening Tool (HOME FAST) in Malaysia: Using raters with a range of professional backgrounds.

Science.gov (United States)

Romli, Muhammad Hibatullah; Mackenzie, Lynette; Lovarini, Meryl; Tan, Maw Pin; Clemson, Lindy

2017-06-01

Falls can be a devastating issue for older people living in the community, including those living in Malaysia. Health professionals and community members have a responsibility to ensure that older people have a safe home environment to reduce the risk of falls. Using a standardised screening tool is beneficial to intervene early with this group. The Home Falls and Accidents Screening Tool (HOME FAST) should be considered for this purpose; however, its use in Malaysia has not been studied. Therefore, the aim of this study was to evaluate the interrater and test-retest reliability of the HOME FAST with multiple professionals in the Malaysian context. A cross-sectional design was used to evaluate interrater reliability where the HOME FAST was used simultaneously in the homes of older people by 2 raters and a prospective design was used to evaluate test-retest reliability with a separate group of older people at different times in their homes. Both studies took place in an urban area of Kuala Lumpur. Professionals from 9 professional backgrounds participated as raters in this study, and a group of 51 community older people were recruited for the interrater reliability study and another group of 30 for the test-retest reliability study. The overall agreement was moderate for interrater reliability and good for test-retest reliability. The HOME FAST was consistently rated by different professionals, and no bias was found among the multiple raters. The HOME FAST can be used with confidence by a variety of professionals across different settings. The HOME FAST can become a universal tool to screen for home hazards related to falls. © 2017 John Wiley & Sons, Ltd.
Influences on the Test-Retest Reliability of Functional Connectivity MRI and its Relationship with Behavioral Utility.

Science.gov (United States)

Noble, Stephanie; Spann, Marisa N; Tokoglu, Fuyuze; Shen, Xilin; Constable, R Todd; Scheinost, Dustin

2017-11-01

Best practices are currently being developed for the acquisition and processing of resting-state magnetic resonance imaging data used to estimate brain functional organization-or "functional connectivity." Standards have been proposed based on test-retest reliability, but open questions remain. These include how amount of data per subject influences whole-brain reliability, the influence of increasing runs versus sessions, the spatial distribution of reliability, the reliability of multivariate methods, and, crucially, how reliability maps onto prediction of behavior. We collected a dataset of 12 extensively sampled individuals (144 min data each across 2 identically configured scanners) to assess test-retest reliability of whole-brain connectivity within the generalizability theory framework. We used Human Connectome Project data to replicate these analyses and relate reliability to behavioral prediction. Overall, the historical 5-min scan produced poor reliability averaged across connections. Increasing the number of sessions was more beneficial than increasing runs. Reliability was lowest for subcortical connections and highest for within-network cortical connections. Multivariate reliability was greater than univariate. Finally, reliability could not be used to improve prediction; these findings are among the first to underscore this distinction for functional connectivity. A comprehensive understanding of test-retest reliability, including its limitations, supports the development of best practices in the field. © The Author 2017. Published by Oxford University Press.
Hip abduction-adduction strength and one-leg hop tests: test-retest reliability and relationship to function in elite ice hockey players.

Science.gov (United States)

Kea, J; Kramer, J; Forwell, L; Birmingham, T

2001-08-01

Single group, test-retest. To determine: (1) hip abduction and adduction torques during concentric and eccentric muscle actions, (2) medial and lateral one-leg hop distances, (3) the test-retest reliability of these measurements, and (4) the relationship between isokinetic measures of hip muscle strength and hop distances in elite ice hockey players. The skating motion used in ice hockey requires strong contractions of the hip and knee musculature. However, baseline scores for hip strength and hop distances, their test-retest reliability, and measures of the extent to which these tests are related for this population are not available. The dominant leg of 27 men (mean age 20 +/- 3 yrs) was tested on 2 occasions. Hip abduction and adduction movements were completed at 60 degrees.s(-1) angular velocity, with the subject lying on the non-test side and the test leg moving vertically in the subject's coronal plane. One-leg hops requiring jumping from and landing on the same leg without losing balance were completed in the medial and lateral directions. Hip adduction torques were significantly greater than abduction torques during both concentric and eccentric muscle actions, while no significant difference was observed between medial and lateral hop distances. Although hop test scores produced excellent ICCs (> 0.75) when determined using scores on 1 occasion, torques needed to be averaged over 2 test occasions to reach this level. Correlations between the strength and hop tests ranged from slight to low (r = -0.26 to 0.27) and were characterized by wide 95% confidence intervals (-0.54 to 0.61). Isokinetic tests of hip abduction and adduction did not provide a strong indication of performance during sideways hop tests. Although isokinetic tests can provide a measure of muscular strength under specific test conditions, they should not be relied upon as a primary indicator of functional abilities or readiness to return to activity.
Establishing survey validity and reliability for American Indians through "think aloud" and test-retest methods.

Science.gov (United States)

Hauge, Cindy Horst; Jacobs-Knight, Jacque; Jensen, Jamie L; Burgess, Katherine M; Puumala, Susan E; Wilton, Georgiana; Hanson, Jessica D

2015-06-01

The purpose of this study was to use a mixed-methods approach to determine the validity and reliability of measurements used within an alcohol-exposed pregnancy prevention program for American Indian women. To develop validity, content experts provided input into the survey measures, and a "think aloud" methodology was conducted with 23 American Indian women. After revising the measurements based on this input, a test-retest was conducted with 79 American Indian women who were randomized to complete either the original measurements or the new, modified measurements. The test-retest revealed that some of the questions performed better for the modified version, whereas others appeared to be more reliable for the original version. The mixed-methods approach was a useful methodology for gathering feedback on survey measurements from American Indian participants and in indicating specific survey questions that needed to be modified for this population. © The Author(s) 2015.
Laterality judgments in people with low back pain--A cross-sectional observational and test-retest reliability study.

Science.gov (United States)

Linder, Martin; Michaelson, Peter; Röijezon, Ulrik

2016-02-01

Disruption of cortical representation, or body schema, has been indicated as a factor in the persistence and recurrence of low back pain (LBP). This has been observed through impaired laterality judgment ability and it has been suggested that this ability is affected in a spatial rather than anatomical manner. We compared laterality judgment performance of foot and trunk movements between people with LBP with or without leg pain and healthy controls, and investigated associations between test performance and pain. We also assessed the test-retest reliability of the Recognise Online™ software when used in a clinical and a home setting. Cross-sectional observational and test-retest study. Thirty individuals with LBP and 30 healthy controls performed judgment tests of foot and trunk laterality once supervised in a clinic and twice at home. No statistically significant group differences were found. LBP intensity was negatively related to trunk laterality accuracy (p = 0.019). Intraclass correlation values ranged from 0.51 to 0.91. Reaction time improved significantly between test occasions while accuracy did not. Laterality judgments were not impaired in subjects with LBP compared to controls. Further research may clarify the relationship between pain mechanisms in LBP and laterality judgment ability. Reliability values were mostly acceptable, with wide and low confidence intervals, suggesting test-retest reliability for Recognise Online™ could be questioned in this trial. A significant learning effect was observed which should be considered in clinical and research application of the test. Copyright © 2015 Elsevier Ltd. All rights reserved.
Test-retest reliability and four-week changes in cardiopulmonary fitness in stroke patients: evaluation using a robotics-assisted tilt table.

Science.gov (United States)

Saengsuwan, Jittima; Berger, Lucia; Schuster-Amft, Corina; Nef, Tobias; Hunt, Kenneth J

2016-09-06

Exercise testing devices for evaluating cardiopulmonary fitness in patients with severe disability after stroke are lacking, but we have adapted a robotics-assisted tilt table (RATT) for cardiopulmonary exercise testing (CPET). Using the RATT in a sample of patients after stroke, this study aimed to investigate test-retest reliability and repeatability of CPET and to prospectively investigate changes in cardiopulmonary outcomes over a period of four weeks. Stroke patients with all degrees of disability underwent 3 separate CPET sessions: 2 tests at baseline (TB1 and TB2) and 1 test at follow up (TF). TB1 and TB2 were at least 24 h apart. TB2 and TF were 4 weeks apart. A RATT equipped with force sensors in the thigh cuffs, a work rate estimation algorithm and a real-time visual feedback system was used to guide the patients' exercise work rate during CPET. Test-retest reliability and repeatability of CPET variables were analysed using paired t-tests, the intraclass correlation coefficient (ICC), the coefficient of variation (CoV), and Bland and Altman limits of agreement. Changes in cardiopulmonary fitness during four weeks were analysed using paired t-tests. Seventeen sub-acute and chronic stroke patients (age 62.7 ± 10.4 years [mean ± SD]; 8 females) completed the test sessions. The median time post stroke was 350 days. There were 4 severely disabled, 1 moderately disabled and 12 mildly disabled patients. For test-retest, there were no statistically significant differences between TB1 and TB2 for most CPET variables. Peak oxygen uptake, peak heart rate, peak work rate and oxygen uptake at the ventilatory anaerobic threshold (VAT) and respiratory compensation point (RCP) showed good to excellent test-retest reliability (ICC 0.65-0.94). For all CPET variables, CoV was 4.1-14.5 %. The mean difference was close to zero in most of the CPET variables. There were no significant changes in most cardiopulmonary performance parameters during the 4-week period
On the internal consistency of the term structure of forecasts of housing starts

DEFF Research Database (Denmark)

Pierdzioch, C.; Rulke, J. C.; Stadtmann, G.

2013-01-01

We use the term structure of forecasts of housing starts to test for rationality of forecasts. Our test is based on the idea that short-term and long-term forecasts should be internally consistent. We test the internal consistency of forecasts using data for Australia, Canada, Japan and the United...
Test-retest reliability and task order effects of emotional cognitive tests in healthy subjects.

Science.gov (United States)

Adams, Thomas; Pounder, Zoe; Preston, Sally; Hanson, Andy; Gallagher, Peter; Harmer, Catherine J; McAllister-Williams, R Hamish

2016-11-01

Little is known of the retest reliability of emotional cognitive tasks or the impact of using different tasks employing similar emotional stimuli within a battery. We investigated this in healthy subjects. We found improved overall performance in an emotional attentional blink task (EABT) with repeat testing at one hour and one week compared to baseline, but the impact of an emotional stimulus on performance was unchanged. Similarly, performance on a facial expression recognition task (FERT) was better one week after a baseline test, though the relative effect of specific emotions was unaltered. There was no effect of repeat testing on an emotional word categorising, recall and recognition task. We found no difference in performance in the FERT and EABT irrespective of task order. We concluded that it is possible to use emotional cognitive tasks in longitudinal studies and combine tasks using emotional facial stimuli in a single battery.
Forward lunge as a functional performance test in ACL deficient subjects: test-retest reliability

DEFF Research Database (Denmark)

Alkjaer, Tine; Henriksen, Marius; Dyhre-Poulsen, Poul

2009-01-01

The forward lunge movement may be used as a functional performance test of anterior cruciate ligament (ACL) deficient and reconstructed subjects. The purposes were 1) to determine the test-retest reliability of a forward lunge in healthy subjects and 2) to determine the required numbers...... of repetitions necessary to yield satisfactory reliability. Nineteen healthy subjects performed four trials of a forward lunge on two different days. The movement time, impulses of the ground reaction forces (IFz, IFy), knee joint kinematics and dynamics during the forward lunge were calculated. The relative...... reliability was determined by calculation of Intraclass Correlation Coefficients (ICC). The IFz, IFy and the positive work of the knee extensors showed excellent reliability (ICC >0.75). All other variables demonstrated acceptable reliability (0.4>ICCreliability increased when more than...
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

Directory of Open Access Journals (Sweden)

Singh Amika S

2011-12-01

Full Text Available Abstract Background Insight in children's energy balance-related behaviours (EBRBs and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77% showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23% and poor for one item. Construct validity appeared to be good to excellent for 70 (47% of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26% and poor for 41 items (27%. Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

Science.gov (United States)

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Test-retest and between-site reliability in a multicenter fMRI study.

Science.gov (United States)

Friedman, Lee; Stern, Hal; Brown, Gregory G; Mathalon, Daniel H; Turner, Jessica; Glover, Gary H; Gollub, Randy L; Lauriello, John; Lim, Kelvin O; Cannon, Tyrone; Greve, Douglas N; Bockholt, Henry Jeremy; Belger, Aysenil; Mueller, Bryon; Doty, Michael J; He, Jianchun; Wells, William; Smyth, Padhraic; Pieper, Steve; Kim, Seyoung; Kubicki, Marek; Vangel, Mark; Potkin, Steven G

2008-08-01

In the present report, estimates of test-retest and between-site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR-deconvolution analysis with FMRISTAT. Six functionally-derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast-to-noise-ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test-retest reliability was high, but initially, between-site reliability was low, indicating a strong contribution from site and site-by-subject variance. However, a number of factors that can markedly improve between-site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between-site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies.
Test-retest reliability of the proposed DSM-5 eating disorder diagnostic criteria

Science.gov (United States)

Sysko, Robyn; Roberto, Christina A.; Barnes, Rachel D.; Grilo, Carlos M.; Attia, Evelyn; Walsh, B. Timothy

2012-01-01

The proposed DSM-5 classification scheme for eating disorders includes both major and minor changes to the existing DSM-IV diagnostic criteria. It is not known what effect these modifications will have on the ability to make reliable diagnoses. Two studies were conducted to evaluate the short-term test-retest reliability of the proposed DSM-5 eating disorder diagnoses: anorexia nervosa, bulimia nervosa, binge eating disorder, and feeding and eating conditions not elsewhere classified. Participants completed two independent telephone interviews with research assessors (n=70 Study 1; n=55 Study 2). Fair to substantial agreements (κ= 0.80 and 0.54) were observed across eating disorder diagnoses in Study 1 and Study 2, respectively. Acceptable rates of agreement were identified for the individual eating disorder diagnoses, including DSM-5 anorexia nervosa (κ’s of 0.81 to 0.97), bulimia nervosa (κ=0.84), binge eating disorder (κ’s of 0.75 and 0.61), and feeding and eating disorders not elsewhere classified (κ’s of 0.70 and 0.46). Further, improved short-term test-retest reliability was noted when using the DSM-5, in comparison to DSM-IV, criteria for binge eating disorder. Thus, these studies found that trained interviewers can reliably diagnose eating disorders using the proposed DSM-5 criteria; however, additional data from general practice settings and community samples are needed. PMID:22401974
Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

Science.gov (United States)

Tepe, Rodger; Tepe, Chabha

2015-03-01

To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Delineating a Retesting Zone Using Receiver Operating Characteristic Analysis on Serial QuantiFERON Tuberculosis Test Results in US Healthcare Workers

Directory of Open Access Journals (Sweden)

Wendy Thanassi

2012-01-01

Full Text Available Objective. To find a statistically significant separation point for the QuantiFERON Gold In-Tube (QFT interferon gamma release assay that could define an optimal “retesting zone” for use in serially tested low-risk populations who have test “reversions” from initially positive to subsequently negative results. Method. Using receiver operating characteristic analysis (ROC to analyze retrospective data collected from 3 major hospitals, we searched for predictors of reversion until statistically significant separation points were revealed. A confirmatory regression analysis was performed on an additional sample. Results. In 575 initially positive US healthcare workers (HCWs, 300 (52.2% had reversions, while 275 (47.8% had two sequential positive tests. The most statistically significant (Kappa = 0.48, chi-square = 131.0, P<0.001 separation point identified by the ROC for predicting reversion was the tuberculosis antigen minus-nil (TBag-nil value at 1.11 International Units per milliliter (IU/mL. The second separation point was found at TBag-nil at 0.72 IU/mL (Kappa = 0.16, chi-square = 8.2, P<0.01. The model was validated by the regression analysis of 287 HCWs. Conclusion. Reversion likelihood increases as the TBag-nil approaches the manufacturer's cut-point of 0.35 IU/mL. The most statistically significant separation point between those who test repeatedly positive and those who revert is 1.11 IU/mL. Clinicians should retest low-risk individuals with initial QFT results < 1.11 IU/mL.
Reliability, factor analysis and internal consistency calculation of the Insomnia Severity Index (ISI) in French and in English among Lebanese adolescents.

Science.gov (United States)

Chahoud, M; Chahine, R; Salameh, P; Sauleau, E A

2017-06-01

Our goal is to validate and to verify the reliability of the French and English versions of the Insomnia Severity Index (ISI) in Lebanese adolescents. A cross-sectional study was implemented. 104 Lebanese students aged between 14 and 19 years participated in the study. The English version of the questionnaire was distributed to English-speaking students and the French version was administered to French-speaking students. A scale (1 to 7 with 1 = very well understood and 7 = not at all) was used to identify the level of the students' understanding of each instruction, question and answer of the ISI. The scale's structural validity was assessed. The factor structure of ISI was evaluated by principal component analysis. The internal consistency of this scale was evaluated by Cronbach's alpha. To assess test-retest reliability the intraclass correlation coefficient (ICC) was used. The principal component analysis confirmed the presence of a two-component factor structure in the English version and a three-component factor structure in the French version with eigenvalues > 1. The English version of the ISI had an excellent internal consistency (α = 0.90), while the French version had a good internal consistency (α = 0.70). The ICC presented an excellent agreement in the French version (ICC = 0.914, CI = 0.856-0.949) and a good agreement in the English one (ICC = 0.762, CI = 0.481-890). The Bland-Altman plots of the two versions of the ISI showed that the responses over two weeks' were comparable and very few outliers were detected. The results of our analyses reveal that both English and French versions of the ISI scale have good internal consistency and are reproducible and reliable. Therefore, it can be used to assess the prevalence of insomnia in Lebanese adolescents.
Test-retest reliability of trunk motor variability measured by large-array surface electromyography.

Science.gov (United States)

Abboud, Jacques; Nougarou, François; Loranger, Michel; Descarreaux, Martin

2015-01-01

The objective of this study was to evaluate the test-retest reliability of the trunk muscle activity distribution in asymptomatic participants during muscle fatigue using large-array surface electromyography (EMG). Trunk muscle activity distribution was evaluated twice, with 3 to 4 days between them, in 27 asymptomatic volunteers using large-array surface EMG. Motor variability, assessed with 2 different variables (the centroid coordinates of the root mean square map and the dispersion variable), was evaluated during a low back muscle fatigue task. Test-retest reliability of muscle activity distribution was obtained using Pearson correlation coefficients. A shift in the distribution of EMG amplitude toward the lateral-caudal region of the lumbar erector spinae induced by muscle fatigue was observed. Moderate to very strong correlations were found between both sessions in the last 3 phases of the fatigue task for both motor variability variables, whereas weak to moderate correlations were found in the first phases of the fatigue task only for the dispersion variable. These findings show that, in asymptomatic participants, patterns of EMG activity are less reliable in initial stages of muscle fatigue, whereas later stages are characterized by highly reliable patterns of EMG activity. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.
Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

Science.gov (United States)

Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh

2018-03-01

Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA
Assessment of lower urinary tract symptoms in women by a self-administered questionnaire: test-retest reliability

DEFF Research Database (Denmark)

Bernstein, Inge Thomsen; Sejr, T; Able, I

1996-01-01

A self-administered questionnaire assessing female lower urinary tract symptoms and their impact on quality of life is described and validated, on 56 females in six participating departments. The patients answered two identical questionnaires on separate occasions before treatment. Test-retest re...

A Test-Retest Analysis of the Vanderbilt Assessment for Leadership in Education in the USA

Science.gov (United States)

Minor, Elizabeth Covay; Porter, Andrew C.; Murphy, Joseph; Goldring, Ellen; Elliott, Stephen N.

2017-01-01

The Vanderbilt Assessment for Leadership in Education (VAL-ED) is a 360-degree learning-centered behaviors principal evaluation tool that includes ratings from the principal, supervisors, and teachers. The current study assesses the test-retest reliability of the VAL-ED for a sample of seven school districts as part of multiple validity and…
The Test-Retest Reliability of New Generation Power Indices of Wingate All-Out Test

Directory of Open Access Journals (Sweden)

Ozgur Ozkaya

2018-04-01

Full Text Available Although reliability correlations of traditional power indices of the Wingate test have been well documented, no study has analyzed new generation power indices based on milliseconds obtained from a Peak Bike. The purpose of this study was to investigate the retest reliability of new generation power indices. Thirty-two well-trained male athletes who were specialized in basketball, football, tennis, or track and field volunteered to take part in the study (age: 24.3 ± 2.2 years; body mass: 77 ± 8.3 kg; height: 180.3 ± 6.3 cm. Participants performed two Wingate all-out sessions on two separate days. Intra-class correlation coefficient (ICC, standard error measurement (SEM, smallest real differences (SRD and coefficient of variation (CV scores were analyzed based on the test and retest data. Reliability results of traditional power indices calculated based on 5-s means such as peak power, average power, power drop, and fatigue index ratio were similar with the previous findings in literature (ICC ≥ 0.94; CV ≤ 2.8%; SEM ≤ 12.28; SRD% ≤ 7.7%. New generation power indices such as peak power, average power, lowest power, power drop, fatigue index, power decline, maximum speed as rpm, and amount of total energy expenditure demonstrated high reliability (ICC ≥ 0.94; CV ≤ 4.3%; SEM ≤ 10.36; SRD% ≤ 8.8%. Time to peak power, time at maximum speed, and power at maximum speed showed a moderate level of reliability (ICC ≥ 0.73; CV ≤ 8.9%; SEM ≤ 63.01; SRD% ≤ 22.4%. The results of this study indicate that reliability correlations and SRD% of new generation power and fatigue-related indices are similar with traditional 5-s means. However, new time-related indices are very sensitive and moderately reliable.
Test-retest reliability of stride time variability while dual tasking in healthy and demented adults with frontotemporal degeneration

Directory of Open Access Journals (Sweden)

Herrmann Francois R

2011-07-01

Full Text Available Abstract Background Although test-retest reliability of mean values of spatio-temporal gait parameters has been assessed for reliability while walking alone (i.e., single tasking, little is known about the test-retest reliability of stride time variability (STV while performing an attention demanding-task (i.e., dual tasking. The objective of this study was to examine immediate test-retest reliability of STV while single and dual tasking in cognitively healthy older individuals (CHI and in demented patients with frontotemporal degeneration (FTD. Methods Based on a cross-sectional design, 69 community-dwelling CHI (mean age 75.5 ± 4.3; 43.5% women and 14 demented patients with FTD (mean age 65.7 ± 9.8 years; 6.7% women walked alone (without performing an additional task; i.e., single tasking and while counting backward (CB aloud starting from 50 (i.e., dual tasking. Each subject completed two trials for all the testing conditions. The mean value and the coefficient of variation (CoV of stride time while walking alone and while CB at self-selected walking speed were measured using GAITRite® and SMTEC® footswitch systems. Results ICC of mean value in CHI under both walking conditions were higher than ICC of demented patients with FTD and indicated perfect reliability (ICC > 0.80. Reliability of mean value was better while single tasking than dual tasking in CHI (ICC = 0.96 under single-task and ICC = 0.86 under dual-task, whereas it was the opposite in demented patients (ICC = 0.65 under single-task and ICC = 0.81 under dual-task. ICC of CoV was slight to poor whatever the group of participants and the walking condition (ICC Conclusions The immediate test-retest reliability of the mean value of stride time in single and dual tasking was good in older CHI as well as in demented patients with FTD. In contrast, the variability of stride time was low in both groups of participants.
Temporal stability of preferences and willingness to pay for natural areas in choice experiments: A test-retest

NARCIS (Netherlands)

Schaafsma, M.; Brouwer, R.; Liekens, I.; de Nocker, L.

2014-01-01

The main objective of this paper is to test the temporal stability of stated preferences and willingness to pay (WTP) values from a Choice Experiment (CE) in a test-retest. The same group of participants was asked the same choice tasks in an internet-based CE, conducted twice with a time interval of
Test-retest reliability of the Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale.

Science.gov (United States)

Gustafsson, Margareta; Blomberg, Karin; Holmefur, Marie

2015-07-01

The Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale evaluates the student nurses' perception of the learning environment and supervision within the clinical placement. It has never been tested in a replication study. The aim of the present study was to evaluate the test-retest reliability of the CLES + T scale. The CLES + T scale was administered twice to a group of 42 student nurses, with a one-week interval. Test-retest reliability was determined by calculations of Intraclass Correlation Coefficients (ICCs) and weighted Kappa coefficients. Standard Error of Measurements (SEM) and Smallest Detectable Difference (SDD) determined the precision of individual scores. Bland-Altman plots were created for analyses of systematic differences between the test occasions. The results of the study showed that the stability over time was good to excellent (ICC 0.88-0.96) in the sub-dimensions "Supervisory relationship", "Pedagogical atmosphere on the ward" and "Role of the nurse teacher". Measurements of "Premises of nursing on the ward" and "Leadership style of the manager" had lower but still acceptable stability (ICC 0.70-0.75). No systematic differences occurred between the test occasions. This study supports the usefulness of the CLES + T scale as a reliable measure of the student nurses' perception of the learning environment within the clinical placement at a hospital. Copyright © 2015 Elsevier Ltd. All rights reserved.
Factor structure, internal consistency and reliability of the Posttraumatic Stress Disorder Checklist (PCL: an exploratory study Estrutura fatorial, consistência interna e confiabilidade do Posttraumatic Stress Disorder Checklist (PCL: um estudo exploratório

Directory of Open Access Journals (Sweden)

Eduardo de Paula Lima

2012-01-01

Full Text Available INTRODUCTION: Posttraumatic stress disorder (PTSD is an anxiety disorder resulting from exposure to traumatic events. The Posttraumatic Stress Disorder Checklist (PCL is a self-report measure largely used to evaluate the presence of PTSD. OBJECTIVE: To investigate the internal consistency, temporal reliability and factor validity of the Portuguese language version of the PCL used in Brazil. METHODS: A total of 186 participants were recruited. The sample was heterogeneous with regard to occupation, sociodemographic data, mental health history, and exposure to traumatic events. Subjects answered the PCL at two occasions within a 15 days’ interval (range: 5-15 days. RESULTS: Cronbach’s alpha coefficients indicated high internal consistency for the total scale (0.91 and for the theoretical dimensions of the Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV (0.83, 0.81, and 0.80. Temporal reliability (test-retest was high and consistent for different cutoffs. Maximum likelihood exploratory factor analysis (EFA was conducted and oblique rotation (Promax was applied. The Kaiser-Meyer-Olkin (KMO index (0.911 and Bartlett’s test of sphericity (χ² = 1,381.34, p INTRODUÇÃO: O transtorno do estresse pós-traumático (TEPT é um transtorno de ansiedade decorrente da exposição a eventos traumáticos. Entre as medidas de avaliação dos sintomas, destaca-se o Posttraumatic Stress Disorder Checklist (PCL. OBJETIVO: Investigar a consistência interna, a confiabilidade temporal e a validade fatorial da versão do PCL em português, utilizada no Brasil. MÉTODOS: Participaram do estudo 186 indivíduos heterogêneos em relação a ocupação, características sociodemográficas, histórico de saúde mental e exposição a eventos traumáticos. O PCL foi aplicado em dois momentos considerando um intervalo máximo de 15 dias (intervalo: 5-15 dias. RESULTADOS: A consistência interna (alfa de Cronbach foi adequada para a escala
Test-retest Agreement and Reliability of Quantitative Sensory Testing 1 Year After Breast Cancer Surgery

DEFF Research Database (Denmark)

Andersen, Kenneth Geving; Kehlet, Henrik; Aasvang, Eske Kvanner

2015-01-01

.5 SD) than within-patient variation (0.23 to 3.55 SD). There were no significant differences between pain and pain-free patients. The individual test-retest variability was higher on the operated side compared with the nonoperated side. DISCUSSION: The QST protocol reliability allows for group......OBJECTIVES: Quantitative sensory testing (QST) is used to assess sensory dysfunction and nerve damage by examining psychophysical responses to controlled, graded stimuli such as mechanical and thermal detection and pain thresholds. In the breast cancer population, 4 studies have used QST to examine...... persistent pain after breast cancer treatment, suggesting neuropathic pain being a prominent pain mechanism. However, the agreement and reliability of QST has not been described in the postsurgical breast cancer population, hindering exact interpretation of QST studies in this population. The aim...
Test-retest reliability of schizoaffective disorder compared with schizophrenia, bipolar disorder, and unipolar depression--a systematic review and meta-analysis.

Science.gov (United States)

Santelmann, Hanno; Franklin, Jeremy; Bußhoff, Jana; Baethge, Christopher

2015-11-01

Schizoaffective disorder is a frequent diagnosis, and its reliability is subject to ongoing discussion. We compared the diagnostic reliability of schizoaffective disorder with its main differential diagnoses. We systematically searched Medline, Embase, and PsycInfo for all studies on the test-retest reliability of the diagnosis of schizoaffective disorder as compared with schizophrenia, bipolar disorder, and unipolar depression. We used meta-analytic methods to describe and compare Cohen's kappa as well as positive and negative agreement. In addition, multiple pre-specified and post hoc subgroup and sensitivity analyses were carried out. Out of 4,415 studies screened, 49 studies were included. Test-retest reliability of schizoaffective disorder was consistently lower than that of schizophrenia (in 39 out of 42 studies), bipolar disorder (27/33), and unipolar depression (29/35). The mean difference in kappa between schizoaffective disorder and the other diagnoses was approximately 0.2, and mean Cohen's kappa for schizoaffective disorder was 0.50 (95% confidence interval: 0.40-0.59). While findings were unequivocal and homogeneous for schizoaffective disorder's diagnostic reliability relative to its three main differential diagnoses (dichotomous: smaller versus larger), heterogeneity was substantial for continuous measures, even after subgroup and sensitivity analyses. In clinical practice and research, schizoaffective disorder's comparatively low diagnostic reliability should lead to increased efforts to correctly diagnose the disorder. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The Physical Activity Scale for Individuals with Physical Disabilities : test-retest reliability and comparison with an accelerometer

NARCIS (Netherlands)

van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem; van der Woude, Lucas

BACKGROUND: The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). METHODS: Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects'
Test-retest reliability of an interactive voice response (IVR) version of the EORTC QLQ-C30

NARCIS (Netherlands)

Lundy, J.J.; Coons, S.J.; Aaronson, N.K.

2015-01-01

Objective: The objective of this study was to assess the test-retest reliability of an interactive voice response (IVR) version of the European Organisation for Research and Treatment of Cancer (EORTC) QLQ-C30. Methods: A convenience sample of outpatient cancer clinic patients (n = 127) was asked to
Choice, internal consistency, and rationality

OpenAIRE

Aditi Bhattacharyya; Prasanta K. Pattanaik; Yongsheng Xu

2010-01-01

The classical theory of rational choice is built on several important internal consistency conditions. In recent years, the reasonableness of those internal consistency conditions has been questioned and criticized, and several responses to accommodate such criticisms have been proposed in the literature. This paper develops a general framework to accommodate the issues raised by the criticisms of classical rational choice theory, and examines the broad impact of these criticisms from both no...
Construct Validity and Test-Retest Reliability of the Walking Questionnaire in People With a Lower Limb Amputation

NARCIS (Netherlands)

de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

Objective: To investigate the construct validity and test-retest reliability of the Walking Questionnaire, a patient-reported measure of activity limitations in walking in people with a lower limb amputation. Design: Cross-sectional study. Setting: Outpatient department of a rehabilitation center.
Reliability and concurrent validity of a motor skill competence test among 4- to 12-year old children

NARCIS (Netherlands)

Hoeboer, Joris; Krijger-Hombergen, Michiel; Savelsbergh, Geert; De Vries, Sanne

2017-01-01

The purpose of this study was to examine the test-retest reliability, internal consistency and concurrent validity of the Athletic Skills Track (AST). During a regular PE lesson, 930 4- to 12-year old children (448 girls, 482 boys) completed two motor skill competence tests: (1) the
Development and initial validation of the internalization of Asian American stereotypes scale.

Science.gov (United States)

Shen, Frances C; Wang, Yu-Wei; Swanson, Jane L

2011-07-01

This research consists of four studies on the initial reliability and validity of the Internalization of Asian American Stereotypes Scale (IAASS), a self-report instrument that measures the degree Asian Americans have internalized racial stereotypes about their own group. The results from the exploratory and confirmatory factor analyses support a stable four-factor structure of the IAASS: Difficulties with English Language Communication, Pursuit of Prestigious Careers, Emotional Reservation, and Expected Academic Success. Evidence for concurrent and discriminant validity is presented. High internal-consistency and test-retest reliability estimates are reported. A discussion of how this scale can contribute to research and practice regarding internalized stereotyping among Asian Americans is provided.
Translation and Adaptation of Knee Injury and Osteoarthritis Outcome Score (KOOS in to Persian and Testing Persian Version Reliability Among Iranians with Osteoarthritis

Directory of Open Access Journals (Sweden)

Solaleh Saraei-Pour

2007-04-01

Full Text Available Objective: To achieve a reliable tool for measuring health related quality of life among Iranians with knee osteoarthritis, by translating and culturally adapting the Knee injury and Osteoarthritis Outcome Score(KOOS to Persian and testing the reliability and internal consistency of the Iranian version. Materials & Methods: It was a non experimental methodology study. KOOS was translated and adapted culturally to Persian language and culture in three phases with respect to IQOLA project. For examining test-retest reliability Iranians version of KOOS was corresponded twice with in at least two days or at most one week interval, by 30 Iranian people with knee OA whom were referred to Municipality and 110 physiotherapy clinics of Tehran with PT order by physicians. It was a non experimental methodological research and we used sample of convenience and non probability design for sampling. Psychometric evaluation: the collected data from the questionnaires was rated and analyzed with SPSS software from the aspects of test-retest reliability, absolute reliability, subscale and item internal consistency. Results: Internal consistency which was calculated by Cronbach '&alpha was high for all the subscales (at least 0.76, except for "symptom" subscale which was moderate, and showed that items of each subscale measured the same construct. Item internal consistency after correction for overlap, was higher than optimal value (0.4, except for the items of" symptom" subscale , which demonstrated good item internal consistency. SEM and ICC which were used for evaluating the absolute and test-retest reliability in respect showed that all the subscales had good test-retest reliability (0.7 and the absolute reliability was also very good in such away that the highest calculated SEM for Persian version was 7.44 which was less than Minimal Perceptible Clinical Improvement (MPCI that is estimated 8 to 10 for the KOOS questionnaire. Conclusion: With the Persian
Test--retest variability of Randot stereoacuity measures gathered in an unselected sample of UK primary school children.

Science.gov (United States)

Adler, Paul; Scally, Andrew J; Barrett, Brendan T

2012-05-01

To determine the test-retest reliability of the Randot stereoacuity test when used as part of vision screening in schools. Randot stereoacuity (graded-circles) and logMAR visual acuity measures were gathered in an unselected sample of 139 children (aged 4-12, mean 8.1±2.1 years) in two schools. Randot testing was repeated on two occasions (average interval between successive tests 8 days, range: 1-21 days). Three Randot scores were obtained in 97.8% of children. Randot stereoacuity improved by an average of one plate (ie, one test level) on repeat testing but was little changed when tested on the third occasion. Within-subject variability was up to three test levels on repeat testing. When stereoacuity was categorised as 'fine', 'intermediate' or 'coarse', the greatest variability was found among younger children who exhibited 'intermediate' or 'coarse'/nil stereopsis on initial testing. Whereas 90.8% of children with 'fine' stereopsis (≤50 arc-seconds) on the first test exhibited 'fine' stereopsis on both subsequent tests, only ∼16% of children with 'intermediate' (>50 but ≤140 arc-seconds) or 'coarse'/nil (≥200 arc-seconds) stereoacuity on initial testing exhibited stable test results on repeat testing. Children exhibiting abnormal stereoacuity on initial testing are very likely to exhibit a normal result when retested. The value of a single, abnormal Randot graded-circles stereoacuity measure from school screening is therefore questionable.
14 CFR 61.49 - Retesting after failure.

Science.gov (United States)

2010-01-01

... 14 Aeronautics and Space 2 2010-01-01 2010-01-01 false Retesting after failure. 61.49 Section 61.49 Aeronautics and Space FEDERAL AVIATION ADMINISTRATION, DEPARTMENT OF TRANSPORTATION (CONTINUED... failure. (a) An applicant for a knowledge or practical test who fails that test may reapply for the test...
Qualidades psicométricas no papel da Escala de Habilidades de Vida Independente de pacientes psiquiátricos (ILSS-BR: fidedignidade do teste e do reteste

Directory of Open Access Journals (Sweden)

Bandeira Marina

2003-01-01

Full Text Available Esta pesquisa teve como objetivo fazer a análise psicométrica de fidedignidade do teste e do reteste da versão brasileira da escala Independent Living Skills Survey (ILSS-BR, que avalia a autonomia de pacientes crônicos em diversas áreas do funcionamento social. Um estudo prévio havia mostrado que esta escala apresentava qualidades psicométricas adequadas de consistência interna das suas subescalas, assim como validade discriminante e validade de construto, sem ter avaliado, no entanto, sua estabilidade temporal. Os resultados da presente pesquisa mostraram que a escala ILSS-BR apresentou coeficientes de correlação significativos entre os escores do teste e do reteste, para todas as suas subescalas e para o escore global. Esses resultados indicam que o ILSS-BR é um instrumento de medida que apresenta estabilidade temporal, mostrando-se, portanto, um instrumento fidedigno para ser utilizado no planejamento e na avaliação de programas relacionados à reabilitação psicossocial de pacientes psiquiátricos, no contexto brasileiro.
A Review and Comparison of the Reliabilities of the MMPI-2, MCMI-III, and PAI Presented in Their Respective Test Manuals

Science.gov (United States)

Wise, Edward A.; Streiner, David L.; Walfish, Steven

2010-01-01

This article provides a review of the literature to determine the most frequently used personality tests. Based on this review, internal consistency and test-retest reliability coefficients from the test manuals for the Minnesota Multiphasic Personality Inventory-2 (MMPI-2), Millon Clinical Multiaxial Inventory-III (MCMI-III), and Personality…
The Internalized Stigma of Mental Illness (ISMI) scale: validation of the Japanese version.

Science.gov (United States)

Tanabe, Yosuke; Hayashi, Kunihiko; Ideno, Yuki

2016-04-29

The present study investigated the reliability and validity of a Japanese version of the Internalized Stigma of Mental Illness (ISMI) scale, designed to assess internalized stigma experienced by people with mental illness. A survey was conducted with 173 outpatients with mental illness who attended psychiatric clinics on a regular basis. A retest was conducted with 51 participants to evaluate the scale's psychometric properties. The alpha coefficient for the overall internal consistency was 0.91, and the coefficients of the individual ISMI subscales ranged from 0.57 to 0.81. The test-retest reliability was r = 0.85 (n = 51, P stigma resistance items excluded. The Japanese version of the ISMI scale demonstrated similar reliability and validity to the original English version. Therefore, the Japanese version of the ISMI scale may be an effective and valid tool to measure internalized stigma among Japanese people who have a mental illness.

Test-retest reliability of the Food Allergy Quality of Life Questionnaires (FAQLQ) for children, adolescents and adults

NARCIS (Netherlands)

van der Velde, Jantina L.; Flokstra-de Blok, Bertine M. J.; Vlieg - Boerstra, Berber J.; Oude Elberink, Joanne N. G.; Schouten, Jan P.; DunnGalvin, Audrey; Hourihane, Jonathan O'B; Duiverman, Eric J.; Dubois, Anthony E. J.

The self-administered Food Allergy Quality of Life Questionnaire-Child Form (FAQLQ-CF), -Teenager Form (FAQLQ-TF) and -Adult Form (FAQLQ-AF) were recently developed within EuroPrevall, a multi-centred study of food allergy in Europe. The primary aim of this study was to evaluate the test-retest
Test-retest reliability of the Food Allergy Quality of Life Questionnaires (FAQLQ) for children, adolescents and adults

NARCIS (Netherlands)

van der Velde, Jantina L.; Flokstra-de Blok, Bertine M. J.; Vlieg-Boerstra, Berber J.; Oude Elberink, Joanne N. G.; Schouten, Jan P.; DunnGalvin, Audrey; Hourihane, Jonathan O.'B.; Duiverman, Eric J.; Dubois, Anthony E. J.

2009-01-01

The self-administered Food Allergy Quality of Life Questionnaire-Child Form (FAQLQ-CF), -Teenager Form (FAQLQ-TF) and -Adult Form (FAQLQ-AF) were recently developed within EuroPrevall, a multi-centred study of food allergy in Europe. The primary aim of this study was to evaluate the test-retest
Rationale and design of REACT: a randomised controlled trial assessing the effectiveness of home-collection to increase chlamydia retesting and detect repeat positive tests.

Science.gov (United States)

Smith, Kirsty S; Hocking, Jane S; Chen, Marcus; Fairley, Christopher K; McNulty, Anna; Read, Phillip; Bradshaw, Catriona S; Tabrizi, Sepehr N; Wand, Handan; Saville, Marion; Rawlinson, William; Garland, Suzanne M; Donovan, Basil; Kaldor, John M; Guy, Rebecca

2014-04-24

Repeat infection with Chlamydia trachomatis is common and increases the risk of sequelae in women and HIV seroconversion in men who have sex with men (MSM). Despite guidelines recommending chlamydia retesting three months after treatment, retesting rates are low. We are conducting the first randomised controlled trial to assess the effectiveness of home collection combined with short message service (SMS) reminders on chlamydia retesting and reinfection rates in three risk groups. The REACT (retest after Chlamydia trachomatis) trial involves 600 patients diagnosed with chlamydia: 200 MSM, 200 women and 200 heterosexual men recruited from two Australian sexual health clinics where SMS reminders for retesting are routine practice. Participants will be randomised to the home group (3-month SMS reminder and home-collection) or the clinic group (3-month SMS reminder to return to the clinic). Participants in the home group will be given the choice of attending the clinic if they prefer. The mailed home-collection kit includes a self-collected vaginal swab (women), UriSWAB (Copan) for urine collection (heterosexual men), and UriSWAB plus rectal swab (MSM). The primary outcome is the retest rate at 1-4 months after a chlamydia diagnosis, and the secondary outcomes are: the repeat positive test rate; the reinfection rate; the acceptability of home testing with SMS reminders; and the cost effectiveness of home testing. Sexual behaviour data collected via an online survey at 4-5 months, and genotyping of repeat infections, will be used to discriminate reinfections from treatment failures. The trial will be conducted over two years. An intention to treat analysis will be conducted. This study will provide evidence about the effectiveness of home-collection combined with SMS reminders on chlamydia retesting, repeat infection and reinfection rates in three risk groups. The trial will determine client acceptability and cost effectiveness of this strategy. Australian and New
14 CFR 63.41 - Retesting after failure.

Science.gov (United States)

2010-01-01

... 14 Aeronautics and Space 2 2010-01-01 2010-01-01 false Retesting after failure. 63.41 Section 63.41 Aeronautics and Space FEDERAL AVIATION ADMINISTRATION, DEPARTMENT OF TRANSPORTATION (CONTINUED... failure. An applicant for a flight engineer certificate who fails a written test or practical test for...
Short-term test-retest-reliability of conditioned pain modulation using the cold-heat-pain method in healthy subjects and its correlation to parameters of standardized quantitative sensory testing.

Science.gov (United States)

Gehling, Julia; Mainka, Tina; Vollert, Jan; Pogatzki-Zahn, Esther M; Maier, Christoph; Enax-Krumova, Elena K

2016-08-05

Conditioned Pain Modulation (CPM) is often used to assess human descending pain inhibition. Nine different studies on the test-retest-reliability of different CPM paradigms have been published, but none of them has investigated the commonly used heat-cold-pain method. The results vary widely and therefore, reliability measures cannot be extrapolated from one CPM paradigm to another. Aim of the present study was to analyse the test-retest-reliability of the common heat-cold-pain method and its correlation to pain thresholds. We tested the short-term test-retest-reliability within 40 ± 19.9 h using a cold-water immersion (10 °C, left hand) as conditioning stimulus (CS) and heat pain (43-49 °C, pain intensity 60 ± 5 on the 101-point numeric rating scale, right forearm) as test stimulus (TS) in 25 healthy right-handed subjects (12females, 31.6 ± 14.1 years). The TS was applied 30s before (TSbefore), during (TSduring) and after (TSafter) the 60s CS. The difference between the pain ratings for TSbefore and TSduring represents the early CPM-effect, between TSbefore and TSafter the late CPM-effect. Quantitative sensory testing (QST, DFNS protocol) was performed on both sessions before the CPM assessment. paired t-tests, Intraclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Pearson's correlation, Bland-Altman analysis, significance level p Pain ratings during CPM correlated significantly (ICC: 0.411…0.962) between both days, though ratings for TSafter were lower on day 2 (p pain thresholds. The short-term test-retest-reliability of the early CPM-effect using the heat-cold-pain method in healthy subjects achieved satisfying results in terms of the ICC. The SRD of the early CPM effect showed that an individual change of > 20 NRS can be attributed to a real change rather than chance. The late CPM-effect was weaker and not reliable.
Test-Retest Reliability of the Parent Behavior Importance Questionnaire-Revised and the Parent Behavior Frequency Questionnaire-Revised

Science.gov (United States)

Mowder, Barbara A.; Shamah, Renee

2011-01-01

This study evaluated the test-retest reliability of two parenting measures: the Parent Behavior Importance Questionnaire-Revised (PBIQ-R) and Parent Behavior Frequency Questionnaire-Revised (PBFQ-R). These self-report parenting behavior assessment measures may be utilized as pre- and post-parent education program measures, with parents as well as…
Which is the most useful patient-reported outcome in femoroacetabular impingement? Test-retest reliability of six questionnaires.

Science.gov (United States)

Hinman, Rana S; Dobson, Fiona; Takla, Amir; O'Donnell, John; Bennell, Kim L

2014-03-01

The most reliable patient-reported outcomes (PROs) for people with femoroacetabular impingement (FAI) is unknown because there have been no direct comparisons of questionnaires. Thus, the aim was to evaluate the test-retest reliability of six existing PROs in a single cohort of young active people with hip/groin pain consistent with a clinical diagnosis of FAI. Young adults with clinical FAI completed six PRO questionnaires on two occasions, 1-2 weeks apart. The PROs were modified Harris Hip Score, Hip dysfunction and Osteoarthritis Score, Hip Outcome Score, Non-Arthritic Hip Score, International Hip Outcome Tool, Copenhagen Hip and Groin Outcome Score. 30 young adults (mean age 24 years, SD 4 years, range 18-30 years; 15 men) with stable symptoms participated. Intraclass correlation coefficient(3,1) values ranged from 0.73 to 0.93 (95% CI 0.38 to 0.98) indicating that most questionnaires reached minimal reliability benchmarks. Measurement error at the individual level was quite large for most questionnaires (minimal detectable change (MDC95) 12.4-35.6, 95% CI 8.7 to 54.0). In contrast, measurement error at the group level was quite small for most questionnaires (MDC95 2.2-7.3, 95% CI 1.6 to 11). The majority of the questionnaires were reliable and precise enough for use at the group level. Samples of only 23-30 individuals were required to achieve acceptable measurement variation at the group level. Further direct comparisons of these questionnaires are required to assess other measurement properties such as validity, responsiveness and meaningful change in young people with FAI.
Test-retest repeatability of child's respiratory symptoms and perceived indoor air quality - comparing self- and parent-administered questionnaires.

Science.gov (United States)

Lampi, Jussi; Ung-Lanki, Sari; Santalahti, Päivi; Pekkanen, Juha

2018-02-09

Questionnaires can be used to assess perceived indoor air quality and symptoms in schools. Questionnaires for primary school aged children have traditionally been parent-administered, but self-administered questionnaires would be easier to administer and may yield as good, if not better, information. Our aim was to compare the repeatability of self- and parent-administered indoor air questionnaires designed for primary school aged pupils. Indoor air questionnaire with questions on child's symptoms and perceived indoor air quality in schools was sent to parents of pupils aged 7-12 years in two schools and again after two weeks. Slightly modified version of the questionnaire was administered to pupils aged 9-12 years in another two schools and repeated after a week. 351 (52%) parents and 319 pupils (86%) answered both the first and the second questionnaire. Test-retest repeatability was assessed with intra-class correlation (ICC) and Cohen's kappa coefficients (k). Test-retest repeatability was generally between 0.4-0.7 (ICC; k) in both self- and parent-administered questionnaire. In majority of the questions on symptoms and perceived indoor air quality test-retest repeatability was at the same level or slightly better in self-administered compared to parent-administered questionnaire. Agreement of self- and parent administered questionnaires was generally indoor air quality. Children aged 9-12 years can give as, or even more, repeatable information about their respiratory symptoms and perceived indoor air quality than their parents. Therefore, it may be possible to use self-administered questionnaires in future studies also with children.
Interobserver and test-retest reproducibility of T1ρ and T2 mesurements of lumber intervertebral discs by 3t magnetic resonance imaging

Energy Technology Data Exchange (ETDEWEB)

Yoo, Yeon Hwa; Yoon, Choon Sik; Eun, Na Lae; Kim, Sung Jin; Chung, Tae Sub [Dept. of Radiology, Gangnam Severance Hospital, Yonsei University College of Medicine, Seoul (Korea, Republic of); Hwang, Moon Jung [GE Health Care, Seoul (Korea, Republic of); Yoo, Hanna [Biostatistics Collaboration Lab, Yonsei University College of Medicine, Seoul (Korea, Republic of); Peter, Robert D. [GE Health Care, Milwaukee (United States); Lee, Young Han; Suh, Jin Suck [Dept. of Radiology, Severance Hospital, Yonsei University College of Medicine, Seoul (Korea, Republic of)

2016-11-15

To investigate the interobserver and test-retest reproducibility of T1ρ and T2 measurements of lumbar intervertebral discs using 3T magnetic resonance imaging (MRI). This study included a total of 51 volunteers (female, 26; male, 25; mean age, 54 ± 16.3 years) who underwent lumbar spine MRI with a 3.0 T scanner. Amongst these subjects, 40 underwent repeat T1ρ and T2 measurement acquisitions with identical image protocol. Two observers independently performed the region of interest measurements in the nuclei pulposi of the discs from L1-2 through L5-S1 levels. Statistical analysis was performed using intraclass correlation coefficient (ICC) with a two-way random model of absolute agreement. Comparison of the ICC values was done after acquisition of ICC values using Z test. Statistical significance was defined as p value < 0.05. The ICCs of interobserver reproducibility were 0.951 and 0.672 for T1ρ and T2 mapping, respectively. The ICCs of test-retest reproducibility (40 subjects) for T1ρ and T2 measurements were 0.922 and 0.617 for observer A and 0.914 and 0.628 for observer B, respectively. In the comparison of the aforementioned ICCs, ICCs of interobserver and test-retest reproducibility for T1ρ mapping were significantly higher than T2 mapping (p < 0.001). The interobserver and test-retest reproducibility of T1ρ mapping were significantly higher than those of T2 mapping for the quantitative assessment of nuclei pulposi of lumbar intervertebral discs.
Test-retest reliability of Antonovsky's 13-item sense of coherence scale in patients with hand-related disorders

DEFF Research Database (Denmark)

Hansen, Alice Ørts; Kristensen, Hanne Kaae; Cederlund, Ragnhild

2017-01-01

to be a powerful tool to measure the ICF component personal factors, which could have an impact on patients' rehabilitation outcomes. Implications for rehabilitation Antonovsky's SOC-13 scale showed test-retest reliability for patients with hand-related disorders. The SOC-13 scale could be a suitable tool to help...... measure personal factors....
Test Re-Test Reliability of Four Versions of the 3-Cone Test in Non-Athletic Men

Directory of Open Access Journals (Sweden)

Jason G. Langley, Robert D. Chetlin

2017-03-01

Full Text Available Until recently, measurement and evaluation in sport science, especially agility testing, has not always included key elements of proper test construction. Often tests are published without reporting reliability and validity analysis for a specific population. The purpose of the present study was to examine the test re-test reliability of four versions of the 3-Cone Test (3CT, and provide guidance on proper test construction for testing agility in athletic populations. Forty male students enrolled in classes in the Department of Physical Education at a mid-Atlantic university participated. On each of test day participants performed 10 trials. In random order, they performed three trials to the right (3CTR, standard test, three to the left (3CTL, and two modified trials (3CTAR and 3CTAL, which included a reactive component in which a visual cue was given to indicate direction. Intra-class correlation coefficients (ICC indicated a moderate to high reliability for the four tests, 3CTR 0.79 (0.64-0.88, 95%CI, 3CTL 0.73 (0.55-0.85, 3CTAR 0.85(0.74-0.92, and 3CTAL 0.79 (0.64-0.88. Small standard error of the measurement (SEM was found; range 0.09 to 0.10. Pearson correlations between tests were high (0.82-0.92 on day one as well as day two (0.72-0.85. These results indicate each version of the 3-Cone Test is reliable; however, further tests are needed with specific athletic populations. Only the 3CTAR and 3CTAL are tests of agility due to the inclusion of a reactive component. Future studies examining agility testing and training should incorporate technological elements, including automated timing systems and motion capture analysis. Such instrumentation will allow for optimal design of tests that simulate sport-specific game conditions.
Development and reliability testing of a self-report instrument to measure the office layout as a correlate of occupational sitting

Directory of Open Access Journals (Sweden)

Duncan Mitch J

2013-02-01

Full Text Available Abstract Background Spatial configurations of office environments assessed by Space Syntax methodologies are related to employee movement patterns. These methods require analysis of floors plans which are not readily available in large population-based studies or otherwise unavailable. Therefore a self-report instrument to assess spatial configurations of office environments using four scales was developed. Methods The scales are: local connectivity (16 items, overall connectivity (11 items, visibility of co-workers (10 items, and proximity of co-workers (5 items. A panel cohort (N = 1154 completed an online survey, only data from individuals employed in office-based occupations (n = 307 were used to assess scale measurement properties. To assess test-retest reliability a separate sample of 37 office-based workers completed the survey on two occasions 7.7 (±3.2 days apart. Redundant scale items were eliminated using factor analysis; Chronbach’s α was used to evaluate internal consistency and test re-test reliability (retest-ICC. ANOVA was employed to examine differences between office types (Private, Shared, Open as a measure of construct validity. Generalized Linear Models were used to examine relationships between spatial configuration scales and the duration of and frequency of breaks in occupational sitting. Results The number of items on all scales were reduced, Chronbach’s α and ICCs indicated good scale internal consistency and test re-test reliability: local connectivity (5 items; α = 0.70; retest-ICC = 0.84, overall connectivity (6 items; α = 0.86; retest-ICC = 0.87, visibility of co-workers (4 items; α = 0.78; retest-ICC = 0.86, and proximity of co-workers (3 items; α = 0.85; retest-ICC = 0.70. Significant (p ≤ 0.001 differences, in theoretically expected directions, were observed for all scales between office types, except overall connectivity. Significant associations were
Test-retest reliability of speech-evoked auditory brainstem response in healthy children at a low sensation level.

Science.gov (United States)

Zakaria, Mohd Normani; Jalaei, Bahram

2017-11-01

Auditory brainstem responses evoked by complex stimuli such as speech syllables have been studied in normal subjects and subjects with compromised auditory functions. The stability of speech-evoked auditory brainstem response (speech-ABR) when tested over time has been reported but the literature is limited. The present study was carried out to determine the test-retest reliability of speech-ABR in healthy children at a low sensation level. Seventeen healthy children (6 boys, 11 girls) aged from 5 to 9 years (mean = 6.8 ± 3.3 years) were tested in two sessions separated by a 3-month period. The stimulus used was a 40-ms syllable /da/ presented at 30 dB sensation level. As revealed by pair t-test and intra-class correlation (ICC) analyses, peak latencies, peak amplitudes and composite onset measures of speech-ABR were found to be highly replicable. Compared to other parameters, higher ICC values were noted for peak latencies of speech-ABR. The present study was the first to report the test-retest reliability of speech-ABR recorded at low stimulation levels in healthy children. Due to its good stability, it can be used as an objective indicator for assessing the effectiveness of auditory rehabilitation in hearing-impaired children in future studies. Copyright © 2017 Elsevier B.V. All rights reserved.
Test-retest reliability of a questionnaire to assess physical environmental factors pertaining to physical activity

Directory of Open Access Journals (Sweden)

McGinn Aileen P

2005-06-01

Full Text Available Abstract Background Despite the documented benefits of physical activity, many adults do not obtain the recommended amounts. Barriers to physical activity occur at multiple levels, including at the individual, interpersonal, and environmental levels. Only until more recently has there been a concerted focus on how the physical environment might affect physical activity behavior. With this new area of study, self-report measures should be psychometrically tested before use in research studies. Therefore the objective of this study was to document the test-retest reliability of a questionnaire designed to assess physical environmental factors that might be associated with physical activity in a diverse adult population. Methods Test and retest surveys were conducted over the telephone with 106 African American and White women and men living in either Forsyth County, North Carolina or Jackson, Mississippi. Reliability of self-reported environmental factors across four domains (e.g., access to facilities and destinations, functionality and safety, aesthetics, natural environment was determined using intraclass correlation coefficients (ICC overall and separately by gender and race. Results Generally items displayed moderate and sometimes substantial reliability (ICC between 0.4 to 0.8, with a few differences by gender or race, across each of the domains. Conclusion This study provides some psychometric evidence for the use of many of these questions in studies examining the effect of self-reported physical environmental measures on physical activity behaviors, among African American and White women and men.
Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

Science.gov (United States)

Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

2016-01-01

To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.
Test-retest studies in quantitative sensory testing

DEFF Research Database (Denmark)

Werner, M U; Petersen, M A; Bischoff, J M

2013-01-01

Quantitative sensory testing (QST) investigates the graded psychophysical response to controlled thermal, mechanical, electrical or chemical stimuli, allowing quantification of clinically relevant perception and pain thresholds. The methods are ubiquitously used in experimental and clinical pain...... research, and therefore, the need for uniform assessment procedures has been emphasised. However, varying consistency and transparency in the statistical methodology seem to occur in the QST literature. Sixteen publications, evaluating aspects of QST variability, from 2010 to 2012, were critically reviewed...
Test-retest reliability and construct validity of the DOiT (Dutch Obesity Intervention in Teenagers) questionnaire: measuring energy balance-related behaviours in Dutch adolescents.

Science.gov (United States)

Janssen, Evelien H C; Singh, Amika S; van Nassau, Femke; Brug, Johannes; van Mechelen, Willem; Chinapaw, Mai J M

2014-02-01

Adequate assessment of energy balance-related behaviours in adolescents is essential to develop and evaluate effective obesity prevention programmes. The present study examined the test-retest reliability and construct validity of a questionnaire assessing energy balance-related behaviours in adolescents during the evaluation of the DOiT (Dutch Obesity Intervention in Teenagers) intervention. To assess test-retest reliability, adolescents filled in the questionnaire twice (n 111). To assess construct validity, the results from the first test were compared with data collected in a personal cognitive interview (n 20, independent from the reliability study). For both reliability and validity, intraclass correlation coefficients for continuous data or Cohen's kappa coefficients for categorical data were calculated as well as percentage agreement. Data were collected during school time from February to May 2010. Study participants were Dutch adolescents aged 12-14 years attending pre-vocational secondary schools. In more than three-quarters of the ninety-five questionnaire items the test-retest reliability appeared to be good to excellent. Moderate reliability was found for all other twenty-one items. Fifty-one items (of ninety-five items) showed good to excellent construct validity. Construct validity appeared moderate in twenty-three items and poor in twenty-one items. Most items with poor construct validity concerned consumption of sugar-containing beverages and high-energy snacks/sweets. Our study showed good test-retest reliability and largely moderate to good construct validity for the majority of items of the DOiT questionnaire. Items with poor construct validity (most of them found for items concerning energy intake-related behaviours) should be revised and tested again to improve the questionnaire for future use.
Test-retest reliability of the diagnosis of schizoaffective disorder in childhood and adolescence - A systematic review and meta-analysis.

Science.gov (United States)

Salamon, Sarah; Santelmann, Hanno; Franklin, Jeremy; Baethge, Christopher

2018-04-01

Reliability of schizoaffective disorder (SAD) diagnoses is low in adults but unclear in children and adolescents (CAD). We estimate the test-retest reliability of SAD and its key differential diagnoses (schizophrenia, bipolar disorder, and unipolar depression). Systematic literature search of Medline, Embase, and PsycInfo for studies on test-retest reliability of SAD, in CAD. Cohen's kappa was extracted from studies. We performed meta-analysis for kappa, including subgroup and sensitivity analysis (PROSPERO protocol: CRD42013006713). Out of > 4000 records screened, seven studies were included. We estimated kappa values of 0.27 [95%-CI: 0.07 0.47] for SAD, 0.56 [0.29; 0.83] for schizophrenia, 0.64 [0.55; 0.74] for bipolar disorder, and 0.66 [0.52; 0.81] for unipolar depression. In 5/7 studies kappa of SAD was lower than that of schizophrenia; similar trends emerged for bipolar disorder (4/5) and unipolar depression (2/3). Estimates of positive agreement of SAD diagnoses supported these results. The number of studies and patients included is low. The point-estimate of the test-retest reliability of schizoaffective disorder is only fair, and lower than that of its main differential diagnoses. All kappa values under study were lower in children and adolescents samples than those reported for adults. Clinically, schizoaffective disorder should be diagnosed in strict adherence to the operationalized criteria and ought to be re-evaluated regularly. Should larger studies confirm the insufficient reliability of schizoaffective disorder in children and adolescents, the clinical value of the diagnosis is highly doubtful. Copyright © 2017. Published by Elsevier B.V.
Test-Retest Reliability of fMRI During Nonverbal Semantic Decisions in Moderate-Severe Nonfluent Aphasia Patients

Directory of Open Access Journals (Sweden)

Jacquie Kurland

2004-01-01

Full Text Available Cortical reorganization in poststroke aphasia is not well understood. Few studies have investigated neural mechanisms underlying language recovery in severe aphasia patients, who are typically viewed as having a poor prognosis for language recovery. Although test-retest reliability is routinely demonstrated during collection of language data in single-subject aphasia research, this is rarely examined in fMRI studies investigating the underlying neural mechanisms in aphasia recovery.
Herth hope index: psychometric testing of the Chinese version.

Science.gov (United States)

Chan, Keung Sum; Li, Ho Cheung William; Chan, Sally Wai-Chi; Lopez, Violeta

2012-09-01

This article is a report on psychometric testing of the Chinese version of the herth hope index. The availability of a valid and reliable instrument that accurately measures the level of hope in patients with heart failure is crucial before any hope-enhancing interventions can be appropriately planned and evaluated. There is no such instrument for Chinese people. A test-retest, within-subjects design was used. A purposive sample of 120 Hong Kong Chinese patients with heart failure between the ages of 60 and 80 years admitted to two medical wards was recruited during an 8-month period in 2009. Participants were asked to respond to the Chinese version of the herth hope index, Hamilton depression rating scale and Rosenberg's self-esteem scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the herth hope index were assessed. The newly translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly translated scale can be used as a self-report assessment tool in assessing the level of hope in Hong Kong Chinese patients with heart failure. © 2011 Blackwell Publishing Ltd.

Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system

Energy Technology Data Exchange (ETDEWEB)

Thomas, Marianna S.; Newman, David; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N. [Norfolk and Norwich University Hospital, Department of Radiology, Norwich (United Kingdom); Leinhard, Olof Dahlqvist [Linkoeping University, Center for Medical Image Science and Visualization, Linkoeping (Sweden); Linkoeping University, Department of Medical and Health Sciences, Linkoeping (Sweden); Karlsson, Anette; Borga, Magnus [Linkoeping University, Center for Medical Image Science and Visualization, Linkoeping (Sweden); Linkoeping University, Department of Biomedical Engineering, Linkoeping (Sweden); Rosander, Johannes [Advanced MR Analytics AB, Linkoeping (Sweden); Toms, Andoni P. [Norfolk and Norwich University Hospital, Department of Radiology, Norwich (United Kingdom); Radiology Academy, Cotman Centre, Norwich, Norfolk (United Kingdom)

2014-09-15

To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19.32 L (SD9.1) and 19.28 L (SD9.12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1.0, 95 % level of agreement -0.32-0.2 L). ICC for all automated test-retest muscle volumes were almost perfect (0.99-1.0) with 95 % levels of agreement 1.8-6.6 % of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1.68 L (2SD0.6) compared to automated 1.64 L (2SD 0.6), left lower leg: manual 1.69 L (2SD 0.64) compared to automated 1.63 L (SD0.61), correlation coefficients for automated and manual segmentation were 0.94-0.96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. (orig.)
Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system

International Nuclear Information System (INIS)

Thomas, Marianna S.; Newman, David; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N.; Leinhard, Olof Dahlqvist; Karlsson, Anette; Borga, Magnus; Rosander, Johannes; Toms, Andoni P.

2014-01-01

To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19.32 L (SD9.1) and 19.28 L (SD9.12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1.0, 95 % level of agreement -0.32-0.2 L). ICC for all automated test-retest muscle volumes were almost perfect (0.99-1.0) with 95 % levels of agreement 1.8-6.6 % of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1.68 L (2SD0.6) compared to automated 1.64 L (2SD 0.6), left lower leg: manual 1.69 L (2SD 0.64) compared to automated 1.63 L (SD0.61), correlation coefficients for automated and manual segmentation were 0.94-0.96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. (orig.)
Test-retest studies of cerebral glucose metabolism using fluorine-18 deoxyglucose: validation of method

International Nuclear Information System (INIS)

Brooks, R.A.; Di Chiro, G.; Zukerberg, B.W.; Bairamian, D.; Larson, S.M.

1987-01-01

In studies using [ 18 F]deoxyglucose (FDG), one often wants to compare metabolic rates following stimulation (drug or motor-sensory) with the baseline values. However, because of reproducibility problems with baseline variations of 25% in the same individual not uncommon, the global effect of the stimulation may be difficult to see. One approach to this problem is to perform the two studies sequentially. This means that, with the 110-min half-life of 18 F, one must take into account the residual activity from the first study when calculating metabolic rates for the second. We performed TEST-RETEST baseline studies on four subjects, with a 1-hr interval between injections. These studies were done without stimulation, in order to validate the repeatability of the method. To reduce the amount of residual activity from the first study, the first injection was only 2 mCi in three cases, and only 1 mCi in one case, out of a total injected dose of 5 mCi. A correction for residual activity was included in the RETEST calculation of metabolic rate. The results showed a global metabolic shift between the two studies of 2% to 9%. An error analysis shows that the shift could be further reduced if anatomically comparable scans are done at comparable postinjection times
Test-retest variability of multifocal electroretinography in normal volunteers and short-term variability in hydroxychloroquine users

Directory of Open Access Journals (Sweden)

Browning DJ

2014-08-01

Full Text Available David J Browning,1 Chong Lee2 1Charlotte Eye, Ear, Nose and Throat Associates, 2University of North Carolina – Charlotte, Charlotte, NC, USA Purpose: To determine measurement variability of N1P1 amplitudes and the R1/R2 ratio in normal subjects and hydroxychloroquine users without retinopathy. Design: Retrospective, observational study. Subjects: Normal subjects (n=21 and 44 patients taking hydroxychloroquine (n=44 without retinopathy. Methods: Multifocal electroretinography (mfERG was performed twice in one session in the 21 normal subjects and twice within 1 year in the hydroxychloroquine users, during which time no clinical change in macular status occurred. Main outcome measures: N1P1 amplitudes of rings R1–R5, the R1/R2 ratio, and coefficients of repeatability (COR for these measurements. Results: Values for N1P1 amplitudes in hydroxychloroquine users were reduced compared with normal subjects by the known effect of age, but R1/R2 was not affected by age. The COR for R1–R5 ranged from 43% to 52% for normal subjects and from 43% to 59% for hydroxychloroquine users; for R1/R2 the COR was 29% in normal subjects and 45% in hydroxychloroquine users. Conclusion: mfERG measurements show high test-retest variability, limiting the ability of a single mfERG test to influence a decision to stop hydroxychloroquine; corroborative evidence with a different ancillary test is recommended in a suspicious case. Keywords: multifocal electroretinography, hydroxychloroquine, test-retest variability
Inter-Rater and Test-Retest (Between-Sessions) Reliability of the 4-Skills Scan for Dutch Elementary School Children

Science.gov (United States)

van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M.

2018-01-01

In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…
Test-Retest Reproducibility of the Microperimeter MP3 With Fundus Image Tracking in Healthy Subjects and Patients With Macular Disease.

Science.gov (United States)

Palkovits, Stefan; Hirnschall, Nino; Georgiev, Stefan; Leisser, Christoph; Findl, Oliver

2018-02-01

To evaluate the test-retest reproducibility of a novel microperimeter with fundus image tracking (MP3, Nidek Co, Japan) in healthy subjects and patients with macular disease. Ten healthy subjects and 20 patients suffering from range of macular diseases were included. After training measurements, two additional microperimetry measurements were scheduled. Test-retest reproducibility was assessed for mean retinal sensitivity, pointwise sensitivity, and deep scotoma size using the coefficient of repeatability and Bland-Altman diagrams. In addition, in a subgroup of patients microperimetry was compared with conventional perimetry. Average differences in mean retinal sensitivity between the two study measurements were 0.26 ± 1.7 dB (median 0 dB; interquartile range [IQR] -1 to 1) for the healthy and 0.36 ± 2.5 dB (median 0 dB; IQR -1 to 2) for the macular patient group. Coefficients of repeatability for mean retinal sensitivity and pointwise retinal sensitivity were 1.2 and 3.3 dB for the healthy subjects and 1.6 and 5.0 dB for the macular disease patients, respectively. Absolute agreement in deep scotoma size between both study days was found in 79.9% of the test loci. The microperimeter MP3 shows an adequate test-retest reproducibility for mean retinal sensitivity, pointwise retinal sensitivity, and deep scotoma size in healthy subjects and patients suffering from macular disease. Furthermore, reproducibility of microperimetry is higher than conventional perimetry. Reproducibility is an important measure for each diagnostic device. Especially in a clinical setting high reproducibility set the basis to achieve reliable results using the specific device. Therefore, assessment of the reproducibility is of eminent importance to interpret the findings of future studies.
International field testing of the psychometric properties of an EORTC quality of life module for oral health: the EORTC QLQ-OH15.

Science.gov (United States)

Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B

2016-09-01

This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took issues were addressed. Analyses suggested a revision of the phase III hypothesized scale structure. Two items were deleted based on a high degree of item misfit, together with negative patient feedback. The remaining 15 items formed one eight-item scale named OH-QoL score, a two-item information scale, a two-item scale regarding dentures, and three single items (sticky saliva/mouth soreness/sensitivity to food/drink). Face and convergent validity and internal consistency were confirmed. Test-retest reliability (n = 60) was demonstrated as was RCA for patients undergoing chemotherapy (n = 117; p = 0.06). The resulting QLQ-OH15 discriminated between clinically distinct patient groups, e.g., low performance status vs. higher (p < 000.1), and head-and-neck cancer versus other cancers (p < 0.03). The EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.
14 CFR 63.59 - Retesting after failure.

Science.gov (United States)

2010-01-01

... 14 Aeronautics and Space 2 2010-01-01 2010-01-01 false Retesting after failure. 63.59 Section 63.59 Aeronautics and Space FEDERAL AVIATION ADMINISTRATION, DEPARTMENT OF TRANSPORTATION (CONTINUED... failure. (a) An applicant for a flight navigator certificate who fails a written or practical test for...
Activity in prelimbic cortex is required for adjusting the anxiety response level during the elevated plus-maze retest.

Science.gov (United States)

Stern, C A J; Do Monte, F H M; Gazarini, L; Carobrez, A P; Bertoglio, L J

2010-09-29

The prelimbic (PL) subregion of medial prefrontal cortex has been implicated in anxiety regulation. It is unknown, however, whether PL cortex also serves to fine-tuning the level of anxiety-related behavior exhibited on the next exposure to the same potentially threatening situation. To address this, we infused cobalt (1.0 mM) to temporarily inactivate the PL cortex during testing, post-testing or retesting in the elevated plus-maze (EPM). This protocol was chosen because it allowed us to concurrently investigate anxiety and the process of aversive learning and memory. PL cortex inactivation during the EPM testing increased the exploration of open-arms, substantiating its role in anxiety. PL cortex inactivation during the EPM retesting counteracted the further avoidance to open-arms exhibited by rats. Interestingly, as evidenced by min-by-min analysis, the cobalt-treated group behaved on EPM retesting as did the vehicle-treated group on EPM testing. This result may imply that activity in PL cortex is necessary for retrieving previously learned information that adjusts the anxiety response level on EPM retesting. Alternatively, a simple reduction in anxiety could explain the cobalt-induced increase in retest open-arms exploration. Neither test nor post-test PL cortex inactivation affected the further avoidance to open-arms observed on EPM retesting. To extend the investigation of PL cortex role in the regulation of open-arms avoidance, we infused other drugs prior to testing or retesting in the EPM. Antagonism of PL cortex adrenergic beta-1 receptors with atenolol (10 nmol), cholinergic muscarinic receptors with scopolamine (20 nmol) or glutamatergic N-methyl-d-aspartic acid (NMDA) receptors with AP5 (6.0 nmol) interfered with the level of open-arms exploration on testing, but not on retesting. Copyright 2010 IBRO. Published by Elsevier Ltd. All rights reserved.
The Validity and Reliability Test of the Indonesian Version of Gastroesophageal Reflux Disease Quality of Life (GERD-QOL) Questionnaire.

Science.gov (United States)

Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti

2017-01-01

to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; pGERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
We need more replication research - A case for test-retest reliability.

Science.gov (United States)

Leppink, Jimmie; Pérez-Fuster, Patricia

2017-06-01

Following debates in psychology on the importance of replication research, we have also started to see pleas for a more prominent role for replication research in medical education. To enable replication research, it is of paramount importance to carefully study the reliability of the instruments we use. Cronbach's alpha has been the most widely used estimator of reliability in the field of medical education, notably as some kind of quality label of test or questionnaire scores based on multiple items or of the reliability of assessment across exam stations. However, as this narrative review outlines, Cronbach's alpha or alternative reliability statistics may complement but not replace psychometric methods such as factor analysis. Moreover, multiple-item measurements should be preferred above single-item measurements, and when using single-item measurements, coefficients as Cronbach's alpha should not be interpreted as indicators of the reliability of a single item when that item is administered after fundamentally different activities, such as learning tasks that differ in content. Finally, if we want to follow up on recent pleas for more replication research, we have to start studying the test-retest reliability of the instruments we use.
Test-retest reliability of the Middlesex Assessment of Mental State (MEAMS): a preliminary investigation in people with probable dementia.

Science.gov (United States)

Powell, T; Brooker, D J; Papadopolous, A

1993-05-01

Relative and absolute test-retest reliability of the MEAMS was examined in 12 subjects with probable dementia and 12 matched controls. Relative reliability was good. Measures of absolute reliability showed scores changing by up to 3 points over an interval of a week. A version effect was found to be in evidence.
Sino-Nasal Outcome Test-22: Translation, Cross-cultural Adaptation, and Validation in Hebrew-Speaking Patients.

Science.gov (United States)

Shapira Galitz, Yael; Halperin, Doron; Bavnik, Yosef; Warman, Meir

2016-05-01

To perform the translation, cross-cultural adaptation, and validation of the Sino-Nasal Outcome Test-22 (SNOT-22) questionnaire to the Hebrew language. A single-center prospective cross-sectional study. Seventy-three chronic rhinosinusitis (CRS) patients and 73 patients without sinonasal disease filled the Hebrew version of the SNOT-22 questionnaire. Fifty-one CRS patients underwent endoscopic sinus surgery, out of which 28 filled a postoperative questionnaire. Seventy-three healthy volunteers without sinonasal disease also answered the questionnaire. Internal consistency, test-retest reproducibility, validity, and responsiveness of the questionnaire were evaluated. Questionnaire reliability was excellent, with a high internal consistency (Cronbach's alpha coefficient, 0.91-0.936) and test-retest reproducibility (Spearman's coefficient, 0.962). Mean scores for the preoperative, postoperative, and control groups were 50.44, 29.64, and 13.15, respectively (P < .0001 for CRS vs controls, P < .001 for preoperative vs postoperative), showing validity and responsiveness of the questionnaire. The Hebrew version of SNOT-22 questionnaire is a valid outcome measure for patients with CRS with or without nasal polyps. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Validity and test-retest reliability of manual goniometers for measuring passive hip range of motion in femoroacetabular impingement patients.

Directory of Open Access Journals (Sweden)

Nussbaumer Silvio

2010-08-01

Full Text Available Abstract Background The aims of this study were to evaluate the construct validity (known group, concurrent validity (criterion based and test-retest (intra-rater reliability of manual goniometers to measure passive hip range of motion (ROM in femoroacetabular impingement patients and healthy controls. Methods Passive hip flexion, abduction, adduction, internal and external rotation ROMs were simultaneously measured with a conventional goniometer and an electromagnetic tracking system (ETS on two different testing sessions. A total of 15 patients and 15 sex- and age-matched healthy controls participated in the study. Results The goniometer provided greater hip ROM values compared to the ETS (range 2.0-18.9 degrees; P P Conclusions The present study suggests that goniometer-based assessments considerably overestimate hip joint ROM by measuring intersegmental angles (e.g., thigh flexion on trunk for hip flexion rather than true hip ROM. It is likely that uncontrolled pelvic rotation and tilt due to difficulties in placing the goniometer properly and in performing the anatomically correct ROM contribute to the overrating of the arc of these motions. Nevertheless, conventional manual goniometers can be used with confidence for longitudinal assessments in the clinic.
The test of variables of attention (TOVA): Internal consistency (Q1 vs. Q2 and Q3 vs. Q4) in children with Attention Deficit/Hyperactivity Disorder (ADHD)

Science.gov (United States)

The internal consistency of the Test of Variables of Attention (TOVA) was examined in a cohort of 6- to 12-year-old children (N = 63) strictly diagnosed with ADHD. The internal consistency of errors of omission (OMM), errors of commission (COM), response time (RT), and response time variability (RTV...
Estimation of macular pigment optical density in the elderly: test-retest variability and effect of optical blur in pseudophakic subjects

NARCIS (Netherlands)

Gallaher, Kevin T.; Mura, Marco; Todd, Wm Andrew; Harris, Tarsha L.; Kenyon, Emily; Harris, Tamara; Johnson, Karen C.; Satterfield, Suzanne; Kritchevsky, Stephen B.; Iannaccone, Alessandro

2007-01-01

The reproducibility of macular pigment optical density (MPOD) estimates in the elderly was assessed in 40 subjects (age: 79.1+/-3.5). Test-retest variability was good (Pearson's r coefficient: 0.734), with an average coefficient of variation (CV) of 18.4% and an intraclass correlation coefficient
Validity and test-retest reliability of a novel simple back extensor muscle strength test.

Science.gov (United States)

Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

2017-01-01

To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r = 0.824, p strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p strength ( p strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.
Validity and internal consistency of a whiplash-specific disability measure.

Science.gov (United States)

Pinfold, Melanie; Niere, Ken R; O'Leary, Elizabeth F; Hoving, Jan Lucas; Green, Sally; Buchbinder, Rachelle

2004-02-01

Cross-sectional study of patients with whiplash-associated disorders investigating the internal consistency, factor structure, response rates, and presence of floor and ceiling effects of the Whiplash Disability Questionnaire (WDQ). The aim of this study was to confirm the appropriateness of the proposed WDQ items. Whiplash injuries are a common cause of pain and disability after motor vehicle accidents. Neck disability questionnaires are often used in whiplash studies to assess neck pain but lack content validity for patients with whiplash-associated disorders. The newly developed WDQ measures functional limitations associated with whiplash injury and was designed after interviews with 83 patients with whiplash in a previous study. Researchers sought expert opinion on items of the WDQ, and items were then tested on a clinical whiplash population. Data were inspected to determine floor and ceiling effects, response rates, factor structure, and internal consistency. Packages of questionnaires were distributed to 55 clinicians, whose patients with whiplash completed and returned 101 questionnaires to researchers. No substantial floor or ceiling effects were identified on inspection of data. The overall floor effect was 12%, and the overall ceiling effect was 4%. Principal component analysis identified one broad factor that accounted for 65% of the variance in responses. Internal consistency was high; Cronbach's alpha = 0.96. Results of the study supported the retention of the 13 proposed items in a whiplash-specific disability questionnaire. Dependent on the results of further psychometric testing, the WDQ is likely to be an appropriate outcome measure for patients with whiplash.
Cross-cultural validation of the Falls Efficacy Scale-International (FES-I) in Portuguese community-dwelling older adults.

Science.gov (United States)

Figueiredo, Daniela; Santos, Sónia

The Falls Efficacy Scale-International (FES-I) is a highly reliable instrument to assess fear of falling among older population. This study aimed to develop a European Portuguese version of the FES-I (FES-I (P) ) and analyse its psychometric properties in terms of internal consistency, test-retest reliability, concurrent and convergent validity. A cross-sectional study was conducted. Data collection integrated a socio-demographic questionnaire which included falls history and presence/absence of fear of falling, the Activities-specific Balance Confidence Scale (ABC), the Hospital Anxiety and Depression Scale (HADS), the Timed Up and Go (TUG) and the Five Times Sit to Stand Test (FTSST). Descriptive and inferential statistical analyses were performed. A total of 100 Portuguese community-dwelling older people (74.27±8.7years old) have participated in the study. From these, 82 have participated in the reliability study. The FES-I (P) had excellent internal consistency (α=0,978) and test-retest reliability (ICC 2,1 =0,999). A significant negative correlation was found between the FES-I (P) and the ABC (r s =-0.85; pPortuguese community-living older people. Future studies should explore the FES-I (P) responsiveness to change over time and analyse its psychometric properties in samples of both non-community-dwelling and community-dwelling older adults with different health conditions. Copyright Â© 2016 Elsevier Ireland Ltd. All rights reserved.
Intra-Rater, Inter-Rater and Test-Retest Reliability of an Instrumented Timed Up and Go (iTUG Test in Patients with Parkinson's Disease.

Directory of Open Access Journals (Sweden)

Rob C van Lummel

Full Text Available The "Timed Up and Go" (TUG is a widely used measure of physical functioning in older people and in neurological populations, including Parkinson's Disease. When using an inertial sensor measurement system (instrumented TUG [iTUG], the individual components of the iTUG and the trunk kinematics can be measured separately, which may provide relevant additional information.The aim of this study was to determine intra-rater, inter-rater and test-retest reliability of the iTUG in patients with Parkinson's Disease.Twenty eight PD patients, aged 50 years or older, were included. For the iTUG the DynaPort Hybrid (McRoberts, The Hague, The Netherlands was worn at the lower back. The device measured acceleration and angular velocity in three directions at a rate of 100 samples/s. Patients performed the iTUG five times on two consecutive days. Repeated measurements by the same rater on the same day were used to calculate intra-rater reliability. Repeated measurements by different raters on the same day were used to calculate intra-rater and inter-rater reliability. Repeated measurements by the same rater on different days were used to calculate test-retest reliability.Nineteen ICC values (15% were ≥ 0.9 which is considered as excellent reliability. Sixty four ICC values (49% were ≥ 0.70 and < 0.90 which is considered as good reliability. Thirty one ICC values (24% were ≥ 0.50 and < 0.70, indicating moderate reliability. Sixteen ICC values (12% were ≥ 0.30 and < 0.50 indicating poor reliability. Two ICT values (2% were < 0.30 indicating very poor reliability.In conclusion, in patients with Parkinson's disease the intra-rater, inter-rater, and test-retest reliability of the individual components of the instrumented TUG (iTUG was excellent to good for total duration and for turning durations, and good to low for the sub durations and for the kinematics of the SiSt and StSi. The results of this fully automated analysis of instrumented TUG movements

[The reliability of a questionnaire regarding Colombian children's physical activity].

Science.gov (United States)

Herazo-Beltrán, Aliz Y; Domínguez-Anaya, Regina

2012-10-01

Reporting the Physical Activity Questionnaire for school children's (PAQ-C) test-retest reliability and internal consistency. This was a descriptive study of 100 school-aged children aged 9 to 11 years old attending a school in Cartagena, Colombia. The sample was randomly selected. The PAQ-C was given twice, one week apart, after the informed consent forms had been signing by the children's parents and school officials. Cronbach's alpha coefficient of reliability was used for assessing internal consistency and an intra-class correlation coefficient for test-retest reliability SPSS (version 17.0) was used for statistical analysis. The questionnaire scored 0.73 internal consistencies during the first measurement and 0.78 on the second; intra-class correlation coefficient was 0.60. There were differences between boys and girls regarding both measurements. The PAQ-C had acceptable internal consistency and test-retest reliability, thereby making it useful for measuring children's self-reported physical activity and a valuable tool for population studies in Colombia.
Delimiting Coefficient a from Internal Consistency and Unidimensionality

Science.gov (United States)

Sijtsma, Klaas

2015-01-01

I discuss the contribution by Davenport, Davison, Liou, & Love (2015) in which they relate reliability represented by coefficient a to formal definitions of internal consistency and unidimensionality, both proposed by Cronbach (1951). I argue that coefficient a is a lower bound to reliability and that concepts of internal consistency and…
TEST-RETEST RELIABILITY OF HAND GRIP STRENGTH MEASUREMENT USING A JAMAR HAND DYNAMOMETER IN PATIENTS WITH ACUTE AND CHRONIC CERVICAL RADICULOPATHY

Directory of Open Access Journals (Sweden)

Ejazi G

2017-12-01

Full Text Available Background: To evaluate the test-retest reliability of Jamar hand held dynamometer for measuring handgrip strength (HGS in patients with acute and chronic cervical radiculopathy and to find out the difference in measurement of the handgrip strength between acute and chronic cervical radiculopathy. Methods: A prospective, observational and non-experimental, the comparative study design was used. A sample of 72 subjects (37 women and 35 men suffering from cervical radiculopathy were divided into two groups i.e., Group A(acute and Group B(chronic, handgrip strength was measured using Jamar hand held dynamometer on two occasions by the same rater with an interval of 7-days. Data collection was based on standard guidelines of American Society of Hand Therapists. Three gripping trials (measured in Kg with patient’s arm in standardized arm position were recorded. The data was analyzed from the mean score obtained from the sample. Result: One-way Analysis of Variance(ANOVA was used to evaluate test-retest reliability and Tukey-Kramer Multiple Comparison Test used to find the difference between handgrip strength among acute and chronic Cervical radiculopathy cases. Greater P-value (>0.05 in both testing session, as well as 95% of the confidence interval, shows the reliability of the instrument and lesser p-value (0.05 in female subjects shows no significant difference in handgrip strength between the two groups. Conclusion: Excellent test-retest reliability for hand grip strength measurement was measured in patients with acute and chronic cervical radiculopathy shows that the equipment could be used as an assessment tool for this patient and significant difference exists among male handgrip strength between acute and chronic cervical radiculopathy cases whereas no difference exists among female handgrip strength between acute and chronic cervical radiculopathy cases.
Test-retest reliability and minimal detectable change of two simplified 3-point balance measures in patients with stroke.

Science.gov (United States)

Chen, Yi-Miau; Huang, Yi-Jing; Huang, Chien-Yu; Lin, Gong-Hong; Liaw, Lih-Jiun; Lee, Shih-Chieh; Hsieh, Ching-Lin

2017-10-01

The 3-point Berg Balance Scale (BBS-3P) and 3-point Postural Assessment Scale for Stroke Patients (PASS-3P) were simplified from the BBS and PASS to overcome the complex scoring systems. The BBS-3P and PASS-3P were more feasible in busy clinical practice and showed similarly sound validity and responsiveness to the original measures. However, the reliability of the BBS-3P and PASS-3P is unknown limiting their utility and the interpretability of scores. We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BBS-3P and PASS-3P in patients with stroke. Cross-sectional study. The rehabilitation departments of a medical center and a community hospital. A total of 51 chronic stroke patients (64.7% male). Both balance measures were administered twice 7 days apart. The test-retest reliability of both the BBS-3P and PASS-3P were examined by intraclass correlation coefficients (ICC). The MDC and its percentage over the total score (MDC%) of each measure was calculated for examining the random measurement errors. The ICC values of the BBS-3P and PASS-3P were 0.99 and 0.97, respectively. The MDC% (MDC) of the BBS-3P and PASS-3P were 9.1% (5.1 points) and 8.4% (3.0 points), respectively, indicating that both measures had small and acceptable random measurement errors. Our results showed that both the BBS-3P and the PASS-3P had good test-retest reliability, with small and acceptable random measurement error. These two simplified 3-level balance measures can provide reliable results over time. Our findings support the repeated administration of the BBS-3P and PASS-3P to monitor the balance of patients with stroke. The MDC values can help clinicians and researchers interpret the change scores more precisely.
Translation and Validation of the Korean Version of the International Knee Documentation Committee Subjective Knee Form

Science.gov (United States)

Kim, Jin Goo; Lee, Joong Yub; Seo, Seung Suk; Choi, Choong Hyeok; Lee, Myung Chul

2013-01-01

Purpose To perform a cross-cultural adaptation and to test the measurement properties of the Korean version of International Knee Documentation Committee (K-IKDC) Subjective Knee Form. Materials and Methods According to the guidelines for cross-cultural adaptation, translation and backward translation of the English version of the IKDC Subjective Knee Form were performed. After translation into the Korean version, 150 patients who had knee-related problems were asked to complete the K-IKDC, Lysholm score, and Short Form-36 (SF-36). Of these patients, 126 were retested 2 weeks later to evaluate test-retest reliability, and 104 were recruited 3 months later to evaluate responsiveness. Construct validity was analyzed by investigating the correlation with Lysholm score and SF-36; content validity was also evaluated. Standardized mean response was calculated for evaluating responsiveness. Results The test-retest reliability proved excellent with a high value for the intraclass correlation coefficient (r=0.94). The internal consistency was strong (Cronbach's α=0.91). Good content validity with absence of floor not ceiling effects and good convergent and divergent validity were observed. Moderate responsiveness was shown (standardized mean response=0.689). Conclusions The K-IKDC demonstrated good measurement properties. We suggest that this instrument is an excellent evaluation instrument that can be used for Korean patients with knee-related injuries. PMID:24032098
Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.

Science.gov (United States)

Oremus, Mark; Oremus, Carolina; Hall, Geoffrey B C; McKinnon, Margaret C

2012-01-01

Quality assessment of included studies is an important component of systematic reviews. The authors investigated inter-rater and test-retest reliability for quality assessments conducted by inexperienced student raters. Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle-Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13-20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. McMaster Integrative Neuroscience Discovery and Study Program. 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test-retest reliability using ICC(2,1). Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI -0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were -0.14 (95% CI -0.28 to 0.00) to 0.39 (95% CI -0.02 to 0.81) for the NOS cohort and -0.20 (95% CI -0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case-control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test-retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were -0.19 (95% CI -0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case-control, the ICC(2,1)s were 0.46 (95% CI -0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). Inter-rater reliability was generally poor
Test-retest paradigm of the forced swimming test in female mice is not valid for predicting antidepressant-like activity: participation of acetylcholine and sigma-1 receptors.

Science.gov (United States)

Su, Jing; Hato-Yamada, Noriko; Araki, Hiroaki; Yoshimura, Hiroyuki

2013-01-01

The forced swimming test (FST) in mice is widely used to predict the antidepressant activity of a drug, but information describing the immobility of female mice is limited. We investigated whether a prior swimming experience affects the immobility duration in a second FST in female mice and whether the test-retest paradigm is a valid screening tool for antidepressants. Female ICR mice were exposed to the FST using two experimental paradigms: a single FST and a double FST in which mice had experienced FST once 24 h prior to the second trail. The initial FST experience reliably prolonged immobility duration in the second FST. The antidepressants imipramine and paroxetine significantly reduced immobility duration in the single FST, but not in the double FST. Scopolamine and the sigma-1 (σ1) antagonist NE-100 administered before the second trial significantly prevented the prolongation of immobility. Neither a 5-HT1A nor a 5-HT2A receptor agonist affected immobility duration. We suggest that the test-retest paradigm in female mice is not adequate for predicting antidepressant-like activity of a drug; the prolongation of immobility in the double FST is modulated through acetylcholine and σ1 receptors.
The Ostomy Adjustment Scale: translation into Norwegian language with validation and reliability testing.

Science.gov (United States)

Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin

2014-01-01

The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is
Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease.

Science.gov (United States)

Strouwen, Carolien; Molenaar, Esther A L M; Keus, Samyra H J; Münks, Liesbeth; Bloem, Bastiaan R; Nieuwboer, Alice

2016-08-01

Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains largely unknown. The purpose of this study was to assess the reliability of DT outcome measures in patients with PD. A repeated-measures design was used. Patients with PD ("on" medication, Mini-Mental State Examination score ≥24) performed 2 cognitive tasks (ie, backward digit span task and auditory Stroop task) and 1 functional task (ie, mobile phone task) in combination with walking. Tasks were assessed at 2 time points (same hour) with an interval of 6 weeks. Test-retest reliability was assessed for gait while performing each secondary task (DT gait) for both cognitive tasks while walking (DT cognitive) and for the functional task while walking (DT functional). Sixty-two patients with PD (age=39-89 years, Hoehn and Yahr stages II-III) were included in the study. Intraclass correlation coefficients (ICCs) showed excellent reliability for DT gait measures, ranging between .86 and .95 when combined with the digit span task, between .86 and .95 when combined with the auditory Stroop task, and between .72 and .90 when combined with the mobile phone task. The standard error of measurements for DT gait speed varied between 0.06 and 0.08 m/s, leading to minimal detectable changes between 0.16 and 0.22 m/s. With regard to DT cognitive measures, reaction times showed good-to-excellent reliability (digit span task: ICC=.75; auditory Stroop task: ICC=.82). The results cannot be generalized to patients with advanced disease or to other DT measures. In people with PD, DT measures proved to be reliable for use in clinical studies and look promising for use in clinical practice to assess improvements after DT training. Large effects, however, are needed to obtain meaningful effect sizes. �
Assessment of behavioral mechanisms maintaining encopresis: Virginia Encopresis-Constipation Apperception Test.

Science.gov (United States)

Cox, Daniel J; Ritterband, Lee M; Quillian, Warren; Kovatchev, Boris; Morris, James; Sutphen, James; Borowitz, Stephen

2003-09-01

To develop and test a scale for parent and child, evaluating theoretical and clinical parameters relevant to children with encopresis. Encopretic children were hypothesized to have more bowel-specific, but not more generic, psychological problems, as compared with nonsymptomatic control children. In addition, mothers were also believed to be more discerning than children. The Virginia Encopresis-Constipation Apperception Test (VECAT) consists of 9 pairs of bowel-specific and 9 parallel generic drawings. Respondents selected the picture in each pair that best described them/their child. It was administered to encopretic children (N = 87), nonsymptomatic siblings (N = 27), and nonsymptomatic nonsiblings (N = 35). The mothers of all the participants also completed the VECAT. Encopretic children were retested 6 and 12 months posttreatment with Enhanced Toilet Training. The VECAT demonstrated good test-retest reliability and internal consistency. Encopretic children and their mothers reported more bowel-specific, but not more generic, problems. Bowel-specific scores improved significantly posttreatment only for those patients who demonstrated significant symptom improvement. Mothers were significantly more discerning than children. The VECAT is a reliable, valid, discriminating, and sensitive test. Bowel-specific problems appear to best differentiate children with and without encopresis.
Test-retest reliability of Physical Activity Neighborhood Environment Scale among urban men and women in Nanjing, China.

Science.gov (United States)

Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F

2018-03-01

The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Center for Epidemiologic Studies Depression Scale for Children: psychometric testing of the Chinese version.

Science.gov (United States)

Li, Ho Cheung William; Chung, Oi Kwan Joyce; Ho, Ka Yan

2010-11-01

This paper is a report of psychometric testing of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children. The availability of a valid and reliable instrument that accurately detects depressive symptoms in children is crucial before any psychological intervention can be appropriately planned and evaluated. There is no such an instrument for Chinese children. A test-retest, within-subjects design was used. A total of 313 primary school students between the ages of 8 and 12 years were invited to participate in the study in 2009. Participants were asked to respond to the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children, short form of the State Anxiety Scale for Children and Rosenberg's Self-Esteem Scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children were assessed. The newly-translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly-translated scale can be used as a self-report assessment tool in detecting depressive symptoms of Chinese children aged between 8 and 12 years. © 2010 Blackwell Publishing Ltd.
Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system.

Science.gov (United States)

Thomas, Marianna S; Newman, David; Leinhard, Olof Dahlqvist; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N; Karlsson, Anette; Rosander, Johannes; Borga, Magnus; Toms, Andoni P

2014-09-01

To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19·32 L (SD9·1) and 19·28 L (SD9·12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1·0, 95% level of agreement -0·32-0·2 L). ICC for all automated test-retest muscle volumes were almost perfect (0·99-1·0) with 95% levels of agreement 1.8-6.6% of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1·68 L (2SD0·6) compared to automated 1·64 L (2SD 0·6), left lower leg: manual 1·69 L (2SD 0·64) compared to automated 1·63 L (SD0·61), correlation coefficients for automated and manual segmentation were 0·94-0·96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. Sarcopaenia is an important reversible complication of a number of diseases. Manual quantification of muscle volume is time-consuming and expensive. Muscles can be imaged using in and out of phase MRI. Automated atlas-based segmentation can identify muscle groups. Automated muscle volume segmentation is reproducible and can replace manual measurements.
Consistency of Flashbulb Memories of September 11 over Long Delays: Implications for Consolidation and Wrong Time Slice Hypotheses

Science.gov (United States)

Kvavilashvili, Lia; Mirani, Jennifer; Schlagman, Simone; Foley, Kerry; Kornbrot, Diana E.

2009-01-01

The consistency of flashbulb memories over long delays provides a test of theories of memory for highly emotional events. This study used September 11, 2001 as the target event, with test-retest delays of 2 and 3 years. The nature and consistency of flashbulb memories were examined as a function of delay between the target event and an initial…
Multilevel Factor Structure, Concurrent Validity, and Test-Retest Reliability of the High School Teacher Version of the Authoritative School Climate Survey

Science.gov (United States)

Huang, Francis L.; Cornell, Dewey G.

2016-01-01

Although school climate has long been recognized as an important factor in the school improvement process, there are few psychometrically supported measures based on teacher perspectives. The current study replicated and extended the factor structure, concurrent validity, and test-retest reliability of the teacher version of the Authoritative…
Psychometric evaluation of the internalized stigma of mental illness scale for patients with mental illnesses: measurement invariance across time.

Directory of Open Access Journals (Sweden)

Chih-Cheng Chang

Full Text Available BACKGROUND: The current investigation examined the psychometric properties of the Internalized Stigma of Mental Illness (ISMI scale in a sample of patients with mental illness. In addition to the internal consistency, test-retest reliability, and concurrent validity that previous studies have tested for the ISMI, we extended the evaluation to its construct validity and measurement invariance using confirmatory factor analysis (CFA. METHODS: Three hundred forty-seven participants completed two questionnaires (i.e., the ISMI and the Depression and Somatic Symptoms Scale [DSSS], and 162 filled out the ISMI again after 50.23±31.18 days. RESULTS: The results of this study confirmed the frame structure of the ISMI; however, the Stigma Resistance subscale in the ISMI seemed weak. In addition, internal consistency, test-retest reliability, and concurrent validity were all satisfactory for all subscales and the total score of the ISMI, except for Stigma Resistance (α = 0.66; ICC = 0.52, and r = 0.02 to 0.06 with DSSS. Therefore, we hypothesize that Stigma Resistance is a new concept rather than a concept in internalized stigma. The acceptable fit indices supported the measurement invariance of the ISMI across time, and suggested that people with mental illness interpret the ISMI items the same at different times. CONCLUSION: The clinical implication of our finding is that clinicians, when they design interventions, may want to use the valid and reliable ISMI without the Stigma Resistance subscale to evaluate the internalized stigma of people with mental illness.
The Unsupported Upper Limb Exercise Test in People Without Disabilities: Assessing the Within-Day Test-Retest Reliability and the Effects of Age and Gender.

Science.gov (United States)

Oliveira, Ana; Cruz, Joana; Jácome, Cristina; Marques, Alda

2018-01-01

Purpose: To estimate the within-day test-retest reliability and standard error of measurement (SEM) of the unsupported upper limb exercise test (UULEX) in adults without disabilities and to determine the effects of age and gender on performance of the UULEX. Method: A cross-sectional study was conducted with 100 adults without disabilities (44 men, mean age 44.2 [SD 26] y; 56 women, mean age 38.1 [SD 24.1] y). Participants performed three UULEX tests to establish within-day reliability, measured using an intra-class correlation coefficient (ICC) model 2 (two-way random effects) with a single rater (ICC[2,1]) and SEM. The effects of age and gender were examined using two-factor mixed-design analysis of variance (ANOVA) and one-way repeated-measures ANOVA. For analysis purposes, four sub-groups were created: younger adults, older adults, men, and women. Results: Excellent within-day reliability and a small SEM were found in the four sub-groups (younger adults: ICC[2,1]=0.88; 95% CI: 0.82, 0.92; SEM∼40 s; older adults: ICC[2,1]=0.82; 95% CI: 0.72, 0.90; SEM∼50 s; men: ICC[2,1]=0.93; 95% CI: 0.88, 0.96; SEM∼30 s; women: ICC[2,1]=0.85; 95% CI: 0.78, 0.91; SEM∼45 s). Younger adults took, on average, 308.24 seconds longer than older adults to perform the test; older adults performed significantly better on the third test ( p 0.05). Conclusion: The within-day test-retest reliability and SEM values of the UULEX may be used to define the magnitude of the error obtained with repeated measures. One UULEX test seems to be adequate for younger adults to achieve reliable results, whereas three tests seem to be needed for older adults.
Fundamentals of endoscopic surgery: creation and validation of the hands-on test.

Science.gov (United States)

Vassiliou, Melina C; Dunkin, Brian J; Fried, Gerald M; Mellinger, John D; Trus, Thadeus; Kaneva, Pepa; Lyons, Calvin; Korndorffer, James R; Ujiki, Michael; Velanovich, Vic; Kochman, Michael L; Tsuda, Shawn; Martinez, Jose; Scott, Daniel J; Korus, Gary; Park, Adrian; Marks, Jeffrey M

2014-03-01

The Fundamentals of Endoscopic Surgery™ (FES) program consists of online materials and didactic and skills-based tests. All components were designed to measure the skills and knowledge required to perform safe flexible endoscopy. The purpose of this multicenter study was to evaluate the reliability and validity of the hands-on component of the FES examination, and to establish the pass score. Expert endoscopists identified the critical skill set required for flexible endoscopy. They were then modeled in a virtual reality simulator (GI Mentor™ II, Simbionix™ Ltd., Airport City, Israel) to create five tasks and metrics. Scores were designed to measure both speed and precision. Validity evidence was assessed by correlating performance with self-reported endoscopic experience (surgeons and gastroenterologists [GIs]). Internal consistency of each test task was assessed using Cronbach's alpha. Test-retest reliability was determined by having the same participant perform the test a second time and comparing their scores. Passing scores were determined by a contrasting groups methodology and use of receiver operating characteristic curves. A total of 160 participants (17 % GIs) performed the simulator test. Scores on the five tasks showed good internal consistency reliability and all had significant correlations with endoscopic experience. Total FES scores correlated 0.73, with participants' level of endoscopic experience providing evidence of their validity, and their internal consistency reliability (Cronbach's alpha) was 0.82. Test-retest reliability was assessed in 11 participants, and the intraclass correlation was 0.85. The passing score was determined and is estimated to have a sensitivity (true positive rate) of 0.81 and a 1-specificity (false positive rate) of 0.21. The FES hands-on skills test examines the basic procedural components required to perform safe flexible endoscopy. It meets rigorous standards of reliability and validity required for high
Validation of the Spanish version of the Test for Respiratory and Asthma Control in Kids (TRACK) in a population of Hispanic preschoolers.

Science.gov (United States)

Rodríguez-Martínez, Carlos E; Nino, Gustavo; Castro-Rodriguez, Jose A

2014-01-01

There is a critical need for validation studies of questionnaires designed to assess the level of control of asthma in children younger than 5 years old. To validate the Spanish version of the Test for Respiratory and Asthma Control in Kids (TRACK) questionnaire in children younger than age 5 years with symptoms consistent with asthma. In a prospective cohort validation study, parents and/or caregivers of children younger than age 5 years and with symptoms consistent with asthma, during a baseline and a follow-up visit 2 to 6 weeks later, completed the information required to assess the content validity, criterion validity, construct validity, test-retest reliability, sensitivity to change, internal consistency reliability, and usability of the TRACK questionnaire. Median (interquartile range) of the TRACK scores were significantly different between patients with well-controlled asthma, patients with not well-controlled asthma, and patients with very poorly controlled asthma (90.0 [75.0-95.0], 75.0 [55.0-85.0], and 35.0 [25.0-55.0], respectively, P Spanish version of the TRACK questionnaire has excellent sensitivity to change and usability; adequate criterion validity, construct validity, and test-retest reliability; and an acceptable internal consistency, when used in children younger than age 5 years with symptoms consistent with asthma. Copyright © 2014 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Preclinical evaluation and test-retest studies of [{sup 18}F]PSS232, a novel radioligand for targeting metabotropic glutamate receptor 5 (mGlu{sub 5})

Energy Technology Data Exchange (ETDEWEB)

Milicevic Sephton, Selena; Mueller Herde, Adrienne; Keller, Claudia; Ruedisuehli, Sonja; Schibli, Roger; Kraemer, Stefanie D.; Ametamey, Simon M. [Center for Radiopharmaceutical Sciences of ETH, PSI and USZ, Zurich (Switzerland); Mu, Linjing [University Hospital Zuerich, Department of Nuclear Medicine, Zuerich (Switzerland); Auberson, Yves [Novartis Institutes for Biomedical Research, Novartis Pharma AG, Basel (Switzerland)

2015-01-15

A novel, {sup 18}F-labelled metabotropic glutamate receptor subtype 5 (mGlu{sub 5}) derivative of [{sup 11}C]ABP688 ([{sup 11}C]1), [{sup 18}F]PSS232 ([{sup 18}F]5), was evaluated in vitro and in vivo for its potential as a PET agent and was used in test-retest reliability studies The radiosynthesis of [{sup 18}F]5 was accomplished via a one-step reaction using a mesylate precursor. In vitro stability was determined in PBS and plasma, and with liver microsomal enzymes. Metabolite studies were performed using rat brain extracts, blood and urine. In vitro autoradiography was performed on horizontal slices of rat brain using 1 and 8, antagonists for mGlu{sub 5} and mGlu{sub 1}, respectively. Small-animal PET, biodistribution, and test-retest studies were performed in Wistar rats. In vivo, dose-dependent displacement studies were performed using 6 and blocking studies with 7. [{sup 18}F]5 was obtained in decay-corrected maximal radiochemical yield of 37 % with a specific activity of 80 - 400 GBq/μmol. Treatment with rat and human microsomal enzymes in vitro for 60 min resulted in 20 % and 4 % of hydrophilic radiometabolites, respectively. No hydrophilic decomposition products or radiometabolites were found in PBS or plasma. In vitro autoradiography on rat brain slices showed a heterogeneous distribution consistent with the known distribution of mGlu{sub 5} with high binding to hippocampal and cortical regions, and negligible radioactivity in the cerebellum. Similar distribution of radioactivity was found in PET images. Under displacement conditions with 6, reduced [{sup 18}F]5 binding was found in all brain regions except the cerebellum. 7 reduced binding in the striatum by 84 % on average. Test-retest studies were reproducible with a variability ranging from 6.8 % to 8.2 %. An extended single-dose toxicity study in Wistar rats showed no compound-related adverse effects. The new mGlu{sub 5} radiotracer, [{sup 18}F]5, showed specific and selective in vitro and in vivo

Reliability and validity of a questionnaire to measure personal, social and environmental correlates of fruit and vegetable intake in 10-11-year-old children in five European countries

DEFF Research Database (Denmark)

De Bourdeaudhuij, I; Klepp, K-I; Due, P

2005-01-01

To investigate the internal consistency of the scales and the test-retest reliability and predictive validity of behaviour theory-based constructs measuring personal, social and environmental correlates of fruit and vegetable intake in 10-11-year-old children.......To investigate the internal consistency of the scales and the test-retest reliability and predictive validity of behaviour theory-based constructs measuring personal, social and environmental correlates of fruit and vegetable intake in 10-11-year-old children....
Interaction between morphine and noradrenergic system of basolateral amygdala on anxiety and memory in the elevated plus-maze test based on a test-retest paradigm.

Science.gov (United States)

Valizadegan, Farhad; Oryan, Shahrbanoo; Nasehi, Mohammad; Zarrindast, Mohammad Reza

2013-05-01

The amygdala is the key brain structure for anxiety and emotional memory storage. We examined the involvement of β-adrenoreceptors in the basolateral amygdala (BLA) and their interaction with morphine in modulating these behaviors. The elevated plus-maze has been employed for investigating anxiety and memory. Male Wistar rats were used for this test. We injected morphine (4, 5, and 6 mg/kg) intraperitoneally, while salbutamol (albuterol) (1, 2, and 4 μg/rat) and propranolol (1, 2, and 4 μg/rat) were injected into the BLA. Open- arms time percentage (%OAT), open- arms entry percentage (%OAE), and locomotor activity were determined by this behavioral test. Retention was tested 24 hours later. Intraperitoneal injection of morphine (6 mg/kg) had an anxiolytic-like effect and improvement of memory. The highest dose of salbutamol decreased the anxiety parameters in test session and improved the memory in retest session. Coadministration of salbutamol and ineffective dose of morphine presenting anxiolytic response. In this case, the memory was improved. Intra-BLA administration of propranolol (4 μg/rat) decreased %OAT in the test session, while had no effect on memory formation. Coadministration of propranolol and morphine (6 mg/kg) showed an increase in %OAT. There was not any significant change in the above- mentioned parameter in the retest session. Coadministration of morphine and propranolol with the effective dose of salbutamol showed that propranolol could reverse anxiolytic-like effect. We found that opioidergic and β-adrenergic systems have the same effects on anxiety and memory in the BLA; but these effects are independent of each other.
Short-interval test-retest interrater reliability of the Dutch version of the structured clinical interview for DSM-IV personality disorders (SCID-II)

NARCIS (Netherlands)

Weertman, A; ArntZ, A; Dreessen, L; van Velzen, C; Vertommen, S

2003-01-01

This study examined the short-interval test-retest reliability of the Structured Clinical Interview (SCID-II: First, Spitzer, Gibbon, & Williams, 1995) for DSM-IV personality disorders (PDs). The SCID-II was administered to 69 in- and outpatients on two occasions separated by 1 to 6 weeks. The
Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project

Directory of Open Access Journals (Sweden)

Singh Amika S

2012-08-01

Full Text Available Abstract Background Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10–12 year old children. Findings We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study of 10–12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement. All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. Conclusions The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
Cross-cultural Adaptation of a Questionnaire on Self-perceived Level of Skills, Abilities and Competencies of Family Physicians in Albania.

Science.gov (United States)

Alla, Arben; Czabanowska, Katarzyna; Kijowska, Violetta; Roshi, Enver; Burazeri, Genc

2012-01-01

Our aim was to validate an international instrument measuring self-perceived competency level of family physicians in Albania. A representative sample of 57 family physicians operating in primary health care services was interviewed twice in March-April 2012 in Tirana (26 men and 31 women; median age: 46 years, inter-quartile range: 38-56 years). A structured questionnaire was administered [and subsequently re-administered after two weeks (test-retest)] to all family physicians aiming to self-assess physicians' level of abilities, skills and competencies regarding different domains of quality of health care. The questionnaire included 37 items organized into 6 subscales/domains. Answers for each item of the tool ranged from 1 ("novice" physicians) to 5 ("expert" physicians). An overall summary score (range: 37-185) and a subscale summary score for each domain were calculated for the test and retest procedures. Cronbach's alpha was used to assess the internal consistency for both the test and the retest procedures, whereas Spearman's rho was employed to assess the stability over time (test-retest reliability) of the instrument. Cronbach's alpha was 0.87 for the test and 0.86 for the retest procedure. Overall, Spearman's rho was 0.84 (Pcross-cultural adaptation of an international instrument taping self-perceived level of competencies of family physicians in Albania. The questionnaire displayed a satisfactory internal consistency for both test and retest procedures in this sample of family physicians in Albania. Furthermore, the high test-retest reliability (stability over time) of the instrument suggests a good potential for wide scale application to nationally representative samples of family physicians in Albanian populations.
Test-retest reproducibility of accommodative facility measures in primary school children.

Science.gov (United States)

Adler, Paul; Scally, Andrew J; Barrett, Brendan T

2018-05-08

To determine the test-retest reproducibility of accommodative facility (AF) measures in an unselected sample of UK primary school children. Using ±2.00 DS flippers and a viewing distance of 40 cm, AF was measured in 136 children (range 4-12 years, average 8.1 ± 2.1) by five testers on three occasions (average interval between successive tests: eight days, range 1-21 days). On each occasion, AF was measured monocularly and binocularly, for two minutes. Full datasets were obtained in 111 children (81.6 per cent). Intra-individual variation in AF was large (standard deviation [SD] = 3.8 cycles per minute [cpm]) and there was variation due to the identity of the tester (SD = 1.6 cpm). On average, AF was greater: (i) in monocular compared to binocular testing (by 1.4 cpm, p cpm, p cpm lower than in children ≥ 10 years old, p = 0.009); and (iv) on subsequent testing occasions (for example, visit-2 AF was 2.0 cpm higher than visit-1 AF, p cpm monocularly and ≥ 8 cpm binocularly), but this rose to 83.8 per cent after the third test. Using less stringent pass criteria (≥ 6 cpm monocularly and ≥ 3 cpm binocularly), the equivalent figures were 82.9 and 96.4 per cent, respectively. Reduced AF did not co-exist with abnormal near point of accommodation or reduced visual acuity. The results reveal considerable intra-individual variability in raw AF measures in children. When the results are considered as pass/fail, children who initially exhibit normal AF continued to do so on repeat testing. Conversely, the vast majority of children with initially reduced AF exhibit normal performance on repeat testing. Using established pass/fail criteria, the prevalence of persistently reduced AF in this sample is 3.6 per cent. © 2018 Optometry Australia.
Community Laboratory Testing for Cryptosporidium: Multicenter Study Retesting Public Health Surveillance Stool Samples Positive for Cryptosporidium by Rapid Cartridge Assay with Direct Fluorescent Antibody Testing.

Directory of Open Access Journals (Sweden)

Dawn M Roellig

Full Text Available Cryptosporidium is a common cause of sporadic diarrheal disease and outbreaks in the United States. Increasingly, immunochromatography-based rapid cartridge assays (RCAs are providing community laboratories with a quick cryptosporidiosis diagnostic method. In the current study, the Centers for Disease Control and Prevention (CDC, the Association of Public Health Laboratories (APHL, and four state health departments evaluated RCA-positive samples obtained during routine Cryptosporidium testing. All samples underwent "head to head" re-testing using both RCA and direct fluorescence assay (DFA. Community level results from three sites indicated that 54.4% (166/305 of Meridian ImmunoCard STAT! positives and 87.0% (67/77 of Remel Xpect positives were confirmed by DFA. When samples were retested by RCA at state laboratories and compared with DFA, 83.3% (155/186 of Meridian ImmunoCard STAT! positives and 95.2% (60/63 of Remel Xpect positives were confirmed. The percentage of confirmed community results varied by site: Minnesota, 39.0%; New York, 63.9%; and Wisconsin, 72.1%. The percentage of confirmed community results decreased with patient age; 12.5% of community positive tests could be confirmed by DFA for patients 60 years of age or older. The percentage of confirmed results did not differ significantly by sex, storage temperature, time between sample collection and testing, or season. Findings from this study demonstrate a lower confirmation rate of community RCA positives when compared to RCA positives identified at state laboratories. Elucidating the causes of decreased test performance in order to improve overall community laboratory performance of these tests is critical for understanding the epidemiology of cryptosporidiosis in the United States (US.
Motivational Interviewing Skills in Health Care Encounters (MISHCE): Development and psychometric testing of an assessment tool.

Science.gov (United States)

Petrova, Tatjana; Kavookjian, Jan; Madson, Michael B; Dagley, John; Shannon, David; McDonough, Sharon K

2015-01-01

Motivational interviewing (MI) has demonstrated a significant impact as an intervention strategy for addiction management, change in lifestyle behaviors, and adherence to prescribed medication and other treatments. Key elements to studying MI include training in MI of professionals who will use it, assessment of skills acquisition in trainees, and the use of a validated skills assessment tool. The purpose of this research project was to develop a psychometrically valid and reliable tool that has been designed to assess MI skills competence in health care provider trainees. The goal was to develop an assessment tool that would evaluate the acquisition and use of specific MI skills and principles, as well as the quality of the patient-provider therapeutic alliance in brief health care encounters. To address this purpose, specific steps were followed, beginning with a literature review. This review contributed to the development of relevant conceptual and operational definitions, selecting a scaling technique and response format, and methods for analyzing validity and reliability. Internal consistency reliability was established on 88 video recorded interactions. The inter-rater and test-retest reliability were established using randomly selected 18 from the 88 interactions. The assessment tool Motivational Interviewing Skills for Health Care Encounters (MISHCE) and a manual for use of the tool were developed. Validity and reliability of MISHCE were examined. Face and content validity were supported with well-defined conceptual and operational definitions and feedback from an expert panel. Reliability was established through internal consistency, inter-rater reliability, and test-retest reliability. The overall internal consistency reliability (Cronbach's alpha) for all fifteen items was 0.75. MISHCE demonstrated good inter-rater reliability and good to excellent test-retest reliability. MISHCE assesses the health provider's level of knowledge and skills in brief
The QUASAR reproducibility study, Part II: Results from a multi-center Arterial Spin Labeling test-retest study

DEFF Research Database (Denmark)

Petersen, Esben Thade; Mouridsen, Kim; Golay, Xavier

2010-01-01

Quantitative STAR labeling of Arterial Regions or QUASAR), a method providing user independent quantification of CBF in a large test-retest study across sites from around the world, dubbed "The QUASAR reproducibility study". Altogether, 28 sites located in Asia, Europe and North America participated...... and a total of 284 healthy volunteers were scanned. Minimal operator dependence was assured by using an automatic planning tool and its accuracy and potential usefulness in multi-center trials was evaluated as well. Accurate repositioning between sessions was achieved with the automatic planning tool showing...
The QUASAR reproducibility study, Part II: Results from a multi-center Arterial Spin Labeling test-retest study

DEFF Research Database (Denmark)

Petersen, Esben; Mouridsen, Kim; Golay, Xavier

2009-01-01

Quantitative STAR labeling of Arterial Regions or QUASAR), a method providing user independent quantification of CBF in a large test-retest study across sites from around the world, dubbed "The QUASAR reproducibility study". Altogether, 28 sites located in Asia, Europe and North America participated...... and a total of 284 healthy volunteers were scanned. Minimal operator dependence was assured by using an automatic planning tool and its accuracy and potential usefulness in multi-center trials was evaluated as well. Accurate repositioning between sessions was achieved with the automatic planning tool showing...
The development and psychometric testing of a Disaster Response Self-Efficacy Scale among undergraduate nursing students.

Science.gov (United States)

Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling

2017-12-01

Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, pself-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Test-retest repeatability of myocardial blood flow and infarct size using 11C-acetate micro-PET imaging in mice

International Nuclear Information System (INIS)

Croteau, Etienne; Renaud, Jennifer M.; McDonald, Matthew; Klein, Ran; DaSilva, Jean N.; Beanlands, Rob S.B.; DeKemp, Robert A.

2015-01-01

Global and regional responses of absolute myocardial blood flow index (iMBF) are used as surrogate markers to assess response to therapies in coronary artery disease. In this study, we assessed the test-retest repeatability of iMBF imaging, and the accuracy of infarct sizing in mice using 11 C-acetate PET. 11 C-Acetate cardiac PET images were acquired in healthy controls, endothelial nitric oxide synthase (eNOS) knockout transgenic mice, and mice after myocardial infarction (MI) to estimate global and regional iMBF, and myocardial infarct size compared to 18 F-FDG PET and ex-vivo histology results. Global test-retest iMBF values had good coefficients of repeatability (CR) in healthy mice, eNOS knockout mice and normally perfused regions in MI mice (CR = 1.6, 2.0 and 1.5 mL/min/g, respectively). Infarct size measured on 11 C-acetate iMBF images was also repeatable (CR = 17 %) and showed a good correlation with the infarct sizes found on 18 F-FDG PET and histopathology (r 2 > 0.77; p < 0.05). 11 C-Acetate micro-PET assessment of iMBF and infarct size is repeatable and suitable for serial investigation of coronary artery disease progression and therapy. (orig.)
The Karen instruments for measuring quality of nursing care: construct validity and internal consistency.

Science.gov (United States)

Lindgren, Margareta; Andersson, Inger S

2011-06-01

Valid and reliable instruments for measuring the quality of care are needed for evaluation and improvement of nursing care. Previously developed and evaluated instruments, the Karen-patient and the Karen-personnel based on Donabedian's Structure-Process-Outcome triad (S-P-O triad) had promising content validity, discriminative power and internal consistency. The objective of this study was to further develop the instruments with regard to construct validity and internal consistency. This prospective study was carried out in medical and surgical wards at a hospital in Sweden. A total of 95 patients and 120 personnel were included. The instruments were tested for construct validity by performing factor analyses in two steps and for internal consistency using Cronbach's alpha coefficient. The first confirmatory factor analyses, with a pre-determined three-factor solution did not load well according to the S-P-O triad, but the second exploratory factor analysis with a six-factor solution appeared to be more coherent and the distribution of variables seemed to be logical. The reliability, i.e. internal consistency, was good in both factor analyses. The Karen-patient and the Karen-personnel instruments have achieved acceptable levels of construct validity. The internal consistency of the instruments is good. This indicates that the instruments may be suitable to use in clinical practice for measuring the quality of nursing care.
Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.

Science.gov (United States)

Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J

2012-07-01

We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.
Investigation of four self-report instruments (FABT, TSK-HC, Back-PAQ, HC-PAIRS) to measure healthcare practitioners' attitudes and beliefs toward low back pain: Reliability, convergent validity and survey of New Zealand osteopaths and manipulative physiotherapists.

Science.gov (United States)

Moran, Robert W; Rushworth, Wendy M; Mason, Jesse

2017-12-01

Healthcare practitioner beliefs influence advice and management provided to patients with back pain. Several instruments measuring practitioner beliefs have been developed but psychometric properties for some have not been investigated. To investigate internal consistency, test-retest reliability and convergent validity of the Fear Avoidance Beliefs Tool (FABT), the Tampa Scale of Kinesiophobia for Health Care Providers (TSK-HC), the Back Pain Attitudes Questionnaire (Back-PAQ), and the Health Care Pain and Impairment Relationship Scale (HC-PAIRS). A secondary aim was to explore beliefs of New Zealand osteopaths and physiotherapists regarding low back pain. FABT, TSK-HC, Back-PAQ, and HC-PAIRS were administered twice, 14 days apart. Data from 91 osteopaths and 35 physiotherapists were analysed. The FABT, TSK-HC and Back-PAQ each demonstrated excellent internal consistency, (Cronbach's α = 0.92, 0.91, and 0.91 respectively), and excellent test-retest reliability (lower limit of 95% CI for intraclass correlation coefficient >0.75). Correlations between instruments (Pearson's r = 0.51 to 0.77, p 0.47) for mean differences in scores, for all instruments, between professions. This study found excellent internal consistency, test-retest reliability and good convergent validity for the FABT, TSK-HC, and Back-PAQ. Previously reported internal consistency, test-retest and convergent validity of the HC-PAIRS were confirmed, and test-retest reliability was excellent. There were significant scoring differences on each instrument between professions, and while both groups demonstrated fear avoidant beliefs, physiotherapist respondent scores indicated that as a group, they held fewer fear-avoidant beliefs than osteopath respondents. Copyright © 2017 Elsevier Ltd. All rights reserved.
Internal Branding and Employee Brand Consistent Behaviours

DEFF Research Database (Denmark)

Mazzei, Alessandra; Ravazzani, Silvia

2017-01-01

constitutive processes. In particular, the paper places emphasis on the role and kinds of communication practices as a central part of the nonnormative and constitutive internal branding process. The paper also discusses an empirical study based on interviews with 32 Italian and American communication managers...... and 2 focus groups with Italian communication managers. Findings show that, in order to enhance employee brand consistent behaviours, the most effective communication practices are those characterised as enablement-oriented. Such a communication creates the organizational conditions adequate to sustain......Employee behaviours conveying brand values, named brand consistent behaviours, affect the overall brand evaluation. Internal branding literature highlights a knowledge gap in terms of communication practices intended to sustain such behaviours. This study contributes to the development of a non...
14 CFR 65.19 - Retesting after failure.

Science.gov (United States)

2010-01-01

... 14 Aeronautics and Space 2 2010-01-01 2010-01-01 false Retesting after failure. 65.19 Section 65.19 Aeronautics and Space FEDERAL AVIATION ADMINISTRATION, DEPARTMENT OF TRANSPORTATION (CONTINUED) AIRMEN CERTIFICATION: AIRMEN OTHER THAN FLIGHT CREWMEMBERS General § 65.19 Retesting after failure. An...
Blink frequency and duration during perimetry and their relationship to test-retest threshold variability.

Science.gov (United States)

Wang, Yanfang; Toor, Sonia S; Gautam, Ramesh; Henson, David B

2011-06-28

To describe different patterns of blinking in patients undergoing a visual field test and to establish whether the blink parameters are related to threshold variability. Thirty-nine patients with diagnosed or suspected glaucoma were recruited to undertake a perimetric task twice. Blinks were detected with a video eye-tracker system that records at a sampling rate of 60 Hz. Blink frequency, duration, and episodes of microsleep (eye closures >500 ms) were analyzed, and correlated with test-retest threshold variability. The timing of blinks with respect to stimulus presentation was analyzed and the percentage of seen stimuli for all presentations (POS(overall)) and those overlapped with blinks (POS(overlapped)) were compared. Blink frequency ranged from 0 to 58 per minute. A significant increase in blink frequency was observed in the second test (P POS(overall) and POS(overlapped) was significant (P POS(overlapped) was observed with the increase of overlap duration. A wide range of blink frequencies was observed during perimetric testing. Although no blink parameters showed significant influence on threshold variability, when the blinks overlapped with a stimulus presentation, the probability of seeing was reduced. For suprathreshold stimuli, blinks often occurred after the presentation, whereas for subthreshold presentations, there was no relationship to presentation time.
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan.

Science.gov (United States)

Moore, Amy Lawson; Miller, Terissa M

2018-01-01

The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Test your memory-Turkish version (TYM-TR): reliability and validity study of a cognitive screening test.

Science.gov (United States)

Maviş, Ilknur; Özbabalik Adapinar, Belgin Demet; Yenilmez, Çinar; Aydin, Ayşe; Olgun, Engin; Bal, Cengiz

2015-01-01

The test your memory (TYM) is reported to be a sensitive cognitive function assessment scale for people with dementia. The aim of the present study was to investigate the reliability and validity of an adapted Turkish version of the TYM (TYM-TR) among Turkish dementia patients. The TYM-TR was given to 59 patients with dementia aged 60+ and 336 normal controls aged 23-75+. The diagnostic utility of the TYM-TR was compared with that of the mini-mental state examination (MMSE) to validate it. The internal consistency of the TYM-TR was a = 0.85. The test-retest reliability was 0.97 (P reliability and validity to distinguish dementia in the Turkish population.

The Parsing Syllable Envelopes Test for Assessment of Amplitude Modulation Discrimination Skills in Children: Development, Normative Data, and Test-Retest Reliability Studies.

Science.gov (United States)

Cameron, Sharon; Chong-White, Nicky; Mealings, Kiri; Beechey, Tim; Dillon, Harvey; Young, Taegan

2018-02-01

Intensity peaks and valleys in the acoustic signal are salient cues to syllable structure, which is accepted to be a crucial early step in phonological processing. As such, the ability to detect low-rate (envelope) modulations in signal amplitude is essential to parse an incoming speech signal into smaller phonological units. The Parsing Syllable Envelopes (ParSE) test was developed to quantify the ability of children to recognize syllable boundaries using an amplitude modulation detection paradigm. The envelope of a 750-msec steady-state /a/ vowel is modulated into two or three pseudo-syllables using notches with modulation depths varying between 0% and 100% along an 11-step continuum. In an adaptive three-alternative forced-choice procedure, the participant identified whether one, two, or three pseudo-syllables were heard. Development of the ParSE stimuli and test protocols, and collection of normative and test-retest reliability data. Eleven adults (aged 23 yr 10 mo to 50 yr 9 mo, mean 32 yr 10 mo) and 134 typically developing, primary-school children (aged 6 yr 0 mo to 12 yr 4 mo, mean 9 yr 3 mo). There were 73 males and 72 females. Data were collected using a touchscreen computer. Psychometric functions (PFs) were automatically fit to individual data by the ParSE software. Performance was related to the modulation depth at which syllables can be detected with 88% accuracy (referred to as the upper boundary of the uncertainty region [UBUR]). A shallower PF slope reflected a greater level of uncertainty. Age effects were determined based on raw scores. z Scores were calculated to account for the effect of age on performance. Outliers, and individual data for which the confidence interval of the UBUR exceeded a maximum allowable value, were removed. Nonparametric tests were used as the data were skewed toward negative performance. Across participants, the performance criterion (UBUR) was met with a median modulation depth of 42%. The effect of age on the UBUR was
Test-retest reliability of behavioral measures of impulsive choice, impulsive action, and inattention.

Science.gov (United States)

Weafer, Jessica; Baggott, Matthew J; de Wit, Harriet

2013-12-01

Behavioral measures of impulsivity are widely used in substance abuse research, yet relatively little attention has been devoted to establishing their psychometric properties, especially their reliability over repeated administration. The current study examined the test-retest reliability of a battery of standardized behavioral impulsivity tasks, including measures of impulsive choice (i.e., delay discounting, probability discounting, and the Balloon Analogue Risk Task), impulsive action (i.e., the stop signal task, the go/no-go task, and commission errors on the continuous performance task), and inattention (i.e., attention lapses on a simple reaction time task and omission errors on the continuous performance task). Healthy adults (n = 128) performed the battery on two separate occasions. Reliability estimates for the individual tasks ranged from moderate to high, with Pearson correlations within the specific impulsivity domains as follows: impulsive choice (r range: .76-.89, ps reliable measures and thus can be confidently used to assess various facets of impulsivity as intermediate phenotypes for drug abuse.
The Internal Consistency and Validity of the Vaccination Attitudes Examination Scale: A Replication Study.

Science.gov (United States)

Wood, Louise; Smith, Michael; Miller, Christopher B; O'Carroll, Ronan E

2018-06-19

Vaccinations are important preventative health behaviors. The recently developed Vaccination Attitudes Examination (VAX) Scale aims to measure the reasons behind refusal/hesitancy regarding vaccinations. The aim of this replication study is to conduct an independent test of the newly developed VAX Scale in the UK. We tested (a) internal consistency (Cronbach's α); (b) convergent validity by assessing its relationships with beliefs about medication, medical mistrust, and perceived sensitivity to medicines; and (c) construct validity by testing how well the VAX Scale discriminated between vaccinators and nonvaccinators. A sample of 243 UK adults completed the VAX Scale, the Beliefs About Medicines Questionnaire, the Perceived Sensitivity to Medicines Scale, and the Medical Mistrust Index, in addition to demographics of age, gender, education levels, and social deprivation. Participants were asked (a) whether they received an influenza vaccination in the past year and (b) if they had a young child, whether they had vaccinated the young child against influenza in the past year. The VAX (a) demonstrated high internal consistency (α = .92); (b) was positively correlated with medical mistrust and beliefs about medicines, and less strongly correlated with perceived sensitivity to medicines; and (c) successfully differentiated parental influenza vaccinators from nonvaccinators. The VAX demonstrated good internal consistency, convergent validity, and construct validity in an independent UK sample. It appears to be a useful measure to help us understand the health beliefs that promote or deter vaccination behavior.
Escala Razões para Fumar Modificada: tradução e adaptação cultural para o português para uso no Brasil e avaliação da confiabilidade teste-reteste Modified Reasons for Smoking Scale: translation to Portuguese, cross-cultural adaptation for use in Brazil and evaluation of test-retest reliability

Directory of Open Access Journals (Sweden)

Elisa Sebba Tosta de Souza

2009-07-01

Full Text Available OBJETIVO: Traduzir, fazer a adaptação cultural e testar a confiabilidade teste-reteste de uma versão em língua portuguesa da Escala Razões Para Fumar Modificada (ERPFM para uso no Brasil. MÉTODOS: Uma versão em língua inglesa da ERPFM foi traduzida por médicos brasileiros com profundo conhecimento sobre a língua inglesa. Uma versão de consenso foi obtida por grupo multidisciplinar composto por dois pneumologistas, um psiquiatra e um psicólogo. Essa versão foi traduzida de volta ao inglês por um tradutor americano. A avaliação da adaptação cultural da versão final foi efetuada em uma amostra de 20 fumantes saudáveis. A avaliação da confiabilidade teste-reteste foi feita pela aplicação da versão traduzida da escala em 54 fumantes saudáveis em duas ocasiões separadas por 15 dias. RESULTADOS: Essa versão traduzida da ERPFM exibiu excelente identidade cultural, sendo bem compreendida por 95% dos fumantes. Os graus de concordância das respostas em duas ocasiões distintas foram quase perfeito para duas questões, substancial para dez questões, moderado para oito questões e discreto para uma questão. Os valores dos coeficientes de correlação intraclasse dos fatores motivacionais em duas ocasiões, empregando-se modelos teóricos previamente publicados, foram superiores a 0,7 em seis dos sete domínios. CONCLUSÕES: A presente versão da ERPFM exibe identidade cultural e confiabilidade teste-reteste satisfatórias, podendo ser de utilidade no tratamento e na avaliação de tabagistas em nosso meio.OBJECTIVE: To translate the Modified Reasons for Smoking Scale (MRSS to Portuguese, to submit it to cross-cultural adaptation for use in Brazil and to evaluate the test-retest reliability of the translated version. METHODS: An English-language version of the MRSS was translated to Portuguese by Brazilian doctors who have thorough knowledge of the English language. A consensus version was produced by a multidisciplinary group
Test-retest repeatability of myocardial blood flow and infarct size using {sup 11}C-acetate micro-PET imaging in mice

Energy Technology Data Exchange (ETDEWEB)

Croteau, Etienne; Renaud, Jennifer M.; McDonald, Matthew; Klein, Ran; DaSilva, Jean N.; Beanlands, Rob S.B.; DeKemp, Robert A. [University of Ottawa Heart Institute, National Cardiac PET Centre, Ottawa, Ontario (Canada)

2015-09-15

Global and regional responses of absolute myocardial blood flow index (iMBF) are used as surrogate markers to assess response to therapies in coronary artery disease. In this study, we assessed the test-retest repeatability of iMBF imaging, and the accuracy of infarct sizing in mice using {sup 11}C-acetate PET. {sup 11}C-Acetate cardiac PET images were acquired in healthy controls, endothelial nitric oxide synthase (eNOS) knockout transgenic mice, and mice after myocardial infarction (MI) to estimate global and regional iMBF, and myocardial infarct size compared to {sup 18}F-FDG PET and ex-vivo histology results. Global test-retest iMBF values had good coefficients of repeatability (CR) in healthy mice, eNOS knockout mice and normally perfused regions in MI mice (CR = 1.6, 2.0 and 1.5 mL/min/g, respectively). Infarct size measured on {sup 11}C-acetate iMBF images was also repeatable (CR = 17 %) and showed a good correlation with the infarct sizes found on {sup 18}F-FDG PET and histopathology (r{sup 2} > 0.77; p < 0.05). {sup 11}C-Acetate micro-PET assessment of iMBF and infarct size is repeatable and suitable for serial investigation of coronary artery disease progression and therapy. (orig.)
Brain GABA Detection in vivo with the J-editing 1H MRS Technique: A Comprehensive Methodological Evaluation of Sensitivity Enhancement, Macromolecule Contamination and Test-Retest Reliability

Science.gov (United States)

Shungu, Dikoma C.; Mao, Xiangling; Gonzales, Robyn; Soones, Tacara N.; Dyke, Jonathan P.; van der Veen, Jan Willem; Kegeles, Lawrence S.

2016-01-01

Abnormalities in brain γ-aminobutyric acid (GABA) have been implicated in various neuropsychiatric and neurological disorders. However, in vivo GABA detection by proton magnetic resonance spectroscopy (1H MRS) presents significant challenges arising from low brain concentration, overlap by much stronger resonances, and contamination by mobile macromolecule (MM) signals. This study addresses these impediments to reliable brain GABA detection with the J-editing difference technique on a 3T MR system in healthy human subjects by (a) assessing the sensitivity gains attainable with an 8-channel phased-array head coil, (b) determining the magnitude and anatomic variation of the contamination of GABA by MM, and (c) estimating the test-retest reliability of measuring GABA with this method. Sensitivity gains and test-retest reliability were examined in the dorsolateral prefrontal cortex (DLPFC), while MM levels were compared across three cortical regions: the DLPFC, the medial prefrontal cortex (MPFC) and the occipital cortex (OCC). A 3-fold higher GABA detection sensitivity was attained with the 8-channel head coil compared to the standard single-channel head coil in DLPFC. Despite significant anatomic variation in GABA+MM and MM across the three brain regions (p GABA+MM was relatively stable across the three voxels, ranging from 41% to 49%, a non-significant regional variation (p = 0.58). The test-retest reliability of GABA measurement, expressed either as ratios to voxel tissue water (W) or total creatine, was found to be very high for both the single-channel coil and the 8-channel phased-array coil. For the 8-channel coil, for example, Pearson’s correlation coefficient of test vs. retest for GABA/W was 0.98 (R2 = 0.96, p = 0.0007), the percent coefficient of variation (CV) was 1.25%, and the intraclass correlation coefficient (ICC) was 0.98. Similar reliability was also found for the co-edited resonance of combined glutamate and glutamine (Glx) for both coils. PMID
A comparison between the original and Tablet-based Symbol Digit Modalities Test in patients with schizophrenia: Test-retest agreement, random measurement error, practice effect, and ecological validity.

Science.gov (United States)

Tang, Shih-Fen; Chen, I-Hui; Chiang, Hsin-Yu; Wu, Chien-Te; Hsueh, I-Ping; Yu, Wan-Hui; Hsieh, Ching-Lin

2017-11-27

We aimed to compare the test-retest agreement, random measurement error, practice effect, and ecological validity of the original and Tablet-based Symbol Digit Modalities Test (T-SDMT) over five serial assessments, and to examine the concurrent validity of the T-SDMT in patients with schizophrenia. Sixty patients with chronic schizophrenia completed five serial assessments (one week apart) of the SDMT and T-SDMT and one assessment of the Activities of Daily Living Rating Scale III at the first time point. Both measures showed high test-retest agreement, similar levels of random measurement error over five serial assessments. Moreover, the practice effects of the two measures did not reach a plateau phase after five serial assessments in young and middle-aged participants. Nevertheless, only the practice effect of the T-SDMT became trivial after the first assessment. Like the SDMT, the T-SDMT had good ecological validity. The T-SDMT also had good concurrent validity with the SDMT. In addition, only the T-SDMT had discriminative validity to discriminate processing speed in young and middle-aged participants. Compared to the SDMT, the T-SDMT had overall slightly better psychometric properties, so it can be an alternative measure to the SDMT for assessing processing speed in patients with schizophrenia. Copyright © 2017 Elsevier B.V. All rights reserved.
Response process and test-retest reliability of the Context Assessment for Community Health tool in Vietnam.

Science.gov (United States)

Duc, Duong M; Bergström, Anna; Eriksson, Leif; Selling, Katarina; Thi Thu Ha, Bui; Wallin, Lars

2016-01-01

The recently developed Context Assessment for Community Health (COACH) tool aims to measure aspects of the local healthcare context perceived to influence knowledge translation in low- and middle-income countries. The tool measures eight dimensions (organizational resources, community engagement, monitoring services for action, sources of knowledge, commitment to work, work culture, leadership, and informal payment) through 49 items. The study aimed to explore the understanding and stability of the COACH tool among health providers in Vietnam. To investigate the response process, think-aloud interviews were undertaken with five community health workers, six nurses and midwives, and five physicians. Identified problems were classified according to Conrad and Blair's taxonomy and grouped according to an estimation of the magnitude of the problem's effect on the response data. Further, the stability of the tool was examined using a test-retest survey among 77 respondents. The reliability was analyzed for items (intraclass correlation coefficient (ICC) and percent agreement) and dimensions (ICC and Bland-Altman plots). In general, the think-aloud interviews revealed that the COACH tool was perceived as clear, well organized, and easy to answer. Most items were understood as intended. However, seven prominent problems in the items were identified and the content of three dimensions was perceived to be of a sensitive nature. In the test-retest survey, two-thirds of the items and seven of eight dimensions were found to have an ICC agreement ranging from moderate to substantial (0.5-0.7), demonstrating that the instrument has an acceptable level of stability. This study provides evidence that the Vietnamese translation of the COACH tool is generally perceived to be clear and easy to understand and has acceptable stability. There is, however, a need to rephrase and add generic examples to clarify some items and to further review items with low ICC.
Online self-report questionnaire on computer work-related exposure (OSCWE): validity and internal consistency.

Science.gov (United States)

Mekhora, Keerin; Jalayondeja, Wattana; Jalayondeja, Chutima; Bhuanantanondh, Petcharatana; Dusadiisariyavong, Asadang; Upiriyasakul, Rujiret; Anuraktam, Khajornyod

2014-07-01

To develop an online, self-report questionnaire on computer work-related exposure (OSCWE) and to determine the internal consistency, face and content validity of the questionnaire. The online, self-report questionnaire was developed to determine the risk factors related to musculoskeletal disorders in computer users. It comprised five domains: personal, work-related, work environment, physical health and psychosocial factors. The questionnaire's content was validated by an occupational medical doctor and three physical therapy lecturers involved in ergonomic teaching. Twenty-five lay people examined the feasibility of computer-administered and the user-friendly language. The item correlation in each domain was analyzed by the internal consistency (Cronbach's alpha; alpha). The content of the questionnaire was considered congruent with the testing purposes. Eight hundred and thirty-five computer users at the PTT Exploration and Production Public Company Limited registered to the online self-report questionnaire. The internal consistency of the five domains was: personal (alpha = 0.58), work-related (alpha = 0.348), work environment (alpha = 0.72), physical health (alpha = 0.68) and psychosocial factor (alpha = 0.93). The findings suggested that the OSCWE had acceptable internal consistency for work environment and psychosocial factors. The OSCWE is available to use in population-based survey research among computer office workers.
Comparison of airtightness retesting results. Comparaison des resultats de nouveaux tests d'etancheite

Energy Technology Data Exchange (ETDEWEB)

1988-01-01

Polyethylene vapour barrier and airtight drywall are two methods used by the building industry to reduce air leakage in residential homes. Concern has been expressed that polyethylene air/vapour barriers degrade over time. This concern has led various agencies to test and retest homes for air leakage. This report is the compilation of the data collected as a result of that testing. Raw data were collected on 145 homes from various sources. Data were screened and the tests of homes were omitted from the analysis if, the fan tests were done on the same house by different firms, if the construction of the house was not sufficiently complete, or if the initial air change rate per hour (ACH) was greater than 3. With these omissions from the database, 90 homes remained to be analyzed. The 90 homes were separated into two groups, those with an intial ACH less than 1.5 and those with an initial ACH between 1.5 and 3.0. The data were recorded in two tables which included the ACH, the time in months, the percentage change, and the difference in change between the first test and each subsequent test. These data indicate a relatively minor average change in airtightness. Keeping in mind the quantity of data collected and the time period examined, there is no indication that significant problems exist that would necessitate a change to the current building practice. 2 figs., 5 tabs.
Reliability, validity and sensitivity to change of neurogenic bowel dysfunction score in patients with spinal cord injury

DEFF Research Database (Denmark)

Erdem, D.; Hava, D.; Keskinoglu, P.

2017-01-01

cord injury (SCI). The reliability of NBD score was assessed by test-retest reliability and internal consistency. Cronbach's alpha coefficient was calculated to determine internal consistency. The construct validity was evaluated by exploring correlations between the NBD score and SF-36 scales, patient...... assessment of impact of NBD on quality of life (QoL) and the physician global assessment (PGA). The Global Rating of Change (GRC) scale was used to assess the change of NBD to investigate the sensitivity of the score to change. Results: Cronbach's alpha coefficient was 0.547. In test-retest reliability...
Static and Dynamic Handgrip Strength Endurance: Test-Retest Reproducibility.

Science.gov (United States)

Gerodimos, Vassilis; Karatrantou, Konstantina; Psychou, Dimitra; Vasilopoulou, Theodora; Zafeiridis, Andreas

2017-03-01

This study investigated the reliability of static and dynamic handgrip strength endurance using different protocols and indicators for the assessment of strength endurance. Forty young, healthy men and women (age, 18-22 years) performed 2 handgrip strength endurance protocols: a static protocol (sustained submaximal contraction at 50% of maximal voluntary contraction) and a dynamic one (8, 10, and 12 maximal repetitions). The participants executed each protocol twice to assess the test-retest reproducibility. Total work and total time were used as indicators of strength endurance in the static protocol; the strength recorded at each maximal repetition, the percentage change, and fatigue index were used as indicators of strength endurance in the dynamic protocol. The static protocol showed high reliability irrespective of sex and hand for total time and work. The 12-repetition dynamic protocol exhibited moderate-high reliability for repeated maximal repetitions and percentage change; the 8- and 10-repetition protocols demonstrated lower reliability irrespective of sex and hand. The fatigue index was not a reliable indicator for the assessment of dynamic handgrip endurance. Static handgrip endurance can be measured reliably using the total time and total work as indicators of strength endurance. For the evaluation of dynamic handgrip endurance, the 12-repetition protocol is recommended, using the repeated maximal repetitions and percentage change as indicators of strength endurance. Practitioners should consider the static (50% maximal voluntary contraction) and dynamic (12 repeated maximal repetitions) protocols as reliable for the assessment of handgrip strength endurance. The evaluation of static endurance in conjunction with dynamic endurance would provide more complete information about hand function. Copyright © 2017 American Society for Surgery of the Hand. Published by Elsevier Inc. All rights reserved.
Test-retest reliability of tibiofemoral joint space width measurements made using a low-dose standing CT scanner

Energy Technology Data Exchange (ETDEWEB)

Segal, Neil A. [University of Kansas Medical Center, Department of Rehabilitation Medicine, 3901 Rainbow Boulevard, Mailstop 1046, Kansas City, KS (United States); The University of Iowa, Iowa City, IA (United States); Bergin, John; Kern, Andrew; Findlay, Christian [The University of Iowa, Iowa City, IA (United States); Anderson, Donald D. [The University of Iowa, Department of Orthopaedics and Rehabilitation, Iowa City, IA (United States)

2017-02-15

To determine the test-retest reliability of knee joint space width (JSW) measurements made using standing CT (SCT) imaging. This prospective two-visit study included 50 knees from 30 subjects (66% female; mean ± SD age 58.2 ± 11.3 years; BMI 29.1 ± 5.6 kg/m{sup 2}; 38% KL grade 0-1). Tibiofemoral geometry was obtained from bilateral, approximately 20 fixed-flexed SCT images acquired at visits 2 weeks apart. For each compartment, the total joint area was defined as the area with a JSW <10 mm. The summary measurements of interest were the percentage of the total joint area with a JSW less than 0.5-mm thresholds between 2.0 and 5.0 mm in each tibiofemoral compartment. Test-retest reliability of the summary JSW measurements was assessed by intraclass correlation coefficients (ICC 2,1) for the percentage area engaged at each threshold of JSW and root-mean-square errors (RMSE) were calculated to assess reproducibility. The ICCs were excellent for each threshold assessed, ranging from 0.95 to 0.97 for the lateral and 0.90 to 0.97 for the medial compartment. RMSE ranged from 1.1 to 7.2% for the lateral and from 3.1 to 9.1% for the medial compartment, with better reproducibility at smaller JSW thresholds. The knee joint positioning protocol used demonstrated high day-to-day reliability for SCT 3D tibiofemoral JSW summary measurements repeated 2 weeks apart. Low-dose SCT provides a great deal of information about the joint while maintaining high reliability, making it a suitable alternative to plain radiographs for evaluating JSW in people with knee OA. (orig.)
Test-retest reliability of tibiofemoral joint space width measurements made using a low-dose standing CT scanner

International Nuclear Information System (INIS)

Segal, Neil A.; Bergin, John; Kern, Andrew; Findlay, Christian; Anderson, Donald D.

2017-01-01

To determine the test-retest reliability of knee joint space width (JSW) measurements made using standing CT (SCT) imaging. This prospective two-visit study included 50 knees from 30 subjects (66% female; mean ± SD age 58.2 ± 11.3 years; BMI 29.1 ± 5.6 kg/m 2 ; 38% KL grade 0-1). Tibiofemoral geometry was obtained from bilateral, approximately 20 fixed-flexed SCT images acquired at visits 2 weeks apart. For each compartment, the total joint area was defined as the area with a JSW <10 mm. The summary measurements of interest were the percentage of the total joint area with a JSW less than 0.5-mm thresholds between 2.0 and 5.0 mm in each tibiofemoral compartment. Test-retest reliability of the summary JSW measurements was assessed by intraclass correlation coefficients (ICC 2,1) for the percentage area engaged at each threshold of JSW and root-mean-square errors (RMSE) were calculated to assess reproducibility. The ICCs were excellent for each threshold assessed, ranging from 0.95 to 0.97 for the lateral and 0.90 to 0.97 for the medial compartment. RMSE ranged from 1.1 to 7.2% for the lateral and from 3.1 to 9.1% for the medial compartment, with better reproducibility at smaller JSW thresholds. The knee joint positioning protocol used demonstrated high day-to-day reliability for SCT 3D tibiofemoral JSW summary measurements repeated 2 weeks apart. Low-dose SCT provides a great deal of information about the joint while maintaining high reliability, making it a suitable alternative to plain radiographs for evaluating JSW in people with knee OA. (orig.)
The Perceived Efficacy and Goal Setting System (PEGS), part II: evaluation of test-retest reliability and differences between child and parental reports in the Swedish version.

Science.gov (United States)

Vroland-Nordstrand, Kristina; Krumlinde-Sundholm, Lena

2012-11-01

to evaluate the test-retest reliability of children's perceptions of their own competence in performing daily tasks and of their choice of goals for intervention using the Swedish version of the perceived efficacy and goal setting system (PEGS). A second aim was to evaluate agreement between children's and parents' perceptions of the child's competence and choices of intervention goals. Forty-four children with disabilities and their parents completed the Swedish version of the PEGS. Thirty-six of the children completed a retest session allocated into one of two groups: (A) for evaluation of perceived competence and (B) for evaluation of choice of goals. Cohen's kappa, weighted kappa and absolute agreement were calculated. Test-retest reliability for children's perceived competence showed good agreement for the dichotomized scale of competent/non-competent performance; however, using the four-point scale the agreement varied. The children's own goals were relatively stable over time; 78% had an absolute agreement ranging from 50% to 100%. There was poor agreement between the children's and their parents' ratings. Goals identified by the children differed from those identified by their parents, with 48% of the children having no goals identical to those chosen by their parents. These results indicate that the Swedish version of the PEGS produces reliable outcomes comparable to the original version.
Immunization knowledge and practice among Malaysian parents: a questionnaire development and pilot-testing.

Science.gov (United States)

Awadh, Ammar Ihsan; Hassali, Mohamed Azmi; Al-lela, Omer Qutaiba; Bux, Siti Halimah; Elkalmi, Ramadan M; Hadi, Hazrina

2014-10-27

Parents are the main decision makers for their children vaccinations. This fact makes parents' immunization knowledge and practices as predictor factors for immunization uptake and timeliness. The aim of this pilot study was to develop a reliable and valid instrument in Malaysian language to measure immunization knowledge and practice (KP) of Malaysian parents. A cross-sectional prospective pilot survey was conducted among 88 Malaysian parents who attended public health facilities that provide vaccinations. Translated immunization KP questionnaires (Bahasa Melayu version) were used. Descriptive statistics were applied, face and content validity were assessed, and internal consistency, test-retest reliability, and construct validity were determined. The mean ± standard deviation (SD) of the knowledge scores was 7.36 ± 2.29 and for practice scores was 7.13 ± 2.20. Good internal consistency was found for knowledge and practice items (Cronbach's alpha = 0.757 and 0.743 respectively); the test-retest reliability value was 0.740 (p = 0.014). A panel of three specialist pharmacists who are experts in this field judged the face and content validity of the final questionnaire. Parents with up-to-date immunized children had significantly better knowledge and practice scores than parents who did not (p Malaysian parents and therefore this version can be used in future research.
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

Science.gov (United States)

Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

2015-12-01

To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
The Phoneme Identification Test for Assessment of Spectral and Temporal Discrimination Skills in Children: Development, Normative Data, and Test-Retest Reliability Studies.

Science.gov (United States)

Cameron, Sharon; Chong-White, Nicky; Mealings, Kiri; Beechey, Tim; Dillon, Harvey; Young, Taegan

2018-02-01

Previous research suggests that a proportion of children experiencing reading and listening difficulties may have an underlying primary deficit in the way that the central auditory nervous system analyses the perceptually important, rapidly varying, formant frequency components of speech. The Phoneme Identification Test (PIT) was developed to investigate the ability of children to use spectro-temporal cues to perceptually categorize speech sounds based on their rapidly changing formant frequencies. The PIT uses an adaptive two-alternative forced-choice procedure whereby the participant identifies a synthesized consonant-vowel (CV) (/ba/ or /da/) syllable. CV syllables differed only in the second formant (F2) frequency along an 11-step continuum (between 0% and 100%-representing an ideal /ba/ and /da/, respectively). The CV syllables were presented in either quiet (PIT Q) or noise at a 0 dB signal-to-noise ratio (PIT N). Development of the PIT stimuli and test protocols, and collection of normative and test-retest reliability data. Twelve adults (aged 23 yr 10 mo to 50 yr 9 mo, mean 32 yr 5 mo) and 137 typically developing, primary-school children (aged 6 yr 0 mo to 12 yr 4 mo, mean 9 yr 3 mo). There were 73 males and 76 females. Data were collected using a touchscreen computer. Psychometric functions were automatically fit to individual data by the PIT software. Performance was determined by the width of the continuum for which responses were neither clearly /ba/ nor /da/ (referred to as the uncertainty region [UR]). A shallower psychometric function slope reflected greater uncertainty. Age effects were determined based on raw scores. Z scores were calculated to account for the effect of age on performance. Outliers, and individual data for which the confidence interval of the UR exceeded a maximum allowable value, were removed. Nonparametric tests were used as the data were skewed toward negative performance. Across participants, the median value of the F2 range
Implementation of repeat HIV testing during pregnancy in southwestern Kenya: progress and missed opportunities.

Science.gov (United States)

Rogers, Anna J; Akama, Eliud; Weke, Elly; Blackburn, Justin; Owino, George; Bukusi, Elizabeth A; Oyaro, Patrick; Kwena, Zachary A; Cohen, Craig R; Turan, Janet M

2017-12-01

Repeat HIV testing during the late antenatal period is crucial to identify and initiate treatment for pregnant women with incident HIV infection to prevent perinatal HIV transmission and keep mothers alive. In 2012, the Kenya Ministry of Health adopted international guidelines suggesting that pregnant women be offered retesting three months after an initial negative HIV test. Our objectives were to determine the current rate of antenatal repeat HIV testing; identify successes, missed opportunities and factors associated with retesting; and estimate the incidence of HIV during pregnancy. Retrospective analysis of longitudinal data was conducted for a cohort of 2145 women attending antenatal care clinic at a large district hospital in southwestern Kenya. Data were abstracted from registers for all women who attended the clinic from the years 2011 to 2014. Although 90.2% of women first came to clinic prior to their third trimester and 27.5% had at least four clinic visits, 58.0% of all women went to delivery without a retest. Missed opportunities for retesting included not returning to clinic at all, not returning when eligible, or late gestational age (>28 weeks) at first clinic visit making them ineligible for retesting (accounting for 14.2%, 26.8% and 9.6% of all clinic attendees respectively); and failure to be retested even when eligible at one or more visits (accounting for 73.2% of eligible returnees). Being unmarried and aged 20 or younger was associated with an increase in mean gestational age of first visit by 2.52 weeks (95% CI: 1.56, 3.48) and a 2.59 increased odds (95% CI: 1.90, 3.54) of failing to return to clinic, compared to those who were married and over 20 years of age. On retest, two women tested HIV positive, suggesting an incidence rate of 4.4 per 100 person-years. After adjusting for potential confounders, only later year of last menstrual period (2013 vs. 2012 and 2011) was associated with retesting. Adoption of retesting guidelines in 2012
Test-Retest Reliability of Measurements of Hand-Grip Strength Obtained by Dynamometry from Older Adults: A Systematic Review of Research in the PubMed Database.

Science.gov (United States)

Bohannon, R W

2017-01-01

A systematic review was performed to summarize literature describing the test-retest reliability of grip strength measures obtained from older adults. Relevant literature was identified via a PubMed search. Seventeen articles were deemed appropriate based on inclusion and exclusion criteria. The relative test-retest reliability of grip strength measures obtained by dynamometry was good to excellent (intra-class correlation coefficients > 0.80) in all but 3 studies, which involved older adults with severe dementia. Absolute reliability, as indicated by summary statistics such as the minimum detectable change (95%), was more variable. As a percentage, that change ranged from 14.5% to 98.5%. Consequently, clinicians can be confident in the relative reliability of grip strength measures obtained from at risk older adults. However, relatively large percentage changes in grip strength may be necessary to conclude with confidence that a real change has occurred over time in some populations.

Using personality item characteristics to predict single-item reliability, retest reliability, and self-other agreement

NARCIS (Netherlands)

de Vries, Reinout Everhard; Realo, Anu; Allik, Jüri

2016-01-01

The use of reliability estimates is increasingly scrutinized as scholars become more aware that test–retest stability and self–other agreement provide a better approximation of the theoretical and practical usefulness of an instrument than its internal reliability. In this study, we investigate item
Validation of the Stroke Specific Quality of Life Scale (SS-QOL): test of reliability and validity of the Danish version (SS-QOL-DK).

Science.gov (United States)

Muus, Ingrid; Williams, Linda S; Ringsberg, Karin C

2007-07-01

To test the reliability and validity of the Danish version of the Stroke Specific Quality of Life Scale version 2.0 (SS-QOL-DK), an instrument for evaluation of health-related quality of life. A correlational study. A stroke unit that provides acute care and rehabilitation for stroke patients in Frederiksborg County, Denmark. One hundred and fifty-two stroke survivors participated; 24 of these performed test-retest. Questionnaires were sent out and returned by mail. A subsequent telephone interview assessed functional level and missing items. Test-retest was measured using Spearman's r, internal consistency was estimated using Cronbach's alpha, and evaluation of floor and ceiling values in proportion of minimum and maximum scores. Construct validity was assessed by comparing patients' scores on the SS-QOL-DK with those obtained by other test methods: Beck's Depression Index, the General Health Survey Short Form 36 (SF-36), the Barthel Index and the National Institutes of Health Stroke Scale, evaluating shared variance using coefficient of determination, r2. Comparing groups with known scores assessed known-group validity. Convergent and discriminant validity were assessed. Test-retest of SS-QOL-DK showed excellent stability, Spearman's r = 0.65-0.99. Internal consistency for all domains showed Cronbach's alpha = 0.81-0.94. Missing items rate was 1.0%. Most SS-QOL-DK domains showed moderately shared variance with similar domains of other test methods, r2 = 0.03-0.62. Groups with known differences showed statistically significant difference in scores. Item-to-scale correlation coefficients of 0.37-0.88 supported convergent validity. SS-QOL-DK is a reliable and valid instrument for measuring self-reported health-related quality of life on group level among people with mild to moderate stroke.
{sup 11}C-PBR28 imaging in multiple sclerosis patients and healthy controls: test-retest reproducibility and focal visualization of active white matter areas

Energy Technology Data Exchange (ETDEWEB)

Park, Eunkyung; Gallezot, Jean-Dominique; Planeta, Beata; Lin, Shu-Fei; Lim, Keunpoong; Chen, Ming-Kai; Huang, Yiyun; Carson, Richard E. [Yale School of Medicine, PET Center, Department of Diagnostic Radiology, 801 Howard Avenue, PO Box 208048, New Haven, CT (United States); Delgadillo, Aracely; Liu, Shuang; O' Connor, Kevin C.; Lee, Jae-Yun; Chastre, Anne; Pelletier, Daniel [Yale School of Medicine, Department of Neurology, New Haven, CT (United States); Seneca, Nicholas; Leppert, David [Hoffmann-La Roche Ltd, Pharmaceuticals Division, Basel (Switzerland)

2015-04-02

Activated microglia play a key role in inflammatory demyelinating injury in multiple sclerosis (MS). Microglial activation can be measured in vivo using a positron emission tomography (PET) ligand {sup 11}C-PBR28. We evaluated the test-retest variability (TRV) and lesion detectability of {sup 11}C-PBR28 binding in MS subjects and healthy controls (HCs) with high-resolution PET. Four clinically and radiologically stable relapsing-remitting MS subjects (age 41 ± 7 years, two men/two women) and four HCs (age 42 ± 8 years, 2 two men/two women), matched for translocator protein genotype [two high- and two medium-affinity binders according to DNA polymorphism (rs6971) in each group], were studied for TRV. Another MS subject (age 41 years, male) with clinical and radiological activity was studied for lesion detectability. Dynamic data were acquired over 120 min after injection of 634 ± 101 MBq {sup 11}C-PBR28. For the TRV study, subjects were scanned twice, on average 1.4 weeks apart. Volume of distribution (V{sub T}) derived from multilinear analysis (MA1) modeling (t* = 30 min, using arterial input data) was the main outcome measure. Mean test V{sub T} values (ml cm{sup -3}) were 3.9 ± 1.4 in the whole brain gray matter (GM), 3.6 ± 1.2 in the whole brain white matter (WM) or normal-appearing white matter (NAWM), and 3.3 ± 0.6 in MS WM lesions; mean retest V{sub T} values were 3.7 ± 1.0 in GM, 3.3 ± 0.9 in WM/NAWM, and 3.3 ± 0.7 in MS lesions. Test-retest results showed a mean absolute TRV ranging from 7 to 9 % across GM, WM/NAWM, and MS lesions. High-affinity binders demonstrated 30 % higher V{sub T} than medium-affinity binders in GM. Focal {sup 11}C-PBR28 uptake was detected in two enhancing lesions of the active MS patient. High-resolution {sup 11}C-PBR28 PET can visualize focal areas where microglial activation is known to be present and has good test-retest reproducibility in the human brain. {sup 11}C-PBR28 PET is likely to be valuable for monitoring both
Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

Science.gov (United States)

McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

2010-01-01

Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Reliability of short form-36 in an Internet- and a pen-and-paper version

DEFF Research Database (Denmark)

Basnov, Maja; Kongsved, Sissel Marie; Bech, Per

2009-01-01

Use of Internet versions of questionnaires may have several advantages in clinical and epidemiological research, but we know little about if Internet versions differ with respect to validity and reliability. We aimed to compare Internet- and pen-and-paper versions of short form-36 (SF-36......) with respect to test-retest reliability and internal consistency. Women referred to mammography (n = 782) were randomised to receive either a paper version with a prepaid return envelope or a guideline on how to fill in the Internet version. A subgroup was asked to answer the questionnaire once again...... in the alternative version. Test-retest reliability was assessed by the intra-class correlation coefficient. Internal consistency was calculated as Cronbach's alpha. The between-version test-retest reliability for the eight subscales were between 0.63 and 0.92. Cronbach's alpha for the two versions were all between...
Development and Validation of an Instrument for the Measurement of Health-Related Quality of Life Based on View of Traditional Chinese Medicine Perspective

Directory of Open Access Journals (Sweden)

Hen-Hong Chang

2012-10-01

Results: The test-retest reliability coefficients of the six domains ranged from 0.46 for spleen to 0.69 for liver-male and kidney. The internal consistency coefficients of the six domains varied from 0.38 for spleen to 0.72 for heart. All scales except that of liver for females could significantly classify different health conditions (evidence of abnormality assessed by TCM physicians. Ten factors were identified through factor analysis. Some items were found to be correlated with more than one domain. Most domains in the questionnaire had fair test-retest reliability and fair to good internal consistency, and could differentiate patients’ health conditions. The low internal consistency of the spleen scale and the inter-related scale structures needs further evaluation.
A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders.

Science.gov (United States)

Stupar, Maja; Côté, Pierre; Beaton, Dorcas E; Boyle, Eleanor; Cassidy, J David

2015-01-01

The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). We performed a test-retest reliability study. We included insurance claimants from Ontario who were at least 18 years of age, within 21 days of their motor vehicle collision and diagnosed as having acute WAD grades I to III. The WDQ, a 13-item questionnaire scored from 0 (no disability) to 130 (complete disability), was administered to all participants at baseline and by telephone 3 days later. We computed the intraclass correlation coefficient (model 2,1) and the MDC with 95% confidence intervals (CIs; MDC95). The mean (SD) age of the 66 participants was 41.6 (12.7) years and 71.2% were female. Twenty-nine percent had WAD I and 71.2% had WAD II. Time since injury ranged from 0 to 19 days. The mean (SD) baseline WDQ score was 49.3 (28.8) and 46.5 (29.8) 3 days later. The intraclass correlation coefficient for the WDQ total score was 0.89 (95% CI, 0.85-0.92) in the entire sample and 0.83 (95% CI, 0.69-0.93) for the 15 participants reporting no change in neck pain. The MDC95 of the WDQ was 21.4 (SD = 14.9) for participants reporting no change. The WDQ was reliable in individuals with acute WAD. There is 95% confidence that a change of approximately one-sixth of the total score is beyond the daily variation of a stable condition. This level of measurement error must be taken into consideration when interpreting change in WDQ scores. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.
Adaptação e validação do Alcohol Use Disorder Identification Test (AUDIT para população ribeirinha do interior da Amazônia, Brasil Adaptation and validation of the Alcohol Use Disorders Identification Test (AUDIT for a river population in the Brazilian Amazon

Directory of Open Access Journals (Sweden)

Rodrigo Otávio Moretti-Pires

2011-03-01

Full Text Available O objetivo deste artigo foi validar o Alcohol Use Disorders Identification Test (AUDIT para a população do interior do Amazonas, Brasil. A versão original em Inglês foi traduzida para o Português, usando-se o procedimento recomendado pela Organização Mundial da Saúde. O texto foi, então, retraduzido e enviado para um tradutor inglês nativo, que aprovou a tradução. O AUDIT foi administrado a 361 habitantes, três vezes, em um período de duas semanas (teste e reteste. Os dados foram analisados para a confiabilidade e consistência interna. O alfa de Cronbach foi de 0,87 na primeira aplicação, 0,87 na segunda e 0,86 na terceira. A confiabilidade Teste/Reteste foi avaliada usando-se o coeficiente de correlação intraclasse, que para a pontuação total do AUDIT foi de 0,93. A área sob a curva ROC foi de 0,805 no ponto de corte sete (sensibilidade de 76,4% e especificidade 75%. Conclusões: a versão do AUDIT validada mostra-se internamente consistente e estável no contexto investigado, destacando-se a necessidade de avaliação de outras propriedades psicométricas.The objective of this study was to validate the Alcohol Use Disorders Identification Test (AUDIT for a river population in the Brazilian Amazon. The original English version of AUDIT was translated into Portuguese, using the procedure recommended by the World Health Organization. The text was then back-translated and submitted to a native English translator, who approved the translation. AUDIT was administered to 361 inhabitants for a total of three times in two weeks. Data were analyzed for test/retest reliability and internal consistency. Cronbach's alpha was 0.87 at the first interview, 0.87 at the second, and 0.86 at the third. Test/retest reliability assessed via the intra-class correlation coefficient for the total AUDIT scale was 0.93. Area under ROC was 0.805 for a cutoff of seven (sensitivity 76.4%; specificity 75%. The validated AUDIT proved to be internally
Italian validation of the Purpose In Life (PIL) test and the Seeking Of Noetic Goals (SONG) test in a population of cancer patients.

Science.gov (United States)

Brunelli, C; Bianchi, E; Murru, L; Monformoso, P; Bosisio, M; Gangeri, L; Miccinesi, G; Scrignaro, M; Ripamonti, C; Borreani, C

2012-11-01

The first instruments developed to evaluate specific logotherapeutic dimensions were the Purpose In Life (PIL) and the Seeking Of Noetic Goals (SONG) tests, designed to reflect Frankl's concepts of, respectively, meaning in life attainment and will to meaning. This study aims to perform the Italian cultural adaptation and the psychometric validation of the PIL and SONG questionnaires. We administered the PIL and SONG, culturally adapted into the Italian language, to 266 cancer patients. The psychometric validation appraised construct validity, internal consistency, test-retest reliability, known-group validity, and convergent validity of the two questionnaires with respect to one another. The factorial analysis indicates that the original single-factor solution can be maintained for both instruments (proportion of variance explained by the first factor 77% and 71% for the PIL and SONG, respectively). The results show excellent internal consistency (Cronbach's alpha of 0.91 for the PIL and 0.90 for the SONG) and test-retest reliability (intraclass correlation coefficient of 0.92 for the PIL and 0.81 for the SONG). As expected, males, believers, patients nearer to the diagnosis, and patients not undergoing psychological therapy have higher PIL and lower SONG scores, while expectations for age were not confirmed. The average level for the PIL was 107.3, while for the SONG, it was 66.1, and a negative correlation (-0.47) between PIL and SONG scores indicates good convergent validity of the two instruments. Italian versions of the PIL and SONG are adequate and reliable self-report instruments for evaluating purpose in life and the motivation to find purpose for cancer patient populations.
Initial validation of the Yin-Yang Assessment Questionnaire for persons with diabetes mellitus.

Science.gov (United States)

Wong, Yee Chi Peggy; Pang, Mei Che Samantha

2015-09-10

To initially test for the content validity, comprehensibility, test-retest reliability and internal consistency reliability of the Yin-Yang Assessment Questionnaire (YY-AQ). The process of initial validity and reliability test covered: (1) content validation from the findings of 18 multiple-case studies, validated Yin- and Yang-deficiency assessment questionnaires, relevant literatures and registered Chinese medicine practitioners; (2) comprehension with the levels of comprehensibility for each item categorized on a 3-point scale (not comprehensible; moderately comprehensible; highly comprehensible). A minimum of three respondents selecting for each item of moderately or highly comprehensible were regarded as comprehensive; (3) test-retest reliability conducted with a 2-wk interval. The intraclass correlation coefficients (ICCs) and their 95%CIs were calculated using a two-way random effects model. Wilcoxon Signed Rank test for related samples was adopted to compare the medians of test-retest scores. An ICC value of 0.85 or higher together with P > 0.05, was considered acceptable; and (4) internal consistency of the total items was measured and evaluated by Cronbach's coefficient alpha (α). A Cronbach's α of 0.7 or higher was considered to represent good internal consistency. Eighteen Yin-deficiency and 14 Yang-deficiency presentation items were finalized from content validation. Five participants with type 2 diabetes mellitus (T2DM) performed the comprehensibility and test-retest reliability tests. Comprehensibility score level of each presentation item was found to be moderate or high in three out of the five participants. Test-retest reliability showed that the single measure ICC of the total Yin-deficiency presentation items was 0.99 (95%CI: 0.89-0.99) and the median scores on the first and 14(th) days were 17 (IQR 6.5-27) and 21 (IQR 6-29) (P = 0.144) respectively. The single measure ICC of the total Yang-deficiency presentation items was 0.88 (95%CI: 0
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 5. Technical Report #1220

Science.gov (United States)

Lai, Cheng-Fei; Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 2. Technical Report #1217

Science.gov (United States)

Anderson, Daniel; Lai, Cheg-Fei; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest an alternate form) and G-Theory/D-Study on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from the convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest. Due to…
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Reading Assessments: Grade 1. Technical Report #1216

Science.gov (United States)

Anderson, Daniel; Park, Jasmine, Bitnara; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest/and alternate form) and G-Theory/D-Study research on the easy CBM reading measures, grades 1-5. Data were gathered in the spring 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest. Due…
Reliability of the Cooking Task in adults with acquired brain injury.

Science.gov (United States)

Poncet, Frédérique; Swaine, Bonnie; Taillefer, Chantal; Lamoureux, Julie; Pradat-Diehl, Pascale; Chevignard, Mathilde

2015-01-01

Acquired brain injury (ABI) often leads to deficits in executive functioning (EF) responsible for severe and long-standing disabilities in daily life activities. The Cooking Task is an ecological and valid test of EF involving multi-tasking in a real environment. Given its complex scoring system, it is important to establish the tool's reliability. The objective of the study was to examine the reliability of the Cooking Task (internal consistency, inter-rater and test-retest reliability). A total of 160 patients with ABI (113 men, mean age 37 years, SD = 14.3) were tested using the Cooking Task. For test-retest reliability, patients were assessed by the same rater on two occasions (mean interval 11 days) while two raters independently and simultaneously observed and scored patients' performances to estimate inter-rater reliability. Internal consistency was high for the global scale (Cronbach α = .74). Inter-rater reliability (n = 66) for total errors was also high (ICC = .93), however the test-retest reliability (n = 11) was poor (ICC = .36). In general the Cooking Task appears to be a reliable tool. The low test-retest results were expected given the importance of EF in the performance of novel tasks.
Evaluation of Factorial Validity and Reliability of a Food Behavior Checklist for Low-Income Filipinos.

Science.gov (United States)

Suzuki, Asuka; Choi, So Yung; Lim, Eunjung; Tauyan, Socorro; Banna, Jinan C

To examine factorial validity, test-retest reliability, and internal consistency of a Tagalog-language food behavior checklist (FBC) for a low-income Filipino population. Participants (n = 160) completed the FBC on 2 occasions 3 weeks apart. Factor structure was examined using principal component analysis. For internal consistency, Cronbach α was calculated. For test-retest reliability, Spearman correlation or intraclass correlation coefficient (ICC) was calculated between scores at the 2 points. All but 1 item loaded on 6 factors: fruit and vegetable quantity, fruit and vegetable variety, fast food, sweetened beverage, healthy fat, and diet quality. Cronbach α was .75 for the total scale (range, .39-.76 for subscales). Spearman correlation was 0.78 (ICC, 0.79) for the total scale (range, 0.66-0.80 [ICC, 0.68-0.80] for subscales). The FBC demonstrated adequate factorial validity, test-retest reliability, and internal consistency. With additional testing, the FBC may be used to evaluate the US Department of Agriculture's nutrition education programs for Tagalog speakers. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Reliability of the Handgrip Strength Test in Elderly Subjects With Parkinson Disease.

Science.gov (United States)

Villafañe, Jorge H; Valdes, Kristin; Buraschi, Riccardo; Martinelli, Marco; Bissolotti, Luciano; Negrini, Stefano

2016-03-01

The handgrip strength test is widely used by clinicians; however, little has been investigated about its reliability when used in subjects with Parkinson disease (PD). The purpose of this study was to investigate the test-retest reliability of the handgrip strength test for subjects with PD. The PD group consisted of 15 patients, and the control group consisted of 15 healthy subjects. Each patient performed 3 pain-free maximal isometric contractions on each hand on 2 occasions, 1 week apart. Intraclass correlation coefficient (ICC), standard error of measurement (SEM), and 95% limits of agreement (LOA) were calculated. The 2-way analysis of variance (ANOVA) was conducted to determine the differences between sides and groups. Test-retest reliability of measurements of grip strength was excellent for dominant (ICC = 0.97; P = .001) and non-dominant (ICC = 0.98; P = .001) hand of participant with PD and (ICC = 0.99; P = .001) and (ICC = 0.99; P = .001) respectively, of healthy group. The Jamar hand dynamometer had fair to excellent test-retest reliability to test grip strength in participants with PD.
Barriers to repeated assessment of verbal learning and memory: a comparison of international shopping list task and rey auditory verbal learning test on build-up of proactive interference.

Science.gov (United States)

Rahimi-Golkhandan, S; Maruff, P; Darby, D; Wilson, P

2012-11-01

Proactive interference (PI) that remains unidentified can confound the assessment of verbal learning, particularly when its effects vary from one population to another. The International Shopping List Task (ISLT) is a new measure that provides multiple forms that can be equated for linguistic factors across cultural groups. The aim of this study was to examine the build-up of PI on two measures of verbal learning-a traditional test of list learning (Rey Auditory Verbal Learning Test, RAVLT) and the ISLT. The sample consisted of 61 healthy adults aged 18-40. Each test had three parallel forms, each recalled three times. Results showed that repeated administration of the ISLT did not result in significant PI effects, unlike the RAVLT. Although these PI effects, observed during short retest intervals, may not be as robust under normal clinical administrations of the tests, the results suggest that the choice of the verbal learning test should be guided by the knowledge of PI effects and the susceptibility of particular patient groups to this effect.
Validation of the International Index of Erectile Function (IIFE) for Use in Brazil

International Nuclear Information System (INIS)

Gonzáles, Ana Inês; Sties, Sabrina Weiss; Wittkopf, Priscilla Geraldine; Mara, Lourenço Sampaio de; Ulbrich, Anderson Zampier; Cardoso, Fernando Luiz; Carvalho, Tales de

2013-01-01

The International Index of Erectile Function has been proposed as a method for assessing sexual function assisting the diagnosis and classification of erectile dysfunction. However, IIEF was not validated for the Portuguese language. Validate the International Index of Erectile Function in patients with cardiopulmonary and metabolic diseases. The sample consisted of 108 participants of to Cardiopulmonary and Metabolic program Rehabilitation (CPMR) in southern Brazil. The clarity assessment of the instrument was performed using a scale ranging from zero to 10. The construct validity was carried out by confirmatory factor analysis (KMO = 0.85; Barllet p < 0.001), internal consistency by Cronbach's alpha and reproducibility and interrater reliability via the test retest method. The items were considered very clear with averages superior to 9. The internal consistency resulted in 0.89. The majority of items related correctly with their domains, with exception of three questions from sexual satisfaction domain, and one from erectile function. All items showed excellent stability of measure and substantial to almost perfect agreement. The present study showed that the IIEF is valid and reliable for use in participants of a cardiopulmonary and metabolic rehabilitation program
Validation of the International Index of Erectile Function (IIFE) for Use in Brazil

Energy Technology Data Exchange (ETDEWEB)

Gonzáles, Ana Inês; Sties, Sabrina Weiss; Wittkopf, Priscilla Geraldine, E-mail: sabrinasties@yahoo.com.br; Mara, Lourenço Sampaio de; Ulbrich, Anderson Zampier; Cardoso, Fernando Luiz; Carvalho, Tales de [Universidade do Estado de Santa Catarina, Florianópolis, SC (Brazil)

2013-08-15

The International Index of Erectile Function has been proposed as a method for assessing sexual function assisting the diagnosis and classification of erectile dysfunction. However, IIEF was not validated for the Portuguese language. Validate the International Index of Erectile Function in patients with cardiopulmonary and metabolic diseases. The sample consisted of 108 participants of to Cardiopulmonary and Metabolic program Rehabilitation (CPMR) in southern Brazil. The clarity assessment of the instrument was performed using a scale ranging from zero to 10. The construct validity was carried out by confirmatory factor analysis (KMO = 0.85; Barllet p < 0.001), internal consistency by Cronbach's alpha and reproducibility and interrater reliability via the test retest method. The items were considered very clear with averages superior to 9. The internal consistency resulted in 0.89. The majority of items related correctly with their domains, with exception of three questions from sexual satisfaction domain, and one from erectile function. All items showed excellent stability of measure and substantial to almost perfect agreement. The present study showed that the IIEF is valid and reliable for use in participants of a cardiopulmonary and metabolic rehabilitation program.
Test-retest repeatability of strength capacity, aerobic power and pericranial tenderness of neck and shoulder muscles in children - relevant for tension-type headache

Directory of Open Access Journals (Sweden)

Tornøe B

2013-08-01

Full Text Available Birte Tornøe,1,2,5,6 Lars L Andersen,3 Jørgen H Skotte,3 Rigmor Jensen,4 Gunvor Gard,1 Liselotte Skov,2 Inger Hallström1 1Department of Health Sciences, Lund University, Scania, Sweden; 2Children's Headache Clinic, Department of Pediatrics, University of Copenhagen, Herlev Hospital, Herlev, Denmark; 3National Research Centre for the Working Environment, Copenhagen, Denmark; 4Danish Headache Center, Department of Neurology, University of Copenhagen, Glostrup Hospital, Glostrup, Denmark; 5Department of Physiotherapy and Occupational Therapy, University of Copenhagen, Glostrup Hospital, Glostrup, Denmark; 6Department of Physiotherapy, Medical Department, University of Copenhagen, Herlev Hospital, Herlev, Denmark Background: Frequent or chronic tension-type headache in children is a prevalent and debilitating condition for the child, often leading to medication overuse. To explore the relationship between physical factors and tension-type headache in children, the quality of repeated measures was examined. The aim of the present study was to determine the test-retest repeatability of parameters determining isometric neck and shoulder strength and stability, aerobic power, and pericranial tenderness in children. Methods: Twenty-five healthy children, 9 to 18 years of age, participated in test-retest procedures within a 1-week interval. A computerized padded force transducer was used for testing. The tests included the isometric maximal voluntary contraction and force steadiness of neck flexion and extension, and the isometric maximal voluntary contraction and rate of force of the dominant shoulder. Pericranial tenderness was recorded by means of standardized manual palpation, and a submaximal cycle ergometer test predicted maximal oxygen uptake (VO2 max. The measurements were evaluated in steps, using the intraclass correlation coefficient (ICC; changes in the mean between the two test occasions; the levels of agreement, visualized in Bland

TEST-RETEST RELIABILITY OF THE CLOSED KINETIC CHAIN UPPER EXTREMITY STABILITY TEST (CKCUEST) IN ADOLESCENTS: RELIABILITY OF CKCUEST IN ADOLESCENTS.

Science.gov (United States)

de Oliveira, Valéria M A; Pitangui, Ana C R; Nascimento, Vinícius Y S; da Silva, Hítalo A; Dos Passos, Muana H P; de Araújo, Rodrigo C

2017-02-01

The Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) has been proposed as an option to assess upper limb function and stability; however, there are few studies that support the use of this test in adolescents. The purpose of the present study was to investigate the intersession reliability and agreement of three CKCUEST scores in adolescents and establish clinimetric values for this test. Test-retest reliability. Twenty-five healthy adolescents of both sexes were evaluated. The subjects performed two CKCUEST with an interval of one week between the tests. An intraclass correlation coefficient (ICC 3,3 ) two-way mixed model with a 95% interval of confidence was utilized to determine intersession reliability. A Bland-Altman graph was plotted to analyze the agreement between assessments. The presence of systematic error was evaluated by a one-sample t test. The difference between the evaluation and reevaluation was observed using a paired-sample t test. The level of significance was set at 0.05. Standard error of measurements and minimum detectable changes were calculated. The intersession reliability of the average touches score, normalized score, and power score were 0.68, 0.68 and 0.87, the standard error of measurement were 2.17, 1.35 and 6.49, and the minimal detectable change was 6.01, 3.74 and 17.98, respectively. The presence of systematic error (p test with moderate to excellent reliability when used with adolescents. The CKCUEST is a measurement with moderate to excellent reliability for adolescents. 2b.
Morpho-Functional 1H-MRI of the Lung in COPD: Short-Term Test-Retest Reliability.

Directory of Open Access Journals (Sweden)

Bertram J Jobst

Full Text Available Non-invasive end-points for interventional trials and tailored treatment regimes in chronic obstructive pulmonary disease (COPD for monitoring regionally different manifestations of lung disease instead of global assessment of lung function with spirometry would be valuable. Proton nuclear magnetic resonance imaging (1H-MRI allows for a radiation-free assessment of regional structure and function. The aim of this study was to evaluate the short-term reproducibility of a comprehensive morpho-functional lung MRI protocol in COPD.20 prospectively enrolled COPD patients (GOLD I-IV underwent 1H-MRI of the lung at 1.5T on two consecutive days, including sequences for morphology, 4D contrast-enhanced perfusion, and respiratory mechanics. Image quality and COPD-related morphological and functional changes were evaluated in consensus by three chest radiologists using a dedicated MRI-based visual scoring system. Test-retest reliability was calculated per each individual lung lobe for the extent of large airway (bronchiectasis, wall thickening, mucus plugging and small airway abnormalities (tree in bud, peripheral bronchiectasis, mucus plugging, consolidations, nodules, parenchymal defects and perfusion defects. The presence of tracheal narrowing, dystelectasis, pleural effusion, pulmonary trunk ectasia, right ventricular enlargement and, finally, motion patterns of diaphragma and chest wall were addressed.Median global scores [10(Q1:8.00;Q3:16.00 vs.11(Q1:6.00;Q3:15.00] as well as category subscores were similar between both timepoints, and kappa statistics indicated "almost perfect" global agreement (ĸ = 0.86, 95%CI = 0.81-0.91. Most subscores showed at least "substantial" agreement of MRI1 and MRI2 (ĸ = 0.64-1.00, whereas the agreement for the diagnosis of dystelectasis/effusion (ĸ = 0.42, 95%CI = 0.00-0.93 was "moderate" and of tracheal abnormalities (ĸ = 0.21, 95%CI = 0.00-0.75 "fair". Most MRI acquisitions showed at least diagnostic quality at
Inter-rater and test-retest reliability, internal consistency, and factorial structure of the instrument for forensic treatment evaluation

NARCIS (Netherlands)

Schuringa, E.; Spreen, M.; Bogaerts, S.

2014-01-01

In this study, the Instrument for Forensic Treatment Evaluation (IFTE) is introduced. The IFTE includes 14 dynamic items of the risk assessment scheme HKT-R and eight items specifically related to the treatment of forensic psychiatric patients. The items are divided over three factors: protective
Test-retest variability of high resolution positron emission tomography (PET) imaging of cortical serotonin (5HT2A) receptors in older, healthy adults

International Nuclear Information System (INIS)

Chow, Tiffany W; Mamo, David C; Uchida, Hiroyuki; Graff-Guerrero, Ariel; Houle, Sylvain; Smith, Gwenn S; Pollock, Bruce G; Mulsant, Benoit H

2009-01-01

Position emission tomography (PET) imaging using [ 18 F]-setoperone to quantify cortical 5-HT 2A receptors has the potential to inform pharmacological treatments for geriatric depression and dementia. Prior reports indicate a significant normal aging effect on serotonin 5HT 2A receptor (5HT 2A R) binding potential. The purpose of this study was to assess the test-retest variability of [ 18 F]-setoperone PET with a high resolution scanner (HRRT) for measuring 5HT 2A R availability in subjects greater than 60 years old. Methods: Six healthy subjects (age range = 65–78 years) completed two [ 18 F]-setoperone PET scans on two separate occasions 5–16 weeks apart. The average difference in the binding potential (BP ND ) as measured on the two occasions in the frontal and temporal cortical regions ranged between 2 and 12%, with the lowest intraclass correlation coefficient in anterior cingulate regions. We conclude that the test-retest variability of [ 18 F]-setoperone PET in elderly subjects is comparable to that of [ 18 F]-setoperone and other 5HT 2A R radiotracers in younger subject samples
Comment on the internal consistency of thermodynamic databases supporting repository safety assessments

International Nuclear Information System (INIS)

Arthur, R.C.

2001-11-01

This report addresses the concept of internal consistency and its relevance to the reliability of thermodynamic databases used in repository safety assessments. In addition to being internally consistent, a reliable database should be accurate over a range of relevant temperatures and pressures, complete in the sense that all important aqueous species, gases and solid phases are represented, and traceable to original experimental results. No single definition of internal consistency need to be universally accepted as the most appropriate under all conditions, however. As a result, two databases that are each internally consistent may be inconsistent with respect to each other, and a database derived from two or more such databases must itself be internally inconsistent. The consequences of alternative definitions that are reasonably attributable to the concept of internal consistency can be illustrated with reference to the thermodynamic database supporting SKB's recent SR 97 safety assessment. This database is internally inconsistent because it includes equilibrium constants calculated over a range of temperatures: using conflicting reference values for some solids, gases and aqueous species that are common to two internally consistent databases (the OECD/NEA database for radioelements and SUPCRT databases for non-radioactive elements) that serve as source databases for the SR 97 TDB, using different definitions in these source databases of standard states for condensed phases and aqueous species, based on different mathematical expressions used in these source databases representing the temperature dependence of the heat capacity, and based on different chemical models adopted in these source databases for the aqueous phase. The importance of such inconsistencies must be considered in relation to the other database reliability criteria noted above, however. Thus, accepting a certain level of internal inconsistency in a database it is probably preferable to use a
Comment on the internal consistency of thermodynamic databases supporting repository safety assessments

Energy Technology Data Exchange (ETDEWEB)

Arthur, R.C. [Monitor Scientific, LLC, Denver, CO (United States)

2001-11-01

This report addresses the concept of internal consistency and its relevance to the reliability of thermodynamic databases used in repository safety assessments. In addition to being internally consistent, a reliable database should be accurate over a range of relevant temperatures and pressures, complete in the sense that all important aqueous species, gases and solid phases are represented, and traceable to original experimental results. No single definition of internal consistency need to be universally accepted as the most appropriate under all conditions, however. As a result, two databases that are each internally consistent may be inconsistent with respect to each other, and a database derived from two or more such databases must itself be internally inconsistent. The consequences of alternative definitions that are reasonably attributable to the concept of internal consistency can be illustrated with reference to the thermodynamic database supporting SKB's recent SR 97 safety assessment. This database is internally inconsistent because it includes equilibrium constants calculated over a range of temperatures: using conflicting reference values for some solids, gases and aqueous species that are common to two internally consistent databases (the OECD/NEA database for radioelements and SUPCRT databases for non-radioactive elements) that serve as source databases for the SR 97 TDB, using different definitions in these source databases of standard states for condensed phases and aqueous species, based on different mathematical expressions used in these source databases representing the temperature dependence of the heat capacity, and based on different chemical models adopted in these source databases for the aqueous phase. The importance of such inconsistencies must be considered in relation to the other database reliability criteria noted above, however. Thus, accepting a certain level of internal inconsistency in a database it is probably preferable to
Results of assembly test of HTTR reactor internals

International Nuclear Information System (INIS)

Maruyama, S.; Saikusa, A.; Shiozawa, S.; Tsuji, N.; Miki, T.

1996-01-01

The assembly test of the HTTR actual reactor internals had been carried out at the works, prior to their installation in the actual reactor pressure vessel(RPV) at the construction site. The assembly test consists of several items such as examining fabricating precision of each component and alignment of piled-up structures, measuring circumferential coolant velocity profile in the passage between the simulated RPV and the reactor internals as well as under the support plates, measuring by-pass flow rate through gaps between the reactor internals, and measuring the binding force of the core restraint mechanism. Results of the test showed good performance of the HTTR reactor internals. Installation of the reactor internals in the actual RPV was started at the construction site of HTTR in April, 1995. In the installation process, main items of the assembly test at the works were repeated to investigate the reproducibility of installation. (author). 5 refs, 11 figs
Pad-weighing test performed with standardized bladder volume

DEFF Research Database (Denmark)

Lose, G; Rosenkilde, P; Gammelgaard, J

1988-01-01

The result of the one-hour pad-weighing test proposed by the International Continence Society has been demonstrated to depend on the urine load during the test. To increase reproducibility of the pad-weighing test by minimizing the influence of variation in urine load the test was done with a sta...... to +/- 24 g between two tests. It is concluded that this setup (i.e., standardized bladder volume) of the one-hour pad-weighing test allows for a more reliable assessment of urinary incontinence for quantitative purposes....... with a standardized bladder volume (50% of the cystometric bladder capacity). Twenty-five female patients with stress or mixed incontinence underwent two separate tests. Test-retest results were highly correlated (r = 0.97, p less than 0.001). Nonetheless, analysis of test-retest differences revealed a variation up...
Rationale and design of REACT: a randomised controlled trial assessing the effectiveness of home-collection to increase chlamydia retesting and detect repeat positive tests

OpenAIRE

Smith, Kirsty S; Hocking, Jane S; Chen, Marcus; Fairley, Christopher K; McNulty, Anna; Read, Phillip; Bradshaw, Catriona S; Tabrizi, Sepehr N; Wand, Handan; Saville, Marion; Rawlinson, William; Garland, Suzanne M; Donovan, Basil; Kaldor, John M; Guy, Rebecca

2014-01-01

Background Repeat infection with Chlamydia trachomatis is common and increases the risk of sequelae in women and HIV seroconversion in men who have sex with men (MSM). Despite guidelines recommending chlamydia retesting three months after treatment, retesting rates are low. We are conducting the first randomised controlled trial to assess the effectiveness of home collection combined with short message service (SMS) reminders on chlamydia retesting and reinfection rates in three risk groups. ...
Adaptation and Preliminary Testing of the Developmental Coordination Disorder Questionnaire (DCDQ) for Children in India.

Science.gov (United States)

Patel, Priya; Gabbard, Carl

2017-05-01

While Developmental Coordination Disorder (DCD) has gained worldwide attention, in India it is relatively unknown. The revised DCD Questionnaire (DCDQ'07) is one of the most utilized screening tools for DCD. The aim of this study was to translate the DCDQ'07 into the Hindi language (DCDQ-Hindi) and test its basic psychometric properties. The DCDQ'07 was translated following guidelines for cross cultural adaptation of instruments. Parents of 1100 children (5-15 years) completed the DCDQ-Hindi, of which 955 were considered for data analysis and 60 were retested randomly after 3 weeks for test-retest reliability. The DCDQ-Hindi showed high internal consistency (α = .86) and moderate test-retest reliability (.73). Confirmatory factor analysis showed equivalence to the DCDQ'07. The% probable DCD using DCDQ'07 cutoff scores (≤57) ranged from 22% to 68%. Using more stringent cutoffs (≤36) it ranged from 5% to 9%. Significant difference was seen for gender (p < .05) in subset 1(gross-motor skills) total scores. The DCDQ-Hindi reveals promise for initial identification of Hindi speaking Indian children with DCD. Based on more stringent cut-off scores, the "probable prevalence" of children with risk of DCD in India appears to be around 6-7%. Research with larger sample and comparison with the MABC-2 or equivalent is needed.
The reliability of eyetracking to assess attentional bias to threatening words in healthy individuals.

Science.gov (United States)

Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H

2017-08-15

Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error ( .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
Delimiting coefficient alpha from internal consistency and unidimensionality

NARCIS (Netherlands)

Sijtsma, K.

2015-01-01

I discuss the contribution by Davenport, Davison, Liou, & Love (2015) in which they relate reliability represented by coefficient α to formal definitions of internal consistency and unidimensionality, both proposed by Cronbach (1951). I argue that coefficient α is a lower bound to reliability and
The importance of retesting the hearing screening as an indicator of the real early hearing disorder

Directory of Open Access Journals (Sweden)

Daniela Polo Camargo da Silva

2015-08-01

Full Text Available INTRODUCTION: Early diagnosis of hearing loss minimizes its impact on child development. We studied factors that influence the effectiveness of screening programs.OBJECTIVE: To investigate the relationship between gender, weight at birth, gestational age, risk factors for hearing loss, venue for newborn hearing screening and "pass" and "fail" results in the retest.METHODS: Prospective cohort study was carried out in a tertiary referral hospital. The screening was performed in 565 newborns through transient evoked otoacoustic emissions in three admission units before hospital discharge and retest in the outpatient clinic. Gender, weight at birth, gestational age, presence of risk indicators for hearing loss and venue for newborn hearing screening were considered.RESULTS: Full-term infants comprised 86% of the cases, preterm 14%, and risk factors for hearing loss were identified in 11%. Considering the 165 newborns retested, only the venue for screening, Intermediate Care Unit, was related to "fail" result in the retest.CONCLUSIONS: Gender, weight at birth, gestational age and presence of risk factors for hearing loss were not related to "pass" and/or "fail" results in the retest. The screening performed in intermediate care units increases the chance of continued "fail" result in the Transient Otoacoustic Evoked Emissions test.
Extensive validation of the pain disability index in 3 groups of patients with musculoskeletal pain.

Science.gov (United States)

Soer, Remko; Köke, Albère J A; Vroomen, Patrick C A J; Stegeman, Patrick; Smeets, Rob J E M; Coppes, Maarten H; Reneman, Michiel F

2013-04-20

A cross-sectional study design was performed. To validate the pain disability index (PDI) extensively in 3 groups of patients with musculoskeletal pain. The PDI is a widely used and studied instrument for disability related to various pain syndromes, although there is conflicting evidence concerning factor structure, test-retest reliability, and missing items. Additionally, an official translation of the Dutch language version has never been performed. For reliability, internal consistency, factor structure, test-retest reliability and measurement error were calculated. Validity was tested with hypothesized correlations with pain intensity, kinesiophobia, Rand-36 subscales, Depression, Roland-Morris Disability Questionnaire, Quality of Life, and Work Status. Structural validity was tested with independent backward translation and approval from the original authors. One hundred seventy-eight patients with acute back pain, 425 patients with chronic low back pain and 365 with widespread pain were included. Internal consistency of the PDI was good. One factor was identified with factor analyses. Test-retest reliability was good for the PDI (intraclass correlation coefficient, 0.76). Standard error of measurement was 6.5 points and smallest detectable change was 17.9 points. Little correlations between the PDI were observed with kinesiophobia and depression, fair correlations with pain intensity, work status, and vitality and moderate correlations with the Rand-36 subscales and the Roland-Morris Disability Questionnaire. The PDI-Dutch language version is internally consistent as a 1-factor structure, and test-retest reliable. Missing items seem high in sexual and professional items. Using the PDI as a 2-factor questionnaire has no additional value and is unreliable.
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan

Directory of Open Access Journals (Sweden)

Moore AL

2018-02-01

Full Text Available Amy Lawson Moore, Terissa M Miller Gibson Institute of Cognitive Research, Colorado Springs, CO, USA Purpose: The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills.Methods: This study included 2,737 participants aged 5–85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test–retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement.Results: Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test–retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93.Conclusion: The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan. Keywords: testing, cognitive skills, memory, processing speed, visual processing, auditory processing
Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

Science.gov (United States)

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Psychometric Characteristics of the Korean Version of the Satisfaction with Life Scale Adapted for Children

Science.gov (United States)

Lim, Young-Jin

2015-01-01

The aim of this study was to examine the internal consistency reliability, test-retest reliability, factorial structure validity, and convergent validity of a Korean version of the Satisfaction With Life Scale adapted for children (K-SWLS-C). Participants consisted of 653 elementary school students (48% were male). The internal consistency of the…
The development and validation of diabetes knowledge questionnaire for the Indigenous population in Malaysia.

Science.gov (United States)

Ahmad, B; Ramadas, A; Quek, K F

2010-12-01

The study's aim was to construct and validate a diabetes mellitus knowledge questionnaire in Bahasa Malaysia for Orang Asli (OA-DKQ). The questionnaire was administered to; case (Orang Asli) and control (administrative staff) groups at baseline and retested two weeks later. The Cronbach's Alpha was used to determine internal consistency and intraclass correlation coefficient (ICC) was used to determine test-retest reliability. The OA-DKQ has an internal consistency of 0.806. These findings suggest the OA-DKQ is an acceptable instrument to assess knowledge and preventive behaviour in Orang Asli (86 words).
Test-retest reliability of diffusion tensor imaging of the liver at 3.0 T.

Science.gov (United States)

Girometti, Rossano; Maieron, Marta; Lissandrello, Giovanni; Bazzocchi, Massimo; Zuiani, Chiara

2015-06-01

This study was done to evaluate test-retest reliability of liver diffusion tensor imaging (LDTI). Ten healthy volunteers (median age 23 years) underwent two LDTI scans on a 3.0 T magnet during two imaging sessions separated by 2 weeks (session-1/-2, respectively). Fifteen gradient directions and b values of 0-1,000 s/mm(2) were used. Two radiologists in consensus assessed liver apparent diffusion coefficient (ADC) and fraction of anisotropy (FA) values on ADC and FA maps at four reference levels, namely: right upper level (RUL), right lower level (RLL), left upper level (LUL) and left lower level (LLL). We then assessed (a) whether ADC and FA values overlapped when measured on different levels within the same imaging session or between different imaging sessions; (b) the degree of variability on an intra-session and inter-session basis, respectively, using the coefficient of variation (CV). In sessions 1 and 2, the ADC/FA values were significantly larger in the left liver lobe (LUL/LLL) compared to right liver lobe (RUL/RLL) (p < 0.05/6). Intra-session CVs were 9.51 % (session 1) and 9.73 % (session 2) for ADC, and 12.93 % (session 1) and 11.82 % (session 2) for FA, respectively. When comparing RUL, RLL, LUL and LLL on an inter-session basis, CVs were 6.52, 8.20, 6.52 and 11.06 % for ADC, and 15.42, 15.80, 15.42 and 6.80 % for FA, respectively. LDTI provides consistent and repeatable measurements. However, since larger left lobe ADC/FA values can be attributed to artefacts, right lobe values should be considered the most reliable measurements of water diffusivity within the liver.
Feasibility and test-retest reliability of measuring lower‑limb strength in young children with cerebral palsy.

Science.gov (United States)

Van Vulpen, L F; De Groot, S; Becher, J G; De Wolf, G S; Dallmeijer, A J

2013-12-01

Quantifying leg muscle strength in young children with cerebral palsy (CP) is essential for identifying muscle groups for treatment and for monitoring progress. To study the feasibility, intratester reliability and the optimal test design (number of test occasions and repetitions) of measuring lower-limb strength with handheld dynamometry (HHD) and dynamic ankle plantar flexor strength with the standing heel-rise (SH) test in 3-10 year aged children with CP. Test-retest design. Rehabilitation centre, special needs school for children with disabilities, and university medical centre. Knee extensor, hip abductor and calf muscle strength was assessed in 20 ambulatory children with spastic CP (3-5 years [N.=10] and 6-10 years [N.=10]) on two test occasions. Intraclass correlation coefficients (ICC) and Smallest Detectable Differences (SDD) were calculated to determine the optimal test design for detecting changes in strength. All isometric strength tests had acceptable SDDs (9-30%), when taking the mean values of 2-3 test occasions (separate days) and 2-3 repetitions. The one-leg SH test had large SDDs (40-128% for younger group, 23-48% for older group). Isometric strength (improvements) can only be measured reliably with HHD in young children with CP when the average values over at least 2 test occasions are taken. Reliability of the SH test is not sufficient for measuring individual changes in dynamic muscle strength in the younger children. Results of this study can be used to determine the optimal number of test occasions and repetitions for reliable HHD measurements depending on expected changes, muscle group and age in 3-10 year old children with CP.

Portuguese validation of the children's eating attitudes test

Directory of Open Access Journals (Sweden)

Maria Del Carmen Bento Teixeira

2012-01-01

Full Text Available BACKGROUND: The Eating Attitudes Test (EAT is the most widely used instrument for evaluating eating disorders in adults and adolescents in a variety of cultures and samples. OBJECTIVE: The aim of this study was to analyse the psychometric properties of the Portuguese version of the Children's Eating Attitudes Test (ChEAT. METHOD: Nine hundred and fifty-six Portuguese secondary students (565 girls and 391 boys answered the ChEAT. The test-retest reliability was obtained with data from 206 participants from the total sample who re-answered the questionnaire after 4-6 weeks. Psychometric analyses were carried out for the total sample and separately for girls and boys. RESULTS: Internal consistency and test-retest reliability were satisfactory. Principal components factorial analysis yielded four factors in the total sample, accounting for 42.35% of the total variance. Factor structure was similar in the total sample and in both genders. Factors were labelled: F1 "Fear of Getting Fat", F2 "Restrictive and Purgative Behaviours", F3 "Food Preoccupation" and F4 "Social Pressure to Eat". The concurrent validity, explored using the Contour Drawing Figure Rating Scale (CDRS was high. DISCUSSION: The Portuguese version of the ChEAT is a valid and useful instrument for the evaluation of abnormal eating attitudes and behaviours among Portuguese adolescents.
Validation of the German version of the Ford Insomnia Response to Stress Test.

Science.gov (United States)

Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

2018-06-01

The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.
MicroPET imaging of 5-HT{sub 1A} receptors in rat brain: a test-retest [{sup 18}F]MPPF study

Energy Technology Data Exchange (ETDEWEB)

Aznavour, Nicolas [McGill University, Department of Psychiatry, Montreal, QC (Canada)]|[Laboratory of Neuroenergetics and Cellular Dynamics, EPFL, SV, BMI, Lausanne (Switzerland); Benkelfat, Chawki; Gravel, Paul [McGill University, Department of Psychiatry, Montreal, QC (Canada)]|[McGill University, Department of Neurology and Neurosurgery, Montreal, QC (Canada); Aliaga, Antonio [McGill University, Department of Small Animal Imaging Laboratory, Montreal, QC (Canada); Rosa-Neto, Pedro [Douglas Hospital, Molecular NeuroImaging Laboratory, Montreal, QC (Canada); Bedell, Barry [McGill University, Department of Neurology and Neurosurgery, Montreal, QC (Canada)]|[McGill University, Department of Small Animal Imaging Laboratory, Montreal, QC (Canada); Zimmer, Luc [CERMEP, ANIMAGE Department, Lyon (France)]|[Universite Lyon 1 and CNRS, Lyon (France); Descarries, Laurent [Universite de Montreal, Department of Pathology and Cell Biology, Montreal, QC (Canada)]|[Universite de Montreal, Department of Physiology, Montreal, QC (Canada)]|[Universite de Montreal, GRSNC, Montreal, QC (Canada)

2009-01-15

Earlier studies have shown that positron emission tomography (PET) imaging with the radioligand [{sup 18}F]MPPF allows for measuring the binding potential of serotonin 5-hydroxytryptamine{sub 1A} (5-HT{sub 1A}) receptors in different regions of animal and human brain, including that of 5-HT{sub 1A} autoreceptors in the raphe nuclei. In the present study, we sought to determine if such data could be obtained in rat, with a microPET (R4, Concorde Microsystems). Scans from isoflurane-anaesthetised rats (n = 18, including six test-retest) were co-registered with magnetic resonance imaging data, and binding potential, blood to plasma ratio and radiotracer efflux were estimated according to a simplified reference tissue model. Values of binding potential for hippocampus (1.2), entorhinal cortex (1.1), septum (1.1), medial prefrontal cortex (1.0), amygdala (0.8), raphe nuclei (0.6), paraventricular hypothalamic nucleus (0.5) and raphe obscurus (0.5) were comparable to those previously measured with PET in cats, non-human primates or humans. Test-retest variability was in the order of 10% in the larger brain regions (hippocampus, medial prefrontal and entorhinal cortex) and less than 20% in small nuclei such as the septum and the paraventricular hypothalamic, basolateral amygdaloid and raphe nuclei. MicroPET brain imaging of 5-HT{sub 1A} receptors with [{sup 18}F]MPPF thus represents a promising avenue for investigating 5-HT{sub 1A} receptor function in rat. (orig.)
Scale for positive aspects of caregiving experience: development, reliability, and factor structure.

Science.gov (United States)

Kate, N; Grover, S; Kulhara, P; Nehra, R

2012-06-01

OBJECTIVE. To develop an instrument (Scale for Positive Aspects of Caregiving Experience [SPACE]) that evaluates positive caregiving experience and assess its psychometric properties. METHODS. Available scales which assess some aspects of positive caregiving experience were reviewed and a 50-item questionnaire with a 5-point rating was constructed. In all, 203 primary caregivers of patients with severe mental disorders were asked to complete the questionnaire. Internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity were evaluated. Principal component factor analysis was run to assess the factorial validity of the scale. RESULTS. The scale developed as part of the study was found to have good internal consistency, test-retest reliability, cross-language reliability, split-half reliability, and face validity. Principal component factor analysis yielded a 4-factor structure, which also had good test-retest reliability and cross-language reliability. There was a strong correlation between the 4 factors obtained. CONCLUSION. The SPACE developed as part of this study has good psychometric properties.
WOrk-Related Questionnaire for UPper extremity disorders (WORQ-UP): Factor Analysis and Internal Consistency.

Science.gov (United States)

Aerts, Bas R; Kuijer, P Paul; Beumer, Annechien; Eygendaal, Denise; Frings-Dresen, Monique H

2018-04-17

To test a 17-item questionnaire, the WOrk-Related Questionnaire for UPper extremity disorders (WORQ-UP), for dimensionality of the items (factor analysis) and internal consistency. Cross-sectional study. Outpatient clinic. A consecutive sample of patients (N=150) consisting of all new referral patients (either from a general physician or other hospital) who visited the orthopedic outpatient clinic because of an upper extremity musculoskeletal disorder. Not applicable. Number and dimensionality of the factors in the WORQ-UP. Four factors with eigenvalues (EVs) >1.0 were found. The factors were named exertion, dexterity, tools & equipment, and mobility. The EVs of the factors were, respectively, 5.78, 2.38, 1.81, and 1.24. The factors together explained 65.9% of the variance. The Cronbach alpha values for these factors were, respectively, .88, .74, .87, and .66. The 17 items of the WORQ-UP resemble 4 factors-exertion, dexterity, tools & equipment, and mobility-with a good internal consistency. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Test-retest repeatability of strength capacity, aerobic power and pericranial tenderness of neck and shoulder muscles in children - relevant for tension-type headache

DEFF Research Database (Denmark)

Tornøe, Birte; Andersen, Lars L; Skotte, J H

2013-01-01

Frequent or chronic tension-type headache in children is a prevalent and debilitating condition for the child, often leading to medication overuse. To explore the relationship between physical factors and tension-type headache in children, the quality of repeated measures was examined. The aim of...... of the present study was to determine the test-retest repeatability of parameters determining isometric neck and shoulder strength and stability, aerobic power, and pericranial tenderness in children....
Reliability of the Client-Centeredness of Goal Setting (C-COGS) Scale in Acquired Brain Injury Rehabilitation.

Science.gov (United States)

Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim

2016-01-01

To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Passage Reading Fluency Assessments: Grade 4. Technical Report #1219

Science.gov (United States)

Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…
Test-retest reliability of knee extensor rate of velocity and power development in older adults using the isotonic mode on a Biodex System 3 dynamometer.

Science.gov (United States)

Van Driessche, Stijn; Van Roie, Evelien; Vanwanseele, Benedicte; Delecluse, Christophe

2018-01-01

Isotonic testing and measures of rapid power production are emerging as functionally relevant test methods for detection of muscle aging. Our objective was to assess reliability of rapid velocity and power measures in older adults using the isotonic mode of an isokinetic dynamometer. Sixty-three participants (aged 65 to 82 years) underwent a test-retest protocol with one week time interval. Isotonic knee extension tests were performed at four different loads: 0%, 25%, 50% and 75% of maximal isometric strength. Peak velocity (pV) and power (pP) were determined as the highest values of the velocity and power curve. Rate of velocity (RVD) and power development (RPD) were calculated as the linear slopes of the velocity- and power-time curve. Relative and absolute measures of test-retest reliability were analyzed using intraclass correlation coefficients (ICC), standard error of measurement (SEM) and Bland-Altman analyses. Overall, reliability was high for pV, pP, RVD and RPD at 0%, 25% and 50% load (ICC: .85 - .98, SEM: 3% - 10%). A trend for increased reliability at lower loads seemed apparent. The tests at 75% load led to range of motion failure and should be avoided. In addition, results demonstrated that caution is advised when interpreting early phase results (first 50ms). To conclude, our results support the use of the isotonic mode of an isokinetic dynamometer for testing rapid power and velocity characteristics in older adults, which is of high clinical relevance given that these muscle characteristics are emerging as the primary outcomes for preventive and rehabilitative interventions in aging research.
Test-retest reliability of pure-tone thresholds from 0.5 to 16 kHz using Sennheiser HDA 200 and Etymotic Research ER-2 earphones.

Science.gov (United States)

Schmuziger, Nicolas; Probst, Rudolf; Smurzynski, Jacek

2004-04-01

The purposes of the study were: (1) To evaluate the intrasession test-retest reliability of pure-tone thresholds measured in the 0.5-16 kHz frequency range for a group of otologically healthy subjects using Sennheiser HDA 200 circumaural and Etymotic Research ER-2 insert earphones and (2) to compare the data with existing criteria of significant threshold shifts related to ototoxicity and noise-induced hearing loss. Auditory thresholds in the frequency range from 0.5 to 6 kHz and in the extended high-frequency range from 8 to 16 kHz were measured in one ear of 138 otologically healthy subjects (77 women, 61 men; mean age, 24.4 yr; range, 12-51 yr) using HDA 200 and ER-2 earphones. For each subject, measurements of thresholds were obtained twice for both transducers during the same test session. For analysis, the extended high-frequency range from 8 to 16 kHz was subdivided into 8 to 12.5 and 14 to 16 kHz ranges. Data for each frequency and frequency range were analyzed separately. There were no significant differences in repeatability for the two transducer types for all frequency ranges. The intrasession variability increased slightly, but significantly, as frequency increased with the greatest amount of variability in the 14 to 16 kHz range. Analyzing each individual frequency, variability was increased particularly at 16 kHz. At each individual frequency and for both transducer types, intrasession test-retest repeatability from 0.5 to 6 kHz and 8 to 16 kHz was within 10 dB for >99% and >94% of measurements, respectively. The results indicated a false-positive rate of HDA 200. Repeatability was similar for both transducer types. Intrasession test-retest repeatability from 0.5 to 12.5 kHz at each individual frequency including the frequency range susceptible to noise-induced hearing loss was excellent for both transducers. Repeatability was slightly, but significantly poorer in the frequency range from 14 to 16 kHz compared with the frequency ranges from 0.5 to 6
Questionnaire for low back pain in the garment industry workers.

Science.gov (United States)

Bindra, Supreet; Sinha, A G K; Benjamin, A I

2013-05-01

Low back pain affects up to 90% of the world's population at some point in their lives. Until date no questionnaire has been designed for back pain in the garment industry workers. Therefore, the objective of this study is to design a questionnaire to determine the prevalence, risk factors, impact, health care service utilization and back pain features in the garment industry workers and gain preliminary experience of its use. The content validity and reliability of the questionnaire was established. Items showing acceptable internal consistency and moderate to high test re-test reliability were retained in the questionnaire. Items showing unacceptable internal consistency, low test re-test reliability or poor differentiation were reworded, redrafted and re-tested on the workers. It took 20 min to complete one interview schedule. Environmental factors such as the absence of the garment industry owner/supervisor or co-workers at the time of the interview and interview during leisure hours need to be standardized. Thus, final questionnaire is ready for use after necessary amendments and will be used on the larger sample size in the main study.
Cross-Cultural Translation, Adaptation and Reliability of the Danish M. D. Andeson Dysphagia Inventory (MDADI) in Patients with Head and Neck Cancer.

Science.gov (United States)

Hajdú, Sara Fredslund; Plaschke, Christina Caroline; Johansen, Christoffer; Dalton, Susanne Oksbjerg; Wessel, Irene

2017-08-01

The objectives were to translate and culturally adapt the M.D. Anderson Dysphagia Inventory (MDADI) into Danish and subsequently test the reliability of the Danish version. The MDADI was translated into Danish and cross culturally adapted through cognitive interviews. The final version was test-retest evaluated in a group of head and neck cancer (HNC) patients who responded to the questionnaire twice with a mean of eight days apart. Interclass correlation coefficient, Cronbach's alpha, floor and ceiling effects, standard error of measurement and minimal detectable change were investigated. Fourteen patients were interviewed on the comprehensibility of the Danish MDADI, and all found the questionnaire meaningful, easy to understand, non-offensive and to include relevant aspects of dysphagia related to HNC. Sixty-four patients were included in the test-retest study. Especially, one item in the emotional scale (E7) appeared to be often misinterpreted, and ceiling effects were found in all four subdomains (global, emotional, functional and physical). The four subdomains and the composite score showed acceptable test-retest reliability and internal consistency in a Danish population of HNC patients. The Danish MDADI is reliable in terms of internal consistency and test-retest reproducibility and can be used in assessing the health-related quality of life in head and neck cancer patients with dysphagia.
Arabic validation of the Urogenital Distress Inventory and Adapted Incontinence Impact Questionnaires--short forms.

Science.gov (United States)

El-Azab, Ahmed S; Mascha, Edward J

2009-01-01

The purpose of this study was to adapt the IIQ-7 to suit the Egyptian culture and then to assess validity and reliability of the adapted and translated IIQ-7 and UDI-6. IIQ-7 was modified to suit Egyptian culture. Linguistic validation of the two questionnaires was done. Initial test-retest reliability and internal consistency of adapted translated questionnaires were done in a pilot study. The final validity, test-retest reliability and internal consistency study included 204 women with urinary incontinence (UI). Participants completed the two questionnaires at enrollment and after 2 weeks. All participants underwent urodynamics. Baseline urodynamic diagnosis was compared with diagnoses made by questionnaires to assess validity. Test-retest reliability was excellent for both the IIQ-7 and UDI-6. For the UDI-6, the mean difference (SD) between first and second visits was -1.63 (7.0), and the 95% CI for the mean difference was -2.6 and -0.68. The 95% limits of agreement were -15.3 and 12.0. Lin's concordance correlation coefficient (LCCC) (95% CI) for the UDI was 0.89 (0.85 and 0.91). For the IIQ-7, the mean difference (SD) was 0.37 (7.1), and the 95% CI for the mean difference was -0.60 and 1.3. The 95% limits of agreement were -13.5 and 14.2. LCCC (95% CI) for the IIQ was 0.90 (0.87 and 0.92). Internal consistency as assessed using Cronbach's alpha was 0.32 and 0.31 for the UDI-6 and IIQ-7, respectively. Validity assessments indicated that both IIQ and UDI scales can distinguish objective disease states. UDI-6 and the modified IIQ-7 are easy to administer, test-retest reliable, and valid questionnaires, with relatively low internal consistency. (c) 2008 Wiley-Liss, Inc.
Comparison of airtightness retesting results

Energy Technology Data Exchange (ETDEWEB)

1989-01-01

Polyethylene vapour barrier and airtight drywall are two methods used by the building industry to reduce air leakage in residential homes. Concern has been expressed that polyethylene air/vapour barriers degrade over time. This concern has led various agencies to test and retest homes for air leakage. This report is the compilation of the data collected as a result of that testing. Raw data were collected on 145 homes from various sources. Data were screened and the tests of homes were omitted from the analysis if the fan tests were done on the same house by different firms, if the construction of the house was not sufficiently complete, or if the initial air change rate per hour (ACH) was greater than 3. With these omissions from the database, 90 homes remained to be analyzed. The 90 homes were separated into two groups, those with an initial ACH less than 1.5 and those with an initial ACH between 1.5 and 3.0. The data were recorded in two tables which included the ACH, the time in months, the percentage change, and the difference in change between the first test and each subsequent test. These data indicate a relatively minor average change in airtightness. Keeping in mind the quantity of data collected the time period examined, there is no indication that significant problems exist that would necessitate a change to the current building practice. 2 figs., 5 tabs.
The effect of sample storage on the performance and reproducibility of the galactomannan EIA test.

Science.gov (United States)

Kimpton, George; White, P Lewis; Barnes, Rosemary A

2014-08-01

Galactomannan enzyme immune assay (GM EIA) is a nonculture test for detecting invasive aspergillosis (IA) forming a key part of diagnosis and management. Recent reports have questioned the reproducibility of indices after sample storage. To investigate this, 198 serum samples (72 from cases and 126 from controls) and 61 plasma samples (24 from cases and 37 from controls), initially tested between 2010 and 2013, were retested to determine any change in index. Data were also collected on circulatory protein levels for false-positive serum samples. Serum indices significantly declined on retesting (median: initial, 0.50, retest, 0.23; P < 0.0001). This was shown to be diagnosis dependent as the decline was apparent on retesting of control samples (median: initial 0.50, retest 0.12; P < 0.0001), but was not evident with case samples (median: initial, 0.80, retest, 0.80; P = 0.724). Plasma samples showed little change on reanalysis after long-term storage at 4°C. Retesting after freezing showed a decrease in index values for controls (median: initial 0.40, retest 0.26; P = 0.0505), but no significant change in cases. Circulatory proteins showed a correlation between serum albumin concentration and difference in index value on retesting. Overall, this study suggests that a lack of reproducibility in GM EIA positivity is only significant when disease is absent. Retesting after freezing helps to differentiate false-positive GM EIA results and, with consecutive positivity, could help to improve accuracy in predicting disease status. The freezing of samples prior to testing could potentially reduce false-positivity rates and the need to retest. © The Author 2014. Published by Oxford University Press on behalf of The International Society for Human and Animal Mycology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Attentional and visual demands for sprint performance in non-fatigued and fatigued conditions: reliability of a repeated sprint test

Directory of Open Access Journals (Sweden)

Diercks Ron L

2010-05-01

Full Text Available Abstract Background Physical performance measures are widely used to assess physical function, providing information about physiological and biomechanical aspects of motor performance. However they do not provide insight into the attentional and visual demands for motor performance. A figure-of-eight sprint test was therefore developed to measure the attentional and visual demands for repeated-sprint performance. The aims of the study were: 1 to assess test-retest reliability of the figure-of-eight sprint test, and 2 to study the attentional and visual demands for sprint performance in a non-fatigued and fatigued condition. Methods Twenty-seven healthy athletes were included in the study. To determine test-retest reliability, a subgroup of 19 athletes performed the figure-of-eight sprint test twice. The figure-of-eight sprint test consisted of nine 30-second sprints. The sprint test consisted of three test parts: sprinting without any restriction, with an attention-demanding task, and with restricted vision. Increases in sprint times with the attention-demanding task or restricted vision are reflective of the attentional and visual demands for sprinting. Intraclass correlation coefficients (ICCs and mean difference between test and retest with 95% confidence limits (CL were used to assess test-retest reliability. Repeated-measures ANOVA were used for comparisons between the sprint times and fatigue measurements of the test parts in both a non-fatigued and fatigued condition. Results The figure-of-eight sprint test showed good test-retest reliability, with ICCs ranging from 0.75 to 0.94 (95% CL: 0.40-0.98. Zero lay within the 95% CL of the mean differences, indicating that no bias existed between sprint performance at test and retest. Sprint times during the test parts with attention-demanding task (P = 0.01 and restricted vision (P Conclusions High ICCs and the absence of systematic variation indicate good test-retest reliability of the figure
Investigating univariate temporal patterns for intrinsic connectivity networks based on complexity and low-frequency oscillation: a test-retest reliability study.

Science.gov (United States)

Wang, X; Jiao, Y; Tang, T; Wang, H; Lu, Z

2013-12-19

Intrinsic connectivity networks (ICNs) are composed of spatial components and time courses. The spatial components of ICNs were discovered with moderate-to-high reliability. So far as we know, few studies focused on the reliability of the temporal patterns for ICNs based their individual time courses. The goals of this study were twofold: to investigate the test-retest reliability of temporal patterns for ICNs, and to analyze these informative univariate metrics. Additionally, a correlation analysis was performed to enhance interpretability. Our study included three datasets: (a) short- and long-term scans, (b) multi-band echo-planar imaging (mEPI), and (c) eyes open or closed. Using dual regression, we obtained the time courses of ICNs for each subject. To produce temporal patterns for ICNs, we applied two categories of univariate metrics: network-wise complexity and network-wise low-frequency oscillation. Furthermore, we validated the test-retest reliability for each metric. The network-wise temporal patterns for most ICNs (especially for default mode network, DMN) exhibited moderate-to-high reliability and reproducibility under different scan conditions. Network-wise complexity for DMN exhibited fair reliability (ICC<0.5) based on eyes-closed sessions. Specially, our results supported that mEPI could be a useful method with high reliability and reproducibility. In addition, these temporal patterns were with physiological meanings, and certain temporal patterns were correlated to the node strength of the corresponding ICN. Overall, network-wise temporal patterns of ICNs were reliable and informative and could be complementary to spatial patterns of ICNs for further study. Copyright © 2013 IBRO. Published by Elsevier Ltd. All rights reserved.
Two Year Longitudinal Change and Test-Retest-Precision of Knee Cartilage Morphology in a Pilot Study for the Osteoarthritis Initiative

Science.gov (United States)

Eckstein, Felix; Kunz, Manuela; Schutzer, Matt; Hudelmaier, Martin; Jackson, Rebecca D.; Yu, Joseph; Eaton, Charles B.; Schneider, Erika

2009-01-01

Objective Fast low angle shot (FLASH) and double echo steady state (DESS) MRI sequences were recently cross-calibrated for quantification of cartilage morphology at 3 Tesla. In this pilot study for the Osteoarthritis Initiative we compare their test-retest precision and sensitivity to longitudinal change. Method 9 participants with mild to moderate clinical OA were imaged at baseline, year 1 and year 2. Coronal 1.5mm FLASH and sagittal 0.7mm DESS sequences were acquired; 1.5mm coronal multiplanar reformats (MPR) were obtained from the DESS. Patellar, femoral and tibial cartilage plates were quantified in paired fashion, with blinding to time point. Results In the weight-bearing femorotibial joint, average precision errors across plates were 1.8% for FLASH, 2.6% for DESS, and 3.0% for MPR-DESS. Volume loss at year 1 was not significant; at year 2 the average change across the femorotibial cartilage plates was −1.7% for FLASH, −2.8% for DESS, and −0.3% for MPR-DESS. Volume change in the lateral tibia (−5.5%; p<0.03), and in the medial (−2.9%; p<0.04) and lateral femorotibial compartment (−3.8%; p<0.03) were significant for DESS. Conclusion FLASH, MPR-DESS and DESS all displayed adequate test-retest precision. Although the comparison between protocols is limited by the small number of participants and by the relatively small longitudinal change in cartilage morphology in this pilot study, the data suggest that significant change can be detected with MRI in a small sample of OA subjects over 2 years. PMID:17560813
An overview of coefficient alpha and a reliability matrix for estimating adequacy of internal consistency coefficients with psychological research measures.

Science.gov (United States)

Ponterotto, Joseph G; Ruckdeschel, Daniel E

2007-12-01

The present article addresses issues in reliability assessment that are often neglected in psychological research such as acceptable levels of internal consistency for research purposes, factors affecting the magnitude of coefficient alpha (alpha), and considerations for interpreting alpha within the research context. A new reliability matrix anchored in classical test theory is introduced to help researchers judge adequacy of internal consistency coefficients with research measures. Guidelines and cautions in applying the matrix are provided.
Psychometrics and the neuroscience of individual differences: Internal consistency limits between-subjects effects.

Science.gov (United States)

Hajcak, Greg; Meyer, Alexandria; Kotov, Roman

2017-08-01

In the clinical neuroscience literature, between-subjects differences in neural activity are presumed to reflect reliable measures-even though the psychometric properties of neural measures are almost never reported. The current article focuses on the critical importance of assessing and reporting internal consistency reliability-the homogeneity of "items" that comprise a neural "score." We demonstrate how variability in the internal consistency of neural measures limits between-subjects (i.e., individual differences) effects. To this end, we utilize error-related brain activity (i.e., the error-related negativity or ERN) in both healthy and generalized anxiety disorder (GAD) participants to demonstrate options for psychometric analyses of neural measures; we examine between-groups differences in internal consistency, between-groups effect sizes, and between-groups discriminability (i.e., ROC analyses)-all as a function of increasing items (i.e., number of trials). Overall, internal consistency should be used to inform experimental design and the choice of neural measures in individual differences research. The internal consistency of neural measures is necessary for interpreting results and guiding progress in clinical neuroscience-and should be routinely reported in all individual differences studies. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

Diffusion-weighted (DW) MRI in lung cancers. ADC test-retest repeatability

Energy Technology Data Exchange (ETDEWEB)

Weller, Alex; Papoutsaki, Marianthi Vasiliki; Blackledge, Matthew; DeSouza, Nandita M. [Institute of Cancer Research and Royal Marsden NHS Foundation Trust, CRUK Cancer Imaging Centre, Surrey (United Kingdom); Waterton, John C. [University of Manchester, Manchester (United Kingdom); Chiti, Arturo [Humanitas University, Milan (Italy); Stroobants, Sigrid [Universiteit Antwerpen, Antwerpen (Belgium); Kuijer, Joost [Vrije Universiteit Medisch Centrum, Amsterdam (Netherlands); Morgan, Veronica [Royal Marsden NHS Foundation Trust, Department of Medicine, London (United Kingdom)

2017-11-15

To determine the test-retest repeatability of Apparent Diffusion Coefficient (ADC) measurements across institutions and MRI vendors, plus investigate the effect of post-processing methodology on measurement precision. Thirty malignant lung lesions >2 cm in size (23 patients) were scanned on two occasions, using echo-planar-Diffusion-Weighted (DW)-MRI to derive whole-tumour ADC (b = 100, 500 and 800 s/mm{sup -2}). Scanning was performed at 4 institutions (3 MRI vendors). Whole-tumour volumes-of-interest were copied from first visit onto second visit images and from one post-processing platform to an open-source platform, to assess ADC repeatability and cross-platform reproducibility. Whole-tumour ADC values ranged from 0.66-1.94x10{sup -3} mm{sup 2}s{sup -1} (mean = 1.14). Within-patient coefficient-of-variation (wCV) was 7.1% (95% CI 5.7-9.6%), limits-of-agreement (LoA) -18.0 to 21.9%. Lesions >3 cm had improved repeatability: wCV 3.9% (95% CI 2.9-5.9%); and LoA -10.2 to 11.4%. Variability for lesions <3 cm was 2.46 times higher. ADC reproducibility across different post-processing platforms was excellent: Pearson's R{sup 2} = 0.99; CoV 2.8% (95% CI 2.3-3.4%); and LoA -7.4 to 8.0%. A free-breathing DW-MRI protocol for imaging malignant lung tumours achieved satisfactory within-patient repeatability and was robust to changes in post-processing software, justifying its use in multi-centre trials. For response evaluation in individual patients, a change in ADC >21.9% will reflect treatment-related change. (orig.)
Confiabilidade do Teste dos Cinco Dígitos em adultos brasileiros

Directory of Open Access Journals (Sweden)

Maene Cristina Campos

2016-06-01

Full Text Available RESUMO Objetivo O presente estudo analisou a confiabilidade do Teste dos Cinco Dígitos (FDT, um instrumento de avaliação dos processos atencionais baseado no paradigma Stroop. O teste usa números e quantidades para avaliação do efeito de interferência atencional. Métodos Avaliamos 49 adultos brasileiros por meio do FDT. Os participantes realizaram o teste em dois momentos, com aproximadamente duas semanas de intervalo. A confiabilidade do teste foi estimada pela consistência interna (método das metades e pela avaliação da estabilidade teste-reteste (coeficiente de correlação intraclasse e teste de Wilcoxon para amostras repetidas. Resultados O tempo de resposta médio dos participantes apresentou discreta melhora nas etapas mais simples do teste e mais acentuada nas etapas mais complexas. A consistência interna do teste foi superior a 0,9. A estabilidade teste-reteste variou em função da etapa do teste, e todas as correlações foram significativas (p < 0,01 e explicaram entre 60% e 90% da variância encontrada. Conclusão O FDT apresenta evidências robustas de confiabilidade na amostra avaliada. Esse foi o primeiro estudo brasileiro a avaliar essa propriedade pelo método de teste-reteste. Os resultados propiciam melhor aplicabilidade do FDT nos contextos de clínica e pesquisa.
Reliability of perceived neighbourhood conditions and the effects of measurement error on self-rated health across urban and rural neighbourhoods.

Science.gov (United States)

Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario

2012-04-01

Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Test-retest reliability, smallest real difference and concurrent validity of six different balance tests on young people with mild to moderate intellectual disability.

Science.gov (United States)

Blomqvist, Sven; Wester, Anita; Sundelin, Gunnevi; Rehn, Börje

2012-12-01

Some studies have reported that people with intellectual disability may have reduced balance ability compared with the population in general. However, none of these studies involved adolescents, and the reliability and validity of balance tests in this population are not known. The purpose of this study was to examine the reliability of six different balance tests and to investigate their concurrent validity. Test-retest reliability assessment. All subjects were recruited from a special school for people with intellectual disability in Bollnäs, Sweden. Eighty-nine adolescents (35 females and 54 males) with mild to moderate intellectual disability with a mean age of 18 years (range 16 to 20 years). All subjects followed the same test protocol on two occasions within an 11-day period. Balance test performances. Intraclass correlation coefficients greater than 0.80 were achieved for four of the balance tests: Extended Timed Up and Go Test, Modified Functional Reach Test, One-leg Stance Test and Force Platform Test. The smallest real differences ranged from 12% to 40%; less than 20% is considered to be low. Concurrent validity among these balance tests varied between no and low correlation. The results indicate that these tests could be used to evaluate changes in balance ability over time in people with mild to moderate intellectual disability. The low concurrent validity illustrates the importance of knowing more about the influence of various sensory subsystems that are significant for balance among adolescents with intellectual disability. Copyright © 2011 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Reliability, validity and description of timed performance of the Jebsen-Taylor Test in patients with muscular dystrophies.

Science.gov (United States)

Artilheiro, Mariana Cunha; Fávero, Francis Meire; Caromano, Fátima Aparecida; Oliveira, Acary de Souza Bulle; Carvas, Nelson; Voos, Mariana Callil; Sá, Cristina Dos Santos Cardoso de

2017-12-08

The Jebsen-Taylor Test evaluates upper limb function by measuring timed performance on everyday activities. The test is used to assess and monitor the progression of patients with Parkinson disease, cerebral palsy, stroke and brain injury. To analyze the reliability, internal consistency and validity of the Jebsen-Taylor Test in people with Muscular Dystrophy and to describe and classify upper limb timed performance of people with Muscular Dystrophy. Fifty patients with Muscular Dystrophy were assessed. Non-dominant and dominant upper limb performances on the Jebsen-Taylor Test were filmed. Two raters evaluated timed performance for inter-rater reliability analysis. Test-retest reliability was investigated by using intraclass correlation coefficients. Internal consistency was assessed using the Cronbach alpha. Construct validity was conducted by comparing the Jebsen-Taylor Test with the Performance of Upper Limb. The internal consistency of Jebsen-Taylor Test was good (Cronbach's α=0.98). A very high inter-rater reliability (0.903-0.999), except for writing with an Intraclass correlation coefficient of 0.772-1.000. Strong correlations between the Jebsen-Taylor Test and the Performance of Upper Limb Module were found (rho=-0.712). The Jebsen-Taylor Test is a reliable and valid measure of timed performance for people with Muscular Dystrophy. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Hypertension Knowledge-Level Scale (HK-LS): A Study on Development, Validity and Reliability

OpenAIRE

Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin

2012-01-01

This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensio...
The Development of a Tracheostomy-Specific Quality of Life Questionnaire: A Pilot Study.

Science.gov (United States)

Smith, Kristine A; Bosch, John Douglas; Pelletier, Guy; MacKenzie, Marianne; Hoy, Monica Y

2016-08-01

A long-term tracheostomy can be a life-altering event and can have significant effects on patients' quality of life (QOL). There is currently no instrument available to evaluate tracheostomy-specific QOL. To address this deficiency, the objective of this study was to create and preliminarily validate a pilot tracheostomy-specific QOL questionnaire to assess its feasibility. A multidisciplinary team developed the pilot tracheostomy-specific QOL questionnaire (TQOL) in 3 phases: item generation, item review, and scale construction. The survey was administered at 0 and 2 weeks to a pilot group of tracheostomy patients with concurrent administration of a validated general QOL questionnaire at week 0. Convergence validity, test-retest reliability, and internal consistency were the primary outcome measures. A total of 37 patients completed the study (mean tracheostomy duration = 90 weeks). The convergence validity of the TQOL was moderately strong (r = 0.72), and the test-retest reliability was strong (r = 0.75). The TQOL also demonstrated good internal consistency (Cronbach's alpha = 0.82). The TQOL has moderately strong internal consistency, convergence validity, and test-retest reliability. While additional refinement and validation may improve the questionnaire, these initial results are promising and support further development of this tool. © The Author(s) 2016.
Validation of the Social Inclusion Scale with Students

Directory of Open Access Journals (Sweden)

Ceri Wilson

2015-07-01

Full Text Available Interventions (such as participatory arts projects aimed at increasing social inclusion are increasingly in operation, as social inclusion is proving to play a key role in recovery from mental ill health and the promotion of mental wellbeing. These interventions require evaluation with a systematically developed and validated measure of social inclusion; however, a “gold-standard” measure does not yet exist. The Social Inclusion Scale (SIS has three subscales measuring social isolation, relations and acceptance. This scale has been partially validated with arts and mental health project users, demonstrating good internal consistency. However, test-retest reliability and construct validity require assessment, along with validation in the general population. The present study aimed to validate the SIS in a sample of university students. Test-retest reliability, internal consistency, and convergent validity (one aspect of construct validity were assessed by comparing SIS scores with scores on other measures of social inclusion and related concepts. Participants completed the measures at two time-points seven-to-14 days apart. The SIS demonstrated high internal consistency and test-retest reliability, although convergent validity was less well-established and possible reasons for this are discussed. This systematic validation of the SIS represents a further step towards the establishment of a “gold-standard” measure of social inclusion.
Psychometric Evaluation of the Young Children's Participation and Environment Measure (YC-PEM) for use in Singapore.

Science.gov (United States)

Lim, Chun Yi; Law, Mary; Khetani, Mary; Rosenbaum, Peter; Pollock, Nancy

2018-08-01

To estimate the psychometric properties of a culturally adapted version of the Young Children's Participation and Environment Measure (YC-PEM) for use among Singaporean families. This is a prospective cohort study. Caregivers of 151 Singaporean children with (n = 83) and without (n = 68) developmental disabilities, between 0 and 7 years, completed the YC-PEM (Singapore) questionnaire with 3 participation scales (frequency, involvement, and change desired) and 1 environment scale for three settings: home, childcare/preschool, and community. Setting-specific estimates of internal consistency, test-retest reliability, and construct validity were obtained. Internal consistency estimates varied from .59 to .92 for the participation scales and .73 to .79 for the environment scale. Test-retest reliability estimates from the YC-PEM conducted on two occasions, 2-3 weeks apart, varied from .39 to .89 for the participation scales and from .65 to .80 for the environment scale. Moderate to large differences were found in participation and perceived environmental support between children with and without a disability. YC-PEM (Singapore) scales have adequate psychometric properties except for low internal consistency for the childcare/preschool participation frequency scale and low test-retest reliability for home participation frequency scale. The YC-PEM (Singapore) may be used for population-level studies involving young children with and without developmental disabilities.
Repeat HIV Testing at Voluntary Testing and Counseling Centers in Croatia: Successful HIV Prevention or Failure to Modify Risk Behaviors?

Science.gov (United States)

Matković Puljić, Vlatka; Kosanović Ličina, Mirjana Lana; Kavić, Marija; Nemeth Blažić, Tatjana

2014-01-01

HIV testing plays a critical role in preventing the spread of the virus and identifying infected individuals in need of care. Voluntary counseling and testing centers (VCTs) not only conduct testing but they also provide counseling. Since a proportion of people who test negative for HIV on their previous visit will return for retesting, the frequency of retesting and the characteristics of those who retest may provide insights into the efficacy of testing and counseling strategies. In this cross-sectional, retrospective study of 1,482 VCT clients in Croatia in 2010, 44.3% had been tested for HIV before. The rate of repeat HIV testing is lower in Croatia than in other countries. Men who have sex with men (MSM) clients, those with three or more sexual partners in the last 12 months, consistent condom users with steady partners, and intravenous drug users were more likely to be repeat testers. This finding suggests that clients presenting for repeat HIV testing are those who self-identify as being at a higher risk of infection. Our data showed that testing positive for HIV was not associated with repeat testing. However, the effects of repeat testing on HIV epidemiology needs to be explored. PMID:24705595
Validation of Portuguese version of Quality of Erection Questionnaire (QEQ) and comparison to International Index of Erectile Function (IIEF) and RAND 36-Item Health Survey.

Science.gov (United States)

Reis, Ana Luiza; Reis, Leonardo Oliveira; Saade, Ricardo Destro; Santos, Carlos Alberto; Lima, Marcelo Lopes de; Fregonesi, Adriano

2015-01-01

To validate the Quality of Erection Questionnaire (QEQ) considering Brazilian social-cultural aspects. To determine equivalence between the Portuguese and the English QEQ versions, the Portuguese version was back-translated by two professors who are native English speakers. After language equivalence had been determined, urologists considered the QEQ Portuguese version suitable. Men with self-reported erectile dysfunction (ED) and infertile men who had a stable sexual relationship for at least 6 months were invited to answer the QEQ, the International Index of Erectile Function (IIEF) and the RAND 36-Item Health Survey (RAND-36). The questionnaires were presented together and answered without help in a private room. Internal consistency (Cronbach's α), test-retest reliability (Spearman), convergent validity (Spearman correlation) coefficients and known-groups validity (the ability of the QEQ Portuguese version to differentiate erectile dysfunction severity groups) were assessed. We recruited 197 men (167 ED patients and 30 non-ED patients), mean age of 53.3 and median of 55.5 years (23-82 years). The Portuguese version of the QEQ had high internal consistency (Cronbach α=0.93), high stability between test and retest (ICC 0.83, with IC 95%: 0.76-0.88, pPortuguese version presented good psychometric properties and high convergent validity in relation to IIEF. The low correlations between the QEQ and the RAND-36, as well as between the IIEF and the RAND-36 indicated IIEF and QEQ specificity, which may have resulted from the patients' psychological adaptations that minimized the impact of ED on Quality of Life (QoL) and reestablished the well-being feeling.
Test-retest reliability of the novel 5-HT1B receptor PET radioligand [11C]P943

International Nuclear Information System (INIS)

Saricicek, Aybala; Chen, Jason; Ruf, Barbara; Planeta, Beata; Labaree, David; Gallezot, Jean-Dominique; Huang, Yiyun; Subramanyam, Kalyani; Maloney, Kathleen; Matuskey, David; Deserno, Lorenz; Neumeister, Alexander; Krystal, John H.; Carson, Richard E.; Bhagwagar, Zubin

2015-01-01

[ 11 C]P943 is a novel, highly selective 5-HT 1B PET radioligand. The aim of this study was to determine the test-retest reliability of [ 11 C]P943 using two different modeling methods and to perform a power analysis with each quantification technique. Seven healthy volunteers underwent two PET scans on the same day. Regions of interest (ROIs) were the amygdala, hippocampus, pallidum, putamen, insula, frontal, anterior cingulate, parietal, temporal and occipital cortices, and cerebellum. Two multilinear radioligand quantification techniques were used to estimate binding potential: MA1, using arterial input function data, and the second version of the multilinear reference tissue model analysis (MRTM2), using the cerebellum as the reference region. Between-scan percent variability and intraclass correlation coefficients (ICC) were used to assess test-retest reliability. We also performed power analyses to determine the method that would allow the least number of subjects using within-subject or between-subject study designs. A voxel-wise ICC analysis for MRTM2 BP ND was performed for the whole brain and all the ROIs studied. Mean percent variability between two scans across regions ranged between 0.4 % and 12.4 % for MA1 BP ND , 0.5 % and 11.5 % for MA1 BP P , 16.7 % and 28.3 % for MA1 BP F , and between 0.2 % and 5.4 % for MRTM2 BP ND . The power analyses showed a greater number of subjects were required using MA1 BP F compared with other outcome measures for both within-subject and between-subject study designs. ICC values were the highest using MRTM2 BP ND and the lowest with MA1 BP F in ten ROIs. Small regions and regions with low binding had lower ICC values than large regions and regions with high binding. Reliable measures of 5-HT 1B receptor binding can be obtained using the novel PET radioligand [ 11 C]P943. Quantification of 5-HT 1B receptor binding with MRTM2 BP ND and with MA1 BP P provided the least variability and optimal power for within-subject and
Validation of the Brazilian Portuguese Version of Geriatric Anxiety Inventory--GAI-BR.

Science.gov (United States)

Massena, Patrícia Nitschke; de Araújo, Narahyana Bom; Pachana, Nancy; Laks, Jerson; de Pádua, Analuiza Camozzato

2015-07-01

The Geriatric Anxiety Inventory (GAI) is a recently developed scale aiming to evaluate symptoms of anxiety in later life. This 20-item scale uses dichotomous answers highlighting non-somatic anxiety complaints of elderly people. The present study aimed to evaluate the psychometric properties of the Brazilian Portuguese version GAI (GAI-BR) in a sample from community and outpatient psychogeriatric clinic. A mixed convenience sample of 72 subjects was recruited for answering the research protocol. The interview procedures were structured with questionnaires about sociodemographic data, clinical health status, anxiety, and depression previously validated instruments, Mini-Mental State Examination, Mini International Neuropsychiatric Interview, and GAI-BR. Twenty-two percent of the sample were interviewed twice for test-retest reliability. For internal consistency analyses, the Cronbach's α test was applied. The Spearman correlation test was applied to evaluate the test-retest GAI-BR reliability. A ROC (receiver operating characteristic) curve study was made to estimate the GAI-BR area under curve, cut-off points, sensitivity, and specificity for the Generalized Anxiety Disorder diagnosis. The GAI-BR version showed high internal consistency (Cronbach's α = 0.91) and strong and significant test-retest reliability (ρ = 0.85, p BR has demonstrated very good psychometric properties and can be a reliable instrument to measure anxiety in Brazilian elderly people.
Testing the Psychometric Properties of a Chinese Version of the Level of Expressed Emotion Scale

Directory of Open Access Journals (Sweden)

Wai Tong Chien

2014-01-01

Full Text Available This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76–0.95 and ICC = 0.81–0.92, test-retest reliability (r = 0.89–0.95, P<0.01, and internal consistency (Cronbach’s α = 0.86–0.92. Among 262 patients with severe mental illness and their caregivers, the 50-item Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement, accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ2/df = 1.93, P=0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78 to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = −0.54 and family burden (r = 0.49 and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90. The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P<0.01. The Chinese version demonstrates sound psychometric properties to measure families’ expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.
Brief International Cognitive Assessment for MS (BICAMS): international standards for validation.

Science.gov (United States)

Benedict, Ralph H B; Amato, Maria Pia; Boringa, Jan; Brochet, Bruno; Foley, Fred; Fredrikson, Stan; Hamalainen, Paivi; Hartung, Hans; Krupp, Lauren; Penner, Iris; Reder, Anthony T; Langdon, Dawn

2012-07-16

An international expert consensus committee recently recommended a brief battery of tests for cognitive evaluation in multiple sclerosis. The Brief International Cognitive Assessment for MS (BICAMS) battery includes tests of mental processing speed and memory. Recognizing that resources for validation will vary internationally, the committee identified validation priorities, to facilitate international acceptance of BICAMS. Practical matters pertaining to implementation across different languages and countries were discussed. Five steps to achieve optimal psychometric validation were proposed. In Step 1, test stimuli should be standardized for the target culture or language under consideration. In Step 2, examiner instructions must be standardized and translated, including all information from manuals necessary for administration and interpretation. In Step 3, samples of at least 65 healthy persons should be studied for normalization, matched to patients on demographics such as age, gender and education. The objective of Step 4 is test-retest reliability, which can be investigated in a small sample of MS and/or healthy volunteers over 1-3 weeks. Finally, in Step 5, criterion validity should be established by comparing MS and healthy controls. At this time, preliminary studies are underway in a number of countries as we move forward with this international assessment tool for cognition in MS.
An Examination of Test-Retest, Alternate Form Reliability, and Generalizability Theory Study of the easyCBM Word and Passage Reading Fluency Assessments: Grade 3. Technical Report #1218

Science.gov (United States)

Park, Bitnara Jasmine; Anderson, Daniel; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

2012-01-01

This technical report is one in a series of five describing the reliability (test/retest and alternate form) and G-Theory/D-Study research on the easyCBM reading measures, grades 1-5. Data were gathered in the spring of 2011 from a convenience sample of students nested within classrooms at a medium-sized school district in the Pacific Northwest.…
The Children's Body Image Scale: reliability and use with international standards for body mass index.

Science.gov (United States)

Truby, Helen; Paxton, Susan J

2008-03-01

To test the reliability of the Children's Body Image Scale (CBIS) and assess its usefulness in the context of new body size charts for children. Participants were 281 primary schoolchildren with 50% being retested after 3 weeks. The CBIS figure scale was compared with a range of international body mass index (BMI) reference standards. Children had a high degree of body image dissatisfaction. The test-retest reliability of the CBIS was supported. The CBIS is a useful tool for assessing body image in children with sound scale properties. It can also be used to identify the body size of children, which lies outside the healthy weight range of BMI.
Tradução, adaptação e avaliação da consistência interna do Eating Behaviours and Body Image Test para uso com crianças do sexo feminino Translation, adaptation and internal consistency evaluation of the Eating Behaviours and Body Image Test for female children

Directory of Open Access Journals (Sweden)

Elizângela Moreira Careta Galindo

2007-02-01

Full Text Available Este trabalho tem por objetivo traduzir, adaptar e validar o Eating Behaviours and Body Image Test, para uso com crianças de uma cidade do interior do estado de São Paulo. Foram sujeitos do estudo 261 escolares do sexo feminino, na faixa etária de 9 a 12 anos. Por meio da análise fatorial, com rotação varimax avaliou-se a consistência interna do instrumento. Esta análise, realizada com o auxílio do programa Statistical Package for Social Sciences, versão 10.0, revelou dois fatores. Para o instrumento total a consistência interna foi adequada (coeficiente a de Cronbach: 0,89 e para os dois fatores (1 e 2 os valores de a também foram considerados satisfatórios (alfa=0,90 e alfa=0,80, respectivamente, mostrando, assim, que o Eating Behaviours and Body Image Test é útil para uma avaliação precoce, rastreando atitudes indicadoras de possíveis distúrbios no comportamento alimentar. Foram mantidas as características psicométricas do instrumento original.This study aimed to translate, adapt and validate the Eating Bahaviours and Body Image Test, to be used with children in a city in upstate São Paulo. Study subjects were 261 female students aging from 9 to 12 years. The internal consistency of the instrument was evaluated by means of factorial analysis with varimax rotation. This analysis was accomplished through Statistical Package for Social Sciences, version 10.0, revealing two factors. The internal consistency was adequate for the total instrument (Cronbach's alpha=0.89 and a values were also considered satisfactory for the two factors (1 and 2 (alpha=0.90 and alpha=0.80, respectively, which demonstrated that the Eating Bahaviours and Body Image Test is useful for an initial evaluation, tracing symptoms that indicate possible eating behavior disorders. The psychometric characteristics of the original instrument were maintained.
Developing a Danish version of the "Impact on Participation and Autonomy Questionnaire"

DEFF Research Database (Denmark)

Ghaziani, E.; Krogh, A.G.; Lund, Hans

2013-01-01

Objective: To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Design: Translation and two successive reliability assessme......Objective: To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Design: Translation and two successive reliability...... and cultural adaptation of the instrument. The revised version (IPAQ-DK) was subsequently subjected to a similar assessment demonstrating Chronbach's alpha values from 0.698 to 0.817. Weighted kappa ranged from 0.370 to 0.880; 78% of these values were higher than 0.600. The intraclass correlation coefficient...
Psychometric Properties of the Autism-Spectrum Quotient for Assessing Low and High Levels of Autistic Traits in College Students.

Science.gov (United States)

Stevenson, Jennifer L; Hart, Kari R

2017-06-01

The current study systematically investigated the effects of scoring and categorization methods on the psychometric properties of the Autism-Spectrum Quotient. Four hundred and three college students completed the Autism-Spectrum Quotient at least once. Total scores on the Autism-Spectrum Quotient had acceptable internal consistency and test-retest reliability using a binary or Likert scoring method, but the results were more varied for the subscales. Overall, Likert scoring yielded higher internal consistency and test-retest reliability than binary scoring. However, agreement in categorization of low and high autistic traits was poor over time (except for a median split on Likert scores). The results support using Likert scoring and administering the Autism-Spectrum Quotient at the same time as the task of interest with neurotypical participants.

Research Review: Test-retest reliability of standardized diagnostic interviews to assess child and adolescent psychiatric disorders: a systematic review and meta-analysis.

Science.gov (United States)

Duncan, Laura; Comeau, Jinette; Wang, Li; Vitoroulis, Irene; Boyle, Michael H; Bennett, Kathryn

2018-02-19

A better understanding of factors contributing to the observed variability in estimates of test-retest reliability in published studies on standardized diagnostic interviews (SDI) is needed. The objectives of this systematic review and meta-analysis were to estimate the pooled test-retest reliability for parent and youth assessments of seven common disorders, and to examine sources of between-study heterogeneity in reliability. Following a systematic review of the literature, multilevel random effects meta-analyses were used to analyse 202 reliability estimates (Cohen's kappa = ҡ) from 31 eligible studies and 5,369 assessments of 3,344 children and youth. Pooled reliability was moderate at ҡ = .58 (CI 95% 0.53-0.63) and between-study heterogeneity was substantial (Q = 2,063 (df = 201), p reliability varied across informants for specific types of psychiatric disorder (ҡ = .53-.69 for parent vs. ҡ = .39-.68 for youth) with estimates significantly higher for parents on attention deficit hyperactivity disorder, oppositional defiant disorder and the broad groupings of externalizing and any disorder. Reliability was also significantly higher in studies with indicators of poor or fair study methodology quality (sample size reliability of SDIs and the usefulness of these tools in both clinical and research contexts. Potential remedies include the introduction of standardized study and reporting requirements for reliability studies, and exploration of other approaches to assessing and classifying child and adolescent psychiatric disorder. © 2018 Association for Child and Adolescent Mental Health.
Quick screening tool for patients with severe negative emotional reactions to chronic illness: psychometric study of the negative emotions due to chronic illness screening test (NECIS).

Science.gov (United States)

Huang, Yun-Hsin; Wu, Chih-Hsun; Chen, Hsiu-Jung; Cheng, Yih-Ru; Hung, Fu-Chien; Leung, Kai-Kuan; Lue, Bee-Horng; Chen, Ching-Yu; Chiu, Tai-Yuan; Wu, Yin-Chang

2018-01-16

Severe negative emotional reactions to chronic illness are maladaptive to patients and they need to be addressed in a primary care setting. The psychometric properties of a quick screening tool-the Negative Emotions due to Chronic Illness Screening Test (NECIS)-for general emotional problems among patients with chronic illness being treated in a primary care setting was investigated. Three studies including 375 of patients with chronic illness were used to assess and analyze internal consistency, test-retest reliability, criterion-related validity, a cut-off point for distinguishing maladaptive emotions and clinical application validity of NECIS. Self-report questionnaires were used. Internal consistency (Cronbach's α) ranged from 0.78 to 0.82, and the test-retest reliability was 0.71 (P analysis reference, the receiver-operating characteristic curve analysis revealed an area under the curve of 0.81 and 0.82 (ps emotions, with a sensitivity and specificity of 83.3 and 69.0%, and 68.5 and 83.0%, respectively. The clinical application validity analysis revealed that low NECIS group showed significantly better adaptation to chronic illness on the scales of subjective health, general satisfaction with life, self-efficacy of self-care for disease, illness perception and stressors in everyday life. The NECIS has satisfactory psychometric properties for use in the primary care setting. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Retesting of liquefaction and nonliquefaction case histories from the 1976 Tangshan earthquake

Science.gov (United States)

Moss, R.E.S.; Kayen, R.E.; Tong, L.-Y.; Liu, S.-Y.; Cai, G.-J.; Wu, J.

2011-01-01

A field investigation was performed to retest liquefaction and nonliquefaction sites from the 1976 Tangshan earthquake in China. These sites were carefully investigated in 1978 and 1979 by using standard penetration test (SPT) and cone penetration test (CPT) equipment; however, the CPT measurements are obsolete because of the now nonstandard cone that was used at the time. In 2007, a modern cone was mobilized to retest 18 selected sites that are particularly important because of the intense ground shaking they sustained despite their high fines content and/or because the site did not liquefy. Of the sites reinvestigated and carefully reprocessed, 13 were considered accurate representative case histories. Two of the sites that were originally investigated for liquefaction have been reinvestigated for cyclic failure of fine-grained soil and removed from consideration for liquefaction triggering. The most important outcome of these field investigations was the collection of more accurate data for three nonliquefaction sites that experienced intense ground shaking. Data for these three case histories is now included in an area of the liquefaction triggering database that was poorly populated and will help constrain the upper bound of future liquefaction triggering curves. ?? 2011 American Society of Civil Engineers.
Reliability and Validity of the Beijing Version of the Montreal Cognitive Assessment in the Evaluation of Cognitive Function of Adult Patients with OSAHS.

Science.gov (United States)

Chen, Xiong; Zhang, Rui; Xiao, Ying; Dong, Jiaqi; Niu, Xun; Kong, Weijia

2015-01-01

The patients with obstructive sleep apnea hypopnea syndrome (OSAHS) tend to develop cognitive deficits, which usually go unrecognized, and can affect their daily life. The Beijing version of the Montreal cognitive assessment (MoCA-BJ), a Chinese version of MoCA, has been used for the assessment of cognitive functions of OSAHS patients in clinical practice. So far, its reliability and validity have not been tested. This study examined the reliability and validity of MoCA-BJ in a cohort of adult OSAHS patients. 152 OSAHS patients, ranging from mild, moderate to severe, 49 primary snoring subjects and 40 normal controls were evaluated for cognitive functions by employing both MoCA-BJ and the Mini Mental State Examination (MMSE). Forty of them were re-tested by MoCA-BJ 14 days after the first test. Internal consistency, test-retest reliability, discriminate and concurrent validity of MoCA-BJ were analyzed. Internal consistency reliability by Cronbach's alpha was adequate (0.73). Intra-class correlation coefficient (ICC), an measure of test-retest reliability, was 0.87 (Preliable and stable. The MoCA-BJ was capable of detecting cognitive dysfunction by visuospatial and total MoCA-BJ score.
Development of a Saudi Food Frequency Questionnaire and testing its reliability and validity.

Science.gov (United States)

Gosadi, Ibrahim M; Alatar, Abdullah A; Otayf, Mojahed M; AlJahani, Dhaherah M; Ghabbani, Hisham M; AlRajban, Waleed A; Alrsheed, Abdullah M; Al-Nasser, Khalid A

2017-06-01

To create a food frequency questionnaire specifically designed to capture the dietary habits of Saudis and test its validity and reliability. Methods: This investigation is a longitudinal, test-retest study conducted in King Saud University, Riyadh, Kingdom of Saudi Arabia between December 2015 and March 2016. A list of 140 food items was included in the questionnaire where a closed-ended and open-ended approach was used. Regarding past year food frequency consumption and 24 hours dietary recall, body weight and height were collected. Internal consistency, test-retest reliability, completeness of the food list, and criterion validity were assessed. Results: One-hundred and thirty eight participants were interviewed to complete the 24 hours dietary recall and the constructed questionnaire. Approximately 85% of the food items reported in the dietary recall were covered in the food frequency questionnaire. The association of body mass index with meats (regression coefficients: 2.28) and dairy products consumption frequency was statistically significant (regression coefficients: 2.31). A high overall reproducibility rate of the questionnaire was detected (Pearsons' correlation coefficient: 0.78 p less than 0.001). Conclusion: The developed questionnaire has a high reliability and reasonable validity, and suitable for use in nutritional epidemiological investigations in Saudi Arabia.
Cross-cultural adaptation of the Neck Disability Index and Copenhagen Neck Functional Disability Scale for patients with neck pain due to degenerative and discopathic disorders. Psychometric properties of the Polish versions

Directory of Open Access Journals (Sweden)

Glowacki Maciej

2011-04-01

Full Text Available Abstract Background Even though there are several region-specific functional outcome questionnaires measuring neck disorders that have been developed in English-speaking countries, no Polish version has ever been validated. The purpose of our study was to translate, culturally adapt and validate the Neck Disability Index (NDI and Copenhagen Neck Functional Disability Scale (CDS for Polish-speaking patients with neck pain. Methods The translation was carried out according to the International Quality of Life Association (IQOLA Project. Sixty patients were treated due to degenerative and discopathic disorders in the cervical spine filled out the NDI-PL and the CDS-PL. The pain level was evaluated using the Visual Analog Scale. The mean age of the assessed group was 47.1 years (SD 8.9. We used Cronbach's alpha to assess internal consistency. We assessed the test-retest reliability using the Intraclass Correlation Coefficients (ICCs. The Spearman's rank correlation coefficient (rS was used to determine dependency between quantitative characteristics. The Mann-Whitney test was applied to determine dependency between quantitative and qualitative characteristics. Results The Cronbach's alpha values were excellent for the NDI-PL in the test and in the retest (0.84, 0.85, respectively, and for the CDS-PL (0.90 in the test and in the retest. Intraclass Correlation Coefficients were excellent for the CDS-PL and NDI-PL and equalled 0.93 (95% CI from 0.89 to 0.95 and 0.87 (95% CI from 0.80 to 0.92, respectively The concurrent validity was good in the test and in the retest (rs = 0.42 p Conclusions The present versions of the NDI-PL and CDS-PL, the first to be published in Polish, have proven to be reliable and valid for patients with degenerative changes in the cervical spine. The NDI-PL and CDS-PL have excellent internal consistency and test-retest reliability, and good concurrent validity. The adapted questionnaires showed a strong inter-correlation both
Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

Directory of Open Access Journals (Sweden)

Penny Moss

Full Text Available Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot. Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%. Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56 years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%. Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add
Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

Science.gov (United States)

Moss, Penny; Whitnell, Jasmine; Wright, Anthony

2016-01-01

Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot). Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%). Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56) years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%). Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add validity and
Internal Consistency of Reliability Assessment of the Persian version of the ‘Home Falls and Accident Screening Tool’

Directory of Open Access Journals (Sweden)

Afsoon Hassani Mehraban

2013-10-01

Full Text Available Objectives: Falling is a common problem among the elderly. Falling indoors and outdoors is highly prevalent among the Iranian elderly. Therefore, identification of the contributing factors at home and their modification can reduce falls and subsequent injuries inthe elderly. The goal of this study was to identify the elderly at risk of fall, using the ‘Home Falls and Accident Screening Tool’ (HOME FAST, and to determine the reliability of this tool. Methods: Sixty old people were selected from five geographical regions of Tehran through the Local Town Councils. Participants were aged 60 to 65 years, and HOME FAST was used to assess inter rater and test- retest reliability. Results: Test-retest reliability in the study showed that agreement between the items of the Persian version of HOME FAST was over 0.8, which is a very good reliability. The agreement between the domains was 0.65-1.00, indicative of moderate to high reliability. Moreover, the Inter rater reliability of the items was over 0.8, which is also very good. The correlation of each item between the domains was 0.01-1.00, which shows poor to high reliability. Discussion: This study showed that the reliability of the Persian version of HOME FAST is high. This tool can therefore be used as an appropriate screening tool by professionals to take necessary preventive measures for the Iranian elderly population.
Polish Adult Reading Test (PART) - construction of Polish test for estimating the level of premorbid intelligence in schizophrenia.

Science.gov (United States)

Karakuła-Juchnowicz, Hanna; Stecka, Mariola

2017-08-29

In view of unavailability in Poland of the standardized methods to measure PIQ, the aim of the work was to develop a Polish test to assess the premorbid level of intelligence - PART(Polish AdultReading Test) and to measureits psychometric properties, such as validity, reliability as well as standardization in the group of schizophrenia patients. The principles of PART construction were based on the idea of popular worldwide National Adult Reading Test by Hazel Nelson. The research comprised a group of 122 subjects (65 schizophrenia patients and 57 healthy people), aged 18-60 years, matched for age and gender. PART appears to be a method with high internal consistency and reliability measured by test-retest, inter-rater reliability, and the method with acceptable diagnostic and prognostic validity. The standardized procedures of PART have been investigated and described. Considering the psychometric values of PART and a short time of its performance, the test may be a useful diagnostic instrument in the assessment of premorbid level of intelligence in a group of schizophrenic patients.
Reliability of two social cognition tests: The combined stories test and the social knowledge test.

Science.gov (United States)

Thibaudeau, Élisabeth; Cellard, Caroline; Legendre, Maxime; Villeneuve, Karèle; Achim, Amélie M

2018-04-01

Deficits in social cognition are common in psychiatric disorders. Validated social cognition measures with good psychometric properties are necessary to assess and target social cognitive deficits. Two recent social cognition tests, the Combined Stories Test (COST) and the Social Knowledge Test (SKT), respectively assess theory of mind and social knowledge. Previous studies have shown good psychometric properties for these tests, but the test-retest reliability has never been documented. The aim of this study was to evaluate the test-retest reliability and the inter-rater reliability of the COST and the SKT. The COST and the SKT were administered twice to a group of forty-two healthy adults, with a delay of approximately four weeks between the assessments. Excellent test-retest reliability was observed for the COST, and a good test-retest reliability was observed for the SKT. There was no evidence of practice effect. Furthermore, an excellent inter-rater reliability was observed for both tests. This study shows a good reliability of the COST and the SKT that adds to the good validity previously reported for these two tests. These good psychometrics properties thus support that the COST and the SKT are adequate measures for the assessment of social cognition. Copyright © 2018. Published by Elsevier B.V.
Validation of the Turkish Version of the Cognitive Test Anxiety Scale–Revised

Directory of Open Access Journals (Sweden)

Sati Bozkurt

2017-01-01

Full Text Available The current study explored the psychometric properties of the newly designed Turkish version of the Cognitive Test Anxiety Scale–Revised (CTAR. Results of an exploratory factor analysis revealed an unidimensional structure consistent with the conceptualized nature of cognitive test anxiety and previous examinations of the English version of the CTAR. Examination of the factor loadings revealed two items that were weakly related to the test anxiety construct and as such were prime candidates for removal. Confirmatory factor analyses were conducted to compare model fit for the 25- and 23-item version of the measure. Results indicated that the 23-item version of the measure provided a better fit to the data which support the removal of the problematic items in the Turkish version of the CTAR. Additional analyses demonstrated the internal consistency, test–retest reliability, concurrent validity, and gender equivalence for responses offered on the Turkish version of the measure. Results of the analysis revealed a 23-item Turkish version of the T-CTAR is a valid and reliable measure of cognitive test anxiety for use among Turkish students.
Standardization of Brief Inventory of Social Support Exchange Network (BISSEN) in Japan.

Science.gov (United States)

Aiba, Miyuki; Tachikawa, Hirokazu; Fukuoka, Yoshiharu; Lebowitz, Adam; Shiratori, Yuki; Doi, Nagafumi; Matsui, Yutaka

2017-07-01

This study describes the Brief Inventory of Social Support Exchange Network (BISSEN) as a standardized brief inventory measuring various aspects of social support. We confirmed the reliability and validity for function and direction of support and standardized the BISSEN. For Sample 1, a stratified random sampling method was used to select 5200 residents in Japan. We conducted mail surveys and responses were retrieved from 2274 participants (collection rate 43.7%). Participants completed a questionnaire packet that included BISSEN, suicidal ideation, depression, support seeking, and Multidimensional Scale of Perceived Social Support (MSPSS). Sample 2 surveys for test-retest reliability were conducted on 23 residents at approximately two-week intervals. Participants were asked about gender, age, and BISSEN. First, we assessed the internal consistency, test-retest reliability, construct, convergent, and concurrent validity. McDonald's omega (.73-.92) and test-retest correlations (.78-.85) demonstrated adequate internal consistency and test-retest reliability. Depression, support seeking, and MSPSS were significantly correlated with all scores of BISSEN. The non-suicidal ideation group had significantly more support compared to the suicidal ideation group. Therefore, function and direction of support in BISSEN had sufficient reliability and validity. Next, we standardized BISSEN using Z-scores and percentile rank with respect to each 12 norm groups by age and gender. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
Test-retest reproducibility of dopamine D{sub 2/3} receptor binding in human brain measured by PET with [{sup 11}C]MNPA and [{sup 11}C]raclopride

Energy Technology Data Exchange (ETDEWEB)

Kodaka, Fumitoshi [National Institute of Radiological Sciences, Molecular Neuroimaging Program, Molecular Imaging Center, Chiba (Japan); Jikei University School of Medicine, Department of Psychiatry, Tokyo (Japan); Ito, Hiroshi [National Institute of Radiological Sciences, Molecular Neuroimaging Program, Molecular Imaging Center, Chiba (Japan); National Institute of Radiological Sciences, Biophysics Program, Molecular Imaging Center, Chiba (Japan); Kimura, Yasuyuki; Fujie, Saori; Takano, Harumasa; Fujiwara, Hironobu; Sasaki, Takeshi; Suhara, Tetsuya [National Institute of Radiological Sciences, Molecular Neuroimaging Program, Molecular Imaging Center, Chiba (Japan); Nakayama, Kazuhiko [Jikei University School of Medicine, Department of Psychiatry, Tokyo (Japan); Halldin, Christer; Farde, Lars [Karolinska Institutet, Department of Clinical Neuroscience, Stockholm (Sweden)

2013-04-15

Dopamine D{sub 2/3} receptors (D{sub 2/3}Rs) have two affinity states for endogenous dopamine, referred to as high-affinity state (D{sub 2/3} {sup HIGH}), which has a high affinity for endogenous dopamine, and low-affinity state (D{sub 2/3} {sup LOW}). The density of D{sub 2/3} {sup HIGH} can be measured with (R)-2-{sup 11}CH{sub 3}O-N-n-propylnorapomorphine ([{sup 11}C]MNPA), while total density of D{sub 2/3} {sup HIGH} and D{sub 2/3} {sup LOW} (D{sub 2/3}Rs) can be measured with [{sup 11}C]raclopride using positron emission tomography (PET). Thus, the ratio of the binding potential (BP) of [{sup 11}C]MNPA to that of [{sup 11}C]raclopride ([{sup 11}C]MNPA/[{sup 11}C]raclopride) may reflect the proportion of the density of D{sub 2/3} {sup HIGH} to that of D{sub 2/3}Rs. In the caudate and putamen, [{sup 11}C]MNPA/[{sup 11}C]raclopride reflects the proportion of the density of D{sub 2} {sup HIGH} to that of D{sub 2}Rs. To evaluate the reliability of the PET paradigm with [{sup 11}C]MNPA and [{sup 11}C]raclopride, we investigated the test-retest reproducibility of non-displaceable BP (BP{sub ND}) measured with [{sup 11}C]MNPA and of [{sup 11}C]MNPA/[{sup 11}C]raclopride in healthy humans. Eleven healthy male volunteers underwent two sets of PET studies on separate days that each included [{sup 11}C]MNPA and [{sup 11}C]raclopride scans. BP{sub ND} values in the caudate and putamen were calculated. Test-retest reproducibility of BP{sub ND} of [{sup 11}C]MNPA and [{sup 11}C]MNPA/[{sup 11}C]raclopride was assessed by intra-subject variability (absolute variability) and test-retest reliability (intraclass correlation coefficient: ICC). The absolute variability of [{sup 11}C]MNPA BP{sub ND} was 5.30 {+-} 3.96 % and 12.3 {+-} 7.95 % and the ICC values of [{sup 11}C]MNPA BP{sub ND} were 0.72 and 0.82 in the caudate and putamen, respectively. The absolute variability of [{sup 11}C]MNPA/[{sup 11}C]raclopride was 6.11 {+-} 3.68 % and 11.60 {+-} 5.70 % and the ICC values of [{sup
Internal consistency & validity of Indian Disability Evaluation and Assessment Scale (IDEAS in patients with schizophrenia

Directory of Open Access Journals (Sweden)

Sandeep Grover

2014-01-01

Full Text Available Background & objectives: The Indian Disability Evaluation and Assessment Scale (IDEAS has been recommended for assessment and certification of disability by the Government of India (GOI. However, the psychometric properties of IDEAS as adopted by GOI remain understudied. Our aim, thus, was to study the internal consistency and validity of IDEAS in patients with schizophrenia. Methods: A total of 103 consenting patients with residual schizophrenia were assessed for disability, quality of life (QOL and psychopathology using the IDEAS, WHO QOL-100 and Positive and Negative symptom scale (PANSS respectively. Internal consistency was calculated using Cronbach′s alpha. For construct validity, relations between IDEAS, and psychopathology and QOL were studied. Results: The inter-item correlations for IDEAS were significant with a Cronbach′s alpha of 0.721. All item scores other than score on communication and understanding; total and global IDEAS scores correlated significantly with the positive, negative and general sub-scales, and total PANSS scores. Communication and understanding was significantly related to negative sub-scale score only. Total and global disability scores correlated negatively with all the domains of WHOQOL-100 (ρ<0.01. The individual IDEAS item scores correlated negatively with various WHOQOL-100 domains (ρ0< 0.01. Interpretation & conclusions: This study findings showed that the GOI-modified IDEAS had good internal consistency and construct validity as tested in patients with residual schizophrenia. Similar studies need to be done with other groups of patients.
Development and evaluation of the McKnight Risk Factor Survey for assessing potential risk and protective factors for disordered eating in preadolescent and adolescent girls.

Science.gov (United States)

Shisslak, C M; Renger, R; Sharpe, T; Crago, M; McKnight, K M; Gray, N; Bryson, S; Estes, L S; Parnaby, O G; Killen, J; Taylor, C B

1999-03-01

To describe the development, test-retest reliability, internal consistency, and convergent validity of the McKnight Risk Factor Survey-III (MRFS-III). The MRFS-III was designed to assess a number of potential risk and protective factors for the development of disordered eating in preadolescent and adolescent girls. Several versions of the MRFS were pilot tested before the MRFS-III was administered to a sample of 651 4th through 12th- grade girls to establish its psychometric properties. Most of the test-retest reliability coefficients of individual items on the MRFS-III were r > .40. Alpha coefficients for each risk and protective factor domain on the MRFS-III were also computed. The majority of these coefficients were r > .60. High convergent validity coefficients were obtained for specific items on the MRFS-III and measures of self-esteem (Rosenberg Self-Esteem Scale) and weight concerns (Weight Concerns Scale). The test-retest reliability, internal consistency, and convergent validity of the MRFS-III suggest that it is a useful new instrument to assess potential risk and protective factors for the development of disordered eating in preadolescent and adolescent girls.
Quality of prenatal care questionnaire: instrument development and testing.

Science.gov (United States)

Heaman, Maureen I; Sword, Wendy A; Akhtar-Danesh, Noori; Bradford, Amanda; Tough, Suzanne; Janssen, Patricia A; Young, David C; Kingston, Dawn A; Hutton, Eileen K; Helewa, Michael E

2014-06-03

Utilization indices exist to measure quantity of prenatal care, but currently there is no published instrument to assess quality of prenatal care. The purpose of this study was to develop and test a new instrument, the Quality of Prenatal Care Questionnaire (QPCQ). Data for this instrument development study were collected in five Canadian cities. Items for the QPCQ were generated through interviews with 40 pregnant women and 40 health care providers and a review of prenatal care guidelines, followed by assessment of content validity and rating of importance of items. The preliminary 100-item QPCQ was administered to 422 postpartum women to conduct item reduction using exploratory factor analysis. The final 46-item version of the QPCQ was then administered to another 422 postpartum women to establish its construct validity, and internal consistency and test-retest reliability. Exploratory factor analysis reduced the QPCQ to 46 items, factored into 6 subscales, which subsequently were validated by confirmatory factor analysis. Construct validity was also demonstrated using a hypothesis testing approach; there was a significant positive association between women's ratings of the quality of prenatal care and their satisfaction with care (r = 0.81). Convergent validity was demonstrated by a significant positive correlation (r = 0.63) between the "Support and Respect" subscale of the QPCQ and the "Respectfulness/Emotional Support" subscale of the Prenatal Interpersonal Processes of Care instrument. The overall QPCQ had acceptable internal consistency reliability (Cronbach's alpha = 0.96), as did each of the subscales. The test-retest reliability result (Intra-class correlation coefficient = 0.88) indicated stability of the instrument on repeat administration approximately one week later. Temporal stability testing confirmed that women's ratings of their quality of prenatal care did not change as a result of giving birth or between the early postpartum
Quality of prenatal care questionnaire: instrument development and testing

Science.gov (United States)

2014-01-01

Background Utilization indices exist to measure quantity of prenatal care, but currently there is no published instrument to assess quality of prenatal care. The purpose of this study was to develop and test a new instrument, the Quality of Prenatal Care Questionnaire (QPCQ). Methods Data for this instrument development study were collected in five Canadian cities. Items for the QPCQ were generated through interviews with 40 pregnant women and 40 health care providers and a review of prenatal care guidelines, followed by assessment of content validity and rating of importance of items. The preliminary 100-item QPCQ was administered to 422 postpartum women to conduct item reduction using exploratory factor analysis. The final 46-item version of the QPCQ was then administered to another 422 postpartum women to establish its construct validity, and internal consistency and test-retest reliability. Results Exploratory factor analysis reduced the QPCQ to 46 items, factored into 6 subscales, which subsequently were validated by confirmatory factor analysis. Construct validity was also demonstrated using a hypothesis testing approach; there was a significant positive association between women’s ratings of the quality of prenatal care and their satisfaction with care (r = 0.81). Convergent validity was demonstrated by a significant positive correlation (r = 0.63) between the “Support and Respect” subscale of the QPCQ and the “Respectfulness/Emotional Support” subscale of the Prenatal Interpersonal Processes of Care instrument. The overall QPCQ had acceptable internal consistency reliability (Cronbach’s alpha = 0.96), as did each of the subscales. The test-retest reliability result (Intra-class correlation coefficient = 0.88) indicated stability of the instrument on repeat administration approximately one week later. Temporal stability testing confirmed that women’s ratings of their quality of prenatal care did not change as a result of giving
Validity and internal consistency of a whiplash-specific disability measure

NARCIS (Netherlands)

Pinfold, Melanie; Niere, Ken R.; O'Leary, Elizabeth F.; Hoving, Jan Lucas; Green, Sally; Buchbinder, Rachelle

2004-01-01

STUDY DESIGN: Cross-sectional study of patients with whiplash-associated disorders investigating the internal consistency, factor structure, response rates, and presence of floor and ceiling effects of the Whiplash Disability Questionnaire (WDQ). OBJECTIVES: The aim of this study was to confirm the
Reliability of the detailed assessment of speed of handwriting on Flemish children.

Science.gov (United States)

Simons, Johan; Probst, Michel

2014-01-01

This study evaluates the reliability of the Detailed Assessment of Speed of Handwriting (DASH) in a Dutch-speaking sample of children. The sample included 650 boys and 513 girls (age range = 9-16 years). Handwriting speed measurements were obtained using the DASH. Interrater agreement, test-retest reliability, and internal consistency were calculated; gender and age effects were analyzed. Interrater agreement shows excellent reliability with intraclass correlation coefficients of at least 0.94. Test-retest correlations ranged from r = 0.65 to r = 0.81. The internal consistency measures, calculated with Cronbach's alpha, were between 0.88 and 0.94. Both gender and age have a significant effect on handwriting speed, with F (7.1144) = 17.43 (P handwriting speed of Dutch-speaking children. There is a tendency of girls to write faster than boys.

Can health workers reliably assess their own work? A test-retest study of bias among data collectors conducting a Lot Quality Assurance Sampling survey in Uganda.

Science.gov (United States)

Beckworth, Colin A; Davis, Rosemary H; Faragher, Brian; Valadez, Joseph J

2015-03-01

Lot Quality Assurance Sampling (LQAS) is a classification method that enables local health staff to assess health programmes for which they are responsible. While LQAS has been favourably reviewed by the World Bank and World Health Organization (WHO), questions remain about whether using local health staff as data collectors can lead to biased data. In this test-retest research, Pallisa Health District in Uganda is subdivided into four administrative units called supervision areas (SA). Data collectors from each SA conducted an LQAS survey. A week later, the data collectors were swapped to a different SA, outside their area of responsibility, to repeat the LQAS survey with the same respondents. The two data sets were analysed for agreement using Cohens' kappa coefficient and disagreements were analysed. Kappa values ranged from 0.19 to 0.97. On average, there was a moderate degree of agreement for knowledge indicators and a substantial level for practice indicators. Respondents were found to be systematically more knowledgeable on retest indicating bias favouring the retest, although no evidence of bias was found for practices indicators. In this initial study, using local health care providers to collect data did not bias data collection. The bias observed in the knowledge indicators is most likely due to the 'practice effect', whereby respondents increased their knowledge as a result of completing the first survey, as no corresponding effect was seen in the practices indicators. Published by Oxford University Press in association with The London School of Hygiene and Tropical Medicine © The Author 2014; all rights reserved.
Equivalence of Laptop and Tablet Administrations of the Minnesota Multiphasic Personality Inventory-2 Restructured Form.

Science.gov (United States)

Menton, William H; Crighton, Adam H; Tarescavage, Anthony M; Marek, Ryan J; Hicks, Adam D; Ben-Porath, Yossef S

2017-06-01

The present study investigated the comparability of laptop computer- and tablet-based administration modes for the Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF). Employing a counterbalanced within-subjects design, the MMPI-2-RF was administered via both modes to a sample of college undergraduates ( N = 133). Administration modes were compared in terms of mean scale scores, internal consistency, test-retest consistency, external validity, and administration time. Mean scores were generally similar, and scores produced via both methods appeared approximately equal in terms of internal consistency and test-retest consistency. Scores from the two modalities also evidenced highly similar patterns of associations with external criteria. Notably, tablet administration of the MMPI-2-RF was substantially longer than laptop administration in the present study (mean difference 7.2 minutes, Cohen's d = .95). Overall, results suggest that varying administration mode between laptop and tablet has a negligible influence on MMPI-2-RF scores, providing evidence that these modes of administration can be considered psychometrically equivalent.
Internal consistency of a Spanish translation of the Francis Scale of Attitude Toward Christianity Short Form.

Science.gov (United States)

Campo-Arias, Adalberto; Oviedo, Heidi Celina; Díaz, Carmen Elena; Cogollo, Zuleima

2006-12-01

This study evaluated the internal consistency of a Spanish version of the short form of the Francis Scale of Attitude Toward Christianity based on responses of 405 Colombian adolescent students ages 13 to 17 years. This translated short-form version of the scale had an internal consistency of .80. This estimate indicates suitable internal consistency reliability for research use in this population.
[New questionnaire to assess self-efficacy toward physical activity in children].

Science.gov (United States)

Aedo, Angeles; Avila, Héctor

2009-10-01

To design a questionnaire for assessment of self-efficacy toward physical activity in school children, as well as to measure its construct validity, test-retest reliability, and internal consistency. A four-stage multimethod approach was used: (1) bibliographic research followed by exploratory study and the formulation of questions and responses based on a dichotomous scale of 14 items; (2) validation of the content by a panel of experts; (3) application of the preliminary version of the questionnaire to a sample of 900 school-aged children in Mexico City; and (4) determination of the construct validity, test-retest reliability, and internal consistency (Cronbach's alpha). Three factors were identified that explain 64.15% of the variance: the search for positive alternatives to physical activity, ability to deal with possible barriers to exercising, and expectations of skill or competence. The model was validated using the goodness of fit, and the result of 65% less than 0.05 indicated that the estimated factor model fit the data. Cronbach's consistency alpha was 0.733; test-retest reliability was 0.867. The scale designed has adequate reliability and validity. These results are a good indicator of self-efficacy toward physical activity in school children, which is important when developing programs intended to promote such behavior in this age group.
Test-retest reproducibility of the metabotropic glutamate receptor 5 ligand [18F]FPEB with bolus plus constant infusion in humans

International Nuclear Information System (INIS)

Park, Eunkyung; Sullivan, Jenna M.; Planeta, Beata; Gallezot, Jean-Dominique; Lim, Keunpoong; Lin, Shu-Fei; Ropchan, Jim; Huang, Yiyun; Carson, Richard E.; McCarthy, Timothy J.; Ding, Yu-Shin; Morris, Evan D.; Williams, Wendol A.

2015-01-01

[ 18 F]FPEB is a promising PET radioligand for the metabotropic glutamate receptor 5 (mGluR5), a potential target for the treatment of neuropsychiatric diseases. The purpose of this study was to evaluate the test-retest reproducibility of [ 18 F]FPEB in the human brain. Seven healthy male subjects were scanned twice, 3 - 11 weeks apart. Dynamic data were acquired using bolus plus infusion of 162 ± 32 MBq [ 18 F]FPEB. Four methods were used to estimate volume of distribution (V T ): equilibrium analysis (EQ) using arterial (EQ A ) or venous input data (EQ V ), MA1, and a two-tissue compartment model (2 T). Binding potential (BP ND ) was also estimated using cerebellar white matter (CWM) or gray matter (CGM) as the reference region using EQ, 2 T and MA1. Absolute test-retest variability (aTRV) of V T and BP ND were calculated for each method. Venous blood measurements (C V ) were compared with arterial input (C A ) to examine their usability in EQ analysis. Regional V T estimated by the four methods displayed a high degree of agreement (r 2 ranging from 0.83 to 0.99 among the methods), although EQ A and EQ V overestimated V T by a mean of 9 % and 7 %, respectively, compared to 2 T. Mean values of aTRV of V T were 11 % by EQ A , 12 % by EQ V , 14 % by MA1 and 14 % by 2 T. Regional BP ND also agreed well among the methods and mean aTRV of BP ND was 8 - 12 % (CWM) and 7 - 9 % (CGM). Venous and arterial blood concentrations of [ 18 F]FPEB were well matched during equilibrium (C V = 1.01 . C A , r 2 = 0.95). [ 18 F]FPEB binding shows good TRV with minor differences among analysis methods. Venous blood can be used as an alternative for input function measurement instead of arterial blood in EQ analysis. Thus, [ 18 F]FPEB is an excellent PET imaging tracer for mGluR5 in humans. (orig.)
Reliability of a science admission test (HAM-Nat) at Hamburg medical school.

Science.gov (United States)

Hissbach, Johanna; Klusmann, Dietrich; Hampe, Wolfgang

2011-01-01

The University Hospital in Hamburg (UKE) started to develop a test of knowledge in natural sciences for admission to medical school in 2005 (Hamburger Auswahlverfahren für Medizinische Studiengänge, Naturwissenschaftsteil, HAM-Nat). This study is a step towards establishing the HAM-Nat. We are investigating parallel forms reliability, the effect of a crash course in chemistry on test results, and correlations of HAM-Nat test results with a test of scientific reasoning (similar to a subtest of the "Test for Medical Studies", TMS). 316 first-year students participated in the study in 2007. They completed different versions of the HAM-Nat test which consisted of items that had already been used (HN2006) and new items (HN2007). Four weeks later half of the participants were tested on the HN2007 version of the HAM-Nat again, while the other half completed the test of scientific reasoning. Within this four week interval students were offered a five day chemistry course. Parallel forms reliability for four different test versions ranged from r(tt)=.53 to r(tt)=.67. The retest reliabilities of the HN2007 halves were r(tt)=.54 and r(tt )=.61. Correlations of the two HAM-Nat versions with the test of scientific reasoning were r=.34 und r=.21. The crash course in chemistry had no effect on HAM-Nat scores. The results suggest that further versions of the test of natural sciences will not easily conform to the standards of internal consistency, parallel-forms reliability and retest reliability. Much care has to be taken in order to assemble items which could be used interchangeably for the construction of new test versions. The test of scientific reasoning and the HAM-Nat are tapping different constructs. Participation in a chemistry course did not improve students' achievement, probably because the content of the course was not coordinated with the test and many students lacked of motivation to do well in the second test.
[Factor analysis and internal consistency of pedagogical practices questionnaire among health care teachers].

Science.gov (United States)

Pérez V, Cristhian; Vaccarezza G, Giulietta; Aguilar A, César; Coloma N, Katherine; Salgado F, Horacio; Baquedano R, Marjorie; Chavarría R, Carla; Bastías V, Nancy

2016-06-01

Teaching practice is one of the most complex topics of the training process in medicine and other health care careers. The Teaching Practices Questionnaire (TPQ) evaluates teaching skills. To assess the factor structure and internal consistency of the Spanish version of the TPP among health care teachers. The TPQ was answered by 315 university teachers from 13 of the 15 administrative Chilean regions, who were selected through a non-probabilistic volunteer sampling. The internal consistency of TPP factors was calculated and the correlation between them was analyzed. Six factors were identified: Student-centered teaching, Teaching planning, Assessment process, Dialogue relationship, Teacher-centered teaching and Use of technological resources. They had Cronbach alphas ranging from 0.60 to 0.85. The factorial structure of TPQ differentiates the most important functions of teaching. It also shows a theoretical consistency and a practical relevance to perform a diagnosis and continuous evaluation of teaching practices. Additionally, it has an adequate internal consistency. Thus, TPQ is valid and reliable to evaluate pedagogical practices in health care careers.
The development and psychometric testing of East Asian Acculturation Scale among Asian immigrant women in Taiwan.

Science.gov (United States)

Kuo, Shu-Fen; Chang, Wen-Yin; Chang, Lu-I; Chou, Yu-Hua; Chen, Ching-Min

2013-01-01

This is a report of development and psychometric testing of the East Asian Acculturation Measure-Chinese version (EAAM-C) scale. An instrument validation design with a cross-sectional survey was conducted. The process was carried in two phases. In Phase 1, Barry's East Asian Acculturation Measure was translated and back translated to evaluate its content, face validity, and feasibility validity. In Phase 2, the 16-item EAAM-C was pilot-tested among 485 female immigrants for test-retest reliability, internal consistency, theoretically-supported construct validity and concurrent validity. The pilot work and the survey results indicated the tools possessed adequate content and face validity. The Cronbach's Alphas for the EAAM-C was 0.72, and 0.76-0.79 for its subscales, and the correlation of test-retest reliability (at 3 weeks) was 0.75. After dropping one item, four theoretically-supported factors which explained 61.82% of the variance were abstracted using exploratory factor analysis: assimilation, integration, separation, and marginalization. Based on the underlying four-factor theoretical structures of the EAAM, the confirmatory factor analysis of the EAAM-C was further examined. The analysis revealed that the four-factor model was an acceptable fit for the data which demonstrated adequate finding in its construct validity. These factors were inter-correlated, and showed statistically significant correlation with the Chinese Health Questionnaire, indicating adequate concurrent validity. The scale shows acceptable validity and consistency, and suggests that immigrant acculturation is a complex construct. This quick evaluation instrument can be applied to assess clients' acculturation and in further developing certain interventions to improve their health.
Development, initial content validation and reliability of Nigerian ...

African Journals Online (AJOL)

Prevention strategies are effective only when there are epidemiological data for the targeted populations. The collection of such .... Proquest, Sport discuss and Cochrane as these are ... 0.74, test retest reliability 0.70; Diet: internal consistency:.
Reliability and validity of the Japanese version of the Resilience Scale and its short version

Directory of Open Access Journals (Sweden)

Kondo Maki

2010-11-01

Full Text Available Abstract Background The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS and short version of the RS (RS-14. Findings The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D, Rosenberg Self-Esteem Scale (RSES, Social Support Questionnaire (SSQ, Perceived Stress Scale (PSS, and Sheehan Disability Scale (SDS were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p Conclusions This study demonstrates that the Japanese version of RS has psychometric properties with high degrees of internal consistency, high test-retest reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
[Discomfort associated with dental extraction surgery and development of a questionnaire (QCirDental). Part I: Impacts and internal consistency].

Science.gov (United States)

Bortoluzzi, Marcelo Carlos; Martins, Luciana Dorochenko; Takahashi, André; Ribeiro, Bianca; Martins, Ligiane; Pinto, Marcia Helena Baldani

2018-01-01

The scope of this study was to develop and validate a questionnaire (QCirDental) to measure the impacts associated with dental extraction surgery. The QCirDental questionnaire was developed in two steps; (1) question and item generation and selection, and (2) pretest of the questionnaire with evaluation of the its measurement properties (internal consistency and responsiveness). The sample was composed of 123 patients. None of the patients had any difficulty in understanding the QCirDental. The instrument was found to have excellent internal consistency with Cronbach's alpha reliability coefficient of 0.83. The principal component analysis (Kaiser-Meyer-Olkin Measure of Sampling Adequacy 0,72 and Bartlett's Test of Sphericity with p < 0.001) showed six (6) dimensions explaining 67.5% of the variance. The QCirDental presented excellent internal consistency, being a questionnaire that is easy to read and understand with adequate semantic and content validity. More than 80% of the patients who underwent dental extraction reported some degree of discomfort within the perioperative period which highlights the necessity to assess the quality of care and impacts of dental extraction surgery.
Test-retest reliability of the assessment of postural stability in typically developing children and in hearing impaired children.

Science.gov (United States)

De Kegel, A; Dhooge, I; Cambier, D; Baetens, T; Palmans, T; Van Waelvelde, H

2011-04-01

The purpose of this study was to establish test-retest reliability of centre of pressure (COP) measurements obtained by an AccuGait portable forceplate (ACG), mean COG sway velocity measured by a Basic Balance Master (BBM) and clinical balance tests in children with and without balance difficulties. 49 typically developing children and 23 hearing impaired children, with a higher risk for stability problems, between 6 and 12 years of age participated. Each child performed the modified Clinical Test of Sensory Interaction on Balance (mCTSIB), Unilateral Stance (US) and Tandem Stance on ACG, mCTSIB and US on BBM and clinical balance tests: one-leg standing, balance beam walking and one-leg hopping. All subjects completed 2 test sessions on 2 different days in the same week assessed by the same examiner. Among COP measurements obtained by the ACG, mean sway velocity was the most reliable parameter with all ICCs higher than 0.72. The standard deviation (SD) of sway velocity, sway area, SD of anterior-posterior and SD of medio-lateral COP data showed moderate to excellent reliability with ICCs between 0.55 and 0.96 but some caution must be taken into account in some conditions. BBM is less reliable but clinical balance tests are as reliable as ACG. Hearing impaired children exhibited better relative reliability (ICC) and comparable absolute reliability (SEM) for most balance parameters compared to typically developing children. Reliable information regarding postural stability of typically developing children and hearing impaired children may be obtained utilizing COP measurements generated by an AccuGait system and clinical balance tests. Copyright © 2011 Elsevier B.V. All rights reserved.
Modeling and Testing Legacy Data Consistency Requirements

DEFF Research Database (Denmark)

Nytun, J. P.; Jensen, Christian Søndergaard

2003-01-01

An increasing number of data sources are available on the Internet, many of which offer semantically overlapping data, but based on different schemas, or models. While it is often of interest to integrate such data sources, the lack of consistency among them makes this integration difficult....... This paper addresses the need for new techniques that enable the modeling and consistency checking for legacy data sources. Specifically, the paper contributes to the development of a framework that enables consistency testing of data coming from different types of data sources. The vehicle is UML and its...... accompanying XMI. The paper presents techniques for modeling consistency requirements using OCL and other UML modeling elements: it studies how models that describe the required consistencies among instances of legacy models can be designed in standard UML tools that support XMI. The paper also considers...
Psychometric Properties of the Thai Internalized Stigma Scale (TIS-LCH) for Care Home Residents.

Science.gov (United States)

Tosangwarn, Suhathai; Clissett, Philip; Blake, Holly

2017-01-01

Living in a care home is a source of stigma in Thai culture, although there is currently no measurement tool in the Thai language specifically designed to assess internalized stigma in care home residents. The Thai Version of Internalized Stigma of Living in a Care Home (TIS-LCH) scale was developed and tested for its psychometric properties among Thai older residents. The Thai version of Internalized Stigma of Mental Health Illness (ISMI) Scale was revised into the TIS-LCH by replacing the word of "mental health illness" to "living in a care home." Content validity of the TIS-LCH was determined through expert review (n = 6), and reliability testing was undertaken with older care home residents (n = 128). The TIS-LCH showed good internal consistency, with a Cronbach's alpha of .87. Test-retest reliability coefficient of TIS-LCH was excellent for the full scale (ICC = .90). The Thai version of IS-LCH (TIS-LCH) is a valid and reliable measurement tool for assessing internalized stigma in Thai care home residents. The IS-LCH will be a useful research tool to assess internalized stigma in older adults living in care settings. Understanding stigma will help health and social care professionals to plan interventions aimed at reducing or preventing negative emotional reactions and negative behavioural responses toward stigma, which are known to be associated with mental illness and particularly depression among this population.
Evaluation of the Relative Validity and Test-Retest Reliability of a 15-Item Beverage Intake Questionnaire in Children and Adolescents.

Science.gov (United States)

Hill, Catelyn E; MacDougall, Carly R; Riebl, Shaun K; Savla, Jyoti; Hedrick, Valisa E; Davy, Brenda M

2017-11-01

Added sugar intake, in the form of sugar-sweetened beverages (SSBs), may contribute to weight gain and obesity development in children and adolescents. A valid and reliable brief beverage intake assessment tool for children and adolescents could facilitate research in this area. The purpose of this investigation was to evaluate the relative validity and test-retest reliability of a 15-item beverage intake questionnaire (BEVQ) for assessing usual beverage intake in children and adolescents. This cross-sectional investigation included four study visits within a 2- to 3-week time period. Participants (333 enrolled; 98% completion rate) were children aged 6 to 11 years and adolescents aged 12 to18 years recruited from the New River Valley, VA, region from January 2014 to September 2015. Study visits included assessment of height/weight, health history, and four 24-hour dietary recalls (24HRs). The BEVQ was completed at two visits (BEVQ 1, BEVQ 2). To evaluate relative validity, BEVQ 1 was compared with habitual beverage intake determined by the averaged 24HR. To evaluate test-retest reliability, BEVQ 1 was compared with BEVQ 2. Analyses included descriptive statistics, independent sample t tests, χ 2 tests, one-way analysis of variance, paired sample t tests, and correlational analyses. In the full sample, self-reported water and total SSB intake were not different between BEVQ 1 and 24HR (mean differences 0±1 fl oz and 0±1 fl oz, respectively; both P values >0.05). Reported intake across all beverage categories was significantly correlated between BEVQ 1 and BEVQ 2 (Pbeverages was not different (all P values >0.05) between BEVQ 1 and 24HR (mean differences: whole milk=3±4 kcal, reduced-fat milk=9±5 kcal, and fat-free milk=7±6 kcal, which is 7±15 total beverage kilocalories). In adolescents (n=200), water and SSB kilocalories were not different (both P values >0.05) between BEVQ 1 and 24HR (mean differences: -1±1 fl oz and 12±9 kcal, respectively). A 15
Translation and field testing of the family functioning, family health and social support questionnaire in Danish outpatients with heart failure

DEFF Research Database (Denmark)

Østergaard, Birte; Pedersen, Karen Steenvinkel; Lauridsen, Jørgen

2018-01-01

factor analysis. METHODS: A cross-sectional design was used to study a sample of 330 patients with heart failure who completed the FAFHES. The validity (dimensionality) and reliability (internal consistency and test-retest) were assessed for each of the three scales. The scales were constructed using...... by the analysis. There were strong correlations within the factors, with Cronbach's alpha ranging from 0.73 to 0.95 across the three scales, and significant, though weak, correlations between most of the factors. None of the revised scales showed good model fit according to the goodness-of-fit indices used...
Psychometric Evaluation of the Mini International Neuropsychiatric Interview for Children and Adolescents (MINI-KID).

Science.gov (United States)

Duncan, Laura; Georgiades, Kathy; Wang, Li; Van Lieshout, Ryan J; MacMillan, Harriet L; Ferro, Mark A; Lipman, Ellen L; Szatmari, Peter; Bennett, Kathryn; Kata, Anna; Janus, Magdalena; Boyle, Michael H

2017-12-04

The goals of the study were to examine test-retest reliability, informant agreement and convergent and discriminant validity of nine DSM-IV-TR psychiatric disorders classified by parent and youth versions of the Mini International Neuropsychiatric Interview for Children and Adolescents (MINI-KID). Using samples drawn from the general population and child mental health outpatient clinics, 283 youth aged 9 to 18 years and their parents separately completed the MINI-KID with trained lay interviewers on two occasions 7 to 14 days apart. Test-retest reliability estimates based on kappa (κ) went from 0.33 to 0.79 across disorders, samples and informants. Parent-youth agreement on disorders was low (average κ = 0.20). Confirmatory factor analysis provided evidence supporting convergent and discriminant validity. The MINI-KID disorder classifications yielded estimates of test-retest reliability and validity comparable to other standardized diagnostic interviews in both general population and clinic samples. These findings, in addition to the brevity and low administration cost, make the MINI-KID a good candidate for use in epidemiological research and clinical practice. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Test-retest reliability of the novel 5-HT{sub 1B} receptor PET radioligand [{sup 11}C]P943

Energy Technology Data Exchange (ETDEWEB)

Saricicek, Aybala [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Izmir Katip Celebi University, Department of Psychiatry, Izmir (Turkey); Chen, Jason; Ruf, Barbara [Yale University, Department of Psychiatry, New Haven, CT (United States); Planeta, Beata; Labaree, David; Gallezot, Jean-Dominique; Huang, Yiyun [Yale University, PET Center, Department of Diagnostic Radiology, New Haven, CT (United States); Subramanyam, Kalyani; Maloney, Kathleen [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Matuskey, David [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Yale University, PET Center, Department of Diagnostic Radiology, New Haven, CT (United States); Deserno, Lorenz [Charite - Universitaetsmedizin Berlin, Department of Psychiatry and Psychotherapy, Campus Charite Mitte, Berlin (Germany); Max-Planck-Institute for Human Cognitive and Brain Sciences, Leipzig, Berlin (Germany); Neumeister, Alexander [Yale University, Department of Psychiatry, New Haven, CT (United States); Mount Sinai School of Medicine, Department of Psychiatry, New York, NY (United States); VA Connecticut Healthcare System, Clinical Neuroscience Division, VA National Center for PTSD, West Haven, CT (United States); Krystal, John H. [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); VA Connecticut Healthcare System, Clinical Neuroscience Division, VA National Center for PTSD, West Haven, CT (United States); Carson, Richard E. [Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Bhagwagar, Zubin [Yale University, Department of Psychiatry, New Haven, CT (United States); Connecticut Mental Health Center, Abraham Ribicoff Research Facilities, New Haven, CT (United States); Bristol-Myers Squibb, Wallingford, CT (United States)

2014-11-27

[{sup 11}C]P943 is a novel, highly selective 5-HT{sub 1B} PET radioligand. The aim of this study was to determine the test-retest reliability of [{sup 11}C]P943 using two different modeling methods and to perform a power analysis with each quantification technique. Seven healthy volunteers underwent two PET scans on the same day. Regions of interest (ROIs) were the amygdala, hippocampus, pallidum, putamen, insula, frontal, anterior cingulate, parietal, temporal and occipital cortices, and cerebellum. Two multilinear radioligand quantification techniques were used to estimate binding potential: MA1, using arterial input function data, and the second version of the multilinear reference tissue model analysis (MRTM2), using the cerebellum as the reference region. Between-scan percent variability and intraclass correlation coefficients (ICC) were used to assess test-retest reliability. We also performed power analyses to determine the method that would allow the least number of subjects using within-subject or between-subject study designs. A voxel-wise ICC analysis for MRTM2 BP{sub ND} was performed for the whole brain and all the ROIs studied. Mean percent variability between two scans across regions ranged between 0.4 % and 12.4 % for MA1 BP{sub ND}, 0.5 % and 11.5 % for MA1 BP{sub P}, 16.7 % and 28.3 % for MA1 BP{sub F}, and between 0.2 % and 5.4 % for MRTM2 BP{sub ND}. The power analyses showed a greater number of subjects were required using MA1 BP{sub F} compared with other outcome measures for both within-subject and between-subject study designs. ICC values were the highest using MRTM2 BP{sub ND} and the lowest with MA1 BP{sub F} in ten ROIs. Small regions and regions with low binding had lower ICC values than large regions and regions with high binding. Reliable measures of 5-HT{sub 1B} receptor binding can be obtained using the novel PET radioligand [{sup 11}C]P943. Quantification of 5-HT{sub 1B} receptor binding with MRTM2 BP{sub ND} and with MA1 BP{sub P
Validation of farsi translation of the ocular surface disease index

Directory of Open Access Journals (Sweden)

Farzad Pakdel

2017-01-01

Conclusion: The obtained F-OSDI showed acceptable internal consistency and test-retest reliability. This F-OSDI could be used for assessment of dry eye, ocular surface discomfort and quality of life in Iranian and Farsi speaking populations.
[Validity and internal consistency of the Maslach Burnout Inventory in Dental Students from Cartagena, Colombia].

Science.gov (United States)

Simancas-Pallares, Miguel Angel; Fortich Mesa, Natalia; González Martínez, Farith Damián

To determine the internal consistency and content validity of the Maslach Burnout Inventory-Student Survey (MBI-SS) in dental students from Cartagena, Colombia. Scale validation study in 886 dental students from Cartagena, Colombia. Factor structure was determined through exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). Internal consistency was measured using the Cronbach's alpha coefficient. Analyses were performed using the Stata v.13.2 for Windows (Statacorp., USA) and Mplus v.7.31 for Windows (Muthén & Muthén, USA) software. Internal consistency was α=.806. The factor structure showed three that accounted for the 56.6% of the variance. CFA revealed: χ 2 =926.036; df=85; RMSEA=.106 (90%CI, .100-.112); CFI=.947; TLI=.934. The MBI showed an adequate internal consistency and a factor structure being consistent with the original proposed structure with a poor fit, which does not reflect adequate content validity in this sample. Copyright © 2016 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.

Validity and reliability assessment of the Brazilian version of the game addiction scale (GAS).

Science.gov (United States)

Lemos, Igor Lins; Cardoso, Adriana; Sougey, Everton Botelho

2016-05-01

The uncontrolled use of video games can be addictive. The Game Addiction Scale (GAS) is an instrument that was developed to assess this type of addiction. The GAS consists of 21 items that are divided into the following seven factors: salience, tolerance, mood modification, relapse, withdrawal, conflict and problems. This study assessed the convergent validity and reliability of the GAS according to measures of internal consistency and test-retest stability. Three hundred and eighty four students completed the GAS, the Internet Addiction Test (IAT), the Liebowitz Social Anxiety Scale (LSAS), the Beck Depression Inventory (BDI) and the Video Game Addiction Test (VAT). A subgroup of the participants (n=76) completed the GAS again after 30days to determine test-retest stability. The GAS demonstrated excellent internal consistency (Cronbach's alpha=0.92), was highly correlated with the VAT (r=0.883) and was moderately correlated with the BDI (r=0.358), the LSAS (r=0.326) and the IAT (r=0.454). In the Brazilian Portuguese population, the GAS shows good internal consistency. These data indicate that the GAS can be used to assess video game addiction due to its demonstrated psychometric validity. Copyright © 2016 Elsevier Inc. All rights reserved.
RELIABILITY OF THE DYNAMIC OCCUPATIONAL THERAPY COGNITIVE ASSESSMENT FOR CHILDREN (DOTCA-CH: THAI VERSION OF ORIENTATION, SPATIAL PERCEPTION, AND THINKING OPERATIONS SUBTESTS

Directory of Open Access Journals (Sweden)

Suchitporn Lersilp

2014-06-01

Full Text Available The Dynamic Occupational Therapy Cognitive Assessment for Children (DOTCA-Ch is a tool for finding out about cognitive problems in school-aged children. However, the DOTCA-Ch was developed in English for Western children. For this reason, it’s not appropriate for Thai children because of the differences of culture and language. The objectives of this study were aimed at translating the DOTCA-Ch in Orientation, Spatial Perception, and Thinking Operations subtests to a Thai version with a World Health Organization back-translation process, and to examine its internal consistency, inter-rater reliability and test-retest reliability. The participants consisted of 38 intellectually impaired and learning disabled individuals between the ages of 6–12. Results from this study revealed high internal consistency in the Orientation subtest (α=.83 Spatial Perception subtest (α=.82 and Thinking Operations subtest (α=.82, high inter-rater reliability in the Orientation subtest (ICC =.83, Spatial Perception subtest (ICC =.84 and Thinking Operations subtest (ICC =.74 and high test-retest reliability in the Orientation subtest (ICC =.84 Spatial Perception subtest (ICC =.86 and Thinking Operations subtest (ICC =.85. These results indicate that the Thai version of the DOTCA-Ch in Orientation, Spatial Perception, and Thinking Operations subtests might be used as an appropriate assessment tool for Thai children, based on psychometric evidence including internal consistency, inter-rater reliability and test-retest reliability. However, additional study of other psychometric properties, including, predictive validity, concurrent reliability, and inter-rater reliability during the mediation process of this assessment tool needs to be carried out.
Test-retest reliability of the KINARM end-point robot for assessment of sensory, motor and neurocognitive function in young adult athletes.

Directory of Open Access Journals (Sweden)

Cameron S Mang

Full Text Available Current assessment tools for sport-related concussion are limited by a reliance on subjective interpretation and patient symptom reporting. Robotic assessments may provide more objective and precise measures of neurological function than traditional clinical tests.To determine the reliability of assessments of sensory, motor and cognitive function conducted with the KINARM end-point robotic device in young adult elite athletes.Sixty-four randomly selected healthy, young adult elite athletes participated. Twenty-five individuals (25 M, mean age±SD, 20.2±2.1 years participated in a within-season study, where three assessments were conducted within a single season (assessments labeled by session: S1, S2, S3. An additional 39 individuals (28M; 22.8±6.0 years participated in a year-to-year study, where annual pre-season assessments were conducted for three consecutive seasons (assessments labeled by year: Y1, Y2, Y3. Forty-four parameters from five robotic tasks (Visually Guided Reaching, Position Matching, Object Hit, Object Hit and Avoid, and Trail Making B and overall Task Scores describing performance on each task were quantified.Test-retest reliability was determined by intra-class correlation coefficients (ICCs between the first and second, and second and third assessments. In the within-season study, ICCs were ≥0.50 for 68% of parameters between S1 and S2, 80% of parameters between S2 and S3, and for three of the five Task Scores both between S1 and S2, and S2 and S3. In the year-to-year study, ICCs were ≥0.50 for 64% of parameters between Y1 and Y2, 82% of parameters between Y2 and Y3, and for four of the five Task Scores both between Y1 and Y2, and Y2 and Y3.Overall, the results suggest moderate-to-good test-retest reliability for the majority of parameters measured by the KINARM robot in healthy young adult elite athletes. Future work will consider the potential use of this information for clinical assessment of concussion
Development and reliability testing of a Health Action Process Approach inventory for physical activity participation among individuals with schizophrenia

Directory of Open Access Journals (Sweden)

Kelly eArbour-Nicitopoulos

2014-06-01

Full Text Available Individuals with schizophrenia tend to have high levels of cardiovascular disease and lower physical activity (PA levels than the general population. Research is urgently required in developing evidence-based behavioral interventions for increasing PA in this population. One model that has been increasingly used to understand the mechanisms underlying PA is the Health Action Process Approach (HAPA. The purpose of this study was to adapt and pilot-test a HAPA-based inventory that reliably captures salient, modifiable PA determinants for individuals with schizophrenia. Initially, twelve outpatients with schizophrenia reviewed the inventory and provided verbal feedback regarding comprehension, item relevance, and potential new content. A content analysis framework was used to inform modifications to the inventory. The resultant inventory underwent a quantitative assessment of internal consistency and test-retest reliability. Twenty-five outpatients (Mage= 41.5 ± 13.5 years; 64% male completed the inventory on two separate occasions, one week apart. All but two scales showed good internal consistency (Cronbach’s α=0.62–0.98 and test-retest correlations (rs = .21-.96. Preliminary assessment of criterion validity of the HAPA inventory showed significant, large-sized correlations between behavioural intentions and both affective outcome expectancies and task self-efficacy, and small-to-moderate correlations between self-reported minutes of moderate-to-vigorous PA and the volitional constructs of the HAPA model. These findings provide preliminary support for the reliability and validity of the first-ever inventory for examining theory-based predictors of moderate to vigorous PA intentions and behavior among individuals with schizophrenia. Further validation research with this inventory using an objective measure of PA behavior will provide additional support for its psychometric properties within the schizophrenia population.
Internal Consistency and Convergent Validity of the Klontz Money Behavior Inventory (KMBI

Directory of Open Access Journals (Sweden)

Colby D. Taylor

2015-12-01

Full Text Available The Klontz Money Behavior Inventory (KMBI is a standalone, multi-scale measure than can screen for the presence of eight distinct money disorders. Given the well-established relationship between mental health and financial behaviors, results from the KMBI can be used to inform both mental health care professionals and financial planners. The present study examined the internal consistency and convergent validity of the KMBI, through comparison with similar measures, among a sample of college students (n = 232. Results indicate that the KMBI demonstrates acceptable internal consistency reliability and some convergence for most subscales when compared to other analogous measures. These findings highlight a need for literature and assessments to identify and describe disordered money behaviors.
Development and psychometric properties of the Patient-Head Injury Participation Scale (P-HIPS) and the Patient-Head Injury Neurobehavioral Assessment Scale (P-HINAS): patient and family determined outcomes scales.

Science.gov (United States)

Deb, Shoumitro; Bryant, Eleanor; Morris, Paul G; Prior, Lindsay; Lewis, Glyn; Haque, Sayeed

2007-06-01

To develop a measure to assess post-acute outcome following from traumatic brain injury (TBI) with particular emphasis on the emotional and the behavioral outcome. The second objective was to assess the test-retest reliability, internal consistency, and factor structure of the newly developed patient version of the Head Injury Participation Scale (P-HIPS) and Patient-Head Injury Neurobehavioral Scale (P-HINAS). Thirty-two TBI individuals and 27 carers took part in in-depth qualitative interviews exploring the consequences of the TBI. Interview transcripts were analyzed and key themes and concepts were used to construct the 49-item P-HIPS. A postal survey was then conducted on a cohort of 113 TBI patients to 'field test' the P-HIPS and the P-HINAS. All individual 49 items of the P-HIPS and their total score showed good test-retest reliability (0.93) and internal consistency (0.95). The P-HIPS showed a very good correlations with the Mayo Portland Adaptability Inventory-3 (MPAI-3) (0.87) and a moderate negative correlation with the Glasgow Outcome Scale-Extended (GOSE) (-0.51). Factor analysis extracted the following domains: 'Emotion/Behavior,' 'Independence/Community Living,' 'Cognition' and 'Physical'. The 'Emotion/Behavior' factor constituted the P-HINAS, which showed good internal consistency (0.93), test-retest reliability (0.91) and concurrent validity with MPAI subscale (0.82). Both the P-HIPS and the P-HINAS show strong psychometric properties. The qualitative methodology employed in the construction stage of the questionnaires provided good evidence of face and content validity.
Health Belief Model Scale for Human Papilloma Virus and its Vaccination: Adaptation and Psychometric Testing.

Science.gov (United States)

Guvenc, Gulten; Seven, Memnun; Akyuz, Aygul

2016-06-01

To adapt and psychometrically test the Health Belief Model Scale for Human Papilloma Virus (HPV) and Its Vaccination (HBMS-HPVV) for use in a Turkish population and to assess the Human Papilloma Virus Knowledge score (HPV-KS) among female college students. Instrument adaptation and psychometric testing study. The sample consisted of 302 nursing students at a nursing school in Turkey between April and May 2013. Questionnaire-based data were collected from the participants. Information regarding HBMS-HPVV and HPV knowledge and descriptive characteristic of participants was collected using translated HBMS-HPVV and HPV-KS. Test-retest reliability was evaluated and Cronbach α was used to assess internal consistency reliability, and exploratory factor analysis was used to assess construct validity of the HBMS-HPVV. The scale consists of 4 subscales that measure 4 constructs of the Health Belief Model covering the perceived susceptibility and severity of HPV and the benefits and barriers. The final 14-item scale had satisfactory validity and internal consistency. Cronbach α values for the 4 subscales ranged from 0.71 to 0.78. Total HPV-KS ranged from 0 to 8 (scale range, 0-10; 3.80 ± 2.12). The HBMS-HPVV is a valid and reliable instrument for measuring young Turkish women's beliefs and attitudes about HPV and its vaccination. Copyright © 2015 North American Society for Pediatric and Adolescent Gynecology. Published by Elsevier Inc. All rights reserved.
Psychometric Evaluation of the Persian Version of Barkley Adult Attention Deficit/Hyperactivity Disorder Screening Tool among the Elderly

Directory of Open Access Journals (Sweden)

Mostafa Sadeghi

2017-01-01

Full Text Available Background. The Barkley Adult Attention Deficit/Hyperactivity Disorder (ADHD Rating Scale-IV (BAARS-IV was developed, and it demonstrated good psychometric properties. The BAARS-IV includes 27 questions on the symptoms of adult ADHD. The purpose of the present study is to investigate the psychometric testing of the Persian version of BAARS-IV among the elderlies in Tabriz City. Method. This cross-sectional study was conducted in Tabriz City—in the west of Iran—in 2015 via enrolling of 121 old-aged people. We did the process of translation and adaptation of BAARS-IV and examined its concurrent validity, internal consistency, and test-retest reliability. Result. The BAARS-IV demonstrated good internal consistency and test-retest reliability. Correlations between the BAARS-IV and the CAARS-S: SV were high and evidence supporting concurrent validity was revealed. Cronbach’s alpha for the overall scale and subscales stood at 0.89, 0.81, 0.66, 0.56, and 0.82, respectively. Conclusion. The Persian BAARS-IV showed acceptable reliability and validity. BAARS-IV was determined to be composed of internally consistent and psychometrically sound items.
Testing the Zimbardo Time Perspective Inventory in the Chinese context.

Science.gov (United States)

Wang, Ya; Chen, Xing-Jie; Cui, Ji-Fang; Liu, Lu-Lu

2015-09-01

In this study, the authors evaluated the Chinese version of the Zimbardo Time Perspective Inventory (ZTPI). The ZTPI was tested among a sample of 303 university students. A subsample of 51 participants was then asked to complete the ZTPI again along with another set of questionnaires. The five-factor model of a 20-item short version of the ZTPI showed good model fit, internal consistency, and test-retest reliability. The 20-item Chinese version of the ZTPI also provided good validity, showing correlations with other variables in expected directions. Past-Positive was positively correlated with reappraisal and negatively correlated with suppression emotion regulation strategies, and Present-Hedonistic was positively correlated with reappraisal emotion regulation strategies. These findings indicate that the ZTPI is a reliable and valid instrument for measuring time perspective in the Chinese setting. © 2015 The Institute of Psychology, Chinese Academy of Sciences and Wiley Publishing Asia Pty Ltd.
Measuring standing balance in multiple sclerosis: Further progress towards an automatic and reliable method in clinical practice.

Science.gov (United States)

Keune, Philipp M; Young, William R; Paraskevopoulos, Ioannis T; Hansen, Sascha; Muenssinger, Jana; Oschmann, Patrick; Müller, Roy

2017-08-15

Balance deficits in multiple sclerosis (MS) are often monitored by means of observer-rated tests. These may provide reliable data, but may also be time-consuming, subject to inter-rater variability, and potentially insensitive to mild fluctuations throughout the clinical course. On the other hand, laboratory assessments are often not available. The Nintendo Wii Balance Board (WBB) may represent a low-cost solution. The purpose of the current study was to examine the methodological quality of WBB data in MS (internal consistency, test-retest reliability), convergent validity with observer-rated tests (Berg Balance Scale, BBS; Timed-Up and Go Test, TUG), and discriminative validity concerning clinical status (Expanded Disability Status Scale, EDSS). Standing balance was assessed with the WBB for 4min in 63 MS patients at two assessment points, four months apart. Additionally, patients were examined with the BBS, TUG and the EDSS. A period of 4min on the WBB provided data characterized by excellent internal consistency and test-retest reliability. Significant correlations between WBB data and results of the BBS and TUG were obtained after merely 2min on the board. An EDSS median-split revealed that higher EDSS values (>3) were associated with significantly increased postural sway on the WBB. WBB measures reflecting postural sway are methodologically robust in MS, involving excellent internal consistency and test-retest reliability. They are also characterized by convergent validity with other considerably lengthier observer-rated balance measures (BBS) and sensitive to broader clinical characteristics (EDSS). The WBB may hence represent an effective, easy-to-use monitoring tool for MS patients in clinical practice. Copyright © 2017 Elsevier B.V. All rights reserved.
Validity and reliability of a Nigerian-Yoruba version of the stroke-specific quality of life scale 2.0.

Science.gov (United States)

Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana

2017-10-19

Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Combination of classical test theory (CTT) and item response theory (IRT) analysis to study the psychometric properties of the French version of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF).

Science.gov (United States)

Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric

2015-02-01

The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.
Development, validity and reliability testing of the East Midlands Evaluation Tool (EMET) for measuring impacts on trainees' confidence and competence following end of life care training.

Science.gov (United States)

Whittaker, B; Parry, R; Bird, L; Watson, S; Faull, C

2017-02-02

To develop, test and validate a versatile questionnaire, the East Midlands Evaluation Tool (EMET), for measuring effects of end of life care training events on trainees' self-reported confidence and competence. A paper-based questionnaire was designed on the basis of the English Department of Health's core competences for end of life care, with sections for completion pretraining, immediately post-training and also for longer term follow-up. Preliminary versions were field tested at 55 training events delivered by 13 organisations to 1793 trainees working in diverse health and social care backgrounds. Iterative rounds of development aimed to maximise relevance to events and trainees. Internal consistency was assessed by calculating interitem correlations on questionnaire responses during field testing. Content validity was assessed via qualitative content analysis of (1) responses to questionnaires completed by field tester trainers and (2) field notes from a workshop with a separate cohort of experienced trainers. Test-retest reliability was assessed via repeat administration to a cohort of student nurses. The EMET comprises 27 items with Likert-scaled responses supplemented with questions seeking free-text responses. It measures changes in self-assessed confidence and competence on 5 subscales: communication skills; assessment and care planning; symptom management; advance care planning; overarching values and knowledge. Test-retest reliability was found to be good, as was internal consistency: the questions successfully assess different aspects of the same underlying concept. The EMET provides a time-efficient, reliable and flexible means of evaluating effects of training on self-reported confidence and competence in the key elements of end of life care. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Standardization, Validity and Reliability Study of Gülhane Aphasia Test-2 (GAT-2

Directory of Open Access Journals (Sweden)

İlknur Maviş

2007-04-01

Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia
The development and validation of a test of science critical thinking for fifth graders.

Science.gov (United States)

Mapeala, Ruslan; Siew, Nyet Moi

2015-01-01

The paper described the development and validation of the Test of Science Critical Thinking (TSCT) to measure the three critical thinking skill constructs: comparing and contrasting, sequencing, and identifying cause and effect. The initial TSCT consisted of 55 multiple choice test items, each of which required participants to select a correct response and a correct choice of critical thinking used for their response. Data were obtained from a purposive sampling of 30 fifth graders in a pilot study carried out in a primary school in Sabah, Malaysia. Students underwent the sessions of teaching and learning activities for 9 weeks using the Thinking Maps-aided Problem-Based Learning Module before they answered the TSCT test. Analyses were conducted to check on difficulty index (p) and discrimination index (d), internal consistency reliability, content validity, and face validity. Analysis of the test-retest reliability data was conducted separately for a group of fifth graders with similar ability. Findings of the pilot study showed that out of initial 55 administered items, only 30 items with relatively good difficulty index (p) ranged from 0.40 to 0.60 and with good discrimination index (d) ranged within 0.20-1.00 were selected. The Kuder-Richardson reliability value was found to be appropriate and relatively high with 0.70, 0.73 and 0.92 for identifying cause and effect, sequencing, and comparing and contrasting respectively. The content validity index obtained from three expert judgments equalled or exceeded 0.95. In addition, test-retest reliability showed good, statistically significant correlations ([Formula: see text]). From the above results, the selected 30-item TSCT was found to have sufficient reliability and validity and would therefore represent a useful tool for measuring critical thinking ability among fifth graders in primary science.
A Bayesian Decision-Theoretic Approach to Logically-Consistent Hypothesis Testing

Directory of Open Access Journals (Sweden)

Gustavo Miranda da Silva

2015-09-01

Full Text Available This work addresses an important issue regarding the performance of simultaneous test procedures: the construction of multiple tests that at the same time are optimal from a statistical perspective and that also yield logically-consistent results that are easy to communicate to practitioners of statistical methods. For instance, if hypothesis A implies hypothesis B, is it possible to create optimal testing procedures that reject A whenever they reject B? Unfortunately, several standard testing procedures fail in having such logical consistency. Although this has been deeply investigated under a frequentist perspective, the literature lacks analyses under a Bayesian paradigm. In this work, we contribute to the discussion by investigating three rational relationships under a Bayesian decision-theoretic standpoint: coherence, invertibility and union consonance. We characterize and illustrate through simple examples optimal Bayes tests that fulfill each of these requisites separately. We also explore how far one can go by putting these requirements together. We show that although fairly intuitive tests satisfy both coherence and invertibility, no Bayesian testing scheme meets the desiderata as a whole, strengthening the understanding that logical consistency cannot be combined with statistical optimality in general. Finally, we associate Bayesian hypothesis testing with Bayes point estimation procedures. We prove the performance of logically-consistent hypothesis testing by means of a Bayes point estimator to be optimal only under very restrictive conditions.
Reliability, Dimensionality, and Internal Consistency as Defined by Cronbach: Distinct Albeit Related Concepts

Science.gov (United States)

Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U.

2015-01-01

This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…
Validation of a pediatric caregiver diary to measure symptoms of postacute respiratory syncytial virus bronchiolitis

DEFF Research Database (Denmark)

Santanello, Nancy C; Norquist, Josephine M; Nelsen, Linda M

2005-01-01

consistent, supporting a unidimensional scale structure. Test-retest reliabilities for the percentage of SFD and CSS were above the recommended cut point of 0.70. Cross-sectional and longitudinal correlations were sizeable and statistically significant, demonstrating construct validity. Hypothesized known......Acute respiratory syncytial virus (RSV)-induced bronchiolitis is often associated with continuing respiratory symptoms following hospitalization. To date, there is no validated objective measure to evaluate symptoms of RSV-induced bronchiolitis. We report on the reliability, validity...... the 4-week treatment period of the reported prospective, placebo-controlled trial of montelukast for treatment of postacute RSV were used to assess reliability (internal consistency and test-retest), construct validity (cross-sectional and longitudinal correlations), discriminant validity (known...
Processes and Procedures for Estimating Score Reliability and Precision

Science.gov (United States)

Bardhoshi, Gerta; Erford, Bradley T.

2017-01-01

Precision is a key facet of test development, with score reliability determined primarily according to the types of error one wants to approximate and demonstrate. This article identifies and discusses several primary forms of reliability estimation: internal consistency (i.e., split-half, KR-20, a), test-retest, alternate forms, interscorer, and…
The Meaningful Activity Participation Assessment: A Measure of Engagement in Personally Valued Activities

Science.gov (United States)

Eakman, Aaron M.; Carlson, Mike E.; Clark, Florence A.

2010-01-01

The Meaningful Activity Participation Assessment (MAPA), a recently developed 28-item tool designed to measure the meaningfulness of activity, was tested in a sample of 154 older adults. The MAPA evidenced a sufficient level of internal consistency and test-retest reliability and correlated as theoretically predicted with the Life Satisfaction…

Retesting young STI clinic visitors with urogenital Chlamydia trachomatis infection in the Netherlands; response to a text message reminder and reinfection rates: a prospective study with historical controls.

Science.gov (United States)

Kampman, Cjg; Koedijk, Fdh; Driessen-Hulshof, Hcm; Hautvast, Jla; van den Broek, Ivf

2016-03-01

The objective of this study is to assess the effect of reminder text messages 6 months after the initial treatment on retest and chlamydia reinfection rates in young heterosexuals compared with a historical control group and to assess factors associated with both outcomes. Heterosexual people (aged 16-23 years), testing positive for urogenital chlamydia, were offered a retest after 6 months. Participants received a text message reminder at 6 months after the initial chlamydia diagnosis. Rates of retest uptake and the result of the retest were analysed using Cox regression. Prevalence ratios (PRs) were calculated to identify factors associated with these outcomes. Furthermore, the retest rate was compared with the retest rate of a historical control group. 30.6% (253/838) of the study group returned within 5-8 months compared with 9.2% (140/1530) in the historical control group. Women and persons who were not notified for a sexually transmitted infection (STI) at inclusion were more likely to return for a retest. 20.4% (56/275) of participants had a chlamydia reinfection upon retesting. Reinfection was higher in participants reporting STI-related symptoms (PR 3.2, 95% CI 1.8 to 5.6) and in participants who were notified for an STI at retest (PR 5.3, 95% CI 2.4 to 11.5). A text message reminder appeared to have a clear, positive impact on the resulting retest rate. These results also indicate that retesting is necessary to identify chlamydia reinfections. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Test-Taking Strategies in L2 Assessment: The Test of English for International Communication Speaking Test.

Science.gov (United States)

Huang, Heng-Tsung Danny

2016-08-01

This research explored the test-taking strategies associated with the Test of English for International Communication Speaking Test (TOEIC-S) and their relationship with test performance. Capitalizing on two sets of TOEIC-S and a custom-made strategy inventory, the researcher collected data from a total of 215 Taiwanese English learners consisting of 84 males and 131 females with an average age of 20.1 years (SD = 2.6). Quantitative data analysis gave rise to three major findings. First, TOEIC-S test-taking strategy use constituted a multi-faceted construct that involved multiple types of strategic behaviors. Second, these strategic behaviors matched those allowing test-takers to communicate both in real life and in the workplace. Third, communication strategy use and cognitive strategy use both contributed significantly to TOEIC-S performance. © The Author(s) 2016.
Can local staff reliably assess their own programs? A confirmatory test-retest study of Lot Quality Assurance Sampling data collectors in Uganda.

Science.gov (United States)

Beckworth, Colin A; Anguyo, Robert; Kyakulaga, Francis Cranmer; Lwanga, Stephen K; Valadez, Joseph J

2016-08-17

Data collection techniques that routinely provide health system information at the local level are in demand and needed. LQAS is intended for use by local health teams to collect data at the district and sub-district levels. Our question is whether local health staff produce biased results as they are responsible for implementing the programs they also assess. This test-retest study replicates on a larger scale an earlier LQAS reliability assessment in Uganda. We conducted in two districts an LQAS survey using 15 local health staff as data collectors. A week later, the data collectors swapped districts, where they acted as disinterested non-local data collectors, repeating the LQAS survey with the same respondents. We analysed the resulting two data sets for agreement using Cohens' Kappa. The average Kappa score for the knowledge indicators was k = 0.43 (SD = 0.16) and for practice indicators k = 0.63 (SD = 0.17). These scores show moderate agreement for knowledge indicators and substantial agreement for practice indicators. Analyses confirm that respondents were more knowledgeable on retest; no evidence of bias was found for practice indicators. The findings of this study are remarkably similar to those produced in the first reliability study. There is no evidence that using local healthcare staff to collect LQAS data biases data collection in an LQAS study. The bias observed in the knowledge indicators was most likely due to a 'practice effect', whereby respondents increased their knowledge as a result of completing the first survey; no corresponding effect was seen in the practice indicators.
The retest distribution of the visual field summary index mean deviation is close to normal.

Science.gov (United States)

Anderson, Andrew J; Cheng, Allan C Y; Lau, Samantha; Le-Pham, Anne; Liu, Victor; Rahman, Farahnaz

2016-09-01

When modelling optimum strategies for how best to determine visual field progression in glaucoma, it is commonly assumed that the summary index mean deviation (MD) is normally distributed on repeated testing. Here we tested whether this assumption is correct. We obtained 42 reliable 24-2 Humphrey Field Analyzer SITA standard visual fields from one eye of each of five healthy young observers, with the first two fields excluded from analysis. Previous work has shown that although MD variability is higher in glaucoma, the shape of the MD distribution is similar to that found in normal visual fields. A Shapiro-Wilks test determined any deviation from normality. Kurtosis values for the distributions were also calculated. Data from each observer passed the Shapiro-Wilks normality test. Bootstrapped 95% confidence intervals for kurtosis encompassed the value for a normal distribution in four of five observers. When examined with quantile-quantile plots, distributions were close to normal and showed no consistent deviations across observers. The retest distribution of MD is not significantly different from normal in healthy observers, and so is likely also normally distributed - or nearly so - in those with glaucoma. Our results increase our confidence in the results of influential modelling studies where a normal distribution for MD was assumed. © 2016 The Authors Ophthalmic & Physiological Optics © 2016 The College of Optometrists.
Test-retest reliability of prefrontal transcranial Direct Current Stimulation (tDCS) effects on functional MRI connectivity in healthy subjects.

Science.gov (United States)

Wörsching, Jana; Padberg, Frank; Helbich, Konstantin; Hasan, Alkomiet; Koch, Lena; Goerigk, Stephan; Stoecklein, Sophia; Ertl-Wagner, Birgit; Keeser, Daniel

2017-07-15

Transcranial Direct Current Stimulation (tDCS) of the prefrontal cortex (PFC) can be used for probing functional brain connectivity and meets general interest as novel therapeutic intervention in psychiatric and neurological disorders. Along with a more extensive use, it is important to understand the interplay between neural systems and stimulation protocols requiring basic methodological work. Here, we examined the test-retest (TRT) characteristics of tDCS-induced modulations in resting-state functional-connectivity MRI (RS fcMRI). Twenty healthy subjects received 20minutes of either active or sham tDCS of the dorsolateral PFC (2mA, anode over F3 and cathode over F4, international 10-20 system), preceded and ensued by a RS fcMRI (10minutes each). All subject underwent three tDCS sessions with one-week intervals in between. Effects of tDCS on RS fcMRI were determined at an individual as well as at a group level using both ROI-based and independent-component analyses (ICA). To evaluate the TRT reliability of individual active-tDCS and sham effects on RS fcMRI, voxel-wise intra-class correlation coefficients (ICC) of post-tDCS maps between testing sessions were calculated. For both approaches, results revealed low reliability of RS fcMRI after active tDCS (ICC (2,1) = -0.09 - 0.16). Reliability of RS fcMRI (baselines only) was low to moderate for ROI-derived (ICC (2,1) = 0.13 - 0.50) and low for ICA-derived connectivity (ICC (2,1) = 0.19 - 0.34). Thus, for ROI-based analyses, the distribution of voxel-wise ICC was shifted to lower TRT reliability after active, but not after sham tDCS, for which the distribution was similar to baseline. The intra-individual variation observed here resembles variability of tDCS effects in motor regions and may be one reason why in this study robust tDCS effects at a group level were missing. The data can be used for appropriately designing large scale studies investigating methodological issues such as sources of variability and
Development and Psychometric Testing of a Novel Food Service Satisfaction Questionnaire for Food Service Staff of Aged Care Homes.

Science.gov (United States)

Miller, M; Hamilton, J; Scupham, R; Matwiejczyk, L; Prichard, I; Farrer, O; Yaxley, A

2018-01-01

Food service staff are integral to delivery of quality food in aged care homes yet measurement of their satisfaction is unable to be performed due to an absence of a valid and reliable questionnaire. The aim of this study was to develop and perform psychometric testing for a new Food Service Satisfaction Questionnaire developed in Australia specifically for use by food service staff working in residential aged care homes (Flinders FSSQFSAC). A mixed methods design utilizing both a qualitative (in-depth interviews, focus groups) and a quantitative approach (cross sectional survey) was used. Content validity was determined from focus groups and interviews with food service staff currently working in aged care homes, related questionnaires from the literature and consultation with an expert panel. The questionnaire was tested for construct validity and internal consistency using data from food service staff currently working in aged care homes that responded to an electronic invitation circulated to Australian aged care homes using a national database of email addresses. Construct validity was tested via principle components analysis and internal consistency through Cronbach's alpha. Temporal stability of the questionnaire was determined from food service staff undertaking the Flinders FSSQFSAC on two occasions, two weeks apart, and analysed using Pearson's correlations. Content validity for the Flinders FSSQFSAC was established from a panel of experts and stakeholders. Principle components analysis revealed food service staff satisfaction was represented by 61-items divided into eight domains: job satisfaction (α=0.832), food quality (α=0.871), staff training (α=0.922), consultation (α=0.840), eating environment (α=0.777), reliability (α=0.695), family expectations (α=0.781) and resident relationships (α=0.429), establishing construct validity in all domains, and internal consistency in all (α>0.5) except for "resident relationships" (α=0.429). Test-retest
Reliability and Validity of the Persian Language Version of the International Consultation on Incontinence Questionnaire - Male Lower Urinary Tract Symptoms (ICIQ-MLUTS).

Science.gov (United States)

Pourmomeny, Abbas Ali; Ghanei, Behnaz; Alizadeh, Farshid

2018-05-01

Assessment instruments are essential for research, allowing diagnosis and evaluating treatment outcomes in subjects with lower urinary tract disorders of both genders. The purpose of this study was to translate the Male Lower Urinary Tract Symptoms (MLUTS) Questionnaire and determine its psychometric properties in Persian subjects. After getting permission from the International Consultation on Incontinence Modular Questionnaire (ICIQ) web site, the forward and backward translation of the MLUTS questionnaire were carried out by researcher team. The content/face validity, construct validity and reliability were assessed in a sample of MLUTS Iranian patients by measuring with the Cronbach's alpha test. In total, 121 male patients were included in the study. The mean age of the patients was 60.5 years. Cronbach alpha value was 0.757, consecrated the internal consistency of the form (r > 0.7). The internal consistency of each question was examined separately and found to be over 0.7. For the evaluation of reliability test-retest was done, the test was administered to 20% of the patients for a second time with an interval of 1-2 weeks. The intraclass correlation coefficient (ICC) score was 0.901. The Correlation coefficient between the MLUTS and International Prostate Symptoms Score (IPSS) was 0.879. ICIQ-MLUTS is a robust instrument, which can be used for evaluating male LUTS in Persian patients. We believe that the Persian version of the MLUTS is an important tool for research and clinical setting. © 2017 John Wiley & Sons Australia, Ltd.
Cosmological consistency tests of gravity theory and cosmic acceleration

Science.gov (United States)

Ishak-Boushaki, Mustapha B.

2017-01-01

Testing general relativity at cosmological scales and probing the cause of cosmic acceleration are among the important objectives targeted by incoming and future astronomical surveys and experiments. I present our recent results on consistency tests that can provide insights about the underlying gravity theory and cosmic acceleration using cosmological data sets. We use statistical measures, the rate of cosmic expansion, the growth rate of large scale structure, and the physical consistency of these probes with one another.
A Scale of Mobbing Impacts

Science.gov (United States)

Yaman, Erkan

2012-01-01

The aim of this research was to develop the Mobbing Impacts Scale and to examine its validity and reliability analyses. The sample of study consisted of 509 teachers from Sakarya. In this study construct validity, internal consistency, test-retest reliabilities and item analysis of the scale were examined. As a result of factor analysis for…
Test-Retest Reliability of Isokinetic Knee Strength Measurements in Children Aged 8 to 10 Years.

Science.gov (United States)

Fagher, Kristina; Fritzson, Annelie; Drake, Anna Maria

Isokinetic dynamometry is a useful tool to objectively assess muscle strength of children and adults in athletic and rehabilitative settings. This study examined test-retest reliability of isokinetic knee strength measurements in children aged 8 to 10 years and defined limits for the minimum difference (MD) in strength that indicates a clinically important change. Isokinetic knee strength measurements (using the Biodex System 4) in children will provide reliable results. Descriptive laboratory study. In 22 healthy children, 5 maximal concentric (CON) knee extensor (KE) and knee flexor (KF) contractions at 2 angular velocities (60 deg/s and 180 deg/s) and 5 maximal eccentric (ECC) KE/KF contractions at 60 deg/s were assessed 7 days apart. The intraclass correlation coefficient (ICC 2.1 ) was used to examine relative reliability, and the MD was calculated on the basis of standard error of measurement. ICCs for CON KE/KF peak torque measurements were fair to excellent (range, 0.49-0.81). The MD% values for CON KE and KF ranged from 31% to 37% at 60 deg/s and from 34% to 39% at 180 deg/s. ICCs in the ECC mode were good (range, 0.60-0.70), but associated MD% values were high (>50%). There was no systematic error for CON KE/KF and ECC KE strength measurements at 60 deg/s, but systematic error was found for all other measurements. The dynamometer provides a reliable analysis of isokinetic CON knee strength measurements at 60 deg/s in children aged 8 to 10 years. Measurements at 180 deg/s and in the ECC mode were not reliable, indicating a need for more familiarization prior to testing. The MD values may help clinicians to determine whether a change in knee strength is due to error or intervention.
Psychometric properties of the Neck OutcOme Score, Neck Disability Index, and Short Form-36 were evaluated in patients with neck pain

DEFF Research Database (Denmark)

Juul, Tina; Søgaard, Karen; Davis, Aileen M.

2016-01-01

Objective:To assess reliability, construct validity, responsiveness, and interpretability for Neck OutcOme Score (NOOS), Neck Disability Index (NDI), and Short Form–36 (SF-36) in neck pain patients. Study Design and Setting: Internal consistency was assessed by Cronbach alpha. Test-retest reliabi...
Measurement of acute nonspecific low back pain perception in primary care physical therapy: reliability and validity of the brief illness perception questionnaire.

Science.gov (United States)

Hallegraeff, Joannes M; van der Schans, Cees P; Krijnen, Wim P; de Greef, Mathieu H G

2013-02-01

The eight-item Brief Illness Perception Questionnaire is used as a screening instrument in physical therapy to assess mental defeat in patients with acute low back pain, besides patient perception might determine the course and risk for chronic low back pain. However, the psychometric properties of the Brief Illness Perception Questionnaire in common musculoskeletal disorders like acute low back pain have not been adequately studied. Patients' perceptions vary across different populations and affect coping styles. Thus, our aim was to determine the internal consistency, test-retest reliability and validity of the Dutch language version of the Brief Illness Perception Questionnaire in acute non-specific low back pain patients in primary care physical therapy. A non-experimental cross-sectional study with two measurements was performed. Eighty-four acute low back pain patients, in multidisciplinary health care center in Dutch primary care with a sample mean (SD) age of 42 (12) years, participated in the study. Internal consistency (Cronbach's α) and test-retest procedures (Intraclass Correlation Coefficients and limits of agreement) were evaluated at a one-week interval. The concurrent validity of the Brief Illness Perception Questionnaire was examined by using the Mental Health Component of the Short Form 36 Health Survey. The Cronbach's α for internal consistency was 0.73 (95% CI, 0.67 - 0.83); and the Intraclass Correlation Coefficient test-retest reliability was acceptable: 0.72 (95% CI, 0.53 - 0.82), however, the limits of agreement were large. The Intraclass Correlation Coefficient measuring concurrent validity 0.65 (95% CI, 0.46 - 0.80). The Dutch version of the Brief Illness Perception Questionnaire is an appropriate instrument for measuring patients' perceptions in acute low back pain patients, showing acceptable internal consistency and reliability. Concurrent validity is adequate, however, the instrument may be unsuitable for detecting changes in low
Potential application of the consistency approach for vaccine potency testing.

Science.gov (United States)

Arciniega, J; Sirota, L A

2012-01-01

The Consistency Approach offers the possibility of reducing the number of animals used for a potency test. However, it is critical to assess the effect that such reduction may have on assay performance. Consistency of production, sometimes referred to as consistency of manufacture or manufacturing, is an old concept implicit in regulation, which aims to ensure the uninterrupted release of safe and effective products. Consistency of manufacture can be described in terms of process capability, or the ability of a process to produce output within specification limits. For example, the standard method for potency testing of inactivated rabies vaccines is a multiple-dilution vaccination challenge test in mice that gives a quantitative, although highly variable estimate. On the other hand, a single-dilution test that does not give a quantitative estimate, but rather shows if the vaccine meets the specification has been proposed. This simplified test can lead to a considerable reduction in the number of animals used. However, traditional indices of process capability assume that the output population (potency values) is normally distributed, which clearly is not the case for the simplified approach. Appropriate computation of capability indices for the latter case will require special statistical considerations.
A Brazilian-Portuguese version of the Kinesthetic and Visual Motor Imagery Questionnaire.

Science.gov (United States)

Demanboro, Alan; Sterr, Annette; Anjos, Sarah Monteiro Dos; Conforto, Adriana Bastos

2018-01-01

Motor imagery has emerged as a potential rehabilitation tool in stroke. The goals of this study were: 1) to develop a translated and culturally-adapted Brazilian-Portugese version of the Kinesthetic and Visual Motor Imagery Questionnaire (KVIQ20-P); 2) to evaluate the psychometric characteristics of the scale in a group of patients with stroke and in an age-matched control group; 3) to compare the KVIQ20 performance between the two groups. Test-retest, inter-rater reliabilities, and internal consistencies were evaluated in 40 patients with stroke and 31 healthy participants. In the stroke group, ICC confidence intervals showed excellent test-retest and inter-rater reliabilities. Cronbach's alpha also indicated excellent internal consistency. Results for controls were comparable to those obtained in persons with stroke. The excellent psychometric properties of the KVIQ20-P should be considered during the design of studies of motor imagery interventions for stroke rehabilitation.
Construct Validity of the Nutrition and Activity Knowledge Scale in a French Sample of Adolescents with Mild to Moderate Intellectual Disability

Science.gov (United States)

Maiano, Christophe; Begarie, Jerome; Morin, Alexandre J. S.; Garbarino, Jean-Marie; Ninot, Gregory

2010-01-01

The purpose of this study was to test the reliability (i.e. internal consistency and test-retest reliability) and construct validity (i.e. content validity, factor validity, measurement invariance, and latent mean invariance) of the Nutrition and Activity Knowledge Scale (NAKS) in a sample of French adolescents with mild to moderate Intellectual…
Developing a Danish version of the "Impact on Participation and Autonomy Questionnaire".

Science.gov (United States)

Ghaziani, Emma; Krogh, Anne Grethe; Lund, Hans

2013-05-01

To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Translation and two successive reliability assessments through test-retest. 137 adults with varying degrees of impairment; of these, 67 participated in the final reliability assessment. The translation followed guidelines set forth by the "European Group for Quality of Life Assessment and Health Measurement". Internal consistency for subscales was estimated by Chronbach's alpha. Weighted kappa coefficients and intraclass correlation coefficients were calculated to assess the test-retest reliability at item and subscale level, respectively. A preliminary reliability assessment revealed residual issues regarding the translation and cultural adaptation of the instrument. The revised version (IPAQ-DK) was subsequently subjected to a similar assessment demonstrating Chronbach's alpha values from 0.698 to 0.817. Weighted kappa ranged from 0.370 to 0.880; 78% of these values were higher than 0.600. The intraclass correlation coefficient covered values from 0.701 to 0.818. IPAQ-DK is a useful instrument for identifying person-perceived participation restrictions and satisfaction with participation. Further studies of IPAQ-DK's floor/ceiling effects and responsiveness to change are recommended, and whether there is a need for further linguistic improvement of certain items.
Design and validation of a comprehensive fecal incontinence questionnaire.

Science.gov (United States)

Macmillan, Alexandra K; Merrie, Arend E H; Marshall, Roger J; Parry, Bryan R

2008-10-01

Fecal incontinence can have a profound effect on quality of life. Its prevalence remains uncertain because of stigma, lack of consistent definition, and dearth of validated measures. This study was designed to develop a valid clinical and epidemiologic questionnaire, building on current literature and expertise. Patients and experts undertook face validity testing. Construct validity, criterion validity, and test-retest reliability was undertaken. Construct validity comprised factor analysis and internal consistency of the quality of life scale. The validity of known groups was tested against 77 control subjects by using regression models. Questionnaire results were compared with a stool diary for criterion validity. Test-retest reliability was calculated from repeated questionnaire completion. The questionnaire achieved good face validity. It was completed by 104 patients. The quality of life scale had four underlying traits (factor analysis) and high internal consistency (overall Cronbach alpha = 0.97). Patients and control subjects answered the questionnaire significantly differently (P validity testing. Criterion validity assessment found mean differences close to zero. Median reliability for the whole questionnaire was 0.79 (range, 0.35-1). This questionnaire compares favorably with other available instruments, although the interpretation of stool consistency requires further research. Its sensitivity to treatment still needs to be investigated.
Reliability of the Dutch translation of the Kujala Patellofemoral Score Questionnaire.

Science.gov (United States)

Ummels, P E J; Lenssen, A F; Barendrecht, M; Beurskens, A J H M

2017-01-01

There are no Dutch language disease-specific questionnaires for patients with patellofemoral pain syndrome available that could help Dutch physiotherapists to assess and monitor these symptoms and functional limitations. The aim of this study was to translate the original disease-specific Kujala Patellofemoral Score into Dutch and evaluate its reliability. The questionnaire was translated from English into Dutch in accordance with internationally recommended guidelines. Reliability was determined in 50 stable subjects with an interval of 1 week. The patient inclusion criteria were age between 14 and 60 years; knowledge of the Dutch language; and the presence of at least three of the following symptoms: pain while taking the stairs, pain when squatting, pain when running, pain when cycling, pain when sitting with knees flexed for a prolonged period, grinding of the patella and a positive clinical patella test. The internal consistency, test-retest reliability, measurement error and limits of agreement were calculated. Internal consistency was 0.78 for the first assessment and 0.80 for the second assessment. The intraclass correlation coefficient (ICC agreement ) between the first and second assessments was 0.98. The mean difference between the first and second measurements was 0.64, and standard deviation was 5.51. The standard error measurement was 3.9, and the smallest detectable change was 11. The Bland and Altman plot shows that the limits of agreement are -10.37 and 11.65. The results of the present study indicated that the test-retest reliability translated Dutch version of the Kujala Patellofemoral Score questionnaire is equivalent of the test-retest original English language version and has good internal consistency. Trial registration NTR (TC = 3258). Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.
The Validity and Reliability of the Mobbing Scale (MS)

Science.gov (United States)

Yaman, Erkan

2009-01-01

The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Test-retest reliability of {sup 11}C-ORM-13070 in PET imaging of α{sub 2C}-adrenoceptors in vivo in the human brain

Energy Technology Data Exchange (ETDEWEB)

Lehto, Jussi; Peltonen, Juha M.; Volanen, Iina; Scheinin, Mika [University of Turku, Clinical Research Services Turku CRST, Turku (Finland); TYKSLAB, Unit of Clinical Pharmacology, Turku (Finland); Virta, Jere R. [University of Turku and Turku University Hospital, Turku PET Centre, Turku (Finland); Turku University Hospital, Division of Clinical Neurosciences, Turku (Finland); Oikonen, Vesa; Roivainen, Anne; Luoto, Pauliina; Arponen, Eveliina; Helin, Semi; Virtanen, Kirsi [University of Turku and Turku University Hospital, Turku PET Centre, Turku (Finland); Hietamaeki, Johanna; Holopainen, Aila; Rouru, Juha; Sallinen, Jukka [Orion Pharma, Turku (Finland); Kailajaervi, Marita [Turku Imanet, GE Healthcare, Turku (Finland); Rinne, Juha O. [University of Turku and Turku University Hospital, Turku PET Centre, Turku (Finland); Turku University Hospital, Division of Clinical Neurosciences, Turku (Finland); University of Turku, Clinical Research Services Turku CRST, Turku (Finland)

2015-01-15

α{sub 2C}-Adrenoceptors share inhibitory presynaptic functions with the more abundant α{sub 2A}-adrenoceptor subtype, but they also have widespread postsynaptic modulatory functions in the brain. Research on the noradrenergic system of the human brain has been hampered by the lack of suitable PET tracers targeted to the α{sub 2}-adrenoceptor subtypes. PET imaging with the specific α{sub 2C}-adrenoceptor antagonist tracer [{sup 11}C]ORM-13070 was performed twice in six healthy male subjects to investigate the test-retest reliability of tracer binding. The bound/free ratio of tracer uptake relative to nonspecific uptake into the cerebellum during the time interval of 5 - 30 min was most prominent in the dorsal striatum: 0.77 in the putamen and 0.58 in the caudate nucleus. Absolute test-retest variability in bound/free ratios of tracer ranged from 4.3 % in the putamen to 29 % in the hippocampus. Variability was also <10 % in the caudate nucleus and thalamus. Intraclass correlation coefficients (ICC) ranged from 0.50 in the hippocampus to 0.89 in the thalamus (ICC >0.70 was also reached in the caudate nucleus, putamen, lateral frontal cortex and parietal cortex). The pattern of [{sup 11}C]ORM-13070 binding, as determined by PET, was in good agreement with receptor density results previously derived from post-mortem autoradiography. PET data analysis results obtained with a compartmental model fit, the simplified reference tissue model and a graphical reference tissue analysis method were convergent with the tissue ratio method. The results of this study support the use of [{sup 11}C]ORM-13070 PET in the quantitative assessment of α{sub 2C}-adrenoceptors in the human brain in vivo. Reliable assessment of specific tracer binding in the dorsal striatum is possible with the help of reference tissue ratios. (orig.)

Development, content validity and test-retest reliability of the Lifelong Physical Activity Skills Battery in adolescents.

Science.gov (United States)

Hulteen, Ryan M; Barnett, Lisa M; Morgan, Philip J; Robinson, Leah E; Barton, Christian J; Wrotniak, Brian H; Lubans, David R

2018-03-28

Numerous skill batteries assess fundamental motor skill (e.g., kick, hop) competence. Few skill batteries examine lifelong physical activity skill competence (e.g., resistance training). This study aimed to develop and assess the content validity, test-retest and inter-rater reliability of the "Lifelong Physical Activity Skills Battery". Development of the skill battery occurred in three stages: i) systematic reviews of lifelong physical activity participation rates and existing motor skill assessment tools, ii) practitioner consultation and iii) research expert consultation. The final battery included eight skills: grapevine, golf swing, jog, push-up, squat, tennis forehand, upward dog and warrior I. Adolescents (28 boys, 29 girls; M = 15.8 years, SD = 0.4 years) completed the Lifelong Physical Activity Skills Battery on two occasions two weeks apart. The skill battery was highly reliable (ICC = 0.84, 95% CI = 0.72-0.90) with individual skill reliability scores ranging from moderate (warrior I; ICC = 0.56) to high (tennis forehand; ICC = 0.82). Typical error (4.0; 95% CI 3.4-5.0) and proportional bias (r = -0.21, p = .323) were low. This study has provided preliminary evidence for the content validity and reliability of the Lifelong Physical Activity Skills Battery in an adolescent population.
SEQUenCE: a service user-centred quality of care instrument for mental health services.

Science.gov (United States)

Hester, Lorraine; O'Doherty, Lorna Jane; Schnittger, Rebecca; Skelly, Niamh; O'Donnell, Muireann; Butterly, Lisa; Browne, Robert; Frorath, Charlotte; Morgan, Craig; McLoughlin, Declan M; Fearon, Paul

2015-08-01

To develop a quality of care instrument that is grounded in the service user perspective and validate it in a mental health service. The instrument (SEQUenCE (SErvice user QUality of CarE)) was developed through analysis of focus group data and clinical practice guidelines, and refined through field-testing and psychometric analyses. All participants were attending an independent mental health service in Ireland. Participants had a diagnosis of bipolar affective disorder (BPAD) or a psychotic disorder. Twenty-nine service users participated in six focus group interviews. Seventy-one service users participated in field-testing: 10 judged the face validity of an initial 61-item instrument; 28 completed a revised 52-item instrument from which 12 items were removed following test-retest and convergent validity analyses; 33 completed the resulting 40-item instrument. Test-retest reliability, internal consistency and convergent validity of the instrument. The final instrument showed acceptable test-retest reliability at 5-7 days (r = 0.65; P Service Satisfaction Scale (r = 0.84, P internal consistency (Cronbach's alpha = 0.87). SEQUenCE is a valid, reliable scale that is grounded in the service user perspective and suitable for routine use. It may serve as a useful tool in individual care planning, service evaluation and research. The instrument was developed and validated with service users with a diagnosis of either BPAD or a psychotic disorder; it does not yet have established external validity for other diagnostic groups. © The Author 2015. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Reliabilities of mental rotation tasks: limits to the assessment of individual differences.

Science.gov (United States)

Hirschfeld, Gerrit; Thielsch, Meinald T; Zernikow, Boris

2013-01-01

Mental rotation tasks with objects and body parts as targets are widely used in cognitive neuropsychology. Even though these tasks are well established to study between-groups differences, the reliability on an individual level is largely unknown. We present a systematic study on the internal consistency and test-retest reliability of individual differences in mental rotation tasks comparing different target types and orders of presentations. In total n = 99 participants (n = 63 for the retest) completed the mental rotation tasks with hands, feet, faces, and cars as targets. Different target types were presented in either randomly mixed blocks or blocks of homogeneous targets. Across all target types, the consistency (split-half reliability) and stability (test-retest reliabilities) were good or acceptable both for intercepts and slopes. At the level of individual targets, only intercepts showed acceptable reliabilities. Blocked presentations resulted in significantly faster and numerically more consistent and stable responses. Mental rotation tasks-especially in blocked variants-can be used to reliably assess individual differences in global processing speed. However, the assessment of the theoretically important slope parameter for individual targets requires further adaptations to mental rotation tests.
Validation and Test-Retest Reliability of New Thermographic Technique Called Thermovision Technique of Dry Needling for Gluteus Minimus Trigger Points in Sciatica Subjects and TrPs-Negative Healthy Volunteers

Science.gov (United States)

Rychlik, Michał; Samborski, Włodzimierz

2015-01-01

The aim of this study was to assess the validity and test-retest reliability of Thermovision Technique of Dry Needling (TTDN) for the gluteus minimus muscle. TTDN is a new thermography approach used to support trigger points (TrPs) diagnostic criteria by presence of short-term vasomotor reactions occurring in the area where TrPs refer pain. Method. Thirty chronic sciatica patients (n=15 TrP-positive and n=15 TrPs-negative) and 15 healthy volunteers were evaluated by TTDN three times during two consecutive days based on TrPs of the gluteus minimus muscle confirmed additionally by referred pain presence. TTDN employs average temperature (T avr), maximum temperature (T max), low/high isothermal-area, and autonomic referred pain phenomenon (AURP) that reflects vasodilatation/vasoconstriction. Validity and test-retest reliability were assessed concurrently. Results. Two components of TTDN validity and reliability, T avr and AURP, had almost perfect agreement according to κ (e.g., thigh: 0.880 and 0.938; calf: 0.902 and 0.956, resp.). The sensitivity for T avr, T max, AURP, and high isothermal-area was 100% for everyone, but specificity of 100% was for T avr and AURP only. Conclusion. TTDN is a valid and reliable method for T avr and AURP measurement to support TrPs diagnostic criteria for the gluteus minimus muscle when digitally evoked referred pain pattern is present. PMID:26137486
The IPR inventory: development and psychometric characteristics.

Science.gov (United States)

Tilden, V P; Nelson, C A; May, B A

1990-01-01

The purpose of this study was to develop, validate, and norm a measure of dimensions of interpersonal relationships that are salient to nursing: social support, reciprocity, and conflict. The selection of these concepts was guided by social exchange and equity theories. In the first phase of the study, 44 respondents were interviewed to provide narrative data from which to develop items so that items would be grounded in lived experience. Content validity of items was judged by a panel of 11 experts. The revised 39-item instrument was tested in successive steps with a total of 340 students, patients, and community residents for reliability and validity, including internal consistency reliability, test-retest reliability, factor analysis, and three forms of validity assessment (theory testing, contrasted groups, and multitrait-multimethod comparison). The three subscales of social support, reciprocity, and conflict demonstrated repeated internal consistency and test-retest reliability. Strong evidence of construct validity was demonstrated for the social support and the conflict subscales; validity of the reciprocity subscale was equivocal.
Test-retest reproducibility of the metabotropic glutamate receptor 5 ligand [{sup 18}F]FPEB with bolus plus constant infusion in humans

Energy Technology Data Exchange (ETDEWEB)

Park, Eunkyung; Sullivan, Jenna M.; Planeta, Beata; Gallezot, Jean-Dominique; Lim, Keunpoong; Lin, Shu-Fei; Ropchan, Jim; Huang, Yiyun; Carson, Richard E. [Yale School of Medicine, PET Center, Department of Diagnostic Radiology, 801 Howard Avenue, PO Box 208048, New Haven, CT (United States); McCarthy, Timothy J. [Pfizer Worldwide Research and Development, Cambridge, MA (United States); Ding, Yu-Shin [New York University School of Medicine, Department of Radiology, New York, NY (United States); Morris, Evan D.; Williams, Wendol A. [Yale School of Medicine, PET Center, Department of Diagnostic Radiology, 801 Howard Avenue, PO Box 208048, New Haven, CT (United States); Yale School of Medicine, Department of Psychiatry, New Haven, CT (United States)

2015-09-15

[{sup 18}F]FPEB is a promising PET radioligand for the metabotropic glutamate receptor 5 (mGluR5), a potential target for the treatment of neuropsychiatric diseases. The purpose of this study was to evaluate the test-retest reproducibility of [{sup 18}F]FPEB in the human brain. Seven healthy male subjects were scanned twice, 3 - 11 weeks apart. Dynamic data were acquired using bolus plus infusion of 162 ± 32 MBq [{sup 18}F]FPEB. Four methods were used to estimate volume of distribution (V{sub T}): equilibrium analysis (EQ) using arterial (EQ{sub A}) or venous input data (EQ{sub V}), MA1, and a two-tissue compartment model (2 T). Binding potential (BP{sub ND}) was also estimated using cerebellar white matter (CWM) or gray matter (CGM) as the reference region using EQ, 2 T and MA1. Absolute test-retest variability (aTRV) of V{sub T} and BP{sub ND} were calculated for each method. Venous blood measurements (C{sub V}) were compared with arterial input (C{sub A}) to examine their usability in EQ analysis. Regional V{sub T} estimated by the four methods displayed a high degree of agreement (r{sup 2} ranging from 0.83 to 0.99 among the methods), although EQ{sub A} and EQ{sub V} overestimated V{sub T} by a mean of 9 % and 7 %, respectively, compared to 2 T. Mean values of aTRV of V{sub T} were 11 % by EQ{sub A}, 12 % by EQ{sub V}, 14 % by MA1 and 14 % by 2 T. Regional BP{sub ND} also agreed well among the methods and mean aTRV of BP{sub ND} was 8 - 12 % (CWM) and 7 - 9 % (CGM). Venous and arterial blood concentrations of [{sup 18}F]FPEB were well matched during equilibrium (C{sub V} = 1.01 . C{sub A}, r{sup 2} = 0.95). [{sup 18}F]FPEB binding shows good TRV with minor differences among analysis methods. Venous blood can be used as an alternative for input function measurement instead of arterial blood in EQ analysis. Thus, [{sup 18}F]FPEB is an excellent PET imaging tracer for mGluR5 in humans. (orig.)
Greek cultural adaption and validation of the Kujala anterior knee pain scale in patients with patellofemoral pain syndrome.

Science.gov (United States)

Papadopoulos, Costas; Constantinou, Antonis; Cheimonidou, Areti-Zoi; Stasinopoulos, Dimitrios

2017-04-01

To cross-culturally adapt and validate the Greek version of the Kujala anterior knee pain scale (KAKPS). The Greek KAKPS was translated from the original English version following standard forward and backward translation procedures. The survey was then conducted in clinical settings by a questionnaire comprising the Greek KAKPS and patellofemoral pain syndrome (PFPS) severity scale. A total of 130 (62 women and 68 men) Greek-reading patients between 18 and 45 years old with anterior knee pain (AKP) for at least four weeks were recruited from physical therapy clinics. To establish test-retest reliability, the patients were asked to complete the KAKPS at initial visit and 2-3 days after the initial visit. The Greek version of the PFPS severity scale was also administered once at initial visit. Internal consistency of the translated instrument was measured using Cronbach's α. An intraclass correlation coefficient was used to assess the test-retest reliability of the KAKPS. Concurrent validity was measured by correlating the KAKPS with the PFPS severity scale using Pearson's correlation coefficient. The results showed that the Greek KAKPS has good internal consistency (Cronbach's α = 0.942), test-retest reliability (ICC = 0.921) and concurrent validity (r > 0.7). This study has shown that the Greek KAKPS has good internal consistency, test-retest reliability and concurrent validity when correlated with the PFPS severity scale in adult patients with AKP for at least four weeks. Implications for rehabilitation The Greek version of the KAKPS has been found to be reliable and valid when used in adult patients with AKP for at least four weeks. The results of the psychometric characteristics were compatible with those of the original English version. The KAKPS could be applied in a Greek-speaking population to assess functional limitations and symptoms in patients aged 18-45 years old with AKP for at least four weeks.
Psychometric evaluation of the impact of weight on quality of life-lite questionnaire (IWQOL-lite) in a community sample.

Science.gov (United States)

Kolotkin, Ronette L; Crosby, Ross D

2002-03-01

The short form of impact of weight on quality of life (IWQOL)-Lite is a 31-item, self-report, obesity-specific measure of health-related quality of life (HRQOL) that consists of a total score and scores on each of five scales--physical function, self-esteem, sexual life, public distress, and work--and that exhibits strong psychometric properties. This study was undertaken in order to assess test-retest reliability and discriminant validity in a heterogeneous sample of individuals not in treatment. Individuals were recruited from the community to complete questionnaires that included the IWQOL-Lite, SF-36, Rosenberg self-esteem (RSE) scale, Marlowe-Crowne social desirability scale, global ratings of quality of life, and sexual functioning and public distress ratings. Persons currently enrolled in weight loss programs or with a body mass index (BMI) of less than 18.5 were dropped from the analyses, leaving 341 females and 153 males for analysis, with an average BMI of 27.4. For test-retest reliability, 112 participants completed the IWQOL-Lite again. ANOVA revealed significant main effects for BMI for all IWQOL-Lite scales and total score. Females showed greater impairment than males on all scales except public distress. Internal consistency ranged from 0.816 to 0.944 for IWQOL-Lite scales and was 0.958 for total score. Test-retest reliability ranged from 0.814 to 0.877 for scales and was 0.937 for total score. Internal consistency and test-retest results for overweight/obese subjects were similar to those obtained for the total sample. There was strong evidence for convergent and discriminant validity of the IWQOL-Lite in overweight/obese subjects. As in previous studies conducted on treatment-seeking obese persons, the IWQOL-Lite appears to be a reliable and valid measure of obesity-specific quality of life in overweight/obese persons not seeking treatment.
[Developing Perceived Competence Scale (PCS) for Adolescents].

Science.gov (United States)

Özer, Arif; Gençtanirim Kurt, Dilek; Kizildağ, Seval; Demırtaş Zorbaz, Selen; Arici Şahın, Fatma; Acar, Tülin; Ergene, Tuncay

2016-01-01

In this study, Perceived Competence Scale was developed to measure high school students' perceived competence. Scale development process was verified on three different samples. Participants of the research are some high school students in 2011-2012 academic terms from Ankara. Participants' numbers are incorporated in exploratory factor analysis, confirmatory factor analysis and test-retest reliability respectively, as follows: 372, 668 and 75. Internal consistency coefficients (Cronbach's and stratified α) are calculated separately for each group. For data analysis Factor 8.02 and LISREL 8.70 package programs were used. According to results of the analyses, internal consistency coefficients (α) are .90 - .93 for academic competence, .82 - .86 for social competence in the samples that exploratory and confirmatory factor analysis performed. For the whole scale internal consistency coefficient (stratified α) is calculated as .91. As a result of test-retest reliability, adjusted correlation coefficients (r) are .94 for social competence and .90 for academic competence. In addition, to fit indexes and regression weights obtained from factor analysis, findings related convergent and discriminant validity, indicating that competence can be addressed in two dimensions which are academic (16 items) and social (14 items).
Content validation: clarity/relevance, reliability and internal consistency of enunciative signs of language acquisition.

Science.gov (United States)

Crestani, Anelise Henrich; Moraes, Anaelena Bragança de; Souza, Ana Paula Ramos de

2017-08-10

To analyze the results of the validation of building enunciative signs of language acquisition for children aged 3 to 12 months. The signs were built based on mechanisms of language acquisition in an enunciative perspective and on clinical experience with language disorders. The signs were submitted to judgment of clarity and relevance by a sample of six experts, doctors in linguistic in with knowledge of psycholinguistics and language clinic. In the validation of reliability, two judges/evaluators helped to implement the instruments in videos of 20% of the total sample of mother-infant dyads using the inter-evaluator method. The method known as internal consistency was applied to the total sample, which consisted of 94 mother-infant dyads to the contents of the Phase 1 (3-6 months) and 61 mother-infant dyads to the contents of Phase 2 (7 to 12 months). The data were collected through the analysis of mother-infant interaction based on filming of dyads and application of the parameters to be validated according to the child's age. Data were organized in a spreadsheet and then converted to computer applications for statistical analysis. The judgments of clarity/relevance indicated no modifications to be made in the instruments. The reliability test showed an almost perfect agreement between judges (0.8 ≤ Kappa ≥ 1.0); only the item 2 of Phase 1 showed substantial agreement (0.6 ≤ Kappa ≥ 0.79). The internal consistency for Phase 1 had alpha = 0.84, and Phase 2, alpha = 0.74. This demonstrates the reliability of the instruments. The results suggest adequacy as to content validity of the instruments created for both age groups, demonstrating the relevance of the content of enunciative signs of language acquisition.
Studies on the consistency of internally taken contrast medium for pancreas CT

Energy Technology Data Exchange (ETDEWEB)

Matsushima, Kishio; Mimura, Seiichi; Tahara, Seiji; Kitayama, Takuichi; Inamura, Keiji; Mikami, Yasutaka; Hashimoto, Keiji; Hiraki, Yoshio; Aono, Kaname

1985-02-01

A problem of Pancreatic CT scanning is the discrimination between the pancreas and the adjacent gastrointestinal tract. Generally we administer a dilution of gastrografin internally to make the discrimination. The degree of dilution has been decided by experience at each hospital. When the consistency of the contrast medium is low in density, an enhancement effect cannot be expected, but when the consistency is high, artifacts appear. We have experimented on the degree of the dilution and CT-No to decide the optimum consistency of gastrografin for the diagnosis of pancreatic disease. Statistical analysis of the results show the optimum dilution of gastrografin to be 1.5%.
Reliability of attitude and knowledge items and behavioral consistency in the validated sun exposure questionnaire in a Danish population based sample

DEFF Research Database (Denmark)

Køster, Brian; Søndergaard, Jens; Nielsen, Jesper Bo

2018-01-01

in protection behavior was low. To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability......An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high. The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed...... questionnaire for monitoring and evaluating population sun-related behavior. Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly...
Internal consistency of the CHAMPS physical activity questionnaire for Spanish speaking older adults.

Science.gov (United States)

Rosario, Martín G; Vázquez, Jenniffer M; Cruz, Wanda I; Ortiz, Alexis

2008-09-01

The Community Healthy Activities Model Program for Seniors (CHAMPS) is a physical activity monitoring questionnaire for people between 65 to 90 years old. This questionnaire has been previously translated to Spanish to be used in the Latin American population. To adapt the Spanish version of the CHAMPS questionnaire to Puerto Rico and assess its internal consistency. An external review committee adapted the existent Spanish version of the CHAMPS to be used in the Puerto Rican population. Three older adults participated in a second phase with the purpose of training the research team. After the second phase, 35 older adults participated in a third content adaptation phase. During the third phase, the preliminary Spanish version for Puerto Rico of the CHAMPS was given to the 35 participants to assess for clarity, vocabulary and understandability. Interviews to each participant in the third phase were carried out to obtain feedback and create a final Spanish version of the CHAMPS for Puerto Rico. After analyses of this phase, the external review committee prepared a final Spanish version of the CHAMPS for Puerto Rico. The final version was administered to 15 older adults (76 +/- 6.5 years) to assess the internal consistency by using Cronbach's Alpha analysis. The questionnaire showed a strong internal consistency of 0.76. The total time to answer the questionnaire was 17.4 minutes. The Spanish version of the CHAMPS questionnaire for Puerto Rico suggested being an easy to administer and consistent measurement tool to assess physical activity in older adults.
SCALE DEVELOPMENT FOR MEASURING AND PREDICTING ADOLESCENTS' LEISURE TIME PHYSICAL ACTIVITY BEHAVIOR

Directory of Open Access Journals (Sweden)

Silvia Arribas Galarraga

2009-12-01

Full Text Available The aim of this study was to develop a scale for assessing and predicting adolescents' physical activity behavior in Spain and Luxembourg using the Theory of Planned Behavior as a framework. The sample was comprised of 613 Spanish (boys = 309, girls = 304; M age =15.28, SD =1.127 and 752 Luxembourgish adolescents (boys = 343, girls = 409; M age = 14.92, SD = 1.198, selected from students of two secondary schools in both countries, with a similar socio-economic status. The initial 43-items were all scored on a 4-point response format using the structured alternative format and translated into Spanish, French and German. In order to ensure the accuracy of the translation, standardized parallel back-translation techniques were employed. Following two pilot tests and subsequent revisions, a second order exploratory factor analysis with oblimin direct rotation was used for factor extraction. Internal consistency and test-retest reliabilities were also tested. The 4-week test-retest correlations confirmed the items' time stability. The same five factors were obtained, explaining 63.76% and 63.64% of the total variance in both samples. Internal consistency for the five factors ranged from α = 0.759 to α = 0. 949 in the Spanish sample and from α = 0.735 to α = 0.952 in the Luxembourgish sample. For both samples, inter-factor correlations were all reported significant and positive, except for Factor 5 where they were significant but negative. The high internal consistency of the subscales, the reported item test-retest reliabilities and the identical factor structure confirm the adequacy of the elaborated questionnaire for assessing the TPB-based constructs when used with a population of adolescents in Spain and Luxembourg. The results give some indication that they may have value in measuring the hypothesized TPB constructs for PA behavior in a cross-cultural context
Translation and validation of the Danish Foot Function Index (FFI-DK).

Science.gov (United States)

Jorgensen, J E; Andreasen, J; Rathleff, M S

2015-08-01

The objective of this study was to translate the Foot Function Index (FFI) for use in Danish-speaking patients with foot complaints. The FFI consists of 23 items scored on a numeric rating scale from 0 to 10. The 23 items are grouped into three subscales: pain (nine items), activity limitation (five items), and disability (nine items). The Danish FFI was developed according to the recommended forward/backward translation protocol. The data analysis included reliability [intraclass correlation coefficient (ICC) 2.1] and internal consistency (Cronbach's alpha). Excellent internal consistency was shown for the three subscales: pain (0.99), disability (0.98), and activity limitation (0.98), as for the total score (0.97). The test-retest reliability was excellent: pain subscale: ICC 0.98 [95% confidence interval (CI): 0.97-0.99]; activity limitation subscale: ICC: 0.95 (95% CI: 0.91-0.98); disability subscale: ICC 0.97 (95% CI: 0.95-0.98); total score: ICC: 0.95 (95% CI: 0.91 to 0.98). The mean difference between test and retest was below 1 point and P > 0.08. Bland-Altman plots showed no significant or clinically relevant differences from test to retest in any of the subscales or in the total score. The Danish version of the FFI was found to be valid and reliable and therefore acceptable for use in the Danish population. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Minimum joint space width (mJSW) of patellofemoral joint on standing ''skyline'' radiographs: test-retest reproducibility and comparison with quantitative magnetic resonance imaging (qMRI)

International Nuclear Information System (INIS)

Simoni, Paolo; Jamali, Sanaa; Alvarez Miezentseva, Victoria; Albert, Adelin; Totterman, Saara; Schreyer, Edward; Tamez-Pena, Jose G.; Zobel, Bruno Beomonte; Gillet, Philippe

2013-01-01

To assess the intraobserver, interobserver, and test-retest reproducibility of minimum joint space width (mJSW) measurement of medial and lateral patellofemoral joints on standing ''skyline'' radiographs and to compare the mJSW of the patellofemoral joint to the mean cartilage thickness calculated by quantitative magnetic resonance imaging (qMRI). A couple of standing ''skyline'' radiographs of the patellofemoral joints and MRI of 55 knees of 28 volunteers (18 females, ten males, mean age, 48.5 ± 16.2 years) were obtained on the same day. The mJSW of the patellofemoral joint was manually measured and Kellgren and Lawrence grade (KLG) was independently assessed by two observers. The mJSW was compared to the mean cartilage thickness of patellofemoral joint calculated by qMRI. mJSW of the medial and lateral patellofemoral joint showed an excellent intraobserver agreement (interclass correlation (ICC) = 0.94 and 0.96), interobserver agreement (ICC = 0.90 and 0.95) and test-retest agreement (ICC = 0.92 and 0.96). The mJSW measured on radiographs was correlated to mean cartilage thickness calculated by qMRI (r = 0.71, p < 0.0001 for the medial PFJ and r = 0.81, p < 0.0001 for the lateral PFJ). However, there was a lack of concordance between radiographs and qMRI for extreme values of joint width and KLG. Radiographs yielded higher joint space measures than qMRI in knees with a normal joint space, while qMRI yielded higher joint space measures than radiographs in knees with joint space narrowing and higher KLG. Standing ''skyline'' radiographs are a reproducible tool for measuring the mJSW of the patellofemoral joint. The mJSW of the patellofemoral joint on radiographs are correlated with, but not concordant with, qMRI measurements. (orig.)
Test-retest reliability and validity of a web-based food-frequency questionnaire for adolescents aged 13-14 to be used in the Norwegian Mother and Child Cohort Study (MoBa).

Science.gov (United States)

Overby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha

2014-01-01

The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. In total, 58 students (aged 13-14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. The median Spearman correlation coefficient for all nutrients in the test-retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). The test-retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents.
Personal Hypothesis Testing: The Role of Consistency and Self-Schema.

Science.gov (United States)

Strohmer, Douglas C.; And Others

1988-01-01

Studied how individuals test hypotheses about themselves. Examined extent to which Snyder's bias toward confirmation persists when negative or nonconsistent personal hypothesis is tested. Found negativity or positivity did not affect hypothesis testing directly, though hypothesis consistency did. Found cognitive schematic variable (vulnerability…
Validation of the Social Appearance Anxiety Scale: Factor, Convergent, and Divergent Validity

Science.gov (United States)

Levinson, Cheri A.; Rodebaugh, Thomas L.

2011-01-01

The Social Appearance Anxiety Scale (SAAS) was created to assess fear of overall appearance evaluation. Initial psychometric work indicated that the measure had a single-factor structure and exhibited excellent internal consistency, test-retest reliability, and convergent validity. In the current study, the authors further examined the factor,…
Adaptation of the Bath measures on disease activity and function in ankylosing spondylitis into Danish

DEFF Research Database (Denmark)

Pedersen, Ole Birger; Hansen, G O; Svendsen, Anders Jørgen

2007-01-01

. RESULTS: Test-retest reliability was high (>0.90) and the random measurement error was within+/-2.0 for the BASG and within approximately+/-1.5 for BASDAI and BASFI, which is acceptable for most clinical settings. The measures have good internal consistency and are able to discriminate between functional...

Migraine patients consistently show abnormal vestibular bedside tests.

Science.gov (United States)

Maranhão, Eliana Teixeira; Maranhão-Filho, Péricles; Luiz, Ronir Raggio; Vincent, Maurice Borges

2016-01-01

Migraine and vertigo are common disorders, with lifetime prevalences of 16% and 7% respectively, and co-morbidity around 3.2%. Vestibular syndromes and dizziness occur more frequently in migraine patients. We investigated bedside clinical signs indicative of vestibular dysfunction in migraineurs. To test the hypothesis that vestibulo-ocular reflex, vestibulo-spinal reflex and fall risk (FR) responses as measured by 14 bedside tests are abnormal in migraineurs without vertigo, as compared with controls. Cross-sectional study including sixty individuals - thirty migraineurs, 25 women, 19-60 y-o; and 30 gender/age healthy paired controls. Migraineurs showed a tendency to perform worse in almost all tests, albeit only the Romberg tandem test was statistically different from controls. A combination of four abnormal tests better discriminated the two groups (93.3% specificity). Migraine patients consistently showed abnormal vestibular bedside tests when compared with controls.
Measuring leprosy-related stigma - a pilot study to validate a toolkit of instruments.

Science.gov (United States)

Rensen, Carin; Bandyopadhyay, Sudhakar; Gopal, Pala K; Van Brakel, Wim H

2011-01-01

Stigma negatively affects the quality of life of leprosy-affected people. Instruments are needed to assess levels of stigma and to monitor and evaluate stigma reduction interventions. We conducted a validation study of such instruments in Tamil Nadu and West Bengal, India. Four instruments were tested in a 'Community Based Rehabilitation' (CBR) setting, the Participation Scale, Internalised Scale of Mental Illness (ISMI) adapted for leprosy-affected persons, Explanatory Model Interview Catalogue (EMIC) for leprosy-affected and non-affected persons and the General Self-Efficacy (GSE) Scale. We evaluated the following components of validity, construct validity, internal consistency, test-retest reproducibility and reliability to distinguish between groups. Construct validity was tested by correlating instrument scores and by triangulating quantitative and qualitative findings. Reliability was evaluated by comparing levels of stigma among people affected by leprosy and community controls, and among affected people living in CBR project areas and those in non-CBR areas. For the Participation, ISMI and EMIC scores significant differences were observed between those affected by leprosy and those not affected (p = 0.0001), and between affected persons in the CBR and Control group (p < 0.05). The internal consistency of the instruments measured with Cronbach's α ranged from 0.83 to 0.96 and was very good for all instruments. Test-retest reproducibility coefficients were 0.80 for the Participation score, 0.70 for the EMIC score, 0.62 for the ISMI score and 0.50 for the GSE score. The construct validity of all instruments was confirmed. The Participation and EMIC Scales met all validity criteria, but test-retest reproducibility of the ISMI and GSE Scales needs further evaluation with a shorter test-retest interval and longer training and additional adaptations for the latter.
Development and Validation of the User Version of the Mobile Application Rating Scale (uMARS).

Science.gov (United States)

Stoyanov, Stoyan R; Hides, Leanne; Kavanagh, David J; Wilson, Hollie

2016-06-10

The Mobile Application Rating Scale (MARS) provides a reliable method to assess the quality of mobile health (mHealth) apps. However, training and expertise in mHealth and the relevant health field is required to administer it. This study describes the development and reliability testing of an end-user version of the MARS (uMARS). The MARS was simplified and piloted with 13 young people to create the uMARS. The internal consistency and test-retest reliability of the uMARS was then examined in a second sample of 164 young people participating in a randomized controlled trial of a mHealth app. App ratings were collected using the uMARS at 1-, 3,- and 6-month follow up. The uMARS had excellent internal consistency (alpha = .90), with high individual alphas for all subscales. The total score and subscales had good test-retest reliability over both 1-2 months and 3 months. The uMARS is a simple tool that can be reliably used by end-users to assess the quality of mHealth apps.
Reliability and Validity of the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2) in Adults with Non-Cancer Pain

Science.gov (United States)

Hayes, Corey J.; Bhandari, Naleen Raj; Kathe, Niranjan; Payakachat, Nalin

2017-01-01

Limited evidence exists on how non-cancer pain (NCP) affects an individual’s health-related quality of life (HRQoL). This study aimed to validate the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2), a generic measure of HRQoL, in a NCP cohort using the Medical Expenditure Panel Survey Longitudinal Files. The SF Mental Component Summary (MCS12) and SF Physical Component Summary (PCS12) were tested for reliability (internal consistency and test-retest reliability) and validity (construct: convergent and discriminant; criterion: concurrent and predictive). A total of 15,716 patients with NCP were included in the final analysis. The MCS12 and PCS12 demonstrated high internal consistency (Cronbach’s alpha and Mosier’s alpha > 0.8), and moderate and high test-retest reliability, respectively (MCS12 intraclass correlation coefficient (ICC): 0.64; PCS12 ICC: 0.73). Both scales were significantly associated with a number of chronic conditions (p reliable and valid measure of HRQoL for patients with NCP. PMID:28445438
Reliability of the Discounting Inventory: An extension into substance-use population

Directory of Open Access Journals (Sweden)

Malesza Marta

2017-06-01

Full Text Available Recent research introduced the Discounting Inventory that allows the measurement of individual differences in the delay, probabilistic, effort, and social discounting rates. The goal of this investigation was to determine several aspects of the reliability of the Discounting Inventory using the responses of 385 participants (200 non-smokers and 185 current-smokers. Two types of reliability are of interest. Internal consistency and test-retest stability. A secondary aim was to extend such reliability measures beyond the non-clinical participant. The current study aimed to measure the reliability of the DI in a nicotine-dependent individuals and non-nicotine-dependent individuals. It is concluded that the internal consistency of the DI is excellent, and that the test-retest reliability results suggest that items intended to measure three types of discounting were likely testing trait, rather than state, factors, regardless of whether “non-smokers” were included in, or excluded from, the analyses (probabilistic discounting scale scores being the exception. With these cautions in mind, however, the psychometric properties of the DI appear to be very good.
Reliability and concurrent validity of the Dutch hip and knee replacement expectations surveys.

Science.gov (United States)

van den Akker-Scheek, Inge; van Raay, Jos J A M; Reininga, Inge H F; Bulstra, Sjoerd K; Zijlstra, Wiebren; Stevens, Martin

2010-10-19

Preoperative expectations of outcome of total hip and knee arthroplasty are important determinants of patients' satisfaction and functional outcome. Aims of the study were (1) to translate the Hospital for Special Surgery Hip Replacement Expectations Survey and Knee Replacement Expectations Survey into Dutch and (2) to study test-retest reliability and concurrent validity. Patients scheduled for total hip (N = 112) or knee replacement (N = 101) were sent the Dutch Expectations Surveys twice with a 2 week interval to determine test-retest reliability. To determine concurrent validity, the Expectation WOMAC was sent. The results for the Dutch Hip Replacement Expectations Survey revealed good test-retest reliability (ICC 0.87), no bias and good internal consistency (alpha 0.86) (N = 72). The correlation between the Hip Expectations Score and the Expectation WOMAC score was 0.59 (N = 86). The results for the Dutch Knee Replacement Expectations Survey revealed good test-retest reliability (ICC 0.79), no bias and good internal consistency (alpha 0.91) (N = 46). The correlation with the Expectation WOMAC score was 0.52 (N = 57). Both Dutch Expectations Surveys are reliable instruments to determine patients' expectations before total hip or knee arthroplasty. As for concurrent validity, the correlation between both surveys and the Expectation WOMAC was moderate confirming that the same construct was determined. However, patients scored systematically lower on the Expectation WOMAC compared to the Dutch Expectation Surveys. Research on patients' expectations before total hip and knee replacement has only been performed in a limited amount of countries. With the Dutch Expectations Surveys it is now possible to determine patients' expectations in another culture and healthcare setting.
Data quality and factor analysis of the Danish version of the Relationship Scale Questionnaire

DEFF Research Database (Denmark)

Andersen, Christina Maar; Pedersen, Anette Fischer; Carlsen, Anders Helles

2017-01-01

properties of the Danish translation of the RSQ and to test whether the results are consistent with the hypothesized model of attachment.METHODS: The study included two samples: 602 general practitioners and 611 cancer patients. The two samples were analyzed separately. Data quality was assessed by mean......, median and missing values for each item, floor and ceiling effects, average inter-item correlations and Cronbach's α for each subscale. Test-retest was assessed by intra-class correlations among 76 general practitioners. A confirmatory factor analysis was conducted to establish evidence of the four......-factor structure which was validated through a confirmatory factor analyses in a second subsample comprised of 278 cancer patients and 289 general practitioners.RESULTS: The data quality of the RSQ was generally good, except low internal consistency and low to moderate test-retest reliability. The four subscales...
Computer interviewing in urogynaecology: concept, development and psychometric testing of an electronic pelvic floor assessment questionnaire in primary and secondary care.

Science.gov (United States)

Radley, S C; Jones, G L; Tanguy, E A; Stevens, V G; Nelson, C; Mathers, N J

2006-02-01

To develop and evaluate a Web-based, electronic pelvic floor symptoms assessment questionnaire (e-PAQ)1 for women. A cross-sectional study in primary and secondary care. Two general practices, two community health clinics and a secondary care urogynaecology clinic. A total of 432 women (204 in primary care and 228 in secondary care) were recruited between June 2003 and January 2004. The e-PAQ was located on a workstation (computer, touchscreen and printer). Women completed the e-PAQ prior to their appointment. Untreated women in primary care were asked to return seven days later to complete the e-PAQ a second time (test-retest). Factor analysis, reliability, validity, patient satisfaction, completion times and system costs. In secondary care, factor analysis identified 14 domains within the four dimensions (urinary, bowel, vaginal and sexual symptoms) with internal consistency (Cronbach's alpha)>or=0.7 in 11 of these. In primary care, alpha values were all>or=0.7 and test-retest analysis found acceptable intraclass correlations of 0.50-0.95 (PPAQ offers a user-friendly clinical tool, which provides valid and reliable data. The system offers comprehensive symptoms and quality of life evaluation and may enhance the clinical episode as well as the quality of care for women with pelvic floor disorders.
A validation study on the traditional Chinese version of Spinal Appearance Questionnaire for adolescent idiopathic scoliosis.

Science.gov (United States)

Guo, Jing; Lau, Ajax Hong Yin; Chau, Jack; Ng, Bobby Kin Wah; Lee, Kwong Man; Qiu, Yong; Cheng, Jack Chun Yiu; Lam, Tsz Ping

2016-10-01

"Simplified Chinese" version of Spinal Appearance Questionnaire (SC-SAQ) for patients with adolescent idiopathic scoliosis (AIS) was available but did not fit for communities using "Traditional Chinese" as their primary language. We developed a traditional Chinese version of SAQ (TC-SAQ) and evaluated its reliability and validity. TC-SAQ was administered to 112 AIS patients, of which 101 bilingual (English and Chinese) patients completed E-SAQ and the traditional Chinese version of Scoliosis Research Society-22 questionnaire (TC-SRS-22). Internal consistency and test-retest reliability were evaluated. Concurrent validity was evaluated by comparing TC-SAQ score with E-SAQ score, and convergent validity by comparing TC-SAQ score with TC-SRS-22 self-image domain score, and discriminant validity by analyzing the relationship between TC-SAQ score and patients' characteristics. Internal consistency of individual TC-SAQ domain was high (Cronbach's α = 0.785 to 0.940), except for general (Cronbach's α = 0.665) and shoulders (Cronbach's α = 0.421) domain. Test-retest reliability of TC-SAQ was good (ICCs of each domain from 0.798 to 0.865). Concurrent validity demonstrated an excellent correlation between TC-SAQ and E-SAQ scores (r = 0.820 to 0.954, P self-image domain was weak to moderate. TC-SAQ total score and individual domain scores (except waist and chest domains) were positively correlated to major curve magnitude. TC-SAQ had good internal consistency and test-retest reliability. Concurrent validity evaluated against the original English version was excellent. TC-SAQ was both reliable and valid for clinical use for AIS patients using traditional Chinese as their primary language.
Cross-cultural adaptation and psychometric assessment of the Chinese version of the comprehensive needs assessment tool for cancer caregivers (CNAT-C).

Science.gov (United States)

Zhang, Yin-Ping; Zhao, Xin-Shuang; Zhang, Bei; Zhang, Lu-Lu; Ni, Chun-Ping; Hao, Nan; Shi, Chang-Bei; Porr, Caroline

2015-07-01

The comprehensive needs assessment tool for cancer caregivers (CNAT-C) is a systematic and comprehensive needs assessment tool for the family caregivers. The purpose of this project was twofold: (1) to adapt the CNAT-C to Mainland China's cultural context and (2) to evaluate the psychometric properties of the newly adapted Chinese CNAT-C. Cross-cultural adaptation of the original CNAT-C was performed according to published guidelines. A pilot study was conducted in Mainland China with 30 Chinese family cancer caregivers. A subsequent validation study was conducted with 205 Chinese cancer caregivers from Mainland China. Construct validity was determined through exploratory and confirmatory factor analyses. Reliability was determined using internal consistency and test-retest reliability. The split-half coefficient for the overall Chinese CNAT-C scale was 0.77. Principal component analysis resulted in an eight-factor structure explaining 68.11 % of the total variance. The comparative fit index (CFI) was 0.91 from the modified model confirmatory factor analysis. The Chi-square divided by degrees of freedom was 1.98, and the root mean squared error of approximation (RMSEA) was 0.079. In relation to the known-group validation, significant differences were found in the Chinese CNAT-C scale according to various caregiver characteristics. Internal consistency was high for the Chinese CNAT-C reaching a Cronbach α value of 0.94. Test-retest reliability was 0.85. The newly adapted Chinese CNAT-C scale possesses adequate validity, test-retest reliability, and internal consistency and therefore may be used to ascertain holistic health and support needs of cancer patients' family caregivers in Mainland China.
Influence of enrichment on behavioral and neurogenic effects of antidepressants in Wistar rats submitted to repeated forced swim test.

Science.gov (United States)

Possamai, Fernanda; dos Santos, Juliano; Walber, Thais; Marcon, Juliana C; dos Santos, Tiago Souza; Lino de Oliveira, Cilene

2015-04-03

Repeated forced swimming test (rFST) may detect gradual effects of antidepressants in adult rats. Antidepressants, as enrichment, affected behavior and neurogenesis in rats. However, the influence of enrichment on behavioral and neurogenic effects of antidepressants is unknown. Here, effects of antidepressants on rFST and hippocampal neurogenesis were investigated in rats under enriched conditions. Behaviors of male Wistar rats, housed from weaning in standard (SE) or enriched environment (EE), were registered during rFST. The rFST consisted of 15min of swimming (pretest) followed by 5min of swimming in the first (test), seventh (retest 1) and fourteenth (retest 2) days after pretest. One hour before the test, rats received an intraperitoneal injection of saline (1ml/kg), fluoxetine (2.5mg/kg) or imipramine (2.5 or 5mg/kg). These treatments were performed daily until the day of the retest 2. After retest 2, rats were euthanized for the identification of markers for neurogenesis in the hippocampus. Fluoxetine or imipramine decreased immobility in retests 1 and 2, as compared to saline. EE abolished these differences. In EE, fluoxetine or imipramine (5mg/kg) reduced immobility time in retest 2, as compared to the test. Independent of the housing conditions, fluoxetine and imipramine (5mg/kg) increased the ratio of immature neurons per progenitor cell in the hippocampus. In summary, antidepressants or enrichment counteracted the high immobility in rFST. Enrichment changed the effects of antidepressants in rFST depending on the type, and the dose of a substance but failed to change neurogenesis in control or antidepressant treated-rats. Effects of antidepressants and enrichment on rFST seemed neurogenesis-independent. Copyright © 2014 Elsevier Inc. All rights reserved.
Validity and reliability of the novel thyroid-specific quality of life questionnaire, ThyPRO

DEFF Research Database (Denmark)

Watt, Torquil; Hegedüs, Laszlo; Groenvold, Mogens

2010-01-01

Background Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated. Aim To investigate clinical ('known-groups') validity and test-retest reliability of the Danish version of the ThyPRO. Methods For each of the 13 ThyPRO scales, we defined groups expected to have high versus low scores ('known-groups'). The clinical validity (known......-groups validity) was evaluated by whether the ThyPRO scales could detect expected differences in a cross-sectional study of 907 thyroid patients. Test-retest reliability was evaluated by intra-class correlations of two responses to the ThyPRO 2 weeks apart in a subsample of 87 stable patients. Results On all 13...
Psychometric Properties of the Chinese Version of the Eating Attitudes Test in Young Female Patients with Eating Disorders in Mainland China.

Science.gov (United States)

Kang, Qing; Chan, Raymond C K; Li, Xiaoping; Arcelus, Jon; Yue, Ling; Huang, Jiabin; Gu, Lian; Fan, Qing; Zhang, Haiyin; Xiao, Zeping; Chen, Jue

2017-11-01

The study aimed to investigate the reliability and validity of the Chinese version of the eating attitudes test (EAT-26) among female adolescents and young adults in Mainland China. This scale was administered to 396 female eating disorder patients and 406 noneating disorder healthy controls, in addition 35 healthy controls completed a retest after a 4-week intervals. Tests for reliability, convergent validity and receiver operating characteristic analysis were performed to detect the psychometric properties. The EAT-26 demonstrated good internal consistency (Cronbach's alpha = 0.822-0.922), test-retest reliability (interclass correlation coefficient = 0.817) and convergent validity(r = 0.450-0.750). The receiver operating characteristic analysis showed that the cut-off 14 for anorexia nervosa and 15 for bulimia nervosa represented good compromises with approximate sensitivity (0.66-0.68) and specificity (0.85-0.86). Our findings provided evidence that the Chinese version of the EAT-26 was a psychometrically reliable and valid self-rating instrument for identifying people suffering from an eating disorder in Mainland China. A clinical cut-off range between 14 and 15 could be used, but caution should be exercised because of the low sensitivity of the tool. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association.
Psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C) for patients with alcoholism.

Science.gov (United States)

Hsueh, Yu-Jung; Chu, Hsin; Huang, Chang-Chih; Ou, Keng-Liang; Chen, Chiung-Hua; Chou, Kuei-Ru

2014-04-01

The aim of this study was to examine the psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C). The sensitivity, specificity, and positive and negative predictive values for the MAST-C were examined in this study. The MAST-C had an internal consistency of 0.83 and a test-retest reliability of 0.89. It had a good content validity index of 0.92. Factor analysis identified four factors and the optimal cutoff point for the MAST-C was a score of 6/7, which yielded a sensitivity of 0.92, a specificity of 0.83, a positive predictive value of 0.92, and a negative predictive value of 0.83. The MAST-C provides a fast, accurate, and sensitive method for clinically diagnosing alcoholism and clinical management. © 2013 Wiley Periodicals, Inc.
Development and psychometric testing of the active aging scale for Thai adults.

Science.gov (United States)

Thanakwang, Kattika; Isaramalai, Sang-Arun; Hatthakit, Urai

2014-01-01

Active aging is central to enhancing the quality of life for older adults, but its conceptualization is not often made explicit for Asian elderly people. Little is known about active aging in older Thai adults, and there has been no development of scales to measure the expression of active aging attributes. The aim of this study was to develop a culturally relevant composite scale of active aging for Thai adults (AAS-Thai) and to evaluate its reliability and validity. EIGHT STEPS OF SCALE DEVELOPMENT WERE FOLLOWED: 1) using focus groups and in-depth interviews, 2) gathering input from existing studies, 3) developing preliminary quantitative measures, 4) reviewing for content validity by an expert panel, 5) conducting cognitive interviews, 6) pilot testing, 7) performing a nationwide survey, and 8) testing psychometric properties. In a nationwide survey, 500 subjects were randomly recruited using a stratified sampling technique. Statistical analyses included exploratory factor analysis, item analysis, and measures of internal consistency, concurrent validity, and test-retest reliability. Principal component factor analysis with varimax rotation resulted in a final 36-item scale consisting of seven factors of active aging: 1) being self-reliant, 2) being actively engaged with society, 3) developing spiritual wisdom, 4) building up financial security, 5) maintaining a healthy lifestyle, 6) engaging in active learning, and 7) strengthening family ties to ensure care in later life. These factors explained 69% of the total variance. Cronbach's alpha coefficient for the overall AAS-Thai was 0.95 and varied between 0.81 and 0.91 for the seven subscales. Concurrent validity and test-retest reliability were confirmed. The AAS-Thai demonstrated acceptable overall validity and reliability for measuring the multidimensional attributes of active aging in a Thai context. This newly developed instrument is ready for use as a screening tool to assess active aging levels among older
Choice consistency and preference stability in test-retests of discrete choice experiment and open-ended willingness to pay elicitation formats

NARCIS (Netherlands)

Brouwer, R.; Logar, I.; Sheremet, O.I.

2017-01-01

This study tests the temporal stability of preferences, choices and willingness to pay (WTP) values using both discrete choice experiment (DCE) and open-ended (OE) WTP elicitation formats. The same sample is surveyed three times over the course of two years using each time the same choice sets.
Migraine patients consistently show abnormal vestibular bedside tests

Directory of Open Access Journals (Sweden)

Eliana Teixeira Maranhão

2015-01-01

Full Text Available Migraine and vertigo are common disorders, with lifetime prevalences of 16% and 7% respectively, and co-morbidity around 3.2%. Vestibular syndromes and dizziness occur more frequently in migraine patients. We investigated bedside clinical signs indicative of vestibular dysfunction in migraineurs.Objective To test the hypothesis that vestibulo-ocular reflex, vestibulo-spinal reflex and fall risk (FR responses as measured by 14 bedside tests are abnormal in migraineurs without vertigo, as compared with controls.Method Cross-sectional study including sixty individuals – thirty migraineurs, 25 women, 19-60 y-o; and 30 gender/age healthy paired controls.Results Migraineurs showed a tendency to perform worse in almost all tests, albeit only the Romberg tandem test was statistically different from controls. A combination of four abnormal tests better discriminated the two groups (93.3% specificity.Conclusion Migraine patients consistently showed abnormal vestibular bedside tests when compared with controls.
A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing.

Science.gov (United States)

DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M

2017-10-27

The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no
Translation and testing of measurement properties of the Swedish version of the IKDC subjective knee form.

Science.gov (United States)

Tigerstrand Grevnerts, H; Grävare Silbernagel, K; Sonesson, S; Ardern, C; Österberg, A; Gauffin, H; Kvist, J

2017-05-01

To translate to Swedish language and cross-culturally adapt the IKDC-SKF and to test the measurement properties of the Swedish version of IKDC-SKF in ACL-injured patients undergoing reconstruction surgery.The translation and cross-cultural adaption was performed according to guidelines. Seventy-six patients with an ACL injury filled out the IKDC-SKF and other questionnaires before ACL reconstruction and at 4, 6, and 12 months after surgery. A total of 203 patients from the Swedish ACL Registry participated at 8 months post-operative. Measurement properties were tested according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guidelines.The Swedish IKDC-SKF had high internal consistency (Cronbach's alpha=0.90) and test-retest reliability (ICC 2,1 =0.92, CI 95%: 0.81-0.97, Pmeasurement properties and can be recommended for use in a population of ACL-deficient patients undergoing ACL reconstruction. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Assessment of the reliability and consistency of the "malnutrition inflammation score" (MIS) in Mexican adults with chronic kidney disease for diagnosis of protein-energy wasting syndrome (PEW).

Science.gov (United States)

González-Ortiz, Ailema Janeth; Arce-Santander, Celene Viridiana; Vega-Vega, Olynka; Correa-Rotter, Ricardo; Espinosa-Cuevas, María de Los Angeles

2014-10-04

The protein-energy wasting syndrome (PEW) is a condition of malnutrition, inflammation, anorexia and wasting of body reserves resulting from inflammatory and non-inflammatory conditions in patients with chronic kidney disease (CKD).One way of assessing PEW, extensively described in the literature, is using the Malnutrition Inflammation Score (MIS). To assess the reliability and consistency of MIS for diagnosis of PEW in Mexican adults with CKD on hemodialysis (HD). Study of diagnostic tests. A sample of 45 adults with CKD on HD were analyzed during the period June-July 2014.The instrument was applied on 2 occasions; the test-retest reliability was calculated using the Intraclass Correlation Coefficient (ICC); the internal consistency of the questionnaire was analyzed using Cronbach's αcoefficient. A weighted Kappa test was used to estimate the validity of the instrument; the result was subsequently compared with the Bilbrey nutritional index (BNI). The reliability of the questionnaires, evaluated in the patient sample, was ICC=0.829.The agreement between MIS observations was considered adequate, k= 0.585 (p <0.001); when comparing it with BNI, a value of k = 0.114 was obtained (p <0.001).In order to estimate the tendency, a correlation test was performed. The r² correlation coefficient was 0.488 (P <0.001). MIS has adequate reliability and validity for diagnosing PEW in the population with chronic kidney disease on HD. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.

Reliability of a computer software angle tool for measuring spine and pelvic flexibility during the sit-and-reach test.

Science.gov (United States)

Mier, Constance M; Shapiro, Belinda S

2013-02-01

The purpose of this study was to determine the reliability of a computer software angle tool that measures thoracic (T), lumbar (L), and pelvic (P) angles as a means of evaluating spine and pelvic flexibility during the sit-and-reach (SR) test. Thirty adults performed the SR twice on separate days. The SR test was captured on video and later analyzed for T, L, and P angles using the computer software angle tool. During the test, 3 markers were placed over T1, T12, and L5 vertebrae to identify T, L, and P angles. Intraclass correlation coefficient (ICC) indicated a very high internal consistency (between trials) for T, L, and P angles (0.95-0.99); thus, the average of trials was used for test-retest (between days) reliability. Mean (±SD) values did not differ between days for T (51.0 ± 14.3 vs. 52.3 ± 16.2°), L (23.9 ± 7.1 vs. 23.0 ± 6.9°), or P (98.4 ± 15.6 vs. 98.3 ± 14.7°) angles. Test-retest reliability (ICC) was high for T (0.96) and P (0.97) angles and moderate for L angle (0.84). Both intrarater and interrater reliabilities were high for T (0.95, 0.94) and P (0.97, 0.97) angles and moderate for L angle (0.87, 0.82). Thus, the computer software angle tool is a highly objective method for assessing spine and pelvic flexibility during a video-captured SR test.
Short-Cut Estimators of Criterion-Referenced Test Consistency.

Science.gov (United States)

Brown, James Dean

1990-01-01

Presents simplified methods for deriving estimates of the consistency of criterion-referenced, English-as-a-Second-Language tests, including (1) the threshold loss agreement approach using agreement or kappa coefficients, (2) the squared-error loss agreement approach using the phi(lambda) dependability approach, and (3) the domain score…
Testing Practices and Attitudes Toward Tests and Testing: An International Survey

Czech Academy of Sciences Publication Activity Database

Evers, A.; McCormick, C. M.; Hawley, L. R.; Muñiz, J.; Balboni, G.; Bartram, D.; Boben, D.; Egeland, J.; El-Hassan, K.; Fernández-Hermida, J.R.; Fine, S.; Frans, Ö.; Gintiliéne, G.; Hagemeister, C.; Halama, P.; Iliescu, D.; Jaworowska, A.; Jiménez, P.; Manthouili, M.; Matesic, K.; Michaelsen, L.; Mogaji, A.; Morley-Kirk, J.; Rózsa, S.; Rowlands, L.; Schittekatte, M.; Sümer, H.C.; Suwartono, T.; Urbánek, Tomáš; Wechsler, S.; Zelenevska, T.; Zanev, S.; Zhang, J.

2017-01-01

Roč. 17, č. 2 (2017), s. 158-190 ISSN 1530-5058 Institutional support: RVO:68081740 Keywords : psychological testing * testing practices * test use * International Test Commission * European Federation of Psychologists' Associations Subject RIV: AN - Psychology OBOR OECD: Psychology (including human - machine relations)
Test-retest reliability and agreement of the SPI-Questionnaire to detect symptoms of digital ischemia in elite volleyball players.

Science.gov (United States)

van de Pol, Daan; Zacharian, Tigran; Maas, Mario; Kuijer, P Paul F M

2017-06-01

The Shoulder posterior circumflex humeral artery Pathology and digital Ischemia - questionnaire (SPI-Q) has been developed to enable periodic surveillance of elite volleyball players, who are at risk for digital ischemia. Prior to implementation, assessing reliability is mandatory. Therefore, the test-retest reliability and agreement of the SPI-Q were evaluated among the population at risk. A questionnaire survey was performed with a 2-week interval among 65 elite male volleyball players assessing symptoms of cold, pale and blue digits in the dominant hand during or after practice or competition using a 4-point Likert scale (never, sometimes, often and always). Kappa (κ) and percentage of agreement (POA) were calculated for individual symptoms, and to distinguish symptomatic and asymptomatic players. For the individual symptoms, κ ranged from "poor" (0.25) to "good" (0.63), and POA ranged from "moderate" (78%) to "good" (97%). To classify symptomatic players, the SPI-Q showed "good" reliability (κ = 0.83; 95%CI 0.69-0.97) and "good" agreement (POA = 92%). The current study has proven the SPI-Q to be reliable for detecting elite male indoor volleyball players with symptoms of digital ischemia.
Improving biobank consent comprehension: a national randomized survey to assess the effect of a simplified form and review/retest intervention

Science.gov (United States)

Beskow, Laura M.; Lin, Li; Dombeck, Carrie B.; Gao, Emily; Weinfurt, Kevin P.

2017-01-01

Purpose: To determine the individual and combined effects of a simplified form and a review/retest intervention on biobanking consent comprehension. Methods: We conducted a national online survey in which participants were randomized within four educational strata to review a simplified or traditional consent form. Participants then completed a comprehension quiz; for each item answered incorrectly, they reviewed the corresponding consent form section and answered another quiz item on that topic. Results: Consistent with our first hypothesis, comprehension among those who received the simplified form was not inferior to that among those who received the traditional form. Contrary to expectations, receipt of the simplified form did not result in significantly better comprehension compared with the traditional form among those in the lowest educational group. The review/retest procedure significantly improved quiz scores in every combination of consent form and education level. Although improved, comprehension remained a challenge in the lowest-education group. Higher quiz scores were significantly associated with willingness to participate. Conclusion: Ensuring consent comprehension remains a challenge, but simplified forms have virtues independent of their impact on understanding. A review/retest intervention may have a significant effect, but assessing comprehension raises complex questions about setting thresholds for understanding and consequences of not meeting them. Genet Med advance online publication 13 October 2016 PMID:27735922
Psychometric Evaluation of the Theory of Mind Inventory (ToMI): A Study of Typically Developing Children and Children with Autism Spectrum Disorder

Science.gov (United States)

Hutchins, Tiffany L.; Prelock, Patricia A.; Bonazinga, Laura

2012-01-01

Two studies examined the psychometric properties of the Theory of Mind Inventory (ToMI). In Study One, 135 caregivers completed the ToMI for children (ages 3 through 17) with autism spectrum disorder (ASD). Findings revealed excellent test-retest reliability and internal consistency. Principle Components Analysis revealed three subscales related…
Meta-Analysis of the English Version of the Beck Depression Inventory-Second Edition

Science.gov (United States)

Erford, Bradley T.; Johnson, Erin; Bardoshi, Gerta

2016-01-01

This meta-analysis reviewed 144 studies from 1996 to 2013 using the Beck Depression Inventory-Second Edition. Internal consistency was 0.89 and test-retest reliability 0.75. Convergent comparisons were robust across 43 depression instruments. Structural validity supported both one- and two-factor solutions and diagnostic accuracy varied according…
The Measurement of Psychological Maltreatment: Early Data on the Child Abuse and Trauma Scale.

Science.gov (United States)

Sanders, Barbara; Becker-Lausen, Evvie

1995-01-01

The Child Abuse and Trauma Scale, a self-report measure yielding a quantitative index of the frequency and extent of negative experiences in childhood and adolescence, was administered to 1,198 college students and 17 subjects with Multiple Personality Disorder. Results revealed the scale's strong internal consistency, test-retest reliability, and…
Factorial Validity and Psychometric Examination of the Exercise Dependence Scale-Revised

Science.gov (United States)

Downs, Danielle Symons; Hausenblas, Heather A.; Nigg, Claudio R.

2004-01-01

The research purposes were to examine the factorial and convergent validity, internal consistency, and test-retest reliability of the Exercise Dependence Scale (EDS). Two separate studies, containing a total of 1,263 college students, were undertaken to accomplish these purposes. Participants completed the EDS and measures of exercise behavior and…
Test-retest reliability of evoked BOLD signals from a cognitive-emotive fMRI test battery.

Science.gov (United States)

Plichta, Michael M; Schwarz, Adam J; Grimm, Oliver; Morgen, Katrin; Mier, Daniela; Haddad, Leila; Gerdes, Antje B M; Sauer, Carina; Tost, Heike; Esslinger, Christine; Colman, Peter; Wilson, Frederick; Kirsch, Peter; Meyer-Lindenberg, Andreas

2012-04-15

Even more than in cognitive research applications, moving fMRI to the clinic and the drug development process requires the generation of stable and reliable signal changes. The performance characteristics of the fMRI paradigm constrain experimental power and may require different study designs (e.g., crossover vs. parallel groups), yet fMRI reliability characteristics can be strongly dependent on the nature of the fMRI task. The present study investigated both within-subject and group-level reliability of a combined three-task fMRI battery targeting three systems of wide applicability in clinical and cognitive neuroscience: an emotional (face matching), a motivational (monetary reward anticipation) and a cognitive (n-back working memory) task. A group of 25 young, healthy volunteers were scanned twice on a 3T MRI scanner with a mean test-retest interval of 14.6 days. FMRI reliability was quantified using the intraclass correlation coefficient (ICC) applied at three different levels ranging from a global to a localized and fine spatial scale: (1) reliability of group-level activation maps over the whole brain and within targeted regions of interest (ROIs); (2) within-subject reliability of ROI-mean amplitudes and (3) within-subject reliability of individual voxels in the target ROIs. Results showed robust evoked activation of all three tasks in their respective target regions (emotional task=amygdala; motivational task=ventral striatum; cognitive task=right dorsolateral prefrontal cortex and parietal cortices) with high effect sizes (ES) of ROI-mean summary values (ES=1.11-1.44 for the faces task, 0.96-1.43 for the reward task, 0.83-2.58 for the n-back task). Reliability of group level activation was excellent for all three tasks with ICCs of 0.89-0.98 at the whole brain level and 0.66-0.97 within target ROIs. Within-subject reliability of ROI-mean amplitudes across sessions was fair to good for the reward task (ICCs=0.56-0.62) and, dependent on the particular ROI
Test–retest repeatability of quantitative cardiac 11C-meta-hydroxyephedrine measurements in rats by small animal positron emission tomography

International Nuclear Information System (INIS)

Thackeray, James T.; Renaud, Jennifer M.; Kordos, Myra; Klein, Ran; Kemp, Robert A. de; Beanlands, Rob S.B.; DaSilva, Jean N.

2013-01-01

Introduction: The norepinephrine analogue 11 C-meta-hydroxyephedrine (HED) has been used to interrogate sympathetic neuronal reuptake in cardiovascular disease. Application for longitudinal studies in small animal models of disease necessitates an understanding of test–retest variability. This study evaluated the repeatability of multiple quantitative cardiac measurements of HED retention and washout and the pharmacological response to reuptake blockade and enhanced norepinephrine levels. Methods: Small animal PET images were acquired over 60 min following HED administration to healthy male Sprague Dawley rats. Paired test and retest scans were undertaken in individual animals over 7 days. Additional HED scans were conducted following administration of norepinephrine reuptake inhibitor desipramine or continuous infusion of exogenous norepinephrine. HED retention was quantified by retention index, standardized uptake value (SUV), monoexponential and one-compartment washout. Plasma and cardiac norepinephrine were measured by high performance liquid chromatography. Results: Test retest variability was lower for retention index (15% ± 12%) and SUV (19% ± 15%) as compared to monoexponential washout rates (21% ± 13%). Desipramine pretreatment reduced myocardial HED retention index by 69% and SUV by 85%. Chase treatment with desipramine increased monoexponential HED washout by 197% compared to untreated controls. Norepinephrine infusion dose-dependently reduced HED accumulation, reflected by both retention index and SUV, with a corresponding increase in monoexponential washout. Plasma and cardiac norepinephrine levels correlated with HED quantitative measurements. Conclusion: The repeatability of HED retention index, SUV, and monoexponential washout supports its suitability for longitudinal PET studies in rats. Uptake and washout of HED are sensitive to acute increases in norepinephrine concentration
Cross-Cultural Translation, Adaptation and Reliability of the Danish M. D. Andeson Dysphagia Inventory (MDADI) in Patients with Head and Neck Cancer

DEFF Research Database (Denmark)

Hajdú, Sara Fredslund; Plaschke, Christina Caroline; Johansen, Christoffer

2017-01-01

The objectives were to translate and culturally adapt the M.D. Anderson Dysphagia Inventory (MDADI) into Danish and subsequently test the reliability of the Danish version. The MDADI was translated into Danish and cross culturally adapted through cognitive interviews. The final version was test...... patients were interviewed on the comprehensibility of the Danish MDADI, and all found the questionnaire meaningful, easy to understand, non-offensive and to include relevant aspects of dysphagia related to HNC. Sixty-four patients were included in the test-retest study. Especially, one item....... The Danish MDADI is reliable in terms of internal consistency and test-retest reproducibility and can be used in assessing the health-related quality of life in head and neck cancer patients with dysphagia....
The Persian version of auditory word discrimination test (P-AWDT) for children: Development, validity, and reliability.

Science.gov (United States)

Hashemi, Nassim; Ghorbani, Ali; Soleymani, Zahra; Kamali, Mohmmad; Ahmadi, Zohreh Ziatabar; Mahmoudian, Saeid

2018-07-01

Auditory discrimination of speech sounds is an important perceptual ability and a precursor to the acquisition of language. Auditory information is at least partially necessary for the acquisition and organization of phonological rules. There are few standardized behavioral tests to evaluate phonemic distinctive features in children with or without speech and language disorders. The main objective of the present study was the development, validity, and reliability of the Persian version of auditory word discrimination test (P-AWDT) for 4-8-year-old children. A total of 120 typical children and 40 children with speech sound disorder (SSD) participated in the present study. The test comprised of 160 monosyllabic paired-words distributed in the Forms A-1 and the Form A-2 for the initial consonants (80 words) and the Forms B-1 and the Form B-2 for the final consonants (80 words). Moreover, the discrimination of vowels was randomly included in all forms. Content validity was calculated and 50 children repeated the test twice with two weeks of interval (test-retest reliability). Further analysis was also implemented including validity, intraclass correlation coefficient (ICC), Cronbach's alpha (internal consistency), age groups, and gender. The content validity index (CVI) and the test-retest reliability of the P-AWDT were achieved 63%-86% and 81%-96%, respectively. Moreover, the total Cronbach's alpha for the internal consistency was estimated relatively high (0.93). Comparison of the mean scores of the P-AWDT in the typical children and the children with SSD revealed a significant difference. The results revealed that the group with SSD had greater severity of deficit than the typical group in auditory word discrimination. In addition, the difference between the age groups was statistically significant, especially in 4-4.11-year-old children. The performance of the two gender groups was relatively same. The comparison of the P-AWDT scores between the typical children
Development of an instrument based on the protection motivation theory to measure factors influencing women's intention to first pap test practice.

Science.gov (United States)

Hassani, Lale; Dehdari, Tahereh; Hajizadeh, Ebrahim; Shojaeizadeh, Davoud; Abedini, Mehrandokht; Nedjat, Saharnaz

2014-01-01

Given that there are many Iranian women who have never had a Pap smear, this study was designed to develop and validate a measurement tool based on the Protection Motivation Theory to assess factors influencing the Iranian women's intention to perform first Pap testing. In this psychometric research, to determine the Content Validity Index (CVI) and the Content Validity Ratio (CVR), a panel of experts (n=10) reviewed scale items. Reliability was estimated through the Intraclass Correlation Coefficient (n=30) and internal consistency (n=240). Also, factor analysis (exploratory and conformity) was performed on the data of the sample women who had never had a Pap smear test (n=240). A 26-item questionnaire was developed. The CVI and CVR scores of the scale were 0.89 and 0.90, respectively. Exploratory factor analysis loaded a 26-item with seven factors questionnaire (perceived vulnerability and severity, fear, response costs, response efficacy, self-efficacy, and protection motivation (or intention)) that jointly accounted for 72.76% of the observed variance. Confirmatory factor analysis indicated a good fit for the data. Internal consistency (range 0.70-0.93) and test-retest reliability (range 0.72-0.96) of sub-scales were acceptable. This study showed that the designed instrument was a valid and reliable tool for measuring the factors influencing the women's intention to perform their first Pap testing.
Determining the Feasibility, Content Validity, and Internal Consistency of a Newly Developed Care Coordination Scale for People with Brain Injury

Directory of Open Access Journals (Sweden)

Brian P. Johnson

2017-07-01

Full Text Available Background: With the increasing complexity of care, people with disabilities and supportive significant others (SSO must often coordinate key aspects of their own care, but no validated scale currently exists to comprehensively characterize the activities done to manage and coordinate their care. Method: This study aimed to improve the feasibility, acceptability, and content validity of the Care and Service Coordination and Management (CASCAM scale and to test its internal consistency. Questionnaire items were administered to 23 individuals with acquired brain injury and 17 SSO. Results: Respondents confirmed content validity and that the instrument addresses important care coordination and management issues. The internal consistency of care coordination domains for medical/ rehabilitative and independent living needs for people with brain injury and their SSO ranged from α = .774 to .945. Conclusion: Care coordination activities by persons with disabilities, including brain injury, and their SSO are multifaceted but feasibly measurable and should be assessed to improve care.
The Clinical Impression of Severity Index for Parkinson's Disease: international validation study.

Science.gov (United States)

Martínez-Martín, Pablo; Rodríguez-Blázquez, Carmen; Forjaz, Maria João; de Pedro, Jesús

2009-01-30

This study sought to provide further information about the psychometric properties of the Clinical Impression of Severity Index for Parkinson's Disease (CISI-PD), in a large, international, cross-culturally diverse sample. Six hundred and fourteen patients with PD participated in the study. Apart from the CISI-PD, assessments were based on Hoehn & Yahr (HY) staging, the Scales for Outcomes in PD-Motor (SCOPA-M), -Cognition (SCOPA-COG) and -Psychosocial (SCOPA-PS), the Cumulative Illness Rating Scale-Geriatrics, and the Hospital Anxiety and Depression Scale. The total CISI-PD score displayed no floor or ceiling effects. Internal consistency was 0.81, the test-retest intraclass correlation coefficient was 0.84, and item homogeneity was 0.52. Exploratory and confirmatory factor analysis (CFI = 0.99, RMSEA = 0.07) confirmed CISI-PD's unifactorial structure. The CISI-PD showed adequate convergent validity with SCOPA-COG and SCOPA-M (r(S) = 0.46-0.85, respectively) and discriminative validity for HY stages and disease duration (P validation study, thus showing that the CISI-PD is a valid instrument to measure clinical impression of severity in PD. Its simplicity and easy application make it an attractive and useful tool for clinical practice and research.
Polish adaptation of three self-report measures of job stressors: the Interpersonal Conflict at Work Scale, the Quantitative Workload Inventory and the Organizational Constraints Scale.

Science.gov (United States)

Baka, Łukasz; Bazińska, Róża

2016-01-01

The objective of the present study was to test the psychometric properties, reliability and validity of three job stressor measures, namely, the Interpersonal Conflict at Work Scale, the Organizational Constraints Scale and the Quantitative Workload Inventory. The study was conducted on two samples (N = 382 and 3368) representing a wide range of occupations. The estimation of internal consistency with Cronbach's α and the test-retest method as well as both exploratory and confirmatory factor analyses were the main statistical methods. The internal consistency of the scales proved satisfactory, ranging from 0.80 to 0.90 for Cronbach's α test and from 0.72 to 0.86 for the test-retest method. The one-dimensional structure of the three measurements was confirmed. The three scales have acceptable fit to the data. The one-factor structures and other psychometric properties of the Polish version of the scales seem to be similar to those found in the US version of the scales. It was also proved that the three job stressors are positively related to all the job strain measures. The Polish versions of the three analysed scales can be used to measure the job stressors in Polish conditions.
Translation, cultural adaptation and validation of the Diabetes Attitudes Scale - third version into Brazilian Portuguese 1

Science.gov (United States)

Vieira, Gisele de Lacerda Chaves; Pagano, Adriana Silvino; Reis, Ilka Afonso; Rodrigues, Júlia Santos Nunes; Torres, Heloísa de Carvalho

2018-01-01

ABSTRACT Objective: to perform the translation, adaptation and validation of the Diabetes Attitudes Scale - third version instrument into Brazilian Portuguese. Methods: methodological study carried out in six stages: initial translation, synthesis of the initial translation, back-translation, evaluation of the translated version by the Committee of Judges (27 Linguists and 29 health professionals), pre-test and validation. The pre-test and validation (test-retest) steps included 22 and 120 health professionals, respectively. The Content Validity Index, the analyses of internal consistency and reproducibility were performed using the R statistical program. Results: in the content validation, the instrument presented good acceptance among the Judges with a mean Content Validity Index of 0.94. The scale presented acceptable internal consistency (Cronbach’s alpha = 0.60), while the correlation of the total score at the test and retest moments was considered high (Polychoric Correlation Coefficient = 0.86). The Intra-class Correlation Coefficient, for the total score, presented a value of 0.65. Conclusion: the Brazilian version of the instrument (Escala de Atitudes dos Profissionais em relação ao Diabetes Mellitus) was considered valid and reliable for application by health professionals in Brazil. PMID:29319739
Technical Analysis of the Slosson-Diagnostic Math Screener (S-DMS)

Science.gov (United States)

Erford, Bradley T.; Klein, Lauren

2007-01-01

The Slosson-Diagnostic Math Screener (S-DMS) was designed to help identify students in Grades 1 to 8 at risk for mathematics failure. Internal consistency, test-retest reliability, item analysis, decision efficiency, convergent validity, and factorial validity of all five levels of the S-DMS were studied using 20 independent samples of students…
Further Validation of the Learning Alliance Inventory: The Roles of Working Alliance, Rapport, and Immediacy in Student Learning

Science.gov (United States)

Rogers, Daniel T.

2015-01-01

This study further examined the reliability and validity of the Learning Alliance Inventory (LAI), a self-report measure designed to assess the working alliance between a student and a teacher. The LAI was found to have good internal consistency and test--retest reliability, and it demonstrated the predicted convergence with measures of immediacy…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.