reliability intraclass correlation: Topics by WorldWideScience.org

Sample records for reliability intraclass correlation

A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.

Science.gov (United States)

Koo, Terry K; Li, Mae Y

2016-06-01

Intraclass correlation coefficient (ICC) is a widely used reliability index in test-retest, intrarater, and interrater reliability analyses. This article introduces the basic concept of ICC in the content of reliability analysis. There are 10 forms of ICCs. Because each form involves distinct assumptions in their calculation and will lead to different interpretations, researchers should explicitly specify the ICC form they used in their calculation. A thorough review of the research design is needed in selecting the appropriate form of ICC to evaluate reliability. The best practice of reporting ICC should include software information, "model," "type," and "definition" selections. When coming across an article that includes ICC, readers should first check whether information about the ICC form has been reported and if an appropriate ICC form was used. Based on the 95% confident interval of the ICC estimate, values less than 0.5, between 0.5 and 0.75, between 0.75 and 0.9, and greater than 0.90 are indicative of poor, moderate, good, and excellent reliability, respectively. This article provides a practical guideline for clinical researchers to choose the correct form of ICC and suggests the best practice of reporting ICC parameters in scientific publications. This article also gives readers an appreciation for what to look for when coming across ICC while reading an article.
A comparison of two indices for the intraclass correlation coefficient.

Science.gov (United States)

Shieh, Gwowen

2012-12-01

In the present study, we examined the behavior of two indices for measuring the intraclass correlation in the one-way random effects model: the prevailing ICC(1) (Fisher, 1938) and the corrected eta-squared (Bliese & Halverson, 1998). These two procedures differ both in their methods of estimating the variance components that define the intraclass correlation coefficient and in their performance of bias and mean squared error in the estimation of the intraclass correlation coefficient. In contrast with the natural unbiased principle used to construct ICC(1), in the present study it was analytically shown that the corrected eta-squared estimator is identical to the maximum likelihood estimator and the pairwise estimator under equal group sizes. Moreover, the empirical results obtained from the present Monte Carlo simulation study across various group structures revealed the mutual dominance relationship between their truncated versions for negative values. The corrected eta-squared estimator performs better than the ICC(1) estimator when the underlying population intraclass correlation coefficient is small. Conversely, ICC(1) has a clear advantage over the corrected eta-squared for medium and large magnitudes of population intraclass correlation coefficient. The conceptual description and numerical investigation provide guidelines to help researchers choose between the two indices for more accurate reliability analysis in multilevel research.
Choosing the best index for the average score intraclass correlation coefficient.

Science.gov (United States)

Shieh, Gwowen

2016-09-01

The intraclass correlation coefficient (ICC)(2) index from a one-way random effects model is widely used to describe the reliability of mean ratings in behavioral, educational, and psychological research. Despite its apparent utility, the essential property of ICC(2) as a point estimator of the average score intraclass correlation coefficient is seldom mentioned. This article considers several potential measures and compares their performance with ICC(2). Analytical derivations and numerical examinations are presented to assess the bias and mean square error of the alternative estimators. The results suggest that more advantageous indices can be recommended over ICC(2) for their theoretical implication and computational ease.
Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies.

Science.gov (United States)

Mehta, Shraddha; Bastero-Caballero, Rowena F; Sun, Yijun; Zhu, Ray; Murphy, Diane K; Hardas, Bhushan; Koch, Gary

2018-04-29

Many published scale validation studies determine inter-rater reliability using the intra-class correlation coefficient (ICC). However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, and levels of rater disagreement affects ICC and provides an approach for obtaining relevant ICC estimates under suboptimal conditions. Simulation results suggest that for a fixed number of subjects, ICC from the convex distribution is smaller than ICC for the uniform distribution, which in turn is smaller than ICC for the concave distribution. The variance component estimates also show that the dissimilarity of ICC among distributions is attributed to the study design (ie, distribution of subjects) component of subject variability and not the scale quality component of rater error variability. The dependency of ICC on the distribution of subjects makes it difficult to compare results across reliability studies. Hence, it is proposed that reliability studies should be designed using a uniform distribution of subjects because of the standardization it provides for representing objective disagreement. In the absence of uniform distribution, a sampling method is proposed to reduce the non-uniformity. In addition, as expected, high levels of disagreement result in low ICC, and when the type of distribution is fixed, any increase in the number of subjects beyond a moderately large specification such as n = 80 does not have a major impact on ICC. Copyright © 2018 John Wiley & Sons, Ltd.
Reliability of environmental sampling culture results using the negative binomial intraclass correlation coefficient.

Science.gov (United States)

Aly, Sharif S; Zhao, Jianyang; Li, Ben; Jiang, Jiming

2014-01-01

The Intraclass Correlation Coefficient (ICC) is commonly used to estimate the similarity between quantitative measures obtained from different sources. Overdispersed data is traditionally transformed so that linear mixed model (LMM) based ICC can be estimated. A common transformation used is the natural logarithm. The reliability of environmental sampling of fecal slurry on freestall pens has been estimated for Mycobacterium avium subsp. paratuberculosis using the natural logarithm transformed culture results. Recently, the negative binomial ICC was defined based on a generalized linear mixed model for negative binomial distributed data. The current study reports on the negative binomial ICC estimate which includes fixed effects using culture results of environmental samples. Simulations using a wide variety of inputs and negative binomial distribution parameters (r; p) showed better performance of the new negative binomial ICC compared to the ICC based on LMM even when negative binomial data was logarithm, and square root transformed. A second comparison that targeted a wider range of ICC values showed that the mean of estimated ICC closely approximated the true ICC.
The Children's Global Assessment Scale (CGAS) and Global Assessment of Psychosocial Disability (GAPD) in clinical practice--substance and reliability as judged by intraclass correlations

DEFF Research Database (Denmark)

Dyrborg, J; Larsen, F W; Nielsen, S

2000-01-01

Studies on the inter-rater reliability on the Children's Global Assessment Scale (CGAS) and the Global Assessment of Psychosocial Disability (GAPD) involving different subgroups of 145 outpatients from 4 to 16 years of age showed fair to substantial intraclass correlations of 0.59 to 0.90. Raters...
Exact Distributions of Intraclass Correlation and Cronbach's Alpha with Gaussian Data and General Covariance

Science.gov (United States)

Kistner, Emily O.; Muller, Keith E.

2004-01-01

Intraclass correlation and Cronbach's alpha are widely used to describe reliability of tests and measurements. Even with Gaussian data, exact distributions are known only for compound symmetric covariance (equal variances and equal correlations). Recently, large sample Gaussian approximations were derived for the distribution functions. New exact…
Intraclass reliability of the Alberta Infant Motor Scale in the Brazilian version

Directory of Open Access Journals (Sweden)

Larissa Paiva Silva

2013-10-01

Full Text Available This study had as its objective to analyze the intraclass reliability of the Alberta Infant Motor Scale (AIMS, in the Brazilian version, in preterm and term infants. It was a methodological study, conducted from November 2009 to April 2010, with 50 children receiving care in two public institutions in Fortaleza, Ceará, Brazil. Children were grouped according to gestational age as preterm and term, and evaluated by three evaluators in the communication laboratory of a public institution or at home. The intraclass correlation indices for the categories prone, supine, sitting and standing ranged from 0.553 to 0.952; most remained above 0.800, except for the standing category of the third evaluator, in which the index was 0.553. As for the total score and percentile, rates ranged from 0.843 to 0.954. The scale proved to be a reliable instrument for assessing gross motor performance of Brazilian children, particularly in Ceará, regardless of gestational age at birth.
Pitfalls and important issues in testing reliability using intraclass correlation coefficients in orthopaedic research.

Science.gov (United States)

Lee, Kyoung Min; Lee, Jaebong; Chung, Chin Youb; Ahn, Soyeon; Sung, Ki Hyuk; Kim, Tae Won; Lee, Hui Jong; Park, Moon Seok

2012-06-01

Intra-class correlation coefficients (ICCs) provide a statistical means of testing the reliability. However, their interpretation is not well documented in the orthopedic field. The purpose of this study was to investigate the use of ICCs in the orthopedic literature and to demonstrate pitfalls regarding their use. First, orthopedic articles that used ICCs were retrieved from the Pubmed database, and journal demography, ICC models and concurrent statistics used were evaluated. Second, reliability test was performed on three common physical examinations in cerebral palsy, namely, the Thomas test, the Staheli test, and popliteal angle measurement. Thirty patients were assessed by three orthopedic surgeons to explore the statistical methods testing reliability. Third, the factors affecting the ICC values were examined by simulating the data sets based on the physical examination data where the ranges, slopes, and interobserver variability were modified. Of the 92 orthopedic articles identified, 58 articles (63%) did not clarify the ICC model used, and only 5 articles (5%) described all models, types, and measures. In reliability testing, although the popliteal angle showed a larger mean absolute difference than the Thomas test and the Staheli test, the ICC of popliteal angle was higher, which was believed to be contrary to the context of measurement. In addition, the ICC values were affected by the model, type, and measures used. In simulated data sets, the ICC showed higher values when the range of data sets were larger, the slopes of the data sets were parallel, and the interobserver variability was smaller. Care should be taken when interpreting the absolute ICC values, i.e., a higher ICC does not necessarily mean less variability because the ICC values can also be affected by various factors. The authors recommend that researchers clarify ICC models used and ICC values are interpreted in the context of measurement.
Tutorial on use of intraclass correlation coefficients for assessing intertest reliability and its application in functional near-infrared spectroscopy-based brain imaging.

Science.gov (United States)

Li, Lin; Zeng, Li; Lin, Zi-Jing; Cazzell, Mary; Liu, Hanli

2015-05-01

Test-retest reliability of neuroimaging measurements is an important concern in the investigation of cognitive functions in the human brain. To date, intraclass correlation coefficients (ICCs), originally used in interrater reliability studies in behavioral sciences, have become commonly used metrics in reliability studies on neuroimaging and functional near-infrared spectroscopy (fNIRS). However, as there are six popular forms of ICC, the adequateness of the comprehensive understanding of ICCs will affect how one may appropriately select, use, and interpret ICCs toward a reliability study. We first offer a brief review and tutorial on the statistical rationale of ICCs, including their underlying analysis of variance models and technical definitions, in the context of assessment on intertest reliability. Second, we provide general guidelines on the selection and interpretation of ICCs. Third, we illustrate the proposed approach by using an actual research study to assess interest reliability of fNIRS-based, volumetric diffuse optical tomography of brain activities stimulated by a risk decision-making protocol. Last, special issues that may arise in reliability assessment using ICCs are discussed and solutions are suggested.
Tutorial on use of intraclass correlation coefficients for assessing intertest reliability and its application in functional near-infrared spectroscopy-based brain imaging

Science.gov (United States)

Li, Lin; Zeng, Li; Lin, Zi-Jing; Cazzell, Mary; Liu, Hanli

2015-05-01

Test-retest reliability of neuroimaging measurements is an important concern in the investigation of cognitive functions in the human brain. To date, intraclass correlation coefficients (ICCs), originally used in inter-rater reliability studies in behavioral sciences, have become commonly used metrics in reliability studies on neuroimaging and functional near-infrared spectroscopy (fNIRS). However, as there are six popular forms of ICC, the adequateness of the comprehensive understanding of ICCs will affect how one may appropriately select, use, and interpret ICCs toward a reliability study. We first offer a brief review and tutorial on the statistical rationale of ICCs, including their underlying analysis of variance models and technical definitions, in the context of assessment on intertest reliability. Second, we provide general guidelines on the selection and interpretation of ICCs. Third, we illustrate the proposed approach by using an actual research study to assess intertest reliability of fNIRS-based, volumetric diffuse optical tomography of brain activities stimulated by a risk decision-making protocol. Last, special issues that may arise in reliability assessment using ICCs are discussed and solutions are suggested.
Intraclass Correlation Coefficients in Hierarchical Designs: Evaluation Using Latent Variable Modeling

Science.gov (United States)

Raykov, Tenko

2011-01-01

Interval estimation of intraclass correlation coefficients in hierarchical designs is discussed within a latent variable modeling framework. A method accomplishing this aim is outlined, which is applicable in two-level studies where participants (or generally lower-order units) are clustered within higher-order units. The procedure can also be…
Estimating a graphical intra-class correlation coefficient (GICC) using multivariate probit-linear mixed models.

Science.gov (United States)

Yue, Chen; Chen, Shaojie; Sair, Haris I; Airan, Raag; Caffo, Brian S

2015-09-01

Data reproducibility is a critical issue in all scientific experiments. In this manuscript, the problem of quantifying the reproducibility of graphical measurements is considered. The image intra-class correlation coefficient (I2C2) is generalized and the graphical intra-class correlation coefficient (GICC) is proposed for such purpose. The concept for GICC is based on multivariate probit-linear mixed effect models. A Markov Chain Monte Carlo EM (mcm-cEM) algorithm is used for estimating the GICC. Simulation results with varied settings are demonstrated and our method is applied to the KIRBY21 test-retest dataset.
A comparison of confidence interval methods for the intraclass correlation coefficient in community-based cluster randomization trials with a binary outcome.

Science.gov (United States)

Braschel, Melissa C; Svec, Ivana; Darlington, Gerarda A; Donner, Allan

2016-04-01

Many investigators rely on previously published point estimates of the intraclass correlation coefficient rather than on their associated confidence intervals to determine the required size of a newly planned cluster randomized trial. Although confidence interval methods for the intraclass correlation coefficient that can be applied to community-based trials have been developed for a continuous outcome variable, fewer methods exist for a binary outcome variable. The aim of this study is to evaluate confidence interval methods for the intraclass correlation coefficient applied to binary outcomes in community intervention trials enrolling a small number of large clusters. Existing methods for confidence interval construction are examined and compared to a new ad hoc approach based on dividing clusters into a large number of smaller sub-clusters and subsequently applying existing methods to the resulting data. Monte Carlo simulation is used to assess the width and coverage of confidence intervals for the intraclass correlation coefficient based on Smith's large sample approximation of the standard error of the one-way analysis of variance estimator, an inverted modified Wald test for the Fleiss-Cuzick estimator, and intervals constructed using a bootstrap-t applied to a variance-stabilizing transformation of the intraclass correlation coefficient estimate. In addition, a new approach is applied in which clusters are randomly divided into a large number of smaller sub-clusters with the same methods applied to these data (with the exception of the bootstrap-t interval, which assumes large cluster sizes). These methods are also applied to a cluster randomized trial on adolescent tobacco use for illustration. When applied to a binary outcome variable in a small number of large clusters, existing confidence interval methods for the intraclass correlation coefficient provide poor coverage. However, confidence intervals constructed using the new approach combined with Smith
Test-Retest Reliability of the Salutogenic Wellness Promotion Scale (SWPS)

Science.gov (United States)

Anderson, L. M.; Moore, J. B.; Hayden, B. M.; Becker, C. M.

2014-01-01

Objective: This study examined the temporal stability (i.e. test-retest reliability) of the Salutogenic Wellness Promotion Scale (SWPS) using intraclass correlation coefficients (ICC). Current intraclass results were also compared to previously published interclass correlations to support the use of the intraclass method for test-retest…
MIDAS and HIT-6 French translation: reliability and correlation between tests.

Science.gov (United States)

Magnoux, E; Freeman, M A; Zlotnik, G

2008-01-01

, with a median of 19 days. The Shrout-Fleiss intraclass correlation coefficients for MIDAS and HIT-6 were, respectively, 0.76 and 0.77 for episodic headaches and 0.83 and 0.80 for chronic headaches. The Pearson correlation coefficient between the MIDAS and HIT-6 questionnaires was 0.48 for episodic headaches and 0.58 for chronic headaches at the first compilation and 0.42 and 0.59 at the second compilation. The test-retest intraclass correlation of the French versions for both MIDAS and HIT-6 questionnaires indicates moderate reliability for episodic headache and substantial reliability for chronic headache. The correlation between the MIDAS and HIT-6 questionnaires is weak for episodic headaches, but approaches a level of 'good' for chronic headaches.
Intraclass Correlation Coefficients in Hierarchical Design Studies with Discrete Response Variables: A Note on a Direct Interval Estimation Procedure

Science.gov (United States)

Raykov, Tenko; Marcoulides, George A.

2015-01-01

A latent variable modeling procedure that can be used to evaluate intraclass correlation coefficients in two-level settings with discrete response variables is discussed. The approach is readily applied when the purpose is to furnish confidence intervals at prespecified confidence levels for these coefficients in setups with binary or ordinal…
The intraclass correlation coefficient applied for evaluation of data correction, labeling methods and rectal biopsy sampling in DNA microarray experiments

NARCIS (Netherlands)

Pellis, E.P.M.; Franssen-Hal, van N.L.W.; Burema, J.; Keijer, J.

2003-01-01

We show that the intraclass correlation coefficient (ICC) can be used as a relatively simple statistical measure to assess methodological and biological variation in DNA microarray analysis. The ICC is a measure that determines the reproducibility of a variable, which can easily be calculated from
Reliability of Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory in a test-retest design.

Science.gov (United States)

Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian

2014-02-01

The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.
Intraclass correlation values for adolescent health outcomes in secondary schools in 21 European countries

Directory of Open Access Journals (Sweden)

N. Shackleton

2016-12-01

Full Text Available Background: Cluster randomised controlled trials (CRCTs are increasingly used to evaluate the effectiveness of interventions for improving health. A key feature of CRCTs is that individuals in clusters are often more alike than individuals in different clusters, irrespective of treatment. This similarity within clusters needs to be taken into account when planning CRCTs to obtain adequate sample sizes, and when analysing clustered data to obtain correct estimates. Methods: Nationally representative data from 15 to 16 year olds were analysed, from 21 of the 35 countries that participated in the 2007 European School Survey Project on Alcohol and Other Drugs. Within country school level intra-class correlation coefficients (ICCs were calculated for substance use (self-reported alcohol use, regular alcohol use, binge drinking, any smoking, regular smoking, and illicit drug use and psychosocial health (depressive mood and self-esteem. Unadjusted and adjusted ICCs are presented. ICCs are adjusted for student sex and socioeconomic status. Results: ICCs ranged from 0.01 to 0.21, with the highest (0.21 reported for regular smoking. Within country school level ICCs varied substantially across health outcomes, and among countries for the same health outcomes. Estimated ICCs were consistently higher for substance use (range 0.01–0.21, than for psychosocial health (range 0.01–0.07. Within country ICCs for health outcomes varied by changes in the measurement of particular health outcomes, for example the ICCs for regular smoking (range 0.06–0.21 were higher than those for having smoked at all in the last month (range 0.03–0.17. Conclusions: For school level ICCs to be effectively utilised in informing sample size requirements for CRCTs and adjusting estimates from meta-analyses, the school level ICCs need to be both country and outcome specific. Keywords: Intra-class correlation, Schools, Adolescents, Substance use, Mental health

Reliability of one-repetition maximum performance in people with chronic heart failure.

Science.gov (United States)

Ellis, Rachel; Holland, Anne E; Dodd, Karen; Shields, Nora

2018-02-24

Evaluate intra-rater and inter-rater reliability of the one-repetition maximum strength test in people with chronic heart failure. Intra-rater and inter-rater reliability study. A public tertiary hospital in northern metropolitan Melbourne. Twenty-four participants (nine female, mean age 71.8 ± 13.1 years) with mild to moderate heart failure of any aetiology. Lower limb strength was assessed by determining the maximum weight that could be lifted using a leg press. Intra-rater reliability was tested by one assessor on two separate occasions . Inter-rater reliability was tested by two assessors in random order. Intra-class correlation coefficients and 95% confidence intervals were calculated. Bland and Altman analyses were also conducted, including calculation of mean differences between measures ([Formula: see text]) and limits of agreement . Ten intra-rater and 21 inter-rater assessments were completed. Excellent intra-rater (intra-class correlation coefficient 2,1 0.96) and inter-rater (intra-class correlation coefficient 2,1 0.93) reliability was found. Intra-rater assessment showed less variability (mean difference 4.5 kg, limits of agreement -8.11 to 17.11 kg) than inter-rater agreement (mean difference -3.81 kg, limits of agreement -23.39 to 15.77 kg). One-repetition maximum determined using a leg press is a reliable measure in people with heart failure. Given its smaller limits of agreement, intra-rater testing is recommended. Implications for Rehabilitation Using a leg press to determine a one-repetition maximum we were able to demonstrate excellent inter-rater and intra-rater reliability using an intra-class correlation coefficient. The Bland and Altman levels of agreement were wide for inter-rater reliability and so we recommend using one assessor if measuring change in strength within an individual over time.
One Iota Fills the Quota: A Paradox in Multifacet Reliability Coefficients.

Science.gov (United States)

Conger, Anthony J.

1983-01-01

A paradoxical phenomenon of decreases in reliability as the number of elements averaged over increases is shown to be possible in multifacet reliability procedures (intraclass correlations or generalizability coefficients). Conditions governing this phenomenon are presented along with implications and cautions. (Author)
[The reliability of a questionnaire regarding Colombian children's physical activity].

Science.gov (United States)

Herazo-Beltrán, Aliz Y; Domínguez-Anaya, Regina

2012-10-01

Reporting the Physical Activity Questionnaire for school children's (PAQ-C) test-retest reliability and internal consistency. This was a descriptive study of 100 school-aged children aged 9 to 11 years old attending a school in Cartagena, Colombia. The sample was randomly selected. The PAQ-C was given twice, one week apart, after the informed consent forms had been signing by the children's parents and school officials. Cronbach's alpha coefficient of reliability was used for assessing internal consistency and an intra-class correlation coefficient for test-retest reliability SPSS (version 17.0) was used for statistical analysis. The questionnaire scored 0.73 internal consistencies during the first measurement and 0.78 on the second; intra-class correlation coefficient was 0.60. There were differences between boys and girls regarding both measurements. The PAQ-C had acceptable internal consistency and test-retest reliability, thereby making it useful for measuring children's self-reported physical activity and a valuable tool for population studies in Colombia.
Intra-class correlation estimates for assessment of vitamin A intake in children.

Science.gov (United States)

Agarwal, Girdhar G; Awasthi, Shally; Walter, Stephen D

2005-03-01

In many community-based surveys, multi-level sampling is inherent in the design. In the design of these studies, especially to calculate the appropriate sample size, investigators need good estimates of intra-class correlation coefficient (ICC), along with the cluster size, to adjust for variation inflation due to clustering at each level. The present study used data on the assessment of clinical vitamin A deficiency and intake of vitamin A-rich food in children in a district in India. For the survey, 16 households were sampled from 200 villages nested within eight randomly-selected blocks of the district. ICCs and components of variances were estimated from a three-level hierarchical random effects analysis of variance model. Estimates of ICCs and variance components were obtained at village and block levels. Between-cluster variation was evident at each level of clustering. In these estimates, ICCs were inversely related to cluster size, but the design effect could be substantial for large clusters. At the block level, most ICC estimates were below 0.07. At the village level, many ICC estimates ranged from 0.014 to 0.45. These estimates may provide useful information for the design of epidemiological studies in which the sampled (or allocated) units range in size from households to large administrative zones.
Reliability of Computerized Neurocognitive Tests for Concussion Assessment: A Meta-Analysis.

Science.gov (United States)

Farnsworth, James L; Dargo, Lucas; Ragan, Brian G; Kang, Minsoo

2017-09-01

Although widely used, computerized neurocognitive tests (CNTs) have been criticized because of low reliability and poor sensitivity. A systematic review was published summarizing the reliability of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores; however, this was limited to a single CNT. Expansion of the previous review to include additional CNTs and a meta-analysis is needed. Therefore, our purpose was to analyze reliability data for CNTs using meta-analysis and examine moderating factors that may influence reliability. A systematic literature search (key terms: reliability, computerized neurocognitive test, concussion) of electronic databases (MEDLINE, PubMed, Google Scholar, and SPORTDiscus) was conducted to identify relevant studies. Studies were included if they met all of the following criteria: used a test-retest design, involved at least 1 CNT, provided sufficient statistical data to allow for effect-size calculation, and were published in English. Two independent reviewers investigated each article to assess inclusion criteria. Eighteen studies involving 2674 participants were retained. Intraclass correlation coefficients were extracted to calculate effect sizes and determine overall reliability. The Fisher Z transformation adjusted for sampling error associated with averaging correlations. Moderator analyses were conducted to evaluate the effects of the length of the test-retest interval, intraclass correlation coefficient model selection, participant demographics, and study design on reliability. Heterogeneity was evaluated using the Cochran Q statistic. The proportion of acceptable outcomes was greatest for the Axon Sports CogState Test (75%) and lowest for the ImPACT (25%). Moderator analyses indicated that the type of intraclass correlation coefficient model used significantly influenced effect-size estimates, accounting for 17% of the variation in reliability. The Axon Sports CogState Test, which
Reliability and validity of the modifiable activity questionnaire for an Iranian urban adolescent population

Directory of Open Access Journals (Sweden)

Maryam Delshad

2015-01-01

Full Text Available Background: The purpose of this study was to evaluate the validity and reliability on the Persian translation of the Modifiable Activity Questionnaire (MAQ in a sample of Tehranian adolescents. Methods: Of a total of 52 subjects, a sub-sample of 40 participations (55.0% boys was used to assess the reliability and the validity of the physical activity questionnaire. The reliability of the two MAQs was calculated by intraclass correlation coefficients, and validation was evaluated using Pearson correlation coefficients to compare data between mean of the two MAQs and mean of four physical activity records. Results: Intraclass correlation coefficient was calculated to assess the reliability between two MAQs and the results of leisure time physical activity over the past year were 0.97. Pearson correlation coefficients between mean of two MAQs and mean of four physical activity records were 0.49 (P < 0.001, for leisure time physical activities. Conclusions: High reliability and relatively moderate validity were found for the Persian translation of the MAQ in a Tehranian adolescent population. Further studies with large sample size are suggested to assess the validity more precisely.
Complex versus Simple Modeling for DIF Detection: When the Intraclass Correlation Coefficient (?) of the Studied Item Is Less Than the ? of the Total Score

Science.gov (United States)

Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon

2014-01-01

Previous research has demonstrated that differential item functioning (DIF) methods that do not account for multilevel data structure could result in too frequent rejection of the null hypothesis (i.e., no DIF) when the intraclass correlation coefficient (?) of the studied item was the same as the ? of the total score. The current study extended…
Reliable computation from contextual correlations

Science.gov (United States)

Oestereich, André L.; Galvão, Ernesto F.

2017-12-01

An operational approach to the study of computation based on correlations considers black boxes with one-bit inputs and outputs, controlled by a limited classical computer capable only of performing sums modulo-two. In this setting, it was shown that noncontextual correlations do not provide any extra computational power, while contextual correlations were found to be necessary for the deterministic evaluation of nonlinear Boolean functions. Here we investigate the requirements for reliable computation in this setting; that is, the evaluation of any Boolean function with success probability bounded away from 1 /2 . We show that bipartite CHSH quantum correlations suffice for reliable computation. We also prove that an arbitrarily small violation of a multipartite Greenberger-Horne-Zeilinger noncontextuality inequality also suffices for reliable computation.
Face validity and reliability of a pictorial instrument for assessing fundamental movement skill perceived competence in young children.

Science.gov (United States)

Barnett, Lisa M; Ridgers, Nicola D; Zask, Avigdor; Salmon, Jo

2015-01-01

To determine reliability and face validity of an instrument to assess young children's perceived fundamental movement skill competence. Validation and reliability study. A pictorial instrument based on the Test Gross Motor Development-2 assessed perceived locomotor (six skills) and object control (six skills) competence using the format and item structure from the physical competence subscale of the Pictorial Scale of Perceived Competence and Acceptance for Young Children. Sample 1 completed object control items in May (n=32) and locomotor items in October 2012 (n=23) at two time points seven days apart. Children were asked at the end of the test-retest their understanding of what was happening in each picture to determine face validity. Sample 2 (n=58) completed 12 items in November 2012 on a single occasion to test internal reliability only. Sample 1 children were aged 5-7 years (M=6.0, SD=0.8) at object control assessment and 5-8 years at locomotor assessment (M=6.5, SD=0.9). Sample 2 children were aged 6-8 years (M=7.2, SD=0.73). Intra-class correlations assessed in Sample 1 children were excellent for object control (intra-class correlation=0.78), locomotor (intra-class correlation=0.82) and all 12 skills (intra-class correlations=0.83). Face validity was acceptable. Internal consistency was adequate in both samples for each subscale and all 12 skills (alpha range 0.60-0.81). This study has provided preliminary evidence for instrument reliability and face validity. This enables future alignment between the measurement of perceived and actual fundamental movement skill competence in young children. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.
Validity and test-retest reliability of a novel simple back extensor muscle strength test.

Science.gov (United States)

Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

2017-01-01

To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r = 0.824, p strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p strength ( p strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.
Test-retest reliability of the Progressive Isoinertial Lifting Evaluation (PILE).

Science.gov (United States)

Lygren, Hildegunn; Dragesund, Tove; Joensen, Jón; Ask, Tove; Moe-Nilssen, Rolf

2005-05-01

A repeated measures single group design. To investigate test-retest reliability of Progressive Isoinertial Lifting Evaluation on patients with long lasting musculoskeletal problems related to the lumbar spine. Test-retest reliability has been satisfactory in healthy men. Test-retest reliability for clinical populations has not been reported. A total of 31 patients (17 women and 14 men) with long lasting low back pain participated in the study. The patients were tested twice at an interval of 2 days and at the same time of the day. The heaviest load that the patient could lift 4 times was used as outcome measure. The error of measurement indicates that the true result in 95% of cases will be within +/-4.5 kg from the measured value, while the difference between 2 measurements in 95% of cases will be less than 6.4 kg. Intra-class correlation (1,1) was 0.91. Relative test-retest reliability was high assessed by intra-class correlation, but absolute measurement variability reported as the smallest detectable difference has relevance for the interpretation of clinical test results and should also be considered.
Understanding Intra-Class Knowledge Inside CNN

OpenAIRE

Wei, Donglai; Zhou, Bolei; Torrabla, Antonio; Freeman, William

2015-01-01

Convolutional Neural Network (CNN) has been successful in image recognition tasks, and recent works shed lights on how CNN separates different classes with the learned inter-class knowledge through visualization. In this work, we instead visualize the intra-class knowledge inside CNN to better understand how an object class is represented in the fully-connected layers. To invert the intra-class knowledge into more interpretable images, we propose a non-parametric patch prior upon previous CNN...
Neighborhood walkability scale (News - Brazil: Back translation and Reliability

Directory of Open Access Journals (Sweden)

Jorge Both

2007-12-01

Full Text Available Existem, no Brasil, poucos instrumentos para avaliar a relação entre o ambiente físico e a prática de atividades físicas. O objetivo do estudo foi analisar a tradução, a retradução e a reprodutibilidade do questionário NEWS (Neighborhood Environment Walkability Scale para o português do Brasil. Os procedimentos metodológicos foram estruturados em duas etapas. Primeiramente, efetuou-se a tradução e a retradução do NEWS com o intuito de verificar a linguagem do instrumento. Em seguida, realizou-se a reprodutibilidade do questionário por meio de teste e re-teste. A amostra desta pesquisa teve a participação de 75 pessoas (45 mulheres e 30 homens, com média de 33 ± 15 anos. A correlação intraclasse, a fidedignidade para as dimensões, o teste de correlação de Spearman e a correlação intraclasse para os indicadores de cada dimensão deste instrumento foram analisados com o auxílio do pacote estatístico SPSS (versão 11.0. O nível de significância adotado foi de p ABSTRACT In Brazil, there are few validated scales that establish the relationship between environmental barriers and physical activity. Therefore, the aim of this study was to analyze the translation, back translation and reliability of the Neighborhood Environment Walkability Scale (NEWS into Brazilian Portuguese. The methodological procedures were structured in two phases. The first phase was to translate and back translate NEWS to verify the instrument language. The second phase was he test and re-test reliability of the questionnaire. The sample was composed of 75 people (45 women and 30 men, mean age of 33 ± 15 years. The statistical analyses to verify the Brazilian NEWS were performed with the SPSS program (version 11.0 for intra-class correlation and reliability for the dimensions; Spearman correlation test and intra-class correlation for all indicators from this questionnaire. The significance level adopted in this survey was p<0.05. The results in
Reliability of a questionnaire on substance use among adolescent students, Brazil.

Science.gov (United States)

Machado Neto, Adelmo de Souza; Andrade, Tarcisio Matos; Fernandes, Gilênio Borges; Zacharias, Helder Paulo; Carvalho, Fernando Martins; Machado, Ana Paula Souza; Dias, Ana Carmen Costa; Garcia, Ana Carolina Rocha; Santana, Lauro Reis; Rolin, Carlos Eduardo; Sampaio, Cyntia; Ghiraldi, Gisele; Bastos, Francisco Inácio

2010-10-01

To analyze reliability of a self-applied questionnaire on substance use and misuse among adolescent students. Two cross-sectional studies were carried out for the instrument test-retest. The sample comprised male and female students aged 1119 years from public and private schools (elementary, middle, and high school students) in the city of Salvador, Northeastern Brazil, in 2006. A total of 591 questionnaires were applied in the test and 467 in the retest. Descriptive statistics, the Kappa index, Cronbach's alpha and intraclass correlation were estimated. The prevalence of substance use/misuse was similar in both test and retest. Sociodemographic variables showed a "moderate" to "almost perfect" agreement for the Kappa index, and a "satisfactory" (>0.75) consistency for Cronbach's alpha and intraclass correlation. The age which psychoactive substances (tobacco, alcohol, and cannabis) were first used and chronological age were similar in both studies. Test-retest reliability was found to be a good indicator of students' age of initiation and their patterns of substance use. The questionnaire reliability was found to be satisfactory in the population studied.
Reliability and Validity of a Chinese-Translated Version of a Pregnancy Physical Activity Questionnaire.

Science.gov (United States)

Xiang, Mi; Konishi, Massayuki; Hu, Huanhuan; Takahashi, Masaki; Fan, Wenbi; Nishimaki, Mio; Ando, Karina; Kim, Hyeon-Ki; Tabata, Hiroki; Arao, Takashi; Sakamoto, Shizuo

2016-09-01

Objectives The objectives of the present study were to translate the English version of the Pregnancy Physical Activity Questionnaire into Chinese (PPAQ-C) and to determine its reliability and validity for use by pregnant Chinese women. Methods The study included 224 pregnant women during their first, second, or third trimesters of pregnancy who completed the PPAQ-C on their first visit and wore a uniaxial accelerometer (Lifecorder; Suzuken Co. Ltd) for 7 days. One week after the first visit, we collected the data from the uniaxial accelerometer records, and the women were asked to complete the PPAQ-C again. Results We used intraclass correlation coefficients to determine the reliability of the PPAQ-C. The intraclass correlation coefficients were 0.77 for total activity (light and above), 0.76 for sedentary activity, 0.75 for light activity, 0.59 for moderate activity, and 0.28 for vigorous activity. The intraclass correlation coefficients were 0.74 for "household and caregiving", 0.75 for "occupational" activities, and 0.34 for "sports/exercise". Validity between the PPAQ-C and accelerometer data was determined by Spearman correlation coefficients. Although there were no significant correlations for moderate activity (r = 0.19, P > 0.05) or vigorous activity (r = 0.15, P > 0.05), there were significant correlations for total activity [light and above; r = 0.35, P < 0.01)] and for light activity (r = 0.33, P < 0.01). Conclusions for Practice The PPAQ-C is reliable and moderately accurate for measuring physical activity in pregnant Chinese women.
Reliability and comparison of acromion assessment techniques on x-ray and magnetic resonance imaging (reliability of acromion assessment techniques)

International Nuclear Information System (INIS)

Viskontas, D.G.; MacDermid, J.C.; Drosdowech, D.S.; Garvin, G.J.; Romano, W.M.; Faber, K.J.

2005-01-01

To determine the reliability and correlation of plain radiography and magnetic resonance imaging (MRI) in the assessment of acromion morphology. Materials and Methods: Acromion morphology was assessed using the lateral acromion angle (LAA) and the acromion-humeral interval (AHI). Thirty patients who had x-rays and MRI for impingement syndrome were included. Six blinded observers assessed the acromion morphology subjectively and objectively. Results: Neither acromion assessment technique demonstrated a positive correlation (kappa and intraclass coefficient 0.55) when measured objectively by experienced observers. Conclusion: The LAA and the AHI are both reliable acromion assessment techniques on X-ray and MRI when measured objectively and by experienced observers. (author)
Inter-rater reliability of the Sødring Motor Evaluation of Stroke patients (SMES).

Science.gov (United States)

Halsaa, K E; Sødring, K M; Bjelland, E; Finsrud, K; Bautz-Holter, E

1999-12-01

The Sødring Motor Evaluation of Stroke patients is an instrument for physiotherapists to evaluate motor function and activities in stroke patients. The rating reflects quality as well as quantity of the patient's unassisted performance within three domains: leg, arm and gross function. The inter-rater reliability of the method was studied in a sample of 30 patients admitted to a stroke rehabilitation unit. Three therapists were involved in the study; two therapists assessed the same patient on two consecutive days in a balanced design. Cohen's weighted kappa and McNemar's test of symmetry were used as measures of item reliability, and the intraclass correlation coefficient was used to express the reliability of the sumscores. For 24 out of 32 items the weighted kappa statistic was excellent (0.75-0.98), while 7 items had a kappa statistic within the range 0.53-0.74 (fair to good). The reliability of one item was poor (0.13). The intraclass correlation coefficient for the three sumscores was 0.97, 0.91 and 0.97. We conclude that the Sødring Motor Evaluation of Stroke patients is a reliable measure of motor function in stroke patients undergoing rehabilitation.
Validity and reliability of wii fit balance board for the assessment of balance of healthy young adults and the elderly.

Science.gov (United States)

Chang, Wen-Dien; Chang, Wan-Yi; Lee, Chia-Lun; Feng, Chi-Yen

2013-10-01

[Purpose] Balance is an integral part of human ability. The smart balance master system (SBM) is a balance test instrument with good reliability and validity, but it is expensive. Therefore, we modified a Wii Fit balance board, which is a convenient balance assessment tool, and analyzed its reliability and validity. [Subjects and Methods] We recruited 20 healthy young adults and 20 elderly people, and administered 3 balance tests. The correlation coefficient and intraclass correlation of both instruments were analyzed. [Results] There were no statistically significant differences in the 3 tests between the Wii Fit balance board and the SBM. The Wii Fit balance board had a good intraclass correlation (0.86-0.99) for the elderly people and positive correlations (r = 0.58-0.86) with the SBM. [Conclusions] The Wii Fit balance board is a balance assessment tool with good reliability and high validity for elderly people, and we recommend it as an alternative tool for assessing balance ability.
Intra-rater reliability of cervical sensory motor function and cervical reconstruction test in healthy subjects

Directory of Open Access Journals (Sweden)

Hatamvand S

2016-07-01

Full Text Available Impairment of cervicocephalic and head joint position sense has an important role in the recurrent and chronic of cervicocephalic pain. The various tools are suggested for evaluating the cervicocephalic joint position sense. Although reconstruction of cervical angle is a clinical criterion for measuring the cervicocephalic proprioception, the reliability of this method has not been completely accepted. The purpose of this study was to evaluate intra-rater reliability of cervical sensory motor function and cervical reconstruction test in healthy subjects. twenty four healthy subjects (25.70±6.08 y through simple non-probability sampling participated in this single-group repeatedmeasures reliability study. Participants were asked to relocate the neck, as accurately as possible, after full active cervical flexion, extension and rotation to the left and right sides. Five trials were performed for each movement. Laser pointer was used in head of patient. The distance between zero spot and joint position which patient had been reconstructed, was measured by centimeter. Intra-class correlation Coefficient (ICCs and Pearson's correlation coefficient test was used to determine intra-rater reliability of variables. The results showed that intra-class correlation Coefficient (ICCs values with 95% confidence interval (CI and the standard error of the measurement (SEM were good to excellent agreement for a single investigator between measurement occasions. Intra-class correlation Coefficient (ICCs values were obtained for flexion movement (ICCs:0.75, good, extension movement (ICCs:0.81, very good, right rotation (ICCs:0.64, good and left rotation (ICCs:0.64, good. The cervicocephalic relocation test to neutral head position by laser pointer is a reliable method to measure cervical sensory motor function. Therefore, it can be used for evaluating cervicocephalic proprioception of patient with cervicocephalic pain.
Reliability of the Brazilian version of the Physical Activity Checklist Interview in children.

Science.gov (United States)

Adami, Fernando; Cruciani, Fernanda; Douek, Michelle; Sewell, Carolina Dumit; Mariath, Aline Brandão; Hinnig, Patrícia de Fragas; Freaza, Silvia Rafaela Mascarenhas; Bergamaschi, Denise Pimentel

2011-04-01

To assess the reliability of the Lista de Atividades Físicas (Brazilian version of the Physical Activity Checklist Interview) in children. The study is part of a cross-cultural adaptation of the Physical Activity Checklist Interview, conducted with 83 school children aged between seven and ten years, enrolled between the 2nd and 5th grades of primary education in the city of São Paulo, Southeastern Brazil, in 2008. The questionnaire was responded by children through individual interviews. It is comprised of a list of 21 moderate to vigorous physical activities performed on the previous day, it is divided into periods (before, during and after school) and it has a section for interview assessment. This questionnaire enables the quantification of time spent in physical and sedentary activities and the total and weighed metabolic costs. Reliability was assessed by comparing two interviews conducted with a mean interval of three hours. For the interview assessment, data from the first interview and those from an external evaluator were compared. Bland-Altman's proposal, the intraclass correlation coefficient and Lin's concordance correlation coefficient were used to assess reliability. The intraclass correlation coefficient lower limits for the outcomes analyzed varied from 0.84 to 0.96. Precision and agreement varied between 0.83 and 0.97 and between 0.99 and 1, respectively. The line estimated from the pairs of values obtained in both interviews indicates high data precision. The interview item showing the poorest result was the ability to estimate time (fair in 27.7% of interviews). Interview assessment items showed intraclass correlation coefficients between 0.60 and 0.70, except for level of cooperation (0.46). The Brazilian version of the Physical Activity Checklist Interview shows high reliability to assess physical and sedentary activity on the previous day in children.

Development and reliability testing of a food store observation form.

Science.gov (United States)

Rimkus, Leah; Powell, Lisa M; Zenk, Shannon N; Han, Euna; Ohri-Vachaspati, Punam; Pugach, Oksana; Barker, Dianne C; Resnick, Elissa A; Quinn, Christopher M; Myllyluoma, Jaana; Chaloupka, Frank J

2013-01-01

To develop a reliable food store observational data collection instrument to be used for measuring product availability, pricing, and promotion. Observational data collection. A total of 120 food stores (26 supermarkets, 34 grocery stores, 54 gas/convenience stores, and 6 mass merchandise stores) in the Chicago metropolitan statistical area. Inter-rater reliability for product availability, pricing, and promotion measures on a food store observational data collection instrument. Cohen's kappa coefficient and proportion of overall agreement for dichotomous variables and intra-class correlation coefficient for continuous variables. Inter-rater reliability, as measured by average kappa coefficient, was 0.84 for food and beverage product availability measures, 0.80 for interior store characteristics, and 0.70 for exterior store characteristics. For continuous measures, average intra-class correlation coefficient was 0.82 for product pricing measures; 0.90 for counts of fresh, frozen, and canned fruit and vegetable options; and 0.85 for counts of advertisements on the store exterior and property. The vast majority of measures demonstrated substantial or almost perfect agreement. Although some items may require revision, results suggest that the instrument may be used to reliably measure the food store environment. Copyright © 2013 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Context-sensitive intra-class clustering

KAUST Repository

Yu, Yingwei; Gutierrez-Osuna, Ricardo; Choe, Yoonsuck

2014-01-01

This paper describes a new semi-supervised learning algorithm for intra-class clustering (ICC). ICC partitions each class into sub-classes in order to minimize overlap across clusters from different classes. This is achieved by allowing partitioning
Reliable scar scoring system to assess photographs of burn patients.

Science.gov (United States)

Mecott, Gabriel A; Finnerty, Celeste C; Herndon, David N; Al-Mousawi, Ahmed M; Branski, Ludwik K; Hegde, Sachin; Kraft, Robert; Williams, Felicia N; Maldonado, Susana A; Rivero, Haidy G; Rodriguez-Escobar, Noe; Jeschke, Marc G

2015-12-01

Several scar-scoring scales exist to clinically monitor burn scar development and maturation. Although scoring scars through direct clinical examination is ideal, scars must sometimes be scored from photographs. No scar scale currently exists for the latter purpose. We modified a previously described scar scale (Yeong et al., J Burn Care Rehabil 1997) and tested the reliability of this new scale in assessing burn scars from photographs. The new scale consisted of three parameters as follows: scar height, surface appearance, and color mismatch. Each parameter was assigned a score of 1 (best) to 4 (worst), generating a total score of 3-12. Five physicians with burns training scored 120 representative photographs using the original and modified scales. Reliability was analyzed using coefficient of agreement, Cronbach alpha, intraclass correlation coefficient, variance, and coefficient of variance. Analysis of variance was performed using the Kruskal-Wallis test. Color mismatch and scar height scores were validated by analyzing actual height and color differences. The intraclass correlation coefficient, the coefficient of agreement, and Cronbach alpha were higher for the modified scale than those of the original scale. The original scale produced more variance than that in the modified scale. Subanalysis demonstrated that, for all categories, the modified scale had greater correlation and reliability than the original scale. The correlation between color mismatch scores and actual color differences was 0.84 and between scar height scores and actual height was 0.81. The modified scar scale is a simple, reliable, and useful scale for evaluating photographs of burn patients. Copyright © 2015 Elsevier Inc. All rights reserved.
Test-retest and interrater reliability of the functional lower extremity evaluation.

Science.gov (United States)

Haitz, Karyn; Shultz, Rebecca; Hodgins, Melissa; Matheson, Gordon O

2014-12-01

Repeated-measures clinical measurement reliability study. To establish the reliability and face validity of the Functional Lower Extremity Evaluation (FLEE). The FLEE is a 45-minute battery of 8 standardized functional performance tests that measures 3 components of lower extremity function: control, power, and endurance. The reliability and normative values for the FLEE in healthy athletes are unknown. A face validity survey for the FLEE was sent to sports medicine personnel to evaluate the level of importance and frequency of clinical usage of each test included in the FLEE. The FLEE was then administered and rated for 40 uninjured athletes. To assess test-retest reliability, each athlete was tested twice, 1 week apart, by the same rater. To assess interrater reliability, 3 raters scored each athlete during 1 of the testing sessions. Intraclass correlation coefficients were used to assess the test-retest and interrater reliability of each of the FLEE tests. In the face validity survey, the FLEE tests were rated as highly important by 58% to 71% of respondents but frequently used by only 26% to 45% of respondents. Interrater reliability intraclass correlation coefficients ranged from 0.83 to 1.00, and test-retest reliability ranged from 0.71 to 0.95. The FLEE tests are considered clinically important for assessing lower extremity function by sports medicine personnel but are underused. The FLEE also is a reliable assessment tool. Future studies are required to determine if use of the FLEE to make return-to-play decisions may reduce reinjury rates.
Interrater and Intrarater Reliability of the Balance Computerized Adaptive Test in Patients With Stroke.

Science.gov (United States)

Chiang, Hsin-Yu; Lu, Wen-Shian; Yu, Wan-Hui; Hsueh, I-Ping; Hsieh, Ching-Lin

2018-04-11

To examine the interrater and intrarater reliability of the Balance Computerized Adaptive Test (Balance CAT) in patients with chronic stroke having a wide range of balance functions. Repeated assessments design (1wk apart). Seven teaching hospitals. A pooled sample (N=102) including 2 independent groups of outpatients (n=50 for the interrater reliability study; n=52 for the intrarater reliability study) with chronic stroke. Not applicable. Balance CAT. For the interrater reliability study, the values of intraclass correlation coefficient, minimal detectable change (MDC), and percentage of MDC (MDC%) for the Balance CAT were .84, 1.90, and 31.0%, respectively. For the intrarater reliability study, the values of intraclass correlation coefficient, MDC, and MDC% ranged from .89 to .91, from 1.14 to 1.26, and from 17.1% to 18.6%, respectively. The Balance CAT showed sufficient intrarater reliability in patients with chronic stroke having balance functions ranging from sitting with support to independent walking. Although the Balance CAT may have good interrater reliability, we found substantial random measurement error between different raters. Accordingly, if the Balance CAT is used as an outcome measure in clinical or research settings, same raters are suggested over different time points to ensure reliable assessments. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs.

Science.gov (United States)

Stolarova, Margarita; Wolf, Corinna; Rinker, Tanja; Brielmann, Aenne

2014-01-01

This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire developed for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent-teacher and 19 mother-father pairs) collected for two-year-old children (12 bilingual) are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC). Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent-teacher ratings of children's early vocabulary can achieve agreement and correlation comparable to those of mother-father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters' agreement. We conclude that future reports of agreement, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings.
How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs

Science.gov (United States)

Stolarova, Margarita; Wolf, Corinna; Rinker, Tanja; Brielmann, Aenne

2014-01-01

This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire developed for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent–teacher and 19 mother–father pairs) collected for two-year-old children (12 bilingual) are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC). Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent–teacher ratings of children's early vocabulary can achieve agreement and correlation comparable to those of mother–father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters' agreement. We conclude that future reports of agreement, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings. PMID:24994985
Dynamic control of the lumbopelvic complex; lack of reliability of established test procedures

DEFF Research Database (Denmark)

Henriksen, Marius; Lund, Hans; Bliddal, Henning

2007-01-01

used in order to account for learning effects. Intraclass correlation coefficients were low for the sitting (0.54) and supported standing positions (0.36). In the standing position, a significant difference between test and retest was observed (P = 0.003) and further reliability analysis was therefore...
How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs

Directory of Open Access Journals (Sweden)

Margarita eStolarova

2014-06-01

Full Text Available This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire deve-loped for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent-teacher and 19 mother-father pairs collected for two-year-old children (12 bilingual are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC. Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent-teacher ratings of children’s early vocabulary can achieve agreement and correlation comparable to those of mother-father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters’ agreement. We conclude that future reports of agree-ment, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings.
Reliability of the Wii Balance Board in kayak.

Science.gov (United States)

Vando, Stefano; Laffaye, Guillaume; Masala, Daniele; Falese, Lavinia; Padulo, Johnny

2015-01-01

the seat of the kayaker represent the principal contact point to express mechanical Energy. therefore we investigated the reliability of the Wii Balance Board measures in the kayak vs. on the ground. Bland-Altman test showed a low systematic bias on the ground (2.85%) and in kayak (-2.13%) respectively; while 0.996 for Intra-class correlation coefficient. the Wii Balance Board is useful to assess postural sway in kayak.
Correlation and Reliability of Cervical Sagittal Alignment Parameters between Lateral Cervical Radiograph and Lateral Whole-Body EOS Stereoradiograph

Science.gov (United States)

Singhatanadgige, Weerasak; Kang, Daniel G.; Luksanapruksa, Panya; Peters, Colleen; Riew, K. Daniel

2015-01-01

Study Design Retrospective analysis. Objective To evaluate the correlation and reliability of cervical sagittal alignment parameters obtained from lateral cervical radiographs (XRs) compared with lateral whole-body stereoradiographs (SRs). Methods We evaluated adults with cervical deformity using both lateral XRs and lateral SRs obtained within 1 week of each other between 2010 and 2014. XR and SR images were measured by two independent spine surgeons using the following sagittal alignment parameters: C2–C7 sagittal Cobb angle (SCA), C2–C7 sagittal vertical axis (SVA), C1–C7 translational distance (C1–7), T1 slope (T1-S), neck tilt (NT), and thoracic inlet angle (TIA). Pearson correlation and paired t test were used for statistical analysis, with intra- and interrater reliability analyzed using intraclass correlation coefficient (ICC). Results A total of 35 patients were included in the study. We found excellent intrarater reliability for all sagittal alignment parameters in both the XR and SR groups with ICC ranging from 0.799 to 0.994 for XR and 0.791 to 0.995 for SR. Interrater reliability was also excellent for all parameters except NT and TIA, which had fair reliability. We also found excellent correlations between XR and SR measurements for most sagittal alignment parameters; SCA, SVA, and C1–C7 had r > 0.90, and only NT had r < 0.70. There was a significant difference between groups, with SR having lower measurements compared with XR for both SVA (0.68 cm lower, p < 0.001) and C1–C7 (1.02 cm lower, p < 0.001). There were no differences between groups for SCA, T1-S, NT, and TIA. Conclusion Whole-body stereoradiography appears to be a viable alternative for measuring cervical sagittal alignment parameters compared with standard radiography. XR and SR demonstrated excellent correlation for most sagittal alignment parameters except NT. However, SR had significantly lower average SVA and C1–C7 measurements than XR
A comparison of confidence interval methods for the concordance correlation coefficient and intraclass correlation coefficient with small number of raters.

Science.gov (United States)

Feng, Dai; Svetnik, Vladimir; Coimbra, Alexandre; Baumgartner, Richard

2014-01-01

The intraclass correlation coefficient (ICC) with fixed raters or, equivalently, the concordance correlation coefficient (CCC) for continuous outcomes is a widely accepted aggregate index of agreement in settings with small number of raters. Quantifying the precision of the CCC by constructing its confidence interval (CI) is important in early drug development applications, in particular in qualification of biomarker platforms. In recent years, there have been several new methods proposed for construction of CIs for the CCC, but their comprehensive comparison has not been attempted. The methods consisted of the delta method and jackknifing with and without Fisher's Z-transformation, respectively, and Bayesian methods with vague priors. In this study, we carried out a simulation study, with data simulated from multivariate normal as well as heavier tailed distribution (t-distribution with 5 degrees of freedom), to compare the state-of-the-art methods for assigning CI to the CCC. When the data are normally distributed, the jackknifing with Fisher's Z-transformation (JZ) tended to provide superior coverage and the difference between it and the closest competitor, the Bayesian method with the Jeffreys prior was in general minimal. For the nonnormal data, the jackknife methods, especially the JZ method, provided the coverage probabilities closest to the nominal in contrast to the others which yielded overly liberal coverage. Approaches based upon the delta method and Bayesian method with conjugate prior generally provided slightly narrower intervals and larger lower bounds than others, though this was offset by their poor coverage. Finally, we illustrated the utility of the CIs for the CCC in an example of a wake after sleep onset (WASO) biomarker, which is frequently used in clinical sleep studies of drugs for treatment of insomnia.
System Reliability Analysis Considering Correlation of Performances

Energy Technology Data Exchange (ETDEWEB)

Kim, Saekyeol; Lee, Tae Hee [Hanyang Univ., Seoul (Korea, Republic of); Lim, Woochul [Mando Corporation, Seongnam (Korea, Republic of)

2017-04-15

Reliability analysis of a mechanical system has been developed in order to consider the uncertainties in the product design that may occur from the tolerance of design variables, uncertainties of noise, environmental factors, and material properties. In most of the previous studies, the reliability was calculated independently for each performance of the system. However, the conventional methods cannot consider the correlation between the performances of the system that may lead to a difference between the reliability of the entire system and the reliability of the individual performance. In this paper, the joint probability density function (PDF) of the performances is modeled using a copula which takes into account the correlation between performances of the system. The system reliability is proposed as the integral of joint PDF of performances and is compared with the individual reliability of each performance by mathematical examples and two-bar truss example.
System Reliability Analysis Considering Correlation of Performances

International Nuclear Information System (INIS)

Kim, Saekyeol; Lee, Tae Hee; Lim, Woochul

2017-01-01

Reliability analysis of a mechanical system has been developed in order to consider the uncertainties in the product design that may occur from the tolerance of design variables, uncertainties of noise, environmental factors, and material properties. In most of the previous studies, the reliability was calculated independently for each performance of the system. However, the conventional methods cannot consider the correlation between the performances of the system that may lead to a difference between the reliability of the entire system and the reliability of the individual performance. In this paper, the joint probability density function (PDF) of the performances is modeled using a copula which takes into account the correlation between performances of the system. The system reliability is proposed as the integral of joint PDF of performances and is compared with the individual reliability of each performance by mathematical examples and two-bar truss example.
Validation and reliability of the Baecke questionnaire for the evaluation of habitual physical activity in adult men

Directory of Open Access Journals (Sweden)

Alex Antonio Florindo

2003-06-01

Full Text Available The aim of this study was to verify validity and reliability of the scores for physical exercise in leisure (PEL, leisure and locomotion activities (LLA, and total score (TS of the Baecke habitual physical activity questionnaire in adult males. Twenty-one students of Physical Education were evaluated. For validation, the maximum oxygen uptake (O2max and the decrease of the heart rate in percentile (%DHR were measured through the Cooper's 12-minute walk or run test, and an annual index of physical exercise (IPE, and a week index of locomotion activities (ILA. The reliability was verified through test-retest with interval of 45 days. The Pearson correlation coefficient, and partial correlation adjusted for age and body mass index were used for validation. The intraclass correlation and paired t-test were used for reliability. The results indicated that %DHR was correlated with LLA and TS (r = 0.47 and p = 0.030; r = 0.48 and p = 0.027, respectively. IPE was correlated with PEL and TS (r = 0.56 and p = 0.008; r = 0.46 and p = 0.036, respectively. ILA was correlated with LLA and TS (r = 0.64 and p = 0.002 and r = 0.51 and p = 0.017, respectively. There was no significant difference in PEL, LLA and TS means in test-retest. The intraclass correlations were r = 0.69; r = 0.80 and r = 0.77, respectively for PEL, LLA and TS. In conclusion, the Baecke questionnaire is valid and reliable to measure habitual physical activity in Brazilian adult men.
Reliability of the Alzheimer's disease assessment scale (ADAS-Cog) in longitudinal studies.

Science.gov (United States)

Khan, Anzalee; Yavorsky, Christian; DiClemente, Guillermo; Opler, Mark; Liechti, Stacy; Rothman, Brian; Jovic, Sofija

2013-11-01

Considering the scarcity of longitudinal assessments of reliability, there is need for a more precise understanding of cognitive decline in Alzheimer's Disease (AD). The primary goal was to assess longitudinal changes in inter-rater reliability, test retest reliability and internal consistency of scores of the ADAS-Cog. 2,618 AD subjects were enrolled in seven randomized, double-blind, placebo-controlled, multicenter-trials from 1986 to 2009. Reliability, internal-consistency and cross-sectional analysis of ADAS-Cog and MMSE across seven visits were examined. Intra-class correlation (ICC) for ADAS-Cog was moderate to high supporting their reliability. Absolute Agreement ICCs 0.392 (Visit-7) to 0.806 (Visit-2) showed a progressive decrease in correlations across time. Item analysis revealed a decrease in item correlations, with the lowest correlations for Visit 7 for Commands (ICC=0.148), Comprehension (ICC=0.092), Spoken Language (ICC=0.044). Suitable assessment of AD treatments is maintained through accurate measurement of clinically significant outcomes. Targeted rater education ADAS-Cog items over-time can improve ability to administer and score the scale.
Wii Balance Board: Reliability and Clinical Use in Assessment of Balance in Healthy Elderly Women.

Science.gov (United States)

Monteiro-Junior, Renato Sobral; Ferreira, Arthur Sá; Puell, Vivian Neiva; Lattari, Eduardo; Machado, Sérgio; Otero Vaghetti, César Augusto; da Silva, Elirez Bezerra

2015-01-01

Force plate is considered gold standard tool to assess body balance. However the Wii Balance Board (WBB) platform is a trustworthy equipment to assess stabilometric components in young people. Thus, we aim to examine the reliability of measures of center of pressure with WBB in healthy elderly women. Twenty one healthy and physically active women were enrolled in the study (age: 64 ± 7 years; body mass index: 29 ± 5 kg/m2. The WBB was used to assess the center of pressure measures in the individuals. Pressure was linearly applied to different points to test the platform precision. Three assessments were performed, with two of them being held on the same day at a 5- to 10-minute interval, and the third one was performed 48 h later. A linear regression analysis was used to find out linearity, while the intraclass correlation coefficient was used to assess reliability. The platform precision was adequate (R2 = 0.997, P = 0.01). Center of pressure measures showed an excellent reliability (all intraclass correlation coefficient values were > 0.90; p < 0.01). The WBB is a precise and reliable tool of body stability quantitative measure in healthy active elderly women and its use should be encouraged in clinical settings.
Reliability and responsiveness of algometry for measuring pressure pain threshold in patients with knee osteoarthritis.

Science.gov (United States)

Mutlu, Ebru Kaya; Ozdincler, Arzu Razak

2015-06-01

[Purpose] This study aimed to establish the intrarater reliability and responsiveness of a clinically available algometer in patients with knee osteoarthritis as well as to determine the minimum-detectable-change and standard error of measurement of testing to facilitate clinical interpretation of temporal changes. [Subjects] Seventy-three patients with knee osteoarthritis were included. [Methods] Pressure pain threshold measured by algometry was evaluated 3 times at 2-min intervals over 2 clinically relevant sites-mediolateral to the medial femoral tubercle (distal) and lateral to the medial malleolus (local)-on the same day. Intrarater reliability was estimated by intraclass correlation coefficients. The minimum-detectable-change and standard error of measurement were calculated. As a measure of responsiveness, the effect size was calculated for the results at baseline and after treatment. [Results] The intrarater reliability was almost perfect (intraclass correlation coefficient = 0.93-0.97). The standard error of measurement and minimum-detectable-change were 0.70-0.66 and 1.62-1.53, respectively. The pressure pain threshold over the distal site was inadequately responsive in knee osteoarthritis, but the local site was responsive. The effect size was 0.70. [Conclusion] Algometry is reliable and responsive to assess measures of pressure pain threshold for evaluating pain patients with knee osteoarthritis.
Reliability of the Serbian version of the International Physical Activity Questionnaire for older adults

Directory of Open Access Journals (Sweden)

Milanović Z

2014-04-01

Full Text Available Zoran Milanović,1 Saša Pantelić,1 Nebojša Trajković,1 Bojan Jorgić,1 Goran Sporiš,2 Milovan Bratić1 1Faculty of Sport and Physical Education, University of Niš, Niš, Serbia; 2Faculty of Kinesiology, University of Zagreb, Zagreb, Croatia Abstract: The purpose of this study was to determine the test–retest reliability of the International Physical Activity Questionnaire (IPAQ for older adults in Serbia. Six hundred and sixty older adults (352 men, 53%; 308 women, 47%; mean age 67.65±5.76 years participated in the study. To examine test–retest reliability, the participants were asked to complete the IPAQ on two occasions 2 weeks apart. Moderate reliability was observed between the repeated IPAQ, with intraclass correlation coefficients ranging from 0.53 to 0.91. The least reliability was established in leisure time activity (0.53 and the most reliability in the transport domain (0.91. Men and women had similar intraclass correlation coefficients for total physical activity (0.71 versus 0.74, respectively, while the biggest difference was obtained for housework in men (0.68 and in women (0.90. Our study shows that the long version of the IPAQ is a reliable instrument for assessing physical activity levels in older adults and that it may be useful for generating internationally comparable data. Keywords: questionnaire, elderly, IPAQ, physical activity
Reliability and minimal detectable difference in multisegment foot kinematics during shod walking and running.

Science.gov (United States)

Milner, Clare E; Brindle, Richard A

2016-01-01

There has been increased interest recently in measuring kinematics within the foot during gait. While several multisegment foot models have appeared in the literature, the Oxford foot model has been used frequently for both walking and running. Several studies have reported the reliability for the Oxford foot model, but most studies to date have reported reliability for barefoot walking. The purpose of this study was to determine between-day (intra-rater) and within-session (inter-trial) reliability of the modified Oxford foot model during shod walking and running and calculate minimum detectable difference for common variables of interest. Healthy adult male runners participated. Participants ran and walked in the gait laboratory for five trials of each. Three-dimensional gait analysis was conducted and foot and ankle joint angle time series data were calculated. Participants returned for a second gait analysis at least 5 days later. Intraclass correlation coefficients and minimum detectable difference were determined for walking and for running, to indicate both within-session and between-day reliability. Overall, relative variables were more reliable than absolute variables, and within-session reliability was greater than between-day reliability. Between-day intraclass correlation coefficients were comparable to those reported previously for adults walking barefoot. It is an extension in the use of the Oxford foot model to incorporate wearing a shoe while maintaining marker placement directly on the skin for each segment. These reliability data for walking and running will aid in the determination of meaningful differences in studies which use this model during shod gait. Copyright © 2015 Elsevier B.V. All rights reserved.

Reliability of pulse waveform separation analysis: effects of posture and fasting.

Science.gov (United States)

Stoner, Lee; Credeur, Daniel; Fryer, Simon; Faulkner, James; Lambrick, Danielle; Gibbs, Bethany Barone

2017-03-01

Oscillometric pulse wave analysis devices enable, with relative simplicity and objectivity, the measurement of central hemodynamic parameters. The important parameters are central blood pressures and indices of arterial wave reflection, including wave separation analysis (backward pressure component Pb and reflection magnitude). This study sought to determine whether the measurement precision (between-day reliability) of Pb and reflection magnitude: exceeds the criterion for acceptable reliability; and is affected by posture (supine, seated) and fasting state. Twenty healthy adults (50% female, 27.9 years, 24.2 kg/m) were tested on six different mornings: 3 days fasted, 3 days nonfasted condition. On each occasion, participants were tested in supine and seated postures. Oscillometric pressure waveforms were recorded on the left upper arm. The criterion intra-class correlation coefficient value of 0.75 was exceeded for Pb (0.76) and reflection magnitude (0.77) when participants were assessed under the combined supine-fasted condition. The intra-class correlation coefficient was lowest for Pb in seated-nonfasted condition (0.57), and lowest for reflection magnitude in the seated-fasted condition (0.56). For Pb, the smallest detectible change that must be exceeded in order for a significant change to occur in an individual was 2.5 mmHg, and for reflection magnitude, the smallest detectable change was 8.5%. Assessments of Pb and reflection magnitude are as follows: exceed the criterion for acceptable reliability; and are most reliable when participants are fasted in a supine position. The demonstrated reliability suggests sufficient precision to detect clinically meaningful changes in reflection magnitude and Pb.
Measurement of the Inter-Rater Reliability Rate Is Mandatory for Improving the Quality of a Medical Database: Experience with the Paulista Lung Cancer Registry.

Science.gov (United States)

Lauricella, Leticia L; Costa, Priscila B; Salati, Michele; Pego-Fernandes, Paulo M; Terra, Ricardo M

2018-06-01

Database quality measurement should be considered a mandatory step to ensure an adequate level of confidence in data used for research and quality improvement. Several metrics have been described in the literature, but no standardized approach has been established. We aimed to describe a methodological approach applied to measure the quality and inter-rater reliability of a regional multicentric thoracic surgical database (Paulista Lung Cancer Registry). Data from the first 3 years of the Paulista Lung Cancer Registry underwent an audit process with 3 metrics: completeness, consistency, and inter-rater reliability. The first 2 methods were applied to the whole data set, and the last method was calculated using 100 cases randomized for direct auditing. Inter-rater reliability was evaluated using percentage of agreement between the data collector and auditor and through calculation of Cohen's κ and intraclass correlation. The overall completeness per section ranged from 0.88 to 1.00, and the overall consistency was 0.96. Inter-rater reliability showed many variables with high disagreement (>10%). For numerical variables, intraclass correlation was a better metric than inter-rater reliability. Cohen's κ showed that most variables had moderate to substantial agreement. The methodological approach applied to the Paulista Lung Cancer Registry showed that completeness and consistency metrics did not sufficiently reflect the real quality status of a database. The inter-rater reliability associated with κ and intraclass correlation was a better quality metric than completeness and consistency metrics because it could determine the reliability of specific variables used in research or benchmark reports. This report can be a paradigm for future studies of data quality measurement. Copyright © 2018 American College of Surgeons. Published by Elsevier Inc. All rights reserved.
Content validity and reliability of test of gross motor development in Chilean children

Directory of Open Access Journals (Sweden)

Marcelo Cano-Cappellacci

2015-01-01

Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Reliability of the AMA Guides to the Evaluation of Permanent Impairment.

Science.gov (United States)

Forst, Linda; Friedman, Lee; Chukwu, Abraham

2010-12-01

AMA's Guides to the Evaluation of Permanent Impairment is used to rate loss of function and determine compensation and ability to work after injury or illness; however, there are few studies that evaluate reliability or construct validity. To evaluate the reliability of the fifth and sixth editions for back injury; to determine best methods for further study. Intra-class correlation coefficients within and between raters were relatively high. There was wider variability for individual cases. Impairment ratings were lower and correlated less well for the sixth edition, though confidence intervals overlapped. The sixth edition may not be an improvement over the fifth. A research agenda should include investigations of reliability and construct validity for different body sites and organ systems along the entire rating scale and among different categories of raters.
Evaluating the validity and reliability of the V-scale instrument (Turkish version) used to determine nurses' attitudes towards vital sign monitoring.

Science.gov (United States)

Ertuğ, Nurcan

2018-06-01

The aim of this study was to determine the validity and reliability of the Turkish version of the V-scale, which measures nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. This validity and reliability study was conducted at a tertiary hospital in Ankara, Turkey, in 2016. A total of 169 ward nurses participated in the study. Exploratory factor analysis, Cronbach's alpha coefficient, and the intraclass correlation coefficient were used to determine the validity and reliability of the scale. A 5-factor, 16-item scale explained 60.823% of the total variance according to the validity analysis. Our version matched the original scale in terms of the number of items and factor structure. Cronbach's alpha coefficient of the Turkish version of the V-scale was 0.764. The test-retest reliability results were 0.855 for the overall intraclass correlation coefficient, and the t-test result was P > 0.05. The V-scale is a reliable and valid instrument to measure Turkish nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. © 2018 John Wiley & Sons Australia, Ltd.
A Systematic Review of Statistical Methods Used to Test for Reliability of Medical Instruments Measuring Continuous Variables

Directory of Open Access Journals (Sweden)

Rafdzah Zaki

2013-06-01

Full Text Available Objective(s: Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice. Materials and Methods: In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria. Results: The Intra-class Correlation Coefficient (ICC is the most popular method with 25 (60% studies having used this method followed by the comparing means (8 or 19%. Out of 25 studies using the ICC, only 7 (28% reported the confidence intervals and types of ICC used. Most studies (71% also tested the agreement of instruments. Conclusion: This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.
Reliability of Lactation Assessment Tools Applied to Overweight and Obese Women.

Science.gov (United States)

Chapman, Donna J; Doughty, Katherine; Mullin, Elizabeth M; Pérez-Escamilla, Rafael

2016-05-01

The interrater reliability of lactation assessment tools has not been evaluated in overweight/obese women. This study aimed to compare the interrater reliability of 4 lactation assessment tools in this population. A convenience sample of 45 women (body mass index > 27.0) was videotaped while breastfeeding (twice daily on days 2, 4, and 7 postpartum). Three International Board Certified Lactation Consultants independently rated each videotaped session using 4 tools (Infant Breastfeeding Assessment Tool [IBFAT], modified LATCH [mLATCH], modified Via Christi [mVC], and Riordan's Tool [RT]). For each day and tool, we evaluated interrater reliability with 1-way repeated-measures analyses of variance, intraclass correlation coefficients (ICCs), and percentage absolute agreement between raters. Analyses of variance showed significant differences between raters' scores on day 2 (all scales) and day 7 (RT). Intraclass correlation coefficient values reflected good (mLATCH) to excellent reliability (IBFAT, mVC, and RT) on days 2 and 7. All day 4 ICCs reflected good reliability. The ICC for mLATCH was significantly lower than all others on day 2 and was significantly lower than IBFAT (day 7). Percentage absolute interrater agreement for scale components ranged from 31% (day 2: observable swallowing, RT) to 92% (day 7: IBFAT, fixing; and mVC, latch time). Swallowing scores on all scales had the lowest levels of interrater agreement (31%-64%). We demonstrated differences in the interrater reliability of 4 lactation assessment tools when applied to overweight/obese women, with the lowest values observed on day 4. Swallowing assessment was particularly unreliable. Researchers and clinicians using these scales should be aware of the differences in their psychometric behavior. © The Author(s) 2015.
Reliability, validity and description of timed performance of the Jebsen-Taylor Test in patients with muscular dystrophies.

Science.gov (United States)

Artilheiro, Mariana Cunha; Fávero, Francis Meire; Caromano, Fátima Aparecida; Oliveira, Acary de Souza Bulle; Carvas, Nelson; Voos, Mariana Callil; Sá, Cristina Dos Santos Cardoso de

2017-12-08

The Jebsen-Taylor Test evaluates upper limb function by measuring timed performance on everyday activities. The test is used to assess and monitor the progression of patients with Parkinson disease, cerebral palsy, stroke and brain injury. To analyze the reliability, internal consistency and validity of the Jebsen-Taylor Test in people with Muscular Dystrophy and to describe and classify upper limb timed performance of people with Muscular Dystrophy. Fifty patients with Muscular Dystrophy were assessed. Non-dominant and dominant upper limb performances on the Jebsen-Taylor Test were filmed. Two raters evaluated timed performance for inter-rater reliability analysis. Test-retest reliability was investigated by using intraclass correlation coefficients. Internal consistency was assessed using the Cronbach alpha. Construct validity was conducted by comparing the Jebsen-Taylor Test with the Performance of Upper Limb. The internal consistency of Jebsen-Taylor Test was good (Cronbach's α=0.98). A very high inter-rater reliability (0.903-0.999), except for writing with an Intraclass correlation coefficient of 0.772-1.000. Strong correlations between the Jebsen-Taylor Test and the Performance of Upper Limb Module were found (rho=-0.712). The Jebsen-Taylor Test is a reliable and valid measure of timed performance for people with Muscular Dystrophy. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Validity and reliability of the Baecke questionnaire for the evaluation of habitual physical activity among people living with HIV/AIDS

Directory of Open Access Journals (Sweden)

Florindo Alex Antonio

2006-01-01

Full Text Available This study evaluates the validity and reliability of the Baecke questionnaire on habitual physical activity when applied to a population of HIV/AIDS subjects. Validity was determined by comparing measurements for 30 subjects of peak oxygen uptake, peak workload, and energy expenditure with scores for occupational physical activity (OPA, physical exercise in leisure (PEL, leisure and locomotion activities (LLA, and total score (TS. Reliability was determined by testing and retesting 29 subjects at intervals of 15-30 days. Validity was evaluated with the Pearson correlation and reliability analyses were done using the intraclass correlation, paired Student t-test, and Bland-Altman methods. Peak VO2 and peak workload had significant correlation with PEL (r = 0.41; r = 0.43; respectively. Energy expenditure had a significant correlation with OPA (r = 0.64. The intraclass coefficients were 0.70 or more for OPA, PEL and TS. There was no difference in OPA, PEL, LLA and TS between the two evaluations. The Bland-Altman methods showed that there was good agreement between the measurements for all habitual physical activities scores. Results show that the Baecke questionnaire is valid for the evaluation of habitual physical activity among people living with HIV/AIDS.
Time to competency, reliability of flexible transnasal laryngoscopy by training level: a pilot study.

Science.gov (United States)

Brook, Christopher D; Platt, Michael P; Russell, Kimberly; Grillone, Gregory A; Aliphas, Avner; Noordzij, J Pieter

2015-05-01

To determine the progression of flexible transnasal laryngoscopy reliability and competency in otolaryngology residency training. Prospective case control study. Academic otolaryngology department. Medical students, otolaryngology residents, and otolaryngology attending physicians. Fourteen otolaryngology residents from PGY-1 to PGY-5 and 3 attending otolaryngologists viewed 25 selected and digitally recorded flexible transnasal laryngoscopies. The evaluators were asked to rate 13 items relating to abnormalities in the oropharynx, hypopharynx, larynx, and subglottis. The level of concern and level of comfort with the diagnosis were assessed. Intraclass correlations were calculated for each topic and by level of training to determine reliability within each class and compare competency versus attending interpretations. Intraclass correlation of residents compared to attending physicians demonstrated significant improvements by year for left and right vocal fold immobility, subglottic stenosis, laryngeal mass, left and right vocal cord abnormalities, and level of concern. Additionally, pooled vocal cord mobility and pooled results in categories with good attending reliability demonstrated stepwise improvement as well. For these categories, resident reliability was found to be statistically similar to attending physicians in all categories by PGY-3. There were no trends for base of tongue abnormalities, pharyngeal abnormalities, and pharyngeal and hypopharyngeal masses. Resident competency for flexible transnasal laryngoscopy progresses during residency to reliability with attending otolaryngologists by the PGY-3 year over key facets of the examination. © American Academy of Otolaryngology-Head and Neck Surgery Foundation 2015.
Improved estimation of subject-level functional connectivity using full and partial correlation with empirical Bayes shrinkage.

Science.gov (United States)

Mejia, Amanda F; Nebel, Mary Beth; Barber, Anita D; Choe, Ann S; Pekar, James J; Caffo, Brian S; Lindquist, Martin A

2018-05-15

Reliability of subject-level resting-state functional connectivity (FC) is determined in part by the statistical techniques employed in its estimation. Methods that pool information across subjects to inform estimation of subject-level effects (e.g., Bayesian approaches) have been shown to enhance reliability of subject-level FC. However, fully Bayesian approaches are computationally demanding, while empirical Bayesian approaches typically rely on using repeated measures to estimate the variance components in the model. Here, we avoid the need for repeated measures by proposing a novel measurement error model for FC describing the different sources of variance and error, which we use to perform empirical Bayes shrinkage of subject-level FC towards the group average. In addition, since the traditional intra-class correlation coefficient (ICC) is inappropriate for biased estimates, we propose a new reliability measure denoted the mean squared error intra-class correlation coefficient (ICC MSE ) to properly assess the reliability of the resulting (biased) estimates. We apply the proposed techniques to test-retest resting-state fMRI data on 461 subjects from the Human Connectome Project to estimate connectivity between 100 regions identified through independent components analysis (ICA). We consider both correlation and partial correlation as the measure of FC and assess the benefit of shrinkage for each measure, as well as the effects of scan duration. We find that shrinkage estimates of subject-level FC exhibit substantially greater reliability than traditional estimates across various scan durations, even for the most reliable connections and regardless of connectivity measure. Additionally, we find partial correlation reliability to be highly sensitive to the choice of penalty term, and to be generally worse than that of full correlations except for certain connections and a narrow range of penalty values. This suggests that the penalty needs to be chosen carefully
Reliability of the detailed assessment of speed of handwriting on Flemish children.

Science.gov (United States)

Simons, Johan; Probst, Michel

2014-01-01

This study evaluates the reliability of the Detailed Assessment of Speed of Handwriting (DASH) in a Dutch-speaking sample of children. The sample included 650 boys and 513 girls (age range = 9-16 years). Handwriting speed measurements were obtained using the DASH. Interrater agreement, test-retest reliability, and internal consistency were calculated; gender and age effects were analyzed. Interrater agreement shows excellent reliability with intraclass correlation coefficients of at least 0.94. Test-retest correlations ranged from r = 0.65 to r = 0.81. The internal consistency measures, calculated with Cronbach's alpha, were between 0.88 and 0.94. Both gender and age have a significant effect on handwriting speed, with F (7.1144) = 17.43 (P handwriting speed of Dutch-speaking children. There is a tendency of girls to write faster than boys.
Reliability and validity of migraine disability assessment questionnaire-Thai version (Thai-MIDAS).

Science.gov (United States)

Seethong, Piman; Nimmannit, Akarin; Chaisewikul, Rungsan; Prayoonwiwat, Naraporn; Chotinaiwattarakul, Wattanachai

2013-02-01

To assess the validity and test-retest reliability of a Thai translation of the Migraine Disability Assessment (MIDAS) Questionnaire in Thai patients with migraine. Migraineurs from the Headache Clinic in Siriraj Hospital were recruited and asked to complete a 13-weeks diary and answered the Thai-MIDAS at once. Some participants were asked to provide the 2nd Thai-MIDAS in the next 2 weeks for test-retest reliability. Ninety-three patients had completed the 13-weeks diaries. Age range was 18-58 years with mean 37.69 +/- 9.60 years. All 5 items and the total score of Thai-MIDAS were moderately correlated with data from 13-weeks diary (Spearman's correlation coefficient = 0.32-0.62). The test-retest reliability of the total score of Thai-MIDAS in 30 patients demonstrated a highly reliable degree of intraclass correlation (ICC = 0.76, 95% CI 0.49-0.88). The present study reveals that the Thai-MIDAS has satisfactory validity and reliability in comparison with the original English MIDAS version.
Reliability and validity of psychosocial and environmental correlates measures of physical activity and screen-based behaviors among Chinese children in Hong Kong

Directory of Open Access Journals (Sweden)

Salmon Jo

2011-03-01

Full Text Available Abstract Background Insufficient participation in physical activity and excessive screen time have been observed among Chinese children. The role of social and environmental factors in shaping physical activity and sedentary behaviors among Chinese children is under-investigated. The purpose of the present study was to assess the reliability and validity of a questionnaire to measure child- and parent-reported psychosocial and environmental correlates of physical activity and screen-based behaviors among Chinese children in Hong Kong. Methods A total of 303 schoolchildren aged 9-14 years and their parents volunteered to participate in this study and 160 of them completed the questionnaire twice within an interval of 10 days. Intraclass correlation coefficients (ICCs, kappa statistics, and percent agreement were performed to evaluate test-retest reliability of the continuous and categorical variables, respectively. Exploratory factor analyses (EFAs were conducted to assess convergent validity of the emergent scales. Cronbach's alpha and ICCs were performed to assess internal and test-retest reliability of the emergent scales. Criterion validity was assessed by correlating psychosocial and environmental measures with self-reported physical activity and screen-based behaviors, measured by a validated questionnaire. Results Reliability statistics for both child- and parent-reported continuous variables showed acceptable consistency for all of the ICC values greater than 0.70. Kappa statistics showed fair to perfect test-retest reliability for the categorical items. Adequate internal consistency and test-retest reliability were observed in most of the emergent scales. Criterion validity assessed by correlating psychosocial and environmental measures with child-reported physical activity found associations with physical activity in the self-efficacy scale (r = 0.25, P r = 0.25, P r = 0.14, P r = -0.22, P r = 0.12, P = 0.053. Conclusions The findings
Reliability of instruments in a cooperative, multisite study: employment intervention demonstration program.

Science.gov (United States)

Salyers, M P; McHugo, G J; Cook, J A; Razzano, L A; Drake, R E; Mueser, K T

2001-09-01

Reliability of well-known instruments was examined in 202 people with severe mental illness participating in a multisite vocational study. We examined interrater reliability of the Positive and Negative Syndrome Scale (PANSS) and the internal consistency and test-retest reliability of the PANSS, the Rosenberg Self-Esteem Scale, the Medical Outcomes Study Short Form-36 (SF-36), and the Quality of Life Interview. Most scales had good levels of reliability, with intraclass correlation coefficients (ICCs) and coefficient alphas above .70. However, the SF-36 scales were generally less stable over time, particularly Social Functioning (ICC = .55). Test-retest reliability was lower among less educated respondents and among ethnic minorities. We recommend close monitoring of psychometric issues in future multisite studies.
Computational area measurement of orbital floor fractures: Reliability, accuracy and rapidity

International Nuclear Information System (INIS)

Schouman, Thomas; Courvoisier, Delphine S.; Imholz, Benoit; Van Issum, Christopher; Scolozzi, Paolo

2012-01-01

Objective: To evaluate the reliability, accuracy and rapidity of a specific computational method for assessing the orbital floor fracture area on a CT scan. Method: A computer assessment of the area of the fracture, as well as that of the total orbital floor, was determined on CT scans taken from ten patients. The ratio of the fracture's area to the orbital floor area was also calculated. The test–retest precision of measurement calculations was estimated using the Intraclass Correlation Coefficient (ICC) and Dahlberg's formula to assess the agreement across observers and across measures. The time needed for the complete assessment was also evaluated. Results: The Intraclass Correlation Coefficient across observers was 0.92 [0.85;0.96], and the precision of the measures across observers was 4.9%, according to Dahlberg's formula .The mean time needed to make one measurement was 2 min and 39 s (range, 1 min and 32 s to 4 min and 37 s). Conclusion: This study demonstrated that (1) the area of the orbital floor fracture can be rapidly and reliably assessed by using a specific computer system directly on CT scan images; (2) this method has the potential of being routinely used to standardize the post-traumatic evaluation of orbital fractures
Reliability and Concurrent Validity of the Narrow Path Walking Test in Persons With Multiple Sclerosis.

Science.gov (United States)

Rosenblum, Uri; Melzer, Itshak

2017-01-01

About 90% of people with multiple sclerosis (PwMS) have gait instability and 50% fall. Reliable and clinically feasible methods of gait instability assessment are needed. The study investigated the reliability and validity of the Narrow Path Walking Test (NPWT) under single-task (ST) and dual-task (DT) conditions for PwMS. Thirty PwMS performed the NPWT on 2 different occasions, a week apart. Number of Steps, Trial Time, Trial Velocity, Step Length, Number of Step Errors, Number of Cognitive Task Errors, and Number of Balance Losses were measured. Intraclass correlation coefficients (ICC2,1) were calculated from the average values of NPWT parameters. Absolute reliability was quantified from standard error of measurement (SEM) and smallest real difference (SRD). Concurrent validity of NPWT with Functional Reach Test, Four Square Step Test (FSST), 12-item Multiple Sclerosis Walking Scale (MSWS-12), and 2 Minute Walking Test (2MWT) was determined using partial correlations. Intraclass correlation coefficients (ICCs) for most NPWT parameters during ST and DT ranged from 0.46-0.94 and 0.55-0.95, respectively. The highest relative reliability was found for Number of Step Errors (ICC = 0.94 and 0.93, for ST and DT, respectively) and Trial Velocity (ICC = 0.83 and 0.86, for ST and DT, respectively). Absolute reliability was high for Number of Step Errors in ST (SEM % = 19.53%) and DT (SEM % = 18.14%) and low for Trial Velocity in ST (SEM % = 6.88%) and DT (SEM % = 7.29%). Significant correlations for Number of Step Errors and Trial Velocity were found with FSST, MSWS-12, and 2MWT. In persons with PwMS performing the NPWT, Number of Step Errors and Trial Velocity were highly reliable parameters. Based on correlations with other measures of gait instability, Number of Step Errors was the most valid parameter of dynamic balance under the conditions of our test.Video Abstract available for more insights from the authors (see Supplemental Digital Content 1, available at: http
Intraclass reliability for assessing how well Taiwan constrained hospital-provided medical services using statistical process control chart techniques.

Science.gov (United States)

Chien, Tsair-Wei; Chou, Ming-Ting; Wang, Wen-Chung; Tsai, Li-Shu; Lin, Weir-Sen

2012-05-15

Few studies discuss the indicators used to assess the effect on cost containment in healthcare across hospitals in a single-payer national healthcare system with constrained medical resources. We present the intraclass correlation coefficient (ICC) to assess how well Taiwan constrained hospital-provided medical services in such a system. A custom Excel-VBA routine to record the distances of standard deviations (SDs) from the central line (the mean over the previous 12 months) of a control chart was used to construct and scale annual medical expenditures sequentially from 2000 to 2009 for 421 hospitals in Taiwan to generate the ICC. The ICC was then used to evaluate Taiwan's year-based convergent power to remain unchanged in hospital-provided constrained medical services. A bubble chart of SDs for a specific month was generated to present the effects of using control charts in a national healthcare system. ICCs were generated for Taiwan's year-based convergent power to constrain its medical services from 2000 to 2009. All hospital groups showed a gradually well-controlled supply of services that decreased from 0.772 to 0.415. The bubble chart identified outlier hospitals that required investigation of possible excessive reimbursements in a specific time period. We recommend using the ICC to annually assess a nation's year-based convergent power to constrain medical services across hospitals. Using sequential control charts to regularly monitor hospital reimbursements is required to achieve financial control in a single-payer nationwide healthcare system.
Validity and reliability of the single-trial line drill test of anaerobic power in basketball players.

Science.gov (United States)

Fatouros, I G; Laparidis, K; Kambas, A; Chatzinikolaou, A; Techlikidou, E; Katrabasas, I; Douroudos, I; Leontsini, D; Berberidou, F; Draganidis, D; Christoforidis, C; Tsoukas, D; Kelis, S; Taxildaris, K

2011-03-01

This study evaluated the validity, reliability, and sensitivity of the single-trial line drill test (SLDT) for anaerobic power assessment. Twenty-four volunteers were assigned to either a control (C, N.=12) or an experimental (BP, N.=12 basketball players) group. SLDT's (time-to-complete) concurrent validity was evaluated against the Wingate testing (WAnT: mean [MP] and peak power [PP]) and a 30-sec vertical jump testing test (VJT: mean height and MP). Blood lactate concentration was measured at rest and immediately post-test. SLDT's reliability [test-retest intraclass correlation coefficients (ICC), coefficient of variation (CV), Bland-Altman plots] and sensitivity were determined (one-way ANOVA). Kendall's tau correlation analysis revealed correlations (Pbasketball players.
Interobserver Reliability of the Total Body Score System for Quantifying Human Decomposition.

Science.gov (United States)

Dabbs, Gretchen R; Connor, Melissa; Bytheway, Joan A

2016-03-01

Several authors have tested the accuracy of the Total Body Score (TBS) method for quantifying decomposition, but none have examined the reliability of the method as a scoring system by testing interobserver error rates. Sixteen participants used the TBS system to score 59 observation packets including photographs and written descriptions of 13 human cadavers in different stages of decomposition (postmortem interval: 2-186 days). Data analysis used a two-way random model intraclass correlation in SPSS (v. 17.0). The TBS method showed "almost perfect" agreement between observers, with average absolute correlation coefficients of 0.990 and average consistency correlation coefficients of 0.991. While the TBS method may have sources of error, scoring reliability is not one of them. Individual component scores were examined, and the influences of education and experience levels were investigated. Overall, the trunk component scores were the least concordant. Suggestions are made to improve the reliability of the TBS method. © 2016 American Academy of Forensic Sciences.

Reliability of the hospital nutrition environment scan for cafeterias, vending machines, and gift shops.

Science.gov (United States)

Winston, Courtney P; Sallis, James F; Swartz, Michael D; Hoelscher, Deanna M; Peskin, Melissa F

2013-08-01

According to ecological models, the physical environment plays a major role in determining individual health behaviors. As such, researchers have started targeting the consumer nutrition environment of large-scale foodservice operations when implementing obesity-prevention programs. In 2010, the American Hospital Association released a call-to-action encouraging health care facilities to join in this movement and improve their facilities' consumer nutrition environments. The Hospital Nutrition Environment Scan (HNES) for Cafeterias, Vending Machines, and Gift Shops was developed in 2011, and the present study evaluated the inter-rater reliability of this instrument. Two trained raters visited 39 hospitals in southern California and completed the HNES. Percent agreement, kappa statistics, and intraclass correlation coefficients were calculated. Percent agreement between raters ranged from 74.4% to 100% and kappa statistics ranged from 0.458 to 1.0. The intraclass correlation coefficient for the overall nutrition composite scores was 0.961. Given these results, the HNES demonstrated acceptable reliability metrics and can now be disseminated to assess the current state of hospital consumer nutrition environments. Copyright © 2013 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Reliability of the craniocervical posture assessment: visual and angular measurements using photographs and radiographs.

Science.gov (United States)

Gadotti, Inae C; Armijo-Olivo, Susan; Silveira, Anelise; Magee, David

2013-01-01

The purposes of this study were to determine the intrarater and interrater reliability of the craniocervical posture in a sagittal view using quantitative measurements on photographs and radiographs and to determine the agreement of the visual assessment of posture between raters. One photograph and 1 radiograph of the sagittal craniocervical posture were simultaneously taken from 39 healthy female subjects. Three angles were measured on the photographs and 10 angles on the radiographs of 22 subjects using Alcimage software (Alcimage; Uberlândia, MG, Brazil). Two repeated measurements were performed by 2 raters. The measurements were compared within and between raters to test the intrarater and interrater reliability, respectively. Intraclass correlation coefficient and SEM were used. κ Agreement was calculated for the visual assessment of 39 subjects using photographs and radiographs between 2 raters. Good to excellent intrarater and interrater intraclass correlation coefficient values were found on both photographs and radiographs. Interrater SEM was large and clinically significant for cervical lordosis photogrammetry and for 1 angle measuring cervical lordosis on radiographs. Interrater κ agreement for the visual assessment using photographs was poor (κ = 0.37). The raters were reliable to measure angles in photographs and radiographs to quantify craniocervical posture with exception of 2 angles measuring lordosis of the cervical spine when compared between raters. The visual assessment of posture between raters was not reliable. © 2013. Published by National University of Health Sciences All rights reserved.
Reliability-guided digital image correlation for image deformation measurement

International Nuclear Information System (INIS)

Pan Bing

2009-01-01

A universally applicable reliability-guided digital image correlation (DIC) method is proposed for reliable image deformation measurement. The zero-mean normalized cross correlation (ZNCC) coefficient is used to identify the reliability of the point computed. The correlation calculation begins with a seed point and is then guided by the ZNCC coefficient. That means the neighbors of the point with the highest ZNCC coefficient in a queue for computed points will be processed first. Thus the calculation path is always along the most reliable direction, and possible error propagation of the conventional DIC method can be avoided. The proposed novel DIC method is universally applicable to the images with shadows, discontinuous areas, and deformation discontinuity. Two image pairs were used to evaluate the performance of the proposed technique, and the successful results clearly demonstrate its robustness and effectiveness
Precision of lumbar intervertebral measurements: does a computer-assisted technique improve reliability?

Science.gov (United States)

Pearson, Adam M; Spratt, Kevin F; Genuario, James; McGough, William; Kosman, Katherine; Lurie, Jon; Sengupta, Dilip K

2011-04-01

Comparison of intra- and interobserver reliability of digitized manual and computer-assisted intervertebral motion measurements and classification of "instability." To determine if computer-assisted measurement of lumbar intervertebral motion on flexion-extension radiographs improves reliability compared with digitized manual measurements. Many studies have questioned the reliability of manual intervertebral measurements, although few have compared the reliability of computer-assisted and manual measurements on lumbar flexion-extension radiographs. Intervertebral rotation, anterior-posterior (AP) translation, and change in anterior and posterior disc height were measured with a digitized manual technique by three physicians and by three other observers using computer-assisted quantitative motion analysis (QMA) software. Each observer measured 30 sets of digital flexion-extension radiographs (L1-S1) twice. Shrout-Fleiss intraclass correlation coefficients for intra- and interobserver reliabilities were computed. The stability of each level was also classified (instability defined as >4 mm AP translation or 10° rotation), and the intra- and interobserver reliabilities of the two methods were compared using adjusted percent agreement (APA). Intraobserver reliability intraclass correlation coefficients were substantially higher for the QMA technique THAN the digitized manual technique across all measurements: rotation 0.997 versus 0.870, AP translation 0.959 versus 0.557, change in anterior disc height 0.962 versus 0.770, and change in posterior disc height 0.951 versus 0.283. The same pattern was observed for interobserver reliability (rotation 0.962 vs. 0.693, AP translation 0.862 vs. 0.151, change in anterior disc height 0.862 vs. 0.373, and change in posterior disc height 0.730 vs. 0.300). The QMA technique was also more reliable for the classification of "instability." Intraobserver APAs ranged from 87 to 97% for QMA versus 60% to 73% for digitized manual
Several submaximal exercise tests are reliable, valid and acceptable in people with chronic pain, fibromyalgia or chronic fatigue: a systematic review.

Science.gov (United States)

Ratter, Julia; Radlinger, Lorenz; Lucas, Cees

2014-09-01

Are submaximal and maximal exercise tests reliable, valid and acceptable in people with chronic pain, fibromyalgia and fatigue disorders? Systematic review of studies of the psychometric properties of exercise tests. People older than 18 years with chronic pain, fibromyalgia and chronic fatigue disorders. Studies of the measurement properties of tests of physical capacity in people with chronic pain, fibromyalgia or chronic fatigue disorders were included. Studies were required to report: reliability coefficients (intraclass correlation coefficient, alpha reliability coefficient, limits of agreements and Bland-Altman plots); validity coefficients (intraclass correlation coefficient, Spearman's correlation, Kendal T coefficient, Pearson's correlation); or dropout rates. Fourteen studies were eligible: none had low risk of bias, 10 had unclear risk of bias and four had high risk of bias. The included studies evaluated: Åstrand test; modified Åstrand test; Lean body mass-based Åstrand test; submaximal bicycle ergometer test following another protocol other than Åstrand test; 2-km walk test; 5-minute, 6-minute and 10-minute walk tests; shuttle walk test; and modified symptom-limited Bruce treadmill test. None of the studies assessed maximal exercise tests. Where they had been tested, reliability and validity were generally high. Dropout rates were generally acceptable. The 2-km walk test was not recommended in fibromyalgia. Moderate evidence was found for reliability, validity and acceptability of submaximal exercise tests in patients with chronic pain, fibromyalgia or chronic fatigue. There is no evidence about maximal exercise tests in patients with chronic pain, fibromyalgia and chronic fatigue. Copyright © 2014. Published by Elsevier B.V.
Reliability measures in item response theory: manifest versus latent correlation functions.

Science.gov (United States)

Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel; Verbeke, Geert; De Boeck, Paul

2015-02-01

For item response theory (IRT) models, which belong to the class of generalized linear or non-linear mixed models, reliability at the scale of observed scores (i.e., manifest correlation) is more difficult to calculate than latent correlation based reliability, but usually of greater scientific interest. This is not least because it cannot be calculated explicitly when the logit link is used in conjunction with normal random effects. As such, approximations such as Fisher's information coefficient, Cronbach's α, or the latent correlation are calculated, allegedly because it is easy to do so. Cronbach's α has well-known and serious drawbacks, Fisher's information is not meaningful under certain circumstances, and there is an important but often overlooked difference between latent and manifest correlations. Here, manifest correlation refers to correlation between observed scores, while latent correlation refers to correlation between scores at the latent (e.g., logit or probit) scale. Thus, using one in place of the other can lead to erroneous conclusions. Taylor series based reliability measures, which are based on manifest correlation functions, are derived and a careful comparison of reliability measures based on latent correlations, Fisher's information, and exact reliability is carried out. The latent correlations are virtually always considerably higher than their manifest counterparts, Fisher's information measure shows no coherent behaviour (it is even negative in some cases), while the newly introduced Taylor series based approximations reflect the exact reliability very closely. Comparisons among the various types of correlations, for various IRT models, are made using algebraic expressions, Monte Carlo simulations, and data analysis. Given the light computational burden and the performance of Taylor series based reliability measures, their use is recommended. © 2014 The British Psychological Society.
Northern Chinese dental ages estimated from southern Chinese reference datasets closely correlate with chronological age

Directory of Open Access Journals (Sweden)

Hai Ming Wong

2016-12-01

Full Text Available While northern and southern Chinese are genetically correlated, there exists notable environmental differences in their living conditions. This study aimed to evaluate validity of the southern Chinese reference dataset for dental age estimation applied to northern Chinese. Dental panoramic tomographs of 437 northern Chinese aged 3 to 21 years were analysed. All the left maxillary and mandibular permanent teeth plus the 2 third molars on the right side were scored based on Demirjian’s classification of tooth development stages. Mean and standard error of dental age were obtained for each tooth development stage, followed by random effect meta-analysis for mean dental age estimation. Validity of the method was examined through measures of agreement (95% limits of agreement, standard error of measurement, and Lin’s concordance correlation coefficient and measure of reliability (Intraclass correlation coefficient. On average, the estimated dental age overestimated chronological age by only around 1 month in both females and males. The Intraclass correlation coefficient values were 0.99 for both sexes, suggesting excellent reliability of the method. Reference dataset for dental age estimation developed on the basis of southern Chinese was applicable for use among the northern Chinese.
Health Service Quality Scale: Brazilian Portuguese translation, reliability and validity.

Science.gov (United States)

Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako

2013-01-17

The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.
BurnCase 3D software validation study: Burn size measurement accuracy and inter-rater reliability.

Science.gov (United States)

Parvizi, Daryousch; Giretzlehner, Michael; Wurzer, Paul; Klein, Limor Dinur; Shoham, Yaron; Bohanon, Fredrick J; Haller, Herbert L; Tuca, Alexandru; Branski, Ludwik K; Lumenta, David B; Herndon, David N; Kamolz, Lars-P

2016-03-01

The aim of this study was to compare the accuracy of burn size estimation using the computer-assisted software BurnCase 3D (RISC Software GmbH, Hagenberg, Austria) with that using a 2D scan, considered to be the actual burn size. Thirty artificial burn areas were pre planned and prepared on three mannequins (one child, one female, and one male). Five trained physicians (raters) were asked to assess the size of all wound areas using BurnCase 3D software. The results were then compared with the real wound areas, as determined by 2D planimetry imaging. To examine inter-rater reliability, we performed an intraclass correlation analysis with a 95% confidence interval. The mean wound area estimations of the five raters using BurnCase 3D were in total 20.7±0.9% for the child, 27.2±1.5% for the female and 16.5±0.1% for the male mannequin. Our analysis showed relative overestimations of 0.4%, 2.8% and 1.5% for the child, female and male mannequins respectively, compared to the 2D scan. The intraclass correlation between the single raters for mean percentage of the artificial burn areas was 98.6%. There was also a high intraclass correlation between the single raters and the 2D Scan visible. BurnCase 3D is a valid and reliable tool for the determination of total body surface area burned in standard models. Further clinical studies including different pediatric and overweight adult mannequins are warranted. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.
Reliability and validity of the Turkish version of the Berg Balance Scale.

Science.gov (United States)

Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu

2008-01-01

The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (pr=0.67 pr=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
The reliability of four widely used patellar height ratios.

Science.gov (United States)

van Duijvenbode, Dennis; Stavenuiter, Michel; Burger, Bart; van Dijke, Cees; Spermon, Jacco; Hoozemans, Marco

2016-03-01

The objective of this study was to evaluate the inter-observer reliability and the intra-observer reliability of four patellar height ratios: Insall-Salvati (IS), modified Insall-Salvati (MIS), Blackburne-Peel (BP) and Caton-Deschamps (CD). The patellar height ratios were assessed by four independent examiners using weight-bearing lateral knee radiographs in 30° flexion. Intra-class correlation coefficients and Fleiss' kappa's were determined. The inter-observer reliability was excellent for the IS and moderate for the other ratios. When the ratio values were categorized, the inter-observer reliability was strong for the IS, moderate for the MIS and BP, and poor for the CD. The intra-observer reliability was excellent for the IS, MIS and CD, and strong for the BP. When the ratio values were categorized, the intra-observer reliability was strong for the IS and MIS, and moderate for the other ratios. Although the IS showed best reliability, we advise to use the MIS as it showed the second best reliability but is, according to the literature, associated with better validity.
Reliability And Validity Of Turkish Version Of Motor Activity Log-28

Directory of Open Access Journals (Sweden)

Burcu Ersöz Hüseyinsinoğlu

2011-06-01

Full Text Available OBJECTIVE: The aim of this study was to adapt the Motor Activity Log-28 (MAL-28 into Turkish and probe the reliability and validity of this questionnaire in stroke patients. METHODS: Following the translation of the MAL-28 into Turkish, its reliability and construct validity was examined in 30 stroke patients. For the reliability study, patients were interviewed twice within a three day period, during which no rehabilitative activities were undertaken. The test-retest reliability was determined by using intra-class correlation coefficient (ICC and Spearman correlation coefficient (r; internal consistency was determined by Cronbach's alpha (α. The construct validity was examined by comparing MAL-28 Quality Of Movement (QOM scale and Amount Of Use (AOU scale with Wolf Motor Function Test (WMFT-Performance Time (PT and Functional Ability (FA scores. Furthermore, item-to-scale correlations of AOU and QOM scales were determined and correlation between totol scores of two scales was examined. RESULTS: Turkish version of MAL-28 AOU and QOM scales were reliable (ICC scores were 0.97 and 0.96, respectively and internally consistent (Cronbach’s α value was 0.96 for both scales. Test-retest reliability was supported (AOU, r=0.94; QOM, r=0.93. WMFT FA scores was correlated with both scales (r=0.63. Correlation between WMFT PT and AOU and QOM scales were -0.56 and -0.55. AOU and QOM scales were highly correlated (r=0.95. CONCLUSION: The findings indicate that Turkish version of MAL-28 is reliable and valid in individuals with stroke. Further investigation about its responsiveness is needed before using that version as a primary measurement in clinical trials
Assessment of the nursing care product (APROCENF: a reliability and construct validity study

Directory of Open Access Journals (Sweden)

Danielle Fabiana Cucolo

Full Text Available ABSTRACT Objectives: to verify the reliability and construct validity estimates of the "Assessment of nursing care product" scale (APROCENF and its applicability. Methods: this validation study included a sample of 40 (inter-rater reliability and 172 (construct validity assessments performed by nurses at the end of the work shift at nine inpatient services of a teaching hospital in the Brazilian Southeast. The data were collected between February and September/2014 with interruptions. Cronbach's alpha and Spearman's correlation coefficients were calculated, as well as the intraclass correlation and the weighted kappa index (inter-rater reliability. Exploratory factor analysis was used with principal component extraction and varimax rotation (construct validity. Results: the internal consistency revealed an alpha coefficient of 0.85, item-item correlation ranging between 0.13 and 0.61 and item-total correlation between 0.43 and 0.69. Inter-rater equivalence was obtained and all items evidenced significant factor loadings. Conclusion: this research evidenced the reliability and construct validity of the scale to assess the nursing care product. Its application in nursing practice permits identifying improvements needed in the production process, contributing to management and care decisions.
The reliability of a VISION COACH task as a measure of psychomotor skills.

Science.gov (United States)

Xi, Yubin; Rosopa, Patrick J; Mossey, Mary; Crisler, Matthew C; Drouin, Nathalie; Kopera, Kevin; Brooks, Johnell O

2014-10-01

The VISION COACH™ interactive light board is designed to test and enhance participants' psychomotor skills. The primary goal of this study was to examine the test-retest reliability of the Full Field 120 VISION COACH task. One hundred eleven male and 131 female adult participants completed six trials where they responded to 120 randomly distributed lights displayed on the VISION COACH interactive light board. The mean time required for a participant to complete a trial was 101 seconds. Intraclass correlation coefficients, ranging from 0.962 to 0.987 suggest the VISION COACH Full Field 120 task was a reliable task. Cohen's d's of adjacent pairs of trials suggest learning effects did not negatively affect reliability after the third trial.
Trunk Muscle Size and Composition Assessment in Older Adults with Chronic Low Back Pain: An Intra-Examiner and Inter-Examiner Reliability Study.

Science.gov (United States)

Sions, Jaclyn Megan; Smith, Andrew Craig; Hicks, Gregory Evan; Elliott, James Matthew

2016-08-01

To evaluate intra- and inter-examiner reliability for the assessment of relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area, i.e., total cross-sectional area minus intramuscular fat, from T1-weighted magnetic resonance images obtained in older adults with chronic low back pain. Reliability study. n = 13 (69.3 ± 8.2 years old) After lumbar magnetic resonance imaging, two examiners produced relative cross-sectional area measurements of multifidi, erector spinae, psoas, and quadratus lumborum by tracing regions of interest just inside fascial borders. Pixel-intensity summaries were used to determine muscle-to-fat infiltration indices; relative muscle cross-sectional area was calculated. Intraclass correlation coefficients were used to estimate intra- and inter-examiner reliability; standard error of measurement was calculated. Intra-examiner intraclass correlation coefficient point estimates for relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area were excellent for multifidi and erector spinae across levels L2-L5 (ICC = 0.77-0.99). At L3, intra-examiner reliability was excellent for relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area for both psoas and quadratus lumborum (ICC = 0.81-0.99). Inter-examiner intraclass correlation coefficients ranged from poor to excellent for relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area. Assessment of relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area in older adults with chronic low back pain can be reliably determined by one examiner from T1-weighted images. Such assessments provide valuable information, as muscle-to-fat infiltration indices and relative muscle cross-sectional area indicate that a substantial amount of
Forward lunge as a functional performance test in ACL deficient subjects: test-retest reliability

DEFF Research Database (Denmark)

Alkjaer, Tine; Henriksen, Marius; Dyhre-Poulsen, Poul

2009-01-01

The forward lunge movement may be used as a functional performance test of anterior cruciate ligament (ACL) deficient and reconstructed subjects. The purposes were 1) to determine the test-retest reliability of a forward lunge in healthy subjects and 2) to determine the required numbers...... of repetitions necessary to yield satisfactory reliability. Nineteen healthy subjects performed four trials of a forward lunge on two different days. The movement time, impulses of the ground reaction forces (IFz, IFy), knee joint kinematics and dynamics during the forward lunge were calculated. The relative...... reliability was determined by calculation of Intraclass Correlation Coefficients (ICC). The IFz, IFy and the positive work of the knee extensors showed excellent reliability (ICC >0.75). All other variables demonstrated acceptable reliability (0.4>ICCreliability increased when more than...
Reliability of infrared thermometric measurements of skin temperature in the hand.

Science.gov (United States)

Packham, Tara L; Fok, Diana; Frederiksen, Karen; Thabane, Lehana; Buckley, Norman

2012-01-01

Clinical measurement study. Skin temperature asymmetries (STAs) are used in the diagnosis of complex regional pain syndrome (CRPS), but little evidence exists for reliability of the equipment and methods. This study examined the reliability of an inexpensive infrared (IR) thermometer and measurement points in the hand for the study of STA. ST was measured three times at five points on both hands with an IR thermometer by two raters in 20 volunteers (12 normals and 8 CRPS). ST measurement results using IR thermometers support inter-rater reliability: intraclass correlation coefficient (ICC) estimate for single measures 0.80; all ST measurement points were also highly reliable (ICC single measures, 0.83-0.91). The equipment demonstrated excellent reliability, with little difference in the reliability of the five measurement sites. These preliminary findings support their use in future CRPS research. Not applicable. Copyright © 2012 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Reliability of externally fixed dynamometry hamstring strength testing in elite youth football players.

Science.gov (United States)

Wollin, Martin; Purdam, Craig; Drew, Michael K

2016-01-01

To investigate inter and intra-tester reliability of an externally fixed dynamometry unilateral hamstring strength test, in the elite sports setting. Reliability study. Sixteen, injury-free, elite male youth football players (age=16.81±0.54 years, height=180.22±5.29cm, weight 73.88±6.54kg, BMI=22.57±1.42) gave written informed consent. Unilateral maximum isometric peak hamstring force was evaluated by externally fixed dynamometry for inter-tester, intra-day and intra-tester, inter-week reliability. The test position was standardised to correlate with the terminal swing phase of the gait running cycle. Inter and intra-tester values demonstrated good to high levels of reliability. The intra-class coefficient (ICC) for inter-tester, intra-day reliability was 0.87 (95% CI=0.75-0.93) with standard error of measure percentage (SEM%) 4.7 and minimal detectable change percentage (MDC%) 12.9. Intra-tester, inter-week reliability results were ICC 0.86 (95% CI, 0.74-0.93), SEM% 5.0 and MDC% 14.0. This study demonstrates good to high inter and intra-tester reliability of isometric externally fixed dynamometry unilateral hamstring strength testing in the regular elite sport setting involving elite male youth football players. The intra-class coefficient in association with the low standard error of measure and minimal detectable change percentages suggest that this procedure is appropriate for clinical and academic use as well as monitoring hamstring strength in the elite sport setting. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
Health service quality scale: Brazilian Portuguese translation, reliability and validity

Science.gov (United States)

2013-01-01

Background The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. Methods We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson’s correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach’s alpha coefficient; the intraclass (ICC) and Pearson’s correlation coefficients were used for test-retest reliability. Results One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson’s correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson’s correlation coefficient was 0.89 and ICC was 0.90. Conclusions The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality. PMID:23327598
Measuring physical activity in young people with cerebral palsy: validity and reliability of the ActivPAL™ monitor.

Science.gov (United States)

Bania, Theofani

2014-09-01

We determined the criterion validity and the retest reliability of the ΑctivPAL™ monitor in young people with diplegic cerebral palsy (CP). Activity monitor data were compared with the criterion of video recording for 10 participants. For the retest reliability, activity monitor data were collected from 24 participants on two occasions. Participants had to have diplegic CP and be between 14 and 22 years of age. They also had to be of Gross Motor Function Classification System level II or III. Outcomes were time spent in standing, number of steps (physical activity) and time spent in sitting (sedentary behaviour). For criterion validity, coefficients of determination were all high (r(2) ≥ 0.96), and limits of group agreement were relatively narrow, but limits of agreement for individuals were narrow only for number of steps (≥5.5%). Relative reliability was high for number of steps (intraclass correlation coefficient = 0.87) and moderate for time spent in sitting and lying, and time spent in standing (intraclass correlation coefficients = 0.60-0.66). For groups, changes of up to 7% could be due to measurement error with 95% confidence, but for individuals, changes as high as 68% could be due to measurement error. The results support the criterion validity and the retest reliability of the ActivPAL™ to measure physical activity and sedentary behaviour in groups of young people with diplegic CP but not in individuals. Copyright © 2014 John Wiley & Sons, Ltd.

The Children's Play Therapy Instrument (CPTI). Description, development, and reliability studies.

Science.gov (United States)

Kernberg, P F; Chazan, S E; Normandin, L

1998-01-01

The Children's Play Therapy Instrument (CPTI), its development, and reliability studies are described. The CPTI is a new instrument to examine a child's play activity in individual psychotherapy. Three independent raters used the CPTI to rate eight videotaped play therapy vignettes. Results were compared with the authors' consensual scores from a preliminary study. Generally good to excellent levels of interrater reliability were obtained for the independent raters on intraclass correlation coefficients for ordinal categories of the CPTI. Likewise, kappa levels were acceptable to excellent for nominal categories of the scale. The CPTI holds promise to become a reliable measure of play activity in child psychotherapy. Further research is needed to assess discriminant validity of the CPTI for use as a diagnostic tool and as a measure of process and outcome.
4-Meter Gait Speed Test in Chronic Obstructive Pulmonary Disease: INTERRATER RELIABILITY USING A STOPWATCH.

Science.gov (United States)

Bisca, Gianna Waldrich; Fava, Lucas Rodrigues; Morita, Andrea Akemi; Machado, Felipe Vilaça Cavallari; Pitta, Fabio; Hernandes, Nidia Aparecida

2017-12-14

4-meter gait speed (4MGS) is increasingly used to assess functional performance in patients with chronic obstructive pulmonary disease. However, the current literature lacks information regarding some technical standards for this test. Therefore, the purpose of this study was to compare and to evaluate the interrater reliability between a stopwatch and video recording used as timing systems for the 4MGS in patients with chronic obstructive pulmonary disease, as well as to verify the interrater reliability between 2 observers measuring the 4MGS time using a manual stopwatch. Fifty-one patients performed the 4MGS using 4 different protocols (random order): walking at the usual and maximum speed in a 4-meter course and walking at the same 2 speeds on an 8-m course using a 2-m acceleration zone, a 4-meter timing area, and a 2-m deceleration zone. Gait speed was measured simultaneously using a stopwatch and a video recording. In a subanalysis (n = 24), 2 independent observers timed the 4MGS using a stopwatch. There was no significant difference in comparison between the 2 timing methods (P > .05 for all), and the reliability between video recording and stopwatch was excellent in all 4MGS studied protocols (intraclass correlation coefficient ≥ 0.91). Moreover, when comparing gait speed measured by 2 observers using a stopwatch, no significant difference was found among all proposed protocols (P > .05 for all), and there was also excellent reliability between the 2 independent observers (intraclass correlation coefficient ≥ 0.94). The stopwatch, a low-cost and feasible tool, is reliable as a timing device for the 4MGS in patients with chronic obstructive pulmonary disease.
Reliability of fully automated versus visually controlled pre- and post-processing of resting-state EEG.

Science.gov (United States)

Hatz, F; Hardmeier, M; Bousleiman, H; Rüegg, S; Schindler, C; Fuhr, P

2015-02-01

To compare the reliability of a newly developed Matlab® toolbox for the fully automated, pre- and post-processing of resting state EEG (automated analysis, AA) with the reliability of analysis involving visually controlled pre- and post-processing (VA). 34 healthy volunteers (age: median 38.2 (20-49), 82% female) had three consecutive 256-channel resting-state EEG at one year intervals. Results of frequency analysis of AA and VA were compared with Pearson correlation coefficients, and reliability over time was assessed with intraclass correlation coefficients (ICC). Mean correlation coefficient between AA and VA was 0.94±0.07, mean ICC for AA 0.83±0.05 and for VA 0.84±0.07. AA and VA yield very similar results for spectral EEG analysis and are equally reliable. AA is less time-consuming, completely standardized, and independent of raters and their training. Automated processing of EEG facilitates workflow in quantitative EEG analysis. Copyright © 2014 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
The relative and absolute reliability of the Functional Independence and Difficulty Scale in community-dwelling frail elderly Japanese people using long-term care insurance services.

Science.gov (United States)

Saito, Takashi; Izawa, Kazuhiro P; Watanabe, Shuichiro

2017-06-01

The newly developed Functional Independence and Difficulty Scale is a tool for assessing the performance of basic activities of daily living in terms of both independence and difficulty. The reliability of this new scale has not been assessed. The aim of this study was to examine the relative reliability and absolute reliability of the newly developed scale in community-dwelling frail elderly people in Japan. Participants were 47 community-dwelling elderly subjects (22 for assessing test-retest reliability and 25 for assessing inter-rater reliability). As relative reliability indices, intra-class correlation coefficients were used. From an absolute reliability perspective, we conducted Bland-Altman analysis and calculated the limit of agreement or minimal detectable change to determine the acceptable range of error. Intra-class correlation coefficients for test-retest and inter-rater reliability were 0.90 (P reliability was -5.2 to 1.8, representing an increase of over six points for improvement and a decrease of over two points for decline of basic activities of daily living ability. The minimal detectable change for inter-rater reliability was 3.7, indicating that a three-point difference might be existed between difference raters. The results of this study demonstrated that the FIDS appeared to be a reliable instrument for use in Japanese community-dwelling frail elderly people. While further research using a large and more diverse sample of participants is needed, our findings support the use of FIDS in clinical practice or clinical research targeting frail elderly Japanese people.
Inter-rater reliability of the Greek version of CAARMS among two groups of mental health professionals.

Science.gov (United States)

Kollias, C; Kontaxakis, V; Havaki-Kontaxaki, B; Simmons, M B; Stefanis, N; Papageorgiou, C

2015-01-01

There is increasing interest within the Greek psychiatric community in the early detection and prevention of psychotic disorders. To support this, there is a need for a valid and reliable tool to identify young people that may be at risk of developing a psychotic disorder. Our team has previously translated the Comprehensive Assessment of At-Risk Mental States (CAARMS). The validity of the CAARMS was ensured by the procedure of translation and the aim of the current study was to estimate the interrater reliability of the CAARMS Greek translation among residents in psychiatry and specialized mental health professionals. 43 mental health workers (27 residents in psychiatry and 16 specialized mental health professionals (i.e. 11 psychiatrists and 5 psychologist) participated in two seminars that covered theoretical information about the ultra high risk concept and training in the CAARMS. During the seminars, 10 vignettes with psychiatric history cases were presented, including healthy, ultra high risk and first episode psychosis. The mean correlated percentage of agreement with the correct answers regarding diagnosis of the presented history cases among all our subjects was 81.42, among specialized mental health professionals 77.88, and among residents 84.46. Intraclass correlation co-efficients were 0.994 for specialized mental health professionals and 0.997 for residents. The translated Greek version of CAARMS presents a satisfying interrater reliability when used by both residents and specialized mental health professionals. Residents declare even higher intraclass correlation co-efficients and mean correlated percentage of agreement than specialized mental health professionals, which indicate that residents are capable of using the CAARMS in early intervention units.
The Turkish version of the Physical Activity Scale for the Elderly (PASE): its cultural adaptation, validation, and reliability.

Science.gov (United States)

Ayvat, Ender; Kilinç, Muhammed; Kirdi, Nuray

2017-06-12

This study aimed to describe the cultural adaptation of the Turkish Physical Activity Scale for the Elderly (PASE) and to examine the reliability and validity of the scale in older Turkish adults. Eighty elderly people were recruited for the study. The assessments included the PASE, the International Physical Activity Questionnaire (IPAQ), the Short Physical Performance Battery and Short Form-36 Quality of Life Questionnaire (SF-36), and the Mini Mental State Test. Outcome measures were conducted twice within a week (test-retest) for reliability. Cronbach's α coefficient was 0.714 for the initial evaluation. The intraclass correlation coefficient for the test-retest reliability was 0.995 with a 95% confidence interval of 0.993-0.997. A high level of positive correlation (0.742, P reliable and valid scale for the fields of research and practice.
Measurement of acute nonspecific low back pain perception in primary care physical therapy: reliability and validity of the brief illness perception questionnaire.

Science.gov (United States)

Hallegraeff, Joannes M; van der Schans, Cees P; Krijnen, Wim P; de Greef, Mathieu H G

2013-02-01

The eight-item Brief Illness Perception Questionnaire is used as a screening instrument in physical therapy to assess mental defeat in patients with acute low back pain, besides patient perception might determine the course and risk for chronic low back pain. However, the psychometric properties of the Brief Illness Perception Questionnaire in common musculoskeletal disorders like acute low back pain have not been adequately studied. Patients' perceptions vary across different populations and affect coping styles. Thus, our aim was to determine the internal consistency, test-retest reliability and validity of the Dutch language version of the Brief Illness Perception Questionnaire in acute non-specific low back pain patients in primary care physical therapy. A non-experimental cross-sectional study with two measurements was performed. Eighty-four acute low back pain patients, in multidisciplinary health care center in Dutch primary care with a sample mean (SD) age of 42 (12) years, participated in the study. Internal consistency (Cronbach's α) and test-retest procedures (Intraclass Correlation Coefficients and limits of agreement) were evaluated at a one-week interval. The concurrent validity of the Brief Illness Perception Questionnaire was examined by using the Mental Health Component of the Short Form 36 Health Survey. The Cronbach's α for internal consistency was 0.73 (95% CI, 0.67 - 0.83); and the Intraclass Correlation Coefficient test-retest reliability was acceptable: 0.72 (95% CI, 0.53 - 0.82), however, the limits of agreement were large. The Intraclass Correlation Coefficient measuring concurrent validity 0.65 (95% CI, 0.46 - 0.80). The Dutch version of the Brief Illness Perception Questionnaire is an appropriate instrument for measuring patients' perceptions in acute low back pain patients, showing acceptable internal consistency and reliability. Concurrent validity is adequate, however, the instrument may be unsuitable for detecting changes in low
Food and beverage environment analysis and monitoring system: a reliability study in the school food and beverage environment.

Science.gov (United States)

Bullock, Sally Lawrence; Craypo, Lisa; Clark, Sarah E; Barry, Jason; Samuels, Sarah E

2010-07-01

States and school districts around the country are developing policies that set nutrition standards for competitive foods and beverages sold outside of the US Department of Agriculture's reimbursable school lunch program. However, few tools exist for monitoring the implementation of these new policies. The objective of this research was to develop a computerized assessment tool, the Food and Beverage Environment Analysis and Monitoring System (FoodBEAMS), to collect data on the competitive school food environment and to test the inter-rater reliability of the tool among research and nonresearch professionals. FoodBEAMS was used to collect data in spring 2007 on the competitive foods and beverages sold in 21 California high schools. Adherence of the foods and beverages to California's competitive food and beverage nutrition policies for schools (Senate Bills 12 and 965) was determined using the data collected by both research and nonresearch professionals. The inter-rater reliability between the data collectors was assessed using the intraclass correlation coefficient. Researcher vs researcher and researcher vs nonresearcher inter-rater reliability was high for both foods and beverages, with intraclass correlation coefficients ranging from .972 to .987. Results of this study provide evidence that FoodBEAMS is a promising tool for assessing and monitoring adherence to nutrition standards for competitive foods sold on school campuses and can be used reliably by both research and nonresearch professionals. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Short-term test-retest-reliability of conditioned pain modulation using the cold-heat-pain method in healthy subjects and its correlation to parameters of standardized quantitative sensory testing.

Science.gov (United States)

Gehling, Julia; Mainka, Tina; Vollert, Jan; Pogatzki-Zahn, Esther M; Maier, Christoph; Enax-Krumova, Elena K

2016-08-05

Conditioned Pain Modulation (CPM) is often used to assess human descending pain inhibition. Nine different studies on the test-retest-reliability of different CPM paradigms have been published, but none of them has investigated the commonly used heat-cold-pain method. The results vary widely and therefore, reliability measures cannot be extrapolated from one CPM paradigm to another. Aim of the present study was to analyse the test-retest-reliability of the common heat-cold-pain method and its correlation to pain thresholds. We tested the short-term test-retest-reliability within 40 ± 19.9 h using a cold-water immersion (10 °C, left hand) as conditioning stimulus (CS) and heat pain (43-49 °C, pain intensity 60 ± 5 on the 101-point numeric rating scale, right forearm) as test stimulus (TS) in 25 healthy right-handed subjects (12females, 31.6 ± 14.1 years). The TS was applied 30s before (TSbefore), during (TSduring) and after (TSafter) the 60s CS. The difference between the pain ratings for TSbefore and TSduring represents the early CPM-effect, between TSbefore and TSafter the late CPM-effect. Quantitative sensory testing (QST, DFNS protocol) was performed on both sessions before the CPM assessment. paired t-tests, Intraclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Pearson's correlation, Bland-Altman analysis, significance level p Pain ratings during CPM correlated significantly (ICC: 0.411…0.962) between both days, though ratings for TSafter were lower on day 2 (p pain thresholds. The short-term test-retest-reliability of the early CPM-effect using the heat-cold-pain method in healthy subjects achieved satisfying results in terms of the ICC. The SRD of the early CPM effect showed that an individual change of > 20 NRS can be attributed to a real change rather than chance. The late CPM-effect was weaker and not reliable.
Foundations for a time reliability correlation system to quantify human reliability

International Nuclear Information System (INIS)

Dougherty, E.M. Jr.; Fragola, J.R.

1988-01-01

Time reliability correlations (TRCs) have been used in human reliability analysis (HRA) in conjunction with probabilistic risk assessment (PRA) to quantify post-initiator human failure events. The first TRCs were judgmental but recent data taken from simulators have provided evidence for development of a system of TRCs. This system has the equational form: t = tau R X tau U , where the first factor is the lognormally distributed random variable of successful response time, derived from the simulator data, and the second factor is a unitary lognormal random variable to account for uncertainty in the model. The first random variable is further factored into a median response time and a factor to account for the dominant type of behavior assumed to be involved in the response and a second factor to account for other influences on the reliability of the response
Reliability and validity of two isometric squat tests.

Science.gov (United States)

Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U

2002-05-01

The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p squat and FHS test performances (r squat and FHS test performance can be attributed to differences in the movement patterns of the tests
Validation and reliability of a Behcet's Syndrome Activity Scale in Korea.

Science.gov (United States)

Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo

2016-01-01

We prepared a cross-cultural adaptation of the Behcet's Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Fifty patients with Behcet's disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet's Disease Current Activity Form (BDCAF) and a Behcet's Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). The Korean version of BSAS is a reliable and valid instrument to measure BD activity.
Reliability of the Bulb Dynamometer for Assessing Grip Strength

Directory of Open Access Journals (Sweden)

Colleen Maher

2018-04-01

Full Text Available Background: Hand function is an overall indicator of health and is often measured using grip strength. Handheld dynamometry is the most common method of measuring grip strength. The purpose of this study was to determine the inter-rater and test-retest reliability, the reliability of one trial versus three trials, and the preliminary norms for a young adult population using the Baseline® Pneumatic Squeeze Bulb Dynamometer (30 psi. Methods: This study used a one-group methodological design. One hundred and three healthy adults (30 males and 73 females were recruited. Six measurements were collected for each hand per participant. The data was analyzed using Intraclass Correlation Coefficients (ICC two-way effects model (2,2 and paired-samples t-tests. Results: The ICC for inter-rater reliability ranged from 0.955 to 0.977. Conclusion: The results of this study suggest that the bulb dynamometer is a reliable tool to measure grip strength and should be further explored for reliable and valid use in diverse populations and as an alternative to the Jamar dynamometer.
Accuracy and correlates of maternal recall of birthweight and gestational age

DEFF Research Database (Denmark)

Adegboye, A R A; Heitmann, Berit Lilienthal

2008-01-01

the two sources was evaluated by mean differences (MD), intraclass correlation coefficient (ICC) and Bland-Altman's plots. The misclassification of the various BW and GA categories were also estimated. MAIN OUTCOME MEASURES: Differences between recalled and registered BW and GA. RESULTS: There was high......OBJECTIVE: To determine the accuracy of maternal recall of children birthweight (BW) and gestational age (GA), using the Danish Medical Birth Register (DBR) as reference and to examine the reliability of recalled BW and its potential correlates. DESIGN: Comparison of data from the DBR...
Patient Assessment of Constipation Quality of Life Questionnaire: Translation, Cultural Adaptation, Reliability, and Validity of the Persian Version.

Science.gov (United States)

Nikjooy, Afsaneh; Jafari, Hassan; Saba, Maryam A; Ebrahimi, Naghmeh; Mirzaei, Rezvan

2018-05-01

The Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire is the most validated and the most specific tool for measuring the quality of life of patients with constipation. Over 120 million people live in countries whose official language is Persian. There is no reported Persian version of the PAC-QOL questionnaire yet. The aim of this study was to translate and culturally adapt the PAC-QOL questionnaire and to assess its reliability and validity among Persian patients with chronic constipation. Following the translation and cultural adaptation of the PAC-QOL questionnaire to Persian, 100 patients (mean±SD age=40.51±13.67) with constipation were recruited for validity measurement and 20 patients were re-examined for reliability. Content validity was assessed based on the opinions of an expert committee and the floor/ceiling effect. Construct validity was evaluated according to the hypothesis test. The SF-36 questionnaire was used for concurrent criterion validity, intra-class correlation coefficient for reliability, and Cronbach's alpha for internal consistency. The content validity of the PAC-QOL questionnaire was proven, and there was no floor/ceiling effect. Construct validity also was confirmed based on the hypothesis test. The overall Cronbach's alpha of the PAC-QOL questionnaire was 0.92 (range=0.72-0.92), and the overall intra-class correlation coefficient of the questionnaire was 0.88 (range=0.69-0.87). The correlation between the SF-36 and PAC-QOL questionnaires was moderate. The Persian version of the PAC-QOL questionnaire demonstrated good validity and reliability properties in chronic constipation. Accordingly, Persian researchers and clinicians can benefit from this questionnaire in further research and assessment of treatment outcomes.
Reliability and concurrent validity of a Smartphone, bubble inclinometer and motion analysis system for measurement of hip joint range of motion.

Science.gov (United States)

Charlton, Paula C; Mentiplay, Benjamin F; Pua, Yong-Hao; Clark, Ross A

2015-05-01

Traditional methods of assessing joint range of motion (ROM) involve specialized tools that may not be widely available to clinicians. This study assesses the reliability and validity of a custom Smartphone application for assessing hip joint range of motion. Intra-tester reliability with concurrent validity. Passive hip joint range of motion was recorded for seven different movements in 20 males on two separate occasions. Data from a Smartphone, bubble inclinometer and a three dimensional motion analysis (3DMA) system were collected simultaneously. Intraclass correlation coefficients (ICCs), coefficients of variation (CV) and standard error of measurement (SEM) were used to assess reliability. To assess validity of the Smartphone application and the bubble inclinometer against the three dimensional motion analysis system, intraclass correlation coefficients and fixed and proportional biases were used. The Smartphone demonstrated good to excellent reliability (ICCs>0.75) for four out of the seven movements, and moderate to good reliability for the remaining three movements (ICC=0.63-0.68). Additionally, the Smartphone application displayed comparable reliability to the bubble inclinometer. The Smartphone application displayed excellent validity when compared to the three dimensional motion analysis system for all movements (ICCs>0.88) except one, which displayed moderate to good validity (ICC=0.71). Smartphones are portable and widely available tools that are mostly reliable and valid for assessing passive hip range of motion, with potential for large-scale use when a bubble inclinometer is not available. However, caution must be taken in its implementation as some movement axes demonstrated only moderate reliability. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Inter-rater reliability of kinesthetic measurements with the KINARM robotic exoskeleton.

Science.gov (United States)

Semrau, Jennifer A; Herter, Troy M; Scott, Stephen H; Dukelow, Sean P

2017-05-22

Kinesthesia (sense of limb movement) has been extremely difficult to measure objectively, especially in individuals who have survived a stroke. The development of valid and reliable measurements for proprioception is important to developing a better understanding of proprioceptive impairments after stroke and their impact on the ability to perform daily activities. We recently developed a robotic task to evaluate kinesthetic deficits after stroke and found that the majority (~60%) of stroke survivors exhibit significant deficits in kinesthesia within the first 10 days post-stroke. Here we aim to determine the inter-rater reliability of this robotic kinesthetic matching task. Twenty-five neurologically intact control subjects and 15 individuals with first-time stroke were evaluated on a robotic kinesthetic matching task (KIN). Subjects sat in a robotic exoskeleton with their arms supported against gravity. In the KIN task, the robot moved the subjects' stroke-affected arm at a preset speed, direction and distance. As soon as subjects felt the robot begin to move their affected arm, they matched the robot movement with the unaffected arm. Subjects were tested in two sessions on the KIN task: initial session and then a second session (within an average of 18.2 ± 13.8 h of the initial session for stroke subjects), which were supervised by different technicians. The task was performed both with and without the use of vision in both sessions. We evaluated intra-class correlations of spatial and temporal parameters derived from the KIN task to determine the reliability of the robotic task. We evaluated 8 spatial and temporal parameters that quantify kinesthetic behavior. We found that the parameters exhibited moderate to high intra-class correlations between the initial and retest conditions (Range, r-value = [0.53-0.97]). The robotic KIN task exhibited good inter-rater reliability. This validates the KIN task as a reliable, objective method for quantifying
Reliability of risk assessment measures used in sexually violent predator proceedings.

Science.gov (United States)

Miller, Cailey S; Kimonis, Eva R; Otto, Randy K; Kline, Suzonne M; Wasserman, Adam L

2012-12-01

The field interrater reliability of three assessment tools frequently used by mental health professionals when evaluating sex offenders' risk for reoffending--the Psychopathy Checklist-Revised (PCL-R), the Minnesota Sex Offender Screening Tool-Revised (MnSOST-R) and the Static-99-was examined within the context of sexually violent predator program proceedings. Rater agreement was highest for the Static--99 (intraclass correlation coefficient [ICC₁] = .78) and lowest for the PCL-R (ICC₁ = .60; MnSOST-R ICC₁ = .74), although all instruments demonstrated lower field reliability than that reported in their test manuals. Findings raise concerns about the reliability of risk assessment tools that are used to inform judgments of risk in high-stake sexually violent predator proceedings. Implications for future research and suggestions for improving evaluator training to increase accuracy when informing legal decision making are discussed.
[Reliability of nursing outcomes classification label "Knowledge: cardiac disease management (1830)" in outpatients with heart failure].

Science.gov (United States)

Cañón-Montañez, Wilson; Oróstegui-Arenas, Myriam

2015-01-01

To determine the reliability (internal consistency, inter-rater reproducibility and level of agreement) of nursing outcome: "Knowledge: cardiac disease management (1830)" of the version published in Spanish, in outpatients with heart failure. A reliability study was conducted on 116 outpatients with heart failure. Six indicators of nursing outcome were operationalized. All participants were assessed simultaneously by two evaluators. Three evaluation periods were defined: initial (at baseline), final (a month later), and follow-up (two months later). Internal consistency by Cronbach alpha coefficient, inter-rater reproducibility with intraclass correlation coefficient of reproducibility or agreement and level agreement using the 95% limits of Bland and Altman. Cronbach's alpha was 0.83 (95% CI: 0.77 - 0.89) in the final evaluation, and follow-up values of 0.85 (95% CI: 0.82-0.89) and 0.83 (95% CI: 0.78 - 0.88) were found for the first and second evaluator, respectively. The intraclass correlation coefficient showed values greater 0.9 in the three evaluation periods in both the random and mixed model. The Bland-Altman 95% limits of agreement were close to zero in the three evaluations performed. The questionnaire operationalized to assess the nursing outcome: "Knowledge: cardiac disease management (1830)" in its Spanish version, is a reliable method to measure skills and knowledge in outpatients with heart failure in the Colombian context. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
Accuracy and reliability of observational gait analysis data: judgments of push-off in gait after stroke.

Science.gov (United States)

McGinley, Jennifer L; Goldie, Patricia A; Greenwood, Kenneth M; Olney, Sandra J

2003-02-01

Physical therapists routinely observe gait in clinical practice. The purpose of this study was to determine the accuracy and reliability of observational assessments of push-off in gait after stroke. Eighteen physical therapists and 11 subjects with hemiplegia following a stroke participated in the study. Measurements of ankle power generation were obtained from subjects following stroke using a gait analysis system. Concurrent videotaped gait performances were observed by the physical therapists on 2 occasions. Ankle power generation at push-off was scored as either normal or abnormal using two 11-point rating scales. These observational ratings were correlated with the measurements of peak ankle power generation. A high correlation was obtained between the observational ratings and the measurements of ankle power generation (mean Pearson r=.84). Interobserver reliability was moderately high (mean intraclass correlation coefficient [ICC (2,1)]=.76). Intraobserver reliability also was high, with a mean ICC (2,1) of.89 obtained. Physical therapists were able to make accurate and reliable judgments of push-off in videotaped gait of subjects following stroke using observational assessment. Further research is indicated to explore the accuracy and reliability of data obtained with observational gait analysis as it occurs in clinical practice.

Translation, cross-cultural adaptation and reliability of the German version of the migraine disability assessment (MIDAS) questionnaire.

Science.gov (United States)

Benz, Thomas; Lehmann, Susanne; Gantenbein, Andreas R; Sandor, Peter S; Stewart, Walter F; Elfering, Achim; Aeschlimann, André G; Angst, Felix

2018-03-09

The Migraine Disability Assessment (MIDAS) is a brief questionnaire and measures headache-related disability. This study aimed to translate and cross-culturally adapt the original English version of the MIDAS to German and to test its reliability. The standardized translation process followed international guidelines. The pre-final version was tested for clarity and comprehensibility by 34 headache sufferers. Test-retest reliability of the final version was quantified by 36 headache patients completing the MIDAS twice with an interval of 48 h. Reliability was determined by intraclass correlation coefficients and internal consistency by Cronbach's α. All steps of the translation process were followed, documented and approved by the developer of the MIDAS. The expert committee discussed in detail the complex phrasing of the questions that refer to one to another, especially exclusion of headache-days from one item to the next. The German version contains more active verb sentences and prefers the perfect to the imperfect tense. The MIDAS scales intraclass correlation coefficients ranged from 0.884 to 0.994 and was 0.991 (95% CI: 0.982-0.995) for the MIDAS total score. Cronbach's α for the MIDAS as a whole was 0.69 at test and 0.67 at retest. The translation process was challenged by the comprehensibility of the questionnaire. The German version of the MIDAS is a highly reliable instrument for assessing headache related disability with moderate internal consistency. Provided validity testing of the German MIDAS is successful, it can be recommended for use in clinical practice as well as in research.
Measuring the validity and reliability of the Apple Watch as a physical activity monitor.

Science.gov (United States)

Zhang, Peng; Godin, Steven D; Owens, Matthew V

2018-04-04

This study aimed to investigate the validity and reliability of the energy expenditure (EE) estimation of Apple Watch among college students. Thirty college students completed two sets of three 10-minute treadmill walking and running trials while wearing three Apple Watches and being connected to indirect calorimetry. The walking trials were at speeds of 54, 80, and 107 m•min-1 while the running trials were at 134, 161, 188m•min-1. Energy expenditure comparisons were made using Two-way ANOVA with repeatedmeasures. Reliability was analyzed by Intraclass Correlation. There was no significant device x speed interactions (F (15, 696) = 1.113, p = 0.341) between the indirect calorimetry (criterion) and Apple Watch. The lowest Inter-Class Correlation (ICC) scores were 0.49 (95%CI) at 54 while the highest were 0.72 (95%CI) at 107 and 134 m•min-1. Apple Watch demonstrated a low to moderate validity and reliability on measuring EE.
The reliability of the Adelaide in-shoe foot model.

Science.gov (United States)

Bishop, Chris; Hillier, Susan; Thewlis, Dominic

2017-07-01

Understanding the biomechanics of the foot is essential for many areas of research and clinical practice such as orthotic interventions and footwear development. Despite the widespread attention paid to the biomechanics of the foot during gait, what largely remains unknown is how the foot moves inside the shoe. This study investigated the reliability of the Adelaide In-Shoe Foot Model, which was designed to quantify in-shoe foot kinematics and kinetics during walking. Intra-rater reliability was assessed in 30 participants over five walking trials whilst wearing shoes during two data collection sessions, separated by one week. Sufficient reliability for use was interpreted as a coefficient of multiple correlation and intra-class correlation coefficient of >0.61. Inter-rater reliability was investigated separately in a second sample of 10 adults by two researchers with experience in applying markers for the purpose of motion analysis. The results indicated good consistency in waveform estimation for most kinematic and kinetic data, as well as good inter-and intra-rater reliability. The exception is the peak medial ground reaction force, the minimum abduction angle and the peak abduction/adduction external hindfoot joint moments which resulted in less than acceptable repeatability. Based on our results, the Adelaide in-shoe foot model can be used with confidence for 24 commonly measured biomechanical variables during shod walking. Copyright © 2017 Elsevier B.V. All rights reserved.
Interrater reliability of the Melbourne Assessment of Unilateral Upper Limb Function for children with hemiplegic cerebral palsy.

LENUS (Irish Health Repository)

Spirtos, Michelle

2012-02-01

OBJECTIVE: We examined the interrater reliability of the Melbourne Assessment of Unilateral Upper Limb Function. METHOD: Three occupational therapists independently scored 34 videotaped assessments of children with hemiplegic cerebral palsy aged 6 yr, 1 mo, to 14 yr, 5 mo. Intraclass correlation coefficients (ICCs) at a 95% confidence interval were calculated for total scores, category scores, and item scores. RESULTS: The correlation between raters\\' total scores was high (ICC = .961). The highest correlation for test components between raters was found for fluency (ICC = .902), followed by range of movement (ICC = .866), and the lowest correlation was found for quality of movement (ICC = .683). The ICCs for individual test item scores varied and ranged from .368 to .899. CONCLUSION: This study demonstrated high interrater reliability for total scores, with scoring of some individual components and items requiring further consideration from both a clinical and a research perspective.
The Reliability of Individualized Load-Velocity Profiles.

Science.gov (United States)

Banyard, Harry G; Nosaka, K; Vernon, Alex D; Haff, G Gregory

2017-11-15

This study examined the reliability of peak velocity (PV), mean propulsive velocity (MPV), and mean velocity (MV) in the development of load-velocity profiles (LVP) in the full depth free-weight back squat performed with maximal concentric effort. Eighteen resistance-trained men performed a baseline one-repetition maximum (1RM) back squat trial and three subsequent 1RM trials used for reliability analyses, with 48-hours interval between trials. 1RM trials comprised lifts from six relative loads including 20, 40, 60, 80, 90, and 100% 1RM. Individualized LVPs for PV, MPV, or MV were derived from loads that were highly reliable based on the following criteria: intra-class correlation coefficient (ICC) >0.70, coefficient of variation (CV) ≤10%, and Cohen's d effect size (ES) 0.05) between trials, movement velocities, or between linear regression versus second order polynomial fits. PV 20-100% , MPV 20-90% , and MV 20-90% are reliable and can be utilized to develop LVPs using linear regression. Conceptually, LVPs can be used to monitor changes in movement velocity and employed as a method for adjusting sessional training loads according to daily readiness.
Intertester reliability of the talk test in a cardiac rehabilitation population

DEFF Research Database (Denmark)

Petersen, Annemette Krintel; Maribo, Thomas; Hjortdal, Vibeke Elisabeth

2013-01-01

PURPOSE: The validity of the Talk Test (TT) is well documented, but the reliability of the test is not clear. The aim of this study was to assess the absolute and relative intertester reliability of the TT in cardiac patients. METHODS: Cardiac patients (n = 64) who had completed an exercise...... randomized to tests. Workload in watts at the first negative stage of the TT was registered as the test result. Patients and physiotherapists were blinded to test results of the first test. Absolute reliability of the TT was assessed with Bland-Altman plot, standard error of measurement, and minimal...... detectable change. Relative reliability was assessed using the intraclass correlation coefficient (ICC). RESULTS: Mean difference in peak workload between test and retest was 0.8 W (95% CI: -4.8 to 3.3). Limit of agreement was estimated to be +31/-32 W. Standard error of measurement was 11 W (95% CI: 10...
Rating scales for dystonia in cerebral palsy: reliability and validity.

Science.gov (United States)

Monbaliu, E; Ortibus, E; Roelens, F; Desloovere, K; Deklerck, J; Prinzie, P; de Cock, P; Feys, H

2010-06-01

This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System and six patients were classified at level V. Reliability was measured by (1) intraclass correlation coefficient (ICC) for interrater reliability, (2) standard error of measurement (SEM) and smallest detectable difference (SDD), and (3) Cronbach's alpha for internal consistency. Validity was assessed by Pearson's correlations among the three scales used and by content analysis. Moderate to good interrater reliability was found for total scores of the three scales (ICC: BADS=0.87; BFMMS=0.86; UDRS=0.79). However, many subitems showed low reliability, in particular for the UDRS. SEM and SDD were respectively 6.36% and 17.72% for the BADS, 9.88% and 27.39% for the BFMMS, and 8.89% and 24.63% for the UDRS. High internal consistency was found. Pearson's correlations were high. Content validity showed insufficient accordance with the new CP definition and classification. Our results support the internal consistency and concurrent validity of the scales; however, taking into consideration the limitations in reliability, including the large SDD values and the content validity, further research on methods of assessment of dystonia is warranted.
Reliability of short form-36 in an Internet- and a pen-and-paper version

DEFF Research Database (Denmark)

Basnov, Maja; Kongsved, Sissel Marie; Bech, Per

2009-01-01

Use of Internet versions of questionnaires may have several advantages in clinical and epidemiological research, but we know little about if Internet versions differ with respect to validity and reliability. We aimed to compare Internet- and pen-and-paper versions of short form-36 (SF-36......) with respect to test-retest reliability and internal consistency. Women referred to mammography (n = 782) were randomised to receive either a paper version with a prepaid return envelope or a guideline on how to fill in the Internet version. A subgroup was asked to answer the questionnaire once again...... in the alternative version. Test-retest reliability was assessed by the intra-class correlation coefficient. Internal consistency was calculated as Cronbach's alpha. The between-version test-retest reliability for the eight subscales were between 0.63 and 0.92. Cronbach's alpha for the two versions were all between...
Reliability and validity of the upper-body dressing scale in Japanese patients with vascular dementia with hemiparesis.

Science.gov (United States)

Endo, Arisa; Suzuki, Makoto; Akagi, Atsumi; Chiba, Naoyuki; Ishizaka, Ikuyo; Matsunaga, Atsuhiko; Fukuda, Michinari

2015-03-01

The purpose of this study was to examine the reliability and validity of the Upper-body Dressing Scale (UBDS) for buttoned shirt dressing, which evaluates the learning process of new component actions of upper-body dressing in patients diagnosed with dementia and hemiparesis. This was a preliminary correlational study of concurrent validity and reliability in which 10 vascular dementia patients with hemiparesis were enrolled and assessed repeatedly by six occupational therapists by means of the UBDS and the dressing item of the Functional Independence Measure (FIM). Intraclass correlation coefficient was 0.97 for intra-rater reliability and 0.99 for inter-rater reliability. The level of correlation between UBDS score and FIM dressing item scores was -0.93. UBDS scores for paralytic hand passed into the sleeve and sleeve pulled up beyond the shoulder joint were worse than the scores for the other components of the task. The UBDS has good reliability and validity for vascular dementia patients with hemiparesis. Further research is needed to investigate the relation between UBDS score and the effect of intervention and to clarify sensitivity or responsiveness of the scale to clinical change. Copyright © 2014 John Wiley & Sons, Ltd.
Reliability and validity of an adapted Arabic version of the Scoliosis Research Society-22r Questionnaire.

Science.gov (United States)

Haidar, Rachid K; Kassak, Kassem; Masrouha, Karim; Ibrahim, Kamal; Mhaidli, Hani

2015-09-01

Cross-sectional validation and reliability assessment study of Arabic version of Scoliosis Research Society-22 (SRS-22r) Questionnaire. To develop and validate the Arabic version of the SRS-22r questionnaire. The diagnosis and treatment of adolescent idiopathic scoliosis may influence patient quality of life. SRS-22r is an internationally validated questionnaire used to assess function/activity, pain, self-image, and mental health of patients with scoliosis. It has been translated into several languages but not into Arabic language. Therefore, a valid health-related quality-of-life outcome questionnaire for patients with spinal deformity is still lacking in Arabic language. The English version of SRS-22r questionnaire was translated, back-translated, and culturally adapted to Arabic language. Then, 81 patients with idiopathic adolescent scoliosis were allocated randomly into either the reliability testing group (group 1) or the validity testing group (group 2). Group 1 patients completed Arabic version of SRS-22r questionnaire twice with 1-week interval in-between. Cronbach α and intraclass correlation coefficient were measured to determine internal consistency and temporal reliability. Group 2 patients completed the Arabic version of SRS-22r questionnaire and the previously validated Arabic version of 36-Item Short Form Health Survey (Short Form-36) questionnaire concurrently, and Pearson correlation coefficient was obtained to assess validity. Content analysis, internal consistency reliability, test/retest reproducibility (intraclass correlation coefficient range: 0.82-0.90), and test of concurrent validity showed satisfactory results. Function/activity and satisfaction with management domains had a lower Cronbach α (0.58 and 0.44, respectively, vs. 0.71-0.85 range for others). Self-image/appearance and satisfaction with management had a lower correlation with domains of the 36-Item Short Form Health Survey. An Arabic version of the SRS-22r questionnaire has
Non-Weight-Bearing and Weight-Bearing Ultrasonography of Select Foot Muscles in Young, Asymptomatic Participants: A Descriptive and Reliability Study.

Science.gov (United States)

Battaglia, Patrick J; Mattox, Ross; Winchester, Brett; Kettner, Norman W

The primary aim of this study was to determine the reliability of diagnostic ultrasound imaging for select intrinsic foot muscles using both non-weight-bearing and weight-bearing postures. Our secondary aim was to describe the change in muscle cross-sectional area (CSA) and dorsoplantar thickness when bearing weight. An ultrasound examination was performed with a linear ultrasound transducer operating between 9 and 12 MHz. Long-axis and short-axis ultrasound images of the abductor hallucis, flexor digitorum brevis, and quadratus plantae were obtained in both the non-weight-bearing and weight-bearing postures. Two examiners independently collected ultrasound images to allow for interexaminer and intraexaminer reliability calculation. The change in muscle CSA and dorsoplantar thickness when bearing weight was also studied. There were 26 participants (17 female) with a mean age of 25.5 ± 3.8 years and a mean body mass index of 28.0 ± 7.8 kg/m 2 . Inter-examiner reliability was excellent when measuring the muscles in short axis (intraclass correlation coefficient >0.75) and fair to good in long axis (intraclass correlation coefficient >0.4). Intraexaminer reliability was excellent for the abductor hallucis and flexor digitorum brevis and ranged from fair to good to excellent for the quadratus plantae. Bearing weight did not reduce interexaminer or intraexaminer reliability. All muscles exhibited a significant increase in CSA when bearing weight. This is the first report to describe weight-bearing diagnostic ultrasound of the intrinsic foot muscles. Ultrasound imaging is reliable when imaging these muscles bearing weight. Furthermore, muscle CSA increases in the weight-bearing posture. Copyright Â© 2016. Published by Elsevier Inc.
Reliability and Validity of Food Frequency Questions to Assess Beverage and Food Group Intakes among Low-Income 2- to 4-Year-Old Children.

Science.gov (United States)

Koleilat, Maria; Whaley, Shannon E

2016-06-01

Fruits, vegetables, sweetened foods, and beverages have been found to have positive and negative associations with obesity in early childhood, yet no rapid assessment tools are available to measure intake of these foods among preschoolers. This study examines the test-retest reliability and validity of a 10-item Child Food and Beverage Intake Questionnaire designed to assess fruits, vegetables, and sweetened foods and beverages intake among 2- to 4-year-old children. The Child Food and Beverage Intake Questionnaire was developed for use in periodic phone surveys conducted with low-income families with preschool-aged children. Seventy primary caregivers of 2- to 4-year-old children completed two Child Food and Beverage Intake Questionnaires within a 2-week period for test-retest reliability. Participants also completed three 24-hour recalls to allow assessment of validity. Intraclass correlations were used to examine test-retest reliability. Spearman rank correlation coefficients, Bland-Altman plots, and linear regression analyses were used to examine validity of the Child Food and Beverage Intake Questionnaire compared with three 24-hour recalls. Intraclass correlations between Child Food and Beverage Intake Questionnaire administrations ranged from 0.48 for sweetened drinks to 0.87 for regular sodas. Intraclass correlations for fruits, vegetables, and sweetened food were 0.56, 0.49, and 0.56, respectively. Spearman rank correlation coefficients ranged from 0.15 to 0.59 for beverages, with 0.46 for sugar-sweetened beverages. Spearman rank correlation coefficients for fruits, vegetables, and sweetened food were 0.30, 0.33, and 0.30, respectively. Although observation of the Bland-Altman plots and linear regression analyses showed a slight upward trend in mean differences, with increasing mean intake for five beverage groups, at least 90% of data plots fell within the limits of agreement for all food/beverage groups. The Child Food and Beverage Intake Questionnaire
Balance Assessment in Sports-Related Concussion: Evaluating Test-Retest Reliability of the Equilibrate System.

Science.gov (United States)

Odom, Mitchell J; Lee, Young M; Zuckerman, Scott L; Apple, Rachel P; Germanos, Theodore; Solomon, Gary S; Sills, Allen K

2016-01-01

This study evaluated the test-retest reliability of a novel computer-based, portable balance assessment tool, the Equilibrate System (ES), used to diagnose sports-related concussion. Twenty-seven students participated in ES testing consisting of three sessions over 4 weeks. The modified Balance Error Scoring System was performed. For each participant, test-retest reliability was established using the intraclass correlation coefficient (ICC). The ES test-retest reliability from baseline to week 2 produced an ICC value of 0.495 (95% CI, 0.123-0.745). Week 2 testing produced ICC values of 0.602 (95% CI, 0.279-0.803) and 0.610 (95% CI, 0.299-0.804), respectively. All other single measures test-retest reliability values produced poor ICC values. Same-day ES testing showed fair to good test-retest reliability while interweek measures displayed poor to fair test-retest reliability. Testing conditions should be controlled when using computerized balance assessment methods. ES testing should only be used as a part of a comprehensive assessment.
Test-Retest Reliability of Rating of Perceived Exertion and Agreement With 1-Repetition Maximum in Adults.

Science.gov (United States)

Bove, Allyn M; Lynch, Andrew D; DePaul, Samantha M; Terhorst, Lauren; Irrgang, James J; Fitzgerald, G Kelley

2016-09-01

Study Design Clinical measurement. Background It has been suggested that rating of perceived exertion (RPE) may be a useful alternative to 1-repetition maximum (1RM) to determine proper resistance exercise dosage. However, the test-retest reliability of RPE for resistance exercise has not been determined. Additionally, prior research regarding the relationship between 1RM and RPE is conflicting. Objectives The purpose of this study was to (1) determine test-retest reliability of RPE related to resistance exercise and (2) assess agreement between percentages of 1RM and RPE during quadriceps resistance exercise. Methods A sample of participants with and without knee pathology completed a series of knee extension exercises and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale, then repeated the procedure 1 to 2 weeks later for test-retest reliability. To determine agreement between RPE and 1RM, participants completed knee extension exercises at various percentages of their 1RM (10% to 130% of predicted 1RM) and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale. Percent agreement was calculated between the 1RM and RPE at each resistance interval. Results The intraclass correlation coefficient indicated excellent test-retest reliability of RPE for quadriceps resistance exercises (intraclass correlation coefficient = 0.895; 95% confidence interval: 0.866, 0.918). Overall percent agreement between RPE and 1RM was 60%, but agreement was poor within the ranges that would typically be used for training (50% 1RM for muscle endurance, 70% 1RM and greater for strength). Conclusion Test-retest reliability of perceived exertion during quadriceps resistance exercise was excellent. However, agreement between the RPE and 1RM was poor, especially in common training zones for knee extensor strengthening. J Orthop Sports Phys Ther 2016;46(9):768-774. Epub 5 Aug 2016. doi:10.2519/jospt.2016.6498.
Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.

Science.gov (United States)

Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J

2012-07-01

We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.
The Children's Play Therapy Instrument (CPTI): Description, Development, and Reliability Studies

Science.gov (United States)

Kernberg, Paulina F.; Chazan, Saralea E.; Normandin, Lina

1998-01-01

The Children's Play Therapy Instrument (CPTI), its development, and reliability studies are described. The CPTI is a new instrument to examine a child's play activity in individual psychotherapy. Three independent raters used the CPTI to rate eight videotaped play therapy vignettes. Results were compared with the authors' consensual scores from a preliminary study. Generally good to excellent levels of interrater reliability were obtained for the independent raters on intraclass correlation coefficients for ordinal categories of the CPTI. Likewise, kappa levels were acceptable to excellent for nominal categories of the scale. The CPTI holds promise to become a reliable measure of play activity in child psychotherapy. Further research is needed to assess discriminant validity of the CPTI for use as a diagnostic tool and as a measure of process and outcome.(The Journal of Psychotherapy Practice and Research 1998; 7:196–207) PMID:9631341
Functional claudication distance: a reliable and valid measurement to assess functional limitation in patients with intermittent claudication

Directory of Open Access Journals (Sweden)

Prins Martin H

2009-03-01

Full Text Available Abstract Background Disease severity and functional impairment in patients with intermittent claudication is usually quantified by the measurement of pain-free walking distance (intermittent claudication distance, ICD and maximal walking distance (absolute claudication distance, ACD. However, the distance at which a patient would prefer to stop because of claudication pain seems a definition that is more correspondent with the actual daily life walking distance. We conducted a study in which the distance a patient prefers to stop was defined as the functional claudication distance (FCD, and estimated the reliability and validity of this measurement. Methods In this clinical validity study we included patients with intermittent claudication, following a supervised exercise therapy program. The first study part consisted of two standardised treadmill tests. During each test ICD, FCD and ACD were determined. Primary endpoint was the reliability as represented by the calculated intra-class correlation coefficients. In the second study part patients performed a standardised treadmill test and filled out the Rand-36 questionnaire. Spearman's rho was calculated to assess validity. Results The intra-class correlation coefficients of ICD, FCD and ACD were 0.940, 0.959, and 0.975 respectively. FCD correlated significantly with five out of nine domains, namely physical function (rho = 0.571, physical role (rho = 0.532, vitality (rho = 0.416, pain (rho = 0.416 and health change (rho = 0.414. Conclusion FCD is a reliable and valid measurement for determining functional capacity in trained patients with intermittent claudication. Furthermore it seems that FCD better reflects the actual functional impairment. In future studies, FCD could be used alongside ICD and ACD.
The coefficient of determination R2 and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded.

Science.gov (United States)

Nakagawa, Shinichi; Johnson, Paul C D; Schielzeth, Holger

2017-09-01

The coefficient of determination R 2 quantifies the proportion of variance explained by a statistical model and is an important summary statistic of biological interest. However, estimating R 2 for generalized linear mixed models (GLMMs) remains challenging. We have previously introduced a version of R 2 that we called [Formula: see text] for Poisson and binomial GLMMs, but not for other distributional families. Similarly, we earlier discussed how to estimate intra-class correlation coefficients (ICCs) using Poisson and binomial GLMMs. In this paper, we generalize our methods to all other non-Gaussian distributions, in particular to negative binomial and gamma distributions that are commonly used for modelling biological data. While expanding our approach, we highlight two useful concepts for biologists, Jensen's inequality and the delta method, both of which help us in understanding the properties of GLMMs. Jensen's inequality has important implications for biologically meaningful interpretation of GLMMs, whereas the delta method allows a general derivation of variance associated with non-Gaussian distributions. We also discuss some special considerations for binomial GLMMs with binary or proportion data. We illustrate the implementation of our extension by worked examples from the field of ecology and evolution in the R environment. However, our method can be used across disciplines and regardless of statistical environments. © 2017 The Author(s).
The Malay Version of the Perceived Stress Scale (PSS)-10 is a Reliable and Valid Measure for Stress among Nurses in Malaysia.

Science.gov (United States)

Sandhu, Sukhvinder Singh; Ismail, Noor Hassim; Rampal, Krishna Gopal

2015-11-01

The Perceived Stress Scale-10 (PSS-10) is widely used to assess stress perception. The aim of this study was to translate the original PSS-10 into Malay and assess the reliability and validity of the Malay version among nurses. The Malay version of the PSS-10 was distributed among 229 nurses from four government hospitals in Selangor State. Test-retest reliability and concurrent validity was conducted with 25 nurses with the Malay version of the Depression Anxiety Stress Scales (DASS) 21. Cronbach's alpha, confirmatory factor analysis (CFA), intraclass correlation coefficient and Pearson's r correlation coefficient were used to determine the psychometric properties of the Malay PSS-10. Two factor components were yielded through exploratory factor analysis with eigenvalues of 3.37 and 2.10, respectively. Both of the factors accounted for 54.6% of the variance. CFA yielded a two-factor structure with satisfactory goodness-of-fit indices [x 2 /df = 2.43; comparative fit index (CFI) = 0.92, goodness-of-fit Index (GFI) = 0.94; standardised root mean square residual (SRMR) = 0.07 and root mean square error of approximation (RMSEA) = 0.08 (90% CI = 0.07-0.09)]. The Cronbach's alpha coefficient for the total items was 0.63 (0.82 for factor 1 and 0.72 for factor 2). The intraclass correlation coefficient (ICC) was 0.81 (95% CI: 0.62-0.91) for test-retest reliability testing after seven days. The total score and the negative component of the PSS-10 correlated significantly with the stress component of the DASS-21: (r = 0.61, P < 0.001) and (r = 0.56, P < 0.004), respectively. The Malay version of the PSS-10 demonstrated a satisfactory level of validity and reliability to assess stress perception. Therefore, this questionnaire is valid in assessing stress perception among nurses in Malaysia.
Reliability, Validity, and Responsiveness of InFLUenza Patient-Reported Outcome (FLU-PRO©) Scores in Influenza-Positive Patients.

Science.gov (United States)

Powers, John H; Bacci, Elizabeth D; Guerrero, M Lourdes; Leidy, Nancy Kline; Stringer, Sonja; Kim, Katherine; Memoli, Matthew J; Han, Alison; Fairchok, Mary P; Chen, Wei-Ju; Arnold, John C; Danaher, Patrick J; Lalani, Tahaniyat; Ridoré, Michelande; Burgess, Timothy H; Millar, Eugene V; Hernández, Andrés; Rodríguez-Zulueta, Patricia; Smolskis, Mary C; Ortega-Gallegos, Hilda; Pett, Sarah; Fischer, William; Gillor, Daniel; Macias, Laura Moreno; DuVal, Anna; Rothman, Richard; Dugas, Andrea; Ruiz-Palacios, Guillermo M

2018-02-01

To assess the reliability, validity, and responsiveness of InFLUenza Patient-Reported Outcome (FLU-PRO©) scores for quantifying the presence and severity of influenza symptoms. An observational prospective cohort study of adults (≥18 years) with influenza-like illness in the United States, the United Kingdom, Mexico, and South America was conducted. Participants completed the 37-item draft FLU-PRO daily for up to 14 days. Item-level and factor analyses were used to remove items and determine factor structure. Reliability of the final tool was estimated using Cronbach α and intraclass correlation coefficients (2-day reliability). Convergent and known-groups validity and responsiveness were assessed using global assessments of influenza severity and return to usual health. Of the 536 patients enrolled, 221 influenza-positive subjects comprised the analytical sample. The mean age of the patients was 40.7 years, 60.2% were women, and 59.7% were white. The final 32-item measure has six factors/domains (nose, throat, eyes, chest/respiratory, gastrointestinal, and body/systemic), with a higher order factor representing symptom severity overall (comparative fit index = 0.92; root mean square error of approximation = 0.06). Cronbach α was high (total = 0.92; domain range = 0.71-0.87); test-retest reliability (intraclass correlation coefficient, day 1-day 2) was 0.83 for total scores and 0.57 to 0.79 for domains. Day 1 FLU-PRO domain and total scores were moderately to highly correlated (≥0.30) with Patient Global Rating of Flu Severity (except nose and throat). Consistent with known-groups validity, scores differentiated severity groups on the basis of global rating (total: F = 57.2, P FLU-PRO score improvement by day 7 than did those who did not, suggesting score responsiveness. Results suggest that FLU-PRO scores are reliable, valid, and responsive to change in influenza-positive adults. Copyright © 2018 International Society for Pharmacoeconomics and Outcomes

Interrater reliability of videotaped observational gait-analysis assessments.

Science.gov (United States)

Eastlack, M E; Arvidson, J; Snyder-Mackler, L; Danoff, J V; McGarvey, C L

1991-06-01

The purpose of this study was to determine the interrater reliability of videotaped observational gait-analysis (VOGA) assessments. Fifty-four licensed physical therapists with varying amounts of clinical experience served as raters. Three patients with rheumatoid arthritis who demonstrated an abnormal gait pattern served as subjects for the videotape. The raters analyzed each patient's most severely involved knee during the four subphases of stance for the kinematic variables of knee flexion and genu valgum. Raters were asked to determine whether these variables were inadequate, normal, or excessive. The temporospatial variables analyzed throughout the entire gait cycle were cadence, step length, stride length, stance time, and step width. Generalized kappa coefficients ranged from .11 to .52. Intraclass correlation coefficients (2,1) and (3,1) were slightly higher. Our results indicate that physical therapists' VOGA assessments are only slightly to moderately reliable and that improved interrater reliability of the assessments of physical therapists utilizing this technique is needed. Our data suggest that there is a need for greater standardization of gait-analysis training.
Test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy.

Science.gov (United States)

Savva, Christos; Giakas, Giannis; Efstathiou, Michalis; Karagiannis, Christos

2014-01-01

The purpose of this study was to evaluate the test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy (CR). A convenience sample of 19 participants (14 men and 5 women; mean ± SD age, 50.5 ± 12 years) with CR was measured using a Jamar hydraulic hand dynamometer by the same rater on 2 different testing sessions with an interval of 7 days between sessions. Data collection procedures followed standardized grip strength testing guidelines established by the American Society of Hand Therapists. During the repeated measures, patients were advised to rest their upper limb in the standardized arm position and encouraged to exert 3 maximum gripping efforts. The mean value of the 3 efforts (measured in kilogram force [Kgf]) was used for data analysis. The intraclass correlation coefficient, SEM, and the Bland-Altman plot were used to estimate test-retest reliability and measurement precision. Grip strength measurement in CR demonstrated an intraclass correlation coefficient of 0.976, suggesting excellent test-retest reliability. The small SEM in both testing sessions (SEM1, 2.41 Kgf; SEM2, 2.51 Kgf) as well as the narrow width of the 95% limits of agreements (95% limits of agreement, -4.9 to 4.4 Kgf) in the Bland-Altman plot reflected precise measurements of grip strength in both occasions. Excellent test-retest reliability for grip strength measurement was measured in patients with CR, demonstrating that a hydraulic hand dynamometer could be used as an outcome measure for these patients. Copyright © 2014 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Inter-rater reliability of measures to characterize the tobacco retail environment in Mexico

Directory of Open Access Journals (Sweden)

Marissa G Hall

2015-11-01

Full Text Available Objective. To evaluate the inter-rater reliability of a data collection instrument to assess the tobacco retail environ- ment in Mexico, after major marketing regulations were implemented. Materials and methods. In 2013, two data collectors independently evaluated 21 stores in two census tracts, through a data collection instrument that assessed the presence of price promotions, whether single cigarettes were sold, the number of visible advertisements, the pre- sence of signage prohibiting the sale of cigarettes to minors, and characteristics of cigarette pack displays. We evaluated the inter-rater reliability of the collected data, through the calculation of metrics such as intraclass correlation coefficient, percent agreement, Cohen’s kappa and Krippendorff’s alpha. Results. Most measures demonstrated substantial or perfect inter-rater reliability. Conclusions. Our results indicate the potential utility of the data collection instrument for future point-of-sale research.
Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis.

Science.gov (United States)

Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

2017-01-18

To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.
Validity and reliability of the novel thyroid-specific quality of life questionnaire, ThyPRO

DEFF Research Database (Denmark)

Watt, Torquil; Hegedüs, Laszlo; Groenvold, Mogens

2010-01-01

Background Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated. Aim To investigate clinical ('known-groups') validity and test-retest reliability of the Danish version of the ThyPRO. Methods For each of the 13 ThyPRO scales, we defined groups expected to have high versus low scores ('known-groups'). The clinical validity (known......-groups validity) was evaluated by whether the ThyPRO scales could detect expected differences in a cross-sectional study of 907 thyroid patients. Test-retest reliability was evaluated by intra-class correlations of two responses to the ThyPRO 2 weeks apart in a subsample of 87 stable patients. Results On all 13...
Validation and reliability of a Behcet’s Syndrome Activity Scale in Korea

Science.gov (United States)

Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo

2016-01-01

Background/Aims: We prepared a cross-cultural adaptation of the Behcet’s Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Methods: Fifty patients with Behcet’s disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet’s Disease Current Activity Form (BDCAF) and a Behcet’s Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Results: Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). Conclusions: The Korean version of BSAS is a reliable and valid instrument to measure BD activity. PMID:26767871
Reliability and Validity of the Multidimensional Scale of Perceived Social Support (MSPSS): Thai Version.

Science.gov (United States)

Wongpakaran, Tinakon; Wongpakaran, Nahathai; Ruktrakul, Ruk

2011-01-01

This study examines the Thai version of the Multidimensional Scale of Perceived Social Support (MSPSS) for its psychometric properties. In total 462 participants were recruited - 310 medical students from Chiang Mai University and 152 psychiatric patients, and they completed the Thai version of the MSPSS, the State Trait Anxiety Inventory (STAI), the Rosenberg Self-Esteem Scale (RSES) and the Thai Depression Inventory (TDI). Test-retest reliability was conducted over a four week period. Factor analysis produced three-factor solutions for both patient (PG) and student groups (SG), and overall the model demonstrated adequate fit indices. The mean total score and the sub-scale score for the SG were statistically higher than those in the PG, except for 'Significant Others'. The internal consistency of the scale was good, with a Cronbach's alpha of 0.91 for the SG and 0.87 for the PG. After a four week retest for reliability exercise, the intra-class correlation coefficient (ICC) was found to be 0.84. The Thai-MSPSS was found to have a negative correlation with the STAI and the TDI, but was positively correlated with the RSES. The Thai MSPSS is a reliable and valid instrument to use.
Validity and reliability of a Nigerian-Yoruba version of the stroke-specific quality of life scale 2.0.

Science.gov (United States)

Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana

2017-10-19

Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity

Directory of Open Access Journals (Sweden)

Adebisi Bisi-Balogun

2016-04-01

Full Text Available This study aimed to determine the relative and absolute reliability of ultrasound (US measurements of the thickness and echogenicity of the plantar fascia (PF at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity.

Science.gov (United States)

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-04-13

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.
Education Research: Bias and poor interrater reliability in evaluating the neurology clinical skills examination

Science.gov (United States)

Schuh, L A.; London, Z; Neel, R; Brock, C; Kissela, B M.; Schultz, L; Gelb, D J.

2009-01-01

Objective: The American Board of Psychiatry and Neurology (ABPN) has recently replaced the traditional, centralized oral examination with the locally administered Neurology Clinical Skills Examination (NEX). The ABPN postulated the experience with the NEX would be similar to the Mini-Clinical Evaluation Exercise, a reliable and valid assessment tool. The reliability and validity of the NEX has not been established. Methods: NEX encounters were videotaped at 4 neurology programs. Local faculty and ABPN examiners graded the encounters using 2 different evaluation forms: an ABPN form and one with a contracted rating scale. Some NEX encounters were purposely failed by residents. Cohen’s kappa and intraclass correlation coefficients (ICC) were calculated for local vs ABPN examiners. Results: Ninety-eight videotaped NEX encounters of 32 residents were evaluated by 20 local faculty evaluators and 18 ABPN examiners. The interrater reliability for a determination of pass vs fail for each encounter was poor (kappa 0.32; 95% confidence interval [CI] = 0.11, 0.53). ICC between local faculty and ABPN examiners for each performance rating on the ABPN NEX form was poor to moderate (ICC range 0.14-0.44), and did not improve with the contracted rating form (ICC range 0.09-0.36). ABPN examiners were more likely than local examiners to fail residents. Conclusions: There is poor interrater reliability between local faculty and American Board of Psychiatry and Neurology examiners. A bias was detected for favorable assessment locally, which is concerning for the validity of the examination. Further study is needed to assess whether training can improve interrater reliability and offset bias. GLOSSARY ABIM = American Board of Internal Medicine; ABPN = American Board of Psychiatry and Neurology; CI = confidence interval; HFH = Henry Ford Hospital; ICC = intraclass correlation coefficients; IM = internal medicine; mini-CEX = Mini-Clinical Evaluation Exercise; NEX = Neurology Clinical
Reliability and validity of the AutoCAD software method in lumbar lordosis measurement.

Science.gov (United States)

Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe

2011-12-01

The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis.
Concurrent validity and reliability of the Alberta Infant Motor Scale in premature infants.

Science.gov (United States)

Almeida, Kênnea Martins; Dutra, Maria Virginia Peixoto; Mello, Rosane Reis de; Reis, Ana Beatriz Rodrigues; Martins, Priscila Silveira

2008-01-01

To verify the concurrent validity and interobserver reliability of the Alberta Infant Motor Scale (AIMS) in premature infants followed-up at the outpatient clinic of Instituto Fernandes Figueira, Fundação Oswaldo Cruz (IFF/Fiocruz), in Rio de Janeiro, Brazil. A total of 88 premature infants were enrolled at the follow-up clinic at IFF/Fiocruz, between February and December of 2006. For the concurrent validity study, 46 infants were assessed at either 6 (n = 26) or 12 (n = 20) months' corrected age using the AIMS and the second edition of the Bayley Scales of Infant Development, by two different observers, and applying Pearson's correlation coefficient to analyze the results. For the reliability study, 42 infants between 0 and 18 months were assessed using the Alberta Infant Motor Scale, by two different observers and the results analyzed using the intraclass correlation coefficient. The concurrent validity study found a high level of correlation between the two scales (r = 0.95) and one that was statistically significant (p system.
Inter-arch digital model vs. manual cast measurements: Accuracy and reliability.

Science.gov (United States)

Kiviahde, Heikki; Bukovac, Lea; Jussila, Päivi; Pesonen, Paula; Sipilä, Kirsi; Raustia, Aune; Pirttiniemi, Pertti

2017-06-28

The purpose of this study was to evaluate the accuracy and reliability of inter-arch measurements using digital dental models and conventional dental casts. Thirty sets of dental casts with permanent dentition were examined. Manual measurements were done with a digital caliper directly on the dental casts, and digital measurements were made on 3D models by two independent examiners. Intra-class correlation coefficients (ICC), a paired sample t-test or Wilcoxon signed-rank test, and Bland-Altman plots were used to evaluate intra- and inter-examiner error and to determine the accuracy and reliability of the measurements. The ICC values were generally good for manual and excellent for digital measurements. The Bland-Altman plots of all the measurements showed good agreement between the manual and digital methods and excellent inter-examiner agreement using the digital method. Inter-arch occlusal measurements on digital models are accurate and reliable and are superior to manual measurements.
Content Validity and Reliability of Multiple Intelligences Developmental Assessment Scales (MIDAS Translated into Persian

Directory of Open Access Journals (Sweden)

Mahnaz Saeidi

2012-11-01

Full Text Available This study aimed to translate MIDAS questionnaire from English into Persian and determine its content validity and reliability. MIDAS was translated and validated on a sample (N = 110 of Iranian adult population. The participants were both male and female with the age range of 17-57. They were at different educational levels and from different ethnic groups in Iran. A translating team, consisting of five members, bilingual in English and Persian and familiar with multiple intelligences (MI theory and practice, were involved in translating and determining content validity, which included the processes of forward translation, back-translation, review, final proof-reading, and testing. The statistical analyses of inter-scale correlation were performed using the Cronbach's alpha coefficient. In an intra-class correlation, the Cronbach's alpha was high for all of the questions. Translation and content validity of MIDAS questionnaire was completed by a proper process leading to high reliability and validity. The results suggest that Persian MIDAS (P-MIDAS could serve as a valid and reliable instrument for measuring Iranian adults MIs.
Accuracy and correlates of maternal recall of birthweight and gestational age

DEFF Research Database (Denmark)

Adegboye, Amanda Rodrigues Amorim; Heitmann, B.

2008-01-01

OBJECTIVE: To determine the accuracy of maternal recall of children birthweight (BW) and gestational age (GA), using the Danish Medical Birth Register (DBR) as reference and to examine the reliability of recalled BW and its potential correlates. DESIGN: Comparison of data from the DBR...... and the European Youth Heart Study (EYHS). SETTING: Schools in Odense, Denmark. POPULATION: A total of 1271 and 678 mothers of school children participated with information in the accuracy studies of BW and GA, respectively. The reliability sample of BW was composed of 359 women. METHOD: The agreement between...... the two sources was evaluated by mean differences (MD), intraclass correlation coefficient (ICC) and Bland-Altman's plots. The misclassification of the various BW and GA categories were also estimated. MAIN OUTCOME MEASURES: Differences between recalled and registered BW and GA. RESULTS: There was high...
Reliability analysis of a sensitive and independent stabilometry parameter set.

Science.gov (United States)

Nagymáté, Gergely; Orlovits, Zsanett; Kiss, Rita M

2018-01-01

Recent studies have suggested reduced independent and sensitive parameter sets for stabilometry measurements based on correlation and variance analyses. However, the reliability of these recommended parameter sets has not been studied in the literature or not in every stance type used in stabilometry assessments, for example, single leg stances. The goal of this study is to evaluate the test-retest reliability of different time-based and frequency-based parameters that are calculated from the center of pressure (CoP) during bipedal and single leg stance for 30- and 60-second measurement intervals. Thirty healthy subjects performed repeated standing trials in a bipedal stance with eyes open and eyes closed conditions and in a single leg stance with eyes open for 60 seconds. A force distribution measuring plate was used to record the CoP. The reliability of the CoP parameters was characterized by using the intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimal detectable change (MDC), coefficient of variation (CV) and CV compliance rate (CVCR). Based on the ICC, SEM and MDC results, many parameters yielded fair to good reliability values, while the CoP path length yielded the highest reliability (smallest ICC > 0.67 (0.54-0.79), largest SEM% = 19.2%). Usually, frequency type parameters and extreme value parameters yielded poor reliability values. There were differences in the reliability of the maximum CoP velocity (better with 30 seconds) and mean power frequency (better with 60 seconds) parameters between the different sampling intervals.
Validity and Reliability of the 8-Item Work Limitations Questionnaire.

Science.gov (United States)

Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C

2017-12-01

Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Reliability analysis of a sensitive and independent stabilometry parameter set

Science.gov (United States)

Nagymáté, Gergely; Orlovits, Zsanett

2018-01-01

Recent studies have suggested reduced independent and sensitive parameter sets for stabilometry measurements based on correlation and variance analyses. However, the reliability of these recommended parameter sets has not been studied in the literature or not in every stance type used in stabilometry assessments, for example, single leg stances. The goal of this study is to evaluate the test-retest reliability of different time-based and frequency-based parameters that are calculated from the center of pressure (CoP) during bipedal and single leg stance for 30- and 60-second measurement intervals. Thirty healthy subjects performed repeated standing trials in a bipedal stance with eyes open and eyes closed conditions and in a single leg stance with eyes open for 60 seconds. A force distribution measuring plate was used to record the CoP. The reliability of the CoP parameters was characterized by using the intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimal detectable change (MDC), coefficient of variation (CV) and CV compliance rate (CVCR). Based on the ICC, SEM and MDC results, many parameters yielded fair to good reliability values, while the CoP path length yielded the highest reliability (smallest ICC > 0.67 (0.54–0.79), largest SEM% = 19.2%). Usually, frequency type parameters and extreme value parameters yielded poor reliability values. There were differences in the reliability of the maximum CoP velocity (better with 30 seconds) and mean power frequency (better with 60 seconds) parameters between the different sampling intervals. PMID:29664938
Scoring sacroiliac joints by magnetic resonance imaging. A Multiple-reader reliability experiment

DEFF Research Database (Denmark)

Landewé, RB; Hermann, KG; van der Heijde, DM

2005-01-01

Magnetic resonance imaging (MRI) of the sacroiliac (SI) joints and the spine is increasingly important in the assessment of inflammatory activity and structural damage in clinical trials with patients with ankylosing spondylitis (AS). We investigated inter-reader reliability and sensitivity...... for 'depth' and 'intensity,' and the fifth method included the SPARCC slice with the maximum score. Inter-reader reliability was investigated by calculating intraclass correlation coefficients (ICC) for all readers together and for all possible reader pairs. Sensitivity to change was investigated...... values close to zero (no agreement) and highest observed values over 0.80 (excellent agreement). In general, agreement of status scores was somewhat better than agreement of change scores, and agreement of the comprehensive SPARCC scoring system was somewhat better than agreement of the more condensed...

Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

Science.gov (United States)

Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

2014-01-01

The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQIntelligence Scale IQ (BIQ) was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96). In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9), and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4). Thus, intellectual disability could be ruled out or determined. The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.
Reliability and validation of the Dutch Achilles tendon Total Rupture Score.

Science.gov (United States)

Opdam, K T M; Zwiers, R; Wiegerinck, J I; Kleipool, A E B; Haverlag, R; Goslings, J C; van Dijk, C N

2018-03-01

Patient-reported outcome measures (PROMs) have become a cornerstone for the evaluation of the effectiveness of treatment. The Achilles tendon Total Rupture Score (ATRS) is a PROM for outcome and assessment of an Achilles tendon rupture. The aim of this study was to translate the ATRS to Dutch and evaluate its reliability and validity in the Dutch population. A forward-backward translation procedure was performed according to the guidelines of cross-cultural adaptation process. The Dutch ATRS was evaluated for reliability and validity in patients treated for a total Achilles tendon rupture from 1 January 2012 to 31 December 2014 in one teaching hospital and one academic hospital. Reliability was assessed by the intraclass correlation coefficients (ICC), Cronbach's alpha and minimal detectable change (MDC). We assessed construct validity by calculation of Spearman's rho correlation coefficient with domains of the Foot and Ankle Outcome Score (FAOS), Victorian Institute of Sports Assessment-Achilles questionnaire (VISA-A) and Numeric Rating Scale (NRS) for pain in rest and during running. The Dutch ATRS had a good test-retest reliability (ICC = 0.852) and a high internal consistency (Cronbach's alpha = 0.96). MDC was 30.2 at individual level and 3.5 at group level. Construct validity was supported by 75 % of the hypothesized correlations. The Dutch ATRS had a strong correlation with NRS for pain during running (r = -0.746) and all the five subscales of the Dutch FAOS (r = 0.724-0.867). There was a moderate correlation with the VISA-A-NL (r = 0.691) and NRS for pain in rest (r = -0.580). The Dutch ATRS shows an adequate reliability and validity and can be used in the Dutch population for measuring the outcome of treatment of a total Achilles tendon rupture and for research purposes. Diagnostic study, Level I.
System reliability with correlated components: Accuracy of the Equivalent Planes method

NARCIS (Netherlands)

Roscoe, K.; Diermanse, F.; Vrouwenvelder, A.C.W.M.

2015-01-01

Computing system reliability when system components are correlated presents a challenge because it usually requires solving multi-fold integrals numerically, which is generally infeasible due to the computational cost. In Dutch flood defense reliability modeling, an efficient method for computing
System reliability with correlated components : Accuracy of the Equivalent Planes method

NARCIS (Netherlands)

Roscoe, K.; Diermanse, F.; Vrouwenvelder, T.

2015-01-01

Computing system reliability when system components are correlated presents a challenge because it usually requires solving multi-fold integrals numerically, which is generally infeasible due to the computational cost. In Dutch flood defense reliability modeling, an efficient method for computing
Design and reliability of a didactic inphographic rubric assessment

Directory of Open Access Journals (Sweden)

Yunuen Ixchel GUZMÁN-CEDILLO

2017-12-01

Full Text Available The objective of this study is to describe design, validity process and reliability of a rubric assessment to evaluate didactic infographics quality. Participants were fifteen judges who participate in different moments of elaboration rubric process; it was made in three process phases: design, settings and reliability determination. Content validity was obtained by percentage agreement between 3 judges by component of the rubric; likewise a Krippendorff’s alpha were applied (a = .710 in pilot assessment with 5 infographics in order to set possible writings contradictions between components and criteria of performance. The intern consistence was determined by Cronbach’s alpha (? = .806 in 22 infographics gradation. An Intraclass correlation coefficient icc (a = .909 was applied to 6 judges qualifications also a Krippendorff’s alpha (a = .538 both of them in ordinal levels. The rubric is composed by 9 components, 3 performance levels, definitions of each component and assignments how to use the rubric. Results suggest the rubric is valid and reliable to grade quality of didactic infographic.
Reliability of force-velocity relationships during deadlift high pull.

Science.gov (United States)

Lu, Wei; Boyas, Sébastien; Jubeau, Marc; Rahmani, Abderrahmane

2017-11-13

This study aimed to evaluate the within- and between-session reliability of force, velocity and power performances and to assess the force-velocity relationship during the deadlift high pull (DHP). Nine participants performed two identical sessions of DHP with loads ranging from 30 to 70% of body mass. The force was measured by a force plate under the participants' feet. The velocity of the 'body + lifted mass' system was calculated by integrating the acceleration and the power was calculated as the product of force and velocity. The force-velocity relationships were obtained from linear regression of both mean and peak values of force and velocity. The within- and between-session reliability was evaluated by using coefficients of variation (CV) and intraclass correlation coefficients (ICC). Results showed that DHP force-velocity relationships were significantly linear (R² > 0.90, p 0.94), mean and peak velocities showed a good agreement (CV reliable and can therefore be utilised as a tool to characterise individuals' muscular profiles.
Development and reliability of a structured interview guide for the Montgomery Asberg Depression Rating Scale (SIGMA).

Science.gov (United States)

Williams, Janet B W; Kobak, Kenneth A

2008-01-01

The Montgomery-Asberg Depression Rating Scale (MADRS) is often used in clinical trials to select patients and to assess treatment efficacy. The scale was originally published without suggested questions for clinicians to use in gathering the information necessary to rate the items. Structured and semi-structured interview guides have been found to improve reliability with other scales. To describe the development and test-retest reliability of a structured interview guide for the MADRS (SIGMA). A total of 162 test-retest interviews were conducted by 81 rater pairs. Each patient was interviewed twice, once by each rater conducting an independent interview. The intraclass correlation for total score between raters using the SIGMA was r=0.93, Preliability. Use of the SIGMA can result in high reliability of MADRS scores in evaluating patients with depression.
Evaluation of Factorial Validity and Reliability of a Food Behavior Checklist for Low-Income Filipinos.

Science.gov (United States)

Suzuki, Asuka; Choi, So Yung; Lim, Eunjung; Tauyan, Socorro; Banna, Jinan C

To examine factorial validity, test-retest reliability, and internal consistency of a Tagalog-language food behavior checklist (FBC) for a low-income Filipino population. Participants (n = 160) completed the FBC on 2 occasions 3 weeks apart. Factor structure was examined using principal component analysis. For internal consistency, Cronbach α was calculated. For test-retest reliability, Spearman correlation or intraclass correlation coefficient (ICC) was calculated between scores at the 2 points. All but 1 item loaded on 6 factors: fruit and vegetable quantity, fruit and vegetable variety, fast food, sweetened beverage, healthy fat, and diet quality. Cronbach α was .75 for the total scale (range, .39-.76 for subscales). Spearman correlation was 0.78 (ICC, 0.79) for the total scale (range, 0.66-0.80 [ICC, 0.68-0.80] for subscales). The FBC demonstrated adequate factorial validity, test-retest reliability, and internal consistency. With additional testing, the FBC may be used to evaluate the US Department of Agriculture's nutrition education programs for Tagalog speakers. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
The reliability of commonly used electrophysiology measures.

Science.gov (United States)

Brown, K E; Lohse, K R; Mayer, I M S; Strigaro, G; Desikan, M; Casula, E P; Meunier, S; Popa, T; Lamy, J-C; Odish, O; Leavitt, B R; Durr, A; Roos, R A C; Tabrizi, S J; Rothwell, J C; Boyd, L A; Orth, M

Electrophysiological measures can help understand brain function both in healthy individuals and in the context of a disease. Given the amount of information that can be extracted from these measures and their frequent use, it is essential to know more about their inherent reliability. To understand the reliability of electrophysiology measures in healthy individuals. We hypothesized that measures of threshold and latency would be the most reliable and least susceptible to methodological differences between study sites. Somatosensory evoked potentials from 112 control participants; long-latency reflexes, transcranial magnetic stimulation with resting and active motor thresholds, motor evoked potential latencies, input/output curves, and short-latency sensory afferent inhibition and facilitation from 84 controls were collected at 3 visits over 24 months at 4 Track-On HD study sites. Reliability was assessed using intra-class correlation coefficients for absolute agreement, and the effects of reliability on statistical power are demonstrated for different sample sizes and study designs. Measures quantifying latencies, thresholds, and evoked responses at high stimulator intensities had the highest reliability, and required the smallest sample sizes to adequately power a study. Very few between-site differences were detected. Reliability and susceptibility to between-site differences should be evaluated for electrophysiological measures before including them in study designs. Levels of reliability vary substantially across electrophysiological measures, though there are few between-site differences. To address this, reliability should be used in conjunction with theoretical calculations to inform sample size and ensure studies are adequately powered to detect true change in measures of interest. Copyright © 2017 Elsevier Inc. All rights reserved.
Reliability, validity, and minimal detectable change of the push-off test scores in assessing upper extremity weight-bearing ability.

Science.gov (United States)

Mehta, Saurabh P; George, Hannah R; Goering, Christian A; Shafer, Danielle R; Koester, Alan; Novotny, Steven

2017-11-01

Clinical measurement study. The push-off test (POT) was recently conceived and found to be reliable and valid for assessing weight bearing through injured wrist or elbow. However, further research with larger sample can lend credence to the preliminary findings supporting the use of the POT. This study examined the interrater reliability, construct validity, and measurement error for the POT in patients with wrist conditions. Participants with musculoskeletal (MSK) wrist conditions were recruited. The performance on the POT, grip isometric strength of wrist extensors was assessed. The shortened version of the Disabilities of the Arm, Shoulder and Hand and numeric pain rating scale were completed. The intraclass correlation coefficient assessed interrater reliability of the POT. Pearson correlation coefficients (r) examined the concurrent relationships between the POT and other measures. The standard error of measurement and the minimal detectable change at 90% confidence interval were assessed as measurement error and index of true change for the POT. A total of 50 participants with different elbow or wrist conditions (age: 48.1 ± 16.6 years) were included in this study. The results of this study strongly supported the interrater reliability (intraclass correlation coefficient: 0.96 and 0.93 for the affected and unaffected sides, respectively) of the POT in patients with wrist MSK conditions. The POT showed convergent relationships with the grip strength on the injured side (r = 0.89) and the wrist extensor strength (r = 0.7). The POT showed smaller standard error of measurement (1.9 kg). The minimal detectable change at 90% confidence interval for the POT was 4.4 kg for the sample. This study provides additional evidence to support the reliability and validity of the POT. This is the first study that provides the values for the measurement error and true change on the POT scores in patients with wrist MSK conditions. Further research should examine the
A clinical assessment tool used for physiotherapy students--is it reliable?

Science.gov (United States)

Lewis, Lucy K; Stiller, Kathy; Hardy, Frances

2008-01-01

Educational institutions providing professional programs such as physiotherapy must provide high-quality student assessment procedures. To ensure that assessment is consistent, assessment tools should have an acceptable level of reliability. There is a paucity of research evaluating the reliability of clinical assessment tools used for physiotherapy students. This study evaluated the inter- and intrarater reliability of an assessment tool used for physiotherapy students during a clinical placement. Five clinical educators and one academic participated in the study. Each rater independently marked 22 student written assessments that had been completed by students after viewing a videotaped patient physiotherapy assessment. The raters repeated the marking process 7 weeks later, with the assessments provided in a randomised order. The interrater reliability (Intraclass Correlation Coefficient) for the total scores was 0.32, representing a poor level of reliability. A high level of intrarater reliability (percentage agreement) was found for the clinical educators, with a difference in section scores of one mark or less on 93.4% of occasions. Further research should be undertaken to reevaluate the reliability of this clinical assessment tool following training. The reliability of clinical assessment tools used in other areas of physiotherapy education should be formally measured rather than assumed.
Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

Science.gov (United States)

Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

2015-01-01

Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
The relationship between multilevel models and non-parametric multilevel mixture models: Discrete approximation of intraclass correlation, random coefficient distributions, and residual heteroscedasticity.

Science.gov (United States)

Rights, Jason D; Sterba, Sonya K

2016-11-01

Multilevel data structures are common in the social sciences. Often, such nested data are analysed with multilevel models (MLMs) in which heterogeneity between clusters is modelled by continuously distributed random intercepts and/or slopes. Alternatively, the non-parametric multilevel regression mixture model (NPMM) can accommodate the same nested data structures through discrete latent class variation. The purpose of this article is to delineate analytic relationships between NPMM and MLM parameters that are useful for understanding the indirect interpretation of the NPMM as a non-parametric approximation of the MLM, with relaxed distributional assumptions. We define how seven standard and non-standard MLM specifications can be indirectly approximated by particular NPMM specifications. We provide formulas showing how the NPMM can serve as an approximation of the MLM in terms of intraclass correlation, random coefficient means and (co)variances, heteroscedasticity of residuals at level 1, and heteroscedasticity of residuals at level 2. Further, we discuss how these relationships can be useful in practice. The specific relationships are illustrated with simulated graphical demonstrations, and direct and indirect interpretations of NPMM classes are contrasted. We provide an R function to aid in implementing and visualizing an indirect interpretation of NPMM classes. An empirical example is presented and future directions are discussed. © 2016 The British Psychological Society.
Affective traits link to reliable neural markers of incentive anticipation.

Science.gov (United States)

Wu, Charlene C; Samanez-Larkin, Gregory R; Katovich, Kiefer; Knutson, Brian

2014-01-01

While theorists have speculated that different affective traits are linked to reliable brain activity during anticipation of gains and losses, few have directly tested this prediction. We examined these associations in a community sample of healthy human adults (n=52) as they played a Monetary Incentive Delay task while undergoing functional magnetic resonance imaging (FMRI). Factor analysis of personality measures revealed that subjects independently varied in trait Positive Arousal and trait Negative Arousal. In a subsample (n=14) retested over 2.5years later, left nucleus accumbens (NAcc) activity during anticipation of large gains (+$5.00) and right anterior insula activity during anticipation of large losses (-$5.00) showed significant test-retest reliability (intraclass correlations>0.50, p'santicipation of large gains, while trait Negative Arousal correlated with individual differences in right anterior insula activity during anticipation of large losses. Associations of affective traits with neural activity were not attributable to the influence of other potential confounds (including sex, age, wealth, and motion). Together, these results demonstrate selective links between distinct affective traits and reliably-elicited activity in neural circuits associated with anticipation of gain versus loss. The findings thus reveal neural markers for affective dimensions of healthy personality, and potentially for related psychiatric symptoms. © 2013. Published by Elsevier Inc. All rights reserved.
Validity and reliability of the Portuguese-Brazilian version of the Quality of Life in Epilepsy Inventory-89.

Science.gov (United States)

Azevedo, Auro Mauro; Alonso, Neide Barreira; Vidal-Dourado, Marcos; Noffs, Maria Helena da Silva; Pascalicchio, Tatiana Frascarelli; Caboclo, Luís Otávio Sales Ferreira; Ciconelli, Rozana Mesquita; Sakamoto, Américo Ceiki; Yacubian, Elza Márcia Targas

2009-03-01

The purpose of this article was to report the translation of the Quality of Life in Epilepsy Inventory-89 (QOLIE-89) into a Portuguese-Brazilian version and evaluate its reliability and validity. This study involved 105 outpatients: 54 patients with refractory temporal lobe epilepsy (TLE) with mesial temporal sclerosis (MTS) and 51 with juvenile myoclonic epilepsy (JME). Reliability and test-retest reliability were assessed. Relationships between QOLIE-89 domains and other questionnaires (Nottingham Health Profile, Beck Depression Inventory, Adverse Event Profile, Neuropsychological Evaluation), and external measures such as demographic and clinical variables were analyzed to examine construct validity. Internal consistency (Cronbach's alpha=0.73-0.92) and test-retest reliability (intraclass correlation coefficient=0.60-0.84) for individual domains were acceptable. For construct validity, we verified high correlations between the QOLIE-89 and the Nottingham Health Profile, Beck Depression Inventory, Adverse Event Profile, and Neuropsychological Evaluation. For clinical characteristics, the patients with juvenile myoclonic epilepsy had better quality-of-life scores on 11 of 17 QOLIE-89 subscales compared with patients with temporal lobe epilepsy (P<0.05). These results support the reliability and validity of the Portuguese-Brazilian translation of QOLIE-89.
Cross-cultural adaptation, reliability and validity of the Turkish version of the Hospital for Special Surgery (HSS) Knee Score.

Science.gov (United States)

Narin, Selnur; Unver, Bayram; Bakırhan, Serkan; Bozan, Ozgür; Karatosun, Vasfi

2014-01-01

The purpose of this study was to adapt the English version of the Hospital for Special Surgery (HSS) knee score for use in a Turkish population and to evaluate its validity, reliability and cultural adaptation. Standard forward-back translation of the HSS knee score was performed and the Turkish version was applied in 73 patients. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Mini-Mental State Examination and sit-to-stand test were also performed and analyzed. Internal consistency reliability was tested using Cronbach's alpha. The intraclass correlation coefficient (ICC) was used to calculate the test-retest reliability at one-week intervals. Validity was assessed by calculating the Pearson correlation between the HSS, WOMAC and sit-to-stand test scores. The ICC ranged from 0.98 to 0.99 with high internal consistency (Cronbach's alpha: 0.87). The WOMAC score correlated with total HSS score (r: -0.80, p<0.001) and sit-to-stand score (r: 0.12, p: 0.312). The Turkish version of the HSS knee score is reliable and valid in evaluating the total knee arthroplasty in Turkish patients.
Reliability and Validity of Digital Imagery Methodology for Measuring Starting Portions and Plate Waste from School Salad Bars.

Science.gov (United States)

Bean, Melanie K; Raynor, Hollie A; Thornton, Laura M; Sova, Alexandra; Dunne Stewart, Mary; Mazzeo, Suzanne E

2018-04-12

Scientifically sound methods for investigating dietary consumption patterns from self-serve salad bars are needed to inform school policies and programs. To examine the reliability and validity of digital imagery for determining starting portions and plate waste of self-serve salad bar vegetables (which have variable starting portions) compared with manual weights. In a laboratory setting, 30 mock salads with 73 vegetables were made, and consumption was simulated. Each component (initial and removed portion) was weighed; photographs of weighed reference portions and pre- and post-consumption mock salads were taken. Seven trained independent raters visually assessed images to estimate starting portions to the nearest ¼ cup and percentage consumed in 20% increments. These values were converted to grams for comparison with weighed values. Intraclass correlations between weighed and digital imagery-assessed portions and plate waste were used to assess interrater reliability and validity. Pearson's correlations between weights and digital imagery assessments were also examined. Paired samples t tests were used to evaluate mean differences (in grams) between digital imagery-assessed portions and measured weights. Interrater reliabilities were excellent for starting portions and plate waste with digital imagery. For accuracy, intraclass correlations were moderate, with lower accuracy for determining starting portions of leafy greens compared with other vegetables. However, accuracy of digital imagery-assessed plate waste was excellent. Digital imagery assessments were not significantly different from measured weights for estimating overall vegetable starting portions or waste; however, digital imagery assessments slightly underestimated starting portions (by 3.5 g) and waste (by 2.1 g) of leafy greens. This investigation provides preliminary support for use of digital imagery in estimating starting portions and plate waste from school salad bars. Results might inform
Reliability and responsiveness of dynamic contrast-enhanced magnetic resonance imaging in rheumatoid arthritis

DEFF Research Database (Denmark)

Axelsen, M.B.; Poggenborg, R.P.; Stoltenberg, M.

2013-01-01

intraarticular injection with 80 mg methylprednisolone. Using semi-automated image processing software, DCE-MRI parameters, including the initial rate of enhancement (IRE) and maximal enhancement (ME), were generated for three regions of interest (ROIs): ‘Whole slice’, ‘Quick ROI’, and ‘Precise ROI......Objectives: To investigate the responsiveness to treatment and the reliability of dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) in rheumatoid arthritis (RA) knee joints. Methods: DCE-MRI was performed in 12 clinically active RA knee joints before and 1, 7, 30, and 180 days after......’. The smallest detectable difference (SDD), the smallest detectable change (SDC), and intra- and inter-reader intraclass correlation coefficients (ICCs) were used to assess the reliability of DCE-MRI. Responsiveness to treatment was assessed by the standardized response mean (SRM). Results: In all patients...
Computer-assisted radiographic calculation of spinal curvature in brachycephalic "screw-tailed" dog breeds with congenital thoracic vertebral malformations: reliability and clinical evaluation.

Directory of Open Access Journals (Sweden)

Julien Guevar

Full Text Available The objectives of this study were: To investigate computer-assisted digital radiographic measurement of Cobb angles in dogs with congenital thoracic vertebral malformations, to determine its intra- and inter-observer reliability and its association with the presence of neurological deficits. Medical records were reviewed (2009-2013 to identify brachycephalic screw-tailed dog breeds with radiographic studies of the thoracic vertebral column and with at least one vertebral malformation present. Twenty-eight dogs were included in the study. The end vertebrae were defined as the cranial end plate of the vertebra cranial to the malformed vertebra and the caudal end plate of the vertebra caudal to the malformed vertebra. Three observers performed the measurements twice. Intraclass correlation coefficients were used to calculate the intra- and inter-observer reliabilities. The intraclass correlation coefficient was excellent for all intra- and inter-observer measurements using this method. There was a significant difference in the kyphotic Cobb angle between dogs with and without associated neurological deficits. The majority of dogs with neurological deficits had a kyphotic Cobb angle higher than 35°. No significant difference in the scoliotic Cobb angle was observed. We concluded that the computer assisted digital radiographic measurement of the Cobb angle for kyphosis and scoliosis is a valid, reproducible and reliable method to quantify the degree of spinal curvature in brachycephalic screw-tailed dog breeds with congenital thoracic vertebral malformations.
Reliability of the Handgrip Strength Test in Elderly Subjects With Parkinson Disease.

Science.gov (United States)

Villafañe, Jorge H; Valdes, Kristin; Buraschi, Riccardo; Martinelli, Marco; Bissolotti, Luciano; Negrini, Stefano

2016-03-01

The handgrip strength test is widely used by clinicians; however, little has been investigated about its reliability when used in subjects with Parkinson disease (PD). The purpose of this study was to investigate the test-retest reliability of the handgrip strength test for subjects with PD. The PD group consisted of 15 patients, and the control group consisted of 15 healthy subjects. Each patient performed 3 pain-free maximal isometric contractions on each hand on 2 occasions, 1 week apart. Intraclass correlation coefficient (ICC), standard error of measurement (SEM), and 95% limits of agreement (LOA) were calculated. The 2-way analysis of variance (ANOVA) was conducted to determine the differences between sides and groups. Test-retest reliability of measurements of grip strength was excellent for dominant (ICC = 0.97; P = .001) and non-dominant (ICC = 0.98; P = .001) hand of participant with PD and (ICC = 0.99; P = .001) and (ICC = 0.99; P = .001) respectively, of healthy group. The Jamar hand dynamometer had fair to excellent test-retest reliability to test grip strength in participants with PD.

Reliability and Validity of Athletes Disability Index Questionnaire.

Science.gov (United States)

Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin

2018-03-01

The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P visual analog scale was r = 0.626 (P disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
Reliability Worth Analysis of Distribution Systems Using Cascade Correlation Neural Networks

DEFF Research Database (Denmark)

Heidari, Alireza; Agelidis, Vassilios; Pou, Josep

2018-01-01

Reliability worth analysis is of great importance in the area of distribution network planning and operation. The reliability worth's precision can be affected greatly by the customer interruption cost model used. The choice of the cost models can change system and load point reliability indices....... In this study, a cascade correlation neural network is adopted to further develop two cost models comprising a probabilistic distribution model and an average or aggregate model. A contingency-based analytical technique is adopted to conduct the reliability worth analysis. Furthermore, the possible effects...
Reliability of a visual analog scale for determining the preferred mastication side.

Science.gov (United States)

Flores-Orozco, Elan Ignacio; Rovira-Lastra, Bernat; Peraire, Maria; Salsench, Juan; Martinez-Gomis, Jordi

2016-02-01

Although the visual analog scale (VAS) is a simple tool for quantitatively measuring symptom perception, no studies have used the VAS to assess the degree of subjective masticatory laterality. The purpose of this study was to assess the reliability of the VAS for determining the preferred mastication side (PMS) and to compare it with other methods. A cross-sectional study was conducted in which 42 adults with natural dentition performed 2 masticatory sessions. Eight different methods were used to determine the PMS by combining different definitions, food tests, measurements, and number of cycles assessed. A test-retest was performed in 10 participants to evaluate the reliability of each method using the intraclass correlation coefficient. To assess the validity of the different methods, the Pearson correlations were performed (α=.05) between the 8 methods. Self-assessment using the VAS had the highest reliability; it also had a positive and significant relationship with 6 of the 7 other methods. The method that showed the best validity used bagged silicone as the test food, determined the PMS by video recording, and assessed all masticatory cycles using the asymmetry index. Low reliability was found for methods using the location of gum bolus at standardized time intervals or electromyographic recordings. The VAS provided a highly reliable means of assessing the degree of masticatory laterality perceived by the participant, with a positive and significant correlation with the majority of the other methods. Copyright © 2016 Editorial Council for the Journal of Prosthetic Dentistry. Published by Elsevier Inc. All rights reserved.
Supersonic shear imaging provides a reliable measurement of resting muscle shear elastic modulus

International Nuclear Information System (INIS)

Lacourpaille, Lilian; Hug, François; Bouillard, Killian; Nordez, Antoine; Hogrel, Jean-Yves

2012-01-01

The aim of the present study was to assess the reliability of shear elastic modulus measurements performed using supersonic shear imaging (SSI) in nine resting muscles (i.e. gastrocnemius medialis, tibialis anterior, vastus lateralis, rectus femoris, triceps brachii, biceps brachii, brachioradialis, adductor pollicis obliquus and abductor digiti minimi) of different architectures and typologies. Thirty healthy subjects were randomly assigned to the intra-session reliability (n = 20), inter-day reliability (n = 21) and the inter-observer reliability (n = 16) experiments. Muscle shear elastic modulus ranged from 2.99 (gastrocnemius medialis) to 4.50 kPa (adductor digiti minimi and tibialis anterior). On the whole, very good reliability was observed, with a coefficient of variation (CV) ranging from 4.6% to 8%, except for the inter-operator reliability of adductor pollicis obliquus (CV = 11.5%). The intraclass correlation coefficients were good (0.871 ± 0.045 for the intra-session reliability, 0.815 ± 0.065 for the inter-day reliability and 0.709 ± 0.141 for the inter-observer reliability). Both the reliability and the ease of use of SSI make it a potentially interesting technique that would be of benefit to fundamental, applied and clinical research projects that need an accurate assessment of muscle mechanical properties. (note)
Inertial Measurement Units for Clinical Movement Analysis: Reliability and Concurrent Validity

Directory of Open Access Journals (Sweden)

Mohammad Al-Amri

2018-02-01

Full Text Available The aim of this study was to investigate the reliability and concurrent validity of a commercially available Xsens MVN BIOMECH inertial-sensor-based motion capture system during clinically relevant functional activities. A clinician with no prior experience of motion capture technologies and an experienced clinical movement scientist each assessed 26 healthy participants within each of two sessions using a camera-based motion capture system and the MVN BIOMECH system. Participants performed overground walking, squatting, and jumping. Sessions were separated by 4 ± 3 days. Reliability was evaluated using intraclass correlation coefficient and standard error of measurement, and validity was evaluated using the coefficient of multiple correlation and the linear fit method. Day-to-day reliability was generally fair-to-excellent in all three planes for hip, knee, and ankle joint angles in all three tasks. Within-day (between-rater reliability was fair-to-excellent in all three planes during walking and squatting, and poor-to-high during jumping. Validity was excellent in the sagittal plane for hip, knee, and ankle joint angles in all three tasks and acceptable in frontal and transverse planes in squat and jump activity across joints. Our results suggest that the MVN BIOMECH system can be used by a clinician to quantify lower-limb joint angles in clinically relevant movements.
Rater reliability and concurrent validity of the Keyboard Personal Computer Style instrument (K-PeCS).

Science.gov (United States)

Baker, Nancy A; Cook, James R; Redfern, Mark S

2009-01-01

This paper describes the inter-rater and intra-rater reliability, and the concurrent validity of an observational instrument, the Keyboard Personal Computer Style instrument (K-PeCS), which assesses stereotypical postures and movements associated with computer keyboard use. Three trained raters independently rated the video clips of 45 computer keyboard users to ascertain inter-rater reliability, and then re-rated a sub-sample of 15 video clips to ascertain intra-rater reliability. Concurrent validity was assessed by comparing the ratings obtained using the K-PeCS to scores developed from a 3D motion analysis system. The overall K-PeCS had excellent reliability [inter-rater: intra-class correlation coefficients (ICC)=.90; intra-rater: ICC=.92]. Most individual items on the K-PeCS had from good to excellent reliability, although six items fell below ICC=.75. Those K-PeCS items that were assessed for concurrent validity compared favorably to the motion analysis data for all but two items. These results suggest that most items on the K-PeCS can be used to reliably document computer keyboarding style.
The Persian Version of the "Life Satisfaction Scale": Construct Validity and Test-Re-Test Reliability among Iranian Older Adults.

Science.gov (United States)

Moghadam, Manije; Salavati, Mahyar; Sahaf, Robab; Rassouli, Maryam; Moghadam, Mojgan; Kamrani, Ahmad Ali Akbari

2018-03-01

After forward-backward translation, the LSS was administered to 334 Persian speaking, cognitively healthy elderly aged 60 years and over recruited through convenience sampling. To analyze the validity of the model's constructs and the relationships between the constructs, a confirmatory factor analysis followed by PLS analysis was performed. The Construct validity was further investigated by calculating the correlations between the LSS and the "Short Form Health Survey" (SF-36) subscales measuring similar and dissimilar constructs. The LSS was re-administered to 50 participants a month later to assess the reliability. For the eight-factor model of the life satisfaction construct, adequate goodness of fit between the hypothesized model and the model derived from the sample data was attained (positive and statistically significant beta coefficients, good R-squares and acceptable GoF). Construct validity was supported by convergent and discriminant validity, and correlations between the LSS and SF-36 subscales. Minimum Intraclass Correlation Coefficient level of 0.60 was exceeded by all subscales. Minimum level of reliability indices (Cronbach's α, composite reliability and indicator reliability) was exceeded by all subscales. The Persian-version of the Life Satisfaction Scale is a reliable and valid instrument, with psychometric properties which are consistent with the original version.
Nerve ultrasound reliability of upper limbs: Effects of examiner training.

Science.gov (United States)

Garcia-Santibanez, Rocio; Dietz, Alexander R; Bucelli, Robert C; Zaidman, Craig M

2018-02-01

Duration of training to reliably measure nerve cross-sectional area with ultrasound is unknown. A retrospective review was performed of ultrasound data, acquired and recorded by 2 examiners-an expert and either a trainee with 2 months (novice) or a trainee with 12 months (experienced) of experience. Data on median, ulnar, and radial nerves were reviewed for 42 patients. Interrater reliability was good and varied most with nerve site but little with experience. Coefficient of variation (CoV) range was 9.33%-22.5%. Intraclass correlation coefficient (ICC) was good to excellent (0.65-95) except ulnar nerve-wrist/forearm and radial nerve-humerus (ICC = 0.39-0.59). Interrater differences did not vary with nerve size or body mass index. Expert-novice and expert-experienced interrater differences and CoV were similar. The ulnar nerve-wrist expert-novice interrater difference decreased with time (r s = -0.68, P = 0.001). A trainee with at least 2 months of experience can reliably measure upper limb nerves. Reliability varies by nerve and location and slightly improves with time. Muscle Nerve 57: 189-192, 2018. © 2017 Wiley Periodicals, Inc.
Intra- and interobserver reliability of glenoid fracture classifications by Ideberg, Euler and AO.

Science.gov (United States)

Gilbert, F; Eden, L; Meffert, R; Konietschke, F; Lotz, J; Bauer, L; Staab, W

2018-03-27

Representing 3%-5% of shoulder girdle injuries scapula fractures are rare. Furthermore, approximately 1% of scapula fractures are intraarticularfractures of the glenoid fossa. Because of uncertain fracture morphology and limited experience, the treatment of glenoid fossa fractures is difficult. The glenoid fracture classification by Ideberg (1984) and Euler (1996) is still commonly used in literature. In 2013 a new glenoid fracture classification was introduced by the AO. The purpose of this study was to examine the new AO classification in clinical practice in comparison with the classifications by Ideberg and Euler. In total CT images of 84 patients with glenoid fossa fractures from 2005 to 2018 were included. Parasagittal, paracoronary and axial reconstructions were examined according to the classifications of Ideberg, Euler and the AO by 3 investigators (orthopedic surgeon, radiologist, student of medicine) at three individual time settings. Inter- and intraobserver reliability of the three classification systems were ascertained by computing Inter- and Intraclass (ICCs) correlation coefficients using Spearman's rank correlation coefficient, 95%-confidence intervals as well as F-tests for correlation coefficients. Inter- and intraobserver reliability for the AO classification showed a perspicuous coherence (R = 0.74 and R = 0.79). Low to moderate intraobserver reliability for Ideberg (R = 0.46) and Euler classification (R = 0.41) was found. Furthermore, data show a low Interobserver reliability for both Ideberg and Euler classification (R reliability using AO is significantly higher than those using Ideberg and Euler (p reliable grading of glenoid fossa fractures with high inter- and intraobserver reliability in 84 patients using CT images. It should possibly be applied in order to enable a valid, reliable and consistent academic description of glenoid fossa fractures. The established classifications by Euler and Ideberg are not capable of
Effect of knee angle on neuromuscular assessment of plantar flexor muscles: A reliability study

Science.gov (United States)

Cornu, Christophe; Jubeau, Marc

2018-01-01

Introduction This study aimed to determine the intra- and inter-session reliability of neuromuscular assessment of plantar flexor (PF) muscles at three knee angles. Methods Twelve young adults were tested for three knee angles (90°, 30° and 0°) and at three time points separated by 1 hour (intra-session) and 7 days (inter-session). Electrical (H reflex, M wave) and mechanical (evoked and maximal voluntary torque, activation level) parameters were measured on the PF muscles. Intraclass correlation coefficients (ICC) and coefficients of variation were calculated to determine intra- and inter-session reliability. Results The mechanical measurements presented excellent (ICC>0.75) intra- and inter-session reliabilities regardless of the knee angle considered. The reliability of electrical measurements was better for the 90° knee angle compared to the 0° and 30° angles. Conclusions Changes in the knee angle may influence the reliability of neuromuscular assessments, which indicates the importance of considering the knee angle to collect consistent outcomes on the PF muscles. PMID:29596480
Validity and Reliability of the Questionnaire for Assessing Women’s Reproductive History in Azar Cohort Study

Directory of Open Access Journals (Sweden)

Mohammad Zakaria Pezeshki

2017-06-01

Full Text Available This study was done to evaluate the validity and reliability of women’s reproductive history questionnaire which will be used in Azar Cohort study; a cohort that is conducted by Tabriz University of Medical Science in Shabestar county for identifying risk factors of no communicable diseases. Content and face validity were evaluated by ten experts in the field and quantified as content validity index (CVI and content validity ratio (CVR. To assess the reliability, using test-retest approach, kappa statistic was calculated for categorical variables and intra-class correlation coefficient (ICC was used for the quantitative items. The calculated CVI and CVR were 0.91and 0.94, respectively. Reliability for all items was high. The ICC was 0.99 and kappa statistic was equal to 1. The final version of questionnaire was redesigned in 26 items with 7 subscales.
Cross-cultural adaptation and determination of the reliability and validity of PRTEE-S (Patientskattad Utvärdering av Tennisarmbåge, a questionnaire for patients with lateral epicondylalgia, in a Swedish population

Directory of Open Access Journals (Sweden)

Baigi Amir

2008-06-01

Full Text Available Abstract Background In Sweden, as well as in Scandinavia, there is no easy way to evaluate patients' difficulties when they suffer from lateral epicondylitis/epicondylalgia. However, there is a Canadian questionnaire, in English, that could make the evaluation of a patient's pain and functional loss both quick and inexpensive. Therefore, the aim of this study was to translate and cross-culturally adapt the questionnaire "Patient-rated Tennis Elbow Evaluation" into Swedish (PRTEE-S; "Patientskattad Utvärdering av Tennisarmbåge", and to evaluate the reliability and validity of the test. Methods The Patient-rated Tennis Elbow Evaluation was cross-culturally adapted for the Swedish language according to well-established guidelines. Fifty-four patients with unilateral epicondylitis/epicondylalgia were assessed using the PRTEE-S (Patientskattad Utvärdering av Tennisarmbåge, the Disabilities of Arm, Shoulder, and Hand questionnaire, and the Roles & Maudsley score to establish the validity and reliability of the PRTEE-S. Reliability was determined via calculation of the intra-class correlation coefficient (ICC the internal consistency was assessed by Cronbach's alpha, and validity was calculated using Spearman's correlation coefficient. Results The test-retest reliability, using the PRTEE-S (Patientskattad Utvärdering av Tennisarmbåge intraclass correlation coefficient, was 0.95 and the internal consistency was 0.94. The PRTEE-S correlated well with the Disabilities of the Arm, Shoulder, and Hand questionnaire (r = 0.88 and the Roles & Maudsley score (r = 0.78. Conclusion The PRTEE-S (Patientskattad Utvärdering av Tennisarmbåge represents a reliable and valid instrument to evaluate the subjective outcome in Swedish speaking patients with lateral epicondylitis/epicondylalgia, and can be used in both research and clinical settings.
Cross-cultural adaptation, validation, and reliability of the Michigan Hand Outcomes Questionnaire among Persian population.

Science.gov (United States)

Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza

2015-01-01

We aimed to validate a cross-culturally adapted version of the Persian Michigan Hand Outcomes Questionnaire (MHOQ). We followed the Beaton's guideline to translate the questionnaire to Persian. We administered the final version to 223 patients among which 79 patients returned 3 days later to respond to the Persian MHOQ for the second time. In the first visit, respondents also filled the Disabilities of the Arm Shoulder and Hand (DASH) and rated the pain based on the Visual Analogue Scale (VAS). Cronbach's alpha for the total MHOQ was 0.79 which showed good internal consistency. Intraclass correlation coefficient (ICC) for the total MHOQ was 0.84 which demonstrated good reliability between test and retest. The absolute correlation coefficient between total MHOQ and the DASH was as high as 0.74. Persian version of the MHOQ proved to be a reliable and valid instrument to be implemented among Persian population with the hand and wrist disorders.
Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

Directory of Open Access Journals (Sweden)

Yota Uno

Full Text Available OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70 was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. RESULTS: The Cronbach's alpha for the new Tanaka B Intelligence Scale IQ (BIQ was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96. In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9, and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4. Thus, intellectual disability could be ruled out or determined. CONCLUSION: The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.
Palliative sedation: reliability and validity of sedation scales.

Science.gov (United States)

Arevalo, Jimmy J; Brinkkemper, Tijn; van der Heide, Agnes; Rietjens, Judith A; Ribbe, Miel; Deliens, Luc; Loer, Stephan A; Zuurmond, Wouter W A; Perez, Roberto S G M

2012-11-01

Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. To study the reliability and validity of observer-based sedation scales in palliative sedation. A prospective evaluation of 54 patients under intermittent or continuous sedation with four sedation scales was performed by 52 nurses. Included scales were the Minnesota Sedation Assessment Tool (MSAT), Richmond Agitation-Sedation Scale (RASS), Vancouver Interaction and Calmness Scale (VICS), and a sedation score proposed in the Guideline for Palliative Sedation of the Royal Dutch Medical Association (KNMG). Inter-rater reliability was tested with the intraclass correlation coefficient (ICC) and Cohen's kappa coefficient. Correlations between the scales using Spearman's rho tested concurrent validity. We also examined construct, discriminative, and evaluative validity. In addition, nurses completed a user-friendliness survey. Overall moderate to high inter-rater reliability was found for the VICS interaction subscale (ICC = 0.85), RASS (ICC = 0.73), and KNMG (ICC = 0.71). The largest correlation between scales was found for the RASS and KNMG (rho = 0.836). All scales showed discriminative and evaluative validity, except for the MSAT motor subscale and VICS calmness subscale. Finally, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. The RASS and KNMG scales stand as the most reliable and valid among the evaluated scales. In addition, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. Further research is needed to evaluate the impact of the scales on better symptom control and patient comfort. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
The reliability of eyetracking to assess attentional bias to threatening words in healthy individuals.

Science.gov (United States)

Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H

2017-08-15

Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error ( .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
The Reliability of a Novel Mobile 3-dimensional Wound Measurement Device.

Science.gov (United States)

Anghel, Ersilia L; Kumar, Anagha; Bigham, Thomas E; Maselli, Kathryn M; Steinberg, John S; Evans, Karen K; Kim, Paul J; Attinger, Christopher E

2016-11-01

Objective assessment of wound dimensions is essential for tracking progression and determining treatment effectiveness. A reliability study was designed to establish intrarater and interrater reliability of a novel mobile 3-dimensional wound measurement (3DWM) device. Forty-five wounds were assessed by 2 raters using a 3DWM device to obtain length, width, area, depth, and volume measurements. Wounds were also measured manually, using a disposable ruler and digital planimetry. The intraclass correlation coefficient (ICC) was used to establish intrarater and interrater reliability. High levels of intrarater and interrater agreement were observed for area, length, and width; ICC = 0.998, 0.977, 0.955 and 0.999, 0.997, 0.995, respectively. Moderate levels of intrarater (ICC = 0.888) and interrater (ICC = 0.696) agreement were observed for volume. Lastly, depth yielded an intrarater ICC of 0.360 and an interrater ICC of 0.649. Measures from the 3DWM device were highly correlated with those obtained from scaled photography for length, width, and area (ρ = 0.997, 0.988, 0.997, P device yielded correlations of ρ = 0.990, 0.987, 0.996 with P device was found to be highly reliable for measuring wound areas for a range of wound sizes and types as compared to manual measurement and digital planimetry. The depth and therefore volume measurement using the 3DWM device was found to have a lower ICC, but volume ICC alone was moderate. Overall, this device offers a mobile option for objective wound measurement in the clinical setting.
[Reliability and validity of the PAQ-A questionnaire to assess physical activity in Spanish adolescents].

Science.gov (United States)

Martínez-Gómez, David; Martínez-de-Haro, Vicente; Pozo, Tamara; Welk, Gregory J; Villagra, Ariel; Calle, Marisa E; Marcos, Ascensión; Veiga, Oscar L

2009-01-01

Questionnaires are feasible instruments to assess physical activity (PA) in large samples. The aim of the current study was to evaluate the reliability and validity of the PAQ-A questionnaire in Spanish adolescents using the measurement of PA by accelerometer as criterion. In a sample of 82 adolescents, aged 12 to 17 years, 1-week PAQ-A test-retest was administered. Reliability was analyzed by the Intraclass Correlation Coefficient (ICC) and the internal consistency by the Cronbach's alpha Coefficient. Two hundred thirty-two adolescents, aged 13-17 years, completed the PAQ-A and wore the ActiGraph GT1M accelerometer during 7-days. The PAQ-A was compared against total PA and moderate to vigorous PA (MVPA) obtained by the accelerometer. Test-retest reliability showed ICC = 0.71 for the final score of PAQ-A. Internal consistency was alpha = 0.65 in the first self-report, alpha = 0.67 in the retest in 82 adolescents sample, and alpha = 0.74 in the 232 adolescents sample. The PAQ-A was moderately correlated with total PA (rho = 0.39) and MVPA (rho= 0.34) assessed by the accelerometer. The PAQ-A obtained significantly moderate correlations in boys but not in girls against the accelerometer. The PAQ-A questionnaire shows an adequate reliability and a reasonable validity for assessing PA in Spanish adolescents.
SPSS Macros for Assessing the Reliability and Agreement of Student Evaluations of Teaching

Science.gov (United States)

Morley, Donald D.

2009-01-01

This article reports and demonstrates two SPSS macros for calculating Krippendorff's alpha and intraclass reliability coefficients in repetitive situations where numerous coefficients are needed. Specifically, the reported SPSS macros were used to evaluate the interrater agreement and reliability of student evaluations of teaching in thousands of…
Toward a Common Language for Measuring Patient Mobility in the Hospital: Reliability and Construct Validity of Interprofessional Mobility Measures.

Science.gov (United States)

Hoyer, Erik H; Young, Daniel L; Klein, Lisa M; Kreif, Julie; Shumock, Kara; Hiser, Stephanie; Friedman, Michael; Lavezza, Annette; Jette, Alan; Chan, Kitty S; Needham, Dale M

2018-02-01

The lack of common language among interprofessional inpatient clinical teams is an important barrier to achieving inpatient mobilization. In The Johns Hopkins Hospital, the Activity Measure for Post-Acute Care (AM-PAC) Inpatient Mobility Short Form (IMSF), also called "6-Clicks," and the Johns Hopkins Highest Level of Mobility (JH-HLM) are part of routine clinical practice. The measurement characteristics of these tools when used by both nurses and physical therapists for interprofessional communication or assessment are unknown. The purposes of this study were to evaluate the reliability and minimal detectable change of AM-PAC IMSF and JH-HLM when completed by nurses and physical therapists and to evaluate the construct validity of both measures when used by nurses. A prospective evaluation of a convenience sample was used. The test-retest reliability and the interrater reliability of AM-PAC IMSF and JH-HLM for inpatients in the neuroscience department (n = 118) of an academic medical center were evaluated. Each participant was independently scored twice by a team of 2 nurses and 1 physical therapist; a total of 4 physical therapists and 8 nurses participated in reliability testing. In a separate inpatient study protocol (n = 69), construct validity was evaluated via an assessment of convergent validity with other measures of function (grip strength, Katz Activities of Daily Living Scale, 2-minute walk test, 5-times sit-to-stand test) used by 5 nurses. The test-retest reliability values (intraclass correlation coefficients) for physical therapists and nurses were 0.91 and 0.97, respectively, for AM-PAC IMSF and 0.94 and 0.95, respectively, for JH-HLM. The interrater reliability values (intraclass correlation coefficients) between physical therapists and nurses were 0.96 for AM-PAC IMSF and 0.99 for JH-HLM. Construct validity (Spearman correlations) ranged from 0.25 between JH-HLM and right-hand grip strength to 0.80 between AM-PAC IMSF and the Katz Activities of

Reliability of concentrations of organophosphate pesticide metabolites in serial urine specimens from pregnancy in the Generation R study

Science.gov (United States)

Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M.; Jusko, Todd A.; Jaddoe, Vincent W.V.; Shaw, Pamela A.; Tiemeier, Henning M.; Hofman, Albert; Pierik, Frank H.; Longnecker, Matthew P.

2014-01-01

The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at 25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiologic studies. PMID:25515376
How reliable are Functional Movement Screening scores? A systematic review of rater reliability.

Science.gov (United States)

Moran, Robert W; Schneiders, Anthony G; Major, Katherine M; Sullivan, S John

2016-05-01

Several physical assessment protocols to identify intrinsic risk factors for injury aetiology related to movement quality have been described. The Functional Movement Screen (FMS) is a standardised, field-expedient test battery intended to assess movement quality and has been used clinically in preparticipation screening and in sports injury research. To critically appraise and summarise research investigating the reliability of scores obtained using the FMS battery. Systematic literature review. Systematic search of Google Scholar, Scopus (including ScienceDirect and PubMed), EBSCO (including Academic Search Complete, AMED, CINAHL, Health Source: Nursing/Academic Edition), MEDLINE and SPORTDiscus. Studies meeting eligibility criteria were assessed by 2 reviewers for risk of bias using the Quality Appraisal of Reliability Studies checklist. Overall quality of evidence was determined using van Tulder's levels of evidence approach. 12 studies were appraised. Overall, there was a 'moderate' level of evidence in favour of 'acceptable' (intraclass correlation coefficient ≥0.6) inter-rater and intra-rater reliability for composite scores derived from live scoring. For inter-rater reliability of composite scores derived from video recordings there was 'conflicting' evidence, and 'limited' evidence for intra-rater reliability. For inter-rater reliability based on live scoring of individual subtests there was 'moderate' evidence of 'acceptable' reliability (κ≥0.4) for 4 subtests (Deep Squat, Shoulder Mobility, Active Straight-leg Raise, Trunk Stability Push-up) and 'conflicting' evidence for the remaining 3 (Hurdle Step, In-line Lunge, Rotary Stability). This review found 'moderate' evidence that raters can achieve acceptable levels of inter-rater and intra-rater reliability of composite FMS scores when using live ratings. Overall, there were few high-quality studies, and the quality of several studies was impacted by poor study reporting particularly in relation to
Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial

Directory of Open Access Journals (Sweden)

Kevin A. Hallgren

2012-02-01

Full Text Available Many research designs require the assessment of inter-rater reliability (IRR to demonstrate consistency among observational ratings provided by multiple coders. However, many studies use incorrect statistical procedures, fail to fully report the information necessary to interpret their results, or do not address how IRR affects the power of their subsequent analyses for hypothesis testing. This paper provides an overview of methodological issues related to the assessment of IRR with a focus on study design, selection of appropriate statistics, and the computation, interpretation, and reporting of some commonly-used IRR statistics. Computational examples include SPSS and R syntax for computing Cohens kappa and intra-class correlations to assess IRR.
Reliability assessment of a peer evaluation instrument in a team-based learning course

Directory of Open Access Journals (Sweden)

Wahawisan J

2016-03-01

Full Text Available Objective: To evaluate the reliability of a peer evaluation instrument in a longitudinal team-based learning setting. Methods: Student pharmacists were instructed to evaluate the contributions of their peers. Evaluations were analyzed for the variance of the scores by identifying low, medium, and high scores. Agreement between performance ratings within each group of students was assessed via intra-class correlation coefficient (ICC. Results: We found little variation in the standard deviation (SD based on the score means among the high, medium, and low scores within each group. The lack of variation in SD of results between groups suggests that the peer evaluation instrument produces precise results. The ICC showed strong concordance among raters. Conclusions: Findings suggest that our student peer evaluation instrument provides a reliable method for peer assessment in team-based learning settings.
Intra-operative reliability of ShapeMatch cutting guide placement in total knee arthroplasty.

Science.gov (United States)

Clark, Gavin; Leong, Anthony; McEwen, Peter; Steele, Robert; Tran, Ton; Trivett, Adrian

2013-01-01

Custom cutting guides based on pre-operative imaging have been introduced for total knee arthroplasty (TKA). The aim of this prospective cohort study was to assess the reliability of repeated placement of custom cutting guides by multiple surgeons in a group of patients undergoing TKA. Custom cutting guides (ShapeMatch®, Stryker Orthopaedics) were designed from pre-operative MRI scans. The treating surgeon placed each guide on the femur and tibia of each patient three times without pinning the block. The three-dimensional position and orientation of the guide was measured for each repetition using a computer navigation system. The surgeon was blinded to the navigation system display. Data from 24 patients and 6 surgeons were analyzed. Intraclass correlation coefficients for all measurement parameters were in the range 0.889-0.997 (excellent), and all comparisons were statistically significant (p reliable.
Inter-Rater Reliability of Cyclotorsion Measurements Using Fundus Photography.

Science.gov (United States)

Dysli, Muriel; Kanku, Madeleine; Traber, Ghislaine L

2018-04-01

The foveo-papillary angle (FPA) on fundus photographs is the accepted standard for the measurement of ocular cyclotorsion. We assessed the inter-rater reliability of this method in healthy subjects and in patients with trochlear nerve palsies. In this methodological study, fundus photographs of healthy subjects and of patients with trochlear nerve palsies were made with a fundus camera (Zeiss Fundus Camera FF 450 plus, Jena, Germany). Three independent observers measured the FPA on the fundus photographs of all subjects in synedra View (synedra View 16, Version 16.0.0.11, Innsbruck, Austria). One hundred and four eyes of 52 subjects (26 healthy controls and 26 patients) were assessed. The mean FPA of the healthy controls was 5.80 degrees (°) [± 0.44 standard error of the mean (SEM)] compared to 11.55° (± 0.80 SEM) for patients with trochlear nerve palsies. The inter-rater reliability of all measured FPAs showed an intraclass correlation coefficient (ICC) of 0.98 (95% CI 0.97 - 0.98). The inter-rater reliability of objective cyclotorsion measurements using fundus photographs was very high. Georg Thieme Verlag KG Stuttgart · New York.
Validity and Reliability of the Upper Extremity Work Demands Scale.

Science.gov (United States)

Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K

2017-12-01

Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.
The reliability of the Glasgow Coma Scale: a systematic review.

Science.gov (United States)

Reith, Florence C M; Van den Brande, Ruben; Synnot, Anneliese; Gruen, Russell; Maas, Andrew I R

2016-01-01

The Glasgow Coma Scale (GCS) provides a structured method for assessment of the level of consciousness. Its derived sum score is applied in research and adopted in intensive care unit scoring systems. Controversy exists on the reliability of the GCS. The aim of this systematic review was to summarize evidence on the reliability of the GCS. A literature search was undertaken in MEDLINE, EMBASE and CINAHL. Observational studies that assessed the reliability of the GCS, expressed by a statistical measure, were included. Methodological quality was evaluated with the consensus-based standards for the selection of health measurement instruments checklist and its influence on results considered. Reliability estimates were synthesized narratively. We identified 52 relevant studies that showed significant heterogeneity in the type of reliability estimates used, patients studied, setting and characteristics of observers. Methodological quality was good (n = 7), fair (n = 18) or poor (n = 27). In good quality studies, kappa values were ≥0.6 in 85%, and all intraclass correlation coefficients indicated excellent reliability. Poor quality studies showed lower reliability estimates. Reliability for the GCS components was higher than for the sum score. Factors that may influence reliability include education and training, the level of consciousness and type of stimuli used. Only 13% of studies were of good quality and inconsistency in reported reliability estimates was found. Although the reliability was adequate in good quality studies, further improvement is desirable. From a methodological perspective, the quality of reliability studies needs to be improved. From a clinical perspective, a renewed focus on training/education and standardization of assessment is required.
Intraclass Correlation Coefficients for Obesity Indicators and Energy Balance-Related Behaviors Among New York City Public Elementary Schools.

Science.gov (United States)

Gray, Heewon Lee; Burgermaster, Marissa; Tipton, Elizabeth; Contento, Isobel R; Koch, Pamela A; Di Noia, Jennifer

2016-04-01

Sample size and statistical power calculation should consider clustering effects when schools are the unit of randomization in intervention studies. The objective of the current study was to investigate how student outcomes are clustered within schools in an obesity prevention trial. Baseline data from the Food, Health & Choices project were used. Participants were 9- to 13-year-old students enrolled in 20 New York City public schools (n= 1,387). Body mass index (BMI) was calculated based on measures of height and weight, and body fat percentage was measured with a Tanita® body composition analyzer (Model SC-331s). Energy balance-related behaviors were self-reported with a frequency questionnaire. To examine the cluster effects, intraclass correlation coefficients (ICCs) were calculated as school variance over total variance for outcome variables. School-level covariates, percentage students eligible for free and reduced-price lunch, percentage Black or Hispanic, and English language learners were added in the model to examine ICC changes. The ICCs for obesity indicators are: .026 for BMI-percentile, .031 for BMIz-score, .035 for percentage of overweight students, .037 for body fat percentage, and .041 for absolute BMI. The ICC range for the six energy balance-related behaviors are .008 to .044 for fruit and vegetables, .013 to .055 for physical activity, .031 to .052 for recreational screen time, .013 to .091 for sweetened beverages, .033 to .121 for processed packaged snacks, and .020 to .083 for fast food. When school-level covariates were included in the model, ICC changes varied from -95% to 85%. This is the first study reporting ICCs for obesity-related anthropometric and behavioral outcomes among New York City public schools. The results of the study may aid sample size estimation for future school-based cluster randomized controlled trials in similar urban setting and population. Additionally, identifying school-level covariates that can reduce cluster
Analysis of the reliability and validity of the Turkish version of the intermittent and constant osteoarthritis pain questionnaire.

Science.gov (United States)

Erel, Suat; Şimşek, İbrahim Engin; Özkan, Hüseyin

2015-01-01

The aim of this study was to analyze the validity and reliability of the Turkish version (ICOAP-TR) of the intermittent and constant osteoarthritis pain (ICOAP) questionnaire in patients with knee osteoarthritis (OA). Thirty-eight volunteer patients diagnosed with knee OA answered the questionnaire twice with an interval of 2-4 days. The reliability of the measurement was assessed using Cronbach's alpha coefficient and intraclass correlation (ICC) for test-retest reliability. Criterion validity was tested against the Western Ontario and McMaster Universities Arthritis Index (WOMAC) pain score and visual analog scale (VAS) designed to assess the perceived discomfort rated by the patient. Test-retest reliability was found to be ICC=0.942 for total score, 0.902 for constant pain subscale, and 0.945 for intermittent pain subscale. Internal consistency was tested using Cronbach's alpha and was found to be 0.970 for total score, 0.948 for constant pain subscale, and 0.972 for intermittent pain subscale. For criterion validity, the correlation between the total score of ICOAP-TR and WOMAC pain subscale was r=0.779 (p<0.05), and correlation between total score of ICOAP-TR and VAS was r=0.570 (p<0.05). The ICOAP-TR is a reliable and valid instrument to be used with patients with knee OA.
Confiabilidade de medidas volumétricas de estruturas temporais mesiais Reliability of mesial temporal lobe volumetric measures

Directory of Open Access Journals (Sweden)

Renato L. Marchetti

2002-06-01

Full Text Available MOTIVO DO ESTUDO: O desenvolvimento de técnicas confiáveis para a realização de medidas volumétricas de estruturas temporais mesiais (amígdala, hipocampo e giro para-hipocampal em exames de ressonância magnética (RM pode fornecer dados para o estudo de vários transtornos neuropsiquiátricos, particularmente epilepsia do lobo temporal, doença de Alzheimer e esquizofrenia. MÉTODO: Investigamos essas técnicas realizando estudo de confiabilidade intra-observador (IO e entre-observador (EO, envolvendo controles normais, pacientes com epilepsia e pacientes com doença de Alzheimer, através do coeficiente de correlação intra-classe (CCI. RESULTADOS: A confiabilidade IO para as estruturas analisadas variou de 0,93 a 0,99 (pRATIONALE: The development of reliable techniques for volumetric measurement of mesial temporal structures (amygdala, hypocampus and parahypocampal gyrus on magnetic resonance imaging (MRI can provide data for the study of neuropsychiatric disorders, mainly temporal lobe epilepsy, Alzheimer´s disease and schizophrenia. METHOD: We investigated these techniques performing intraobserver and interobserver reliability study concerning normal controls, epilepsy and Alzheimer's disease patients using the intra-class correlation coefficient. RESULTS: Intra-observer reliability of evaluated structures ranged from 0.93 to 0.99 (p<0.001. Inter-observer reliability ranged from 0.70 to 0.95 (p <= 0.001. CONCLUSION: The results suggest that the technique of MRI morphometry of mesial temporal regions can be considered a reliable tool which may help in the investigation of neuropsychiatric disorders, since used by adequately trained clinicians and researchers.
Context-sensitive intra-class clustering

KAUST Repository

Yu, Yingwei

2014-02-01

This paper describes a new semi-supervised learning algorithm for intra-class clustering (ICC). ICC partitions each class into sub-classes in order to minimize overlap across clusters from different classes. This is achieved by allowing partitioning of a certain class to be assisted by data points from other classes in a context-dependent fashion. The result is that overlap across sub-classes (both within- and across class) is greatly reduced. ICC is particularly useful when combined with algorithms that assume that each class has a unimodal Gaussian distribution (e.g., Linear Discriminant Analysis (LDA), quadratic classifiers), an assumption that is not always true in many real-world situations. ICC can help partition non-Gaussian, multimodal distributions to overcome such a problem. In this sense, ICC works as a preprocessor. Experiments with our ICC algorithm on synthetic data sets and real-world data sets indicated that it can significantly improve the performance of LDA and quadratic classifiers. We expect our approach to be applicable to a broader class of pattern recognition problems where class-conditional densities are significantly non-Gaussian or multi-modal. © 2013 Elsevier Ltd. All rights reserved.
Method of administration of PROMIS scales did not significantly impact score level, reliability, or validity

DEFF Research Database (Denmark)

Bjorner, Jakob B; Rose, Matthias; Gandek, Barbara

2014-01-01

OBJECTIVES: To test the impact of the method of administration (MOA) on score level, reliability, and validity of scales developed in the Patient Reported Outcomes Measurement Information System (PROMIS). STUDY DESIGN AND SETTING: Two nonoverlapping parallel forms each containing eight items from......, no significant mode differences were found and all confidence intervals were within the prespecified minimal important difference of 0.2 standard deviation. Parallel-forms reliabilities were very high (ICC = 0.85-0.93). Only one across-mode ICC was significantly lower than the same-mode ICC. Tests of validity...... questionnaire (PQ), personal digital assistant (PDA), or personal computer (PC) and a second form by PC, in the same administration. Method equivalence was evaluated through analyses of difference scores, intraclass correlations (ICCs), and convergent/discriminant validity. RESULTS: In difference score analyses...
Validity and reliability of the Turkish version of the Manchester-Oxford Foot Questionnaire for hallux valgus deformity evaluation.

Science.gov (United States)

Talu, Burcu; Bayramlar, Kezban; Bek, Nilgün; Yakut, Yavuz

2016-01-01

The aim of this study was to evaluate the reliability and validity of the Turkish version of the Manchester-Oxford Foot Questionnaire (MOXFQ) in patients affected by hallux valgus in order to assess the accuracy of this cross-cultural adaption. Thirty female volunteers aged between 18 and 55 years were included in the study. Subjects with hallux valgus were asked to complete the MOXFQ and the Short-Form 36 Health Survey (SF-36). After receiving permission from the author, the MOXFQ was translated into Turkish twice and then back translated to English, after which its compatibility was evaluated. The Turkish version of the MOXFO was applied twice, 1-3 days apart, to the study subjects. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and intraclass correlation coefficient (ICC), respectively. Construct validity was assessed with the use of Spearman's rank correlation coefficient, using a priori hypothesized correlations with SF-36 domains. Subjects achieved similar scores at the first and second administration of the questionnaire (validity was supported by the presence of all the hypothesized correlations, with SF-36 within its physical parameters. The Turkish version of the MOXFQ is a valid and reliable tool for evaluating foot pain and functional status in patients affected by hallux valgus.
Validade concorrente e confiabilidade da Alberta Infant Motor Scale em lactentes nascidos prematuros Concurrent validity and reliability of the Alberta Infant Motor Scale in premature infants

Directory of Open Access Journals (Sweden)

Kênnea Martins Almeida

2008-10-01

Full Text Available OBJETIVO: Verificar a validade concorrente e a confiabilidade interobservador da Alberta Infant Motor Scale (AIMS em lactentes prematuros acompanhados no ambulatório de seguimento do Instituto Fernandes Figueira, Fundação Oswaldo Cruz (IFF/Fiocruz. MÉTODOS: Foram avaliados 88 lactentes nascidos prematuros no ambulatório de seguimento do IFF/Fiocruz entre fevereiro e dezembro de 2006. No estudo de validade concorrente, 46 lactentes com 6 (n = 26 ou 12 (n = 20 meses de idade corrigida foram avaliados pela AIMS e pela escala motora da Bayley Scales of Infant Development, 2ª edição, por dois observadores diferentes, utilizando-se o coeficiente de correlação de Pearson para análise dos resultados. No estudo de confiabilidade, 42 lactentes entre 0 e 18 meses foram avaliados pela AIMS por dois observadores diferentes, utilizando-se o intraclass correlation coefficient (ICC para análise dos resultados. RESULTADOS: No estudo de validade concorrente, a correlação encontrada entre as duas escalas foi alta (r = 0,95 e estatisticamente significativa (p OBJECTIVE: To verify the concurrent validity and interobserver reliability of the Alberta Infant Motor Scale (AIMS in premature infants followed-up at the outpatient clinic of Instituto Fernandes Figueira, Fundação Oswaldo Cruz (IFF/Fiocruz, in Rio de Janeiro, Brazil. METHODS: A total of 88 premature infants were enrolled at the follow-up clinic at IFF/Fiocruz, between February and December of 2006. For the concurrent validity study, 46 infants were assessed at either 6 (n = 26 or 12 (n = 20 months' corrected age using the AIMS and the second edition of the Bayley Scales of Infant Development, by two different observers, and applying Pearson's correlation coefficient to analyze the results. For the reliability study, 42 infants between 0 and 18 months were assessed using the Alberta Infant Motor Scale, by two different observers and the results analyzed using the intraclass correlation
Absolute and Relative Reliability of the Timed 'Up & Go' Test and '30second Chair-Stand' Test in Hospitalised Patients with Stroke

DEFF Research Database (Denmark)

Lyders Johansen, Katrine; Derby Stistrup, Rikke; Skibdal Schjøtt, Camilla

2016-01-01

OBJECTIVE: The timed 'Up & Go' test and '30second Chair-Stand' test are simple clinical outcome measures widely used to assess functional performance. The reliability of both tests in hospitalised stroke patients is unknown. The purpose was to investigate the relative and absolute reliability...... of both tests in patients admitted to an acute stroke unit. METHODS: Sixty-two patients (men, n = 41) attended two test sessions separated by a one hours rest. Intraclass correlation coefficients (ICC2,1) were calculated to assess relative reliability. Absolute reliability was expressed as Standard Error...... of Measurement (with 95% certainty-SEM95) and Smallest Real Difference (SRD) and as percentage of their respective means if heteroscedasticity was observed in Bland Altman plots (SEM95% and SRD%). RESULTS: ICC values for interrater reliability were 0.97 and 0.99 for the timed 'Up & Go' test and 0.88 and 0...
[Reliability of the PRISCUS-PAQ. Questionnaire to assess physical activity of persons aged 70 years and older].

Science.gov (United States)

Trampisch, U; Platen, P; Burghaus, I; Moschny, A; Wilm, S; Thiem, U; Hinrichs, T

2010-12-01

A questionnaire (Q) to measure physical activity (PA) of persons ≥70 years for epidemiological research is lacking. The aim was to develop the PRISCUS-PAQ and test the reliability in community-dwelling people (≥70 years). Validated PA questionnaires were translated and adapted to design the PRISCUS-PAQ. Its test-retest reliability for 91 randomly selected people (36% men) aged 70-98 (76±5) years ranged from 0.47 (walking) to 0.82 (riding a bicycle). The overall activity score was 0.59 as determined by the intraclass correlation coefficient (ICC). Recording of general activities, e.g., housework (ICC=0.59), was in general less reliable than athletic activities, e.g., gymnastics (ICC=0.76). The PRISCUS-PAQ, which is a short instrument with acceptable reliability to collect the physical activity of the elderly in a telephone interview, will be used to collect data in a large cohort of older people in the German research consortium PRISCUS.
Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

Science.gov (United States)

Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

2018-05-01

Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.
Reliability and validity of a Chinese version of the Diagnostic Interview for Borderlines-Revised.

Science.gov (United States)

Wang, Lanlan; Yuan, Chenmei; Qiu, Jianying; Gunderson, John; Zhang, Min; Jiang, Kaida; Leung, Freedom; Zhong, Jie; Xiao, Zeping

2014-09-01

Borderline personality disorder (BPD) is the most studied of the axis II disorders. One of the most widely used diagnostic instruments is the Diagnostic Interview for Borderline Patients-Revised (DIB-R). The aim of this study was to test the reliability and validity of DIB-R for use in the Chinese culture. The reliability and validity of the DIB-R Chinese version were assessed in a sample of 236 outpatients with a probable BPD diagnosis. The Structured Clinical Interview for DSM-IV Personality Disorders (SCID-II) was used as a standard. Test-retest reliability was tested six months later with 20 patients, and inter-rater reliability was tested on 32 patients. The Chinese version of the DIB-R showed good internal global consistency (Cronbach's α of 0.916), good test-retest reliability (Pearson correlation of 0.704), good inter-rater reliability (intra-class correlation coefficient of 0.892 and kappa of 0.861). When compared with the DSM-IV diagnosis as measured by the SCID-II, the DIB-R showed relatively good sensitivity (0.768) and specificity (0.891) at the cutoff of 7, moderate diagnostic convergence (kappa of 0.631), as well as good discriminating validity. The Chinese version of the DIB-R has good psychometric properties, which renders it a valuable method for examining the presence, the severity, and component phenotypes of BPD in Chinese samples. © 2013 Wiley Publishing Asia Pty Ltd.
Reliability of levator scapulae index in subjects with and without scapular downward rotation syndrome.

Science.gov (United States)

Lee, Ji-Hyun; Cynn, Heon-Seock; Choi, Woo-Jeong; Jeong, Hyo-Jung; Yoon, Tae-Lim

2016-05-01

The objective of this study was to introduce levator scapulae (LS) measurement using a caliper and the levator scapulae index (LSI) and to investigate intra- and interrater reliability of the LSI in subjects with and without scapular downward rotation syndrome (SDRS). Two raters measured LS length twice in 38 subjects (19 with SDRS and 19 without SDRS). For reliability testing, intraclass correlation coefficients (ICCs), standard error of measurement (SEM), and minimal detectable change (MDC) were calculated. Intrarater reliability analysis resulted with ICCs ranging from 0.94 to 0.98 in subjects with SDRS and 0.96 to 0.98 in subjects without SDRS. These results represented that intrarater reliability in both groups were excellent for measuring LS length with the LSI. Interrater reliability was good (ICC: 0.82) in subjects with SDRS; however, interrater reliability was moderate (ICC: 0.75) in subjects without SDRS. Additionally, SEM and MDC were 0.13% and 0.36% in subjects with SDRS and 0.35% and 0.97% in subjects without SDRS. In subjects with SDRS, low dispersion of the measurement errors and MDC were shown. This study suggested that the LSI is a reliable method to measure LS length and is more reliable for subjects with SDRS. Copyright © 2015 Elsevier Ltd. All rights reserved.

Test-retest and between-site reliability in a multicenter fMRI study.

Science.gov (United States)

Friedman, Lee; Stern, Hal; Brown, Gregory G; Mathalon, Daniel H; Turner, Jessica; Glover, Gary H; Gollub, Randy L; Lauriello, John; Lim, Kelvin O; Cannon, Tyrone; Greve, Douglas N; Bockholt, Henry Jeremy; Belger, Aysenil; Mueller, Bryon; Doty, Michael J; He, Jianchun; Wells, William; Smyth, Padhraic; Pieper, Steve; Kim, Seyoung; Kubicki, Marek; Vangel, Mark; Potkin, Steven G

2008-08-01

In the present report, estimates of test-retest and between-site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR-deconvolution analysis with FMRISTAT. Six functionally-derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast-to-noise-ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test-retest reliability was high, but initially, between-site reliability was low, indicating a strong contribution from site and site-by-subject variance. However, a number of factors that can markedly improve between-site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between-site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies.
The reliability of surface EMG recorded from the pelvic floor muscles.

Science.gov (United States)

Auchincloss, Cindy C; McLean, Linda

2009-08-30

The neuromuscular function of the pelvic floor muscles (PFMs) is frequently evaluated using surface electrodes embedded on vaginal probes. The purpose of this study was to determine the between-trial and between-day reliability of EMG data recorded from the PFM using two different vaginal probes while subjects performed PFM maximum voluntary contractions and a coughing task. The Femiscan and the Periform vaginal probes were used to acquire EMG data while the subjects performed the tasks. Peak RMS amplitudes were computed for each instrument, task, and side of the pelvic floor using a sliding window technique. The between-trial reliability was evaluated using intraclass correlation coefficients (ICCs) and coefficients of variation (CV). Between-trial reliability was determined using ICCs, Pearson's correlation coefficients, computing the mean absolute difference between days, and calculating the standard error the measurement (SEM) for each instrument and task. EMG amplitude differences were detected between the left and right PFM (pperformed separately for each side. Overall, between-trial reliability was fair to high for the Femiscan (ICC((3,1))=0.58-0.98, CV=8.5-20.7%) and good to high for the Periform (ICC((3,1))=0.80-0.98, CV=9.6-19.5%), however between-day reliability was generally poor for both vaginal probes (ICC((3,1))=0.08-0.84). The results suggest that although it is acceptable to use PFM surface EMG as a biofeedback tool for training purposes, it is not recommended for use to make between-subject comparisons or to use as an outcome measure between-days when evaluating PFM function.
Reliability and validity of the Turkish version of ABILHAND-Kids' questionnaire in a group of patients with neuromuscular disorders.

Science.gov (United States)

Öksüz, Çigdem; Alemdaroglu, Ipek; Kilinç, Muhammed; Abaoğlu, Hatice; Demirci, Cevher; Karahan, Sevilay; Yilmaz, Oznur; Yildirim, Sibel Aksu

2017-10-01

This study was performed to examine the reliability and validity of the Turkish version of ABILHAND-Kids questionnaire which assesses manual functions of children with neuromuscular diseases (NMDs). A cross sectional survey study design and Rasch analysis were used to assess the reliability and validity of the Turkish version of scale. Ninety-three children with different neuromuscular disorders and their parents were included in the study. The scale was applied to the parents with face-to-face interview twice; on their first visit and after an interval of 15 days. The test-retest reliability was assessed with intraclass correlation coefficient (ICC), and internal consistency of the multi-item subscales by calculating Cronbach alpha values. Brooke Upper Extremity Functional Classification (BUEFC) and Wee-Functional Independency Measurement (Wee-FIM) were correlated to determine the construct validity. The ICC value for the test/retest reliability was 0.94. The internal consistency was 0.81. Floor (1.1%) and ceiling (11.8%) effects were not significant. There were moderate correlations between the Turkish version of ABILHAND-Kids and Wee-FIM (0.67) and BUEFC (-0.37). Rasch analysis indicated good item ﬁt, unidimensionality, and model ﬁt. The Turkish version of ABILHAND-Kids questionnaire was found to be a reliable and valid scale for the assessment of the manual ability of children with NMDs.
A Test-Retest Reliability Study of the Whiplash Disability Questionnaire in Patients With Acute Whiplash-Associated Disorders.

Science.gov (United States)

Stupar, Maja; Côté, Pierre; Beaton, Dorcas E; Boyle, Eleanor; Cassidy, J David

2015-01-01

The purpose of this study was to determine the test-retest reliability and the Minimal Detectable Change (MDC) of the Whiplash Disability Questionnaire (WDQ) in individuals with acute whiplash-associated disorders (WADs). We performed a test-retest reliability study. We included insurance claimants from Ontario who were at least 18 years of age, within 21 days of their motor vehicle collision and diagnosed as having acute WAD grades I to III. The WDQ, a 13-item questionnaire scored from 0 (no disability) to 130 (complete disability), was administered to all participants at baseline and by telephone 3 days later. We computed the intraclass correlation coefficient (model 2,1) and the MDC with 95% confidence intervals (CIs; MDC95). The mean (SD) age of the 66 participants was 41.6 (12.7) years and 71.2% were female. Twenty-nine percent had WAD I and 71.2% had WAD II. Time since injury ranged from 0 to 19 days. The mean (SD) baseline WDQ score was 49.3 (28.8) and 46.5 (29.8) 3 days later. The intraclass correlation coefficient for the WDQ total score was 0.89 (95% CI, 0.85-0.92) in the entire sample and 0.83 (95% CI, 0.69-0.93) for the 15 participants reporting no change in neck pain. The MDC95 of the WDQ was 21.4 (SD = 14.9) for participants reporting no change. The WDQ was reliable in individuals with acute WAD. There is 95% confidence that a change of approximately one-sixth of the total score is beyond the daily variation of a stable condition. This level of measurement error must be taken into consideration when interpreting change in WDQ scores. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.
Intra- and Interobserver Reliability of Three Classification Systems for Hallux Rigidus.

Science.gov (United States)

Dillard, Sarita; Schilero, Christina; Chiang, Sharon; Pham, Peter

2018-04-18

There are over ten classification systems currently used in the staging of hallux rigidus. This results in confusion and inconsistency with radiographic interpretation and treatment. The reliability of hallux rigidus classification systems has not yet been tested. The purpose of this study was to evaluate intra- and interobserver reliability using three commonly used classifications for hallux rigidus. Twenty-one plain radiograph sets were presented to ten ACFAS board-certified foot and ankle surgeons. Each physician classified each radiograph based on clinical experience and knowledge according to the Regnauld, Roukis, and Hattrup and Johnson classification systems. The two-way mixed single-measure consistency intraclass correlation was used to calculate intra- and interrater reliability. The intrarater reliability of individual sets for the Roukis and Hattrup and Johnson classification systems was "fair to good" (Roukis, 0.62±0.19; Hattrup and Johnson, 0.62±0.28), whereas the intrarater reliability of individual sets for the Regnauld system bordered between "fair to good" and "poor" (0.43±0.24). The interrater reliability of the mean classification was "excellent" for all three classification systems. Conclusions Reliable and reproducible classification systems are essential for treatment and prognostic implications in hallux rigidus. In our study, Roukis classification system had the best intrarater reliability. Although there are various classification systems for hallux rigidus, our results indicate that all three of these classification systems show reliability and reproducibility.
The prone bridge test: Performance, validity, and reliability among older and younger adults.

Science.gov (United States)

Bohannon, Richard W; Steffl, Michal; Glenney, Susan S; Green, Michelle; Cashwell, Leah; Prajerova, Kveta; Bunn, Jennifer

2018-04-01

The prone bridge maneuver, or plank, has been viewed as a potential alternative to curl-ups for assessing trunk muscle performance. The purpose of this study was to assess prone bridge test performance, validity, and reliability among younger and older adults. Sixty younger (20-35 years old) and 60 older (60-79 years old) participants completed this study. Groups were evenly divided by sex. Participants completed surveys regarding physical activity and abdominal exercise participation. Height, weight, body mass index (BMI), and waist circumference were measured. On two occasions, 5-9 days apart, participants held a prone bridge until volitional exhaustion or until repeated technique failure. Validity was examined using data from the first session: convergent validity by calculating correlations between survey responses, anthropometrics, and prone bridge time, known groups validity by using an ANOVA comparing bridge times of younger and older adults and of men and women. Test-retest reliability was examined by using a paired t-test to compare prone bridge times for Session1 and Session 2. Furthermore, an intraclass correlation coefficient (ICC) was used to characterize relative reliability and minimal detectable change (MDC 95% ) was used to describe absolute reliability. The mean prone bridge time was 145.3 ± 71.5 s, and was positively correlated with physical activity participation (p ≤ 0.001) and negatively correlated with BMI and waist circumference (p ≤ 0.003). Younger participants had significantly longer plank times than older participants (p = 0.003). The ICC between testing sessions was 0.915. The prone bridge test is a valid and reliable measure for evaluating abdominal performance in both younger and older adults. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reliability of a new questionnaire for the evaluation of habitual physical activity and food consumption in children DOI:10.5007/1980-0037.2010v12n1p21

Directory of Open Access Journals (Sweden)

Filipe Ferreira da Costa

2010-12-01

Full Text Available The aim of this study was to determine the reliability of the Physical Activity and Food Consumption (PAFC questionnaire in schoolchildren from a private school in Natal-RN, Brazil. A total of 101 children, 57 boys and 44 girls (mean age: 9.4 years, SD: 1.03, range: 7.3 to 11.6 in the second to fourth grade of elementary school were recruited. An expanded version of the PAFC questionnaire was applied at the school by a single researcher, with an average of 15 days between test and retest. The coefficient of relative agreement, intraclass correlation coefficient, Spearman’s correlation coefficient, kappa index of agreement, PABAK, and Wilcoxon signed-rank test were used to determine reliability. In general, relatively consistent measures between the two questionnaire sessions were found for items related to attitude towards exercise (0.41, means of transportation used to travel to and from school (0.79, and the remaining 11 physical activities (0.69. An intraclass correlation of 0.87 as obtained for the overall physical activity index. Twenty-seven of the 42 items presented moderate to good agreement (mean kappa index: 0.51. The PAFC questionnaire showed moderate to good reliability for most of its items and seems to be a suitable instrument for the evaluation of physical activity and food intake behavior in schoolchildren. Moreover, the questionnaire might be used as an alternative for the classification of more and less active individuals as well as for the identification of healthy and inadequate dietary patterns.
TEST-RETEST RELIABILITY OF THE CLOSED KINETIC CHAIN UPPER EXTREMITY STABILITY TEST (CKCUEST) IN ADOLESCENTS: RELIABILITY OF CKCUEST IN ADOLESCENTS.

Science.gov (United States)

de Oliveira, Valéria M A; Pitangui, Ana C R; Nascimento, Vinícius Y S; da Silva, Hítalo A; Dos Passos, Muana H P; de Araújo, Rodrigo C

2017-02-01

The Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) has been proposed as an option to assess upper limb function and stability; however, there are few studies that support the use of this test in adolescents. The purpose of the present study was to investigate the intersession reliability and agreement of three CKCUEST scores in adolescents and establish clinimetric values for this test. Test-retest reliability. Twenty-five healthy adolescents of both sexes were evaluated. The subjects performed two CKCUEST with an interval of one week between the tests. An intraclass correlation coefficient (ICC 3,3 ) two-way mixed model with a 95% interval of confidence was utilized to determine intersession reliability. A Bland-Altman graph was plotted to analyze the agreement between assessments. The presence of systematic error was evaluated by a one-sample t test. The difference between the evaluation and reevaluation was observed using a paired-sample t test. The level of significance was set at 0.05. Standard error of measurements and minimum detectable changes were calculated. The intersession reliability of the average touches score, normalized score, and power score were 0.68, 0.68 and 0.87, the standard error of measurement were 2.17, 1.35 and 6.49, and the minimal detectable change was 6.01, 3.74 and 17.98, respectively. The presence of systematic error (p test with moderate to excellent reliability when used with adolescents. The CKCUEST is a measurement with moderate to excellent reliability for adolescents. 2b.
Test of gross motor development-2 for Filipino children with intellectual disability: validity and reliability.

Science.gov (United States)

Capio, Catherine M; Eguia, Kathlynne F; Simons, Johan

2016-01-01

This study aimed to examine aspects of validity and reliability of the Test of Gross Motor Development-2 (TGMD-2) in Filipino children with intellectual disability. Content and construct validity were verified, as well as inter-rater and intra-rater reliability. Two paediatric physiotherapists tested 81 children with intellectual disability (mean age = 9.29 ± 2.71 years) on locomotor and object control skills. Analysis of covariance, confirmatory factor analysis and analysis of variance were used to test validity, while Cronbach's alpha, intra-class correlation coefficients (ICC) and Bland-Altman plots were used to examine reliability. Age was a significant predictor of locomotor and object control scores (P = 0.004). The data fit the hypothesised two-factor model with fit indices as follows: χ(2) = 33.525, DF = 34, P = 0.491, χ(2)/DF = 0.986. As hypothesised, gender was a significant predictor for object control skills (P = 0.038). Participants' mean scores were significantly below mastery (locomotor, P intellectual disability.
Reliability characteristics and applicability of a repeated sprint ability test in male young soccer players

DEFF Research Database (Denmark)

Castagna, Carlo; Francini, Lorenzo; Krustrup, Peter

2018-01-01

The aim of this study was to examine the usefulness and reliability characteristics of a repeated sprint ability test considering 5 line sprints of 30-m interspersed with 30-s of active recovery in non-elite outfield young male soccer players. Twenty-six (age 14.9±1.2 years, height 1.72±0.12 cm......, body mass 62.2±5.1 kg) players were tested 48 hours and 7 days apart for 5x30-m performance over 5 trials (T1-T5). Short- (T1-T2) and long-term reliability (T1-T3-T4-T5) were assessed with Intraclass Correlation Coefficient (ICC) and with typical error for measurement (TEM). Short- and long...... study revealed that the 5x30-m sprint test is a reliable field test in the short and long-term when the sum of sprint times and the best sprint performance are considered as outcome variables. Sprint performance decrements variables showed large variability across trials....
Reliability of Strength Testing using the Advanced Resistive Exercise Device and Free Weights

Science.gov (United States)

English, Kirk L.; Loehr, James A.; Laughlin, Mitzi A.; Lee, Stuart M. C.; Hagan, R. Donald

2008-01-01

The Advanced Resistive Exercise Device (ARED) was developed for use on the International Space Station as a countermeasure against muscle atrophy and decreased strength. This investigation examined the reliability of one-repetition maximum (1RM) strength testing using ARED and traditional free weight (FW) exercise. Methods: Six males (180.8 +/- 4.3 cm, 83.6 +/- 6.4 kg, 36 +/- 8 y, mean +/- SD) who had not engaged in resistive exercise for at least six months volunteered to participate in this project. Subjects completed four 1RM testing sessions each for FW and ARED (eight total sessions) using a balanced, randomized, crossover design. All testing using one device was completed before progressing to the other. During each session, 1RM was measured for the squat, heel raise, and deadlift exercises. Generalizability (G) and intraclass correlation coefficients (ICC) were calculated for each exercise on each device and were used to predict the number of sessions needed to obtain a reliable 1RM measurement (G . 0.90). Interclass reliability coefficients and Pearson's correlation coefficients (R) also were calculated for the highest 1RM value (1RM9sub peak)) obtained for each exercise on each device to quantify 1RM relationships between devices.
The reliability and validity of the standardized Mensendieck test in relation to disability in patients with chronic pain.

Science.gov (United States)

Keessen, Paul; Maaskant, Jolanda; Visser, Bart

2018-08-01

The standardized Mensendieck test (SMT) was developed to quantify posture, movement, gait, and respiration. In the hands of an experienced therapist, the SMT is proven to be a reliable tool. It is unclear whether posture, movement, gait, and respiration are related to the degree of functional disability in patients with chronic pain. The objective of this study was to assess the reliability and convergent validity of the SMT in a heterogeneous sample of 50 patients with chronic pain. Internal consistency was determined by Cronbach's α and interrater reliability by the intraclass correlation coefficient (ICC). Convergent validity was assessed by determining the Spearman rank correlation coefficient between the movement quality measured in the SMT and functional limitation measured on the disability rating index (DRI). The internal consistency was Cronbach's α 0.91. Substantial reliability was found for the items: movement (ICC = 0.68), gait (ICC = 0.69), sitting posture (ICC = 0.63), and respiration (ICC = 0.64). Insufficient reliability was found for standing posture (ICC = 0.23). A moderate correlation was found between average test score SMT and the DRI (r = -0.37) and respiration and DRI (r = -0.45). The SMT is a reasonably reliable tool to assess movement, gait, sitting posture, and respiration. None of the items in the domain standing posture has sufficient reliability. A thorough study of this domain should be considered. The results show little evidence for convergent validity. Several items of the SMT correlated moderately with functional limitation with the DRI. These items were global movement, hip flexion, pelvis rotation, and all respiration items.
Reliability of the information about the history of diagnosis and treatment of hypertension. Differences in regard to sex, age, and educational level. The pró-saúde study

Directory of Open Access Journals (Sweden)

Faerstein Eduardo

2001-01-01

Full Text Available OBJECTIVE: To assess the intraobserver reliability of the information about the history of diagnosis and treatment of hypertension. METHODS: A multidimensional health questionnaire, which was filled out by the interviewees, was applied twice with an interval of 2 weeks, in July '99, to 192 employees of the University of the State of Rio de Janeiro (UERJ, stratified by sex, age, and educational level. The intraobserver reliability of the answers provided was estimated by the kappa statistic and by the coefficient of intraclass correlation (CICC. RESULTS: The general kappa (k statistic was 0.75 (95% CI=0.73-0.77. Reliability was higher among females (k=0.88, 95% CI=0.85-0.91 than among males (k=0.62, 95% CI=0.59-0.65.The reliability was higher among individuals 40 years of age or older (k=0.79; 95% CI=0.73-0.84 than those from 18 to 39 years (k=0.52; 95% CI=0.45-0.57. Finally, the kappa statistic was higher among individuals with a university educational level (k=0.86; 95% CI=0.81-0.91 than among those with high school educational level (k=0.61; 95% CI=0.53-0.70 or those with middle school educational level (k=0.68; 95% CI=0.64-0.72. The coefficient of intraclass correlation estimated by the intraobserver agreement in regard to age at the time of the diagnosis of hypertension was 0.74. A perfect agreement between the 2 answers (k=1.00 was observed for 22 interviewees who reported prior prescription of antihypertensive medication. CONCLUSION: In the population studied, estimates of the reliability of the history of medical diagnosis of hypertension and its treatment ranged from substantial to almost perfect reliability.
Strength and Pain Threshold Handheld Dynamometry Test Reliability in Patellofemoral Pain.

Science.gov (United States)

van der Heijden, R A; Vollebregt, T; Bierma-Zeinstra, S M A; van Middelkoop, M

2015-12-01

Patellofemoral pain syndrome (PFPS), characterized by peri- and retropatellar pain, is a common disorder in young, active people. The etiology is unclear; however, quadriceps strength seems to be a contributing factor, and sensitization might play a role. The study purpose is determining the inter-rater reliability of handheld dynamometry to test both quadriceps strength and pressure pain threshold (PPT), a measure for sensitization, in patients with PFPS. This cross-sectional case-control study comprises 3 quadriceps strength and one PPT measurements performed by 2 independent investigators in 22 PFPS patients and 16 matched controls. Inter-rater reliability was analyzed using intraclass correlation coefficients (ICC) and Bland-Altman plots. Inter-rater reliability of quadriceps strength testing was fair to good in PFPS patients (ICC=0.72) and controls (ICC=0.63). Bland-Altman plots showed an increased difference between assessors when average quadriceps strength values exceeded 250 N. Inter-rater reliability of PPT was excellent in patients (ICC=0.79) and fair to good in controls (ICC=0.52). Handheld dynamometry seems to be a reliable method to test both quadriceps strength and PPT in PFPS patients. Inter-rater reliability was higher in PFPS patients compared to control subjects. With regard to quadriceps testing, a higher variance between assessors occurs when quadriceps strength increases. © Georg Thieme Verlag KG Stuttgart · New York.
The Test-Retest Reliability of New Generation Power Indices of Wingate All-Out Test

Directory of Open Access Journals (Sweden)

Ozgur Ozkaya

2018-04-01

Full Text Available Although reliability correlations of traditional power indices of the Wingate test have been well documented, no study has analyzed new generation power indices based on milliseconds obtained from a Peak Bike. The purpose of this study was to investigate the retest reliability of new generation power indices. Thirty-two well-trained male athletes who were specialized in basketball, football, tennis, or track and field volunteered to take part in the study (age: 24.3 ± 2.2 years; body mass: 77 ± 8.3 kg; height: 180.3 ± 6.3 cm. Participants performed two Wingate all-out sessions on two separate days. Intra-class correlation coefficient (ICC, standard error measurement (SEM, smallest real differences (SRD and coefficient of variation (CV scores were analyzed based on the test and retest data. Reliability results of traditional power indices calculated based on 5-s means such as peak power, average power, power drop, and fatigue index ratio were similar with the previous findings in literature (ICC ≥ 0.94; CV ≤ 2.8%; SEM ≤ 12.28; SRD% ≤ 7.7%. New generation power indices such as peak power, average power, lowest power, power drop, fatigue index, power decline, maximum speed as rpm, and amount of total energy expenditure demonstrated high reliability (ICC ≥ 0.94; CV ≤ 4.3%; SEM ≤ 10.36; SRD% ≤ 8.8%. Time to peak power, time at maximum speed, and power at maximum speed showed a moderate level of reliability (ICC ≥ 0.73; CV ≤ 8.9%; SEM ≤ 63.01; SRD% ≤ 22.4%. The results of this study indicate that reliability correlations and SRD% of new generation power and fatigue-related indices are similar with traditional 5-s means. However, new time-related indices are very sensitive and moderately reliable.
Intra- and interobserver reliability of quantitative ultrasound measurement of the plantar fascia.

Science.gov (United States)

Rathleff, Michael Skovdal; Moelgaard, Carsten; Lykkegaard Olesen, Jens

2011-01-01

To determine intra- and interobserver reliability and measurement precision of sonographic assessment of plantar fascia thickness when using one, the mean of two, or the mean of three measurements. Two experienced observers scanned 20 healthy subjects twice with 60 minutes between test and retest. A GE LOGIQe ultrasound scanner was used in the study. The built-in software in the scanner was used to measure the thickness of the plantar fascia (PF). Reliability was calculated using intraclass correlation coefficient (ICC) and limits of agreement (LOA). Intraobserver reliability (ICC) using one measurement was 0.50 for one observer and 0.52 for the other, and using the mean of three measurements intraobserver reliability increased up to 0.77 and 0.67, respectively. Interobserver reliability (ICC) when using one measurement was 0.62 and increased to 0.82 when using the average of three measurements. LOA showed that when using the average of three measurements, LOA decreased to 0.6 mm, corresponding to 17.5% of the mean thickness of the PF. The results showed that reliability increases when using the mean of three measurements compared with one. Limits of agreement based on intratester reliability shows that changes in thickness that are larger than 0.6 mm can be considered actual changes in thickness and not a result of measurement error. Copyright © 2011 Wiley Periodicals, Inc.
Night-to-night arousal variability and interscorer reliability of arousal measurements.

Science.gov (United States)

Loredo, J S; Clausen, J L; Ancoli-Israel, S; Dimsdale, J E

1999-11-01

Measurement of arousals from sleep is clinically important, however, their definition is not well standardized, and little data exist on reliability. The purpose of this study is to determine factors that affect arousal scoring reliability and night-to-night arousal variability. The night-to-night arousal variability and interscorer reliability was assessed in 20 subjects with and without obstructive sleep apnea undergoing attended polysomnography during two consecutive nights. Five definitions of arousal were studied, assessing duration of electroencephalographic (EEG) frequency changes, increases in electromyographic (EMG) activity and leg movement, association with respiratory events, as well as the American Sleep Disorders Association (ASDA) definition of arousals. NA. NA. NA. Interscorer reliability varied with the definition of arousal and ranged from an Intraclass correlation (ICC) of 0.19 to 0.92. Arousals that included increases in EMG activity or leg movement had the greatest reliability, especially when associated with respiratory events (ICC 0.76 to 0.92). The ASDA arousal definition had high interscorer reliability (ICC 0.84). Reliability was lowest for arousals consisting of EEG changes lasting <3 seconds (ICC 0.19 to 0.37). The within subjects night-to-night arousal variability was low for all arousal definitions In a heterogeneous population, interscorer arousal reliability is enhanced by increases in EMG activity, leg movements, and respiratory events and decreased by short duration EEG arousals. The arousal index night-to-night variability was low for all definitions.
Reliability of radiographic measurement of lateral capitellohumeral angle in healthy children.

Science.gov (United States)

Hasegawa, Masaki; Suzuki, Taku; Kuroiwa, Takashi; Oka, Yusuke; Maeda, Atsushi; Takeda, Hiroki; Shizu, Kanae; Tsuji, Takashi; Suzuki, Katsuji; Yamada, Harumoto

2018-04-01

This retrospective cohort study was designed to validate the reliability of measurement of the lateral capitellohumeral angle (LCHA), an index of sagittal angulation of the elbow, in healthy children. The results were compared to the Baumann angle (BA), which is a similar concept to LCHA.Sixty-two radiographs of the elbow in healthy children (range, 2-11 years) were reviewed by 6 examiners at 2 sessions. The mean value and reliability of the measurement of LCHA and BA were assessed. Intraobserver reliability and interobserver reliability were calculated using intraclass correlation coefficients (ICCs).The mean LCHA value was 45° (range, 22° to 70°) and the mean BA was 71° (range, 56° to 86°). The ICCs for intraobserver reliability of the LCHA measurements were almost perfect for 2 examiners, substantial for 3 examiners, and moderate for 1 examiner with a mean value of 0.77 (range, 0.57-0.95). For BA measurements, the ICCs were almost perfect for 1 examiner and substantial for 5 examiners with a mean value of 0.74 (range, 0.66-0.83). The ICCs for interobserver reliability between the first and second measurements were both moderate for LCHA (0.56 and 0.51) and for BA (0.52 and 0.50).LCHA showed almost the same reliability in measurement as BA, which is the gold standard assessment for coronal alignment of the elbow. LCHA showed moderate-to-good reliability in the evaluation of sagittal plane elbow alignment.
The Reliability of Anthropometric Measurements Used Preoperatively in Aesthetic Breast Surgery.

Science.gov (United States)

Isaac, Kathryn V; Murphy, Blake D; Beber, Brett; Brown, Mitchell

2016-04-01

Patient outcomes in aesthetic breast surgery are highly dependent on breast measurements used in preoperative planning. The purpose of this study is to determine the reliability of anthropometric breast measurements. Four raters measured 28 women using 7 measurements: sternal notch to nipple distance (Sn-N), nipple to midline (N-M), nipple to inframammary-fold distance under maximal stretch (N-IMF), breast base width (BW), soft tissue pinch thickness of the upper pole (STPT:UP), STPT at the inframammary fold (STPT:IMF), and anterior pull skin stretch (APSS). Reliability was assessed using intra-class correlation coefficients (ICCs). Inter-rater reliability was excellent for Sn-N, N-M, and BW (ICC = 0.94, 0.90, and 0.76, respectively) and was good for N-IMF (ICC = 0.70). The STPT:UP, STPT:IMF, and APSS measurements were not reliable between raters (ICC reliability was excellent for Sn-N, N-M, and BW for all raters (all ICC > 0.75). The N-IMF intra-rater reliability was excellent in senior raters (ICC > 0.75) and good in junior raters (ICC > 0.6). The STPT:UP, STPT:IMF, and APSS measurements showed fair or poor reliability for most raters (ICC reliable. Dynamic measurements including APSS, STPT:UP, and STUP:IMF are unreliable. N-IMF is the only reliable dynamic measurement, and its reliability improves with increasing clinical experience. The variable reliability of preoperative measurements must be considered in the planning of aesthetic breast surgery. 4 Diagnostic. © 2015 The American Society for Aesthetic Plastic Surgery, Inc. Reprints and permission: journals.permissions@oup.com.
The gothic arch: a reliable measurement for developmental dysplasia of the hip.

Science.gov (United States)

Herickhoff, Paul K; O'Brien, Megan K; Dolan, Lori A; Morcuende, Jose A; Peterson, Jonathan B; Weinstein, Stuart L

2013-01-01

The "Gothic Arch" is a radio-graphic finding on AP pelvis x-rays postulated to be predictive of hip osteoarthritis. The purpose of this study was to determine the reliability of measurement of the Gothic Arch in patients with no known hip pathology and patients with unilateral developmental dysplasia of the hip (DDH). After obtaining IRB approval, nine skeletally mature patients (18 hips) with no known hip pathology were selected to serve as the control group. The AP pelvis x-rays at skeletal maturity of eight patients (16 hips) with unilateral DDH treated with closed reduction and casting comprised the comparison group. A digitizing program was designed to measure the Gothic Arch based on landmarks identified by the user. Two pediatric orthopaedic surgeons and two orthopaedic residents completed the program on two separate occasions. Intra-and interobserver reliability were determined using intraclass cor-relation coefficients (ICC) for continuous variables. Both the unilateral DDH group and the control group demonstrated excellent inter- and intraobserver reliability (ICC >0.70) for base, height, area, and orientation of the Gothic Arch, but poor reliability (ICC Gothic Arch can be reliably measured on AP pelvis x-rays of patients with normal and dysplastic hips. III, Diagnostic study. See the Guidelines for Authors for a complete description of levels of evidence.

Five times sit-to-stand test in subjects with total knee replacement: Reliability and relationship with functional mobility tests.

Science.gov (United States)

Medina-Mirapeix, Francesc; Vivo-Fernández, Iván; López-Cañizares, Juan; García-Vidal, José A; Benítez-Martínez, Josep Carles; Del Baño-Aledo, María Elena

2018-01-01

The objective was to determine the inter-observer and test/retest reliability of the "Five-repetition sit-to-stand" (5STS) test in patients with total knee replacement (TKR). To explore correlation between 5STS and two mobility tests. A reliability study was conducted among 24 (mean age 72.13, S.D. 10.67; 50% were women) outpatients with TKR. They were recruited from a traumatology unit of a public hospital via convenience sampling. A physiotherapist and trauma physician assessed each patient at the same time. The same physiotherapist realized a 5STS second measurement 45-60min after the first one. Reliability was assessed with intraclass correlation coefficients (ICCs) and Bland-Altman plots. Pearson coefficient was calculated to assess the correlation between 5STS, time up to go test (TUG) and four meters gait speed (4MGS). ICC for inter-observer and test-retest reliability of the 5STS were 0.998 (95% confidence interval [CI], 0.995-0.999) and 0.982 (95% CI, 0.959-0.992). Bland-Altman plot inter-observer showed limits between -0.82 and 1.06 with a mean of 0.11 and no heteroscedasticity within the data. Bland-Altman plot for test-retest showed the limits between 1.76 and 4.16, a mean of 1.20 and heteroscedasticity within the data. Pearson correlation coefficient revealed significant correlation between 5STS and TUG (r=0.7, ptest-retest reliability when it is used in people with TKR, and also significant correlation with other functional mobility tests. These findings support the use of 5STS as outcome measure in TKR population. Copyright © 2017 Elsevier B.V. All rights reserved.
Test-retest reliability of the Military Pre-training Questionnaire.

Science.gov (United States)

Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

2010-09-01

Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Reliability and validity of the French-Canadian version of the scoliosis research society 22 questionnaire in France.

Science.gov (United States)

Lonjon, Guillaume; Ilharreborde, Brice; Odent, Thierry; Moreau, Sébastien; Glorion, Christophe; Mazda, Keyvan

2014-01-01

Outcome study to determine the internal consistency, reproducibility, and concurrent validity of the French-Canadian version of the Scoliosis Research Society 22 (SRS-22 fcv) patient questionnaire in France. To determine whether the SRS-22 fcv can be used in a population from France. The SRS-22 has been translated and validated in multiple countries, notably in the French-Canadian language in Quebec, Canada. Use of SRS-22 fcv seems appropriate for evaluating adolescent idiopathic scoliosis in France. However, French-Canadian French is noticeably different from the French spoken in France, and no study has investigated the use of a French-Canadian version of a health-quality questionnaire in another French population. The methods used for validating the SRS-22 fcv in Quebec were adopted for use with a group of 200 adolescents with idiopathic scoliosis and 60 healthy adolescents in France. Reliability and reproducibility were measured by the Cronbach α and intraclass correlation coefficient (ICC), construct validity by factorial analysis, concurrent validity by the Short-Form of the survey, and discriminant validity by analysis of variance and multivariate linear regression. In France, the SRS-22 fcv showed good global internal consistency (Cronbach α = 0.87, intraclass correlation coefficient = 0.92), a coherent factorial structure, and high correlation coefficients between the SRS-22 fcv and Short-Form of the survey (P < 0.001). However, reliability and validity were slightly less than that for the instrument's original validation and the validation of the SRS-22 fcv in Quebec. These differences could be explained by language and cultural differences. The SRS-22 fcv is relevant for use in France, but further development and validation of a specific French questionnaire remain necessary to improve the assessment of functional outcomes of adolescents with scoliosis in France. N/A.
Reliability and validity of the Children's Fear Survey Schedule-Dental Subscale for Arabic-speaking children: a cross-sectional study.

Science.gov (United States)

El-Housseiny, Azza A; Alsadat, Farah A; Alamoudi, Najlaa M; El Derwi, Douaa A; Farsi, Najat M; Attar, Moaz H; Andijani, Basil M

2016-04-14

Early recognition of dental fear is essential for the effective delivery of dental care. This study aimed to test the reliability and validity of the Arabic version of the Children's Fear Survey Schedule-Dental Subscale (CFSS-DS). A school-based sample of 1546 children was randomly recruited. The Arabic version of the CFSS-DS was completed by children during class time. The scale was tested for internal consistency and test-retest reliability. To test criterion validity, children's behavior was assessed using the Frankl scale during dental examination, and results were compared with children's CFSS-DS scores. To test the scale's construct validity, scores on "fear of going to the dentist soon" were correlated with CFSS-DS scores. Factor analysis was also used. The Arabic version of the CFSS-DS showed high reliability regarding both test-retest reliability (intraclass correlation = 0.83, p children with negative behavior had significantly higher fear scores (t = 13.67, p fear of invasive dental procedures," "fear of less invasive dental procedures" and "fear of strangers." The Arabic version of the CFSS-DS is a reliable and valid measure of dental fear in Arabic-speaking children. Pediatric dentists and researchers may use this validated version of the CFSS-DS to measure dental fear in Arabic-speaking children.
Intrarater Reliability and Other Psychometrics of the Health Promoting Activities Scale (HPAS).

Science.gov (United States)

Muskett, Rachel; Bourke-Taylor, Helen; Hewitt, Alana

The Health Promoting Activities Scale (HPAS) measures the self-rated frequency with which adults participate in activities that promote health. We evaluated the internal consistency, construct validity, and intrarater reliability of the HPAS with a cohort of mothers (N = 56) of school-age children. We used an online survey that included the HPAS and measures of mental and physical health. Statistical analysis included intraclass correlation coefficients (ICCs), measurement error, error range, limits of agreement, and minimum detectable change (MDC). The HPAS showed good internal consistency (Cronbach's α = .73). Construct validity was supported by a significant difference in HPAS scores among participants grouped by physical activity level; no other differences were significant. Results included a high aggregate ICC of .90 and an MDC of 5 points. Our evaluation of the HPAS revealed good reliability and stability, suggesting suitability for ongoing evaluation as an outcome measure. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Reliability, validity, and significance of assessment of sense of contribution in the workplace.

Science.gov (United States)

Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito

2014-01-29

The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%-80.2%). Fifty-four workers were included in the analysis of test-retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach's α coefficients in men and women were 0.85 and 0.86, respectively) and test-retest reliability (intraclass correlation coefficient = 0.91). Significant (p workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Interrater reliability of Violence Risk Appraisal Guide scores provided in Canadian criminal proceedings.

Science.gov (United States)

Edens, John F; Penson, Brittany N; Ruchensky, Jared R; Cox, Jennifer; Smith, Shannon Toney

2016-12-01

Published research suggests that most violence risk assessment tools have relatively high levels of interrater reliability, but recent evidence of inconsistent scores among forensic examiners in adversarial settings raises concerns about the "field reliability" of such measures. This study specifically examined the reliability of Violence Risk Appraisal Guide (VRAG) scores in Canadian criminal cases identified in the legal database, LexisNexis. Over 250 reported cases were located that made mention of the VRAG, with 42 of these cases containing 2 or more scores that could be submitted to interrater reliability analyses. Overall, scores were skewed toward higher risk categories. The intraclass correlation (ICCA1) was .66, with pairs of forensic examiners placing defendants into the same VRAG risk "bin" in 68% of the cases. For categorical risk statements (i.e., low, moderate, high), examiners provided converging assessment results in most instances (86%). In terms of potential predictors of rater disagreement, there was no evidence for adversarial allegiance in our sample. Rater disagreement in the scoring of 1 VRAG item (Psychopathy Checklist-Revised; Hare, 2003), however, strongly predicted rater disagreement in the scoring of the VRAG (r = .58). (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Stair negotiation in women with fibromyalgia: A descriptive correlational study.

Science.gov (United States)

Collado-Mateo, Daniel; Domínguez-Muñoz, Francisco J; Olivares, Pedro R; Adsuar, José C; Gusi, Narcis

2017-10-01

Walking up and down stairs is a common and important activity of daily living. Women with fibromyalgia often show a reduced ability to perform this task.The objective of this study was to evaluate the test-retest reliability of stair negotiation tasks and to assess the impact of fibromyalgia symptoms on the ability to negotiate stairs.Forty-two women with fibromyalgia participated in this descriptive correlational study. The relevance of the stair negotiation (both walking up and down) was evaluated by assessing its association with the revised version of the fibromyalgia impact questionnaire (FIQ-R) and other health-related variables. Test-retest reliability was also analyzed. The main outcome measures were time spent walking up and down stairs and impact of fibromyalgia, quality of life, number of falls, weight, and lower limb strength and endurance.The intraclass correlation coefficient (ICC) for stair descent was 0.929 whereas that for ascent was 0.972. The score in these tests correlated significantly with the total score for the FIQ-R and the score for many of dimensions and symptoms: that is, physical function, overall impact of fibromyalgia, pain, energy, stiffness, restorative sleep, tenderness, self-perceived balance problems, and sensitivity.Given the importance of the stair negotiation as activity of daily living and the high reliability, both stair ascent and descent tasks may be useful as outcome measures in studies on patients with fibromyalgia.
Reliability Assessment of IGBT Modules Modeled as Systems with Correlated Components

DEFF Research Database (Denmark)

Kostandyan, Erik; Sørensen, John Dalsgaard

2013-01-01

configuration. The estimated system reliability by the proposed method is a conservative estimate. Application of the suggested method could be extended for reliability estimation of systems composing of welding joints, bolts, bearings, etc. The reliability model incorporates the correlation between...... was applied for the systems failure functions estimation. It is desired to compare the results with the true system failure function, which is possible to estimate using simulation techniques. Theoretical model development should be applied for the further research. One of the directions for it might...... be modeling the system based on the Sequential Order Statistics, by considering the failure of the minimum (weakest component) at each loading level. The proposed idea to represent the system by the independent components could also be used for modeling reliability by Sequential Order Statistics....
Development and Reliability Testing of a Fast-Food Restaurant Observation Form.

Science.gov (United States)

Rimkus, Leah; Ohri-Vachaspati, Punam; Powell, Lisa M; Zenk, Shannon N; Quinn, Christopher M; Barker, Dianne C; Pugach, Oksana; Resnick, Elissa A; Chaloupka, Frank J

2015-01-01

To develop a reliable observational data collection instrument to measure characteristics of the fast-food restaurant environment likely to influence consumer behaviors, including product availability, pricing, and promotion. The study used observational data collection. Restaurants were in the Chicago Metropolitan Statistical Area. A total of 131 chain fast-food restaurant outlets were included. Interrater reliability was measured for product availability, pricing, and promotion measures on a fast-food restaurant observational data collection instrument. Analysis was done with Cohen's κ coefficient and proportion of overall agreement for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. Interrater reliability, as measured by average κ coefficient, was .79 for menu characteristics, .84 for kids' menu characteristics, .92 for food availability and sizes, .85 for beverage availability and sizes, .78 for measures on the availability of nutrition information,.75 for characteristics of exterior advertisements, and .62 and .90 for exterior and interior characteristics measures, respectively. For continuous measures, average ICC was .88 for food pricing measures, .83 for beverage prices, and .65 for counts of exterior advertisements. Over 85% of measures demonstrated substantial or almost perfect agreement. Although some measures required revision or protocol clarification, results from this study suggest that the instrument may be used to reliably measure the fast-food restaurant environment.
Validity and Reliability of a Portable Balance Tracking System, BTrackS, in Older Adults.

Science.gov (United States)

Levy, Susan S; Thralls, Katie J; Kviatkovsky, Shiloah A

Falls are the leading cause of disability, injury, hospital admission, and injury-related death among older adults. Balance limitations have consistently been identified as predictors of falls and increased fall risk. Field measures of balance are limited by issues of subjectivity, ceiling effects, and low sensitivity to change. The gold standard for measuring balance is the force plate; however, its field use is untenable due to high cost and lack of portability. Thus, a critical need is observed for valid objective field measures of balance to accurately assess balance and identify limitations over time. The purpose of this study was to examine the concurrent validity and 3-day test-retest reliability of Balance Tracking System (BTrackS) in community-dwelling older adults. Minimal detectable change values were also calculated to reflect changes in balance beyond measurement error. Postural sway data were collected from community-dwelling older adults (N = 49, mean [SD] age = 71.3 [7.3] years) with a force plate and BTrackS in multitrial eyes open (EO) and eyes closed (EC) static balance conditions. Force sensors transmitted BTrackS data via a USB to a computer running custom software. Three approaches to concurrent validity were taken including calculation of Pearson product moment correlation coefficients, repeated-measures ANOVAs, and Bland-Altman plots. Three-day test-retest reliability of BTrackS was examined in a second sample of 47 community-dwelling older adults (mean [SD] age = 75.8 [7.7] years) using intraclass correlation coefficients and MDC values at 95% CI (MDC95) were calculated. BTrackS demonstrated good validity using Pearson product moment correlations (r > 0.90). Repeated-measures ANOVA and Bland-Altman plots indicated some BTrackS bias with center of pressure (COP) values higher than FP COP values in the EO (mean [SD] bias = 4.0 [6.8]) and EC (mean [SD] bias = 9.6 [12.3]) conditions. Test-retest reliability using intraclass correlation
Validity and reliability of Preschool Language Scale 4 for measuring language development in children 48-59 months of age

Directory of Open Access Journals (Sweden)

Nuryani Sidarta

2016-04-01

Full Text Available Prevalence rates for speech and language delay have been reported across wide ranges. Speech and language delay affects 5% to 8% of preschool children, often persisting into the school years. A cross-sectional study was conducted in 208 children aged 48-59 months to determine the validity and reliability of the Indonesian edition of the Preschool Language Scale version 4 (PLS4 as a screening tool for the identification of language development disorders. Construct validity was examined by using Pearson correlation coefficient. Internal consistency was tested and repeated measurements were taken to establish the stability coefficient and intraclass correlation coefficients (ICC for test-retest reliability. For construct validity, the Pearson correlation coefficient ranged from 0.151-0.526, indicating that all questions in this instrument were valid for measuring auditory comprehension (AC and expressive communication skills (EC. Cronbach’s alpha level ranged from 0.81-0.95 with standard error of measurement (SEM ranging from 3.1-3.3. Stability coefficients ranged from 0.98-.0.99 with ICC coefficient ranging from 0.97-0.99 both of which showed an excellent reliability. This study found that PLS-4 is a valid and reliable instrument. It is easy to handle and can be recommended for assessing language development in children aged 48-59 months.
Reliability of histologic assessment in patients with eosinophilic oesophagitis.

Science.gov (United States)

Warners, M J; Ambarus, C A; Bredenoord, A J; Verheij, J; Lauwers, G Y; Walsh, J C; Katzka, D A; Nelson, S; van Viegen, T; Furuta, G T; Gupta, S K; Stitt, L; Zou, G; Parker, C E; Shackelton, L M; D Haens, G R; Sandborn, W J; Dellon, E S; Feagan, B G; Collins, M H; Jairath, V; Pai, R K

2018-04-01

The validity of the eosinophilic oesophagitis (EoE) histologic scoring system (EoEHSS) has been demonstrated, but only preliminary reliability data exist. Formally assess the reliability of the EoEHSS and additional histologic features. Four expert gastrointestinal pathologists independently reviewed slides from adult patients with EoE (N = 45) twice, in random order, using standardised training materials and scoring conventions for the EoEHSS and additional histologic features agreed upon during a modified Delphi process. Intra- and inter-rater reliability for scoring the EoEHSS, a visual analogue scale (VAS) of overall histopathologic disease severity, and additional histologic features were assessed using intra-class correlation coefficients (ICCs). Almost perfect intra-rater reliability was observed for the composite EoEHSS scores and the VAS. Inter-rater reliability was also almost perfect for the composite EoEHSS scores and substantial for the VAS. Of the EoEHSS items, eosinophilic inflammation was associated with the highest ICC estimates and consistent with almost perfect intra- and inter-rater reliability. With the exception of dyskeratotic epithelial cells and surface epithelial alteration, ICC estimates for the remaining EoEHSS items were above the benchmarks for substantial intra-rater, and moderate inter-rater reliability. Estimation of peak eosinophil count and number of lamina propria eosinophils were associated with the highest ICC estimates among the exploratory items. The composite EoEHSS and most component items are associated with substantial reliability when assessed by central pathologists. Future studies should assess responsiveness of the score to change after a therapeutic intervention to facilitate its use in clinical trials. © 2018 John Wiley & Sons Ltd.
The 6-min push test is reliable and predicts low fitness in spinal cord injury.

Science.gov (United States)

Cowan, Rachel E; Callahan, Morgan K; Nash, Mark S

2012-10-01

The objective of this study is to assess 6-min push test (6MPT) reliability, determine whether the 6MPT is sensitive to fitness differences, and assess if 6MPT distance predicts fitness level in persons with spinal cord injury (SCI) or disease. Forty individuals with SCI who could self-propel a manual wheelchair completed an incremental arm crank peak oxygen consumption assessment and two 6MPTs across 3 d (37% tetraplegia (TP), 63% paraplegia (PP), 85% men, 70% white, 63% Hispanic, mean age = 34 ± 10 yr, mean duration of injury = 13 ± 10 yr, and mean body mass index = 24 ± 5 kg.m). Intraclass correlation and Bland-Altman plots assessed 6MPT distance (m) reliability. Mann-Whitney U test compared 6MPT distance (m) of high and low fitness groups for TP and PP. The fitness status prediction was developed using N = 30 and validated in N = 10 (validation group (VG)). A nonstatistical prediction approach, below or above a threshold distance (TP = 445 m and PP = 604 m), was validated statistically by binomial logistic regression. Accuracy, sensitivity, and specificity were computed to evaluate the threshold approach. Intraclass correlation coefficients exceeded 0.90 for the whole sample and the TP/PP subsets. High fitness persons propelled farther than low fitness persons for both TP/PP (both P < 0.05). Binomial logistic regression (P < 0.008) predicted the same fitness levels in the VG as the threshold approach. In the VG, overall accuracy was 70%. Eighty-six percent of low fitness persons were correctly identified (sensitivity), and 33% of high fitness persons were correctly identified (specificity). The 6MPT may be a useful tool for SCI clinicians and researchers. 6MPT distance demonstrates excellent reliability and is sensitive to differences in fitness level. 6MPT distances less than a threshold distance may be an effective approach to identify low fitness in person with SCI.
The Swedish Exercise Self-Efficacy Scale (ESES-S): reliability and validity in a rheumatoid arthritis population.

Science.gov (United States)

Nessen, Thomas; Demmelmaier, Ingrid; Nordgren, Birgitta; Opava, Christina H

2015-01-01

The aim of the present study was to investigate aspects of reliability and validity of the Exercise Self-Efficacy Scale (ESES-S) in a rheumatoid arthritis (RA) population. A total of 244 people with RA participating in a physical activity study were included. The six-item ESES-S, exploring confidence in performing exercise, was assessed for test-retest reliability over 4-6 months, and for internal consistency. Construct validity investigated correlation with similar and other constructs. An intraclass correlation coefficient (ICC) of 0.59 (95% CI 0.37-0.73) was found for 84 participants with stable health perceptions between measurement occasions. Cronbach's alpha coefficients of 0.87 and 0.89 were found at the first and second measurements. Corrected item-total correlation single ESES-S items ranged between 0.53 and 0.73. Construct convergent validity for the ESES-S was partly confirmed by correlations with health-enhancing physical activity and outcome expectations respectively (Pearson's r = 0.18, p exercise is crucial for management of symptoms and co-morbidity in rheumatoid arthritis. Self-efficacy for exercise is important to address in rehabilitation as it regulates exercise motivation and behavior. Measurement properties of self-efficacy scales need to be assessed in specific populations and different languages.
Reliability and prevalence of physical performance examination assessing mobility and balance in older persons in the US: data from the Third National Health and Nutrition Examination Survey.

Science.gov (United States)

Ostchega, Y; Harris, T B; Hirsch, R; Parsons, V L; Kington, R; Katzoff, M

2000-09-01

This report provides reliability and prevalence estimates by sex, age, and race/ethnicity of an observed physical performance examination (PPE) assessing mobility and balance. The Third National Health and Nutrition Examination Survey (NHANES III) 1988-1994. A cross-sectional nationally representative survey. All persons aged 60 and older (n = 5,403) who performed the PPE either in the mobile examination center (MEC) or in the home during NHANES III (conducted 1988-1994). The PPE included timed chair stand, full tandem stand, and timed 8-foot walk. Timed chair stand and 8-foot timed walk were reliable measurements (Intraclass Correlations > 0.5). Women were significantly slower (P physically limited than men.
Reliability of conditioned pain modulation: a systematic review

Science.gov (United States)

Kennedy, Donna L.; Kemp, Harriet I.; Ridout, Deborah; Yarnitsky, David; Rice, Andrew S.C.

2016-01-01

Abstract A systematic literature review was undertaken to determine if conditioned pain modulation (CPM) is reliable. Longitudinal, English language observational studies of the repeatability of a CPM test paradigm in adult humans were included. Two independent reviewers assessed the risk of bias in 6 domains; study participation; study attrition; prognostic factor measurement; outcome measurement; confounding and analysis using the Quality in Prognosis Studies (QUIPS) critical assessment tool. Intraclass correlation coefficients (ICCs) less than 0.4 were considered to be poor; 0.4 and 0.59 to be fair; 0.6 and 0.75 good and greater than 0.75 excellent. Ten studies were included in the final review. Meta-analysis was not appropriate because of differences between studies. The intersession reliability of the CPM effect was investigated in 8 studies and reported as good (ICC = 0.6-0.75) in 3 studies and excellent (ICC > 0.75) in subgroups in 2 of those 3. The assessment of risk of bias demonstrated that reporting is not comprehensive for the description of sample demographics, recruitment strategy, and study attrition. The absence of blinding, a lack of control for confounding factors, and lack of standardisation in statistical analysis are common. Conditioned pain modulation is a reliable measure; however, the degree of reliability is heavily dependent on stimulation parameters and study methodology and this warrants consideration for investigators. The validation of CPM as a robust prognostic factor in experimental and clinical pain studies may be facilitated by improvements in the reporting of CPM reliability studies. PMID:27559835
Evidence for validity and reliability of a french version of the FAAM

Directory of Open Access Journals (Sweden)

Ballabeni Pierluigi

2011-02-01

Full Text Available Abstract Background The Foot and Ankle Ability Measure (FAAM is a self reported questionnaire for patients with foot and ankle disorders available in English, German, and Persian. This study plans to translate the FAAM from English to French (FAAM-F and assess the validity and reliability of this new version. Methods The FAAM-F Activities of Daily Living (ADL and sports subscales were completed by 105 French-speaking patients (average age 50.5 years presenting various chronic foot and ankle disorders. Convergent and divergent validity was assessed by Pearson's correlation coefficients between the FAAM-F subscales and the SF-36 scales: Physical Functioning (PF, Physical Component Summary (PCS, Mental Health (MH and Mental Component Summary (MCS. Internal consistency was calculated by Cronbach's Alpha (CA. To assess test re-test reliability, 22 patients filled out the questionnaire a second time to estimate minimal detectable changes (MDC and intraclass correlation coefficients (ICC. Results Correlations for FAAM-F ADL subscale were 0.85 with PF, 0.81 with PCS, 0.26 with MH, 0.37 with MCS. Correlations for FAAM-F Sports subscale were 0.72 with PF, 0.72 with PCS, 0.21 with MH, 0.29 with MCS. CA estimates were 0.97 for both subscales. Respectively for the ADL and Sports subscales, ICC were 0.97 and 0.94, errors for a single measure were 8 and 10 points at 95% confidence and the MDC values at 95% confidence were 7 and 18 points. Conclusion The FAAM-F is valid and reliable for the self-assessment of physical function in French-speaking patients with a wide range of chronic foot and ankle disorders.
Validity and reliability of a new tool to evaluate handwriting difficulties in Parkinson's disease.

Directory of Open Access Journals (Sweden)

Evelien Nackaerts

Full Text Available Handwriting in Parkinson's disease (PD features specific abnormalities which are difficult to assess in clinical practice since no specific tool for evaluation of spontaneous movement is currently available.This study aims to validate the 'Systematic Screening of Handwriting Difficulties' (SOS-test in patients with PD.Handwriting performance of 87 patients and 26 healthy age-matched controls was examined using the SOS-test. Sixty-seven patients were tested a second time within a period of one month. Participants were asked to copy as much as possible of a text within 5 minutes with the instruction to write as neatly and quickly as in daily life. Writing speed (letters in 5 minutes, size (mm and quality of handwriting were compared. Correlation analysis was performed between SOS outcomes and other fine motor skill measurements and disease characteristics. Intrarater, interrater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC and Spearman correlation coefficient.Patients with PD had a smaller (p = 0.043 and slower (p 0.769 for both groups.The SOS-test is a short and effective tool to detect handwriting problems in PD with excellent reliability. It can therefore be recommended as a clinical instrument for standardized screening of handwriting deficits in PD.
Validity and reliability of a novel immunosuppressive adverse effects scoring system in renal transplant recipients.

Science.gov (United States)

Meaney, Calvin J; Arabi, Ziad; Venuto, Rocco C; Consiglio, Joseph D; Wilding, Gregory E; Tornatore, Kathleen M

2014-06-12

After renal transplantation, many patients experience adverse effects from maintenance immunosuppressive drugs. When these adverse effects occur, patient adherence with immunosuppression may be reduced and impact allograft survival. If these adverse effects could be prospectively monitored in an objective manner and possibly prevented, adherence to immunosuppressive regimens could be optimized and allograft survival improved. Prospective, standardized clinical approaches to assess immunosuppressive adverse effects by health care providers are limited. Therefore, we developed and evaluated the application, reliability and validity of a novel adverse effects scoring system in renal transplant recipients receiving calcineurin inhibitor (cyclosporine or tacrolimus) and mycophenolic acid based immunosuppressive therapy. The scoring system included 18 non-renal adverse effects organized into gastrointestinal, central nervous system and aesthetic domains developed by a multidisciplinary physician group. Nephrologists employed this standardized adverse effect evaluation in stable renal transplant patients using physical exam, review of systems, recent laboratory results, and medication adherence assessment during a clinic visit. Stable renal transplant recipients in two clinical studies were evaluated and received immunosuppressive regimens comprised of either cyclosporine or tacrolimus with mycophenolic acid. Face, content, and construct validity were assessed to document these adverse effect evaluations. Inter-rater reliability was determined using the Kappa statistic and intra-class correlation. A total of 58 renal transplant recipients were assessed using the adverse effects scoring system confirming face validity. Nephrologists (subject matter experts) rated the 18 adverse effects as: 3.1 ± 0.75 out of 4 (maximum) regarding clinical importance to verify content validity. The adverse effects scoring system distinguished 1.75-fold increased gastrointestinal adverse

The Validity and Reliability Test of the Indonesian Version of Gastroesophageal Reflux Disease Quality of Life (GERD-QOL) Questionnaire.

Science.gov (United States)

Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti

2017-01-01

to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; pGERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Reproducibility and interoperator reliability of obtaining images and measurements of the cervix and uterus with brachytherapy treatment applicators in situ using transabdominal ultrasound.

Science.gov (United States)

van Dyk, Sylvia; Garth, Margaret; Oates, Amanda; Kondalsamy-Chennakesavan, Srinivas; Schneider, Michal; Bernshaw, David; Narayan, Kailash

2016-01-01

To validate interoperator reliability of brachytherapy radiation therapists (RTs) in obtaining an ultrasound image and measuring the cervix and uterine dimensions using transabdominal ultrasound. Patients who underwent MRI with applicators in situ after the first insertion were included in the study. Imaging was performed by three RTs (RT1, RT2, and RT3) with varying degrees of ultrasound experience. All RTs were required to obtain a longitudinal planning image depicting the applicator in the uterine canal and measure the cervix and uterus. The MRI scan, taken 1 hour after the ultrasound, was used as the reference standard against which all measurements were compared. Measurements were analyzed with intraclass correlation coefficient and Bland-Altman plots. All RTs were able to obtain a suitable longitudinal image for each patient in the study. Mean differences (SD) between MRI and ultrasound measurements obtained by RTs ranged from 3.5 (3.6) to 4.4 (4.23) mm and 0 (3.0) to 0.9 (2.5) mm on the anterior and posterior surface of the cervix, respectively. Intraclass correlation coefficient for absolute agreement between MRI and RTs was >0.9 for all posterior measurement points in the cervix and ranged from 0.41 to 0.92 on the anterior surface. Measurements were not statistically different between RTs at any measurement point. RTs with variable training attained high levels of interoperator reliability when using transabdominal ultrasound to obtain images and measurements of the uterus and cervix with brachytherapy applicators in situ. Access to training and use of a well-defined protocol assist in achieving these high levels of reliability. Copyright © 2016 American Brachytherapy Society. Published by Elsevier Inc. All rights reserved.
Validity and reliability of a low-cost digital dynamometer for measuring isometric strength of lower limb.

Science.gov (United States)

Romero-Franco, Natalia; Jiménez-Reyes, Pedro; Montaño-Munuera, Juan A

2017-11-01

Lower limb isometric strength is a key parameter to monitor the training process or recognise muscle weakness and injury risk. However, valid and reliable methods to evaluate it often require high-cost tools. The aim of this study was to analyse the concurrent validity and reliability of a low-cost digital dynamometer for measuring isometric strength in lower limb. Eleven physically active and healthy participants performed maximal isometric strength for: flexion and extension of ankle, flexion and extension of knee, flexion, extension, adduction, abduction, internal and external rotation of hip. Data obtained by the digital dynamometer were compared with the isokinetic dynamometer to examine its concurrent validity. Data obtained by the digital dynamometer from 2 different evaluators and 2 different sessions were compared to examine its inter-rater and intra-rater reliability. Intra-class correlation (ICC) for validity was excellent in every movement (ICC > 0.9). Intra and inter-tester reliability was excellent for all the movements assessed (ICC > 0.75). The low-cost digital dynamometer demonstrated strong concurrent validity and excellent intra and inter-tester reliability for assessing isometric strength in the main lower limb movements.
Quantitative measurement of hypertrophic scar: interrater reliability and concurrent validity.

Science.gov (United States)

Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo

2008-01-01

Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.
The interrater and intrarater reliability of the Philpott-Javer staging system based on level of training.

Science.gov (United States)

Parhar, Harman S; Thamboo, Andrew; Habib, Al-Rahim; Chang, Brent; Gan, Eng Cern; Javer, Amin R

2014-04-01

The Philpott-Javer postoperative endoscopic mucosal staging system for allergic fungal rhinosinusitis has previously demonstrated acceptable interrater reliability among rhinologists. There are, however, numerous learners involved in patient care at tertiary centers. This study aims to analyze the interrater and intrarater reliability of this system among learners in otolaryngology at different stages in training. A prospective analysis of retrospectively collected endoscopic photographs. A tertiary care teaching hospital (January 2013). Fifty patients undergoing routine follow-up. Three photographs from each of 50 patients undergoing routine postsurgical nasoendoscopy were reviewed. Images were played twice, 1 week apart, in 2 differently randomized cycles and scored according to Philpott-Javer criteria by a rhinologist, a rhinology fellow, a senior otolaryngology resident, a junior otolaryngology resident, and a medical student. Interobserver reliability was assessed using the intraclass correlation coefficient, while intrarater reliability was assessed by Shrout-Fleiss κ values. Agreement between each learner and the rhinologist was also assessed using κ values. The interclass correlation among the 5 raters was 0.7600 (95% confidence interval, 0.6917-0.8161) for the Philpott-Javer scoring system, suggesting substantial reliability. Intrarater data showed substantial to almost-perfect reliability (κ values between 0.668 and 0.815) among all raters using this system. There was also moderate to substantial agreement between the learners and the rhinologist (κ values between 0.534 and 0.710). Results suggest that the Philpott-Javer staging system has acceptable intrarater and interrater reliability among learners of differing levels of clinical experience and is suitable for evaluating progress following surgery.
Reliability and validity of two multidimensional self-reported physical activity questionnaires in people with chronic low back pain.

Science.gov (United States)

Carvalho, Flávia A; Morelhão, Priscila K; Franco, Marcia R; Maher, Chris G; Smeets, Rob J E M; Oliveira, Crystian B; Freitas Júnior, Ismael F; Pinto, Rafael Z

2017-02-01

Although there is some evidence for reliability and validity of self-report physical activity (PA) questionnaires in the general adult population, it is unclear whether we can assume similar measurement properties in people with chronic low back pain (LBP). To determine the test-retest reliability of the International Physical Activity Questionnaire (IPAQ) long-version and the Baecke Physical Activity Questionnaire (BPAQ) and their criterion-related validity against data derived from accelerometers in patients with chronic LBP. Cross-sectional study. Patients with non-specific chronic LBP were recruited. Each participant attended the clinic twice (one week interval) and completed self-report PA. Accelerometer measures >7 days included time spent in moderate-and-vigorous physical activity, steps/day, counts/minute, and vector magnitude counts/minute. Intraclass Correlation Coefficients (ICC) and Bland and Altman method were used to determine reliability and spearman rho correlation were used for criterion-related validity. A total of 73 patients were included in our analyses. The reliability analyses revealed that the BPAQ and its subscales have moderate to excellent reliability (ICC 2,1 : 0.61 to 0.81), whereas IPAQ and most IPAQ domains (except walking) showed poor reliability (ICC 2,1 : 0.20 to 0.40). The Bland and Altman method revealed larger discrepancies for the IPAQ. For the validity analysis, questionnaire and accelerometer measures showed at best fair correlation (rho reliability than the IPAQ long-version, both questionnaires did not demonstrate acceptable validity against accelerometer data. These findings suggest that questionnaire and accelerometer PA measures should not be used interchangeably in this population. Copyright © 2016 Elsevier Ltd. All rights reserved.
The Neck Disability Index-Russian Language Version (NDI-RU): A Study of Validity and Reliability.

Science.gov (United States)

Bakhtadze, Maxim A; Vernon, Howard; Zakharova, Olga B; Kuzminov, Kirill O; Bolotov, Dmitry A

2015-07-15

Cross-cultural adaptation and psychometric testing. To perform a validated Russian translation and then to evaluate the validity and reliability of the Russian language version of the Neck Disability Index (NDI-RU). Neck pain is highly prevalent and can greatly affect daily activity. The Neck Disability Index (NDI) is the most frequently used scale for self-rating of disability due to neck pain. Its translated versions are applied in many countries. However, the Russian language version of the NDI has not been developed yet. Cross-cultural adaptation of the NDI-RU was performed according to established guidelines. Then, the NDI-RU was evaluated for content validity, concurrent criterion validity, internal consistency, test-retest reliability, factor structure, and minimum detectable change. Two hundred thirty-two patients took part in the study in total: 109 in validity (39.5 ± 10 yr), 123 in reliability (38.4 ± 11 yr; 80 in the test-retest phase). A culturally valid translation was achieved. NDI-RU total scores were distributed normally. Floor/ceiling effects were absent. Good values of Cronbach α were obtained for each item (from 0.80 to 0.84) and for the total NDI-RU (0.83). A 2-factor solution was found for the NDI-RU. The average interitem correlation coefficient was 0.53. Intraclass correlation coefficients for test-retest reliability coefficients ranged from 0.65 to 0.92 for different items and 0.91 for the total NDI-RU. Moderate correlation (Spearman rs = 0.62; P Russian language version of the Neck Disability Index resulted in a valid, reliable instrument that can be used both in clinical practice and scientific investigations. 1.
Test-Retest Reliability of Diffusion Tensor Imaging in Huntington's Disease.

Science.gov (United States)

Cole, James H; Farmer, Ruth E; Rees, Elin M; Johnson, Hans J; Frost, Chris; Scahill, Rachael I; Hobbs, Nicola Z

2014-03-21

Diffusion tensor imaging (DTI) has shown microstructural abnormalities in patients with Huntington's Disease (HD) and work is underway to characterise how these abnormalities change with disease progression. Using methods that will be applied in longitudinal research, we sought to establish the reliability of DTI in early HD patients and controls. Test-retest reliability, quantified using the intraclass correlation coefficient (ICC), was assessed using region-of-interest (ROI)-based white matter atlas and voxelwise approaches on repeat scan data from 22 participants (10 early HD, 12 controls). T1 data was used to generate further ROIs for analysis in a reduced sample of 18 participants. The results suggest that fractional anisotropy (FA) and other diffusivity metrics are generally highly reliable, with ICCs indicating considerably lower within-subject compared to between-subject variability in both HD patients and controls. Where ICC was low, particularly for the diffusivity measures in the caudate and putamen, this was partly influenced by outliers. The analysis suggests that the specific DTI methods used here are appropriate for cross-sectional research in HD, and give confidence that they can also be applied longitudinally, although this requires further investigation. An important caveat for DTI studies is that test-retest reliability may not be evenly distributed throughout the brain whereby highly anisotropic white matter regions tended to show lower relative within-subject variability than other white or grey matter regions.
The Construct Validity and Reliability of an Assessment Tool for Competency in Cochlear Implant Surgery

Directory of Open Access Journals (Sweden)

Patorn Piromchai

2014-01-01

Full Text Available Introduction. We introduce a rating tool that objectively evaluates the skills of surgical trainees performing cochlear implant surgery. Methods. Seven residents and seven experts performed cochlear implant surgery sessions from mastoidectomy to cochleostomy on a standardized virtual reality temporal bone. A total of twenty-eight assessment videos were recorded and two consultant otolaryngologists evaluated the performance of each participant using these videos. Results. Interrater reliability was calculated using the intraclass correlation coefficient for both the global and checklist components of the assessment instrument. The overall agreement was high. The construct validity of this instrument was strongly supported by the significantly higher scores in the expert group for both components. Conclusion. Our results indicate that the proposed assessment tool for cochlear implant surgery is reliable, accurate, and easy to use. This instrument can thus be used to provide objective feedback on overall and task-specific competency in cochlear implantation.
[Reliability study in the measurement of the cusp inclination angle of a chairside digital model].

Science.gov (United States)

Xinggang, Liu; Xiaoxian, Chen

2018-02-01

This study aims to evaluate the reliability of the software Picpick in the measurement of the cusp inclination angle of a digital model. Twenty-one trimmed models were used as experimental objects. The chairside digital impression was then used for the acquisition of 3D digital models, and the software Picpick was employed for the measurement of the cusp inclination of these models. The measurements were repeated three times, and the results were compared with a gold standard, which was a manually measured experimental model cusp angle. The intraclass correlation coefficient (ICC) was calculated. The paired t test value of the two measurement methods was 0.91. The ICCs between the two measurement methods and three repeated measurements were greater than 0.9. The digital model achieved a smaller coefficient of variation (9.9%). The software Picpick is reliable in measuring the cusp inclination of a digital model.
Magnetic resonance imaging of shoulders with idiopathic adhesive capsulitis: reliability of measures

Energy Technology Data Exchange (ETDEWEB)

Lefevre-Colau, Marie-Martine; Fayad, Fouad; Rannou, Francois; Demaille-Wlodyka, Samantha; Mayoux-Benhamou, Marie-Anne; Poiraudeau, Serge; Revel, Michel [Universite Rene Descartes, Department of Physical and Rehabilitation Medicine, Hopital Cochin (AP-HP), Paris (France); Drape, Jean-Luc; Diche, Thierry; Minvielle, Francois [Hopital Cochin (AP-HP), Department of Radiology B, Paris (France); Fermanian, Jacques [Universite Rene Descartes, Department of Biostatistics, Hopital Necker (AP-HP), Paris (France)

2005-12-01

The magnetic resonance imaging (MRI) findings in idiopathic adhesive capsulitis (AC) were compared with those of contralateral healthy shoulders and the reliability of measures assessed. Twenty-six consecutive patients (26 AC and 14 healthy shoulders) were prospectively assessed. The main measurements were thickness of the joint capsule and synovial membrane in the axillary recess and rotator interval in T1-weighted spin-echo sequence enhanced with intravenous (IV) gadolinium chelate (Gd-chelate). Reliability was studied by use of the intraclass correlation coefficient (ICC). The mean thickness of the axillary recess on the coronal plane was 9.0{+-}2.2 mm in AC shoulders and 0.4{+-}0.7 mm in healthy shoulders. The mean thickness of the rotator interval on the sagittal plane was 8.4{+-}2.8 in AC shoulders and 0.6{+-}0.8 mm in healthy shoulders. Interobserver reliability was good for the axillary recess, with ICC values of 0.84 for the coronal plane, and good for the rotator interval, with ICC values of 0.80 for the sagittal plane. MRI with IV Gd-chelate injection can show, with acceptable reliability, signal and thickness abnormalities of the shoulder joint capsule and synovial membrane in AC. (orig.)
Magnetic resonance imaging of shoulders with idiopathic adhesive capsulitis: reliability of measures

International Nuclear Information System (INIS)

Lefevre-Colau, Marie-Martine; Fayad, Fouad; Rannou, Francois; Demaille-Wlodyka, Samantha; Mayoux-Benhamou, Marie-Anne; Poiraudeau, Serge; Revel, Michel; Drape, Jean-Luc; Diche, Thierry; Minvielle, Francois; Fermanian, Jacques

2005-01-01

The magnetic resonance imaging (MRI) findings in idiopathic adhesive capsulitis (AC) were compared with those of contralateral healthy shoulders and the reliability of measures assessed. Twenty-six consecutive patients (26 AC and 14 healthy shoulders) were prospectively assessed. The main measurements were thickness of the joint capsule and synovial membrane in the axillary recess and rotator interval in T1-weighted spin-echo sequence enhanced with intravenous (IV) gadolinium chelate (Gd-chelate). Reliability was studied by use of the intraclass correlation coefficient (ICC). The mean thickness of the axillary recess on the coronal plane was 9.0±2.2 mm in AC shoulders and 0.4±0.7 mm in healthy shoulders. The mean thickness of the rotator interval on the sagittal plane was 8.4±2.8 in AC shoulders and 0.6±0.8 mm in healthy shoulders. Interobserver reliability was good for the axillary recess, with ICC values of 0.84 for the coronal plane, and good for the rotator interval, with ICC values of 0.80 for the sagittal plane. MRI with IV Gd-chelate injection can show, with acceptable reliability, signal and thickness abnormalities of the shoulder joint capsule and synovial membrane in AC. (orig.)
The reliability of three psoriasis assessment tools: Psoriasis area and severity index, body surface area and physician global assessment.

Science.gov (United States)

Bożek, Agnieszka; Reich, Adam

2017-08-01

A wide variety of psoriasis assessment tools have been proposed to evaluate the severity of psoriasis in clinical trials and daily practice. The most frequently used clinical instrument is the psoriasis area and severity index (PASI); however, none of the currently published severity scores used for psoriasis meets all the validation criteria required for an ideal score. The aim of this study was to compare and assess the reliability of 3 commonly used assessment instruments for psoriasis severity: the psoriasis area and severity index (PASI), body surface area (BSA) and physician global assessment (PGA). On the scoring day, 10 trained dermatologists evaluated 9 adult patients with plaque-type psoriasis using the PASI, BSA and PGA. All the subjects were assessed twice by each physician. Correlations between the assessments were analyzed using the Pearson correlation coefficient. Intra-class correlation coefficient (ICC) was calculated to analyze intra-rater reliability, and the coefficient of variation (CV) was used to assess inter-rater variability. Significant correlations were observed among the 3 scales in both assessments. In all 3 scales the ICCs were > 0.75, indicating high intra-rater reliability. The highest ICC was for the BSA (0.96) and the lowest one for the PGA (0.87). The CV for the PGA and PASI were 29.3 and 36.9, respectively, indicating moderate inter-rater variability. The CV for the BSA was 57.1, indicating high inter-rater variability. Comparing the PASI, PGA and BSA, it was shown that the PGA had the highest inter-rater reliability, whereas the BSA had the highest intra-rater reliability. The PASI showed intermediate values in terms of interand intra-rater reliability. None of the 3 assessment instruments showed a significant advantage over the other. A reliable assessment of psoriasis severity requires the use of several independent evaluations simultaneously.
Evaluating validity and reliability of Persian version of Supports Intensity Scale in adults with intellectual disability

Directory of Open Access Journals (Sweden)

Shahin Soltani

2013-12-01

Full Text Available Background: Shifting paradigms regarding the ways to assess the support needs of people with intellectual disability in 1980 necessitates the design and development of appropriate tools more than ever. In this regard, American Association on Intellectual and Developmental Disabilities (AAIDD developed Supports Intensity Scale (SIS to respond the lack of an appropriate measurement tool. The aim of this study is the cultural adaptation and evaluation of psychometric properties of Supports Intensity Scale in adults with intellectual disability. Methods: Validity of Persian version of SIS was assessed by Content validity. The reliability of the scale was evaluated using Cronbach's alpha and test–retest reliability with a 3-week interval. In this study, the sample contained 43 adults (29 men and 14 women with intellectual disability. Results: The content of the Persian version of SIS was approved by the experts. The Cronbach's alpha reliability coefficients for the subscales ranged between 0.80 and 0.99. Also, Intraclass correlation coefficients ranged between 0.90 and 0.99 (P<0.001. Furthermore, all Pearson correlation coefficients among the SIS subscales ranged between 0.63 and 0.98 (P<0.01. Conclusion: The results of this study indicated that the validity and reliability of the equivalent Persian version of SIS for identifying pattern and required supports intensity in adults with intellectual disability is acceptable.
Validity and reliability of the European portuguese version of neuropsychiatric inventory in an institutionalized sample.

Science.gov (United States)

Ferreira, Ana Rita; Martins, Sonia; Ribeiro, Orquidea; Fernandes, Lia

2015-01-01

Neuropsychiatric symptoms are very common in dementia and have been associated with patient and caregiver distress, increased risk of institutionalization and higher costs of care. In this context, the neuropsychiatric inventory (NPI) is the most widely used comprehensive tool designed to measure neuropsychiatric Symptoms in geriatric patients with dementia. The aim of this study was to present the validity and reliability of the European Portuguese version of NPI. A cross-sectional study was carried out with a convenience sample of institutionalized patients (≥ 50 years old) in three nursing homes in Portugal. All patients were also assessed with mini-mental state examination (MMSE) (cognition), geriatric depression scale (GDS) (depression) and adults and older adults functional assessment inventory (IAFAI) (functionality). NPI was administered to a formal caregiver, usually from the clinical staff. Inter-rater and test-retest reliability were assessed in a subsample of 25 randomly selected subjects. The sample included 166 elderly, with a mean age of 80.9 (standard deviation: 10.2) years. Three out of the NPI behavioral items had negative correlations with MMSE: delusions (rs = -0.177, P = 0.024), disinhibition (rs = -0.174, P = 0.026) and aberrant motor activity (rs = -0.182, P = 0.020). The NPI subsection of depression/dysphoria correlated positively with GDS total score (rs = 0.166, P = 0.038). NPI showed good internal consistency (overall α = 0.766; frequency α = 0.737; severity α = 0.734). The inter-rater reliability was excellent (intraclass correlation coefficient (ICC): 1.00, 95% confidence interval (CI) 1.00 - 1.00), as well as test-retest reliability (ICC: 0.91, 95% CI 0.80 - 0.96). The results found for convergent validity, inter-rater and test-retest reliability, showed that this version appears to be a valid and reliable instrument for evaluation of neuropsychiatric symptoms in institutionalized elderly.
Reliability and accuracy of three imaging software packages used for 3D analysis of the upper airway on cone beam computed tomography images.

Science.gov (United States)

Chen, Hui; van Eijnatten, Maureen; Wolff, Jan; de Lange, Jan; van der Stelt, Paul F; Lobbezoo, Frank; Aarab, Ghizlane

2017-08-01

The aim of this study was to assess the reliability and accuracy of three different imaging software packages for three-dimensional analysis of the upper airway using CBCT images. To assess the reliability of the software packages, 15 NewTom 5G ® (QR Systems, Verona, Italy) CBCT data sets were randomly and retrospectively selected. Two observers measured the volume, minimum cross-sectional area and the length of the upper airway using Amira ® (Visage Imaging Inc., Carlsbad, CA), 3Diagnosys ® (3diemme, Cantu, Italy) and OnDemand3D ® (CyberMed, Seoul, Republic of Korea) software packages. The intra- and inter-observer reliability of the upper airway measurements were determined using intraclass correlation coefficients and Bland & Altman agreement tests. To assess the accuracy of the software packages, one NewTom 5G ® CBCT data set was used to print a three-dimensional anthropomorphic phantom with known dimensions to be used as the "gold standard". This phantom was subsequently scanned using a NewTom 5G ® scanner. Based on the CBCT data set of the phantom, one observer measured the volume, minimum cross-sectional area, and length of the upper airway using Amira ® , 3Diagnosys ® , and OnDemand3D ® , and compared these measurements with the gold standard. The intra- and inter-observer reliability of the measurements of the upper airway using the different software packages were excellent (intraclass correlation coefficient ≥0.75). There was excellent agreement between all three software packages in volume, minimum cross-sectional area and length measurements. All software packages underestimated the upper airway volume by -8.8% to -12.3%, the minimum cross-sectional area by -6.2% to -14.6%, and the length by -1.6% to -2.9%. All three software packages offered reliable volume, minimum cross-sectional area and length measurements of the upper airway. The length measurements of the upper airway were the most accurate results in all software packages. All
The reliability and validity of a Venezuelan version of the Bath Ankylosing Spondylitis Functional Index (BASFI) and Bath Ankylosing Spondylitis Disease Activity Index (BASDAI).

Science.gov (United States)

Rauseo Vera, Mayra; Gutiérrez-González, Luis Arturo; Maldonado, Irama; Al Snih, Soham

2017-09-21

Spondyloarthropathies (SpA) are disabling diseases with a prevalence of 1.9% in the general population. The indices designed for monitoring the disease should be valid, reliable and cross-culturally adapted for decision-making concerning the appropriate treatment. Changing an adjective or pronoun in a self-administered questionnaire could be the big difference in condensing an idea in a few words and transmitting that concept to all those who share the same language. To develop a Venezuelan version of the original English version of the BASDAI/BASFI and to evaluate its reliability and validity in Venezuelan patients with SpA. Certified linguists were needed for the translation of a Venezuelan version of the BASDAI/BASFI. The evaluation of reliability and validity was performed by calculating correlation coefficients in addition to Cronbach's alpha correlation between the BASDAI score and the clinical parameters (for example: erythrocyte sedimentation rate, C-reactive protein, modified Schöber test, occiput-to-wall distance and enthesis count). We studied 40 patients including 31 men (77.5%) and 9 women (22.5%). The mean age was 35.9 years ± standard deviation (SD) 12.01 and the disease duration was 11.5 years (± SD 9.5). The most common diagnoses were undifferentiated spondyloarthritis (45%), ankylosing spondylitis (27.5%) and psoriatic arthritis (20%). The incidences of reactive arthritis, ankylosing spondylitis and juvenile Reiter's syndrome were 2.5% each. The test-retest reliability of the BASDAI and BASFI was high (R = 0.99 and 0.99, respectively; P<.0001). The internal consistency for the BASDAI was high (Cronbach's alpha = 0.88; P=.002) and the intraclass correlation coefficient for internal consistency: 0.9867 (P=.001). Internal consistency for the BASFI: Cronbach's alpha = 0.7985 (P=.002), intraclass correlation coefficient for internal consistency: 0.9055 (P=.001). Construct validity of the BASDAI was high for general well-being of the patient (R = 0
The Vocal Cord Dysfunction Questionnaire: Validity and Reliability of the Persian Version.

Science.gov (United States)

Ghaemi, Hamide; Khoddami, Seyyedeh Maryam; Soleymani, Zahra; Zandieh, Fariborz; Jalaie, Shohreh; Ahanchian, Hamid; Khadivi, Ehsan

2017-12-25

The aim of this study was to develop, validate, and assess the reliability of the Persian version of Vocal Cord Dysfunction Questionnaire (VCDQ P ). The study design was cross-sectional or cultural survey. Forty-four patients with vocal fold dysfunction (VFD) and 40 healthy volunteers were recruited for the study. To assess the content validity, the prefinal questions were given to 15 experts to comment on its essential. Ten patients with VFD rated the importance of VCDQ P in detecting face validity. Eighteen of the patients with VFD completed the VCDQ 1 week later for test-retest reliability. To detect absolute reliability, standard error of measurement and smallest detected change were calculated. Concurrent validity was assessed by completing the Persian Chronic Obstructive Pulmonary Disease (COPD) Assessment Test (CAT) by 34 patients with VFD. Discriminant validity was measured from 34 participants. The VCDQ was further validated by administering the questionnaire to 40 healthy volunteers. Validation of the VCDQ as a treatment outcome tool was conducted in 18 patients with VFD using pre- and posttreatment scores. The internal consistency was confirmed (Cronbach α = 0.78). The test-retest reliability was excellent (intraclass correlation coefficient = 0.97). The standard error of measurement and smallest detected change values were acceptable (0.39 and 1.08, respectively). There was a significant correlation between the VCDQ P and the CAT total scores (P validity was significantly different. The VCDQ scores in patients with VFD before and after treatment was significantly different (P valid and reliable self-administered questionnaire in Persian-speaking population. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reliability analysis based on a novel density estimation method for structures with correlations

Directory of Open Access Journals (Sweden)

Baoyu LI

2017-06-01

Full Text Available Estimating the Probability Density Function (PDF of the performance function is a direct way for structural reliability analysis, and the failure probability can be easily obtained by integration in the failure domain. However, efficiently estimating the PDF is still an urgent problem to be solved. The existing fractional moment based maximum entropy has provided a very advanced method for the PDF estimation, whereas the main shortcoming is that it limits the application of the reliability analysis method only to structures with independent inputs. While in fact, structures with correlated inputs always exist in engineering, thus this paper improves the maximum entropy method, and applies the Unscented Transformation (UT technique to compute the fractional moments of the performance function for structures with correlations, which is a very efficient moment estimation method for models with any inputs. The proposed method can precisely estimate the probability distributions of performance functions for structures with correlations. Besides, the number of function evaluations of the proposed method in reliability analysis, which is determined by UT, is really small. Several examples are employed to illustrate the accuracy and advantages of the proposed method.
Reliability and validity of the Wii Balance Board for assessment of standing balance: A systematic review.

Science.gov (United States)

Clark, Ross A; Mentiplay, Benjamin F; Pua, Yong-Hao; Bower, Kelly J

2018-03-01

The use of force platform technologies to assess standing balance is common across a range of clinical areas. Numerous researchers have evaluated the low-cost Wii Balance Board (WBB) for its utility in assessing balance, with variable findings. This review aimed to systematically evaluate the reliability and concurrent validity of the WBB for assessment of static standing balance. Articles were retrieved from six databases (Medline, SCOPUS, EMBASE, CINAHL, Web of Science, Inspec) from 2007 to 2017. After independent screening by two reviewers, 25 articles were included. Two reviewers performed the data extraction and quality assessment. Test-retest reliability was investigated in 12 studies, with intraclass correlation coefficients or Pearson's correlation values showing a range from poor to excellent reliability (range: 0.27 to 0.99). Concurrent validity (i.e. comparison with another force platform) was examined in 21 studies, and was generally found to be excellent in studies examining the association between the same outcome measures collected on both devices. For studies reporting predominantly poor to moderate validity, potentially influential factors included the choice of 1) criterion reference (e.g. not a common force platform), 2) test duration (e.g. balance. Protocol registration number: PROSPERO 2017: CRD42017058122. Copyright © 2018 Elsevier B.V. All rights reserved.

Reliability of muscle strength assessment in chronic post-stroke hemiparesis: a systematic review and meta-analysis.

Science.gov (United States)

Rabelo, Michelle; Nunes, Guilherme S; da Costa Amante, Natália Menezes; de Noronha, Marcos; Fachin-Martins, Emerson

2016-02-01

Muscle weakness is the main cause of motor impairment among stroke survivors and is associated with reduced peak muscle torque. To systematically investigate and organize the evidence of the reliability of muscle strength evaluation measures in post-stroke survivors with chronic hemiparesis. Two assessors independently searched four electronic databases in January 2014 (Medline, Scielo, CINAHL, Embase). Inclusion criteria comprised studies on reliability on muscle strength assessment in adult post-stroke patients with chronic hemiparesis. We extracted outcomes from included studies about reliability data, measured by intraclass correlation coefficient (ICC) and/or similar. The meta-analyses were conducted only with isokinetic data. Of 450 articles, eight articles were included for this review. After quality analysis, two studies were considered of high quality. Five different joints were analyzed within the included studies (knee, hip, ankle, shoulder, and elbow). Their reliability results varying from low to very high reliability (ICCs from 0.48 to 0.99). Results of meta-analysis for knee extension varying from high to very high reliability (pooled ICCs from 0.89 to 0.97), for knee flexion varying from high to very high reliability (pooled ICCs from 0.84 to 0.91) and for ankle plantar flexion showed high reliability (pooled ICC = 0.85). Objective muscle strength assessment can be reliably used in lower and upper extremities in post-stroke patients with chronic hemiparesis.
Reliability and validity of two frequently used self-administered physical activity questionnaires in adolescents

Directory of Open Access Journals (Sweden)

Kurtze Nanna

2008-07-01

Full Text Available Abstract Background To create and find accurate and reliable instruments for the measurement of physical activity has been a challenge in epidemiological studies. We investigated the reliability and validity of two different physical activity questionnaires in 71 adolescents aged 13–18 years; the WHO, Health Behaviour in Schoolchildren (HBSC questionnaire, and the International Physical Activity Questionnaire (IPAQ, short version. Methods The questionnaires were administered twice (8–12 days apart to measure reliability. Validity was assessed by comparing answers from the questionnaires with a cardiorespiratory fitness test (VO2peak and seven days activity monitoring with the ActiReg, an instrument measuring physical activity level (PAL and total energy expenditure (TEE. Results Intraclass correlation coefficients for reliability for the WHO HBSC questionnaire were 0.71 for frequency and 0.73 for duration. For the frequency question, there was a significant difference between genders; 0.87 for girls and 0.59 for boys (p 2peak were fair, ranging between 0.29 – 0.39. The WHO HBSC questionnaire measured against VO2peak for girls were acceptable, ranging between 0.30 – 0.55. Both questionnaires, except the walking question in IPAQ, showed a low correlation with PAL and TEE, ranging between 0.01 and 0.29. Conclusion These data indicate that the WHO HBSC questionnaire had substantial reliability and were acceptable instrument for measuring cardiorespiratory fitness, especially among girls. None of the questionnaires however seemed to be a valid instrument for measuring physical activity compared to TEE and PAL in adolescents.
Validity and reliability of isometric muscle strength measurements of hip abduction and abduction with external hip rotation in a bent-hip position using a handheld dynamometer with a belt.

Science.gov (United States)

Aramaki, Hidefumi; Katoh, Munenori; Hiiragi, Yukinobu; Kawasaki, Tsubasa; Kurihara, Tomohisa; Ohmi, Yorikatsu

2016-07-01

[Purpose] This study aimed to investigate the relatedness, reliability, and validity of isometric muscle strength measurements of hip abduction and abduction with an external hip rotation in a bent-hip position using a handheld dynamometer with a belt. [Subjects and Methods] Twenty healthy young adults, with a mean age of 21.5 ± 0.6 years were included. Isometric hip muscle strength in the subjects' right legs was measured under two posture positions using two devices: a handheld dynamometer with a belt and an isokinetic dynamometer. Reliability was evaluated using an intra-class correlation coefficient (ICC); relatedness and validity were evaluated using Pearson's product moment correlation coefficient. Differences in measurements of devices were assessed by two-way ANOVA. [Results] ICC (1, 1) was ≥0.9; significant positive correlations in measurements were found between the two devices under both conditions. No main effect was found between the measurement values. [Conclusion] Our findings revealed that there was relatedness, reliability, and validity of this method for isometric muscle strength measurements using a handheld dynamometer with a belt.
Accuracy and Reliability of the Klales et al. (2012) Morphoscopic Pelvic Sexing Method.

Science.gov (United States)

Lesciotto, Kate M; Doershuk, Lily J

2018-01-01

Klales et al. (2012) devised an ordinal scoring system for the morphoscopic pelvic traits described by Phenice (1969) and used for sex estimation of skeletal remains. The aim of this study was to test the accuracy and reliability of the Klales method using a large sample from the Hamann-Todd collection (n = 279). Two observers were blinded to sex, ancestry, and age and used the Klales et al. method to estimate the sex of each individual. Sex was correctly estimated for females with over 95% accuracy; however, the male allocation accuracy was approximately 50%. Weighted Cohen's kappa and intraclass correlation coefficient analysis for evaluating intra- and interobserver error showed moderate to substantial agreement for all traits. Although each trait can be reliably scored using the Klales method, low accuracy rates and high sex bias indicate better trait descriptions and visual guides are necessary to more accurately reflect the range of morphological variation. © 2017 American Academy of Forensic Sciences.
Reliability and validity of the Youth Leisure-time Sedentary Behavior Questionnaire (YLSBQ).

Science.gov (United States)

Cabanas-Sánchez, Verónica; Martínez-Gómez, David; Esteban-Cornejo, Irene; Castro-Piñero, José; Conde-Caveda, Julio; Veiga, Óscar L

2018-01-01

To develop a questionnaire able to assess time spent by youth in a wide range of leisure-time sedentary behaviors (SB) and evaluate its test-retest reliability and criterion validity. Cross-sectional observational. The reliability sample included 194 youth, aged 10-18 years, who completed the questionnaire twice, separated by one-week interval. The validity study comprised 1207 participants aged 8-18 years. Participants wore an accelerometer for 7 consecutive days. The questionnaire was designed to assess the amount of time spent in twelve different SB during weekdays and weekends, separately. In order to avoid usual phenomenon of time over reporting, values were adjusted to real available leisure-time (LT) for each participant. Reliability was assessed by using Intraclass Correlation Coefficients (ICC) and weighted (quadratic) kappa (k), and validity was assessed by using Pearson correlation and Bland-Altman plots. The reliability of questionnaire showed a moderate-to-substantial agreement for the most (91%) of items (k=0.43-0.74; ICC=0.41-0.79) with three items (4%) reaching an almost perfect agreement (ICC=0.82-0.83). Only 'sitting and talking' evidenced fair-to-moderate reliability (k=0.27-0.39; ICC=0.34-0.46). The relationship between average sedentary time assessed by the questionnaire and accelerometry was moderate (r=0.36; pquestionnaire and accelerometer sedentary time for average day (r=0.05; p=0.11) but Bland-Altman plots suggest moderate discrepancies between both methods of SB measurement (mean=19.86; limits of agreement=-280.04 to 319.76). The questionnaire showed moderate to good test-retest reliability and a moderate level of validity for assessing SB in youth, similar or slightly better to previously published in this population. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Validity and reliability of an adapted Thai version of Scoliosis Research Society-22 questionnaire for adolescent idiopathic scoliosis.

Science.gov (United States)

Sathira-Angkura, Vera; Pithankuakul, Kongkit; Sakulpipatana, Susana; Piyaskulkaew, Chaiwat; Kunakornsawat, Sombat

2012-04-20

Cross-sectional observational study to investigate psychometric properties of an adapted Thai version of the refined Scoliosis Research Society-22 (SRS-22) questionnaire. To evaluate the reliability and validity of the adapted Thai version of the refined SRS-22 questionnaire. The SRS-22 questionnaire is a valid instrument for assessing the health-related quality of life for patients with adolescent idiopathic scoliosis. Recently, the questionnaire has been translated and validated in many languages for non-English-speaking countries. Translation/retranslation of the English version of the SRS-22 was conducted, and the cross-cultural adaptation process was performed. The Thai version SRS-22 and previously validated Thai version Short-Form survey version 2.0 (SF-36V2) questionnaires were administered to 77 patients with adolescent idiopathic scoliosis who had surgical treatment. Fifty-eight patients (52 adolescent girls) had filled out the first set of questionnaires. Thirty patients of the first-time responders completed the second set of questionnaires. The mean age at the time of operation was 14.6 years and the mean age at the time of the final follow-up was 18.7 years. The mean preoperative scoliosis curve magnitude was 55.4° (range, 30°-95°) and postoperative curve magnitude was 20.1° (range, 0°-60°). Internal consistency was determined with Cronbach α coefficient. Intraclass correlation coefficient was used for test-retest reliability. Concurrent validity was evaluated by comparing SRS-22 domains with relevant domains in the SF-36V2 questionnaire, using the Pearson correlation coefficient. The mean overall Cronbach α coefficient of the adapted Thai version SRS-22 was 0.76. The 2 of corresponding domains (mental health = 0.80 and self-image = 0.83) had satisfactory internal consistency and the remaining domains (pain = 0.78; function/activity = 0.74; and satisfaction = 0.76) were good. The intraclass correlation coefficient for 5 domains was ranged from
A comparison of manual anthropometric measurements with Kinect-based scanned measurements in terms of precision and reliability.

Science.gov (United States)

Bragança, Sara; Arezes, Pedro; Carvalho, Miguel; Ashdown, Susan P; Castellucci, Ignacio; Leão, Celina

2018-01-01

Collecting anthropometric data for real-life applications demands a high degree of precision and reliability. It is important to test new equipment that will be used for data collectionOBJECTIVE:Compare two anthropometric data gathering techniques - manual methods and a Kinect-based 3D body scanner - to understand which of them gives more precise and reliable results. The data was collected using a measuring tape and a Kinect-based 3D body scanner. It was evaluated in terms of precision by considering the regular and relative Technical Error of Measurement and in terms of reliability by using the Intraclass Correlation Coefficient, Reliability Coefficient, Standard Error of Measurement and Coefficient of Variation. The results obtained showed that both methods presented better results for reliability than for precision. Both methods showed relatively good results for these two variables, however, manual methods had better results for some body measurements. Despite being considered sufficiently precise and reliable for certain applications (e.g. apparel industry), the 3D scanner tested showed, for almost every anthropometric measurement, a different result than the manual technique. Many companies design their products based on data obtained from 3D scanners, hence, understanding the precision and reliability of the equipment used is essential to obtain feasible results.
Attenuation of the Squared Canonical Correlation Coefficient under Varying Estimates of Score Reliability

Science.gov (United States)

Wilson, Celia M.

2010-01-01

Research pertaining to the distortion of the squared canonical correlation coefficient has traditionally been limited to the effects of sampling error and associated correction formulas. The purpose of this study was to compare the degree of attenuation of the squared canonical correlation coefficient under varying conditions of score reliability.…
Reliability and Validity of 3 Methods of Assessing Orthopedic Resident Skill in Shoulder Surgery.

Science.gov (United States)

Bernard, Johnathan A; Dattilo, Jonathan R; Srikumaran, Uma; Zikria, Bashir A; Jain, Amit; LaPorte, Dawn M

Traditional measures for evaluating resident surgical technical skills (e.g., case logs) assess operative volume but not level of surgical proficiency. Our goal was to compare the reliability and validity of 3 tools for measuring surgical skill among orthopedic residents when performing 3 open surgical approaches to the shoulder. A total of 23 residents at different stages of their surgical training were tested for technical skill pertaining to 3 shoulder surgical approaches using the following measures: Objective Structured Assessment of Technical Skills (OSATS) checklists, the Global Rating Scale (GRS), and a final pass/fail assessment determined by 3 upper extremity surgeons. Adverse events were recorded. The Cronbach α coefficient was used to assess reliability of the OSATS checklists and GRS scores. Interrater reliability was calculated with intraclass correlation coefficients. Correlations among OSATS checklist scores, GRS scores, and pass/fail assessment were calculated with Spearman ρ. Validity of OSATS checklists was determined using analysis of variance with postgraduate year (PGY) as a between-subjects factor. Significance was set at p shoulder approaches. Checklist scores showed superior interrater reliability compared with GRS and subjective pass/fail measurements. GRS scores were positively correlated across training years. The incidence of adverse events was significantly higher among PGY-1 and PGY-2 residents compared with more experienced residents. OSATS checklists are a valid and reliable assessment of technical skills across 3 surgical shoulder approaches. However, checklist scores do not measure quality of technique. Documenting adverse events is necessary to assess quality of technique and ultimate pass/fail status. Multiple methods of assessing surgical skill should be considered when evaluating orthopedic resident surgical performance. Copyright Â© 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights
Reliability analysis for manual radiographic measures of rotatory subluxation or lateral listhesis in adult scoliosis.

Science.gov (United States)

Freedman, Brett A; Horton, William C; Rhee, John M; Edwards, Charles C; Kuklo, Timothy R

2009-03-15

Retrospective observational study. To define the inter- and intraobserver reliability of 3 measures of rotatory subluxation (RS) in adult scoliosis (AS). RS is a hallmark of AS. To accurately track this measure, one must know its reliability. Reliability testing has not been performed. PA 36" films of 29 AS patients were collected from one surgeon's practice. Three observers on 2 separate occasions measured all levels with >or=3-mm RS (60 levels, 360 measurements) on the convexity of the involved segment using 3 different techniques-midbody (MB), endplate (EP), and centroid (C). These data were then analyzed to determine the intraclass correlation coefficient (ICC) for inter- and intraobserver reliability. The thoracolumbar/lumbar curve (average 58 degrees ) was the major curve for the majority (62%) of patients. RS at L3/4 was most common (35%). The overall inter- and intraobserver reliability was good-excellent for all methods, but the centroid method consistently had the highest ICC. ICC correlated with observer experience. Moderate-severe arthritic change (present in 55%) and poor image quality (52%) decreased ICC, but it still remained good-excellent for each measure. The reproducibility coefficient for each measure was 4 mm for MB and 2.8 mm for C and EP. MB, EP, and C are reliable techniques to measure RS even in elderly arthritic spines, but the methods inherently produce different values for a given level. The centroid method is most reliable and least influenced by experience. The EP method is easy to perform and very reliable. Spine surgeons should pick their preferred method and apply it consistently. Changes >3 mm suggest RS progression. RS may be a useful measure in addition to Cobb angle in AS. Having defined measurement reliability, the role of RS progression in surgical indications and patient outcomes can be evaluated.
Strength of pelvic floor in men: reliability intra examiners

Directory of Open Access Journals (Sweden)

Patricia Zaidan

2018-05-01

Full Text Available Abstract Introduction: The obtaining of urinary continence is due to the strength of the pelvic floor muscles (MAPs at the moment of muscle contraction, when there are sudden increases in intra-abdominal pressure, which increases urethral closure pressure and decreases the possibility of urinary loss. Objective: To verify the reliability, type: stability, intra-examiner, of the measure of the strength of MAPs held with Peritron. Methods: Test and retest study to assess the intra-rater reliability of Peritron to measure the strength of MAPs. The sample consisted of 36 male patients, mean age 65.3 ± 7.2 years, all with urinary incontinence (UI after radical prostatectomy. The physical therapist conducted a training for familiarization with the procedures of MAPs strength assessment with Peritron for two weeks. The strength of MAPs was measured by a perineometer of the Peritron brand (PFX 9300®, Cardio-Design Pty. Ltd, Baulkham Hills, Australia, 2153. Results: The intraclass correlation coefficient (ICC was equal to 0.99; P = 0.0001. The typical measurement error (ETM was equal to 3.1 cmH2O and ETM% of 4. Conclusion: Peritron showed high reliability for measuring the strength of MAPs in men, both for clinical practice and for the production of scientific knowledge. It should be noted that such measures were carried out in stability, so it is suggested that in internal consistency reliability is equivalent.
Assessing physiotherapists' communication skills for promoting patient autonomy for self-management: reliability and validity of the communication evaluation in rehabilitation tool.

Science.gov (United States)

Murray, Aileen; Hall, Amanda; Williams, Geoffrey C; McDonough, Suzanne M; Ntoumanis, Nikos; Taylor, Ian; Jackson, Ben; Copsey, Bethan; Hurley, Deirdre A; Matthews, James

2018-02-27

To assess the inter-rater reliability and concurrent validity of the Communication Evaluation in Rehabilitation Tool, which aims to externally assess physiotherapists competency in using Self-Determination Theory-based communication strategies in practice. Audio recordings of initial consultations between 24 physiotherapists and 24 patients with chronic low back pain in four hospitals in Ireland were obtained as part of a larger randomised controlled trial. Three raters, all of whom had Ph.Ds in psychology and expertise in motivation and physical activity, independently listened to the 24 audio recordings and completed the 18-item Communication Evaluation in Rehabilitation Tool. Inter-rater reliability between all three raters was assessed using intraclass correlation coefficients. Concurrent validity was assessed using Pearson's r correlations with a reference standard, the Health Care Climate Questionnaire. The total score for the Communication Evaluation in Rehabilitation Tool is an average of all 18 items. Total scores demonstrated good inter-rater reliability (Intraclass Correlation Coefficient (ICC) = 0.8) and concurrent validity with the Health Care Climate Questionnaire total score (range: r = 0.7-0.88). Item-level scores of the Communication Evaluation in Rehabilitation Tool identified five items that need improvement. Results provide preliminary evidence to support future use and testing of the Communication Evaluation in Rehabilitation Tool. Implications for Rehabilitation Promoting patient autonomy is a learned skill and while interventions exist to train clinicians in these skills there are no tools to assess how well clinicians use these skills when interacting with a patient. The lack of robust assessment has severe implications regarding both the fidelity of clinician training packages and resulting outcomes for promoting patient autonomy. This study has developed a novel measurement tool Communication Evaluation in Rehabilitation Tool and a
Reproducibility of tender point examination in chronic low back pain patients as measured by intrarater and inter-rater reliability and agreement

DEFF Research Database (Denmark)

Jensen, Ole Kudsk; Callesen, Jacob; Nielsen, Merete Graakjaer

2013-01-01

back examination and return-to-work intervention, 43 and 39 patients, respectively (18 women, 46%) entered and completed the study. MAIN OUTCOME MEASURES: The reliability was estimated by the intraclass correlation coefficient (ICC), and agreement was calculated for up to ±3 TPs. Furthermore......, the smallest detectable difference was calculated. RESULTS: TP examination was performed twice by two consultants in rheumatology and rehabilitation at 20 min intervals and repeated 1 week later. Intrarater reliability in the more and less experienced rater was ICC 0.84 (95% CI 0.69 to 0.98) and 0.72 (95% CI 0.......49 to 0.95), respectively. The figures for inter-rater reliability were intermediate between these figures. In more than 70% of the cases, the raters agreed within ±3 TPs in both men and women and between test days. The smallest detectable difference between raters was 5, and for the more and less...
Intra- and interrater reliability of the 'lumbar-locked thoracic rotation test' in competitive swimmers ages 10 through 18 years.

Science.gov (United States)

Feijen, Stef; Kuppens, Kevin; Tate, Angela; Baert, Isabel; Struyf, Thomas; Struyf, Filip

2018-04-17

Measuring thoracic spine mobility can be of interest to competitive swimmers as it has been associated with shoulder girdle function and scapular position in subjects with and without shoulder pain. At present, no reliability data of thoracic spine mobility measurements are available in the swimming population. This study aims to evaluate the within-session intra- and interrater reliability of the "lumbar-locked rotation test" for thoracic spine rotation in competitive swimmers aged 10 to 18 years. This reliability study is part of a larger prospective cohort study investigating potential risk factors for the development of shoulder pain in competitive swimmers. Within-session, intra- and inter-rater reliability. Competitive swimming clubs in Belgium. 21 competitive swimmers. Intra- and inter-rater reliability of the lumbar-locked thoracic rotation test. Intraclass correlation coefficients (ICCs) ranged from 0.91 (95% CI 0.78 to 0.96) to 0.96 (0.89-0.98) for intra-rater reliability. Results for inter-rater reliability ranged from 0.89 (0.72-0.95) to 0.86 (0.65-0.94) respectively for right and left thoracic rotation. Results suggest good to excellent reliability of the lumbar-locked thoracic rotation test, indicating this test can be used reliably in clinical practice. Copyright © 2018 Elsevier Ltd. All rights reserved.
Reliability of bounce drop jump parameters within elite male rugby players.

Science.gov (United States)

Costley, Lisa; Wallace, Eric; Johnston, Michael; Kennedy, Rodney

2017-07-25

The aims of the study were to investigate the number of familiarisation sessions required to establish reliability of the bounce drop jump (BDJ) and subsequent reliability once familiarisation is achieved. Seventeen trained male athletes completed 4 BDJs in 4 separate testing sessions. Force-time data from a 20 cm BDJ was obtained using two force plates (ensuring ground contact < 250 ms). Subjects were instructed to 'jump for maximal height and minimal contact time' while the best and average of four jumps were compared. A series of performance variables were assessed in both eccentric and concentric phases including jump height, contact time, flight time, reactive strength index (RSI), peak power, rate of force development (RFD) and actual dropping height (ADH). Reliability was assessed using the intraclass correlation coefficient (ICC) and coefficient of variation (CV) while familiarisation was assessed using a repeated measures analysis of variance (ANOVA). The majority of DJ parameters exhibited excellent reliability with no systematic bias evident, while the average of 4 trials provided greater reliability. With the exception of vertical stiffness (CV: 12.0 %) and RFD (CV: 16.2 %) all variables demonstrated low within subject variation (CV range: 3.1 - 8.9 %). Relative reliability was very poor for ADH, with heights ranging from 14.87 - 29.85 cm. High levels of reliability can be obtained from the BDJ with the exception of vertical stiffness and RFD, however, extreme caution must be taken when comparing DJ results between individuals and squads due to large discrepancies between actual drop height and platform height.
Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.

Science.gov (United States)

Taylor, Karen; Bulsara, Max; Monterosso, Leanne

2018-01-01

Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Reliability of Eustachian tube function measurements in a hypobaric and hyperbaric pressure chamber.

Science.gov (United States)

Meyer, M F; Jansen, S; Mordkovich, O; Hüttenbrink, K-B; Beutner, D

2017-12-01

Measurement of the Eustachian tube (ET) function is a challenge. The demand for a precise and meaningful diagnostic tool increases-especially because more and more operative therapies are being offered without objective evidence. The measurement of the ET function by continuous impedance recording in a pressure chamber is an established method, although the reliability of the measurements is still unclear. Twenty-five participants (50 ears) were exposed to phases of compression and decompression in a hypo- and hyperbaric pressure chamber. The ET function reflecting parameters-ET opening pressure (ETOP), ET opening duration (ETOD) and ET opening frequency (ETOF)-were determined under exactly the same preconditions three times in a row. The intraclass correlation coefficient (ICC) and Bland and Altman plot were used to assess test-retest reliability. ICCs revealed a high correlation for ETOP and ETOF in phases of decompression (passive equalisation) as well as ETOD and ETOP in phases of compression (active induced equalisation). Very high correlation could be shown for ETOD in decompression and ETOF in compression phases. The Bland and Altman graphs could show that measurements provide results within a 95 % confidence interval in compression and decompression phases. We conclude that measurements in a pressure chamber are a very valuable tool in terms of estimating the ET opening and closing function. Measurements show some variance comparing participants, but provide reliable results within a 95 % confidence interval in retest. This study is the basis for enabling efficacy measurements of ET treatment modalities. © 2017 John Wiley & Sons Ltd.
Reliability and Validity Assessment of a Linear Position Transducer

Directory of Open Access Journals (Sweden)

Manuel V. Garnacho-Castaño

2015-03-01

Full Text Available The objectives of the study were to determine the validity and reliability of peak velocity (PV, average velocity (AV, peak power (PP and average power (AP measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain during two resistance exercises, bench press (BP and full back squat (BS, performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2. Intraclass correlation coefficients (ICCs indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W. Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W. Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP make this device a useful tool for monitoring resistance training.
Reliability of sonographic assessment of tendinopathy in tennis elbow.

Science.gov (United States)

Poltawski, Leon; Ali, Syed; Jayaram, Vijay; Watson, Tim

2012-01-01

To assess the reliability and compute the minimum detectable change using sonographic scales to quantify the extent of pathology and hyperaemia in the common extensor tendon in people with tennis elbow. The lateral elbows of 19 people with tennis elbow were assessed sonographically twice, 1-2 weeks apart. Greyscale and power Doppler images were recorded for subsequent rating of abnormalities. Tendon thickening, hypoechogenicity, fibrillar disruption and calcification were each rated on four-point scales, and scores were summed to provide an overall rating of structural abnormality; hyperaemia was scored on a five point scale. Inter-rater reliability was established using the intraclass correlation coefficient (ICC) to compare scores assigned independently to the same set of images by a radiologist and a physiotherapist with training in musculoskeletal imaging. Test-retest reliability was assessed by comparing scores assigned by the physiotherapist to images recorded at the two sessions. The minimum detectable change (MDC) was calculated from the test-retest reliability data. ICC values for inter-rater reliability ranged from 0.35 (95% CI: 0.05, 0.60) for fibrillar disruption to 0.77 (0.55, 0.88) for overall greyscale score, and 0.89 (0.79, 0.95) for hyperaemia. Test-retest reliability ranged from 0.70 (0.48, 0.84) for tendon thickening to 0.82 (0.66, 0.90) for overall greyscale score and 0.86 (0.73, 0.93) for calcification. The MDC for the greyscale total score was 2.0/12 and for the hyperaemia score was 1.1/5. The sonographic scoring system used in this study may be used reliably to quantify tendon abnormalities and change over time. A relatively inexperienced imager can conduct the assessment and use the rating scales reliably.
Test-retest reliability of Physical Activity Neighborhood Environment Scale among urban men and women in Nanjing, China.

Science.gov (United States)

Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F

2018-03-01

The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.

Reliability and convergent validity of the five-step test in people with chronic stroke.

Science.gov (United States)

Ng, Shamay S M; Tse, Mimi M Y; Tam, Eric W C; Lai, Cynthia Y Y

2018-01-10

(i) To estimate the intra-rater, inter-rater and test-retest reliabilities of the Five-Step Test (FST), as well as the minimum detectable change in FST completion times in people with stroke. (ii) To estimate the convergent validity of the FST with other measures of stroke-specific impairments. (iii) To identify the best cut-off times for distinguishing FST performance in people with stroke from that of healthy older adults. A cross-sectional study. University-based rehabilitation centre. Forty-eight people with stroke and 39 healthy controls. None. The FST, along with (for the stroke survivors only) scores on the Fugl-Meyer Lower Extremity Assessment (FMA-LE), the Berg Balance Scale (BBS), Limits of Stability (LOS) tests, and Activities-specific Balance Confidence (ABC) scale were tested. The FST showed excellent intra-rater (intra-class correlation coefficient; ICC = 0.866-0.905), inter-rater (ICC = 0.998), and test-retest (ICC = 0.838-0.842) reliabilities. A minimum detectable change of 9.16 s was found for the FST in people with stroke. The FST correlated significantly with the FMA-LE, BBS, and LOS results in the forward and sideways directions (r = -0.411 to -0.716, p people with stroke and healthy older adults. The FST is a reliable, easy-to-administer clinical test for assessing stroke survivors' ability to negotiate steps and stairs.
Reliability of cervical lordosis and global sagittal spinal balance measurements in adolescent idiopathic scoliosis.

Science.gov (United States)

Vidal, Christophe; Ilharreborde, Brice; Azoulay, Robin; Sebag, Guy; Mazda, Keyvan

2013-06-01

Radiological reproducibility study. To assess intra and interobserver reliability of radiographic measurements for global sagittal balance parameters and sagittal spine curves, including cervical spine. Sagittal spine balance in adolescent idiopathic scoliosis (AIS) is a main issue and many studies have been reported, showing that coronal and sagittal deformities often involve sagittal cervical unbalance. Global sagittal balance aims to obtain a horizontal gaze and gravity line at top of hips when subject is in a static position, involving adjustment of each spine curvature in the sagittal plane. To our knowledge, no study did use a methodologically validated imaging analysis tool able to appreciate sagittal spine contours and distances in AIS and especially in the cervical region. Lateral full-spine low-dose EOS radiographs were performed in 75 patients divided in three groups (control subjects, AIS, operated AIS). Three observers digitally analyzed twice each radiograph and 11 sagittal measures were collected for each image. Reliability was assessed calculating intraobserver Pearson's r correlation coefficient, interobserver intra-class correlation coefficient (ICC) completed with a two-by-two Bland-Altman plot analysis. This measurement method has shown excellent intra and interobserver reliability in all parameters, sagittal curvatures, pelvic parameters and global sagittal balance. This study validated a simple and efficient tool in AIS sagittal contour analysis. It defined new relevant landmarks allowing to characterize cervical segmental curvatures and cervical involvement in global balance.
Validity and Reliability of Visual Analog Scale Foot and Ankle: The Turkish Version.

Science.gov (United States)

Gur, Gozde; Turgut, Elif; Dilek, Burcu; Baltaci, Gul; Bek, Nilgun; Yakut, Yavuz

The present study tested the reliability and validity of the Turkish version of the visual analog scale foot and ankle (VAS-FA) among healthy subjects and patients with foot problems. A total of 128 participants, 65 healthy subjects and 63 patients with foot problems, were evaluated. The VAS-FA was translated into Turkish and administered to the 128 subjects on 2 separate occasions with a 5-day interval. The test-retest reliability and internal consistency were assessed with the intraclass correlation coefficient and Cronbach's α. The validity was assessed using the correlations with Turkish versions of the Foot Function Index, the Foot and Ankle Outcome Score, and the Short-Form 36-item Health Survey. A statistically significant difference was found between the healthy group and the patient group in the overall score and subscale scores of the VAS-FA (p Foot Function Index, Foot and Ankle Outcome Score, and Short-Form 36-item Health Survey scores in the healthy and patient groups both. The Turkish version of the VAS-FA is sensitive enough to distinguish foot and ankle-specific pathologic conditions from asymptomatic conditions. The Turkish version of the VAS-FA is a reliable and valid method and can be used for foot-related problems. Copyright © 2017 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Test-Retest Reliability of Measurements of Hand-Grip Strength Obtained by Dynamometry from Older Adults: A Systematic Review of Research in the PubMed Database.

Science.gov (United States)

Bohannon, R W

2017-01-01

A systematic review was performed to summarize literature describing the test-retest reliability of grip strength measures obtained from older adults. Relevant literature was identified via a PubMed search. Seventeen articles were deemed appropriate based on inclusion and exclusion criteria. The relative test-retest reliability of grip strength measures obtained by dynamometry was good to excellent (intra-class correlation coefficients > 0.80) in all but 3 studies, which involved older adults with severe dementia. Absolute reliability, as indicated by summary statistics such as the minimum detectable change (95%), was more variable. As a percentage, that change ranged from 14.5% to 98.5%. Consequently, clinicians can be confident in the relative reliability of grip strength measures obtained from at risk older adults. However, relatively large percentage changes in grip strength may be necessary to conclude with confidence that a real change has occurred over time in some populations.
Reliability and validity of the foot and ankle outcome score: a validation study from Iran.

Science.gov (United States)

Negahban, Hossein; Mazaheri, Masood; Salavati, Mahyar; Sohani, Soheil Mansour; Askari, Marjan; Fanian, Hossein; Parnianpour, Mohamad

2010-05-01

The aims of this study were to culturally adapt and validate the Persian version of Foot and Ankle Outcome Score (FAOS) and present data on its psychometric properties for patients with different foot and ankle problems. The Persian version of FAOS was developed after a standard forward-backward translation and cultural adaptation process. The sample included 93 patients with foot and ankle disorders who were asked to complete two questionnaires: FAOS and Short-Form 36 Health Survey (SF-36). To determine test-retest reliability, 60 randomly chosen patients completed the FAOS again 2 to 6 days after the first administration. Test-retest reliability and internal consistency were assessed using intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. To evaluate convergent and divergent validity of FAOS compared to similar and dissimilar concepts of SF-36, the Spearman's rank correlation was used. Dimensionality was determined by assessing item-subscale correlation corrected for overlap. The results of test-retest reliability show that all the FAOS subscales have a very high ICC, ranging from 0.92 to 0.96. The minimum Cronbach's alpha level of 0.70 was exceeded by most subscales. The Spearman's correlation coefficient for convergent construct validity fell within 0.32 to 0.58 for the main hypotheses presented a priori between FAOS and SF-36 subscales. For dimensionality, the minimum Spearman's correlation coefficient of 0.40 was exceeded by most items. In conclusion, the results of our study show that the Persian version of FAOS seems to be suitable for Iranian patients with various foot and ankle problems especially lateral ankle sprain. Future studies are needed to establish stronger psychometric properties for patients with different foot and ankle problems.
Reliability and validity of the Spanish version of the 10-item Connor-Davidson Resilience Scale (10-item CD-RISC in young adults

Directory of Open Access Journals (Sweden)

García-Campayo Javier

2011-08-01

Full Text Available Abstract Background The 10-item Connor-Davidson Resilience Scale (10-item CD-RISC is an instrument for measuring resilience that has shown good psychometric properties in its original version in English. The aim of this study was to evaluate the validity and reliability of the Spanish version of the 10-item CD-RISC in young adults and to verify whether it is structured in a single dimension as in the original English version. Findings Cross-sectional observational study including 681 university students ranging in age from 18 to 30 years. The number of latent factors in the 10 items of the scale was analyzed by exploratory factor analysis. Confirmatory factor analysis was used to verify whether a single factor underlies the 10 items of the scale as in the original version in English. The convergent validity was analyzed by testing whether the mean of the scores of the mental component of SF-12 (MCS and the quality of sleep as measured with the Pittsburgh Sleep Index (PSQI were higher in subjects with better levels of resilience. The internal consistency of the 10-item CD-RISC was estimated using the Cronbach α test and test-retest reliability was estimated with the intraclass correlation coefficient. The Cronbach α coefficient was 0.85 and the test-retest intraclass correlation coefficient was 0.71. The mean MCS score and the level of quality of sleep in both men and women were significantly worse in subjects with lower resilience scores. Conclusions The Spanish version of the 10-item CD-RISC showed good psychometric properties in young adults and thus can be used as a reliable and valid instrument for measuring resilience. Our study confirmed that a single factor underlies the resilience construct, as was the case of the original scale in English.
The Trojan Lifetime Champions Health Survey: Development, Validity, and Reliability

Science.gov (United States)

Sorenson, Shawn C.; Romano, Russell; Scholefield, Robin M.; Schroeder, E. Todd; Azen, Stanley P.; Salem, George J.

2015-01-01

Context Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. Objective To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Design Descriptive laboratory study. Setting A large National Collegiate Athletic Association Division I university. Patients or Other Participants A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Intervention(s) Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Main Outcome Measure(s) Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Results Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent
Reliability of internal oblique elbow radiographs for measuring displacement of medial epicondyle humerus fractures: a cadaveric study.

Science.gov (United States)

Gottschalk, Hilton P; Bastrom, Tracey P; Edmonds, Eric W

2013-01-01

Standard elbow radiographs (AP and lateral views) are not accurate enough to measure true displacement of medial epicondyle fractures of the humerus. The amount of perceived displacement has been used to determine treatment options. This study assesses the utility of internal oblique radiographs for measurement of true displacement in these fractures. A medial epicondyle fracture was created in a cadaveric specimen. Displacement of the fragment (mm) was set at 5, 10, and 15 in line with the vector of the flexor pronator mass. The fragment was sutured temporarily in place. Radiographs were obtained at 0 (AP), 15, 30, 45, 60, 75, and 90 degrees (lateral) of internal rotation, with the elbow in set positions of flexion. This was done with and without radio-opaque markers placed on the fragment and fracture bed. The 45 and 60 degrees internal oblique radiographs were then presented to 5 separate reviewers (of different levels of training) to evaluate intraobserver and interobserver agreement. Change in elbow position did not affect the perceived displacement (P=0.82) with excellent intraobserver reliability (intraclass correlation coefficient range, 0.979 to 0.988) and interobserver agreement of 0.953. The intraclass correlation coefficient for intraobserver reliability on 45 degrees internal oblique films for all groups ranged from 0.985 to 0.998, with interobserver agreement of 0.953. For predicting displacement, the observers were 60% accurate in predicting the true displacement on the 45 degrees internal oblique films and only 35% accurate using the 60 degrees internal oblique view. Standardizing to a 45 degrees internal oblique radiograph of the elbow (regardless of elbow flexion) can augment the treating surgeon's ability to determine true displacement. At this degree of rotation, the measured number can be multiplied by 1.4 to better estimate displacement. The addition of a 45 degrees internal oblique radiograph in medial humeral epicondyle fractures has good
Reliability of peripheral arterial tonometry in patients with heart failure, diabetic nephropathy and arterial hypertension.

Science.gov (United States)

Weisrock, Fabian; Fritschka, Max; Beckmann, Sebastian; Litmeier, Simon; Wagner, Josephine; Tahirovic, Elvis; Radenovic, Sara; Zelenak, Christine; Hashemi, Djawid; Busjahn, Andreas; Krahn, Thomas; Pieske, Burkert; Dinh, Wilfried; Düngen, Hans-Dirk

2017-08-01

Endothelial dysfunction plays a major role in cardiovascular diseases and pulse amplitude tonometry (PAT) offers a non-invasive way to assess endothelial dysfunction. However, data about the reliability of PAT in cardiovascular patient populations are scarce. Thus, we evaluated the test-retest reliability of PAT using the natural logarithmic transformed reactive hyperaemia index (LnRHI). Our cohort consisted of 91 patients (mean age: 65±9.7 years, 32% female), who were divided into four groups: those with heart failure with preserved ejection fraction (HFpEF) ( n=25), heart failure with reduced ejection fraction (HFrEF) ( n=22), diabetic nephropathy ( n=21), and arterial hypertension ( n=23). All subjects underwent two separate PAT measurements at a median interval of 7 days (range 4-14 days). LnRHI derived by PAT showed good reliability in subjects with diabetic nephropathy (intra-class correlation (ICC) = 0.863) and satisfactory reliability in patients with both HFpEF (ICC = 0.557) and HFrEF (ICC = 0.576). However, in subjects with arterial hypertension, reliability was poor (ICC = 0.125). We demonstrated that PAT is a reliable technique to assess endothelial dysfunction in adults with diabetic nephropathy, HFpEF or HFrEF. However, in subjects with arterial hypertension, we did not find sufficient reliability, which can possibly be attributed to variations in heart rate and the respective time of the assessments. Clinical Trial Registration Identifier: NCT02299960.
Reliability and validity of an internet-based questionnaire measuring lifetime physical activity.

Science.gov (United States)

De Vera, Mary A; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek

2010-11-15

Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005-2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity.
Interrater reliability assessment using the Test of Gross Motor Development-2.

Science.gov (United States)

Barnett, Lisa M; Minto, Christine; Lander, Natalie; Hardy, Louise L

2014-11-01

The aim was to examine interrater reliability of the object control subtest from the Test of Gross Motor Development-2 by live observation in a school field setting. Reliability Study--cross sectional. Raters were rated on their ability to agree on (1) the raw total for the six object control skills; (2) each skill performance and (3) the skill components. Agreement for the object control subtest and the individual skills was assessed by an intraclass correlation (ICC) and a kappa statistic assessed for skill component agreement. A total of 37 children (65% girls) aged 4-8 years (M = 6.2, SD = 0.8) were assessed in six skills by two raters; equating to 222 skill tests. Interrater reliability was excellent for the object control subset (ICC = 0.93), and for individual skills, highest for the dribble (ICC = 0.94) followed by strike (ICC = 0.85), overhand throw (ICC = 0.84), underhand roll (ICC = 0.82), kick (ICC = 0.80) and the catch (ICC = 0.71). The strike and the throw had more components with less agreement. Even though the overall subtest score and individual skill agreement was good, some skill components had lower agreement, suggesting these may be more problematic to assess. This may mean some skill components need to be specified differently in order to improve component reliability. Crown Copyright © 2013. Published by Elsevier Ltd. All rights reserved.
Testing the reliability of the Fall Risk Screening Tool in an elderly ambulatory population.

Science.gov (United States)

Fielding, Susan J; McKay, Michael; Hyrkas, Kristiina

2013-11-01

To identify and test the reliability of a fall risk screening tool in an ambulatory outpatient clinic. The Fall Risk Screening Tool (Albert Lea Medical Center, MN, USA) was scripted for an interview format. Two interviewers separately screened a convenience sample of 111 patients (age ≥ 65 years) in an ambulatory outpatient clinic in a northeastern US city. The interviewers' scoring of fall risk categories was similar. There was good internal consistency (Cronbach's α = 0.834-0.889) and inter-rater reliability [intra-class correlation coefficients (ICC) = 0.824-0.881] for total, Risk Factor and Client's Health Status subscales. The Physical Environment scores indicated acceptable internal consistency (Cronbach's α = 0.742) and adequate reliability (ICC = 0.688). Two Physical Environment items (furniture and medical equipment condition) had low reliabilities [Kappa (K) = 0.323, P = 0.08; K = -0.078, P = 0.648), respectively. The scripted Fall Risk Screening Tool demonstrated good reliability in this sample. Rewording two Physical Environment items will be considered. A reliable instrument such as the scripted Fall Risk Screening Tool provides a standardised assessment for identifying high fall risk patients. This tool is especially useful because it assesses personal, behavioural and environmental factors specific to community-dwelling patients; the interview format also facilitates patient-provider interaction. © 2013 John Wiley & Sons Ltd.
Reliability of the Superimposed-Burst Technique in Patients With Patellofemoral Pain: A Technical Report.

Science.gov (United States)

Norte, Grant E; Frye, Jamie L; Hart, Joseph M

2015-11-01

The superimposed-burst (SIB) technique is commonly used to quantify central activation failure after knee-joint injury, but its reliability has not been established in pathologic cohorts. To assess within-session and between-sessions reliability of the SIB technique in patients with patellofemoral pain. Descriptive laboratory study. University laboratory. A total of 10 patients with self-reported patellofemoral pain (1 man, 9 women; age = 24.1 ± 3.8 years, height = 167.8 ± 15.2 cm, mass = 71.6 ± 17.5 kg) and 10 healthy control participants (3 men, 7 women; age = 27.4 ± 5.0 years, height = 173.5 ± 9.9 cm, mass = 78.2 ± 16.5 kg) volunteered. Participants were assessed at 6 intervals spanning 21 days. Intraclass correlation coefficients (ICCs [3,3]) were used to assess reliability. Quadriceps central activation ratio, knee-extension maximal voluntary isometric contraction force, and SIB force. The quadriceps central activation ratio was highly reliable within session (ICC [3,3] = 0.97) and between sessions through day 21 (ICC [3,3] = 0.90-0.95). Acceptable reliability of knee extension (ICC [3,3] = 0.75-0.91) and SIB force (ICC [3,3] = 0.77-0.89) was observed through day 21. The SIB technique was reliable for clinical research up to 21 days in patients with patellofemoral pain.
Validity and reliability of 3D US for the detection of erosions in patients with rheumatoid arthritis using MRI as the gold standard

DEFF Research Database (Denmark)

Ellegaard, K; Bliddal, H; Møller Døhn, U

2014-01-01

PURPOSE: To test the reliability and validity of a 3D US erosion score in RA using MRI as the gold standard. MATERIALS AND METHODS: RA patients were examined with 3D US and 3 T MRI over the 2nd and 3rd metacarpophalangeal joints. 3D blocks were evaluated by two investigators. The erosions were...... estimated according to a semi-quantitative score (SQS) (0 - 3) and a quantitative score (QS) (mm²). MRI was evaluated according to the RAMRIS score. For the estimation of reliability, intra-class correlation coefficients (ICC) were used. Validity was tested using Spearman's rho (rs). The sensitivity...... and specificity were also calculated. RESULTS: 28 patients with RA were included. The ICC for the inter-observer reliability in the QS was 0.41 and 0.13 for the metacarpal bone and phalangeal bone, respectively, and 0.86 and 0.16, respectively, in the SQS. The ICC for the intra-observer reliability in the QS...
Cross-Cultural Adaptation, Validity, and Reliability of the Persian Version of the Orebro Musculoskeletal Pain Screening Questionnaire.

Science.gov (United States)

Shafeei, Asrin; Mokhtarinia, Hamid Reza; Maleki-Ghahfarokhi, Azam; Piri, Leila

2017-08-01

Observational study. To cross-culturally translate the Orebro Musculoskeletal Pain Screening Questionnaire (OMPQ) into Persian and then evaluate its psychometric properties (reliability, validity, ceiling, and flooring effects). To the authors' knowledge, prior to this study there has been no validated instrument to screen the risk of chronicity in Persian-speaking patients with low back pain (LBP) in Iran. The OMPQ was specifically developed as a self-administered screening tool for assessing the risk of LBP chronicity. The forward-backward translation method was used for the translation and cross-cultural adaptation of the original questionnaire. In total, 202 patients with subacute LBP completed the OMPQ and the pain disability questionnaire (PDQ), which was used to assess convergent validity. 62 patients completed the OMPQ a week later as a retest. Slight changes were made to the OMPQ during the translation/cultural adaptation process; face validity of the Persian version was obtained. The Persian OMPQ showed excellent test-retest reliability (intraclass correlation coefficient=0.89). Its internal consistency was 0.71, and its convergent validity was confirmed by good correlation coefficient between the OMPQ and PDQ total scores ( r =0.72, p validity, construct validity, reliability, and consistency. It is therefore considered a useful instrument for screening Iranian patients with LBP.
Reliable and valid assessment of Lichtenstein hernia repair skills.

Science.gov (United States)

Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L

2014-08-01

Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.
The bridge crane mechanism shaft reliability calculating in case of the fatigue fracture parameters correlation

Directory of Open Access Journals (Sweden)

Krutitskiy M.N.

2016-03-01

Full Text Available The method of statistical tests examines the impact of the correlation of the parameters of fatigue-such as the durability of the shaft mechanism of an overhead traveling crane for General use is under consideration in this article. It is be-lieved that the normal and shear stresses together affect the overall durability of the shaft. There may be a correlation between endurance limits and coefficients of block similarity of loading. To calculate resource used corrected linear theory of fatigue damage accumulation. Parameters on the reliability are computed after building the function, the reli-ability function directly or through private functions the reliability function for each type of stress.
Cross-cultural adaptation and validation of the reliability of the Thai version of the Hip disability and Osteoarthritis Outcome Score (HOOS).

Science.gov (United States)

Trathitiphan, Warayos; Paholpak, Permsak; Sirichativapee, Winai; Wisanuyotin, Taweechok; Laupattarakasem, Pat; Sukhonthamarn, Kamolsak; Jeeravipoolvarn, Polasak; Kosuwon, Weerachai

2016-10-01

HOOS was developed as an extension of the Western Ontario and McMaster Universities' Osteoarthritis Index questionnaire for measuring symptoms and functional limitations related to the hip(s) of patients with osteoarthritis. To determine the validity and reliability of the Thai version of the Hip disability and Osteoarthritis Outcome Score (HOOS) vis-à-vis hip osteoarthritis, the original HOOS was translated into a Thai version of HOOS, according to international recommendations. Patients with hip osteoarthritis (n = 57; 25 males) were asked to complete the Thai version of HOOS twice: once then again after a 3-week interval. The test-retest reliability was analyzed using the intraclass correlation coefficient (ICC). Internal consistencies were analyzed using Cronbach's alpha, while the construct validity was tested by comparing the Thai HOOS with the Thai modified SF-36 and calculating the Spearman's rank correlation coefficients. The Thai HOOS produced good reliability (i.e., the ICC was greater than 0.9 in all five subscales). All of the Cronbach's alpha showed that the Thai HOOS had high internal consistency (Cronbach's alpha greater than 0.8), especially for the pain and ADL subscales (0.89 and 0.90, respectively). The Spearman's rank correlation for all five subscales of the Thai HOOS had moderate correlation with the Bodily Pain subscale of the Thai SF-36. The pain subscale of the Thai HOOS had a high correlation with the Vitality and Social Function subscales of the Thai SF-36 (r = 0.55 and 0.54)-with which the symptom subscale had a moderate correlation. The Thai version of HOOS had excellent internal consistency, excellent test-retest reliability, and good construct validity. It can be used as a reliable tool for assessing quality of life for patients with hip osteoarthritis in Thailand.
Reliability and cross-cultural validation of the Turkish version of Manual Ability Classification System (MACS) for children with cerebral palsy.

Science.gov (United States)

Akpinar, Pinar; Tezel, Canan G; Eliasson, Ann-Christin; Icagasioglu, Afitap

2010-01-01

To determine the reliability and cross-cultural validation of the Turkish translation of the Manual Ability Classification System (MACS) for children with cerebral palsy (CP) and to investigate the relation to gross motor function and other comorbidities. After the forward and backward translation procedures, inter-rater and test-retest reliability was assessed between parents, physiotherapists and physicians using the intra-class correlation coefficient (ICC). Children (N = 118, 4 to 18 years, mean age 9 years 4 months; 68 boys, 50 girls) with various types of CP were classified. Additional data on the Gross Motor Function Classification System (GMFCS), intellectual delay, visual acuity, and epilepsy were collected. The inter-rater reliability was high; the ICC ranged from 0.89 to 0.96 among different professionals and parents. Between two persons of the same profession it ranged from 0.97 to 0.98. For the test-retest reliability it ranged from 0.91 to 0.98. Total agreement between the GMFCS and the MACS occurred in only 45% of the children. The level of the MACS was found to correlate with the accompanying comorbidities, namely intellectual delay and epilepsy. The Turkish version of the MACS is found to be valid and reliable, and is suggested to be appropriate for the assessment of manual ability within the Turkish population.
A Comparison of Three Methods for the Analysis of Skin Flap Viability: Reliability and Validity.

Science.gov (United States)

Tim, Carla Roberta; Martignago, Cintia Cristina Santi; da Silva, Viviane Ribeiro; Dos Santos, Estefany Camila Bonfim; Vieira, Fabiana Nascimento; Parizotto, Nivaldo Antonio; Liebano, Richard Eloin

2018-05-01

Objective: Technological advances have provided new alternatives to the analysis of skin flap viability in animal models; however, the interrater validity and reliability of these techniques have yet to be analyzed. The present study aimed to evaluate the interrater validity and reliability of three different methods: weight of paper template (WPT), paper template area (PTA), and photographic analysis. Approach: Sixteen male Wistar rats had their cranially based dorsal skin flap elevated. On the seventh postoperative day, the viable tissue area and the necrotic area of the skin flap were recorded using the paper template method and photo image. The evaluation of the percentage of viable tissue was performed using three methods, simultaneously and independently by two raters. The analysis of interrater reliability and viability was performed using the intraclass correlation coefficient and Bland Altman Plot Analysis was used to visualize the presence or absence of systematic bias in the evaluations of data validity. Results: The results showed that interrater reliability for WPT, measurement of PTA, and photographic analysis were 0.995, 0.990, and 0.982, respectively. For data validity, a correlation >0.90 was observed for all comparisons made between the three methods. In addition, Bland Altman Plot Analysis showed agreement between the comparisons of the methods and the presence of systematic bias was not observed. Innovation: Digital methods are an excellent choice for assessing skin flap viability; moreover, they make data use and storage easier. Conclusion: Independently from the method used, the interrater reliability and validity proved to be excellent for the analysis of skin flaps' viability.

Translation, reliability, and clinical utility of the Melbourne Assessment 2.

Science.gov (United States)

Gerber, Corinna N; Plebani, Anael; Labruyère, Rob

2017-10-12

The aims were to (i) provide a German translation of the Melbourne Assessment 2 (MA2), a quantitative test to measure unilateral upper limb function in children with neurological disabilities and (ii) to evaluate its reliability and aspects of clinical utility. After its translation into German and approval of the back translation by the original authors, the MA2 was performed and videotaped twice with 30 children with neuromotor disorders. For each participant, two raters scored the video of the first test for inter-rater reliability. To determine test-retest reliability, one rater additionally scored the video of the second test while the other rater repeated the scoring of the first video to evaluate intra-rater reliability. Time needed for rater training, test administration, and scoring was recorded. The four subscale scores showed excellent intra-, inter-rater, and test-retest reliability with intraclass correlation coefficients of 0.90-1.00 (95%-confidence intervals 0.78-1.00). Score items revealed substantial to almost perfect intra-rater reliability (weighted kappa k w = 0.66-1.00) for the more affected side. Score item inter-rater and test-retest reliability of the same extremity were, with one exception, moderate to almost perfect (k w = 0.42-0.97; k w = 0.40-0.89). Furthermore, the MA2 was feasible and acceptable for patients and clinicians. The MA2 showed excellent subscale and moderate to almost perfect score item reliability. Implications for Rehabilitation There is a lack of high-quality studies about psychometric properties of upper limb measurement tools in the neuropediatric population. The Melbourne Assessment 2 is a promising tool for reliable measurement of unilateral upper limb movement quality in the neuropediatric population. The Melbourne Assessment 2 is acceptable and practicable to therapists and patients for routine use in clinical care.
Reliability and construct validity of a new Danish translation of the Prosthesis Evaluation Questionnaire in a population of Danish amputees

DEFF Research Database (Denmark)

Christensen, Jan; Doherty, Patrick; Bjorner, Jakob Bue

2017-01-01

. Estimates for standard error of measurement (SEM) were calculated based on reliability estimates. Construct validity was evaluated by testing using hypotheses testing. Results: Reliability estimates (ICC/Cronbach’s alpha) for the nine subscales were: Social Burden (0.85/0.76), Appearance (0....... Methods: Lower limb amputees responded to electronic versions of the PEQ and SF-36v2 at baseline (n=64), after two weeks (n=51), and after 12 weeks (n=50). Reliability was assessed using Cronbach’s alpha and intraclass correlation coefficient (ICC) analyses of the baseline and two weeks test-retest data.......85/0.72), Residual Limb Health (0.80/0.69), Well-Being (0.78/0.90), Utility (0.76/0.89), Frustration (0.74/0.90), Perceived Response (0.62/0.80), Ambulation (0.61/0.94), Sounds (0.51/0.65). Construct validity was supported in three out of four subscales evaluated. Conclusions: The subscales Social Burden, Appearance...
The reliability of repeated TMS measures in older adults and in patients with subacute and chronic stroke

Directory of Open Access Journals (Sweden)

Heidi M. Schambra

2015-09-01

Full Text Available The reliability of transcranial magnetic stimulation (TMS measures in healthy older adults and stroke patients has been insufficiently characterized. We determined whether common TMS measures could reliably evaluate change in individuals and in groups using the smallest detectable change (SDC, or could tell subjects apart using the intraclass correlation coefficient (ICC. We used a single-rater test-retest design in older healthy, subacute stroke, and chronic stroke subjects. At twice daily sessions on two consecutive days, we recorded resting motor threshold, test stimulus intensity, recruitment curves, short-interval intracortical inhibition and facilitation, and long-interval intracortical inhibition. Using variances estimated from a random effects model, we calculated the SDC and ICC for each TMS measure. For all TMS measures in all groups, SDCs for single subjects were large; only with modest group sizes did the SDCs become low. Thus, while these TMS measures cannot be reliably used as a biomarker to detect individual change, they can reliably detect change exceeding measurement noise in moderate-sized groups. For several of the TMS measures, ICCs were universally high, suggesting that they can reliably discriminate between subjects. Though most TMS measures have sufficient reliability in particular contexts, work establishing their validity, responsiveness, and clinical relevance is still needed.
Validity and reliability of isometric, isokinetic and isoinertial modalities for the assessment of quadriceps muscle strength in patients with total knee arthroplasty.

Science.gov (United States)

Lienhard, K; Lauermann, S P; Schneider, D; Item-Glatthorn, J F; Casartelli, N C; Maffiuletti, N A

2013-12-01

Reliability of isometric, isokinetic and isoinertial modalities for quadriceps strength evaluation, and the relation between quadriceps strength and physical function was investigated in 29 total knee arthroplasty (TKA) patients, with an average age of 63 years. Isometric maximal voluntary contraction torque, isokinetic peak torque, and isoinertial one-repetition maximum load of the involved and uninvolved quadriceps were evaluated as well as objective (walking parameters) and subjective physical function (WOMAC). Reliability was good and comparable for the isometric, isokinetic, and isoinertial strength outcomes on both sides (intraclass correlation coefficient range: 0.947-0.966; standard error of measurement range: 5.1-9.3%). Involved quadriceps strength was significantly correlated to walking speed (r range: 0.641-0.710), step length (r range: 0.685-0.820) and WOMAC function (r range: 0.575-0.663), independent from the modality (P strength was also significantly correlated to walking speed (r range: 0.413-0.539), step length (r range: 0.514-0.608) and WOMAC function (r range: 0.374-0.554) (P 0.05). In conclusion, isometric, isokinetic, and isoinertial modalities ensure valid and reliable assessment of quadriceps muscle strength in TKA patients. Copyright © 2013 Elsevier Ltd. All rights reserved.
Reliability of the Balance Evaluation Systems Test (BESTest) and BESTest sections for adults with hemiparesis

Science.gov (United States)

Rodrigues, Letícia C.; Marques, Aline P.; Barros, Paula B.; Michaelsen, Stella M.

2014-01-01

BACKGROUND: The Balance Evaluation Systems Test (BESTest) was recently created to allow the development of treatments according to the specific balance system affected in each patient. The Brazilian version of the BESTest has not been specifically tested after stroke. OBJECTIVE: To evaluate the intra- and inter-rater reliability and concurrent and convergent validity of the total score of the BESTest and BESTest sections for adults with hemiparesis after stroke. METHOD: The study included 16 subjects (61.1±7.5 years) with chronic hemiparesis (54.5±43.5 months after stroke). The BESTest was administered by two raters in the same week and one of the raters repeated the test after a one-week interval. Intraclass correlation coefficient (ICC) was calculated to assess intra- and interrater reliability. Concurrent validity with the Berg Balance Scale (BBS) and convergent validity with the Activities-specific Balance Confidence scale (ABC-Brazil) were assessed using Pearson's correlation coefficient. RESULTS: Both the BESTest total score (ICC=0.98) and the BESTest sections (ICC between 0.85 and 0.96) have excellent intrarater reliability. Interrater reliability for the total score was excellent (ICC=0.93) and, for the sections, it ranged between 0.71 and 0.94. The correlation coefficient between the BESTest and the BBS and ABC-Brazil were 0.78 and 0.59, respectively. CONCLUSIONS: The Brazilian version of the BESTest demonstrated adequate reliability when measured by sections and could identify what balance system was affected in patients after stroke. Concurrent validity was excellent with the BBS total score and good to excellent with the sections. The total scores but not the sections present adequate convergent validity with the ABC-Brazil. However, other psychometric properties should be further investigated. PMID:25003281
The reliability of WorkWell Systems Functional Capacity Evaluation: a systematic review

Science.gov (United States)

2014-01-01

Background Functional capacity evaluation (FCE) determines a person’s ability to perform work-related tasks and is a major component of the rehabilitation process. The WorkWell Systems (WWS) FCE (formerly known as Isernhagen Work Systems FCE) is currently the most commonly used FCE tool in German rehabilitation centres. Our systematic review investigated the inter-rater, intra-rater and test-retest reliability of the WWS FCE. Methods We performed a systematic literature search of studies on the reliability of the WWS FCE and extracted item-specific measures of inter-rater, intra-rater and test-retest reliability from the identified studies. Intraclass correlation coefficients ≥ 0.75, percentages of agreement ≥ 80%, and kappa coefficients ≥ 0.60 were categorised as acceptable, otherwise they were considered non-acceptable. The extracted values were summarised for the five performance categories of the WWS FCE, and the results were classified as either consistent or inconsistent. Results From 11 identified studies, 150 item-specific reliability measures were extracted. 89% of the extracted inter-rater reliability measures, all of the intra-rater reliability measures and 96% of the test-retest reliability measures of the weight handling and strength tests had an acceptable level of reliability, compared to only 67% of the test-retest reliability measures of the posture/mobility tests and 56% of the test-retest reliability measures of the locomotion tests. Both of the extracted test-retest reliability measures of the balance test were acceptable. Conclusions Weight handling and strength tests were found to have consistently acceptable reliability. Further research is needed to explore the reliability of the other tests as inconsistent findings or a lack of data prevented definitive conclusions. PMID:24674029
Reliability and Validity Assessment of a Linear Position Transducer

Science.gov (United States)

Garnacho-Castaño, Manuel V.; López-Lastra, Silvia; Maté-Muñoz, José L.

2015-01-01

The objectives of the study were to determine the validity and reliability of peak velocity (PV), average velocity (AV), peak power (PP) and average power (AP) measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain) during two resistance exercises, bench press (BP) and full back squat (BS), performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2). Intraclass correlation coefficients (ICCs) indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W). Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W). Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP) make this device a useful tool for monitoring resistance training. Key points This study determined the validity and reliability of peak velocity, average velocity, peak power and average power measurements made using a linear position transducer The Tendo Weight-lifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and power. PMID:25729300
Validity and reliability of the TED-QOL: a new three-item questionnaire to assess quality of life in thyroid eye disease.

Science.gov (United States)

Fayers, Tessa; Dolman, Peter J

2011-12-01

To develop and test a user-friendly questionnaire for rapidly assessing quality of life (QOL) in thyroid eye disease (TED). A three-item questionnaire, the TED-QOL, was designed and compared to the 16-item Graves Ophthalmopathy (GO)-QOL and the nine-item GO-Quality of Life Scale (QLS). 100 patients with TED were administered all three questionnaires on two occasions. Results were compared to clinical severity scores (Vision, Inflammation, Strabismus, Appearance (VISA) classification). Main outcomes were construct and criterion validity, test-retest reliability, duration, comprehension and completion rates. TED-QOL correlated strongly with the other questionnaires for corresponding items (Pearson correlation: appearance 0.71, 0.62; functioning 0.69, 0.66; overall QOL 0.53). Test-retest analysis demonstrated good reliability for all three questionnaires (intraclass correlations: TED-QOL 0.81, 0.74, 0.87; GO-QOL 0.81, 0.82; GO-QLS 0.74, 0.86, 0.67). TED-QOL was significantly faster to complete (1.6 min vs GO-QOL 3.1 min, GO-QLS 2.7 min, p<0.0001) and had a higher completion rate (100% vs GO-QOL 78%, GO-QLS 94%). There was only moderate correlation between items on all three questionnaires and VISA scores. The TED-QOL is rapid and easy to complete and analyse and has similar validity and reliability to longer questionnaires. All questionnaires showed only moderate correlation with disease severity, emphasising the discrepancy between objective and subjective assessments and the importance of measuring both.
Validity and reliability of Nintendo Wii Fit balance scores.

Science.gov (United States)

Wikstrom, Erik A

2012-01-01

Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Descriptive laboratory study. Sports medicine research laboratory. Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Participants completed a single-limb-stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with scores ranging from fair (ICC = 0.74) to poor (ICC = 0.29). Wii Fit balance activity scores had poor concurrent validity relative to COP outcomes and SEBT
Reliability of reflectance measures in passive filters

Science.gov (United States)

Saldiva de André, Carmen Diva; Afonso de André, Paulo; Rocha, Francisco Marcelo; Saldiva, Paulo Hilário Nascimento; Carvalho de Oliveira, Regiani; Singer, Julio M.

2014-08-01

Measurements of optical reflectance in passive filters impregnated with a reactive chemical solution may be transformed to ozone concentrations via a calibration curve and constitute a low cost alternative for environmental monitoring, mainly to estimate human exposure. Given the possibility of errors caused by exposure bias, it is common to consider sets of m filters exposed during a certain period to estimate the latent reflectance on n different sample occasions at a certain location. Mixed models with sample occasions as random effects are useful to analyze data obtained under such setups. The intra-class correlation coefficient of the mean of the m measurements is an indicator of the reliability of the latent reflectance estimates. Our objective is to determine m in order to obtain a pre-specified reliability of the estimates, taking possible outliers into account. To illustrate the procedure, we consider an experiment conducted at the Laboratory of Experimental Air Pollution, University of São Paulo, Brazil (LPAE/FMUSP), where sets of m = 3 filters were exposed during 7 days on n = 9 different occasions at a certain location. The results show that the reliability of the latent reflectance estimates for each occasion obtained under homoskedasticity is km = 0.74. A residual analysis suggests that the within-occasion variance for two of the occasions should be different from the others. A refined model with two within-occasion variance components was considered, yielding km = 0.56 for these occasions and km = 0.87 for the remaining ones. To guarantee that all estimates have a reliability of at least 80% we require measurements on m = 10 filters on each occasion.
Inter-tester reliability of selected clinical tests for long-lasting temporomandibular disorders.

Science.gov (United States)

Julsvoll, Elisabeth Heggem; Vøllestad, Nina Køpke; Opseth, Gro; Robinson, Hilde Stendal

2017-09-01

Clinical tests used to examine patients with temporomandibular disorders vary in methodological quality, and some are not tested for reliability. The purpose of this cross-sectional study was to evaluate inter-tester reliability of clinical tests and a cluster of tests used to examine patients with long-lasting temporomandibular disorders. Forty patients with pain in the temporomandibular area treated by health-professionals were included. They were between 18-70 years, had 65 symptomatic (33 right/32 left) and 15 asymptomatic joints. Two manual therapists examined all participants with selected tests. Percentage agreement and the kappa coefficient ( k ) with 95% confidence interval (CI) were used to evaluate the tests with categorical outcomes. For tests with continuous outcomes, the relative inter-tester reliability was assessed by the intraclass-correlation-coefficient (ICC 3,1 , 95% CI) and the absolute reliability was calculated by the smallest detectable change (SDC). The best reliability among single tests was found for the dental stick test, the joint-sound test ( k = 0.80-1.0) and range of mouth-opening (ICC 3,1 (95% CI) = 0.97 (0.95-0.98) and SDC = 4 mm). The reliability of cluster of tests was excellent with both four and five positive tests out of seven. The reliability was good to excellent for the clinical tests and the cluster of tests when performed by experienced therapists. The tests are feasible for use in the clinical setting. They require no advanced equipment and are easy to perform.
The validity and reliability of the Persian version of the Revised Fibromyalgia Impact Questionnaire.

Science.gov (United States)

Ghavidel Parsa, Banafsheh; Amir Maafi, Alireza; Haghdoost, Afrooz; Arabi, Yasaman; Khojamli, Monire; Chatrnour, Gelayol; Bidari, Ali

2014-02-01

The Revised Fibromyalgia Impact Questionnaire (FIQR), an updated version of the Fibromyalgia Impact Questionnaire (FIQ) achieved a better balance among different domains (i.e., function, overall impact, and symptom severity) and attempts to address the limitations of FIQ. As there is no Persian version of the FIQR available, we aimed to investigate the validity and reliability of a Persian translation of the FIQR in Iranian patients. After translating the FIQR into Persian, it was administered to 77 female patients with fibromyalgia syndrome. All of the patients filled out the questionnaire together with a Persian version of the FIQ, short form-12 (SF-12). The tender-point count was also calculated. One week later, FM patients filled out the Persian FIQR at their second visit. Reliability was analyzed by internal consistency and reproducibility including Cronbach's α coefficient and intra-class correlation coefficient. Construct validity was evaluated by Spearman's correlation coefficient and Pearson's correlation coefficient. Statistical analysis was performed using SPSS for Windows version 17.0. All patients included in this study were female, and the mean age was 38.23 ± 10.68 years. The total scores of the FIQR and FIQ were 49.77 ± 18.27 and 54.05 ± 14.00 that were closely correlated (r = 0.63, p FIQ domains (r = 0.36-0.63, p fibromyalgia.
Chest computed tomography-based scoring of thoracic sarcoidosis: Inter-rater reliability of CT abnormalities

Energy Technology Data Exchange (ETDEWEB)

Heuvel, D.A.V. den; Es, H.W. van; Heesewijk, J.P. van; Spee, M. [St. Antonius Hospital Nieuwegein, Department of Radiology, Nieuwegein (Netherlands); Jong, P.A. de [University Medical Center Utrecht, Department of Radiology, Utrecht (Netherlands); Zanen, P.; Grutters, J.C. [University Medical Center Utrecht, Division Heart and Lungs, Utrecht (Netherlands); St. Antonius Hospital Nieuwegein, Center of Interstitial Lung Diseases, Department of Pulmonology, Nieuwegein (Netherlands)

2015-09-15

To determine inter-rater reliability of sarcoidosis-related computed tomography (CT) findings that can be used for scoring of thoracic sarcoidosis. CT images of 51 patients with sarcoidosis were scored by five chest radiologists for various abnormal CT findings (22 in total) encountered in thoracic sarcoidosis. Using intra-class correlation coefficient (ICC) analysis, inter-rater reliability was analysed and reported according to the Guidelines for Reporting Reliability and Agreement Studies (GRRAS) criteria. A pre-specified sub-analysis was performed to investigate the effect of training. Scoring was trained in a distinct set of 15 scans in which all abnormal CT findings were represented. Median age of the 51 patients (36 men, 70 %) was 43 years (range 26 - 64 years). All radiographic stages were present in this group. ICC ranged from 0.91 for honeycombing to 0.11 for nodular margin (sharp versus ill-defined). The ICC was above 0.60 in 13 of the 22 abnormal findings. Sub-analysis for the best-trained observers demonstrated an ICC improvement for all abnormal findings and values above 0.60 for 16 of the 22 abnormalities. In our cohort, reliability between raters was acceptable for 16 thoracic sarcoidosis-related abnormal CT findings. (orig.)
The reliability and validity of hand-held refractometry water content measures of hydrogel lenses.

Science.gov (United States)

Nichols, Jason J; Mitchell, G Lynn; Good, Gregory W

2003-06-01

To investigate within- and between-examiner reliability and validity of hand-held refractometry water content measures of hydrogel lenses. Nineteen lenses of various nominal water contents were examined by two examiners on two occasions separated by 1 hour. An Atago N2 hand-held refractometer was used for all water content measures. Lenses were presented in a random order to each examiner by a third party, and examiners were masked to any potential lens identifiers. Intraclass correlation coefficients (ICC), 95% limits of agreement, and Wilcoxon signed rank test were used to characterize the within- and between-examiner reliability and validity of lens water content measures. Within-examiner reliability was excellent (ICC, 0.97; 95% limits of agreement, -3.6% to +5.7%), and the inter-visit mean difference of 1.1 +/- 2.4% was not biased (p = 0.08). Between-examiner reliability was also excellent (ICC, 0.98; 95% limits of agreement, -4.1% to +3.9%). The mean difference between examiners was -0.1 +/- 2.1% (p = 0.83). The mean difference between the nominally reported water content and our water content measures was -2.1 +/- 1.7% (p refractometry and is material dependent. Therefore, investigators may need to account for bias when measuring hydrogel lens water content via hand-held refractometry.
Chest computed tomography-based scoring of thoracic sarcoidosis: Inter-rater reliability of CT abnormalities

International Nuclear Information System (INIS)

Heuvel, D.A.V. den; Es, H.W. van; Heesewijk, J.P. van; Spee, M.; Jong, P.A. de; Zanen, P.; Grutters, J.C.

2015-01-01

To determine inter-rater reliability of sarcoidosis-related computed tomography (CT) findings that can be used for scoring of thoracic sarcoidosis. CT images of 51 patients with sarcoidosis were scored by five chest radiologists for various abnormal CT findings (22 in total) encountered in thoracic sarcoidosis. Using intra-class correlation coefficient (ICC) analysis, inter-rater reliability was analysed and reported according to the Guidelines for Reporting Reliability and Agreement Studies (GRRAS) criteria. A pre-specified sub-analysis was performed to investigate the effect of training. Scoring was trained in a distinct set of 15 scans in which all abnormal CT findings were represented. Median age of the 51 patients (36 men, 70 %) was 43 years (range 26 - 64 years). All radiographic stages were present in this group. ICC ranged from 0.91 for honeycombing to 0.11 for nodular margin (sharp versus ill-defined). The ICC was above 0.60 in 13 of the 22 abnormal findings. Sub-analysis for the best-trained observers demonstrated an ICC improvement for all abnormal findings and values above 0.60 for 16 of the 22 abnormalities. In our cohort, reliability between raters was acceptable for 16 thoracic sarcoidosis-related abnormal CT findings. (orig.)
Development, reliability, and validity testing of Toddler NutriSTEP: a nutrition risk screening questionnaire for children 18-35 months of age.

Science.gov (United States)

Randall Simpson, Janis; Gumbley, Jillian; Whyte, Kylie; Lac, Jane; Morra, Crystal; Rysdale, Lee; Turfryer, Mary; McGibbon, Kim; Beyers, Joanne; Keller, Heather

2015-09-01

Nutrition is vital for optimal growth and development of young children. Nutrition risk screening can facilitate early intervention when followed by nutritional assessment and treatment. NutriSTEP (Nutrition Screening Tool for Every Preschooler) is a valid and reliable nutrition risk screening questionnaire for preschoolers (aged 3-5 years). A need was identified for a similar questionnaire for toddlers (aged 18-35 months). The purpose was to develop a reliable and valid Toddler NutriSTEP. Toddler NutriSTEP was developed in 4 phases. Content and face validity were determined with a literature review, parent focus groups (n = 6; 48 participants), and experts (n = 13) (phase A). A draft questionnaire was refined with key intercept interviews of 107 parents/caregivers (phase B). Test-retest reliability (phase C), based on intra-class correlations (ICC), Kappa (κ) statistics, and Wilcoxon tests was assessed with 133 parents/caregivers. Criterion validity (phase D) was assessed using Receiver Operating Characteristic (ROC) curves by comparing scores on the Toddler NutriSTEP to a comprehensive nutritional assessment of 200 toddlers with a registered dietitian (RD). The Toddler NutriSTEP was reliable between 2 administrations (ICC = 0.951, F = 20.53, p Toddler NutriSTEP were correlated (r = 0.67, p Toddler NutriSTEP questionnaire is both reliable and valid for screening for nutritional risk in toddlers.
Reliability of cervical lordosis measurement techniques on long-cassette radiographs.

Science.gov (United States)

Janusz, Piotr; Tyrakowski, Marcin; Yu, Hailong; Siemionow, Kris

2016-11-01

Lateral radiographs are commonly used to assess cervical sagittal alignment. Three assessment methods have been described and are commonly utilized in clinical practice. These methods are described for perfect lateral cervical radiographs, however in everyday practice radiograph quality varies. The aim of this study was to compare the reliability and reproducibility of 3 cervical lordosis (CL) measurement methods. Forty-four standing lateral radiographs were randomly chosen from a lateral long-cassette radiograph database. Measurements of CL were performed with: Cobb method C2-C7 (CM), C2-C7 posterior tangent method (PTM), sum of posterior tangent method for each segment (SPTM). Three independent orthopaedic surgeons measured CL using the three methods on 44 lateral radiographs. One researcher used the three methods to measured CL three times at 4-week time intervals. Agreement between the methods as well as their intra- and interobserver reliability were tested and quantified by intraclass correlation coefficient (ICC) and median error for a single measurement (SEM). ICC of 0.75 or more reflected an excellent agreement/reliability. The results were compared with repeated ANOVA test, with p 0.05). All three methods appeared to be highly reliable. Although, high agreement between all measurement methods was shown, we do not recommend using Cobb measurement method interchangeably with PTM or SPTM within a single study as this could lead to error, whereas, such a comparison between tangent methods can be considered.
Interrater reliability of quantitative ultrasound using force feedback among examiners with varied levels of experience

Directory of Open Access Journals (Sweden)

Michael O. Harris-Love

2016-06-01

Full Text Available Background. Quantitative ultrasound measures are influenced by multiple external factors including examiner scanning force. Force feedback may foster the acquisition of reliable morphometry measures under a variety of scanning conditions. The purpose of this study was to determine the reliability of force-feedback image acquisition and morphometry over a range of examiner-generated forces using a muscle tissue-mimicking ultrasound phantom. Methods. Sixty material thickness measures were acquired from a muscle tissue mimicking phantom using B-mode ultrasound scanning by six examiners with varied experience levels (i.e., experienced, intermediate, and novice. Estimates of interrater reliability and measurement error with force feedback scanning were determined for the examiners. In addition, criterion-based reliability was determined using material deformation values across a range of examiner scanning forces (1–10 Newtons via automated and manually acquired image capture methods using force feedback. Results. All examiners demonstrated acceptable interrater reliability (intraclass correlation coefficient, ICC = .98, p .90, p < .001, independent of their level of experience. The measurement error among all examiners was 1.5%–2.9% across all applied stress conditions. Conclusion. Manual image capture with force feedback may aid the reliability of morphometry measures across a range of examiner scanning forces, and allow for consistent performance among examiners with differing levels of experience.
The reliability, validity, and feasibility of physical activity measurement in adults with traumatic brain injury: an observational study.

Science.gov (United States)

Hassett, Leanne; Moseley, Anne; Harmer, Alison; van der Ploeg, Hidde P

2015-01-01

To determine the reliability and validity of the Physical Activity Scale for Individuals with a Physical Disability (PASIPD) in adults with severe traumatic brain injury (TBI) and estimate the proportion of the sample participants who fail to meet the World Health Organization guidelines for physical activity. A single-center observational study recruited a convenience sample of 30 community-based ambulant adults with severe TBI. Participants completed the PASIPD on 2 occasions, 1 week apart, and wore an accelerometer (ActiGraph GT3X; ActiGraph LLC, Pensacola, Florida) for the 7 days between these 2 assessments. The PASIPD test-retest reliability was substantial (intraclass correlation coefficient = 0.85; 95% confidence interval, 0.70-0.92), and the correlation with the accelerometer ranged from too low to be meaningful (R = 0.09) to moderate (R = 0.57). From device-based measurement of physical activity, 56% of participants failed to meet the World Health Organization physical activity guidelines. The PASIPD is a reliable measure of the type of physical activity people with severe TBI participate in, but it is not a valid measure of the amount of moderate to vigorous physical activity in which they engage. Accelerometers should be used to quantify moderate to vigorous physical activity in people with TBI.
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

Science.gov (United States)

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (Test possessed low CV and significant (pTest possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

Manual muscle testing and hand-held dynamometry in people with inflammatory myopathy: An intra- and interrater reliability and validity study.

Science.gov (United States)

Baschung Pfister, Pierrette; de Bruin, Eling D; Sterkele, Iris; Maurer, Britta; de Bie, Rob A; Knols, Ruud H

2018-01-01

Manual muscle testing (MMT) and hand-held dynamometry (HHD) are commonly used in people with inflammatory myopathy (IM), but their clinimetric properties have not yet been sufficiently studied. To evaluate the reliability and validity of MMT and HHD, maximum isometric strength was measured in eight muscle groups across three measurement events. To evaluate reliability of HHD, intra-class correlation coefficients (ICC), the standard error of measurements (SEM) and smallest detectable changes (SDC) were calculated. To measure reliability of MMT linear Cohen`s Kappa was computed for single muscle groups and ICC for total score. Additionally, correlations between MMT8 and HHD were evaluated with Spearman Correlation Coefficients. Fifty people with myositis (56±14 years, 76% female) were included in the study. Intra-and interrater reliability of HHD yielded excellent ICCs (0.75-0.97) for all muscle groups, except for interrater reliability of ankle extension (0.61). The corresponding SEMs% ranged from 8 to 28% and the SDCs% from 23 to 65%. MMT8 total score revealed excellent intra-and interrater reliability (ICC>0.9). Intrarater reliability of single muscle groups was substantial for shoulder and hip abduction, elbow and neck flexion, and hip extension (0.64-0.69); moderate for wrist (0.53) and knee extension (0.49) and fair for ankle extension (0.35). Interrater reliability was moderate for neck flexion (0.54) and hip abduction (0.44); fair for shoulder abduction, elbow flexion, wrist and ankle extension (0.20-0.33); and slight for knee extension (0.08). Correlations between the two tests were low for wrist, knee, ankle, and hip extension; moderate for elbow flexion, neck flexion and hip abduction; and good for shoulder abduction. In conclusion, the MMT8 total score is a reliable assessment to consider general muscle weakness in people with myositis but not for single muscle groups. In contrast, our results confirm that HHD can be recommended to evaluate strength of
Development of the Italian version of the revised Scoliosis Research Society-22 Patient Questionnaire, SRS-22r-I: cross-cultural adaptation, factor analysis, reliability, and validity.

Science.gov (United States)

Monticone, Marco; Baiardi, Paola; Calabrò, David; Calabrò, Fabio; Foti, Calogero

2010-11-15

Evaluation of the psychometric properties of a translated and culturally adapted questionnaire. Translating, culturally adapting, and validating the Italian version of the revised Scoliosis Research Society-22 Patient Questionnaire (SRS-22r-I) in order to allow its use with Italian-speaking patients with adolescent idiopathic scoliosis (AIS). Increasing attention is being given to health-related quality of life measures as a means of adding information about the evaluation of AIS. A translated form of the revised SRS-22 has never been validated in Italian patients with AIS. The development of the SRS-22 questionnaire involved its translation and back-translation, a final review by an Expert Committee, and testing of the prefinal version to establish its correspondence to the original English version. Psychometric testing included factor analysis, reliability by internal consistency (Cronbach alpha) and test-retest repeatability (Intraclass Coefficient Correlation), and concurrent validity (Pearson correlation) by comparing the SRS-22r-I domains with the Short-Form Health Survey (SF-36) subscales. It took 4 months to develop a shared version of the SRS-22r-I, which proved to be satisfactorily acceptable when administered to 223 subjects with AIS. Factor analysis indicated a 4-factor solution (54% of the explained variance), and the questionnaire had an acceptable level of internal consistency (α = 0.77) and a high level of test-retest reliability (intraclass correlation coefficient = 0.957). In terms of concurrent validity, the correlations with the related Short-Form-36 subscales were moderate to good in the case of the Pain and Mental Health domains, and moderate in the case of the Function and Self-Image domains. The Italian translation of the SRS-22r has a good factorial structure and psychometric properties, and replicates the results of existing English versions of the questionnaire. Its use for research purposes can therefore be recommended.
Inter- and intra-rater reliability of 3D kinematics during maximum mouth opening of asymptomatic subjects.

Science.gov (United States)

Calixtre, Leticia Bojikian; Nakagawa, Theresa Helissa; Alburquerque-Sendín, Francisco; da Silva Grüninger, Bruno Leonardo; de Sena Rosa, Lianna Ramalho; Oliveira, Ana Beatriz

2017-11-07

Previous studies evaluated 3D human jaw movements using kinematic analysis systems during mouth opening, but information on the reliability of such measurements is still scarce. The purpose of this study was to analyze within- and between-session reliabilities, inter-rater reliability, standard error of measurement (SEM), minimum detectable change (MDC) and consistency of agreement across raters and sessions of 3D kinematic variables during maximum mouth opening (MMO). Thirty-six asymptomatic subjects from both genders were evaluated on two different days, five to seven days apart. Subjects performed three MMO movements while kinematic data were collected. Intraclass correlation coefficient (ICC), SEM and MDC were calculated for all variables, and Bland-Altman plots were constructed. Jaw radius and width were the most reproducible variables (ICC>0.81) and demonstrated minor error. Incisor displacement during MMO and angular movements in the sagittal plane presented good reliability (ICC from 0.61 to 0.8) and small errors and, consequently, could be used in future studies with the same methodology and population. The variables with smaller amplitudes (condylar translations during mouth opening and closing and mandibular movements on the frontal and transversal planes) were less reliable (ICCmandibular movements in the frontal and transversal planes. Copyright © 2017 Elsevier Ltd. All rights reserved.
Interobserver and Intraobserver Reliability of Three-Dimensional Preoperative Planning Software in Total Hip Arthroplasty.

Science.gov (United States)

Wako, Yasushi; Nakamura, Junichi; Miura, Michiaki; Kawarai, Yuya; Sugano, Masahiko; Nawata, Kento

2018-02-01

The purpose of this study is to clarify interobserver and intraobserver reliabilities of the three-dimensional (3D) templating of total hip arthroplasty (THA). We selected preoperative computed tomography from 60 hips in 46 patients (14 men and 32 women) who underwent primary THA. To evaluate interobserver and intraobserver reliability, 6 orthopedic surgeons performed 3D templating twice over a 4-week interval. We investigated intraclass correlation coefficients (ICCs) and percent agreement of component size and alignment, comparing morphological differences in the hip. Reproducibility was also compared between groups with osteoarthritis (OA) and those with osteonecrosis (ON). The interobserver reliabilities for mean cup size and stem size were excellent, with ICC = 0.907 and 0.944, respectively. The value was significantly higher in the ON group than in the OA group. In the OA group, the reliability of cup size and alignment decreased in hips with severe subluxation. Percent agreement of stem size was significantly different between the shapes of femoral canal. For intraobserver reliability, the mean ICC of cup size was 0.965 overall, while the value in the ON group was significantly higher than in the OA group. The mean ICC of stem size was 0.972 overall. Computed tomography-based 3D templating showed excellent reliability for component size and alignment in THA. Deformity of the affected joint influenced the reliability of preoperative planning. Copyright © 2017 Elsevier Inc. All rights reserved.
Reliability of ultrasound thickness measurement of the abdominal muscles during clinical isometric endurance tests.

Science.gov (United States)

ShahAli, Shabnam; Arab, Amir Massoud; Talebian, Saeed; Ebrahimi, Esmaeil; Bahmani, Andia; Karimi, Noureddin; Nabavi, Hoda

2015-07-01

The study was designed to evaluate the intra-examiner reliability of ultrasound (US) thickness measurement of abdominal muscles activity when supine lying and during two isometric endurance tests in subjects with and without Low back pain (LBP). A total of 19 women (9 with LBP, 10 without LBP) participated in the study. Within-day reliability of the US thickness measurements at supine lying and the two isometric endurance tests were assessed in all subjects. The intra-class correlation coefficient (ICC) was used to assess the relative reliability of thickness measurement. The standard error of measurement (SEM), minimal detectable change (MDC) and the coefficient of variation (CV) were used to evaluate the absolute reliability. Results indicated high ICC scores (0.73-0.99) and also small SEM and MDC scores for within-day reliability assessment. The Bland-Altman plots of agreement in US measurement of the abdominal muscles during the two isometric endurance tests demonstrated that 95% of the observations fall between the limits of agreement for test and retest measurements. Together the results indicate high intra-tester reliability for the US measurement of the thickness of abdominal muscles in all the positions tested. According to the study's findings, US imaging can be used as a reliable method for assessment of abdominal muscles activity in supine lying and the two isometric endurance tests employed, in participants with and without LBP. Copyright © 2014 Elsevier Ltd. All rights reserved.
How Many Sleep Diary Entries Are Needed to Reliably Estimate Adolescent Sleep?

Science.gov (United States)

Arora, Teresa; Gradisar, Michael; Taheri, Shahrad; Carskadon, Mary A.

2017-01-01

Abstract Study Objectives: To investigate (1) how many nights of sleep diary entries are required for reliable estimates of five sleep-related outcomes (bedtime, wake time, sleep onset latency [SOL], sleep duration, and wake after sleep onset [WASO]) and (2) the test–retest reliability of sleep diary estimates of school night sleep across 12 weeks. Methods: Data were drawn from four adolescent samples (Australia [n = 385], Qatar [n = 245], United Kingdom [n = 770], and United States [n = 366]), who provided 1766 eligible sleep diary weeks for reliability analyses. We performed reliability analyses for each cohort using complete data (7 days), one to five school nights, and one to two weekend nights. We also performed test–retest reliability analyses on 12-week sleep diary data available from a subgroup of 55 US adolescents. Results: Intraclass correlation coefficients for bedtime, SOL, and sleep duration indicated good-to-excellent reliability from five weekday nights of sleep diary entries across all adolescent cohorts. Four school nights was sufficient for wake times in the Australian and UK samples, but not the US or Qatari samples. Only Australian adolescents showed good reliability for two weekend nights of bedtime reports; estimates of SOL were adequate for UK adolescents based on two weekend nights. WASO was not reliably estimated using 1 week of sleep diaries. We observed excellent test–rest reliability across 12 weeks of sleep diary data in a subsample of US adolescents. Conclusion: We recommend at least five weekday nights of sleep dairy entries to be made when studying adolescent bedtimes, SOL, and sleep duration. Adolescent sleep patterns were stable across 12 consecutive school weeks. PMID:28199718
The reliability and validity of fatigue measures during short-duration maximal-intensity intermittent cycling.

Science.gov (United States)

Glaister, Mark; Stone, Michael H; Stewart, Andrew M; Hughes, Michael; Moir, Gavin L

2004-08-01

The purpose of the present study was to assess the reliability and validity of fatigue measures, as derived from 4 separate formulae, during tests of repeat sprint ability. On separate days over a 3-week period, 2 groups of 7 recreationally active men completed 6 trials of 1 of 2 maximal (20 x 5 seconds) intermittent cycling tests with contrasting recovery periods (10 or 30 seconds). All trials were conducted on a friction-braked cycle ergometer, and fatigue scores were derived from measures of mean power output for each sprint. Apart from formula 1, which calculated fatigue from the percentage difference in mean power output between the first and last sprint, all remaining formulae produced fatigue scores that showed a reasonably good level of test-retest reliability in both intermittent test protocols (intraclass correlation range: 0.78-0.86; 95% likely range of true values: 0.54-0.97). Although between-protocol differences in the magnitude of the fatigue scores suggested good construct validity, within-protocol differences highlighted limitations with each formula. Overall, the results support the use of the percentage decrement score as the most valid and reliable measure of fatigue during brief maximal intermittent work.
The reliability and validity of fatigue measures during multiple-sprint work: an issue revisited.

Science.gov (United States)

Glaister, Mark; Howatson, Glyn; Pattison, John R; McInnes, Gill

2008-09-01

The ability to repeatedly produce a high-power output or sprint speed is a key fitness component of most field and court sports. The aim of this study was to evaluate the validity and reliability of eight different approaches to quantify this parameter in tests of multiple-sprint performance. Ten physically active men completed two trials of each of two multiple-sprint running protocols with contrasting recovery periods. Protocol 1 consisted of 12 x 30-m sprints repeated every 35 seconds; protocol 2 consisted of 12 x 30-m sprints repeated every 65 seconds. All testing was performed in an indoor sports facility, and sprint times were recorded using twin-beam photocells. All but one of the formulae showed good construct validity, as evidenced by similar within-protocol fatigue scores. However, the assumptions on which many of the formulae were based, combined with poor or inconsistent test-retest reliability (coefficient of variation range: 0.8-145.7%; intraclass correlation coefficient range: 0.09-0.75), suggested many problems regarding logical validity. In line with previous research, the results support the percentage decrement calculation as the most valid and reliable method of quantifying fatigue in tests of multiple-sprint performance.
Student Practice Evaluation Form-Revised Edition online comment bank: development and reliability analysis.

Science.gov (United States)

Rodger, Sylvia; Turpin, Merrill; Copley, Jodie; Coleman, Allison; Chien, Chi-Wen; Caine, Anne-Maree; Brown, Ted

2014-08-01

The reliable evaluation of occupational therapy students completing practice education placements along with provision of appropriate feedback is critical for both students and for universities from a quality assurance perspective. This study describes the development of a comment bank for use with an online version of the Student Practice Evaluation Form-Revised Edition (SPEF-R Online) and investigates its reliability. A preliminary bank of 109 individual comments (based on previous students' placement performance) was developed via five stages. These comments reflected all 11 SPEF-R domains. A purpose-designed online survey was used to examine the reliability of the comment bank. A total of 37 practice educators returned surveys, 31 of which were fully completed. Participants were asked to rate each individual comment using the five-point SPEF-R rating scale. One hundred and two of 109 comments demonstrated satisfactory agreement with their respective default ratings that were determined by the development team. At each domain level, the intra-class correlation coefficients (ranging between 0.86 and 0.96) also demonstrated good to excellent inter-rater reliability. There were only seven items that required rewording prior to inclusion in the final SPEF-R Online comment bank. The development of the SPEF-R Online comment bank offers a source of reliable comments (consistent with the SPEF-R rating scale across different domains) and aims to assist practice educators in providing reliable and timely feedback to students in a user-friendly manner. © 2014 Occupational Therapy Australia.
Cultural Adaptation and Reliability of the Compliance with Standard Precautions Scale (CSPS) for Nurses in Brazil 1

Science.gov (United States)

Pereira, Fernanda Maria Vieira; Lam, Simon Ching; Gir, Elucir

2017-01-01

ABSTRACT Objective: this study aimed to carry of the cultural adaptation and to evaluate the reliability of the Compliance with Standard Precautions Scale (CSPS) for nurses in Brazil. Method: the adaptation process entailed translation, consensus among judges, back-translation, semantic validation and pretest. The reliability was evaluated by internal consistency (Cronbach alpha) and stability (test-retest). The instrument was administered to a sample group of 300 nurses who worked in a large hospital located in the city of São Paulo/SP, Brazil. Results: through the semantic validation, the items from the scale were considered understandable and deemed important for the nurse´s clinical practice. The CSPS Brazilian Portuguese version (CSPS-PB) revealed excellent interpretability. The Cronbach`s alpha was 0.61 and the intraclass correlation coefficient was 0.85. Conclusion: the initial study showed that CSPS-PB is appropriate to assess compliance with standard precautions among nurses in Brazil. The reliability was considered acceptable. Furhter study is necessary to evaluate its comprehensive psychometric properties. PMID:28301030
Reliability of contractile properties of the knee extensor muscles in individuals with post-polio syndrome.

Directory of Open Access Journals (Sweden)

Eric L Voorn

Full Text Available To assess the reliability of contractile properties of the knee extensor muscles in 23 individuals with post-polio syndrome (PPS and 18 age-matched healthy individuals.Contractile properties of the knee extensors were assessed from repeated electrically evoked contractions on 2 separate days, with the use of a fixed dynamometer. Reliability was determined for fatigue resistance, rate of torque development (MRTD, and early and late relaxation time (RT50 and RT25, using the intraclass correlation coefficient (ICC and standard error of measurement (SEM, expressed as % of the mean.In both groups, reliability for fatigue resistance was good, with high ICCs (>0.90 and small SEM values (PPS: 7.1%, healthy individuals: 7.0%. Reliability for contractile speed indices varied, with the best values found for RT50 (ICCs>0.82, SEM values <2.8%. We found no systematic differences between test and retest occasions, except for RT50 in healthy subjects (p = 0.016.In PPS and healthy individuals, the reliability of fatigue resistance, as obtained from electrically evoked contractions is high. The reliability of contractile speed is only moderate, except for RT50 in PPS, demonstrating high reliability.This was the first study to examine the reliability of electrically evoked contractile properties in individuals with PPS. Our results demonstrate its potential to study mechanisms underlying muscle fatigue in PPS and to evaluate changes in contractile properties over time in response to interventions or from natural course.
The PRECIS-2 tool has good interrater reliability and modest discriminant validity.

Science.gov (United States)

Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank M; Donnan, Peter T; Gágyor, Ildikó; Hobbelen, Hans J S M; Althabe, Fernando; Krishnan, Jerry A; Treweek, Shaun

2017-08-01

PRagmatic Explanatory Continuum Indicator Summary (PRECIS)-2 is a tool that could improve design insight for trialists. Our aim was to validate the PRECIS-2 tool, unlike its predecessor, testing the discriminant validity and interrater reliability. Over 80 international trialists, methodologists, clinicians, and policymakers created PRECIS-2 helping to ensure face validity and content validity. The interrater reliability of PRECIS-2 was measured using 19 experienced trialists who used PRECIS-2 to score a diverse sample of 15 randomized controlled trial protocols. Discriminant validity was tested with two raters to independently determine if the trial protocols were more pragmatic or more explanatory, with scores from the 19 raters for the 15 trials as predictors of pragmatism. Interrater reliability was generally good, with seven of nine domains having an intraclass correlation coefficient over 0.65. Flexibility (adherence) and recruitment had wide confidence intervals, but raters found these difficult to rate and wanted more information. Each of the nine PRECIS-2 domains could be used to differentiate between trials taking more pragmatic or more explanatory approaches with better than chance discrimination for all domains. We have assessed the validity and reliability of PRECIS-2. An elaboration study and web site provide guidance to help future users of the tool which is continuing to be tested by trial teams, systematic reviewers, and funders. Copyright © 2017 Elsevier Inc. All rights reserved.
Reliability and validity of a treatment fidelity assessment for motivational interviewing targeting sexual risk behaviors in people living with HIV/AIDS.

Science.gov (United States)

Seng, Elizabeth K; Lovejoy, Travis I

2013-12-01

This study psychometrically evaluates the Motivational Interviewing Treatment Integrity Code (MITI) to assess fidelity to motivational interviewing to reduce sexual risk behaviors in people living with HIV/AIDS. 74 sessions from a pilot randomized controlled trial of motivational interviewing to reduce sexual risk behaviors in people living with HIV were coded with the MITI. Participants reported sexual behavior at baseline, 3-month, and 6-months. Regarding reliability, excellent inter-rater reliability was achieved for measures of behavior frequency across the 12 sessions coded by both coders; global scales demonstrated poor intraclass correlations, but adequate percent agreement. Regarding validity, principle components analyses indicated that a two-factor model accounted for an adequate amount of variance in the data. These factors were associated with decreases in sexual risk behaviors after treatment. The MITI is a reliable and valid measurement of treatment fidelity for motivational interviewing targeting sexual risk behaviors in people living with HIV/AIDS.
The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals

Energy Technology Data Exchange (ETDEWEB)

Fuller, Joel T.; Buckley, Jonathan D.; Tsiros, Margarita D.; Thewlis, Dominic [University of South Australia, Alliance for Research in Exercise, Nutrition and Activity (ARENA), Sansom Institute for Health Research, GPO Box 2471, Adelaide, South Australia (Australia); Archer, Jane [University of South Australia, Medical Radiation, School of Health Sciences, Adelaide (Australia)

2016-01-15

To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5 %; limits of agreement [LOA] ≤4.2 %). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2 %; LOA ≤11.9 %). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2 %; LOA ≤23.6 %) and fourth metatarsals (ICC ≥0.67; CV ≤9.6 %; LOA ≤27.5 %). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals. (orig.)
The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals

International Nuclear Information System (INIS)

Fuller, Joel T.; Buckley, Jonathan D.; Tsiros, Margarita D.; Thewlis, Dominic; Archer, Jane

2016-01-01

To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5 %; limits of agreement [LOA] ≤4.2 %). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2 %; LOA ≤11.9 %). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2 %; LOA ≤23.6 %) and fourth metatarsals (ICC ≥0.67; CV ≤9.6 %; LOA ≤27.5 %). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals. (orig.)
The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals.

Science.gov (United States)

Fuller, Joel T; Archer, Jane; Buckley, Jonathan D; Tsiros, Margarita D; Thewlis, Dominic

2016-01-01

To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5%; limits of agreement [LOA] ≤4.2%). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2%; LOA ≤11.9%). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2%; LOA ≤23.6%) and fourth metatarsals (ICC ≥0.67; CV ≤9.6%; LOA ≤27.5%). BMD was greatest in the first and second metatarsals (P Reliable measurements of BMD were achieved for the first, second and fifth metatarsals.
Reliability and Convergent Validity of the Algometer for Vestibular Pain Assessment in Women with Provoked Vestibulodynia.

Science.gov (United States)

Cyr, Marie-Pierre; Bourbonnais, Daniel; Pinard, Alexandra; Dubois, Olivia; Morin, Mélanie

2016-07-01

Women with provoked vestibulodynia (PVD) suffer pain at the entry of the vagina elicited by pressure as during vaginal penetration. To quantify vestibular pain, we developed a new instrument, an algometer. The aim of this study was to investigate the test-retest reliability of the algometer and evaluate its convergent validity for vestibular pain assessment in women with PVD. Twenty-six women with PVD participated in the study. Vestibular pain was assessed with the new algometer and the already known vulvalgesiometer during two different sessions 2 to 4 weeks apart. At each session, the pressure pain threshold (PPT) and pressure pain tolerance (PPTol) were measured twice at the 3, 6, and 9 o'clock sites of the vestibule in random order. The test-retest reliability (intra- and inter-session) of the algometer was calculated using the intraclass correlation coefficient (ICC) and standard error of measurement (SEM). Its convergent validity was evaluated by the correlation coefficients between PPTs and PPTols measured by the algometer and those measured with the vulvalgesiometer. Intra-session reliability at all three sites for PPTs and PPTols in both sessions was excellent (ICC = 0.859 to 0.988, P ≤ 0.002). Inter-session reliability was good to excellent (ICC = 0.683 to 0.922, SEM = 15.06 to 47.04 g, P ≤ 0.001). Significant correlations were found between the two tools for all sites for PPTs (r = 0.500 to 0.614, P ≤ 0.009) and PPTols (r = 0.809 to 0.842, P algometer is a reliable and valid instrument for measuring PPTs and PPTols in the vestibular area in women with PVD. This technology is promising for pinpointing treatment mechanisms and efficacy. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Cross-Cultural Adaptation, Validation, and Reliability Testing of the Modified Oswestry Disability Questionnaire in Persian Population with Low Back Pain.

Science.gov (United States)

Baradaran, Aslan; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza

2016-04-01

Prospective study. We aimed to validate the Persian version of the modified Oswestry disability questionnaire (MODQ) in patients with low back pain. Modified Oswestry low back pain disability questionnaire is a well-known condition-specific outcome measure that helps quantify disability in patients with lumbar syndromes. To test the validity in a pilot study, the Persian MODQ was administered to 25 individuals with low back pain. We then enrolled 200 consecutive patients with low back pain to fill the Persian MODQ as well as the short form 36 (SF-36) questionnaire. Convergent validity of the MODQ was tested using the Spearman's correlation coefficient between the MODQ and SF-36 subscales. Intraclass correlation coefficient (ICC) and Cronbach's α coefficient were measured to test the reliability between test and retest and internal consistency of all items, respectively. ICC for individual items ranged from 0.43 to 0.80 showing good reliability and reproducibility of each individual item. Cronbach's α coefficient was 0.69 showing good internal consistency across all 10 items of the Persian MODQ. Total MODQ score showed moderate to strong correlation with the eight subscales and the two domains of the SF-36. The highest correlation was between the MODQ and the physical functioning subscale of the SF-36 (r=-0.54, pPersian version of the MODQ is a valid and reliable tool for the assessment of the disability following low back pain.
Clinical assessment of scapular positioning in musicians: an intertester reliability study.

Science.gov (United States)

Struyf, Filip; Nijs, Jo; De Coninck, Kris; Giunta, Marco; Mottram, Sarah; Meeusen, Romain

2009-01-01

The reliability of the measurement of the distance between the posterior border of the acromion and the wall and the reliability of the modified lateral scapular slide test have not been studied. Overall, the reliability of the clinical tools used to assess scapular positioning has not been studied in musicians. To examine the intertester reliability of scapular observation and 2 clinical tests for the assessment of scapular positioning in musicians. Intertester reliability study. University research laboratory. Thirty healthy student musicians at a single university. Two assessors performed a standardized observation protocol, the measurement of the distance between the posterior border of the acromion and the wall, and the modified lateral scapular slide test. Each assessor was blinded to the other's findings. The intertester reliability coefficients (kappa) for the observation in relaxed position, during unloaded movement, and during loaded movement were 0.41, 0.63, and 0.36, respectively. The kappa values for the observation of tilting and winging at rest were 0.48 and 0.42, respectively; during unloaded movement, the kappa values were 0.52 and 0.78, respectively; and with a 1-kg load, the kappa values were 0.24 and 0.50, respectively. The intraclass correlation coefficient (ICC) of the measurement of the acromial distance was 0.72 in relaxed position and 0.75 with the participant actively retracting both shoulders. The ICCs for the modified lateral scapular slide test varied between 0.63 and 0.58. Our results demonstrated that the modified lateral scapular slide test was not a reliable tool to assess scapular positioning in these participants. Our data indicated that scapular observation in the relaxed position and during unloaded abduction in the frontal plane was a reliable assessment tool. The reliability of the measurement of the distance between the posterior border of the acromion and the wall in healthy musicians was moderate.
Validity and reliability of an adapted arabic version of the long international physical activity questionnaire.

Science.gov (United States)

Helou, Khalil; El Helou, Nour; Mahfouz, Maya; Mahfouz, Yara; Salameh, Pascale; Harmouche-Karaki, Mireille

2017-07-24

The International Physical Actvity Questionnaire (IPAQ) is a validated tool for physical activity assessment used in many countries however no Arabic version of the long-form of this questionnaire exists to this date. Hence, the aim of this study was to cross-culturally adapt and validate an Arabic version of the long International Physical Activity Questionnaire (AIPAQ) equivalent to the French version (F-IPAQ) in a Lebanese population. The guidelines for cross-cultural adaptation provided by the World Health Organization and the International Physical Activity Questionnaire committee were followed. One hundred fifty-nine students and staff members from Saint Joseph University of Beirut were randomly recruited to participate in the study. Items of the A-IPAQ were compared to those from the F-IPAQ for concurrent validity using Spearman's correlation coefficient. Content validity of the questionnaire was assessed using factor analysis for the A-IPAQ's items. The physical activity indicators derived from the A-IPAQ were compared with the body mass index (BMI) of the participants for construct validity. The instrument was also evaluated for internal consistency reliability using Cronbach's alpha and Intraclass Correlation Coefficient (ICC). Finally, thirty-one participants were asked to complete the A-IPAQ on two occasions three weeks apart to examine its test-retest reliability. Bland-Altman analyses were performed to evaluate the extent of agreement between the two versions of the questionnaire and its repeated administrations. A high correlation was observed between answers of the F-IPAQ and those of the A-IPAQ, with Spearman's correlation coefficients ranging from 0.91 to 1.00 (p reliability with Cronbach's alpha ranging from 0.769-1.00 (p reliability for most of its items (ICC ranging from 0.66-0.96; p validity and reliability for the assessment of physical activity among Lebanese adults. More studies are necessary in the future to assess its validity compared

The validity and reliability of Systemic Lupus Erythematosus Quality of Life Questionnaire (L-QoL) in a Turkish population.

Science.gov (United States)

Duruöz, M T; Unal, C; Toprak, C Sanal; Sezer, I; Yilmaz, F; Ulutatar, F; Atagündüz, P; Baklacioglu, H S

2017-12-01

Background Systemic lupus erythematosus (SLE) may have a profound impact on quality of life. There is increasing interest in measuring quality of life in lupus patients. The purpose of this study was to investigate the validity and reliability of SLE Quality of Life Questionnaire (L-QoL) in Turkish SLE patients. Methods SLE according to 2012 Systemic Lupus International Collaborating Clinics Classification Criteria were recruited into the study. Demographic data, clinical parameters and disease activity measured with the Systemic Lupus Erythematosus Disease Activity Index-2000 (SLEDAI-2K); were noted. Nottingham Health Profile and Health Assessment Questionnaire were filled out in addition to the Turkish L-QoL (LQoL-TR). Internal consistency, test-retest reliability, and convergent and discriminant validity were evaluated. Results The mean age of participants was 43.55 ± 14.33 years and the mean disease duration was 89.8 ± 92.1 months. The patients filled out LQoL-TR in 2.5 min. Strong correlation of LQoL-TR with all subgroups of the Nottingham Health Profile and the Health Assessment Questionnaire were established showing the convergent validity. The highest correlation was demonstrated with emotional reactions (rho = 0.72) and sleep component (rho = 0.65) of the Nottingham Health Profile scale ( p < 0.0001). Its poor and not significant correlation with nonfunctional parameters (age, disease duration, perceived general health, SLEDAI-2K) showed its discriminative properties. LQoL-TR demonstrated good internal reliability with a Cronbach's α of 0.93 and test-retest reliability with intraclass correlation coefficient of 0.87. Conclusion The LQoL-TR is a practical and useful tool which demonstrates good validity and reliability.
Reliability and Validity of the Turkish Adaptation of VITACORA-19 in Patients with Psoriatic Arthritis.

Science.gov (United States)

Tander, Berna; Ulus, Yasemin; Terzi, Yüksel; Zahiroğlu, Yeliz; Kesmen, Hakan; Farisoğullari, Bayram; Akyol, Yeşim; Bilgici, Ayhan; Kuru, Ömer

2016-12-01

This study aims to evaluate the reliability and validity of the Turkish language version of VITACORA-19 (psoriatic arthritis quality of life questionnaire) in patients with psoriatic arthritis. The Turkish version of VITACORA-19 questionnaire was obtained after a translation and back translation process. The study sample included 61 PsA patients (22 males, 39 females; mean age 46.5±12.2 years; range 19 to 71 years). To assess the test-retest reliability of the Turkish VITACORA-19, the questionnaire was reapplied 10 to 15 days after the first interview (interclass correlation coefficient). Cronbach's alpha (a) was used to evaluate the internal consistency. VITACORA-19 was compared with visual analog scale for physician and patient global assessments, the Health Assessment Questionnaire, and Nottingham Health Profile for construct validity. The internal structure of VITACORA-19 was examined by factor analysis. The individual item intraclass correlation coefficient ranged from 0.77 to 0.98 and Cronbach's alpha ranged from 0.77 to 0.98. The Cronbach's alpha value for whole scale was determined as 0.96. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.90, and Bartlett's test of sphericity had a p<0.001. Turkish VITACORA-19 total scores were correlated negatively with Health Assessment Questionnaire, visual analog scale for pain, and Nottingham Health Profile subgroups, and positively with physician and patient global assessments (p<0.01). Turkish version of VITACORA-19 questionnaire is a reliable and valid measure for health-related quality of life in Turkish patients with psoriatic arthritis.
Reliability of scoring arousals in normal children and children with obstructive sleep apnea syndrome.

Science.gov (United States)

Wong, Tat Kong; Galster, Patricia; Lau, Tai Shing; Lutz, Janita M; Marcus, Carole L

2004-09-15

Scoring of arousals in children is based on an extension of adult criteria, as defined by the American Sleep Disorders Association (ASDA). By this, a minimum duration of 3 seconds is required. A few recent studies utilized modified criteria for the study of children, with durations as short as 1 second. However, the validity and reliability of scoring these shorter arousals have never been verified. Based on studies in adults, we hypothesized that interscorer agreement for scoring arousals shorter than 3 seconds was poor. Retrospective review of polysomnograms by 2 experienced sleep practitioners who independently scored arousals according to the ASDA 3-second criteria and modified duration criteria of 1 and 2 seconds. Academic hospital. 20 polysomnographic studies from children aged 3 to 8 years with mild to severe obstructive sleep apnea syndrome, and 16 polysomnographic studies from normal children. None. The intraclass correlation coefficient for scoring ASDA arousals was 0.90 (95% confidence interval: 0.81-0.95), indicating excellent interscorer agreement. The intraclass correlation coefficient for scoring modified 1-second and 2-second arousals were 0.35 (95% confidence interval: 0.02-0.61) and 0.42 (95% confidence interval: 0.12-0.65) respectively, indicating poor to fair interscorer agreement. Furthermore, modified 1-second and 2-second arousals accounted for less than 15% of all arousals scored. We conclude that there is much poorer interscorer agreement for scoring arousals shorter than 3 seconds, when compared to the standard ASDA criteria. We propose that scoring of arousals in children should follow the standard ASDA criteria.
The Korean Version of the Cognitive Assessment Scale for Stroke Patients (K-CASP): A Reliability and Validity Study.

Science.gov (United States)

Park, Kwon-Hee; Lee, Hee-Won; Park, Kee-Boem; Lee, Jin-Youn; Cho, Ah-Ra; Oh, Hyun-Mi; Park, Joo Hyun

2017-06-01

To develop the Korean version of the Cognitive Assessment Scale for Stroke Patients (K-CASP) and to evaluate the test reliability and validity of the K-CASP in stroke patients. The original CASP was translated into Korean, back-translated into English, then reviewed and compared with the original version. Thirty-three stroke patients were assessed independently by two examiners using the K-CASP twice, with a one-day interval, for a total of four test results. To evaluate the reliability of the K-CASP, intra-class correlation coefficients were used. Pearson correlations were calculated and simple regression analyses performed with the Korean version of Mini-Mental State Examination (K-MMSE) and the aphasia quotient (AQ) to assess the validity. The mean score was 24.42±9.47 (total score 36) for the K-CASP and 21.50±7.01 (total score 30) for the K-MMSE. The inter-rater correlation coefficients of the K-CASP were 0.992 on the first day and 0.995 on the second day. The intra-rater correlation coefficients of the K-CASP were 0.997 for examiner 1 and 0.996 for examiner 2. In the Pearson correlation analysis, the K-CASP score significantly correlated with the K-MMSE score (r=0.825, preliable and valid instrument for cognitive dysfunction screening in post-stroke patients. It is more applicable than other cognitive assessment tools in stroke patients with aphasia.
Reliability and consistency of plantarflexor stretch-shortening cycle function using an adapted force sledge apparatus

International Nuclear Information System (INIS)

Furlong, Laura-Anne M; Harrison, Andrew J

2013-01-01

There are various limitations to existing methods of studying plantarflexor stretch-shortening cycle (SSC) function and muscle-tendon unit (MTU) mechanics, predominantly related to measurement validity and reliability. This study utilizes an innovative adaptation to a force sledge which isolates the plantarflexors and ankle for analysis. The aim of this study was to determine the sledge loading protocol to be used, most appropriate method of data analysis and measurement reliability in a group of healthy, non-injured subjects. Twenty subjects (11 males, 9 females; age: 23.5 ±2.3 years; height: 1.73 ±0.08 m; mass: 74.2 ±11.3 kg) completed 11 impacts at five different loadings rated on a scale of perceived exertion from 1 to 5, where 5 is a loading that the subject could only complete the 11 impacts using the adapted sledge. Analysis of impacts 4–8 or 5–7 using loading 2 provided consistent results that were highly reliable (single intra-class correlation, ICC > 0.85, average ICC > 0.95) and replicated kinematics found in hopping and running. Results support use of an adapted force sledge apparatus as an ecologically valid, reliable method of investigating plantarflexor SSC function and MTU mechanics in a dynamic controlled environment. (paper)
Reliability, precision, and gender differences in knee internal/external rotation proprioception measurements.

Science.gov (United States)

Nagai, Takashi; Sell, Timothy C; Abt, John P; Lephart, Scott M

2012-11-01

To develop and assess the reliability and precision of knee internal/external rotation (IR/ER) threshold to detect passive motion (TTDPM) and determine if gender differences exist. Test-retest for the reliability/precision and cross-sectional for gender comparisons. University neuromuscular and human performance research laboratory. Ten subjects for the reliability and precision aim. Twenty subjects (10 males and 10 females) for gender comparisons. All TTDPM tests were performed using a multi-mode dynamometer. Subjects performed TTDPM at two knee positions (near IR or ER end-range). Intraclass correlation coefficient (ICC (3,k)) and standard error of measurement (SEM) were used to evaluate the reliability and precision. Independent t-tests were used to compare genders. TTDPM toward IR and ER at two knee positions. Intrasession and intersession reliability and precision were good (ICC=0.68-0.86; SEM=0.22°-0.37°). Females had significantly diminished TTDPM toward IR at IR-test position (males: 0.77°±0.14°, females: 1.18°±0.46°, p=0.021) and TTDPM toward IR at the ER-test position (males: 0.87°±0.13°, females: 1.36°±0.58°, p=0.026). No other significant gender differences were found (p>0.05). The current IR/ER TTDPM methods are reliable and accurate for the test-retest or cross-section research design. Gender differences were found toward IR where the ACL acts as the secondary restraint. Copyright © 2011 Elsevier Ltd. All rights reserved.
Reliability of Rehabilitative Ultrasonography to Measure Transverse Abdominis and Multifidus Muscle Dimensions

International Nuclear Information System (INIS)

Nabavi, Narjes; Mosallanezhad, Zahra; Haghighatkhah, Hamid Reza; Mohseni Bandpeid, Mohammad Ali

2014-01-01

Lumbar paraspinal muscles play an important role in providing both mobility and stability during dynamic tasks. Among paraspinal muscles, transverse abdominis and lumbar multifidus have been of particular interest as active stabilizers of the lumbar spine. These muscles may become dysfunctional in chronic low back pain (CLBP). Low back injury can result in muscle inhibition and control loss that cannot recover spontaneously, and specific exercises are required to stimulate their recovery. The purpose of this study was to test the reliability of ultrasonography to measure muscle dimensions and to present a reliable method for measuring transverse abdominis and lumbar multifidus as stabilizing muscles of the lumbar spine. Fifteen healthy participants (18-55 year olds) were evaluated by a radiologist using ultrasonography (ES500) with two probes (50mm linear 7.5 MHZ and 70 mm curvilinear 3.5 MHz). The muscle thickness of transverse abdominis and the anterior-posterior diameter and cross sectional area of the LMF were measured. To determine within and between days reliabilities, second and third measurements were repeated with half an hour and one week intervals, respectively. Intraclass correlation coefficient for left and right showed good to high reliability for the cross sectional area of lumbar multifidi (0.74 and 0.88, respectively) as well as the anterior-posterior dimensions of lumbar multifidi (0.89 and 0.91, respectively) and transverse abdomini thickness (0.73 and 0.85, respectively). Rehabilitative ultrasonography is a reliable and non-invasive instrument to measure muscle thickness. The method used in this study is a reliable way to measure lumbar stabilizing muscles
Reliability and validity of the brief multidimensional measure of religiousness/spirituality among adolescents.

Science.gov (United States)

Harris, Sion Kim; Sherritt, Lon R; Holder, David W; Kulig, John; Shrier, Lydia A; Knight, John R

2008-12-01

Developed for use in health research, the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) consists of brief measures of a broad range of religiousness and spirituality (R/S) dimensions. It has established psychometric properties among adults, but little is known about its appropriateness for use with adolescents. We assessed the psychometric properties of the BMMRS among adolescents. We recruited a racially diverse (85% non-White) sample of 305 adolescents aged 12-18 years (median 16 yrs, IQR 14-17) from 3 urban medical clinics; 93 completed a retest 1 week later. We assessed internal consistency and test-retest reliability. We assessed construct validity by examining how well the measures discriminated groups expected to differ based on self-reported religious preference, and how they related to a hypothesized correlate, depressive symptoms. Religious preference was categorized into "No religion/Atheist" (11%), "Don't know/Confused" (9%), or "Named a religion" (80%). Responses to multi-item measures were generally internally consistent (alpha > or = 0.70 for 12/16 measures) and stable over 1 week (intraclass correlation coefficients > or = 0.70 for 14/16). Forgiveness, Negative R/S Coping, and Commitment items showed lower internal cohesiveness. Scores on most measures were higher (p Atheist" group. Forgiveness, Commitment, and Anticipated Support from members of one's congregation were inversely correlated with depressive symptoms, while BMMRS measures assessing negative R/S experiences (Negative R/S Coping, Negative Interactions with others in congregation, Loss in Faith) were positively correlated with depressive symptoms. These findings suggest that most BMMRS measures are reliable and valid for use among adolescents.
Validity and reliability of English and Marathi Oswestry Disability Index (version 2.1a) in Indian population.

Science.gov (United States)

Joshi, Veena D; Raiturker, Pradyumna P Pai; Kulkarni, Aditi A

2013-05-15

A total of 200 patients with low back pain (LBP) completed an English and Marathi Oswestry Disability Index (ODI) questionnaires (100 each), visual analogue scale, and Roland-Morris Disability Questionnaire. To validate the English and Marathi versions of ODI (version 2.1a). Patient-orientated assessment methods are important in the evaluation of treatment outcome. The ODI is one of the condition-specific questionnaires recommended for the use of patients with LBP. An adaptation of the ODI (version 2.1a) for Marathi language was carried out according to established guidelines. Average age of patients who answered the English ODI was 42 ± 15, whereas that of Marathi-speaking patients was 52 ± 15 years. About 40% were males. The Cronbach α reliability score was 0.877 for English and 0.943 for Marathi. Forty-seven and 53 of these patients were retested with English and Marathi ODI within 2 weeks (to assess test-retest reliability). The intraclass correlation coefficient (ICC) for the test-retest reliability of the questionnaire was 0.877 and 0.943 for English and Marathi respectively. The ODI scores correlated with visual analogue scale pain intensity (r = 0.67, P Disability Questionnaire score (r = 0.71, P Disability Questionnaire scores (r = 0.503, P Oswestry questionnaire is reliable and valid, and shows psychometric characteristics as good as the English version. It should represent a valuable tool for use in future patient-orientated outcome studies for population with LBP in India.
The reliability and validity of a three-camera foot image system for obtaining foot anthropometrics.

Science.gov (United States)

O'Meara, Damien; Vanwanseele, Benedicte; Hunt, Adrienne; Smith, Richard

2010-08-01

The purpose was to develop a foot image capture and measurement system with web cameras (the 3-FIS) to provide reliable and valid foot anthropometric measures with efficiency comparable to that of the conventional method of using a handheld anthropometer. Eleven foot measures were obtained from 10 subjects using both methods. Reliability of each method was determined over 3 consecutive days using the intraclass correlation coefficient and root mean square error (RMSE). Reliability was excellent for both the 3-FIS and the handheld anthropometer for the same 10 variables, and good for the fifth metatarsophalangeal joint height. The RMSE values over 3 days ranged from 0.9 to 2.2 mm for the handheld anthropometer, and from 0.8 to 3.6 mm for the 3-FIS. The RMSE values between the 3-FIS and the handheld anthropometer were between 2.3 and 7.4 mm. The 3-FIS required less time to collect and obtain the final variables than the handheld anthropometer. The 3-FIS provided accurate and reproducible results for each of the foot variables and in less time than the conventional approach of a handheld anthropometer.
Reliability and accuracy of a video analysis protocol to assess core ability.

Science.gov (United States)

McDonald, Dawn A; Delgadillo, James Q; Fredericson, Michael; McConnell, Jennifer; Hodgins, Melissa; Besier, Thor F

2011-03-01

To develop and test a method to measure core ability in healthy athletes with 2-dimensional video analysis software (SiliconCOACH). Specific objectives were to: (1) develop a standardized exercise battery with progressions of increasing difficulty to evaluate areas of core ability in elite athletes; (2) develop an objective and quantitative grading rubric with the use of video analysis software; (3) assess the test-retest reliability of the exercise battery; (4) assess the interrater and intrarater reliability of the video analysis system; and (5) assess the accuracy of the assessment. Test-retest repeatability and accuracy. Testing was conducted in the Stanford Human Performance Laboratory, Stanford University, Stanford, CA. Nine female gymnasts currently training with the Stanford Varsity Women's Gymnastics Team participated in testing. Participants completed a test battery composed of planks, side planks, and leg bridges of increasing difficulty. Subjects completed two 20-minute testing sessions within a 4- to 10-day period. Two-dimensional sagittal-plane video was captured simultaneously with 3-dimensional motion capture. The main outcome measures were pelvic displacement and time that elapsed until failure occurred, as measured with SiliconCOACH video analysis software. Test-retest and interrater and intrarater reliability of the video analysis measures was assessed. Accuracy as compared with 3-dimensional motion capture also was assessed. Levels reached during the side planks and leg bridges had an excellent test-retest correlation (r(2) = 0.84, r(2) = 0.95). Pelvis displacements measured by examiner 1 and examiner 2 had an excellent correlation (r(2) = 0.86, intraclass correlation coefficient = 0.92). Pelvis displacements measured by examiner 1 during independent grading sessions had an excellent correlation (r(2) = 0.92). Pelvis displacements from the plank and from a set of combined plank and side plank exercises both had an excellent correlation with 3
Inter-rater reliability of shoulder measurements in middle-aged women.

Science.gov (United States)

De Groef, A; Van Kampen, M; Vervloesem, N; Clabau, E; Christiaens, M-R; Neven, P; Geraerts, I; Struyf, F; Devoogdt, N

2017-06-01

To investigate inter-rater reliability of a set of shoulder measurements including inclinometry [shoulder range of motion (ROM)], acromion-table distance and pectoralis minor muscle length (static scapular positioning), upward rotation with two inclinometers (scapular kinematics) and pain pressure thresholds (muscle tenderness) in middle-aged women. Observational study. Thirty symptom-free middle-aged women (first cohort) were measured by two raters. All measurements with an intraclass correlation coefficient (ICC) below 0.75 were retested after an additional training period in a second cohort of 30 symptom-free middle-aged women. Inter-rater reliability of all variables was measured with the ICC (95% confidence interval) and standard error of measurement (SEM). Acromion-table distance (ICC=0.91, SEM 0.22 to 0.28% of body length), pectoralis minor muscle length (ICC=0.91, SEM 0.16% of body length), pain pressure thresholds (ICC=0.78 to 0.85, SEM 0.39 to 0.70kg) and abduction ROM (ICC=0.77, SEM 5°) showed good to excellent inter-rater reliability in the first cohort. After an additional training period, forward flexion ROM showed good inter-rater reliability (ICC=0.83, SEM 5°), scapular upward rotation in resting position showed moderate reliability (ICC=0.52, SEM 2°), and other scaption angles showed weak reliability (ICC=0.26 to 0.43, SEM 3 to 8°). In a battery of clinical tools to evaluate factors contributing to shoulder pain, static scapular positioning and pressure pain thresholds were found to have good to excellent inter-rater reliability in middle-aged women. Additional training is recommended for measurements with a gravity inclinometer. Copyright © 2016 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Hippocampal MRI volumetry at 3 Tesla: reliability and practical guidance.

Science.gov (United States)

Jeukens, Cécile R L P N; Vlooswijk, Mariëlle C G; Majoie, H J Marian; de Krom, Marc C T F M; Aldenkamp, Albert P; Hofman, Paul A M; Jansen, Jacobus F A; Backes, Walter H

2009-09-01

Although volumetry of the hippocampus is considered to be an established technique, protocols reported in literature are not described in great detail. This article provides a complete and detailed protocol for hippocampal volumetry applicable to T1-weighted magnetic resonance (MR) images acquired at 3 Tesla, which has become the standard for structural brain research. The protocol encompasses T1-weighted image acquisition at 3 Tesla, anatomic guidelines for manual hippocampus delineation, requirements of delineation software, reliability measures, and criteria to assess and ensure sufficient reliability. Moreover, the validity of the correction for total intracranial volume size was critically assessed. The protocol was applied by 2 readers to the MR images of 36 patients with cryptogenic localization-related epilepsy, 4 patients with unilateral hippocampal sclerosis, and 20 healthy control subjects. The uncorrected hippocampal volumes were 2923 +/- 500 mm3 (mean +/- SD) (left) and 3120 +/- 416 mm3 (right) for the patient group and 3185 +/- 411 mm3 (left) and 3302 +/- 411 mm3 (right) for the healthy control group. The volume of the 4 pathologic hippocampi of the patients with unilateral hippocampal sclerosis was 2980 +/- 422 mm3. The inter-reader reliability values were determined: intraclass-correlation-coefficient (ICC) = 0.87 (left) and 0.86 (right), percentage volume difference (VD) = 7.0 +/- 4.7% (left) and 6.0 +/- 3.8% (right), and overlap ratio (OR) = 0.82 +/- 0.04 (left) and 0.82 +/- 0.03 (right). The positive Pearson correlation between hippocampal volume and total intracranial volume was found to be low: r = 0.48 (P = 0.03, left) and r = 0.62 (P = 0.004, right) and did not significantly reduce the volumetric variances, showing the limited benefit of the brain size correction. A protocol was described to determine hippocampal volumes based on 3 Tesla MR images with high inter-reader reliability. Although the reliability of hippocampal volumetry at 3 Tesla
Reliability of Doppler and stethoscope methods of determining systolic blood pressures: considerations for calculating an ankle-brachial index.

Science.gov (United States)

Chesbro, Steven B; Asongwed, Elmira T; Brown, Jamesha; John, Emmanuel B

2011-01-01

The purposes of this study were to: (1) identify the interrater and intrarater reliability of systolic blood pressures using a stethoscope and Doppler to determine an ankle-brachial index (ABI), and (2) to determine the correlation between the 2 methods. Peripheral arterial disease (PAD) affects approximately 8 to 12 million people in the United States, and nearly half of those with this disease are asymptomatic. Early detection and prompt treatment of PAD will improve health outcomes. It is important that clinicians perform tests that determine the presence of PAD. Two individual raters trained in ABI procedure measured the systolic blood pressures of 20 individuals' upper and lower extremities. Standard ABI measurement protocols were observed. Raters individually recorded the systolic blood pressures of each extremity using a stethoscope and a Doppler, for a total of 640 independent measures. Interrater reliability of Doppler measurements to determine SBP at the ankle was very strong (intraclass correlation coefficient [ICC], 0.93-0.99) compared to moderate to strong reliability using a stethoscope (ICC, 0.64-0.87). Agreement between the 2 devices to determine SBP was moderate to very weak (ICC, 0.13-0.61). Comparisons of the use of Doppler and stethoscope to determine ABI showed weak to very weak intrarater correlation (ICC, 0.17-0.35). Linear regression analysis of the 2 methods to determine ABI showed positive but weak to very weak correlations (r2 = .013, P = .184). A Doppler ultrasound is recommended over a stethoscope for accuracy in systolic pressure readings for ABI measurements.
The reliability and validity study of the Kinesthetic and Visual Imagery Questionnaire in individuals with Multiple Sclerosis

Directory of Open Access Journals (Sweden)

Yousef Moghadas Tabrizi

2013-12-01

Full Text Available OBJECTIVE: Motor imagery (MI has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS. It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. METHOD: Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14days apart by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R as the gold standard. Intra-class correlation coefficients (ICCs were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha and factorial structure of the KVIQ were studied. RESULTS: The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93, and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79. The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84. Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. CONCLUSIONS: The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients.
The reliability and validity study of the Kinesthetic and Visual Imagery Questionnaire in individuals with multiple sclerosis.

Science.gov (United States)

Tabrizi, Yousef Moghadas; Zangiabadi, Nasser; Mazhari, Shahrzad; Zolala, Farzaneh

2013-01-01

Motor imagery (MI) has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS). It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ) was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14 days apart) by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R) as the gold standard. Intra-class correlation coefficients (ICCs) were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha) and factorial structure of the KVIQ were studied. The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93), and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79). The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84). Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients.
Reliability, sensitivity and validity of the assistant referee intermittent endurance test (ARIET) - a modified Yo-Yo IE2 test for elite soccer assistant referees

DEFF Research Database (Denmark)

Castagna, Carlo; Bendiksen, Mads; Impellizzeri, Franco M

2012-01-01

We examined the reliability and validity of the assistant referee intermittent endurance test (ARIET), a modified Yo-Yo IE2 test including shuttles of sideways running. The ARIET was carried out on 198 Italian (Serie A-B, Lega-Pro and National Level) and 47 Danish elite soccer assistant referees....... Reproducibility was tested for 41 assistant referees on four occasions each separated by one week. The ARIET intraclass correlation coefficients and typical error of measurement ranged from 0.96 to 0.99 and 3.1 to 5.7%, respectively. ARIET performance for Serie A and B was 23 and 25% greater than in Lega-Pro (P...... ARIET performance was significantly correlated with VO(2max) (r = 0.78, P ARIET (r = - 0.81, P
A quick aphasia battery for efficient, reliable, and multidimensional assessment of language function.

Science.gov (United States)

Wilson, Stephen M; Eriksson, Dana K; Schneck, Sarah M; Lucanie, Jillian M

2018-01-01

This paper describes a quick aphasia battery (QAB) that aims to provide a reliable and multidimensional assessment of language function in about a quarter of an hour, bridging the gap between comprehensive batteries that are time-consuming to administer, and rapid screening instruments that provide limited detail regarding individual profiles of deficits. The QAB is made up of eight subtests, each comprising sets of items that probe different language domains, vary in difficulty, and are scored with a graded system to maximize the informativeness of each item. From the eight subtests, eight summary measures are derived, which constitute a multidimensional profile of language function, quantifying strengths and weaknesses across core language domains. The QAB was administered to 28 individuals with acute stroke and aphasia, 25 individuals with acute stroke but no aphasia, 16 individuals with chronic post-stroke aphasia, and 14 healthy controls. The patients with chronic post-stroke aphasia were tested 3 times each and scored independently by 2 raters to establish test-retest and inter-rater reliability. The Western Aphasia Battery (WAB) was also administered to these patients to assess concurrent validity. We found that all QAB summary measures were sensitive to aphasic deficits in the two groups with aphasia. All measures showed good or excellent test-retest reliability (overall summary measure: intraclass correlation coefficient (ICC) = 0.98), and excellent inter-rater reliability (overall summary measure: ICC = 0.99). Sensitivity and specificity for diagnosis of aphasia (relative to clinical impression) were 0.91 and 0.95 respectively. All QAB measures were highly correlated with corresponding WAB measures where available. Individual patients showed distinct profiles of spared and impaired function across different language domains. In sum, the QAB efficiently and reliably characterized individual profiles of language deficits.
Validity and reliability of a self-administered foot evaluation questionnaire (SAFE-Q).

Science.gov (United States)

Niki, Hisateru; Tatsunami, Shinobu; Haraguchi, Naoki; Aoki, Takafumi; Okuda, Ryuzo; Suda, Yasunori; Takao, Masato; Tanaka, Yasuhito

2013-03-01

The Japanese Society for Surgery of the Foot (JSSF) is developing a QOL questionnaire instrument for use in pathological conditions related to the foot and ankle. The main body of the outcome instrument (the Self-Administered Foot Evaluation Questionnaire, SAFE-Q version 2) consists of 34 questionnaire items, which provide five subscale scores (1: Pain and Pain-Related; 2: Physical Functioning and Daily Living; 3: Social Functioning; 4: Shoe-Related; and 5: General Health and Well-Being). In addition, the instrument has nine optional questionnaire items that provide a Sports Activity subscale score. The purpose of this study was to evaluate the test-retest reliability of the SAFE-Q. Version 2 of the SAFE-Q was administered to 876 patients and 491 non-patients, and the test-retest reliability was evaluated for 131 patients. In addition, the SF-36 questionnaire and the JSSF Scale scoring form were administered to all of the participants. Subscale scores were scaled such that the final sum of scores ranged between zero (least healthy) to 100 (healthiest). The intraclass correlation coefficients were larger than 0.7 for all of the scores. The means of the five subscale scores were between 60 and 75. The five subscales easily separated patients from non-patients. The coefficients for the correlations of the subscale scores with the scores on the JSSF Scale and the SF-36 subscales were all highly statistically significantly greater than zero (p valid and reliable. In the future, it will be beneficial to test the responsiveness of the SAFE-Q.
RELIABILITY OF THE ONE REPETITION-MAXIMUM POWER CLEAN TEST IN ADOLESCENT ATHLETES

Science.gov (United States)

Faigenbaum, Avery D.; McFarland, James E.; Herman, Robert; Naclerio, Fernando; Ratamess, Nicholas A.; Kang, Jie; Myer, Gregory D.

2013-01-01

Although the power clean test is routinely used to assess strength and power performance in adult athletes, the reliability of this measure in younger populations has not been examined. Therefore, the purpose of this study was to determine the reliability of the one repetition maximum (1 RM) power clean in adolescent athletes. Thirty-six male athletes (age 15.9 ± 1.1 yrs, body mass 79.1 ± 20.3 kg, height 175.1 ±7.4 cm) who had more than 1 year of training experience with weightlifting exercises performed a 1 RM power clean on two nonconsecutive days in the afternoon following standardized procedures. All test procedures were supervised by a senior level weightlifting coach and consisted of a systematic progression in test load until the maximum resistance that could be lifted for one repetition using proper exercise technique was determined. Data were analyzed using an intraclass correlation coefficient (ICC [2,k]), Pearson correlation coefficient (r), repeated measures ANOVA, Bland-Altman plot, and typical error analyses. Analysis of the data revealed that the test measures were highly reliable demonstrating a test-retest ICC of 0.98 (95% CI = 0.96–0.99). Testing also demonstrated a strong relationship between 1 RM measures on trial 1 and trial 2 (r=0.98, pinjuries occurred during the study period and the testing protocol was well-tolerated by all subjects. These findings indicate that 1 RM power clean testing has a high degree of reproducibility in trained male adolescent athletes when standardized testing procedures are followed and qualified instruction is present. PMID:22233786

Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

Science.gov (United States)

Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

2016-03-03

The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.
Reliable sagittal plane kinematic gait assessments are feasible using low-cost webcam technology.

Science.gov (United States)

Saner, Robert J; Washabaugh, Edward P; Krishnan, Chandramouli

2017-07-01

Three-dimensional (3-D) motion capture systems are commonly used for gait analysis because they provide reliable and accurate measurements. However, the downside of this approach is that it is expensive and requires technical expertise; thus making it less feasible in the clinic. To address this limitation, we recently developed and validated (using a high-precision walking robot) a low-cost, two-dimensional (2-D) real-time motion tracking approach using a simple webcam and LabVIEW Vision Assistant. The purpose of this study was to establish the repeatability and minimal detectable change values of hip and knee sagittal plane gait kinematics recorded using this system. Twenty-one healthy subjects underwent two kinematic assessments while walking on a treadmill at a range of gait velocities. Intraclass correlation coefficients (ICC) and minimal detectable change (MDC) values were calculated for commonly used hip and knee kinematic parameters to demonstrate the reliability of the system. Additionally, Bland-Altman plots were generated to examine the agreement between the measurements recorded on two different days. The system demonstrated good to excellent reliability (ICC>0.75) for all the gait parameters tested on this study. The MDC values were typically low (gait assessments using webcam technology can be reliably used for clinical and research purposes. Copyright © 2017 Elsevier B.V. All rights reserved.
Reliability of the CMT neuropathy score (second version) in Charcot-Marie-Tooth disease.

LENUS (Irish Health Repository)

Murphy, Sinéad M

2011-09-01

The Charcot-Marie-Tooth neuropathy score (CMTNS) is a reliable and valid composite score comprising symptoms, signs, and neurophysiological tests, which has been used in natural history studies of CMT1A and CMT1X and as an outcome measure in treatment trials of CMT1A. Following an international workshop on outcome measures in Charcot-Marie-Tooth disease (CMT), the CMTNS was modified to attempt to reduce floor and ceiling effects and to standardize patient assessment, aiming to improve its sensitivity for detecting change over time and the effect of an intervention. After agreeing on the modifications made to the CMTNS (CMTNS2), three examiners evaluated 16 patients to determine inter-rater reliability; one examiner evaluated 18 patients twice within 8 weeks to determine intra-rater reliability. Three examiners evaluated 63 patients using the CMTNS and the CMTNS2 to determine how the modifications altered scoring. For inter- and intra-rater reliability, intra-class correlation coefficients (ICCs) were ≥0.96 for the CMT symptom score and the CMT examination score. There were small but significant differences in some of the individual components of the CMTNS compared with the CMTNS2, mainly in the components that had been modified the most. A longitudinal study is in progress to determine whether the CMTNS2 is more sensitive than the CMTNS for detecting change over time.
Validity and Reliability of the Clinical Competency Evaluation Instrument for Use among Physiotherapy Students: Pilot study.

Science.gov (United States)

Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh

2015-05-01

The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Intra- and inter-rater reliability of the Knee Society Knee Score when used by two physiotherapists in patients post total knee arthroplasty

Directory of Open Access Journals (Sweden)

S. Gopal

2010-01-01

Full Text Available Background and Purpose: It has yet to be shown whether routine physiotherapy plays a role in the rehabilitation of patients post totalknee arthroplasty (Rajan et al 2004. Physiotherapists should be using validoutcome measures to provide evidence of the benefit of their intervention. The aim of this study was to establish the intra and inter-rater reliability of the Knee Society Knee Score, a scoring system developed by Insall et al(1989. The Knee Society Knee Score can be used to assess the integrity of theknee joint of patients undergoing total knee arthroplasty. Since the scoreinvolves clinical testing, the intra-rater reliability of the clinician should be established prior to using the scores as datain clinical research. W here multiple clinicians are involved, inter-rater reliability should also be established.Design: This was a correlation study.Subjects: A sample of thirty patients post total knee arthroplasty attending the arthroplasty clinic at Johannesburg Hospital between six weeks and twelve months postoperatively.M ethod: Recruited patients were evaluated twice with a time interval of one hour between each assessment. Statistical A nalysis: The intra- and inter-rater reliability were estimated using Intraclass Correlation Coefficient (ICC. R esults: The intra-rater reliability showed excellent reliability (h= 0.95 for Examiner A and good reliability (h= 0.71for Examiner B. The inter-rater reliability showed moderate reliability (h= 0.67 during test one and h= 0.66 during test two.Conclusion: The KSKS has good intra-rater reliability when tested within a period of one hour. The KSKS demonstrated moderate agreement for inter rater reliability.
Reliability of proxy respondents for patients with stroke: a systematic review.

Science.gov (United States)

Oczkowski, Colin; O'Donnell, Martin

2010-01-01

Proxy respondents are an important aspect of stroke medicine and research. We performed a systematic review of studies evaluating the reliability of proxy respondents for stroke patients. Studies were identified by searches of MEDLINE, Google, and the Cochrane Library between January 1969 and June 2008. All were prospective or cross-sectional studies reporting the reliability of proxy respondents for patients with a history of previous stroke or transient ischemic attack. One author abstracted data. For each study, intraclass correlation (ICC) or the k-statistic was categorized as poor (0.80). Thirteen studies, with a total of 2618 participants, met our inclusion criteria. Most studies recruited patients >3 months after their stroke. Of these studies, 5 (360 participants; 5 scales) evaluated reliability of proxy respondents for activities of daily living (ADL), and 9 (2334 participants; 9 scales) evaluated reliability of proxy respondents for quality of life (QoL). One study evaluated both. In studies, the ICC/k for scales ranged from 0.61 to 0.91 for ADL and from 0.41 to 0.8 for QoL. Most studies reported that proxy respondents overestimated impairments compared with patient self-reports. Stroke severity and objective nature of questions were the most consistent determinants of disagreement between stroke patient and proxy respondent. Our data indicate that beyond the acute stroke period, the reliability of proxy respondents for validated scales of ADL was substantial to excellent, while that of scales for QoL was moderate to substantial. Copyright (c) 2010 National Stroke Association. Published by Elsevier Inc. All rights reserved.
Regional reliability of quantitative signal targeting with alternating radiofrequency (STAR) labeling of arterial regions (QUASAR).

Science.gov (United States)

Tatewaki, Yasuko; Higano, Shuichi; Taki, Yasuyuki; Thyreau, Benjamin; Murata, Takaki; Mugikura, Shunji; Ito, Daisuke; Takase, Kei; Takahashi, Shoki

2014-01-01

Quantitative signal targeting with alternating radiofrequency labeling of arterial regions (QUASAR) is a recent spin labeling technique that could improve the reliability of brain perfusion measurements. Although it is considered reliable for measuring gray matter as a whole, it has never been evaluated regionally. Here we assessed this regional reliability. Using a 3-Tesla Philips Achieva whole-body system, we scanned four times 10 healthy volunteers, in two sessions 2 weeks apart, to obtain QUASAR images. We computed perfusion images and ran a voxel-based analysis within all brain structures. We also calculated mean regional cerebral blood flow (rCBF) within regions of interest configured for each arterial territory distribution. The mean CBF over whole gray matter was 37.74 with intraclass correlation coefficient (ICC) of .70. In white matter, it was 13.94 with an ICC of .30. Voxel-wise ICC and coefficient-of-variation maps showed relatively lower reliability in watershed areas and white matter especially in deeper white matter. The absolute mean rCBF values were consistent with the ones reported from PET, as was the relatively low variability in different feeding arteries. Thus, QUASAR reliability for regional perfusion is high within gray matter, but uncertain within white matter. © 2014 The Authors. Journal of Neuroimaging published by the American Society of Neuroimaging.
Inter- and intra- observer reliability of risk assessment of repetitive work without an explicit method.

Science.gov (United States)

Eliasson, Kristina; Palm, Peter; Nyman, Teresia; Forsman, Mikael

2017-07-01

A common way to conduct practical risk assessments is to observe a job and report the observed long term risks for musculoskeletal disorders. The aim of this study was to evaluate the inter- and intra-observer reliability of ergonomists' risk assessments without the support of an explicit risk assessment method. Twenty-one experienced ergonomists assessed the risk level (low, moderate, high risk) of eight upper body regions, as well as the global risk of 10 video recorded work tasks. Intra-observer reliability was assessed by having nine of the ergonomists repeat the procedure at least three weeks after the first assessment. The ergonomists made their risk assessment based on his/her experience and knowledge. The statistical parameters of reliability included agreement in %, kappa, linearly weighted kappa, intraclass correlation and Kendall's coefficient of concordance. The average inter-observer agreement of the global risk was 53% and the corresponding weighted kappa (K w ) was 0.32, indicating fair reliability. The intra-observer agreement was 61% and 0.41 (K w ). This study indicates that risk assessments of the upper body, without the use of an explicit observational method, have non-acceptable reliability. It is therefore recommended to use systematic risk assessment methods to a higher degree. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Reliability of Semiautomated Computational Methods for Estimating Tibiofemoral Contact Stress in the Multicenter Osteoarthritis Study

Directory of Open Access Journals (Sweden)

Donald D. Anderson

2012-01-01

Full Text Available Recent findings suggest that contact stress is a potent predictor of subsequent symptomatic osteoarthritis development in the knee. However, much larger numbers of knees (likely on the order of hundreds, if not thousands need to be reliably analyzed to achieve the statistical power necessary to clarify this relationship. This study assessed the reliability of new semiautomated computational methods for estimating contact stress in knees from large population-based cohorts. Ten knees of subjects from the Multicenter Osteoarthritis Study were included. Bone surfaces were manually segmented from sequential 1.0 Tesla magnetic resonance imaging slices by three individuals on two nonconsecutive days. Four individuals then registered the resulting bone surfaces to corresponding bone edges on weight-bearing radiographs, using a semi-automated algorithm. Discrete element analysis methods were used to estimate contact stress distributions for each knee. Segmentation and registration reliabilities (day-to-day and interrater for peak and mean medial and lateral tibiofemoral contact stress were assessed with Shrout-Fleiss intraclass correlation coefficients (ICCs. The segmentation and registration steps of the modeling approach were found to have excellent day-to-day (ICC 0.93–0.99 and good inter-rater reliability (0.84–0.97. This approach for estimating compartment-specific tibiofemoral contact stress appears to be sufficiently reliable for use in large population-based cohorts.
Cross-cultural adaptation, reliability, and validity of the Persian version of the Cumberland Ankle Instability Tool.

Science.gov (United States)

Hadadi, Mohammad; Ebrahimi Takamjani, Ismail; Ebrahim Mosavi, Mohammad; Aminian, Gholamreza; Fardipour, Shima; Abbasi, Faeze

2017-08-01

The purpose of the present study was to translate and to cross-culturally adapt the Cumberland Ankle Instability Tool (CAIT) into Persian language and to evaluate its psychometric properties. The International Quality of Life Assessment process was pursued to translate CAIT into Persian. Two groups of Persian-speaking individuals, 105 participants with a history of ankle sprain and 30 participants with no history of ankle sprain, were asked to fill out Persian version of CAIT (CAIT-P), Foot and Ankle Ability Measure (FAAM), and Visual Analog Scale (VAS). Data obtained from the first administration of CAIT were used to evaluate floor and ceiling effects, internal consistency, dimensionality, and criterion validity. To determine the test-retest reliability, 45 individuals re-filled CAIT 5-7 days after the first session. Cronbach's alpha was over the cutoff point of 0.70 for both ankles and in both groups. The intra-class correlation coefficient was high for right (0.95) and left (0.91) ankles. There was a strong correlation between each item and the total score of the CAIT-P. Although the CAIT-P had strong correlation with VAS, its correlation with both subscales of FAAM was moderate. The CAIT-P has good validity and reliability and it can be used by clinicians and researchers for identification and investigation of functional ankle instability. Implications for Rehabilitation Chronic ankle instability is one of the most common consequences of acute ankle sprain. Cumberland Ankle Instability Tool is an acceptable measure to determine functional ankle instability and its severity. The Persian version of Cumberland Ankle Instability Tool is a valid and reliable tool for clinical and research purpose in Persian-speaking individuals.
Reliability of anthropometric measurements in young male and female artistic gymnasts.

Science.gov (United States)

Siatras, Theophanis; Skaperda, Malamati; Mameletzi, Dimitra

2010-12-01

Body dimensions and body composition of children participating in artistic activities, such as gymnastics and many types of dancing, are important factors in performance improvement. The present study aimed to determine the reliability of a series of selected anthropometric measurements in young male and female gymnasts. Segment lengths, body breadths, circumferences, and skinfold thickness were measured in 20 young gymnasts by the same experienced examiner, using portable and easy-to-use instruments. All parameters were measured twice (test-retest) under the same conditions within a week's period. The high intra-class correlation coefficient (ICC) values ranging from 0.87 to 0.99, as well as the low coefficient of variation (CV) values (artistic gymnasts. Therefore, these measurements could contribute to further research in this field of investigation, helping to monitor young artistic gymnasts' growth status and identify specific characteristics for increased performance in this sport.
Measuring Outcomes for Dysphagia: Validity and Reliability of the European Portuguese Eating Assessment Tool (P-EAT-10).

Science.gov (United States)

Nogueira, Dália Santos; Ferreira, Pedro Lopes; Reis, Elizabeth Azevedo; Lopes, Inês Sousa

2015-10-01

The purpose of this study was to evaluate the validity and the reliability of the European Portuguese version of the EAT-10 (P-EAT-10). This research was conducted in three phases: (i) cultural and linguistic adaptation; (ii) feasibility and reliability test; and (iii) validity tests. The final sample was formed by a cohort of 520 subjects. The P-EAT-10 index was compared for socio-demographic and clinic variables. It was also compared for both dysphagic and non-dysphagic groups as well as for the results of the 3Oz wst. Lastly, the P-EAT-10 scores were correlated with the EuroQol Group Portuguese EQ-5D index. The Cronbach's α obtained for the P-EAT-10 scale was 0.952 and it remained excellent even if any item was deleted. The item-total and the intraclass correlation coefficients were very good. The P-EAT-10 mean of the non-dysphagic cohort was 0.56 and that of the dysphagic cohort was 14.26, the mean comparison between the 3Oz wst groups and the P-EAT-10 scores were significant. A significant higher perception of QoL was also found among the non-dysphagic subjects. P-EAT-10 is a valid and reliable measure that may be used to document dysphagia which makes it useful both for screening in clinical practice and in research.
Advances in population surveillance for physical activity and sedentary behavior: reliability and validity of time use surveys.

Science.gov (United States)

van der Ploeg, Hidde P; Merom, Dafna; Chau, Josephine Y; Bittman, Michael; Trost, Stewart G; Bauman, Adrian E

2010-11-15

Many countries conduct regular national time use surveys, some of which date back as far as the 1960s. Time use surveys potentially provide more detailed and accurate national estimates of the prevalence of sedentary and physical activity behavior than more traditional self-report surveillance systems. In this study, the authors determined the reliability and validity of time use surveys for assessing sedentary and physical activity behavior. In 2006 and 2007, participants (n = 134) were recruited from work sites in the Australian state of New South Wales. Participants completed a 2-day time use diary twice, 7 days apart, and wore an accelerometer. The 2 diaries were compared for test-retest reliability, and comparison with the accelerometer determined concurrent validity. Participants with similar activity patterns during the 2 diary periods showed reliability intraclass correlations of 0.74 and 0.73 for nonoccupational sedentary behavior and moderate/vigorous physical activity, respectively. Comparison of the diary with the accelerometer showed Spearman correlations of 0.57-0.59 and 0.45-0.69 for nonoccupational sedentary behavior and moderate/vigorous physical activity, respectively. Time use surveys appear to be more valid for population surveillance of nonoccupational sedentary behavior and health-enhancing physical activity than more traditional surveillance systems. National time use surveys could be used to retrospectively study nonoccupational sedentary and physical activity behavior over the past 5 decades.
The Dutch language anterior cruciate ligament return to sport after injury scale (ACL-RSI) - validity and reliability.

Science.gov (United States)

Slagers, Anton J; Reininga, Inge H F; van den Akker-Scheek, Inge

2017-02-01

The ACL-Return to Sport after Injury scale (ACL-RSI) measures athletes' emotions, confidence in performance, and risk appraisal in relation to return to sport after ACL reconstruction. Aim of this study was to study the validity and reliability of the Dutch version of the ACL-RSI (ACL-RSI (NL)). Total 150 patients, who were 3-16 months postoperative, completed the ACL-RSI(NL) and 5 other questionnaires regarding psychological readiness to return to sports, knee-specific physical functioning, kinesiophobia, and health-specific locus of control. Construct validity of the ACL-RSI(NL) was determined with factor analysis and by exploring 10 hypotheses regarding correlations between ACL-RSI(NL) and the other questionnaires. For test-retest reliability, 107 patients (5-16 months postoperative) completed the ACL-RSI(NL) again 2 weeks after the first administration. Cronbach's alpha, Intraclass Correlation Coefficient (ICC), SEM, and SDC, were calculated. Bland-Altman analysis was conducted to assess bias between test and retest. Nine hypotheses (90%) were confirmed, indicating good construct validity. The ACL-RSI(NL) showed good internal consistency (Cronbach's alpha 0.94) and test-retest reliability (ICC 0.93). SEM was 5.5 and SDC was 15. A significant bias of 3.2 points between test and retest was found. Therefore, the ACL-RSI(NL) can be used to investigate psychological factors relevant to returning to sport after ACL reconstruction.
The modified gait abnormality rating scale in patients with a conversion disorder: a reliability and responsiveness study.

Science.gov (United States)

Vandenberg, Justin M; George, Deanna R; O'Leary, Andrea J; Olson, Lindsay C; Strassburg, Kaitlyn R; Hollman, John H

2015-01-01

Individuals with conversion disorder have neurologic symptoms that are not identified by an underlying organic cause. Often the symptoms manifest as gait disturbances. The modified gait abnormality rating scale (GARS-M) may be useful for quantifying gait abnormalities in these individuals. The purpose of this study was to examine the reliability, responsiveness and concurrent validity of GARS-M scores in individuals with conversion disorder. Data from 27 individuals who completed a rehabilitation program were included in this study. Pre- and post-intervention videos were obtained and walking speed was measured. Five examiners independently evaluated gait performance according to the GARS-M criteria. Inter- and intrarater reliability of GARS-M scores were estimated with intraclass correlation coefficients (ICCs). Responsiveness was estimated with the minimum detectable change (MDC). Pre- to post-treatment changes in GARS-M scores were analyzed with a dependent t-test. The correlation between GARS-M scores and walking speed was analyzed to assess concurrent validity. GARS-M scores were quantified with good-to-excellent inter- (ICC = 0.878) and intrarater reliability (ICC = 0.989). The MDC was 2 points. Mean GARS-M scores decreased from 7 ± 5 at baseline to 1 ± 2 at discharge (t26 = 7.411, p conversion disorder. GARS-M scores provide objective measures upon which treatment effects can be assessed. Copyright © 2014 Elsevier B.V. All rights reserved.
An Indication of Reliability of the Two-Level Approach of the AWIN Welfare Assessment Protocol for Horses

Directory of Open Access Journals (Sweden)

Irena Czycholl

2018-01-01

Full Text Available To enhance feasibility, the Animal Welfare Indicators (AWIN assessment protocol for horses consists of two levels: the first is a visual inspection of a sample of horses performed from a distance, the second a close-up inspection of all horses. The aim was to analyse whether information would be lost if only the first level were performed. In this study, 112 first and 112 second level assessments carried out on a subsequent day by one observer were compared by calculating the Spearman’s Rank Correlation Coefficient (RS, Intraclass Correlation Coefficients (ICC, Smallest Detectable Changes (SDC and Limits of Agreements (LoA. Most indicators demonstrated sufficient reliability between the two levels. Exceptions were the Horse Grimace Scale, the Avoidance Distance Test and the Voluntary Human Approach Test (e.g., Voluntary Human Approach Test: RS: 0.38, ICC: 0.38, SDC: 0.21, LoA: −0.25–0.17, which could, however, be also interpreted as a lack of test-retest reliability. Further disagreement was found for the indicator consistency of manure (RS: 0.31, ICC: 0.38, SDC: 0.36, LoA: −0.38–0.36. For these indicators, an adaptation of the first level would be beneficial. Overall, in this study, the division into two levels was reliable and might therewith have the potential to enhance feasibility in other welfare assessment schemes.
Validity and reliability of the patient assessment of constipation quality of life questionnaire for the Turkish population.

Science.gov (United States)

Bengi, Göksel; Yalçın, Mustafa; Akpınar, Hale; Keskinoğlu, Pembe; Ellidokuz, Hülya

2015-07-01

There are few specific evaluation forms for evaluating the quality of life among patients with chronic constipation. Our study aimed to determine the validity and reliability of the translated Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire for the Turkish population because evidence of its reliability and validity is required to justify its use in other studies and clinical practice. This study included 154 patients with constipation who were treated at the Department of Gastroenterology, Dokuz Eylül University Hospital between January and June 2012. The translated PAC-QOL questionnaire was completed by patients at the clinic and also at a 2-week follow-up to test its reliability. Cronbach's alpha coefficient (internal consistency) was 0.91 (good) for the translated PAC-QOL questionnaire. Time validity was evaluated using the intraclass correlation coefficient (ICC) method, and the ICC value for all questions was confirmed as 0.68 at the 2-week follow-up. The validity of the tool in the study group was evaluated using factor analysis, and the results were highly significant (Kaiser-Meyer-Olkin value: 0.857; Bartlett's test: p=0.001). Questions were categorized according to six factors based on the factor analysis, and these factors explained 65.1% of the total variation. For hypothesis verification of the tool, the correlation coefficient for PAC-QOL and PAC Symptoms (PAC-SYM) was r=0.577 (p<0.001), whereas the correlation coefficient for PAC-QOL and constipation severity score was r=0.457 (p<0.001). The PAC-QOL questionnaire was reliable, although not valid because of the limited sample group.
Quantifying frontal plane knee motion during single limb squats: reliability and validity of 2-dimensional measures.

Science.gov (United States)

Gwynne, Craig R; Curran, Sarah A

2014-12-01

Clinical assessment of lower limb kinematics during dynamic tasks may identify individuals who demonstrate abnormal movement patterns that may lead to etiology of exacerbation of knee conditions such as patellofemoral joint (PFJt) pain. The purpose of this study was to determine the reliability, validity and associated measurement error of a clinically appropriate two-dimensional (2-D) procedure of quantifying frontal plane knee alignment during single limb squats. Nine female and nine male recreationally active subjects with no history of PFJt pain had frontal plane limb alignment assessed using three-dimensional (3-D) motion analysis and digital video cameras (2-D analysis) while performing single limb squats. The association between 2-D and 3-D measures was quantified using Pearson's product correlation coefficients. Intraclass correlation coefficients (ICCs) were determined for within- and between-session reliability of 2-D data and standard error of measurement (SEM) was used to establish measurement error. Frontal plane limb alignment assessed with 2-D analysis demonstrated good correlation compared with 3-D methods (r = 0.64 to 0.78, p < 0.001). Within-session (0.86) and between-session ICCs (0.74) demonstrated good reliability for 2-D measures and SEM scores ranged from 2° to 4°. 2-D measures have good consistency and may provide a valid measure of lower limb alignment when compared to existing 3-D methods. Assessment of lower limb kinematics using 2-D methods may be an accurate and clinically useful alternative to 3-D motion analysis when identifying individuals who demonstrate abnormal movement patterns associated with PFJt pain. 2b.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran

Directory of Open Access Journals (Sweden)

Davoud Pourmarzi

2016-04-01

Full Text Available Objective: To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran.Materials and methods: Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA. Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test–retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC.Results: Cronbach’s alpha coefficient for overall scale was 0.85. Also Cronbach’s alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84, negative self-worth (4 items, α = 0.70, perceived interpersonal insecurity (2 items, α = 0.57, financial insecurity (3 items, α = 0.70, discretionary disclosure (2 items, α = 0.83. Test–retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales.Conclusion: This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran.
The precision and reliability evaluation of 3-dimensional printed damaged bone and prosthesis models by stereo lithography appearance.

Science.gov (United States)

Zou, Yun; Han, Qing; Weng, Xisheng; Zou, Yongwei; Yang, Yingying; Zhang, Kesong; Yang, Kerong; Xu, Xiaolin; Wang, Chenyu; Qin, Yanguo; Wang, Jincheng

2018-02-01

Recently, clinical application of 3D printed model was increasing. However, there was no systemic study for confirming the precision and reliability of 3D printed model. Some senior clinical doctors mistrusted its reliability in clinical application. The purpose of this study was to evaluate the precision and reliability of stereolithography appearance (SLA) 3D printed model.Some related parameters were selected to research the reliability of SLA 3D printed model. The computed tomography (CT) data of bone/prosthesis and model were collected and 3D reconstructed. Some anatomical parameters were measured and statistical analysis was performed; the intraclass correlation coefficient (ICC) was used to was used to evaluate the similarity between the model and real bone/prosthesis. the absolute difference (mm) and relative difference (%) were conducted. For prosthesis model, the 3-dimensional error was measured.There was no significant difference in the anatomical parameters except max height (MH) of long bone. All the ICCs were greater than 0.990. The maximum absolute and relative difference were 0.45 mm and 1.10%; The 3-dimensional error analysis showed that positive/minus distance were 0.273 mm/0.237 mm.The application of SLA 3D printed model in diagnosis and treatment process of complex orthopedic disease was reliable and precise.

Diurnal variation and reliability of the urine lactate concentration after maximal exercise.

Science.gov (United States)

Nikolaidis, Stefanos; Kosmidis, Ioannis; Sougioultzis, Michail; Kabasakalis, Athanasios; Mougios, Vassilis

2018-01-01

The postexercise urine lactate concentration is a novel valid exercise biomarker, which has exhibited satisfactory reliability in the morning hours under controlled water intake. The aim of the present study was to investigate the diurnal variation of the postexercise urine lactate concentration and its reliability in the afternoon hours. Thirty-two healthy children (11 boys and 21 girls) and 23 adults (13 men and 10 women) participated in the study. All participants performed two identical sessions of eight 25 m bouts of maximal freestyle swimming executed every 2 min with passive recovery in between. These sessions were performed in the morning and afternoon and were separated by 3-4 days. Adults performed an additional afternoon session that was also separated by 3-4 days. All swimmers drank 500 mL of water before and another 500 mL after each test. Capillary blood and urine samples were collected before and after each test for lactate determination. Urine creatinine, urine density and body water content were also measured. The intraclass correlation coefficient was used as a reliability index between the morning and afternoon tests, as well as between the afternoon test and retest. Swimming performance and body water content exhibited excellent reliability in both children and adults. The postexercise blood lactate concentration did not show diurnal variation, showing a good reliability between the morning and afternoon tests, as well as high reliability between the afternoon test and retest. The postexercise urine density and lactate concentration were affected by time of day. However, when lactate was normalized to creatinine, it exhibited excellent reliability in children and good-to-high reliability in adults. The postexercise urine lactate concentration showed high reliability between the afternoon test and retest, independent of creatinine normalization. The postexercise blood and urine lactate concentrations were significantly correlated in all
Is a sphygmomanometer a valid and reliable tool to measure the isometric strength of hip muscles? A systematic review.

Science.gov (United States)

Toohey, Liam Anthony; De Noronha, Marcos; Taylor, Carolyn; Thomas, James

2015-02-01

Muscle strength measurement is a key component of physiotherapists' assessment and is frequently used as an outcome measure. A sphygmomanometer is an instrument commonly used to measure blood pressure that can be potentially used as a tool to assess isometric muscle strength. To systematically review the evidence on the reliability and validity of a sphygmomanometer for measuring isometric strength of hip muscles. A literature search was conducted across four databases. Studies were eligible if they presented data on reliability and/or validity, used a sphygmomanometer to measure isometric muscle strength of the hip region, and were peer reviewed. The individual studies were evaluated for quality using a standardized critical appraisal tool. A total of 644 articles were screened for eligibility, with five articles chosen for inclusion. The use of a sphygmomanometer to objectively assess isometric muscle strength of the hip muscles appears to be reliable with intraclass correlation coefficient values ranging from 0.66 to 0.94 in elderly and young populations. No studies were identified that have assessed the validity of a sphygmomanometer. The sphygmomanometer appears to be reliable for assessment of isometric muscle strength around the hip joint, but further research is warranted to establish its validity.
Chinese version of the Constant-Murley questionnaire for shoulder pain and disability: a reliability and validation study.

Science.gov (United States)

Yao, Min; Yang, Long; Cao, Zuo-Yuan; Cheng, Shao-Dan; Tian, Shuang-Lin; Sun, Yue-Li; Wang, Jing; Xu, Bao-Ping; Hu, Xiao-Chun; Wang, Yong-Jun; Zhang, Ying; Cui, Xue-Jun

2017-09-18

Shoulder pain is a common musculoskeletal disorder in Chinese population, which affects more than 1,3 billion individuals. To the best of our knowledge, there has been no available Chinese-language version of measurements of shoulder pain and disability so far. Moreover, the Constant-Murley score (CMS) questionnaire is a universally recognized patient-reported questionnaire for clinical practice and research. The present study was designed to evaluate a Chinese translational version of CMS and subsequently assess its reliability and validity. The Chinese translational version of CMS was formulated by means of forward-backward translation. Meanwhile, a final review was carried out by an expert committee, followed by conducting a test of the pre-final version. Therefore, the reliability and validity of the Chinese translational version of CMS could be assessed using the internal consistency, construct validity, factor analysis, reliability and floor and ceiling effects. Specifically, the reliability was assessed by testing the internal consistency (Cronbach's α) and test-retest reliability (intraclass coefficient correlation [ICC]), while the construct validity was evaluated via comparison between the Chinese translational version of CMS with visual analog scale (VAS) score and the 36-Item Short Form Health Survey (SF-36, Spearman correlation). The questionnaire was verified to be acceptable after distribution among 120 subjects with unilateral shoulder pain. Factor analysis had revealed a two-factor and 10-item solution. Moreover, the assessment results indicated that the Chinese translational version of CMS questionnaire harbored good internal consistency (Cronbach's α = 0.739) and test-retest reliability (ICC = 0.827). In addition, the Chinese translational version of CMS was moderately correlated with VAS score (r = 0.497) and SF-36 (r = 0.135). No obvious floor and ceiling effects were observed in the Chinese translational version of CMS questionnaire
Reliability of a computer and Internet survey (Computer User Profile) used by adults with and without traumatic brain injury (TBI).

Science.gov (United States)

Kilov, Andrea M; Togher, Leanne; Power, Emma

2015-01-01

To determine test-re-test reliability of the 'Computer User Profile' (CUP) in people with and without TBI. The CUP was administered on two occasions to people with and without TBI. The CUP investigated the nature and frequency of participants' computer and Internet use. Intra-class correlation coefficients and kappa coefficients were conducted to measure reliability of individual CUP items. Descriptive statistics were used to summarize content of responses. Sixteen adults with TBI and 40 adults without TBI were included in the study. All participants were reliable in reporting demographic information, frequency of social communication and leisure activities and computer/Internet habits and usage. Adults with TBI were reliable in 77% of their responses to survey items. Adults without TBI were reliable in 88% of their responses to survey items. The CUP was practical and valuable in capturing information about social, leisure, communication and computer/Internet habits of people with and without TBI. Adults without TBI scored more items with satisfactory reliability overall in their surveys. Future studies may include larger samples and could also include an exploration of how people with/without TBI use other digital communication technologies. This may provide further information on determining technology readiness for people with TBI in therapy programmes.
Reliability of a single objective measure in assessing sleepiness.

Science.gov (United States)

Sunwoo, Bernie Y; Jackson, Nicholas; Maislin, Greg; Gurubhagavatula, Indira; George, Charles F; Pack, Allan I

2012-01-01

To evaluate reliability of single objective tests in assessing sleepiness. Subjects who completed polysomnography underwent a 4-nap multiple sleep latency test (MSLT) the following day. Prior to each nap opportunity on MSLT, subjects performed the psychomotor vigilance test (PVT) and divided attention driving task (DADT). Results of single versus multiple test administrations were compared using the intraclass correlation coefficient (ICC) and adjusted for test administration order effects to explore time of day effects. Measures were explored as continuous and binary (i.e., impaired or not impaired). Community-based sample evaluated at a tertiary, university-based sleep center. 372 adult commercial vehicle operators oversampled for increased obstructive sleep apnea risk. N/A. AS CONTINUOUS MEASURES, ICC WERE AS FOLLOWS: MSLT 0.45, PVT median response time 0.69, PVT number of lapses 0.51, 10-min DADT tracking error 0.87, 20-min DADT tracking error 0.90. Based on binary outcomes, ICC were: MSLT 0.63, PVT number of lapses 0.85, 10-min DADT 0.95, 20-min DADT 0.96. Statistically significant time of day effects were seen in both the MSLT and PVT but not the DADT. Correlation between ESS and different objective tests was strongest for MSLT, range [-0.270 to -0.195] and persisted across all time points. Single DADT and PVT administrations are reliable measures of sleepiness. A single MSLT administration can reasonably discriminate individuals with MSL < 8 minutes. These results support the use of a single administration of some objective tests of sleepiness when performed under controlled conditions in routine clinical care.
Reliability and Validity of the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2) in Adults with Non-Cancer Pain

Science.gov (United States)

Hayes, Corey J.; Bhandari, Naleen Raj; Kathe, Niranjan; Payakachat, Nalin

2017-01-01

Limited evidence exists on how non-cancer pain (NCP) affects an individual’s health-related quality of life (HRQoL). This study aimed to validate the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2), a generic measure of HRQoL, in a NCP cohort using the Medical Expenditure Panel Survey Longitudinal Files. The SF Mental Component Summary (MCS12) and SF Physical Component Summary (PCS12) were tested for reliability (internal consistency and test-retest reliability) and validity (construct: convergent and discriminant; criterion: concurrent and predictive). A total of 15,716 patients with NCP were included in the final analysis. The MCS12 and PCS12 demonstrated high internal consistency (Cronbach’s alpha and Mosier’s alpha > 0.8), and moderate and high test-retest reliability, respectively (MCS12 intraclass correlation coefficient (ICC): 0.64; PCS12 ICC: 0.73). Both scales were significantly associated with a number of chronic conditions (p reliable and valid measure of HRQoL for patients with NCP. PMID:28445438
Measuring reliable change in cognition using the Edinburgh Cognitive and Behavioural ALS Screen (ECAS).

Science.gov (United States)

Crockford, Christopher; Newton, Judith; Lonergan, Katie; Madden, Caoifa; Mays, Iain; O'Sullivan, Meabhdh; Costello, Emmet; Pinto-Grau, Marta; Vajda, Alice; Heverin, Mark; Pender, Niall; Al-Chalabi, Ammar; Hardiman, Orla; Abrahams, Sharon

2018-02-01

Cognitive impairment affects approximately 50% of people with amyotrophic lateral sclerosis (ALS). Research has indicated that impairment may worsen with disease progression. The Edinburgh Cognitive and Behavioural ALS Screen (ECAS) was designed to measure neuropsychological functioning in ALS, with its alternate forms (ECAS-A, B, and C) allowing for serial assessment over time. The aim of the present study was to establish reliable change scores for the alternate forms of the ECAS, and to explore practice effects and test-retest reliability of the ECAS's alternate forms. Eighty healthy participants were recruited, with 57 completing two and 51 completing three assessments. Participants were administered alternate versions of the ECAS serially (A-B-C) at four-month intervals. Intra-class correlation analysis was employed to explore test-retest reliability, while analysis of variance was used to examine the presence of practice effects. Reliable change indices (RCI) and regression-based methods were utilized to establish change scores for the ECAS alternate forms. Test-retest reliability was excellent for ALS Specific, ALS Non-Specific, and ECAS Total scores of the combined ECAS A, B, and C (all > .90). No significant practice effects were observed over the three testing sessions. RCI and regression-based methods produced similar change scores. The alternate forms of the ECAS possess excellent test-retest reliability in a healthy control sample, with no significant practice effects. The use of conservative RCI scores is recommended. Therefore, a change of ≥8, ≥4, and ≥9 for ALS Specific, ALS Non-Specific, and ECAS Total score is required for reliable change.
Reliability of the Brazilian Portuguese version of the Gross Motor Function Measure in children with cerebral palsy

Science.gov (United States)

Almeida, Kênnea M.; Albuquerque, Karolina A.; Ferreira, Marina L.; Aguiar, Stéphany K. B.; Mancini, Marisa C.

2016-01-01

OBJECTIVE: To test the intra- and interrater reliability of the Brazilian Portuguese version of the 66-item Gross Motor Function Measure (GMFM-66). METHOD: The sample included 48 children with cerebral palsy (CP), ranging from 2-17 years old, classified at levels I to IV of the Gross Motor Function Classification System (GMFCS) and four child rehabilitation examiners. A main examiner evaluated all children using the GMFM-66 and video-recorded the assessments. The other examiners watched the video recordings and scored them independently for the assessment of interrater reliability. For the intrarater reliability evaluation, the main examiner watched the video recordings one month after the evaluation and re-scored each child. We calculated reliability by using intraclass correlation coefficients (ICC) with their respective 95% confidence intervals. RESULTS: Excellent test reliability was documented. The intrarater reliability of the total sample was ICC=0.99 (95% CI 0.98-0.99), and the interrater reliability was ICC=0.97 (95% CI 0.95-0.98). The reliability across GMFCS levels ranged from ICC=0.92 (95% CI 0.72-0.98) to ICC=0.99 (95% CI 0.99-0.99); the lowest value was the interrater reliability for the GMFCS IV group. Reliability in the five GMFM dimensions varied from ICC=0.95 (95% CI 0.93-0.97) to ICC=0.99 (95% CI 0.99-0.99). CONCLUSION: The Brazilian Portuguese version of the GMFM-66 showed excellent intra- and interrater reliability when used in Brazilian children with CP levels GMFCS I to IV. PMID:26786081
Reliability measures of functional magnetic resonance imaging in a longitudinal evaluation of mild cognitive impairment.

Science.gov (United States)

Zanto, Theodore P; Pa, Judy; Gazzaley, Adam

2014-01-01

As the aging population grows, it has become increasingly important to carefully characterize amnestic mild cognitive impairment (aMCI), a preclinical stage of Alzheimer's disease (AD). Functional magnetic resonance imaging (fMRI) is a valuable tool for monitoring disease progression in selectively vulnerable brain regions associated with AD neuropathology. However, the reliability of fMRI data in longitudinal studies of older adults with aMCI is largely unexplored. To address this, aMCI participants completed two visual working tasks, a Delayed-Recognition task and a One-Back task, on three separate scanning sessions over a three-month period. Test-retest reliability of the fMRI blood oxygen level dependent (BOLD) activity was assessed using an intraclass correlation (ICC) analysis approach. Results indicated that brain regions engaged during the task displayed greater reliability across sessions compared to regions that were not utilized by the task. During task-engagement, differential reliability scores were observed across the brain such that the frontal lobe, medial temporal lobe, and subcortical structures exhibited fair to moderate reliability (ICC=0.3-0.6), while temporal, parietal, and occipital regions exhibited moderate to good reliability (ICC=0.4-0.7). Additionally, reliability across brain regions was more stable when three fMRI sessions were used in the ICC calculation relative to two fMRI sessions. In conclusion, the fMRI BOLD signal is reliable across scanning sessions in this population and thus a useful tool for tracking longitudinal change in observational and interventional studies in aMCI. © 2013.
Reliability of rehabilitative ultrasonographic imaging for muscle thickness measurement of the rhomboid major.

Science.gov (United States)

Jeong, Ju Ri; Ko, Young Jun; Ha, Hyun Geun; Lee, Wan Hee

2016-03-01

This study was to establish inter-rater and intrarater reliability of the rehabilitative ultrasonographic imaging (RUSI) technique for muscle thickness measurement of the rhomboid major at rest and with the shoulder abducted to 90°. Twenty-four young adults (eight men, 16 women; right-handed; mean age [±SD], 24·4 years [±2·6]) with no history of neck, shoulder, or arm pain were recruited. Rhomboid major muscle images were obtained in the resting position and with shoulder in 90° abduction using an ultrasonography system with a 7·5-MHz linear transducer. In these two positions, the examiners found the site at which the transducer could be placed. Two examiners obtained the images of all participants in three test sessions at random. Intraclass correlation coefficients (ICC) were used to estimate reliability. All ICCs (95% CI) were >0·75, ranging from 0·93 to 0·98, which indicates good reliability. The ICCs for inter-rater reliability ranged from 0·75 to 0·94. For the absolute value of the difference in the intra-examiner reliability between the right and left ratios, the ICCs ranged from 0·58 to 0·91. In this study, the intra- and interexaminer reliability of muscle thickness measurements of the rhomboid major were good. Therefore, we suggest that muscle thickness measurements of the rhomboid major obtained with the RUSI technique would be useful for clinical rehabilitative assessment. © 2014 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.
Construct validity, test-retest reliability and internal consistency of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) in patients with carpal tunnel syndrome.

Science.gov (United States)

Buntragulpoontawee, Montana; Phutrit, Suphatha; Tongprasert, Siam; Wongpakaran, Tinakon; Khunachiva, Jeeranan

2018-03-27

This study evaluated additional psychometric properties of the Thai version of the disabilities of the arm, shoulder and hand questionnaire (DASH-TH) which included, test-retest reliability, construct validity, internal consistency of in patients with carpal tunnel syndrome. As for determining construct validity, the Thai EuroQOL questionnaire (EQ-5D-5L) was also administered in order to examine convergent and divergent validity. Fifty patients completed both questionnaires. The DASH-TH showed excellent test-retest reliability (intraclass correlation coefficient = 0.811) and internal consistency (Cronbach's alpha = 0.911). The exploratory factor analysis yielded a six-factor solution while the confirmatory factor analysis denoted that the hypothesized model adequately fit the data with a comparative fit index of 0.967 and a Tucker-Lewis index of 0.964. The related subscales between the DASH-TH and the Thai EQ-5D-5L were significantly correlated, indicating the DASH-TH's convergent and discriminant validity. The DASH-TH demonstrated good reliability, internal consistency construct validity, and multidimensionality, in assessing the upper extremity function in carpal tunnel syndrome patients.
Meaningful Effect Sizes, Intraclass Correlations, and Proportions of Variance Explained by Covariates for Planning Two- and Three-Level Cluster Randomized Trials of Social and Behavioral Outcomes.

Science.gov (United States)

Dong, Nianbo; Reinke, Wendy M; Herman, Keith C; Bradshaw, Catherine P; Murray, Desiree W

2016-09-30

There is a need for greater guidance regarding design parameters and empirical benchmarks for social and behavioral outcomes to inform assumptions in the design and interpretation of cluster randomized trials (CRTs). We calculated the empirical reference values on critical research design parameters associated with statistical power for children's social and behavioral outcomes, including effect sizes, intraclass correlations (ICCs), and proportions of variance explained by a covariate at different levels (R 2 ). Children from kindergarten to Grade 5 in the samples from four large CRTs evaluating the effectiveness of two classroom- and two school-level preventive interventions. Teacher ratings of students' social and behavioral outcomes using the Teacher Observation of Classroom Adaptation-Checklist and the Social Competence Scale-Teacher. Two types of effect size benchmarks were calculated: (1) normative expectations for change and (2) policy-relevant demographic performance gaps. The ICCs and R 2 were calculated using two-level hierarchical linear modeling (HLM), where students are nested within schools, and three-level HLM, where students were nested within classrooms, and classrooms were nested within schools. Comprehensive tables of benchmarks and ICC values are provided to inform prevention researchers in interpreting the effect size of interventions and conduct power analyses for designing CRTs of children's social and behavioral outcomes. The discussion also provides a demonstration for how to use the parameter reference values provided in this article to calculate the sample size for two- and three-level CRTs designs. © The Author(s) 2016.
Test-retest reliability of a balance testing protocol with external perturbations in young healthy adults.

Science.gov (United States)

Robbins, Shawn M; Caplan, Ryan M; Aponte, Daniel I; St-Onge, Nancy

2017-10-01

External perturbations are utilized to challenge balance and mimic realistic balance threats in patient populations. The reliability of such protocols has not been established. The purpose was to examine test-retest reliability of balance testing with external perturbations. Healthy adults (n=34; mean age 23 years) underwent balance testing over two visits. Participants completed ten balance conditions in which the following parameters were combined: perturbation or non-perturbation, single or double leg, and eyes open or closed. Three trials were collected for each condition. Data were collected on a force plate and external perturbations were applied by translating the plate. Force plate center of pressure (CoP) data were summarized using 13 different CoP measures. Test-retest reliability was examined using intraclass correlation coefficients (ICC) and Bland-Altman plots. CoP measures of total speed and excursion in both anterior-posterior and medial-lateral directions generally had acceptable ICC values for perturbation conditions (ICC=0.46 to 0.87); however, many other CoP measures (e.g. range, area of ellipse) had unacceptable test-retest reliability (ICCbalance testing protocols that include external perturbations should be made to improve test-retest reliability and diminish learning including more extensive participant training and increasing the number of trials. CoP measures that consider all data points (e.g. total speed) are more reliable than those that only consider a few data points. Copyright © 2017 Elsevier B.V. All rights reserved.
Reliability and validity of the adapted Resistance Training Skills Battery for Children.

Science.gov (United States)

Furzer, Bonnie J; Bebich-Philip, Marc D; Wright, Kemi E; Reid, Siobhan L; Thornton, Ashleigh L

2017-12-29

Resistance training (RT) is emerging as a training modality to improve motor function and facilitate physical activity participation in children across the motor proficiency spectrum. Although RT competency assessments have been established and validated among adolescent cohorts, the extent to which these methods are suitable for assessing children's RT skills is unknown. This project aimed to assess the psychometric properties of the adapted Resistance Training Skills Battery for Children (RTSBc), in children with varying motor proficiency. Repeated measures design with 40 participants (M age=8.2±1.7years) displaying varying levels of motor proficiency. Participants performed the adapted RTSBc on two occasions, receiving a score for their execution of each component, in addition to an overall RT skill quotient child (RTSQc). Cronbach's alpha, intra-class correlation (ICC), Bland-Altman analysis, and typical error were used to assess test-retest reliability. To examine construct validity, exploratory factor analysis was performed alongside computing correlations between participants' muscle strength, motor proficiency, age, lean muscle mass, and RTSQc. The RTSBc displayed an acceptable level of internal consistency (alpha=0.86) and test-retest reliability (ICC range=0.86-0.99). Exploratory factor analysis supported internal test structure, with all six RT skills loading strongly on a single factor (range 0.56-0.89). Analyses of structural validity revealed positive correlations for RTSQc in relation to motor proficiency (r=0.52, preliability of the RTSBc, providing preliminary evidence that the RTSBc is appropriate for use in the assessment of children's RT competency. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Reliability of the Swedish version of the Exercise Self-Efficacy Scale (S-ESES): a test-retest study in adults with neurological disease.

Science.gov (United States)

Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth

2015-03-01

To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.
Test–Retest Reliability of Self-Reported Sexual Behavior History in Urbanized Nigerian Women

Directory of Open Access Journals (Sweden)

Eileen O. Dareng

2017-07-01

Full Text Available BackgroundStudies assessing risk of sexual behavior and disease are often plagued by questions about the reliability of self-reported sexual behavior. In this study, we evaluated the reliability of self-reported sexual history among urbanized women in a prospective study of cervical HPV infections in Nigeria.MethodsWe examined test–retest reliability of sexual practices using questionnaires administered at study entry and at follow-up visits. We used the root mean squared approach to calculate within-person coefficient of variation (CVw and calculated the intra-class correlation coefficient (ICC using two way, mixed effects models for continuous variables and (κ^ statistics for discrete variables. To evaluate the potential predictors of reliability, we used linear regression and log binomial regression models for the continuous and categorical variables, respectively.ResultsWe found that self-reported sexual history was generally reliable, with overall ICC ranging from 0.7 to 0.9; however, the reliability varied by nature of sexual behavior evaluated. Frequency reports of non-vaginal sex (agreement = 63.9%, 95% CI: 47.5–77.6% were more reliable than those of vaginal sex (agreement = 59.1%, 95% CI: 55.2–62.8%. Reports of time-invariant behaviors were also more reliable than frequency reports. The CVw for age at sexual debut was 10.7 (95% CI: 10.6–10.7 compared with the CVw for lifetime number of vaginal sex partners, which was 35.2 (95% CI: 35.1–35.3. The test–retest interval was an important predictor of reliability of responses, with longer intervals resulting in increased inconsistency (average change in unreliability for each 1 month increase = 0.04, 95% CI = 0.07–0.38, p = 0.005.ConclusionOur findings suggest that overall, the self-reported sexual history among urbanized Nigeran women is reliable.
The reliabilty of isokinetic strength measurement

OpenAIRE

Kadlec, Miroslav

2011-01-01

Title: Reliability of isometric and isokinetic strength testing in the knee flexion and extension Objectives: To compare the reliability of isometric and isokinetic testing of the knee strength in flexion and extension Methods: I used intraclass correlation coefficient and Pearson's correlation coefficient. Results: I have discovered that the reliability measured on isokinetic and isometric dynamometer is high. Furthermore the reliability of the maximum strength measurement was higher with-us...
Reliability and concurrent validity of the adapted Chinese version of Scoliosis Research Society-22 (SRS-22) questionnaire.

Science.gov (United States)

Cheung, Kenneth M C; Senkoylu, Alpaslan; Alanay, Ahmet; Genc, Yasemin; Lau, Sarah; Luk, Keith D

2007-05-01

Validation study to define validity and reliability of an adapted and translated questionnaire. Assessment of the concurrent validity and reliability of a Chinese version of SRS-22 outcome instrument. No valid health-related quality of life (HRQL) outcome instrument exists for patients with spinal deformity in Chinese. The modified SRS-22 questionnaire was proven to be an appropriate outcome instrument in English, and has already been translated and validated in several other languages. The English version of the SRS-22 questionnaire was adapted to Chinese according to the International Quality of Life Assessment Project guidelines. To assess reliability, 48 subjects with adolescent idiopathic scoliosis (mean age, 16.5 years) filled the questionnaire on 2 separate occasions (Group 1). To assess concurrent validity, 50 subjects (mean age, 21 years) filled in the same questionnaire and a previously validated Chinese version of the Short Form-36 (SF36) questionnaire (Group 2). Internal consistency, reproducibility and concurrent validity were determined with Cronbach's alpha coefficient, interclass correlation coefficient and Pearson correlation coefficient, respectively. Cronbach's alpha coefficient for the 4 major domains (function/activity, pain, self-image/appearance and mental health) were high. Intraclass correlation was also excellent for all domains. For concurrent validity, excellent correlation was found in 1 domain, good in 12 domains, moderate in 3 domains, and poor in 1 domain of the 17 relevant domains. Both cultural adaptation and linguistic translation are essential in any attempt to use a HRQL questionnaire across cultures. The Chinese version of the SRS-22 outcome instrument has satisfactory internal consistency and excellent reproducibility. It is ready for use in clinical studies on idiopathic scoliosis in Chinese-speaking societies.
A competency based selection procedure for Dutch postgraduate GP training: a pilot study on validity and reliability.

Science.gov (United States)

Vermeulen, Margit I; Tromp, Fred; Zuithoff, Nicolaas P A; Pieters, Ron H M; Damoiseaux, Roger A M J; Kuyvenhoven, Marijke M

2014-12-01

Abstract Background: Historically, semi-structured interviews (SSI) have been the core of the Dutch selection for postgraduate general practice (GP) training. This paper describes a pilot study on a newly designed competency-based selection procedure that assesses whether candidates have the competencies that are required to complete GP training. The objective was to explore reliability and validity aspects of the instruments developed. The new selection procedure comprising the National GP Knowledge Test (LHK), a situational judgement tests (SJT), a patterned behaviour descriptive interview (PBDI) and a simulated encounter (SIM) was piloted alongside the current procedure. Forty-seven candidates volunteered in both procedures. Admission decision was based on the results of the current procedure. Study participants did hardly differ from the other candidates. The mean scores of the candidates on the LHK and SJT were 21.9 % (SD 8.7) and 83.8% (SD 3.1), respectively. The mean self-reported competency scores (PBDI) were higher than the observed competencies (SIM): 3.7(SD 0.5) and 2.9(SD 0.6), respectively. Content-related competencies showed low correlations with one another when measured with different instruments, whereas more diverse competencies measured by a single instrument showed strong to moderate correlations. Moreover, a moderate correlation between LHK and SJT was found. The internal consistencies (intraclass correlation, ICC) of LHK and SJT were poor while the ICC of PBDI and SIM showed acceptable levels of reliability. Findings on content validity and reliability of these new instruments are promising to realize a competency based procedure. Further development of the instruments and research on predictive validity should be pursued.
Inter- and intrarater reliability of goniometry and hand held dynamometry for patients with subacromial impingement syndrome.

Science.gov (United States)

Fieseler, Georg; Laudner, Kevin G; Irlenbusch, Lars; Meyer, Henrike; Schulze, Stephan; Delank, Karl-Stefan; Hermassi, Souhail; Bartels, Thomas; Schwesig, René

2017-12-01

The purpose of this study was to examine the intra- and interrater reliability of measuring shoulder range of motion (ROM) and strength among patients diagnosed with subacromial impingement syndrome (SAIS). Twenty-five patients (14 female patients; mean age, 60.4± 7.84 years) diagnosed with SAIS were assessed to determine the intrarater reliability for glenohumeral ROM. Twenty-five patients (16 female patients; mean age, 60.4± 7.80 years) and 76 asymptomatic volunteers (52 female volunteers; mean age, 29.4± 14.1 years) were assessed for interrater reliability. Dependent variables were active shoulder ROM and isometric strength. Intrarater reliability was fair-to-excellent for the SAIS patients (intraclass correlation coefficient [ICC], 0.52-0.97; standard error of measurement [SEM], 4.4°-9.9° N; coefficient of variation [CV], 7.1%-44.9%). Based on the ICC, 11 of 12 parameters (92%) displayed an excellent reliability (ICC> 0.75). The interrater reliability showed fair-to-excellent results (SAIS patients: ICC, 0.13-0.98; SEM, 2.3°-8.8°; CV, 3.6%-37.0%; controls: ICC, 0.11-0.96; SEM, 3.0°-35.4°; CV, 5.6%-26.4%). In accordance with the intrarater reliability, glenohumeral adduction ROM was the only parameter with an ICC below 0.75 for both samples. Painful shoulder ROM in the SAIS patients showed no influence on the quality of reliability for measurement. Therefore, these protocols should be considered reliable assessment techniques in the prevention, diagnosis, and treatment of painful shoulder conditions such as SAIS.

Reliability and relationships among handgrip strength, leg extensor strength and power, and balance in older men.

Science.gov (United States)

Jenkins, Nathaniel D M; Buckner, Samuel L; Bergstrom, Haley C; Cochrane, Kristen C; Goldsmith, Jacob A; Housh, Terry J; Johnson, Glen O; Schmidt, Richard J; Cramer, Joel T

2014-10-01

To quantify the reliability of isometric leg extension torque (LEMVC), rate of torque development (LERTD), isometric handgrip force (HGMVC) and RFD (HGRFD), isokinetic leg extension torque and power at 1.05rad·s(-1) and 3.14rad·s(-1); and explore relationships among strength, power, and balance in older men. Sixteen older men completed 3 isometric handgrips, 3 isometric leg extensions, and 3 isokinetic leg extensions at 1.05rad·s(-1) and 3.14rad·s(-1) during two visits. Intraclass correlation coefficients (ICCs), ICC confidence intervals (95% CI), coefficients of variation (CVs), and Pearson correlation coefficients were calculated. LERTD demonstrated no reliability. The CVs for LERTD and HGRFD were ≤23.26%. HGMVC wasn't related to leg extension torque or power, or balance (r=0.14-0.47; p>0.05). However, moderate to strong relationships were found among isokinetic leg extension torque at 1.05rad·s(-1) and 3.14rad·s(-1), leg extension mean power at 1.05rad·s(-1), and functional reach (r=0.51-0.95; p≤0.05). LERTD and HGRFD weren't reliable and shouldn't be used as outcome variables in older men. Handgrip strength may not be an appropriate surrogate for lower body strength, power, or balance. Instead, perhaps handgrip strength should only be used to describe upper body strength or functionality, which may compliment isokinetic assessments of lower body strength, which were reliable and related to balance. Copyright © 2014 Elsevier Inc. All rights reserved.
Reliability of Using Motion Sensors to Measure Children’s Physical Activity Levels in Exergaming

Directory of Open Access Journals (Sweden)

Nan Zeng

2018-05-01

Full Text Available Objectives: This study examined the reliability of two objective measurement tools in assessing children’s physical activity (PA levels in an exergaming setting. Methods: A total of 377 children (190 girls, Mage = 8.39, SD = 1.55 attended the 30-min exergaming class every other day for 18 weeks. Children’s PA levels were concurrently measured by NL-1000 pedometer and ActiGraph GT3X accelerometer, while children’s steps per min and time engaged in sedentary, light, and moderate-to-vigorous PA were estimated, respectively. Results: The results of intraclass correlation coefficient (ICC indicated a low degree of reliability (single measures ICC = 0.03 in accelerometers. ANOVA did detect a possible learning effect for 27 classes (p < 0.01, and the single measures ICC was 0.20 for pedometers. Moreover, there was no significant positive relationship between steps per min and time spent in moderate-to-vigorous physical activity (MVPA. Finally, only 1.3% variance was explained by pedometer as a predictor using Hierarchical Linear Modeling to further explore the relationship between pedometer and accelerometer data. Conclusions: The NL-1000 pedometers and ActiGraph GT3X accelerometers have low reliability in assessing elementary school children’s PA levels during exergaming. More research is warranted in determining the reliable and accurate measurement information regarding the use of modern devices in exergaming setting.
The Structured Interview & Scoring Tool-Massachusetts Alzheimer's Disease Research Center (SIST-M): development, reliability, and cross-sectional validation of a brief structured clinical dementia rating interview.

Science.gov (United States)

Okereke, Olivia I; Copeland, Maura; Hyman, Bradley T; Wanggaard, Taylor; Albert, Marilyn S; Blacker, Deborah

2011-03-01

The Clinical Dementia Rating (CDR) and CDR Sum-of-Boxes can be used to grade mild but clinically important cognitive symptoms of Alzheimer disease. However, sensitive clinical interview formats are lengthy. To develop a brief instrument for obtaining CDR scores and to assess its reliability and cross-sectional validity. Using legacy data from expanded interviews conducted among 347 community-dwelling older adults in a longitudinal study, we identified 60 questions (from a possible 131) about cognitive functioning in daily life using clinical judgment, inter-item correlations, and principal components analysis. Items were selected in 1 cohort (n=147), and a computer algorithm for generating CDR scores was developed in this same cohort and re-run in a replication cohort (n=200) to evaluate how well the 60 items retained information from the original 131 items. Short interviews based on the 60 items were then administered to 50 consecutively recruited older individuals, with no symptoms or mild cognitive symptoms, at an Alzheimer's Disease Research Center. Clinical Dementia Rating scores based on short interviews were compared with those from independent long interviews. In the replication cohort, agreement between short and long CDR interviews ranged from κ=0.65 to 0.79, with κ=0.76 for Memory, κ=0.77 for global CDR, and intraclass correlation coefficient for CDR Sum-of-Boxes=0.89. In the cross-sectional validation, short interview scores were slightly lower than those from long interviews, but good agreement was observed for global CDR and Memory (κ≥0.70) as well as for CDR Sum-of-Boxes (intraclass correlation coefficient=0.73). The Structured Interview & Scoring Tool-Massachusetts Alzheimer's Disease Research Center is a brief, reliable, and sensitive instrument for obtaining CDR scores in persons with symptoms along the spectrum of mild cognitive change.
Reliability of joint count assessment in rheumatoid arthritis: a systematic literature review.

Science.gov (United States)

Cheung, Peter P; Gossec, Laure; Mak, Anselm; March, Lyn

2014-06-01

Joint counts are central to the assessment of rheumatoid arthritis (RA) but reliability is an issue. To evaluate the reliability and agreement of joint counts (intra-observer and inter-observer) by health care professionals (physicians, nurses, and metrologists) and patients in RA, and the impact of training and standardization on joint count reliability through a systematic literature review. Articles reporting joint count reliability or agreement in RA in PubMed, EMBase, and the Cochrane library between 1960 and 2012 were selected. Data were extracted regarding tender joint counts (TJCs) and swollen joint counts (SJCs) derived by physicians, metrologists, or patients for intra-observer and inter-observer reliability. In addition, methods and effects of training or standardization were extracted. Statistics expressing reliability such as intraclass correlation coefficients (ICCs) were extracted. Data analysis was primarily descriptive due to high heterogeneity. Twenty-eight studies on health care professionals (HCP) and 20 studies on patients were included. Intra-observer reliability for TJCs and SJCs was good for HCPs and patients (range of ICC: 0.49-0.98). Inter-observer reliability between HCPs for TJCs was higher than for SJCs (range of ICC: 0.64-0.88 vs. 0.29-0.98). Patient inter-observer reliability with HCPs as comparators was better for TJCs (range of ICC: 0.31-0.91) compared to SJCs (0.16-0.64). Nine studies (7 with HCPs and 2 with patients) evaluated consensus or training, with improvement in reliability of TJCs but conflicting evidence for SJCs. Intra- and inter-observer reliability was high for TJCs for HCPs and patients: among all groups, reliability was better for TJCs than SJCs. Inter-observer reliability of SJCs was poorer for patients than HCPs. Data were inconclusive regarding the potential for training to improve SJC reliability. Overall, the results support further evaluation for patient-reported joint counts as an outcome measure. © 2013
Reliability of cognitive tests of ELSA-Brasil, the brazilian longitudinal study of adult health

Science.gov (United States)

Batista, Juliana Alves; Giatti, Luana; Barreto, Sandhi Maria; Galery, Ana Roscoe Papini; Passos, Valéria Maria de Azeredo

2013-01-01

Cognitive function evaluation entails the use of neuropsychological tests, applied exclusively or in sequence. The results of these tests may be influenced by factors related to the environment, the interviewer or the interviewee. OBJECTIVES We examined the test-retest reliability of some tests of the Brazilian version from the Consortium to Establish a Registry for Alzheimer's disease. METHODS The ELSA-Brasil is a multicentre study of civil servants (35-74 years of age) from public institutions across six Brazilian States. The same tests were applied, in different order of appearance, by the same trained and certified interviewer, with an approximate 20-day interval, to 160 adults (51% men, mean age 52 years). The Intraclass Correlation Coefficient (ICC) was used to assess the reliability of the measures; and a dispersion graph was used to examine the patterns of agreement between them. RESULTS We observed higher retest scores in all tests as well as a shorter test completion time for the Trail Making Test B. ICC values for each test were as following: Word List Learning Test (0.56), Word Recall (0.50), Word Recognition (0.35), Phonemic Verbal Fluency Test (VFT, 0.61), Semantic VFT (0.53) and Trail B (0.91). The Bland-Altman plot showed better correlation of executive function (VFT and Trail B) than of memory tests. CONCLUSIONS Better performance in retest may reflect a learning effect, and suggest that retest should be repeated using alternate forms or after longer periods. In this sample of adults with high schooling level, reliability was only moderate for memory tests whereas the measurement of executive function proved more reliable. PMID:29213860
Reliability of Cognitive Tests of ELSA-Brasil, the Brazilian Longitudinal Study of Adult Health

Directory of Open Access Journals (Sweden)

Juliana Alves Batista

Full Text Available ABSTRACT Cognitive function evaluation entails the use of neuropsychological tests, applied exclusively or in sequence. The results of these tests may be influenced by factors related to the environment, the interviewer or the interviewee. Objectives: We examined the test-retest reliability of some tests of the Brazilian version from the Consortium to Establish a Registry for Alzheimer's disease. Methods: The ELSA-Brasil is a multicentre study of civil servants (35-74 years of age from public institutions across six Brazilian States. The same tests were applied, in different order of appearance, by the same trained and certified interviewer, with an approximate 20-day interval, to 160 adults (51% men, mean age 52 years. The Intraclass Correlation Coefficient (ICC was used to assess the reliability of the measures; and a dispersion graph was used to examine the patterns of agreement between them. Results: We observed higher retest scores in all tests as well as a shorter test completion time for the Trail Making Test B. ICC values for each test were as following: Word List Learning Test (0.56, Word Recall (0.50, Word Recognition (0.35, Phonemic Verbal Fluency Test (VFT, 0.61, Semantic VFT (0.53 and Trail B (0.91. The Bland-Altman plot showed better correlation of executive function (VFT and Trail B than of memory tests. Conclusions: Better performance in retest may reflect a learning effect, and suggest that retest should be repeated using alternate forms or after longer periods. In this sample of adults with high schooling level, reliability was only moderate for memory tests whereas the measurement of executive function proved more reliable.
Reliability and number of trials of Y Balance Test in adolescent athletes.

Science.gov (United States)

Linek, Pawel; Sikora, Damian; Wolny, Tomasz; Saulicz, Edward

2017-10-01

The Star Excursion Balance Test (SEBT) is commonly used to evaluate dynamic equilibrium. The Y Balance Test (Y-BT) is a shortened version of the SEBT where a Y- Balance Kit is commonly used. To date, research concerning the protocol and reliability of the SEBT and Y-BT has been conducted only for adults. The aim of the study was to assess the protocol (the necessary number of trials to stabilize the results) and reliability of the Y-BT in adolescent athletes. One-way repeated-measures analysis of variance (ANOVA) and reliability study. The sample of 38 athletes (mean age: 15.6 years) was selected from a football club. A Y-Balance test kit was applied for the evaluation of dynamic balance. The analysis used the values normalized to the relative length of the lower limbs. After six attempts, three consecutive ones achieved stability for all directions and both extremities (p > 0.05). The intraclass correlation coefficient (ICC 3,1 ), standard error of measurement and minimal detectable change values for the three attempts ranged from 0.57 to 0.82, from 3 to less than 6% and from 7.68 to 13.7%, respectively. In the study of adolescent dynamic equilibrium using the Y-BT, it is recommended to perform nine attempts (including six trial attempts and three measurements). In order to increase reliability it is recommended that the average of the three measured attempts is analysed. Copyright © 2017 Elsevier Ltd. All rights reserved.
Validation and reliability of a modified sphygmomanometer for the assessment of handgrip strength in Parkinson´s disease

Directory of Open Access Journals (Sweden)

Soraia M. Silva

2015-04-01

Full Text Available BACKGROUND: Handgrip strength is currently considered a predictor of overall muscle strength and functional capacity. Therefore, it is important to find reliable and affordable instruments for this analysis, such as the modified sphygmomanometer test (MST. OBJECTIVES: To assess the concurrent criterion validity of the MST, to compare the MST with the Jamar dynamometer, and to analyze the reproducibility (i.e. reliability and agreement of the MST in individuals with Parkinson's disease (PD. METHOD: The authors recruited 50 subjects, 24 with PD (65.5±6.2 years of age and 26 healthy elderly subjects (63.4±7.2 years of age. The handgrip strength was measured using the Jamar dynamometer and modified sphygmomanometer. The concurrent criterion validity was analyzed using Pearson's correlation coefficient and a simple linear regression test. The reproducibility of the MST was evaluated with the coefficient of intra-class correlation (ICC2,1, the standard error of measurement (SEM, the minimal detectable change (MDC, and the Bland-Altman plot. For all of the analyses, α≤0.05 was considered a risk. RESULTS: There was a significant correlation of moderate magnitude (r≥0.45 between the MST and the Jamar dynamometer. The MST had excellent reliability (ICC2,1≥0.7. The SEM and the MDC were adequate; however, the Bland-Altman plot indicated an unsatisfactory interrater agreement. CONCLUSIONS: The MST exhibited adequate validity and excellent reliability and is, therefore, suitable for monitoring the handgrip strength in PD. However, if the goal is to compare the measurements between examiners, the authors recommend that the data be interpreted with caution.
Assessment of Lower Limb Muscle Strength and Power Using Hand-Held and Fixed Dynamometry: A Reliability and Validity Study

Science.gov (United States)

Perraton, Luke G.; Bower, Kelly J.; Adair, Brooke; Pua, Yong-Hao; Williams, Gavin P.; McGaw, Rebekah

2015-01-01

Introduction Hand-held dynamometry (HHD) has never previously been used to examine isometric muscle power. Rate of force development (RFD) is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power. Methods 30 healthy young adults (age: 23±5yrs, male: 15) were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2) and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC), Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change. Results Comparison of RFD methods revealed that a peak 200ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70) for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70) for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31–0.79). Conclusions Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability
Intra and inter-rater reliability study of pelvic floor muscle dynamometric measurements

Directory of Open Access Journals (Sweden)

Natalia M. Martinho

2015-04-01

Full Text Available OBJECTIVE: The aim of this study was to evaluate the intra and inter-rater reliability of pelvic floor muscle (PFM dynamometric measurements for maximum and average strengths, as well as endurance. METHOD: A convenience sample of 18 nulliparous women, without any urogynecological complaints, aged between 19 and 31 (mean age of 25.4±3.9 participated in this study. They were evaluated using a pelvic floor dynamometer based on load cell technology. The dynamometric evaluations were repeated in three successive sessions: two on the same day with a rest period of 30 minutes between them, and the third on the following day. All participants were evaluated twice in each session; first by examiner 1 followed by examiner 2. The vaginal dynamometry data were analyzed using three parameters: maximum strength, average strength, and endurance. The Intraclass Correlation Coefficient (ICC was applied to estimate the PFM dynamometric measurement reliability, considering a good level as being above 0.75. RESULTS: The intra and inter-raters' analyses showed good reliability for maximum strength (ICCintra-rater1=0.96, ICCintra-rater2=0.95, and ICCinter-rater=0.96, average strength (ICCintra-rater1=0.96, ICCintra-rater2=0.94, and ICCinter-rater=0.97, and endurance (ICCintra-rater1=0.88, ICCintra-rater2=0.86, and ICCinter-rater=0.92 dynamometric measurements. CONCLUSIONS: The PFM dynamometric measurements showed good intra- and inter-rater reliability for maximum strength, average strength and endurance, which demonstrates that this is a reliable device that can be used in clinical practice.
Reliability of Examination Findings in Suspected Community-Acquired Pneumonia.

Science.gov (United States)

Florin, Todd A; Ambroggio, Lilliam; Brokamp, Cole; Rattan, Mantosh S; Crotty, Eric J; Kachelmeyer, Andrea; Ruddy, Richard M; Shah, Samir S

2017-09-01

The authors of national guidelines emphasize the use of history and examination findings to diagnose community-acquired pneumonia (CAP) in outpatient children. Little is known about the interrater reliability of the physical examination in children with suspected CAP. This was a prospective cohort study of children with suspected CAP presenting to a pediatric emergency department from July 2013 to May 2016. Children aged 3 months to 18 years with lower respiratory signs or symptoms who received a chest radiograph were included. We excluded children hospitalized ≤14 days before the study visit and those with a chronic medical condition or aspiration. Two clinicians performed independent examinations and completed identical forms reporting examination findings. Interrater reliability for each finding was reported by using Fleiss' kappa (κ) for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. No examination finding had substantial agreement (κ/ICC > 0.8). Two findings (retractions, wheezing) had moderate to substantial agreement (κ/ICC = 0.6-0.8). Nine findings (abdominal pain, pleuritic pain, nasal flaring, skin color, overall impression, cool extremities, tachypnea, respiratory rate, and crackles/rales) had fair to moderate agreement (κ/ICC = 0.4-0.6). Eight findings (capillary refill time, cough, rhonchi, head bobbing, behavior, grunting, general appearance, and decreased breath sounds) had poor to fair reliability (κ/ICC = 0-0.4). Only 3 examination findings had acceptable agreement, with the lower 95% confidence limit >0.4: wheezing, retractions, and respiratory rate. In this study, we found fair to moderate reliability of many findings used to diagnose CAP. Only 3 findings had acceptable levels of reliability. These findings must be considered in the clinical management and research of pediatric CAP. Copyright © 2017 by the American Academy of Pediatrics.
Low-Budget Instrumentation of a Conventional Leg Press to Measure Reliable Isometric-Strength Capacity.

Science.gov (United States)

Baur, Heiner; Groppa, Alessia Severina; Limacher, Regula; Radlinger, Lorenz

2016-02-02

Maximum strength and rate of force development (RFD) are 2 important strength characteristics for everyday tasks and athletic performance. Measurements of both parameters must be reliable. Expensive isokinetic devices with isometric modes are often used. The possibility of cost-effective measurements in a practical setting would facilitate quality control. The purpose of this study was to assess the reliability of measurements of maximum isometric strength (Fmax) and RFD on a conventional leg press. Sixteen subjects (23 ± 2 y, 1.68 ± 0.05 m, 59 ± 5 kg) were tested twice within 1 session. After warm-up, subjects performed 2 times 5 trials eliciting maximum voluntary isometric contractions on an instrumented leg press (1- and 2-legged randomized). Fmax (N) and RFD (N/s) were extracted from force-time curves. Reliability was determined for Fmax and RFD by calculating the intraclass correlation coefficient (ICC), the test-retest variability (TRV), and the bias and limits of agreement. Reliability measures revealed good to excellent ICCs of .80-.93. TRV showed mean differences between measurement sessions of 0.4-6.9%. The systematic error was low compared with the absolute mean values (Fmax 5-6%, RFD 1-4%). The implementation of a force transducer into a conventional leg press provides a viable procedure to assess Fmax and RFD. Both performance parameters can be assessed with good to excellent reliability allowing quality control of interventions.
Assessment of reliability, validity, responsiveness and minimally important change of the German Hip dysfunction and osteoarthritis outcome score (HOOS) in patients with osteoarthritis of the hip.

Science.gov (United States)

Arbab, Dariusch; van Ochten, Johannes H M; Schnurr, Christoph; Bouillon, Bertil; König, Dietmar

2017-12-01

Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures. The intention of this study was to evaluate reliability, validity, responsiveness and minimally important change of the German version of the Hip dysfunction and osteoarthritis outcome score (HOOS). The German HOOS was investigated in 251 consecutive patients before and 6 months after total hip arthroplasty. All patients completed HOOS, Oxford-Hip Score, Short-Form (SF-36) and numeric scales for pain and disability. Test-retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German HOOS demonstrated excellent test-retest reliability with intraclass correlation coefficient values > 0.7. Cronbach´s alpha values demonstrated strong internal consistency. As hypothesized, HOOS subscales strongly correlated with corresponding OHS and SF-36 domains. All subscales showed excellent (effect size/standardized response means > 0.8) responsiveness between preoperative assessment and postoperative follow-up. The HOOS and all subdomains showed higher changes than the minimal detectable change which indicates true changes. The German version of the HOOS demonstrated good psychometric properties. It proved to be valid, reliable and responsive to the changes instrument for use in patients with hip osteoarthritis undergoing total hip replacement.
Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

Science.gov (United States)

Mills, Tamara L; Holm, Margo B; Schmeler, Mark

2007-01-01

The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.
Reliability of the Identification of Functional Ankle Instability (IdFAI) Scale Across Different Age Groups in Adults.

Science.gov (United States)

Gurav, Reshma S; Ganu, Sneha S; Panhale, Vrushali P

2014-10-01

Functional ankle instability (FAI) is the tendency of the foot to 'give way'. Identification of Functional Ankle Instability questionnaire (IdFAI) is a newly developed questionnaire to detect whether individuals meet the minimum criteria necessary for inclusion in an FAI population. However, the reliability of the questionnaire was studied only in a restricted age group. The purpose of this investigation was to examine the reliability of IdFAI across different age groups in adults. One hundred and twenty participants in the age group of 20-60 years consisting of 30 individuals in each age group were asked to complete the IdFAI on two occasions. Test-retest reliability was evaluated by intraclass correlation coefficient (ICC2,1). The study revealed that IdFAI has excellent test-retest reliability when studied across different age groups. The ICC2,1 in the age groups 20-30 years, 30-40 years, 40-50 years and 50-60 years was 0.978, 0.975, 0.961 and 0.922, respectively with Cronbach's alpha >0.9 in all the age groups. The IdFAI can accurately predict if an individual meets the minimum criterion for FAI across different age groups in adults. Thus, the questionnaire can be applied over different age groups in clinical and research set-ups.
Reliability of a Computerized Neurocognitive Test in Baseline Concussion Testing of High School Athletes.

Science.gov (United States)

MacDonald, James; Duerson, Drew

2015-07-01

Baseline assessments using computerized neurocognitive tests are frequently used in the management of sport-related concussions. Such testing is often done on an annual basis in a community setting. Reliability is a fundamental test characteristic that should be established for such tests. Our study examined the test-retest reliability of a computerized neurocognitive test in high school athletes over 1 year. Repeated measures design. Two American high schools. High school athletes (N = 117) participating in American football or soccer during the 2011-2012 and 2012-2013 academic years. All study participants completed 2 baseline computerized neurocognitive tests taken 1 year apart at their respective schools. The test measures performance on 4 cognitive tasks: identification speed (Attention), detection speed (Processing Speed), one card learning accuracy (Learning), and one back speed (Working Memory). Reliability was assessed by measuring the intraclass correlation coefficient (ICC) between the repeated measures of the 4 cognitive tasks. Pearson and Spearman correlation coefficients were calculated as a secondary outcome measure. The measure for identification speed performed best (ICC = 0.672; 95% confidence interval, 0.559-0.760) and the measure for one card learning accuracy performed worst (ICC = 0.401; 95% confidence interval, 0.237-0.542). All tests had marginal or low reliability. In a population of high school athletes, computerized neurocognitive testing performed in a community setting demonstrated low to marginal test-retest reliability on baseline assessments 1 year apart. Further investigation should focus on (1) improving the reliability of individual tasks tested, (2) controlling for external factors that might affect test performance, and (3) identifying the ideal time interval to repeat baseline testing in high school athletes. Computerized neurocognitive tests are used frequently in high school athletes, often within a model of baseline testing
INTRA-RATER RELIABILITY OF THE MULTIPLE SINGLE-LEG HOP-STABILIZATION TEST AND RELATIONSHIPS WITH AGE, LEG DOMINANCE AND TRAINING.

Science.gov (United States)

Sawle, Leanne; Freeman, Jennifer; Marsden, Jonathan

2017-04-01

Balance is a complex construct, affected by multiple components such as strength and co-ordination. However, whilst assessing an athlete's dynamic balance is an important part of clinical examination, there is no gold standard measure. The multiple single-leg hop-stabilization test is a functional test which may offer a method of evaluating the dynamic attributes of balance, but it needs to show adequate intra-tester reliability. The purpose of this study was to assess the intra-rater reliability of a dynamic balance test, the multiple single-leg hop-stabilization test on the dominant and non-dominant legs. Intra-rater reliability study. Fifteen active participants were tested twice with a 10-minute break between tests. The outcome measure was the multiple single-leg hop-stabilization test score, based on a clinically assessed numerical scoring system. Results were analysed using an Intraclass Correlations Coefficient (ICC 2,1 ) and Bland-Altman plots. Regression analyses explored relationships between test scores, leg dominance, age and training (an alpha level of p = 0.05 was selected). ICCs for intra-rater reliability were 0.85 for the dominant and non-dominant legs (confidence intervals = 0.62-0.95 and 0.61-0.95 respectively). Bland-Altman plots showed scores within two standard deviations. A significant correlation was observed between the dominant and non-dominant leg on balance scores (R 2 =0.49, ptest demonstrated strong intra-tester reliability with active participants. Younger participants who trained more, have better balance scores. This test may be a useful measure for evaluating the dynamic attributes of balance. 3.
Reliability and Reproducibility of Advanced ECG Parameters in Month-to-Month and Year-to-Year Recordings in Healthy Subjects

Science.gov (United States)

Starc, Vito; Abughazaleh, Ahmed S.; Schlegel, Todd T.

2014-01-01

Advanced resting ECG parameters such the spatial mean QRS-T angle and the QT variability index (QTVI) have important diagnostic and prognostic utility, but their reliability and reproducibility (R&R) are not well characterized. We hypothesized that the spatial QRS-T angle would have relatively higher R&R than parameters such as QTVI that are more responsive to transient changes in the autonomic nervous system. The R&R of several conventional and advanced ECG para-meters were studied via intraclass correlation coefficients (ICCs) and coefficients of variation (CVs) in: (1) 15 supine healthy subjects from month-to-month; (2) 27 supine healthy subjects from year-to-year; and (3) 25 subjects after transition from the supine to the seated posture. As hypothesized, for the spatial mean QRS-T angle and many conventional ECG parameters, ICCs we-re higher, and CVs lower than QTVI, suggesting that the former parameters are more reliable and reproducible.
The Reliability of a Functional Agility Test for Water Polo

Directory of Open Access Journals (Sweden)

Tucher Guilherme

2014-07-01

Full Text Available Few functional agility tests for water polo take into consideration its specific characteristics. The preliminary objective of this study was to evaluate the reliability of an agility test for water polo players. Fifteen players (16.3 ± 1.8 years old with a minimum of two years of competitive experience were evaluated. A Functional Test for Agility Performance (FTAP was designed to represent the context of this sport. Several trials were performed to familiarize the athlete with the movement. Two experienced coaches measured three repetitions of the FTAP. Descriptive statistics, repeated measures analysis of variance (ANOVA, 95% limit of agreement (LOA, intraclass correlation coefficient (ICC and standard error of measurements (SEM were used for data analysis. It was considered that certain criteria of reliability measures were met. There was no significant difference between the repetitions, which may be explained by an effect of the evaluator, the ability of the players or fatigue (p > 0.05. The ICC average from evaluators was high (0.88. The SEM varied between 0.13 s and 0.49 s. The CV average considering each individual was near 6-7%. These values depended on the condition of measurement. As the FTAP contains some characteristics that create a degree of unpredictability, the same athlete may reach different performance results, increasing variability. An adjustment in the sample, familiarization and careful selection of subjects help to improve this situation and enhance the reliability of the indicators.
Reliability and Validity of the Beijing Version of the Montreal Cognitive Assessment in the Evaluation of Cognitive Function of Adult Patients with OSAHS.

Science.gov (United States)

Chen, Xiong; Zhang, Rui; Xiao, Ying; Dong, Jiaqi; Niu, Xun; Kong, Weijia

2015-01-01

The patients with obstructive sleep apnea hypopnea syndrome (OSAHS) tend to develop cognitive deficits, which usually go unrecognized, and can affect their daily life. The Beijing version of the Montreal cognitive assessment (MoCA-BJ), a Chinese version of MoCA, has been used for the assessment of cognitive functions of OSAHS patients in clinical practice. So far, its reliability and validity have not been tested. This study examined the reliability and validity of MoCA-BJ in a cohort of adult OSAHS patients. 152 OSAHS patients, ranging from mild, moderate to severe, 49 primary snoring subjects and 40 normal controls were evaluated for cognitive functions by employing both MoCA-BJ and the Mini Mental State Examination (MMSE). Forty of them were re-tested by MoCA-BJ 14 days after the first test. Internal consistency, test-retest reliability, discriminate and concurrent validity of MoCA-BJ were analyzed. Internal consistency reliability by Cronbach's alpha was adequate (0.73). Intra-class correlation coefficient (ICC), an measure of test-retest reliability, was 0.87 (Preliable and stable. The MoCA-BJ was capable of detecting cognitive dysfunction by visuospatial and total MoCA-BJ score.

Reliability and Repetition Effect of the Center of Pressure and Kinematics Parameters That Characterize Trunk Postural Control During Unstable Sitting Test.

Science.gov (United States)

Barbado, David; Moreside, Janice; Vera-Garcia, Francisco J

2017-03-01

Although unstable seat methodology has been used to assess trunk postural control, the reliability of the variables that characterize it remains unclear. To analyze reliability and learning effect of center of pressure (COP) and kinematic parameters that characterize trunk postural control performance in unstable seating. The relationships between kinematic and COP parameters also were explored. Test-retest reliability design. Biomechanics laboratory setting. Twenty-three healthy male subjects. Participants volunteered to perform 3 sessions at 1-week intervals, each consisting of five 70-second balancing trials. A force platform and a motion capture system were used to measure COP and pelvis, thorax, and spine displacements. Reliability was assessed through standard error of measurement (SEM) and intraclass correlation coefficients (ICC 2,1 ) using 3 methods: (1) comparing the last trial score of each day; (2) comparing the best trial score of each day; and (3) calculating the average of the three last trial scores of each day. Standard deviation and mean velocity were calculated to assess balance performance. Although analyses of variance showed some differences in balance performance between days, these differences were not significant between days 2 and 3. Best result and average methods showed the greatest reliability. Mean velocity of the COP showed high reliability (0.71 reliability (0.37 reliability using the average method (0.62 reliability than kinematics ones. Specifically, mean velocity of COP showed the highest test-retest reliability, especially for the average and best methods. Although correlations between COP and mean joint angular velocity were high, the few relationships between COP and kinematic standard deviation suggest different postural behavior can lead to a similar balance performance during an unstable sitting protocol. III. Copyright © 2017 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights
Training less-experienced faculty improves reliability of skills assessment in cardiac surgery.

Science.gov (United States)

Lou, Xiaoying; Lee, Richard; Feins, Richard H; Enter, Daniel; Hicks, George L; Verrier, Edward D; Fann, James I

2014-12-01

Previous work has demonstrated high inter-rater reliability in the objective assessment of simulated anastomoses among experienced educators. We evaluated the inter-rater reliability of less-experienced educators and the impact of focused training with a video-embedded coronary anastomosis assessment tool. Nine less-experienced cardiothoracic surgery faculty members from different institutions evaluated 2 videos of simulated coronary anastomoses (1 by a medical student and 1 by a resident) at the Thoracic Surgery Directors Association Boot Camp. They then underwent a 30-minute training session using an assessment tool with embedded videos to anchor rating scores for 10 components of coronary artery anastomosis. Afterward, they evaluated 2 videos of a different student and resident performing the task. Components were scored on a 1 to 5 Likert scale, yielding an average composite score. Inter-rater reliabilities of component and composite scores were assessed using intraclass correlation coefficients (ICCs) and overall pass/fail ratings with kappa. All components of the assessment tool exhibited improvement in reliability, with 4 (bite, needle holder use, needle angles, and hand mechanics) improving the most from poor (ICC range, 0.09-0.48) to strong (ICC range, 0.80-0.90) agreement. After training, inter-rater reliabilities for composite scores improved from moderate (ICC, 0.76) to strong (ICC, 0.90) agreement, and for overall pass/fail ratings, from poor (kappa = 0.20) to moderate (kappa = 0.78) agreement. Focused, video-based anchor training facilitates greater inter-rater reliability in the objective assessment of simulated coronary anastomoses. Among raters with less teaching experience, such training may be needed before objective evaluation of technical skills. Published by Elsevier Inc.
Validity and Reliability of Nintendo Wii Fit Balance Scores

Science.gov (United States)

Wikstrom, Erik A.

2012-01-01

Context: Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. Objective: To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Design: Descriptive laboratory study. Setting: Sports medicine research laboratory. Patients or Other Participants: Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Intervention(s): Participants completed a single-limb–stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Main Outcome Measure(s): Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. Results: All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with
Inter- and Intrarater Reliability Using Different Software Versions of E4D Compare in Dental Education.

Science.gov (United States)

Callan, Richard S; Cooper, Jeril R; Young, Nancy B; Mollica, Anthony G; Furness, Alan R; Looney, Stephen W

2015-06-01

The problems associated with intra- and interexaminer reliability when assessing preclinical performance continue to hinder dental educators' ability to provide accurate and meaningful feedback to students. Many studies have been conducted to evaluate the validity of utilizing various technologies to assist educators in achieving that goal. The purpose of this study was to compare two different versions of E4D Compare software to determine if either could be expected to deliver consistent and reliable comparative results, independent of the individual utilizing the technology. Five faculty members obtained E4D digital images of students' attempts (sample model) at ideal gold crown preparations for tooth #30 performed on typodont teeth. These images were compared to an ideal (master model) preparation utilizing two versions of E4D Compare software. The percent correlations between and within these faculty members were recorded and averaged. The intraclass correlation coefficient was used to measure both inter- and intrarater agreement among the examiners. The study found that using the older version of E4D Compare did not result in acceptable intra- or interrater agreement among the examiners. However, the newer version of E4D Compare, when combined with the Nevo scanner, resulted in a remarkable degree of agreement both between and within the examiners. These results suggest that consistent and reliable results can be expected when utilizing this technology under the protocol described in this study.
Reliability and Validity of the Early Years Physical Activity Questionnaire (EY-PAQ

Directory of Open Access Journals (Sweden)

Daniel D. Bingham

2016-05-01

Full Text Available Measuring physical activity (PA and sedentary time (ST in young children (<5 years is complex. Objective measures have high validity but require specialist expertise, are expensive, and can be burdensome for participants. A proxy-report instrument for young children that accurately measures PA and ST is needed. The aim of this study was to assess the reliability and validity of the Early Years Physical Activity Questionnaire (EY-PAQ. In a setting where English and Urdu are the predominant languages spoken by parents of young children, a sample of 196 parents and their young children (mean age 3.2 ± 0.8 years from Bradford, UK took part in the study. A total of 156 (79.6% questionnaires were completed in English and 40 (20.4% were completed in transliterated Urdu. A total of 109 parents took part in the reliability aspect of the study, which involved completion of the EY-PAQ on two occasions (7.2 days apart; standard deviation (SD = 1.1. All 196 participants took part in the validity aspect which involved comparison of EY-PAQ scores against accelerometry. Validty anaylsis used all data and data falling with specific MVPA and ST boundaries. Reliability was assessed using intra-class correlations (ICC and validity by Bland–Altman plots and rank correlation coefficients. The test re-test reliability of the EY-PAQ was moderate for ST (ICC = 0.47 and fair for moderate-to-vigorous physical activity (MVPA(ICC = 0.35. The EY-PAQ had poor agreement with accelerometer-determined ST (mean difference = −87.5 min·day−1 and good agreement for MVPA (mean difference = 7.1 min·day−1 limits of agreement were wide for all variables. The rank correlation coefficient was non-significant for ST (rho = 0.19 and significant for MVPA (rho = 0.30. The EY-PAQ has comparable validity and reliability to other PA self-report tools and is a promising population-based measure of young children’s habitual MVPA but not ST. In situations when objective methods are not
Interrater and Intrarater Reliability of the Tuck Jump Assessment by Health Professionals of Varied Educational Backgrounds

Directory of Open Access Journals (Sweden)

Lisa A. Dudley

2013-01-01

Full Text Available Objective. The Tuck Jump Assessment (TJA, a clinical plyometric assessment, identifies 10 jumping and landing technique flaws. The study objective was to investigate TJA interrater and intrarater reliability with raters of different educational and clinical backgrounds. Methods. 40 participants were video recorded performing the TJA using published protocol and instructions. Five raters of varied educational and clinical backgrounds scored the TJA. Each score of the 10 technique flaws was summed for the total TJA score. Approximately one month later, 3 raters scored the videos again. Intraclass correlation coefficients determined interrater (5 and 3 raters for first and second session, resp. and intrarater (3 raters reliability. Results. Interrater reliability with 5 raters was poor (ICC = 0.47; 95% confidence intervals (CI 0.33–0.62. Interrater reliability between 3 raters who completed 2 scoring sessions improved from 0.52 (95% CI 0.35–0.68 for session one to 0.69 (95% CI 0.55–0.81 for session two. Intrarater reliability was poor to moderate, ranging from 0.44 (95% CI 0.22–0.68 to 0.72 (95% CI 0.55–0.84. Conclusion. Published protocol and training of raters were insufficient to allow consistent TJA scoring. There may be a learned effect with the TJA since interrater reliability improved with repetition. TJA instructions and training should be modified and enhanced before clinical implementation.
Validity and reliability of International Physical Activity Questionnaire-Short Form in Chinese youth.

Science.gov (United States)

Wang, Chao; Chen, Peijie; Zhuang, Jie

2013-12-01

The psychometric profiles of the widely used International Physical Activity Questionnaire-Short Form (IPAQ-SF) in Chinese youth have not been reported. The purpose of this study was to examine the validity and reliability of the IPAQ-SF using a sample of Chinese youth. One thousand and twenty-one youth (M(age) = 14.26 +/- 1.63 years, 52.8% boys) from 11 cities in China wore accelerometers for 7 consecutive days and completed the IPAQ-SF on the 8th day to recall their physical activity (PA) during accelerometer-wearing days. A subsample of 92 youth (M(age) = 15.90 +/- 1.35 years, 46.7% boys) completed the IPAQ-SF again a week later to recall their PA during accelerometer-wearing days. Differences in PA estimated by the IPAQ-SF and accelerometer were examined by paired-sample t test. Spearman correlation coefficients were used to examine the correlation between the IPAQ-SF and accelerometer. Test-retest reliability of the IPAQ-SF was determined by the intraclass correlation coefficient (ICC). Compared with accelerometer, the IPAQ-SF overestimated sedentary time, moderate PA (MPA), vigorous PA (VPA), and moderate-to-vigorous PA (MVPA). Correlations between PA (total PA, MPA, VPA, and MVPA) and sedentary time measured by 2 instruments ranged from "none" to "low" (p = .08-.31). Test-retest ICC of the IPAQ-SF ranged from "moderate" to "high" (ICC = .43-.83), except for sitting in boys (ICC = .06), sitting for the whole sample (ICC = .32), and VPA in girls (ICC = .35). The IPAQ-SF was not a valid instrument for measuring PA and sedentary behavior in Chinese youth.
Low Carbon-Oriented Optimal Reliability Design with Interval Product Failure Analysis and Grey Correlation Analysis

Directory of Open Access Journals (Sweden)

Yixiong Feng

2017-03-01

Full Text Available The problem of large amounts of carbon emissions causes wide concern across the world, and it has become a serious threat to the sustainable development of the manufacturing industry. The intensive research into technologies and methodologies for green product design has significant theoretical meaning and practical value in reducing the emissions of the manufacturing industry. Therefore, a low carbon-oriented product reliability optimal design model is proposed in this paper: (1 The related expert evaluation information was prepared in interval numbers; (2 An improved product failure analysis considering the uncertain carbon emissions of the subsystem was performed to obtain the subsystem weight taking the carbon emissions into consideration. The interval grey correlation analysis was conducted to obtain the subsystem weight taking the uncertain correlations inside the product into consideration. Using the above two kinds of subsystem weights and different caution indicators of the decision maker, a series of product reliability design schemes is available; (3 The interval-valued intuitionistic fuzzy sets (IVIFSs were employed to select the optimal reliability and optimal design scheme based on three attributes, namely, low carbon, correlation and functions, and economic cost. The case study of a vertical CNC lathe proves the superiority and rationality of the proposed method.
[Reliability and Validity of the Behavioral Check List for Preschool Children to Measure Attention Deficit Hyperactivity Behaviors].

Science.gov (United States)

Tsuno, Kanami; Yoshimasu, Kouichi; Hayashi, Takashi; Tatsuta, Nozomi; Ito, Yuki; Kamijima, Michihiro; Nakai, Kunihiko

2018-01-01

Nowadays, attention deficit hyperactivity (ADH) problems are observed commonly among school-age children. However, questionnaires specific to ADH behaviors among preschool children are very few. The aim of this study was to investigate the reliability and validity of the 25-item Behavioral Check List (BCL), which was developed from interviews of parents with children who were diagnosed as having Attention-deficit/hyperactivity disorder (ADHD) and measures ADH behaviors in preschool age. We recruited 22 teachers from 10 nurseries/kindergartens in Miyagi Prefecture, Japan. A total of 138 preschool children were assessed using the BCL. To investigate inter-rater reliability, two teachers from each facility assess seven to twenty children in their class, and intraclass correlation coefficients (ICCs) were calculated. The teachers additionally answered questions in the 1/5-5 Caregiver-Teacher Report Form (C-TRF) to investigate the criterion validity of the BCL. To investigate structural validity, exploratory factor analysis with promax rotation and confirmatory factor analysis were performed. The internal consistency reliability of the BCL was good (α = 0.92) and correlation analyses also confirmed its excellent criterion validity. Although exploratory factor analysis for the BCL yielded a five-factor model that consisted of a factor structure different from that of the original one, the results were similar to the original six factors. The ICCs of the BCL were 0.38-0.99 and it was not high enough for inter-rater reliability in some facilities. However, there is a possibility to improve it by giving raters adequate explanations when using BCL. The present study showed acceptable levels of reliability and validity of the BCL among Japanese preschool children.
Reliability of capturing foot parameters using digital scanning and the neutral suspension casting technique

Science.gov (United States)

2011-01-01

Background A clinical study was conducted to determine the intra and inter-rater reliability of digital scanning and the neutral suspension casting technique to measure six foot parameters. The neutral suspension casting technique is a commonly utilised method for obtaining a negative impression of the foot prior to orthotic fabrication. Digital scanning offers an alternative to the traditional plaster of Paris techniques. Methods Twenty one healthy participants volunteered to take part in the study. Six casts and six digital scans were obtained from each participant by two raters of differing clinical experience. The foot parameters chosen for investigation were cast length (mm), forefoot width (mm), rearfoot width (mm), medial arch height (mm), lateral arch height (mm) and forefoot to rearfoot alignment (degrees). Intraclass correlation coefficients (ICC) with 95% confidence intervals (CI) were calculated to determine the intra and inter-rater reliability. Measurement error was assessed through the calculation of the standard error of the measurement (SEM) and smallest real difference (SRD). Results ICC values for all foot parameters using digital scanning ranged between 0.81-0.99 for both intra and inter-rater reliability. For neutral suspension casting technique inter-rater reliability values ranged from 0.57-0.99 and intra-rater reliability values ranging from 0.36-0.99 for rater 1 and 0.49-0.99 for rater 2. Conclusions The findings of this study indicate that digital scanning is a reliable technique, irrespective of clinical experience, with reduced measurement variability in all foot parameters investigated when compared to neutral suspension casting. PMID:21375757
Reliability of widefield nailfold capillaroscopy and video capillaroscopy in the assessment of patients with Raynaud’s phenomenon.

Science.gov (United States)

Sekiyama, Juliana Y; Camargo, Cintia Z; Eduardo, Luís; Andrade, C; Kayser, Cristiane

2013-11-01

To analyze the diagnostic performance and reliability of different parameters evaluated by widefield nailfold capillaroscopy (NFC) with those obtained by video capillaroscopy in patients with Raynaud’s phenomenon (RP). Two hundred fifty-two individuals were assessed, including 101 systemic sclerosis (SSc; scleroderma) patients,61 patients with undifferentiated connective tissue disease, 37 patients with primary RP, and 53 controls. Widefield NFC was performed using a stereomicroscope under 10–25 x magnification and direct measurement of all parameters. Video capillaroscopy was performed under 200 x magnification, with the acquirement of 32 images per individual (4 fields per finger in 8 fingers). The following parameters were analyzed in 8 fingers of the hands (excluding thumbs) by both methods: number of capillaries/mm, number of enlarged and giant capillaries, microhemorrhages, and avascular score.Intra- and interobserver reliability was evaluated by performing both examinations in 20 individuals on 2 different days and by 2 long-term experienced observers. There was a significant correlation (P capillaroscopy in the comparison of all parameters. Kappa values and intraclass correlation coefficient analysis showed excellent intra- and interobserver reproducibility for all parameters evaluated by widefield NFC and video capillaroscopy. Bland-Altman analysis showed high agreement of all parameters evaluated in both methods. According to receiver operating characteristic curve analysis, both methods showed a similar performance in discriminating SSc patients from controls. Widefield NFC and video capillaroscopy are reliable and accurate methods and can be used equally for assessing peripheral microangiopathy in RP and SSc patients. Nonetheless, the high reliability obtained may not be similar for less experienced examiners.
Test-retest reliability and predictors of unreliable reporting for a sexual behavior questionnaire for U.S. men.

Science.gov (United States)

Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R

2010-12-01

Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.
Validity and Reliability of Gait and Postural Control Analysis Using the Tri-axial Accelerometer of the iPod Touch.

Science.gov (United States)

Kosse, Nienke M; Caljouw, Simone; Vervoort, Danique; Vuillerme, Nicolas; Lamoth, Claudine J C

2015-08-01

Accelerometer-based assessments can identify elderly with an increased fall risk and monitor interventions. Smart devices, like the iPod Touch, with built-in accelerometers are promising for clinical gait and posture assessments due to easy use and cost-effectiveness. The aim of the present study was to establish the validity and reliability of the iPod Touch for gait and posture assessment. Sixty healthy participants (aged 18-75 years) were measured with an iPod Touch and stand-alone accelerometer while they walked under single- and dual-task conditions, and while standing in parallel and semi-tandem stance with eyes open, eyes closed and when performing a dual task. Cross-correlation values (CCV) showed high correspondence of anterior-posterior and medio-lateral signal patterns (CCV's ≥ 0.88). Validity of gait parameters (foot contacts, index of harmonicity, and amplitude variability) and standing posture parameters [root mean square of accelerations, median power frequency (MPF) and sway area] as indicated by intra-class correlation (ICC) was high (ICC = 0.85-0.99) and test-retest reliability was good (ICC = 0.81-0.97), except for MPF (ICC = 0.59-0.87). Overall, the iPod Touch obtained valid and reliable measures of gait and postural control in healthy adults of all ages under different conditions. Additionally, smart devices have the potential to be used for clinical gait and posture assessments.
Cross-Cultural Adaptation of the Profile Fitness Mapping Neck Questionnaire to Brazilian Portuguese: Internal Consistency, Reliability, and Construct and Structural Validity.

Science.gov (United States)

Ferreira, Mariana Cândido; Björklund, Martin; Dach, Fabiola; Chaves, Thais Cristina

The purpose of this study was to adapt and evaluate the psychometric properties of the ProFitMap-neck to Brazilian Portuguese. The cross-cultural adaptation consisted of 5 stages, and 180 female patients with chronic neck pain participated in the study. A subsample (n = 30) answered the pretest, and another subsample (n = 100) answered the questionnaire a second time. Internal consistency, test-retest reliability, and construct validity (hypothesis testing and structural validity) were estimated. For construct validity, the scores of the questionnaire were correlated with the Neck Disability Index (NDI), and the Hospital Anxiety and Depression Scale (HADS), the Tampa Scale of Kinesiophobia (TSK), and the 36-item Short-Form Health Survey (SF-36). Internal consistency was determined by adequate Cronbach's α values (α > 0.70). Strong reliability was identified by high intraclass correlation coefficients (ICC > 0.75). Construct validity was identified by moderate and strong correlations of the Br-ProFitMap-neck with total NDI score (-0.56 50%, Kaiser-Meyer-Olkin index > 0.50, eigenvalue > 1, and factor loadings > 0.2. Br-ProFitMap-neck had adequate psychometric properties and can be used in clinical settings, as well as research, in patients with chronic neck pain. Copyright © 2017. Published by Elsevier Inc.
Reliability of maximal isometric knee strength testing with modified hand-held dynamometry in patients awaiting total knee arthroplasty: useful in research and individual patient settings? A reliability study

Directory of Open Access Journals (Sweden)

Koblbauer Ian FH

2011-10-01

Full Text Available Abstract Background Patients undergoing total knee arthroplasty (TKA often experience strength deficits both pre- and post-operatively. As these deficits may have a direct impact on functional recovery, strength assessment should be performed in this patient population. For these assessments, reliable measurements should be used. This study aimed to determine the inter- and intrarater reliability of hand-held dynamometry (HHD in measuring isometric knee strength in patients awaiting TKA. Methods To determine interrater reliability, 32 patients (81.3% female were assessed by two examiners. Patients were assessed consecutively by both examiners on the same individual test dates. To determine intrarater reliability, a subgroup (n = 13 was again assessed by the examiners within four weeks of the initial testing procedure. Maximal isometric knee flexor and extensor strength were tested using a modified Citec hand-held dynamometer. Both the affected and unaffected knee were tested. Reliability was assessed using the Intraclass Correlation Coefficient (ICC. In addition, the Standard Error of Measurement (SEM and the Smallest Detectable Difference (SDD were used to determine reliability. Results In both the affected and unaffected knee, the inter- and intrarater reliability were good for knee flexors (ICC range 0.76-0.94 and excellent for knee extensors (ICC range 0.92-0.97. However, measurement error was high, displaying SDD ranges between 21.7% and 36.2% for interrater reliability and between 19.0% and 57.5% for intrarater reliability. Overall, measurement error was higher for the knee flexors than for the knee extensors. Conclusions Modified HHD appears to be a reliable strength measure, producing good to excellent ICC values for both inter- and intrarater reliability in a group of TKA patients. High SEM and SDD values, however, indicate high measurement error for individual measures. This study demonstrates that a modified HHD is appropriate to
The reliability and sensitivity of the National Institutes of Health Stroke Scale for spontaneous intracerebral hemorrhage in an uncontrolled setting.

Directory of Open Access Journals (Sweden)

Adrian V Specogna

Full Text Available BACKGROUND AND PURPOSE: The National Institutes of Health Stroke Scale (NIHSS is commonly used to measure neurologic function and guide treatment after spontaneous intracerebral hemorrhage (ICH in routine stroke clinics. We evaluated its reliability and sensitivity to detect change with consecutive and unique rater combinations in a real-world setting. METHODS: Conservative measures of interrater reliability (unweighted Kappa (κ, Intraclass Correlation Coefficient (ICC1,1 and sensitivity to detect change (Minimal Detectable Difference (MDD were estimated. Sixty-one repeated ratings were completed within 1 week after ICH by physicians and nurses with no investigator intervention. RESULTS: Reliability (consistency of the NIHSS total score was good for both physicians vs. nurses and nurses vs. nurses (ICC=0.78, 95%CI: 0.58-0.89 and ICC=0.75, 95%CI: 0.55-0.87 respectively in this scenario. Reliability (agreement of items 1C and 9 were excellent (κ>=0.61 for both rater comparisons, however, reliability was poor to fair on most remaining items (κ:0.01-0.60, with item 11 being completely unreliable in this scenario (κ=10 points need to be observed for clinicians to be confident that real changes had occurred within 1 week after ICH.
Reliability and reproducibility of disc-foveal angle measurements by non-mydriatic fundus photography.

Science.gov (United States)

Le Jeune, Caroline; Chebli, Fayçal; Leon, Lorette; Anthoine, Emmanuelle; Weber, Michel; Péchereau, Alain; Lebranchu, Pierre

2018-01-01

Abnormal torsion could be associated with cyclovertical strabismus, but torsion measurements are not reliable in children. To assess an objective fundus torsion evaluation in a paediatric population, we used Non-Mydriatic Fundus photography (NMFP) in healthy and cyclovertical strabismus patients to evaluate the disc-foveal angle over time and observers. We used a retrospective set of NMFP including 24 A or V-pattern strabismus and 27 age-matched normal children (mean age 6.4 and 6.7 years respectively), taken during 2 distinct follow-up consultations (separated by 251 and 479 days respectively). Each disc-foveal angle measurement (from which the ocular torsion can be assessed) was performed by 5 different observers, using graphical software and based on reproducible fundus anatomical marks. Statistical analysis was performed with a multivariate ANOVA using group, time and observers as factors, in addition to intraclass coefficient correlation (ICC) to assess measurement reproducibility. A significant difference of disc-foveal angle measures was observed between groups (p0,97). Abnormal amount of objective torsion could be associated with alphabet-pattern strabismus. Disc-foveal angle evaluation by NMFP in a children population appears as a non-invasive, reliable and reproducible method.
Reliability of the Star Excursion Balance Test and Two New Similar Protocols to Measure Trunk Postural Control.

Science.gov (United States)

López-Plaza, Diego; Juan-Recio, Casto; Barbado, David; Ruiz-Pérez, Iñaki; Vera-Garcia, Francisco J

2018-05-18

Although the Star Excursion Balance test (SEBT) has shown a good intrasession reliability, the intersession reliability of this test has not been deeply studied. Furthermore, there is an evident high influence of the lower limbs in the performance of the SEBT, so even if it has been used to measure core stability, it is possibly not the most suitable measurement. The aims of this study were to (1) to assess the absolute and relative between-session reliability of the SEBT and 2 novel variations of this test to assess trunk postural control while sitting, ie, the Star Excursion Sitting Test (SEST) and the Star Excursion Timing Test (SETT); and (2) to analyze the relationships between these 3 test scores. Correlational and reliability test-retest study. Controlled laboratory environment. Twenty-seven physically active men (age: 24.54 ± 3.05 years). Relative and absolute reliability of the SEBT, SEST, and SETT were calculated through the intraclass correlation coefficient (ICC) and standard error of measurement (SEM), respectively. A Pearson correlation analysis was carried out between the variables of the 3 tests. Maximum normalized reach distances were assessed for different SEBT and SEST directions. In addition, composite indexes were calculated for SEBT, SEST, and SETT. The SEBT (dominant leg: ICC = 0.87 [0.73-0.94], SEM = 2.12 [1.66-2.93]; nondominant leg: ICC = 0.74 [0.50-0.87], SEM = 3.23 [2.54-4.45]), SEST (ICC = 0.85 [0.68-0.92], SEM = 1.27 [1.03-1.80]), and SETT (ICC = 0.61 [0.30-0.80], SEM = 2.31 [1.82-3.17]) composite indexes showed moderate-to-high 1-month reliability. A learning effect was detected for some SEBT and SEST directions and for SEST and SETT composite indexes. No significant correlations were found between SEBT and its 2 variations (r ≤ .366; P > .05). A significant correlation was found between the SEST and SETT composite indexes (r = .520; P > .01). SEBT, SEST, and SETT are reliable field protocols to measure postural control. However
Cross-cultural adaptation, reliability and construct validity of the Tampa scale for kinesiophobia for temporomandibular disorders (TSK/TMD-Br) into Brazilian Portuguese.

Science.gov (United States)

Aguiar, A S; Bataglion, C; Visscher, C M; Bevilaqua Grossi, D; Chaves, T C

2017-07-01

Fear of movement (kinesiophobia) seems to play an important role in the development of chronic pain. However, for temporomandibular disorders (TMD), there is a scarcity of studies about this topic. The Tampa Scale for Kinesiophobia for TMD (TSK/TMD) is the most widely used instrument to measure fear of movement and it is not available in Brazilian Portuguese. The purpose of this study was to culturally adapt the TSK/TMD to Brazilian Portuguese and to assess its psychometric properties regarding internal consistency, reliability, and construct and structural validity. A total of 100 female patients with chronic TMD participated in the validation process of the TSK/TMD-Br. The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Spearman's rank correlation for construct validity and confirmatory factor analysis (CFA) for structural validity. CFA endorsed the pre-specified model with two domains and 12-items (Activity Avoidance - AA/Somatic Focus - SF) and all items obtained a loading factor greater than 0·4. Acceptable levels of reliability were found (ICC > 0·75) for all questions and domains of the TSK/TMD-Br. For internal consistency, Cronbach's α of 0·78 for both domains were found. Moderate correlations (0·40 Br scores versus catastrophising, depression and jaw functional limitation. TSK/TMD-Br 12 items and two-factor demonstrated sound psychometric properties (transcultural validity, reliability, internal consistency and structural validity). In such a way, the instrument can be used in clinical settings and for research purposes. © 2017 John Wiley & Sons Ltd.
Reliability and Validity of the Footprint Assessment Method Using Photoshop CS5 Software.

Science.gov (United States)

Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam

2015-05-01

Several sophisticated methods of footprint analysis currently exist. However, it is sometimes useful to apply standard measurement methods of recognized evidence with an easy and quick application. We sought to assess the reliability and validity of a new method of footprint assessment in a healthy population using Photoshop CS5 software (Adobe Systems Inc, San Jose, California). Forty-two footprints, corresponding to 21 healthy individuals (11 men with a mean ± SD age of 20.45 ± 2.16 years and 10 women with a mean ± SD age of 20.00 ± 1.70 years) were analyzed. Footprints were recorded in static bipedal standing position using optical podography and digital photography. Three trials for each participant were performed. The Hernández-Corvo, Chippaux-Smirak, and Staheli indices and the Clarke angle were calculated by manual method and by computerized method using Photoshop CS5 software. Test-retest was used to determine reliability. Validity was obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed high values (ICC, 0.98-0.99). Moreover, the validity test clearly showed no difference between techniques (ICC, 0.99-1). The reliability and validity of a method to measure, assess, and record the podometric indices using Photoshop CS5 software has been demonstrated. This provides a quick and accurate tool useful for the digital recording of morphostatic foot study parameters and their control.

Quality Evaluation Scores are no more Reliable than Gestalt in Evaluating the Quality of Emergency Medicine Blogs: A METRIQ Study.

Science.gov (United States)

Thoma, Brent; Sebok-Syer, Stefanie S; Colmers-Gray, Isabelle; Sherbino, Jonathan; Ankel, Felix; Trueger, N Seth; Grock, Andrew; Siemens, Marshall; Paddock, Michael; Purdy, Eve; Kenneth Milne, William; Chan, Teresa M

2018-01-30

Construct: We investigated the quality of emergency medicine (EM) blogs as educational resources. Online medical education resources such as blogs are increasingly used by EM trainees and clinicians. However, quality evaluations of these resources using gestalt are unreliable. We investigated the reliability of two previously derived quality evaluation instruments for blogs. Sixty English-language EM websites that published clinically oriented blog posts between January 1 and February 24, 2016, were identified. A random number generator selected 10 websites, and the 2 most recent clinically oriented blog posts from each site were evaluated using gestalt, the Academic Life in Emergency Medicine (ALiEM) Approved Instructional Resources (AIR) score, and the Medical Education Translational Resources: Impact and Quality (METRIQ-8) score, by a sample of medical students, EM residents, and EM attendings. Each rater evaluated all 20 blog posts with gestalt and 15 of the 20 blog posts with the ALiEM AIR and METRIQ-8 scores. Pearson's correlations were calculated between the average scores for each metric. Single-measure intraclass correlation coefficients (ICCs) evaluated the reliability of each instrument. Our study included 121 medical students, 88 EM residents, and 100 EM attendings who completed ratings. The average gestalt rating of each blog post correlated strongly with the average scores for ALiEM AIR (r = .94) and METRIQ-8 (r = .91). Single-measure ICCs were fair for gestalt (0.37, IQR 0.25-0.56), ALiEM AIR (0.41, IQR 0.29-0.60) and METRIQ-8 (0.40, IQR 0.28-0.59). The average scores of each blog post correlated strongly with gestalt ratings. However, neither ALiEM AIR nor METRIQ-8 showed higher reliability than gestalt. Improved reliability may be possible through rater training and instrument refinement.
Reliability and validity of the multimedia activity recall in children and adults (MARCA) in people with chronic obstructive pulmonary disease.

Science.gov (United States)

Hunt, Toby; Williams, Marie T; Olds, Tim S

2013-01-01

To determine the reliability and validity of the Multimedia Activity Recall for Children and Adults (MARCA) in people with chronic obstructive pulmonary disease (COPD). People with COPD and their carers completed the Multimedia Activity Recall for Children and Adults (MARCA) for four, 24-hour periods (including test-retest of 2 days) while wearing a triaxial accelerometer (Actigraph GT3X+®), a multi-sensor armband (Sensewear Pro3®) and a pedometer (New Lifestyles 1000®). Self reported activity recalls (MARCA) and objective activity monitoring (Accelerometry) were recorded under free-living conditions. 24 couples were included in the analysis (COPD; age 74.4 ± 7.9 yrs, FEV1 54 ± 13% Carer; age 69.6 ± 10.9 yrs, FEV1 99 ± 24%). Not applicable. Test-retest reliability was compared for MARCA activity domains and different energy expenditure zones. Validity was assessed between MARCA-derived physical activity level (in metabolic equivalent of task (MET) per minute), duration of moderate to vigorous physical activity (min) and related data from the objective measurement devices. Analysis included intra-class correlation coefficients (ICC), Bland-Altman analyses, paired t-tests (p) and Spearman's rank correlation coefficients (rs). Reliability between occasions of recall for all activity domains was uniformly high, with test-retest correlations consistently >0.9. Validity correlations were moderate to strong (rs = 0.43-0.80) across all comparisons. The MARCA yields comparable PAL estimates and slightly higher moderate to vigorous physical activity (MVPA) estimates. In older adults with chronic illness, the MARCA is a valid and reliable tool for capturing not only the time and energy expenditure associated with physical and sedentary activities but also information on the types of activities.
Reliability, Validity, and Sensitivity of a Novel Smartphone-Based Eccentric Hamstring Strength Test in Professional Football Players.

Science.gov (United States)

Lee, Justin W Y; Cai, Ming-Jing; Yung, Patrick S H; Chan, Kai-Ming

2018-05-01

To evaluate the test-retest reliability, sensitivity, and concurrent validity of a smartphone-based method for assessing eccentric hamstring strength among male professional football players. A total of 25 healthy male professional football players performed the Chinese University of Hong Kong (CUHK) Nordic break-point test, hamstring fatigue protocol, and isokinetic hamstring strength test. The CUHK Nordic break-point test is based on a Nordic hamstring exercise. The Nordic break-point angle was defined as the maximum point where the participant could no longer support the weight of his body against gravity. The criterion for the sensitivity test was the presprinting and postsprinting difference of the Nordic break-point angle with a hamstring fatigue protocol. The hamstring fatigue protocol consists of 12 repetitions of the 30-m sprint with 30-s recoveries between sprints. Hamstring peak torque of the isokinetic hamstring strength test was used as the criterion for validity. A high test-retest reliability (intraclass correlation coefficient = .94; 95% confidence interval, .82-.98) was found in the Nordic break-point angle measurements. The Nordic break-point angle significantly correlated with isokinetic hamstring peak torques at eccentric action of 30°/s (r = .88, r 2 = .77, P hamstring strength measures among male professional football players.
Reliability of the Pictorial Scale of Perceived Movement Skill Competence in 2 Diverse Samples of Young Children.

Science.gov (United States)

Barnett, Lisa M; Robinson, Leah E; Webster, E Kipling; Ridgers, Nicola D

2015-08-01

The purpose was to determine the reliability of an instrument designed to assess young children's perceived movement skill competence in 2 diverse samples. A pictorial instrument assessed 12 perceived Fundamental Movement Skills (FMS) based on the Test of Gross Motor Development 2nd edition. Intra-Class Correlations (ICC) and internal consistency analyses were conducted. Paired sample t tests assessed change in mean perceived skill scores. Bivariate correlations between the intertrial difference and the mean of the trials explored proportional bias. Sample 1 (S1) were culturally diverse Australian children (n = 111; 52% boys) aged 5 to 8 years (mean = 6.4, SD = 1.0) with educated parents. Sample 2 (S2) were racially diverse and socioeconomically disadvantaged American children (n = 110; 57% boys) aged 5 to 10 years (mean = 6.8, SD = 1.1). For all children, the internal consistency for 12 FMS was acceptable (S1 = 0.72, 0.75, S2 = 0.66, 0.67). ICCs were higher in S1 (0.73) than S2 (0.50). Mean changes between trials were small. There was little evidence of proportional bias. Lower values in S2 may be due to differences in study demographic and execution. While the instrument demonstrated reliability/internal consistency, further work is recommended in diverse samples.
Relative and Absolute Reliability of Timed Up and Go Test in Community Dwelling Older Adult and Healthy Young People

Directory of Open Access Journals (Sweden)

Farhad Azadi

2014-01-01

Full Text Available Objectives: Relative and absolute reliability are psychometric properties of the test that many clinical decisions are based on them. In many cases, only relative reliability takes into consideration while the absolute reliability is also very important. Methods & Materials: Eleven community-dwelling older adults aged 65 years and older (69.64±3.58 and 20 healthy young in the age range 20 to 35 years (28.80±4.15 using three versions of Timed Up and Go test were evaluated twice with an interval of 2 to 5 days. Results: Generally, the non-homogeneity of the study population was stratified to increase the Intra-class Correlation Coefficient (ICC this coefficient in elderly people is greater than young people and with a secondary task is reduced. In This study, absolute reliability indices using different data sources and equations lead to in more or less similar results. At general, in test–retest situations, the elderly more than the young people must be changed to be interpreted as a real change, not random. The random error contribution is slightly greater in elderly than young and with a secondary task is increased.It seems, heterogeneity leads to moderation in absolute reliability indices. Conclusion: In relative reliability studies, researchers and clinicians should pay attention to factors such as homogeneity of population and etc. As well as, absolute reliability beside relative reliability is needed and necessary in clinical decision making.
Reliability and responsiveness of a goniometric device for measuring the range of motion in the dart-throwing motion plane.

Science.gov (United States)

Kasubuchi, Kenji; Dohi, Yoshihiro; Fujita, Hiroyuki; Fukumoto, Takahiko

2018-02-26

Dart-throwing motion (DTM) is an important component of wrist function and, consequently, has the potential to become an evaluation tool in rehabilitation. However, no measurement method is currently available to reliably measure range of motion (ROM) of the wrist in the DTM plane. To determine the reliability and responsiveness of a goniometric device to measure wrist ROM in the DTM plane. ROM of the wrist in the DTM plane was measured in 70 healthy participants. The intra-class correlation coefficient (ICC) was used to evaluate the relative reliability of measurement, and a Bland-Altman analysis conducted to establish its absolute reliability, including the 95% limits of agreement (95% LOA). The standard error of the measurement (SEM) and minimal detectable change at the 95% confidence level (MDC 95 ) were calculated as measures of responsiveness. The intra-rater ICC was 0.87, and an inter-rater ICC of 0.71. There was no evidence of a fixed or proportional bias. For intra- and inter-rater reliability, 95% LOA ranged from -13.83 to 11.12 and from -17.75 to 16.19, respectively. The SEM and MDC 95 were 4.5° and 12.4°, respectively, for intra-rater reliability, and 6.0° and 16.6°, respectively, for inter-rater reliability. The ROM of the wrist in the DTM plane was measured with fair-to-good reliability and responsiveness and, therefore, has the potential to become an evaluation tool for rehabilitation.
Reliability and Validity of a New Method for Isometric Back Extensor Strength Evaluation Using A Hand-Held Dynamometer.

Science.gov (United States)

Park, Hee-Won; Baek, Sora; Kim, Hong Young; Park, Jung-Gyoo; Kang, Eun Kyoung

2017-10-01

To investigate the reliability and validity of a new method for isometric back extensor strength measurement using a portable dynamometer. A chair equipped with a small portable dynamometer was designed (Power Track II Commander Muscle Tester). A total of 15 men (mean age, 34.8±7.5 years) and 15 women (mean age, 33.1±5.5 years) with no current back problems or previous history of back surgery were recruited. Subjects were asked to push the back of the chair while seated, and their isometric back extensor strength was measured by the portable dynamometer. Test-retest reliability was assessed with intraclass correlation coefficient (ICC). For the validity assessment, isometric back extensor strength of all subjects was measured by a widely used physical performance evaluation instrument, BTE PrimusRS system. The limit of agreement (LoA) from the Bland-Altman plot was evaluated between two methods. The test-retest reliability was excellent (ICC=0.82; 95% confidence interval, 0.65-0.91). The Bland-Altman plots demonstrated acceptable agreement between the two methods: the lower 95% LoA was -63.1 N and the upper 95% LoA was 61.1 N. This study shows that isometric back extensor strength measurement using a portable dynamometer has good reliability and validity.
Adaptation, test-retest reliability, and construct validity of the Physical Activity Neighborhood Environment Scale in Nigeria (PANES-N).

Science.gov (United States)

Oyeyemi, Adewale L; Sallis, James F; Oyeyemi, Adetoyeje Y; Amin, Mariam M; De Bourdeaudhuij, Ilse; Deforche, Benedicte

2013-11-01

This study adapted the Physical Activity Neighborhood Environment Scale (PANES) to the Nigerian context and assessed the test-retest reliability and construct validity of the Nigerian version (PANESN). A multidisciplinary panel of experts adapted the original PANES to reflect the built and social environment of Nigeria. The adapted PANES was subjected to cognitive testing and test retest reliability in a diverse sample of Nigerian adults (N = 132) from different neighborhood types. Intraclass Correlation Coefficients (ICC) was used to assess test-retest reliability, and construct validity was investigated with Analysis of Covariance for differences in environmental attributes between neighborhoods. Four of the 17 items on the original PANES were significantly modified, 3 were removed and 2 new items were incorporated into the final version of adapted PANES-N. Test-retest reliability was substantial to almost perfect (ICC = 0.62-1.00) for all items on the PANES-N, and residents of neighborhoods in the inner city reported higher residential density, land use mix and safety, but lower pedestrian facilities and aesthetics than did residents of government reserved area/new layout neighborhoods. The PANES-N appears promising for assessing environmental perceptions related to physical activity in Nigeria, but further testing is required to assess its applicability across Africa.
The reliability of jump kinematics and kinetics in children of different maturity status.

Science.gov (United States)

Meylan, Cesar M P; Cronin, John B; Oliver, Jon L; Hughes, Michael G; McMaster, D Travis

2012-04-01

The purpose of this study was to determine the reliability of eccentric (ECC) and concentric (CON) kinematic and kinetic variables thought to be critical to jump performance during bilateral vertical countermovement jump (VCMJ) and horizontal countermovement jump (HCMJ) across children of different maturity status. Forty-two athletic male and female participants between 9 and 16 years of age were divided into 3 maturity groups according to peak height velocity (PHV) offset (Post-PHV, At-PHV, and Pre-PHV) and percent of predicted adult stature. All the participants performed 3 VCMJ and HCMJ trials and the kinematics, and kinetics of these jumps were measured via a force plate over 3 testing sessions. In both jumps, vertical CON mean and peak power and jump height or distance were the most reliable measures across all groups (change in the mean [CM] = -5.4 to 6.2%; coefficient of variation [CV] = 2.1-9.4%; Intraclass correlation coefficient [ICC] = 0.82-0.98), whereas vertical ECC mean power was the only ECC variable with acceptable reliability for both jumps (CM = -0.7 to 10.1%; CV = 5.2-15.6%; ICC = 0.74-0.97). A less mature state was "likely" to "very likely" to reduce the reliability of the HCMJ ECC kinetics and kinematics. These findings suggested that movement variability is associated with the ECC phase of CMJs, especially in Pre-PHV during the HCMJ. Vertical CON mean and peak power and ECC mean power were deemed reliable and appropriate to be used in children as indicators of jump and stretch-shortening cycle performance.
INTRA-RATER RELIABILITY OF WII BALANCE BOARD (WBB IN ASSESSING STANDING BALANCE IN OLDER ADULTS

Directory of Open Access Journals (Sweden)

Shilpa Dugani Burji

2014-06-01

Full Text Available Background: WII Balance Board (WBB being one of the latest, advanced technologies of high sensitivity in monitoring change in balance over time and owing to, ease of use, and portability, it is being used in physical therapy clinics as a popular substitute for the expensive and complicated force plates to improve dynamic strength and balance. Despite its growing popularity, the WBB’s reliability as an intervention and assessment tool for balance is still being investigated. So this study aims in finding the accuracy of WBB. The objectives of the study are to find the Intraclass Correlation Coefficient and Standard Error Measurement on both day 1 and day 2 with eyes closed and eyes open in older adults. Method: 30 subjects over the age of 65 years were assessed for balance using WBB. Subjects were measured in double limb stance with eyes open and closed with feet comfortably distant apart on the board. The same procedure was repeated after 24 hours. Results: The study showed to be statistically significant for eyes open on day 1 and day 2, but was not statistically significant for eyes closed on day 1 and day 2. Conclusion: The study suggested that the WBB was reliable for eyes open and not reliable with eyes closed.
Intersession reliability of fMRI activation for heat pain and motor tasks.

Science.gov (United States)

Quiton, Raimi L; Keaser, Michael L; Zhuo, Jiachen; Gullapalli, Rao P; Greenspan, Joel D

2014-01-01

As the practice of conducting longitudinal fMRI studies to assess mechanisms of pain-reducing interventions becomes more common, there is a great need to assess the test-retest reliability of the pain-related BOLD fMRI signal across repeated sessions. This study quantitatively evaluated the reliability of heat pain-related BOLD fMRI brain responses in healthy volunteers across 3 sessions conducted on separate days using two measures: (1) intraclass correlation coefficients (ICC) calculated based on signal amplitude and (2) spatial overlap. The ICC analysis of pain-related BOLD fMRI responses showed fair-to-moderate intersession reliability in brain areas regarded as part of the cortical pain network. Areas with the highest intersession reliability based on the ICC analysis included the anterior midcingulate cortex, anterior insula, and second somatosensory cortex. Areas with the lowest intersession reliability based on the ICC analysis also showed low spatial reliability; these regions included pregenual anterior cingulate cortex, primary somatosensory cortex, and posterior insula. Thus, this study found regional differences in pain-related BOLD fMRI response reliability, which may provide useful information to guide longitudinal pain studies. A simple motor task (finger-thumb opposition) was performed by the same subjects in the same sessions as the painful heat stimuli were delivered. Intersession reliability of fMRI activation in cortical motor areas was comparable to previously published findings for both spatial overlap and ICC measures, providing support for the validity of the analytical approach used to assess intersession reliability of pain-related fMRI activation. A secondary finding of this study is that the use of standard ICC alone as a measure of reliability may not be sufficient, as the underlying variance structure of an fMRI dataset can result in inappropriately high ICC values; a method to eliminate these false positive results was used in this
Intersession reliability of fMRI activation for heat pain and motor tasks

Science.gov (United States)

Quiton, Raimi L.; Keaser, Michael L.; Zhuo, Jiachen; Gullapalli, Rao P.; Greenspan, Joel D.

2014-01-01

As the practice of conducting longitudinal fMRI studies to assess mechanisms of pain-reducing interventions becomes more common, there is a great need to assess the test–retest reliability of the pain-related BOLD fMRI signal across repeated sessions. This study quantitatively evaluated the reliability of heat pain-related BOLD fMRI brain responses in healthy volunteers across 3 sessions conducted on separate days using two measures: (1) intraclass correlation coefficients (ICC) calculated based on signal amplitude and (2) spatial overlap. The ICC analysis of pain-related BOLD fMRI responses showed fair-to-moderate intersession reliability in brain areas regarded as part of the cortical pain network. Areas with the highest intersession reliability based on the ICC analysis included the anterior midcingulate cortex, anterior insula, and second somatosensory cortex. Areas with the lowest intersession reliability based on the ICC analysis also showed low spatial reliability; these regions included pregenual anterior cingulate cortex, primary somatosensory cortex, and posterior insula. Thus, this study found regional differences in pain-related BOLD fMRI response reliability, which may provide useful information to guide longitudinal pain studies. A simple motor task (finger-thumb opposition) was performed by the same subjects in the same sessions as the painful heat stimuli were delivered. Intersession reliability of fMRI activation in cortical motor areas was comparable to previously published findings for both spatial overlap and ICC measures, providing support for the validity of the analytical approach used to assess intersession reliability of pain-related fMRI activation. A secondary finding of this study is that the use of standard ICC alone as a measure of reliability may not be sufficient, as the underlying variance structure of an fMRI dataset can result in inappropriately high ICC values; a method to eliminate these false positive results was used in this
Multicenter reliability of semiautomatic retinal layer segmentation using OCT

Science.gov (United States)

Oberwahrenbrock, Timm; Traber, Ghislaine L.; Lukas, Sebastian; Gabilondo, Iñigo; Nolan, Rachel; Songster, Christopher; Balk, Lisanne; Petzold, Axel; Paul, Friedemann; Villoslada, Pablo; Brandt, Alexander U.; Green, Ari J.

2018-01-01

Objective To evaluate the inter-rater reliability of semiautomated segmentation of spectral domain optical coherence tomography (OCT) macular volume scans. Methods Macular OCT volume scans of left eyes from 17 subjects (8 patients with MS and 9 healthy controls) were automatically segmented by Heidelberg Eye Explorer (v1.9.3.0) beta-software (Spectralis Viewing Module v6.0.0.7), followed by manual correction by 5 experienced operators from 5 different academic centers. The mean thicknesses within a 6-mm area around the fovea were computed for the retinal nerve fiber layer, ganglion cell layer (GCL), inner plexiform layer (IPL), inner nuclear layer, outer plexiform layer (OPL), and outer nuclear layer (ONL). Intraclass correlation coefficients (ICCs) were calculated for mean layer thickness values. Spatial distribution of ICC values for the segmented volume scans was investigated using heat maps. Results Agreement between raters was good (ICC > 0.84) for all retinal layers, particularly inner retinal layers showed excellent agreement across raters (ICC > 0.96). Spatial distribution of ICC showed highest values in the perimacular area, whereas the ICCs were poorer for the foveola and the more peripheral macular area. The automated segmentation of the OPL and ONL required the most correction and showed the least agreement, whereas differences were less prominent for the remaining layers. Conclusions Automated segmentation with manual correction of macular OCT scans is highly reliable when performed by experienced raters and can thus be applied in multicenter settings. Reliability can be improved by restricting analysis to the perimacular area and compound segmentation of GCL and IPL. PMID:29552598
Reliability and fall experience discrimination of Cross Step Moving on Four Spots Test in the elderly.

Science.gov (United States)

Yamaji, Shunsuke; Demura, Shinichi

2013-07-01

To examine the reliability and fall experience discrimination of the Cross Step Moving on Four Spots Test (CSFT) and the relationship between CSFT and fall-related physical function. The reliability of the CSFT was examined in a test-retest format with the same tester. Fall history, fall risk, fear of falling, activities of daily living (ADL), and various physical parameters were measured for all participants. A community center and university medical school. Elderly community-dwelling subjects (N=533; 62 men, 471 women) aged 65 to 94 years living independently. Not applicable. Time to complete all the CSFT steps required, fall risk score, ADL score, and fall-related physical function (isometric muscle strength: toe grip, plantar flexion, knee extension, hip flexion, hand grip; balance: 1-leg standing time with eyes open, functional reach test using an elastic stick; and gait: 10-m maximal walking speed). The trial-to-trial reliability test indicated good reliability of the CSFT in both sexes (intraclass correlation coefficient =.833 in men, .825 in women). However, trial-to-trial errors increased with an increase in the CSFT values in both sexes. Significant correlations were observed between the CSFT values and scores for most fall-related physical function tests in both sexes. However, the correlation coefficient for all significant correlations was fall experience) revealed that the fall experience is a significant factor affecting CSFT values; values in fallers were significantly lower than those in nonfallers. The odds ratios in logistic regression analysis were significant in both sexes (men, 1.35; women, 1.48). As determined by the Youden index, the optimal cutoff value for identifying fall experience was 7.32 seconds, with an area under the curve of .676. The CSFT can detect fall experience and is useful in the evaluation of different fall-related physical functions including muscle strength, balance, and mobility. Copyright © 2013 American Congress of
Reliability and Validity of the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2 in Adults with Non-Cancer Pain

Directory of Open Access Journals (Sweden)

Corey J. Hayes

2017-04-01

Full Text Available Limited evidence exists on how non-cancer pain (NCP affects an individual’s health-related quality of life (HRQoL. This study aimed to validate the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2, a generic measure of HRQoL, in a NCP cohort using the Medical Expenditure Panel Survey Longitudinal Files. The SF Mental Component Summary (MCS12 and SF Physical Component Summary (PCS12 were tested for reliability (internal consistency and test-retest reliability and validity (construct: convergent and discriminant; criterion: concurrent and predictive. A total of 15,716 patients with NCP were included in the final analysis. The MCS12 and PCS12 demonstrated high internal consistency (Cronbach’s alpha and Mosier’s alpha > 0.8, and moderate and high test-retest reliability, respectively (MCS12 intraclass correlation coefficient (ICC: 0.64; PCS12 ICC: 0.73. Both scales were significantly associated with a number of chronic conditions (p < 0.05. The PCS12 was strongly correlated with perceived health (r = 0.52 but weakly correlated with perceived mental health (r = 0.25. The MCS12 was moderately correlated with perceived mental health (r = 0.42 and perceived health (r = 0.33. Increasing PCS12 and MCS12 scores were significantly associated with lower odds of reporting future physical and cognitive limitations (PCS12: OR = 0.90 95%CI: 0.89–0.90, MCS12: OR = 0.94 95%CI: 0.93–0.94. In summary, the SF-12v2 is a reliable and valid measure of HRQoL for patients with NCP.
Verification of the reliability and validity of a Japanese version of the Quality of Life in Childhood Epilepsy Questionnaire (QOLCE-J).

Science.gov (United States)

Moriguchi, Eri; Ito, Mikiko; Nagai, Toshisaburo

2015-11-01

A Japanese version of the Quality of Life in Childhood Epilepsy Questionnaire (QOLCE-J) was developed using international guidelines as a QOL scale for childhood epilepsy; its reliability and validity were examined, focusing on Japanese pediatric epilepsy patients applicability. A pilot test questionnaire survey was conducted; involving parents of pediatric epilepsy patients aged 4-15 undergoing outpatient treatment. 278 responses were obtained and analyzed. Internal consistency for the 16 QOLCE-J subscales, except for , was sufficient, and a high overall coefficient α was obtained. The intraclass correlation coefficient was also high, supporting the test-retest reliability of this version. Associations among the subscales, high correlations of r>0.7 were observed among , , and , representing cognitive and behavioral aspects, and among these and . In contrast, correlations among others were moderate or weaker. Furthermore, correlations of r>0.35 were observed among the subscales of the SDQ (Strength and Difficulties Questionnaire) used as an external criterion and the QOLCE-J, confirming the criterion validity of the study version. Analysis of associations between the total QOLCE-J score and pathology of epilepsy, found significant correlation with age of onset and frequency of seizures, ADL, and antiepileptics side effects' symptoms. QOLCE has mostly been used in treatment resistant pediatric patients, the influence of interictal period presently observed, like antiepileptic side effects' symptoms; suggest usefulness for pediatric patients with seizures under control. The QOLCE-J with sufficient reliability and validity may be applicable as a QOL scale for Japanese children with epilepsy. Copyright © 2015 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.
Validity and reliability of the 6 minute walk in persons with fibromyalgia.

Science.gov (United States)

King, S; Wessel, J; Bhambhani, Y; Maikala, R; Sholter, D; Maksymowych, W

1999-10-01

To assess the reliability and construct validity of the 6 minute walk (6MW) in persons with fibromyalgia (FM) and to determine an equation for predicting peak oxygen consumption (pVO2) from the distance covered in 6 minutes. Ninety-six women who met the American College of Rheumatology (ACR) criteria for FM were tested on the 6MW and the Fibromyalgia Impact Questionnaire (FIQ). A subset (n = 23) were tested on a separate day for pVO2 during a symptom-limited, incremental treadmill test. Twelve subjects repeated the 6MW five times over 10 days. Heart rate and rating of perceived exertion (RPE) were recorded for each walk. Intraclass correlations were used to determine the reliability of the 6MW. Validity was examined by correlating the 6MW with pVO2 and the FIQ. Body mass index (BMI) and 6MW were independent variables in a stepwise regression to predict pVO2. A significant increase in distance occurred from Walk 1 to Walk 2 (p = 0.000) with the distance maintained on the remaining walks (p = 0.148) The correlations of the 6MW with the FIQ and pVO2 were -0.325 and 0.657, respectively. The regression equation to predict pVO2 from 6MW distance and BMI was: pVO2 (ml/kg/min) = 21.48 + (-0.4316 x BMI) + [0.0304 x distance(m)] (R = 0.76, R2 = 0.66). When using the 6MW it is necessary to conduct a practice walk, with the second walk taken as the baseline measure. It was determined from the correlations that the 6MW cannot replace the FIQ as a measure of function. The 6MW may be used as an indicator of aerobic fitness, although obtaining VO2 by means of a graded exercise test is preferable.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

Science.gov (United States)

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Reliability and validity of the 12-item WHODAS 2.0 in patients with Kashin-Beck disease.

Science.gov (United States)

Younus, Mohammad Imran; Wang, Di-Miao; Yu, Fang-Fang; Fang, Hua; Guo, Xiong

2017-09-01

The purpose of this study was to check the reliability and validity of the 12-item Chinese version of the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) for the assessment of disability in patients with Kashin-Beck disease (KBD). We recruited 219 patients with KBD from the high-risk KBD area in the Shaanxi province, using stratified multistage random sampling. We assessed each patient using the Chinese version of the 12-item WHODAS 2.0 and the Western Ontario and McMaster Universities Index of Osteoarthritis (WOMAC). Statistical evaluations of the instruments consisted of Cronbach's alpha, intraclass correlation coefficient (ICC), confirmatory factor analysis (CFA), and Pearson's correlation coefficient. Cronbach's alpha and ICC for the six domains ranged from 0.704 to 0.906 and 0.690 to 0.852, respectively. A six-factor structure fits the data well (CFI = 0.967, TLI = 0.944, RMSEA = 0.08). Regarding convergent validity, the four domains of the 12-item WHODAS 2.0 (getting around, self-care, life activity, and participation) showed moderate-to-strong correlation for all three domains of the WOMAC (0.428 < |r| < 0.804). Regarding divergent validity, the two domains of the 12-item WHODAS 2.0 (understanding and communication, and getting along with people) showed weak correlation for the three domains of WOMAC (0.182 < |r| < 0.295). The Chinese version of 12-item WHODAS 2.0 questionnaire is a reliable and valid instrument when administered to KBD patients.
The reliability of a modified Kalamazoo Consensus Statement Checklist for assessing the communication skills of multidisciplinary clinicians in the simulated environment.

Science.gov (United States)

Peterson, Eleanor B; Calhoun, Aaron W; Rider, Elizabeth A

2014-09-01

With increased recognition of the importance of sound communication skills and communication skills education, reliable assessment tools are essential. This study reports on the psychometric properties of an assessment tool based on the Kalamazoo Consensus Statement Essential Elements Communication Checklist. The Gap-Kalamazoo Communication Skills Assessment Form (GKCSAF), a modified version of an existing communication skills assessment tool, the Kalamazoo Essential Elements Communication Checklist-Adapted, was used to assess learners in a multidisciplinary, simulation-based communication skills educational program using multiple raters. 118 simulated conversations were available for analysis. Internal consistency and inter-rater reliability were determined by calculating a Cronbach's alpha score and intra-class correlation coefficients (ICC), respectively. The GKCSAF demonstrated high internal consistency with a Cronbach's alpha score of 0.844 (faculty raters) and 0.880 (peer observer raters), and high inter-rater reliability with an ICC of 0.830 (faculty raters) and 0.89 (peer observer raters). The Gap-Kalamazoo Communication Skills Assessment Form is a reliable method of assessing the communication skills of multidisciplinary learners using multi-rater methods within the learning environment. The Gap-Kalamazoo Communication Skills Assessment Form can be used by educational programs that wish to implement a reliable assessment and feedback system for a variety of learners. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

Intra- and inter-observer agreement and reliability of bone mineral density measurements around acetabular cup

DEFF Research Database (Denmark)

Mussmann, Bo Redder; Overgaard, Soren; Torfing, Trine

2017-01-01

in measuring bone density (BMD) in complex anatomic structures which might be overcome using dual-energy computed tomography (DECT).PurposeTo test inter- and intra-observer agreement and reliability of in-house segmentation software measuring BMD adjacent to acetabular cup and to compare measurements performed...... with single-energy CT (SECT) and DECT in cemented and cementless cups.Material and Methods: Twenty-four acetabular cups inserted in porcine hip specimens were scanned with SECT and DECT. Bone density was measured in a three-dimensional volume adjacent to the cup. Double measurements were performed.......Results: BMD derived from SECT was approximately four times higher than that of DECT. In both scan modes, intraclass correlation coefficient (ICC) was >0.90 with no differences between repeated measurements, except for uncemented cups where a statistically significant difference of 11 mg/cm3 was found...
Isokinetic Strength and Endurance Tests used Pre- and Post-Spaceflight: Test-Retest Reliability

Science.gov (United States)

Laughlin, Mitzi S.; Lee, Stuart M. C.; Loehr, James A.; Amonette, William E.

2009-01-01

To assess changes in muscular strength and endurance after microgravity exposure, NASA measures isokinetic strength and endurance across multiple sessions before and after long-duration space flight. Accurate interpretation of pre- and post-flight measures depends upon the reliability of each measure. The purpose of this study was to evaluate the test-retest reliability of the NASA International Space Station (ISS) isokinetic protocol. Twenty-four healthy subjects (12 M/12 F, 32.0 +/- 5.6 years) volunteered to participate. Isokinetic knee, ankle, and trunk flexion and extension strength as well as endurance of the knee flexors and extensors were measured using a Cybex NORM isokinetic dynamometer. The first weekly session was considered a familiarization session. Data were collected and analyzed for weeks 2-4. Repeated measures analysis of variance (alpha=0.05) was used to identify weekly differences in isokinetic measures. Test-retest reliability was evaluated by intraclass correlation coefficients (ICC) (3,1). No significant differences were found between weeks in any of the strength measures and the reliability of the strength measures were all considered excellent (ICC greater than 0.9), except for concentric ankle dorsi-flexion (ICC=0.67). Although a significant difference was noted in weekly endurance measures of knee extension (p less than 0.01), the reliability of endurance measure by week were considered excellent for knee flexion (ICC=0.97) and knee extension (ICC=0.96). Except for concentric ankle dorsi-flexion, the isokinetic strength and endurance measures are highly reliable when following the NASA ISS protocol. This protocol should allow accurate interpretation isokinetic data even with a small number of crew members.
Translation, Validation, and Reliability of the Dutch Late-Life Function and Disability Instrument Computer Adaptive Test.

Science.gov (United States)

Arensman, Remco M; Pisters, Martijn F; de Man-van Ginkel, Janneke M; Schuurmans, Marieke J; Jette, Alan M; de Bie, Rob A

2016-09-01

Adequate and user-friendly instruments for assessing physical function and disability in older adults are vital for estimating and predicting health care needs in clinical practice. The Late-Life Function and Disability Instrument Computer Adaptive Test (LLFDI-CAT) is a promising instrument for assessing physical function and disability in gerontology research and clinical practice. The aims of this study were: (1) to translate the LLFDI-CAT to the Dutch language and (2) to investigate its validity and reliability in a sample of older adults who spoke Dutch and dwelled in the community. For the assessment of validity of the LLFDI-CAT, a cross-sectional design was used. To assess reliability, measurement of the LLFDI-CAT was repeated in the same sample. The item bank of the LLFDI-CAT was translated with a forward-backward procedure. A sample of 54 older adults completed the LLFDI-CAT, World Health Organization Disability Assessment Schedule 2.0, RAND 36-Item Short-Form Health Survey physical functioning scale (10 items), and 10-Meter Walk Test. The LLFDI-CAT was repeated in 2 to 8 days (mean=4.5 days). Pearson's r and the intraclass correlation coefficient (ICC) (2,1) were calculated to assess validity, group-level reliability, and participant-level reliability. A correlation of .74 for the LLFDI-CAT function scale and the RAND 36-Item Short-Form Health Survey physical functioning scale (10 items) was found. The correlations of the LLFDI-CAT disability scale with the World Health Organization Disability Assessment Schedule 2.0 and the 10-Meter Walk Test were -.57 and -.53, respectively. The ICC (2,1) of the LLFDI-CAT function scale was .84, with a group-level reliability score of .85. The ICC (2,1) of the LLFDI-CAT disability scale was .76, with a group-level reliability score of .81. The high percentage of women in the study and the exclusion of older adults with recent joint replacement or hospitalization limit the generalizability of the results. The Dutch LLFDI
[Screening for dementia using telephone interviews. An evaluation and reliability study of the Telephone Interview for Cognitive Status (TICS) in its modified German version].

Science.gov (United States)

Matrisch, M; Trampisch, U; Klaassen-Mielke, R; Pientka, L; Trampisch, H J; Thiem, U

2012-04-01

To assess cognitive impairment or dementia in epidemiologic studies using telephone interviews for data acquisition, valid, reliable and short instruments suitable for telephone administration are required. For the Telephone Interview for Cognitive Status (TICS) in its modified German version, the only instrument used in Germany so far, more data on reliability and practicability are needed. Participants were recruited in the offices of nine primary care physicians. Data from 197 participants (115 females, mean age 78.5±4.1 years) who were tested by telephone and in the office by means of the Mini-Mental State Examination (MMSE) were used for the evaluation. For assessing reliability, a group of 91 participants (55 females, mean age 78.1±4.1 years) was contacted twice during 30 days to be tested during a telephone interview by means of the TICS in its modified German version. The intraclass correlation coefficient (ICC), a measure of reliability, was 0.67 [95% confidence interval (CI): 0.53; 0.77]. The Bland-Altman plot did not reveal any relationship between the variability of the difference between repeated measures and the total amount of the measure. For the overall TICS score, no differences were found between repeated measurements. However, the tasks recall of the word list and counting backwards showed some improvement in the repeated tests. TICS and MMSE showed only moderate correlation, with a correlation coefficient of 0.48 (95% CI: 0.36; 0.58). TICS values were dependent on age and educational level of the person tested. The TICS in its modified German version appears to be of acceptable reliability for the assessment of cognitive impairment during a telephone interview. TICS values depend on age and educational level of the person tested. TICS and MMSE correlate only moderately.
Between-day reliability of a method for non-invasive estimation of muscle composition.

Science.gov (United States)

Simunič, Boštjan

2012-08-01

Tensiomyography is a method for valid and non-invasive estimation of skeletal muscle fibre type composition. The validity of selected temporal tensiomyographic measures has been well established recently; there is, however, no evidence regarding the method's between-day reliability. Therefore it is the aim of this paper to establish the between-day repeatability of tensiomyographic measures in three skeletal muscles. For three consecutive days, 10 healthy male volunteers (mean±SD: age 24.6 ± 3.0 years; height 177.9 ± 3.9 cm; weight 72.4 ± 5.2 kg) were examined in a supine position. Four temporal measures (delay, contraction, sustain, and half-relaxation time) and maximal amplitude were extracted from the displacement-time tensiomyogram. A reliability analysis was performed with calculations of bias, random error, coefficient of variation (CV), standard error of measurement, and intra-class correlation coefficient (ICC) with a 95% confidence interval. An analysis of ICC demonstrated excellent agreement (ICC were over 0.94 in 14 out of 15 tested parameters). However, lower CV was observed in half-relaxation time, presumably because of the specifics of the parameter definition itself. These data indicate that for the three muscles tested, tensiomyographic measurements were reproducible across consecutive test days. Furthermore, we indicated the most possible origin of the lowest reliability detected in half-relaxation time. Copyright © 2012 Elsevier Ltd. All rights reserved.
Validity and Reliability of a New Device (WIMU®) for Measuring Hamstring Muscle Extensibility.

Science.gov (United States)

Muyor, José M

2017-09-01

The aims of the current study were 1) to evaluate the validity of the WIMU ® system for measuring hamstring muscle extensibility in the passive straight leg raise (PSLR) test using an inclinometer for the criterion and 2) to determine the test-retest reliability of the WIMU ® system to measure hamstring muscle extensibility during the PSLR test. 55 subjects were evaluated on 2 separate occasions. Data from a Unilever inclinometer and WIMU ® system were collected simultaneously. Intraclass correlation coefficients (ICCs) for the validity were very high (0.983-1); a very low systematic bias (-0.21°--0.42°), random error (0.05°-0.04°) and standard error of the estimate (0.43°-0.34°) were observed (left-right leg, respectively) between the 2 devices (inclinometer and the WIMU ® system). The R 2 between the devices was 0.999 (p<0.001) in both the left and right legs. The test-retest reliability of the WIMU ® system was excellent, with ICCs ranging from 0.972-0.995, low coefficients of variation (0.01%), and a low standard error of the estimate (0.19-0.31°). The WIMU ® system showed strong concurrent validity and excellent test-retest reliability for the evaluation of hamstring muscle extensibility in the PSLR test. © Georg Thieme Verlag KG Stuttgart · New York.
Reliability and validity of the transport and physical activity questionnaire (TPAQ) for assessing physical activity behaviour.

Science.gov (United States)

Adams, Emma J; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C; Cooper, Ashley R; Ogilvie, David

2014-01-01

No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, ptravel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires.
Cross-cultural Adaptation, Reliability, and Validity of the Yoruba Version of the Roland-Morris Disability Questionnaire.

Science.gov (United States)

Mbada, Chidozie Emmanuel; Idowu, Opeyemi Ayodiipo; Ogunjimi, Olawale Richard; Ayanniyi, Olusola; Orimolade, Elkanah Ayodele; Oladiran, Ajibola Babatunde; Johnson, Olubusola Esther; Akinsulore, Adesanmi; Oni, Temitope Olawale

2017-04-01

A translation, cross-cultural adaptation, and psychometric analysis. The aim of this study was to translate, cross-culturally adapt, and validate the Yoruba version of the RMDQ. The Roland-Morris Disability Questionnaire (RMDQ) is a valid outcome tool for low back pain (LBP) in clinical and research settings. There seems to be no valid and reliable version of the RMDQ in the Nigerian languages. Following the Guillemin criteria, the English version of the RMDQ was forward and back translated. Two Yoruba translated versions of the RMDQ were assessed for clarity, common language usage, and conceptual equivalence. Consequently, a harmonized Yoruba version was produced and was pilot-tested among 20 patients with nonspecific long-term LBP (NSLBP) for cognitive debriefing. The final version of the Yoruba RMDQ was tested for its construct validity and re-retest reliability among 120 and 87 patients with NSLBP, respectively. Pearson product moment correlation coefficient (r) of 0.82 was obtained for reliability of the Yoruba version of the RMDQ. The test-retest reliability of the Yoruba RMDQ yielded Cronbach alpha 0.932, while the intraclass correlation (ICC) ranged between 0.896 and 0.956. The analysis of the global scores of both the English and Yoruba versions of the RMDQ yielded ICC value of between 0.995 (95% confidence interval 0.996-0.997), with the item-by-item Kappa agreement ranging between 0.824 and 1.000. The external validity of RMDQ using Quadruple Visual Analogue Scale was r = -0.596 (P = 0.001). The Yoruba version of the RMDQ had no floor/ceiling effects, as no patient achieved either of the maximum or the minimum possible scores. The Yoruba version of the RMDQ has excellent reliability and validity and may be an appropriate outcome tool for clinical and research purposes among Yoruba-speaking patients with LBP. 3.
Within- and between-session reliability of medial gastrocnemius architectural properties

Directory of Open Access Journals (Sweden)

JJ McMahon

2016-05-01

Full Text Available This study aimed to determine the within- and between-session reliability of medial gastrocnemius (MG architecture (e.g. muscle thickness (MT, fascicle length (FL and pennation angle (PA, as derived via ultrasonography followed by manual digitization. A single rater recorded three ultrasound images of the relaxed MG muscle belly for both legs of 16 resistance trained males, who were positioned in a pronated position with their knees fully extended and the ankles in a neutral (e.g. 90° position. A subset of participants (n = 11 were retested under the same conditions ~48-72 hours after baseline testing. The same rater manually digitized each ultrasound image on three occasions to determine MG MT, FL and PA before pooling the data accordingly to allow for within-image (n = 96, between-image (n = 32 and between-session reliability (n = 22 to be determined. Intraclass correlation coefficients (ICCs demonstrated excellent within-image (ICCs = 0.99-1.00, P < 0.001 and very good between-image (ICCs = 0.83-0.95, P < 0.001 and between-session (ICCs = 0.89- 0.95, P < 0.001 reliability for MT, FL and PA. Between-session coefficient of variation was low (≤ 3.6% for each architectural parameter and smallest detectible difference values of 10.6%, 11.4% and 9.8% were attained for MT, FL and PA, respectively. Manually digitizing ultrasound images of the MG muscle at rest yields highly reliable measurements of its architectural properties. Furthermore, changes in MG MT, FL and PA of ≥ 10.6%, 11.4% and 9.8% respectively, as brought about by any form of intervention, should be considered meaningful.
The minimum sit-to-stand height test: reliability, responsiveness and relationship to leg muscle strength.

Science.gov (United States)

Schurr, Karl; Sherrington, Catherine; Wallbank, Geraldine; Pamphlett, Patricia; Olivetti, Lynette

2012-07-01

To determine the reliability of the minimum sit-to-stand height test, its responsiveness and its relationship to leg muscle strength among rehabilitation unit inpatients and outpatients. Reliability study using two measurers and two test occasions. Secondary analysis of data from two clinical trials. Inpatient and outpatient rehabilitation services in three public hospitals. Eighteen hospital patients and five others participated in the reliability study. Seventy-two rehabilitation unit inpatients and 80 outpatients participated in the clinical trials. The minimum sit-to-stand height test was assessed using a standard procedure. For the reliability study, a second tester repeated the minimum sit-to-stand height test on the same day. In the inpatient clinical trial the measures were repeated two weeks later. In the outpatient trial the measures were repeated five weeks later. Knee extensor muscle strength was assessed in the clinical trials using a hand-held dynamometer. The reliability for the minimum sit-to-stand height test was excellent (intraclass correlation coefficient (ICC) 0.91, 95% confidence interval (CI) 0.81-0.96). The standard error of measurement was 34 mm. Responsiveness was moderate in the inpatient trial (effect size: 0.53) but small in the outpatient trial (effect size: 0.16). A small proportion (8-17%) of variability in minimum sit-to-stand height test was explained by knee extensor muscle strength. The minimum sit-to-stand height test has excellent reliability and moderate responsiveness in an inpatient rehabilitation setting. Responsiveness in an outpatient rehabilitation setting requires further investigation. Performance is influenced by factors other than knee extensor muscle strength.
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

Science.gov (United States)

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Strength Measurements in Acute Hamstring Injuries: Intertester Reliability and Prognostic Value of Handheld Dynamometry.

Science.gov (United States)

Reurink, Gustaaf; Goudswaard, Gert Jan; Moen, Maarten H; Tol, Johannes L; Verhaar, Jan A N; Weir, Adam

2016-08-01

Study Design Cohort study, repeated measures. Background Although hamstring strength measurements are used for assessing prognosis and monitoring recovery after hamstring injury, their actual clinical relevance has not been established. Handheld dynamometry (HHD) is a commonly used method of measuring muscle strength. The reliability of HHD has not been determined in athletes with acute hamstring injuries. Objectives To determine the intertester reliability and the prognostic value of hamstring HHD strength measurement in acute hamstring injuries. Methods We measured knee flexion strength with HHD in 75 athletes at 2 visits, at baseline (within 5 days of hamstring injury) and follow-up (5 to 7 days after the baseline measurement). We assessed isometric hamstring strength in 15° and 90° of knee flexion. Reliability analysis testing was performed by 2 testers independently at the follow-up visit. We recorded the time needed to return to play (RTP) up to 6 months following baseline. Results The intraclass correlation coefficients of the strength measurements in injured hamstrings were between 0.75 and 0.83. There was a statistically significant but weak correlation between the time to RTP and the strength deficit at 15° of knee flexion measured at baseline (Spearman r = 0.25, P = .045) and at the follow-up visit (Spearman r = 0.26, P = .034). Up to 7% of the variance in time to RTP is explained by this strength deficit. None of the other strength variables were significantly correlated with time to RTP. Conclusion Hamstring strength can be reliably measured with HHD in athletes with acute hamstring injuries. The prognostic value of strength measurements is limited, as there is only a weak association between the time to RTP and hamstring strength deficit after acute injury. Level of Evidence Prognosis, level 4. J Orthop Sports Phys Ther 2016;46(8):689-696. Epub 12 May 2016. doi:10.2519/jospt.2016.6363.
The reliability and validity of the informant AD8 by comparison with a series of cognitive assessment tools in primary healthcare.

Science.gov (United States)

Shaik, Muhammad Amin; Xu, Xin; Chan, Qun Lin; Hui, Richard Jor Yeong; Chong, Steven Shih Tsze; Chen, Christopher Li-Hsian; Dong, YanHong

2016-03-01

The validity and reliability of the informant AD8 in primary healthcare has not been established. Therefore, the present study examined the validity and reliability of the informant AD8 in government subsidized primary healthcare centers in Singapore. Eligible patients (≥60 years old) were recruited from primary healthcare centers and their informants received the AD8. Patient-informant dyads who agreed for further cognitive assessments received the Mini-Mental State Examination (MMSE), Montreal Cognitive Assessment (MoCA), Clinical Dementia Rating (CDR), and a locally validated formal neuropsychological battery at a research center in a tertiary hospital. 1,082 informants completed AD8 assessment at two primary healthcare centers. Of these, 309 patients-informant dyads were further assessed, of whom 243 (78.6%) were CDR = 0; 22 (7.1%) were CDR = 0.5; and 44 (14.2%) were CDR≥1. The mean administration time of the informant AD8 was 2.3 ± 1.0 minutes. The informant AD8 demonstrated good internal consistency (Cronbach's α = 0.85); inter-rater reliability (Intraclass Correlation Coefficient (ICC) = 0.85); and test-retest reliability (weighted κ = 0.80). Concurrent validity, as measured by the correlation between total AD8 scores and CDR global (R = 0.65, p validity, as measured by convergent validity (R ≥ 0.4) between individual items of AD8 with CDR and neuropsychological domains was acceptable. The informant AD8 demonstrated good concurrent and construct validity and is a reliable measure to detect cognitive dysfunction in primary healthcare.
Validity and reliability of central blood pressure estimated by upper arm oscillometric cuff pressure.

Science.gov (United States)

Climie, Rachel E D; Schultz, Martin G; Nikolic, Sonja B; Ahuja, Kiran D K; Fell, James W; Sharman, James E

2012-04-01

Noninvasive central blood pressure (BP) independently predicts mortality, but current methods are operator-dependent, requiring skill to obtain quality recordings. The aims of this study were first, to determine the validity of an automatic, upper arm oscillometric cuff method for estimating central BP (O(CBP)) by comparison with the noninvasive reference standard of radial tonometry (T(CBP)). Second, we determined the intratest and intertest reliability of O(CBP). To assess validity, central BP was estimated by O(CBP) (Pulsecor R6.5B monitor) and compared with T(CBP) (SphygmoCor) in 47 participants free from cardiovascular disease (aged 57 ± 9 years) in supine, seated, and standing positions. Brachial mean arterial pressure (MAP) and diastolic BP (DBP) from the O(CBP) device were used to calibrate in both devices. Duplicate measures were recorded in each position on the same day to assess intratest reliability, and participants returned within 10 ± 7 days for repeat measurements to assess intertest reliability. There was a strong intraclass correlation (ICC = 0.987, P difference (1.2 ± 2.2 mm Hg) for central systolic BP (SBP) determined by O(CBP) compared with T(CBP). Ninety-six percent of all comparisons (n = 495 acceptable recordings) were within 5 mm Hg. With respect to reliability, there were strong correlations but higher limits of agreement for the intratest (ICC = 0.975, P difference 0.6 ± 4.5 mm Hg) and intertest (ICC = 0.895, P difference 4.3 ± 8.0 mm Hg) comparisons. Estimation of central SBP using cuff oscillometry is comparable to radial tonometry and has good reproducibility. As a noninvasive, relatively operator-independent method, O(CBP) may be as useful as T(CBP) for estimating central BP in clinical practice.
Assessment of the reliability and consistency of the "malnutrition inflammation score" (MIS) in Mexican adults with chronic kidney disease for diagnosis of protein-energy wasting syndrome (PEW).

Science.gov (United States)

González-Ortiz, Ailema Janeth; Arce-Santander, Celene Viridiana; Vega-Vega, Olynka; Correa-Rotter, Ricardo; Espinosa-Cuevas, María de Los Angeles

2014-10-04

The protein-energy wasting syndrome (PEW) is a condition of malnutrition, inflammation, anorexia and wasting of body reserves resulting from inflammatory and non-inflammatory conditions in patients with chronic kidney disease (CKD).One way of assessing PEW, extensively described in the literature, is using the Malnutrition Inflammation Score (MIS). To assess the reliability and consistency of MIS for diagnosis of PEW in Mexican adults with CKD on hemodialysis (HD). Study of diagnostic tests. A sample of 45 adults with CKD on HD were analyzed during the period June-July 2014.The instrument was applied on 2 occasions; the test-retest reliability was calculated using the Intraclass Correlation Coefficient (ICC); the internal consistency of the questionnaire was analyzed using Cronbach's αcoefficient. A weighted Kappa test was used to estimate the validity of the instrument; the result was subsequently compared with the Bilbrey nutritional index (BNI). The reliability of the questionnaires, evaluated in the patient sample, was ICC=0.829.The agreement between MIS observations was considered adequate, k= 0.585 (p <0.001); when comparing it with BNI, a value of k = 0.114 was obtained (p <0.001).In order to estimate the tendency, a correlation test was performed. The r² correlation coefficient was 0.488 (P <0.001). MIS has adequate reliability and validity for diagnosing PEW in the population with chronic kidney disease on HD. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Reliability, Validity, and Significance of Assessment of Sense of Contribution in the Workplace

Directory of Open Access Journals (Sweden)

Jiro Takaki

2014-01-01

Full Text Available The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS, a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%–80.2%. Fifty-four workers were included in the analysis of test–retest reliability (response rate, 77.1%. The SCS showed high internal consistency (Cronbach’s α coefficients in men and women were 0.85 and 0.86, respectively and test–retest reliability (intraclass correlation coefficient = 0.91. Significant (p < 0.001, positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS’s convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05 and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort–reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Validity and reliability of global operative assessment of laparoscopic skills (GOALS) in novice trainees performing a laparoscopic cholecystectomy.

Science.gov (United States)

Kramp, Kelvin H; van Det, Marc J; Hoff, Christiaan; Lamme, Bas; Veeger, Nic J G M; Pierie, Jean-Pierre E N

2015-01-01

Global Operative Assessment of Laparoscopic Skills (GOALS) assessment has been designed to evaluate skills in laparoscopic surgery. A longitudinal blinded study of randomized video fragments was conducted to estimate the validity and reliability of GOALS in novice trainees. In total, 10 trainees each performed 6 consecutive laparoscopic cholecystectomies. Sixty procedures were recorded on video. Video fragments of (1) opening of the peritoneum; (2) dissection of Calot's triangle and achievement of critical view of safety; and (3) dissection of the gallbladder from the liver bed were blinded, randomized, and rated by 2 consultant surgeons using GOALS. Also, a grade was given for overall competence. The correlation of GOALS with live observation Objective Structured Assessment of Technical Skills (OSATS) scores was calculated. Construct validity was estimated using the Friedman 2-way analysis of variance by ranks and the Wilcoxon signed-rank test. The interrater reliability was calculated using the absolute and consistency agreement 2-way random-effects model intraclass correlation coefficient. A high correlation was found between mean GOALS score (r = 0.879, p = 0.021) and mean OSATS score. The GOALS score increased significantly across the 6 procedures (p = 0.002). The trainees performed significantly better on their sixth when compared with their first cholecystectomy (p = 0.004). The consistency agreement interrater reliability was 0.37 for the mean GOALS score (p = 0.002) and 0.55 for overall competence (p < 0.001) of the 3 video fragments. The validity observed in this randomized blinded longitudinal study supports the existing evidence that GOALS is a valid tool for assessment of novice trainees. A relatively low reliability was found in this study. Copyright © 2014 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Reliability and validity of the Japanese version of the simplified nutritional appetite questionnaire in community-dwelling older adults.

Science.gov (United States)

Nakatsu, Nobuyuki; Sawa, Ryuichi; Misu, Shogo; Ueda, Yuya; Ono, Rei

2015-12-01

To translate the Simplified Nutritional Appetite Questionnaire (SNAQ) into Japanese, and assess its reliability and validity in Japanese community-dwelling older adults. A total of 84 community-dwelling older adults people aged 65 years or older were included in the present study, and those with a Mini-Mental State Examination score of validity was evaluated by measuring the Pearson's correlation coefficient between the SNAQ and Mini-Nutritional Assessment Short-Form scores, Geriatric Depression Scale scores, walking speed test, chair-stand test, hand grip strength test, or the Timed Up and Go test. The mean score of the Japanese version of the SNAQ was 15.5, with a Cronbach's alpha coefficient of 0.545 and intraclass correlation coefficient of 0.754. Factor analysis showed a single factor with 50.0% explained variance. The SNAQ was significantly associated with the Mini-Nutritional Assessment Short-Form, Geriatric Depression Scale, walking speed test, chair-stand test and the Timed Up and Go test. Handgrip strength test did not show a significant association with the SNAQ. The Japanese version of the SNAQ had sufficient reliability and validity. Furthermore, SNAQ (Japanese version) is useful for evaluating the appetite of community-dwelling older adults in Japan. Geriatr Gerontol Int 2015; 15: 1264-1269. © 2014 Japan Geriatrics Society.
CPM Test-Retest Reliability: "Standard" vs "Single Test-Stimulus" Protocols.

Science.gov (United States)

Granovsky, Yelena; Miller-Barmak, Adi; Goldstein, Oren; Sprecher, Elliot; Yarnitsky, David

2016-03-01

Assessment of pain inhibitory mechanisms using conditioned pain modulation (CPM) is relevant clinically in prediction of pain and analgesic efficacy. Our objective is to provide necessary estimates of intersession CPM reliability, to enable transformation of the CPM paradigm into a clinical tool. Two cohorts of young healthy subjects (N = 65) participated in two dual-session studies. In Study I, a Bath-Thermode CPM protocol was used, with hot water immersion and contact heat as conditioning- and test-stimuli, respectively, in a classical parallel CPM design introducing test-stimulus first, and then the conditioning- and repeated test-stimuli in parallel. Study II consisted of two CPM protocols: 1) Two-Thermodes, one for each of the stimuli, in the same parallel design as above, and 2) single test-stimulus (STS) protocol with a single administration of a contact heat test-stimulus, partially overlapped in time by a remote shorter contact heat as conditioning stimulus. Test-retest reliability was assessed within 3-7 days. The STS-CPM had superior reliability intraclass correlation (ICC 2 ,: 1 = 0.59) over Bath-Thermode (ICC 2 ,: 1 = 0.34) or Two-Thermodes (ICC 2 ,: 1 = 0.21) protocols. The hand immersion conditioning pain had higher reliability than thermode pain (ICC 2 ,: 1 = 0.76 vs ICC 2 ,: 1 = 0.16). Conditioned test-stimulus pain scores were of good (ICC 2 ,: 1 = 0.62) or fair (ICC 2 ,: 1 = 0.43) reliability for the Bath-Thermode and the STS, respectively, but not for the Two-Thermodes protocol (ICC 2 ,: 1 = 0.20). The newly developed STS-CPM paradigm was more reliable than other CPM protocols tested here, and should be further investigated for its clinical relevance. It appears that large contact size of the conditioning-stimulus and use of single rather than dual test-stimulus pain contribute to augmentation of CPM reliability. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e
Reliability of Two Smartphone Applications for Radiographic Measurements of Hallux Valgus Angles.

Science.gov (United States)

Mattos E Dinato, Mauro Cesar; Freitas, Marcio de Faria; Milano, Cristiano; Valloto, Elcio; Ninomiya, André Felipe; Pagnano, Rodrigo Gonçalves

The objective of the present study was to assess the reliability of 2 smartphone applications compared with the traditional goniometer technique for measurement of radiographic angles in hallux valgus and the time required for analysis with the different methods. The radiographs of 31 patients (52 feet) with a diagnosis of hallux valgus were analyzed. Four observers, 2 with >10 years' experience in foot and ankle surgery and 2 in-training surgeons, measured the hallux valgus angle and intermetatarsal angle using a manual goniometer technique and 2 smartphone applications (Hallux Angles and iPinPoint). The interobserver and intermethod reliability were estimated using intraclass correlation coefficients (ICCs), and the time required for measurement of the angles among the 3 methods was compared using the Friedman test. A very good or good interobserver reliability was found among the 4 observers measuring the hallux valgus angle and intermetatarsal angle using the goniometer (ICC 0.913 and 0.821, respectively) and iPinPoint (ICC 0.866 and 0.638, respectively). Using the Hallux Angles application, a very good interobserver reliability was found for measurements of the hallux valgus angle (ICC 0.962) and intermetatarsal angle (ICC 0.935) only among the more experienced observers. The time required for the measurements was significantly shorter for the measurements using both smartphone applications compared with the goniometer method. One smartphone application (iPinPoint) was reliable for measurements of the hallux valgus angles by either experienced or nonexperienced observers. The use of these tools might save time in the evaluation of radiographic angles in the hallux valgus. Copyright © 2016 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.