WorldWideScience

Sample records for valid assessment measures

  1. Assessing the validity of parenting measures in a sample of chinese adolescents.

    Science.gov (United States)

    Supple, Andrew J; Peterson, Gary W; Bush, Kevin R

    2004-09-01

    The purpose of this study was to assess the construct validity of adolescent-report parenting behavior measures (primarily derived from the Parental Behavior Measure) in a sample of 480 adolescents from Beijing, China. Results suggest that maternal support, monitoring, and autonomy granting were valid measures when assessing maternal socialization strategies and Chinese adolescent development. Measures of punitiveness and love withdrawal demonstrated limited validity, whereas maternal positive induction demonstrated little validity. The major implications of these results are that measures of "negative" parenting that included physical or psychological manipulations may not have salience for the development of Chinese adolescents. Moreover, researchers and clinicians should question the applicability of instruments and measures designed to assess family process when working with individuals in families from diverse cultural backgrounds. Copyright 2004 American Psychological Association

  2. Discriminant content validity: a quantitative methodology for assessing content of theory-based measures, with illustrative applications.

    Science.gov (United States)

    Johnston, Marie; Dixon, Diane; Hart, Jo; Glidewell, Liz; Schröder, Carin; Pollard, Beth

    2014-05-01

    In studies involving theoretical constructs, it is important that measures have good content validity and that there is not contamination of measures by content from other constructs. While reliability and construct validity are routinely reported, to date, there has not been a satisfactory, transparent, and systematic method of assessing and reporting content validity. In this paper, we describe a methodology of discriminant content validity (DCV) and illustrate its application in three studies. Discriminant content validity involves six steps: construct definition, item selection, judge identification, judgement format, single-sample test of content validity, and assessment of discriminant items. In three studies, these steps were applied to a measure of illness perceptions (IPQ-R) and control cognitions. The IPQ-R performed well with most items being purely related to their target construct, although timeline and consequences had small problems. By contrast, the study of control cognitions identified problems in measuring constructs independently. In the final study, direct estimation response formats for theory of planned behaviour constructs were found to have as good DCV as Likert format. The DCV method allowed quantitative assessment of each item and can therefore inform the content validity of the measures assessed. The methods can be applied to assess content validity before or after collecting data to select the appropriate items to measure theoretical constructs. Further, the data reported for each item in Appendix S1 can be used in item or measure selection. Statement of contribution What is already known on this subject? There are agreed methods of assessing and reporting construct validity of measures of theoretical constructs, but not their content validity. Content validity is rarely reported in a systematic and transparent manner. What does this study add? The paper proposes discriminant content validity (DCV), a systematic and transparent method

  3. Validation of a measurement tool to assess awareness of breast cancer.

    Science.gov (United States)

    Linsell, Louise; Forbes, Lindsay J L; Burgess, Caroline; Kapari, Marcia; Thurnham, Angela; Ramirez, Amanda J

    2010-05-01

    Until now, there has been no universally accepted and validated measure of breast cancer awareness. This study aimed to validate the new Breast Cancer Awareness Measure (BCAM) which assesses, using a self-complete questionnaire, knowledge of breast cancer symptoms and age-related risk, and frequency of breast checking. We measured the psychometric properties of the BCAM in 1035 women attending the NHS Breast Screening Programme: acceptability was assessed using a feedback questionnaire (n=292); sensitivity to change after an intervention promoting breast cancer awareness (n=576), and test-retest reliability (n=167). We also assessed readability, and construct validity using the 'known-groups' method. The readability of the BCAM was high. Over 90% of women found it acceptable. The BCAM was sensitive to change: there was an increase in the proportion of women obtaining the full score for breast cancer awareness one month after receiving the intervention promoting breast cancer awareness; this was greater among those who received a more intensive version (less intensive version (booklet): 9.3%, 95% confidence interval (CI): 4.5-14.1%; more intensive version (interaction with health professional plus booklet): 30%, 95% CI: 23.4-36.6%). Test-retest reliability of the BCAM was moderate to good for most items. Cancer experts had higher levels of cancer awareness than non-medical academics (50% versus 6%, p=0.001), indicating good construct validity. The BCAM is a valid and robust measure of breast cancer awareness suitable for use in surveys of breast cancer awareness in the general population and to evaluate the impact of awareness-raising interventions. Copyright (c) 2010 Elsevier Ltd. All rights reserved.

  4. Feasibility and validity of accelerometer measurements to assess physical activity in toddlers

    Directory of Open Access Journals (Sweden)

    De Bourdeaudhuij Ilse

    2011-06-01

    Full Text Available Abstract Background Accelerometers are considered to be the most promising tool for measuring physical activity (PA in free-living young children. So far, no studies have examined the feasibility and validity of accelerometer measurements in children under 3 years of age. Therefore, the purpose of the present study was to examine the feasibility and validity of accelerometer measurements in toddlers (1- to 3-year olds. Methods Forty-seven toddlers (25 boys; 20 ± 4 months wore a GT1M ActiGraph accelerometer for 6 consecutive days and parental perceptions of the acceptability of wearing the monitor were assessed to examine feasibility. To investigate the validity of the ActiGraph and the predictive validity of three ActiGraph cut points, accelerometer measurements of 31 toddlers (17 boys; 20 ± 4 months during free play at child care were compared to directly observed PA, using the Observational System for Recording Physical Activity in Children-Preschool (OSRAC-P. Validity was assessed using Pearson and Spearman correlations and predictive validity using area under the Receiver Operating Characteristic curve (ROC-AUC. Results The feasibility examination indicated that accelerometer measurements of 30 toddlers (63.8% could be included with a mean registration time of 564 ± 62 min during weekdays and 595 ± 83 min during weekend days. According to the parental reports, 83% perceived wearing the accelerometer as 'not unpleasant and not pleasant' and none as 'unpleasant'. The validity evaluation showed that mean ActiGraph activity counts were significantly and positively associated with mean OSRAC-P activity intensity (r = 0.66; p Conclusions The present findings suggest that ActiGraph accelerometer measurements are feasible and valid for quantifying PA in toddlers. However, further research is needed to accurately identify PA intensities in toddlers using accelerometry.

  5. The development and validation of measures to assess cooking skills and food skills.

    Science.gov (United States)

    Lavelle, Fiona; McGowan, Laura; Hollywood, Lynsey; Surgenor, Dawn; McCloat, Amanda; Mooney, Elaine; Caraher, Martin; Raats, Monique; Dean, Moira

    2017-09-02

    With the increase use of convenience food and eating outside the home environment being linked to the obesity epidemic, the need to assess and monitor individuals cooking and food skills is key to help intervene where necessary to promote the usage of these skills. Therefore, this research aimed to develop and validate a measure for cooking skills and one for food skills, that are clearly described, relatable, user-friendly, suitable for different types of studies, and applicable across all sociodemographic levels. Two measures were developed in light of the literature and expert opinion and piloted for clarity and ease of use. Following this, four studies were undertaken across different cohorts (including a sample of students, both 'Food preparation novices' and 'Experienced food preparers', and a nationally representative sample) to assess temporal stability, psychometrics, internal consistency reliability and construct validity of both measures. Analysis included T-tests, Pearson's correlations, factor analysis, and Cronbach's alphas, with a significance level of 0.05. Both measures were found to have a significant level of temporal stability (P cooking skills confidence measure ranged from 0.78 to 0.93 across all cohorts. The food skills confidence measure's Cronbach's alpha's ranged from 0.85 to 0.94. The two measures also showed a high discriminate validity as there were significant differences (P cooking skills confidence and P cooking skills confidence measure and the food skills confidence measure have been shown to have a very satisfactory reliability, validity and are consistent over time. Their user-friendly applicability make both measures highly suitable for large scale cross-sectional, longitudinal and intervention studies to assess or monitor cooking and food skills levels and confidence.

  6. Assessing behavioural changes in ALS: cross-validation of ALS-specific measures.

    Science.gov (United States)

    Pinto-Grau, Marta; Costello, Emmet; O'Connor, Sarah; Elamin, Marwa; Burke, Tom; Heverin, Mark; Pender, Niall; Hardiman, Orla

    2017-07-01

    The Beaumont Behavioural Inventory (BBI) is a behavioural proxy report for the assessment of behavioural changes in ALS. This tool has been validated against the FrSBe, a non-ALS-specific behavioural assessment, and further comparison of the BBI against a disease-specific tool was considered. This study cross-validates the BBI against the ALS-FTD-Q. Sixty ALS patients, 8% also meeting criteria for FTD, were recruited. All patients were evaluated using the BBI and the ALS-FTD-Q, completed by a carer. Correlational analysis was performed to assess construct validity. Precision, sensitivity, specificity, and overall accuracy of the BBI when compared to the ALS-FTD-Q, were obtained. The mean score of the whole sample on the BBI was 11.45 ± 13.06. ALS-FTD patients scored significantly higher than non-demented ALS patients (31.6 ± 14.64, 9.62 ± 11.38; p ALS-FTD-Q was observed (r = 0.807, p ALS-FTD-Q. Good construct validity has been further confirmed when the BBI is compared to an ALS-specific tool. Furthermore, the BBI is a more comprehensive behavioural assessment for ALS, as it measures the whole behavioural spectrum in this condition.

  7. Construct Validity and Case Validity in Assessment

    Science.gov (United States)

    Teglasi, Hedwig; Nebbergall, Allison Joan; Newman, Daniel

    2012-01-01

    Clinical assessment relies on both "construct validity", which focuses on the accuracy of conclusions about a psychological phenomenon drawn from responses to a measure, and "case validity", which focuses on the synthesis of the full range of psychological phenomena pertaining to the concern or question at hand. Whereas construct validity is…

  8. Patient Experiences with the Preoperative Assessment Clinic (PEPAC): validation of an instrument to measure patient experiences

    NARCIS (Netherlands)

    Edward, G. M.; Lemaire, L. C.; Preckel, B.; Oort, F. J.; Bucx, M. J. L.; Hollmann, M. W.; de Haes, J. C. J. M.

    2007-01-01

    Background. Presently, no comprehensive and validated questionnaire to measure patient experiences of the preoperative assessment clinic (PAC) is available. We developed and validated the Patient Experiences with the Preoperative Assessment Clinic (PEPAC) questionnaire, which can be used for

  9. Validation of a measurement tool for self-assessment of teamwork in intensive care.

    Science.gov (United States)

    Weller, J; Shulruf, B; Torrie, J; Frengley, R; Boyd, M; Paul, A; Yee, B; Dzendrowskyj, P

    2013-09-01

    Teamwork is an important contributor to patient safety and a validated teamwork measurement tool could help healthcare teams identify areas for improvement and measure progress. We explored the psychometric properties of a teamwork measurement tool when used for self-assessment. We hypothesized that the tool had a valid factor structure and that scores from participants and external assessors would correlate. Forty intensive care teams (one doctor, three nurses) participated in four simulated emergencies, and each independently rated their team's performance at the end of each case using the teamwork measurement tool, without prior training in the use of the tool. We used exploratory factor analysis (EFA) and confirmatory factor analysis (CFA), and compared factor structure between participants and external assessors (using previously reported data). Scores from participants and external assessors were compared using Pearson's correlation coefficient. EFA demonstrated items loaded onto three distinct factors which were supported by the CFA. We found significant correlations between external and participant scores for overall teamwork scores and the three factors. Participants agreed with external assessors on the ranking of overall team performance but scored themselves significantly higher than external assessors. The teamwork measurement tool has a valid structure when used for self-assessment. Participant and external assessor scores correlated significantly, suggesting that participants could discriminate between different levels of performance, although leniency in self-assessed scores indicated the need for calibration. This tool could help structure reflection on teamwork and potentially facilitate self-directed, workplace-based improvement in teamwork.

  10. Assessing College Student-Athletes' Life Stress: Initial Measurement Development and Validation

    Science.gov (United States)

    Lu, Frank Jing-Horng; Hsu, Ya-Wen; Chan, Yuan-Shuo; Cheen, Jang-Rong; Kao, Kuei-Tsu

    2012-01-01

    College student-athletes have unique life stress that warrants close attention. The purpose of this study was to develop a reliable and valid measurement assessing college student-athletes' life stress. In Study 1, a focus group discussion and Delphi method produced a questionnaire draft, termed the College Student-Athletes' Life Stress Scale. In…

  11. Assessing personal initiative among vocational training students: development and validation of a new measure.

    Science.gov (United States)

    Balluerka, Nekane; Gorostiaga, Arantxa; Ulacia, Imanol

    2014-11-14

    Personal initiative characterizes people who are proactive, persistent and self-starting when facing the difficulties that arise in achieving goals. Despite its importance in the educational field there is a scarcity of measures to assess students' personal initiative. Thus, the aim of the present study was to develop a questionnaire to assess this variable in the academic environment and to validate it for adolescents and young adults. The sample comprised 244 vocational training students. The questionnaire showed a factor structure including three factors (Proactivity-Prosocial behavior, Persistence and Self-Starting) with acceptable indices of internal consistency (ranging between α = .57 and α =.73) and good convergent validity with respect to the Self-Reported Initiative scale. Evidence of external validity was also obtained based on the relationships between personal initiative and variables such as self-efficacy, enterprising attitude, responsibility and control aspirations, conscientiousness, and academic achievement. The results indicate that this new measure is very useful for assessing personal initiative among vocational training students.

  12. Commentary: moving toward cost-effectiveness in using psychophysiological measures in clinical assessment: validity, decision making, and adding value.

    Science.gov (United States)

    Youngstrom, Eric A; De Los Reyes, Andres

    2015-01-01

    Psychophysiological measures offer a variety of potential advantages, including more direct assessment of certain processes, as well as provision of information that may contrast with other sources. The role of psychophysiological measures in clinical practice will be best defined when researchers (a) switch to research designs and statistical models that better approximate how clinicians administer assessments and make clinical decisions in practice, (b) systematically compare the validity of psychophysiological measures to incumbent methods for assessing similar criteria, (c) test whether psychophysiological measures show either greater validity or clinically meaningful incremental validity, and (d) factor in fiscal costs as well as the utilities that the client attaches to different assessment outcomes. The statistical methods are now readily available, along with the interpretive models for integrating assessment results into client-centered decision making. These, combined with technology reducing the cost of psychophysiological measurement and improving ease of interpretation, poise the field for a rapid transformation of assessment practice, but only if we let go of old habits of research.

  13. Validation of the Cognitive Assessment of Later Life Status (CALLS instrument: a computerized telephonic measure

    Directory of Open Access Journals (Sweden)

    Parsons Thomas D

    2007-05-01

    Full Text Available Abstract Background Brief screening tests have been developed to measure cognitive performance and dementia, yet they measure limited cognitive domains and often lack construct validity. Neuropsychological assessments, while comprehensive, are too costly and time-consuming for epidemiological studies. This study's aim was to develop a psychometrically valid telephone administered test of cognitive function in aging. Methods Using a sequential hierarchical strategy, each stage of test development did not proceed until specified criteria were met. The 30 minute Cognitive Assessment of Later Life Status (CALLS measure and a 2.5 hour in-person neuropsychological assessment were conducted with a randomly selected sample of 211 participants 65 years and older that included equivalent distributions of men and women from ethnically diverse populations. Results Overall Cronbach's coefficient alpha for the CALLS test was 0.81. A principal component analysis of the CALLS tests yielded five components. The CALLS total score was significantly correlated with four neuropsychological assessment components. Older age and having a high school education or less was significantly correlated with lower CALLS total scores. Females scored better overall than males. There were no score differences based on race. Conclusion The CALLS test is a valid measure that provides a unique opportunity to reliably and efficiently study cognitive function in large populations.

  14. Assessing the validity of single-item life satisfaction measures: results from three large samples.

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E

    2014-12-01

    The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS)-a more psychometrically established measure. Two large samples from Washington (N = 13,064) and Oregon (N = 2,277) recruited by the Behavioral Risk Factor Surveillance System and a representative German sample (N = 1,312) recruited by the Germany Socio-Economic Panel were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62-0.64; disattenuated r = 0.78-0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001-0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS was very small (average absolute difference = 0.015-0.042). Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use.

  15. Validation of the "Security Needs Assessment Profile" for measuring the profiles of security needs of Chinese forensic psychiatric inpatients.

    Science.gov (United States)

    Siu, B W M; Au-Yeung, C C Y; Chan, A W L; Chan, L S Y; Yuen, K K; Leung, H W; Yan, C K; Ng, K K; Lai, A C H; Davies, S; Collins, M

    Mapping forensic psychiatric services with the security needs of patients is a salient step in service planning, audit and review. A valid and reliable instrument for measuring the security needs of Chinese forensic psychiatric inpatients was not yet available. This study aimed to develop and validate the Chinese version of the Security Needs Assessment Profile for measuring the profiles of security needs of Chinese forensic psychiatric inpatients. The Security Needs Assessment Profile by Davis was translated into Chinese. Its face validity, content validity, construct validity and internal consistency reliability were assessed by measuring the security needs of 98 Chinese forensic psychiatric inpatients. Principal factor analysis for construct validity provided a six-factor security needs model explaining 68.7% of the variance. Based on the Cronbach's alpha coefficient, the internal consistency reliability was rated as acceptable for procedural security (0.73), and fair for both physical security (0.62) and relational security (0.58). A significant sex difference (p=0.002) in total security score was found. The Chinese version of the Security Needs Assessment Profile is a valid and reliable instrument for assessing the security needs of Chinese forensic psychiatric inpatients. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Validity and reproducibility of crutch force and heart rate measurements to assess energy expenditure of paraplegic gait

    NARCIS (Netherlands)

    IJzerman, Maarten Joost; Baardman, Gert; van 't Hof, Martin A.; Boom, H.B.K.; Hermens, Hermanus J.; Veltink, Petrus H.

    1999-01-01

    Objective: To determine the validity and reproducibility of heart rate (HR) and crutch force measurements to estimate energy expenditure during paraplegic walking. Usefulness of these outcome measures in comparative trials was assessed in terms of responsiveness. Design: Cross-sectional validity was

  17. Assessing Workplace Emotional Intelligence: Development and Validation of an Ability-based Measure.

    Science.gov (United States)

    Krishnakumar, Sukumarakurup; Hopkins, Kay; Szmerekovsky, Joseph G; Robinson, Michael D

    2016-01-01

    Existing measures of Emotional Intelligence (EI), defined as the ability to perceive, understand, and manage emotions for productive purposes, have displayed limitations in predicting workplace outcomes, likely in part because they do not target this context. Such considerations led to the development of an ability EI measure with work-related scenarios in which respondents infer the likely emotions (perception) and combinations of emotion (understanding) that would occur to protagonists while rating the effectiveness of ways of responding (management). Study 1 (n = 290 undergraduates) used item-total correlations to select scenarios from a larger pool and Study 2 (n = 578) reduced the measure-termed the NEAT-to 30 scenarios on the basis of structural equation modeling. Study 3 (n = 96) then showed that the NEAT had expected correlations with personality and cognitive ability and Study 4 (n = 85) demonstrated convergent validity with other ability EI measures. Last, study 5 (n = 91) established that the NEAT had predictive validity with respect to job satisfaction, job stress, and job performance. The findings affirm the importance of EI in the workplace in the context of a valid new instrument for assessing relevant skills.

  18. The Anaclitic-Introjective Depression Assessment: Development and preliminary validity of an observer-rated measure.

    Science.gov (United States)

    Rost, Felicitas; Luyten, Patrick; Fonagy, Peter

    2018-03-01

    The two-configurations model developed by Blatt and colleagues offers a comprehensive conceptual and empirical framework for understanding depression. This model suggests that depressed patients struggle, at different developmental levels, with issues related to dependency (anaclitic issues) or self-definition (introjective issues), or a combination of both. This paper reports three studies on the development and preliminary validation of the Anaclitic-Introjective Depression Assessment, an observer-rated assessment tool of impairments in relatedness and self-definition in clinical depression based on the item pool of the Shedler-Westen Assessment Procedure. Study 1 describes the development of the measure using expert consensus rating and Q-methodology. Studies 2 and 3 report the assessment of its psychometric properties, preliminary reliability, and validity in a sample of 128 patients diagnosed with treatment-resistant depression. Four naturally occurring clusters of depressed patients were identified using Q-factor analysis, which, overall, showed meaningful and theoretically expected relationships with anaclitic/introjective prototypes as formulated by experts, as well as with clinical, social, occupational, global, and relational functioning. Taken together, findings reported in this paper provide preliminary evidence for the reliability and validity of the Anaclitic-Introjective Depression Assessment, an observer-rated measure that allows the detection of important nuanced differentiations between and within anaclitic and introjective depression. Copyright © 2017 John Wiley & Sons, Ltd.

  19. Assessing the Validity of Single-item Life Satisfaction Measures: Results from Three Large Samples

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E.

    2014-01-01

    Purpose The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS) - a more psychometrically established measure. Methods Two large samples from Washington (N=13,064) and Oregon (N=2,277) recruited by the Behavioral Risk Factor Surveillance System (BRFSS) and a representative German sample (N=1,312) recruited by the Germany Socio-Economic Panel (GSOEP) were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Results Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62 – 0.64; disattenuated r = 0.78 – 0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001 – 0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS were very small (average absolute difference = 0.015 −0.042). Conclusions Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use. PMID:24890827

  20. Development and validation of measures to assess prevention and control of AMR in hospitals.

    Science.gov (United States)

    Flanagan, Mindy; Ramanujam, Rangaraj; Sutherland, Jason; Vaughn, Thomas; Diekema, Daniel; Doebbeling, Bradley N

    2007-06-01

    The rapid spread of antimicrobial resistance (AMR) in the US hospitals poses serious quality and safety problems. Expert panels, identifying strategies for optimizing antibiotic use and preventing AMR spread, have recommended hospitals undertake efforts to implement specific evidence-based practices. To develop and validate a measurement scale for assessing hospitals' efforts to implement recommended AMR prevention and control measures. Surveys were mailed to infection control professionals in a national sample of 670 US hospitals stratified by geographic region, bedsize, teaching status, and VA affiliation. : Four hundred forty-eight infection control professionals participated (67% response rate). Survey items measured implementation of guideline recommendations, practices for AMR monitoring and feedback, AMR-related outcomes (methicillin-resistant Staphylococcus aureus prevalence and outbreaks [MRSA]), and organizational features. "Derivation" and "validation" samples were randomly selected. Exploratory factor analysis was performed to identify factors underlying AMR prevention and control efforts. Multiple methods were used for validation. We identified 4 empirically distinct factors in AMR prevention and control: (1) practices for antimicrobial prescription/use, (2) information/resources for AMR control, (3) practices for isolating infected patients, and (4) organizational support for infection control policies. The Prevention and Control of Antimicrobial Resistance scale was reliable and had content and construct validity. MRSA prevalence was significantly lower in hospitals with higher resource/information availability and broader organizational support. The Prevention and Control of Antimicrobial Resistance scale offers a simple yet discriminating assessment of AMR prevention and control efforts. Use should complement assessment methods based exclusively on AMR outcomes.

  1. Assessing anger regulation in middle childhood: development and validation of a behavioral observation measure

    Directory of Open Access Journals (Sweden)

    Helena Lara Rohlf

    2015-04-01

    Full Text Available An observational measure of anger regulation in middle childhood was developed that facilitated the in situ assessment of five maladaptive regulation strategies in response to an anger-eliciting task. 599 children aged 6-10 years (M = 8.12, SD = 0.92 participated in the study. Construct validity of the measure was examined through correlations with parent- and self-reports of anger regulation and anger reactivity. Criterion validity was established through links with teacher-rated aggression and social rejection measured by parent-, teacher-, and self-reports. The observational measure correlated significantly with parent- and self-reports of anger reactivity, whereas it was unrelated to parent- and self-reports of anger regulation. It also made a unique contribution to predicting aggression and social rejection.

  2. Measuring awareness of financial skills: reliability and validity of a new measure.

    Science.gov (United States)

    Cramer, K; Tuokko, H A; Mateer, C A; Hultsch, D F

    2004-03-01

    This paper examines the psychometric properties of a three-part (participant, informant, and performance) Measure for assessing Awareness of Financial Skills (MAFS). The MAFS was administered to 10 seniors with dementia and 25 well-functioning seniors, and their informants. Measures of cognitive functioning, social desirability, neuroticism, and perceived control were administered to each participant to allow for an assessment of validity. Internal consistency estimates for the participant and informant questionnaires were found to be 0.92 and 0.97, respectively. Convergent validity analysis indicated that performance on this measure was related to level of cognitive functioning, with higher level of unawareness associated with decreased cognitive ability. Discriminant validity analysis showed that performance on this measure was not related to social desirability or neuroticism. This study provides evidence that the MAFS is a reliable and valid tool for assessing awareness of financial skills in older adults.

  3. Team Emergency Assessment Measure (TEAM) for the assessment of non-technical skills during resuscitation: Validation of the French version.

    Science.gov (United States)

    Maignan, Maxime; Koch, François-Xavier; Chaix, Jordane; Phellouzat, Pierre; Binauld, Gery; Collomb Muret, Roselyne; Cooper, Simon J; Labarère, José; Danel, Vincent; Viglino, Damien; Debaty, Guillaume

    2016-04-01

    Evaluation of team performances during medical simulation must rely on validated and reproducible tools. Our aim was to build and validate a French version of the Team Emergency Assessment Measure (TEAM) score, which was developed for the assessment of team performance and non-technical skills during resuscitation. A forward and backward translation of the initial TEAM score was made, with the agreement and the final validation by the original author. Ten medical teams were recruited and performed a standardized cardiac arrest simulation scenario. Teams were videotaped and nine raters evaluate non-technical skills for each team thanks to the French TEAM Score. Psychometric properties of the score were then evaluated. French TEAM score showed an excellent reliability with a Cronbach coefficient of 0.95. Mean correlation coefficient between each item and the global score range was 0.78. The inter-rater reliability measured by intraclass correlation coefficient of the global score was 0.93. Finally, expert teams had higher French TEAM score than intermediate and novice teams. The French TEAM score shows good psychometric properties to evaluate team performance during cardiac arrest simulation. Its utilization could help in the assessment of non-technical skills during simulation. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  4. External validation of a measurement tool to assess systematic reviews (AMSTAR.

    Directory of Open Access Journals (Sweden)

    Beverley J Shea

    Full Text Available BACKGROUND: Thousands of systematic reviews have been conducted in all areas of health care. However, the methodological quality of these reviews is variable and should routinely be appraised. AMSTAR is a measurement tool to assess systematic reviews. METHODOLOGY: AMSTAR was used to appraise 42 reviews focusing on therapies to treat gastro-esophageal reflux disease, peptic ulcer disease, and other acid-related diseases. Two assessors applied the AMSTAR to each review. Two other assessors, plus a clinician and/or methodologist applied a global assessment to each review independently. CONCLUSIONS: The sample of 42 reviews covered a wide range of methodological quality. The overall scores on AMSTAR ranged from 0 to 10 (out of a maximum of 11 with a mean of 4.6 (95% CI: 3.7 to 5.6 and median 4.0 (range 2.0 to 6.0. The inter-observer agreement of the individual items ranged from moderate to almost perfect agreement. Nine items scored a kappa of >0.75 (95% CI: 0.55 to 0.96. The reliability of the total AMSTAR score was excellent: kappa 0.84 (95% CI: 0.67 to 1.00 and Pearson's R 0.96 (95% CI: 0.92 to 0.98. The overall scores for the global assessment ranged from 2 to 7 (out of a maximum score of 7 with a mean of 4.43 (95% CI: 3.6 to 5.3 and median 4.0 (range 2.25 to 5.75. The agreement was lower with a kappa of 0.63 (95% CI: 0.40 to 0.88. Construct validity was shown by AMSTAR convergence with the results of the global assessment: Pearson's R 0.72 (95% CI: 0.53 to 0.84. For the AMSTAR total score, the limits of agreement were -0.19+/-1.38. This translates to a minimum detectable difference between reviews of 0.64 'AMSTAR points'. Further validation of AMSTAR is needed to assess its validity, reliability and perceived utility by appraisers and end users of reviews across a broader range of systematic reviews.

  5. Validity of Measures Assessing Oral Health Beliefs of American Indian Parents.

    Science.gov (United States)

    Wilson, Anne R; Brega, Angela G; Thomas, Jacob F; Henderson, William G; Lind, Kimberly E; Braun, Patricia A; Batliner, Terrence S; Albino, Judith

    2018-03-05

    This aimed to validate measures of constructs included in an extended Health Belief Model (EHBM) addressing oral health beliefs among American Indian (AI) parents. Questionnaire data were collected as part of a randomized controlled trial (n = 1016) aimed at reducing childhood caries. Participants were AI parents with a preschool-age child enrolled in the Navajo Nation Head Start program. Questionnaire items addressed five EHBM constructs: perceived susceptibility, severity, barriers, benefits, and parental self-efficacy. Subscales representing each construct underwent reliability and validity testing. Internal consistency reliability of each subscale was evaluated using Cronbach's alpha. Convergent validity was assessed using linear regression to evaluate the association of each EHBM subscale with oral health-related measures. Internal consistency reliability was high for self-efficacy (α = 0.83) and perceived benefits (α = 0.83) compared to remaining EHBM subscales (α Parents with more education (p parents (ps = 0.02) and those with more education (ps oral health behavior. Female parents (p Parental knowledge was associated with all EHBM measures (ps  0.05). Parents with increased self-efficacy had greater behavioral adherence (p parents who reported higher perceived barriers (p oral health outcomes were associated with higher levels of self-efficacy (p < 0.0001) and lower levels of perceived severity (p = 0.02) and barriers (p = 0.05). Results support the value of questionnaire items addressing the EHBM subscales, which functioned in a manner consistent with the EHBM theoretical framework in AI participants.

  6. Valid Competency Assessment in Higher Education

    Directory of Open Access Journals (Sweden)

    Olga Zlatkin-Troitschanskaia

    2017-01-01

    Full Text Available The aim of the 15 collaborative projects conducted during the new funding phase of the German research program Modeling and Measuring Competencies in Higher Education—Validation and Methodological Innovations (KoKoHs is to make a significant contribution to advancing the field of modeling and valid measurement of competencies acquired in higher education. The KoKoHs research teams assess generic competencies and domain-specific competencies in teacher education, social and economic sciences, and medicine based on findings from and using competency models and assessment instruments developed during the first KoKoHs funding phase. Further, they enhance, validate, and test measurement approaches for use in higher education in Germany. Results and findings are transferred at various levels to national and international research, higher education practice, and education policy.

  7. Assessment of validity with polytrauma Veteran populations.

    Science.gov (United States)

    Bush, Shane S; Bass, Carmela

    2015-01-01

    Veterans with polytrauma have suffered injuries to multiple body parts and organs systems, including the brain. The injuries can generate a triad of physical, neurologic/cognitive, and emotional symptoms. Accurate diagnosis is essential for the treatment of these conditions and for fair allocation of benefits. To accurately diagnose polytrauma disorders and their related problems, clinicians take into account the validity of reported history and symptoms, as well as clinical presentations. The purpose of this article is to describe the assessment of validity with polytrauma Veteran populations. Review of scholarly and other relevant literature and clinical experience are utilized. A multimethod approach to validity assessment that includes objective, standardized measures increases the confidence that can be placed in the accuracy of self-reported symptoms and physical, cognitive, and emotional test results. Due to the multivariate nature of polytrauma and the multiple disciplines that play a role in diagnosis and treatment, an ideal model of validity assessment with polytrauma Veteran populations utilizes neurocognitive, neurological, neuropsychiatric, and behavioral measures of validity. An overview of these validity assessment approaches as applied to polytrauma Veteran populations is presented. Veterans, the VA, and society are best served when accurate diagnoses are made.

  8. Psychometric validation of patient-reported outcome measures assessing chronic constipation

    Directory of Open Access Journals (Sweden)

    Nelson LM

    2014-09-01

    Full Text Available Lauren M Nelson,1 Valerie SL Williams,1 Sheri E Fehnel,1 Robyn T Carson,2 James MacDougall,3 Mollie J Baird,3 Stavros Tourkodimitris,2 Caroline B Kurtz,3 Jeffrey M Johnston31RTI Health Solutions, Durham, NC, USA; 2Forest Research Institute, Jersey City, NJ, USA; 3Ironwood Pharmaceuticals, Cambridge, MA, USABackground: Measures assessing treatment outcomes in previous CC clinical trials have not met the requirements described in the US Food and Drug Administration's guidance on patient-reported outcomes.Aim: Psychometric analyses using data from one Phase IIb study and two Phase III trials of linaclotide for the treatment of chronic constipation (CC were conducted to document the measurement properties of patient-reported CC Symptom Severity Measures.Study methods: Each study had a multicenter, randomized, double-blind, placebo-controlled, parallel-group design, comparing placebo to four doses of oral linaclotide taken once daily for 4 weeks in the Phase IIb dose-ranging study (n=307 and to two doses of linaclotide taken once daily for 12 weeks in the Phase III trials (n=1,272. The CC Symptom Severity Measures addressing bowel function (Bowel Movement Frequency, Stool Consistency, Straining and abdominal symptoms (Bloating, Abdominal Discomfort, Abdominal Pain were administered daily using interactive voice-response system technology. Intraclass correlations, Pearson correlations, factor analyses, F-tests, and effect sizes were computed.Results: The CC Symptom Severity Measures demonstrated satisfactory test–retest reliability and construct validity. Factor analyses indicated one factor for abdominal symptoms and another for bowel symptoms. Known-groups F-tests substantiated the discriminating ability of the CC Symptom Severity Measures. Responsiveness statistics were moderate to strong, indicating that these measures are capable of detecting change.Conclusion: In large studies of CC patients, linaclotide significantly improved abdominal and

  9. Competency measurements: testing convergent validity for two measures.

    Science.gov (United States)

    Cowin, Leanne S; Hengstberger-Sims, Cecily; Eagar, Sandy C; Gregory, Linda; Andrew, Sharon; Rolley, John

    2008-11-01

    This paper is a report of a study to investigate whether the Australian National Competency Standards for Registered Nurses demonstrate correlations with the Finnish Nurse Competency Scale. Competency assessment has become popular as a key regulatory requirement and performance indicator. The term competency, however, does not have a globally accepted definition and this has the potential to create controversy, ambiguity and confusion. Variations in meaning and definitions adopted in workplaces and educational settings will affect the interpretation of research findings and have implications for the nursing profession. A non-experimental cross-sectional survey design was used with a convenience sample of 116 new graduate nurses in 2005. The second version of the Australian National Competency Standards and the Nurse Competency Scale was used to elicit responses to self-assessed competency in the transitional year (first year as a Registered Nurse). Correlational analysis of self-assessed levels of competence revealed a relationship between the Australian National Competency Standards (ANCI) and the Nurse Competency Scale (NCS). The correlational relation between ANCI domains and NCS factors suggests that these scales are indeed used to measure related dimensions. A statistically significant relationship (r = 0.75) was found between the two competency measures. Although the finding of convergent validity is insufficient to establish construct validity for competency as used in both measures in this study, it is an important step towards this goal. Future studies on relationships between competencies must take into account the validity and reliability of the tools.

  10. Validity and Reliability of Field-Based Measures for Assessing Movement Skill Competency in Lifelong Physical Activities: A Systematic Review.

    Science.gov (United States)

    Hulteen, Ryan M; Lander, Natalie J; Morgan, Philip J; Barnett, Lisa M; Robertson, Samuel J; Lubans, David R

    2015-10-01

    It has been suggested that young people should develop competence in a variety of 'lifelong physical activities' to ensure that they can be active across the lifespan. The primary aim of this systematic review is to report the methodological properties, validity, reliability, and test duration of field-based measures that assess movement skill competency in lifelong physical activities. A secondary aim was to clearly define those characteristics unique to lifelong physical activities. A search of four electronic databases (Scopus, SPORTDiscus, ProQuest, and PubMed) was conducted between June 2014 and April 2015 with no date restrictions. Studies addressing the validity and/or reliability of lifelong physical activity tests were reviewed. Included articles were required to assess lifelong physical activities using process-oriented measures, as well as report either one type of validity or reliability. Assessment criteria for methodological quality were adapted from a checklist used in a previous review of sport skill outcome assessments. Movement skill assessments for eight different lifelong physical activities (badminton, cycling, dance, golf, racquetball, resistance training, swimming, and tennis) in 17 studies were identified for inclusion. Methodological quality, validity, reliability, and test duration (time to assess a single participant), for each article were assessed. Moderate to excellent reliability results were found in 16 of 17 studies, with 71% reporting inter-rater reliability and 41% reporting intra-rater reliability. Only four studies in this review reported test-retest reliability. Ten studies reported validity results; content validity was cited in 41% of these studies. Construct validity was reported in 24% of studies, while criterion validity was only reported in 12% of studies. Numerous assessments for lifelong physical activities may exist, yet only assessments for eight lifelong physical activities were included in this review

  11. Cognitive Assessment Interview (CAI): Validity as a co-primary measure of cognition across phases of schizophrenia.

    Science.gov (United States)

    Ventura, Joseph; Subotnik, Kenneth L; Ered, Arielle; Hellemann, Gerhard S; Nuechterlein, Keith H

    2016-04-01

    Progress has been made in developing interview-based measures for the assessment of cognitive functioning, such as the Cognitive Assessment Interview (CAI), as co-primary measures that compliment objective neurocognitive assessments and daily functioning. However, a few questions remain, including whether the relationships with objective cognitive measures and daily functioning are high enough to justify the CAI as an co-primary measure and whether patient-only assessments are valid. Participants were first-episode schizophrenia patients (n=60) and demographically-similar healthy controls (n=35), chronic schizophrenia patients (n=38) and demographically similar healthy controls (n=19). Participants were assessed at baseline with an interview-based measure of cognitive functioning (CAI), a test of objective cognitive functioning, functional capacity, and role functioning at baseline, and in the first episode patients again 6 months later (n=28). CAI ratings were correlated with objective cognitive functioning, functional capacity, and functional outcomes in first-episode schizophrenia patients at similar magnitudes as in chronic patients. Comparisons of first-episode and chronic patients with healthy controls indicated that the CAI sensitively detected deficits in schizophrenia. The relationship of CAI Patient-Only ratings with objective cognitive functioning, functional capacity, and daily functioning were comparable to CAI Rater scores that included informant information. These results confirm in an independent sample the relationship of the CAI ratings with objectively measured cognition, functional capacity, and role functioning. Comparison of schizophrenia patients with healthy controls further validates the CAI as an co-primary measure of cognitive deficits. Also, CAI change scores were strongly related to objective cognitive change indicating sensitivity to change. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Validation of the high performance leadership competencies as measured by an assessment centre in-basket

    Directory of Open Access Journals (Sweden)

    H. H. Spangenberg

    2003-10-01

    Full Text Available The purpose of this study was to validate Schroder’s High Performance Leadership Competencies (HPLCs, measured by a specially designed In-basket, against multiple criteria. These consisted of six measures of managerial success, representing managerial advancement and salary progress criteria, and a newly developed comprehensive measure of work unit performance, the Performance Index. An environmental dynamism and complexity questionnaire served as moderator variable. Results indicated disappointing predictive validity quotients for the HPLCs as measured by an In-basket, in contrast to satisfactory predictive and construct validity obtained in previous studies by means of a full assessment centre. The implications of the findings are discussed and suggestions are made for improving the validity of the In-basket. Opsomming Die doel van hierdie studie was die validering van Schroder se Hoëvlak Leierskapsbevoegdhede, gemeet deur ‘n spesiaal ontwerpte Posmandjie, teen veelvoudige kriteria. Dit behels ses metings van bestuursukses wat bestuursbevorderings- en salarisvorderingskriteria insluit, sowel as ‘n nuutontwikkelde, omvattende meting van werkeenheidsprestasie, die Prestasie indeks. ‘n Vraelys wat die dinamika en kompleksiteit van die omgewing meet, het as moderator veranderlike gedien. Resultate dui op teleurstellende geldigheidskwosiënte vir die Hoëvlak Leierskapsbevoegdhede soos gemeet deur ‘n posmandjie, in teenstelling met bevredigende voorspellings- en konstrukgeldigheid wat in vorige studies deur middel van ‘n volle takseersentrum verkry is. Die bevindinge word bespreek en voorstelle word gemaak om die geldigheidskwosiënte te verbeter.

  13. Validating a measure to assess factors that affect assistive technology use by students with disabilities in elementary and secondary education.

    Science.gov (United States)

    Zapf, Susan A; Scherer, Marcia J; Baxter, Mary F; H Rintala, Diana

    2016-01-01

    The purpose of this study was to measure the predictive validity, internal consistency and clinical utility of the Matching Assistive Technology to Child & Augmentative Communication Evaluation Simplified (MATCH-ACES) assessment. Twenty-three assistive technology team evaluators assessed 35 children using the MATCH-ACES assessment. This quasi-experimental study examined the internal consistency, predictive validity and clinical utility of the MATCH-ACES assessment. The MATCH-ACES assessment predisposition scales had good internal consistency across all three scales. A significant relationship was found between (a) high student perseverance and need for assistive technology and (b) high teacher comfort and interest in technology use (p = (0).002). Study results indicate that the MATCH-ACES assessment has good internal consistency and validity. Predisposition characteristics of student and teacher combined can influence the level of assistive technology use; therefore, assistive technology teams should assess predisposition factors of the user when recommending assistive technology. Implications for Rehabilitation Educational and medical professionals should be educated on evidence-based assistive technology assessments. Personal experience and psychosocial factors can influence the outcome use of assistive technology. Assistive technology assessments must include an intervention plan for assistive technology service delivery to measure effective outcome use.

  14. Development and Validation of the Life Sciences Assessment: A Measure of Preschool Children's Conceptions of Basic Life Sciences

    Science.gov (United States)

    Maherally, Uzma Nooreen

    2014-01-01

    The purpose of this study was to develop and validate a science assessment tool termed the Life Sciences Assessment (LSA) in order to assess preschool children's conceptions of basic life sciences. The hypothesis was that the four sub-constructs, each of which can be measured through a series of questions on the LSA, will make a significant…

  15. Initial development and preliminary validation of a new negative symptom measure: the Clinical Assessment Interview for Negative Symptoms (CAINS).

    Science.gov (United States)

    Forbes, Courtney; Blanchard, Jack J; Bennett, Melanie; Horan, William P; Kring, Ann; Gur, Raquel

    2010-12-01

    As part of an ongoing scale development process, this study provides an initial examination of the psychometric properties and validity of a new interview-based negative symptom instrument, the Clinical Assessment Interview for Negative Symptoms (CAINS), in outpatients with schizophrenia or schizoaffective disorder (N = 37). The scale was designed to address limitations of existing measures and to comprehensively assess five consensus-based negative symptoms: asociality, avolition, anhedonia (consummatory and anticipatory), affective flattening, and alogia. Results indicated satisfactory internal consistency reliability for the total CAINS scale score and promising inter-rater agreement, with clear areas identified in need of improvement. Convergent validity was evident in general agreement between the CAINS and alternative negative symptom measures. Further, CAINS subscales significantly correlated with relevant self-report emotional experience measures as well as with social functioning. Discriminant validity of the CAINS was strongly supported by its small, non-significant relations with positive symptoms, general psychiatric symptoms, and depression. These preliminary data on an early beta-version of the CAINS provide initial support for this new assessment approach to negative symptoms and suggest directions for further scale development. Copyright © 2010 Elsevier B.V. All rights reserved.

  16. Level validity of self-report whole-family measures.

    Science.gov (United States)

    Manders, Willeke A; Cook, William L; Oud, Johan H L; Scholte, Ron H J; Janssens, Jan M A M; De Bruyn, Eric E J

    2007-12-01

    This article introduces an approach to testing the level validity of family assessment instruments (i.e., whether a family instrument measures family functioning at the level of the system it purports to assess). Two parents and 2 adolescents in 69 families rated the warmth in each of their family relationships and in the family as a whole. Family members' ratings of whole-family warmth assessed family functioning not only at the family level (i.e., characteristics of the family as a whole) but also at the individual level of analysis (i.e., characteristics of family members as raters), indicating a lack of level validity. Evidence was provided for the level validity of a latent variable based on family members' ratings of whole-family warmth. The findings underscore the importance of assessing the level validity of individual ratings of whole-family functioning.

  17. Measuring Nutrition Literacy in Spanish-Speaking Latinos: An Exploratory Validation Study.

    Science.gov (United States)

    Gibbs, Heather D; Camargo, Juliana M T B; Owens, Sarah; Gajewski, Byron; Cupertino, Ana Paula

    2017-11-21

    Nutrition is important for preventing and treating chronic diseases highly prevalent among Latinos, yet no tool exists for measuring nutrition literacy among Spanish speakers. This study aimed to adapt the validated Nutrition Literacy Assessment Instrument for Spanish-speaking Latinos. This study was developed in two phases: adaptation and validity testing. Adaptation included translation, expert item content review, and interviews with Spanish speakers. For validity testing, 51 participants completed the Short Assessment of Health Literacy-Spanish (SAHL-S), the Nutrition Literacy Assessment Instrument in Spanish (NLit-S), and socio-demographic questionnaire. Validity and reliability statistics were analyzed. Content validity was confirmed with a Scale Content Validity Index of 0.96. Validity testing demonstrated NLit-S scores were strongly correlated with SAHL-S scores (r = 0.52, p internal consistency was excellent (Cronbach's α = 0.92). The NLit-S demonstrates validity and reliability for measuring nutrition literacy among Spanish-speakers.

  18. Knowing Every Child: Validation of the Holistic Student Assessment (HSA) as a Measure of Social-Emotional Development.

    Science.gov (United States)

    Malti, Tina; Zuffianò, Antonio; Noam, Gil G

    2018-04-01

    Knowing every child's social-emotional development is important as it can support prevention and intervention approaches to meet the developmental needs and strengths of children. Here, we discuss the role of social-emotional assessment tools in planning, implementing, and evaluating preventative strategies to promote mental health in all children and adolescents. We, first, selectively review existing tools and identify current gaps in the measurement literature. Next, we introduce the Holistic Student Assessment (HSA), a tool that is based in our social-emotional developmental theory, The Clover Model, and designed to measure social-emotional development in children and adolescents. Using a sample of 5946 students (51% boys, M age  = 13.16 years), we provide evidence for the psychometric validity of the self-report version of the HSA. First, we document the theoretically expected 7-dimension factor structure in a calibration sub-sample (n = 984) and cross-validate its structure in a validation sub-sample (n = 4962). Next, we show measurement invariance across development, i.e., late childhood (9- to 11-year-olds), early adolescence (12- to 14-year-olds), and middle adolescence (15- to 18-year-olds), and evidence for the HSA's construct validity in each age group. The findings support the robustness of the factor structure and confirm its developmental sensitivity. Structural equation modeling validity analysis in a multiple-group framework indicates that the HSA is associated with mental health in expected directions across ages. Overall, these findings show the psychometric properties of the tool, and we discuss how social-emotional tools such as the HSA can guide future research and inform large-scale dissemination of preventive strategies.

  19. Validation of the Organizational Culture Assessment Instrument

    Science.gov (United States)

    Heritage, Brody; Pollock, Clare; Roberts, Lynne

    2014-01-01

    Organizational culture is a commonly studied area in industrial/organizational psychology due to its important role in workplace behaviour, cognitions, and outcomes. Jung et al.'s [1] review of the psychometric properties of organizational culture measurement instruments noted many instruments have limited validation data despite frequent use in both theoretical and applied situations. The Organizational Culture Assessment Instrument (OCAI) has had conflicting data regarding its psychometric properties, particularly regarding its factor structure. Our study examined the factor structure and criterion validity of the OCAI using robust analysis methods on data gathered from 328 (females = 226, males = 102) Australian employees. Confirmatory factor analysis supported a four factor structure of the OCAI for both ideal and current organizational culture perspectives. Current organizational culture data demonstrated expected reciprocally-opposed relationships between three of the four OCAI factors and the outcome variable of job satisfaction but ideal culture data did not, thus indicating possible weak criterion validity when the OCAI is used to assess ideal culture. Based on the mixed evidence regarding the measure's properties, further examination of the factor structure and broad validity of the measure is encouraged. PMID:24667839

  20. Validation of the organizational culture assessment instrument.

    Directory of Open Access Journals (Sweden)

    Brody Heritage

    Full Text Available Organizational culture is a commonly studied area in industrial/organizational psychology due to its important role in workplace behaviour, cognitions, and outcomes. Jung et al.'s [1] review of the psychometric properties of organizational culture measurement instruments noted many instruments have limited validation data despite frequent use in both theoretical and applied situations. The Organizational Culture Assessment Instrument (OCAI has had conflicting data regarding its psychometric properties, particularly regarding its factor structure. Our study examined the factor structure and criterion validity of the OCAI using robust analysis methods on data gathered from 328 (females = 226, males = 102 Australian employees. Confirmatory factor analysis supported a four factor structure of the OCAI for both ideal and current organizational culture perspectives. Current organizational culture data demonstrated expected reciprocally-opposed relationships between three of the four OCAI factors and the outcome variable of job satisfaction but ideal culture data did not, thus indicating possible weak criterion validity when the OCAI is used to assess ideal culture. Based on the mixed evidence regarding the measure's properties, further examination of the factor structure and broad validity of the measure is encouraged.

  1. Reliable and valid assessment of Lichtenstein hernia repair skills

    DEFF Research Database (Denmark)

    Carlsen, C G; Lindorff Larsen, Karen; Funch-Jensen, P

    2014-01-01

    PURPOSE: Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity...... of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. METHODS: Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia...... a significant difference between the three groups which indicates construct validity, p skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment...

  2. The Patient Assessment Questionnaire: initial validation of a measure of treatment effectiveness for patients with schizophrenia and schizoaffective disorder.

    Science.gov (United States)

    Mojtabai, Ramin; Corey-Lisle, Patricia K; Ip, Edward Hak-Sing; Kopeykina, Irina; Haeri, Sophia; Cohen, Lisa Janet; Shumaker, Sally

    2012-12-30

    Investigation of patients' subjective perspective regarding the effectiveness - as opposed to efficacy - of antipsychotic medication has been hampered by a relative shortage of self-report measures of global clinical outcome. This paper presents data supporting the feasibility, inter-item consistency, and construct validity of the Patient Assessment Questionnaire (PAQ)-a self-report measure of psychiatric symptoms, medication side effects and general wellbeing, ultimately intended to assess effectiveness of interventions for schizophrenia-spectrum patients. The original 53-item instrument was developed by a multidisciplinary team which utilized brainstorming sessions for item generation and content analysis, patient focus groups, and expert panel reviews. This instrument and additional validation measures were administered, via Audio Computer-Assisted Self-Interviewing (ACASI), to 300 stable, medicated outpatients diagnosed with schizophrenia or schizoaffective disorder. Item elimination was based on psychometric properties and Item-Response Theory information functions and characteristic curves. Exploratory factor analysis of the resulting 40-item scale yielded a five factor solution. The five subscales (General Distress, Side Effects, Psychotic Symptoms, Cognitive Symptoms, Sleep) showed robust convergent (β's=0.34-0.75, average β=0.49) and discriminant validity. The PAQ demonstrates feasibility, reliability, and construct validity as a self-report measure of multiple domains pertinent to effectiveness. Future research needs to establish the PAQ's sensitivity to change. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  3. Validating the Assessment for Measuring Indonesian Secondary School Students Performance in Ecology

    Science.gov (United States)

    Rachmatullah, A.; Roshayanti, F.; Ha, M.

    2017-09-01

    The aims of this current study are validating the American Association for the Advancement of Science (AAAS) Ecology assessment and examining the performance of Indonesian secondary school students on the assessment. A total of 611 Indonesian secondary school students (218 middle school students and 393 high school students) participated in the study. Forty-five items of AAAS assessment in the topic of Interdependence in Ecosystems were divided into two versions which every version has 21 similar items. Linking item method was used as the method to combine those two versions of assessment and further Rasch analyses were utilized to validate the instrument. Independent sample t-test was also run to compare the performance of Indonesian students and American students based on the mean of item difficulty. We found that from the total of 45 items, three items were identified as misfitting items. Later on, we also found that both Indonesian middle and high school students were significantly lower performance with very large and medium effect size compared to American students. We will discuss our findings in the regard of validation issue and the connection to Indonesian student’s science literacy.

  4. Development of an assessment tool to measure students′ perceptions of respiratory care education programs: Item generation, item reduction, and preliminary validation

    Directory of Open Access Journals (Sweden)

    Ghazi Alotaibi

    2013-01-01

    Full Text Available Objectives: Students who perceived their learning environment positively are more likely to develop effective learning strategies, and adopt a deep learning approach. Currently, there is no validated instrument for measuring the educational environment of educational programs on respiratory care (RC. The aim of this study was to develop an instrument to measure students′ perception of the RC educational environment. Materials and Methods: Based on the literature review and an assessment of content validity by multiple focus groups of RC educationalists, potential items of the instrument relevant to RC educational environment construct were generated by the research group. The initial 71 item questionnaire was then field-tested on all students from the 3 RC programs in Saudi Arabia and was subjected to multi-trait scaling analysis. Cronbach′s alpha was used to assess internal consistency reliabilities. Results: Two hundred and twelve students (100% completed the survey. The initial instrument of 71 items was reduced to 65 across 5 scales. Convergent and discriminant validity assessment demonstrated that the majority of items correlated more highly with their intended scale than a competing one. Cronbach′s alpha exceeded the standard criterion of >0.70 in all scales except one. There was no floor or ceiling effect for scale or overall score. Conclusions: This instrument is the first assessment tool developed to measure the RC educational environment. There was evidence of its good feasibility, validity, and reliability. This first validation of the instrument supports its use by RC students to evaluate educational environment.

  5. Development and validation of the functional assessment of chronic illness therapy treatment satisfaction (FACIT TS) measures.

    Science.gov (United States)

    Peipert, John D; Beaumont, Jennifer L; Bode, Rita; Cella, Dave; Garcia, Sofia F; Hahn, Elizabeth A

    2014-04-01

    To develop and validate a new functional assessment of chronic illness therapy (FACIT) measure of satisfaction with treatment for chronic illnesses such as cancer and HIV/AIDS. To define domains and generate items, a literature review informed creation of semi-structured interview guides for patients and an international expert panel of clinicians and researchers. Patients and experts also rated 15 areas of satisfaction for relevance. The final list of items underwent further refinement by the original expert panel and a new group of clinical experts. Items were tested in four studies (primarily lung cancer) and data were pooled for analysis. Exploratory and confirmatory factor analyses (CFA), and item response theory modeling were conducted to evaluate dimensionality. Internal consistency reliability and test-retest reliability were both evaluated. Validity was evaluated by correlating the FACIT subscale scores and measures of comparable concepts and by testing the scales' ability to distinguish people according to their overall treatment satisfaction. Two instruments were created: the FACIT TS-general (G), an overall evaluation of current treatment, and the FACIT TS-patient satisfaction (PS), a measure of patient satisfaction. CFA results were not optimal for a five-factor solution for PS. Internal consistency reliability met psychometric standards (≥0.70) for all PS subscales. Construct validity was established for the PS subscales: Physician Communication, Treatment Staff Communication, Technical Competence, Confidence and Trust, and Nurse Communication. The two instruments generated here offer a new way to assess several key dimensions of patient satisfaction with treatment, especially for people with lung cancer.

  6. Measures of agreement between computation and experiment:validation metrics.

    Energy Technology Data Exchange (ETDEWEB)

    Barone, Matthew Franklin; Oberkampf, William Louis

    2005-08-01

    With the increasing role of computational modeling in engineering design, performance estimation, and safety assessment, improved methods are needed for comparing computational results and experimental measurements. Traditional methods of graphically comparing computational and experimental results, though valuable, are essentially qualitative. Computable measures are needed that can quantitatively compare computational and experimental results over a range of input, or control, variables and sharpen assessment of computational accuracy. This type of measure has been recently referred to as a validation metric. We discuss various features that we believe should be incorporated in a validation metric and also features that should be excluded. We develop a new validation metric that is based on the statistical concept of confidence intervals. Using this fundamental concept, we construct two specific metrics: one that requires interpolation of experimental data and one that requires regression (curve fitting) of experimental data. We apply the metrics to three example problems: thermal decomposition of a polyurethane foam, a turbulent buoyant plume of helium, and compressibility effects on the growth rate of a turbulent free-shear layer. We discuss how the present metrics are easily interpretable for assessing computational model accuracy, as well as the impact of experimental measurement uncertainty on the accuracy assessment.

  7. The Myotonometer: Not a Valid Measurement Tool for Active Hamstring Musculotendinous Stiffness.

    Science.gov (United States)

    Pamukoff, Derek N; Bell, Sarah E; Ryan, Eric D; Blackburn, J Troy

    2016-05-01

    Hamstring musculotendinous stiffness (MTS) is associated with lower-extremity injury risk (ie, hamstring strain, anterior cruciate ligament injury) and is commonly assessed using the damped oscillatory technique. However, despite a preponderance of studies that measure MTS reliably in laboratory settings, there are no valid clinical measurement tools. A valid clinical measurement technique is needed to assess MTS and permit identification of individuals at heightened risk of injury and track rehabilitation progress. To determine the validity and reliability of the Myotonometer for measuring active hamstring MTS. Descriptive laboratory study. Laboratory. 33 healthy participants (15 men, age 21.33 ± 2.94 y, height 172.03 ± 16.36 cm, mass 74.21 ± 16.36 kg). Hamstring MTS was assessed using the damped oscillatory technique and the Myotonometer. Intraclass correlations were used to determine the intrasession, intersession, and interrater reliability of the Myotonometer. Criterion validity was assessed via Pearson product-moment correlation between MTS measures obtained from the Myotonometer and from the damped oscillatory technique. The Myotonometer demonstrated good intrasession (ICC3,1 = .807) and interrater reliability (ICC2,k = .830) and moderate intersession reliability (ICC2,k = .693). However, it did not provide a valid measurement of MTS compared with the damped oscillatory technique (r = .346, P = .061). The Myotonometer does not provide a valid measure of active hamstring MTS. Although the Myotonometer does not measure active MTS, it possesses good reliability and portability and could be used clinically to measure tissue compliance, muscle tone, or spasticity associated with multiple musculoskeletal disorders. Future research should focus on portable and clinically applicable tools to measure active hamstring MTS in efforts to prevent and monitor injuries.

  8. Validation of Questionnaire-Assessed Physical Activity in Comparison With Objective Measures Using Accelerometers and Physical Performance Measures Among Community-Dwelling Adults Aged ≥85 Years in Tokyo, Japan.

    Science.gov (United States)

    Oguma, Yuko; Osawa, Yusuke; Takayama, Michiyo; Abe, Yukiko; Tanaka, Shigeho; Lee, I-Min; Arai, Yasumichi

    2017-04-01

    To date, there is no physical activity (PA) questionnaire with convergent and construct validity for the oldest-old. The aim of the current study was to investigate the validity of questionnaire-assessed PA in comparison with objective measures determined by uniaxial and triaxial accelerometers and physical performance measures in the oldest-old. Participants were 155 elderly (mean age 90 years) who were examined at the university and agreed to wear an accelerometer for 7 days in the 3-year-follow-up survey of the Tokyo Oldest-Old Survey of Total Health. Fifty-nine participants wore a uniaxial and triaxial accelerometer simultaneously. Self-rated walking, exercise, and household PA were measured using a modified Zutphen PA Questionnaire (PAQ). Several physical performance tests were done, and the associations among PAQ, accelerometer-assessed PA, and physical performances were compared by Spearman's correlation coefficients. Significant, low to moderate correlations between PA measures were seen on questionnaire and accelerometer assessments (ρ = 0.19 to 0.34). Questionnaireassessed PA measure were correlated with a range of lower extremity performance (ρ = 0.21 to 0.29). This PAQ demonstrated convergent and construct validity. Our findings suggest that the PAQ can reasonably be used in this oldest-old population to rank their PA level.

  9. Construct validity of 2 measures to assess reasons for antipsychotic discontinuation and continuation from patients’ and clinicians’ perspectives in a clinical trial

    Directory of Open Access Journals (Sweden)

    Faries Douglas

    2012-09-01

    Full Text Available Abstract Background Little is known about the specific reasons for antipsychotic discontinuation or continuation from patients’ or clinicians’ perspectives. This study aimed to assess the construct validity of 2 new measures of the Reasons for Antipsychotic Discontinuation/Continuation (RAD: RAD-I (a structured interview assessing the patient’s perspective and RAD-Q (a questionnaire assessing the clinician’s perspective. Methods Data were used from a 12-week antipsychotic trial of schizophrenia patients in which the RAD was administered at study entry and at study completion (or discontinuation. Construct validity was assessed through comparisons of RAD responses, clinicians’ responses to a standard patient disposition form identifying reasons for patient’s study discontinuation, and several standard psychiatric measures. Percent agreement quantified the correspondence between patient and clinician scores. Results Patients indicating lack of improvement/worsening of positive symptoms as a ‘somewhat’ to ‘primary’ reason for medication discontinuation had statistically significantly less improvement in Positive and Negative Syndrome Scale positive score than patients not reporting these as a reason (concurrent validity. Similar results were observed for the RAD negative symptom, functional, social support, and adherence items, whereas the mood and cognitive items were not significantly associated with change scores on standard psychiatric measures. Responses to the RAD were also weakly associated with variables that theoretically should not be related to them (divergent validity. Level of agreement between the clinician- and patient-rated RAD scores was high (60%-100%. Conclusions Initial validation of the RAD suggests that the instruments are valid tools for gathering detailed information regarding reasons for antipsychotic discontinuation and continuation from patients’ and clinicians’ perspectives.

  10. Method validation to measure Strontium-90 in urine sample for internal dosimetry assessment

    International Nuclear Information System (INIS)

    Bitar, A.; Maghrabi, M.; Alhamwi, A.

    2010-12-01

    Occupational individuals exposed at some scientific centers in Syrian Arab Republic to potentially significant intake by ingestion or inhalation during process of producing radiopharmaceutical compounds. The received radioactive intake differs in relation to the amount of radionuclides released during the preparation processes, to the work conditions and to the applying ways of the radiation protection procedures. TLD (Thermoluminescence Dosimeter) is usually used for external radiation monitoring for workers in radioisotope centers. During the external monitoring programme, it was noticed that some workers were exposed to high external dose resultant from radiation accident in their laboratory when preparing Y-90 from Sr-90. For internal dose assessment, chemical method to measure the amount of Sr-90 in urine samples was validated and explained in details in this study. Urine bioassays were carried out and the activities of 90 Sr were determined using liquid scintillation counter. Then, the validated method was used for internal occupational monitoring purposes through the design of internal monitoring programme. The programme was established for four workers who are dealing, twice per month, with an amount of about 20 mCi in each time. At the beginning, theoretical study was done to assess maximum risks for workers. Calculated internal doses showed that it is necessary to apply internal routine monitoring programme for those workers. (author)

  11. A preliminary study to assess the construct validity of a cultural intelligence measure on a South African sample

    Directory of Open Access Journals (Sweden)

    Bright Mahembe

    2014-09-01

    Research purpose: The purpose of the current study was to assess the construct validity of the CQS on a South African sample. The results of the psychometric assessment offer some important insights into the factor structure of the cultural intelligence construct. Motivation for the study: The current study sought to provide some practical validity confirmation of the CQS for the effective management of cultural diversity in the South African context. Research approach, design and method: The CQS was administered on a non-probability sample of 229 young adults in South Africa. Item analysis was performed to ascertain reliability. Exploratory factor analysis was used to test the unidimensionality of CQS subscales. The first-order and second-order factor structures underlying contemporary models of cultural intelligence were tested using confirmatory factor analysis. Main findings: Results indicated that the CQS is a reliable and valid measure of cultural intelligence as evidenced by the high internal consistency coefficients in all the subscales. Good construct validity for both the first-order and second-order models was obtained via confirmatory factor analysis. Practical/managerial implications: The study finds good measurement properties of the CQS in a South African context. The CQS can be confidently used for applications such as selecting, training and developing a more culturally competent workforce. Contribution: The study extends the body of knowledge on the reliability and construct validity of the CQS in the South African milieu. It further indicates that cultural intelligence can be represented by a general cultural intelligence factor that drives more specific dimensions of cultural intelligence.

  12. Issues in developing valid assessments of speech pathology students' performance in the workplace.

    Science.gov (United States)

    McAllister, Sue; Lincoln, Michelle; Ferguson, Alison; McAllister, Lindy

    2010-01-01

    Workplace-based learning is a critical component of professional preparation in speech pathology. A validated assessment of this learning is seen to be 'the gold standard', but it is difficult to develop because of design and validation issues. These issues include the role and nature of judgement in assessment, challenges in measuring quality, and the relationship between assessment and learning. Valid assessment of workplace-based performance needs to capture the development of competence over time and account for both occupation specific and generic competencies. This paper reviews important conceptual issues in the design of valid and reliable workplace-based assessments of competence including assessment content, process, impact on learning, measurement issues, and validation strategies. It then goes on to share what has been learned about quality assessment and validation of a workplace-based performance assessment using competency-based ratings. The outcomes of a four-year national development and validation of an assessment tool are described. A literature review of issues in conceptualizing, designing, and validating workplace-based assessments was conducted. Key factors to consider in the design of a new tool were identified and built into the cycle of design, trialling, and data analysis in the validation stages of the development process. This paper provides an accessible overview of factors to consider in the design and validation of workplace-based assessment tools. It presents strategies used in the development and national validation of a tool COMPASS, used in an every speech pathology programme in Australia, New Zealand, and Singapore. The paper also describes Rasch analysis, a model-based statistical approach which is useful for establishing validity and reliability of assessment tools. Through careful attention to conceptual and design issues in the development and trialling of workplace-based assessments, it has been possible to develop the

  13. Validity and Responsiveness of Concept Map Assessment Scores in Physical Education

    Science.gov (United States)

    Lee, Yun Soo; Jang, Yongkyu; Kang, Minsoo

    2015-01-01

    Concept map assessment has been applied to many education areas to measure students' knowledge structure. However, the proper and valid use of concept map assessment has not been examined in physical education. The purpose of this study was to evaluate the evidence of validity and responsiveness of the concept map assessment scores in physical…

  14. Development and Validation of a Video Measure for Assessing Women’s Risk Perception for Alcohol-Related Sexual Assault

    Science.gov (United States)

    Parks, Kathleen A.; Levonyan-Radloff, Kristine; Dearing, Ronda L.; Hequembourg, Amy; Testa, Maria

    2016-01-01

    Objective Using an iterative process, a series of three video scenarios were developed for use as a standardized measure for assessing women’s perception of risks for alcohol-related sexual assault (SA). The videos included ambiguous and clear behavioral and environmental risk cues. Method Focus group discussions with young, female heavy drinkers (N = 42) were used to develop three videos at different risk levels (low, moderate, and high) in Study 1. Realism, reliability, and validity of the videos were assessed using multiple methods in Studies 2 and 3. One hundred-four women were used to compare differences in risk perception across the video risk level in Study 2. In Study 3 (N = 60), we assessed women’s perceptions of the low and high risk videos under conditions of no alcohol and alcohol. Results The realism and reliability of the videos were good. Women who viewed the low risk video compared to women who viewed the moderate and high risk videos perceived less risk for SA. We found an interaction between alcohol and risk perception such that, women in the alcohol condition were less likely to perceive risk when watching the high risk video. Conclusions As the video risk level increased, women’s perception of risk increased. These findings provide convergent evidence for the validity of the video measure. Given the limited number of standardized scenarios for assessing risk perception for sexual assault, our findings suggest that these videos may provide a needed standardized measure. PMID:27747131

  15. The Nutrition Literacy Assessment Instrument is a Valid and Reliable Measure of Nutrition Literacy in Adults with Chronic Disease.

    Science.gov (United States)

    Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K

    2018-03-01

    To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.

  16. Reliability and validity of two self-report measures of cognitive flexibility.

    Science.gov (United States)

    Johnco, Carly; Wuthrich, Viviana M; Rapee, Ronald M

    2014-12-01

    Neuropsychological testing currently represents the gold standard in assessing cognitive flexibility. However, this format presents some challenges in terms of time and skills required for administration, scoring, and interpretation. Two self-report measures of cognitive flexibility have been developed to measure aspects of cognitive flexibility in everyday settings, although neither has been validated in an older sample. In this study, we investigated the psychometric properties of 2 self-report measures of cognitive flexibility, the Cognitive Flexibility Inventory (CFI; Dennis & Vander Wal, 2010) and the Cognitive Flexibility Scale (CFS; Martin & Rubin, 1995), against neuropsychological measures of cognitive flexibility in a clinical sample of 47 older adults with comorbid anxiety and depression and a nonclinical sample of 53 community-dwelling older adults. Internal consistency was good for the CFS and CFI in all samples. The clinical sample reported poorer cognitive flexibility than did the nonclinical sample on self-report measures and performed more poorly on some neuropsychological measures. There was evidence of convergent validity between the 2 self-report measures but little relationship between the self-report and neuropsychological measures of cognitive flexibility, suggesting that self-report measures assess a different aspect of cognitive flexibility than does neuropsychological testing. Divergent validity was weak from measures of anxiety and depression in the combined and nonclinical samples but acceptable in the clinical sample. Results suggest that these measures are suitable for use with an older adult sample but do not assess the same aspects of cognitive flexibility as are assessed by neuropsychological assessment. (c) 2014 APA, all rights reserved.

  17. Reliable and valid assessment of performance in thoracoscopy

    DEFF Research Database (Denmark)

    Konge, Lars; Lehnert, Per; Hansen, Henrik Jessen

    2012-01-01

    BACKGROUND: As we move toward competency-based education in medicine, we have lagged in developing competency-based evaluation methods. In the era of minimally invasive surgery, there is a need for a reliable and valid tool dedicated to measure competence in video-assisted thoracoscopic surgery....... The purpose of this study is to create such an assessment tool, and to explore its reliability and validity. METHODS: An expert group of physicians created an assessment tool consisting of 10 items rated on a five-point rating scale. The following factors were included: economy and confidence of movement...

  18. Validation of measures for assessing management and evaluation skills in university professors

    OpenAIRE

    Domínguez Guedea, Miriam; Laros, Jacobo; Domínguez Guedea, Rosario–Leticia

    2010-01-01

    We present the results of a validation process two scales to measure skills of teachers teaching in a public university in Mexico. The elements to consider in this project were checking to test the construct validity and reliability of the method of collection, as well as contributing to the technical dimension of the evaluation of teachers.

  19. The Children's Social Understanding Scale: construction and validation of a parent-report measure for assessing individual differences in children's theories of mind.

    Science.gov (United States)

    Tahiroglu, Deniz; Moses, Louis J; Carlson, Stephanie M; Mahy, Caitlin E V; Olofson, Eric L; Sabbagh, Mark A

    2014-11-01

    Children's theory of mind (ToM) is typically measured with laboratory assessments of performance. Although these measures have generated a wealth of informative data concerning developmental progressions in ToM, they may be less useful as the sole source of information about individual differences in ToM and their relation to other facets of development. In the current research, we aimed to expand the repertoire of methods available for measuring ToM by developing and validating a parent-report ToM measure: the Children's Social Understanding Scale (CSUS). We present 3 studies assessing the psychometric properties of the CSUS. Study 1 describes item analysis, internal consistency, test-retest reliability, and relation of the scale to children's performance on laboratory ToM tasks. Study 2 presents cross-validation data for the scale in a different sample of preschool children with a different set of ToM tasks. Study 3 presents further validation data for the scale with a slightly older age group and a more advanced ToM task, while controlling for several other relevant cognitive abilities. The findings indicate that the CSUS is a reliable and valid measure of individual differences in children's ToM that may be of great value as a complement to standard ToM tasks in many different research contexts. (PsycINFO Database Record (c) 2014 APA, all rights reserved).

  20. Using SMS Text Messaging to Assess Moderators of Smoking Reduction: Validating a New Tool for Ecological Measurement of Health Behaviors

    Science.gov (United States)

    Berkman, Elliot T.; Dickenson, Janna; Falk, Emily B.; Lieberman, Matthew D.

    2011-01-01

    Objective Understanding the psychological processes that contribute to smoking reduction will yield population health benefits. Negative mood may moderate smoking lapse during cessation, but this relationship has been difficult to measure in ongoing daily experience. We used a novel form of ecological momentary assessment to test a self-control model of negative mood and craving leading to smoking lapse. Design We validated short message service (SMS) text as a user-friendly and low-cost option for ecologically measuring real-time health behaviors. We sent text messages to cigarette smokers attempting to quit eight times daily for the first 21 days of cessation (N-obs = 3,811). Main outcome measures Approximately every two hours, we assessed cigarette count, mood, and cravings, and examined between- and within-day patterns and time-lagged relationships among these variables. Exhaled carbon monoxide was assessed pre- and posttreatment. Results Negative mood and craving predicted smoking two hours later, but craving mediated the mood–smoking relationship. Also, this mediation relationship predicted smoking over the next two, but not four, hours. Conclusion Results clarify conflicting previous findings on the relation between affect and smoking, validate a new low-cost and user-friendly method for collecting fine-grained health behavior assessments, and emphasize the importance of rapid, real-time measurement of smoking moderators. PMID:21401252

  1. Validation of a novel venous duplex ultrasound objective structured assessment of technical skills for the assessment of venous reflux.

    Science.gov (United States)

    Jaffer, Usman; Normahani, Pasha; Lackenby, Kimberly; Aslam, Mohammed; Standfield, Nigel J

    2015-01-01

    Duplex ultrasound measurement of reflux time is central to the diagnosis of venous incompetence. We have developed an assessment tool for Duplex measurement of venous reflux for both simulator and patient-based training. A novel assessment tool, Venous Duplex Ultrasound Assessment of Technical Skills (V-DUOSATS), was developed. A modified DUOSATS was used for simulator training. Participants of varying skill level were invited to viewed an instructional video and were allowed ample time to familiarize with the Duplex equipment. Attempts made by the participants were recorded and independently assessed by 3 expert assessors and 5 novice assessors using the modified V-DUOSATS. "Global" assessment was also done by expert assessors on a 4-point Likert scale. Content, construct, and concurrent validities as well as reliability were evaluated. Content and construct validity as well as reliability were demonstrated. Receiver operator characteristic analysis-established cut points of 19/22 and 21/30 were most appropriate for simulator and patient-based assessment, respectively. We have validated a novel assessment tool for Duplex venous reflux measurement. Further work is required to establish transference validity of simulator training to improve skill in scanning patients. We have developed and validated V-DUOSATS for simulator training. Copyright © 2015 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.

  2. Validity and reliability of simple measurement device to assess the velocity of the barbell during squats.

    Science.gov (United States)

    Lorenzetti, Silvio; Lamparter, Thomas; Lüthy, Fabian

    2017-12-06

    The velocity of a barbell can provide important insights on the performance of athletes during strength training. The aim of this work was to assess the validity and reliably of four simple measurement devices that were compared to 3D motion capture measurements during squatting. Nine participants were assessed when performing 2 × 5 traditional squats with a weight of 70% of the 1 repetition maximum and ballistic squats with a weight of 25 kg. Simultaneously, data was recorded from three linear position transducers (T-FORCE, Tendo Power and GymAware), an accelerometer based system (Myotest) and a 3D motion capture system (Vicon) as the Gold Standard. Correlations between the simple measurement devices and 3D motion capture of the mean and the maximal velocity of the barbell, as well as the time to maximal velocity, were calculated. The correlations during traditional squats were significant and very high (r = 0.932, 0.990, p squats and was less accurate. All the linear position transducers were able to assess squat performance, particularly during traditional squats and especially in terms of mean velocity and time to maximal velocity.

  3. Development and validation of a measure of display rule knowledge: the display rule assessment inventory.

    Science.gov (United States)

    Matsumoto, David; Yoo, Seung Hee; Hirayama, Satoko; Petrova, Galina

    2005-03-01

    As one component of emotion regulation, display rules, which reflect the regulation of expressive behavior, have been the topic of many studies. Despite their theoretical and empirical importance, however, to date there is no measure of display rules that assesses a full range of behavioral responses that are theoretically possible when emotion is elicited. This article reports the development of a new measure of display rules that surveys 5 expressive modes: expression, deamplification, amplification, qualification, and masking. Two studies provide evidence for its internal and temporal reliability and for its content, convergent, discriminant, external, and concurrent predictive validity. Additionally, Study 1, involving American, Russian, and Japanese participants, demonstrated predictable cultural differences on each of the expressive modes. Copyright 2005 APA, all rights reserved.

  4. Validation of an inertial measurement unit for the measurement of jump count and height.

    Science.gov (United States)

    MacDonald, Kerry; Bahr, Roald; Baltich, Jennifer; Whittaker, Jackie L; Meeuwisse, Willem H

    2017-05-01

    To validate the use of an inertial measurement unit (IMU) for the collection of total jump count and assess the validity of an IMU for the measurement of jump height against 3-D motion analysis. Cross sectional validation study. 3D motion-capture laboratory and field based settings. Thirteen elite adolescent volleyball players. Participants performed structured drills, played a 4 set volleyball match and performed twelve counter movement jumps. Jump counts from structured drills and match play were validated against visual count from recorded video. Jump height during the counter movement jumps was validated against concurrent 3-D motion-capture data. The IMU device captured more total jumps (1032) than visual inspection (977) during match play. During structured practice, device jump count sensitivity was strong (96.8%) while specificity was perfect (100%). The IMU underestimated jump height compared to 3D motion-capture with mean differences for maximal and submaximal jumps of 2.5 cm (95%CI: 1.3 to 3.8) and 4.1 cm (3.1-5.1), respectively. The IMU offers a valid measuring tool for jump count. Although the IMU underestimates maximal and submaximal jump height, our findings demonstrate its practical utility for field-based measurement of jump load. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Data validation and risk assessment -- some pitfalls when evaluating VOC measurements

    International Nuclear Information System (INIS)

    Korte, N.; Kearl, P.

    1993-01-01

    Data validation, as described in Environmental Protection Agency (EPA) protocols under the Contract Laboratory Program (CLP), yields false confidence in the data and drives up costs while providing little benefit (Korte and Brown 1992). Commonly, these data are then used to perform a risk assessment. Much of the published guidance for risk assessments in and arid soils is inadequate because it does not take into account vapor migration due to density-driven flow (Korte and others 1992). Investigations into both of these problems have been performed by personnel of Oak Ridge National Laboratory (ORNL) and are described in this presentation

  6. Development and initial validity of the in-hand manipulation assessment.

    Science.gov (United States)

    Klymenko, Gabrielle; Liu, Karen P Y; Bissett, Michelle; Fong, Kenneth N K; Welage, Nandana; Wong, Rebecca S M

    2018-04-01

    A review of the literature related to in-hand manipulation (IHM) revealed that there is no assessment which specifically measures this construct in the adult population. This study reports the face and content validity of an IHM assessment for adults with impaired hand function based on expert opinion. The definition of IHM skills, assessment tasks and scoring methods identified from literature was discussed in a focus group (n = 4) to establish face validity. An expert panel (n = 16) reviewed the content validity of the proposed assessment; evaluating the representativeness and relevance of encompassing the IHM skills in the proposed assessment tasks, the clarity and importance to daily life of the task and the clarity and applicability to clinical environment of the scoring method. The content validity was calculated using the content validity index for both the individual task and all tasks together (I-CVI and S-CVI). Feedback was incorporated to create the assessment. The focus group members agreed to include 10 assessment tasks that covered all IHM skills. In the expert panel review, all tasks received an I-CVI above 0.78 and S-CVI above 0.80 in representativeness and relevance ratings, representing good content validity. With the comments from the expert panel, tasks were modified to improve the clarity and importance to daily life. A four-point Likert scale was identified for assessing both the completion of the assessment tasks and the quality of IHM skills within the task performance. Face and content validity were established in this new IHM assessment. Further studies to examine psychometric properties and use within clinical practice are recommended. © 2018 Occupational Therapy Australia.

  7. Translating and validating a Training Needs Assessment tool into Greek

    Directory of Open Access Journals (Sweden)

    Hicks Carolyn M

    2007-05-01

    Full Text Available Abstract Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire into Greek language and discuss possibilities of its use in primary care settings. Methods A modified version of the English self-administered questionnaire consisting of 30 items was used. Internationally recommended methodology, mandating forward translation, backward translation, reconciliation and pretesting steps, was followed. Tool validation included assessing item internal consistency, using the alpha coefficient of Cronbach. Reproducibility (test – retest reliability was measured by the kappa correlation coefficient. Criterion validity was calculated for selected parts of the questionnaire by correlating respondents' research experience with relevant research item scores. An exploratory factor analysis highlighted how the items group together, using a Varimax (oblique rotation and subsequent Cronbach's alpha assessment. Results The psychometric properties of the Greek version of the TNA questionnaire for nursing staff employed in primary care were good. Internal consistency of the instrument was very good, Cronbach's alpha was found to be 0.985 (p 1.0, KMO (Kaiser-Meyer-Olkin measure of sampling adequacy = 0.680 and Bartlett's test of sphericity, p Conclusion The translated and adapted Greek version is comparable with the original English instrument in terms of validity and reliability and it is suitable to assess professional development needs of nursing staff in Greek primary care settings.

  8. Translating and validating a Training Needs Assessment tool into Greek

    Science.gov (United States)

    Markaki, Adelais; Antonakis, Nikos; Hicks, Carolyn M; Lionis, Christos

    2007-01-01

    Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA) is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire into Greek language and discuss possibilities of its use in primary care settings. Methods A modified version of the English self-administered questionnaire consisting of 30 items was used. Internationally recommended methodology, mandating forward translation, backward translation, reconciliation and pretesting steps, was followed. Tool validation included assessing item internal consistency, using the alpha coefficient of Cronbach. Reproducibility (test – retest reliability) was measured by the kappa correlation coefficient. Criterion validity was calculated for selected parts of the questionnaire by correlating respondents' research experience with relevant research item scores. An exploratory factor analysis highlighted how the items group together, using a Varimax (oblique) rotation and subsequent Cronbach's alpha assessment. Results The psychometric properties of the Greek version of the TNA questionnaire for nursing staff employed in primary care were good. Internal consistency of the instrument was very good, Cronbach's alpha was found to be 0.985 (p 1.0, KMO (Kaiser-Meyer-Olkin) measure of sampling adequacy = 0.680 and Bartlett's test of sphericity, p < 0.001. Conclusion The translated and adapted Greek version is comparable with the original English instrument in terms of validity and reliability and it is suitable to assess professional development needs of nursing staff in Greek primary care settings. PMID:17474989

  9. Reliable and valid assessment of Lichtenstein hernia repair skills.

    Science.gov (United States)

    Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L

    2014-08-01

    Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.

  10. Validation of an organizational communication climate assessment toolkit.

    Science.gov (United States)

    Wynia, Matthew K; Johnson, Megan; McCoy, Thomas P; Griffin, Leah Passmore; Osborn, Chandra Y

    2010-01-01

    Effective communication is critical to providing quality health care and can be affected by a number of modifiable organizational factors. The authors performed a prospective multisite validation study of an organizational communication climate assessment tool in 13 geographically and ethnically diverse health care organizations. Communication climate was measured across 9 discrete domains. Patient and staff surveys with matched items in each domain were developed using a national consensus process, which then underwent psychometric field testing and assessment of domain coherence. The authors found meaningful within-site and between-site performance score variability in all domains. In multivariable models, most communication domains were significant predictors of patient-reported quality of care and trust. The authors conclude that these assessment tools provide a valid empirical assessment of organizational communication climate in 9 domains. Assessment results may be useful to track organizational performance, to benchmark, and to inform tailored quality improvement interventions.

  11. Self-report measures of prospective memory are reliable but not valid.

    Science.gov (United States)

    Uttl, Bob; Kibreab, Mekale

    2011-03-01

    Are self-report measures of prospective memory (ProM) reliable and valid? To examine this question, 240 undergraduate student volunteers completed several widely used self-report measures of ProM including the Prospective Memory Questionnaire (PMQ), the Prospective and Retrospective Memory Questionnaire (PRMQ), the Comprehensive Assessment of Prospective Memory (CAPM) questionnaire, self-reports of retrospective memory (RetM), objective measures of ProM and RetM, and measures of involvement in activities and events, memory strategies and aids use, personality and verbal intelligence. The results showed that both convergent and divergent validity of ProM self-reports are poor, even though we assessed ProM using a newly developed, reliable continuous measure. Further analyses showed that a substantial proportion of variability in ProM self-report scores was due to verbal intelligence, personality (conscientiousness, neuroticism), activities and event involvement (busyness), and use of memory strategies and aids. ProM self-reports have adequate reliability, but poor validity and should not be interpreted as reflecting ProM ability. (PsycINFO Database Record (c) 2011 APA, all rights reserved).

  12. Valid and Reliable Science Content Assessments for Science Teachers

    Science.gov (United States)

    Tretter, Thomas R.; Brown, Sherri L.; Bush, William S.; Saderholm, Jon C.; Holmes, Vicki-Lynn

    2013-01-01

    Science teachers' content knowledge is an important influence on student learning, highlighting an ongoing need for programs, and assessments of those programs, designed to support teacher learning of science. Valid and reliable assessments of teacher science knowledge are needed for direct measurement of this crucial variable. This paper…

  13. Validity of instruments to assess students' travel and pedestrian safety

    Directory of Open Access Journals (Sweden)

    Baranowski Tom

    2010-05-01

    Full Text Available Abstract Background Safe Routes to School (SRTS programs are designed to make walking and bicycling to school safe and accessible for children. Despite their growing popularity, few validated measures exist for assessing important outcomes such as type of student transport or pedestrian safety behaviors. This research validated the SRTS school travel survey and a pedestrian safety behavior checklist. Methods Fourth grade students completed a brief written survey on how they got to school that day with set responses. Test-retest reliability was obtained 3-4 hours apart. Convergent validity of the SRTS travel survey was assessed by comparison to parents' report. For the measure of pedestrian safety behavior, 10 research assistants observed 29 students at a school intersection for completion of 8 selected pedestrian safety behaviors. Reliability was determined in two ways: correlations between the research assistants' ratings to that of the Principal Investigator (PI and intraclass correlations (ICC across research assistant ratings. Results The SRTS travel survey had high test-retest reliability (κ = 0.97, n = 96, p Conclusions These validated instruments can be used to assess SRTS programs. The pedestrian safety behavior checklist may benefit from further formative work.

  14. Validity in assessment of prior learning

    DEFF Research Database (Denmark)

    Wahlgren, Bjarne; Aarkrog, Vibe

    2015-01-01

    , the article discusses the need for specific criteria for assessment. The reliability and validity of the assessment procedures depend on whether the competences are well-defined, and whether the teachers are adequately trained for the assessment procedures. Keywords: assessment, prior learning, adult...... education, vocational training, lifelong learning, validity...

  15. Functional claudication distance: a reliable and valid measurement to assess functional limitation in patients with intermittent claudication

    Directory of Open Access Journals (Sweden)

    Prins Martin H

    2009-03-01

    Full Text Available Abstract Background Disease severity and functional impairment in patients with intermittent claudication is usually quantified by the measurement of pain-free walking distance (intermittent claudication distance, ICD and maximal walking distance (absolute claudication distance, ACD. However, the distance at which a patient would prefer to stop because of claudication pain seems a definition that is more correspondent with the actual daily life walking distance. We conducted a study in which the distance a patient prefers to stop was defined as the functional claudication distance (FCD, and estimated the reliability and validity of this measurement. Methods In this clinical validity study we included patients with intermittent claudication, following a supervised exercise therapy program. The first study part consisted of two standardised treadmill tests. During each test ICD, FCD and ACD were determined. Primary endpoint was the reliability as represented by the calculated intra-class correlation coefficients. In the second study part patients performed a standardised treadmill test and filled out the Rand-36 questionnaire. Spearman's rho was calculated to assess validity. Results The intra-class correlation coefficients of ICD, FCD and ACD were 0.940, 0.959, and 0.975 respectively. FCD correlated significantly with five out of nine domains, namely physical function (rho = 0.571, physical role (rho = 0.532, vitality (rho = 0.416, pain (rho = 0.416 and health change (rho = 0.414. Conclusion FCD is a reliable and valid measurement for determining functional capacity in trained patients with intermittent claudication. Furthermore it seems that FCD better reflects the actual functional impairment. In future studies, FCD could be used alongside ICD and ACD.

  16. Are implicit self-esteem measures valid for assessing individual and cultural differences?

    Science.gov (United States)

    Falk, Carl F; Heine, Steven J; Takemura, Kosuke; Zhang, Cathy X J; Hsu, Chih-Wei

    2015-02-01

    Our research utilized two popular theoretical conceptualizations of implicit self-esteem: 1) implicit self-esteem as a global automatic reaction to the self; and 2) implicit self-esteem as a context/domain specific construct. Under this framework, we present an extensive search for implicit self-esteem measure validity among different cultural groups (Study 1) and under several experimental manipulations (Study 2). In Study 1, Euro-Canadians (N = 107), Asian-Canadians (N = 187), and Japanese (N = 112) completed a battery of implicit self-esteem, explicit self-esteem, and criterion measures. Included implicit self-esteem measures were either popular or provided methodological improvements upon older methods. Criterion measures were sampled from previous research on implicit self-esteem and included self-report and independent ratings. In Study 2, Americans (N = 582) completed a shorter battery of these same types of measures under either a control condition, an explicit prime meant to activate the self-concept in a particular context, or prime meant to activate self-competence related implicit attitudes. Across both studies, explicit self-esteem measures far outperformed implicit self-esteem measures in all cultural groups and under all experimental manipulations. Implicit self-esteem measures are not valid for individual or cross-cultural comparisons. We speculate that individuals may not form implicit associations with the self as an attitudinal object. © 2013 Wiley Periodicals, Inc.

  17. Construction and Initial Validation of the Multiracial Experiences Measure (MEM)

    Science.gov (United States)

    Yoo, Hyung Chol; Jackson, Kelly; Guevarra, Rudy P.; Miller, Matthew J.; Harrington, Blair

    2015-01-01

    This article describes the development and validation of the Multiracial Experiences Measure (MEM): a new measure that assesses uniquely racialized risks and resiliencies experienced by individuals of mixed racial heritage. Across two studies, there was evidence for the validation of the 25-item MEM with 5 subscales including Shifting Expressions, Perceived Racial Ambiguity, Creating Third Space, Multicultural Engagement, and Multiracial Discrimination. The 5-subscale structure of the MEM was supported by a combination of exploratory and confirmatory factor analyses. Evidence of criterion-related validity was partially supported with MEM subscales correlating with measures of racial diversity in one’s social network, color-blind racial attitude, psychological distress, and identity conflict. Evidence of discriminant validity was supported with MEM subscales not correlating with impression management. Implications for future research and suggestions for utilization of the MEM in clinical practice with multiracial adults are discussed. PMID:26460977

  18. An Extended Validity Argument for Assessing Feedback Culture.

    Science.gov (United States)

    Rougas, Steven; Clyne, Brian; Cianciolo, Anna T; Chan, Teresa M; Sherbino, Jonathan; Yarris, Lalena M

    2015-01-01

    NEGEA 2015 CONFERENCE ABSTRACT (EDITED): Measuring an Organization's Culture of Feedback: Can It Be Done? Steven Rougas and Brian Clyne. CONSTRUCT: This study sought to develop a construct for measuring formative feedback culture in an academic emergency medicine department. Four archetypes (Market, Adhocracy, Clan, Hierarchy) reflecting an organization's values with respect to focus (internal vs. external) and process (flexibility vs. stability and control) were used to characterize one department's receptiveness to formative feedback. The prevalence of residents' identification with certain archetypes served as an indicator of the department's organizational feedback culture. New regulations have forced academic institutions to implement wide-ranging changes to accommodate competency-based milestones and their assessment. These changes challenge residencies that use formative feedback from faculty as a major source of data for determining training advancement. Though various approaches have been taken to improve formative feedback to residents, there currently exists no tool to objectively measure the organizational culture that surrounds this process. Assessing organizational culture, commonly used in the business sector to represent organizational health, may help residency directors gauge their program's success in fostering formative feedback. The Organizational Culture Assessment Instrument (OCAI) is widely used, extensively validated, applicable to survey research, and theoretically based and may be modifiable to assess formative feedback culture in the emergency department. Using a modified Delphi technique and several iterations of focus groups amongst educators at one institution, four of the original six OCAI domains (which each contain 4 possible responses) were modified to create a 16-item Formative Feedback Culture Tool (FFCT) that was administered to 26 residents (response rate = 55%) at a single academic emergency medicine department. The mean

  19. Developing and Validating a New Classroom Climate Observation Assessment Tool.

    Science.gov (United States)

    Leff, Stephen S; Thomas, Duane E; Shapiro, Edward S; Paskewich, Brooke; Wilson, Kim; Necowitz-Hoffman, Beth; Jawad, Abbas F

    2011-01-01

    The climate of school classrooms, shaped by a combination of teacher practices and peer processes, is an important determinant for children's psychosocial functioning and is a primary factor affecting bullying and victimization. Given that there are relatively few theoretically-grounded and validated assessment tools designed to measure the social climate of classrooms, our research team developed an observation tool through participatory action research (PAR). This article details how the assessment tool was designed and preliminarily validated in 18 third-, fourth-, and fifth-grade classrooms in a large urban public school district. The goals of this study are to illustrate the feasibility of a PAR paradigm in measurement development, ascertain the psychometric properties of the assessment tool, and determine associations with different indices of classroom levels of relational and physical aggression.

  20. The Selective Mutism Questionnaire: Measurement Structure and Validity

    Science.gov (United States)

    Letamendi, Andrea M.; Chavira, Denise A.; Hitchcock, Carla A.; Roesch, Scott C.; Shipon-Blum, Elisa; Stein, Murray B.; Roesch, Scott C.

    2010-01-01

    Objective To evaluate the factor structure, reliability, and validity of the 17-item Selective Mutism Questionnaire. Method Diagnostic interviews were administered via telephone to 102 parents of children identified with selective mutism (SM) and 43 parents of children without SM from varying U.S. geographic regions. Children were between the ages of 3 and 11 inclusive and comprised 58% girls and 42% boys. SM diagnoses were determined using the Anxiety Disorders Interview Schedule for Children - Parent Version (ADIS-C/P); SM severity was assessed using the 17-item Selective Mutism Questionnaire (SMQ); and behavioral and affective symptoms were assessed using the Child Behavior Checklist (CBCL). An exploratory factor analysis (EFA) was conducted to investigate the dimensionality of the SMQ and a modified parallel analysis procedure was used to confirm EFA results. Internal consistency, construct validity, and incremental validity were also examined. Results The EFA yielded a 13-item solution consisting of three factors: a) Social Situations Outside of School, b) School Situations, and c) Home and Family Situations. Internal consistency of SMQ factors and total scale ranged from moderate to high. Convergent and incremental validity were also well supported. Conclusions Measure structure findings are consistent with the 3-factor solution found in a previous psychometric evaluation of the SMQ. Results also suggest that the SMQ provides useful and unique information in the prediction of SM phenomenon beyond other child anxiety measures. PMID:18698268

  1. Assessing trauma and mental health in refugee children and youth: a systematic review of validated screening and measurement tools.

    Science.gov (United States)

    Gadeberg, A K; Montgomery, E; Frederiksen, H W; Norredam, M

    2017-06-01

    : It is estimated that children below 18 years constitute 50% of the refugee population worldwide, which is the highest figure in a decade. Due to conflicts like the Syrian crises, children are continuously exposed to traumatic events. Trauma exposure can cause mental health problems that may in turn increase the risk of morbidity and mortality. Tools such as questionnaires and interview guides are being used extensively, despite the fact that only a few have been tested and their validity confirmed in refugee children and youth. : Our aim was to provide a systematic review of the validated screening and measurement tools available for assessment of trauma and mental health among refugee children and youth. : We systematically searched the databases PubMed, PsycINFO and PILOTS. The search yielded 913 articles and 97 were retained for further investigation. In accordance with the PRISMA guidelines two authors performed the eligibility assessment. The full text of 23 articles was assessed and 9 met the eligibility criteria. Results : Only nine studies had validated trauma and mental health tools in refugee children and youth populations. A serious lack of validated tools for refugee children below the age of 6 was identified. : There is a lack of validated trauma and mental health tools, especially for refugees below the age of 6. Detection and treatment of mental health issues among refugee children and youth should be a priority both within the scientific community and in practice in order to reduce morbidity and mortality. © The Author 2017. Published by Oxford University Press on behalf of the European Public Health Association. All rights reserved.

  2. Measuring stakeholder participation in evaluation: an empirical validation of the Participatory Evaluation Measurement Instrument (PEMI).

    Science.gov (United States)

    Daigneault, Pierre-Marc; Jacob, Steve; Tremblay, Joël

    2012-08-01

    Stakeholder participation is an important trend in the field of program evaluation. Although a few measurement instruments have been proposed, they either have not been empirically validated or do not cover the full content of the concept. This study consists of a first empirical validation of a measurement instrument that fully covers the content of participation, namely the Participatory Evaluation Measurement Instrument (PEMI). It specifically examines (1) the intercoder reliability of scores derived by two research assistants on published evaluation cases; (2) the convergence between the scores of coders and those of key respondents (i.e., authors); and (3) the convergence between the authors' scores on the PEMI and the Evaluation Involvement Scale (EIS). A purposive sample of 40 cases drawn from the evaluation literature was used to assess reliability. One author per case in this sample was then invited to participate in a survey; 25 fully usable questionnaires were received. Stakeholder participation was measured on nominal and ordinal scales. Cohen's κ, the intraclass correlation coefficient, and Spearman's ρ were used to assess reliability and convergence. Reliability results ranged from fair to excellent. Convergence between coders' and authors' scores ranged from poor to good. Scores derived from the PEMI and the EIS were moderately associated. Evidence from this study is strong in the case of intercoder reliability and ranges from weak to strong in the case of convergent validation. Globally, this suggests that the PEMI can produce scores that are both reliable and valid.

  3. Reliability and Validity of Selected PROMIS Measures in People with Rheumatoid Arthritis.

    Directory of Open Access Journals (Sweden)

    Susan J Bartlett

    Full Text Available To evaluate the reliability and validity of 11 PROMIS measures to assess symptoms and impacts identified as important by people with rheumatoid arthritis (RA.Consecutive patients (N = 177 in an observational study completed PROMIS computer adapted tests (CATs and a short form (SF assessing pain, fatigue, physical function, mood, sleep, and participation. We assessed test-test reliability and internal consistency using correlation and Cronbach's alpha. We assessed convergent validity by examining Pearson correlations between PROMIS measures and existing measures of similar domains and known groups validity by comparing scores across disease activity levels using ANOVA.Participants were mostly female (82% and white (83% with mean (SD age of 56 (13 years; 24% had ≤ high school, 29% had RA ≤ 5 years with 13% ≤ 2 years, and 22% were disabled. PROMIS Physical Function, Pain Interference and Fatigue instruments correlated moderately to strongly (rho's ≥ 0.68 with corresponding PROs. Test-retest reliability ranged from .725-.883, and Cronbach's alpha from .906-.991. A dose-response relationship with disease activity was evident in Physical Function with similar trends in other scales except Anger.These data provide preliminary evidence of reliability and construct validity of PROMIS CATs to assess RA symptoms and impacts, and feasibility of use in clinical care. PROMIS instruments captured the experiences of RA patients across the broad continuum of RA symptoms and function, especially at low disease activity levels. Future research is needed to evaluate performance in relevant subgroups, assess responsiveness and identify clinically meaningful changes.

  4. Validity of Dietary Assessment in Athletes: A Systematic Review

    Directory of Open Access Journals (Sweden)

    Louise Capling

    2017-12-01

    Full Text Available Dietary assessment methods that are recognized as appropriate for the general population are usually applied in a similar manner to athletes, despite the knowledge that sport-specific factors can complicate assessment and impact accuracy in unique ways. As dietary assessment methods are used extensively within the field of sports nutrition, there is concern the validity of methodologies have not undergone more rigorous evaluation in this unique population sub-group. The purpose of this systematic review was to compare two or more methods of dietary assessment, including dietary intake measured against biomarkers or reference measures of energy expenditure, in athletes. Six electronic databases were searched for English-language, full-text articles published from January 1980 until June 2016. The search strategy combined the following keywords: diet, nutrition assessment, athlete, and validity; where the following outcomes are reported but not limited to: energy intake, macro and/or micronutrient intake, food intake, nutritional adequacy, diet quality, or nutritional status. Meta-analysis was performed on studies with sufficient methodological similarity, with between-group standardized mean differences (or effect size and 95% confidence intervals (CI being calculated. Of the 1624 studies identified, 18 were eligible for inclusion. Studies comparing self-reported energy intake (EI to energy expenditure assessed via doubly labelled water were grouped for comparison (n = 11 and demonstrated mean EI was under-estimated by 19% (−2793 ± 1134 kJ/day. Meta-analysis revealed a large pooled effect size of −1.006 (95% CI: −1.3 to −0.7; p < 0.001. The remaining studies (n = 7 compared a new dietary tool or instrument to a reference method(s (e.g., food record, 24-h dietary recall, biomarker as part of a validation study. This systematic review revealed there are limited robust studies evaluating dietary assessment methods in athletes. Existing

  5. Reliability and Validity of Finger Strength and Endurance Measurements in Rock Climbing

    Science.gov (United States)

    Michailov, Michail Lubomirov; Baláš, Jirí; Tanev, Stoyan Kolev; Andonov, Hristo Stoyanov; Kodejška, Jan; Brown, Lee

    2018-01-01

    Purpose: An advanced system for the assessment of climbing-specific performance was developed and used to: (a) investigate the effect of arm fixation (AF) on construct validity evidence and reliability of climbing-specific finger-strength measurement; (b) assess reliability of finger-strength and endurance measurements; and (c) evaluate the…

  6. Assessing Measurement Error in Medicare Coverage

    Data.gov (United States)

    U.S. Department of Health & Human Services — Assessing Measurement Error in Medicare Coverage From the National Health Interview Survey Using linked administrative data, to validate Medicare coverage estimates...

  7. Validation of the Australian Midwifery Standards Assessment Tool (AMSAT): A tool to assess midwifery competence.

    Science.gov (United States)

    Sweet, Linda; Bazargan, Maryam; McKellar, Lois; Gray, Joanne; Henderson, Amanda

    2018-02-01

    There is no current validated clinical assessment tool to measure the attainment of midwifery student competence in the midwifery practice setting. The lack of a valid assessment tool has led to a proliferation of tools and inconsistency in assessment of, and feedback on student learning. This research aimed to develop and validate a tool to assess competence of midwifery students in practice-based settings. A mixed-methods approach was used and the study implemented in two phases. Phase one involved the development of the AMSAT tool with qualitative feedback from midwifery academics, midwife assessors of students, and midwifery students. In phase two the newly developed AMSAT tool was piloted across a range of midwifery practice settings and ANOVA was used to compare scores across year levels, with feedback being obtained from assessors. Analysis of 150 AMSAT forms indicate the AMSAT as: reliable (Cronbach alpha greater than 0.9); valid-data extraction loaded predominantly onto one factor; and sensitivity scores indicating level of proficiency increased across the three years. Feedback evaluation forms (n=83) suggest acceptance of this tool for the purpose of both assessing and providing feedback on midwifery student's practice performance and competence. The AMSAT is a valid, reliable and acceptable midwifery assessment tool enables consistent assessment of midwifery student competence. This assists benchmarking across midwifery education programs. Copyright © 2017 Australian College of Midwives. Published by Elsevier Ltd. All rights reserved.

  8. Measuring older adults' sedentary time: reliability, validity, and responsiveness.

    Science.gov (United States)

    Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville

    2011-11-01

    With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is

  9. Reliability and validity of subjective assessment of lumbar lordosis in ...

    African Journals Online (AJOL)

    Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. Objective: To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. Design: A blinded, repeated-measures diagnostic test was carried ...

  10. The Elbow Self-Assessment Score (ESAS): development and validation of a new patient-reported outcome measurement tool for elbow disorders.

    Science.gov (United States)

    Beirer, Marc; Friese, Henrik; Lenich, Andreas; Crönlein, Moritz; Sandmann, Gunther H; Biberthaler, Peter; Kirchhoff, Chlodwig; Siebenlist, Sebastian

    2017-07-01

    To develop and validate an elbow self-assessment score considering subjective as well as objective parameters. Each scale of the American Shoulder and Elbow Surgeons-Elbow Score, the Broberg and Morrey rating system (BMS), the Patient-Rated Elbow Evaluation (PREE) Questionnaire, the Mayo Elbow Performance Score (MEPS), the Oxford Elbow Score (OES) and the Quick Disabilities of the Arm, Shoulder and Hand (Quick-DASH) was analysed, and after matching of the general topics, the dedicated items underwent a fusion to the final ESAS's item and a score containing 22 items was created. In a prospective clinical study, validity, reliability and responsiveness in physically active patients with traumatic as well as degenerative elbow disorders were evaluated. Validation study included 103 patients (48 women, 55 men; mean age 43 years). A high test-retest reliability was found with intraclass correlation coefficients of at least 0.71. Construct validity and responsiveness were confirmed by correlation coefficients of -0.80 to -0.84 and 0.72-0.84 (p Self-Assessment Score (ESAS), a valid and reliable instrument for a qualitative self-assessment of subjective and objective parameters (e.g. range of motion) of the elbow joint is demonstrated. Quantitative measurement of elbow function may not longer be limited to specific elbow disorders or patient groups. The ESAS seems to allow for a broad application in clinical research studying elbow patients and may facilitate the comparison of treatment results in elbow disorders. The treatment efficacy can be easily evaluated, and treatment concepts could be reviewed and revised. Diagnostic study, Level III.

  11. Establishing best practices for the validation of atmospheric composition measurements from satellites

    Science.gov (United States)

    Lambert, Jean-Christopher

    As a contribution to the implementation of the Global Earth Observation System of Systems (GEOSS), the Committee on Earth Observation Satellites (CEOS) is developing a data quality strategy for satellite measurements. To achieve GEOSS requirements of consistency and interoperability (e.g. for comparison and for integrated interpretation) of the measurements and their derived data products, proper uncertainty assessment is essential and needs to be continuously monitored and traceable to standards. Therefore, CEOS has undertaken the task to establish a set of best practices and guidelines for satellite validation, starting with current practices that could be improved with time. Best practices are not intended to be imposed as firm requirements, but rather to be suggested as a baseline for comparing against, which could be used by the widest community and provide guidance to newcomers. The present paper reviews the current development of best practices and guidelines for the validation of atmospheric composition satellites. Terminologies and general principles of validation are reminded. Going beyond elementary definitions of validation like the assessment of uncertainties, the specific GEOSS context calls also for validation of individual service components and against user requirements. This paper insists on two important aspects. First one, the question of the "collocation". Validation generally involves comparisons with "reference" measurements of the same quantities, and the question of what constitutes a valid comparison is not the least of the challenges faced. We present a tentative scheme for defining the validity of a comparison and of the necessary "collocation" criteria. Second focus of this paper: the information content of the data product. Validation against user requirements, or the verification of the "fitness for purpose" of both the data products and their validation, needs to identify what information, in the final product, is contributed really

  12. Assessment of juveniles testimonies’ validity

    Directory of Open Access Journals (Sweden)

    Dozortseva E.G.

    2015-12-01

    Full Text Available The article presents a review of the English language publications concerning the history and the current state of differential psychological assessment of validity of testimonies produced by child and adolescent victims of crimes. The topicality of the problem in Russia is high due to the tendency of Russian specialists to use methodical means and instruments developed abroad in this sphere for forensic assessments of witness testimony veracity. A system of Statement Validity Analysis (SVA by means of Criteria-Based Content Analysis (CBCA and Validity Checklist is described. The results of laboratory and field studies of validity of CBCA criteria on the basis of child and adult witnesses are discussed. The data display a good differentiating capacity of the method, however, a high level of error probability. The researchers recommend implementation of SVA in the criminal investigation process, but not in the forensic assessment. New perspective developments in the field of methods for differentiation of witness statements based on the real experience and fictional are noted. The conclusion is drawn that empirical studies and a special work for adaptation and development of new approaches should precede their implementation into Russian criminal investigation and forensic assessment practice

  13. Ovarian and cervical cancer awareness: development of two validated measurement tools.

    Science.gov (United States)

    Simon, Alice E; Wardle, Jane; Grimmett, Chloe; Power, Emily; Corker, Elizabeth; Menon, Usha; Matheson, Lauren; Waller, Jo

    2012-07-01

    The aim of the study was to develop and validate measures of awareness of symptoms and risk factors for ovarian and cervical cancer (Ovarian and Cervical Cancer Awareness Measures). Potentially relevant items were extracted from the literature and generated by experts. Four validation studies were carried out to establish reliability and validity. Women aged 21-67 years (n=146) and ovarian and cervical cancer experts (n=32) were included in the studies. Internal reliability was assessed psychometrically. Test-retest reliability was assessed over a 1-week interval. To establish construct validity, Cancer Awareness Measure (CAM) scores of cancer experts were compared with equally well-educated comparison groups. Sensitivity to change was tested by randomly assigning participants to read either a leaflet giving information about ovarian/cervical cancer or a leaflet with control information, and then completing the ovarian/cervical CAM. Internal reliability (Cronbach's α=0.88 for the ovarian CAM and α=0.84 for the cervical CAM) and test-retest reliability (r=0.84 and r=0.77 for the ovarian and cervical CAMs, respectively) were both high. Validity was demonstrated with cancer experts achieving higher scores than controls [ovarian CAM: t(36)= -5.6, pcancer leaflet scored higher than those who received a control leaflet [ovarian CAM: t(49)=7.5, pcancer awareness in the general population.

  14. Investigating the construct validity of a development assessment centre

    Directory of Open Access Journals (Sweden)

    Nadia M. Brits

    2013-11-01

    Research purpose: The aim of this study was to determine the construct validity of a one-day development assessment centre (DAC using a convenience sample of 202 managers in a large South African banking institution. Motivation for the study: Although the AC method is popular, it has been widely criticised as to whether it predominantly measures the dimensions it is designed to measure. Research design, approach and method: The fit of the measurement models implied by the dimensions measured was analysed in a quantitative study using an ex post facto correlation design and structural equation modelling. Main findings: Bi-factor confirmatory factor analysis was used to assess the relative contribution of higher-order exercise and dimension effects. Empirical under-identification stemming from the small number of exercises designed to reflect designated latent dimensions restricted the number of DAC dimensions that could be evaluated. Ultimately, only one global dimension had enough measurement points and was analysed. The results suggested that dimension effects explained the majority of variance in the post-exercise dimension ratings. Practical/managerial implications: Candidates’ proficiency on each dimension was used as the basis for development reports. The validity of inferences holds important implications for candidates’ career development and growth. Contribution/value-add: The authors found only one study on construct validity of AC dimensions in the South African context. The present study is the first use the bi-factor approach. This study will consequently contribute to the scarce AC literature in South Africa.

  15. The Measurement of Entrepreneurial Outsourcing: Preliminary Scale Development, Dimensionality Assessment, and Construct Validation

    Directory of Open Access Journals (Sweden)

    Ali Davari

    2015-08-01

    Full Text Available Studying the outsourcing concept, as a strategy for efficient and effective business management, has been implemented less in the field of entrepreneurship. Accordingly, the present study aims to develop a measurement instrument for measuring entrepreneurial outsourcing construct utilizing empirical evidence in Iran’s telecommunications and automotive industries. Employing a sample of 203 senior managers and executive experts of companies operating in these industries, the gathered data were analyzed using PLS-SEM method. According to our results, the proposed scale of entrepreneurial outsourcing comprises six dimensions: strategic factors, economical factors, technological factors, task specifications, risk relating factors, and entrepreneurial performance. Moreover, the scale enjoys sufficient multidimensionality, reliability, and construct validity in terms of convergent and discriminate validity.

  16. Measuring couple relationship quality in a rural African population: Validation of a Couple Functionality Assessment Tool in Malawi.

    Directory of Open Access Journals (Sweden)

    Allison Ruark

    Full Text Available Available data suggest that individual and family well-being are linked to the quality of women's and men's couple relationships, but few tools exist to assess couple relationship functioning in low- and middle-income countries. In response to this gap, Catholic Relief Services has developed a Couple Functionality Assessment Tool (CFAT to capture valid and reliable data on various domains of relationship quality. This tool is designed to be used by interventions which aim to improve couple and family well-being as a means of measuring the effectiveness of these interventions, particularly related to couple relationship quality. We carried out a validation study of the CFAT among 401 married and cohabiting adults (203 women and 198 men in rural Chikhwawa District, Malawi. Using psychometric scales, the CFAT addressed six domains of couple relationship quality (intimacy, partner support, sexual satisfaction, gender roles, decision-making, and communication and conflict management, and included questions on intimate partner violence. We used exploratory factor analysis to assess scale performance of each domain and produce a shortened Relationship Quality Index (RQI composed of items from five relationship quality domains. This article reports the performance of the RQI. Internal reliability and validity of the RQI were found to be good. Regression analyses examined the relationship of the RQI to outcomes important to health and development: intra-household cooperation, positive health behaviors, intimate partner violence, and gender-equitable norms. We found many significant correlations between RQI scores and these couple- and family-level development issues. There is a need to further validate the tool with use in other populations as well as to continue to explore whether the observed linkages between couple functionality and development outcomes are causal relationships.

  17. Novel Automated Morphometric and Kinematic Handwriting Assessment: A Validity Study in Children with ASD and ADHD

    Science.gov (United States)

    Dirlikov, Benjamin; Younes, Laurent; Nebel, Mary Beth; Martinelli, Mary Katherine; Tiedemann, Alyssa Nicole; Koch, Carolyn A.; Fiorilli, Diana; Bastian, Amy J.; Denckla, Martha Bridge; Miller, Michael I.; Mostofsky, Stewart H.

    2017-01-01

    This study presents construct validity for a novel automated morphometric and kinematic handwriting assessment, including (1) convergent validity, establishing reliability of automated measures with traditional manual-derived Minnesota Handwriting Assessment (MHA), and (2) discriminant validity, establishing that the automated methods distinguish…

  18. Construct validity and inter-rater reliability of the Dutch activity measure for post-acute care "6-clicks" basic mobility form to assess the mobility of hospitalized patients.

    Science.gov (United States)

    Geelen, Sven Jacobus Gertruda; Valkenet, Karin; Veenhof, Cindy

    2018-05-12

    To evaluate the construct validity and the inter-rater reliability of the Dutch Activity Measure for Post-Acute Care "6-clicks" Basic Mobility short form measuring the patient's mobility in Dutch hospital care. First, the "6-clicks" was translated by using a forward-backward translation protocol. Next, 64 patients were assessed by the physiotherapist to determine the validity while being admitted to the Internal Medicine wards of a university medical center. Six hypotheses were tested regarding the construct "mobility" which showed that: Better "6-clicks" scores were related to less restrictive pre-admission living situations (p = 0.011), less restrictive discharge locations (p = 0.001), more independence in activities of daily living (p = 0.001) and less physiotherapy visits (p Dutch "6-clicks" shows a good construct validity and moderate-to-excellent inter-rater reliability when used to assess the mobility of hospitalized patients. Implications for Rehabilitation Even though various measurement tools have been developed, it appears the majority of physiotherapists working in a hospital currently do not use these tools as a standard part of their care. The Activity Measure for Post-Acute Care "6-clicks" Basic Mobility is the only tool which is designed to be short, easy to use within usual care and has been validated in the entire hospital population. This study shows that the Dutch version of the Activity Measure for Post-Acute Care "6-clicks" Basic Mobility form is a valid, easy to use, quick tool to assess the basic mobility of Dutch hospitalized patients.

  19. Are we really measuring what we say we're measuring? Using video techniques to supplement traditional construct validation procedures.

    Science.gov (United States)

    Podsakoff, Nathan P; Podsakoff, Philip M; Mackenzie, Scott B; Klinger, Ryan L

    2013-01-01

    Several researchers have persuasively argued that the most important evidence to consider when assessing construct validity is whether variations in the construct of interest cause corresponding variations in the measures of the focal construct. Unfortunately, the literature provides little practical guidance on how researchers can go about testing this. Therefore, the purpose of this article is to describe how researchers can use video techniques to test whether their scales measure what they purport to measure. First, we discuss how researchers can develop valid manipulations of the focal construct that they hope to measure. Next, we explain how to design a study to use this manipulation to test the validity of the scale. Finally, comparing and contrasting traditional and contemporary perspectives on validation, we discuss the advantages and limitations of video-based validation procedures. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  20. Discriminant Validity Assessment: Use of Fornell & Larcker criterion versus HTMT Criterion

    Science.gov (United States)

    Hamid, M. R. Ab; Sami, W.; Mohmad Sidek, M. H.

    2017-09-01

    Assessment of discriminant validity is a must in any research that involves latent variables for the prevention of multicollinearity issues. Fornell and Larcker criterion is the most widely used method for this purpose. However, a new method has emerged for establishing the discriminant validity assessment through heterotrait-monotrait (HTMT) ratio of correlations method. Therefore, this article presents the results of discriminant validity assessment using these methods. Data from previous study was used that involved 429 respondents for empirical validation of value-based excellence model in higher education institutions (HEI) in Malaysia. From the analysis, the convergent, divergent and discriminant validity were established and admissible using Fornell and Larcker criterion. However, the discriminant validity is an issue when employing the HTMT criterion. This shows that the latent variables under study faced the issue of multicollinearity and should be looked into for further details. This also implied that the HTMT criterion is a stringent measure that could detect the possible indiscriminant among the latent variables. In conclusion, the instrument which consisted of six latent variables was still lacking in terms of discriminant validity and should be explored further.

  1. Reliability and validity of expert assessment based on airborne and urinary measures of nickel and chromium exposure in the electroplating industry

    Science.gov (United States)

    Chen, Yu-Cheng; Coble, Joseph B; Deziel, Nicole C.; Ji, Bu-Tian; Xue, Shouzheng; Lu, Wei; Stewart, Patricia A; Friesen, Melissa C

    2014-01-01

    The reliability and validity of six experts’ exposure ratings were evaluated for 64 nickel-exposed and 72 chromium-exposed workers from six Shanghai electroplating plants based on airborne and urinary nickel and chromium measurements. Three industrial hygienists and three occupational physicians independently ranked the exposure intensity of each metal on an ordinal scale (1–4) for each worker's job in two rounds: the first round was based on responses to an occupational history questionnaire and the second round also included responses to an electroplating industry-specific questionnaire. Spearman correlation (rs) was used to compare each rating's validity to its corresponding subject-specific arithmetic mean of four airborne or four urinary measurements. Reliability was moderately-high (weighted kappa range=0.60–0.64). Validity was poor to moderate (rs= -0.37–0.46) for both airborne and urinary concentrations of both metals. For airborne nickel concentrations, validity differed by plant. For dichotomized metrics, sensitivity and specificity were higher based on urinary measurements (47–78%) than airborne measurements (16–50%). Few patterns were observed by metal, assessment round, or expert type. These results suggest that, for electroplating exposures, experts can achieve moderately-high agreement and (reasonably) distinguish between low and high exposures when reviewing responses to in-depth questionnaires used in population-based case-control studies. PMID:24736099

  2. Reliability and validity of expert assessment based on airborne and urinary measures of nickel and chromium exposure in the electroplating industry.

    Science.gov (United States)

    Chen, Yu-Cheng; Coble, Joseph B; Deziel, Nicole C; Ji, Bu-Tian; Xue, Shouzheng; Lu, Wei; Stewart, Patricia A; Friesen, Melissa C

    2014-11-01

    The reliability and validity of six experts' exposure ratings were evaluated for 64 nickel-exposed and 72 chromium-exposed workers from six Shanghai electroplating plants based on airborne and urinary nickel and chromium measurements. Three industrial hygienists and three occupational physicians independently ranked the exposure intensity of each metal on an ordinal scale (1-4) for each worker's job in two rounds: the first round was based on responses to an occupational history questionnaire and the second round also included responses to an electroplating industry-specific questionnaire. The Spearman correlation (r(s)) was used to compare each rating's validity to its corresponding subject-specific arithmetic mean of four airborne or four urinary measurements. Reliability was moderately high (weighted kappa range=0.60-0.64). Validity was poor to moderate (r(s)=-0.37-0.46) for both airborne and urinary concentrations of both metals. For airborne nickel concentrations, validity differed by plant. For dichotomized metrics, sensitivity and specificity were higher based on urinary measurements (47-78%) than airborne measurements (16-50%). Few patterns were observed by metal, assessment round, or expert type. These results suggest that, for electroplating exposures, experts can achieve moderately high agreement and (reasonably) distinguish between low and high exposures when reviewing responses to in-depth questionnaires used in population-based case-control studies.

  3. A Measure of Perceived Chronic Social Adversity: Development and Validation

    Directory of Open Access Journals (Sweden)

    Jingqiu Zhang

    2017-12-01

    Full Text Available The goal of this study was to develop a measure that assesses negative daily social encounters. Specifically, we examined the concept of perceived chronic social adversity and its assessment, the Perceived Chronic Social Adversity Questionnaire (PCSAQ. The PCSAQ focused on the subjective processing of daily social experiences. Psychometric properties were examined within two non-clinical samples (N = 331 and N = 390 and one clinical sample (N = 86. Exploratory and confirmatory factor analyses supported a three-factor model of the PCSAQ, which corresponds to three types of daily social stressors. The final 28-item PCSAQ was shown to be internally consistent, and to have good construct validity in terms of factor structure and group differences. It was also shown to have good concurrent validity in terms of association with outcome variables (sense of control, happiness, and mood and anxiety symptoms. Perceived chronic social adversity was also shown to be correlated with PTSD severity. Taken together, these findings suggest that the PCSAQ is a reliable, valid, and useful measure that can be used to assess negative social and clinical aspects of personal experiences. This study is an important exploratory step in improving our understanding of the relationship between the cumulative effect of negative social encounters and psychological difficulty.

  4. Preliminary validation of FastaReada as a measure of reading fluency

    Directory of Open Access Journals (Sweden)

    Zena eElhassan

    2015-10-01

    Full Text Available Fluent reading is characterized by speed and accuracy in the decoding and comprehension of connected text. Although a variety of measures are available for the assessment of reading skills most tests do not evaluate rate of text recognition as reflected in fluent reading. Here we evaluate FastaReada, a customized computer-generated task that was developed to address some of the limitations of currently available measures of reading skills. FastaReada provides a rapid assessment of reading fluency quantified as words read per minute for connected, meaningful text. To test the criterion validity of FastaReada, 124 mainstream school children with typical sensory, mental and motor development were assessed. Performance on FastaReada was correlated with the established Neale Analysis of Reading Ability (NARA measures of text reading accuracy, rate and comprehension, and common single word measures of pseudoword (non-word reading, phonetic decoding, phonological awareness and mode of word decoding (i.e., visual or eidetic versus auditory or phonetic. The results demonstrated strong positive correlations between FastaReada performance and NARA reading rate (r = .75, accuracy (r = .83 and comprehension (r = .63 scores providing evidence for criterion-related validity. Additional evidence for criterion validity was demonstrated through strong positive correlations between FastaReada and both single word eidetic (r = .81 and phonetic decoding skills (r = .68. The results also demonstrated FastaReada to be a stronger predictor of eidetic decoding than the NARA rate measure, with FastaReada predicting 14.4% of the variance compared to 2.6% predicted by NARA rate. FastaReada was therefore deemed to be a valid tool for educators, clinicians, and research related assessment of reading accuracy and rate. As expected, analysis with hierarchical regressions also highlighted the closer relationship of fluent reading to rapid visual word recognition than to

  5. Validation of an instrument to measure quality of life in British children with inflammatory bowel disease.

    Science.gov (United States)

    Ogden, C A; Akobeng, A K; Abbott, J; Aggett, P; Sood, M R; Thomas, A G

    2011-09-01

    To validate IMPACT-III (UK), a health-related quality of life (HRQoL) instrument, in British children with inflammatory bowel disease (IBD). One hundred six children and parents were invited to participate. IMPACT-III (UK) was validated by inspection by health professionals and children to assess face and content validity, factor analysis to determine optimum domain structure, use of Cronbach alpha coefficients to test internal reliability, ANOVA to assess discriminant validity, correlation with the Child Health Questionnaire to assess concurrent validity, and use of intraclass correlation coefficients to assess test-retest reliability. The independent samples t test was used to measure differences between sexes and age groups, and between paper and computerised versions of IMPACT-III (UK). IMPACT-III (UK) had good face and content validity. The most robust factor solution was a 5-domain structure: body image, embarrassment, energy, IBD symptoms, and worries/concerns about IBD, all of which demonstrated good internal reliability (α = 0.74-0.88). Discriminant validity was demonstrated by significant (P  measure HRQoL in British children with IBD.

  6. Anatomy education environment measurement inventory: A valid tool to measure the anatomy learning environment.

    Science.gov (United States)

    Hadie, Siti Nurma Hanim; Hassan, Asma'; Ismail, Zul Izhar Mohd; Asari, Mohd Asnizam; Khan, Aaijaz Ahmed; Kasim, Fazlina; Yusof, Nurul Aiman Mohd; Manan Sulong, Husnaida Abdul; Tg Muda, Tg Fatimah Murniwati; Arifin, Wan Nor; Yusoff, Muhamad Saiful Bahri

    2017-09-01

    Students' perceptions of the education environment influence their learning. Ever since the major medical curriculum reform, anatomy education has undergone several changes in terms of its curriculum, teaching modalities, learning resources, and assessment methods. By measuring students' perceptions concerning anatomy education environment, valuable information can be obtained to facilitate improvements in teaching and learning. Hence, it is important to use a valid inventory that specifically measures attributes of the anatomy education environment. In this study, a new 11-factor, 132-items Anatomy Education Environment Measurement Inventory (AEEMI) was developed using Delphi technique and was validated in a Malaysian public medical school. The inventory was found to have satisfactory content evidence (scale-level content validity index [total] = 0.646); good response process evidence (scale-level face validity index [total] = 0.867); and acceptable to high internal consistency, with the Raykov composite reliability estimates of the six factors are in the range of 0.604-0.876. The best fit model of the AEEMI is achieved with six domains and 25 items (X 2  = 415.67, P education environment in Malaysia. A concerted collaboration should be initiated toward developing a valid universal tool that, using the methods outlined in this study, measures the anatomy education environment across different institutions and countries. Anat Sci Educ 10: 423-432. © 2017 American Association of Anatomists. © 2017 American Association of Anatomists.

  7. Assessment of teacher competence using video portfolios: reliability, construct validity and consequential validity

    NARCIS (Netherlands)

    Admiraal, W.; Hoeksma, M.; van de Kamp, M.-T.; van Duin, G.

    2011-01-01

    The richness and complexity of video portfolios endanger both the reliability and validity of the assessment of teacher competencies. In a post-graduate teacher education program, the assessment of video portfolios was evaluated for its reliability, construct validity, and consequential validity.

  8. When Assessment Data Are Words: Validity Evidence for Qualitative Educational Assessments.

    Science.gov (United States)

    Cook, David A; Kuper, Ayelet; Hatala, Rose; Ginsburg, Shiphra

    2016-10-01

    Quantitative scores fail to capture all important features of learner performance. This awareness has led to increased use of qualitative data when assessing health professionals. Yet the use of qualitative assessments is hampered by incomplete understanding of their role in forming judgments, and lack of consensus in how to appraise the rigor of judgments therein derived. The authors articulate the role of qualitative assessment as part of a comprehensive program of assessment, and translate the concept of validity to apply to judgments arising from qualitative assessments. They first identify standards for rigor in qualitative research, and then use two contemporary assessment validity frameworks to reorganize these standards for application to qualitative assessment.Standards for rigor in qualitative research include responsiveness, reflexivity, purposive sampling, thick description, triangulation, transparency, and transferability. These standards can be reframed using Messick's five sources of validity evidence (content, response process, internal structure, relationships with other variables, and consequences) and Kane's four inferences in validation (scoring, generalization, extrapolation, and implications). Evidence can be collected and evaluated for each evidence source or inference. The authors illustrate this approach using published research on learning portfolios.The authors advocate a "methods-neutral" approach to assessment, in which a clearly stated purpose determines the nature of and approach to data collection and analysis. Increased use of qualitative assessments will necessitate more rigorous judgments of the defensibility (validity) of inferences and decisions. Evidence should be strategically sought to inform a coherent validity argument.

  9. Validity of instruments to assess students' travel and pedestrian safety.

    Science.gov (United States)

    Mendoza, Jason A; Watson, Kathy; Baranowski, Tom; Nicklas, Theresa A; Uscanga, Doris K; Hanfling, Marcus J

    2010-05-18

    Safe Routes to School (SRTS) programs are designed to make walking and bicycling to school safe and accessible for children. Despite their growing popularity, few validated measures exist for assessing important outcomes such as type of student transport or pedestrian safety behaviors. This research validated the SRTS school travel survey and a pedestrian safety behavior checklist. Fourth grade students completed a brief written survey on how they got to school that day with set responses. Test-retest reliability was obtained 3-4 hours apart. Convergent validity of the SRTS travel survey was assessed by comparison to parents' report. For the measure of pedestrian safety behavior, 10 research assistants observed 29 students at a school intersection for completion of 8 selected pedestrian safety behaviors. Reliability was determined in two ways: correlations between the research assistants' ratings to that of the Principal Investigator (PI) and intraclass correlations (ICC) across research assistant ratings. The SRTS travel survey had high test-retest reliability (kappa = 0.97, n = 96, p < 0.001) and convergent validity (kappa = 0.87, n = 81, p < 0.001). The pedestrian safety behavior checklist had moderate reliability across research assistants' ratings (ICC = 0.48) and moderate correlation with the PI (r = 0.55, p = < 0.01). When two raters simultaneously used the instrument, the ICC increased to 0.65. Overall percent agreement (91%), sensitivity (85%) and specificity (83%) were acceptable. These validated instruments can be used to assess SRTS programs. The pedestrian safety behavior checklist may benefit from further formative work.

  10. Process Skill Assessment Instrument: Innovation to measure student’s learning result holistically

    Science.gov (United States)

    Azizah, K. N.; Ibrahim, M.; Widodo, W.

    2018-01-01

    Science process skills (SPS) are very important skills for students. However, the fact that SPS is not being main concern in the primary school learning is undeniable. This research aimed to develop a valid, practical, and effective assessment instrument to measure student’s SPS. Assessment instruments comprise of worksheet and test. This development research used one group pre-test post-test design. Data were obtained with validation, observation, and test method to investigate validity, practicality, and the effectivenss of the instruments. Results showed that the validity of assessment instruments is very valid, the reliability is categorized as reliable, student SPS activities have a high percentage, and there is significant improvement on student’s SPS score. It can be concluded that assessment instruments of SPS are valid, practical, and effective to be used to measure student’s SPS result.

  11. Validation of Visual Caries Activity Assessment

    DEFF Research Database (Denmark)

    Guedes, R S; Piovesan, C; Ardenghi, T M

    2014-01-01

    We evaluated the predictive and construct validity of a caries activity assessment system associated with the International Caries Detection and Assessment System (ICDAS) in primary teeth. A total of 469 children were reexamined: participants of a caries survey performed 2 yr before (follow-up rate...... of 73.4%). At baseline, children (12-59 mo old) were examined with the ICDAS and a caries activity assessment system. The predictive validity was assessed by evaluating the risk of active caries lesion progression to more severe conditions in the follow-up, compared with inactive lesions. We also...... assessed if children with a higher number of active caries lesions were more likely to develop new lesions (construct validity). Noncavitated active caries lesions at occlusal surfaces presented higher risk of progression than inactive ones. Children with a higher number of active lesions and with higher...

  12. Review and evaluation of performance measures for survival prediction models in external validation settings

    Directory of Open Access Journals (Sweden)

    M. Shafiqur Rahman

    2017-04-01

    Full Text Available Abstract Background When developing a prediction model for survival data it is essential to validate its performance in external validation settings using appropriate performance measures. Although a number of such measures have been proposed, there is only limited guidance regarding their use in the context of model validation. This paper reviewed and evaluated a wide range of performance measures to provide some guidelines for their use in practice. Methods An extensive simulation study based on two clinical datasets was conducted to investigate the performance of the measures in external validation settings. Measures were selected from categories that assess the overall performance, discrimination and calibration of a survival prediction model. Some of these have been modified to allow their use with validation data, and a case study is provided to describe how these measures can be estimated in practice. The measures were evaluated with respect to their robustness to censoring and ease of interpretation. All measures are implemented, or are straightforward to implement, in statistical software. Results Most of the performance measures were reasonably robust to moderate levels of censoring. One exception was Harrell’s concordance measure which tended to increase as censoring increased. Conclusions We recommend that Uno’s concordance measure is used to quantify concordance when there are moderate levels of censoring. Alternatively, Gönen and Heller’s measure could be considered, especially if censoring is very high, but we suggest that the prediction model is re-calibrated first. We also recommend that Royston’s D is routinely reported to assess discrimination since it has an appealing interpretation. The calibration slope is useful for both internal and external validation settings and recommended to report routinely. Our recommendation would be to use any of the predictive accuracy measures and provide the corresponding predictive

  13. Predictive validity of measurements of clinical competence using the team objective structured bedside assessment (TOSBA): assessing the clinical competence of final year medical students.

    LENUS (Irish Health Repository)

    Meagher, Frances M

    2009-11-01

    The importance of valid and reliable assessment of student competence and performance is gaining increased recognition. Provision of valid patient-based formative assessment is an increasing challenge for clinical teachers in a busy hospital setting. A formative assessment tool that reliably predicts performance in the summative setting would be of value to both students and teachers.

  14. Understanding and Measuring Evaluation Capacity: A Model and Instrument Validation Study

    Science.gov (United States)

    Taylor-Ritzler, Tina; Suarez-Balcazar, Yolanda; Garcia-Iriarte, Edurne; Henry, David B.; Balcazar, Fabricio E.

    2013-01-01

    This study describes the development and validation of the Evaluation Capacity Assessment Instrument (ECAI), a measure designed to assess evaluation capacity among staff of nonprofit organizations that is based on a synthesis model of evaluation capacity. One hundred and sixty-nine staff of nonprofit organizations completed the ECAI. The 68-item…

  15. Accuracy limitations for low velocity measurements and draft assessment in rooms

    DEFF Research Database (Denmark)

    Melikov, Arsen Krikor; Popiolek, Zbigniew J.; Silva, M.G.

    2007-01-01

    must be known in order to perform reliable assessment and validation. At present, a low-velocity thermal anemometer (LVTA) with an omnidirectional (spherical) sensor is most often used in practice for measuring air speed due to its low price and easy and convenient operation. The accuracy of the speed......, the definition of realistic requirements in thermal comfort standards as well as validation of CFD predictions is made possible.......The measurement of air temperature, mean air speed, and turbulence intensity is required in order to assess air distribution and draft discomfort in ventilated rooms. The measurements are also used for validation of computational fluid dynamics (CFD) predictions. The uncertainty of the measurements...

  16. Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

    Science.gov (United States)

    Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

    2018-01-01

    The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.

  17. The consultation and relational empathy (CARE) measure: development and preliminary validation and reliability of an empathy-based consultation process measure.

    Science.gov (United States)

    Mercer, Stewart W; Maxwell, Margaret; Heaney, David; Watt, Graham Cm

    2004-12-01

    Empathy is a key aspect of the clinical encounter but there is a lack of patient-assessed measures suitable for general clinical settings. Our aim was to develop a consultation process measure based on a broad definition of empathy, which is meaningful to patients irrespective of their socio-economic background. Qualitative and quantitative approaches were used to develop and validate the new measure, which we have called the consultation and relational empathy (CARE) measure. Concurrent validity was assessed by correlational analysis against other validated measures in a series of three pilot studies in general practice (in areas of high or low socio-economic deprivation). Face and content validity was investigated by 43 interviews with patients from both types of areas, and by feedback from GPs and expert researchers in the field. The initial version of the new measure (pilot 1; high deprivation practice) correlated strongly (r = 0.85) with the Reynolds empathy measure (RES) and the Barrett-Lennard empathy subscale (BLESS) (r = 0.63), but had a highly skewed distribution (skew -1.879, kurtosis 3.563). Statistical analysis, and feedback from the 20 patients interviewed, the GPs and the expert researchers, led to a number of modifications. The revised, second version of the CARE measure, tested in an area of low deprivation (pilot 2) also correlated strongly with the established empathy measures (r = 0.84 versus RES and r = 0.77 versus BLESS) but had a less skewed distribution (skew -0.634, kurtosis -0.067). Internal reliability of the revised version was high (Cronbach's alpha 0.92). Patient feedback at interview (n = 13) led to only minor modification. The final version of the CARE measure, tested in pilot 3 (high deprivation practice) confirmed the validation with the other empathy measures (r = 0.85 versus RES and r = 0.84 versus BLESS) and the face validity (feedback from 10 patients). These preliminary results support the validity and reliability of the CARE

  18. Translating and validating a Training Needs Assessment tool into Greek

    OpenAIRE

    Markaki, Adelais; Antonakis, Nikos; Hicks, Carolyn M; Lionis, Christos

    2007-01-01

    Abstract Background The translation and cultural adaptation of widely accepted, psychometrically tested tools is regarded as an essential component of effective human resource management in the primary care arena. The Training Needs Assessment (TNA) is a widely used, valid instrument, designed to measure professional development needs of health care professionals, especially in primary health care. This study aims to describe the translation, adaptation and validation of the TNA questionnaire...

  19. Validity and reliability of balance assessment software using the Nintendo Wii balance board: usability and validation.

    Science.gov (United States)

    Park, Dae-Sung; Lee, GyuChang

    2014-06-10

    A balance test provides important information such as the standard to judge an individual's functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment software using the Nintendo Wii Balance Board, investigated its reliability and validity, and compared it with a laboratory-grade force platform. Twenty healthy adults participated in our study. The participants participated in the test for inter-rater reliability, intra-rater reliability, and concurrent validity. The tests were performed with balance assessment software using the Nintendo Wii balance board and a laboratory-grade force platform. Data such as Center of Pressure (COP) path length and COP velocity were acquired from the assessment systems. The inter-rater reliability, the intra-rater reliability, and concurrent validity were analyzed by an intraclass correlation coefficient (ICC) value and a standard error of measurement (SEM). The inter-rater reliability (ICC: 0.89-0.79, SEM in path length: 7.14-1.90, SEM in velocity: 0.74-0.07), intra-rater reliability (ICC: 0.92-0.70, SEM in path length: 7.59-2.04, SEM in velocity: 0.80-0.07), and concurrent validity (ICC: 0.87-0.73, SEM in path length: 5.94-0.32, SEM in velocity: 0.62-0.08) were high in terms of COP path length and COP velocity. The balance assessment software incorporating the Nintendo Wii balance board was used in our study and was found to be a reliable assessment device. In clinical settings, the device can be remarkably inexpensive, portable, and convenient for the balance assessment.

  20. Reliability and validity of non-radiographic methods of thoracic kyphosis measurement: a systematic review.

    Science.gov (United States)

    Barrett, Eva; McCreesh, Karen; Lewis, Jeremy

    2014-02-01

    A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.

  1. Structural Validation of the Holistic Wellness Assessment

    Science.gov (United States)

    Brown, Charlene; Applegate, E. Brooks; Yildiz, Mustafa

    2015-01-01

    The Holistic Wellness Assessment (HWA) is a relatively new assessment instrument based on an emergent transdisciplinary model of wellness. This study validated the factor structure identified via exploratory factor analysis (EFA), assessed test-retest reliability, and investigated concurrent validity of the HWA in three separate samples. The…

  2. RELIABILITY AND VALIDITY OF SUBJECTIVE ASSESSMENT OF LUMBAR LORDOSIS IN CONVENTIONAL RADIOGRAPHY.

    Science.gov (United States)

    Ruhinda, E; Byanyima, R K; Mugerwa, H

    2014-10-01

    Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. A blinded, repeated-measures diagnostic test was carried out on lumbar spine x-ray radiographs. Radiology Department at Joint Clinical Research Centre (JCRC), Mengo-Kampala-Uganda. Seventy (70) lateral lumbar x-ray films were used for this study and were obtained from the archive of JCRC radiology department at Butikiro house, Mengo-Kampala. Poor observer agreement, both inter- and intra-observer, with kappa values of 0.16 was found. Inter-observer agreement was poorer than intra-observer agreement. Kappa values significantly rose when the lumbar lordosis was clustered into four categories without grading each abnormality. The results confirm that subjective assessment of lumbar lordosis has low reliability and validity. Film quality has limited influence on the observer reliability. This study further shows that fewer scale categories of lordosis abnormalities produce better observer reliability.

  3. Validity of Cognitive Load Measures in Simulation-Based Training: A Systematic Review.

    Science.gov (United States)

    Naismith, Laura M; Cavalcanti, Rodrigo B

    2015-11-01

    Cognitive load theory (CLT) provides a rich framework to inform instructional design. Despite the applicability of CLT to simulation-based medical training, findings from multimedia learning have not been consistently replicated in this context. This lack of transferability may be related to issues in measuring cognitive load (CL) during simulation. The authors conducted a review of CLT studies across simulation training contexts to assess the validity evidence for different CL measures. PRISMA standards were followed. For 48 studies selected from a search of MEDLINE, EMBASE, PsycInfo, CINAHL, and ERIC databases, information was extracted about study aims, methods, validity evidence of measures, and findings. Studies were categorized on the basis of findings and prevalence of validity evidence collected, and statistical comparisons between measurement types and research domains were pursued. CL during simulation training has been measured in diverse populations including medical trainees, pilots, and university students. Most studies (71%; 34) used self-report measures; others included secondary task performance, physiological indices, and observer ratings. Correlations between CL and learning varied from positive to negative. Overall validity evidence for CL measures was low (mean score 1.55/5). Studies reporting greater validity evidence were more likely to report that high CL impaired learning. The authors found evidence that inconsistent correlations between CL and learning may be related to issues of validity in CL measures. Further research would benefit from rigorous documentation of validity and from triangulating measures of CL. This can better inform CLT instructional design for simulation-based medical training.

  4. Multi-Trait Multi-Method Matrices for the Validation of Creativity and Critical Thinking Assessments for Secondary School Students in England and Greece

    Directory of Open Access Journals (Sweden)

    Ourania Maria Ventista

    2017-08-01

    Full Text Available The aim of this paper is the validation of measurement tools which assess critical thinking and creativity as general constructs instead of subject-specific skills. Specifically, this research examined whether there is convergent and discriminant (or divergent validity between measurement tools of creativity and critical thinking. For this purpose, the multi-trait and multi-method matrix suggested by Campbell and Fiske (1959 was used. This matrix presented the correlation of scores that students obtain in different assessments in order to reveal whether the assessments measure the same or different constructs. Specifically, the two methods used were written and oral exams, and the two traits measured were critical thinking and creativity. For the validation of the assessments, 30 secondary-school students in Greece and 21 in England completed the assessments. The sample in both countries provided similar results. The critical thinking tools demonstrated convergent validity when compared with each other and discriminant validity with the creativity assessments. Furthermore, creativity assessments which measure the same aspect of creativity demonstrated convergent validity. To conclude, this research provided indicators that critical thinking and creativity as general constructs can be measured in a valid way. However, since the sample was small, further investigation of the validation of the assessment tools with a bigger sample is recommended.

  5. Validation of the Neuro-QoL measurement system in children with epilepsy.

    Science.gov (United States)

    Lai, Jin-Shei; Nowinski, Cindy J; Zelko, Frank; Wortman, Katy; Burns, James; Nordli, Douglas R; Cella, David

    2015-05-01

    Children with epilepsy often face complex psychosocial consequences that are not fully captured by existing patient-reported outcome (PRO) measures. The Neurology Quality of Life Measurement System "Neuro-QoL" was developed to provide a set of common PRO measures that address issues important to people with neurologic disorders. This paper reports Neuro-QoL (anxiety, depression, interaction with peers, fatigue, pain, cognitive function, stigma, and upper and lower extremity functions) validation in children with epilepsy. Patients (aged 10-18years) diagnosed with epilepsy completed Neuro-QoL and legacy measures at time 1 (initial study visit) and 6-month follow-up. Internal consistency reliability was also evaluated. Concurrent validity was assessed by comparing Neuro-QoL measures with more established "legacy" measures of the same concepts. Clinical validity was evaluated by comparing mean Neuro-QoL scores of patients grouped by clinical anchors such as disease severity. Responsiveness of the Neuro-QoL from time 1 (initial study visit) to 6months was evaluated using self-reported change as the primary anchor. Sixty-one patients (mean age=13.4years; 62.3% male, 75.9% white) participated. Most patients (64.2%) had been seizure-free in the 3months prior to participation, and seizure frequency was otherwise described as follows: 17.8% daily, 13.3% weekly, 35.6% monthly, and 33.3% yearly. All patients were taking antiepileptic drugs. Patients reported better function/less symptoms compared to the reference groups. Internal consistency (alpha) coefficients ranged from 0.76 to 0.87. Patients with different seizure frequencies differed on anxiety (pNeuro-QoL measures were significantly correlated with other measures assessing similar domains. Stigma was related to self-reported change in several areas of functioning but in sometimes unexpected directions. The Neurology Quality of Life Measurement System is a valid and reliable assessment tool for children with epilepsy

  6. Identification of validated questionnaires to measure adherence to pharmacological antihypertensive treatments

    Science.gov (United States)

    Pérez-Escamilla, Beatriz; Franco-Trigo, Lucía; Moullin, Joanna C; Martínez-Martínez, Fernando; García-Corpas, José P

    2015-01-01

    Background Low adherence to pharmacological treatments is one of the factors associated with poor blood pressure control. Questionnaires are an indirect measurement method that is both economic and easy to use. However, questionnaires should meet specific criteria, to minimize error and ensure reproducibility of results. Numerous studies have been conducted to design questionnaires that quantify adherence to pharmacological antihypertensive treatments. Nevertheless, it is unknown whether questionnaires fulfil the minimum requirements of validity and reliability. The aim of this study was to compile validated questionnaires measuring adherence to pharmacological antihypertensive treatments that had at least one measure of validity and one measure of reliability. Methods A literature search was undertaken in PubMed, the Excerpta Medica Database (EMBASE), and the Latin American and Caribbean Health Sciences Literature database (Literatura Latino-Americana e do Caribe em Ciências da Saúde [LILACS]). References from included articles were hand-searched. The included papers were all that were published in English, French, Portuguese, and Spanish from the beginning of the database’s indexing until July 8, 2013, where a validation of a questionnaire (at least one demonstration of the validity and at least one of reliability) was performed to measure adherence to antihypertensive pharmacological treatments. Results A total of 234 potential papers were identified in the electronic database search; of these, 12 met the eligibility criteria. Within these 12 papers, six questionnaires were validated: the Morisky–Green–Levine; Brief Medication Questionnaire; Hill-Bone Compliance to High Blood Pressure Therapy Scale; Morisky Medication Adherence Scale; Treatment Adherence Questionnaire for Patients with Hypertension (TAQPH); and Martín–Bayarre–Grau. Questionnaire length ranged from four to 28 items. Internal consistency, assessed by Cronbach’s α, varied from 0

  7. The Outcome and Assessment Information Set (OASIS): A Review of Validity and Reliability

    Science.gov (United States)

    O’CONNOR, MELISSA; DAVITT, JOAN K.

    2015-01-01

    The Outcome and Assessment Information Set (OASIS) is the patient-specific, standardized assessment used in Medicare home health care to plan care, determine reimbursement, and measure quality. Since its inception in 1999, there has been debate over the reliability and validity of the OASIS as a research tool and outcome measure. A systematic literature review of English-language articles identified 12 studies published in the last 10 years examining the validity and reliability of the OASIS. Empirical findings indicate the validity and reliability of the OASIS range from low to moderate but vary depending on the item studied. Limitations in the existing research include: nonrepresentative samples; inconsistencies in methods used, items tested, measurement, and statistical procedures; and the changes to the OASIS itself over time. The inconsistencies suggest that these results are tentative at best; additional research is needed to confirm the value of the OASIS for measuring patient outcomes, research, and quality improvement. PMID:23216513

  8. Validation of measures from the smartphone sway balance application: a pilot study.

    Science.gov (United States)

    Patterson, Jeremy A; Amick, Ryan Z; Thummar, Tarunkumar; Rogers, Michael E

    2014-04-01

    A number of different balance assessment techniques are currently available and widely used. These include both subjective and objective assessments. The ability to provide quantitative measures of balance and posture is the benefit of objective tools, however these instruments are not generally utilized outside of research laboratory settings due to cost, complexity of operation, size, duration of assessment, and general practicality. The purpose of this pilot study was to assess the value and validity of using software developed to access the iPod and iPhone accelerometers output and translate that to the measurement of human balance. Thirty healthy college-aged individuals (13 male, 17 female; age = 26.1 ± 8.5 years) volunteered. Participants performed a static Athlete's Single Leg Test protocol for 10 sec, on a Biodex Balance System SD while concurrently utilizing a mobile device with balance software. Anterior/posterior stability was recorded using both devices, described as the displacement in degrees from level, and was termed the "balance score." There were no significant differences between the two reported balance scores (p = 0.818. Mean balance score on the balance platform was 1.41 ± 0.90, as compared to 1.38 ± 0.72 using the mobile device. There is a need for a valid, convenient, and cost-effective tool to objectively measure balance. Results of this study are promising, as balance score derived from the Smartphone accelerometers were consistent with balance scores obtained from a previously validated balance system. However, further investigation is necessary as this version of the mobile software only assessed balance in the anterior/posterior direction. Additionally, further testing is necessary on a healthy populations and as well as those with impairment of the motor control system. Level 2b (Observational study of validity)(1.)

  9. Assessing Students' Understanding of Macroevolution: Concerns regarding the validity of the MUM

    Science.gov (United States)

    Novick, Laura R.; Catley, Kefyn M.

    2012-11-01

    In a recent article, Nadelson and Southerland (2010. Development and preliminary evaluation of the Measure of Understanding of Macroevolution: Introducing the MUM. The Journal of Experimental Education, 78, 151-190) reported on their development of a multiple-choice concept inventory intended to assess college students' understanding of macroevolutionary concepts, the Measure of Understanding Macroevolution (MUM). Given that the only existing evolution inventories assess understanding of natural selection, a microevolutionary concept, a valid assessment of students' understanding of macroevolution would be a welcome and necessary addition to the field of science education. Although the conceptual framework underlying Nadelson and Southerland's test is promising, we believe the test has serious shortcomings with respect to validity evidence for the construct being tested. We argue and provide evidence that these problems are serious enough that the MUM should not be used in its current form to measure students' understanding of macroevolution.

  10. Towards a realistic approach to validation of reactive transport models for performance assessment

    International Nuclear Information System (INIS)

    Siegel, M.D.

    1993-01-01

    Performance assessment calculations are based on geochemical models that assume that interactions among radionuclides, rocks and groundwaters under natural conditions, can be estimated or bound by data obtained from laboratory-scale studies. The data include radionuclide distribution coefficients, measured in saturated batch systems of powdered rocks, and retardation factors measured in short-term column experiments. Traditional approaches to model validation cannot be applied in a straightforward manner to the simple reactive transport models that use these data. An approach to model validation in support of performance assessment is described in this paper. It is based on a recognition of different levels of model validity and is compatible with the requirements of current regulations for high-level waste disposal. Activities that are being carried out in support of this approach include (1) laboratory and numerical experiments to test the validity of important assumptions inherent in current performance assessment methodologies,(2) integrated transport experiments, and (3) development of a robust coupled reaction/transport code for sensitivity analyses using massively parallel computers

  11. Development and content validity of a patient reported outcomes measure to assess symptoms of major depressive disorder

    Directory of Open Access Journals (Sweden)

    Lasch Kathryn

    2012-04-01

    had 15 daily and 20 weekly items. The cognitive interviews confirmed that the instructions, item content, and response scales were understood by the patients. Conclusions Rigorous qualitative research resulted in the development of a PRO measure for MDD with supported content validity. The MDD PRO can assist in understanding and assessing MDD symptoms from patients' perspectives as well as evaluating treatment benefit of new targeted therapies.

  12. Validation of geotechnical software for repository performance assessment

    International Nuclear Information System (INIS)

    LeGore, T.; Hoover, J.D.; Khaleel, R.; Thornton, E.C.; Anantatmula, R.P.; Lanigan, D.C.

    1989-01-01

    An important step in the characterization of a high level nuclear waste repository is to demonstrate that geotechnical software, used in performance assessment, correctly models validation. There is another type of validation, called software validation. It is based on meeting the requirements of specifications documents (e.g. IEEE specifications) and does not directly address the correctness of the specifications. The process of comparing physical experimental results with the predicted results should incorporate an objective measure of the level of confidence regarding correctness. This paper reports on a methodology developed that allows the experimental uncertainties to be explicitly included in the comparison process. The methodology also allows objective confidence levels to be associated with the software. In the event of a poor comparison, the method also lays the foundation for improving the software

  13. Reliability and validity of the Pragmatics Observational Measure (POM): a new observational measure of pragmatic language for children.

    Science.gov (United States)

    Cordier, Reinie; Munro, Natalie; Wilkes-Gillan, Sarah; Speyer, Renée; Pearce, Wendy M

    2014-07-01

    There is a need for a reliable and valid assessment of childhood pragmatic language skills during peer-peer interactions. This study aimed to evaluate the psychometric properties of a newly developed pragmatic assessment, the Pragmatic Observational Measure (POM). The psychometric properties of the POM were investigated from observational data of two studies - study 1 involved 342 children aged 5-11 years (108 children with ADHD; 108 typically developing playmates; 126 children in the control group), and study 2 involved 9 children with ADHD who attended a 7-week play-based intervention. The psychometric properties of the POM were determined based on the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) taxonomy of psychometric properties and definitions for health-related outcomes; the Pragmatic Protocol was used as the reference tool against which the POM was evaluated. The POM demonstrated sound psychometric properties in all the reliability, validity and interpretability criteria against which it was assessed. The findings showed that the POM is a reliable and valid measure of pragmatic language skills of children with ADHD between the age of 5 and 11 years and has clinical utility in identifying children with pragmatic language difficulty. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  14. Development and validation of a preference weight multiattribute health outcome measure for rheumatoid arthritis.

    Science.gov (United States)

    Chiou, Chiun-Fang; Suarez-Almazor, Maria E; Sherbourne, Cathy D; Chang, Chih-Hung; Reyes, Carolina; Dylan, Michelle; Ofman, Joshua; Wallace, Daniel J; Mizutani, Wesley; Weisman, Michael

    2006-12-01

    To develop and validate multiattribute measures for patients with rheumatoid arthritis (RA) to report health states and estimate preference weights. Survey materials were mailed to 748 patients. Factor analysis, an item response theory-based model, and an internal consistency test were used to identify attributes and evaluate items. Two multiattribute preference weight functions (MAPWF) were constructed. Construct validity of the new measures was then tested. Four hundred eighty-seven patients returned the survey; 24 items on 6 health attributes were selected to form the new outcomes measure. Two MAPWF were derived with preference weights measured with time tradeoff and visual analog scales as dependent variables. All validity test results were statistically significant. Our results reveal that the new measures are reliable and valid in assessing health states and associated preference weights of patients with RA.

  15. Geostatistical validation and cross-validation of magnetometric measurements of soil pollution with Potentially Toxic Elements in problematic areas

    Science.gov (United States)

    Fabijańczyk, Piotr; Zawadzki, Jarosław

    2016-04-01

    Field magnetometry is fast method that was previously effectively used to assess the potential soil pollution. One of the most popular devices that are used to measure the soil magnetic susceptibility on the soil surface is a MS2D Bartington. Single reading using MS2D device of soil magnetic susceptibility is low time-consuming but often characterized by considerable errors related to the instrument or environmental and lithogenic factors. In this connection, measured values of soil magnetic susceptibility have to be usually validated using more precise, but also much more expensive, chemical measurements. The goal of this study was to analyze validation methods of magnetometric measurements using chemical analyses of a concentration of elements in soil. Additionally, validation of surface measurements of soil magnetic susceptibility was performed using selected parameters of a distribution of magnetic susceptibility in a soil profile. Validation was performed using selected geostatistical measures of cross-correlation. The geostatistical approach was compared with validation performed using the classic statistics. Measurements were performed at selected areas located in the Upper Silesian Industrial Area in Poland, and in the selected parts of Norway. In these areas soil magnetic susceptibility was measured on the soil surface using a MS2D Bartington device and in the soil profile using MS2C Bartington device. Additionally, soil samples were taken in order to perform chemical measurements. Acknowledgment The research leading to these results has received funding from the Polish-Norwegian Research Programme operated by the National Centre for Research and Development under the Norwegian Financial Mechanism 2009-2014 in the frame of Project IMPACT - Contract No Pol-Nor/199338/45/2013.

  16. The measurement of instrumental ADL: content validity and construct validity

    DEFF Research Database (Denmark)

    Avlund, K; Schultz-Larsen, K; Kreiner, S

    1993-01-01

    do not depend on help. It is also possible to add the items in a valid way. However, to obtain valid IADL-scales, we omitted items that were highly relevant to especially elderly women, such as house-work items. We conclude that the criteria employed for this IADL-measure are somewhat contradictory....... showed that 14 items could be combined into two qualitatively different additive scales. The IADL-measure complies with demands for content validity, distinguishes between what the elderly actually do, and what they are capable of doing, and is a good discriminator among the group of elderly persons who...

  17. Development of assessment instruments to measure critical thinking skills

    Science.gov (United States)

    Sumarni, W.; Supardi, K. I.; Widiarti, N.

    2018-04-01

    Assessment instruments that is commonly used in the school generally have not been orientated on critical thinking skills. The purpose of this research is to develop assessment instruments to measure critical thinking skills, to test validity, reliability, and practicality. This type of research is Research and Development. There are two stages on the preface step, which are field study and literacy study. On the development steps, there some parts, which are 1) instrument construction, 2) expert validity, 3) limited scale tryout and 4) narrow scale try-out. The developed assessment instrument are analysis essay and problem solving. Instruments were declared valid, reliable and practical.

  18. Brief report: The Brief Alcohol Social Density Assessment (BASDA): convergent, criterion-related, and incremental validity.

    Science.gov (United States)

    MacKillop, James; Acker, John D; Bollinger, Jared; Clifton, Allan; Miller, Joshua D; Campbell, W Keith; Goodie, Adam S

    2013-09-01

    Alcohol misuse is substantially influenced by social factors, but systematic assessments of social network drinking are typically lengthy. The goal of the present study was to provide further validation of a brief measure of social network alcohol use, the Brief Alcohol Social Density Assessment (BASDA), in a sample of emerging adults. Specifically, the study sought to examine the BASDA's convergent, criterion, and incremental validity in relation to well-established measures of drinking motives and problematic drinking. Participants were 354 undergraduates who were assessed using the BASDA, the Alcohol Use Disorders Identification Test (AUDIT), and the Drinking Motives Questionnaire. Significant associations were observed between the BASDA index of alcohol-related social density and alcohol misuse, social motives, and conformity motives, supporting convergent validity. Criterion-related validity was supported by evidence that significantly greater alcohol involvement was present in the social networks of individuals scoring at or above an AUDIT score of 8, a validated criterion for hazardous drinking. Finally, the BASDA index was significantly associated with alcohol misuse above and beyond drinking motives in relation to AUDIT scores, supporting incremental validity. Taken together, these findings provide further support for the BASDA as an efficient measure of drinking in an individual's social network. Methodological considerations as well as recommendations for future investigations in this area are discussed.

  19. Validation of a scenario-based assessment of critical thinking using an externally validated tool.

    Science.gov (United States)

    Buur, Jennifer L; Schmidt, Peggy; Smylie, Dean; Irizarry, Kris; Crocker, Carlos; Tyler, John; Barr, Margaret

    2012-01-01

    With medical education transitioning from knowledge-based curricula to competency-based curricula, critical thinking skills have emerged as a major competency. While there are validated external instruments for assessing critical thinking, many educators have created their own custom assessments of critical thinking. However, the face validity of these assessments has not been challenged. The purpose of this study was to compare results from a custom assessment of critical thinking with the results from a validated external instrument of critical thinking. Students from the College of Veterinary Medicine at Western University of Health Sciences were administered a custom assessment of critical thinking (ACT) examination and the externally validated instrument, California Critical Thinking Skills Test (CCTST), in the spring of 2011. Total scores and sub-scores from each exam were analyzed for significant correlations using Pearson correlation coefficients. Significant correlations between ACT Blooms 2 and deductive reasoning and total ACT score and deductive reasoning were demonstrated with correlation coefficients of 0.24 and 0.22, respectively. No other statistically significant correlations were found. The lack of significant correlation between the two examinations illustrates the need in medical education to externally validate internal custom assessments. Ultimately, the development and validation of custom assessments of non-knowledge-based competencies will produce higher quality medical professionals.

  20. Positive mental health literacy: development and validation of a measure among Norwegian adolescents.

    Science.gov (United States)

    Bjørnsen, Hanne Nissen; Eilertsen, Mary Elizabeth Bradley; Ringdal, Regine; Espnes, Geir Arild; Moksnes, Unni Karin

    2017-09-18

    Mental health literacy (MHL), or the knowledge and abilities necessary to benefit mental health, is a significant determinant of mental health and has the potential to benefit both individual and public mental health. MHL and its measures have traditionally focused on knowledge and beliefs about mental -ill-health rather than on mental health. No measures of MHL addressing knowledge of good or positive mental health have been identified. This study aimed to develop and validate an instrument measuring adolescents' knowledge of how to obtain and maintain good mental health and to evaluate the psychometric properties of the instrument. More specifically, the factor structure, internal and construct validity, and test-retest reliability were assessed. The participants were Norwegian upper secondary school students aged 15-21 years. The development and validation of the instrument entailed three phases: 1) item generation based on the basic psychological needs theory (BPNT), focus group interviews, and a narrative literature review, 2) a pilot study (n = 479), and 3) test-retest (n = 149), known-groups validity (n = 44), and scale construction, item reduction through principal component analysis (PCA), and confirmatory factor analysis (CFA) for factor structure and psychometric properties assessment (n = 1888). Thirty-two items were initially generated, and 15 were selected for the pilot study. PCA identified cross-loadings, and a one-factor solution was examined. After removing five problematic items, CFA yielded a satisfactory fit for a 10-item one-factor model, referred to as the mental health-promoting knowledge (MHPK-10) measure. The test-retest evaluation supported the stability of the measure. McDonald's omega was 0.84, and known-groups validity test indicated good construct validity. A valid and reliable one-dimensional instrument measuring knowledge of factors promoting good mental health among adolescents was developed. The instrument has the

  1. Positive mental health literacy: development and validation of a measure among Norwegian adolescents

    Directory of Open Access Journals (Sweden)

    Hanne Nissen Bjørnsen

    2017-09-01

    Full Text Available Abstract Background Mental health literacy (MHL, or the knowledge and abilities necessary to benefit mental health, is a significant determinant of mental health and has the potential to benefit both individual and public mental health. MHL and its measures have traditionally focused on knowledge and beliefs about mental -ill-health rather than on mental health. No measures of MHL addressing knowledge of good or positive mental health have been identified. Aim: This study aimed to develop and validate an instrument measuring adolescents’ knowledge of how to obtain and maintain good mental health and to evaluate the psychometric properties of the instrument. More specifically, the factor structure, internal and construct validity, and test-retest reliability were assessed. Methods The participants were Norwegian upper secondary school students aged 15–21 years. The development and validation of the instrument entailed three phases: 1 item generation based on the basic psychological needs theory (BPNT, focus group interviews, and a narrative literature review, 2 a pilot study (n = 479, and 3 test-retest (n = 149, known-groups validity (n = 44, and scale construction, item reduction through principal component analysis (PCA, and confirmatory factor analysis (CFA for factor structure and psychometric properties assessment (n = 1888. Results Thirty-two items were initially generated, and 15 were selected for the pilot study. PCA identified cross-loadings, and a one-factor solution was examined. After removing five problematic items, CFA yielded a satisfactory fit for a 10-item one-factor model, referred to as the mental health-promoting knowledge (MHPK-10 measure. The test-retest evaluation supported the stability of the measure. McDonald’s omega was 0.84, and known-groups validity test indicated good construct validity. Conclusion A valid and reliable one-dimensional instrument measuring knowledge of factors promoting good mental

  2. Validation of screening tools to assess appetite among geriatric patients.

    Science.gov (United States)

    Hanisah, R; Suzana, S; Lee, F S

    2012-07-01

    Poor appetite is one of the main contributing factors of poor nutritional status among elderly individuals. Recognizing the importance of assessment of appetite, a cross sectional study was conducted to determine the validity of appetite screening tools namely, the Council on Nutrition Appetite questionnaire (CNAQ) and the simplified nutritional appetite questionnaire (SNAQ) against the appetite, hunger and sensory perception questionnaire (AHSPQ), measures of nutritional status and food intake among geriatric patients at the main general hospital in Malaysia. Nutritional status was assessed using the subjective global assessment (SGA) while food intake was measured using the dietary history questionnaire (DHQ). Anthropometric parameters included weight, height, body mass index (BMI), calf circumference (CC) and mid upper arm circumference (MUAC). A total of 145 subjects aged 60 to 86 years (68.3 ± 5.8 years) with 31.7% men and 68.3% women were recruited from outpatients (35 subjects) and inpatients (110 subjects) of Kuala Lumpur Hospital of Malaysia. As assessed by SGA, most subjects were classified as mild to moderately malnourished (50.4%), followed by normal (38.6%) and severely malnourished (11.0%). A total of 79.3% and 57.2% subjects were classified as having poor appetite according to CNAQ and SNAQ, respectively. CNAQ (80.9%) had a higher sensitivity than SNAQ (69.7%) when validated against nutritional status as assessed using SGA. However, the specificity of SNAQ (62.5%) was higher than CNAQ (23.2%). Positive predictive value for CNAQ and SNAQ were 62.6% and 74.7%, respectively. Cronbach's alpha for CNAQ and SNAQ were 0.546 and 0.578, respectively. History of weight loss over the past one year (Adjusted odds ratio 2.49) (p risk factors for poor appetite among subjects. In conclusion, malnutrition and poor appetite were prevalent among the geriatric outpatients and inpatients. SNAQ was more reliable and valid as an appetite screening tool among this special

  3. The Irvine, Beatties, and Bresnahan (IBB) Forelimb Recovery Scale: An Assessment of Reliability and Validity

    Science.gov (United States)

    Irvine, Karen-Amanda; Ferguson, Adam R.; Mitchell, Kathleen D.; Beattie, Stephanie B.; Lin, Amity; Stuck, Ellen D.; Huie, J. Russell; Nielson, Jessica L.; Talbott, Jason F.; Inoue, Tomoo; Beattie, Michael S.; Bresnahan, Jacqueline C.

    2014-01-01

    The IBB scale is a recently developed forelimb scale for the assessment of fine control of the forelimb and digits after cervical spinal cord injury [SCI; (1)]. The present paper describes the assessment of inter-rater reliability and face, concurrent and construct validity of this scale following SCI. It demonstrates that the IBB is a reliable and valid scale that is sensitive to severity of SCI and to recovery over time. In addition, the IBB correlates with other outcome measures and is highly predictive of biological measures of tissue pathology. Multivariate analysis using principal component analysis (PCA) demonstrates that the IBB is highly predictive of the syndromic outcome after SCI (2), and is among the best predictors of bio-behavioral function, based on strong construct validity. Altogether, the data suggest that the IBB, especially in concert with other measures, is a reliable and valid tool for assessing neurological deficits in fine motor control of the distal forelimb, and represents a powerful addition to multivariate outcome batteries aimed at documenting recovery of function after cervical SCI in rats. PMID:25071704

  4. Validation of the use of synthetic imagery for camouflage effectiveness assessment

    Science.gov (United States)

    Newman, Sarah; Gilmore, Marilyn A.; Moorhead, Ian R.; Filbee, David R.

    2002-08-01

    CAMEO-SIM was developed as a laboratory method to assess the effectiveness of aircraft camouflage schemes. It is a physically accurate synthetic image generator, rendering in any waveband between 0.4 and 14 microns. Camouflage schemes are assessed by displaying imagery to observers under controlled laboratory conditions or by analyzing the digital image and calculating the contrast statistics between the target and background. Code verification has taken place during development. However, validation of CAMEO-SIM is essential to ensure that the imagery produced is suitable to be used for camouflage effectiveness assessment. Real world characteristics are inherently variable, so exact pixel to pixel correlation is unnecessary. For camouflage effectiveness assessment it is more important to be confident that the comparative effects of different schemes are correct, but prediction of detection ranges is also desirable. Several different tests have been undertaken to validate CAMEO-SIM for the purpose of assessing camouflage effectiveness. Simple scenes have been modeled and measured. Thermal and visual properties of the synthetic and real scenes have been compared. This paper describes the validation tests and discusses the suitability of CAMEO-SIM for camouflage assessment.

  5. Reliability and validity of a nutrition and physical activity environmental self-assessment for child care

    Directory of Open Access Journals (Sweden)

    Ammerman Alice S

    2007-07-01

    Full Text Available Abstract Background Few assessment instruments have examined the nutrition and physical activity environments in child care, and none are self-administered. Given the emerging focus on child care settings as a target for intervention, a valid and reliable measure of the nutrition and physical activity environment is needed. Methods To measure inter-rater reliability, 59 child care center directors and 109 staff completed the self-assessment concurrently, but independently. Three weeks later, a repeat self-assessment was completed by a sub-sample of 38 directors to assess test-retest reliability. To assess criterion validity, a researcher-administered environmental assessment was conducted at 69 centers and was compared to a self-assessment completed by the director. A weighted kappa test statistic and percent agreement were calculated to assess agreement for each question on the self-assessment. Results For inter-rater reliability, kappa statistics ranged from 0.20 to 1.00 across all questions. Test-retest reliability of the self-assessment yielded kappa statistics that ranged from 0.07 to 1.00. The inter-quartile kappa statistic ranges for inter-rater and test-retest reliability were 0.45 to 0.63 and 0.27 to 0.45, respectively. When percent agreement was calculated, questions ranged from 52.6% to 100% for inter-rater reliability and 34.3% to 100% for test-retest reliability. Kappa statistics for validity ranged from -0.01 to 0.79, with an inter-quartile range of 0.08 to 0.34. Percent agreement for validity ranged from 12.9% to 93.7%. Conclusion This study provides estimates of criterion validity, inter-rater reliability and test-retest reliability for an environmental nutrition and physical activity self-assessment instrument for child care. Results indicate that the self-assessment is a stable and reasonably accurate instrument for use with child care interventions. We therefore recommend the Nutrition and Physical Activity Self-Assessment for

  6. Concurrent measurement of "real-world" stress and arousal in individuals with psychosis: assessing the feasibility and validity of a novel methodology.

    Science.gov (United States)

    Kimhy, David; Delespaul, Philippe; Ahn, Hongshik; Cai, Shengnan; Shikhman, Marina; Lieberman, Jeffrey A; Malaspina, Dolores; Sloan, Richard P

    2010-11-01

    Psychosis has been repeatedly suggested to be affected by increases in stress and arousal. However, there is a dearth of evidence supporting the temporal link between stress, arousal, and psychosis during "real-world" functioning. This paucity of evidence may stem from limitations of current research methodologies. Our aim is to the test the feasibility and validity of a novel methodology designed to measure concurrent stress and arousal in individuals with psychosis during "real-world" daily functioning. Twenty patients with psychosis completed a 36-hour ambulatory assessment of stress and arousal. We used experience sampling method with palm computers to assess stress (10 times per day, 10 AM → 10 PM) along with concurrent ambulatory measurement of cardiac autonomic regulation using a Holter monitor. The clocks of the palm computer and Holter monitor were synchronized, allowing the temporal linking of the stress and arousal data. We used power spectral analysis to determine the parasympathetic contributions to autonomic regulation and sympathovagal balance during 5 minutes before and after each experience sample. Patients completed 79% of the experience samples (75% with a valid concurrent arousal data). Momentary increases in stress had inverse correlation with concurrent parasympathetic activity (ρ = -.27, P feasibility and validity of our methodology in individuals with psychosis. The methodology offers a novel way to study in high time resolution the concurrent, "real-world" interactions between stress, arousal, and psychosis. The authors discuss the methodology's potential applications and future research directions.

  7. Measuring production loss due to health and work environment problems: construct validity and implications.

    Science.gov (United States)

    Karlsson, Malin Lohela; Bergström, Gunnar; Björklund, Christina; Hagberg, Jan; Jensen, Irene

    2013-12-01

    The aim was to validate two measures of production loss, health-related and work environment-related production loss, concerning their associations with health status and work environment factors. Validity was assessed by evaluating the construct validity. Health problems related and work environment-related problems (or factors) were included in separate analyses and evaluated regarding the significant difference in proportion of explained variation (R) of production loss. health problems production loss was not found to fulfill the criteria for convergent validity in this study; however, the measure of work environment-related production loss did fulfill the criteria that were set up. The measure of work environment-related production loss can be used to screen for production loss due to work environment problems as well as an outcome measure when evaluating the effect of organizational interventions.

  8. Validation of the Chinese version of functional assessment of anorexia-cachexia therapy (FAACT) scale for measuring quality of life in cancer patients with cachexia.

    Science.gov (United States)

    Zhou, Ting; Yang, Kaixiang; Thapa, Sudip; Fu, Qiang; Jiang, Yongsheng; Yu, Shiying

    2017-04-01

    The assessment of quality of life (QOL) is an important part of cachexia management for cancer patients. Functional assessment of anorexia-cachexia therapy (FAACT), a specific QOL instrument for cachexia patients, has not been validated in Chinese population. The aim of this study was to validate the FAACT scale in Chinese cancer patients for its future use. Eligible cancer patients were included in our study. Patients' demographic and clinical characteristics were collected from the electronic medical records. Patients were asked to complete the Chinese version of FAACT scale and the MD Anderson symptom inventory (MDASI), and then the reliability and validity were analyzed. A total of 285 patients were enrolled in our study, data of 241 patients were evaluated. Coefficients of Cronbach's alpha, test-retest and split-half analyses were all greater than 0.8, which indicated an excellent reliability for FAACT scale. In item-subscale correlation analysis and factor analysis, good construct validity for FAACT scale was found. The correlation between FAACT and MDASI interference subscale showed reasonable criterion-related validity, and for further clinical validation, the FAACT scale showed excellent discriminative validity for distinguishing patients in different cachexia status and in different performance status. The Chinese version of FAACT scale has good reliability and validity and is suitable for measuring QOL of cachexia patients in Chinese population.

  9. Publishing nutrition research: validity, reliability, and diagnostic test assessment in nutrition-related research.

    Science.gov (United States)

    Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara

    2010-03-01

    This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.

  10. Development and preliminary validation of a self-report measure of psychopathic personality traits in noncriminal populations.

    Science.gov (United States)

    Lilienfeld, S O; Andrews, B P

    1996-06-01

    Research on psychopathology has been hindered by persisting difficulties and controversies regarding its assessment. The primary goals of this set of studies were to (a) develop, and initiate the construct validation of, a self-report measure that assesses the major personality traits of psychopathy in noncriminal populations and (b) clarify the nature of these traits via an exploratory approach to test construction. This measure, the Psychopathic Personality Inventory (PPI), was developed by writing items to assess a large number of personality domains relevant to psychopathy and performing successive item-level factor analyses and revisions on three undergraduate samples. The PPI total score and its eight subscales were found to possess satisfactory internal consistency and test-retest reliability. In four studies with undergraduates, the PPI and its subscales exhibited a promising pattern of convergent and discriminant validity with self-report, psychiatric interview, observer rating, and family history data. In addition, the PPI total score demonstrated incremental validity relative to several commonly used self-report psychopathy-related measures. Future construct validation studies, unresolved conceptual issues regarding the assessment of psychopathy, and potential research uses of the PPI are outlined.

  11. Development and initial validation of a measure of work, family, and school conflict.

    Science.gov (United States)

    Olson, Kristine J

    2014-01-01

    This study reports the development and initial validation of a theoretically based measure of conflict between work, family, and college student roles. The measure was developed through the assessment of construct definitions and an assessment of measurement items by subject matter experts. Then, the measurement items were assessed with data from 500 college students who were engaged in work and family responsibilities. The results indicate that conflict between work, family, and school are effectively measured by 12 factors assessing the direction of conflict (e.g., work-to-school conflict, and school-to-work conflict) as well as the form of conflict (i.e., time, strain, and behavior based conflict). Sets of exploratory and confirmatory factor analyses demonstrated that the 12 factors of the new measure are distinct from the 6 factors of the Carlson, Kacmar, and Williams (2000) work-family conflict measure. Criterion validity of the measure was established through a series of regression analyses testing hypothesized relationships between antecedent and outcome variables with role conflict. Results indicate that role demand was a robust predictor of role conflict. To extend the literature, core self-evaluations and emotional stability were established as predictors of role conflict. Further, work, family, and school role satisfaction were significantly impacted with the presence of role conflict between work, family, and school. PsycINFO Database Record (c) 2014 APA, all rights reserved.

  12. Measurement of predictive validity in violence risk assessment studies: a second-order systematic review.

    Science.gov (United States)

    Singh, Jay P; Desmarais, Sarah L; Van Dorn, Richard A

    2013-01-01

    The objective of the present review was to examine how predictive validity is analyzed and reported in studies of instruments used to assess violence risk. We reviewed 47 predictive validity studies published between 1990 and 2011 of 25 instruments that were included in two recent systematic reviews. Although all studies reported receiver operating characteristic curve analyses and the area under the curve (AUC) performance indicator, this methodology was defined inconsistently and findings often were misinterpreted. In addition, there was between-study variation in benchmarks used to determine whether AUCs were small, moderate, or large in magnitude. Though virtually all of the included instruments were designed to produce categorical estimates of risk - through the use of either actuarial risk bins or structured professional judgments - only a minority of studies calculated performance indicators for these categorical estimates. In addition to AUCs, other performance indicators, such as correlation coefficients, were reported in 60% of studies, but were infrequently defined or interpreted. An investigation of sources of heterogeneity did not reveal significant variation in reporting practices as a function of risk assessment approach (actuarial vs. structured professional judgment), study authorship, geographic location, type of journal (general vs. specialized audience), sample size, or year of publication. Findings suggest a need for standardization of predictive validity reporting to improve comparison across studies and instruments. Copyright © 2013 John Wiley & Sons, Ltd.

  13. Validity of a parent-report measure of vocabulary and grammar for Spanish-speaking toddlers.

    Science.gov (United States)

    Thal, D; Jackson-Maldonado, D; Acosta, D

    2000-10-01

    The validity of the Fundación MacArthur Inventario del Desarrollo de Habilidades Comunicativas: Palabras y Enunciados (IDHC:PE) was examined with twenty 20- and nineteen 28-month-old, typically developing, monolingual, Spanish-speaking children living in Mexico. One measure of vocabulary (number of words) and two measures of grammar (mean of the three longest utterances and grammatical complexity score) from the IDHC:PE were compared to behavioral measures of vocabulary (number of different words from a language sample and number of objects named in a confrontation naming task) and one behavioral measure of grammar (mean length of utterance from a language sample). Only vocabulary measures were assessed in the 20-month-olds because of floor effects on the grammar measures. Results indicated validity for assessing expressive vocabulary in 20-month-olds and expressive vocabulary and grammar in 28-month-olds.

  14. Assessing Self-Regulated Strategies for School Writing: Cross-Cultural Validation of a Triadic Measure

    Science.gov (United States)

    Malpique, Anabela Abreu; Veiga Simão, Ana Margarida

    2015-01-01

    This study reports on the construction of a questionnaire to assess ninth-grade students' use of self-regulated strategies for school writing tasks. Exploratory and confirmatory factorial analyses were conducted to validate the factor structure of the instrument. The initial factor analytic stage (n = 296) revealed a 13-factor scale, accounting…

  15. A Turkish version of myocardial infarction dimensional assessment scale (TR-MIDAS): reliability-validity assesment.

    Science.gov (United States)

    Uysal, Hilal; Ozcan, Şeyda

    2011-06-01

    Many new measuring devices have been developed so that broader psychometric measurements in the coronary artery disease, disease-specific health status measurements, and identification of the broader quality of life can be performed in the recent years. The study was intended to determine whether, and to what extent, MIDAS is a valid and reliable measurement to the patients suffering from myocardial infarction for the first time in Turkey. The research was conducted with the patients hospitalized and treated with myocardial infarction in the cardiology departments of 2 hospitals in Istanbul, Turkey, between 2007 and 2008. Psychometric evaluations of TR-MIDAS were used for validity studies; language validity, content validity, construct validity were examined. For reliability studies; the tool's internal consistency reliability, Cronbach's alpha reliability coefficient, and test-retest reliability were completed. The instrument's content validity index was determined to be "0.95". Principal component analysis revealed six factors with an eigenvalue >1.5. Cronbach's alpha was found to be 0.89 for total scale which was an acceptable value. The total's test-retest reliability was 0.51 (p<0.01). Data obtained at the end of the study supports that Turkish Myocardial Infarction Dimensional Assessment Scale is a valid and reliable instrument as a disease-specific scale to assess the patients' quality of life suffering from myocardial infarction in Turkey. Copyright © 2010 European Society of Cardiology. Published by Elsevier B.V. All rights reserved.

  16. Quantum mechanics concept assessment: Development and validation study

    Directory of Open Access Journals (Sweden)

    Homeyra R. Sadaghiani

    2015-03-01

    Full Text Available As part of an ongoing investigation of students’ learning in first semester upper-division quantum mechanics, we needed a high-quality conceptual assessment instrument for comparing outcomes of different curricular approaches. The process of developing such a tool started with converting a preliminary version of a 14-item open-ended quantum mechanics assessment tool (QMAT to a multiple-choice (MC format. Further question refinement, development of effective distractors, adding new questions, and robust statistical analysis has led to a 31-item quantum mechanics concept assessment (QMCA test. The QMCA is used as post-test only to assess students’ knowledge about five main topics of quantum measurement: the time-independent Schrödinger equation, wave functions and boundary conditions, time evolution, and probability density. During two years of testing and refinement, the QMCA has been given in alpha (N=61 and beta versions (N=263 to students in upper division quantum mechanics courses at 11 different institutions with an average post-test score of 54%. By allowing for comparisons of student learning across different populations and institutions, the QMCA provides instructors and researchers a more standard measure of effectiveness of different curricula or teaching strategies on student conceptual understanding of quantum mechanics. In this paper, we discuss the construction of effective distractors and the use of student interviews and expert feedback to revise and validate both questions and distractors. We include the results of common statistical tests of reliability and validity, which suggest the instrument is presently in a stable, usable, and promising form.

  17. Clinical reliability and validity of elbow functional assessment in rheumatoid arthritis.

    NARCIS (Netherlands)

    Boer, Y.A. de; Ende, C.H.M. van den; Eygendaal, D.; Jolie, I.M.M.; Hazes, J.M.W.; Rozing, P.M.

    1999-01-01

    OBJECTIVES: (1) To investigate the measurement characteristics of the Hospital for Special Surgery (HSS) and Mayo Clinic elbow assessment instruments, utilizing methodological criteria including feasibility, reliability, validity, and discriminative ability; and (2) to develop an efficient and

  18. Clinical and psychometric validation of the psychotic depression assessment scale

    DEFF Research Database (Denmark)

    Østergaard, Søren D; Pedersen, Christina H; Uggerby, Peter

    2015-01-01

    BACKGROUND: Recent studies have indicated that the 11-item Psychotic Depression Assessment Scale (PDAS), consisting of the 6-item melancholia subscale (HAM-D6) of the Hamilton Depression Rating Scale and 5 psychosis items from the Brief Psychiatric Rating Scale (BPRS), is a valid measure for the ...

  19. A Validation Study of the Student Oral Proficiency Assessment (SOPA).

    Science.gov (United States)

    Thompson, Lynn E.; Kenyon, Dorry M.; Rhodes, Nancy C.

    This study validated the Student Oral Proficiency Assessment (SOPA), an oral proficiency instrument designed for students in elementary foreign language programs. Elementary students who were tested with the SOPA were also administered other instruments designed to measure proficiency. These instruments included the Stanford Foreign Language Oral…

  20. Is a sphygmomanometer a valid and reliable tool to measure the isometric strength of hip muscles? A systematic review.

    Science.gov (United States)

    Toohey, Liam Anthony; De Noronha, Marcos; Taylor, Carolyn; Thomas, James

    2015-02-01

    Muscle strength measurement is a key component of physiotherapists' assessment and is frequently used as an outcome measure. A sphygmomanometer is an instrument commonly used to measure blood pressure that can be potentially used as a tool to assess isometric muscle strength. To systematically review the evidence on the reliability and validity of a sphygmomanometer for measuring isometric strength of hip muscles. A literature search was conducted across four databases. Studies were eligible if they presented data on reliability and/or validity, used a sphygmomanometer to measure isometric muscle strength of the hip region, and were peer reviewed. The individual studies were evaluated for quality using a standardized critical appraisal tool. A total of 644 articles were screened for eligibility, with five articles chosen for inclusion. The use of a sphygmomanometer to objectively assess isometric muscle strength of the hip muscles appears to be reliable with intraclass correlation coefficient values ranging from 0.66 to 0.94 in elderly and young populations. No studies were identified that have assessed the validity of a sphygmomanometer. The sphygmomanometer appears to be reliable for assessment of isometric muscle strength around the hip joint, but further research is warranted to establish its validity.

  1. Improving the quality of discrete-choice experiments in health: how can we assess validity and reliability?

    Science.gov (United States)

    Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P

    2017-12-01

    The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.

  2. A Meta-Analysis of the Convergent Validity of Self-Control Measures

    Science.gov (United States)

    Duckworth, Angela Lee; Kern, Margaret L.

    2011-01-01

    There is extraordinary diversity in how the construct of self-control is operationalized in research studies. We meta-analytically examined evidence of convergent validity among executive function, delay of gratification, and self- and informant-report questionnaire measures of self-control. Overall, measures demonstrated moderate convergence (rrandom = .27 [95% CI = .24, .30]; rfixed = .34 [.33, .35], k = 282 samples, N = 33,564 participants), although there was substantial heterogeneity in the observed correlations. Correlations within and across types of self-control measures were strongest for informant-report questionnaires and weakest for executive function tasks. Questionnaires assessing sensation seeking impulses could be distinguished from questionnaires assessing processes of impulse regulation. We conclude that self-control is a coherent but multidimensional construct best assessed using multiple methods. PMID:21643479

  3. The reliability and validity of radiological assessment for patellar instability. A systematic review and meta-analysis

    Energy Technology Data Exchange (ETDEWEB)

    Smith, Toby O. [University of East Anglia, Faculty of Health, Norwich (United Kingdom); Davies, Leigh [Norfolk and Norwich University Hospital, Norwich (United Kingdom); Toms, Andoni P.; Donell, Simon T. [University of East Anglia, Faculty of Health, Norwich (United Kingdom); Norfolk and Norwich University Hospital, Norwich (United Kingdom); Hing, Caroline B. [St George' s Hospital, London (United Kingdom)

    2011-04-15

    To determine the discriminative validity and reliability of the evidence base using meta-analysis. A review of published sources using the databases AMED, CINHAL, EMBASE, MEDLINE, Scopus and the Cochrane Library, and for unpublished material was conducted. All studies assessing the reliability, validity, sensitivity or specificity of magnetic resonance imaging (MRI), computed tomography (CT) or ultrasound (US) of the patellofemoral joint of patients following patellar dislocation, subluxation or instability, were included. A meta-analysis was performed to assess the difference in radiological measurements between healthy controls and subjects with patellar instability in order to assess discrimination validity. A narrative assessment was used to evaluate the inter- and intra-observer reliability as well as the sensitivity and specificity of specific radiological measurements. A total of 27 studies were reviewed. The findings indicated that there was acceptable inter-observer and intra-observer reliability and validity for different methods of assessing patellar height and the sulcus angle with X-ray, MRI and CT methods, and the tibial tubercle-trochlear groove (TT-TG) assessed using CT. There was poor reliability or validity for the assessment of severity of trochlear dysplasia and the sulcus angle using US. There is insufficient evidence to determine the reliability, validity, sensitivity or specificity of tests such as the congruence angle, lateral patellar displacement, lateral patellar tilt, trochlear depth, boss height, the crossing sign or Wiberg patellar classification. A critical appraisal of the literature identified a number of recurrent methodological limitations. Further study is recommended to evaluate the reliability and validity of these radiological outcomes using well-designed radiological trials. (orig.)

  4. The reliability and validity of radiological assessment for patellar instability. A systematic review and meta-analysis

    International Nuclear Information System (INIS)

    Smith, Toby O.; Davies, Leigh; Toms, Andoni P.; Donell, Simon T.; Hing, Caroline B.

    2011-01-01

    To determine the discriminative validity and reliability of the evidence base using meta-analysis. A review of published sources using the databases AMED, CINHAL, EMBASE, MEDLINE, Scopus and the Cochrane Library, and for unpublished material was conducted. All studies assessing the reliability, validity, sensitivity or specificity of magnetic resonance imaging (MRI), computed tomography (CT) or ultrasound (US) of the patellofemoral joint of patients following patellar dislocation, subluxation or instability, were included. A meta-analysis was performed to assess the difference in radiological measurements between healthy controls and subjects with patellar instability in order to assess discrimination validity. A narrative assessment was used to evaluate the inter- and intra-observer reliability as well as the sensitivity and specificity of specific radiological measurements. A total of 27 studies were reviewed. The findings indicated that there was acceptable inter-observer and intra-observer reliability and validity for different methods of assessing patellar height and the sulcus angle with X-ray, MRI and CT methods, and the tibial tubercle-trochlear groove (TT-TG) assessed using CT. There was poor reliability or validity for the assessment of severity of trochlear dysplasia and the sulcus angle using US. There is insufficient evidence to determine the reliability, validity, sensitivity or specificity of tests such as the congruence angle, lateral patellar displacement, lateral patellar tilt, trochlear depth, boss height, the crossing sign or Wiberg patellar classification. A critical appraisal of the literature identified a number of recurrent methodological limitations. Further study is recommended to evaluate the reliability and validity of these radiological outcomes using well-designed radiological trials. (orig.)

  5. Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale.

    Science.gov (United States)

    Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe

    2018-04-01

    Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results.

  6. Evaluation of the Validity and Reliability of the Waterlow Pressure Ulcer Risk Assessment Scale

    Science.gov (United States)

    Charalambous, Charalambos; Koulori, Agoritsa; Vasilopoulos, Aristidis; Roupa, Zoe

    2018-01-01

    Introduction Prevention is the ideal strategy to tackle the problem of pressure ulcers. Pressure ulcer risk assessment scales are one of the most pivotal measures applied to tackle the problem, much criticisms has been developed regarding the validity and reliability of these scales. Objective To investigate the validity and reliability of the Waterlow pressure ulcer risk assessment scale. Method The methodology used is a narrative literature review, the bibliography was reviewed through Cinahl, Pubmed, EBSCO, Medline and Google scholar, 26 scientific articles where identified. The articles where chosen due to their direct correlation with the objective under study and their scientific relevance. Results The construct and face validity of the Waterlow appears adequate, but with regards to content validity changes in the category age and gender can be beneficial. The concurrent validity cannot be assessed. The predictive validity of the Waterlow is characterized by high specificity and low sensitivity. The inter-rater reliability has been demonstrated to be inadequate, this may be due to lack of clear definitions within the categories and differentiating level of knowledge between the users. Conclusion Due to the limitations presented regarding the validity and reliability of the Waterlow pressure ulcer risk assessment scale, the scale should be used in conjunction with clinical assessment to provide optimum results. PMID:29736104

  7. Validation of the Long- and Short-Form of the Ethical Values Assessment (EVA): A Questionnaire Measuring the Three Ethics Approach to Moral Psychology

    Science.gov (United States)

    Padilla-Walker, Laura Maria; Jensen, Lene Arnett

    2016-01-01

    Moral psychology has been moving toward consideration of multiple kinds of moral concepts and values, such as the Ethics of Autonomy, Community, and Divinity. While these three ethics have commonly been measured qualitatively, the current study sought to validate the long and short forms of the Ethical Values Assessment (EVA), which is a…

  8. Instruments to assess self-care among healthy children: A systematic review of measurement properties.

    Science.gov (United States)

    Urpí-Fernández, Ana-María; Zabaleta-Del-Olmo, Edurne; Montes-Hidalgo, Javier; Tomás-Sábado, Joaquín; Roldán-Merino, Juan-Francisco; Lluch-Canut, María-Teresa

    2017-12-01

    To identify, critically appraise and summarize the measurement properties of instruments to assess self-care in healthy children. Assessing self-care is a proper consideration for nursing practice and nursing research. No systematic review summarizes instruments of measurement validated in healthy children. Psychometric review in accordance with the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) panel. MEDLINE, CINAHL, PsycINFO, Web of Science and Open Grey were searched from their inception to December 2016. Validation studies with a healthy child population were included. Search was not restricted by language. Two reviewers independently assessed the methodological quality of included studies using the COSMIN checklist. Eleven studies were included in the review assessing the measurement properties of ten instruments. There was a maximum of two studies per instrument. None of the studies evaluated the properties of test-retest reliability, measurement error, criterion validity and responsiveness. Internal consistency and structural validity were rated as "excellent" or "good" in four studies. Four studies were rated as "excellent" in content validity. Cross-cultural validity was rated as "poor" in the two studies (three instruments) which cultural adaptation was carried out. The evidence available does not allow firm conclusions about the instruments identified in terms of reliability and validity. Future research should focus on generate evidence about a wider range of measurement properties of these instruments using a rigorous methodology, as well as instrument testing on different countries and child population. © 2017 John Wiley & Sons Ltd.

  9. The local lymph node assay and the assessment of relative potency: status of validation.

    Science.gov (United States)

    Basketter, David A; Gerberick, Frank; Kimber, Ian

    2007-08-01

    For the prediction of skin sensitization potential, the local lymph node assay (LLNA) is a fully validated alternative to guinea-pig tests. More recently, information from LLNA dose-response analyses has been used to assess the relative potency of skin sensitizing chemicals. These data are then deployed for risk assessment and risk management. In this commentary, the utility and validity of these relative potency measurements are reviewed. It is concluded that the LLNA does provide a valuable assessment of relative sensitizing potency in the form of the estimated concentration of a chemical required to produce a threefold stimulation of draining lymph node cell proliferation compared with concurrent controls (EC3 value) and that all reasonable validation requirements have been addressed successfully. EC3 measurements are reproducible in both intra- and interlaboratory evaluations and are stable over time. It has been shown also, by several independent groups, that EC3 values correlate closely with data on relative human skin sensitization potency. Consequently, the recommendation made here is that LLNA EC3 measurements should now be regarded as a validated method for the determination of the relative potency of skin sensitizing chemicals, a conclusion that has already been reached by a number of independent expert groups.

  10. Reliability and Validity of 3 Methods of Assessing Orthopedic Resident Skill in Shoulder Surgery.

    Science.gov (United States)

    Bernard, Johnathan A; Dattilo, Jonathan R; Srikumaran, Uma; Zikria, Bashir A; Jain, Amit; LaPorte, Dawn M

    Traditional measures for evaluating resident surgical technical skills (e.g., case logs) assess operative volume but not level of surgical proficiency. Our goal was to compare the reliability and validity of 3 tools for measuring surgical skill among orthopedic residents when performing 3 open surgical approaches to the shoulder. A total of 23 residents at different stages of their surgical training were tested for technical skill pertaining to 3 shoulder surgical approaches using the following measures: Objective Structured Assessment of Technical Skills (OSATS) checklists, the Global Rating Scale (GRS), and a final pass/fail assessment determined by 3 upper extremity surgeons. Adverse events were recorded. The Cronbach α coefficient was used to assess reliability of the OSATS checklists and GRS scores. Interrater reliability was calculated with intraclass correlation coefficients. Correlations among OSATS checklist scores, GRS scores, and pass/fail assessment were calculated with Spearman ρ. Validity of OSATS checklists was determined using analysis of variance with postgraduate year (PGY) as a between-subjects factor. Significance was set at p shoulder approaches. Checklist scores showed superior interrater reliability compared with GRS and subjective pass/fail measurements. GRS scores were positively correlated across training years. The incidence of adverse events was significantly higher among PGY-1 and PGY-2 residents compared with more experienced residents. OSATS checklists are a valid and reliable assessment of technical skills across 3 surgical shoulder approaches. However, checklist scores do not measure quality of technique. Documenting adverse events is necessary to assess quality of technique and ultimate pass/fail status. Multiple methods of assessing surgical skill should be considered when evaluating orthopedic resident surgical performance. Copyright © 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights

  11. Assessing Knowledge Sharing Among Academics: A Validation of the Knowledge Sharing Behavior Scale (KSBS).

    Science.gov (United States)

    Ramayah, T; Yeap, Jasmine A L; Ignatius, Joshua

    2014-04-01

    There is a belief that academics tend to hold on tightly to their knowledge and intellectual resources. However, not much effort has been put into the creation of a valid and reliable instrument to measure knowledge sharing behavior among the academics. To apply and validate the Knowledge Sharing Behavior Scale (KSBS) as a measure of knowledge sharing behavior within the academic community. Respondents (N = 447) were academics from arts and science streams in 10 local, public universities in Malaysia. Data were collected using the 28-item KSBS that assessed four dimensions of knowledge sharing behavior namely written contributions, organizational communications, personal interactions, and communities of practice. The exploratory factor analysis showed that the items loaded on the dimension constructs that they were supposed to represent, thus proving construct validity. A within-factor analysis revealed that each set of items representing their intended dimension loaded on only one construct, therefore establishing convergent validity. All four dimensions were not perfectly correlated with each other or organizational citizenship behavior, thereby proving discriminant validity. However, all four dimensions correlated with organizational commitment, thus confirming predictive validity. Furthermore, all four factors correlated with both tacit and explicit sharing, which confirmed their concurrent validity. All measures also possessed sufficient reliability (α > .70). The KSBS is a valid and reliable instrument that can be used to formally assess the types of knowledge artifacts residing among academics and the degree of knowledge sharing in relation to those artifacts. © The Author(s) 2014.

  12. Development of a reliable, valid measure to assess parents' and teachers' understanding of postural care for children with physical disabilities: the (UKC PostCarD) questionnaire.

    Science.gov (United States)

    Hotham, S; Hutton, E; Hamilton-West, K E

    2015-11-01

    Previous research has highlighted lack of knowledge, understanding and confidence among parents and teachers responsible for the postural care of children with physical disability. Interventions designed to improve these qualities require a reliable and validated tool to assess pre- and post-intervention levels. Currently, however, no validated measure of postural care confidence (i.e. self-efficacy) exists. Hence, the aim of this research was to develop a reliable and valid questionnaire to assess parents' and teachers' confidence, alongside knowledge and understanding of postural care - the Understanding Knowledge and Confidence in providing POSTural CARe for children with Disabilities (UKC PostCarD) questionnaire. Items were developed by a multidisciplinary team and designed to map onto the content of 'An A-to-Z of Postural Care'. Parents, teachers and therapists assessed items for face validity. Scale reliability was then assessed using Cronbach's alpha and known-group validity was assessed by comparing scores of an 'expert' group (physiotherapists and occupational therapists) with those of a 'non-expert' group (with no formal training in postural care). The total scale and all three subscales (understanding and knowledge, confidence and concerns) demonstrated adequate reliability (α > 0.83) and subscale correlations formed a logical pattern (understanding and knowledge correlated positively with confidence and negatively with concerns). Experts' (n = 111) scores were higher than non-experts' (n = 79) for the total scale and all subscales (P children with disabilities. © 2015 John Wiley & Sons Ltd.

  13. Identification of validated questionnaires to measure adherence to pharmacological antihypertensive treatments

    Directory of Open Access Journals (Sweden)

    Pérez-Escamilla B

    2015-04-01

    Full Text Available Beatriz Pérez-Escamilla,1 Lucía Franco-Trigo,1 Joanna C Moullin,2 Fernando Martínez-Martínez,1 José P García-Corpas1 1Academic Centre in Pharmaceutical Care, Faculty of Pharmacy, University of Granada, Granada, Spain; 2Graduate School of Health, Faculty of Pharmacy, University of Technology Sydney, Sydney, NSW, Australia Background: Low adherence to pharmacological treatments is one of the factors associated with poor blood pressure control. Questionnaires are an indirect measurement method that is both economic and easy to use. However, questionnaires should meet specific criteria, to minimize error and ensure reproducibility of results. Numerous studies have been conducted to design questionnaires that quantify adherence to pharmacological antihypertensive treatments. Nevertheless, it is unknown whether questionnaires fulfil the minimum requirements of validity and reliability. The aim of this study was to compile validated questionnaires measuring adherence to pharmacological antihypertensive treatments that had at least one measure of validity and one measure of reliability. Methods: A literature search was undertaken in PubMed, the Excerpta Medica Database (EMBASE, and the Latin American and Caribbean Health Sciences Literature database (Literatura Latino-Americana e do Caribe em Ciências da Saúde [LILACS]. References from included articles were hand-searched. The included papers were all that were published in English, French, Portuguese, and Spanish from the beginning of the database’s indexing until July 8, 2013, where a validation of a questionnaire (at least one demonstration of the validity and at least one of reliability was performed to measure adherence to antihypertensive pharmacological treatments. Results: A total of 234 potential papers were identified in the electronic database search; of these, 12 met the eligibility criteria. Within these 12 papers, six questionnaires were validated: the Morisky

  14. Many quality measurements, but few quality measures assessing the quality of breast cancer care in women: a systematic review.

    Science.gov (United States)

    Schachter, Howard M; Mamaladze, Vasil; Lewin, Gabriela; Graham, Ian D; Brouwers, Melissa; Sampson, Margaret; Morrison, Andra; Zhang, Li; O'Blenis, Peter; Garritty, Chantelle

    2006-12-18

    Breast cancer in women is increasingly frequent, and care is complex, onerous and expensive, all of which lend urgency to improvements in care. Quality measurement is essential to monitor effectiveness and to guide improvements in healthcare. Ten databases, including Medline, were searched electronically to identify measures assessing the quality of breast cancer care in women (diagnosis, treatment, followup, documentation of care). Eligible studies measured adherence to standards of breast cancer care in women diagnosed with, or in treatment for, any histological type of adenocarcinoma of the breast. Reference lists of studies, review articles, web sites, and files of experts were searched manually. Evidence appraisal entailed dual independent assessments of data (e.g., indicators used in quality measurement). The extent of each quality indicator's scientific validation as a measure was assessed. The American Society of Clinical Oncology (ASCO) was asked to contribute quality measures under development. Sixty relevant reports identified 58 studies with 143 indicators assessing adherence to quality breast cancer care. A paucity of validated indicators (n = 12), most of which assessed quality of life, only permitted a qualitative data synthesis. Most quality indicators evaluated processes of care. While some studies revealed patterns of under-use of care, all adherence data require confirmation using validated quality measures. ASCO's current development of a set of quality measures relating to breast cancer care may hold the key to conducting definitive studies.

  15. Development and validation of the Pediatric Anesthesia Behavior score--an objective measure of behavior during induction of anesthesia.

    Science.gov (United States)

    Beringer, Richard M; Greenwood, Rosemary; Kilpatrick, Nicky

    2014-02-01

    Measuring perioperative behavior changes requires validated objective rating scales. We developed a simple score for children's behavior during induction of anesthesia (Pediatric Anesthesia Behavior score) and assessed its reliability, concurrent validity, and predictive validity. Data were collected as part of a wider observational study of perioperative behavior changes in children undergoing general anesthesia for elective dental extractions. One-hundred and two healthy children aged 2-12 were recruited. Previously validated behavioral scales were used as follows: the modified Yale Preoperative Anxiety Scale (m-YPAS); the induction compliance checklist (ICC); the Pediatric Anesthesia Emergence Delirium scale (PAED); and the Post-Hospitalization Behavior Questionnaire (PHBQ). Pediatric Anesthesia Behavior (PAB) score was independently measured by two investigators, to allow assessment of interobserver reliability. Concurrent validity was assessed by examining the correlation between the PAB score, the m-YPAS, and the ICC. Predictive validity was assessed by examining the association between the PAB score, the PAED scale, and the PHBQ. The PAB score correlated strongly with both the m-YPAS (P risk of developing postoperative behavioral disturbance. This study provides evidence for its reliability and validity. © 2013 John Wiley & Sons Ltd.

  16. Infusion phlebitis assessment measures: a systematic review.

    Science.gov (United States)

    Ray-Barruel, Gillian; Polit, Denise F; Murfield, Jenny E; Rickard, Claire M

    2014-04-01

    Phlebitis is a common and painful complication of peripheral intravenous cannulation. The aim of this review was to identify the measures used in infusion phlebitis assessment and evaluate evidence regarding their reliability, validity, responsiveness and feasibility. We conducted a systematic literature review of the Cochrane library, Ovid MEDLINE and EBSCO CINAHL until September 2013. All English-language studies (randomized controlled trials, prospective cohort and cross-sectional) that used an infusion phlebitis scale were retrieved and analysed to determine which symptoms were included in each scale and how these were measured. We evaluated studies that reported testing the psychometric properties of phlebitis assessment scales using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) guidelines. Infusion phlebitis was the primary outcome measure in 233 studies. Fifty-three (23%) of these provided no actual definition of phlebitis. Of the 180 studies that reported measuring phlebitis incidence and/or severity, 101 (56%) used a scale and 79 (44%) used a definition alone. We identified 71 different phlebitis assessment scales. Three scales had undergone some psychometric analyses, but no scale had been rigorously tested. Many phlebitis scales exist, but none has been thoroughly validated for use in clinical practice. A lack of consensus on phlebitis measures has likely contributed to disparities in reported phlebitis incidence, precluding meaningful comparison of phlebitis rates. © 2014 The Authors. Journal of Evaluation in Clinical Practice published by John Wiley & Sons, Ltd.

  17. Assessing Irrational Beliefs and Emotional Distress: Evidence and Implications of Limited Discriminant Validity.

    Science.gov (United States)

    Zurawski, Raymond M.; Smith, Timothy W.

    1987-01-01

    Examined the disciminant validity of measures of irrational beliefs. The Irrational Beliefs Test and the Rational Behavior Inventory were highly correlated but were equally highly correlated with self-report measures of depression and anxiety. Thus, rather than assessing beliefs correlated with emotional distress, the measures may actually assess…

  18. Assessing communication skills in dietetic consultations: the development of the reliable and valid DIET-COMMS tool.

    Science.gov (United States)

    Whitehead, K A; Langley-Evans, S C; Tischler, V A; Swift, J A

    2014-04-01

    There is an increasing emphasis on the development of communication skills for dietitians but few evidence-based assessment tools available. The present study aimed to develop a dietetic-specific, short, reliable and valid assessment tool for measuring communication skills in patient consultations: DIET-COMMS. A literature review and feedback from 15 qualified dietitians were used to establish face and content validity during the development of DIET-COMMS. In total, 113 dietetic students and qualified dietitians were video-recorded undertaking mock consultations, assessed using DIET-COMMS by the lead author, and used to establish intra-rater reliability, as well as construct and predictive validity. Twenty recorded consultations were reassessed by nine qualified dietitians to assess inter-rater reliability: eight of these assessors were interviewed to determine user evaluation. Significant improvements in DIET-COMMS scores were achieved as students and qualified staff progressed through their training and gained experience, demonstrating construct validity, and also by qualified staff attending a training course, indicating predictive validity (P skills in practice was questioned. DIET-COMMS is a short, user-friendly, reliable and valid tool for measuring communication skills in patient consultations with both pre- and post-registration dietitians. Additional work is required to develop a training package for assessors and to identify how DIET-COMMS assessment can acceptably be incorporated into practice. © 2013 The British Dietetic Association Ltd.

  19. Reliability and Validity Assessment of a Linear Position Transducer

    Science.gov (United States)

    Garnacho-Castaño, Manuel V.; López-Lastra, Silvia; Maté-Muñoz, José L.

    2015-01-01

    The objectives of the study were to determine the validity and reliability of peak velocity (PV), average velocity (AV), peak power (PP) and average power (AP) measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain) during two resistance exercises, bench press (BP) and full back squat (BS), performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2). Intraclass correlation coefficients (ICCs) indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W). Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W). Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP) make this device a useful tool for monitoring resistance training. Key points This study determined the validity and reliability of peak velocity, average velocity, peak power and average power measurements made using a linear position transducer The Tendo Weight-lifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and power. PMID:25729300

  20. Development and Validation of Instruments to Measure Learning of Expert-Like Thinking

    Science.gov (United States)

    Adams, Wendy K.; Wieman, Carl E.

    2011-06-01

    This paper describes the process for creating and validating an assessment test that measures the effectiveness of instruction by probing how well that instruction causes students in a class to think like experts about specific areas of science. The design principles and process are laid out and it is shown how these align with professional standards that have been established for educational and psychological testing and the elements of assessment called for in a recent National Research Council study on assessment. The importance of student interviews for creating and validating the test is emphasized, and the appropriate interview procedures are presented. The relevance and use of standard psychometric statistical tests are discussed. Additionally, techniques for effective test administration are presented.

  1. Developing and validating an instrument for measuring mobile computing self-efficacy.

    Science.gov (United States)

    Wang, Yi-Shun; Wang, Hsiu-Yuan

    2008-08-01

    IT-related self-efficacy has been found to have a critical influence on system use. However, traditional measures of computer self-efficacy and Internet-related self-efficacy are perceived to be inapplicable in the context of mobile computing and commerce because they are targeted primarily at either desktop computer or wire-based technology contexts. Based on previous research, this study develops and validates a multidimensional instrument for measuring mobile computing self-efficacy (MCSE). This empirically validated instrument will be useful to researchers in developing and testing the theories of mobile user behavior, and to practitioners in assessing the mobile computing self-efficacy of users and promoting the use of mobile commerce systems.

  2. Validity of soft-tissue thickness of calf measured using MRI for assessing unilateral lower extremity lymphoedema secondary to cervical and endometrial cancer treatments

    International Nuclear Information System (INIS)

    Lu, Qing; Li, Yulai; Chen, Tian-Wu; Yao, Yuan; Zhao, Zizhou; Li, Yang; Xu, Jianrong; Jiang, Zhaohua; Hu, Jiani

    2014-01-01

    Aim: To determine whether soft-tissue thickness of the calf measured using MRI could be valid for assessing unilateral lower extremity lymphoedema (LEL) secondary to cervical and endometrial cancer treatments. Materials and methods: Seventy women with unilateral LEL and 25 without LEL after cervical or endometrial cancer treatments underwent MRI examinations of their calves. Total thickness of soft-tissue (TT), muscle thickness (MT), and subcutaneous tissue thickness (STT) of the calf, and the difference between the affected and contralateral unaffected calf regarding TT (DTT), MT (DMT), and STT (DSTT) were obtained using fat-suppressed T2-weighted imaging in the middle of the calves. The volume of the calf and difference in volume (DV) between calves were obtained by the method of water displacement. Statistical analysis was performed to determine the validity of MRI measurements by volume measurements in staging LEL. Results: There was a close correlation between volume and TT for the affected (r = 0.927) or unaffected calves (r = 0.896). STT of the affected calf, and DTT or DSTT of the calves were closely correlated with volume of the affected calf or DV of the calves (all p < 0.05). Multivariate analysis showed significant differences in TT, STT, volume of the affected calf, DTT, DSTT, and DV between stages except in volume of the affected calf or in DV between stage 0 and 1. For staging LEL, DSTT showed the best discrimination ability among all the parameters. Conclusions: Soft-tissue thickness of the calf measured at MRI could be valid for quantitatively staging unilateral LEL, and DSTT of the calves could be the best classifying factor. - Highlights: • The soft tissue thickness of calves on MRI could quantitatively assess secondary LEL. • Calf soft tissue thickness indicated concurrent or construct validity of calf volume. • The difference of subcutaneous tissue thickness of calves could be used to stage LEL

  3. Assessing Cognitive Performance in Badminton Players: A Reproducibility and Validity Study

    Directory of Open Access Journals (Sweden)

    van de Water Tanja

    2017-01-01

    Full Text Available Fast reaction and good inhibitory control are associated with elite sports performance. To evaluate the reproducibility and validity of a newly developed Badminton Reaction Inhibition Test (BRIT, fifteen elite (25 ± 4 years and nine non-elite (24 ± 4 years Dutch male badminton players participated in the study. The BRIT measured four components: domain-general reaction time, badminton-specific reaction time, domain-general inhibitory control and badminton-specific inhibitory control. Five participants were retested within three weeks on the badminton-specific components. Reproducibility was acceptable for badminton-specific reaction time (ICC = 0.626, CV = 6% and for badminton-specific inhibitory control (ICC = 0.317, CV = 13%. Good construct validity was shown for badminton-specific reaction time discriminating between elite and non-elite players (F = 6.650, p 0.05. Concurrent validity for domain-general reaction time was good, as it was associated with a national ranking for elite (p = 0.70, p 0.05. In conclusion, reproducibility and validity of inhibitory control assessment was not confirmed, however, the BRIT appears a reproducible and valid measure of reaction time in badminton players. Reaction time measured with the BRIT may provide input for training programs aiming to improve badminton players’ performance.

  4. Assessing Cognitive Performance in Badminton Players: A Reproducibility and Validity Study.

    Science.gov (United States)

    van de Water, Tanja; Huijgen, Barbara; Faber, Irene; Elferink-Gemser, Marije

    2017-01-01

    Fast reaction and good inhibitory control are associated with elite sports performance. To evaluate the reproducibility and validity of a newly developed Badminton Reaction Inhibition Test (BRIT), fifteen elite (25 ± 4 years) and nine non-elite (24 ± 4 years) Dutch male badminton players participated in the study. The BRIT measured four components: domain-general reaction time, badminton-specific reaction time, domain-general inhibitory control and badminton-specific inhibitory control. Five participants were retested within three weeks on the badminton-specific components. Reproducibility was acceptable for badminton-specific reaction time (ICC = 0.626, CV = 6%) and for badminton-specific inhibitory control (ICC = 0.317, CV = 13%). Good construct validity was shown for badminton-specific reaction time discriminating between elite and non-elite players (F = 6.650, p 0.05). Concurrent validity for domain-general reaction time was good, as it was associated with a national ranking for elite (p = 0.70, p badminton-specific reaction time, nor both components of inhibitory control (p > 0.05). In conclusion, reproducibility and validity of inhibitory control assessment was not confirmed, however, the BRIT appears a reproducible and valid measure of reaction time in badminton players. Reaction time measured with the BRIT may provide input for training programs aiming to improve badminton players' performance.

  5. Reliability and Validity of a Newly Developed Measure of Citizenship Among Persons with Mental Illnesses.

    Science.gov (United States)

    O'Connell, Maria J; Clayton, Ashley; Rowe, Michael

    2017-04-01

    Following development of a 46-item of measure citizenship, a framework for supporting the full membership in society of persons with mental illness, this study tested the measure's reliability and validity. 110 persons from a mental health center completed a questionnaire packet containing the citizenship measure and other measures to assess internal consistency and validity of the citizenship instrument. Correlation matrices were examined for associations between the citizenship instrument and other measures. Stepwise regression examines demographic factors, sense of community, and social capital as predictors of citizenship, recovery, and well-being. Analyses revealed that the measure is psychometrically sound. The measure captures subjective information about the degree to which individuals experience rights, sense of belonging, and other factors associated with community membership that have been previously difficult to assess. The measure establishes a platform for interventions to support the full participation in society of persons with mental illnesses.

  6. On the improvement of IT process maturity: assessment, recommendation and validation

    Directory of Open Access Journals (Sweden)

    Dirgahayu Teduh

    2018-01-01

    Full Text Available The use of information technology (IT in enterprises must be governed and managed appropriately using IT processes. The notion of IT process maturity is useful to measure the actual performance and to define the desired performance of IT processes. Improvements are necessary when there are gaps between the actual and desired performance. Most literatures focus on IT process maturity assessment. They do not address how to improve IT process maturity. This paper proposes an approach to enterprise IT process maturity improvement for COBIT processes. The approach consists of three activities, i.e. IT process maturity assessment, recommendation, and validation. Assessment is to recognise the process’ control objectives maturity. From the assessment results, recommendation identifies control objectives that must be improved and then suggests improvement actions. The prescriptive nature of the control objectives facilitates in suggesting those actions. Recommendations for managements are defined by abstracting similar actions. Validation checks whether the recommendations match with the enterprise needs and capability. It includes a scale for validation, in which enterprise’s capability is categorized into (i not capable, (ii capable with great efforts, and (iii fully capable. The paper illustrates the approach with a case study.

  7. What are validated self-report adherence scales really measuring?: a systematic review.

    Science.gov (United States)

    Nguyen, Thi-My-Uyen; La Caze, Adam; Cottrell, Neil

    2014-03-01

    Medication non-adherence is a significant health problem. There are numerous methods for measuring adherence, but no single method performs well on all criteria. The purpose of this systematic review is to (i) identify self-report medication adherence scales that have been correlated with comparison measures of medication-taking behaviour, (ii) assess how these scales measure adherence and (iii) explore how these adherence scales have been validated. Cinahl and PubMed databases were used to search articles written in English on the development or validation of medication adherence scales dating to August 2012. The search terms used were medication adherence, medication non-adherence, medication compliance and names of each scale. Data such as barriers identified and validation comparison measures were extracted and compared. Sixty articles were included in the review, which consisted of 43 adherence scales. Adherence scales include items that either elicit information regarding the patient's medication-taking behaviour and/or attempts to identify barriers to good medication-taking behaviour or beliefs associated with adherence. The validation strategies employed depended on whether the focus of the scale was to measure medication-taking behaviour or identify barriers or beliefs. Supporting patients to be adherent requires information on their medication-taking behaviour, barriers to adherence and beliefs about medicines. Adherence scales have the potential to explore these aspects of adherence, but currently there has been a greater focus on measuring medication-taking behaviour. Selecting the 'right' adherence scale(s) requires consideration of what needs to be measured and how (and in whom) the scale has been validated. © 2013 The British Pharmacological Society.

  8. Validation of Patient-Reported Outcomes Measurement Information System (PROMIS) computerized adaptive tests in cervical spine surgery.

    Science.gov (United States)

    Boody, Barrett S; Bhatt, Surabhi; Mazmudar, Aditya S; Hsu, Wellington K; Rothrock, Nan E; Patel, Alpesh A

    2018-03-01

    OBJECTIVE The Patient-Reported Outcomes Measurement Information System (PROMIS), which is funded by the National Institutes of Health, is a set of adaptive, responsive assessment tools that measures patient-reported health status. PROMIS measures have not been validated for surgical patients with cervical spine disorders. The objective of this project is to evaluate the validity (e.g., convergent validity, known-groups validity, responsiveness to change) of PROMIS computer adaptive tests (CATs) for pain behavior, pain interference, and physical function in patients undergoing cervical spine surgery. METHODS The legacy outcome measures Neck Disability Index (NDI) and SF-12 were used as comparisons with PROMIS measures. PROMIS CATs, NDI-10, and SF-12 measures were administered prospectively to 59 consecutive tertiary hospital patients who were treated surgically for degenerative cervical spine disorders. A subscore of NDI-5 was calculated from NDI-10 by eliminating the lifting, headaches, pain intensity, reading, and driving sections and multiplying the final score by 4. Assessments were administered preoperatively (baseline) and postoperatively at 6 weeks and 3 months. Patients presenting for revision surgery, tumor, infection, or trauma were excluded. Participants completed the measures in Assessment Center, an online data collection tool accessed by using a secure login and password on a tablet computer. Subgroup analysis was also performed based on a primary diagnosis of either cervical radiculopathy or cervical myelopathy. RESULTS Convergent validity for PROMIS CATs was supported with multiple statistically significant correlations with the existing legacy measures, NDI and SF-12, at baseline. Furthermore, PROMIS CATs demonstrated known-group validity and identified clinically significant improvements in all measures after surgical intervention. In the cervical radiculopathy and myelopathic cohorts, the PROMIS measures demonstrated similar responsiveness to the

  9. Construction and Validation of a Scale to Measure Maslow's Concept of Self-Actualization

    Science.gov (United States)

    Jones, Kenneth Melvin; Randolph, Daniel Lee

    1978-01-01

    Designed to measure self-actualization as defined by Abraham Maslow, the Jones Self Actualizing Scale, as assessed in this study, possesses content validity, reliability, and a number of other positive characteristics. (JC)

  10. Neighborhood walkability: field validation of geographic information system measures.

    Science.gov (United States)

    Hajna, Samantha; Dasgupta, Kaberi; Halparin, Max; Ross, Nancy A

    2013-06-01

    Given the health benefits of walking, there is interest in understanding how physical environments favor walking. Although GIS-derived measures of land-use mix, street connectivity, and residential density are commonly combined into indices to assess how conducive neighborhoods are to walking, field validation of these measures is limited. To assess the relationship between audit- and GIS-derived measures of overall neighborhood walkability and between objective (audit- and GIS-derived) and participant-reported measures of walkability. Walkability assessments were conducted in 2009. Street-level audits were conducted using a modified version of the Pedestrian Environmental Data Scan. GIS analyses were used to derive land-use mix, street connectivity, and residential density. Participant perceptions were assessed using a self-administered questionnaire. Audit, GIS, and participant-reported indices of walkability were calculated. Spearman correlation coefficients were used to assess the relationships between measures. All analyses were conducted in 2012. The correlation between audit- and GIS-derived measures of overall walkability was high (R=0.7 [95% CI=0.6, 0.8]); the correlations between objective (audit and GIS-derived) and participant-reported measures were low (R=0.2 [95% CI=0.06, 0.3]; R=0.2 [95% CI=0.04, 0.3], respectively). For comparable audit and participant-reported items, correlations were higher for items that appeared more objective (e.g., sidewalk presence, R=0.4 [95% CI=0.3, 0.5], versus safety, R=0.1 [95% CI=0.003, 0.3]). The GIS-derived measure of walkability correlated well with the in-field audit, suggesting that it is reasonable to use GIS-derived measures in place of more labor-intensive audits. Interestingly, neither audit- nor GIS-derived measures correlated well with participants' perceptions of walkability. Copyright © 2013 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.

  11. Validating Remotely Sensed Land Surface Evapotranspiration Based on Multi-scale Field Measurements

    Science.gov (United States)

    Jia, Z.; Liu, S.; Ziwei, X.; Liang, S.

    2012-12-01

    The land surface evapotranspiration plays an important role in the surface energy balance and the water cycle. There have been significant technical and theoretical advances in our knowledge of evapotranspiration over the past two decades. Acquisition of the temporally and spatially continuous distribution of evapotranspiration using remote sensing technology has attracted the widespread attention of researchers and managers. However, remote sensing technology still has many uncertainties coming from model mechanism, model inputs, parameterization schemes, and scaling issue in the regional estimation. Achieving remotely sensed evapotranspiration (RS_ET) with confident certainty is required but difficult. As a result, it is indispensable to develop the validation methods to quantitatively assess the accuracy and error sources of the regional RS_ET estimations. This study proposes an innovative validation method based on multi-scale evapotranspiration acquired from field measurements, with the validation results including the accuracy assessment, error source analysis, and uncertainty analysis of the validation process. It is a potentially useful approach to evaluate the accuracy and analyze the spatio-temporal properties of RS_ET at both the basin and local scales, and is appropriate to validate RS_ET in diverse resolutions at different time-scales. An independent RS_ET validation using this method was presented over the Hai River Basin, China in 2002-2009 as a case study. Validation at the basin scale showed good agreements between the 1 km annual RS_ET and the validation data such as the water balanced evapotranspiration, MODIS evapotranspiration products, precipitation, and landuse types. Validation at the local scale also had good results for monthly, daily RS_ET at 30 m and 1 km resolutions, comparing to the multi-scale evapotranspiration measurements from the EC and LAS, respectively, with the footprint model over three typical landscapes. Although some

  12. Life-Space Assessment questionnaire: Novel measurement properties for Brazilian community-dwelling older adults.

    Science.gov (United States)

    Simões, Maria do Socorro Mp; Garcia, Isabel Ff; Costa, Lucíola da Cm; Lunardi, Adriana C

    2018-05-01

    The Life-Space Assessment (LSA) assesses mobility from the spaces that older adults go, and how often and how independent they move. Despite its increased use, LSA measurement properties remain unclear. The aim of the present study was to analyze the content validity, reliability, construct validity and interpretability of the LSA for Brazilian community-dwelling older adults. In this clinimetric study we analyzed the measurement properties (content validity, reliability, construct validity and interpretability) of the LSA administered to 80 Brazilian community-dwelling older adults. Reliability was analyzed by Cronbach's alpha (internal consistency), intraclass correlation coefficients and 95% confidence interval (reproducibility), and standard error of measurement (measurement error). Construct validity was analyzed by Pearson's correlations between the LSA and accelerometry (time in inactivity and moderate-to-vigorous activities), and interpretability was analyzed by determination of the minimal detectable change, and floor and ceiling effects. The LSA met the criteria for content validity. The Cronbach's alpha was 0.92, intraclass correlation coefficient was 0.97 (95% confidence interval 0.95-0.98) and standard error of measurement was 4.12. The LSA showed convergence with accelerometry (negative correlation with time in inactivity and positive correlation with time in moderate to vigorous activities), the minimal detectable change was 0.36 and we observed no floor or ceiling effects. The LSA showed adequate reliability, validity and interpretability for life-space mobility assessment of Brazilian community-dwelling older adults. Geriatr Gerontol Int 2018; 18: 783-789. © 2018 Japan Geriatrics Society.

  13. Validity and inter-observer reliability of subjective hand-arm vibration assessments

    NARCIS (Netherlands)

    Coenen, P.; Formanoy, M.; Douwes, M.; Bosch, T.; Kraker, H. de

    2014-01-01

    Exposure to mechanical vibrations at work (e.g., due to handling powered tools) is a potential occupational risk as it may cause upper extremity complaints. However, reliable and valid assessment methods for vibration exposure at work are lacking. Measuring hand-arm vibration objectively is often

  14. Is the Scale for Measuring Motivational Interviewing Skills a valid and reliable instrument for measuring the primary care professionals motivational skills?: EVEM study protocol.

    Science.gov (United States)

    Pérula, Luis Á; Campiñez, Manuel; Bosch, Josep M; Barragán Brun, Nieves; Arboniés, Juan C; Bóveda Fontán, Julia; Martín Alvarez, Remedios; Prados, Jose A; Martín-Rioboó, Enrique; Massons, Josep; Criado, Margarita; Fernández, José Á; Parras, Juan M; Ruiz-Moral, Roger; Novo, Jesús M

    2012-11-22

    Lifestyle is one of the main determinants of people's health. It is essential to find the most effective prevention strategies to be used to encourage behavioral changes in their patients. Many theories are available that explain change or adherence to specific health behaviors in subjects. In this sense the named Motivational Interviewing has increasingly gained relevance. Few well-validated instruments are available for measuring doctors' communication skills, and more specifically the Motivational Interviewing. The hypothesis of this study is that the Scale for Measuring Motivational Interviewing Skills (EVEM questionnaire) is a valid and reliable instrument for measuring the primary care professionals skills to get behavior change in patients. To test the hypothesis we have designed a prospective, observational, multi-center study to validate a measuring instrument. - Thirty-two primary care centers in Spain. -Sampling and Size: a) face and consensual validity: A group composed of 15 experts in Motivational Interviewing. b) Assessment of the psychometric properties of the scale; 50 physician- patient encounters will be videoed; a total of 162 interviews will be conducted with six standardized patients, and another 200 interviews will be conducted with 50 real patients (n=362). Four physicians will be specially trained to assess 30 interviews randomly selected to test the scale reproducibility. -Measurements for to test the hypothesis: a) Face validity: development of a draft questionnaire based on a theoretical model, by using Delphi-type methodology with experts. b) Scale psychometric properties: intraobservers will evaluate video recorded interviews: content-scalability validity (Exploratory Factor Analysis), internal consistency (Cronbach alpha), intra-/inter-observer reliability (Kappa index, intraclass correlation coefficient, Bland & Altman methodology), generalizability, construct validity and sensitivity to change (Pearson product-moment correlation

  15. Is the Scale for Measuring Motivational Interviewing Skills a valid and reliable instrument for measuring the primary care professionals motivational skills?: EVEM study protocol

    Directory of Open Access Journals (Sweden)

    Pérula Luis Á

    2012-11-01

    Full Text Available Abstract Background Lifestyle is one of the main determinants of people’s health. It is essential to find the most effective prevention strategies to be used to encourage behavioral changes in their patients. Many theories are available that explain change or adherence to specific health behaviors in subjects. In this sense the named Motivational Interviewing has increasingly gained relevance. Few well-validated instruments are available for measuring doctors’ communication skills, and more specifically the Motivational Interviewing. Methods/Design The hypothesis of this study is that the Scale for Measuring Motivational Interviewing Skills (EVEM questionnaire is a valid and reliable instrument for measuring the primary care professionals skills to get behavior change in patients. To test the hypothesis we have designed a prospective, observational, multi-center study to validate a measuring instrument. –Scope: Thirty-two primary care centers in Spain. -Sampling and Size: a face and consensual validity: A group composed of 15 experts in Motivational Interviewing. b Assessment of the psychometric properties of the scale; 50 physician- patient encounters will be videoed; a total of 162 interviews will be conducted with six standardized patients, and another 200 interviews will be conducted with 50 real patients (n=362. Four physicians will be specially trained to assess 30 interviews randomly selected to test the scale reproducibility. -Measurements for to test the hypothesis: a Face validity: development of a draft questionnaire based on a theoretical model, by using Delphi-type methodology with experts. b Scale psychometric properties: intraobservers will evaluate video recorded interviews: content-scalability validity (Exploratory Factor Analysis, internal consistency (Cronbach alpha, intra-/inter-observer reliability (Kappa index, intraclass correlation coefficient, Bland & Altman methodology, generalizability, construct validity and

  16. Validity of a Measure of Assertiveness

    Science.gov (United States)

    Galassi, John P.; Galassi, Merna D.

    1974-01-01

    This study was concerned with further validation of a measure of assertiveness. Concurrent validity was established for the College Self-Expression Scale using the method of contrasted groups and through correlations of self-and judges' ratings of assertiveness. (Author)

  17. Assessment of patient empowerment - a systematic review of measures

    NARCIS (Netherlands)

    Barr, P.J.; Scholl, I.; Bravo, P.; Faber, M.J.; Elwyn, G.; Mcallister, M.

    2015-01-01

    BACKGROUND: Patient empowerment has gained considerable importance but uncertainty remains about the best way to define and measure it. The validity of empirical findings depends on the quality of measures used. This systematic review aims to provide an overview of studies assessing psychometric

  18. Validation of jump squats as a practical measure of post-activation potentiation.

    Science.gov (United States)

    Nibali, Maria L; Chapman, Dale W; Robergs, Robert A; Drinkwater, Eric J

    2013-03-01

    To determine if post-activation potentiation (PAP) can augment sports performance, it is pertinent that researchers be confident that any enhancement in performance is attributable to the PAP phenomenon. However, obtaining mechanistic measures of PAP in the daily training environment of highly trained athletes is impractical. We sought to validate jump squats as a practical measure with ecological validity to sports performance against a mechanistic measure of PAP. We assessed the evoked muscle twitch properties of the knee extensors and jump squat kinetics of 8 physically trained males in response to a 5-repetition-maximum back squat conditioning stimulus (CS). Evoked muscle twitch, followed by 3 jump squats, was assessed before and at 4, 8, and 12 min post CS. Time intervals were assessed on separate occasions using a Latin square design. Linear regression was used to determine the relationship between post-pre changes in kinetic variables and muscle twitch peak force (Ft) and twitch rate of force development (RFDt). Large correlations were observed for both concentric relative and absolute mean power and Ft (r = 0.50 ± 0.30) and RFDt (r = 0.56 ± 0.27 and r = 0.58 ± 0.26). Concentric rate of force development (RFD) showed moderate correlations with Ft (r = 0.45 ± 0.33) and RFDt (r = 0.49 ± 0.32). Small-to-moderate correlations were observed for a number of kinetic variables (r = -0.42-0.43 ± 0.32-0.38). Jump squat concentric mean power and RFD are valid ecological measures of muscle potentiation, capable of detecting changes in athletic performance in response to the PAP phenomenon.

  19. Reliability and Validity Evidence of Multiple Balance Assessments in Athletes With a Concussion

    Science.gov (United States)

    Murray, Nicholas; Salvatore, Anthony; Powell, Douglas; Reed-Jones, Rebecca

    2014-01-01

    Context: An estimated 300 000 sport-related concussion injuries occur in the United States annually. Approximately 30% of individuals with concussions experience balance disturbances. Common methods of balance assessment include the Clinical Test of Sensory Organization and Balance (CTSIB), the Sensory Organization Test (SOT), the Balance Error Scoring System (BESS), and the Romberg test; however, the National Collegiate Athletic Association recommended the Wii Fit as an alternative measure of balance in athletes with a concussion. A central concern regarding the implementation of the Wii Fit is whether it is reliable and valid for measuring balance disturbance in athletes with concussion. Objective: To examine the reliability and validity evidence for the CTSIB, SOT, BESS, Romberg test, and Wii Fit for detecting balance disturbance in athletes with a concussion. Data Sources: Literature considered for review included publications with reliability and validity data for the assessments of balance (CTSIB, SOT, BESS, Romberg test, and Wii Fit) from PubMed, PsycINFO, and CINAHL. Data Extraction: We identified 63 relevant articles for consideration in the review. Of the 63 articles, 28 were considered appropriate for inclusion and 35 were excluded. Data Synthesis: No current reliability or validity information supports the use of the CTSIB, SOT, Romberg test, or Wii Fit for balance assessment in athletes with a concussion. The BESS demonstrated moderate to high reliability (interclass correlation coefficient = 0.87) and low to moderate validity (sensitivity = 34%, specificity = 87%). However, the Romberg test and Wii Fit have been shown to be reliable tools in the assessment of balance in Parkinson patients. Conclusions: The BESS can evaluate balance problems after a concussion. However, it lacks the ability to detect balance problems after the third day of recovery. Further investigation is needed to establish the use of the CTSIB, SOT, Romberg test, and Wii Fit for

  20. Reliability and validity evidence of multiple balance assessments in athletes with a concussion.

    Science.gov (United States)

    Murray, Nicholas; Salvatore, Anthony; Powell, Douglas; Reed-Jones, Rebecca

    2014-01-01

    An estimated 300 000 sport-related concussion injuries occur in the United States annually. Approximately 30% of individuals with concussions experience balance disturbances. Common methods of balance assessment include the Clinical Test of Sensory Organization and Balance (CTSIB), the Sensory Organization Test (SOT), the Balance Error Scoring System (BESS), and the Romberg test; however, the National Collegiate Athletic Association recommended the Wii Fit as an alternative measure of balance in athletes with a concussion. A central concern regarding the implementation of the Wii Fit is whether it is reliable and valid for measuring balance disturbance in athletes with concussion. To examine the reliability and validity evidence for the CTSIB, SOT, BESS, Romberg test, and Wii Fit for detecting balance disturbance in athletes with a concussion. Literature considered for review included publications with reliability and validity data for the assessments of balance (CTSIB, SOT, BESS, Romberg test, and Wii Fit) from PubMed, PsycINFO, and CINAHL. We identified 63 relevant articles for consideration in the review. Of the 63 articles, 28 were considered appropriate for inclusion and 35 were excluded. No current reliability or validity information supports the use of the CTSIB, SOT, Romberg test, or Wii Fit for balance assessment in athletes with a concussion. The BESS demonstrated moderate to high reliability (interclass correlation coefficient = 0.87) and low to moderate validity (sensitivity = 34%, specificity = 87%). However, the Romberg test and Wii Fit have been shown to be reliable tools in the assessment of balance in Parkinson patients. The BESS can evaluate balance problems after a concussion. However, it lacks the ability to detect balance problems after the third day of recovery. Further investigation is needed to establish the use of the CTSIB, SOT, Romberg test, and Wii Fit for assessing balance in athletes with concussions.

  1. Many quality measurements, but few quality measures assessing the quality of breast cancer care in women: A systematic review

    Directory of Open Access Journals (Sweden)

    Zhang Li

    2006-12-01

    Full Text Available Abstract Background Breast cancer in women is increasingly frequent, and care is complex, onerous and expensive, all of which lend urgency to improvements in care. Quality measurement is essential to monitor effectiveness and to guide improvements in healthcare. Methods Ten databases, including Medline, were searched electronically to identify measures assessing the quality of breast cancer care in women (diagnosis, treatment, followup, documentation of care. Eligible studies measured adherence to standards of breast cancer care in women diagnosed with, or in treatment for, any histological type of adenocarcinoma of the breast. Reference lists of studies, review articles, web sites, and files of experts were searched manually. Evidence appraisal entailed dual independent assessments of data (e.g., indicators used in quality measurement. The extent of each quality indicator's scientific validation as a measure was assessed. The American Society of Clinical Oncology (ASCO was asked to contribute quality measures under development. Results Sixty relevant reports identified 58 studies with 143 indicators assessing adherence to quality breast cancer care. A paucity of validated indicators (n = 12, most of which assessed quality of life, only permitted a qualitative data synthesis. Most quality indicators evaluated processes of care. Conclusion While some studies revealed patterns of under-use of care, all adherence data require confirmation using validated quality measures. ASCO's current development of a set of quality measures relating to breast cancer care may hold the key to conducting definitive studies.

  2. Measuring patient activation in Italy: Translation, adaptation and validation of the Italian version of the patient activation measure 13 (PAM13-I).

    Science.gov (United States)

    Graffigna, Guendalina; Barello, Serena; Bonanomi, Andrea; Lozza, Edoardo; Hibbard, Judith

    2015-12-23

    The Patient Activation Measure (PAM13) is an instrument that assesses patient knowledge, skills, and confidence for disease self-management. This cross-sectional study was aimed to validate a culturally-adapted Italian Patient Activation Measure (PAM13-I) for patients with chronic conditions. 519 chronic patients were involved in the Italian validation study and responded to PAM13-I. The PAM 13 was translated into Italian by a standardized forward-backward translation. Data quality was assessed by mean, median, item response, missing values, floor and ceiling effects, internal consistency (Cronbach's alpha and average inter-item correlation), item-rest correlations. Rasch Model and differential item functioning assessed scale properties. Mean PAM13-I score was 66.2. Rasch analysis showed that the PAM13-I is a good measure of patient activation. The level of internal consistency was good (α = 0.88). For all items, the distribution of answers was left-skewed, with a small floor effect (range 1.7-4.5 %) and a moderate ceiling effect (range 27.6-55.0 %). The Italian version formed a unidimensional, probabilistic Guttman-like scale explaining 41 % of the variance. The PAM13-I has been demonstrated to be a valid and reliable measure of patient activation and the present study suggests its applicability to the Italian-speaking chronic patient population. The measure has good psychometric properties and appears to be consistent with the developmental nature of the patient activation phenomenon, although it presents a different ranking order of the items comparing to the American version. PAM13-I can be a useful assessment tool to evaluate interventions aimed at improving patient engagement in healthcare and to train doctors in attuning their communication to the level of patients' activation. Future research could be conducted to further confirm the validity of the PAM13-I.

  3. Reliability and validity of the brief multidimensional measure of religiousness/spirituality among adolescents.

    Science.gov (United States)

    Harris, Sion Kim; Sherritt, Lon R; Holder, David W; Kulig, John; Shrier, Lydia A; Knight, John R

    2008-12-01

    Developed for use in health research, the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) consists of brief measures of a broad range of religiousness and spirituality (R/S) dimensions. It has established psychometric properties among adults, but little is known about its appropriateness for use with adolescents. We assessed the psychometric properties of the BMMRS among adolescents. We recruited a racially diverse (85% non-White) sample of 305 adolescents aged 12-18 years (median 16 yrs, IQR 14-17) from 3 urban medical clinics; 93 completed a retest 1 week later. We assessed internal consistency and test-retest reliability. We assessed construct validity by examining how well the measures discriminated groups expected to differ based on self-reported religious preference, and how they related to a hypothesized correlate, depressive symptoms. Religious preference was categorized into "No religion/Atheist" (11%), "Don't know/Confused" (9%), or "Named a religion" (80%). Responses to multi-item measures were generally internally consistent (alpha > or = 0.70 for 12/16 measures) and stable over 1 week (intraclass correlation coefficients > or = 0.70 for 14/16). Forgiveness, Negative R/S Coping, and Commitment items showed lower internal cohesiveness. Scores on most measures were higher (p Atheist" group. Forgiveness, Commitment, and Anticipated Support from members of one's congregation were inversely correlated with depressive symptoms, while BMMRS measures assessing negative R/S experiences (Negative R/S Coping, Negative Interactions with others in congregation, Loss in Faith) were positively correlated with depressive symptoms. These findings suggest that most BMMRS measures are reliable and valid for use among adolescents.

  4. Validation of Transition Readiness Assessment Questionaire in Turkish Adolescents with Diabetes

    Directory of Open Access Journals (Sweden)

    Evrim Kızıler

    2018-02-01

    . Conclusion: The Turkish Transition Readiness Assessment Questionnaire is a valid and reliable measure of the transition readiness of adolescents/young adults with diabetes mellitus in Turkey. The Transition Readiness Assessment Questionnaire assesses the self-management abilities and health care transition knowledge of adolescents/young adults with diabetes mellitus who need special health care. It can also serve as a guide for health care professionals in detecting the educational fields that are necessary for acquiring self-management and self-care abilities

  5. Assessment of the validity of the CUDIT-R in a subpopulation of cannabis users.

    Science.gov (United States)

    Loflin, Mallory; Babson, Kimberly; Browne, Kendall; Bonn-Miller, Marcel

    2018-01-01

    The Cannabis Use Disorders Identification Test-Revised (CUDIT-R) is an 8-item measure used to screen for cannabis use disorders (CUD). Despite widespread use of the tool, assessments of the CUDIT-R's validity in subpopulations are limited. The current study tested the structural validity and internal consistency of one of the most widely used screening measures for CUD (i.e., CUDIT-R) among a sample of military veterans who use cannabis for medicinal purposes. The present study used confirmatory factor analysis (CFA) to test the internal consistency and validity of the single-factor structure of the original screener among a sample of veterans who use cannabis for medicinal purposes (n = 90 [90% male]; M age  = 55.31, SD = 15.37). Measures included demographics and the CUDIT-R, obtained from the baseline assessment of an ongoing longitudinal study. The CFA revealed that the single-factor model previously validated in recreational using samples only accounted for 38.34% of total variance in responses on the CUDIT-R (χ 2  = 66.09, df = 28, p medicinal cannabis and other subpopulations of cannabis users.

  6. Validating Machine Learning Algorithms for Twitter Data Against Established Measures of Suicidality.

    Science.gov (United States)

    Braithwaite, Scott R; Giraud-Carrier, Christophe; West, Josh; Barnes, Michael D; Hanson, Carl Lee

    2016-05-16

    One of the leading causes of death in the United States (US) is suicide and new methods of assessment are needed to track its risk in real time. Our objective is to validate the use of machine learning algorithms for Twitter data against empirically validated measures of suicidality in the US population. Using a machine learning algorithm, the Twitter feeds of 135 Mechanical Turk (MTurk) participants were compared with validated, self-report measures of suicide risk. Our findings show that people who are at high suicidal risk can be easily differentiated from those who are not by machine learning algorithms, which accurately identify the clinically significant suicidal rate in 92% of cases (sensitivity: 53%, specificity: 97%, positive predictive value: 75%, negative predictive value: 93%). Machine learning algorithms are efficient in differentiating people who are at a suicidal risk from those who are not. Evidence for suicidality can be measured in nonclinical populations using social media data.

  7. Assessing students' communication skills: validation of a global rating.

    Science.gov (United States)

    Scheffer, Simone; Muehlinghaus, Isabel; Froehmel, Annette; Ortwein, Heiderose

    2008-12-01

    Communication skills training is an accepted part of undergraduate medical programs nowadays. In addition to learning experiences its importance should be emphasised by performance-based assessment. As detailed checklists have been shown to be not well suited for the assessment of communication skills for different reasons, this study aimed to validate a global rating scale. A Canadian instrument was translated to German and adapted to assess students' communication skills during an end-of-semester-OSCE. Subjects were second and third year medical students at the reformed track of the Charité-Universitaetsmedizin Berlin. Different groups of raters were trained to assess students' communication skills using the global rating scale. Validity testing included concurrent validity and construct validity: Judgements of different groups of raters were compared to expert ratings as a defined gold standard. Furthermore, the amount of agreement between scores obtained with this global rating scale and a different instrument for assessing communication skills was determined. Results show that communication skills can be validly assessed by trained non-expert raters as well as standardised patients using this instrument.

  8. Validation of Experimental whole-body SAR Assessment Method in a Complex Indoor Environment

    DEFF Research Database (Denmark)

    Bamba, Aliou; Joseph, Wout; Vermeeren, Gunter

    2012-01-01

    Assessing experimentally the whole-body specific absorption rate (SARwb) in a complex indoor environment is very challenging. An experimental method based on room electromagnetics theory (accounting only the Line-Of-Sight as specular path) to assess the whole-body SAR is validated by numerical...... of the proposed method is that it allows discarding the computation burden because it does not use any discretizations. Results show good agreement between measurement and computation at 2.8 GHz, as long as the plane wave assumption is valid, i.e., for high distances from the transmitter. Relative deviations 0...

  9. Validation of a Measure of Family Resilience among Iraq and Afghanistan Veterans.

    Science.gov (United States)

    Finley, Erin P; Pugh, Mary Jo; Palmer, Raymond F

    2016-01-01

    Although interactions within veterans' families may support or inhibit resilient coping to stress and trauma across the deployment cycle, research on family resilience has been hampered by the lack of a brief assessment. Using a three-stage mixed-method study, we developed and conducted preliminary validation of a measure of family resilience tailored for Iraq and Afghanistan veterans (IAV), the Family Resilience Scale for Veterans (FRS-V) , which was field-tested using a survey of 151 IAV. Our findings indicate the resulting 6-item measure shows strong initial reliability and validity and support the application of existing models of family resilience in this population.

  10. Evaluation of the Gratitude Questionnaire in a Chinese Sample of Adults: Factorial Validity, Criterion-Related Validity, and Measurement Invariance Across Sex.

    Science.gov (United States)

    Kong, Feng; You, Xuqun; Zhao, Jingjing

    2017-01-01

    The Gratitude Questionnaire (GQ; McCullough et al., 2002) is one of the most widely used instruments to assess dispositional gratitude. The purpose of this study was to validate a Chinese version of the GQ by examining internal consistency, factor structure, convergent validity, and measurement invariance across sex. A total of 1151 Chinese adults were recruited to complete the GQ, Positive Affect and Negative Affect Scales, and Satisfaction with Life Scale. Confirmatory factor analysis indicated that the original unidimensional model fitted well, which is in accordance with the findings in Western populations. Furthermore, the GQ had satisfactory composite reliability and criterion-related validity with measures of life satisfaction and affective well-being. Evidence of configural, metric and scalar invariance across sex was obtained. Tests of the latent mean differences found females had higher latent mean scores than males. These findings suggest that the Chinese version of GQ is a reliable and valid tool for measuring dispositional gratitude and can generally be utilized across sex in the Chinese context.

  11. Visual Impairment Screening Assessment (VISA) tool: pilot validation.

    Science.gov (United States)

    Rowe, Fiona J; Hepworth, Lauren R; Hanna, Kerry L; Howard, Claire

    2018-03-06

    To report and evaluate a new Vision Impairment Screening Assessment (VISA) tool intended for use by the stroke team to improve identification of visual impairment in stroke survivors. Prospective case cohort comparative study. Stroke units at two secondary care hospitals and one tertiary centre. 116 stroke survivors were screened, 62 by naïve and 54 by non-naïve screeners. Both the VISA screening tool and the comprehensive specialist vision assessment measured case history, visual acuity, eye alignment, eye movements, visual field and visual inattention. Full completion of VISA tool and specialist vision assessment was achieved for 89 stroke survivors. Missing data for one or more sections typically related to patient's inability to complete the assessment. Sensitivity and specificity of the VISA screening tool were 90.24% and 85.29%, respectively; the positive and negative predictive values were 93.67% and 78.36%, respectively. Overall agreement was significant; k=0.736. Lowest agreement was found for screening of eye movement and visual inattention deficits. This early validation of the VISA screening tool shows promise in improving detection accuracy for clinicians involved in stroke care who are not specialists in vision problems and lack formal eye training, with potential to lead to more prompt referral with fewer false positives and negatives. Pilot validation indicates acceptability of the VISA tool for screening of visual impairment in stroke survivors. Sensitivity and specificity were high indicating the potential accuracy of the VISA tool for screening purposes. Results of this study have guided the revision of the VISA screening tool ahead of full clinical validation. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  12. Reliability and validity of measures used in assessing dental anxiety in 5- to 15-year-old Croatian children.

    Science.gov (United States)

    Majstorovic, M; Veerkamp, J S; Skrinjaric, I

    2003-12-01

    The aim of the study was to evaluate reliability and validity of different questionnaires and predict related causes, as concomitant factors in assessing different aspects of children's dental anxiety. Children were interviewed on dental anxiety, dispositional risk factors and satisfaction with the dentist after dental treatment had been accomplished. Parents were interviewed on dental anxiety as well. The study population included 165 children (91 boys) aged 5 to 15 years, referred to a university dental clinic by general dental practitioners because of a history of fear and uncooperative behaviour during previous dental visits. Children were treated by two dentists, both experienced in treating fearful children. Statistical analysis was performed in Statistics for Windows, Release 5.5 and Release 7.5. Pearson's correlation coefficients were calculated for validity and Cronbach alpha for reliability of the measures. Spearman Brown prophecy formula was used for correction of the alpha scores. Results The children's total average CFSS-DS score was 27.02, with no significant difference with respect to gender. The highest Cronbach alpha scores regarding reliability were obtained for the S-DAI, the CFSS-DS and the PDAS. Pearson's correlations regarding validity presented significant correlations between the CMFQ, the CDAS and the S-DAI, between the OAS, the CDAS and the S-DAI, as well as between the OAS and the DVSS-SV. Previous negative medical experience had significant influence on children's dental anxiety, supporting Rachman's conditioning theory. Anxious children were more likely to show behaviour problems (aggression) and more introvert in expressing their judgement regarding the dentist. Both the S-DAI and the CFSS-DS, which were standardized in the Croatian population sample, showed the highest reliability in assessment of children's dental anxiety.

  13. The ecological and construct validity of a newly developed measure of executive function: the Virtual Library Task.

    Science.gov (United States)

    Renison, Belinda; Ponsford, Jennie; Testa, Renee; Richardson, Barry; Brownfield, Kylie

    2012-05-01

    Virtual reality (VR) assessment paradigms have the potential to address the limited ecological validity of pen and paper measures of executive function (EF) and the pragmatic and reliability issues associated with functional measures. To investigate the ecological validity and construct validity of a newly developed VR measure of EF, the Virtual Library Task (VLT); a real life analogous task--the Real Library Task (RLT); and five neuropsychological measures of EF were administered to 30 patients with traumatic brain injury (TBI) and 30 healthy Controls. Significant others for each participant also completed the Dysexecutive Questionnaire (DEX), which is a behavioral rating scale of everyday EF. Performances on the VLT and the RLT were significantly positively correlated indicating that VR performance is similar to real world performance. The TBI group performed significantly worse than the Control group on the VLT and the Modified Six Elements Test (MSET) but the other four neuropsychological measures of EF failed to differentiate the groups. Both the MSET and the VLT significantly predicted everyday EF suggesting that they are both ecologically valid tools for the assessment of EF. The VLT has the advantage over the MSET of providing objective measurement of individual components of EF.

  14. Teamwork assessment in internal medicine: a systematic review of validity evidence and outcomes.

    Science.gov (United States)

    Havyer, Rachel D A; Wingo, Majken T; Comfere, Nneka I; Nelson, Darlene R; Halvorsen, Andrew J; McDonald, Furman S; Reed, Darcy A

    2014-06-01

    Valid teamwork assessment is imperative to determine physician competency and optimize patient outcomes. We systematically reviewed published instruments assessing teamwork in undergraduate, graduate, and continuing medical education in general internal medicine and all medical subspecialties. We searched MEDLINE, MEDLINE In-process, CINAHL and PsycINFO from January 1979 through October 2012, references of included articles, and abstracts from four professional meetings. Two content experts were queried for additional studies. Included studies described quantitative tools measuring teamwork among medical students, residents, fellows, and practicing physicians on single or multi-professional (interprofessional) teams. Instrument validity and study quality were extracted using established frameworks with existing validity evidence. Two authors independently abstracted 30 % of articles and agreement was calculated. Of 12,922 citations, 178 articles describing 73 unique teamwork assessment tools met inclusion criteria. Interrater agreement was intraclass correlation coefficient 0.73 (95 % CI 0.63-0.81). Studies involved practicing physicians (142, 80 %), residents/fellows (70, 39 %), and medical students (11, 6 %). The majority (152, 85 %) assessed interprofessional teams. Studies were conducted in inpatient (77, 43 %), outpatient (42, 24 %), simulation (37, 21 %), and classroom (13, 7 %) settings. Validity evidence for the 73 tools included content (54, 74 %), internal structure (51, 70 %), relationships to other variables (25, 34 %), and response process (12, 16 %). Attitudes and opinions were the most frequently assessed outcomes. Relationships between teamwork scores and patient outcomes were directly examined for 13 (18 %) of tools. Scores from the Safety Attitudes Questionnaire and Team Climate Inventory have substantial validity evidence and have been associated with improved patient outcomes. Review is limited to quantitative assessments of teamwork in internal

  15. Social Validation of the New England Center for Children-Core Skills Assessment

    Science.gov (United States)

    Dickson, Chata A.; MacDonald, Rebecca P. F.; Mansfield, Renee; Guilhardi, Paulo; Johnson, Cammarie; Ahearn, William H.

    2014-01-01

    We investigated the social validity of the NECC Core Skills Assessment (NECC-CSA) with parents and professionals as participants. The NECC-CSA is a measurement tool consisting of direct and indirect measures of skills important to all individuals with autism, across the lifespan. Participants (N = 245) were provided with a list of 66 skills, 47 of…

  16. Validity of parent-reported weight and height of preschool children measured at home or estimated without home measurement: a validation study

    Directory of Open Access Journals (Sweden)

    Cox Bianca

    2011-07-01

    Full Text Available Abstract Background Parental reports are often used in large-scale surveys to assess children's body mass index (BMI. Therefore, it is important to know to what extent these parental reports are valid and whether it makes a difference if the parents measured their children's weight and height at home or whether they simply estimated these values. The aim of this study is to compare the validity of parent-reported height, weight and BMI values of preschool children (3-7 y-old, when measured at home or estimated by parents without actual measurement. Methods The subjects were 297 Belgian preschool children (52.9% male. Participation rate was 73%. A questionnaire including questions about height and weight of the children was completed by the parents. Nurses measured height and weight following standardised procedures. International age- and sex-specific BMI cut-off values were employed to determine categories of weight status and obesity. Results On the group level, no important differences in accuracy of reported height, weight and BMI were identified between parent-measured or estimated values. However, for all 3 parameters, the correlations between parental reports and nurse measurements were higher in the group of children whose body dimensions were measured by the parents. Sensitivity for underweight and overweight/obesity were respectively 73% and 47% when parents measured their child's height and weight, and 55% and 47% when parents estimated values without measurement. Specificity for underweight and overweight/obesity were respectively 82% and 97% when parents measured the children, and 75% and 93% with parent estimations. Conclusions Diagnostic measures were more accurate when parents measured their child's weight and height at home than when those dimensions were based on parental judgements. When parent-reported data on an individual level is used, the accuracy could be improved by encouraging the parents to measure weight and height

  17. Developing a model for hospital inherent safety assessment: Conceptualization and validation.

    Science.gov (United States)

    Yari, Saeed; Akbari, Hesam; Gholami Fesharaki, Mohammad; Khosravizadeh, Omid; Ghasemi, Mohammad; Barsam, Yalda; Akbari, Hamed

    2018-01-01

    Paying attention to the safety of hospitals, as the most crucial institute for providing medical and health services wherein a bundle of facilities, equipment, and human resource exist, is of significant importance. The present research aims at developing a model for assessing hospitals' safety based on principles of inherent safety design. Face validity (30 experts), content validity (20 experts), construct validity (268 examples), convergent validity, and divergent validity have been employed to validate the prepared questionnaire; and the items analysis, the Cronbach's alpha test, ICC test (to measure reliability of the test), composite reliability coefficient have been used to measure primary reliability. The relationship between variables and factors has been confirmed at 0.05 significance level by conducting confirmatory factor analysis (CFA) and structural equations modeling (SEM) technique with the use of Smart-PLS. R-square and load factors values, which were higher than 0.67 and 0.300 respectively, indicated the strong fit. Moderation (0.970), simplification (0.959), substitution (0.943), and minimization (0.5008) have had the most weights in determining the inherent safety of hospital respectively. Moderation, simplification, and substitution, among the other dimensions, have more weight on the inherent safety, while minimization has the less weight, which could be due do its definition as to minimize the risk.

  18. Uncertainty estimates of purity measurements based on current information: toward a "live validation" of purity methods.

    Science.gov (United States)

    Apostol, Izydor; Kelner, Drew; Jiang, Xinzhao Grace; Huang, Gang; Wypych, Jette; Zhang, Xin; Gastwirt, Jessica; Chen, Kenneth; Fodor, Szilan; Hapuarachchi, Suminda; Meriage, Dave; Ye, Frank; Poppe, Leszek; Szpankowski, Wojciech

    2012-12-01

    To predict precision and other performance characteristics of chromatographic purity methods, which represent the most widely used form of analysis in the biopharmaceutical industry. We have conducted a comprehensive survey of purity methods, and show that all performance characteristics fall within narrow measurement ranges. This observation was used to develop a model called Uncertainty Based on Current Information (UBCI), which expresses these performance characteristics as a function of the signal and noise levels, hardware specifications, and software settings. We applied the UCBI model to assess the uncertainty of purity measurements, and compared the results to those from conventional qualification. We demonstrated that the UBCI model is suitable to dynamically assess method performance characteristics, based on information extracted from individual chromatograms. The model provides an opportunity for streamlining qualification and validation studies by implementing a "live validation" of test results utilizing UBCI as a concurrent assessment of measurement uncertainty. Therefore, UBCI can potentially mitigate the challenges associated with laborious conventional method validation and facilitates the introduction of more advanced analytical technologies during the method lifecycle.

  19. Development and validation of parenting measures for body image and eating patterns in childhood.

    Science.gov (United States)

    Damiano, Stephanie R; Hart, Laura M; Paxton, Susan J

    2015-01-01

    Evidence-based parenting interventions are important in assisting parents to help their children develop healthy body image and eating patterns. To adequately assess the impact of parenting interventions, valid parent measures are required. The aim of this study was to develop and assess the validity and reliability of two new parent measures, the Parenting Intentions for Body image and Eating patterns in Childhood (Parenting Intentions BEC) and the Knowledge Test for Body image and Eating patterns in Childhood (Knowledge Test BEC). Participants were 27 professionals working in research or clinical treatment of body dissatisfaction or eating disorders, and 75 parents of children aged 2-6 years, who completed the measures via an online questionnaire. Seven scenarios were developed for the Parenting Intentions BEC to describe common experiences about the body and food that parents might need to respond to in front of their child. Parents ranked four behavioural intentions, derived from the current literature on parenting risk factors for body dissatisfaction and unhealthy eating patterns in children. Two subscales were created, one representing positive behavioural intentions, the other negative behavioural intentions. After piloting a larger pool of items, 13 statements were used to construct the Knowledge Test BEC. These were designed to be factual statements about the influence of parent language, media, family meals, healthy eating, and self-esteem on child eating and body image. The validity of both measures was tested by comparing parent and professional scores, and reliability was assessed by comparing parent scores over two testing occasions. Compared with parents, professionals reported significantly higher scores on the Positive Intentions subscale and significantly lower on the Negative Intentions subscale of the Parenting Intentions BEC; confirming the discriminant validity of six out of the seven scenarios. Test-retest reliability was also confirmed as

  20. Incremental Validity of the Durand Adaptive Psychopathic Traits Questionnaire Above Self-Report Psychopathy Measures in Community Samples.

    Science.gov (United States)

    Durand, Guillaume

    2018-05-03

    Although highly debated, the notion of the existence of an adaptive side to psychopathy is supported by some researchers. Currently, 2 instruments assessing psychopathic traits include an adaptive component, which might not cover the full spectrum of adaptive psychopathic traits. The Durand Adaptive Psychopathic Traits Questionnaire (DAPTQ; Durand, 2017 ) is a 41-item self-reported instrument assessing adaptive traits known to correlate with the psychopathic personality. In this study, I investigated in 2 samples (N = 263 and N = 262) the incremental validity of the DAPTQ over the Psychopathic Personality Inventory-Short Form (PPI-SF) and the Triarchic Psychopathy Measure (TriPM) using multiple criterion measures. Results showed that the DAPTQ significantly increased the predictive validity over the PPI-SF on 5 factors of the HEXACO. Additionally, the DAPTQ provided incremental validity over both the PPI-SF and the TriPM on measures of communication adaptability, perceived stress, and trait anxiety. Overall, these results support the validity of the DAPTQ in community samples. Directions for future studies to further validate the DAPTQ are discussed.

  1. Validation of Modified Soft Skills Assessment Instrument (MOSSAI) for Use in Nigeria

    Science.gov (United States)

    Aworanti, O. A.; Taiwo, M. B.; Iluobe, O. I.

    2015-01-01

    Currently, it has become an accepted norm nearly all over the globe to teach and assess soft skills. However, in Nigeria, it is an emerging area of interest that needs to be addressed squarely. In the light of the fore-going, this study validated a modified version of Measuring and Assessment Soft Skills (MASS) (an instrument developed and used by…

  2. Assessing attitude toward same-sex marriage: scale development and validation.

    Science.gov (United States)

    Lannutti, Pamela J; Lachlan, Kenneth A

    2007-01-01

    This paper reports the results of three studies conducted to develop, refine, and validate a scale which assessed heterosexual adults' attitudes toward same-sex marriage, the Attitude Toward Same-Sex Marriage Scale (ASSMS). The need for such a scale is evidenced in the increasing importance of same-sex marriage in the political arena of the United States and other nations, as well as the growing body of empirical research examining same-sex marriage and related issues (e.g., Lannutti, 2005; Solomon, Rothblum, & Balsam, 2004). The results demonstrate strong reliability, convergent validity, and predictive validity for the ASSMS and suggest that the ASSMS may be adapted to measure attitudes toward civil unions and other forms of relational recognition for same-sex couples. Gender comparisons using the validated scale showed that in college and non-college samples, women had a significantly more positive attitude toward same-sex marriage than did men.

  3. Validity of Devices That Assess Body Temperature During Outdoor Exercise in the Heat

    OpenAIRE

    Casa, Douglas J; Becker, Shannon M; Ganio, Matthew S; Brown, Christopher M; Yeargin, Susan W; Roti, Melissa W; Siegler, Jason; Blowers, Julie A; Glaviano, Neal R; Huggins, Robert A; Armstrong, Lawrence E; Maresh, Carl M

    2007-01-01

    Context: Rectal temperature is recommended by the National Athletic Trainers' Association as the criterion standard for recognizing exertional heat stroke, but other body sites commonly are used to measure temperature. Few authors have assessed the validity of the thermometers that measure body temperature at these sites in athletic settings.

  4. Initial Verification and Validation Assessment for VERA

    Energy Technology Data Exchange (ETDEWEB)

    Dinh, Nam [North Carolina State Univ., Raleigh, NC (United States); Athe, Paridhi [North Carolina State Univ., Raleigh, NC (United States); Jones, Christopher [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Hetzler, Adam [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Sieger, Matt [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

    2017-04-01

    The Virtual Environment for Reactor Applications (VERA) code suite is assessed in terms of capability and credibility against the Consortium for Advanced Simulation of Light Water Reactors (CASL) Verification and Validation Plan (presented herein) in the context of three selected challenge problems: CRUD-Induced Power Shift (CIPS), Departure from Nucleate Boiling (DNB), and Pellet-Clad Interaction (PCI). Capability refers to evidence of required functionality for capturing phenomena of interest while capability refers to the evidence that provides confidence in the calculated results. For this assessment, each challenge problem defines a set of phenomenological requirements against which the VERA software is assessed. This approach, in turn, enables the focused assessment of only those capabilities relevant to the challenge problem. The evaluation of VERA against the challenge problem requirements represents a capability assessment. The mechanism for assessment is the Sandia-developed Predictive Capability Maturity Model (PCMM) that, for this assessment, evaluates VERA on 8 major criteria: (1) Representation and Geometric Fidelity, (2) Physics and Material Model Fidelity, (3) Software Quality Assurance and Engineering, (4) Code Verification, (5) Solution Verification, (6) Separate Effects Model Validation, (7) Integral Effects Model Validation, and (8) Uncertainty Quantification. For each attribute, a maturity score from zero to three is assigned in the context of each challenge problem. The evaluation of these eight elements constitutes the credibility assessment for VERA.

  5. Validation of the second-generation Olympus colonoscopy simulator for skills assessment.

    Science.gov (United States)

    Haycock, A V; Bassett, P; Bladen, J; Thomas-Gibson, S

    2009-11-01

    Simulators have potential value in providing objective evidence of technical skill for procedures within medicine. The aim of this study was to determine face and construct validity for the Olympus colonoscopy simulator and to establish which assessment measures map to clinical benchmarks of expertise. Thirty-four participants were recruited: 10 novices with no prior colonoscopy experience, 13 intermediate (trainee) endoscopists with fewer than 1000 previous colonoscopies, and 11 experienced endoscopists with more than 1000 previous colonoscopies. All participants completed three standardized cases on the simulator and experts gave feedback regarding the realism of the simulator. Forty metrics recorded automatically by the simulator were analyzed for their ability to distinguish between the groups. The simulator discriminated participants by experience level for 22 different parameters. Completion rates were lower for novices than for trainees and experts (37 % vs. 79 % and 88 % respectively, P variable stiffness function ( P = 0.004), number of sigmoid N-loops ( P = 0.02); size of sigmoid N-loops ( P = 0.01), and time to remove alpha loops ( P = 0.004). Out of 10, experts rated the realism of movement at 6.4, force feedback at 6.6, looping at 6.6, and loop resolution at 6.8. The Olympus colonoscopy simulator has good face validity and excellent construct validity. It provides an objective assessment of colonoscopic skill on multiple measures and benchmarks have been set to allow its use as both a formative and a summative assessment tool. Georg Thieme Verlag KG Stuttgart. New York.

  6. Development, content validity, and cross-cultural adaptation of a patient-reported outcome measure for real-time symptom assessment in irritable bowel syndrome.

    Science.gov (United States)

    Vork, L; Keszthelyi, D; Mujagic, Z; Kruimel, J W; Leue, C; Pontén, I; Törnblom, H; Simrén, M; Albu-Soda, A; Aziz, Q; Corsetti, M; Holvoet, L; Tack, J; Rao, S S; van Os, J; Quetglas, E G; Drossman, D A; Masclee, A A M

    2018-03-01

    End-of-day questionnaires, which are considered the gold standard for assessing abdominal pain and other gastrointestinal (GI) symptoms in irritable bowel syndrome (IBS), are influenced by recall and ecological bias. The experience sampling method (ESM) is characterized by random and repeated assessments in the natural state and environment of a subject, and herewith overcomes these limitations. This report describes the development of a patient-reported outcome measure (PROM) based on the ESM principle, taking into account content validity and cross-cultural adaptation. Focus group interviews with IBS patients and expert meetings with international experts in the fields of neurogastroenterology & motility and pain were performed in order to select the items for the PROM. Forward-and-back translation and cognitive interviews were performed to adapt the instrument for the use in different countries and to assure on patients' understanding with the final items. Focus group interviews revealed 42 items, categorized into five domains: physical status, defecation, mood and psychological factors, context and environment, and nutrition and drug use. Experts reduced the number of items to 32 and cognitive interviewing after translation resulted in a few slight adjustments regarding linguistic issues, but not regarding content of the items. An ESM-based PROM, suitable for momentary assessment of IBS symptom patterns was developed, taking into account content validity and cross-cultural adaptation. This PROM will be implemented in a specifically designed smartphone application and further validation in a multicenter setting will follow. © 2017 John Wiley & Sons Ltd.

  7. Assessment of patient empowerment--a systematic review of measures.

    Directory of Open Access Journals (Sweden)

    Paul J Barr

    Full Text Available Patient empowerment has gained considerable importance but uncertainty remains about the best way to define and measure it. The validity of empirical findings depends on the quality of measures used. This systematic review aims to provide an overview of studies assessing psychometric properties of questionnaires purporting to capture patient empowerment, evaluate the methodological quality of these studies and assess the psychometric properties of measures identified.Electronic searches in five databases were combined with reference tracking of included articles. Peer-reviewed articles reporting psychometric testing of empowerment measures for adult patients in French, German, English, Portuguese and Spanish were included. Study characteristics, constructs operationalised and psychometric properties were extracted. The quality of study design, methods and reporting was assessed using the COSMIN checklist. The quality of psychometric properties was assessed using Terwee's 2007 criteria.30 studies on 19 measures were included. Six measures are generic, while 13 were developed for a specific condition (N=4 or specialty (N=9. Most studies tested measures in English (N=17 or Swedish (N=6. Sample sizes of included studies varied from N=35 to N=8261. A range of patient empowerment constructs was operationalised in included measures. These were classified into four domains: patient states, experiences and capacities; patient actions and behaviours; patient self-determination within the healthcare relationship and patient skills development. Quality assessment revealed several flaws in methodological study quality with COSMIN scores mainly fair or poor. The overall quality of psychometric properties of included measures was intermediate to positive. Certain psychometric properties were not tested for most measures.Findings provide a basis from which to develop consensus on a core set of patient empowerment constructs and for further work to develop a

  8. Using the eating disorder examination in the assessment of bulimia and anorexia: issues of reliability and validity.

    Science.gov (United States)

    Guest, T

    2000-01-01

    The Eating Disorder Examination will be assessed according to its reliability and validity in the assessment of anorexia nervosa and bulimia nervosa. A thorough review of the literature was conducted to judge the reliability and validity of the Eating Disorder Examination and its subscales. The review shows that the EDE and its subscales have good interrater reliability and internal consistency reliability. Similarly, high levels of discriminant validity, construct validity, and treatment validity in the assessment of eating disorders were also found. A summary of each study concerning the various types of reliability and validity will be provided. The EDE is considered to be the "gold standard" by which to identify eating disorders, so this tool used in conjunction with other behavioral measures will be imperative for clinical social work practice.

  9. Measuring patient-provider communication skills in Rwanda: Selection, adaptation and assessment of psychometric properties of the Communication Assessment Tool.

    Science.gov (United States)

    Cubaka, Vincent Kalumire; Schriver, Michael; Vedsted, Peter; Makoul, Gregory; Kallestrup, Per

    2018-04-23

    To identify, adapt and validate a measure for providers' communication and interpersonal skills in Rwanda. After selection, translation and piloting of the measure, structural validity, test-retest reliability, and differential item functioning were assessed. Identification and adaptation: The 14-item Communication Assessment Tool (CAT) was selected and adapted. Content validation found all items highly relevant in the local context except two, which were retained upon understanding the reasoning applied by patients. Eleven providers and 291 patients were involved in the field-testing. Confirmatory factor analysis showed a good fit for the original one factor model. Test-retest reliability assessment revealed a mean quadratic weighted Kappa = 0.81 (range: 0.69-0.89, N = 57). The average proportion of excellent scores was 15.7% (SD: 24.7, range: 9.9-21.8%, N = 180). Differential item functioning was not observed except for item 1, which focuses on greetings, for age groups (p = 0.02, N = 180). The Kinyarwanda version of CAT (K-CAT) is a reliable and valid patient-reported measure of providers' communication and interpersonal skills. K-CAT was validated on nurses and its use on other types of providers may require further validation. K-CAT is expected to be a valuable feedback tool for providers in practice and in training. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Advantages and psychometric validation of proximal intensive assessments of patient-reported outcomes collected in daily life.

    Science.gov (United States)

    Carlson, Eve B; Field, Nigel P; Ruzek, Josef I; Bryant, Richard A; Dalenberg, Constance J; Keane, Terrence M; Spain, David A

    2016-03-01

    Ambulatory assessment data collection methods are increasingly used to study behavior, experiences, and patient-reported outcomes (PROs), such as emotions, cognitions, and symptoms in clinical samples. Data collected close in time at frequent and fixed intervals can assess PROs that are discrete or changing rapidly and provide information about temporal dynamics or mechanisms of change in clinical samples and individuals, but clinical researchers have not yet routinely and systematically investigated the reliability and validity of such measures or their potential added value over conventional measures. The present study provides a comprehensive, systematic evaluation of the psychometrics of several proximal intensive assessment (PIA) measures in a clinical sample and investigates whether PIA appears to assess meaningful differences in phenomena over time. Data were collected on a variety of psychopathology constructs on handheld devices every 4 h for 7 days from 62 adults recently exposed to traumatic injury of themselves or a family member. Data were also collected on standard self-report measures of the same constructs at the time of enrollment, 1 week after enrollment, and 2 months after injury. For all measure scores, results showed good internal consistency across items and within persons over time, provided evidence of convergent, divergent, and construct validity, and showed significant between- and within-subject variability. Results indicate that PIA measures can provide valid measurement of psychopathology in a clinical sample. PIA may be useful to study mechanisms of change in clinical contexts, identify targets for change, and gauge treatment progress.

  11. Empirical evaluation of the Process Overview Measure for assessing situation awareness in process plants.

    Science.gov (United States)

    Lau, Nathan; Jamieson, Greg A; Skraaning, Gyrd

    2016-03-01

    The Process Overview Measure is a query-based measure developed to assess operator situation awareness (SA) from monitoring process plants. A companion paper describes how the measure has been developed according to process plant properties and operator cognitive work. The Process Overview Measure demonstrated practicality, sensitivity, validity and reliability in two full-scope simulator experiments investigating dramatically different operational concepts. Practicality was assessed based on qualitative feedback of participants and researchers. The Process Overview Measure demonstrated sensitivity and validity by revealing significant effects of experimental manipulations that corroborated with other empirical results. The measure also demonstrated adequate inter-rater reliability and practicality for measuring SA in full-scope simulator settings based on data collected on process experts. Thus, full-scope simulator studies can employ the Process Overview Measure to reveal the impact of new control room technology and operational concepts on monitoring process plants. Practitioner Summary: The Process Overview Measure is a query-based measure that demonstrated practicality, sensitivity, validity and reliability for assessing operator situation awareness (SA) from monitoring process plants in representative settings.

  12. The reasons for betel-quid chewing scale: assessment of factor structure, reliability, and validity.

    Science.gov (United States)

    Little, Melissa A; Pokhrel, Pallav; Murphy, Kelle L; Kawamoto, Crissy T; Suguitan, Gil S; Herzog, Thaddeus A

    2014-06-03

    Despite the fact that betel-quid is one of the most commonly used psychoactive substances worldwide and a major risk-factor for head-and-neck cancer incidence and mortality globally, currently no standardized instrument is available to assess the reasons why individuals chew betel-quid. A measure to assess reasons for chewing betel-quid could help researchers and clinicians develop prevention and treatment strategies. In the current study, we sought to develop and evaluate a self-report instrument for assessing the reasons for chewing betel quid which contributes toward the goal of developing effective interventions to reduce betel quid chewing in vulnerable populations. The current study assessed the factor structure, reliability and convergent validity of the Reasons for Betel-quid Chewing Scale (RBCS), a newly developed 10 item measure adapted from several existing "reasons for smoking" scales. The measure was administered to 351 adult betel-quid chewers in Guam. Confirmatory factor analysis of this measure revealed a three factor structure: reinforcement, social/cultural, and stimulation. Further tests revealed strong support for the internal consistency and convergent validity of this three factor measure. The goal of designing an intervention to reduce betel-quid chewing necessitates an understanding of why chewers chew; the current study makes considerable contributions towards that objective.

  13. Assessing the internal validity of a household survey-based food security measure adapted for use in Iran

    Directory of Open Access Journals (Sweden)

    Sadeghizadeh Atefeh

    2009-06-01

    Full Text Available Abstract Background The prevalence of food insecurity is an indicator of material well-being in an area of basic need. The U.S. Food Security Module has been adapted for use in a wide variety of cultural and linguistic settings around the world. We assessed the internal validity of the adapted U.S. Household Food Security Survey Module to measure adult and child food insecurity in Isfahan, Iran, using statistical methods based on the Rasch measurement model. Methods The U.S. Household Food Security Survey Module was translated into Farsi and after adaptation, administered to a representative sample. Data were provided by 2,004 randomly selected households from all sectors of the population of Isfahan, Iran, during 2005. Results 53.1 percent reported that their food had run out at some time during the previous 12 months and they did not have money to buy more, while 26.7 percent reported that an adult had cut the size of a meal or skipped a meal because there was not enough money for food, and 7.2 percent reported that an adult did not eat for a whole day because there was not enough money for food. The severity of the items in the adult scale, estimated under Rasch-model assumptions, covered a range of 6.65 logistic units, and those in the child scale 11.68 logistic units. Most Item-infit statistics were near unity, and none exceeded 1.20. Conclusion The range of severity of items provides measurement coverage across a wide range of severity of food insecurity for both adults and children. Both scales demonstrated acceptable levels of internal validity, although several items should be improved. The similarity of the response patterns in the Isfahan and the U.S. suggests that food insecurity is experienced, managed, and described similarly in the two countries.

  14. A Reliability and Validity of an Instrument to Evaluate the School-Based Assessment System: A Pilot Study

    Science.gov (United States)

    Ghazali, Nor Hasnida Md

    2016-01-01

    A valid, reliable and practical instrument is needed to evaluate the implementation of the school-based assessment (SBA) system. The aim of this study is to develop and assess the validity and reliability of an instrument to measure the perception of teachers towards the SBA implementation in schools. The instrument is developed based on a…

  15. Validation of a Smartphone-Based Approach to In Situ Cognitive Fatigue Assessment

    Science.gov (United States)

    Linden, Mark

    2017-01-01

    Background Acquired Brain Injuries (ABIs) can result in multiple detrimental cognitive effects, such as reduced memory capability, concentration, and planning. These effects can lead to cognitive fatigue, which can exacerbate the symptoms of ABIs and hinder management and recovery. Assessing cognitive fatigue is difficult due to the largely subjective nature of the condition and existing assessment approaches. Traditional methods of assessment use self-assessment questionnaires delivered in a medical setting, but recent work has attempted to employ more objective cognitive tests as a way of evaluating cognitive fatigue. However, these tests are still predominantly delivered within a medical environment, limiting their utility and efficacy. Objective The aim of this research was to investigate how cognitive fatigue can be accurately assessed in situ, during the quotidian activities of life. It was hypothesized that this assessment could be achieved through the use of mobile assistive technology to assess working memory, sustained attention, information processing speed, reaction time, and cognitive throughput. Methods The study used a bespoke smartphone app to track daily cognitive performance, in order to assess potential levels of cognitive fatigue. Twenty-one participants with no prior reported brain injuries took place in a two-week study, resulting in 81 individual testing instances being collected. The smartphone app delivered three cognitive tests on a daily basis: (1) Spatial Span to measure visuospatial working memory; (2) Psychomotor Vigilance Task (PVT) to measure sustained attention, information processing speed, and reaction time; and (3) a Mental Arithmetic Test to measure cognitive throughput. A smartphone-optimized version of the Mental Fatigue Scale (MFS) self-assessment questionnaire was used as a baseline to assess the validity of the three cognitive tests, as the questionnaire has already been validated in multiple peer-reviewed studies. Results

  16. Validation of a Smartphone-Based Approach to In Situ Cognitive Fatigue Assessment.

    Science.gov (United States)

    Price, Edward; Moore, George; Galway, Leo; Linden, Mark

    2017-08-17

    Acquired Brain Injuries (ABIs) can result in multiple detrimental cognitive effects, such as reduced memory capability, concentration, and planning. These effects can lead to cognitive fatigue, which can exacerbate the symptoms of ABIs and hinder management and recovery. Assessing cognitive fatigue is difficult due to the largely subjective nature of the condition and existing assessment approaches. Traditional methods of assessment use self-assessment questionnaires delivered in a medical setting, but recent work has attempted to employ more objective cognitive tests as a way of evaluating cognitive fatigue. However, these tests are still predominantly delivered within a medical environment, limiting their utility and efficacy. The aim of this research was to investigate how cognitive fatigue can be accurately assessed in situ, during the quotidian activities of life. It was hypothesized that this assessment could be achieved through the use of mobile assistive technology to assess working memory, sustained attention, information processing speed, reaction time, and cognitive throughput. The study used a bespoke smartphone app to track daily cognitive performance, in order to assess potential levels of cognitive fatigue. Twenty-one participants with no prior reported brain injuries took place in a two-week study, resulting in 81 individual testing instances being collected. The smartphone app delivered three cognitive tests on a daily basis: (1) Spatial Span to measure visuospatial working memory; (2) Psychomotor Vigilance Task (PVT) to measure sustained attention, information processing speed, and reaction time; and (3) a Mental Arithmetic Test to measure cognitive throughput. A smartphone-optimized version of the Mental Fatigue Scale (MFS) self-assessment questionnaire was used as a baseline to assess the validity of the three cognitive tests, as the questionnaire has already been validated in multiple peer-reviewed studies. The most highly correlated results

  17. Anxiety measures validated in perinatal populations: a systematic review.

    Science.gov (United States)

    Meades, Rose; Ayers, Susan

    2011-09-01

    Research and screening of anxiety in the perinatal period is hampered by a lack of psychometric data on self-report anxiety measures used in perinatal populations. This paper aimed to review self-report measures that have been validated with perinatal women. A systematic search was carried out of four electronic databases. Additional papers were obtained through searching identified articles. Thirty studies were identified that reported validation of an anxiety measure with perinatal women. Most commonly validated self-report measures were the General Health Questionnaire (GHQ), State-Trait Anxiety Inventory (STAI), and Hospital Anxiety and Depression Scales (HADS). Of the 30 studies included, 11 used a clinical interview to provide criterion validity. Remaining studies reported one or more other forms of validity (factorial, discriminant, concurrent and predictive) or reliability. The STAI shows criterion, discriminant and predictive validity and may be most useful for research purposes as a specific measure of anxiety. The Kessler 10 (K-10) may be the best short screening measure due to its ability to differentiate anxiety disorders. The Depression Anxiety Stress Scales 21 (DASS-21) measures multiple types of distress, shows appropriate content, and remains to be validated against clinical interview in perinatal populations. Nineteen studies did not report sensitivity or specificity data. The early stages of research into perinatal anxiety, the multitude of measures in use, and methodological differences restrict comparison of measures across studies. There is a need for further validation of self-report measures of anxiety in the perinatal period to enable accurate screening and detection of anxiety symptoms and disorders. Copyright © 2010 Elsevier B.V. All rights reserved.

  18. Portuguese Adaptation and Input for the Validation of the Views on Inpatient Care (VOICE) Outcome Measure to Assess Service Users'Perceptions of Inpatient Psychiatric Care.

    Science.gov (United States)

    Palha, João; Palha, Filipa; Dias, Pedro; Gonçalves-Pereira, Manuel

    2017-11-29

    Patient satisfaction is an important measure of health care quality. Patients' views have seldom been considered in the construction of measures addressing satisfaction with inpatient facilities in psychiatry. The Views on Inpatient Care - VOICE - is a first service-user generated outcome measure relying solely on their perceptions of acute care, representing a valuable indicator of service users' perceived quality of care. The present study aimed to contribute to the validation of the Portuguese version of VOICE. The questionnaire was translated into Portuguese and applied to a sample of eighty-five female inpatients of a psychiatric institution. Data analysis focused on assessing reliability and exploring the impact of demographic and clinical variables on participants' satisfaction. Internal consistency of the questionnaire was high (α = 0.87). Participants' age and marital status were associated with differences in scores, with older patients and patients who were married or involved in a close relationship presenting higher satisfaction levels. The questionnaire demonstrated good internal consistency and acceptability, as well as construct validity. Further studies should expand the analysis of the psychometric properties of this measure e.g., test-retest reliability. The Portuguese version of VOICE is a promising tool to assess service users' perceptions of inpatient psychiatric care in Portugal.

  19. Validity of linear encoder measurement of sit-to-stand performance power in older people.

    Science.gov (United States)

    Lindemann, U; Farahmand, P; Klenk, J; Blatzonis, K; Becker, C

    2015-09-01

    To investigate construct validity of linear encoder measurement of sit-to-stand performance power in older people by showing associations with relevant functional performance and physiological parameters. Cross-sectional study. Movement laboratory of a geriatric rehabilitation clinic. Eighty-eight community-dwelling, cognitively unimpaired older women (mean age 78 years). Sit-to-stand performance power and leg power were assessed using a linear encoder and the Nottingham Power Rig, respectively. Gait speed was measured on an instrumented walkway. Maximum quadriceps and hand grip strength were assessed using dynamometers. Mid-thigh muscle cross-sectional area of both legs was measured using magnetic resonance imaging. Associations of sit-to-stand performance power with power assessed by the Nottingham Power Rig, maximum gait speed and muscle cross-sectional area were r=0.646, r=0.536 and r=0.514, respectively. A linear regression model explained 50% of the variance in sit-to-stand performance power including muscle cross-sectional area (p=0.001), maximum gait speed (p=0.002), and power assessed by the Nottingham Power Rig (p=0.006). Construct validity of linear encoder measurement of sit-to-stand power was shown at functional level and morphological level for older women. This measure could be used in routine clinical practice as well as in large-scale studies. DRKS00003622. Copyright © 2015 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

  20. Validating the JobFit system functional assessment method

    Energy Technology Data Exchange (ETDEWEB)

    Jenny Legge; Robin Burgess-Limerick

    2007-05-15

    Workplace injuries are costing the Australian coal mining industry and its communities $410 Million a year. This ACARP study aims to meet those demands by developing a safe, reliable and valid pre-employment functional assessment tool. All JobFit System Pre-Employment Functional Assessments (PEFAs) consist of a musculoskeletal screen, balance test, aerobic fitness test and job-specific postural tolerances and material handling tasks. The results of each component are compared to the applicant's job demands and an overall PEFA score between 1 and 4 is given with 1 being the better score. The reliability study and validity study were conducted concurrently. The reliability study examined test-retest, intra-tester and inter-tester reliability of the JobFit System Functional Assessment Method. Overall, good to excellent reliability was found, which was sufficient to be used for comparison with injury data for determining the validity of the assessment. The overall assessment score and material handling tasks had the greatest reliability. The validity study compared the assessment results of 336 records from a Queensland underground and open cut coal mine with their injury records. A predictive relationship was found between PEFA score and the risk of a back/trunk/shoulder injury from manual handling. An association was also found between PEFA score of 1 and increased length of employment. Lower aerobic fitness test results had an inverse relationship with injury rates. The study found that underground workers, regardless of PEFA score, were more likely to have an injury when compared to other departments. No relationship was found between age and risk of injury. These results confirm the validity of the JobFit System Functional Assessment method.

  1. Validity and reliability of portfolio assessment of competency in a baccalaureate dental hygiene program

    Science.gov (United States)

    Gadbury-Amyot, Cynthia C.

    This study examined validity and reliability of portfolio assessment using Messick's (1996, 1995) unified framework of construct validity. Theoretical and empirical evidence was sought for six aspects of construct validity. The sample included twenty student portfolios. Each portfolio were evaluated by seven faculty raters using a primary trait analysis scoring rubric. There was a significant relationship (r = .81--.95; p Dental Hygiene Board Examination (r = .60; p Dental Testing Service examination was both weak and nonsignificant (r = .19; p > .05). An open-ended survey was used to elicit student feedback on portfolio development. A majority of the students (76%) perceived value in the development of programmatic portfolios. In conclusion, the pattern of findings from this study suggest that portfolios can serve as a valid and reliable measure for assessing student competency.

  2. Validity of the Nintendo Wii® balance board for the assessment of standing balance in Parkinson's disease.

    Science.gov (United States)

    Holmes, Jeffrey D; Jenkins, Mary E; Johnson, Andrew M; Hunt, Michael A; Clark, Ross A

    2013-04-01

    Impaired postural stability places individuals with Parkinson's at an increased risk for falls. Given the high incidence of fall-related injuries within this population, ongoing assessment of postural stability is important. To evaluate the validity of the Nintendo Wii(®) balance board as a measurement tool for the assessment of postural stability in individuals with Parkinson's. Twenty individuals with Parkinson's participated. Subjects completed testing on two balance tasks with eyes open and closed on a Wii(®) balance board and biomechanical force platform. Bland-Altman plots and a two-way, random-effects, single measure intraclass correlation coefficient model were used to assess concurrent validity of centre-of-pressure data. Concurrent validity was demonstrated to be excellent across balance tasks (intraclass correlation coefficients = 0.96, 0.98, 0.92, 0.94). This study suggests that the Wii(®) balance board is a valid tool for the quantification of postural stability among individuals with Parkinson's.

  3. Validation of the Dyadic Coping Inventory with Chinese couples: Factorial structure, measurement invariance, and construct validity.

    Science.gov (United States)

    Xu, Feng; Hilpert, Peter; Randall, Ashley K; Li, Qiuping; Bodenmann, Guy

    2016-08-01

    The Dyadic Coping Inventory (DCI, Bodenmann, 2008) assesses how couples support each other when facing individual (e.g., workload) and common (e.g., parenting) stressors. Specifically, the DCI measures partners' perceptions of their own (Self) and their partners' behaviors (Partner) when facing individual stressors, and partners' common coping behaviors when facing common stressors (Common). To date, the DCI has been validated in 6 different languages from individualistic Western cultures; however, because culture can affect interpersonal interactions, it is unknown whether the DCI is a reliable measure of coping behaviors for couples living in collectivistic Eastern cultures. Based on data from 474 Chinese couples (N = 948 individuals), the current study examined the Chinese version of the DCI's factorial structure, measurement invariance (MI), and construct validity of test scores. Using 3 cultural groups (China, Switzerland, and the United States [U.S.]), confirmatory factor analysis revealed a 5-factor structure regarding Self and Partner and a 2-factor structure regarding Common dyadic coping (DC). Results from analyses of MI indicated that the DCI subscales met the criteria for configural, metric, and full/partial scalar invariance across cultures (Chinese-Swiss and Chinese-U.S.) and genders (Chinese men and women). Results further revealed good construct validity of the DCI test scores. In all, the Chinese version of the DCI can be used for measuring Chinese couples' coping behaviors, and is available for cross-cultural studies examining DC behaviors between Western and Eastern cultures. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  4. The Brief Early Childhood Screening Assessment: Preliminary Validity in Pediatric Primary Care.

    Science.gov (United States)

    Fallucco, Elise M; Wysocki, Tim; James, Lauren; Kozikowski, Chelsea; Williams, Andre; Gleason, Mary M

    Brief, well-validated instruments are needed to facilitate screening for early childhood behavioral and emotional problems (BEPs). The objectives of this study were to empirically reduce the length of the Early Childhood Screening Assessment (ECSA) and to assess the validity and reliability of this shorter tool. Using caregiver ECSA responses for 2467 children aged 36 to 60 months seen in primary care, individual ECSA items were ranked on a scale ranging from "absolutely retain" to "absolutely delete." Items were deleted sequentially beginning with "absolutely delete" and going up the item prioritization list, resulting in 35 shorter versions of the ECSA. A separate primary care sample (n = 69) of mothers of children aged 18 to 60 months was used to determine the sensitivity and specificity of each shorter ECSA version using psychiatric diagnosis on the Diagnostic Infant and Preschool Assessment as the gold standard. The version with the optimal balance of sensitivity, specificity, and length was selected as the Brief ECSA. Associations between Brief ECSA scores and other pertinent measures were evaluated to estimate reliability and validity. A 22-item measure reflected the best combination of brevity, sensitivity and specificity. A cutoff score of 9 or higher on the 22-item Brief ECSA demonstrated acceptable sensitivity (89%) and specificity (85%) for predicting a psychiatric diagnosis. Brief ECSA scores correlated significantly and in expected directions with scores on pertinent measures and with demographic variables. The results indicate that the Brief ECSA has sound psychometric properties for identifying young children with BEPs in primary care.

  5. A Prospective Validation Study of a Rainbow Model of Integrated Care Measurement Tool in Singapore.

    Science.gov (United States)

    Nurjono, Milawaty; Valentijn, Pim P; Bautista, Mary Ann C; Wei, Lim Yee; Vrijhoef, Hubertus Johannes Maria

    2016-04-08

    The conceptual ambiguity of the integrated care concept precludes a full understanding of what constitutes a well-integrated health system, posing a significant challenge in measuring the level of integrated care. Most available measures have been developed from a disease-specific perspective and only measure certain aspects of integrated care. Based on the Rainbow Model of Integrated Care, which provides a detailed description of the complex concept of integrated care, a measurement tool has been developed to assess integrated care within a care system as a whole gathered from healthcare providers' and managerial perspectives. This paper describes the methodology of a study seeking to validate the Rainbow Model of Integrated Care measurement tool within and across the Singapore Regional Health System. The Singapore Regional Health System is a recent national strategy developed to provide a better-integrated health system to deliver seamless and person-focused care to patients through a network of providers within a specified geographical region. The validation process includes the assessment of the content of the measure and its psychometric properties. If the measure is deemed to be valid, the study will provide the first opportunity to measure integrated care within Singapore Regional Health System with the results allowing insights in making recommendations for improving the Regional Health System and supporting international comparison.

  6. Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

    Directory of Open Access Journals (Sweden)

    Formiga MF

    2018-05-01

    Full Text Available Magno F Formiga,1,2 Kathryn E Roach,1 Isabel Vital,3 Gisel Urdaneta,3 Kira Balestrini,3 Rafael A Calderon-Candelario,3,4 Michael A Campos,3,4,* Lawrence P Cahalin1,* 1Department of Physical Therapy, University of Miami Miller School of Medicine, Coral Gables, FL, USA; 2CAPES Foundation, Ministry of Education of Brazil, Brasilia, Brazil; 3Pulmonary Section, Miami Veterans Administration Medical Center, Miami, FL, USA; 4Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, University of Miami Miller School of Medicine, Miami, FL, USA *These authors contributed equally to this work Purpose: The Test of Incremental Respiratory Endurance (TIRE provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP over time. The integration of MIP over inspiratory duration (ID provides the sustained maximal inspiratory pressure (SMIP. Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods: Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results: All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP

  7. An Instrument to Measure Maturity of Integrated Care: A First Validation Study

    Directory of Open Access Journals (Sweden)

    Liset Grooten

    2018-01-01

    Full Text Available Introduction: Lessons captured from interviews with 12 European regions are represented in a new instrument, the B3-Maturity Model (B3-MM. B3-MM aims to assess maturity along 12 dimensions reflecting the various aspects that need to be managed in order to deliver integrated care. The objective of the study was to test the content validity of B3-MM as part of SCIROCCO (Scaling Integrated Care into Context, a European Union funded project. Methods: A literature review was conducted to compare B3-MM’s 12 dimensions and their measurement scales with existing measures and instruments that focus on assessing the development of integrated care. Subsequently, a three-round survey conducted through a Delphi study with international experts in the field of integrated care was performed to test the relevance of: 1 the dimensions, 2 the maturity indicators and 3 the assessment scale used in B3-MM. Results: The 11 articles included in the literature review confirmed all the dimensions described in the original version of B3-MM. The Delphi study rounds resulted in various phrasing amendments of indicators and assessment scale. Full agreement among the experts on the relevance of the 12 B3-MM dimensions, their indicators, and assessment scale was reached after the third Delphi round. Conclusion and discussion: The B3-MM dimensions, maturity indicators and assessment scale showed satisfactory content validity. While the B3-MM is a unique instrument based on existing knowledge and experiences of regions in integrated care, further testing is needed to explore other measurement properties of B3-MM.

  8. An Instrument to Measure Maturity of Integrated Care: A First Validation Study

    Science.gov (United States)

    2018-01-01

    Introduction: Lessons captured from interviews with 12 European regions are represented in a new instrument, the B3-Maturity Model (B3-MM). B3-MM aims to assess maturity along 12 dimensions reflecting the various aspects that need to be managed in order to deliver integrated care. The objective of the study was to test the content validity of B3-MM as part of SCIROCCO (Scaling Integrated Care into Context), a European Union funded project. Methods: A literature review was conducted to compare B3-MM’s 12 dimensions and their measurement scales with existing measures and instruments that focus on assessing the development of integrated care. Subsequently, a three-round survey conducted through a Delphi study with international experts in the field of integrated care was performed to test the relevance of: 1) the dimensions, 2) the maturity indicators and 3) the assessment scale used in B3-MM. Results: The 11 articles included in the literature review confirmed all the dimensions described in the original version of B3-MM. The Delphi study rounds resulted in various phrasing amendments of indicators and assessment scale. Full agreement among the experts on the relevance of the 12 B3-MM dimensions, their indicators, and assessment scale was reached after the third Delphi round. Conclusion and discussion: The B3-MM dimensions, maturity indicators and assessment scale showed satisfactory content validity. While the B3-MM is a unique instrument based on existing knowledge and experiences of regions in integrated care, further testing is needed to explore other measurement properties of B3-MM. PMID:29588644

  9. Reliability and Validity of the Footprint Assessment Method Using Photoshop CS5 Software.

    Science.gov (United States)

    Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam

    2015-05-01

    Several sophisticated methods of footprint analysis currently exist. However, it is sometimes useful to apply standard measurement methods of recognized evidence with an easy and quick application. We sought to assess the reliability and validity of a new method of footprint assessment in a healthy population using Photoshop CS5 software (Adobe Systems Inc, San Jose, California). Forty-two footprints, corresponding to 21 healthy individuals (11 men with a mean ± SD age of 20.45 ± 2.16 years and 10 women with a mean ± SD age of 20.00 ± 1.70 years) were analyzed. Footprints were recorded in static bipedal standing position using optical podography and digital photography. Three trials for each participant were performed. The Hernández-Corvo, Chippaux-Smirak, and Staheli indices and the Clarke angle were calculated by manual method and by computerized method using Photoshop CS5 software. Test-retest was used to determine reliability. Validity was obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed high values (ICC, 0.98-0.99). Moreover, the validity test clearly showed no difference between techniques (ICC, 0.99-1). The reliability and validity of a method to measure, assess, and record the podometric indices using Photoshop CS5 software has been demonstrated. This provides a quick and accurate tool useful for the digital recording of morphostatic foot study parameters and their control.

  10. Neuro-QoL health-related quality of life measurement system: Validation in Parkinson's disease.

    Science.gov (United States)

    Nowinski, Cindy J; Siderowf, Andrew; Simuni, Tanya; Wortman, Catherine; Moy, Claudia; Cella, David

    2016-05-01

    Neuro-QoL is a multidimensional patient-reported outcome measurement system assessing aspects of physical, mental, and social health identified by neurology patients and caregivers as important. One of the first neurology-specific patient-reported outcome measure systems created using modern test development methods, Neuro-Qol enables brief, yet precise, assessment and the ability to conduct both PD-specific and cross-disease comparisons. We present results of Neuro-QoL clinical validation using a sample of PD patients. A total of 120 PD patients recruited from academic medical centers were assessed at baseline, 1 week, and 6 months. Assessments included Neuro-QoL and general and PD-specific validity measures. Participants were 62% male and 95% white (average age = 66); H & Y stages were 1 (16%), 2 (61%), 3 (18%), and 4 (5%). Internal consistency and test-retest reliability of Neuro-QoL ranged from Cronbach's alphas = 0.81 to 0.94 with intraclass correlation coefficients = 0.66 to 0.80. Pearson's correlations between Neuro-QoL and legacy measures were generally moderate and in expected directions. UPDRS Part 2 was moderately correlated with Neuro-QoL Upper Extremity and Mobility, respectively (r's = -0.44; -0.59). Parkinson's Disease Questionnaire-39 and Neuro-QoL measures of similar constructs showed strong-to-moderate correlations (r's = 0.70-0.44). Neuro-QoL measures of fatigue, mobility, positive emotion, and emotional/behavioral control showed responsiveness to self-reported change. Neuro-QoL is valid for use in PD clinical research. Reliability for all but two measures is sufficient for group comparisons, with some evidence supporting responsiveness to change. Neuro-QoL possesses characteristics, such as brevity, flexibility in administration, and suitability, for cross-disease comparisons that may be advantageous to users in a variety of settings. © 2016 Movement Disorder Society. © 2016 International Parkinson and Movement Disorder

  11. Reliability and Validity Assessment of a Linear Position Transducer

    Directory of Open Access Journals (Sweden)

    Manuel V. Garnacho-Castaño

    2015-03-01

    Full Text Available The objectives of the study were to determine the validity and reliability of peak velocity (PV, average velocity (AV, peak power (PP and average power (AP measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain during two resistance exercises, bench press (BP and full back squat (BS, performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2. Intraclass correlation coefficients (ICCs indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W. Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W. Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP make this device a useful tool for monitoring resistance training.

  12. Development of caries risk assessment tool for Iranian preschoolers: A primary validation study

    Directory of Open Access Journals (Sweden)

    Shiva Mortazavi

    2017-01-01

    Full Text Available Background: The aim of the present study was to develop a dental caries risk assessment tool for Iranian preschoolers. Methods: In a validation and cross-sectional study, a random sample of 150 preschool children was involved. This study was conducted in three phases: questionnaire design (expert panel and peer evaluation, questionnaire testing (pilot evaluation and field testing, and validation study. The initial assessments include interview, dental examination, and laboratory investigations. Validity and reliability indices, content validity index (CVI, content validity ratio (CVR, impact score, and test-retest and Cronbach's alpha were measured. Decayed, missing, filled teeth (dmft scores were calculated according to the WHO guidelines. Results: The Iranian version of caries risk assessment (CRA questionnaire contained 17 items. Cronbach's alpha coefficient (0.86 indicated a suitable internal consistency. The mean scores for the CVI and the CVR were 0.87 and 0.78, respectively. The prevalence rate of dental caries in the study group was 69.3%, and the mean dmft was 4.57 (range 0–19. Conclusions: The Persian version of CRA questionnaire was adapted to the Iranian population. The findings demonstrated overall acceptable validity and also reliability in the application of test-retest. The results of the present study provide initial evidence that the designed CRA form could be a useful tool for CRA in the Iranian preschoolers.

  13. Validation of a new strength measurement device for amyotrophic lateral sclerosis clinical trials.

    Science.gov (United States)

    Andres, Patricia L; Skerry, Linda M; Munsat, Theodore L; Thornell, Brenda J; Szymonifka, Jackie; Schoenfeld, David A; Cudkowicz, Merit E

    2012-01-01

    Strength measures with reduced variability and higher sensitivity could improve efficiency in clinical trials of amyotrophic lateral sclerosis (ALS). The Accurate Test of Limb Isometric Strength (ATLIS) was developed to precisely and conveniently measure force in 12 muscle groups. In this study we evaluate the reliability and validity of the ATLIS testing protocol. Twenty healthy adults and 10 patients with ALS were tested twice by the same or by different evaluators to determine test-retest and interrater reliability. Twenty healthy adults were examined using ATLIS and a well-validated strength testing protocol (TQNE) to assess criterion-based validity. Mean absolute variation between tests was 8.6%, and intraclass correlation coefficients for each muscle group were high (range 0.82-0.99). The Pearson correlation coefficient of mean ATLIS and TQNE scores was 0.90. A subject survey demonstrated high user acceptance of ATLIS. ATLIS is convenient for patients and evaluators, produces precise strength measurements, and is easily moved between examining rooms. Copyright © 2011 Wiley Periodicals, Inc.

  14. The Child Behaviour Assessment Instrument: development and validation of a measure to screen for externalising child behavioural problems in community setting

    Directory of Open Access Journals (Sweden)

    Perera Hemamali

    2010-06-01

    Full Text Available Abstract Background In Sri Lanka, behavioural problems have grown to epidemic proportions accounting second highest category of mental health problems among children. Early identification of behavioural problems in children is an important pre-requisite of the implementation of interventions to prevent long term psychiatric outcomes. The objectives of the study were to develop and validate a screening instrument for use in the community setting to identify behavioural problems in children aged 4-6 years. Methods An initial 54 item questionnaire was developed following an extensive review of the literature. A three round Delphi process involving a panel of experts from six relevant fields was then undertaken to refine the nature and number of items and created the 15 item community screening instrument, Child Behaviour Assessment Instrument (CBAI. Validation study was conducted in the Medical Officer of Health area Kaduwela, Sri Lanka and a community sample of 332 children aged 4-6 years were recruited by two stage randomization process. The behaviour status of the participants was assessed by an interviewer using the CBAI and a clinical psychologist following clinical assessment concurrently. Criterion validity was appraised by assessing the sensitivity, specificity and predictive values at the optimum screen cut off value. Construct validity of the instrument was quantified by testing whether the data of validation study fits to a hypothetical model. Face and content validity of the CBAI were qualitatively assessed by a panel of experts. The reliability of the instrument was assessed by internal consistency analysis and test-retest methods in a 15% subset of the community sample. Results Using the Receiver Operating Characteristic analysis the CBAI score of >16 was identified as the cut off point that optimally differentiated children having behavioural problems, with a sensitivity of 0.88 (95% CI = 0.80-0.96 and specificity of 0.81 (95% CI = 0

  15. Validation of a rubric to assess innovation competence

    Directory of Open Access Journals (Sweden)

    Frances Watts

    2012-06-01

    Full Text Available This paper addresses the development and validation of rubrics, materials and situations for the assessment of innovation competence. Research was carried out to verify the viability of the first draft of the assessment criteria, which led to refinement of the criteria and proposals to enhance the ensuing validation process that will include students and raters of different language backgrounds.

  16. Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

    Science.gov (United States)

    Sanders, James L; Williams, Robert J

    2016-01-01

    Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.

  17. Reliability and validity of the transport and physical activity questionnaire (TPAQ) for assessing physical activity behaviour.

    Science.gov (United States)

    Adams, Emma J; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C; Cooper, Ashley R; Ogilvie, David

    2014-01-01

    No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, ptravel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires.

  18. Validating Measures of Mathematical Knowledge for Teaching

    Science.gov (United States)

    Kane, Michael

    2007-01-01

    According to Schilling, Blunk, and Hill, the set of papers presented in this journal issue had two main purposes: (1) to use an argument-based approach to evaluate the validity of the tests of mathematical knowledge for teaching (MKT), and (2) to critically assess the author's version of an argument-based approach to validation (Kane, 2001, 2004).…

  19. Assessing middle school students` understanding of science relationships and processes: Year 2 - instrument validation. Final report

    Energy Technology Data Exchange (ETDEWEB)

    Schau, C.; Mattern, N.; Weber, R.; Minnick, K.

    1997-01-01

    Our overall purpose for this multi-year project was to develop an alternative assessment format measuring rural middle school students understanding of science concepts and processes and the interrelationships among them. This kind of understanding is called structural knowledge. We had 3 major interrelated goals: (1) Synthesize the existing literature and critically evaluate the actual and potential use of measures of structural knowledge in science education. (2) Develop a structural knowledge alternative assessment format. (3) Examine the validity of our structural knowledge format. We accomplished the first two goals during year 1. The structural knowledge assessment we identified and developed further was a select-and-fill-in concept map format. The goal for our year 2 work was to begin to validate this assessment approach. This final report summarizes our year 2 work.

  20. Development and validation of the Bullying and Cyberbullying Scale for Adolescents: A multi-dimensional measurement model.

    Science.gov (United States)

    Thomas, Hannah J; Scott, James G; Coates, Jason M; Connor, Jason P

    2018-05-03

    Intervention on adolescent bullying is reliant on valid and reliable measurement of victimization and perpetration experiences across different behavioural expressions. This study developed and validated a survey tool that integrates measurement of both traditional and cyber bullying to test a theoretically driven multi-dimensional model. Adolescents from 10 mainstream secondary schools completed a baseline and follow-up survey (N = 1,217; M age  = 14 years; 66.2% male). The Bullying and cyberbullying Scale for Adolescents (BCS-A) developed for this study comprised parallel victimization and perpetration subscales, each with 20 items. Additional measures of bullying (Olweus Global Bullying and the Forms of Bullying Scale [FBS]), as well as measures of internalizing and externalizing problems, school connectedness, social support, and personality, were used to further assess validity. Factor structure was determined, and then, the suitability of items was assessed according to the following criteria: (1) factor interpretability, (2) item correlations, (3) model parsimony, and (4) measurement equivalence across victimization and perpetration experiences. The final models comprised four factors: physical, verbal, relational, and cyber. The final scale was revised to two 13-item subscales. The BCS-A demonstrated acceptable concurrent and convergent validity (internalizing and externalizing problems, school connectedness, social support, and personality), as well as predictive validity over 6 months. The BCS-A has sound psychometric properties. This tool establishes measurement equivalence across types of involvement and behavioural forms common among adolescents. An improved measurement method could add greater rigour to the evaluation of intervention programmes and also enable interventions to be tailored to subscale profiles. © 2018 The British Psychological Society.

  1. Reliability and Validity Study of a Tool to Measure Cancer Stigma: Patient Version.

    Science.gov (United States)

    Yılmaz, Medine; Dişsiz, Gülçin; Demir, Filiz; Irız, Sibel; Alacacioglu, Ahmet

    2017-01-01

    The aim of this methodological study is to establish the validity and reliability of the Turkish version of "A Questionnaire for Measuring Attitudes toward Cancer (Cancer Stigma) - Patient version." The sample comprised oncology patients who had active cancer treatment. The construct validity was assessed using the confirmatory and exploratory factor analysis. The mean age of the participants was 54.9±12.3 years. In the confirmatory factor analysis, fit values were determined as comparative fit index = 0.93, goodness of fit index = 0.91, normed-fit index=0.91, and root mean square error of approximation RMSEA = 0.09 ( P Kaiser-Meyer-Olkin = 0.88, χ 2 = 1084.41, Df = 66, and Barletta's test P <0.000). The first factor was "impossibility of recovery and experience of social discrimination" and the second factor was "stereotypes of cancer patients." The two-factor structure accounted for 56.74% of the variance. The Cronbach's alpha value was determined as 0.88 for the two-factor scale. "A questionnaire for measuring attitudes toward cancer (cancer stigma) - Patient version" is a reliable and valid questionnaire to assess stigmatization of cancer in cancer patients.

  2. Validity and repeatability of inertial measurement units for measuring gait parameters.

    Science.gov (United States)

    Washabaugh, Edward P; Kalyanaraman, Tarun; Adamczyk, Peter G; Claflin, Edward S; Krishnan, Chandramouli

    2017-06-01

    Inertial measurement units (IMUs) are small wearable sensors that have tremendous potential to be applied to clinical gait analysis. They allow objective evaluation of gait and movement disorders outside the clinic and research laboratory, and permit evaluation on large numbers of steps. However, repeatability and validity data of these systems are sparse for gait metrics. The purpose of this study was to determine the validity and between-day repeatability of spatiotemporal metrics (gait speed, stance percent, swing percent, gait cycle time, stride length, cadence, and step duration) as measured with the APDM Opal IMUs and Mobility Lab system. We collected data on 39 healthy subjects. Subjects were tested over two days while walking on a standard treadmill, split-belt treadmill, or overground, with IMUs placed in two locations: both feet and both ankles. The spatiotemporal measurements taken with the IMU system were validated against data from an instrumented treadmill, or using standard clinical procedures. Repeatability and minimally detectable change (MDC) of the system was calculated between days. IMUs displayed high to moderate validity when measuring most of the gait metrics tested. Additionally, these measurements appear to be repeatable when used on the treadmill and overground. The foot configuration of the IMUs appeared to better measure gait parameters; however, both the foot and ankle configurations demonstrated good repeatability. In conclusion, the IMU system in this study appears to be both accurate and repeatable for measuring spatiotemporal gait parameters in healthy young adults. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. A Community Based Study to Test the Reliability and Validity of Physical Activity Measurement Techniques

    Directory of Open Access Journals (Sweden)

    Puneet Misra

    2014-01-01

    Full Text Available Introduction: Physical activity (PA is protective against non-communicable diseases and it can reduce premature mortality. However, it is difficult to assess the frequency, duration, type and intensity of PA. The global physical activity questionnaire (GPAQ has been developed by World Health Organization with the aim of having valid and reliable estimates of PA. The primary aim of this study is to assess the repeatability of the GPAQ instrument and the secondary aim is to validate it against International Physical Activity Questionnaire (IPAQ and against an objective measure of PA (i.e., using pedometers in both rural and peri-urban areas of North India. Methods: A total of 262 subjects were recruited by random selection from Ballabgarh Block of Haryana State in India. For test retest repeatability of GPAQ and IPAQ, the instruments were administered on two occasions separated by at least 3 days. For concurrent validity, both questionnaires were administered in random order and for criterion validity step counters were used. Spearman′s correlation coefficient, intra-class correlation (ICC and Cohen′s kappa was used in the analysis. Results: For GPAQ validity, the spearman′s Rho ranged from 0.40 to 0.59 and ICC ranged from 0.43 to 0.81 while for IPAQ validity, spearman correlation coefficient ranged from 0.42 to 0.43 and ICC ranged from 0.56 to 0.68. The observed concurrent validity coefficients suggested that both the questionnaires had reasonable agreement (Spearman Rho of >0.90; P < 0.0001; ICC: 0.76-0.91, P < 0.05. Conclusions: GPAQ is similar to IPAQ in measuring PA and can be used for measurement of PA in community settings.

  4. Examining the Validity of Self-Reports on Scales Measuring Students' Strategic Processing

    Science.gov (United States)

    Samuelstuen, Marit S.; Braten, Ivar

    2007-01-01

    Background: Self-report inventories trying to measure strategic processing at a global level have been much used in both basic and applied research. However, the validity of global strategy scores is open to question because such inventories assess strategy perceptions outside the context of specific task performance. Aims: The primary aim was to…

  5. Validation of Karolinska Exhaustion Scale: psychometric properties of a measure of exhaustion syndrome.

    Science.gov (United States)

    Saboonchi, Fredrik; Perski, Aleksander; Grossi, Giorgio

    2013-12-01

    The syndrome of exhaustion is currently a medical diagnosis in Sweden. The description of the syndrome largely corresponds to the suggested core component of burnout, that is exhaustion. Karolinska Exhaustion Scale (KES) has been constructed to provide specific assessment of exhaustion in clinical and research settings. The purpose of the present study was to examine the psychometric properties of this scale in its original and revised versions by examining the factorial structure and measures of convergent and discriminant validity. Data gathered from two independent samples (n1 = 358 & n2 = 403) consisting of patients diagnosed with 'reaction to severe stress, and adjustment disorder' were subjected to confirmatory factor analysis. The study's instruments were Karolinska Exhaustion Scale and Shirom Melam Burnout Measure. Correlation analyses were employed to follow up the established factorial structure of the scale. The study was ethically approved by Karolinska Institute regional ethic committee. The findings demonstrated adequate fit of the data to the measurement model provided by the revised version of KES Limitations: The main limitation of the present study is the lack of a gold standard of exhaustion for direct comparison with KES. (KES-26) and partially supported convergent validity and discriminant validity of the scale. The demonstrated psychometric properties of KES-26 indicate sound construct validity for this scale encouraging use of this scale in assessment of exhaustion. The factorial structure of KES-26 may also be used to provide information concerning possible different clinical profiles. © 2012 The Authors Scandinavian Journal of Caring Sciences © 2012 Nordic College of Caring Science.

  6. Validation of a Tablet Application for Assessing Dietary Intakes Compared with the Measured Food Intake/Food Waste Method in Military Personnel Consuming Field Rations

    Directory of Open Access Journals (Sweden)

    Mavra Ahmed

    2017-02-01

    Full Text Available The collection of accurate dietary intakes using traditional dietary assessment methods (e.g., food records from military personnel is challenging due to the demanding physiological and psychological conditions of training or operations. In addition, these methods are burdensome, time consuming, and prone to measurement errors. Adopting smart-phone/tablet technology could overcome some of these barriers. The objective was to assess the validity of a tablet app, modified to contain detailed nutritional composition data, in comparison to a measured food intake/waste method. A sample of Canadian Armed Forces personnel, randomized to either a tablet app (n = 9 or a weighed food record (wFR (n = 9, recorded the consumption of standard military rations for a total of 8 days. Compared to the gold standard measured food intake/waste method, the difference in mean energy intake was small (−73 kcal/day for tablet app and −108 kcal/day for wFR (p > 0.05. Repeated Measures Bland-Altman plots indicated good agreement for both methods (tablet app and wFR with the measured food intake/waste method. These findings demonstrate that the tablet app, with added nutritional composition data, is comparable to the traditional dietary assessment method (wFR and performs satisfactorily in relation to the measured food intake/waste method to assess energy, macronutrient, and selected micronutrient intakes in a sample of military personnel.

  7. Corporate Entrepreneurship Assessment Instrument (CEAI): Refinement and Validation of a Survey Measure

    National Research Council Canada - National Science Library

    Cates, Michael S

    2007-01-01

    .... The measurement instrument known as the Corporate Entrepreneurship Assessment Index (CEAI) has been designed to tap the climate-related organizational factors that represent and potentially encourage corporate entrepreneurship...

  8. Reliability and validity of advanced theory-of-mind measures in middle childhood and adolescence.

    Science.gov (United States)

    Hayward, Elizabeth O; Homer, Bruce D

    2017-09-01

    Although theory-of-mind (ToM) development is well documented for early childhood, there is increasing research investigating changes in ToM reasoning in middle childhood and adolescence. However, the psychometric properties of most advanced ToM measures for use with older children and adolescents have not been firmly established. We report on the reliability and validity of widely used, conventional measures of advanced ToM with this age group. Notable issues with both reliability and validity of several of the measures were evident in the findings. With regard to construct validity, results do not reveal a clear empirical commonality between tasks, and, after accounting for comprehension, developmental trends were evident in only one of the tasks investigated. Statement of contribution What is already known on this subject? Second-order false belief tasks have acceptable internal consistency. The Eyes Test has poor internal consistency. Validity of advanced theory-of-mind tasks is often based on the ability to distinguish clinical from typical groups. What does this study add? This study examines internal consistency across six widely used advanced theory-of-mind tasks. It investigates validity of tasks based on comprehension of items by typically developing individuals. It further assesses construct validity, or commonality between tasks. © 2017 The British Psychological Society.

  9. The FLIR ONE thermal imager for the assessment of burn wounds: Reliability and validity study.

    Science.gov (United States)

    Jaspers, M E H; Carrière, M E; Meij-de Vries, A; Klaessens, J H G M; van Zuijlen, P P M

    2017-11-01

    Objective measurement tools may be of great value to provide early and reliable burn wound assessment. Thermal imaging is an easy, accessible and objective technique, which measures skin temperature as an indicator of tissue perfusion. These thermal images might be helpful in the assessment of burn wounds. However, before implementation of a novel measurement tool into clinical practice is considered, it is appropriate to test its clinimetric properties (i.e. reliability and validity). The objective of this study was to assess the reliability and validity of the recently introduced FLIR ONE thermal imager. Two observers obtained thermal images of burn wounds in adult patients at day 1-3, 4-7 and 8-10 after burn. Subsequently, temperature differences between the burn wound and healthy skin (ΔT) were calculated on an iPad mini containing the FLIR Tools app. To assess reliability, ΔT values of both observers were compared by calculating the intraclass correlation coefficient (ICC) and measurement error parameters. To assess validity, the ΔT values of the first observer were compared to the registered healing time of the burn wounds, which was specified into three categories: (I) ≤14 days, (II) 15-21 days and (III) >21 days. The ability of the FLIR ONE to discriminate between healing ≤21 days and >21 days was evaluated by means of a receiver operating characteristic curve and an optimal ΔT cut-off value. Reliability: ICCs were 0.99 for each time point, indicating excellent reliability up to 10 days after burn. The standard error of measurement varied between 0.17-0.22°C. the area under the curve was calculated at 0.69 (95% CI 0.54-0.84). A cut-off value of -1.15°C shows a moderate discrimination between burn wound healing ≤21 days and >21 days (46% sensitivity; 82% specificity). Our results show that the FLIR ONE thermal imager is highly reliable, but the moderate validity calls for additional research. However, the FLIR ONE is pre-eminently feasible

  10. Concurrent validation of an inertial measurement system to quantify kicking biomechanics in four football codes.

    Science.gov (United States)

    Blair, Stephanie; Duthie, Grant; Robertson, Sam; Hopkins, William; Ball, Kevin

    2018-05-17

    Wearable inertial measurement systems (IMS) allow for three-dimensional analysis of human movements in a sport-specific setting. This study examined the concurrent validity of a IMS (Xsens MVN system) for measuring lower extremity and pelvis kinematics in comparison to a Vicon motion analysis system (MAS) during kicking. Thirty footballers from Australian football (n = 10), soccer (n = 10), rugby league and rugby union (n = 10) clubs completed 20 kicks across four conditions. Concurrent validity was assessed using a linear mixed-modelling approach, which allowed the partition of between and within-subject variance from the device measurement error. Results were expressed in raw and standardised units for assessments of differences in means and measurement error, and interpreted via non-clinical magnitude-based inferences. Trivial to small differences were found in linear velocities (foot and pelvis), angular velocities (knee, shank and thigh), sagittal joint (knee and hip) and segment angle (shank and pelvis) means (mean difference: 0.2-5.8%) between the IMS and MAS in Australian football, soccer and the rugby codes. Trivial to small measurement errors (from 0.1 to 5.8%) were found between the IMS and MAS in all kinematic parameters. The IMS demonstrated acceptable levels of concurrent validity compared to a MAS when measuring kicking biomechanics across the four football codes. Wearable IMS offers various benefits over MAS, such as, out-of-laboratory testing, larger measurement range and quick data output, to help improve the ecological validity of biomechanical testing and the timing of feedback. The results advocate the use of IMS to quantify biomechanics of high-velocity movements in sport-specific settings. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Lumbar segmental instability: a criterion-related validity study of manual therapy assessment

    Directory of Open Access Journals (Sweden)

    Chapple Cathy

    2005-11-01

    Full Text Available Abstract Background Musculoskeletal physiotherapists routinely assess lumbar segmental motion during the clinical examination of a patient with low back pain. The validity of manual assessment of segmental motion has not, however, been adequately investigated. Methods In this prospective, multi-centre, pragmatic, diagnostic validity study, 138 consecutive patients with recurrent or chronic low back pain (R/CLBP were recruited. Physiotherapists with post-graduate training in manual therapy performed passive accessory intervertebral motion tests (PAIVMs and passive physiological intervertebral motion tests (PPIVMs. Consenting patients were referred for flexion-extension radiographs. Sagittal angular rotation and sagittal translation of each lumbar spinal motion segment was measured from these radiographs, and compared to a reference range derived from a study of 30 asymptomatic volunteers. Motion beyond two standard deviations from the reference mean was considered diagnostic of rotational lumbar segmental instability (LSI and translational LSI. Accuracy and validity of the clinical assessments were expressed using sensitivity, specificity, and likelihood ratio statistics with 95% confidence intervals (CI. Results Only translation LSI was found to be significantly associated with R/CLBP (p Conclusion This study provides the first evidence reporting the concurrent validity of manual tests for the detection of abnormal sagittal planar motion. PAIVMs and PPIVMs are highly specific, but not sensitive, for the detection of translation LSI. Likelihood ratios resulting from positive test results were only moderate. This research indicates that manual clinical examination procedures have moderate validity for detecting segmental motion abnormality.

  12. Assessment of the factorial validity and reliability of the ALSFRS-R: a revision of its measurement model.

    Science.gov (United States)

    Bakker, Leonhard A; Schröder, Carin D; van Es, Michael A; Westers, Paul; Visser-Meily, Johanna M A; van den Berg, Leonard H

    2017-07-01

    The amyotrophic lateral sclerosis functional rating scale-revised (ALSFRS-R) is a widely used primary outcome measure in amyotrophic lateral sclerosis (ALS) clinical practice and clinical trials. ALSFRS-R items cannot, however, validly be summed to obtain a total score, but constitute domain scores reflecting a profile of disease severity. Currently, there are different measurement models for estimating domain scores. The objective of the present study is, therefore, to derive the measurement model that best fits the data for a valid and uniform estimation of ALSFRS-R domain scores. Data from 1556 patients with ALS were obtained from a population-based register in The Netherlands. A random split of the sample provided a calibration and validation set. Measurement models of the ALSFRS-R were investigated using both exploratory factor analyses and confirmatory factor analyses. The measurement model with a four-factor structure (i.e., bulbar, fine motor, gross motor, and respiratory function), with correlated factors and cross-loading items on dressing and hygiene and turning in bed and adjusting bed clothes on both motor function scales, provided the best fit to the data in both sets. Correlation between factors ranged from weak to modest, confirming that the ALSFRS-R constitutes a profile of four clinically relevant domain scores rather than a total score that expresses disease severity. The internal consistency of the four domain scores was satisfactory. Our revision of the measurement model may allow for a more adequate estimation of disease severity and disease progression in epidemiological studies and clinical trials.

  13. Design and validation of a three-instrument toolkit for the assessment of competence in electrocardiogram rhythm recognition.

    Science.gov (United States)

    Hernández-Padilla, José M; Granero-Molina, José; Márquez-Hernández, Verónica V; Suthers, Fiona; López-Entrambasaguas, Olga M; Fernández-Sola, Cayetano

    2017-06-01

    Rapid and accurate interpretation of cardiac arrhythmias by nurses has been linked with safe practice and positive patient outcomes. Although training in electrocardiogram rhythm recognition is part of most undergraduate nursing programmes, research continues to suggest that nurses and nursing students lack competence in recognising cardiac rhythms. In order to promote patient safety, nursing educators must develop valid and reliable assessment tools that allow the rigorous assessment of this competence before nursing students are allowed to practise without supervision. The aim of this study was to develop and psychometrically evaluate a toolkit to holistically assess competence in electrocardiogram rhythm recognition. Following a convenience sampling technique, 293 nursing students from a nursing faculty in a Spanish university were recruited for the study. The following three instruments were developed and psychometrically tested: an electrocardiogram knowledge assessment tool (ECG-KAT), an electrocardiogram skills assessment tool (ECG-SAT) and an electrocardiogram self-efficacy assessment tool (ECG-SES). Reliability and validity (content, criterion and construct) of these tools were meticulously examined. A high Cronbach's alpha coefficient demonstrated the excellent reliability of the instruments (ECG-KAT=0.89; ECG-SAT=0.93; ECG-SES=0.98). An excellent context validity index (scales' average content validity index>0.94) and very good criterion validity were evidenced for all the tools. Regarding construct validity, principal component analysis revealed that all items comprising the instruments contributed to measure knowledge, skills or self-efficacy in electrocardiogram rhythm recognition. Moreover, known-groups analysis showed the tools' ability to detect expected differences in competence between groups with different training experiences. The three-instrument toolkit developed showed excellent psychometric properties for measuring competence in

  14. Development and Validation of Instruments to Measure Learning of Expert-Like Thinking

    Science.gov (United States)

    Adams, Wendy K.; Wieman, Carl E.

    2011-01-01

    This paper describes the process for creating and validating an assessment test that measures the effectiveness of instruction by probing how well that instruction causes students in a class to think like experts about specific areas of science. The design principles and process are laid out and it is shown how these align with professional…

  15. Clinical assessment of dysphagia in neurodegeneration (CADN): development, validity and reliability of a bedside tool for dysphagia assessment.

    Science.gov (United States)

    Vogel, Adam P; Rommel, Natalie; Sauer, Carina; Horger, Marius; Krumm, Patrick; Himmelbach, Marc; Synofzik, Matthis

    2017-06-01

    Screening assessments for dysphagia are essential in neurodegenerative disease. Yet there are no purpose-built tools to quantify swallowing deficits at bedside or in clinical trials. A quantifiable, brief, easy to administer assessment that measures the impact of dysphagia and predicts the presence or absence of aspiration is needed. The Clinical Assessment of Dysphagia in Neurodegeneration (CADN) was designed by a multidisciplinary team (neurology, neuropsychology, speech pathology) validated against strict methodological criteria in two neurodegenerative diseases, Parkinson's disease (PD) and degenerative ataxia (DA). CADN comprises two parts, an anamnesis (part one) and consumption (part two). Two-thirds of patients were assessed using reference tests, the SWAL-QOL symptoms subscale (part one) and videofluoroscopic assessment of swallowing (part two). CADN has 11 items and can be administered and scored in an average of 7 min. Test-retest reliability was established using correlation and Bland-Altman plots. 125 patients with a neurodegenerative disease were recruited; 60 PD and 65 DA. Validity was established using ROC graphs and correlations. CADN has sensitivity of 79 and 84% and specificity 71 and 69% for parts one and two, respectively. Significant correlations with disease severity were also observed (p dysphagia symptomatology and risk of aspiration. The CADN is a reliable, valid, brief, quantifiable, and easily deployed assessment of swallowing in neurodegenerative disease. It is thus ideally suited for both clinical bedside assessment and future multicentre clinical trials in neurodegenerative disease.

  16. How to measure wisdom: content, reliability, and validity of five measures

    Science.gov (United States)

    Glück, Judith; König, Susanne; Naschenweng, Katja; Redzanowski, Uwe; Dorner, Lara; Straßer, Irene; Wiedermann, Wolfgang

    2013-01-01

    Wisdom is a field of growing interest both inside and outside academic psychology, and researchers are increasingly interested in using measures of wisdom in their work. However, wisdom is a highly complex construct, and its various operationalizations are based on quite different definitions. Which measure a researcher chooses for a particular research project may have a strong influence on the results. This study compares four well-established measures of wisdom—the Self-Assessed Wisdom Scale (Webster, 2003, 2007), the Three-Dimensional Wisdom Scale (Ardelt, 2003), the Adult Self-Transcendence Inventory (Levenson et al., 2005), and the Berlin Wisdom Paradigm (Baltes and Smith, 1990; Baltes and Staudinger, 2000)—with respect to content, reliability, factorial structure, and construct validity (relationships to wisdom nomination, interview-based wisdom ratings, and correlates of wisdom). The sample consisted of 47 wisdom nominees and 123 control participants. While none of the measures performed “better” than the others by absolute standards, recommendations are given for researchers to select the most suitable measure for their substantive interests. In addition, a “Brief Wisdom Screening Scale” is introduced that contains those 20 items from the three self-report scales that were most highly correlated with the common factor across the scales. PMID:23874310

  17. Reliability and validity of an internet-based questionnaire measuring lifetime physical activity.

    Science.gov (United States)

    De Vera, Mary A; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek

    2010-11-15

    Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005-2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity.

  18. Assessing functional mobility in survivors of lower-extremity sarcoma: reliability and validity of a new assessment tool.

    Science.gov (United States)

    Marchese, Victoria G; Rai, Shesh N; Carlson, Claire A; Hinds, Pamela S; Spearing, Elena M; Zhang, Lijun; Callaway, Lulie; Neel, Michael D; Rao, Bhaskar N; Ginsberg, Jill P

    2007-08-01

    Reliability and validity of a new tool, Functional Mobility Assessment (FMA), were examined in patients with lower-extremity sarcoma. FMA requires the patients to physically perform the functional mobility measures, unlike patient self-report or clinician administered measures. A sample of 114 subjects participated, 20 healthy volunteers and 94 patients with lower-extremity sarcoma after amputation, limb-sparing, or rotationplasty surgery. Reliability of the FMA was examined by three raters testing 20 healthy volunteers and 23 subjects with lower-extremity sarcoma. Concurrent validity was examined using data from 94 subjects with lower-extremity sarcoma who completed the FMA, Musculoskeletal Tumor Society (MSTS), Short-Form 36 (SF-36v2), and Toronto Extremity Salvage Scale (TESS) scores. Construct validity was measured by the ability of the FMA to discriminate between subjects with and without functional mobility deficits. FMA demonstrated excellent reliability (ICC [2,1] >or=0.97). Moderate correlations were found between FMA and SF-36v2 (r = 0.60, P < 0.01), FMA and MSTS (r = 0.68, P < 0.01), and FMA and TESS (r = 0.62, P < 0.01). The patients with lower-extremity sarcoma scored lower on the FMA as compared to healthy controls (P < 0.01). The FMA is a reliable and valid functional outcome measure for patients with lower-extremity sarcoma. This study supports the ability of the FMA to discriminate between patients with varying functional abilities and supports the need to include measures of objective functional mobility in examination of patients with lower-extremity sarcoma.

  19. Psychometric properties and validation of the Italian version of the Family Assessment Measure Third Edition – Short Version – in a nonclinical sample

    Directory of Open Access Journals (Sweden)

    Pellerone M

    2017-02-01

    reliability with Cronbach’s α coefficients equal to 0.96. The Brief FAM-III has satisfactory internal consistency, with Cronbach’s α equal to 0.90 for General Scale, 0.94 for Dyadic Relationships Scale, and 0.88 for the Self-Rating Scale. Conclusion: The Brief FAM-III can be a psychometrically reliable and valid measure for the assessment of family strengths and weaknesses within Italian contexts. The instrument can be used to obtain an overall idea of family functioning, for the purposes of preliminary screening, and for monitoring family functioning over time or during treatment. Keywords: family assessment, psychometric properties, Italian validation, family strengths, family weaknesses

  20. Validation and clinical significance of the Childhood Myositis Assessment Scale for assessment of muscle function in the juvenile idiopathic inflammatory myopathies.

    Science.gov (United States)

    Huber, Adam M; Feldman, Brian M; Rennebohm, Robert M; Hicks, Jeanne E; Lindsley, Carol B; Perez, Maria D; Zemel, Lawrence S; Wallace, Carol A; Ballinger, Susan H; Passo, Murray H; Reed, Ann M; Summers, Ronald M; White, Patience H; Katona, Ildy M; Miller, Frederick W; Lachenbruch, Peter A; Rider, Lisa G

    2004-05-01

    To examine the measurement characteristics of the Childhood Myositis Assessment Scale (CMAS) in children with juvenile idiopathic inflammatory myopathy (juvenile IIM), and to obtain preliminary data on the clinical significance of CMAS scores. One hundred eight children with juvenile IIM were evaluated on 2 occasions, 7-9 months apart, using various measures of physical function, strength, and disease activity. Interrater reliability, construct validity, and responsiveness of the CMAS were examined. The minimum clinically important difference (MID) and CMAS scores corresponding to various degrees of physical disability were estimated. The intraclass correlation coefficient for 26 patients assessed by 2 examiners was 0.89, indicating very good interrater reliability. The CMAS score correlated highly with the Childhood Health Assessment Questionnaire (C-HAQ) score and with findings on manual muscle testing (MMT) (r(s) = -0.73 and 0.73, respectively) and moderately with physician-assessed global disease activity and skin activity, parent-assessed global disease severity, and muscle magnetic resonance imaging (r(s) = -0.44 to -0.61), thereby demonstrating good construct validity. The standardized response mean was 0.81 (95% confidence interval 0.53, 1.09) in patients with at least 0.8 cm improvement on a 10-cm visual analog scale for physician-assessed global disease activity, indicating strong responsiveness. In bivariate regression models predicting physician-assessed global disease activity, MMT remained significant in models containing the CMAS (P = 0.03) while the C-HAQ did not (P = 0.4). Estimates of the MID ranged from 1.5 to 3.0 points on a 0-52-point scale. CMAS scores corresponding to no, mild, mild-to-moderate, and moderate physical disability, respectively, were 48, 45, 39, and 30. The CMAS exhibits good reliability, construct validity, and responsiveness, and is therefore a valid instrument for the assessment of physical function, muscle strength, and

  1. Validation of the one pass measure for motivational interviewing competence.

    Science.gov (United States)

    McMaster, Fiona; Resnicow, Ken

    2015-04-01

    This paper examines the psychometric properties of the OnePass coding system: a new, user-friendly tool for evaluating practitioner competence in motivational interviewing (MI). We provide data on reliability and validity with the current gold-standard: Motivational Interviewing Treatment Integrity tool (MITI). We compared scores from 27 videotaped MI sessions performed by student counselors trained in MI and simulated patients using both OnePass and MITI, with three different raters for each tool. Reliability was estimated using intra-class coefficients (ICCs), and validity was assessed using Pearson's r. OnePass had high levels of inter-rater reliability with 19/23 items found from substantial to almost perfect agreement. Taking the pair of scores with the highest inter-rater reliability on the MITI, the concurrent validity between the two measures ranged from moderate to high. Validity was highest for evocation, autonomy, direction and empathy. OnePass appears to have good inter-rater reliability while capturing similar dimensions of MI as the MITI. Despite the moderate concurrent validity with the MITI, the OnePass shows promise in evaluating both traditional and novel interpretations of MI. OnePass may be a useful tool for developing and improving practitioner competence in MI where access to MITI coders is limited. Copyright © 2015. Published by Elsevier Ireland Ltd.

  2. The Borderline Syndrome Index: a validation study using the personality assessment schedule.

    Science.gov (United States)

    Marlowe, M J; O'Neill-Byrne, K; Lowe-Ponsford, F; Watson, J P

    1996-01-01

    This study examines the validity and screening properties of the Borderline Syndrome Index--BSI (developed in the USA) for categories of the Personality Assessment Schedule--PAS (developed in the UK). Patients were recruited by case control sampling. Chance corrected agreement between instruments and screening properties of the BSI were calculated. The BSI proved a moderately sensitive but non-specific screen. Questionnaire scores were highly correlated with symptom measures. The results do not support the validity of the BSI or its use as a screening instrument. BSI scores may be distorted by current symptoms.

  3. Translation and validation of the assistive technology device predisposition assessment in Greek in order to assess satisfaction with use of the selected assistive device.

    Science.gov (United States)

    Koumpouros, Yiannis; Papageorgiou, Effie; Karavasili, Alexandra; Alexopoulou, Despoina

    2017-07-01

    To examine the Assistive Technology Device Predisposition Assessment scale and provide evidence of validity and reliability of the Greek version. We translated and adapted the original instrument in Greek according to the most well-known guidelines recommendations. Field test studies were conducted in a rehabilitation hospital to validate the appropriateness of the final results. Ratings of the different items were statistically analyzed. We recruited 115 subjects who were administered the Form E of the original questionnaire. The experimental analysis conducted revealed a three subscales structure: (i) Adaptability, (ii) Fit to Use, and (iii) Socializing. According to the results of our study the three subscales measure different constructs. Reliability measures (ICC = 0.981, Pearson's correlation = 0.963, Cronbach's α = 0.701) yielded high values. Test-retest outcome showed great stability. This is the first study, at least to the knowledge of the authors, which focuses merely on measuring the satisfaction of the users from the used assistive device, while exploring the Assistive Technology Device Predisposition Assessment - Device Form in such depth. According to the results, it is a stable, valid and reliable instrument and applicable to the Greek population. Thus, it can be used to measure the satisfaction of patients with assistive devices. Implications for Rehabilitation The paper explores the cultural adaptability and applicability of ATD PA - Device Form. ATD PA - Device Form can be used to assess user satisfaction by the selected assistive device. ATD PA - Device Form is a valid and reliable instrument in measuring users' satisfaction in Greekreality.

  4. Measurement and data analysis methods for field-scale wind erosion studies and model validation

    NARCIS (Netherlands)

    Zobeck, T.M.; Sterk, G.; Funk, R.F.; Rajot, J.L.; Stout, J.E.; Scott Van Pelt, R.

    2003-01-01

    Accurate and reliable methods of measuring windblown sediment are needed to confirm, validate, and improve erosion models, assess the intensity of aeolian processes and related damage, determine the source of pollutants, and for other applications. This paper outlines important principles to

  5. Animal-based measures for welfare assessment

    Directory of Open Access Journals (Sweden)

    Agostino Sevi

    2010-01-01

    Full Text Available Animal welfare assessment can’t be irrespective of measures taken on animals. Indeed, housing parametersrelatedtostructures, designandmicro-environment, evenifreliable parameters related to structures, design and micro-environment, even if reliable and easier to take, can only identify conditions which could be detrimental to animal welfare, but can’t predict poor welfare in animals per se. Welfare assessment through animal-based measures is almost complex, given that animals’ responses to stressful conditions largely depend on the nature, length and intensity of challenges and on physiological status, age, genetic susceptibility and previous experience of animals. Welfare assessment requires a multi-disciplinary approach and the monitoring of productive, ethological, endocrine, immunological and pathological param- eters to be exhaustive and reliable. So many measures are needed, because stresses can act only on some of the mentioned parameters or on all of them but at different times and degree. Under this point of view, the main aim of research is to find feasible and most responsive indicators of poor animal welfare. In last decades, studies focused on the following parameters for animal wel- fare assessment indexes of biological efficiency, responses to behavioral tests, cortisol secretion, neutrophil to lymphocyte ratio, lymphocyte proliferation, production of antigen specific IgG and cytokine release, somatic cell count and acute phase proteins. Recently, a lot of studies have been addressed to reduce handling and constraint of animals for taking measures to be used in welfare assessment, since such procedures can induce stress in animals and undermined the reliability of measures taken for welfare assessment. Range of animal-based measures for welfare assessment is much wider under experimental condition than at on-farm level. In welfare monitoring on-farm the main aim is to find feasible measures of proved validity and reliability

  6. Validation of behaviour measurement instrument of patients with diabetes mellitus and hypertension

    Science.gov (United States)

    Saputri, G. Z.; Akrom; Dini, S. M.

    2017-11-01

    Non-adherence to the treatment of chronic diseases such as hypertension and Diabetes Mellitus (DM) is a major obstacle in achieving patient therapy targets and quality of life of patients. A comprehensive approach involving pharmacists counselling has shown influences on changes in health behaviour and patient compliance. Behaviour changes in patients are one of the parameters to assess the effectiveness of counselling and education by pharmacists. Therefore, it is necessary to develop questionnaires of behaviour change measurement in DM-hypertension patients. This study aims to develop a measurement instrument in the form of questionnaires in assessing the behaviour change of DM-hypertension patients. Preparation of question items from the questionnaire research instrument refers to some guidelines and previous research references. Test of questionnaire instrument valid was done with expert validation, followed by pilot testing on 10 healthy respondents, and 10 DM-hypertension patients included in the inclusion criteria. Furthermore, field validation test was conducted on 37 patients who had undergone outpatient care at the PKU Muhammadiyah Yogyakarta City Hospital and The Gading Clinic in Yogyakarta. The inclusion criteria were male and female patients, aged 18-65, diagnosed with type 2 diabetes with hypertension who received oral antidiabetic drugs and antihypertensives, and who were not illiterate and co-operative. The data were collected by questionnaire interviews by a standardized pharmacist. The result of validation test using Person correlation shows the value of 0.33. The results of the questionnaire validation test on 37 patients showed 5 items of invalid questions with the value of r 0.33. The reliability value is shown from the Cronbach's alpha value of 0.722 (> 0.6), implying that the questionnaire is reliable for DM-hypertension patients. This Behavioural change questionnaire can be used on DM-hypertension patients, and an FGD approach is required

  7. Single-Item Measurement of Suicidal Behaviors: Validity and Consequences of Misclassification.

    Directory of Open Access Journals (Sweden)

    Alexander J Millner

    Full Text Available Suicide is a leading cause of death worldwide. Although research has made strides in better defining suicidal behaviors, there has been less focus on accurate measurement. Currently, the widespread use of self-report, single-item questions to assess suicide ideation, plans and attempts may contribute to measurement problems and misclassification. We examined the validity of single-item measurement and the potential for statistical errors. Over 1,500 participants completed an online survey containing single-item questions regarding a history of suicidal behaviors, followed by questions with more precise language, multiple response options and narrative responses to examine the validity of single-item questions. We also conducted simulations to test whether common statistical tests are robust against the degree of misclassification produced by the use of single-items. We found that 11.3% of participants that endorsed a single-item suicide attempt measure engaged in behavior that would not meet the standard definition of a suicide attempt. Similarly, 8.8% of those who endorsed a single-item measure of suicide ideation endorsed thoughts that would not meet standard definitions of suicide ideation. Statistical simulations revealed that this level of misclassification substantially decreases statistical power and increases the likelihood of false conclusions from statistical tests. Providing a wider range of response options for each item reduced the misclassification rate by approximately half. Overall, the use of single-item, self-report questions to assess the presence of suicidal behaviors leads to misclassification, increasing the likelihood of statistical decision errors. Improving the measurement of suicidal behaviors is critical to increase understanding and prevention of suicide.

  8. Content Validity of a Tool Measuring Medication Errors.

    Science.gov (United States)

    Tabassum, Nishat; Allana, Saleema; Saeed, Tanveer; Dias, Jacqueline Maria

    2015-08-01

    The objective of this study was to determine content and face validity of a tool measuring medication errors among nursing students in baccalaureate nursing education. Data was collected from the Aga Khan University School of Nursing and Midwifery (AKUSoNaM), Karachi, from March to August 2014. The tool was developed utilizing literature and the expertise of the team members, expert in different areas. The developed tool was then sent to five experts from all over Karachi for ensuring the content validity of the tool, which was measured on relevance and clarity of the questions. The Scale Content Validity Index (S-CVI) for clarity and relevance of the questions was found to be 0.94 and 0.98, respectively. The tool measuring medication errors has an excellent content validity. This tool should be used for future studies on medication errors, with different study populations such as medical students, doctors, and nurses.

  9. Development and validation of resource flexibility measures for manufacturing industry

    Directory of Open Access Journals (Sweden)

    Gulshan Chauhan

    2014-01-01

    Full Text Available Purpose: Global competition and ever changing customers demand have made manufacturing organizations to rapidly adjust to complexities, uncertainties, and changes. Therefore, flexibility in manufacturing resources is necessary to respond cost effectively and rapidly to changing production needs and requirements.  Ability of manufacturing resources to dynamically reallocate from one stage of a production process to another in response to shifting bottlenecks is recognized as resource flexibility. This paper aims to develop and validate resource flexibility measures for manufacturing industry that could be used by managers/ practitioners in assessing and improving the status of resource flexibility for the optimum utilization of resources. Design/methodology/approach: The study involves survey carried out in Indian manufacturing industry using a questionnaire to assess the status of various aspects of resource flexibility and their relationships. A questionnaire was specially designed covering various parameters of resource flexibility. Its reliability was checked by finding the value of Cronback alpha (0.8417. Relative weightage of various measures was found out by using Analytical Hierarchy Process (AHP. Pearson’s coefficient of correlation analysis was carried out to find out relationships between various parameters. Findings: From detailed review of literature on resource flexibility, 17 measures of resource flexibility and 47 variables were identified. The questionnaire included questions on all these measures and parameters. ‘Ability of machines to perform diverse set of operations’ and ability of workers to work on different machines’ emerged to be important measures with contributing weightage of 20.19% and 17.58% respectively.  All the measures were found to be significantly correlated with overall resource flexibility except ‘training of workers’, as shown by Pearson’s coefficient of correlation. This indicates that

  10. Assessment model validity document - HYDRASTAR. A stochastic continuum program for groundwater flow

    Energy Technology Data Exchange (ETDEWEB)

    Gylling, B. [Kemakta Konsult AB, Stockholm (Sweden); Eriksson, Lars [Equa Simulation AB, Sundbyberg (Sweden)

    2001-12-01

    The prevailing document addresses validation of the stochastic continuum model HYDRASTAR designed for Monte Carlo simulations of groundwater flow in fractured rocks. Here, validation is defined as a process to demonstrate that a model concept is fit for its purpose. Preferably, the validation is carried out by comparison of model predictions with independent field observations and experimental measurements. In addition, other sources can also be used to confirm that the model concept gives acceptable results. One method is to compare results with the ones achieved using other model concepts for the same set of input data. Another method is to compare model results with analytical solutions. The model concept HYDRASTAR has been used in several studies including performance assessments of hypothetical repositories for spent nuclear fuel. In the performance assessments, the main tasks for HYDRASTAR have been to calculate groundwater travel time distributions, repository flux distributions, path lines and their exit locations. The results have then been used by other model concepts to calculate the near field release and far field transport. The aim and framework for the validation process includes describing the applicability of the model concept for its purpose in order to build confidence in the concept. Preferably, this is made by comparisons of simulation results with the corresponding field experiments or field measurements. Here, two comparisons with experimental results are reported. In both cases the agreement was reasonably fair. In the broader and more general context of the validation process, HYDRASTAR results have been compared with other models and analytical solutions. Commonly, the approximation calculations agree well with the medians of model ensemble results. Additional indications that HYDRASTAR is suitable for its purpose were obtained from the comparisons with results from other model concepts. Several verification studies have been made for

  11. Application of a repeat-measure biomarker measurement error model to 2 validation studies: examination of the effect of within-person variation in biomarker measurements.

    Science.gov (United States)

    Preis, Sarah Rosner; Spiegelman, Donna; Zhao, Barbara Bojuan; Moshfegh, Alanna; Baer, David J; Willett, Walter C

    2011-03-15

    Repeat-biomarker measurement error models accounting for systematic correlated within-person error can be used to estimate the correlation coefficient (ρ) and deattenuation factor (λ), used in measurement error correction. These models account for correlated errors in the food frequency questionnaire (FFQ) and the 24-hour diet recall and random within-person variation in the biomarkers. Failure to account for within-person variation in biomarkers can exaggerate correlated errors between FFQs and 24-hour diet recalls. For 2 validation studies, ρ and λ were calculated for total energy and protein density. In the Automated Multiple-Pass Method Validation Study (n=471), doubly labeled water (DLW) and urinary nitrogen (UN) were measured twice in 52 adults approximately 16 months apart (2002-2003), yielding intraclass correlation coefficients of 0.43 for energy (DLW) and 0.54 for protein density (UN/DLW). The deattenuated correlation coefficient for protein density was 0.51 for correlation between the FFQ and the 24-hour diet recall and 0.49 for correlation between the FFQ and the biomarker. Use of repeat-biomarker measurement error models resulted in a ρ of 0.42. These models were similarly applied to the Observing Protein and Energy Nutrition Study (1999-2000). In conclusion, within-person variation in biomarkers can be substantial, and to adequately assess the impact of correlated subject-specific error, this variation should be assessed in validation studies of FFQs. © The Author 2011. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved.

  12. Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

    Science.gov (United States)

    Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

    2018-01-01

    The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.

  13. Examining the Reliability and Validity of the Effective Behavior Support Self-Assessment Survey

    Science.gov (United States)

    Solomon, Benjamin G.; Tobin, Kevin G.; Schutte, Gregory M.

    2015-01-01

    The Effective Behavior Support Self-Assessment Survey (SAS; Sugai, Horner, & Todd, 2003) is designed to measure perceived Positive Behavior Interventions and Supports (PBIS) implementation and identify priorities for improvement. Despite its longevity, little published research exists documenting its reliability or validity for these purposes.…

  14. A measure of smoking abstinence-related motivational engagement: development and initial validation.

    Science.gov (United States)

    Simmons, Vani N; Heckman, Bryan W; Ditre, Joseph W; Brandon, Thomas H

    2010-04-01

    Although a great deal of research has focused on measuring motivation and readiness to quit smoking, little research has assessed gross motivational changes after a smoker has made an attempt to quit smoking. Unlike previous single-item global measures of motivation to remain abstinent, we developed the abstinence-related motivational engagement (ARME) scale to evaluate the degree to which abstinence motivation is reflected by an ex-smoker's daily experience in areas that include cognitive effort, priority, vigilance, and excitement. The aim of this study was to collect reliability and initial construct validity data on this new measure. Participants were 199 ex-smokers recruited from the community and smoking cessation Web sites. Participants completed online measures including a global motivation measure, the ARME scale, demographic questionnaire, and a measure of cessation self-efficacy. The 16-item ARME questionnaire demonstrated high internal consistency reliability (alpha = .89). Analyses provided support for convergent, discriminant, and construct validity of the scale. ARME demonstrated the predicted correlation with a traditional measure of global cessation motivation, yet, also as predicted, only the ARME was negatively associated with length of abstinence. Moreover, as hypothesized, ex-smokers engaged in the quitting process via ongoing smoking Web site participation showed higher ARME scores than a comparison community sample. A five-item short form demonstrated similar psychometric properties. This study provided initial support for the ARME construct and offers two versions of a reliable instrument for assessing this construct. Future research will examine the ARME as a predictor of cessation outcome and a potential target for relapse prevention.

  15. Adapting social neuroscience measures for schizophrenia clinical trials, part 3: fathoming external validity.

    Science.gov (United States)

    Olbert, Charles M; Penn, David L; Kern, Robert S; Lee, Junghee; Horan, William P; Reise, Steven P; Ochsner, Kevin N; Marder, Stephen R; Green, Michael F

    2013-11-01

    It is unknown whether measures adapted from social neuroscience linked to specific neural systems will demonstrate relationships to external variables. Four paradigms adapted from social neuroscience were administered to 173 clinically stable outpatients with schizophrenia to determine their relationships to functionally meaningful variables and to investigate their incremental validity beyond standard measures of social and nonsocial cognition. The 4 paradigms included 2 that assess perception of nonverbal social and action cues (basic biological motion and emotion in biological motion) and 2 that involve higher level inferences about self and others' mental states (self-referential memory and empathic accuracy). Overall, social neuroscience paradigms showed significant relationships to functional capacity but weak relationships to community functioning; the paradigms also showed weak correlations to clinical symptoms. Evidence for incremental validity beyond standard measures of social and nonsocial cognition was mixed with additional predictive power shown for functional capacity but not community functioning. Of the newly adapted paradigms, the empathic accuracy task had the broadest external validity. These results underscore the difficulty of translating developments from neuroscience into clinically useful tasks with functional significance.

  16. Validation of a novel duplex ultrasound objective structured assessment of technical skills (DUOSATS) for arterial stenosis detection.

    Science.gov (United States)

    Jaffer, U; Singh, P; Pandey, V A; Aslam, M; Standfield, N J

    2014-01-01

    Duplex ultrasound facilitates bedside diagnosis and hence timely patient care. Its uptake has been hampered by training and accreditation issues. We have developed an assessment tool for Duplex arterial stenosis measurement for both simulator and patient based training. A novel assessment tool: duplex ultrasound assessment of technical skills was developed. A modified duplex ultrasound assessment of technical skills was used for simulator training. Novice, intermediate experience and expert users of duplex ultrasound were invited to participate. Participants viewed an instructional video and were allowed ample time to familiarize with the equipment. Participants' attempts were recorded and independently assessed by four experts using the modified duplex ultrasound assessment of technical skills. 'Global' assessment was also done on a four point Likert scale. Content, construct and concurrent validity as well as reliability were evaluated. Content and construct validity as well as reliability were demonstrated. The simulator had good satisfaction rating from participants: median 4; range 3-5. Receiver operator characteristic analysis has established a cut point of 22/ 34 and 25/ 40 were most appropriate for simulator and patient based assessment respectively. We have validated a novel assessment tool for duplex arterial stenosis detection. Further work is underway to establish transference validity of simulator training to improved skill in scanning patients. We have developed and validated duplex ultrasound assessment of technical skills for simulator training.

  17. Development and Validation of a Family Meeting Assessment Tool (FMAT).

    Science.gov (United States)

    Hagiwara, Yuya; Healy, Jennifer; Lee, Shuko; Ross, Jeanette; Fischer, Dixie; Sanchez-Reilly, Sandra

    2018-01-01

    A cornerstone procedure in Palliative Medicine is to perform family meetings. Learning how to lead a family meeting is an important skill for physicians and others who care for patients with serious illnesses and their families. There is limited evidence on how to assess best practice behaviors during end-of-life family meetings. Our aim was to develop and validate an observational tool to assess trainees' ability to lead a simulated end-of-life family meeting. Building on evidence from published studies and accrediting agency guidelines, an expert panel at our institution developed the Family Meeting Assessment Tool. All fourth-year medical students (MS4) and eight geriatric and palliative medicine fellows (GPFs) were invited to participate in a Family Meeting Objective Structured Clinical Examination, where each trainee assumed the physician role leading a complex family meeting. Two evaluators observed and rated randomly chosen students' performances using the Family Meeting Assessment Tool during the examination. Inter-rater reliability was measured using percent agreement. Internal consistency was measured using Cronbach α. A total of 141 trainees (MS4 = 133 and GPF = 8) and 26 interdisciplinary evaluators participated in the study. Internal reliability (Cronbach α) of the tool was 0.85. Number of trainees rated by two evaluators was 210 (MS4 = 202 and GPF = 8). Rater agreement was 84%. Composite scores, on average, were significantly higher for fellows than for medical students (P < 0.001). Expert-based content, high inter-rater reliability, good internal consistency, and ability to predict educational level provided initial evidence for construct validity for this novel assessment tool. Copyright © 2017 American Academy of Hospice and Palliative Medicine. All rights reserved.

  18. Evaluating trauma team performance in a Level I trauma center: Validation of the trauma team communication assessment (TTCA-24).

    Science.gov (United States)

    DeMoor, Stephanie; Abdel-Rehim, Shady; Olmsted, Richard; Myers, John G; Parker-Raley, Jessica

    2017-07-01

    Nontechnical skills (NTS), such as team communication, are well-recognized determinants of trauma team performance and good patient care. Measuring these competencies during trauma resuscitations is essential, yet few valid and reliable tools are available. We aimed to demonstrate that the Trauma Team Communication Assessment (TTCA-24) is a valid and reliable instrument that measures communication effectiveness during activations. Two tools with adequate psychometric strength (Trauma Nontechnical Skills Scale [T-NOTECHS], Team Emergency Assessment Measure [TEAM]) were identified during a systematic review of medical literature and compared with TTCA-24. Three coders used each tool to evaluate 35 stable and 35 unstable patient activations (defined according to Advanced Trauma Life Support criteria). Interrater reliability was calculated between coders using the intraclass correlation coefficient. Spearman rank correlation coefficient was used to establish concurrent validity between TTCA-24 and the other two validated tools. Coders achieved an intraclass correlation coefficient of 0.87 for stable patient activations and 0.78 for unstable activations scoring excellent on the interrater agreement guidelines. The median score for each assessment showed good team communication for all 70 videos (TEAM, 39.8 of 54; T-NOTECHS, 17.4 of 25; and TTCA-24, 87.4 of 96). A significant correlation between TTTC-24 and T-NOTECHS was revealed (p = 0.029), but no significant correlation between TTCA-24 and TEAM (p = 0.77). Team communication was rated slightly better across all assessments for stable versus unstable patient activations, but not statistically significant. TTCA-24 correlated with T-NOTECHS, an instrument measuring nontechnical skills for trauma teams, but not TEAM, a tool that assesses communication in generic emergency settings. TTCA-24 is a reliable and valid assessment that can be a useful adjunct when evaluating interpersonal and team communication during trauma

  19. Psychometric properties of three measures assessing advanced theory of mind: Evidence from people with schizophrenia.

    Science.gov (United States)

    Chen, Kuan-Wei; Lee, Shih-Chieh; Chiang, Hsin-Yu; Syu, Ya-Cing; Yu, Xiao-Xuan; Hsieh, Ching-Lin

    2017-11-01

    Patients with schizophrenia tend to have deficits in advanced Theory of Mind (ToM). The "Reading the mind in the eyes" test (RMET), the Faux Pas Task, and the Strange Stories are commonly used for assessing advanced ToM. However, most of the psychometric properties of these 3 measures in patients with schizophrenia are unknown. The aims of this study were to validate the psychometric properties of the 3 advanced ToM measures in patients with schizophrenia, including: (1) test-retest reliability; (2) random measurement error; (3) practice effect; (4) concurrent validity; and (5) ecological validity. We recruited 53 patients with schizophrenia, who completed the 3 measures twice, 4 weeks apart. The Revised Social Functioning Scale-Taiwan short version (R-SFST) was completed within 3 days of first session of assessments. We found that the intraclass correlation coefficients of the RMET, Strange Stories, and Faux Pas Task were 0.24, 0.5, and 0.76. All 3 advanced ToM measures had large random measurement error, trivial to small practice effects, poor concurrent validity, and low ecological validity. We recommend that the scores of the 3 advanced ToM measures be interpreted with caution because these measures may not provide reliable and valid results on patients' advanced ToM abilities. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Validating an instrument for measuring brand equity of CSR driven organizations in Malaysia

    Directory of Open Access Journals (Sweden)

    Singh Dara Singh Karpal

    2017-06-01

    Full Text Available The objective of this study is to develop and propose a valid and reliable instrument to measure brand equity of CSR driven organizations in Malaysia. An instrument to measure brand equity was constructed with adaptations from two key sources, namely Yew Leh and Lee (2011 and Yoo and Donthu (2001. As such the study only focuses on the development and validation of an instrument to measure brand equity of CSR driven organizations. The usable sample population included 909 respondents from 12 states of West Malaysia which were selected using a quota sampling plan. Confirmatory factor analysis (CFA and reliability analysis were carried out to test and validate the proposed brand equity instrument containing four components (brand awareness, brand association, perceived quality and brand loyalty with a total of 13 items. Results from the CFA and reliability analysis indicated that all the items representing the four components were valid and can be used to measure the brand equity of organizations that are practicing CSR. The study tried to set an empirical basis for brand equity and CSR related research which could be used by future researchers in different industries and geographical locations. The study also implies the need for organizations to assess the success of their CSR efforts through the use of the proposed instrument in order to gauge whether all their CSR efforts translate to improved brand equity.

  1. Patient Assessment of Constipation Quality of Life Questionnaire: Translation, Cultural Adaptation, Reliability, and Validity of the Persian Version.

    Science.gov (United States)

    Nikjooy, Afsaneh; Jafari, Hassan; Saba, Maryam A; Ebrahimi, Naghmeh; Mirzaei, Rezvan

    2018-05-01

    The Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire is the most validated and the most specific tool for measuring the quality of life of patients with constipation. Over 120 million people live in countries whose official language is Persian. There is no reported Persian version of the PAC-QOL questionnaire yet. The aim of this study was to translate and culturally adapt the PAC-QOL questionnaire and to assess its reliability and validity among Persian patients with chronic constipation. Following the translation and cultural adaptation of the PAC-QOL questionnaire to Persian, 100 patients (mean±SD age=40.51±13.67) with constipation were recruited for validity measurement and 20 patients were re-examined for reliability. Content validity was assessed based on the opinions of an expert committee and the floor/ceiling effect. Construct validity was evaluated according to the hypothesis test. The SF-36 questionnaire was used for concurrent criterion validity, intra-class correlation coefficient for reliability, and Cronbach's alpha for internal consistency. The content validity of the PAC-QOL questionnaire was proven, and there was no floor/ceiling effect. Construct validity also was confirmed based on the hypothesis test. The overall Cronbach's alpha of the PAC-QOL questionnaire was 0.92 (range=0.72-0.92), and the overall intra-class correlation coefficient of the questionnaire was 0.88 (range=0.69-0.87). The correlation between the SF-36 and PAC-QOL questionnaires was moderate. The Persian version of the PAC-QOL questionnaire demonstrated good validity and reliability properties in chronic constipation. Accordingly, Persian researchers and clinicians can benefit from this questionnaire in further research and assessment of treatment outcomes.

  2. Reliability and validity study of a tool to measure cancer stigma: Patient version

    Directory of Open Access Journals (Sweden)

    Medine Yilmaz

    2017-01-01

    Full Text Available Objective: The aim of this methodological study is to establish the validity and reliability of the Turkish version of “A Questionnaire for Measuring Attitudes toward Cancer (Cancer Stigma - Patient version.” Methods: The sample comprised oncology patients who had active cancer treatment. The construct validity was assessed using the confirmatory and exploratory factor analysis. Results: The mean age of the participants was 54.9±12.3 years. In the confirmatory factor analysis, fit values were determined as comparative fit index = 0.93, goodness of fit index = 0.91, normed-fit index=0.91, and root mean square error of approximation RMSEA = 0.09 (P<0.05 (Kaiser–Meyer–Olkin = 0.88, χ2 = 1084.41, Df = 66, and Barletta's test P<0.000. The first factor was “impossibility of recovery and experience of social discrimination” and the second factor was “stereotypes of cancer patients.” The two-factor structure accounted for 56.74% of the variance. The Cronbach's alpha value was determined as 0.88 for the two-factor scale. Conclusions: “A questionnaire for measuring attitudes toward cancer (cancer stigma - Patient version” is a reliable and valid questionnaire to assess stigmatization of cancer in cancer patients.

  3. Validating a Patient-Reported Comorbidity Measure with Respect to Quality of Life in End-Stage Renal Disease.

    Directory of Open Access Journals (Sweden)

    Maxi Robinski

    Full Text Available Medical record-derived comorbidity measures such as the Charlson Comorbidity Index (CCI do not predict functional limitations or quality of life (QoL in the chronically ill. Although these shortcomings are known since the 1980s, they have been largely ignored by the international literature. Recently, QoL has received growing interest as an end-point of interventional trials in Nephrology. The aim of this study is to compare a patient-reported comorbidity measure and the CCI with respect to its validity regarding QoL.The German Self-Administered Comorbidity Questionnaire (SCQ-G was completed by 780 adult end-stage renal disease-patients recruited from 55 dialysis units throughout Germany. Acceptance was evaluated via response rates. Content validity was examined by comparing the typical comorbidity pattern in dialysis patients and the pattern retrieved from our data. Convergent validity was assessed via kappa statistics. Data was compared to the CCI. Linear associations with QoL were examined (criterion validity.The SCQ-G was very well accepted by dialysis patients of all ages (response rate: 99%. Content validity can be interpreted as high (corresponding comorbidity items: 73.7%. Convergent validity was rather weak (.27≤ρ≤.29 but increased when comparing only concordant items (.39≤ρ≤.43. With respect to criterion validity, the SCQ-G performed better than the CCI regarding the correlation with QoL (e.g., SF-12-physical: SCQ-G total score: ρ = -.49 vs. CCI: ρ = -.36.The patient-reported measure proved to be more valid than the external assessment when aiming at insights on QoL. Due to the inclusion of subjective limitations, the SCQ-G is more substantial with respect to patient-centered outcomes and might be used as additional measure in clinical trials.

  4. Validation of a pediatric caregiver diary to measure symptoms of postacute respiratory syncytial virus bronchiolitis

    DEFF Research Database (Denmark)

    Santanello, Nancy C; Norquist, Josephine M; Nelsen, Linda M

    2005-01-01

    consistent, supporting a unidimensional scale structure. Test-retest reliabilities for the percentage of SFD and CSS were above the recommended cut point of 0.70. Cross-sectional and longitudinal correlations were sizeable and statistically significant, demonstrating construct validity. Hypothesized known......Acute respiratory syncytial virus (RSV)-induced bronchiolitis is often associated with continuing respiratory symptoms following hospitalization. To date, there is no validated objective measure to evaluate symptoms of RSV-induced bronchiolitis. We report on the reliability, validity...... the 4-week treatment period of the reported prospective, placebo-controlled trial of montelukast for treatment of postacute RSV were used to assess reliability (internal consistency and test-retest), construct validity (cross-sectional and longitudinal correlations), discriminant validity (known...

  5. Some considerations for validation of repository performance assessment models

    International Nuclear Information System (INIS)

    Eisenberg, N.

    1991-01-01

    Validation is an important aspect of the regulatory uses of performance assessment. A substantial body of literature exists indicating the manner in which validation of models is usually pursued. Because performance models for a nuclear waste repository cannot be tested over the long time periods for which the model must make predictions, the usual avenue for model validation is precluded. Further impediments to model validation include a lack of fundamental scientific theory to describe important aspects of repository performance and an inability to easily deduce the complex, intricate structures characteristic of a natural system. A successful strategy for validation must attempt to resolve these difficulties in a direct fashion. Although some procedural aspects will be important, the main reliance of validation should be on scientific substance and logical rigor. The level of validation needed will be mandated, in part, by the uses to which these models are put, rather than by the ideal of validation of a scientific theory. Because of the importance of the validation of performance assessment models, the NRC staff has engaged in a program of research and international cooperation to seek progress in this important area. 2 figs., 16 refs

  6. Validation of a global scale to assess the quality of interprofessional teamwork in mental health settings.

    Science.gov (United States)

    Tomizawa, Ryoko; Yamano, Mayumi; Osako, Mitue; Hirabayashi, Naotugu; Oshima, Nobuo; Sigeta, Masahiro; Reeves, Scott

    2017-12-01

    Few scales currently exist to assess the quality of interprofessional teamwork through team members' perceptions of working together in mental health settings. The purpose of this study was to revise and validate an interprofessional scale to assess the quality of teamwork in inpatient psychiatric units and to use it multi-nationally. A literature review was undertaken to identify evaluative teamwork tools and develop an additional 12 items to ensure a broad global focus. Focus group discussions considered adaptation to different care systems using subjective judgements from 11 participants in a pre-test of items. Data quality, construct validity, reproducibility, and internal consistency were investigated in the survey using an international comparative design. Exploratory factor analysis yielded five factors with 21 items: 'patient/community centred care', 'collaborative communication', 'interprofessional conflict', 'role clarification', and 'environment'. High overall internal consistency, reproducibility, adequate face validity, and reasonable construct validity were shown in the USA and Japan. The revised Collaborative Practice Assessment Tool (CPAT) is a valid measure to assess the quality of interprofessional teamwork in psychiatry and identifies the best strategies to improve team performance. Furthermore, the revised scale will generate more rigorous evidence for collaborative practice in psychiatry internationally.

  7. Infusion phlebitis assessment measures: a systematic review

    OpenAIRE

    Ray-Barruel, Gillian; Polit, Denise F; Murfield, Jenny E; Rickard, Claire M

    2014-01-01

    Rationale, aims and objectives Phlebitis is a common and painful complication of peripheral intravenous cannulation. The aim of this review was to identify the measures used in infusion phlebitis assessment and evaluate evidence regarding their reliability, validity, responsiveness and feasibility. Method We conducted a systematic literature review of the Cochrane library, Ovid MEDLINE and EBSCO CINAHL until September 2013. All English-language studies (randomized controlled trials, prospecti...

  8. Validity of a smartphone protractor to measure sagittal parameters in adult spinal deformity.

    Science.gov (United States)

    Kunkle, William Aaron; Madden, Michael; Potts, Shannon; Fogelson, Jeremy; Hershman, Stuart

    2017-10-01

    Smartphones have become an integral tool in the daily life of health-care professionals (Franko 2011). Their ease of use and wide availability often make smartphones the first tool surgeons use to perform measurements. This technique has been validated for certain orthopedic pathologies (Shaw 2012; Quek 2014; Milanese 2014; Milani 2014), but never to assess sagittal parameters in adult spinal deformity (ASD). This study was designed to assess the validity, reproducibility, precision, and efficiency of using a smartphone protractor application to measure sagittal parameters commonly measured in ASD assessment and surgical planning. This study aimed to (1) determine the validity of smartphone protractor applications, (2) determine the intra- and interobserver reliability of smartphone protractor applications when used to measure sagittal parameters in ASD, (3) determine the efficiency of using a smartphone protractor application to measure sagittal parameters, and (4) elucidate whether a physician's level of experience impacts the reliability or validity of using a smartphone protractor application to measure sagittal parameters in ASD. An experimental validation study was carried out. Thirty standard 36″ standing lateral radiographs were examined. Three separate measurements were performed using a marker and protractor; then at a separate time point, three separate measurements were performed using a smartphone protractor application for all 30 radiographs. The first 10 radiographs were then re-measured two more times, for a total of three measurements from both the smartphone protractor and marker and protractor. The parameters included lumbar lordosis, pelvic incidence, and pelvic tilt. Three raters performed all measurements-a junior level orthopedic resident, a senior level orthopedic resident, and a fellowship-trained spinal deformity surgeon. All data, including the time to perform the measurements, were recorded, and statistical analysis was performed to

  9. The reliability and validity of radiographic measurements for determining the three-dimensional position of the talus in varus and valgus osteoarthritic ankles.

    Science.gov (United States)

    Nosewicz, Tomasz L; Knupp, Markus; Bolliger, Lilianna; Hintermann, Beat

    2012-12-01

    To assess the most accurate radiographic method to determine talar three-dimensional position in varus and valgus osteoarthritic ankles, we evaluated the reliability and validity of different radiographic measurements. Nine radiographic measurements were performed blindly on weight-bearing mortise, sagittal, and horizontal radiographs of 33 varus and 33 valgus feet (63 patients). Intra- and interobserver reliability was determined with the intraclass coefficient (ICC). Discriminant validity of measurements between varus and valgus feet was assessed with effect size (ES). Convergent validity (Pearson's r) was evaluated by correlating measurements to the dichotomized varus and valgus groups. Obtained measurements in both groups were finally compared with each other and with 30 control feet. Reliability was excellent (ICC > 0.80) in all but two measurements. Whereas frontal plane validity was excellent (ES and r > 0.80), horizontal and sagittal measurements showed poor to moderate validity (ES and r between 0.00 and 0.60). Four measurements were significantly different among all groups (p reliability, validity, and difference among the groups. The frontal tibiotalar surface angle, sagittal talocalcaneal inclination angle, and horizontal talometatarsal I angle accurately determine talar three-dimensional radiographic position in weight-bearing varus and valgus osteoarthritic ankles. Careful radiographic evaluation is important, as these deformities affect talar position in all three planes.

  10. Using qualitative methods to improve questionnaires for Spanish speakers: assessing face validity of a food behavior checklist.

    Science.gov (United States)

    Banna, Jinan C; Vera Becerra, Luz E; Kaiser, Lucia L; Townsend, Marilyn S

    2010-01-01

    Development of outcome measures relevant to health nutrition behaviors requires a rigorous process of testing and revision. Whereas researchers often report performance of quantitative data collection to assess questionnaire validity and reliability, qualitative testing procedures are often overlooked. This report outlines a procedure for assessing face validity of a Spanish-language dietary assessment tool. Reviewing the literature produced no rigorously validated Spanish-language food behavior assessment tools for the US Department of Agriculture's food assistance and education programs. In response to this need, this study evaluated the face validity of a Spanish-language food behavior checklist adapted from a 16-item English version of a food behavior checklist shown to be valid and reliable for limited-resource English speakers. The English version was translated using rigorous methods involving initial translation by one party and creation of five possible versions. Photos were modified based on client input and new photos were taken as necessary. A sample of low-income, Spanish-speaking women completed cognitive interviews (n=20). Spanish translation experts (n=7) fluent in both languages and familiar with both cultures made minor modifications but essentially approved client preferences. The resulting checklist generated a readability score of 93, indicating low reading difficulty. The Spanish-language checklist has adequate face validity in the target population and is ready for further validation using convergent measures. At the conclusion of testing, this instrument may be used to evaluate nutrition education interventions in California. These qualitative procedures provide a framework for designing evaluation tools for low-literate audiences participating in the US Department of Agriculture food assistance and education programs. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.

  11. Cultural adaptation and validation of the Freiburg Life Quality Assessment - Wound Module to Brazilian Portuguese

    Directory of Open Access Journals (Sweden)

    Elaine Aparecida Rocha Domingues

    2016-01-01

    Full Text Available Objectives: to adapt the Freiburg Life Quality Assessment - Wound Module to Brazilian Portuguese and to measure its psychometric properties: reliability and validity. Method: the cultural adaptation was undertaken following the stages of translation, synthesis of the translations, back translation, committee of specialists, pre-test and focus group. A total of 200 patients participated in the study. These were recruited in Primary Care Centers, Family Health Strategy Centers, in a philanthropic hospital and in a teaching hospital. Reliability was assessed through internal consistency and stability. Validity was ascertained through the correlation of the instrument's values with those of the domains of the Ferrans and Powers Quality of Life Index - Wound Version and with the quality of life score of the visual analog scale. Results: the instrument presented adequate internal consistency (Cronbach alpha =0.86 and high stability in the test and retest (0.93. The validity presented correlations of moderate and significant magnitude (-0.24 to -0.48, p<0.0001. Conclusion: the results indicated that the adapted version presented reliable and valid psychometric measurements for the population with chronic wounds in the Brazilian culture.

  12. Validity evidence for the Security Scale as a measure of perceived attachment security in adolescence.

    Science.gov (United States)

    Van Ryzin, Mark J; Leve, Leslie D

    2012-04-01

    In this study, the validity of a self-report measure of children's perceived attachment security (the Kerns Security Scale) was tested using adolescents. With regards to predictive validity, the Security Scale was significantly associated with (1) observed mother-adolescent interactions during conflict and (2) parent- and teacher-rated social competence. With regards to convergent validity, the Security Scale was significantly associated with all subscales of the Adult Attachment Scale (i.e., Depend, Anxiety, and Close) as measured 3 years later. Further, these links were found even after controlling for mother-child relationship quality as assessed by the Inventory of Parent and Peer Attachment (IPPA), and chi-square difference tests indicated that the Security Scale was generally a stronger predictor as compared to the IPPA. These results suggest that the Security Scale can be used to assess perceived attachment security across both childhood and adolescence, and thus could contribute significantly to developmental research during this period. Copyright © 2011 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.

  13. Situational awareness of hazards: Validation of multi-source radiation measurements

    Science.gov (United States)

    Hultquist, C.; Cervone, G.

    2016-12-01

    Citizen-led movements producing scientific hazard data during disasters are increasingly common. After the Japanese earthquake-triggered tsunami in 2011, and the resulting radioactive releases at the damaged Fukushima Daiichi nuclear power plants, citizens monitored on-ground levels of radiation with innovative mobile devices built from off-the-shelf components. To date, the citizen-led SAFECAST project has recorded 50 million radiation measurements worldwide, with the majority of these measurements from Japan. The analysis of data which are multi-dimensional, not vetted, and provided from multiple devices presents big data challenges due to their volume, velocity, variety, and veracity. While the SAFECAST project produced massive open-source radiation measurements at specific coordinates and times, the reliability and validity of the overall data have not yet been assessed. The nuclear disaster provides a case for assessing the SAFECAST data with official aerial remote sensing radiation data jointly collected by the governments of the United States and Japan. A spatial and statistical assessment of SAFECAST requires several preprocessing steps. First, SAFECAST ionized radiation sensors collected data using different units of measure than the government data, and they had to be converted. Secondly, the normally occurring radiation and decay rates of Cesium from deposition surveys were used to properly compare measurements in space and time. Finally, the GPS located points were selected within overlapping extents at multiple spatial resolutions. Quantitative measures were used to assess the similarity and differences in the observed measurements. Radiation measurements from the same geographic extents show similar spatial variations and statistically significant correlations. The results suggest that actionable scientific data for disasters and emergencies can be inferred from non-traditional and not vetted data generated through citizen science projects. This

  14. Traditional Masculinity and Femininity: Validation of a New Scale Assessing Gender Roles

    Science.gov (United States)

    Kachel, Sven; Steffens, Melanie C.; Niedlich, Claudia

    2016-01-01

    Gender stereotype theory suggests that men are generally perceived as more masculine than women, whereas women are generally perceived as more feminine than men. Several scales have been developed to measure fundamental aspects of gender stereotypes (e.g., agency and communion, competence and warmth, or instrumentality and expressivity). Although omitted in later version, Bem's original Sex Role Inventory included the items “masculine” and “feminine” in addition to more specific gender-stereotypical attributes. We argue that it is useful to be able to measure these two core concepts in a reliable, valid, and parsimonious way. We introduce a new and brief scale, the Traditional Masculinity-Femininity (TMF) scale, designed to assess central facets of self-ascribed masculinity-femininity. Studies 1–2 used known-groups approaches (participants differing in gender and sexual orientation) to validate the scale and provide evidence of its convergent validity. As expected the TMF reliably measured a one-dimensional masculinity-femininity construct. Moreover, the TMF correlated moderately with other gender-related measures. Demonstrating incremental validity, the TMF predicted gender and sexual orientation in a superior way than established adjective-based measures. Furthermore, the TMF was connected to criterion characteristics, such as judgments as straight by laypersons for the whole sample, voice pitch characteristics for the female subsample, and contact to gay men for the male subsample, and outperformed other gender-related scales. Taken together, as long as gender differences continue to exist, we suggest that the TMF provides a valuable methodological addition for research into gender stereotypes. PMID:27458394

  15. Traditional Masculinity and Femininity: Validation of a New Scale Assessing Gender Roles.

    Science.gov (United States)

    Kachel, Sven; Steffens, Melanie C; Niedlich, Claudia

    2016-01-01

    Gender stereotype theory suggests that men are generally perceived as more masculine than women, whereas women are generally perceived as more feminine than men. Several scales have been developed to measure fundamental aspects of gender stereotypes (e.g., agency and communion, competence and warmth, or instrumentality and expressivity). Although omitted in later version, Bem's original Sex Role Inventory included the items "masculine" and "feminine" in addition to more specific gender-stereotypical attributes. We argue that it is useful to be able to measure these two core concepts in a reliable, valid, and parsimonious way. We introduce a new and brief scale, the Traditional Masculinity-Femininity (TMF) scale, designed to assess central facets of self-ascribed masculinity-femininity. Studies 1-2 used known-groups approaches (participants differing in gender and sexual orientation) to validate the scale and provide evidence of its convergent validity. As expected the TMF reliably measured a one-dimensional masculinity-femininity construct. Moreover, the TMF correlated moderately with other gender-related measures. Demonstrating incremental validity, the TMF predicted gender and sexual orientation in a superior way than established adjective-based measures. Furthermore, the TMF was connected to criterion characteristics, such as judgments as straight by laypersons for the whole sample, voice pitch characteristics for the female subsample, and contact to gay men for the male subsample, and outperformed other gender-related scales. Taken together, as long as gender differences continue to exist, we suggest that the TMF provides a valuable methodological addition for research into gender stereotypes.

  16. PROMIS GH (Patient-Reported Outcomes Measurement Information System Global Health) Scale in Stroke: A Validation Study.

    Science.gov (United States)

    Katzan, Irene L; Lapin, Brittany

    2018-01-01

    The International Consortium for Health Outcomes Measurement recently included the 10-item PROMIS GH (Patient-Reported Outcomes Measurement Information System Global Health) scale as part of their recommended Standard Set of Stroke Outcome Measures. Before collection of PROMIS GH is broadly implemented, it is necessary to assess its performance in the stroke population. The objective of this study was to evaluate the psychometric properties of PROMIS GH in patients with ischemic stroke and intracerebral hemorrhage. PROMIS GH and 6 PROMIS domain scales measuring same/similar constructs were electronically collected on 1102 patients with ischemic and hemorrhagic strokes at various stages of recovery from their stroke who were seen in a cerebrovascular clinic from October 12, 2015, through June 2, 2017. Confirmatory factor analysis was performed to evaluate the adequacy of 2-factor structure of component scores. Test-retest reliability and convergent validity of PROMIS GH items and component scores were assessed. Discriminant validity and responsiveness were compared between PROMIS GH and PROMIS domain scales measuring the same or related constructs. Analyses were repeated stratified by stroke subtype and modified Rankin Scale score validity was good with significant correlations between all PROMIS GH items and PROMIS domain scales ( P 0.5) was demonstrated for 8 of the 10 PROMIS GH items. Reliability and validity remained consistent across stroke subtype and disability level (modified Rankin Scale, <2 versus ≥2). PROMIS GH exhibits acceptable performance in patients with stroke. Our findings support International Consortium for Health Outcomes Measurement recommendation to use PROMIS GH as part of the standard set of outcome measures in stroke. © 2017 American Heart Association, Inc.

  17. Reliability and validity of the transport and physical activity questionnaire (TPAQ for assessing physical activity behaviour.

    Directory of Open Access Journals (Sweden)

    Emma J Adams

    Full Text Available No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ.The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA.In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59, cycling for transport (ICC = 0.61, walking for recreation (ICC = 0.48, cycling for recreation (ICC = 0.35, moderate leisure-time physical activity (ICC = 0.47, vigorous leisure-time physical activity (ICC = 0.63, and total physical activity (ICC = 0.56. The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60. In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, p<0.001, fair but non-significant agreement for moderate physical activity (r = 0.24, p = 0.09 and fair agreement for MVPA (r = 0.27, p = 0.05. Bland-Altman analysis showed a mean

  18. Reliability and Validity of the Transport and Physical Activity Questionnaire (TPAQ) for Assessing Physical Activity Behaviour

    Science.gov (United States)

    Adams, Emma J.; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C.; Cooper, Ashley R.; Ogilvie, David

    2014-01-01

    Background No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). Methods The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). Results In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, pphysical activity (r = 0.24, p = 0.09) and fair agreement for MVPA (r = 0.27, p = 0.05). Bland-Altman analysis showed a mean overestimation of MVPA of 87.6 min/week (p

  19. TWO CRITERIA FOR GOOD MEASUREMENTS IN RESEARCH: VALIDITY AND RELIABILITY

    Directory of Open Access Journals (Sweden)

    Haradhan Kumar Mohajan

    2017-12-01

    Full Text Available Reliability and validity are two most important and fundamental features in the evaluation of any measurement instrument or toll for a good research. The purpose of this research is to discuss the validity and reliability of measurement instruments that are used in research. Validity concerns what an instrument measures, and how well it does so. Reliability concerns the faith that one can have in the data obtained from use of an instrument, that is, the degree to which any measuring tool controls for random error. An attempt has been taken here to review the reliability and validity, and threat to them in some details.

  20. Validity of a questionnaire measuring motives for choosing foods including sustainable concerns.

    Science.gov (United States)

    Sautron, Valérie; Péneau, Sandrine; Camilleri, Géraldine M; Muller, Laurent; Ruffieux, Bernard; Hercberg, Serge; Méjean, Caroline

    2015-04-01

    Since the 1990s, sustainability of diet has become an increasingly important concern for consumers. However, there is no validated multidimensional measurement of motivation in the choice of foods including a concern for sustainability currently available. In the present study, we developed a questionnaire that measures food choice motives during purchasing, and we tested its psychometric properties. The questionnaire included 104 items divided into four predefined dimensions (environmental, health and well-being, economic and miscellaneous). It was administered to 1000 randomly selected subjects participating in the Nutrinet-Santé cohort study. Among 637 responders, one-third found the questionnaire complex or too long, while one-quarter found it difficult to fill in. Its underlying structure was determined by exploratory factor analysis and then internally validated by confirmatory factor analysis. Reliability was also assessed by internal consistency of selected dimensions and test-retest repeatability. After selecting the most relevant items, first-order analysis highlighted nine main dimensions: labeled ethics and environment, local and traditional production, taste, price, environmental limitations, health, convenience, innovation and absence of contaminants. The model demonstrated excellent internal validity (adjusted goodness of fit index = 0.97; standardized root mean square residuals = 0.07) and satisfactory reliability (internal consistency = 0.96, test-retest repeatability coefficient ranged between 0.31 and 0.68 over a mean 4-week period). This study enabled precise identification of the various dimensions in food choice motives and proposed an original, internally valid tool applicable to large populations for assessing consumer food motivation during purchasing, particularly in terms of sustainability. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Validating a new device for measuring tear evaporation rates.

    Science.gov (United States)

    Rohit, Athira; Ehrmann, Klaus; Naduvilath, Thomas; Willcox, Mark; Stapleton, Fiona

    2014-01-01

    To calibrate and validate a commercially available dermatology instrument to measure tear evaporation rate of contact lens wearers. A dermatology instrument was modified by attaching a swim goggle cup such that the cup sealed around the eye socket. Results for the unmodified instrument are dependent on probe area and enclosed volume. Calibration curves were established using a model eye, to account for individual variations in chamber volume and exposed area. Fifteen participants were recruited and the study included a contact lens wear and a no contact lens wear stage. Day and diurnal variation of the measurements were assessed by taking the measurement three times a day over 2 days. The coefficient of repeatability of the measurement was calculated and a linear mixed model assessed the influence of humidity, temperature, contact lens wear, day and diurnal variations on tear evaporation rate. The associations between variables were assessed using Pearson correlation coefficient. Absolute evaporation rates with and without contact lens wear were calculated based on the new calibration. The measurements were most repeatable during the evening with no lens wear (COR = 49 g m⁻² h) and least repeatable during the evening with contact lens wear (COR = 93 g m⁻² h). Humidity (p = 0.007), and contact lens wear (p evaporation rate. However, temperature (p = 0.54) diurnal variation (p = 0.85) and different days (p = 0.65) had no significant effect after controlling for humidity. Tear evaporation rates can be measured using a modified dermatology instrument. Measurements were higher and more variable with lens wear consistent with previous literature. Control of environmental conditions is important as a higher humidity results in a reduced evaporation rate. © 2013 The Authors Ophthalmic & Physiological Optics © 2013 The College of Optometrists.

  2. Exploring the validity of the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT) with established emotions measures.

    Science.gov (United States)

    Roberts, Richard D; Schulze, Ralf; O'Brien, Kristin; MacCann, Carolyn; Reid, John; Maul, Andy

    2006-11-01

    Emotions measures represent an important means of obtaining construct validity evidence for emotional intelligence (EI) tests because they have the same theoretical underpinnings. Additionally, the extent to which both emotions and EI measures relate to intelligence is poorly understood. The current study was designed to address these issues. Participants (N = 138) completed the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT), two emotions measures, as well as four intelligence tests. Results provide mixed support for the model hypothesized to underlie the MSCEIT, with emotions research and EI measures failing to load on the same factor. The emotions measures loaded on the same factor as intelligence measures. The validity of certain EI components (in particular, Emotion Perception), as currently assessed, appears equivocal. Copyright 2006 APA, all rights reserved.

  3. Cross-cultural adaptation and validation of the Hungarian version of the Core Outcome Measures Index for the back (COMI Back).

    Science.gov (United States)

    Klemencsics, Istvan; Lazary, Aron; Valasek, Tamas; Szoverfi, Zsolt; Bozsodi, Arpad; Eltes, Peter; Fekete, Tamás Fülöp; Varga, Peter Pal

    2016-01-01

    The Core Outcome Measure Index (COMI) is a short, multidimensional outcome instrument developed for the evaluation of patients with spinal conditions. The aim of this study was to produce a cross-culturally adapted and validated Hungarian version of the COMI Back questionnaire. A cross-cultural adaptation of the COMI into Hungarian was carried out using established guidelines. Low back pain patients completed a booklet of questionnaires containing the Hungarian versions of COMI, Oswestry Disability Index (ODI) and WHO Quality of Life-BREF assessment (WHOQOL-BREF). The validation of the COMI included assessment of its construct validity, reliability, and responsiveness. 145 patients participated in the assessment of reliability and 159 surgically treated patients were included in the responsiveness study. Excellent correlation was found between COMI and ODI scores (rho = 0.83, p cross-cultural adaptation of the COMI into the Hungarian language was successful, resulting in a reliable and valid measurement tool with good clinimetric properties.

  4. Life-Space Assessment scale to assess mobility: validation in Latin American older women and men.

    Science.gov (United States)

    Curcio, Carmen-Lucia; Alvarado, Beatriz E; Gomez, Fernando; Guerra, Ricardo; Guralnik, Jack; Zunzunegui, Maria Victoria

    2013-10-01

    The Life-Space Assessment (LSA) instrument of the University of Alabama and Birmingham study is a useful and innovative measure of mobility in older populations. The purpose of this article was to assess the reliability, construct and convergent validity of the LSA in Latin American older populations. In a cross-sectional study, a total of 150 women and 150 men, aged 65-74 years, were recruited from seniors' community centers in Manizales, Colombia and Natal, Brazil. The LSA questionnaire summarizes where people travel (5 levels from room to places outside of town), how often and any assistance needed. Four LSA variables were obtained according to the maximum life space achieved and the level of independence. As correlates of LSA, education, perception of income sufficiency, depression, cognitive function, and functional measures (objective and subjectively measured) were explored. The possible modifying effect of the city on correlates of LSA was examined. Reliability for the composite LSA score was substantial (ICC = 0.70; 95 % CI 0.49-0.83) in Manizales. Average levels of LSA scores were higher in those with better functional performance and those who reported less mobility difficulties. Low levels of education, insufficient income, depressive symptoms, and low scores of cognitive function were all significantly related to lower LSA scores. Women in both cities were more likely to be restricted to their neighborhood and had lower LSA scores. This study provides evidence for the validity of LSA in two Latin American populations. Our results suggest that LSA is a good measure of mobility that reflects the interplay of physical functioning with gender and the social and physical environment.

  5. Validity and Interrater Reliability of the Visual Quarter-Waste Method for Assessing Food Waste in Middle School and High School Cafeteria Settings.

    Science.gov (United States)

    Getts, Katherine M; Quinn, Emilee L; Johnson, Donna B; Otten, Jennifer J

    2017-11-01

    Measuring food waste (ie, plate waste) in school cafeterias is an important tool to evaluate the effectiveness of school nutrition policies and interventions aimed at increasing consumption of healthier meals. Visual assessment methods are frequently applied in plate waste studies because they are more convenient than weighing. The visual quarter-waste method has become a common tool in studies of school meal waste and consumption, but previous studies of its validity and reliability have used correlation coefficients, which measure association but not necessarily agreement. The aims of this study were to determine, using a statistic measuring interrater agreement, whether the visual quarter-waste method is valid and reliable for assessing food waste in a school cafeteria setting when compared with the gold standard of weighed plate waste. To evaluate validity, researchers used the visual quarter-waste method and weighed food waste from 748 trays at four middle schools and five high schools in one school district in Washington State during May 2014. To assess interrater reliability, researcher pairs independently assessed 59 of the same trays using the visual quarter-waste method. Both validity and reliability were assessed using a weighted κ coefficient. For validity, as compared with the measured weight, 45% of foods assessed using the visual quarter-waste method were in almost perfect agreement, 42% of foods were in substantial agreement, 10% were in moderate agreement, and 3% were in slight agreement. For interrater reliability between pairs of visual assessors, 46% of foods were in perfect agreement, 31% were in almost perfect agreement, 15% were in substantial agreement, and 8% were in moderate agreement. These results suggest that the visual quarter-waste method is a valid and reliable tool for measuring plate waste in school cafeteria settings. Copyright © 2017 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.

  6. Expert validation of a teamwork assessment rubric: A modified Delphi study.

    Science.gov (United States)

    Parratt, Jenny A; Fahy, Kathleen M; Hutchinson, Marie; Lohmann, Gui; Hastie, Carolyn R; Chaseling, Marilyn; O'Brien, Kylie

    2016-01-01

    Teamwork is a 'soft skill' employability competence desired by employers. Poor teamwork skills in healthcare have an impact on adverse outcomes. Teamwork skills are rarely the focus of teaching and assessment in undergraduate courses. The TeamUP Rubric is a tool used to teach and evaluate undergraduate students' teamwork skills. Students also use the rubric to give anonymised peer feedback during team-based academic assignments. The rubric's five domains focus on planning, environment, facilitation, conflict management and individual contribution; each domain is grounded in relevant theory. Students earn marks for their teamwork skills; validity of the assessment rubric is critical. To what extent do experts agree that the TeamUP Rubric is a valid assessment of 'teamwork skills'? Modified Delphi technique incorporating Feminist Collaborative Conversations. A heterogeneous panel of 35 professionals with recognised expertise in communications and/or teamwork. Three Delphi rounds using a survey that included the rubric were conducted either face-to-face, by telephone or online. Quantitative analysis yielded item content validity indices (I-CVI); minimum consensus was pre-set at 70%. An average of the I-CVI also yielded sub-scale (domain) (D-CVI/Ave) and scale content validity indices (S-CVI/Ave). After each Delphi round, qualitative data were analysed and interpreted; Feminist Collaborative Conversations by the research team aimed to clarify and confirm consensus about the wording of items on the rubric. Consensus (at 70%) was obtained for all but one behavioural descriptor of the rubric. We modified that descriptor to address expert concerns. The TeamUP Rubric (Version 4) can be considered to be well validated at that level of consensus. The final rubric reflects underpinning theory, with no areas of conceptual overlap between rubric domains. The final TeamUP Rubric arising from this study validly measures individual student teamwork skills and can be used with

  7. Cross-cultural validation of the Persian version of the Functional Independence Measure for patients with stroke.

    Science.gov (United States)

    Naghdi, Soofia; Ansari, Noureddin Nakhostin; Raji, Parvin; Shamili, Aryan; Amini, Malek; Hasson, Scott

    2016-01-01

    To translate and cross-culturally adapt the Functional Independence Measure (FIM) into the Persian language and to test the reliability and validity of the Persian FIM (PFIM) in patients with stroke. In this cross-sectional study carried out in an outpatient stroke rehabilitation center, 40 patients with stroke (mean age 60 years) were participated. A standard forward-backward translation method and expert panel validation was followed to develop the PFIM. Two experienced occupational therapists (OTs) assessed the patients independently in all items of the PFIM in a single session for inter-rater reliability. One of the OTs reassessed the patients after 1 week for intra-rater reliability. There were no floor or ceiling effects for the PFIM. Excellent inter-rater and intra-rater reliability was noted for the PFIM total score, motor and cognitive subscales (ICC(agreement)0.88-0.98). According to the Bland-Altman agreement analysis, there was no systematic bias between raters and within raters. The internal consistency of the PFIM was with Cronbach's alpha from 0.70 to 0.96. The principal component analysis with varimax rotation indicated a three-factor structure: (1) self-care and mobility; (2) sphincter control and (3) cognitive that jointly accounted for 74.8% of the total variance. Construct validity was supported by a significant Pearson correlation between the PFIM and the Persian Barthel Index (r = 0.95; p Persian patients with stroke. The Functional Independence Measure (FIM) is an outcome measure for disability based on the International Classification of Functioning, Disability and Health (ICF). The FIM was cross-culturally adapted and validated into Persian language. The Persian version of the FIM (PFIM) is reliable and valid for assessing functional status of patients with stroke. The PFIM can be used in Persian speaking countries to assess the limitations in activities of daily living of patients with stroke.

  8. Using the Rasch measurement model to design a report writing assessment instrument.

    Science.gov (United States)

    Carlson, Wayne R

    2013-01-01

    This paper describes how the Rasch measurement model was used to develop an assessment instrument designed to measure student ability to write law enforcement incident and investigative reports. The ability to write reports is a requirement of all law enforcement recruits in the state of Michigan and is a part of the state's mandatory basic training curriculum, which is promulgated by the Michigan Commission on Law Enforcement Standards (MCOLES). Recently, MCOLES conducted research to modernize its training and testing in the area of report writing. A structured validation process was used, which included: a) an examination of the job tasks of a patrol officer, b) input from content experts, c) a review of the professional research, and d) the creation of an instrument to measure student competency. The Rasch model addressed several measurement principles that were central to construct validity, which were particularly useful for assessing student performances. Based on the results of the report writing validation project, the state established a legitimate connectivity between the report writing standard and the essential job functions of a patrol officer in Michigan. The project also produced an authentic instrument for measuring minimum levels of report writing competency, which generated results that are valid for inferences of student ability. Ultimately, the state of Michigan must ensure the safety of its citizens by licensing only those patrol officers who possess a minimum level of core competency. Maintaining the validity and reliability of both the training and testing processes can ensure that the system for producing such candidates functions as intended.

  9. Development and validation of a questionnaire (the IRA-AGHN to assess teachers' knowledge of Attention Deficit Hyperactivity Disorder

    Directory of Open Access Journals (Sweden)

    Marian Soroa

    2014-10-01

    Full Text Available The purpose of this study was to develop a questionnaire, called IRA-AGHN, to assess infant and primary school teachers' knowledge of Attention Deficit Hyperactivity Disorder. The psychometric properties of this questionnaire were examined in a sample of 752 teachers aged between 20 and 64 years (M = 41.57; SD = 9.69. These teachers were employed at 84 randomly selected schools in the Autonomous Community of the Basque Country and Navarre. The factor validity, internal consistency, temporal stability, convergent validity and external validity of the instrument were all analysed. The results suggest that the IRA-AGHN is a valid and reliable measure for assessing teachers' knowledge of ADHD.

  10. Is the Veterans Specific Activity Questionnaire Valid to Assess Older Adults Aerobic Fitness?

    Science.gov (United States)

    de Carvalho Bastone, Alessandra; de Souza Moreira, Bruno; Teixeira, Claudine Patrícia; Dias, João Marcos Domingues; Dias, Rosângela Corrêa

    2016-01-01

    Aerobic fitness in older adults is related to health status, incident disability, nursing home admission, and all-cause mortality. The most accurate quantification of aerobic fitness, expressed as peak oxygen consumption in mL·kg·min, is the cardiorespiratory exercise test; however, it is not feasible in all settings and might offer risk to patients. The Veterans Specific Activity Questionnaire (VSAQ) is a 13-item self-administered symptom questionnaire that estimates aerobic fitness expressed in metabolic equivalents (METs) and has been validated to cardiovascular patients. The purpose of this study was to assess the validity and reliability of the VSAQ in older adults without specific health conditions. A methodological study with a cross-sectional design was conducted with 28 older adults (66-86 years). The VSAQ was administered on 3 occasions by 2 evaluators. Aerobic capacity in METs as measured by the VSAQ was compared with the METs found in an incremental shuttle walk test (ISWT) performed with a portable metabolic measurement system and with accelerometer data. The validity of the VSAQ was found to be moderate-to-good when compared with the METs and distance measured by the ISWT and with the moderate activity per day and steps per day obtained by accelerometry. The Bland-Altman graph analysis showed no values outside the limits of agreement, suggesting good precision between the METs estimated by questionnaire and the METs measured by the ISWT. Also, the intrarater and interrater reliabilities of the instrument were good. The results showed that the VSAQ is a valuable tool to assess the aerobic fitness of older adults.

  11. How to measure the impact of premenstrual symptoms? Development and validation of the German PMS-Impact Questionnaire.

    Science.gov (United States)

    Kues, Johanna N; Janda, Carolyn; Kleinstäuber, Maria; Weise, Cornelia

    2016-10-01

    With 75% of women of reproductive age affected, premenstrual symptoms are very common, ranging from emotional and cognitive to physical symptoms. Premenstrual Syndrome and Premenstrual Dysphoric Disorder can lead to substantial functional interference and psychological distress comparable to that of dysthymic disorders. The assessment of this impact is required as a part of the diagnostic procedure in the DSM-5. In the absence of a specific measure, the authors developed the PMS-Impact Questionnaire. A sample of 101 women reporting severe premenstrual complaints was assessed with the twenty-two items in the questionnaire during their premenstrual phase in an ongoing intervention study at the Philipps-University Marburg from August 2013 until January 2015. An exploratory factor analysis revealed a two-factor solution (labeled Psychological Impact and Functional Impact) with 18 items. A Cronbach's alpha of 0.90 for Psychological Impact and of 0.90 for Functional Impact indicated good reliability. Convergent construct validity was demonstrated by moderate to high correlations with the Pain Disability Index. Low correlations with the Big Five Inventory-10 indicated good divergent validity. The PMS-Impact Questionnaire was found to be a valid, reliable, and an economic measure to assess the impact of premenstrual symptoms. In future research, cross validations and confirmatory factor analyses should be conducted.

  12. Development and validation of a physics problem-solving assessment rubric

    Science.gov (United States)

    Docktor, Jennifer Lynn

    Problem solving is a complex process that is important for everyday life and crucial for learning physics. Although there is a great deal of effort to improve student problem solving throughout the educational system, there is no standard way to evaluate written problem solving that is valid, reliable, and easy to use. Most tests of problem solving performance given in the classroom focus on the correctness of the end result or partial results rather than the quality of the procedures and reasoning leading to the result, which gives an inadequate description of a student's skills. A more detailed and meaningful measure is necessary if different curricular materials or pedagogies are to be compared. This measurement tool could also allow instructors to diagnose student difficulties and focus their coaching. It is important that the instrument be applicable to any problem solving format used by a student and to a range of problem types and topics typically used by instructors. Typically complex processes such as problem solving are assessed by using a rubric, which divides a skill into multiple quasi-independent categories and defines criteria to attain a score in each. This dissertation describes the development of a problem solving rubric for the purpose of assessing written solutions to physics problems and presents evidence for the validity, reliability, and utility of score interpretations on the instrument.

  13. Validation of the spiritual distress assessment tool in older hospitalized patients

    Directory of Open Access Journals (Sweden)

    Monod Stefanie

    2012-03-01

    Full Text Available Abstract Background The Spiritual Distress Assessment Tool (SDAT is a 5-item instrument developed to assess unmet spiritual needs in hospitalized elderly patients and to determine the presence of spiritual distress. The objective of this study was to investigate the SDAT psychometric properties. Methods This cross-sectional study was performed in a Geriatric Rehabilitation Unit. Patients (N = 203, aged 65 years and over with Mini Mental State Exam score ≥ 20, were consecutively enrolled over a 6-month period. Data on health, functional, cognitive, affective and spiritual status were collected upon admission. Interviews using the SDAT (score from 0 to 15, higher scores indicating higher distress were conducted by a trained chaplain. Factor analysis, measures of internal consistency (inter-item and item-to-total correlations, Cronbach α, and reliability (intra-rater and inter-rater were performed. Criterion-related validity was assessed using the Functional Assessment of Chronic Illness Therapy-Spiritual well-being (FACIT-Sp and the question "Are you at peace?" as criterion-standard. Concurrent and predictive validity were assessed using the Geriatric Depression Scale (GDS, occurrence of a family meeting, hospital length of stay (LOS and destination at discharge. Results SDAT scores ranged from 1 to 11 (mean 5.6 ± 2.4. Overall, 65.0% (132/203 of the patients reported some spiritual distress on SDAT total score and 22.2% (45/203 reported at least one severe unmet spiritual need. A two-factor solution explained 60% of the variance. Inter-item correlations ranged from 0.11 to 0.41 (eight out of ten with P Conclusions SDAT has acceptable psychometrics properties and appears to be a valid and reliable instrument to assess spiritual distress in elderly hospitalized patients.

  14. Validation of the Danish version of the Patient Assessment of Care for Chronic Conditions questionnaire (PACIC)

    DEFF Research Database (Denmark)

    Sokolowski, Ineta; Maindal, Helle Terkildsen; Vedsted, Peter

    Objective: To evaluate the level of chronic care patients must be involved. The Danish version of the 20-item Patient Assessment of Care for Chronic Conditions PACIC questionnaire consisting of 5 scales and an overall summary score measuring patient reported assessment of structured chronic care ...... the same questionnaire is constructed and applied to different countries with diverse cultural backgrounds and health care systems. It is decisive, that translated questionnaires are validated in country they are used.......Objective: To evaluate the level of chronic care patients must be involved. The Danish version of the 20-item Patient Assessment of Care for Chronic Conditions PACIC questionnaire consisting of 5 scales and an overall summary score measuring patient reported assessment of structured chronic care...... interitem correlation), item-rest correlations. Model fit from confirmatory factor analysis (CFA). Results: We present the psychometric properties of the questionnaire and the first results evaluating chronic care in Danish people with diabetes. Conclusions: The complexity of validation is greater when...

  15. A Valid and Reliable Tool to Assess Nursing Students` Clinical Performance

    OpenAIRE

    Mehrnoosh Pazargadi; Tahereh Ashktorab; Sharareh Khosravi; Hamid Alavi majd

    2013-01-01

    Background: The necessity of a valid and reliable assessment tool is one of the most repeated issues in nursing students` clinical evaluation. But it is believed that present tools are not mostly valid and can not assess students` performance properly.Objectives: This study was conducted to design a valid and reliable assessment tool for evaluating nursing students` performance in clinical education.Methods: In this methodological study considering nursing students` performance definition; th...

  16. Validation of a Behavioral Approach for Measuring Saccades in Parkinson's Disease.

    Science.gov (United States)

    Turner, Travis H; Renfroe, Jenna B; Duppstadt-Delambo, Amy; Hinson, Vanessa K

    2017-01-01

    Speed and control of saccades are related to disease progression and cognitive functioning in Parkinson's disease (PD). Traditional eye-tracking complexities encumber application for individual evaluations and clinical trials. The authors examined psychometric properties of standalone tasks for reflexive prosaccade latency, volitional saccade initiation, and saccade inhibition (antisaccade) in a heterogeneous sample of 65 PD patients. Demographics had minimal impact on task performance. Thirty-day test-retest reliability estimates for behavioral tasks were acceptable and similar to traditional eye tracking. Behavioral tasks demonstrated concurrent validity with traditional eye-tracking measures; discriminant validity was less clear. Saccade initiation and inhibition discriminated PD patients with cognitive impairment. The present findings support further development and use of the behavioral tasks for assessing latency and control of saccades in PD.

  17. Validation of hand and foot anatomical feature measurements from smartphone images

    Science.gov (United States)

    Amini, Mohammad; Vasefi, Fartash; MacKinnon, Nicholas

    2018-02-01

    A smartphone mobile medical application, previously presented as a tool for individuals with hand arthritis to assess and monitor the progress of their disease, has been modified and expanded to include extraction of anatomical features from the hand (joint/finger width, and angulation) and foot (length, width, big toe angle, and arch height index) from smartphone camera images. Image processing algorithms and automated measurements were validated by performing tests on digital hand models, rigid plastic hand models, and real human hands and feet to determine accuracy and reproducibility compared to conventional measurement tools such as calipers, rulers, and goniometers. The mobile application was able to provide finger joint width measurements with accuracy better than 0.34 (+/-0.25) millimeters. Joint angulation measurement accuracy was better than 0.50 (+/-0.45) degrees. The automatically calculated foot length accuracy was 1.20 (+/-1.27) millimeters and the foot width accuracy was 1.93 (+/-1.92) millimeters. Hallux valgus angle (used in assessing bunions) accuracy was 1.30 (+/-1.29) degrees. Arch height index (AHI) measurements had an accuracy of 0.02 (+/-0.01). Combined with in-app documentation of symptoms, treatment, and lifestyle factors, the anatomical feature measurements can be used by both healthcare professionals and manufacturers. Applications include: diagnosing hand osteoarthritis; providing custom finger splint measurements; providing compression glove measurements for burn and lymphedema patients; determining foot dimensions for custom shoe sizing, insoles, orthotics, or foot splints; and assessing arch height index and bunion treatment effectiveness.

  18. Policy and Validity Prospects for Performance-Based Assessment.

    Science.gov (United States)

    Baker, Eva L.; And Others

    1994-01-01

    This article describes performance-based assessment as expounded by its proponents, comments on these conceptions, reviews evidence regarding the technical quality of performance-based assessment, and considers its validity under various policy options. (JDD)

  19. COCOA: A New Validated Instrument to Assess Medical Students' Attitudes towards Older Adults

    Science.gov (United States)

    Hollar, David; Roberts, Ellen; Busby-Whitehead, Jan

    2011-01-01

    This study tested the reliability and validity of the Carolina Opinions on Care of Older Adults (COCOA) survey compared with the Geriatric Assessment Survey (GAS). Participants were first year medical students (n = 160). A Linear Structural Relations (LISREL) measurement model for COCOA had a moderately strong fit that was significantly better…

  20. Toward a Common Language for Measuring Patient Mobility in the Hospital: Reliability and Construct Validity of Interprofessional Mobility Measures.

    Science.gov (United States)

    Hoyer, Erik H; Young, Daniel L; Klein, Lisa M; Kreif, Julie; Shumock, Kara; Hiser, Stephanie; Friedman, Michael; Lavezza, Annette; Jette, Alan; Chan, Kitty S; Needham, Dale M

    2018-02-01

    The lack of common language among interprofessional inpatient clinical teams is an important barrier to achieving inpatient mobilization. In The Johns Hopkins Hospital, the Activity Measure for Post-Acute Care (AM-PAC) Inpatient Mobility Short Form (IMSF), also called "6-Clicks," and the Johns Hopkins Highest Level of Mobility (JH-HLM) are part of routine clinical practice. The measurement characteristics of these tools when used by both nurses and physical therapists for interprofessional communication or assessment are unknown. The purposes of this study were to evaluate the reliability and minimal detectable change of AM-PAC IMSF and JH-HLM when completed by nurses and physical therapists and to evaluate the construct validity of both measures when used by nurses. A prospective evaluation of a convenience sample was used. The test-retest reliability and the interrater reliability of AM-PAC IMSF and JH-HLM for inpatients in the neuroscience department (n = 118) of an academic medical center were evaluated. Each participant was independently scored twice by a team of 2 nurses and 1 physical therapist; a total of 4 physical therapists and 8 nurses participated in reliability testing. In a separate inpatient study protocol (n = 69), construct validity was evaluated via an assessment of convergent validity with other measures of function (grip strength, Katz Activities of Daily Living Scale, 2-minute walk test, 5-times sit-to-stand test) used by 5 nurses. The test-retest reliability values (intraclass correlation coefficients) for physical therapists and nurses were 0.91 and 0.97, respectively, for AM-PAC IMSF and 0.94 and 0.95, respectively, for JH-HLM. The interrater reliability values (intraclass correlation coefficients) between physical therapists and nurses were 0.96 for AM-PAC IMSF and 0.99 for JH-HLM. Construct validity (Spearman correlations) ranged from 0.25 between JH-HLM and right-hand grip strength to 0.80 between AM-PAC IMSF and the Katz Activities of

  1. Assessment of Irrational Beliefs: The Question of Discriminant Validity.

    Science.gov (United States)

    Smith, Timothy W.; Zurawski, Raymond M.

    1983-01-01

    Evaluated discriminant validity in frequently used measures of irrational beliefs relative to measures of trait anxiety in college students (N=142). Results showed discriminant validity in the Rational Behavior Inventory but not in the Irrational Beliefs Test and correlated cognitive rather than somatic aspects of trait anxiety with both measures.…

  2. Cross-validation pitfalls when selecting and assessing regression and classification models.

    Science.gov (United States)

    Krstajic, Damjan; Buturovic, Ljubomir J; Leahy, David E; Thomas, Simon

    2014-03-29

    We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error.

  3. Cross-cultural validation of Cancer Communication Assessment Tool in Korea.

    Science.gov (United States)

    Shin, Dong Wook; Shin, Jooyeon; Kim, So Young; Park, Boram; Yang, Hyung-Kook; Cho, Juhee; Lee, Eun Sook; Kim, Jong Heun; Park, Jong-Hyock

    2015-02-01

    Communication between cancer patients and caregivers is often suboptimal. The Cancer Communication Assessment Tool for Patient and Families (CCAT-PF) is a unique tool developed to measure congruence in patient-family caregiver communication employing a dyadic approach. We aimed to examine the cross-cultural applicability of the CCAT in the Korean healthcare setting. Linguistic validation of the CCAT-PF was performed through a standard forward-backward translation process. Psychometric validation was performed with 990 patient-caregiver dyads recruited from 10 cancer centers. Mean scores of CCAT-P and CCAT-F were similar at 44.8 for both scales. Mean CCAT-PF score was 23.7 (8.66). Concordance of each items between patients and caregivers was low (weighted kappa values communication congruence between cancer patient and family caregivers. Copyright © 2014 John Wiley & Sons, Ltd.

  4. Cultural Orientations Framework (COF) Assessment Questionnaire in Cross-Cultural Coaching: A Cross-Validation with Wave Focus Styles

    OpenAIRE

    Rojon, C; McDowall, A

    2010-01-01

    This paper outlines a cross-validation of the Cultural Orientations Framework assessment questionnaire\\ud (COF, Rosinski, 2007; a new tool designed for cross-cultural coaching) with the Saville Consulting\\ud Wave Focus Styles questionnaire (Saville Consulting, 2006; an existing validated measure of\\ud occupational personality), using data from UK and German participants (N = 222). The convergent and\\ud divergent validity of the questionnaire was adequate. Contrary to previous findings which u...

  5. Psycho-oncology assessment in Chinese populations: a systematic review of quality of life and psychosocial measures.

    Science.gov (United States)

    Hyde, M K; Chambers, S K; Shum, D; Ip, D; Dunn, J

    2016-09-01

    This systematic review describes psychosocial and quality of life (QOL) measures used in psycho-oncology research with cancer patients and caregivers in China. Medline and PsycINFO databases were searched (1980-2014). Studies reviewed met the following criteria: English language; peer-reviewed; sampled Chinese cancer patients/caregivers; developed, validated or assessed psychometric properties of psychosocial or QOL outcome measures; and reported validation data. The review examined characteristics of measures and participants, translation and cultural adaptation processes and psychometric properties of the measures. Ninety five studies met review criteria. Common characteristics of studies reviewed were they: assessed primarily QOL measures, sampled patients with breast, colorectal, or head and neck cancer, and validated existing measures (>80%) originating in North America or Europe. Few studies reported difficulties translating measures. Regarding psychometric properties of the measures >50% of studies reported subscale reliabilities adaptation and psychometric testing of psychosocial measures is needed. Developing support structures for translating and validating psychosocial measures would enable this and ensure Chinese psycho-oncology clinical practice and research keeps pace with international focus on patient reported outcome measures and data management. © 2015 John Wiley & Sons Ltd.

  6. Development and Validation of an Instrument for Assessing Patient Experience of Chronic Illness Care

    Directory of Open Access Journals (Sweden)

    José Joaquín Mira

    2016-08-01

    Full Text Available Introduction: The experience of chronic patients with the care they receive, fuelled by the focus on patient-centeredness and the increasing evidence on its positive relation with other dimensions of quality, is being acknowledged as a key element in improving the quality of care. There are a dearth of accepted tools and metrics to assess patient experience from the patient’s perspective that have been adapted to the new chronic care context: continued, systemic, with multidisciplinary teams and new technologies. Methods: Development and validation of a scale conducting a literature review, expert panel, pilot and field studies with 356 chronic primary care patients, to assess content and face validities and reliability. Results: IEXPAC is an 11+1 item scale with adequate metric properties measured by Alpha Chronbach, Goodness of fit index, and satisfactory convergence validity around three factors named: productive interactions, new relational model and person’s self-management. Conclusions: IEXPAC allows measurement of the patient experience of chronic illness care. Together with other indicators, IEXPAC can determine the quality of care provided according to the Triple Aim framework, facilitating health systems reorientation towards integrated patient-centred care.

  7. The validity and reliability of the Functional Strength Measurement (FSM) in children with intellectual disabilities.

    Science.gov (United States)

    Aertssen, W F M; Steenbergen, B; Smits-Engelsman, B C M

    2018-06-07

    There is lack of valid and reliable field-based tests for assessing functional strength in young children with mild intellectual disabilities (IDs). The aim of this study was to investigate the test-retest reliability and construct validity of the Functional Strength Measurement in children with ID (FSM-ID). Fifty-two children with mild ID (40 boys and 12 girls, mean age 8.48 years, SD = 1.48) were tested with the FSM. Test-retest reliability (n = 32) was examined by a two-way interclass correlation coefficient for agreement (ICC 2.1A). Standard error of measurement and smallest detectable change were calculated. Construct validity was determined by calculating correlations between the FSM-ID and handheld dynamometry (HHD) (convergent validity), FSM-ID, FSM-ID and subtest strength of the Bruininks-Oseretsky test of motor proficiency - second edition (BOT-2) (convergent validity) and the FSM-ID and balance subtest of the BOT-2 (discriminant validity). Test-retest reliability ICC ranged 0.89-0.98. Correlation between the items of the FSM-ID and HHD ranged 0.39-0.79 and between FSM-ID and BOT-2 (strength items) 0.41-0.80. Correlation between items of the FSM-ID and BOT-2 (balance items) ranged 0.41-0.70. The FSM-ID showed good test-retest reliability and good convergent validity with the HHD and BOT-2 subtest strength. The correlations assessing discriminant validity were higher than expected. Poor levels of postural control and core stability in children with mild IDs may be the underlying factor of those higher correlations. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.

  8. Psychometric properties and validation of the Italian version of the Family Assessment Measure Third Edition - Short Version - in a nonclinical sample.

    Science.gov (United States)

    Pellerone, Monica; Ramaci, Tiziana; Parrello, Santa; Guariglia, Paola; Giaimo, Flavio

    2017-01-01

    Family functioning plays an important role in developing and maintaining dysfunctional behaviors, especially during adolescence. The lack of indicators of family functioning, as determinants of personal and interpersonal problems, represents an obstacle to the activities aimed at developing preventive and intervention strategies. The Process Model of Family Functioning provides a conceptual framework organizing and integrating various concepts into a comprehensive family assessment; this model underlines that through the process of task accomplishment, each family meets objectives central to its life as a group. The Family Assessment Measure Third Edition (FAM III), based on the Process Model of Family Functioning, is among the most frequently used self-report instruments to measure family functioning. The present study aimed to evaluate the psychometric properties of the Italian version of the Family Assessment Measure Third Edition - Short Version (Brief FAM-III). It consists of three modules: General Scale, which evaluates the family as a system; Dyadic Relationships Scale, which examines how each family member perceives his/her relationship with another member; and Self-Rating Scale, which indicates how each family member is perceived within the nucleus. The developed Brief FAM-III together with the Family Assessment Device were administered to 484 subjects, members of 162 Italian families, formed of 162 fathers aged between 35 and 73 years; 162 mothers aged between 34 and 69 years; and 160 children aged between 12 and 35 years. Correlation, paired-sample t -test, and reliability analyses were carried out. General item analysis shows good indices of reliability with Cronbach's α coefficients equal to 0.96. The Brief FAM-III has satisfactory internal consistency, with Cronbach's α equal to 0.90 for General Scale, 0.94 for Dyadic Relationships Scale, and 0.88 for the Self-Rating Scale. The Brief FAM-III can be a psychometrically reliable and valid measure for

  9. Validity and reliability of an instrumented leg-extension machine for measuring isometric muscle strength of the knee extensors.

    Science.gov (United States)

    Ruschel, Caroline; Haupenthal, Alessandro; Jacomel, Gabriel Fernandes; Fontana, Heiliane de Brito; Santos, Daniela Pacheco dos; Scoz, Robson Dias; Roesler, Helio

    2015-05-20

    Isometric muscle strength of knee extensors has been assessed for estimating performance, evaluating progress during physical training, and investigating the relationship between isometric and dynamic/functional performance. To assess the validity and reliability of an adapted leg-extension machine for measuring isometric knee extensor force. Validity (concurrent approach) and reliability (test and test-retest approach) study. University laboratory. 70 healthy men and women aged between 20 and 30 y (39 in the validity study and 31 in the reliability study). Intraclass correlation coefficient (ICC) values calculated for the maximum voluntary isometric torque of knee extensors at 30°, 60°, and 90°, measured with the prototype and with an isokinetic dynamometer (ICC2,1, validity study) and measured with the prototype in test and retest sessions, scheduled from 48 h to 72 h apart (ICC1,1, reliability study). In the validity analysis, the prototype showed good agreement for measurements at 30° (ICC2,1 = .75, SEM = 18.2 Nm) and excellent agreement for measurements at 60° (ICC2,1 = .93, SEM = 9.6 Nm) and at 90° (ICC2,1 = .94, SEM = 8.9 Nm). Regarding the reliability analysis, between-days' ICC1,1 were good to excellent, ranging from .88 to .93. Standard error of measurement and minimal detectable difference based on test-retest ranged from 11.7 Nm to 18.1 Nm and 32.5 Nm to 50.1 Nm, respectively, for the 3 analyzed knee angles. The analysis of validity and repeatability of the prototype for measuring isometric muscle strength has shown to be good or excellent, depending on the knee joint angle analyzed. The new instrument, which presents a relative low cost and easiness of transportation when compared with an isokinetic dynamometer, is valid and provides consistent data concerning isometric strength of knee extensors and, for this reason, can be used for practical, clinical, and research purposes.

  10. Reliability and Validity of the Footprint Assessment Method Using Photoshop CS5 Software in Young People with Down Syndrome.

    Science.gov (United States)

    Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Rey-Abella, Ferran; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam

    2016-05-01

    People with Down syndrome present skeletal abnormalities in their feet that can be analyzed by commonly used gold standard indices (the Hernández-Corvo index, the Chippaux-Smirak index, the Staheli arch index, and the Clarke angle) based on footprint measurements. The use of Photoshop CS5 software (Adobe Systems Software Ireland Ltd, Dublin, Ireland) to measure footprints has been validated in the general population. The present study aimed to assess the reliability and validity of this footprint assessment technique in the population with Down syndrome. Using optical podography and photography, 44 footprints from 22 patients with Down syndrome (11 men [mean ± SD age, 23.82 ± 3.12 years] and 11 women [mean ± SD age, 24.82 ± 6.81 years]) were recorded in a static bipedal standing position. A blinded observer performed the measurements using a validated manual method three times during the 4-month study, with 2 months between measurements. Test-retest was used to check the reliability of the Photoshop CS5 software measurements. Validity and reliability were obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed very good values for the Photoshop CS5 method (ICC, 0.982-0.995). Validity testing also found no differences between the techniques (ICC, 0.988-0.999). The Photoshop CS5 software method is reliable and valid for the study of footprints in young people with Down syndrome.

  11. Cross-cultural validation of health literacy measurement tools in Italian oncology patients.

    Science.gov (United States)

    Zotti, Paola; Cocchi, Simone; Polesel, Jerry; Cipolat Mis, Chiara; Bragatto, Donato; Cavuto, Silvio; Conficconi, Alice; Costanzo, Carla; De Giorgi, Melissa; Drace, Christina A; Fiorini, Federica; Gangeri, Laura; Lisi, Andrea; Martino, Rosalba; Mosconi, Paola; Paradiso, Angelo; Ravaioli, Valentina; Truccolo, Ivana; De Paoli, Paolo

    2017-06-19

    The aim of this study was to assess the psychometric characteristics of four Health Literacy (HL) measurement tools, viz. Newest Vital Sign (NVS), Short Test of Functional Health Literacy in Adults (STOFHLA), Single Item Literacy Screener (SILS) and Single question on Self-rated Reading Ability (SrRA) among Italian oncology patients. The original version of the tools were translated from the English language into Italian using a standard forward-backward procedure and according to internationally recognized good practices. Their internal consistency (reliability) and validity (construct, convergent and discriminative) were tested in a sample of 245 consecutive cancer patients recruited from seven Italian health care centers. The internal consistency of the STOFHLA-I was Chronbach's α=0.96 and that of NVS-I was α=0.74. The STOFHLA-I, NVS-I, SILS-I and SrRA-I scores were in a good relative correlation and in all tools the discriminative known-group validity was confirmed. The reliability and validity values were similar to those obtained from other cultural context studies. The psychometric characteristics of the Italian version of NVS, STHOFLA, SILS and SrRA were found to be good, with satisfactory reliability and validity. This indicates that they could be used as a screening tool in Italian patients. Moreover, the use of the same cross-cultural tools, validated in different languages, is essential for implementing multicenter studies to measure and compare the functional HL levels across countries.

  12. Organizational readiness for implementing change: a psychometric assessment of a new measure.

    Science.gov (United States)

    Shea, Christopher M; Jacobs, Sara R; Esserman, Denise A; Bruce, Kerry; Weiner, Bryan J

    2014-01-10

    Organizational readiness for change in healthcare settings is an important factor in successful implementation of new policies, programs, and practices. However, research on the topic is hindered by the absence of a brief, reliable, and valid measure. Until such a measure is developed, we cannot advance scientific knowledge about readiness or provide evidence-based guidance to organizational leaders about how to increase readiness. This article presents results of a psychometric assessment of a new measure called Organizational Readiness for Implementing Change (ORIC), which we developed based on Weiner's theory of organizational readiness for change. We conducted four studies to assess the psychometric properties of ORIC. In study one, we assessed the content adequacy of the new measure using quantitative methods. In study two, we examined the measure's factor structure and reliability in a laboratory simulation. In study three, we assessed the reliability and validity of an organization-level measure of readiness based on aggregated individual-level data from study two. In study four, we conducted a small field study utilizing the same analytic methods as in study three. Content adequacy assessment indicated that the items developed to measure change commitment and change efficacy reflected the theoretical content of these two facets of organizational readiness and distinguished the facets from hypothesized determinants of readiness. Exploratory and confirmatory factor analysis in the lab and field studies revealed two correlated factors, as expected, with good model fit and high item loadings. Reliability analysis in the lab and field studies showed high inter-item consistency for the resulting individual-level scales for change commitment and change efficacy. Inter-rater reliability and inter-rater agreement statistics supported the aggregation of individual level readiness perceptions to the organizational level of analysis. This article provides evidence in

  13. The development and validation of a custom built device for assessing frontal knee joint laxity.

    Science.gov (United States)

    Ismail, Shiek Abdullah; Simic, Milena; Clarke, Jillian L; Lopes, Thiago Jambo Alves; Pappas, Evangelos

    2017-12-01

    This study reports the development and validation of a quantitative technique of assessing frontal knee joint laxity through a custom built device named KLICP. The objectives of this study were to determine: (i) the intra- and inter-rater reliability and (ii) the validity of the device when compared to real time ultrasound. Twenty-five participants had their frontal knee joint laxity assessed by the KLICP, by manual varus/valgus tests and by ultrasound. Two raters independently assessed laxity manually by three repeated measurements, repeated at least 48h later. Results were validated by comparing them to the medial and lateral joint space opening measured by the ultrasound. Intraclass correlation coefficients and standard error of measurement reliability were calculated. Pearson's correlation coefficients were calculated to determine the correlation between the KLICP and the joint space. Intra-rater reliability (intra-session) for each rater was good on both sessions (0.91-0.98), intra-rater reliability (inter-sessions) was moderate to good (0.62-0.87), and inter-rater reliability (intra-session) was good (0.75-0.80). There is low agreement for intra-rater (inter-session) and for inter-rater (intra-session) reliability. The KLICP measurement has a significant positive fair to moderate correlation to the ultrasound measurement at the left (r: 0.61, p: 0.01) and right (r: 0.48, p: 0.02) knee in the valgus direction and at the left (r: 0.51, p: 0.01) and right (r: 0.39, p: 0.05) knee in the varus direction. There is low agreement between the KLICP and the RTU. Reliability and agreement was good only when measured for intra-rater, within session. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Development and validation of the Treatment Related Impact Measure of Weight (TRIM-Weight

    Directory of Open Access Journals (Sweden)

    Lessard Suzanne

    2010-02-01

    Full Text Available Abstract Background The use of prescription anti-obesity medication (AOM is becoming increasingly common as treatment options grow and become more accessible. However, AOM may not be without a wide range of potentially negative impacts on patient functioning and well being. The Treatment Related Impact Measure (TRIM-Weight is an obesity treatment-specific patient reported outcomes (PRO measure designed to assess the key impacts of prescription anti-obesity medication. This paper will present the validation findings for the TRIM-Weight. Methods The online validation battery survey was administered in four countries (the U.S., U.K., Australia, and Canada. Eligible subjects were over age eighteen, currently taking a prescription AOM and were currently or had been obese during their life. Validation analyses were conducted according to an a priori statistical analysis plan. Item level psychometric and conceptual criteria were used to refine and reduce the preliminary item pool and factor analysis to identify structural domains was performed. Reliability and validity testing was then performed and the minimally importance difference (MID explored. Results Two hundred and eight subjects completed the survey. Twenty-one of the 43 items were dropped and a five-factor structure was achieved: Daily Life, Weight Management, Treatment Burden, Experience of Side Effects, and Psychological Health. A-priori criteria for internal consistency and test-retest coefficients for the total score and all five subscales were met. All pre-specified hypotheses for convergent and known group validity were also met with the exception of the domain of Daily Life (proven in an ad hoc analysis as well as the 1/2 standard deviation threshold for the MID. Conclusion The development and validation of the TRIM-Weight has been conducted according to well-defined principles for the creation of a PRO measure. Based on the evidence to date, the TRIM-Weight can be considered a brief

  15. Measuring walking within and outside the neighborhood in Chinese elders: reliability and validity

    Directory of Open Access Journals (Sweden)

    Cerin Ester

    2011-11-01

    Full Text Available Abstract Background Walking is a preferred, prevalent and recommended activity for aging populations and is influenced by the neighborhood built environment. To study this influence it is necessary to differentiate whether walking occurs within or outside of the neighborhood. The Neighborhood Physical Activity Questionnaire (NPAQ collects information on setting-specific physical activity, including walking, inside and outside one's neighborhood. While the NPAQ has shown to be a reliable measure in adults, its reliability in older adults is unknown. Additionally its validity and the influence of type of neighborhood on reliability and validity have yet to be explored. Methods The NPAQ walking component was adapted for Chinese speaking elders (NWQ-CS. Ninety-six Chinese elders, stratified by social economic status and neighborhood walkability, wore an accelerometer and completed a log of walks for 7 days. Following the collection of valid data the NWQ-CS was interviewer-administered. Fourteen to 20 days (average of 17 days later the NWQ-CS was re-administered. Test-retest reliability and validity of the NWQ-CS were assessed. Results Reliability and validity estimates did not differ with type of neighborhood. NWQ-CS measures of walking showed moderate to excellent reliability. Reliability was generally higher for estimates of weekly frequency than minutes of walking. Total weekly minutes of walking were moderately related to all accelerometry measures. Moderate-to-strong associations were found between the NWQ-CS and log-of-walks variables. The NWQ-CS yielded statistically significantly lower mean values of total walking, weekly minutes of walking for transportation and weekly frequency of walking for transportation outside the neighborhood than the log-of-walks. Conclusions The NWQ-CS showed measurement invariance across types of neighborhoods. It is a valid measure of walking for recreation and frequency of walking for transport. However, it may

  16. Instruments assessing attitudes toward or capability regarding self-management of osteoarthritis: a systematic review of measurement properties.

    Science.gov (United States)

    Eyles, J P; Hunter, D J; Meneses, S R F; Collins, N J; Dobson, F; Lucas, B R; Mills, K

    2017-08-01

    To make a recommendation on the "best" instrument to assess attitudes toward and/or capabilities regarding self-management of osteoarthritis (OA) based on available measurement property evidence. Electronic searches were performed in MEDLINE, EMBASE, CINAHL and PsychINFO (inception to 27 December 2016). Two reviewers independently rated measurement properties using the Consensus-based Standards for the selection of Health Measurement Instruments (COSMIN) 4-point scale. Best evidence synthesis was determined by considering COSMIN ratings for measurement property results and the level of evidence available for each measurement property of each instrument. Eight studies out of 5653 publications met the inclusion criteria, with eight instruments identified for evaluation: Multidimensional Health Locus of Control (MHLC), Perceived Behavioural Control (PBC), Patient Activation Measure (PAM), Educational Needs Assessment (ENAT), Stages of Change Questionnaire in Osteoarthritis (SCQOA), Effective Consumer Scale (EC-17) and Perceived Efficacy in Patient-Physician Interactions five item (PEPPI-5) and ten item scales. Measurement properties assessed for these instruments included internal consistency (k = 8), structural validity (k = 8), test-retest reliability (k = 2), measurement error (k = 1), hypothesis testing (k = 3) and cross-cultural validity (k = 3). No information was available for content validity, responsiveness or minimal important change (MIC)/minimal important difference (MID). The Dutch PEPPI-5 demonstrated the best measurement property evidence; strong evidence for internal consistency and structural validity but limited evidence for reliability and construct validity. Although PEPPI-5 was identified as having the best measurement properties, overall there is a poor level of evidence currently available concerning measurement properties of instruments to assess attitudes toward and/or capabilities regarding osteoarthritis self-management. Further

  17. A content validity approach to creating an end-user computer skill assessment tool

    Directory of Open Access Journals (Sweden)

    Shirley Gibbs

    Full Text Available Practical assessment instruments are commonly used in the workplace and educational environments to assess a person\\'s level of digital literacy and end-user computer skill. However, it is often difficult to find statistical evidence of the actual validity of instruments being used. To ensure that the correct factors are being assessed for a particular purpose it is necessary to undertake some type of psychometric testing, and the first step is to study the content relevance of the measure. The purpose of this paper is to report on the rigorous judgment-quantification process using panels of experts in order to establish inter-rater reliability and agreement in the development of end-user instruments developed to measure workplace skills using spreadsheet and word-processing applications.

  18. The Validity of Two Education Requirement Measures

    Science.gov (United States)

    van der Meer, Peter H.

    2006-01-01

    In this paper we investigate the validity of two education requirement measures. This is important because a key part of the ongoing discussion concerning overeducation is about measurement. Thanks to the Dutch Institute for Labour Studies, we have been given a unique opportunity to compare two education requirement measures: first, Huijgen's…

  19. Exploring sarcasm detection in Amyotrophic Lateral Sclerosis using ecologically valid measures

    Directory of Open Access Journals (Sweden)

    Mathew eStaios

    2013-05-01

    Full Text Available Amyotrophic lateral sclerosis is a rapidly progressive condition involving degeneration of both upper and lower motor neurons. Recent research suggests that a proportion of persons with ALS show a profile similar to that of FTD, with this group of ALS patients exhibiting social cognitive deficits. Although social cognitive deficits have been partially explored in ALS, research has yet to investigate such changes using ecologically valid measures. Therefore, this study aimed to further characterise the scope of social cognitive and emotion recognition deficits in non-demented ALS patients using an ecologically valid measure of social cognition. A sample of 35 ALS patients and 30 age-and-education matched controls were assessed using the Addenbrooke’s Cognitive Examination, the Brixton Spatial Anticipation Test, and The Awareness of Social Inference Test, where participants were required to discriminate between various emotions and decipher socially challenging scenarios enacted in video vignettes. Participants with ALS showed significant difficulties in recognising both sarcastic and paradoxical sarcastic statements, but not sincere statements, when compared to controls. After controlling for executive difficulties, ALS patients still displayed significant difficulties on tasks that assessed their comprehension of both sarcastic and paradoxical sarcastic statements. The inability to read social cues and make social inferences has the potential to place significant strain on familial/interpersonal relationships in ALS. The findings of this study highlight the importance of employing a broader range of neuropsychological assessment tools to aid in early detection of frontal lobe impairment in non-demented ALS patients.

  20. Exploring sarcasm detection in amyotrophic lateral sclerosis using ecologically valid measures

    Science.gov (United States)

    Staios, Mathew; Fisher, Fiona; Lindell, Annukka K.; Ong, Ben; Howe, Jim; Reardon, Katrina

    2013-01-01

    Amyotrophic lateral sclerosis (ALS) is a rapidly progressive condition involving degeneration of both upper and lower motor neurons. Recent research suggests that a proportion of persons with ALS show a profile similar to that of frontotemporal dementia (FTD), with this group of ALS patients exhibiting social cognitive deficits. Although social cognitive deficits have been partially explored in ALS, research has yet to investigate such changes using ecologically valid measures. Therefore, this study aimed to further characterize the scope of social cognitive and emotion recognition deficits in non-demented ALS patients using an ecologically valid measure of social cognition. A sample of 35 ALS patients and 30 age-and-education matched controls were assessed using the Addenbrooke's Cognitive Examination, the Brixton Spatial Anticipation Test, and The Awareness of Social Inference Test, where participants were required to discriminate between various emotions and decipher socially challenging scenarios enacted in video vignettes. Participants with ALS showed significant difficulties in recognizing both sarcastic and paradoxical sarcastic statements, but not sincere statements, when compared to controls. After controlling for executive difficulties, ALS patients still displayed significant difficulties on tasks that assessed their comprehension of both sarcastic and paradoxical sarcastic statements. The inability to read social cues and make social inferences has the potential to place significant strain on familial/interpersonal relationships in ALS. The findings of this study highlight the importance of employing a broader range of neuropsychological assessment tools to aid in early detection of frontal lobe impairment in non-demented ALS patients. PMID:23734113

  1. Validation of innovative technologies and strategies for regulatory safety assessment methods: challenges and opportunities.

    Science.gov (United States)

    Stokes, William S; Wind, Marilyn

    2010-01-01

    Advances in science and innovative technologies are providing new opportunities to develop test methods and strategies that may improve safety assessments and reduce animal use for safety testing. These include high throughput screening and other approaches that can rapidly measure or predict various molecular, genetic, and cellular perturbations caused by test substances. Integrated testing and decision strategies that consider multiple types of information and data are also being developed. Prior to their use for regulatory decision-making, new methods and strategies must undergo appropriate validation studies to determine the extent that their use can provide equivalent or improved protection compared to existing methods and to determine the extent that reproducible results can be obtained in different laboratories. Comprehensive and optimal validation study designs are expected to expedite the validation and regulatory acceptance of new test methods and strategies that will support improved safety assessments and reduced animal use for regulatory testing.

  2. Measurement issues in the sonographic assessment of tennis elbow.

    Science.gov (United States)

    Poltawski, Leon; Jayaram, Vijay; Watson, Tim

    2010-05-01

    Sonography is increasingly being used for assessment in tennis elbow research and clinical practice, but there are a lack of data regarding its validity, reliability, and responsiveness to change for this application. Studies using the modality were reviewed to establish current levels of evidence for these measurement properties. There is reasonable evidence regarding its validity for identifying tennis elbow tendinopathy, but a lack of data addressing its reliability and responsiveness. Practical issues affecting image quality are discussed, and recommendations for further investigation are suggested, to enhance the credible use of sonography with this debilitating condition.

  3. Development and validation of an instrument for rapidly assessing symptoms: the general symptom distress scale.

    Science.gov (United States)

    Badger, Terry A; Segrin, Chris; Meek, Paula

    2011-03-01

    Symptom assessment has increasingly focused on the evaluation of total symptom distress or burden rather than assessing only individual symptoms. The challenge for clinicians and researchers alike is to assess symptoms, and to determine the symptom distress associated with the symptoms and the patient's ability for symptom management without a lengthy and burdensome assessment process. The objective of this article was to discuss the psychometric evaluation of a brief general symptom distress scale (GSDS) developed to assess specific symptoms and how they rank in relation to each other, the overall symptom distress associated with the symptom schema, and provide an assessment of how well or poorly that symptom schema is managed. Results from a pilot study about the initial development of the GSDS with 76 hospitalized patients are presented, followed by a more complete psychometric evaluation of the GSDS using three samples of cancer patients (n=190) and their social network members, called partners in these studies (n=94). Descriptive statistics were used to describe the GSDS symptoms, symptom distress, and symptom management. Point biserial correlations indexed the associations between dichotomous symptoms and continuous measures, and conditional probabilities were used to illustrate the substantial comorbidities of this sample. Internal consistency was examined using the KR-20 coefficient, and test-retest reliability was examined. Construct validity and predictive validity also were examined. The GSDS demonstrated satisfactory internal consistency and test-retest reliability, and good construct validity and predictive validity. The total score on the GSDS, symptom distress, and symptom management correlated significantly with related constructs of depression, positive and negative affect, and general health. The GSDS was able to demonstrate its ability to distinguish between those with or without chronic illness, and was able to significantly predict scores on

  4. Quantifying frontal plane knee motion during single limb squats: reliability and validity of 2-dimensional measures.

    Science.gov (United States)

    Gwynne, Craig R; Curran, Sarah A

    2014-12-01

    Clinical assessment of lower limb kinematics during dynamic tasks may identify individuals who demonstrate abnormal movement patterns that may lead to etiology of exacerbation of knee conditions such as patellofemoral joint (PFJt) pain. The purpose of this study was to determine the reliability, validity and associated measurement error of a clinically appropriate two-dimensional (2-D) procedure of quantifying frontal plane knee alignment during single limb squats. Nine female and nine male recreationally active subjects with no history of PFJt pain had frontal plane limb alignment assessed using three-dimensional (3-D) motion analysis and digital video cameras (2-D analysis) while performing single limb squats. The association between 2-D and 3-D measures was quantified using Pearson's product correlation coefficients. Intraclass correlation coefficients (ICCs) were determined for within- and between-session reliability of 2-D data and standard error of measurement (SEM) was used to establish measurement error. Frontal plane limb alignment assessed with 2-D analysis demonstrated good correlation compared with 3-D methods (r = 0.64 to 0.78, p < 0.001). Within-session (0.86) and between-session ICCs (0.74) demonstrated good reliability for 2-D measures and SEM scores ranged from 2° to 4°. 2-D measures have good consistency and may provide a valid measure of lower limb alignment when compared to existing 3-D methods. Assessment of lower limb kinematics using 2-D methods may be an accurate and clinically useful alternative to 3-D motion analysis when identifying individuals who demonstrate abnormal movement patterns associated with PFJt pain. 2b.

  5. The reliability and validity of the informant AD8 by comparison with a series of cognitive assessment tools in primary healthcare.

    Science.gov (United States)

    Shaik, Muhammad Amin; Xu, Xin; Chan, Qun Lin; Hui, Richard Jor Yeong; Chong, Steven Shih Tsze; Chen, Christopher Li-Hsian; Dong, YanHong

    2016-03-01

    The validity and reliability of the informant AD8 in primary healthcare has not been established. Therefore, the present study examined the validity and reliability of the informant AD8 in government subsidized primary healthcare centers in Singapore. Eligible patients (≥60 years old) were recruited from primary healthcare centers and their informants received the AD8. Patient-informant dyads who agreed for further cognitive assessments received the Mini-Mental State Examination (MMSE), Montreal Cognitive Assessment (MoCA), Clinical Dementia Rating (CDR), and a locally validated formal neuropsychological battery at a research center in a tertiary hospital. 1,082 informants completed AD8 assessment at two primary healthcare centers. Of these, 309 patients-informant dyads were further assessed, of whom 243 (78.6%) were CDR = 0; 22 (7.1%) were CDR = 0.5; and 44 (14.2%) were CDR≥1. The mean administration time of the informant AD8 was 2.3 ± 1.0 minutes. The informant AD8 demonstrated good internal consistency (Cronbach's α = 0.85); inter-rater reliability (Intraclass Correlation Coefficient (ICC) = 0.85); and test-retest reliability (weighted κ = 0.80). Concurrent validity, as measured by the correlation between total AD8 scores and CDR global (R = 0.65, p validity, as measured by convergent validity (R ≥ 0.4) between individual items of AD8 with CDR and neuropsychological domains was acceptable. The informant AD8 demonstrated good concurrent and construct validity and is a reliable measure to detect cognitive dysfunction in primary healthcare.

  6. Spanish validation of the Negative Symptom Assessment-16 (NSA-16) in patients with schizophrenia.

    Science.gov (United States)

    Garcia-Alvarez, Leticia; Garcia-Portilla, María Paz; Saiz, Pilar Alejandra; Fonseca-Pedrero, Eduardo; Bobes-Bascaran, María Teresa; Gomar, Jesús; Muñiz, José; Bobes, Julio

    2018-04-05

    Negative symptoms are prevalent in schizophrenia and associated with a poorer outcome. Validated newer psychometric instruments could contribute to better assessment and improved treatment of negative symptoms. The Negative Symptom Assessment-16 (NSA-16) has been shown to have strong psychometric properties, but there is a need for validation in non-English languages. This study aimed to examine the psychometric properties of a Spanish version of the NSA-16 (Sp-NSA-16). Observational, cross-sectional validation study in a sample of 123 outpatients with schizophrenia. NSA-16, PANSS, HDRS, CGI-SCH and PSP. The results indicate appropriate psychometric properties, high internal consistency (Cronbach's alpha=0.86), convergent validity (PANSS negative scale, PANSS Marder Negative Factor and CGI-negative symptoms r values between 0.81 and 0.94) and divergent validity (PANSS positive scale and the HDRS r values between 0.10 and 0.34). In addition, the NSA-16 also exhibited discriminant validity (ROC curve=0.97, 95% CI=0.94 to 1.00; 94.3% sensitivity and 83.3% specificity). The Sp-NSA-16 is reliable and valid for measuring negative symptoms in patients with schizophrenia. This provides Spanish clinicians with a new tool for clinical practice and research. However, it is necessary to provide further information about its inter-rater reliability. Copyright © 2018 SEP y SEPB. Publicado por Elsevier España, S.L.U. All rights reserved.

  7. Validation of the Adolescent Concerns Measure (ACM): Evidence from Exploratory and Confirmatory Factor Analysis

    Science.gov (United States)

    Ang, Rebecca P.; Chong, Wan Har; Huan, Vivien S.; Yeo, Lay See

    2007-01-01

    This article reports the development and initial validation of scores obtained from the Adolescent Concerns Measure (ACM), a scale which assesses concerns of Asian adolescent students. In Study 1, findings from exploratory factor analysis using 619 adolescents suggested a 24-item scale with four correlated factors--Family Concerns (9 items), Peer…

  8. The foundations of measurement and assessment in medical education.

    Science.gov (United States)

    Tavakol, Mohsen; Dennick, Reg

    2017-10-01

    As a medical educator, you may be directly or indirectly involved in the quality of assessments. Measurement has a substantial role in developing the quality of assessment questions and student learning. The information provided by psychometric data can improve pedagogical issues in medical education. Through measurement we are able to assess the learning experiences of students. Standard setting plays an important role in assessing the performance quality of students as doctors in the future. Presentation of performance data for standard setters may contribute towards developing a credible and defensible pass mark. Validity and reliability of test scores are the most important factors for developing quality assessment questions. Analysis of the answers to individual questions provides useful feedback for assessment leads to improve the quality of each question, and hence make students' marks fair in terms of diversity and ethnicity. Item Characteristic Curves (ICC) can send signals to assessment leads to improve the quality of individual questions.

  9. Validation of French upper limb Erasmus modified Nottingham Sensory Assessment in stroke.

    Science.gov (United States)

    Villepinte, Claire; Catella, Emilie; Martin, Magali; Hidalgo, Sylvie; Téchené, Sabrina; Lebely, Claire; Castel-Lacanal, Evelyne; de Boissezon, Xavier; Chih, HuiJun; Gasq, David

    2018-04-13

    Somatosensory impairment of the upper limb (UL) occurs in approximately 50% of adults post-stroke, associated with loss of hand motor function, activity and participation. Measurement of UL sensory impairment is a component of rehabilitation contributing to the selection of sensorimotor techniques optimizing recovery and providing a prognostic estimate of UL function. To date, no standardized official French version of a measure of somatosensory impairment has been established. To develop and validate a French version of the Erasmus modified Nottingham Sensory Assessment somatosensory (EmNSA-SS) and stereognosis (EmNSA-ST) component for evaluating the UL among adults with stroke. This study is a single-center observational cross-sectional study. A French version of the EmNSA for UL was developed by forward-backward translation and cross-cultural adaptation. Fifty stroke patients were recruited to establish concurrent-criterion-related validity, internal consistency, intra- and inter-rater reproducibility with intracorrelation coefficients (ICCs) for reliability and the minimal detectable change with 95% confidence interval (MDC95) for agreement, as well as ceiling and floor effects. Criterion validity was assessed against the Fugl-Meyer Assessment-Sensory (FMA-S) for the UL. The median (range) EmNSA-SS score was 41.5 (1-44). The Spearman rank correlation coefficient between EmNSA-SS and FMA-S total scores was moderate (rho=0.74, P<0.001). The EmNSA-SS/ST internal consistency was adequate across subscales; with Cronbach α ranging from 0.82-0.96. For the EmNSA-SS total score, intra- and inter-rater reliability was excellent (ICC=0.92 in both cases), with MDC95 of 12.3 and 14.6, respectively. EmNSA-SS/ST total scores demonstrated no ceiling or floor effects. The French EmNSA is a valid and reproducible scale that can be used for comprehensive and accurate assessment of somatosensory modalities in adults post-stroke. Taking less than 30min to administer, the

  10. Climate for Innovation impacts on Adaptive Performance. Conceptualization, Measurement, and Validation

    Directory of Open Access Journals (Sweden)

    Stańczyk Sylwia

    2017-05-01

    Full Text Available The main objective of this paper was to examine the relationship between organizational climate for innovation and adaptive performance. The study was carried out in business organisations in Poland (N=387, representing variety of industries. The Cimate for Innovation measure and Individual Adaptive Performance measure was adopted from previous studies. The results of presented research point out that certain measurements of the organizational climate for innovation are interrelated to adaptive performance, especially supervisory encouragement. The present study discusses some aspects concerning the adaptation of existing instruments and measurements. On the basis of the research presented we indicate that, in general, the adaptation, of the mesearuments were relatively effective. The questionnaire was assessed as to be valid in terms of content for the reseraching CI and AP aspects in Poland.

  11. Assessing medical professionalism: A systematic review of instruments and their measurement properties

    Science.gov (United States)

    Li, Honghe; Liu, Yang; Wen, Deliang

    2017-01-01

    Background Over the last three decades, various instruments were developed and employed to assess medical professionalism, but their measurement properties have yet to be fully evaluated. This study aimed to systematically evaluate these instruments’ measurement properties and the methodological quality of their related studies within a universally acceptable standardized framework and then provide corresponding recommendations. Methods A systematic search of the electronic databases PubMed, Web of Science, and PsycINFO was conducted to collect studies published from 1990–2015. After screening titles, abstracts, and full texts for eligibility, the articles included in this study were classified according to their respective instrument’s usage. A two-phase assessment was conducted: 1) methodological quality was assessed by following the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) checklist; and 2) the quality of measurement properties was assessed according to Terwee’s criteria. Results were integrated using best-evidence synthesis to look for recommendable instruments. Results After screening 2,959 records, 74 instruments from 80 existing studies were included. The overall methodological quality of these studies was unsatisfactory, with reasons including but not limited to unknown missing data, inadequate sample sizes, and vague hypotheses. Content validity, cross-cultural validity, and criterion validity were either unreported or negative ratings in most studies. Based on best-evidence synthesis, three instruments were recommended: Hisar’s instrument for nursing students, Nurse Practitioners’ Roles and Competencies Scale, and Perceived Faculty Competency Inventory. Conclusion Although instruments measuring medical professionalism are diverse, only a limited number of studies were methodologically sound. Future studies should give priority to systematically improving the performance of existing

  12. A comparison between the original and Tablet-based Symbol Digit Modalities Test in patients with schizophrenia: Test-retest agreement, random measurement error, practice effect, and ecological validity.

    Science.gov (United States)

    Tang, Shih-Fen; Chen, I-Hui; Chiang, Hsin-Yu; Wu, Chien-Te; Hsueh, I-Ping; Yu, Wan-Hui; Hsieh, Ching-Lin

    2017-11-27

    We aimed to compare the test-retest agreement, random measurement error, practice effect, and ecological validity of the original and Tablet-based Symbol Digit Modalities Test (T-SDMT) over five serial assessments, and to examine the concurrent validity of the T-SDMT in patients with schizophrenia. Sixty patients with chronic schizophrenia completed five serial assessments (one week apart) of the SDMT and T-SDMT and one assessment of the Activities of Daily Living Rating Scale III at the first time point. Both measures showed high test-retest agreement, similar levels of random measurement error over five serial assessments. Moreover, the practice effects of the two measures did not reach a plateau phase after five serial assessments in young and middle-aged participants. Nevertheless, only the practice effect of the T-SDMT became trivial after the first assessment. Like the SDMT, the T-SDMT had good ecological validity. The T-SDMT also had good concurrent validity with the SDMT. In addition, only the T-SDMT had discriminative validity to discriminate processing speed in young and middle-aged participants. Compared to the SDMT, the T-SDMT had overall slightly better psychometric properties, so it can be an alternative measure to the SDMT for assessing processing speed in patients with schizophrenia. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study.

    Science.gov (United States)

    Mokkink, Lidwine B; Terwee, Caroline B; Patrick, Donald L; Alonso, Jordi; Stratford, Paul W; Knol, Dirk L; Bouter, Lex M; de Vet, Henrica C W

    2010-05-01

    Aim of the COSMIN study (COnsensus-based Standards for the selection of health status Measurement INstruments) was to develop a consensus-based checklist to evaluate the methodological quality of studies on measurement properties. We present the COSMIN checklist and the agreement of the panel on the items of the checklist. A four-round Delphi study was performed with international experts (psychologists, epidemiologists, statisticians and clinicians). Of the 91 invited experts, 57 agreed to participate (63%). Panel members were asked to rate their (dis)agreement with each proposal on a five-point scale. Consensus was considered to be reached when at least 67% of the panel members indicated 'agree' or 'strongly agree'. Consensus was reached on the inclusion of the following measurement properties: internal consistency, reliability, measurement error, content validity (including face validity), construct validity (including structural validity, hypotheses testing and cross-cultural validity), criterion validity, responsiveness, and interpretability. The latter was not considered a measurement property. The panel also reached consensus on how these properties should be assessed. The resulting COSMIN checklist could be useful when selecting a measurement instrument, peer-reviewing a manuscript, designing or reporting a study on measurement properties, or for educational purposes.

  14. The modified Thomas test is not a valid measure of hip extension unless pelvic tilt is controlled

    Directory of Open Access Journals (Sweden)

    Andrew D. Vigotsky

    2016-08-01

    Full Text Available The modified Thomas test was developed to assess the presence of hip flexion contracture and to measure hip extensibility. Despite its widespread use, to the authors’ knowledge, its criterion reference validity has not yet been investigated. The purpose of this study was to assess the criterion reference validity of the modified Thomas test for measuring peak hip extension angle and hip extension deficits, as defined by the hip not being able to extend to 0º, or neutral. Twenty-nine healthy college students (age = 22.00 ± 3.80 years; height = 1.71 ± 0.09 m; body mass = 70.00 ± 15.60 kg were recruited for this study. Bland–Altman plots revealed poor validity for the modified Thomas test’s ability to measure hip extension, which could not be explained by differences in hip flexion ability alone. The modified Thomas test displayed a sensitivity of 31.82% (95% CI [13.86–54.87] and a specificity of 57.14% (95% CI [18.41–90.10] for testing hip extension deficits. It appears, however, that by controlling pelvic tilt, much of this variance can be accounted for (r = 0.98. When pelvic tilt is not controlled, the modified Thomas test displays poor criterion reference validity and, as per previous studies, poor reliability. However, when pelvic tilt is controlled, the modified Thomas test appears to be a valid test for evaluating peak hip extension angle.

  15. Psychometric properties of the Perceived Stress Scale (PSS: measurement invariance between athletes and non-athletes and construct validity

    Directory of Open Access Journals (Sweden)

    Yi-Hsiang Chiu

    2016-12-01

    Full Text Available Background Although Perceived Stress Scale (PSS, Cohen, Kamarack & Mermelstein, 1983 has been validated and widely used in many domains, there is still no validation in sports by comparing athletes and non-athletes and examining related psychometric indices. Purpose The purpose of this study was to examine the measurement invariance of PSS between athletes and non-athletes, and examine construct validity and reliability in the sports contexts. Methods Study 1 sampled 359 college student-athletes (males = 233; females = 126 and 242 non-athletes (males = 124; females = 118 and examined factorial structure, measurement invariance and internal consistency. Study 2 sampled 196 student-athletes (males = 139, females = 57, Mage = 19.88 yrs, SD = 1.35 and examined discriminant validity and convergent validity of PSS. Study 3 sampled 37 student-athletes to assess test-retest reliability of PSS. Results Results found that 2-factor PSS-10 fitted the model the best and had appropriate reliability. Also, there was a measurement invariance between athletes and non-athletes; and PSS positively correlated with athletic burnout and life stress but negatively correlated with coping efficacy provided evidence of discriminant validity and convergent validity. Further, the test-retest reliability for PSS subscales was significant (r = .66 and r = .50. Discussion It is suggested that 2-factor PSS-10 can be a useful tool in assessing perceived stress either in sports or non-sports settings. We suggest future study may use 2-factor PSS-10 in examining the effects of stress on the athletic injury, burnout, and psychiatry disorders.

  16. Measurement properties of existing clinical assessment methods evaluating scapular positioning and function. A systematic review

    DEFF Research Database (Denmark)

    Larsen, Camilla Marie; Juul-Kristensen, Birgit; Lund, Hans

    2014-01-01

    %) to "poor" (43%), with only one study rated as "good". The reliability domain was most often investigated. Few of the assessment methods in the included studies that had "fair" or "good" measurement property ratings demonstrated acceptable results for both reliability and validity. We found a substantially...... larger number of clinical scapular assessment methods than previously reported. Using the COSMIN checklist the methodological quality of the included measurement properties in the reliability and validity domains were in general "fair" to "poor". None were examined for all three domains: (1) reliability...... excluded for evaluation due to no/few clinimetric results, leaving 35 studies for evaluation. Graded according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN checklist), the methodological quality in the reliability and validity domains was "fair" (57...

  17. Ancillary outcome measures for assessment of individuals with cervical spondylotic myelopathy.

    Science.gov (United States)

    Kalsi-Ryan, Sukhvinder; Singh, Anoushka; Massicotte, Eric M; Arnold, Paul M; Brodke, Darrel S; Norvell, Daniel C; Hermsmeyer, Jeffrey T; Fehlings, Michael G

    2013-10-15

    Narrative review. To identify suitable outcome measures that can be used to quantify neurological and functional impairment in the management of cervical spondylotic myelopathy (CSM). CSM is the leading cause of acquired spinal cord disability, causing varying degrees of neurological impairment which impact on independence and quality of life. Because this impairment can have a heterogeneous presentation, a single outcome measure cannot define the broad range of deficits seen in this population. Therefore, it is necessary to define outcome measures that characterize the deficits with greater validity and sensitivity. This review was conducted in 3 stages. Stage I: To evaluate the current use of outcome measures in CSM, PubMed was searched using the name of the outcome measure and the common abbreviation combined with "CSM" or "myelopathy." Stage II: Having identified a lack of appropriate outcome measures, we constructed criteria by which measures appropriate for assessing the various aspects of CSM could be identified. Stage III: A second literature search was then conducted looking at specified outcomes that met these criteria. All literature was reviewed to determine specificity and psychometric properties of outcomes for CSM. Nurick grade, modified Japanese Orthopaedic Association Scale, visual analogue scale (VAS) for pain, Short Form (36) Health Survey (SF-36), and Neck Disability Index were the most commonly cited measures. The Short-Form 36 Health Survey and Myelopathy Disability Index have been validated in the CSM population with multiple studies, whereas the modified Japanese Orthopaedic Association Scale score, Nurick grade, and European Myelopathy Scale each had only one study assessing psychometric characteristics. No validity, reliability, or responsiveness studies were found for the VAS or Neck Disability Index in the CSM population. We recommend that the modified Japanese Orthopaedic Association Scale, Nurick grade, Myelopathy Disability Index

  18. Assessing the validity of commercial and municipal food environment data sets in Vancouver, Canada.

    Science.gov (United States)

    Daepp, Madeleine Ig; Black, Jennifer

    2017-10-01

    The present study assessed systematic bias and the effects of data set error on the validity of food environment measures in two municipal and two commercial secondary data sets. Sensitivity, positive predictive value (PPV) and concordance were calculated by comparing two municipal and two commercial secondary data sets with ground-truthed data collected within 800 m buffers surrounding twenty-six schools. Logistic regression examined associations of sensitivity and PPV with commercial density and neighbourhood socio-economic deprivation. Kendall's τ estimated correlations between density and proximity of food outlets near schools constructed with secondary data sets v. ground-truthed data. Vancouver, Canada. Food retailers located within 800 m of twenty-six schools RESULTS: All data sets scored relatively poorly across validity measures, although, overall, municipal data sets had higher levels of validity than did commercial data sets. Food outlets were more likely to be missing from municipal health inspections lists and commercial data sets in neighbourhoods with higher commercial density. Still, both proximity and density measures constructed from all secondary data sets were highly correlated (Kendall's τ>0·70) with measures constructed from ground-truthed data. Despite relatively low levels of validity in all secondary data sets examined, food environment measures constructed from secondary data sets remained highly correlated with ground-truthed data. Findings suggest that secondary data sets can be used to measure the food environment, although estimates should be treated with caution in areas with high commercial density.

  19. (Re)conceptualizing validity in (outcomes-based) assessment

    African Journals Online (AJOL)

    Erna Kinsey

    how the construct validity has evolved within social research discour- ses. Third, we invoke particular ..... understanding and, ideally, self-determination through research participation. .... Handbook of classroom assessment. San Diego, CA: ...

  20. The Cognitive Assessment Interview (CAI): development and validation of an empirically derived, brief interview-based measure of cognition.

    Science.gov (United States)

    Ventura, Joseph; Reise, Steven P; Keefe, Richard S E; Baade, Lyle E; Gold, James M; Green, Michael F; Kern, Robert S; Mesholam-Gately, Raquelle; Nuechterlein, Keith H; Seidman, Larry J; Bilder, Robert M

    2010-08-01

    Practical, reliable "real world" measures of cognition are needed to supplement neurocognitive performance data to evaluate possible efficacy of new drugs targeting cognitive deficits associated with schizophrenia. Because interview-based measures of cognition offer one possible approach, data from the MATRICS initiative (n=176) were used to examine the psychometric properties of the Schizophrenia Cognition Rating Scale (SCoRS) and the Clinical Global Impression of Cognition in Schizophrenia (CGI-CogS). We used classical test theory methods and item response theory to derive the 10-item Cognitive Assessment Interview (CAI) from the SCoRS and CGI-CogS ("parent instruments"). Sources of information for CAI ratings included the patient and an informant. Validity analyses examined the relationship between the CAI and objective measures of cognitive functioning, intermediate measures of cognition, and functional outcome. The rater's score from the newly derived CAI (10 items) correlate highly (r=.87) with those from the combined set of the SCoRS and CGI-CogS (41 items). Both the patient (r=.82) and the informant (r=.95) data were highly correlated with the rater's score. The CAI was modestly correlated with objectively measured neurocognition (r=-.32), functional capacity (r=-.44), and functional outcome (r=-.32), which was comparable to the parent instruments. The CAI allows for expert judgment in evaluating a patient's cognitive functioning and was modestly correlated with neurocognitive functioning, functional capacity, and functional outcome. The CAI is a brief, repeatable, and potentially valuable tool for rating cognition in schizophrenia patients who are participating in clinical trials. Copyright 2010 Elsevier B.V. All rights reserved.

  1. Validity in work-based assessment: expanding our horizons

    NARCIS (Netherlands)

    Govaerts, M.; Vleuten, C.P.M. van der

    2013-01-01

    CONTEXT: Although work-based assessments (WBA) may come closest to assessing habitual performance, their use for summative purposes is not undisputed. Most criticism of WBA stems from approaches to validity consistent with the quantitative psychometric framework. However, there is increasing

  2. Development and validation of the functional assessment of cancer therapy-antiangiogenesis subscale.

    Science.gov (United States)

    Kaiser, Karen; Beaumont, Jennifer L; Webster, Kimberly; Yount, Susan E; Wagner, Lynne I; Kuzel, Timothy M; Cella, David

    2015-05-01

    The Functional Assessment of Cancer Therapy (FACT)-Antiangiogenesis (AntiA) Subscale was developed and validated to enhance treatment decision-making and side effect management for patients receiving anti-angiogenesis therapies. Side effects related to anti-angiogenesis therapies were identified from the literature, clinician input, and patient input. Fifty-nine possible patient expressions of side effects were generated. Patient and clinician ratings of the importance of these expressions led us to develop a 24-item questionnaire with clinical and research potential. To assess the scale's reliability and validity, 167 patients completed the AntiA Subscale, the Functional Assessment of Cancer Therapy-general (FACT-G), the FACT-Kidney Symptom Index (FKSI), the FACIT-Fatigue Subscale, the Global Rating of Change Scale (GRC), and the PROMIS Global Health Scale. Patient responses to the AntiA were analyzed for internal consistency, test-retest reliability, convergent and discriminant validity, and responsiveness to change in clinical status. All tested scales were found to have good internal consistency reliability (Cronbach's alpha 0.70-0.92). Test-retest reliability was also good (0.72-0.88) for total and subscale scores and lower for individual items. The total score, subscale scores, and all single items (except nosebleeds) significantly differentiated between groups defined by level of side effect bother. Evaluation of responsiveness to change in this study was not conclusive, suggesting an area for further research. The AntiA is a reliable and valid measure of side effects from anti-angiogenesis therapy. © 2014 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

  3. Content Validity and Inter-Rater Reliability of the Halliwick-Concept-Based Instrument "Swimming with Independent Measure"

    Science.gov (United States)

    Srsen, Katja Groleger; Vidmar, Gaj; Pikl, Masa; Vrecar, Irena; Burja, Cirila; Krusec, Klavdija

    2012-01-01

    The Halliwick concept is widely used in different settings to promote joyful movement in water and swimming. To assess the swimming skills and progression of an individual swimmer, a valid and reliable measure should be used. The Halliwick-concept-based Swimming with Independent Measure (SWIM) was introduced for this purpose. We aimed to determine…

  4. Reliability and validity of the Performance Recorder 1 for measuring isometric knee flexor and extensor strength.

    Science.gov (United States)

    Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L

    2013-11-01

    Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n  =  15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC  =  0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC  =  0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC  =  0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC  =  0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC  =  0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.

  5. Criterion and convergent validity of the Montreal cognitive assessment with screening and standardized neuropsychological testing.

    Science.gov (United States)

    Lam, Benjamin; Middleton, Laura E; Masellis, Mario; Stuss, Donald T; Harry, Robin D; Kiss, Alex; Black, Sandra E

    2013-12-01

    To compare the validity of the Montreal Cognitive Assessment (MoCA) with the criterion standard of standardized neuropsychological testing and to compare the convergent validity of the MoCA with that of existing screening tools and global measures of cognition. Cross-sectional observational study. Tertiary care hospital-based cognitive neurology subspecialty clinic. A convenience sample of 107 individuals with mild Alzheimer's disease (AD, n=75) or mild cognitive impairment (MCI, n=32) from the Sunnybrook Dementia Study. In addition to the MoCA, all participants completed the Mini-Mental State Examination (MMSE), the Mattis Dementia Rating Scale (DRS), and detailed neuropsychological testing. Convergent validity was supported, with MoCA scores correlating well with the MMSE (correlation coefficient (r)=0.66, Pvalidity was supported, with MoCA subscores according to cognitive domain correlating well with analogous neuropsychological tests and, in the case of memory (area under the receiver operating characteristic curve (AUC)=0.86), executive (AUC=0.79), and visuospatial function (AUC=0.79), being reasonably sensitive to impairment in those domains. The MoCA is a valid assessment of cognition that shows good agreement with existing screening tools and global measures (convergent validity) and was superior to the MMSE in this regard. The MoCA domain-specific subscores align with performance on more-detailed neuropsychological tests, suggesting not only good criterion validity for the MoCA, but also that it may be useful in guiding further neuropsychological testing. © 2013, Copyright the Authors Journal compilation © 2013, The American Geriatrics Society.

  6. Validating health impact assessment: Prediction is difficult (especially about the future)

    International Nuclear Information System (INIS)

    Petticrew, Mark; Cummins, Steven; Sparks, Leigh; Findlay, Anne

    2007-01-01

    Health impact assessment (HIA) has been recommended as a means of estimating how policies, programmes and projects may impact on public health and on health inequalities. This paper considers the difference between predicting health impacts and measuring those impacts. It draws upon a case study of the building of a new hypermarket in a deprived area of Glasgow, which offered an opportunity to reflect on the issue of the predictive validity of HIA, and to consider the difference between potential and actual impacts. We found that the actual impacts of the new hypermarket on diet differed from that which would have been predicted based on previous studies. Furthermore, they challenge current received wisdom about the impact of food retail outlets in poorer areas. These results are relevant to the validity of HIA as a process and emphasise the importance of further research on the predictive validity of HIA, which should help improve its value to decision-makers

  7. Initial validation of the Yin-Yang Assessment Questionnaire for persons with diabetes mellitus.

    Science.gov (United States)

    Wong, Yee Chi Peggy; Pang, Mei Che Samantha

    2015-09-10

    To initially test for the content validity, comprehensibility, test-retest reliability and internal consistency reliability of the Yin-Yang Assessment Questionnaire (YY-AQ). The process of initial validity and reliability test covered: (1) content validation from the findings of 18 multiple-case studies, validated Yin- and Yang-deficiency assessment questionnaires, relevant literatures and registered Chinese medicine practitioners; (2) comprehension with the levels of comprehensibility for each item categorized on a 3-point scale (not comprehensible; moderately comprehensible; highly comprehensible). A minimum of three respondents selecting for each item of moderately or highly comprehensible were regarded as comprehensive; (3) test-retest reliability conducted with a 2-wk interval. The intraclass correlation coefficients (ICCs) and their 95%CIs were calculated using a two-way random effects model. Wilcoxon Signed Rank test for related samples was adopted to compare the medians of test-retest scores. An ICC value of 0.85 or higher together with P > 0.05, was considered acceptable; and (4) internal consistency of the total items was measured and evaluated by Cronbach's coefficient alpha (α). A Cronbach's α of 0.7 or higher was considered to represent good internal consistency. Eighteen Yin-deficiency and 14 Yang-deficiency presentation items were finalized from content validation. Five participants with type 2 diabetes mellitus (T2DM) performed the comprehensibility and test-retest reliability tests. Comprehensibility score level of each presentation item was found to be moderate or high in three out of the five participants. Test-retest reliability showed that the single measure ICC of the total Yin-deficiency presentation items was 0.99 (95%CI: 0.89-0.99) and the median scores on the first and 14(th) days were 17 (IQR 6.5-27) and 21 (IQR 6-29) (P = 0.144) respectively. The single measure ICC of the total Yang-deficiency presentation items was 0.88 (95%CI: 0

  8. Validation of the FEEL-KJ: An Instrument to Measure Emotion Regulation Strategies in Children and Adolescents.

    Directory of Open Access Journals (Sweden)

    Emiel Cracco

    Full Text Available Although the field of emotion regulation in children and adolescents is growing, there is need for age-adjusted measures that assess a large variety of strategies. An interesting instrument in this respect is the FEEL-KJ because it measures 7 adaptive and 5 maladaptive emotion regulation strategies in response to three different emotions. However, the FEEL-KJ has not yet been validated extensively. Therefore, the current study aims to test the internal structure and validity of the FEEL-KJ in a large sample of Dutch-speaking Belgian children and adolescents (N = 1102, 8-18 years old. The investigation of the internal structure confirms earlier reports of a two-factor structure with Adaptive and Maladaptive Emotion Regulation as overarching categories. However, it also suggests that the two-factor model is more complex than what was previously assumed. The evaluation of the FEEL-KJ validity furthermore provides evidence for its construct and external validity. In sum, the current study confirms that the FEEL-KJ is a valuable and reliable measure of emotion regulation strategies in children and adolescents.

  9. Reliability and validity of child/adolescent food frequency questionnaires that assess foods and/or food groups.

    Science.gov (United States)

    Kolodziejczyk, Julia K; Merchant, Gina; Norman, Gregory J

    2012-07-01

    Summarize the validity and reliability of child/adolescent food frequency questionnaires (FFQs) that assess food and/or food groups. We performed a systematic review of child/adolescent (6-18 years) FFQ studies published between January 2001 and December 2010 using MEDLINE, Cochrane Library, PsycINFO, and Google Scholar. Main inclusion criteria were peer reviewed, written in English, and reported reliability or validity of questionnaires that assessed intake of food/food groups. Studies were excluded that focused on diseased people or used a combined dietary assessment method. Two authors independently selected the articles and extracted questionnaire characteristics such as number of items, portion size information, time span, category intake frequencies, and method of administration. Validity and reliability coefficients were extracted and reported for food categories and averaged across food categories for each study. Twenty-one studies were selected from 873, 18 included validity data, and 14 included test-retest reliability data. Publications were from the United States, Europe, Africa, Brazil, and the south Pacific. Validity correlations ranged from 0.01 to 0.80, and reliability correlations ranged from 0.05 to 0.88. The highest average validity correlations were obtained when the questionnaire did not assess portion size, measured a shorter time span (ie, previous day/week), was of medium length (ie, ≈ 20-60 items), and was not administered to the child's parents. There are design and administration features of child/adolescent FFQs that should be considered to obtain reliable and valid estimates of dietary intake in this population.

  10. Measuring patient activation in The Netherlands: translation and validation of the American short form Patient Activation Measure (PAM13).

    Science.gov (United States)

    Rademakers, Jany; Nijman, Jessica; van der Hoek, Lucas; Heijmans, Monique; Rijken, Mieke

    2012-07-31

    The American short form Patient Activation Measure (PAM) is a 13-item instrument which assesses patient (or consumer) self-reported knowledge, skills and confidence for self-management of one's health or chronic condition. In this study the PAM was translated into a Dutch version; psychometric properties of the Dutch version were established and the instrument was validated in a panel of chronically ill patients. The translation was done according to WHO guidelines. The PAM 13-Dutch was sent to 4178 members of the Dutch National Panel of people with Chronic illness or Disability (NPCD) in April 2010 (study A) and again to a sub sample of this group (N = 973) in June 2010 (study B). Internal consistency, test-retest reliability and cross-validation with the SBSQ-D (a measure for Health literacy) were computed. The Dutch results were compared to similar Danish and American data. The psychometric properties of the PAM 13-Dutch were generally good. The level of internal consistency is good (α = 0.88) and item-rest correlations are moderate to strong. The Dutch mean PAM score (61.3) is comparable to the American (61.9) and lower than the Danish (64.2). The test-retest reliability was moderate. The association with Health literacy was weak to moderate. The PAM-13 Dutch is a reliable instrument to measure patient activation. More research is needed into the validity of the Patient Activation Measure, especially with respect to a more comprehensive measure of Health literacy.

  11. Development and validation of an instrument to assess job satisfaction in eye-care personnel.

    Science.gov (United States)

    Paudel, Prakash; Cronjé, Sonja; O'Connor, Patricia M; Khadka, Jyoti; Rao, Gullapalli N; Holden, Brien A

    2017-11-01

    The aim was to develop and validate an instrument to measure job satisfaction in eye-care personnel and assess the job satisfaction of one-year trained vision technicians in India. A pilot instrument for assessing job satisfaction was developed, based on a literature review and input from a public health expert panel. Rasch analysis was used to assess psychometric properties and to undertake an iterative item reduction. The instrument was then administered to vision technicians in vision centres of Andhra Pradesh in India. Associations between vision technicians' job satisfaction and factors such as age, gender and experience were analysed using t-test and one-way analysis of variance. Rasch analysis confirmed that the 15-item job satisfaction in eye-care personnel (JSEP) was a unidimensional instrument with good fit statistics, measurement precisions and absence of differential item functioning. Overall, vision technicians reported high rates of job satisfaction (0.46 logits). Age, gender and experience were not associated with high job satisfaction score. Item score analysis showed non-financial incentives, salary and workload were the most important determinants of job satisfaction. The 15-item JSEP instrument is a valid instrument for assessing job satisfaction among eye-care personnel. Overall, vision technicians in India demonstrated high rates of job satisfaction. © 2016 Optometry Australia.

  12. Validation of a Short Form of an Indecision Test: The Vocational Assessment Test

    Science.gov (United States)

    Picard, France; Frenette, Éric; Guay, Frédéric; Labrosse, Julie

    2015-01-01

    The purpose of this research was to validate the scores of a short form of a new instrument, "l'Épreuve de décision vocationnelle, forme scolaire" (EDV-9S; vocational assessment test), which measures six indecision-related problems (lack of self-knowledge, lack of readiness, lack of method in decision making, lack of information,…

  13. The Validity of Attribute-Importance Measurement: A Review

    NARCIS (Netherlands)

    Ittersum, van K.; Pennings, J.M.E.; Wansink, B.; Trijp, van J.C.M.

    2007-01-01

    A critical review of the literature demonstrates a lack of validity among the ten most common methods for measuring the importance of attributes in behavioral sciences. The authors argue that one of the key determinants of this lack of validity is the multi-dimensionality of attribute importance.

  14. Development of a tool to measure person-centered maternity care in developing settings: validation in a rural and urban Kenyan population.

    Science.gov (United States)

    Afulani, Patience A; Diamond-Smith, Nadia; Golub, Ginger; Sudhinaraset, May

    2017-09-22

    Person-centered reproductive health care is recognized as critical to improving reproductive health outcomes. Yet, little research exists on how to operationalize it. We extend the literature in this area by developing and validating a tool to measure person-centered maternity care. We describe the process of developing the tool and present the results of psychometric analyses to assess its validity and reliability in a rural and urban setting in Kenya. We followed standard procedures for scale development. First, we reviewed the literature to define our construct and identify domains, and developed items to measure each domain. Next, we conducted expert reviews to assess content validity; and cognitive interviews with potential respondents to assess clarity, appropriateness, and relevance of the questions. The questions were then refined and administered in surveys; and survey results used to assess construct and criterion validity and reliability. The exploratory factor analysis yielded one dominant factor in both the rural and urban settings. Three factors with eigenvalues greater than one were identified for the rural sample and four factors identified for the urban sample. Thirty of the 38 items administered in the survey were retained based on the factors loadings and correlation between the items. Twenty-five items load very well onto a single factor in both the rural and urban sample, with five items loading well in either the rural or urban sample, but not in both samples. These 30 items also load on three sub-scales that we created to measure dignified and respectful care, communication and autonomy, and supportive care. The Chronbach alpha for the main scale is greater than 0.8 in both samples, and that for the sub-scales are between 0.6 and 0.8. The main scale and sub-scales are correlated with global measures of satisfaction with maternity services, suggesting criterion validity. We present a 30-item scale with three sub-scales to measure person

  15. Technical skills assessment toolbox: a review using the unitary framework of validity.

    Science.gov (United States)

    Ghaderi, Iman; Manji, Farouq; Park, Yoon Soo; Juul, Dorthea; Ott, Michael; Harris, Ilene; Farrell, Timothy M

    2015-02-01

    The purpose of this study was to create a technical skills assessment toolbox for 35 basic and advanced skills/procedures that comprise the American College of Surgeons (ACS)/Association of Program Directors in Surgery (APDS) surgical skills curriculum and to provide a critical appraisal of the included tools, using contemporary framework of validity. Competency-based training has become the predominant model in surgical education and assessment of performance is an essential component. Assessment methods must produce valid results to accurately determine the level of competency. A search was performed, using PubMed and Google Scholar, to identify tools that have been developed for assessment of the targeted technical skills. A total of 23 assessment tools for the 35 ACS/APDS skills modules were identified. Some tools, such as Operative Performance Rating System (OSATS) and Objective Structured Assessment of Technical Skill (OPRS), have been tested for more than 1 procedure. Therefore, 30 modules had at least 1 assessment tool, with some common surgical procedures being addressed by several tools. Five modules had none. Only 3 studies used Messick's framework to design their validity studies. The remaining studies used an outdated framework on the basis of "types of validity." When analyzed using the contemporary framework, few of these studies demonstrated validity for content, internal structure, and relationship to other variables. This study provides an assessment toolbox for common surgical skills/procedures. Our review shows that few authors have used the contemporary unitary concept of validity for development of their assessment tools. As we progress toward competency-based training, future studies should provide evidence for various sources of validity using the contemporary framework.

  16. The multi-faceted assessment of independence in patients with rheumatoid arthritis: preliminary validation from the ATTAIN study.

    Science.gov (United States)

    Hassett, Afton L; Li, Tracy; Buyske, Steven; Savage, Shantal V; Gignac, Monique A M

    2008-05-01

    To consider the feasibility of assessing multiple facets of independence in rheumatoid arthritis (RA) using a measure developed from existing items and examining its face validity, construct validity and responsiveness to change. The ATTAIN (Abatacept Trial in Treatment of Anti-tumor necrosis factor [TNF] Inadequate responders) database was used. Patients with RA were randomized 2:1, abatacept (n = 258) and placebo (n = 133). A multi-faceted scale to measure physical and psychosocial independence was constructed using items from the Health Assessment Questionnaire (HAQ) and Short Form 36 Health Survey (SF-36). Questions assessing activity limitations and need for outside caregiver help were also examined. Interviews with 20 RA patients assessed face validity. Item Response Theory analysis yielded two traits - 'Psychosocial Independence', derived from the number of days with activity limitations plus the Role Emotional, Social Functioning and Role Physical subscale items from the SF-36; and 'Physical Independence', derived from 15 HAQ items assessing need for help from another. The two traits showed no significant differential item functioning for age or gender and demonstrated good face validity. Changes over 169 days on Psychosocial Independence were greater (mean 0.46 units, 95% confidence interval [CI]: 0.17-0.75) for the abatacept group than for placebo (p = 0.002). Changes in Physical Independence were greater (mean 0.59 units, 95% CI: 0.35-0.82) for the abatacept group than for placebo (p anti-TNF therapy. However, we caution against an interpretation that these data suggest that abatacept improves independence because the component parts of this assessment came from instruments used in the ATTAIN trial where data had been previously analyzed.

  17. Validated assessment scales for the lower face.

    Science.gov (United States)

    Narins, Rhoda S; Carruthers, Jean; Flynn, Timothy C; Geister, Thorin L; Görtelmeyer, Roman; Hardas, Bhushan; Himmrich, Silvia; Jones, Derek; Kerscher, Martina; de Maio, Maurício; Mohrmann, Cornelia; Pooth, Rainer; Rzany, Berthold; Sattler, Gerhard; Buchner, Larry; Benter, Ursula; Breitscheidel, Lusine; Carruthers, Alastair

    2012-02-01

    Aging in the lower face leads to lines, wrinkles, depression of the corners of the mouth, and changes in lip volume and lip shape, with increased sagging of the skin of the jawline. Refined, easy-to-use, validated, objective standards assessing the severity of these changes are required in clinical research and practice. To establish the reliability of eight lower face scales assessing nasolabial folds, marionette lines, upper and lower lip fullness, lip wrinkles (at rest and dynamic), the oral commissure and jawline, aesthetic areas, and the lower face unit. Four 5-point rating scales were developed to objectively assess upper and lower lip wrinkles, oral commissures, and the jawline. Twelve experts rated identical lower face photographs of 50 subjects in two separate rating cycles using eight 5-point scales. Inter- and intrarater reliability of responses was assessed. Interrater reliability was substantial or almost perfect for all lower face scales, aesthetic areas, and the lower face unit. Intrarater reliability was high for all scales, areas and the lower face unit. Our rating scales are reliable tools for valid and reproducible assessment of the aging process in lower face areas. © 2012 by the American Society for Dermatologic Surgery, Inc. Published by Wiley Periodicals, Inc.

  18. Quantitative assessment of skin erythema due to radiotherapy--evaluation of different measurements

    International Nuclear Information System (INIS)

    Wengstroem, Yvonne; Forsberg, Christina; Naeslund, Ingemar; Bergh, Jonas

    2004-01-01

    Background and purpose: Visual assessment is the most common clinical investigation of skin reactions in radiotherapy. Due to the unquantitative and subjective nature of this method additional non-invasive methods are needed for more accurate evaluation of the visible acute adverse skin reactions due to radiotherapy. The purpose of this study was to evaluate a new objective measure with regard to reliability and validity and compare it with an established objective measure and a visual assessment. Patients and methods: A sample of 53 consecutive patients commencing curative tangential radiation therapy to the breast parenchyma were included in the study. The skin area of the treated breast was divided into five sections and assessed individually at 0, 24 and 50 Gy. The RTOG scoring system was used for the visual assessment of the skin reactions. The first objective measure included reflectance spectrometry (DermaSpectrometer) measures at fixed points within the treatment area. For the second objective measure digital images (Camera) were taken with a system using a digital camera and software. The images were analyzed using the Adobe Photoshop 5.0 software program. Results: The results provided significant evidence of the test-retest reliability of the camera. The correlation between the objective measures proved to be significant as the treatment progressed. Conclusions: The results suggest that the camera may be used in a reliable and valid way to measure skin erythema due to radiotherapy

  19. Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis.

    Science.gov (United States)

    Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

    2017-01-18

    To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.

  20. Assessing the criterion validity of four highly abbreviated measures from the Minimal Assessment of Cognitive Function in Multiple Sclerosis (MACFIMS).

    Science.gov (United States)

    Gromisch, Elizabeth S; Zemon, Vance; Holtzer, Roee; Chiaravalloti, Nancy D; DeLuca, John; Beier, Meghan; Farrell, Eileen; Snyder, Stacey; Schairer, Laura C; Glukhovsky, Lisa; Botvinick, Jason; Sloan, Jessica; Picone, Mary Ann; Kim, Sonya; Foley, Frederick W

    2016-10-01

    Cognitive dysfunction is prevalent in multiple sclerosis. As self-reported cognitive functioning is unreliable, brief objective screening measures are needed. Utilizing widely used full-length neuropsychological tests, this study aimed to establish the criterion validity of highly abbreviated versions of the Brief Visuospatial Memory Test - Revised (BVMT-R), Symbol Digit Modalities Test (SDMT), Delis-Kaplan Executive Function System (D-KEFS) Sorting Test, and Controlled Oral Word Association Test (COWAT) in order to begin developing an MS-specific screening battery. Participants from Holy Name Medical Center and the Kessler Foundation were administered one or more of these four measures. Using test-specific criterion to identify impairment at both -1.5 and -2.0 SD, receiver-operating-characteristic (ROC) analyses of BVMT-R Trial 1, Trial 2, and Trial 1 + 2 raw data (N = 286) were run to calculate the classification accuracy of the abbreviated version, as well as the sensitivity and specificity. The same methods were used for SDMT 30-s and 60-s (N = 321), D-KEFS Sorting Free Card Sort 1 (N = 120), and COWAT letters F and A (N = 298). Using these definitions of impairment, each analysis yielded high classification accuracy (89.3 to 94.3%). BVMT-R Trial 1, SDMT 30-s, D-KEFS Free Card Sort 1, and COWAT F possess good criterion validity in detecting impairment on their respective overall measure, capturing much of the same information as the full version. Along with the first two trials of the California Verbal Learning Test - Second Edition (CVLT-II), these five highly abbreviated measures may be used to develop a brief screening battery.

  1. Measuring the food service environment: development and implementation of assessment tools.

    Science.gov (United States)

    Minaker, Leia M; Raine, Kim D; Cash, Sean B

    2009-01-01

    The food environment is increasingly being implicated in the obesity epidemic, though few reported measures of it exist. In order to assess the impact of the food environment on food intake, valid measures must be developed and tested. The current study describes the development of a food service environment assessment tool and its implementation in a community setting. A descriptive study with mixed qualitative and quantitative methods at a large, North American university campus was undertaken. Measures were developed on the basis of a conceptual model of nutrition environments. Measures of community nutrition environment were the number, type and hours of operation of each food service outlet on campus. Measures of consumer nutrition environment were food availability, food affordability, food promotion and nutrition information availability. Seventy-five food service outlets within the geographic boundaries were assessed. Assessment tools could be implemented in a reasonable amount of time and showed good face and content validity. The food environments were described and measures were grouped so that food service outlet types could be compared in terms of purchasing convenience, cost/value, healthy food promotion and health. Food service outlet types that scored higher in purchasing convenience and cost/value tended to score lower in healthy food promotion and health. This study adds evidence that food service outlet types that are convenient to consumers and supply high value (in terms of calories per dollar) tend to be less health-promoting. Results from this study also suggest the possibility of characterizing the food environment according to the type of food service outlet observed.

  2. Momentary assessment of adults’ physical activity and sedentary behavior: Feasibility and validity

    Directory of Open Access Journals (Sweden)

    Genevieve Fridlund Dunton

    2012-07-01

    Full Text Available Introduction: Mobile phones are ubiquitous and easy to use, and thus have the capacity to collect real-time data from large numbers of people. Research tested the feasibility and validity of an Ecological Momentary Assessment (EMA self-report protocol using electronic surveys on mobile phones to assess adults’ physical activity and sedentary behaviors. Methods: Adults (N = 110 (73% female, 30% Hispanic, 62% overweight/obese completed a four-day signal-contingent EMA protocol (Sat. - Tues. with eight surveys randomly spaced throughout each day. EMA items assessed current activity (e.g., Watching TV/Movies, Reading/Computer, Physical Activity/Exercise. EMA responses were time-matched to minutes of moderate-to-vigorous physical activity (MVPA and sedentary activity (SA measured by accelerometer immediately before and after each EMA prompt. Results: Unanswered EMA prompts had greater MVPA (±15 min. than answered EMA prompts (p = .029 for under/normal weight participants, indicating that activity level might influence the likelihood of responding. The 15-min. intervals before vs. after the EMA-reported physical activity (n = 296 occasions did not differ in MVPA (p > .05, suggesting that prompting did not disrupt physical activity. SA decreased after EMA-reported sedentary behavior (n = 904 occasions (p < .05 for overweight and obese participants. As compared with other activities, EMA-reported physical activity and sedentary behavior had significantly greater MVPA and SA, respectively, in the ±15 minutes of the EMA prompt (p’s < .001, providing evidence for criterion validity. Conclusions: Findings generally support the acceptability and validity of a four-day signal contingent EMA protocol using mobile phones to measure physical activity and sedentary behavior in adults. However, some MVPA may be missed among underweight and normal weight individuals, and EMA may disrupt sedentary behavior among overweight/obese individuals.

  3. Assessing the construct validity of aberrant salience

    Directory of Open Access Journals (Sweden)

    Kristin Schmidt

    2009-12-01

    Full Text Available We sought to validate the psychometric properties of a recently developed paradigm that aims to measure salience attribution processes proposed to contribute to positive psychotic symptoms, the Salience Attribution Test (SAT. The “aberrant salience” measure from the SAT showed good face validity in previous results, with elevated scores both in high-schizotypy individuals, and in patients with schizophrenia suffering from delusions. Exploring the construct validity of salience attribution variables derived from the SAT is important, since other factors, including latent inhibition/learned irrelevance, attention, probabilistic reward learning, sensitivity to probability, general cognitive ability and working memory could influence these measures. Fifty healthy participants completed schizotypy scales, the SAT, a learned irrelevance task, and a number of other cognitive tasks tapping into potentially confounding processes. Behavioural measures of interest from each task were entered into a principal components analysis, which yielded a five-factor structure accounting for ~75% percent of the variance in behaviour. Implicit aberrant salience was found to load onto its own factor, which was associated with elevated “Introvertive Anhedonia” schizotypy, replicating our previous finding. Learned irrelevance loaded onto a separate factor, which also included implicit adaptive salience, but was not associated with schizotypy. Explicit adaptive and aberrant salience, along with a measure of probabilistic learning, loaded onto a further factor, though this also did not correlate with schizotypy. These results suggest that the measures of learned irrelevance and implicit adaptive salience might be based on similar underlying processes, which are dissociable both from implicit aberrant salience and explicit measures of salience.

  4. Four tenets of modern validity theory for medical education assessment and evaluation.

    Science.gov (United States)

    Royal, Kenneth D

    2017-01-01

    Validity is considered by many to be the most important criterion for evaluating a set of scores, yet few agree on what exactly the term means. Since the mid-1800s, scholars have been concerned with the notion of validity, but over time, the term has developed a variety of meanings across academic disciplines and contexts. Accordingly, when scholars with different academic backgrounds, many of whom hold deeply entrenched perspectives about validity conceptualizations, converge in the field of medical education assessment, it is a recipe for confusion. Thus, it is important to work toward a consensus about validity in the context of medical education assessment. Thus, the purpose of this work was to present four fundamental tenets of modern validity theory in an effort to establish a framework for scholars in the field of medical education assessment to follow when conceptualizing validity, interpreting validity evidence, and reporting research findings.

  5. Validity and reproducibility of resting metabolic rate measurements in rural Bangladeshi women: comparison of measurements obtained by Medgem and by Deltatrac device

    NARCIS (Netherlands)

    Alam, D.S.; Hulshof, P.J.M.; Roordink, D.; Meltzer, M.; Yunus, M.; Salam, M.A.; Raaij, van J.M.A.

    2005-01-01

    Objective:To assess reproducibility and validity of resting metabolic rate (RMR) of Bangladeshi women as measured with the MedGem device and using the Deltatrac metabolic monitor as a reference; and (2) to evaluate the FAO/WHO/UNU basal metabolic rate (BMR)-prediction equations. Design:In each of

  6. Validation of the Elementary Social Behavior Assessment: A Measure of Student Prosocial School Behaviors

    Science.gov (United States)

    Pennefather, Jordan T.; Smolkowski, Keith

    2015-01-01

    We describe the psychometric evaluation of the "Elementary Social Behavior Assessment" (ESBA™), a 12-item scale measuring teacher-preferred, positive social skills. The ESBA was developed for use in elementary school classrooms to measure teacher perceptions of students using time-efficient, web-based data collection methods that allow…

  7. Validation and Error Characterization for the Global Precipitation Measurement

    Science.gov (United States)

    Bidwell, Steven W.; Adams, W. J.; Everett, D. F.; Smith, E. A.; Yuter, S. E.

    2003-01-01

    The Global Precipitation Measurement (GPM) is an international effort to increase scientific knowledge on the global water cycle with specific goals of improving the understanding and the predictions of climate, weather, and hydrology. These goals will be achieved through several satellites specifically dedicated to GPM along with the integration of numerous meteorological satellite data streams from international and domestic partners. The GPM effort is led by the National Aeronautics and Space Administration (NASA) of the United States and the National Space Development Agency (NASDA) of Japan. In addition to the spaceborne assets, international and domestic partners will provide ground-based resources for validating the satellite observations and retrievals. This paper describes the validation effort of Global Precipitation Measurement to provide quantitative estimates on the errors of the GPM satellite retrievals. The GPM validation approach will build upon the research experience of the Tropical Rainfall Measuring Mission (TRMM) retrieval comparisons and its validation program. The GPM ground validation program will employ instrumentation, physical infrastructure, and research capabilities at Supersites located in important meteorological regimes of the globe. NASA will provide two Supersites, one in a tropical oceanic and the other in a mid-latitude continental regime. GPM international partners will provide Supersites for other important regimes. Those objectives or regimes not addressed by Supersites will be covered through focused field experiments. This paper describes the specific errors that GPM ground validation will address, quantify, and relate to the GPM satellite physical retrievals. GPM will attempt to identify the source of errors within retrievals including those of instrument calibration, retrieval physical assumptions, and algorithm applicability. With the identification of error sources, improvements will be made to the respective calibration

  8. The public health disaster trust scale: validation of a brief measure.

    Science.gov (United States)

    Eisenman, David P; Williams, Malcolm V; Glik, Deborah; Long, Anna; Plough, Alonzo L; Ong, Michael

    2012-01-01

    Trust contributes to community resilience by the critical influence it has on the community's responses to public health recommendations before, during, and after disasters. However, trust in public health is a multifactorial concept that has rarely been defined and measured empirically in public health jurisdictional risk assessment surveys. Measuring trust helps public health departments identify and ameliorate a threat to effective risk communications and increase resilience. Such a measure should be brief to be incorporated into assessments conducted by public health departments. We report on a brief scale of public health disaster-related trust, its psychometric properties, and its validity. On the basis of a literature review, our conceptual model of public health disaster-related trust and previously conducted focus groups, we postulated that public health disaster-related trust includes 4 major domains: competency, honesty, fairness, and confidentiality. A random-digit-dialed telephone survey of the Los Angeles county population, conducted in 2004-2005 in 6 languages. Two thousand five hundred eighty-eight adults aged 18 years and older including oversamples of African Americans and Asian Americans. Trust was measured by 4 items scored on a 4-point Likert scale. A summary score from 4 to 16 was constructed. Scores ranged from 4 to 16 and were normally distributed with a mean of 8.5 (SD 2.7). Cronbach α = 0.79. As hypothesized, scores were lower among racial/ethnic minority populations than whites. Also, trust was associated with lower likelihood of following public health recommendations in a hypothetical disaster and lower likelihood of household disaster preparedness. The Public Health Disaster Trust scale may facilitate identifying communities where trust is low and prioritizing them for inclusion in community partnership building efforts under Function 2 of the Centers for Disease Control and Prevention's Public Health Preparedness Capability 1. The

  9. International cross-cultural validation study of the Canadian haemophilia outcomes: kids' life assessment tool.

    Science.gov (United States)

    McCusker, P J; Fischer, K; Holzhauer, S; Meunier, S; Altisent, C; Grainger, J D; Blanchette, V S; Burke, T A; Wakefield, C; Young, N L

    2015-05-01

    Health-related quality of life (HRQoL) assessment is recognized as an important outcome in the evaluation of different therapeutic regimens for persons with haemophilia. The Canadian Haemophilia Outcomes-Kids' Life Assessment Tool (CHO-KLAT) is a disease-specific measure of HRQoL for 4 to 18-year-old boys with haemophilia. The purpose of this study was to extend this disease-specific, child-centric, outcome measure for use in international clinical trials. We adapted the North American English CHO-KLAT version for use in five countries: France, Germany, the Netherlands, Spain and the United Kingdom (UK). The process included four stages: (i) translation; (ii) cognitive debriefing; (iii) validity assessment relative to the PedsQL (generic) and the Haemo-QoL (disease-specific) and (iv) assessment of inter and intra-rater reliability. Cognitive debriefing was performed in 57 boys (mean age 11.4 years), validation was performed in 144 boys (mean age 11.0 years) and reliability was assessed for a subgroup of 64 boys (mean age 12.0 years). Parents also participated. The mean scores reported by the boys were high: CHO-KLAT 77.0 (SD = 11.2); PedsQL 83.8 (SD = 11.9) and Haemo-QoL 79.6 (SD = 11.5). Correlations between the CHO-KLAT and PedsQL ranged from 0.63 in Germany to 0.39 in the Netherlands and Spain. Test-retest reliability (concordance) for child self-report was 0.67. Child-parent concordance was slightly lower at 0.57. The CHO-KLAT has been fully culturally adapted and validated for use in five different languages and cultures (in England, the Netherlands, France, Germany and Spain) where treatment is readily available either on demand or as prophylaxis. © 2014 John Wiley & Sons Ltd.

  10. Assessing tolerance for wildlife: Clarifying relations between concepts and measures

    Science.gov (United States)

    Bruskotter, Jeremy T.; Singh, Ajay; Fulton, David C.; Slagle, Kristina

    2015-01-01

    Two parallel lines of inquiry, tolerance for and acceptance of wildlife populations, have arisen in the applied literature on wildlife conservation to assess probability of successfully establishing or increasing populations of controversial species. Neither of these lines is well grounded in social science theory, and diverse measures have been employed to assess tolerance, which inhibits comparability across studies. We empirically tested behavioral measures of tolerance against self-reports of previous policy-relevant behavior and behavioral intentions. Both composite behavioral measures were strongly correlated (r > .70) with two attitudinal measures of tolerance commonly employed in the literature. The strong correlation between attitudinal and behavioral measures suggests existing attitudinal measures represent valid, parsimonious measures of tolerance that may be useful when behavioral measures are too cumbersome or misreporting of behavior is anticipated. Our results demonstrate how behavioral measures of tolerance provide additional, useful information beyond general attitudinal measures.

  11. Validity of the stroke rehabilitation assessment of movement scale in acute rehabilitation: a comparison with the functional independence measure and stroke impact scale-16.

    Science.gov (United States)

    Ward, Irene; Pivko, Susan; Brooks, Gary; Parkin, Kate

    2011-11-01

    To demonstrate sensitivity to change of the Stroke Rehabilitation Assessment of Movement (STREAM) as well as the concurrent and predictive validity of the STREAM in an acute rehabilitation setting. Prospective cohort study. Acute, in-patient rehabilitation department within a tertiary-care teaching hospital in the United States. Thirty adults with a newly diagnosed, first ischemic stroke. Clinical assessments were conducted on admission and then again on discharge from the rehabilitation hospital with the STREAM (total STREAM and upper extremity, lower extremity, and mobility subscales), Functional Independence Measure (FIM), and Stroke Impact Scale-16 (SIS-16). Sensitivity to change was determined with the Wilcoxon signed rank test and by the calculation of standardized response means. Spearman correlations were used to assess concurrent validity of the total STREAM and STREAM subscales with the FIM and SIS-16 on admission and discharge. We determined predictive validity for all instruments by correlating admission scores with actual and predicted length of stay and by testing associations between admission scores and discharge destination (home vs subacute facility). Not applicable. For all instruments, there was statistically significant improvement from admission to discharge. The standardized response means for the total STREAM and STREAM subscales were large. Spearman correlations between the total STREAM and STREAM subscales and the FIM and SIS-16 were moderate to excellent, both on admission and discharge. Among change scores, only the SIS-16 correlated with the total STREAM. All 3 instruments were significantly associated with discharge destination; however, the associations were strongest for the total STREAM and STREAM subscales. All instruments showed moderate-to-excellent correlations with predicted and actual length of stay. The STREAM is sensitive to change and demonstrates good concurrent and predictive validity as compared with the FIM and SIS-16

  12. Validation of self assessment patient knowledge questionnaire for heart failure patients.

    Science.gov (United States)

    Lainscak, Mitja; Keber, Irena

    2005-12-01

    Several studies showed insufficient knowledge and poor compliance to non-pharmacological management in heart failure patients. Only a limited number of validated tools are available to assess their knowledge. The aim of the study was to test our 10-item Patient knowledge questionnaire. The Patient knowledge questionnaire was administered to 42 heart failure patients from Heart failure clinic and to 40 heart failure patients receiving usual care. Construct validity (Pearson correlation coefficient), internal consistency (Cronbach alpha), reproducibility (Wilcoxon signed rank test), and reliability (chi-square test and Student's t-test for independent samples) were assessed. Overall score of the Patient knowledge questionnaire had the strongest correlation to the question about regular weighing (r=0.69) and the weakest to the question about presence of heart disease (r=0.33). There was a strong correlation between question about fluid retention and questions assessing regular weighing, (r=0.86), weight of one litre of water (r=0.86), and salt restriction (r=0.57). The Cronbach alpha was 0.74 and could be improved by exclusion of questions about clear explanation (Chronbach alpha 0.75), importance of fruit, soup, and vegetables (Chronbach alpha 0.75), and self adjustment of diuretic (Chronbach alpha 0.81). During reproducibility testing 91% to 98% of questions were answered equally. Patients from Heart failure clinic scored significantly better than patients receiving usual care (7.9 (1.3) vs. 5.7 (2.2), p<0.001). Patient knowledge questionnaire is a valid and reliable tool to measure knowledge of heart failure patients.

  13. Validity of portfolio assessment: which qualities determine ratings?

    NARCIS (Netherlands)

    Driessen, E.W.; Overeem, K.; Tartwijk, J. van; Vleuten, C.P.M. van der; Muijtjens, A.M.M.

    2006-01-01

    The portfolio is becoming increasingly accepted as a valuable tool for learning and assessment. The validity of portfolio assessment, however, may suffer from bias due to irrelevant qualities, such as lay-out and writing style. We examined the possible effects of such qualities in a portfolio

  14. Validity of clinical outcome measures to evaluate ankle range of motion during the weight-bearing lunge test.

    Science.gov (United States)

    Hall, Emily A; Docherty, Carrie L

    2017-07-01

    To determine the concurrent validity of standard clinical outcome measures compared to laboratory outcome measure while performing the weight-bearing lunge test (WBLT). Cross-sectional study. Fifty participants performed the WBLT to determine dorsiflexion ROM using four different measurement techniques: dorsiflexion angle with digital inclinometer at 15cm distal to the tibial tuberosity (°), dorsiflexion angle with inclinometer at tibial tuberosity (°), maximum lunge distance (cm), and dorsiflexion angle using a 2D motion capture system (°). Outcome measures were recorded concurrently during each trial. To establish concurrent validity, Pearson product-moment correlation coefficients (r) were conducted, comparing each dependent variable to the 2D motion capture analysis (identified as the reference standard). A higher correlation indicates strong concurrent validity. There was a high correlation between each measurement technique and the reference standard. Specifically the correlation between the inclinometer placement at 15cm below the tibial tuberosity (44.9°±5.5°) and the motion capture angle (27.0°±6.0°) was r=0.76 (p=0.001), between the inclinometer placement at the tibial tuberosity angle (39.0°±4.6°) and the motion capture angle was r=0.71 (p=0.001), and between the distance from the wall clinical measure (10.3±3.0cm) to the motion capture angle was r=0.74 (p=0.001). This study determined that the clinical measures used during the WBLT have a high correlation with the reference standard for assessing dorsiflexion range of motion. Therefore, obtaining maximum lunge distance and inclinometer angles are both valid assessments during the weight-bearing lunge test. Copyright © 2016 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  15. Validation of the Danish version of the McGill Ingestive Skills Assessment using classical test theory and the Rasch model

    DEFF Research Database (Denmark)

    Hansen, Tina; Lambert, Heather C; Faber, Jens

    2012-01-01

    Purpose: The study aimed to validate the Danish version of the Canadian the "McGill Ingestive Skills Assessment" (MISA-DK) for measuring dysphagia in frail elders. Method: One-hundred and ten consecutive older medical patients were recruited to the study. Reliability was assessed by internal...... consistency (Chronbach's alpha). External construct validity (convergent and known-groups validity) was evaluated against theoretical constructs assessing the complex concept of ingestive skills. Internal construct validity was tested using Rasch analysis. Results: High internal consistency reliability...... with Chronbach's alpha of 0.77-0.95 was evident. External construct validity was supported by expected high correlations with most of the constructs related to ingestive skills (r(s)¿=¿0.53 to r(s)¿=¿0.66). The MISA-DK discriminated significantly between known-groups. Fit to the Rasch model (x(2) (df)¿=¿12 (12...

  16. Content Validation and Evaluation of an Endovascular Teamwork Assessment Tool.

    Science.gov (United States)

    Hull, L; Bicknell, C; Patel, K; Vyas, R; Van Herzeele, I; Sevdalis, N; Rudarakanchana, N

    2016-07-01

    To modify, content validate, and evaluate a teamwork assessment tool for use in endovascular surgery. A multistage, multimethod study was conducted. Stage 1 included expert review and modification of the existing Observational Teamwork Assessment for Surgery (OTAS) tool. Stage 2 included identification of additional exemplar behaviours contributing to effective teamwork and enhanced patient safety in endovascular surgery (using real-time observation, focus groups, and semistructured interviews of multidisciplinary teams). Stage 3 included content validation of exemplar behaviours using expert consensus according to established psychometric recommendations and evaluation of structure, content, feasibility, and usability of the Endovascular Observational Teamwork Assessment Tool (Endo-OTAS) by an expert multidisciplinary panel. Stage 4 included final team expert review of exemplars. OTAS core team behaviours were maintained (communication, coordination, cooperation, leadership team monitoring). Of the 114 OTAS behavioural exemplars, 19 were modified, four removed, and 39 additional endovascular-specific behaviours identified. Content validation of these 153 exemplar behaviours showed that 113/153 (73.9%) reached the predetermined Item-Content Validity Index rating for teamwork and/or patient safety. After expert team review, 140/153 (91.5%) exemplars were deemed to warrant inclusion in the tool. More than 90% of the expert panel agreed that Endo-OTAS is an appropriate teamwork assessment tool with observable behaviours. Some concerns were noted about the time required to conduct observations and provide performance feedback. Endo-OTAS is a novel teamwork assessment tool, with evidence for content validity and relevance to endovascular teams. Endo-OTAS enables systematic objective assessment of the quality of team performance during endovascular procedures. Copyright © 2016. Published by Elsevier Ltd.

  17. Patient-Reported Outcome Measures in Dysphagia: A Systematic Review of Instrument Development and Validation

    Science.gov (United States)

    Patel, Dhyanesh A.; Sharda, Rohit; Hovis, Kristen L.; Nichols, Erin E.; Sathe, Nila; Penson, David F.; Feurer, Irene D.; McPheeters, Melissa L.; Vaezi, Michael F.; Francis, David O.

    2017-01-01

    Objective Patient-reported outcome (PRO) measures are commonly used to capture patient experience with dysphagia and to evaluate treatment effectiveness. Inappropriate application can lead to distorted results in clinical studies. A systematic review of the literature on dysphagia-related PRO measures was performed to 1) identify all currently available measures and 2) to evaluate each for the presence of important measurement properties that would affect their applicability. Design MEDLINE via the PubMed interface, the Cumulative Index of Nursing and Allied Health Literature, and the Health and Psychosocial Instrument database were searched using relevant vocabulary terms and key terms related to PRO measures and dysphagia. Three independent investigators performed abstract and full text reviews. Each study meeting criteria was evaluated using an 18-item checklist developed a priori that assessed multiple domains: 1) conceptual model, 2) content validity, 3) reliability, 4) construct validity, 6) scoring and interpretation, and 7) burden and presentation. Results Of 4950 abstracts reviewed, a total of 34 dysphagia-related PRO measures (publication year 1987 – 2014) met criteria for extraction and analysis. Several PRO measures were of high quality (MADS for achalasia, SWAL-QOL and SSQ for oropharyngeal dysphagia, PROMIS-GI for general dysphagia, EORTC-QLQ-OG25 for esophageal cancer, ROMP-swallowing for Parkinson’s disease, DSQ-EoE for eosinophilic esophagitis, and SOAL for total laryngectomy-related dysphagia). In all, 17 met at least one criterion per domain. Thematic deficiencies in current measures were evident including: 1) direct patient involvement in content development, 2) empirically justified dimensionality, 3) demonstrable responsiveness to change, 4) plan for interpreting missing responses, and 5) literacy level assessment. Conclusion This is the first comprehensive systematic review assessing developmental properties of all available dysphagia

  18. Content validity and its estimation

    Directory of Open Access Journals (Sweden)

    Yaghmale F

    2003-04-01

    Full Text Available Background: Measuring content validity of instruments are important. This type of validity can help to ensure construct validity and give confidence to the readers and researchers about instruments. content validity refers to the degree that the instrument covers the content that it is supposed to measure. For content validity two judgments are necessary: the measurable extent of each item for defining the traits and the set of items that represents all aspects of the traits. Purpose: To develop a content valid scale for assessing experience with computer usage. Methods: First a review of 2 volumes of International Journal of Nursing Studies, was conducted with onlyI article out of 13 which documented content validity did so by a 4-point content validity index (CV! and the judgment of 3 experts. Then a scale with 38 items was developed. The experts were asked to rate each item based on relevance, clarity, simplicity and ambiguity on the four-point scale. Content Validity Index (CVI for each item was determined. Result: Of 38 items, those with CVIover 0.75 remained and the rest were discarded reSulting to 25-item scale. Conclusion: Although documenting content validity of an instrument may seem expensive in terms of time and human resources, its importance warrants greater attention when a valid assessment instrument is to be developed. Keywords: Content Validity, Measuring Content Validity

  19. The Validity of Subjective Performance Measures

    DEFF Research Database (Denmark)

    Meier, Kenneth J.; Winter, Søren C.; O'Toole, Laurence J.

    2015-01-01

    to provide, and are highly policy specific rendering generalization difficult. But are perceptual performance measures valid, and do they generate unbiased findings? We examine these questions in a comparative study of middle managers in schools in Texas and Denmark. The findings are remarkably similar...

  20. Validity and Reliability of Assessing Body Composition Using a Mobile Application.

    Science.gov (United States)

    Macdonald, Elizabeth Z; Vehrs, Pat R; Fellingham, Gilbert W; Eggett, Dennis; George, James D; Hager, Ronald

    2017-12-01

    The purpose of this study was to determine the validity and reliability of the LeanScreen (LS) mobile application that estimates percent body fat (%BF) using estimates of circumferences from photographs. The %BF of 148 weight-stable adults was estimated once using dual-energy x-ray absorptiometry (DXA). Each of two administrators assessed the %BF of each subject twice using the LS app and manually measured circumferences. A mixed-model ANOVA and Bland-Altman analyses were used to compare the estimates of %BF obtained from each method. Interrater and intrarater reliabilities values were determined using multiple measurements taken by each of the two administrators. The LS app and manually measured circumferences significantly underestimated (P < 0.05) the %BF determined using DXA by an average of -3.26 and -4.82 %BF, respectively. The LS app (6.99 %BF) and manually measured circumferences (6.76 %BF) had large limits of agreement. All interrater and intrarater reliability coefficients of estimates of %BF using the LS app and manually measured circumferences exceeded 0.99. The estimates of %BF from manually measured circumferences and the LS app were highly reliable. However, these field measures are not currently recommended for the assessment of body composition because of significant bias and large limits of agreements.

  1. The reliability and validity of fatigue measures during short-duration maximal-intensity intermittent cycling.

    Science.gov (United States)

    Glaister, Mark; Stone, Michael H; Stewart, Andrew M; Hughes, Michael; Moir, Gavin L

    2004-08-01

    The purpose of the present study was to assess the reliability and validity of fatigue measures, as derived from 4 separate formulae, during tests of repeat sprint ability. On separate days over a 3-week period, 2 groups of 7 recreationally active men completed 6 trials of 1 of 2 maximal (20 x 5 seconds) intermittent cycling tests with contrasting recovery periods (10 or 30 seconds). All trials were conducted on a friction-braked cycle ergometer, and fatigue scores were derived from measures of mean power output for each sprint. Apart from formula 1, which calculated fatigue from the percentage difference in mean power output between the first and last sprint, all remaining formulae produced fatigue scores that showed a reasonably good level of test-retest reliability in both intermittent test protocols (intraclass correlation range: 0.78-0.86; 95% likely range of true values: 0.54-0.97). Although between-protocol differences in the magnitude of the fatigue scores suggested good construct validity, within-protocol differences highlighted limitations with each formula. Overall, the results support the use of the percentage decrement score as the most valid and reliable measure of fatigue during brief maximal intermittent work.

  2. The cultural validation of two scales to assess social stigma in leprosy.

    Science.gov (United States)

    Peters, Ruth M H; Dadun; Van Brakel, Wim H; Zweekhorst, Marjolein B M; Damayanti, Rita; Bunders, Joske F G; Irwanto

    2014-01-01

    Stigma plays in an important role in the lives of persons affected by neglected tropical diseases, and assessment of stigma is important to document this. The aim of this study is to test the cross-cultural validity of the Community Stigma Scale (EMIC-CSS) and the Social Distance Scale (SDS) in the field of leprosy in Cirebon District, Indonesia. Cultural equivalence was tested by assessing the conceptual, item, semantic, operational and measurement equivalence of these instruments. A qualitative exploratory study was conducted to increase our understanding of the concept of stigma in Cirebon District. A process of translation, discussions, trainings and a pilot study followed. A sample of 259 community members was selected through convenience sampling and 67 repeated measures were obtained to assess the psychometric measurement properties. The aspects and items in the SDS and EMIC-CSS seem equally relevant and important in the target culture. The response scales were adapted to ensure that meaning is transferred accurately and no changes to the scale format (e.g. lay out, statements or questions) of both scales were made. A positive correlation was found between the EMIC-CSS and the SDS total scores (r=0.41). Cronbach's alphas of 0.83 and 0.87 were found for the EMIC-CSS and SDS. The exploratory factor analysis indicated for both scales an adequate fit as unidimensional scale. A standard error of measurement of 2.38 was found in the EMIC-CSS and of 1.78 in the SDS. The test-retest reliability coefficient was respectively, 0.84 and 0.75. No floor or ceiling effects were found. According to current international standards, our findings indicate that the EMIC-CSS and the SDS have adequate cultural validity to assess social stigma in leprosy in the Bahasa Indonesia-speaking population of Cirebon District. We believe the scales can be further improved, for instance, by adding, changing and rephrasing certain items. Finally, we provide suggestions for use with other

  3. The Cultural Validation of Two Scales to Assess Social Stigma in Leprosy

    Science.gov (United States)

    Peters, Ruth M. H.; Dadun; Van Brakel, Wim H.; Zweekhorst, Marjolein B. M.; Damayanti, Rita; Bunders, Joske F. G.; Irwanto

    2014-01-01

    Background Stigma plays in an important role in the lives of persons affected by neglected tropical diseases, and assessment of stigma is important to document this. The aim of this study is to test the cross-cultural validity of the Community Stigma Scale (EMIC-CSS) and the Social Distance Scale (SDS) in the field of leprosy in Cirebon District, Indonesia. Methodology/principle findings Cultural equivalence was tested by assessing the conceptual, item, semantic, operational and measurement equivalence of these instruments. A qualitative exploratory study was conducted to increase our understanding of the concept of stigma in Cirebon District. A process of translation, discussions, trainings and a pilot study followed. A sample of 259 community members was selected through convenience sampling and 67 repeated measures were obtained to assess the psychometric measurement properties. The aspects and items in the SDS and EMIC-CSS seem equally relevant and important in the target culture. The response scales were adapted to ensure that meaning is transferred accurately and no changes to the scale format (e.g. lay out, statements or questions) of both scales were made. A positive correlation was found between the EMIC-CSS and the SDS total scores (r = 0.41). Cronbach's alphas of 0.83 and 0.87 were found for the EMIC-CSS and SDS. The exploratory factor analysis indicated for both scales an adequate fit as unidimensional scale. A standard error of measurement of 2.38 was found in the EMIC-CSS and of 1.78 in the SDS. The test-retest reliability coefficient was respectively, 0.84 and 0.75. No floor or ceiling effects were found. Conclusions/significance According to current international standards, our findings indicate that the EMIC-CSS and the SDS have adequate cultural validity to assess social stigma in leprosy in the Bahasa Indonesia-speaking population of Cirebon District. We believe the scales can be further improved, for instance, by adding, changing and

  4. Validity of assessing child feeding with virtual reality.

    Science.gov (United States)

    Persky, Susan; Goldring, Megan R; Turner, Sara A; Cohen, Rachel W; Kistler, William D

    2018-04-01

    Assessment of parents' child feeding behavior is challenging, and there is need for additional methodological approaches. Virtual reality technology allows for the creation of behavioral measures, and its implementation overcomes several limitations of existing methods. This report evaluates the validity and usability of the Virtual Reality (VR) Buffet among a sample of 52 parents of children aged 3-7. Participants served a meal of pasta and apple juice in both a virtual setting and real-world setting (counterbalanced and separated by a distractor task). They then created another meal for their child, this time choosing from the full set of food options in the VR Buffet. Finally, participants completed a food estimation task followed by a questionnaire, which assessed their perceptions of the VR Buffet. Results revealed that the amount of virtual pasta served by parents correlated significantly with the amount of real pasta they served, r s  = 0.613, p < .0001, as did served amounts of virtual and real apple juice, r s  = 0.822, p < .0001. Furthermore, parents' perception of the calorie content of chosen foods was significantly correlated with observed calorie content (r s  = 0.438, p = .002), and parents agreed that they would feed the meal they created to their child (M = 4.43, SD = 0.82 on a 1-5 scale). The data presented here demonstrate that parent behavior in the VR Buffet is highly related to real-world behavior, and that the tool is well-rated by parents. Given the data presented and the potential benefits of the abundant behavioral data the VR Buffet can provide, we conclude that it is a valid and needed addition to the array of tools for assessing feeding behavior. Published by Elsevier Ltd.

  5. The Development and Validation of an Alternative Assessment to Measure Changes in Understanding of the Longleaf Pine Ecosystem

    Science.gov (United States)

    Dentzau, Michael W.; Martínez, Alejandro José Gallard

    2016-01-01

    A drawing assessment to gauge changes in fourth grade students' understanding of the essential components of the longleaf pine ecosystem was developed to support an out-of-school environmental education program. Pre- and post-attendance drawings were scored with a rubric that was determined to have content validity and reliability among users. In…

  6. Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

    Science.gov (United States)

    Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

    2014-09-04

    There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a

  7. Patient-Reported Measures of Narcolepsy: The Need for Better Assessment.

    Science.gov (United States)

    Kallweit, Ulf; Schmidt, Markus; Bassetti, Claudio L

    2017-05-15

    Narcolepsy, a chronic disorder of the central nervous system, is clinically characterized by a symptom pentad that includes excessive daytime sleepiness, cataplexy, sleep paralysis, hypnopompic/hypnagogic hallucinations, and disrupted nighttime sleep. Ideally, screening and diagnosis instruments that assist physicians in evaluating a patient for type 1 or type 2 narcolepsy would be brief, easy for patients to understand and physicians to score, and would identify or rule out the need for electrophysiological testing. A search of the literature was conducted to review patient-reported measures used for the assessment of narcolepsy, mainly in clinical trials, with the goal of summarizing existing scales and identifying areas that may require additional screening questions and clinical practice scales. Of the seven scales reviewed, the Epworth Sleepiness Scale continues to be an important outcome measure to screen adults for excessive daytime sleepiness, which may be associated with narcolepsy. Several narcolepsy-specific scales have demonstrated utility, such as the Ullanlinna Narcolepsy Scale, Swiss Narcolepsy Scale, and Narcolepsy Symptom Assessment Questionnaire, but further validation is required. Although the narcolepsy-specific scales currently in use may identify type 1 narcolepsy, there are no validated questionnaires to identify type 2 narcolepsy. Thus, there remains a need for short, easily understood, and well-validated instruments that can be readily used in clinical practice to distinguish narcolepsy subtypes, as well as other hypersomnias, and for assessing symptoms of these conditions during treatment. © 2017 American Academy of Sleep Medicine

  8. The reliability and validity of radiographic measurements for determining the three-dimensional position of the talus in varus and valgus osteoarthritic ankles

    Energy Technology Data Exchange (ETDEWEB)

    Nosewicz, Tomasz L. [Kantonsspital Liestal, Department of Orthopaedic Surgery and Traumatology, Liestal (Switzerland); Academic Medical Center, Department of Orthopaedic Surgery, Meibergdreef 9, AZ, Amsterdam (Netherlands); Knupp, Markus; Bolliger, Lilianna; Hintermann, Beat [Kantonsspital Liestal, Department of Orthopaedic Surgery and Traumatology, Liestal (Switzerland)

    2012-12-15

    To assess the most accurate radiographic method to determine talar three-dimensional position in varus and valgus osteoarthritic ankles, we evaluated the reliability and validity of different radiographic measurements. Nine radiographic measurements were performed blindly on weight-bearing mortise, sagittal, and horizontal radiographs of 33 varus and 33 valgus feet (63 patients). Intra- and interobserver reliability was determined with the intraclass coefficient (ICC). Discriminant validity of measurements between varus and valgus feet was assessed with effect size (ES). Convergent validity (Pearson's r) was evaluated by correlating measurements to the dichotomized varus and valgus groups. Obtained measurements in both groups were finally compared with each other and with 30 control feet. Reliability was excellent (ICC > 0.80) in all but two measurements. Whereas frontal plane validity was excellent (ES and r > 0.80), horizontal and sagittal measurements showed poor to moderate validity (ES and r between 0.00 and 0.60). Four measurements were significantly different among all groups (p < 0.05). Talar positional tendency was found towards dorsiflexion or endorotation in the varus group and towards plantarflexion or exorotation in the valgus group. The frontal tibiotalar surface angle, sagittal talocalcaneal inclination angle, and horizontal talometatarsal I angle showed the best reliability, validity, and difference among the groups. The frontal tibiotalar surface angle, sagittal talocalcaneal inclination angle, and horizontal talometatarsal I angle accurately determine talar three-dimensional radiographic position in weight-bearing varus and valgus osteoarthritic ankles. Careful radiographic evaluation is important, as these deformities affect talar position in all three planes. (orig.)

  9. The reliability and validity of radiographic measurements for determining the three-dimensional position of the talus in varus and valgus osteoarthritic ankles

    International Nuclear Information System (INIS)

    Nosewicz, Tomasz L.; Knupp, Markus; Bolliger, Lilianna; Hintermann, Beat

    2012-01-01

    To assess the most accurate radiographic method to determine talar three-dimensional position in varus and valgus osteoarthritic ankles, we evaluated the reliability and validity of different radiographic measurements. Nine radiographic measurements were performed blindly on weight-bearing mortise, sagittal, and horizontal radiographs of 33 varus and 33 valgus feet (63 patients). Intra- and interobserver reliability was determined with the intraclass coefficient (ICC). Discriminant validity of measurements between varus and valgus feet was assessed with effect size (ES). Convergent validity (Pearson's r) was evaluated by correlating measurements to the dichotomized varus and valgus groups. Obtained measurements in both groups were finally compared with each other and with 30 control feet. Reliability was excellent (ICC > 0.80) in all but two measurements. Whereas frontal plane validity was excellent (ES and r > 0.80), horizontal and sagittal measurements showed poor to moderate validity (ES and r between 0.00 and 0.60). Four measurements were significantly different among all groups (p < 0.05). Talar positional tendency was found towards dorsiflexion or endorotation in the varus group and towards plantarflexion or exorotation in the valgus group. The frontal tibiotalar surface angle, sagittal talocalcaneal inclination angle, and horizontal talometatarsal I angle showed the best reliability, validity, and difference among the groups. The frontal tibiotalar surface angle, sagittal talocalcaneal inclination angle, and horizontal talometatarsal I angle accurately determine talar three-dimensional radiographic position in weight-bearing varus and valgus osteoarthritic ankles. Careful radiographic evaluation is important, as these deformities affect talar position in all three planes. (orig.)

  10. The bogus taste test: Validity as a measure of laboratory food intake.

    Science.gov (United States)

    Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A; Kemps, Eva; Higgs, Suzanne; Jones, Andrew

    2017-09-01

    Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The 'bogus' taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food intake. We assessed whether the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. We examined construct validity by testing whether participant sex, hunger and liking of taste test food were associated with the amount of food consumed in the taste test. In addition, we also examined whether BMI (body mass index), trait measures of dietary restraint and over-eating in response to palatable food cues were associated with food consumption. Results indicated that the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. Factors that were reliably associated with increased consumption during the taste test were being male, have a higher baseline hunger, liking of the taste test food and a greater tendency to overeat in response to palatable food cues, whereas trait dietary restraint and BMI were not. These results indicate that the bogus taste test is likely to be a valid measure of food intake and can be used to identify factors that have a causal effect on food intake. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  11. Validation of Patient Reported Outcomes Measurement Information System (PROMIS) Computer Adaptive Tests (CATs) in the Surgical Treatment of Lumbar Spinal Stenosis.

    Science.gov (United States)

    Patel, Alpesh A; Dodwad, Shah-Nawaz M; Boody, Barrett S; Bhatt, Surabhi; Savage, Jason W; Hsu, Wellington K; Rothrock, Nan E

    2018-03-19

    Prospective, cohort study. Demonstrate validity of PROMIS physical function, pain interference, and pain behavior computer adaptive tests (CATs) in surgically treated lumbar stenosis patients. There has been increasing attention given to patient reported outcomes associated with spinal interventions. Historical patient outcome measures have inadequate validation, demonstrate floor/ceiling effects, and infrequently used due to time constraints. PROMIS is an adaptive, responsive NIH assessment tool that measures patient-reported health status. 98 consecutive patients were surgically treated for lumbar spinal stenosis and were assessed using PROMIS CATs, ODI, ZCQ and SF-12. Prior lumbar surgery, history of scoliosis, cancer, trauma, or infection were excluded. Completion time, preoperative assessment, 6 week and 3 month postoperative scores were collected. At baseline, 49%, 79%, and 81% of patients had PROMIS PB, PI, and PF scores greater than 1 SD worse than the general population. 50.6% were categorized as severely disabled, crippled, or bed bound by ODI. PROMIS CATs demonstrated convergent validity through moderate to high correlations with legacy measures (r = 0.35-0.73). PROMIS CATs demonstrated known groups validity when stratified by ODI levels of disability. ODI improvements of at least 10 points on average had changes in PROMIS scores in the expected direction (PI = -12.98, PB = -9.74, PF = 7.53). PROMIS CATs demonstrated comparable responsiveness to change when evaluated against legacy measures. PROMIS PB and PI decreased 6.66 and 9.62 and PROMIS PF increased 6.8 points between baseline and 3-months post-op (p validity, known groups validity, and responsiveness for surgically treated patients with lumbar stenosis to detect change over time and are more efficient than legacy instruments. 2.

  12. Measuring financial toxicity as a clinically relevant patient-reported outcome: The validation of the COmprehensive Score for financial Toxicity (COST).

    Science.gov (United States)

    de Souza, Jonas A; Yap, Bonnie J; Wroblewski, Kristen; Blinder, Victoria; Araújo, Fabiana S; Hlubocky, Fay J; Nicholas, Lauren H; O'Connor, Jeremy M; Brockstein, Bruce; Ratain, Mark J; Daugherty, Christopher K; Cella, David

    2017-02-01

    Cancer and its treatment lead to increased financial distress for patients. To the authors' knowledge, to date, no standardized patient-reported outcome measure has been validated to assess this distress. Patients with AJCC Stage IV solid tumors receiving chemotherapy for at least 2 months were recruited. Financial toxicity was measured by the COmprehensive Score for financial Toxicity (COST) measure. The authors collected data regarding patient characteristics, clinical trial participation, health care use, willingness to discuss costs, psychological distress (Brief Profile of Mood States [POMS]), and health-related quality of life (HRQOL) as measured by the Functional Assessment of Cancer Therapy: General (FACT-G) and the European Organization for Research and Treatment of Cancer (EORTC) QOL questionnaires. Test-retest reliability, internal consistency, and validity of the COST measure were assessed using standard-scale construction techniques. Associations between the resulting factors and other variables were assessed using multivariable analyses. A total of 375 patients with advanced cancer were approached, 233 of whom (62.1%) agreed to participate. The COST measure demonstrated high internal consistency and test-retest reliability. Factor analyses revealed a coherent, single, latent variable (financial toxicity). COST values were found to be correlated with income (correlation coefficient [r] = 0.28; Pfinancial toxicity were race (P = .04), employment status (Pcosts was not found to be associated with the degree of financial distress (P = .49). The COST measure demonstrated reliability and validity in measuring financial toxicity. Its correlation with HRQOL indicates that financial toxicity is a clinically relevant patient-centered outcome. Cancer 2017;123:476-484. © 2016 American Cancer Society. © 2016 The Authors. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.

  13. Assessing the empirical validity of alternative multi-attribute utility measures in the maternity context

    Directory of Open Access Journals (Sweden)

    Morrell Jane

    2009-05-01

    Full Text Available Abstract Background Multi-attribute utility measures are preference-based health-related quality of life measures that have been developed to inform economic evaluations of health care interventions. The objective of this study was to compare the empirical validity of two multi-attribute utility measures (EQ-5D and SF-6D based on hypothetical preferences in a large maternity population in England. Methods Women who participated in a randomised controlled trial of additional postnatal support provided by trained community support workers represented the study population for this investigation. The women were asked to complete the EQ-5D descriptive system (which defines health-related quality of life in terms of five dimensions: mobility, self care, usual activities, pain/discomfort and anxiety/depression and the SF-36 (which defines health-related quality of life, using 36 items, across eight dimensions: physical functioning, role limitations (physical, social functioning, bodily pain, general health, mental health, vitality and role limitations (emotional at six months postpartum. Their responses were converted into utility scores using the York A1 tariff set and the SF-6D utility algorithm, respectively. One-way analysis of variance was used to test the hypothetically-constructed preference rule that each set of utility scores differs significantly by self-reported health status (categorised as excellent, very good, good, fair or poor. The degree to which EQ-5D and SF-6D utility scores reflected alternative dichotomous configurations of self-reported health status and the Edinburgh Postnatal Depression Scale score was tested using the relative efficiency statistic and receiver operating characteristic (ROC curves. Results The mean utility score for the EQ-5D was 0.861 (95% CI: 0.844, 0.877, whilst the mean utility score for the SF-6D was 0.809 (95% CI: 0.796, 0.822, representing a mean difference in utility score of 0.052 (95% CI: 0.040, 0

  14. Validity of Devices That Assess Body Temperature During Outdoor Exercise in the Heat

    Science.gov (United States)

    Casa, Douglas J; Becker, Shannon M; Ganio, Matthew S; Brown, Christopher M; Yeargin, Susan W; Roti, Melissa W; Siegler, Jason; Blowers, Julie A; Glaviano, Neal R; Huggins, Robert A; Armstrong, Lawrence E; Maresh, Carl M

    2007-01-01

    Context: Rectal temperature is recommended by the National Athletic Trainers' Association as the criterion standard for recognizing exertional heat stroke, but other body sites commonly are used to measure temperature. Few authors have assessed the validity of the thermometers that measure body temperature at these sites in athletic settings. Objective: To assess the validity of commonly used temperature devices at various body sites during outdoor exercise in the heat. Design: Observational field study. Setting: Outdoor athletic facilities. Patients or Other Participants: Fifteen men and 10 women (age = 26.5 ± 5.3 years, height = 174.3 ± 11.1 cm, mass = 72.73 ± 15.95 kg, body fat = 16.2 ± 5.5%). Intervention(s): We simultaneously tested inexpensive and expensive devices orally and in the axillary region, along with measures of aural, gastrointestinal, forehead, temporal, and rectal temperatures. Temporal temperature was measured according to the instruction manual and a modified method observed in medical tents at local road races. We also measured forehead temperatures directly on the athletic field (other measures occurred in a covered pavilion) where solar radiation was greater. Rectal temperature was the criterion standard used to assess the validity of all other devices. Subjects' temperatures were measured before exercise, every 60 minutes during 180 minutes of exercise, and every 20 minutes for 60 minutes of postexercise recovery. Temperature devices were considered invalid if the mean bias (average difference between rectal temperature and device temperature) was greater than ±0.27°C (±0.5°F). Main Outcome Measure(s): Temperature from each device at each site and time point. Results: Mean bias for the following temperatures was greater than the allowed limit of ±0.27°C (±0.5°F): temperature obtained via expensive oral device (−1.20°C [−2.17°F]), inexpensive oral device (−1.67°C [−3.00°F]), expensive axillary device (−2.58°C [−4

  15. The web-buffet--development and validation of an online tool to measure food choice.

    Science.gov (United States)

    Bucher, Tamara; Keller, Carmen

    2015-08-01

    To date, no data exist on the agreement of food choice measured using an online tool with subsequent actual consumption. This needs to be shown before food choice, measured by means of an online tool, is used as a dependent variable to examine intake in the general population. A 'web-buffet' was developed to assess food choice. Choice was measured as planned meal composition from photographic material; respondents chose preferred foods and proportions for a main meal (out of a possible 144 combinations) online and the validity was assessed by comparison of a meal composed from a web-buffet with actual food intake 24-48 h later. Furthermore, correlations of food preferences, energy needs and health interest with meals chosen from the web-buffet were analysed. Students: n 106 (Study I), n 32 (Study II). Meals chosen from the web-buffet (mean = 2998 kJ, SD = 471 kJ) agreed with actual consumption (rs = 0.63, P choice in the web-buffet agrees sufficiently well with actual intake to measure food choice as a dependent variable in online surveys. However, we found an average underestimation of subsequent consumption. High correlations of preferences with chosen amounts and an inverse association of health interest with total energy further indicate the validity of the tool. Applications in behavioural nutrition research are discussed.

  16. Reliability and Validity of the Greek Migraine Disability Assessment (MIDAS) Questionnaire.

    Science.gov (United States)

    Oikonomidi, Theodora; Vikelis, Michail; Artemiadis, Artemios; Chrousos, George P; Darviri, Christina

    2018-03-01

    The Migraine Disability Assessment (MIDAS) Questionnaire is a reliable and valid instrument for migraine-related disability. Such a tool is needed to quantify migraine-related disability in the Greek population. This validation study aims to assess the test-retest reliability, internal consistency, item discriminant and convergent validity of the Greek translation of the MIDAS. Adults diagnosed with migraine completed the MIDAS Questionnaire on two occasions 3 weeks apart to assess reliability, and completed the RAND-36 to assess validity. Participants (n = 152) had a median MIDAS score of 24 and mostly severe disability (58% were grade IV). The test-retest reliability analysis (N = 59) revealed excellent reliability for the total score. Internal consistency was α = 0.71 for initial and α = 0.82 for retest completion. For item discriminant validity, the correlations between each question and the total score were significant, with high correlations for questions 2-5 (range 0.67 ≤ r ≤ 0.79; p MIDAS score tended to have better wellbeing. Psychometric properties are comparable with those of other published validation studies of the MIDAS and the original. Findings on question 1 show that missing work/school days may be closely related with increased affect issues. The Greek version of the MIDAS Questionnaire has good reliability and validity. This study allowed for cross-cultural comparability of research findings.

  17. Validity and Reliability of a Digital Inclinometer to Assess Knee Joint Position Sense in a Closed Kinetic Chain.

    Science.gov (United States)

    Romero-Franco, Natalia; Montaño-Munuera, Juan Antonio; Jiménez-Reyes, Pedro

    2017-01-01

    Knee joint position sense (JPS) is a key parameter for optimum performance in many sports but is frequently negatively affected by injuries and/or fatigue during training sessions. Although evaluation of JPS may provide key information to reduce the risk of injury, it often requires expensive and/or complex tools that make monitoring proprioceptive deterioration difficult. To analyze the validity and reliability of a digital inclinometer to measure knee JPS in a closed kinetic chain (CKC). The validity and intertester and intratester reliability of a digital inclinometer for measuring knee JPS were assessed. Biomechanics laboratory. 10 athletes (5 men and 5 women; 26.2 ± 1.3 y, 71.7 ± 12.4 kg; 1.75 ± 0.09 m; 23.5 ± 3.9 kg/m 2 ). Knee JPS was measured in a CKC. Absolute angular error (AAE) of knee JPS in a CKC. Intraclass correlation coefficient (ICC) and standard error of the mean (SEM) were calculated to determine the validity and reliability of the inclinometer. Data showed that the inclinometer had a high level of validity compared with an isokinetic dynamometer (ICC = 1.0, SEM = 1.39, p AutoCAD video analysis, inclinometer validity was very high (ICC = 0.980, SEM = 3.46, p < 0.001) for measuring AAE during knee JPS in a CKC. In addition, the intertester reliability of the inclinometer for obtaining AAE was very high (ICC = .994, SEM = 1.67, p < 0.001). The inclinometer provides a valid and reliable method for assessing knee JPS in a CKC. Health and sports professionals could take advantage of this tool to monitor proprioceptive deterioration in athletes.

  18. Reliability, validity, and minimal detectable change of the push-off test scores in assessing upper extremity weight-bearing ability.

    Science.gov (United States)

    Mehta, Saurabh P; George, Hannah R; Goering, Christian A; Shafer, Danielle R; Koester, Alan; Novotny, Steven

    2017-11-01

    Clinical measurement study. The push-off test (POT) was recently conceived and found to be reliable and valid for assessing weight bearing through injured wrist or elbow. However, further research with larger sample can lend credence to the preliminary findings supporting the use of the POT. This study examined the interrater reliability, construct validity, and measurement error for the POT in patients with wrist conditions. Participants with musculoskeletal (MSK) wrist conditions were recruited. The performance on the POT, grip isometric strength of wrist extensors was assessed. The shortened version of the Disabilities of the Arm, Shoulder and Hand and numeric pain rating scale were completed. The intraclass correlation coefficient assessed interrater reliability of the POT. Pearson correlation coefficients (r) examined the concurrent relationships between the POT and other measures. The standard error of measurement and the minimal detectable change at 90% confidence interval were assessed as measurement error and index of true change for the POT. A total of 50 participants with different elbow or wrist conditions (age: 48.1 ± 16.6 years) were included in this study. The results of this study strongly supported the interrater reliability (intraclass correlation coefficient: 0.96 and 0.93 for the affected and unaffected sides, respectively) of the POT in patients with wrist MSK conditions. The POT showed convergent relationships with the grip strength on the injured side (r = 0.89) and the wrist extensor strength (r = 0.7). The POT showed smaller standard error of measurement (1.9 kg). The minimal detectable change at 90% confidence interval for the POT was 4.4 kg for the sample. This study provides additional evidence to support the reliability and validity of the POT. This is the first study that provides the values for the measurement error and true change on the POT scores in patients with wrist MSK conditions. Further research should examine the

  19. The validity and reliability of the Socioeconomic Status Instrument for assessing prostate cancer patients.

    Science.gov (United States)

    Cyrus-David, Mfon

    2010-08-01

    Because of the lack of consistency in the associations of the socioeconomic status (SES) of prostate cancer (PC) patients from diverse racial and ethnic backgrounds with PC health outcomes, I created the Socioeconomic Status Instrument (SESI) from the Demographic and Health Access components of the Behavioral Risk Factor Surveillance System 2004 Questionnaires and the socioeconomic indices of the subjects' residential counties to better assess the SES of PC patients. The SESI was tested on 220 consecutive subjects with pathologically confirmed PC at the Veterans Affairs Medical Center in Houston, TX. A team that included an epidemiologist, a validation statistician/health services research scientist, and PC survivors assessed the content validity of the SESI. The construct validity of the SESI was assessed with factor analysis by extracting and analyzing 5 principal components based on the subjects' individual responses on the assessment: county socioeconomic characteristics, individual socioeconomic characteristics, financial distress, increased domestic burden with limited earnings, and affluence. The internal consistency reliability of the SESI was assessed with Cronbach's alpha coefficients. Based on the reviews of the SESI, all of the initial 10 items were retained. The correlations between individual responses on the SESI were similar to the results of previous studies. The 5 principal components that I assessed accounted for 71.5% of the variance. Factor loadings ranged from 0.66 to 0.98 and communalities ranged from 0.55 to 0.94. County socioeconomic characteristics accounted for 22.6% of the variance, whereas individual socioeconomic characteristics accounted for 14.6% of the variance. The overall Cronbach's alpha coefficient was 0.78. The SESI is valid and reliable. Accurate measurements of the SES of PC patients would provide better guidance for future research and care deliveries.

  20. Design, validation, and reliability of survey to measure female athlete triad knowledge among coaches

    Directory of Open Access Journals (Sweden)

    Jillian E. Frideres

    2015-06-01

    Full Text Available The purpose of this study was to design and to test the validity and reliability of an instrument to evaluate coaches' knowledge about the female athlete triad syndrome and their confidence in this knowledge. The instrument collects information regarding: knowledge of the syndrome, components, prevention and intervention; confidence of the coaches in their answers; and coach's characteristics (gender, degree held, years of experience in coaching females, continuing education participation specific to the syndrome and its components, and sport coached. The process of designing the questionnaire and testing the validity and reliability of it was done in four phases: a design and development of the instrument, b content validity, c instrument reliability, and d concurrent validity. The results show that the instrument is suitable for measuring coaches' female athlete triad knowledge. The instrument can contribute to assessing the coaches' knowledge level in relation to this topic.

  1. Validity and reliability of eating disorder assessments used with athletes: A review

    Directory of Open Access Journals (Sweden)

    Zachary Pope

    2015-09-01

    Conclusion: Only seven studies calculated validity coefficients within the study whereas 47 cited the validity coefficient. Twenty-six calculated a reliability coefficient whereas 47 cited the reliability of the ED measures. Four studies found validity evidence for the EAT, EDI, BULIT-R, QEDD, and EDE-Q in an athlete population. Few studies reviewed calculated validity and reliability coefficients of ED measures. Cross-validation of these measures in athlete populations is clearly needed.

  2. Assessment of Young English Language Learners in Arizona: Questioning the Validity of the State Measure of English Proficiency

    Science.gov (United States)

    Garcia, Eugene E.; Lawton, Kerry; Diniz de Figueiredo, Eduardo H.

    2010-01-01

    This study analyzes the Arizona policy of utilizing a single assessment of English proficiency to determine if students should be exited from the ELL program, which is ostensibly designed to make it possible for them to succeed in the mainstream classroom without any further language support. The study examines the predictive validity of this…

  3. Measuring patient activation in the Netherlands: translation and validation of the American short form Patient Activation Measure (PAM13

    Directory of Open Access Journals (Sweden)

    Rademakers Jany

    2012-07-01

    Full Text Available Abstract Background The American short form Patient Activation Measure (PAM is a 13-item instrument which assesses patient (or consumer self-reported knowledge, skills and confidence for self-management of one’s health or chronic condition. In this study the PAM was translated into a Dutch version; psychometric properties of the Dutch version were established and the instrument was validated in a panel of chronically ill patients. Methods The translation was done according to WHO guidelines. The PAM 13-Dutch was sent to 4178 members of the Dutch National Panel of people with Chronic illness or Disability (NPCD in April 2010 (study A and again to a sub sample of this group (N = 973 in June 2010 (study B. Internal consistency, test-retest reliability and cross-validation with the SBSQ-D (a measure for Health literacy were computed. The Dutch results were compared to similar Danish and American data. Results The psychometric properties of the PAM 13-Dutch were generally good. The level of internal consistency is good (α = 0.88 and item-rest correlations are moderate to strong. The Dutch mean PAM score (61.3 is comparable to the American (61.9 and lower than the Danish (64.2. The test-retest reliability was moderate. The association with Health literacy was weak to moderate. Conclusions The PAM-13 Dutch is a reliable instrument to measure patient activation. More research is needed into the validity of the Patient Activation Measure, especially with respect to a more comprehensive measure of Health literacy.

  4. Validation of the MOS Social Support Survey 6-item (MOS-SSS-6) measure with two large population-based samples of Australian women.

    Science.gov (United States)

    Holden, Libby; Lee, Christina; Hockey, Richard; Ware, Robert S; Dobson, Annette J

    2014-12-01

    This study aimed to validate a 6-item 1-factor global measure of social support developed from the Medical Outcomes Study Social Support Survey (MOS-SSS) for use in large epidemiological studies. Data were obtained from two large population-based samples of participants in the Australian Longitudinal Study on Women's Health. The two cohorts were aged 53-58 and 28-33 years at data collection (N = 10,616 and 8,977, respectively). Items selected for the 6-item 1-factor measure were derived from the factor structure obtained from unpublished work using an earlier wave of data from one of these cohorts. Descriptive statistics, including polychoric correlations, were used to describe the abbreviated scale. Cronbach's alpha was used to assess internal consistency and confirmatory factor analysis to assess scale validity. Concurrent validity was assessed using correlations between the new 6-item version and established 19-item version, and other concurrent variables. In both cohorts, the new 6-item 1-factor measure showed strong internal consistency and scale reliability. It had excellent goodness-of-fit indices, similar to those of the established 19-item measure. Both versions correlated similarly with concurrent measures. The 6-item 1-factor MOS-SSS measures global functional social support with fewer items than the established 19-item measure.

  5. Measuring the Value of New Drugs: Validity and Reliability of 4 Value Assessment Frameworks in the Oncology Setting.

    Science.gov (United States)

    Bentley, Tanya G K; Cohen, Joshua T; Elkin, Elena B; Huynh, Julie; Mukherjea, Arnab; Neville, Thanh H; Mei, Matthew; Copher, Ronda; Knoth, Russell; Popescu, Ioana; Lee, Jackie; Zambrano, Jenelle M; Broder, Michael S

    2017-06-01

    Several organizations have developed frameworks to systematically assess the value of new drugs. To evaluate the convergent validity and interrater reliability of 4 value frameworks to understand the extent to which these tools can facilitate value-based treatment decisions in oncology. Eight panelists used the American Society of Clinical Oncology (ASCO), European Society for Medical Oncology (ESMO), Institute for Clinical and Economic Review (ICER), and National Comprehensive Cancer Network (NCCN) frameworks to conduct value assessments of 15 drugs for advanced lung and breast cancers and castration-refractory prostate cancer. Panelists received instructions and published clinical data required to complete the assessments, assigning each drug a numeric or letter score. Kendall's Coefficient of Concordance for Ranks (Kendall's W) was used to measure convergent validity by cancer type among the 4 frameworks. Intraclass correlation coefficients (ICCs) were used to measure interrater reliability for each framework across cancers. Panelists were surveyed on their experiences. Kendall's W across all 4 frameworks for breast, lung, and prostate cancer drugs was 0.560 (P= 0.010), 0.562 (P = 0.010), and 0.920 (P fair to excellent, increasing with clinical benefit subdomain concordance and simplicity of drug trial data. Interrater reliability, highest for ASCO and ESMO, improved with clarity of instructions and specificity of score definitions. Continued use, analyses, and refinements of these frameworks will bring us closer to the ultimate goal of using value-based treatment decisions to improve patient care and outcomes. This work was funded by Eisai Inc. Copher and Knoth are employees of Eisai Inc. Bentley, Lee, Zambrano, and Broder are employees of Partnership for Health Analytic Research, a health services research company paid by Eisai Inc. to conduct this research. For this study, Cohen, Huynh, and Neville report fees from Partnership for Health Analytic Research

  6. Validation of an instrumented dummy to assess mechanical aspects of discomfort during load carriage.

    Directory of Open Access Journals (Sweden)

    Patrick D Wettenschwiler

    Full Text Available Due to the increasing load in backpacks and other load carriage systems over the last decades, load carriage system designs have to be adapted accordingly to minimize discomfort and to reduce the risk of injury. As subject studies are labor-intensive and include further challenges such as intra-subject and inter-subject variability, we aimed to validate an instrumented dummy as an objective laboratory tool to assess the mechanical aspects of discomfort. The validation of the instrumented dummy was conducted by comparison with a recent subject study. The mechanical parameters that characterize the static and dynamic interaction between backpack and body during different backpack settings were compared. The second aim was to investigate whether high predictive power (coefficient of determination R2>0.5 in assessing the discomfort of load carriage systems could be reached using the instrumented dummy. Measurements were conducted under static conditions, simulating upright standing, and dynamic conditions, simulating level walking. Twelve different configurations of a typical load carriage system, a commercially available backpack with a hip belt, were assessed. The mechanical parameters were measured in the shoulder and the hip region of the dummy and consisted of average pressure, peak pressure, strap force and relative motion between the system and the body. The twelve configurations consisted of three different weights (15kg, 20kg, and 25kg, combined with four different hip belt tensions (30N, 60N, 90N, and 120N. Through the significant (p<0.05 correlation of the mechanical parameters measured on the dummy with the corresponding values of the subject study, the dummy was validated for all static measurements and for dynamic measurements in the hip region to accurately simulate the interaction between the human body and the load carriage system. Multiple linear regressions with the mechanical parameters measured on the dummy as independent

  7. Validity of the patient-reported Clinical Global Impression of Change as a measure of treatment response in men with premature ejaculation.

    Science.gov (United States)

    Althof, Stanley E; Brock, Gerald B; Rosen, Raymond C; Rowland, David L; Aquilina, Joseph W; Rothman, Margaret; Tesfaye, Fisseha; Bull, Scott

    2010-06-01

    The Clinical Global Impression of Change (CGIC) measures have high utility in clinical practice. However, it is unknown whether the CGIC is valued for assessing premature ejaculation (PE) symptoms and/or the relationship between CGIC and other validated PE patient-reported measures. The study aims to assess the validity of the patient-reported CGIC measure in men with PE and to examine the relationship between CGIC ratings and assessments of control, satisfaction, personal distress, and interpersonal difficulty. Data from a randomized, double-blind, 24-week phase 3 trial in 1,162 men with PE who received dapoxetine (30 mg or 60 mg) or placebo on demand provided the basis for the analysis. Patients were ≥18 years, in a stable monogamous relationship for ≥6 months, met the Diagnostic and Statistical Manual of Mental Disorders-Fourth Edition-Text Revision criteria for PE for ≥6 months, and had an intravaginal ejaculatory latency time (IELT) ≤2 minutes in ≥75% of intercourse episodes. The CGIC asked patients to rate improvement or worsening of their PE compared with the start of the study using a 7-point response scale; other patient-reported measures were control over ejaculation, satisfaction with sexual intercourse, interpersonal difficulty, and personal distress related to ejaculation. Stopwatch-measured IELT was recorded. Associations between CGIC and change in other measures at study end point were assessed. The magnitude of IELT increased for each category of improvement on the CGIC: 1.63, 4.03, and 7.15 minutes for slightly better, better, and much better, respectively. Higher CGIC ratings were correlated with greater improvement in control (r = 0.73), satisfaction (r = 0.62), greater reduction in distress (r = -0.52), and interpersonal difficulty (r = -0.39). Total variance accounted for was 57.4%: control (48.7%), satisfaction (4.5%), IELT (2.8%), and distress (1.15%). The analyses support the validity of the CGIC measure in men with PE. The CGIC

  8. A concise, content valid, gender invariant measure of workplace incivility.

    Science.gov (United States)

    Matthews, Russell A; Ritter, Kelsey-Jo

    2016-07-01

    The authors present a short, valid, gender invariant measure of workplace incivility that should have a high degree of utility in a variety of research designs, especially those concerned with reducing participant burden such as experience sampling and multiwave longitudinal designs. Given ongoing concerns about the psychometric properties of workplace mistreatment constructs, they validated a 4-item measure of experienced incivility based on series of 3 independent field studies (N = 2,636). In addition to retaining items on the basis of employee rated conceptual alignment (i.e., judgmental criteria) with a standard incivility definition (i.e., ambiguous intent to harm), items were also chosen based on external criteria in terms of their ability to explain incremental variance in outcomes of interest (e.g., role overload, interpersonal deviance). Items with large systematic relationships with other mistreatment constructs (i.e., abusive supervision, supervisor undermining) were excluded. In turn, the authors demonstrated that the 4-item measure is gender invariant, a critical issue that has received limited attention in the literature to date. They also experimentally investigated the effect of recall window (2 weeks, 1 month, 1 year) and found a differential pattern of effect sizes for various outcomes of interest. A fourth independent field study was conducted as a practical application of the measure within a longitudinal framework. An autoregressive model examining experienced incivility and counterproductive work behaviors was tested. Data was collected from a sample of 278 respondents at 3 time points with 1 month between assessments. Implications of these findings are discussed. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  9. DMM assessments of attachment and adaptation: Procedures, validity and utility.

    Science.gov (United States)

    Farnfield, Steve; Hautamäki, Airi; Nørbech, Peder; Sahhar, Nicola

    2010-07-01

    This article gives a brief over view of the Dynamic-Maturational Model of attachment and adaptation (DMM; Crittenden, 2008) together with the various DMM assessments of attachment that have been developed for specific stages of development. Each assessment is discussed in terms of procedure, outcomes, validity, advantages and limitations, comparable procedures and areas for further research and validation. The aims are twofold: to provide an introduction to DMM theory and its application that underlie the articles in this issue of CCPP; and to provide researchers and clinicians with a guide to DMM assessments.

  10. Validity and reliability of a low-cost digital dynamometer for measuring isometric strength of lower limb.

    Science.gov (United States)

    Romero-Franco, Natalia; Jiménez-Reyes, Pedro; Montaño-Munuera, Juan A

    2017-11-01

    Lower limb isometric strength is a key parameter to monitor the training process or recognise muscle weakness and injury risk. However, valid and reliable methods to evaluate it often require high-cost tools. The aim of this study was to analyse the concurrent validity and reliability of a low-cost digital dynamometer for measuring isometric strength in lower limb. Eleven physically active and healthy participants performed maximal isometric strength for: flexion and extension of ankle, flexion and extension of knee, flexion, extension, adduction, abduction, internal and external rotation of hip. Data obtained by the digital dynamometer were compared with the isokinetic dynamometer to examine its concurrent validity. Data obtained by the digital dynamometer from 2 different evaluators and 2 different sessions were compared to examine its inter-rater and intra-rater reliability. Intra-class correlation (ICC) for validity was excellent in every movement (ICC > 0.9). Intra and inter-tester reliability was excellent for all the movements assessed (ICC > 0.75). The low-cost digital dynamometer demonstrated strong concurrent validity and excellent intra and inter-tester reliability for assessing isometric strength in the main lower limb movements.

  11. Broadband IR Measurements for Modis Validation

    Science.gov (United States)

    Jessup, Andrew T.

    2003-01-01

    The primary objective of this research was the development and deployment of autonomous shipboard systems for infrared measurement of ocean surface skin temperature (SST). The focus was on demonstrating long-term, all-weather capability and supplying calibrated skin SST to the MODIS Ocean Science Team (MOCEAN). A secondary objective was to investigate and account for environmental factors that affect in situ measurements of SST for validation of satellite products. We developed and extensively deployed the Calibrated, InfraRed, In situ Measurement System, or CIRIMS, for at-sea validation of satellite-derived SST. The design goals included autonomous operation at sea for up to 6 months and an accuracy of +/- 0.1 C. We used commercially available infrared pyrometers and a precision blackbody housed in a temperature-controlled enclosure. The sensors are calibrated at regular interval using a cylindro-cone target immersed in a temperature-controlled water bath, which allows the calibration points to follow the ocean surface temperature. An upward-looking pyrometer measures sky radiance in order to correct for the non-unity emissivity of water, which can introduce an error of up to 0.5 C. One of the most challenging aspects of the design was protection against the marine environment. A wide range of design strategies to provide accurate, all-weather measurements were investigated. The CIRIMS uses an infrared transparent window to completely protect the sensor and calibration blackbody from the marine environment. In order to evaluate the performance of this approach, the design incorporates the ability to make measurements with and without the window in the optical path.

  12. The Assessment of reliability and validity of Persian Version of the Endometriosis Health Profile (EHP-30

    Directory of Open Access Journals (Sweden)

    Marzieh Nojomi

    2011-06-01

    Full Text Available Background: The Endometriosis Health Profile-30 (EHP-30 is a disease-specific questionnaire to measure the health-related quality of life in patients with endometriosis. The aim of this study was to evaluate the validity and reliability of the Persian version of Endometriosis Health Profile (EHP-30 in women with endometriosis referring to three Gynecology Clinics in Tehran, Iran. Methods: One hundred women (20 to 50 years old with surgically confirmed endometriosis recruited from three outpatient Gynecology Clinics affiliated to the Iran University of Medical Sciences. All 100 patients were asked to complete EHP-30 questionnaire while referring to the Clinics. The findings were analyzed using descriptive statistics, internal reliability consistency, construct validity (using short form-36, which had already been validated in Iran, factor analysis (with principle component analysis method, and item total correlation to assess the validity and reliability of the questionnaire. Results: The internal consistency reliability of the questionnaire was high (Cronbach’s α ranged between 0.80 and 0.93 for core, and 0.78 and 0.90 for modular parts. All items were loaded on their own factors except item 17 (feeling aggressive or violent and item 18 (feeling unwell, which were loaded on pain and social support domains, respectively. Construct validity of EHP-30, established by using SF-36, indicates good correlations in several similar scales of these two questionnaires. Conclusion: The findings of the study demonstrate that Persian version of EHP-30 is a valid and reliable measure to assess the quality of life in women with endometriosis

  13. Validity of hip-mounted uniaxial accelerometry with heart-rate monitoring vs. triaxial accelerometry in the assessment of free-living energy expenditure in young children: the IDEFICS Validation Study.

    Science.gov (United States)

    Ojiambo, Robert; Konstabel, Kenn; Veidebaum, Toomas; Reilly, John; Verbestel, Vera; Huybrechts, Inge; Sioen, Isabelle; Casajús, José A; Moreno, Luis A; Vicente-Rodriguez, German; Bammann, Karin; Tubic, Bojan M; Marild, Staffan; Westerterp, Klaas; Pitsiladis, Yannis P

    2012-11-01

    One of the aims of Identification and Prevention of Dietary- and Lifestyle-Induced Health Effects in Children and Infants (IDEFICS) validation study is to validate field measures of physical activity (PA) and energy expenditure (EE) in young children. This study compared the validity of uniaxial accelerometry with heart-rate (HR) monitoring vs. triaxial accelerometry against doubly labeled water (DLW) criterion method for assessment of free-living EE in young children. Forty-nine European children (25 female, 24 male) aged 4-10 yr (mean age: 6.9 ± 1.5 yr) were assessed by uniaxial ActiTrainer with HR, uniaxial 3DNX, and triaxial 3DNX accelerometry. Total energy expenditure (TEE) was estimated using DLW over a 1-wk period. The longitudinal axis of both devices and triaxial 3DNX counts per minute (CPM) were significantly (P hip-mounted uniaxial and triaxial accelerometers for assessing PA and EE is similar.

  14. Reliable and Valid Assessment of Point-of-care Ultrasonography

    DEFF Research Database (Denmark)

    Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt

    2015-01-01

    physicians' OSAUS scores with diagnostic accuracy. RESULTS: The generalizability coefficient was high (0.81) and a D-study demonstrated that 1 assessor and 5 cases would result in similar reliability. The construct validity of the OSAUS scale was supported by a significant difference in the mean scores......OBJECTIVE: To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. BACKGROUND: POC US is increasingly used by clinicians and is an essential part of the management of acute surgical...... conditions. However, the quality of performance is highly operator-dependent. Therefore, reliable and valid assessment of trainees' ultrasonography competence is needed to ensure patient safety. METHODS: Twenty-four physicians, representing novices, intermediates, and experts in POC US, scanned 4 different...

  15. Toward a Measure of Accountability in Nursing: A Three-Stage Validation Study.

    Science.gov (United States)

    Drach-Zahavy, Anat; Leonenko, Marina; Srulovici, Einav

    2018-06-04

    To develop and psychometrically evaluate a three-dimensional questionnaire suitable for evaluating personal and organizational accountability in nurses. Accountability is defined as a three-dimensional value, directing professionals to take responsibility for their decisions and actions, to be willing to explain them (transparency) and to be judged according to society's accepted values (answerability). Despite the relatively clear definition, measurement of accountability lags well behind. Existing self-report questionnaires do not fully capture the complexity of the concept; nor do they capture the different sources of accountability (e.g., personal accountability, organizational accountability). A three-stage measure development. Data were collected during 2015-2016. In Phase 1, an initial database of items (N = 74) was developed, based on literature review and qualitative study, establishing face and content validity. In Phase 2, the face, content, construct and criterion-related validity of the initial questionnaires (19 items for personal and organizational accountability questionnaire) was established with a sample of 229 nurses. In Phase 3, the final questionnaires (19 items each) were validated with a new sample of 329 nurses and established construct validity. The final version of the instruments comprised 19 items, suitable for assessing personal and organizational accountability. The questionnaire referred to the dimensions of responsibility, transparency and answerability. The findings established the instrument's content, construct and criterion-related validity, as well as good internal reliability. The questionnaire portrays accountability in nursing, by capturing nurses' subjective perceptions of accountability dimensions (responsibility, transparency, answerability), as demonstrated by personal and organizational values. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  16. Translation, adaptation and validation of the American short form Patient Activation Measure (PAM13) in a Danish version.

    Science.gov (United States)

    Maindal, Helle Terkildsen; Sokolowski, Ineta; Vedsted, Peter

    2009-06-29

    The Patient Activation Measure (PAM) is a measure that assesses patient knowledge, skill, and confidence for self-management. This study validates the Danish translation of the 13-item Patient Activation Measure (PAM13) in a Danish population with dysglycaemia. 358 people with screen-detected dysglycaemia participating in a primary care health education study responded to PAM13. The PAM13 was translated into Danish by a standardised forward-backward translation. Data quality was assessed by mean, median, item response, missing values, floor and ceiling effects, internal consistency (Cronbach's alpha and average inter-item correlation) and item-rest correlations. Scale properties were assessed by Rasch Rating Scale models. The item response was high with a small number of missing values (0.8-4.2%). Floor effect was small (range 0.6-3.6%), but the ceiling effect was above 15% for all items (range 18.6-62.7%). The alpha-coefficient was 0.89 and the average inter-item correlation 0.38. The Danish version formed a unidimensional, probabilistic Guttman-like scale explaining 43.2% of the variance. We did however, find a different item sequence compared to the original scale. A Danish version of PAM13 with acceptable validity and reliability is now available. Further development should focus on single items, response categories in relation to ceiling effects and further validation of reproducibility and responsiveness.

  17. Creating and validating GIS measures of urban design for health research.

    Science.gov (United States)

    Purciel, Marnie; Neckerman, Kathryn M; Lovasi, Gina S; Quinn, James W; Weiss, Christopher; Bader, Michael D M; Ewing, Reid; Rundle, Andrew

    2009-12-01

    Studies relating urban design to health have been impeded by the unfeasibility of conducting field observations across large areas and the lack of validated objective measures of urban design. This study describes measures for five dimensions of urban design - imageability, enclosure, human scale, transparency, and complexity - created using public geographic information systems (GIS) data from the US Census and city and state government. GIS measures were validated for a sample of 588 New York City block faces using a well-documented field observation protocol. Correlations between GIS and observed measures ranged from 0.28 to 0.89. Results show valid urban design measures can be constructed from digital sources.

  18. Modelling the pre-assessment learning effects of assessment: evidence in the validity chain.

    Science.gov (United States)

    Cilliers, Francois J; Schuwirth, Lambert W T; van der Vleuten, Cees P M

    2012-11-01

    We previously developed a model of the pre-assessment learning effects of consequential assessment and started to validate it. The model comprises assessment factors, mechanism factors and learning effects. The purpose of this study was to continue the validation process. For stringency, we focused on a subset of assessment factor-learning effect associations that featured least commonly in a baseline qualitative study. Our aims were to determine whether these uncommon associations were operational in a broader but similar population to that in which the model was initially derived. A cross-sectional survey of 361 senior medical students at one medical school was undertaken using a purpose-made questionnaire based on a grounded theory and comprising pairs of written situational tests. In each pair, the manifestation of an assessment factor was varied. The frequencies at which learning effects were selected were compared for each item pair, using an adjusted alpha to assign significance. The frequencies at which mechanism factors were selected were calculated. There were significant differences in the learning effect selected between the two scenarios of an item pair for 13 of this subset of 21 uncommon associations, even when a p-value of value. For a subset of uncommon associations in the model, the role of most assessment factor-learning effect associations and the mechanism factors involved were supported in a broader but similar population to that in which the model was derived. Although model validation is an ongoing process, these results move the model one step closer to the stage of usefully informing interventions. Results illustrate how factors not typically included in studies of the learning effects of assessment could confound the results of interventions aimed at using assessment to influence learning. © Blackwell Publishing Ltd 2012.

  19. Measuring the suffering of end-stage dementia: reliability and validity of the Mini-Suffering State Examination.

    Science.gov (United States)

    Aminoff, Bechor Z; Purits, Elena; Noy, Shlomo; Adunsky, Abraham

    2004-01-01

    Assessment of suffering is extremely important in dying end-stage dementia patients (ESDP). We have developed and examined the reliability and validity of the Mini-Suffering State Examination (MSSE), in 103 consecutive bedridden ESDP. Main outcome measures included inter-observer reliability and concurrent validity. Reliability of the MSSE questionnaire was satisfactory, with Cronbach alpha values of 0.735 and 0.718 for the two physicians (Ph-1, Ph-2), respectively. The kappa agreement coefficient was 0.791. There was a high agreement for seven items (kappa 0.882-0.972) and a substantial agreement for the other three items (kappa 0.621-0.682) of the MSSE. MSSE was validated versus the comfort assessment in dying with dementia (CAD-EOLD) scale and resulted in a significant Pearson correlation (r=-0.796, P<0.001). We conclude that the MSSE scale is a reliable and valid clinical tool, recommended for evaluating the severity of the patient's condition and the level of suffering of ESDP. Use of MSSE may improve medical management and facilitate communication between patients and caregivers.

  20. Validation of Navigation Ultrasound for Clavicular Length Measurement

    DEFF Research Database (Denmark)

    Høj, Anders Thorsmark; Villa, Chiara; Christensen, Ole M.

    2017-01-01

    interval): approximately ± 7.5 mm, Pearson's correlation R: 0.948-0.974). Navigation ultrasound can measure clavicular length with an intra-rater reliability matching that of 3-D rendered computed tomography scans and with high validity. Its use could spread to other fields requiring accurate...... of 52.5 (range: 21-78 y) were included. Navigation ultrasound exhibited high reliability (intra-class correlation coefficient: 0.942-0.997, standard error of the mean: 0.7-2.9 mm, minimal detectable change: 2.3-8.1 mm) and validity (measurement error: 1.3%-1.8%, limits of agreement (95% confidence...

  1. Validation of the Portuguese version of the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS-P) in clinical and non-clinical samples.

    Science.gov (United States)

    Curcio, Cristiane Schumann Silva; Lucchetti, Giancarlo; Moreira-Almeida, Alexander

    2015-04-01

    Despite Brazil's high levels of religious involvement, there is a scarcity of validated religiousness/spirituality (R/S) measures in Portuguese, particularly multidimensional ones. This study presents the validation of the Portuguese version of the "Brief Multidimensional Measure in Religiousness and Spirituality" (BMMRS) within the Brazilian context. Inpatients (262) and caregivers (389) at two hospitals of Brazil answered the BMMRS, the DUREL-p, and a sociodemographic questionnaire. The internal and convergent validity and test-retest reliability for major dimensions were good. Discriminant validity was high (except for the Forgiveness dimension). The Portuguese version of the BMMRS is a reliable and valid instrument to assess multiple R/S dimensions in clinical and non-clinical samples.

  2. Validation of a tool for assessing the quality of pharmaceutical services

    Directory of Open Access Journals (Sweden)

    Cosendey Marly Aparecida E.

    2003-01-01

    Full Text Available This paper presents the validation process for a tool assessing basic pharmaceutical services through an analysis of the implementation of a Basic Pharmaceuticals Distribution Program by the Brazilian Federal government. The process began with the drafting of a theoretical model, based on a state-of-the-art review and allowing the selection of various conceptual dimensions and respective criteria that best represented the construct. The second step involved weighting indicators for the construction of quality scores. Three models were tested for ranking implementation levels, and seven simulations were conducted, determining the score most closely reflecting the selected indicators in two different matrices. The objective was to select the most coherent and consistent version between implementation levels and expected outcomes, while simultaneously enhancing validity of chosen criteria. Testing of the various models and the results obtained showed that augmenting the validity of the study was possible without altering data. This endeavor is justified in understanding the scope and limitations of these measurements and of the choices involved in issues concerning their weighting and interpretation.

  3. Development and validation of an instrument to assess future orientation and resilience in adolescence.

    Science.gov (United States)

    Di Maggio, Ilaria; Ginevra, Maria Cristina; Nota, Laura; Soresi, Salvatore

    2016-08-01

    The study is aimed at providing the development and initial validation of the Design My Future (DMF), which may be administered in career counseling and research activities to assess adolescents' future orientation and resilience. Two studies with two independent samples of Italian adolescents were conducted to examine psychometric requisites of DMF. Specifically, in the first study, after developing items and examined the content validity, the factorial structure, reliability and discriminant validity of the DMF were tested. In the second study, the measurement invariance across gender, conducing a sequence of nested CFA models, was evaluated. Results showed good psychometric support for the instrument with Italian adolescents. Copyright © 2016 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.

  4. Validation of an instrument to assess evidence-based practice knowledge, attitudes, access, and confidence in the dental environment.

    Science.gov (United States)

    Hendricson, William D; Rugh, John D; Hatch, John P; Stark, Debra L; Deahl, Thomas; Wallmann, Elizabeth R

    2011-02-01

    This article reports the validation of an assessment instrument designed to measure the outcomes of training in evidence-based practice (EBP) in the context of dentistry. Four EBP dimensions are measured by this instrument: 1) understanding of EBP concepts, 2) attitudes about EBP, 3) evidence-accessing methods, and 4) confidence in critical appraisal. The instrument-the Knowledge, Attitudes, Access, and Confidence Evaluation (KACE)-has four scales, with a total of thirty-five items: EBP knowledge (ten items), EBP attitudes (ten), accessing evidence (nine), and confidence (six). Four elements of validity were assessed: consistency of items within the KACE scales (extent to which items within a scale measure the same dimension), discrimination (capacity to detect differences between individuals with different training or experience), responsiveness (capacity to detect the effects of education on trainees), and test-retest reliability. Internal consistency of scales was assessed by analyzing responses of second-year dental students, dental residents, and dental faculty members using Cronbach coefficient alpha, a statistical measure of reliability. Discriminative validity was assessed by comparing KACE scores for the three groups. Responsiveness was assessed by comparing pre- and post-training responses for dental students and residents. To measure test-retest reliability, the full KACE was completed twice by a class of freshman dental students seventeen days apart, and the knowledge scale was completed twice by sixteen faculty members fourteen days apart. Item-to-scale consistency ranged from 0.21 to 0.78 for knowledge, 0.57 to 0.83 for attitude, 0.70 to 0.84 for accessing evidence, and 0.87 to 0.94 for confidence. For discrimination, ANOVA and post hoc testing by the Tukey-Kramer method revealed significant score differences among students, residents, and faculty members consistent with education and experience levels. For responsiveness to training, dental students

  5. Validation of a Novel 3-Dimensional Sonographic Method for Assessing Gastric Accommodation in Healthy Adults

    NARCIS (Netherlands)

    Buisman, Wijnand J; van Herwaarden-Lindeboom, MYA; Mauritz, Femke A; El Ouamari, Mourad; Hausken, Trygve; Olafsdottir, Edda J; van der Zee, David C; Gilja, Odd Helge

    OBJECTIVES: A novel automated 3-dimensional (3D) sonographic method has been developed for measuring gastric volumes. This study aimed to validate and assess the reliability of this novel 3D sonographic method compared to the reference standard in 3D gastric sonography: freehand magneto-based 3D

  6. Reliability and validity of the Repeatable Battery for the Assessment of Neuropsychological Status in community-dwelling elderly

    Science.gov (United States)

    Cheng, Yan; Wu, Wenyuan; Wang, Jiaqi; Feng, Wei; Li, Chunbo

    2011-01-01

    Introduction The Repeatable Battery for the Assessment of Neuropsychological Status (RBANS) is a widely used screening instrument in neuropsychological assessment and is a brief, individually administered measure. The present study aims to assess the reliability and validity of the Chinese version of the RBANS in community-dwelling elderly. Material and methods All subjects come from the community-dwelling elderly in Shanghai, China. They completed a questionnaire concerning demographic information, the mini-mental state examination (MMSE) and the Chinese version of the RBANS. To test for internal consistency, Cronbach's α was calculated for all six RBANS indices. Correlations between each of the RBANS and MMSE subtests were conducted to measure the concurrent validity. A confirmatory factor analysis (CFA) was conducted to test the construct validity. Results The final sample of participants included 236 community-dwelling elderly. The mean total score on the RBANS was 86.02 (±14.19). The RBANS total score showed strong internal consistency (r = 0.806), and the coefficient α value for each of the RBANS scales ranged from 0.142 to 0.727. The total RBANS score was highly correlated with that of the MMSE (r = 0.594, pvalidity in a community-dwelling elderly sample. It may be a useful screening instrument for conducting cognitive assessments in community-dwelling elderly. PMID:22291831

  7. Measurement of Dietary Restraint: Validity Tests of Four Questionnaires

    Science.gov (United States)

    Williamson, Donald A.; Martin, Corby K.; York-Crowe, Emily; Anton, Stephen D.; Redman, Leanne M.; Han, Hongmei; Ravussin, Eric

    2007-01-01

    This study tested the validity of four measures of dietary restraint: Dutch Eating Behavior Questionnaire, Eating Inventory (EI), Revised Restraint Scale (RS), and the Current Dieting Questionnaire. Dietary restraint has been implicated as a determinant of overeating and binge eating. Conflicting findings have been attributed to different methods for measuring dietary restraint. The validity of four self-report measures of dietary restraint and dieting behavior was tested using: 1) factor analysis, 2) changes in dietary restraint in a randomized controlled trial of different methods to achieve calorie restriction, and 3) correlation of changes in dietary restraint with an objective measure of energy balance, calculated from the changes in fat mass and fat-free mass over a six-month dietary intervention. Scores from all four questionnaires, measured at baseline, formed a dietary restraint factor, but the RS also loaded on a binge eating factor. Based on change scores, the EI Restraint scale was the only measure that correlated significantly with energy balance expressed as a percentage of energy require d for weight maintenance. These findings suggest that that, of the four questionnaires tested, the EI Restraint scale was the most valid measure of the intent to diet and actual caloric restriction. PMID:17101191

  8. Self-administered structured food record for measuring individual energy and nutrient intake in large cohorts: Design and validation.

    Science.gov (United States)

    García, Silvia M; González, Claudio; Rucci, Enzo; Ambrosino, Cintia; Vidal, Julia; Fantuzzi, Gabriel; Prestes, Mariana; Kronsbein, Peter

    2018-06-05

    Several instruments developed to assess dietary intake of groups or populations have strengths and weaknesses that affect their specific application. No self-administered, closed-ended dietary survey was previously used in Argentina to assess current food and nutrient intake on a daily basis. To design and validate a self-administered, structured food record (NutriQuid, NQ) representative of the adult Argentine population's food consumption pattern to measure individual energy and nutrient intake. Records were loaded onto a database using software that checks a regional nutrition information system (SARA program), automatically quantifying energy and nutrient intake. NQ validation included two phases: (1) NQ construct validity comparing records kept simultaneously by healthy volunteers (45-75 years) and a nutritionist who provided meals (reference), and (2) verification of whether NQ reflected target population consumption (calories and nutrients), week consumption differences, respondent acceptability, and ease of data entry/analysis. Data analysis included descriptive statistics, repeated measures ANOVA, intraclass correlation coefficient, nonparametric regression, and cross-classification into quintiles. The first validation (study group vs. reference) showed an underestimation (10%) of carbohydrate, fat, and energy intake. Second validation: 109 volunteers (91% response) completed the NQ for seven consecutive days. Record completion took about 9min/day, and data entry 3-6min. Mean calorie intake was 2240±119kcal/day (42% carbohydrates, 17% protein, and 41% fat). Intake significantly increased in the weekend. NQ is a simple and efficient tool to assess dietary intake in large samples. Copyright © 2018 SEEN y SED. Publicado por Elsevier España, S.L.U. All rights reserved.

  9. Reliability and Validity of Three Instruments (DSM-IV, CPGI, and PPGM) in the Assessment of Problem Gambling in South Korea.

    Science.gov (United States)

    Back, Ki-Joon; Williams, Robert J; Lee, Choong-Ki

    2015-09-01

    Most research on the assessment, epidemiology, and treatment of problem gambling has occurred in Western jurisdictions. This potentially limits the cross-cultural validity of problem gambling assessment instruments as well as etiological models of problem gambling. The primary objective of the present research was to investigate the reliability and validity of three problem gambling assessment instruments within a South Korean context. A total of 4,330 South Korean adults participated in a comprehensive assessment of their gambling behavior that included the administration of the DSM-IV criteria for pathological gambling (NODS), the Canadian Problem Gambling Index (CPGI), and the Problem and Pathological Gambling Measure (PPGM). Cronbach alpha showed that all three instruments had good internal consistency. Concurrent validity was established by the significant associations observed between scores on the instruments and measures of gambling involvement (number of gambling formats engaged in; frequency of gambling; and gambling expenditure). Most importantly, kappa statistics showed that all instruments have satisfactory classification accuracy against clinical assessment of problem gambling conducted by South Korean clinicians (NODS κ = .66; PPGM κ = .62; CPGI κ = .51). These results confirm that Western-derived operationalizations of problem gambling have applicability in a South Korean setting.

  10. Validity and applicability of a video-based animated tool to assess mobility in elderly Latin American populations.

    Science.gov (United States)

    Guerra, Ricardo Oliveira; Oliveira, Bruna Silva; Alvarado, Beatriz Eugenia; Curcio, Carmen Lucia; Rejeski, W Jack; Marsh, Anthony P; Ip, Edward H; Barnard, Ryan T; Guralnik, Jack M; Zunzunegui, Maria Victoria

    2014-10-01

    To assess the reliability and the validity of Portuguese- and Spanish-translated versions of the video-based short-form Mobility Assessment Tool in assessing self-reported mobility, and to provide evidence for the applicability of these videos in elderly Latin American populations as a complement to physical performance measures. The sample consisted of 300 elderly participants (150 from Brazil, 150 from Colombia) recruited at neighborhood social centers. Mobility was assessed with the Mobility Assessment Tool, and compared with the Short Physical Performance Battery score and self-reported functional limitations. Reliability was calculated using intraclass correlation coefficients. Multiple linear regression analyses were used to assess associations among mobility assessment tools and health, and sociodemographic variables. A significant gradient of increasing Mobility Assessment Tool score with better physical function was observed for both self-reported and objective measures, and in each city. Associations between self-reported mobility and health were strong, and significant. Mobility Assessment Tool scores were lower in women at both sites. Intraclass correlation coefficients of the Mobility Assessment Tool were 0.94 (95% confidence interval 0.90-0.97) in Brazil and 0.81 (95% confidence interval 0.66-0.91) in Colombia. Mobility Assessment Tool scores were lower in Manizales than in Natal after adjustment by Short Physical Performance Battery, self-rated health and sex. These results provide evidence for high reliability and good validity of the Mobility Assessment Tool in its Spanish and Portuguese versions used in Latin American populations. In addition, the Mobility Assessment Tool can detect mobility differences related to environmental features that cannot be captured by objective performance measures. © 2013 Japan Geriatrics Society.

  11. Validation of a digital photographic method for assessment of dietary quality of school lunch sandwiches brought from home

    DEFF Research Database (Denmark)

    Sabinsky, Marianne; Toft, Ulla; Andersen, Klaus K

    2013-01-01

    Background: It is a challenge to assess children’s dietary intake. The digital photographic method (DPM) may be an objective method that can overcome some of these challenges. Objective: The aim of this study was to evaluate the validity and reliability of a DPM to assess the quality of dietary....... The lunches were photographed using a standardised DPM. From the digital images, the dietary components were estimated by a trained image analyst using weights or household measures and the dietary quality was assessed using a validated Meal Index of Dietary Quality (Meal IQ). The dietary components...... and the Meal IQ obtained from the digital images were validated against the objective weighed foods of the school lunch sandwiches. To determine interrater reliability, the digital images were evaluated by a second image analyst. Results: Correlation coefficients between the DPM and the weighed foods ranged...

  12. Development and validation of a new assessment tool for suturing skills in medical students.

    Science.gov (United States)

    Sundhagen, Henriette Pisani; Almeland, Stian Kreken; Hansson, Emma

    2018-01-01

    In recent years, emphasis has been put on that medical student should demonstrate pre-practice/pre-registration core procedural skills to ensure patient safety. Nonetheless, the formal teaching and training of basic suturing skills to medical students have received relatively little attention and there is no standard for what should be tested and how. The aim of this study was to develop and validate, using scientific methods, a tool for assessment of medical students' suturing skills, measuring both micro- and macrosurgical qualities. A tool was constructed and content, construct, concurrent validity, and inter-rater, inter-item, inter-test reliability were tested. Three groups were included: students with no training in suturing skills, students who have had training, plastic surgery. The results show promising reliability and validity when assessing novice medical students' suturing skills. Further studies are needed on implementation of the instrument. Moreover, how the instrument can be used to give formative feedback, evaluate if a required standard is met and for curriculum development needs further investigation.Level of Evidence: Not ratable.

  13. Validation of the PedsQL Epilepsy Module: A pediatric epilepsy-specific health-related quality of life measure.

    Science.gov (United States)

    Modi, Avani C; Junger, Katherine F; Mara, Constance A; Kellermann, Tanja; Barrett, Lauren; Wagner, Janelle; Mucci, Grace A; Bailey, Laurie; Almane, Dace; Guilfoyle, Shanna M; Urso, Lauryn; Hater, Brooke; Hustzi, Heather; Smith, Gigi; Herrmann, Bruce; Perry, M Scott; Zupanc, Mary; Varni, James W

    2017-11-01

    To validate a brief and reliable epilepsy-specific, health-related quality of life (HRQOL) measure in children with various seizure types, treatments, and demographic characteristics. This national validation study was conducted across five epilepsy centers in the United States. Youth 5-18 years and caregivers of youth 2-18 years diagnosed with epilepsy completed the PedsQL Epilepsy Module and additional questionnaires to establish reliability and validity of the epilepsy-specific HRQOL instrument. Demographic and medical data were collected through chart reviews. Factor analysis was conducted, and internal consistency (Cronbach's alphas), test-retest reliability, and construct validity were assessed. Questionnaires were analyzed from 430 children with epilepsy (M age = 9.9 years; range 2-18 years; 46% female; 62% white: non-Hispanic; 76% monotherapy, 54% active seizures) and their caregivers. The final PedsQL Epilepsy Module is a 29-item measure with five subscales (i.e., Impact, Cognitive, Sleep, Executive Functioning, and Mood/Behavior) with parallel child and caregiver reports. Internal consistency coefficients ranged from 0.70-0.94. Construct validity and convergence was demonstrated in several ways, including strong relationships with seizure outcomes, antiepileptic drug (AED) side effects, and well-established measures of executive, cognitive, and emotional/behavioral functioning. The PedsQL Epilepsy Module is a reliable measure of HRQOL with strong evidence of its validity across the epilepsy spectrum in both clinical and research settings. Wiley Periodicals, Inc. © 2017 International League Against Epilepsy.

  14. Assessing Psychological Insulin Resistance in Type 2 Diabetes: a Critical Comparison of Measures.

    Science.gov (United States)

    Holmes-Truscott, E; Pouwer, F; Speight, J

    2017-07-01

    This study aims to examine the operationalisation of 'psychological insulin resistance' (PIR) among people with type 2 diabetes and to identify and critique relevant measures. PIR has been operationalised as (1) the assessment of attitudes or beliefs about insulin therapy and (2) hypothetical or actual resistance, or unwillingness, to use to insulin. Five validated PIR questionnaires were identified. None was fully comprehensive of all aspects of PIR, and the rigour and reporting of questionnaire development and psychometric validation varied considerably between measures. Assessment of PIR should focus on the identification of negative and positive attitudes towards insulin use. Actual or hypothetical insulin refusal may be better conceptualised as a potential consequence of PIR, as its assessment overlooks the attitudes that may prevent insulin use. This paper provides guidance on the selection of questionnaires for clinical or research purpose and the development of new, or improvement of existing, questionnaires.

  15. The Child Dental Control Assessment (CDCA) in youth: reliability, validity and cross-cultural differences.

    Science.gov (United States)

    Coolidge, T; Heima, M; Heaton, L J; Nakai, Y; Höskuldsson, O; Smith, T A; Weinstein, P; Milgrom, P

    2005-03-01

    The Child Dental Control Assessment (CDCA) measures children's preferred control strategies in the dental situation. Three studies are reported, assessing aspects of this instrument in youths from the USA, Japan and Australia. In particular, measurements were made as to the reliability and validity of this instrument in this age group in the three cultures, as well as comparing some results across cultures. These studies used a questionnaire design. Questionnaires (including the CDCA and other measures) were given to youths aged 11-15 in the three cultures. In one culture, youths received the questionnaire twice, to compute test-retest reliability. The measure's reliability and validity were similar to those of other measures. The CDCA behaves similarly to the Revised Iowa Dental Control Index (R-IDCI). Youths in all three cultures showed similar responses, although the Japanese were less likely to endorse items. Internal reliability of the scale ranged from 0.74 to 0.85. Test- retest reliability was 0.74. Participants in the High Desire/Low Predicted classification on the R-IDCI scored higher on the CDCA (t (73) = 2.9, p < .01). In the Japanese and Australian samples the correlation between CDCA and dental fear was 0.29-0.33 (p < .001). The Australian and USA samples scored significantly higher than the Japanese sample (overall F(2,1544) = 383.98, p < .001, followed by Tukey's HSD, p < .001). These results provide evidence for the reliability and validity of the CDCA in youth. It appears to measure the discrepancy between Desired and Predicted Control identified in the Revised Iowa Dental Control Index (R-IDCI). Responses of the youth in all three cultures were similar, indicating common dental control preferences for individuals of this age. However, consistent with cultural values, Japanese youth were less likely to endorse the control strategies. These results underline the need to develop culturally-specific, as well as situationally-specific control measures.

  16. Validation of a global assessment of arthroscopic skills in a cadaveric knee model.

    Science.gov (United States)

    Slade Shantz, Jesse A; Leiter, Jeff R; Collins, John B; MacDonald, Peter B

    2013-01-01

    The purpose of this study was to determine whether a global assessment of arthroscopic skills was valid for blinded assessment of cadaveric diagnostic knee arthroscopy. A global skills assessment for arthroscopy was created using a published theory of the development of expertise. Faculty surgeons, fellows, and residents were consented and enrolled in this institutional review board-approved validation study. All participants were oriented to the equipment and procedures for diagnostic arthroscopy of the knee. After reviewing the anatomic structures to be visualized, participants were allowed 10 minutes to complete a diagnostic arthroscopy of the knee. The hands and arthroscopic view were recorded during this attempt. Resident participants completed a second filmed diagnostic arthroscopy 1 week after the initial attempt. Five blinded reviewers watched the synchronized videos and assessed arthroscopic skills with a procedure-specific checklist and the newly developed global skills assessment. The agreement between reviewers was determined by intraclass correlation coefficient. Internal consistency was determined with Cronbach's α. Test-retest reliability was measured by correlating repeated arthroscopies by residents. The ability of the global assessment to discriminate skill levels was determined with between-group Mann-Whitney U tests. The agreement between global assessment scores was strong (I.C.C. = 0.80, 95% C.I. 0.68-0.92). The internal consistency of evaluations was excellent (Cronbach's α = 0.97), and the test-retest reliability was strong (r = 0.52). The global assessment score was shown to be able to discriminate between skill levels by an analysis of variance indicating the difference in means among the various levels of training (P Assessment of Arthroscopic Skills is a useful adjunct to arthroscopic educators and learners and could be used for in-training evaluations. The Objective Assessment of Arthroscopic Skills is an instrument that can be

  17. Validity and reliability of using photography for measuring knee range of motion: a methodological study

    Directory of Open Access Journals (Sweden)

    Adie Sam

    2011-04-01

    Full Text Available Abstract Background The clinimetric properties of knee goniometry are essential to appreciate in light of its extensive use in the orthopaedic and rehabilitative communities. Intra-observer reliability is thought to be satisfactory, but the validity and inter-rater reliability of knee goniometry often demonstrate unacceptable levels of variation. This study tests the validity and reliability of measuring knee range of motion using goniometry and photographic records. Methods Design: Methodology study assessing the validity and reliability of one method ('Marker Method' which uses a skin marker over the greater trochanter and another method ('Line of Femur Method' which requires estimation of the line of femur. Setting: Radiology and orthopaedic departments of two teaching hospitals. Participants: 31 volunteers (13 arthritic and 18 healthy subjects. Knee range of motion was measured radiographically and photographically using a goniometer. Three assessors were assessed for reliability and validity. Main outcomes: Agreement between methods and within raters was assessed using concordance correlation coefficient (CCCs. Agreement between raters was assessed using intra-class correlation coefficients (ICCs. 95% limits of agreement for the mean difference for all paired comparisons were computed. Results Validity (referenced to radiographs: Each method for all 3 raters yielded very high CCCs for flexion (0.975 to 0.988, and moderate to substantial CCCs for extension angles (0.478 to 0.678. The mean differences and 95% limits of agreement were narrower for flexion than they were for extension. Intra-rater reliability: For flexion and extension, very high CCCs were attained for all 3 raters for both methods with slightly greater CCCs seen for flexion (CCCs varied from 0.981 to 0.998. Inter-rater reliability: For both methods, very high ICCs (min to max: 0.891 to 0.995 were obtained for flexion and extension. Slightly higher coefficients were obtained

  18. Assessment of sedentary behaviors and transport-related activities by questionnaire: a validation study

    Directory of Open Access Journals (Sweden)

    Keitly Mensah

    2016-08-01

    Full Text Available Abstract Background Comprehensive assessment of sedentary behavior (SB and physical activity (PA, including transport-related activities (TRA, is required to design innovative PA promotion strategies. There are few validated instruments that simultaneously assess the different components of human movement according to their context of practice (e.g. work, transport, leisure. We examined test-retest reliability and validity of the Sedentary, Transportation and Activity Questionnaire (STAQ, a newly developed questionnaire dedicated to assessing context-specific SB, TRA and PA. Methods Ninety six subjects (51 women kept a contextualized activity-logbook and wore a hip accelerometer (Actigraph GT3X + TM for a 7-day or 14-day period, at the end of which they completed the STAQ. Activity-energy expenditure was measured in a subgroup of 45 subjects using the double labeled water (DLW method. Test-retest reliability was assessed using intra-class-coefficients (ICC in a subgroup of 32 subjects who filled the questionnaire twice one month apart. Accelerometry was annotated using the logbook to obtain total and context-specific objective estimates of SB. Spearman correlations, Bland-Altman plots and ICC were used to analyze validity with logbook, accelerometry and DLW data validity criteria. Results Test-retest reliability was fair for total sitting time (ICC = 0.52, good to excellent for work sitting time (ICC = 0.71, transport-related walking (ICC = 0.61 and car use (ICC = 0.67, and leisure screen-related SB (ICC = 0.64-0.79, but poor for total sitting time during leisure and transport-related contexts. For validity, compared to accelerometry, significant correlations were found for STAQ estimates of total (r = 0.54 and context-specific sitting times with stronger correlations for work sitting time (r = 0.88, and screen times (TV/DVD viewing: r = 0.46; other screens: r = 0.42 than for transport (r = 0.35 or

  19. Assessment of sedentary behaviors and transport-related activities by questionnaire: a validation study.

    Science.gov (United States)

    Mensah, Keitly; Maire, Aurélia; Oppert, Jean-Michel; Dugas, Julien; Charreire, Hélène; Weber, Christiane; Simon, Chantal; Nazare, Julie-Anne

    2016-08-09

    Comprehensive assessment of sedentary behavior (SB) and physical activity (PA), including transport-related activities (TRA), is required to design innovative PA promotion strategies. There are few validated instruments that simultaneously assess the different components of human movement according to their context of practice (e.g. work, transport, leisure). We examined test-retest reliability and validity of the Sedentary, Transportation and Activity Questionnaire (STAQ), a newly developed questionnaire dedicated to assessing context-specific SB, TRA and PA. Ninety six subjects (51 women) kept a contextualized activity-logbook and wore a hip accelerometer (Actigraph GT3X + (TM)) for a 7-day or 14-day period, at the end of which they completed the STAQ. Activity-energy expenditure was measured in a subgroup of 45 subjects using the double labeled water (DLW) method. Test-retest reliability was assessed using intra-class-coefficients (ICC) in a subgroup of 32 subjects who filled the questionnaire twice one month apart. Accelerometry was annotated using the logbook to obtain total and context-specific objective estimates of SB. Spearman correlations, Bland-Altman plots and ICC were used to analyze validity with logbook, accelerometry and DLW data validity criteria. Test-retest reliability was fair for total sitting time (ICC = 0.52), good to excellent for work sitting time (ICC = 0.71), transport-related walking (ICC = 0.61) and car use (ICC = 0.67), and leisure screen-related SB (ICC = 0.64-0.79), but poor for total sitting time during leisure and transport-related contexts. For validity, compared to accelerometry, significant correlations were found for STAQ estimates of total (r = 0.54) and context-specific sitting times with stronger correlations for work sitting time (r = 0.88), and screen times (TV/DVD viewing: r = 0.46; other screens: r = 0.42) than for transport (r = 0.35) or leisure-related sitting-times (r

  20. Validation of a clinical assessment tool for spinal anaesthesia.

    LENUS (Irish Health Repository)

    Breen, D

    2011-07-01

    There is a need for a procedure-specific means of assessment of clinical performance in anaesthesia. The aim of this study was to devise a tool for assessing the performance of spinal anaesthesia, which has both content and construct validity.

  1. Validation of an early childhood caries risk assessment tool in a low-income Hispanic population.

    Science.gov (United States)

    Custodio-Lumsden, Christie L; Wolf, Randi L; Contento, Isobel R; Basch, Charles E; Zybert, Patricia A; Koch, Pamela A; Edelstein, Burton L

    2016-03-01

    There is a recognized need for valid risk assessment tools for use by both dental and nondental personnel to identify young children at risk for, or with, precavitated stages of early childhood caries (i.e., early stage decalcifications or white spot lesions).The aim of this study is to establish concurrent criterion validity of "MySmileBuddy" (MSB), a novel technology-assisted ECC risk assessment and behavioral intervention tool against four measures of ECC activity: semi-quantitative assays of salivary mutans streptococci levels, visible quantity of dental plaque, visual evidence of enamel decalcifications, and cavitation status (none, ECC, severe ECC). One hundred eight children 2-6 years of age presenting to a pediatric dental clinic were recruited from a predominantly Spanish-speaking, low-income, urban population. All children received a comprehensive oral examination and saliva culture for assessment of ECC indicators. Their caregivers completed the iPad-based MSB assessment in its entirety (15-20 minutes). MSB calculated both diet and comprehensive ECC risk scores. Associations between all variables were determined using ordinal logistic regression. MSB diet risk scores were significantly positively associated with salivary mutans (P valid risk assessment tool for identifying children with early precursors of cavitations but does not add value in identifying children with extant lesions. © 2015 American Association of Public Health Dentistry.

  2. Construct validity of adolescents' self-reported big five personality traits: importance of conceptual breadth and initial validation of a short measure.

    Science.gov (United States)

    Morizot, Julien

    2014-10-01

    While there are a number of short personality trait measures that have been validated for use with adults, few are specifically validated for use with adolescents. To trust such measures, it must be demonstrated that they have adequate construct validity. According to the view of construct validity as a unifying form of validity requiring the integration of different complementary sources of information, this article reports the evaluation of content, factor, convergent, and criterion validities as well as reliability of adolescents' self-reported personality traits. Moreover, this study sought to address an inherent potential limitation of short personality trait measures, namely their limited conceptual breadth. In this study, starting with items from a known measure, after the language-level was adjusted for use with adolescents, items tapping fundamental primary traits were added to determine the impact of added conceptual breadth on the psychometric properties of the scales. The resulting new measure was named the Big Five Personality Trait Short Questionnaire (BFPTSQ). A group of expert judges considered the items to have adequate content validity. Using data from a community sample of early adolescents, the results confirmed the factor validity of the Big Five structure in adolescence as well as its measurement invariance across genders. More important, the added items did improve the convergent and criterion validities of the scales, but did not negatively affect their reliability. This study supports the construct validity of adolescents' self-reported personality traits and points to the importance of conceptual breadth in short personality measures. © The Author(s) 2014.

  3. The Premature Ejaculation Profile: validation of self-reported outcome measures for research and practice.

    Science.gov (United States)

    Patrick, Donald L; Giuliano, François; Ho, Kai Fai; Gagnon, Dennis D; McNulty, Pauline; Rothman, Margaret

    2009-02-01

    To evaluate the reliability and validity of the Premature Ejaculation Profile (PEP), a self-reported outcome instrument for evaluating domains of PE and its treatment, comprised of four single-item measures, a profile, and an index score. Data were from men participating in observational studies in the USA (PE, 207 men; non-PE, 1380) and Europe (PE, 201; non-PE, 914) and from men with PE (1238) participating in a phase III randomized, placebo-controlled clinical trial of dapoxetine. The PEP contains four measures: perceived control over ejaculation, personal distress related to ejaculation, satisfaction with sexual intercourse, and interpersonal difficulty related to ejaculation, each assessed on five-point response scales. Test-retest reliability, known-groups validity, and ability to detect a patient-reported global impression of change (PGI) in condition were evaluated for the individual PEP measures and a PEP index score (the mean of all four measures). Profile analysis was conducted using multivariate analysis of variance. All PEP measures showed acceptable reliability (intraclass correlation coefficients ranged from 0.66 to 0.83) and mean scores for all measures differed significantly between PE and non-PE groups (P measures. The PEP profiles of men with and without PE differed significantly (P measure for use in monitoring outcomes of men with PE.

  4. Application of validity theory and methodology to patient-reported outcome measures (PROMs): building an argument for validity.

    Science.gov (United States)

    Hawkins, Melanie; Elsworth, Gerald R; Osborne, Richard H

    2018-07-01

    Data from subjective patient-reported outcome measures (PROMs) are now being used in the health sector to make or support decisions about individuals, groups and populations. Contemporary validity theorists define validity not as a statistical property of the test but as the extent to which empirical evidence supports the interpretation of test scores for an intended use. However, validity testing theory and methodology are rarely evident in the PROM validation literature. Application of this theory and methodology would provide structure for comprehensive validation planning to support improved PROM development and sound arguments for the validity of PROM score interpretation and use in each new context. This paper proposes the application of contemporary validity theory and methodology to PROM validity testing. The validity testing principles will be applied to a hypothetical case study with a focus on the interpretation and use of scores from a translated PROM that measures health literacy (the Health Literacy Questionnaire or HLQ). Although robust psychometric properties of a PROM are a pre-condition to its use, a PROM's validity lies in the sound argument that a network of empirical evidence supports the intended interpretation and use of PROM scores for decision making in a particular context. The health sector is yet to apply contemporary theory and methodology to PROM development and validation. The theoretical and methodological processes in this paper are offered as an advancement of the theory and practice of PROM validity testing in the health sector.

  5. Validation of an instrument to assess toddler feeding practices of Latino mothers.

    Science.gov (United States)

    Chaidez, Virginia; Kaiser, Lucia L

    2011-08-01

    This paper describes qualitative and quantitative aspects of testing a 34-item Toddler-Feeding Questionnaire (TFQ), designed for use in Latino families, and the associations between feeding practices and toddler dietary outcomes. Qualitative methods included review by an expert panel for content validity and cognitive testing of the tool to assess face validity. Quantitative analyses included use of exploratory factor analysis for construct validity; Pearson's correlations for test-retest reliability; Cronbach's alpha (α) for internal reliability; and multivariate regression for investigating relationships between feeding practices and toddler diet and anthropometry. Interviews were conducted using a convenience sample of 94 Latino mother and toddler dyads obtained largely through the Supplemental Nutrition Program for Women, Infants and Children (WIC). Data collection included household characteristics, self-reported early-infant feeding practices, the toddler's dietary intake, and anthropometric measurements. Factor analysis suggests the TFQ contains three subscales: indulgent; authoritative; and environmental influences. The TFQ demonstrated acceptable reliability for most measures. As hypothesized, indulgent practices in Latino toddlers were associated with increased energy consumption and higher intakes of total fat, saturated fat, and sweetened beverages. This tool may be useful in future research exploring the relationship of toddler feeding practices to nutritional outcomes in Latino families. Copyright © 2011 Elsevier Ltd. All rights reserved.

  6. The Work-Family Conflict Scale (WAFCS): development and initial validation of a self-report measure of work-family conflict for use with parents.

    Science.gov (United States)

    Haslam, Divna; Filus, Ania; Morawska, Alina; Sanders, Matthew R; Fletcher, Renee

    2015-06-01

    This paper outlines the development and validation of the Work-Family Conflict Scale (WAFCS) designed to measure work-to-family conflict (WFC) and family-to-work conflict (FWC) for use with parents of young children. An expert informant and consumer feedback approach was utilised to develop and refine 20 items, which were subjected to a rigorous validation process using two separate samples of parents of 2-12 year old children (n = 305 and n = 264). As a result of statistical analyses several items were dropped resulting in a brief 10-item scale comprising two subscales assessing theoretically distinct but related constructs: FWC (five items) and WFC (five items). Analyses revealed both subscales have good internal consistency, construct validity as well as concurrent and predictive validity. The results indicate the WAFCS is a promising brief measure for the assessment of work-family conflict in parents. Benefits of the measure as well as potential uses are discussed.

  7. Latency-Based and Psychophysiological Measures of Sexual Interest Show Convergent and Concurrent Validity.

    Science.gov (United States)

    Ó Ciardha, Caoilte; Attard-Johnson, Janice; Bindemann, Markus

    2018-04-01

    Latency-based measures of sexual interest require additional evidence of validity, as do newer pupil dilation approaches. A total of 102 community men completed six latency-based measures of sexual interest. Pupillary responses were recorded during three of these tasks and in an additional task where no participant response was required. For adult stimuli, there was a high degree of intercorrelation between measures, suggesting that tasks may be measuring the same underlying construct (convergent validity). In addition to being correlated with one another, measures also predicted participants' self-reported sexual interest, demonstrating concurrent validity (i.e., the ability of a task to predict a more validated, simultaneously recorded, measure). Latency-based and pupillometric approaches also showed preliminary evidence of concurrent validity in predicting both self-reported interest in child molestation and viewing pornographic material containing children. Taken together, the study findings build on the evidence base for the validity of latency-based and pupillometric measures of sexual interest.

  8. Towards Validating Risk Indicators Based on Measurement Theory (Extended version)

    NARCIS (Netherlands)

    Morali, A.; Wieringa, Roelf J.

    Due to the lack of quantitative information and for cost-efficiency, most risk assessment methods use partially ordered values (e.g. high, medium, low) as risk indicators. In practice it is common to validate risk indicators by asking stakeholders whether they make sense. This way of validation is

  9. Validation of the self-assessment teamwork tool (SATT) in a cohort of nursing and medical students.

    Science.gov (United States)

    Roper, Lucinda; Shulruf, Boaz; Jorm, Christine; Currie, Jane; Gordon, Christopher J

    2018-02-09

    Poor teamwork has been implicated in medical error and teamwork training has been shown to improve patient care. Simulation is an effective educational method for teamwork training. Post-simulation reflection aims to promote learning and we have previously developed a self-assessment teamwork tool (SATT) for health students to measure teamwork performance. This study aimed to evaluate the psychometric properties of a revised self-assessment teamwork tool. The tool was tested in 257 medical and nursing students after their participation in one of several mass casualty simulations. Using exploratory and confirmatory factor analysis, the revised self-assessment teamwork tool was shown to have strong construct validity, high reliability, and the construct demonstrated invariance across groups (Medicine & Nursing). The modified SATT was shown to be a reliable and valid student self-assessment tool. The SATT is a quick and practical method of guiding students' reflection on important teamwork skills.

  10. The Online Social Support Scale: Measure development and validation.

    Science.gov (United States)

    Nick, Elizabeth A; Cole, David A; Cho, Sun-Joo; Smith, Darcy K; Carter, T Grace; Zelkowitz, Rachel L

    2018-05-21

    A new measure, the Online Social Support Scale, was developed based on previous theory, research, and measurement of in-person social support. It includes four subscales: Esteem/Emotional Support, Social Companionship, Informational Support, and Instrumental Support. In college and community samples, factor analytic and item response theory results suggest that subtypes of in-person social support also pertain in the online world. Evidence of reliability, convergent validity, and discriminant validity provide excellent psychometric support for the measure. Construct validity accrues to the measure vis-à-vis support for three hypotheses: (a) Various broad types of Internet platforms for social interactions are differentially associated with online social support and online victimization; (b) similar to in-person social support, online social support offsets the adverse effect of negative life events on self-esteem and depression-related outcome; and (c) online social support counteracts the effects of online victimization in much the same way that in-person friends in one social niche counterbalance rejection in other social niches. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  11. The Malay version of the Early Childhood Oral Health Impact Scale (Malay-ECOHIS)--assessing validity and reliability.

    Science.gov (United States)

    Hashim, Azlina N; Yusof, Zamros Y M; Esa, Rashidah

    2015-11-25

    The Early Childhood Oral Health Impact Scale (ECOHIS) is used to assess oral impacts on the quality of life of preschool aged children and their families. The objective of this study was to perform a cross-cultural adaptation of the ECOHIS into Malay and assess its psychometric properties. The cross-cultural adaptation of ECOHIS into Malay comprised of translating the ECOHIS into the Malay language (Malay-ECOHIS) by experts followed by face validation of the Malay-ECOHIS by a group of mothers. The Malay-ECOHIS was back translated into English and this was compared with the original ECOHIS. Minor changes were made to the Malay-ECOHIS before it was finalised. The Malay-ECOHIS' psychometric properties were assessed in terms of construct, convergent and discriminant validity as well as internal and test-retest reliability based on two separate studies involving 127 parents of 4-6 year old preschool children followed by oral examinations of 860 preschool children from 25 kindergartens from two districts in Selangor state, Malaysia. Non-parametric statistics were used to assess the relationships between the Malay-ECOHIS and the subjective and clinical outcome measures. The Cronbach's alpha was 0.83 and the weighted Kappa was 0.95 (intraclass correlation = 0.94). The Malay-ECOHIS demonstrated significant associations with different subjective and normative measures, i.e. levels of oral health satisfaction, perceived oral health status, perceived oral health need, toothache experience, pattern of dental attendance, and caries status of preschool children. These significant associations supported its construct, convergent and discriminant validity as well as internal and test-retest reliability. This study showed that the Malay-ECOHIS is a valid and reliable instrument to assess the negative impacts of oral disorders/conditions on the quality of life of 4-6 year old preschool children and their families in Malaysia.

  12. Development and validation of an instrument to measure nurse educator perceived confidence in clinical teaching.

    Science.gov (United States)

    Nguyen, Van N B; Forbes, Helen; Mohebbi, Mohammadreza; Duke, Maxine

    2017-12-01

    Teaching nursing in clinical environments is considered complex and multi-faceted. Little is known about the role of the clinical nurse educator, specifically the challenges related to transition from clinician, or in some cases, from newly-graduated nurse to that of clinical nurse educator, as occurs in developing countries. Confidence in the clinical educator role has been associated with successful transition and the development of role competence. There is currently no valid and reliable instrument to measure clinical nurse educator confidence. This study was conducted to develop and psychometrically test an instrument to measure perceived confidence among clinical nurse educators. A multi-phase, multi-setting survey design was used. A total of 468 surveys were distributed, and 363 were returned. Data were analyzed using exploratory and confirmatory factor analyses. The instrument was successfully tested and modified in phase 1, and factorial validity was subsequently confirmed in phase 2. There was strong evidence of internal consistency, reliability, content, and convergent validity of the Clinical Nurse Educator Skill Acquisition Assessment instrument. The resulting instrument is applicable in similar contexts due to its rigorous development and validation process. © 2017 The Authors. Nursing & Health Sciences published by John Wiley & Sons Australia, Ltd.

  13. Development of a quality-assessment tool for experimental bruxism studies: reliability and validity.

    Science.gov (United States)

    Dawson, Andreas; Raphael, Karen G; Glaros, Alan; Axelsson, Susanna; Arima, Taro; Ernberg, Malin; Farella, Mauro; Lobbezoo, Frank; Manfredini, Daniele; Michelotti, Ambra; Svensson, Peter; List, Thomas

    2013-01-01

    To combine empirical evidence and expert opinion in a formal consensus method in order to develop a quality-assessment tool for experimental bruxism studies in systematic reviews. Tool development comprised five steps: (1) preliminary decisions, (2) item generation, (3) face-validity assessment, (4) reliability and discriminitive validity assessment, and (5) instrument refinement. The kappa value and phi-coefficient were calculated to assess inter-observer reliability and discriminative ability, respectively. Following preliminary decisions and a literature review, a list of 52 items to be considered for inclusion in the tool was compiled. Eleven experts were invited to join a Delphi panel and 10 accepted. Four Delphi rounds reduced the preliminary tool-Quality-Assessment Tool for Experimental Bruxism Studies (Qu-ATEBS)- to 8 items: study aim, study sample, control condition or group, study design, experimental bruxism task, statistics, interpretation of results, and conflict of interest statement. Consensus among the Delphi panelists yielded good face validity. Inter-observer reliability was acceptable (k = 0.77). Discriminative validity was excellent (phi coefficient 1.0; P reviews of experimental bruxism studies, exhibits face validity, excellent discriminative validity, and acceptable inter-observer reliability. Development of quality assessment tools for many other topics in the orofacial pain literature is needed and may follow the described procedure.

  14. Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

    Directory of Open Access Journals (Sweden)

    Roberts Gwerfyl

    2012-06-01

    Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.

  15. Measurement properties of questionnaires assessing participation in children and adolescents with a disability: a systematic review.

    Science.gov (United States)

    Rainey, Linda; van Nispen, Ruth; van der Zee, Carlijn; van Rens, Ger

    2014-12-01

    To critically appraise the measurement properties of questionnaires measuring participation in children and adolescents (0-18 years) with a disability. Bibliographic databases were searched for studies evaluating the measurement properties of self-report or parent-report questionnaires measuring participation in children and adolescents (0-18 years) with a disability. The methodological quality of the included studies and the results of the measurement properties were evaluated using a checklist developed on consensus-based standards. The search strategy identified 3,977 unique publications, of which 22 were selected; these articles evaluated the development and measurement properties of eight different questionnaires. The Child and Adolescent Scale of Participation was evaluated most extensively, generally showing moderate positive results on content validity, internal consistency, reliability and construct validity. The remaining questionnaires also demonstrated positive results. However, at least 50 % of the measurement properties per questionnaire were not (or only poorly) assessed. Studies of high methodological quality, using modern statistical methods, are needed to accurately assess the measurement properties of currently available questionnaires. Moreover, consensus is required on the definition of the construct 'participation' to determine content validity and to enable meaningful interpretation of outcomes.

  16. The validity of the variable "NICU admission" as an outcome measure for neonatal morbidity: a retrospective study

    NARCIS (Netherlands)

    Wiegerinck, Melanie M. J.; Danhof, Nora A.; van Kaam, Anton H.; Tamminga, Pieter; Mol, Ben Willem J.

    2014-01-01

    To determine whether "neonatal intensive care unit (NICU) admission" is a valid surrogate outcome measure to assess neonatal condition in clinical studies. Retrospective study. Tertiary hospital in the Netherlands. Neonates admitted to NICU during a 10-year period. Inclusion was restricted to

  17. Does assessing project work enhance the validity of qualifications? The case of GCSE coursework

    Directory of Open Access Journals (Sweden)

    Victoria Crisp

    2009-03-01

    Full Text Available This paper begins by describing current views on validity and how certain assessment forms, such as school-based project work, may enhance validity. It then touches on debates about the dependability of assessment by teachers. GCSEs and GCSE coursework are then described along with the reasons for the inclusion of coursework in many GCSEs. Crooks, Kane and Cohen’s (1996 chain model of eight linked stages of validity enquiry is then used as a structure within which to consider the validity of project work assessments, and specifically GCSE coursework assessment, drawing on the available literature. Strengths for validity include the ability to assess objectives that are difficult to test in written examinations, promoting additional skills such as critical thinking, creativity and independent thinking, and improving motivation. Possible threats to validity include the potential for internet and other types of plagiarism, tasks becoming overly structured and formulaic thus reducing the positive impact on learning, and the potentially heavy workload for teachers and students. The paper concludes by describing current policy changes in the UK with regard to GCSE coursework and relates this to strong and weak validity links for project work as a mode of assessment.

  18. Measuring data quality for ongoing improvement a data quality assessment framework

    CERN Document Server

    Sebastian-Coleman, Laura

    2013-01-01

    The Data Quality Assessment Framework shows you how to measure and monitor data quality, ensuring quality over time. You'll start with general concepts of measurement and work your way through a detailed framework of more than three dozen measurement types related to five objective dimensions of quality: completeness, timeliness, consistency, validity, and integrity. Ongoing measurement, rather than one time activities will help your organization reach a new level of data quality. This plain-language approach to measuring data can be understood by both business and IT and provides pra

  19. Development and validation of a measure of food choice values.

    Science.gov (United States)

    Lyerly, Jordan E; Reeve, Charlie L

    2015-06-01

    Food choice values (FCVs) are factors that individuals consider when deciding which foods to purchase and/or consume. Given the potentially important implications for health, it is critical for researchers to have access to a validated measure of FCV. Though there is an existing measure of FCV, this measure was developed 20 years ago and recent research suggests additional FCVs exist that are not included in this measure. A series of four studies was conducted to develop a new expanded measure of FCV. An eight-factor model of FCV was supported and confirmed. In aggregate, results from the four studies indicate that the measure is content valid, and has internally consistent scales that also demonstrated acceptable temporal stability and convergent validity. In addition, the eight scales of the measures were independent of social desirability, met criteria for measurement invariance across income groups, and predicted dietary intake. The development of this new measure of FCV may be useful for researchers examining FCVs (FCVs) in the future, as well as for use in intervention and prevention efforts targeting dietary choices. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Protocol: validation of the INCODE barometer to measure the innovation compe-tence through the Rasch Measurement Theory

    Directory of Open Access Journals (Sweden)

    Lidia Sanchez

    2017-06-01

    Full Text Available This communication presents a protocol in order to show the different phases that must be followed in order to validate the INCODE barometer, which is used to measure the innovation competence, with Rasch Measurement Theory. Five phases are stated: dimensionality analysis, individual reliability and validity analysis of ítems and persons, global reliability and validity analysis, and cathegory analysis.