WorldWideScience

Sample records for reliable confidence measures

  1. Sample size planning for composite reliability coefficients: accuracy in parameter estimation via narrow confidence intervals.

    Science.gov (United States)

    Terry, Leann; Kelley, Ken

    2012-11-01

    Composite measures play an important role in psychology and related disciplines. Composite measures almost always have error. Correspondingly, it is important to understand the reliability of the scores from any particular composite measure. However, the point estimates of the reliability of composite measures are fallible and thus all such point estimates should be accompanied by a confidence interval. When confidence intervals are wide, there is much uncertainty in the population value of the reliability coefficient. Given the importance of reporting confidence intervals for estimates of reliability, coupled with the undesirability of wide confidence intervals, we develop methods that allow researchers to plan sample size in order to obtain narrow confidence intervals for population reliability coefficients. We first discuss composite reliability coefficients and then provide a discussion on confidence interval formation for the corresponding population value. Using the accuracy in parameter estimation approach, we develop two methods to obtain accurate estimates of reliability by planning sample size. The first method provides a way to plan sample size so that the expected confidence interval width for the population reliability coefficient is sufficiently narrow. The second method ensures that the confidence interval width will be sufficiently narrow with some desired degree of assurance (e.g., 99% assurance that the 95% confidence interval for the population reliability coefficient will be less than W units wide). The effectiveness of our methods was verified with Monte Carlo simulation studies. We demonstrate how to easily implement the methods with easy-to-use and freely available software. ©2011 The British Psychological Society.

  2. Methodology for building confidence measures

    Science.gov (United States)

    Bramson, Aaron L.

    2004-04-01

    This paper presents a generalized methodology for propagating known or estimated levels of individual source document truth reliability to determine the confidence level of a combined output. Initial document certainty levels are augmented by (i) combining the reliability measures of multiply sources, (ii) incorporating the truth reinforcement of related elements, and (iii) incorporating the importance of the individual elements for determining the probability of truth for the whole. The result is a measure of confidence in system output based on the establishing of links among the truth values of inputs. This methodology was developed for application to a multi-component situation awareness tool under development at the Air Force Research Laboratory in Rome, New York. Determining how improvements in data quality and the variety of documents collected affect the probability of a correct situational detection helps optimize the performance of the tool overall.

  3. Probabilistic confidence for decisions based on uncertain reliability estimates

    Science.gov (United States)

    Reid, Stuart G.

    2013-05-01

    Reliability assessments are commonly carried out to provide a rational basis for risk-informed decisions concerning the design or maintenance of engineering systems and structures. However, calculated reliabilities and associated probabilities of failure often have significant uncertainties associated with the possible estimation errors relative to the 'true' failure probabilities. For uncertain probabilities of failure, a measure of 'probabilistic confidence' has been proposed to reflect the concern that uncertainty about the true probability of failure could result in a system or structure that is unsafe and could subsequently fail. The paper describes how the concept of probabilistic confidence can be applied to evaluate and appropriately limit the probabilities of failure attributable to particular uncertainties such as design errors that may critically affect the dependability of risk-acceptance decisions. This approach is illustrated with regard to the dependability of structural design processes based on prototype testing with uncertainties attributable to sampling variability.

  4. A systematic review of maternal confidence for physiologic birth: characteristics of prenatal care and confidence measurement.

    Science.gov (United States)

    Avery, Melissa D; Saftner, Melissa A; Larson, Bridget; Weinfurter, Elizabeth V

    2014-01-01

    Because a focus on physiologic labor and birth has reemerged in recent years, care providers have the opportunity in the prenatal period to help women increase confidence in their ability to give birth without unnecessary interventions. However, most research has only examined support for women during labor. The purpose of this systematic review was to examine the research literature for information about prenatal care approaches that increase women's confidence for physiologic labor and birth and tools to measure that confidence. Studies were reviewed that explored any element of a pregnant woman's interaction with her prenatal care provider that helped build confidence in her ability to labor and give birth. Timing of interaction with pregnant women included during pregnancy, labor and birth, and the postpartum period. In addition, we looked for studies that developed a measure of women's confidence related to labor and birth. Outcome measures included confidence or similar concepts, descriptions of components of prenatal care contributing to maternal confidence for birth, and reliability and validity of tools measuring confidence. The search of MEDLINE, CINAHL, PsycINFO, and Scopus databases provided a total of 893 citations. After removing duplicates and articles that did not meet inclusion criteria, 6 articles were included in the review. Three relate to women's confidence for labor during the prenatal period, and 3 describe tools to measure women's confidence for birth. Research about enhancing women's confidence for labor and birth was limited to qualitative studies. Results suggest that women desire information during pregnancy and want to use that information to participate in care decisions in a relationship with a trusted provider. Further research is needed to develop interventions to help midwives and physicians enhance women's confidence in their ability to give birth and to develop a tool to measure confidence for use during prenatal care. © 2014 by

  5. Measures of differences in reliability

    International Nuclear Information System (INIS)

    Doksum, K.A.

    1975-01-01

    Measures of differences in reliability of two systems are considered in the scale model, location-scale model, and a nonparametric model. In each model, estimates and confidence intervals are given and some of their properties discussed

  6. MEASUREMENT: ACCOUNTING FOR RELIABILITY IN PERFORMANCE ESTIMATES.

    Science.gov (United States)

    Waterman, Brian; Sutter, Robert; Burroughs, Thomas; Dunagan, W Claiborne

    2014-01-01

    When evaluating physician performance measures, physician leaders are faced with the quandary of determining whether departures from expected physician performance measurements represent a true signal or random error. This uncertainty impedes the physician leader's ability and confidence to take appropriate performance improvement actions based on physician performance measurements. Incorporating reliability adjustment into physician performance measurement is a valuable way of reducing the impact of random error in the measurements, such as those caused by small sample sizes. Consequently, the physician executive has more confidence that the results represent true performance and is positioned to make better physician performance improvement decisions. Applying reliability adjustment to physician-level performance data is relatively new. As others have noted previously, it's important to keep in mind that reliability adjustment adds significant complexity to the production, interpretation and utilization of results. Furthermore, the methods explored in this case study only scratch the surface of the range of available Bayesian methods that can be used for reliability adjustment; further study is needed to test and compare these methods in practice and to examine important extensions for handling specialty-specific concerns (e.g., average case volumes, which have been shown to be important in cardiac surgery outcomes). Moreover, it's important to note that the provider group average as a basis for shrinkage is one of several possible choices that could be employed in practice and deserves further exploration in future research. With these caveats, our results demonstrate that incorporating reliability adjustment into physician performance measurements is feasible and can notably reduce the incidence of "real" signals relative to what one would expect to see using more traditional approaches. A physician leader who is interested in catalyzing performance improvement

  7. Social Information Is Integrated into Value and Confidence Judgments According to Its Reliability.

    Science.gov (United States)

    De Martino, Benedetto; Bobadilla-Suarez, Sebastian; Nouguchi, Takao; Sharot, Tali; Love, Bradley C

    2017-06-21

    How much we like something, whether it be a bottle of wine or a new film, is affected by the opinions of others. However, the social information that we receive can be contradictory and vary in its reliability. Here, we tested whether the brain incorporates these statistics when judging value and confidence. Participants provided value judgments about consumer goods in the presence of online reviews. We found that participants updated their initial value and confidence judgments in a Bayesian fashion, taking into account both the uncertainty of their initial beliefs and the reliability of the social information. Activity in dorsomedial prefrontal cortex tracked the degree of belief update. Analogous to how lower-level perceptual information is integrated, we found that the human brain integrates social information according to its reliability when judging value and confidence. SIGNIFICANCE STATEMENT The field of perceptual decision making has shown that the sensory system integrates different sources of information according to their respective reliability, as predicted by a Bayesian inference scheme. In this work, we hypothesized that a similar coding scheme is implemented by the human brain to process social signals and guide complex, value-based decisions. We provide experimental evidence that the human prefrontal cortex's activity is consistent with a Bayesian computation that integrates social information that differs in reliability and that this integration affects the neural representation of value and confidence. Copyright © 2017 De Martino et al.

  8. The short version of the Activities-specific Balance Confidence (ABC) scale: its validity, reliability, and relationship to balance impairment and falls in older adults.

    Science.gov (United States)

    Schepens, Stacey; Goldberg, Allon; Wallace, Melissa

    2010-01-01

    A shortened version of the ABC 16-item scale (ABC-16), the ABC-6, has been proposed as an alternative balance confidence measure. We investigated whether the ABC-6 is a valid and reliable measure of balance confidence and examined its relationship to balance impairment and falls in older adults. Thirty-five community-dwelling older adults completed the ABC-16, including the 6 questions of the ABC-6. They also completed the following clinical balance tests: unipedal stance time (UST), functional reach (FR), Timed Up and Go (TUG), and maximum step length (MSL). Participants reported 12-month falls history. Balance confidence on the ABC-6 was significantly lower than on the ABC-16, however scores were highly correlated. Fallers reported lower balance confidence than non-fallers as measured by the ABC-6 scale, but confidence did not differ between the groups with the ABC-16. The ABC-6 significantly correlated with all balance tests assessed and number of falls. The ABC-16 significantly correlated with all balance tests assessed, but not with number of falls. Test-retest reliability for the ABC-16 and ABC-6 was good to excellent. The ABC-6 is a valid and reliable measure of balance confidence in community-dwelling older adults, and shows stronger relationships to falls than does the ABC-16. The ABC-6 may be a more useful balance confidence assessment tool than the ABC-16. Copyright 2009 Elsevier Ireland Ltd. All rights reserved.

  9. Development, validity and reliability testing of the East Midlands Evaluation Tool (EMET) for measuring impacts on trainees' confidence and competence following end of life care training.

    Science.gov (United States)

    Whittaker, B; Parry, R; Bird, L; Watson, S; Faull, C

    2017-02-02

    To develop, test and validate a versatile questionnaire, the East Midlands Evaluation Tool (EMET), for measuring effects of end of life care training events on trainees' self-reported confidence and competence. A paper-based questionnaire was designed on the basis of the English Department of Health's core competences for end of life care, with sections for completion pretraining, immediately post-training and also for longer term follow-up. Preliminary versions were field tested at 55 training events delivered by 13 organisations to 1793 trainees working in diverse health and social care backgrounds. Iterative rounds of development aimed to maximise relevance to events and trainees. Internal consistency was assessed by calculating interitem correlations on questionnaire responses during field testing. Content validity was assessed via qualitative content analysis of (1) responses to questionnaires completed by field tester trainers and (2) field notes from a workshop with a separate cohort of experienced trainers. Test-retest reliability was assessed via repeat administration to a cohort of student nurses. The EMET comprises 27 items with Likert-scaled responses supplemented with questions seeking free-text responses. It measures changes in self-assessed confidence and competence on 5 subscales: communication skills; assessment and care planning; symptom management; advance care planning; overarching values and knowledge. Test-retest reliability was found to be good, as was internal consistency: the questions successfully assess different aspects of the same underlying concept. The EMET provides a time-efficient, reliable and flexible means of evaluating effects of training on self-reported confidence and competence in the key elements of end of life care. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  10. A method of bias correction for maximal reliability with dichotomous measures.

    Science.gov (United States)

    Penev, Spiridon; Raykov, Tenko

    2010-02-01

    This paper is concerned with the reliability of weighted combinations of a given set of dichotomous measures. Maximal reliability for such measures has been discussed in the past, but the pertinent estimator exhibits a considerable bias and mean squared error for moderate sample sizes. We examine this bias, propose a procedure for bias correction, and develop a more accurate asymptotic confidence interval for the resulting estimator. In most empirically relevant cases, the bias correction and mean squared error correction can be performed simultaneously. We propose an approximate (asymptotic) confidence interval for the maximal reliability coefficient, discuss the implementation of this estimator, and investigate the mean squared error of the associated asymptotic approximation. We illustrate the proposed methods using a numerical example.

  11. Confidence Estimation of Reliability Indices of the System with Elements Duplication and Recovery

    Directory of Open Access Journals (Sweden)

    I. V. Pavlov

    2017-01-01

    Full Text Available The article considers a problem to estimate a confidence interval of the main reliability indices such as availability rate, mean time between failures, and operative availability (in the stationary state for the model of the system with duplication and independent recovery of elements.Presents a solution of the problem for a situation that often arises in practice, when there are unknown exact values of the reliability parameters of the elements, and only test data of the system or its individual parts (elements, subsystems for reliability are known. It should be noted that the problems of the confidence estimate of reliability indices of the complex systems based on the testing results of their individual elements are fairly common function in engineering practice when designing and running the various engineering systems. The available papers consider this problem, mainly, for non-recovery systems.Describes a solution of this problem for the important particular case when the system elements are duplicated by the reserved elements, and the elements that have failed in the course of system operation are recovered (regardless of the state of other elements.An approximate solution of this problem is obtained for the case of high reliability or "fast recovery" of elements on the assumption that the average recovery time of elements is small as compared to the average time between failures.

  12. An assessment of the precision and confidence of aquatic eddy correlation measurements

    DEFF Research Database (Denmark)

    Donis, Daphne; Holtappels, Moritz; Noss, Christian

    2015-01-01

    facility with well-constrained hydrodynamics. These observations are used to review data processing procedures and to recommend improved deployment methods, thus improving the precision, reliability, and confidence of EC measurements. Specifically, this study demonstrates that 1) the alignment of the time...... series based on maximum cross correlation improved the precision of EC flux estimations; 2) an oxygen sensor with a response time of

  13. Using the Reliability Theory for Assessing the Decision Confidence Probability for Comparative Life Cycle Assessments.

    Science.gov (United States)

    Wei, Wei; Larrey-Lassalle, Pyrène; Faure, Thierry; Dumoulin, Nicolas; Roux, Philippe; Mathias, Jean-Denis

    2016-03-01

    Comparative decision making process is widely used to identify which option (system, product, service, etc.) has smaller environmental footprints and for providing recommendations that help stakeholders take future decisions. However, the uncertainty problem complicates the comparison and the decision making. Probability-based decision support in LCA is a way to help stakeholders in their decision-making process. It calculates the decision confidence probability which expresses the probability of a option to have a smaller environmental impact than the one of another option. Here we apply the reliability theory to approximate the decision confidence probability. We compare the traditional Monte Carlo method with a reliability method called FORM method. The Monte Carlo method needs high computational time to calculate the decision confidence probability. The FORM method enables us to approximate the decision confidence probability with fewer simulations than the Monte Carlo method by approximating the response surface. Moreover, the FORM method calculates the associated importance factors that correspond to a sensitivity analysis in relation to the probability. The importance factors allow stakeholders to determine which factors influence their decision. Our results clearly show that the reliability method provides additional useful information to stakeholders as well as it reduces the computational time.

  14. How do regulators measure public confidence?

    International Nuclear Information System (INIS)

    Schmitt, A.; Besenyei, E.

    2006-01-01

    The conclusions and recommendations of this session can be summarized this way. - There are some important elements of confidence: visibility, satisfaction, credibility and reputation. The latter can consist of trust, positive image and knowledge of the role the organisation plays. A good reputation is hard to achieve but easy to lose. - There is a need to define what public confidence is and what to measure. The difficulty is that confidence is a matter of perception of the public, so what we try to measure is the perception. - It is controversial how to take into account the results of confidence measurement because of the influence of the context. It is not an exact science, results should be examined cautiously and surveys should be conducted frequently, at least every two years. - Different experiences were explained: - Quantitative surveys - among the general public or more specific groups like the media; - Qualitative research - with test groups and small panels; - Semi-quantitative studies - among stakeholders who have regular contracts with the regulatory body. It is not clear if the results should be shared with the public or just with other authorities and governmental organisations. - Efforts are needed to increase visibility, which is a prerequisite for confidence. - A practical example of organizing an emergency exercise and an information campaign without taking into account the real concerns of the people was given to show how public confidence can be decreased. - We learned about a new method - the so-called socio-drama - which addresses another issue also connected to confidence - the notion of understanding between stakeholders around a nuclear site. It is another way of looking at confidence in a more restricted group. (authors)

  15. Development and validation of an instrument to measure nurse educator perceived confidence in clinical teaching.

    Science.gov (United States)

    Nguyen, Van N B; Forbes, Helen; Mohebbi, Mohammadreza; Duke, Maxine

    2017-12-01

    Teaching nursing in clinical environments is considered complex and multi-faceted. Little is known about the role of the clinical nurse educator, specifically the challenges related to transition from clinician, or in some cases, from newly-graduated nurse to that of clinical nurse educator, as occurs in developing countries. Confidence in the clinical educator role has been associated with successful transition and the development of role competence. There is currently no valid and reliable instrument to measure clinical nurse educator confidence. This study was conducted to develop and psychometrically test an instrument to measure perceived confidence among clinical nurse educators. A multi-phase, multi-setting survey design was used. A total of 468 surveys were distributed, and 363 were returned. Data were analyzed using exploratory and confirmatory factor analyses. The instrument was successfully tested and modified in phase 1, and factorial validity was subsequently confirmed in phase 2. There was strong evidence of internal consistency, reliability, content, and convergent validity of the Clinical Nurse Educator Skill Acquisition Assessment instrument. The resulting instrument is applicable in similar contexts due to its rigorous development and validation process. © 2017 The Authors. Nursing & Health Sciences published by John Wiley & Sons Australia, Ltd.

  16. The short version of the Activities-specific Balance Confidence (ABC) scale: Its validity, reliability, and relationship to balance impairment and falls in older adults

    OpenAIRE

    Schepens, Stacey; Goldberg, Allon; Wallace, Melissa

    2009-01-01

    A shortened version of the ABC 16-item scale (ABC-16), the ABC-6, has been proposed as an alternative balance confidence measure. We investigated whether the ABC-6 is a valid and reliable measure of balance confidence and examined its relationship to balance impairment and falls in older adults. Thirty-five community-dwelling older adults completed the ABC-16, including the six questions of the ABC-6. They also completed the following clinical balance tests: unipedal stance time (UST), functi...

  17. Alternative confidence measure for local matching stereo algorithms

    CSIR Research Space (South Africa)

    Ndhlovu, T

    2009-11-01

    Full Text Available The authors present a confidence measure applied to individual disparity estimates in local matching stereo correspondence algorithms. It aims at identifying textureless areas, where most local matching algorithms fail. The confidence measure works...

  18. Confidence bounds of recurrence-based complexity measures

    International Nuclear Information System (INIS)

    Schinkel, Stefan; Marwan, N.; Dimigen, O.; Kurths, J.

    2009-01-01

    In the recent past, recurrence quantification analysis (RQA) has gained an increasing interest in various research areas. The complexity measures the RQA provides have been useful in describing and analysing a broad range of data. It is known to be rather robust to noise and nonstationarities. Yet, one key question in empirical research concerns the confidence bounds of measured data. In the present Letter we suggest a method for estimating the confidence bounds of recurrence-based complexity measures. We study the applicability of the suggested method with model and real-life data.

  19. Inter-rater reliability of shoulder measurements in middle-aged women.

    Science.gov (United States)

    De Groef, A; Van Kampen, M; Vervloesem, N; Clabau, E; Christiaens, M-R; Neven, P; Geraerts, I; Struyf, F; Devoogdt, N

    2017-06-01

    To investigate inter-rater reliability of a set of shoulder measurements including inclinometry [shoulder range of motion (ROM)], acromion-table distance and pectoralis minor muscle length (static scapular positioning), upward rotation with two inclinometers (scapular kinematics) and pain pressure thresholds (muscle tenderness) in middle-aged women. Observational study. Thirty symptom-free middle-aged women (first cohort) were measured by two raters. All measurements with an intraclass correlation coefficient (ICC) below 0.75 were retested after an additional training period in a second cohort of 30 symptom-free middle-aged women. Inter-rater reliability of all variables was measured with the ICC (95% confidence interval) and standard error of measurement (SEM). Acromion-table distance (ICC=0.91, SEM 0.22 to 0.28% of body length), pectoralis minor muscle length (ICC=0.91, SEM 0.16% of body length), pain pressure thresholds (ICC=0.78 to 0.85, SEM 0.39 to 0.70kg) and abduction ROM (ICC=0.77, SEM 5°) showed good to excellent inter-rater reliability in the first cohort. After an additional training period, forward flexion ROM showed good inter-rater reliability (ICC=0.83, SEM 5°), scapular upward rotation in resting position showed moderate reliability (ICC=0.52, SEM 2°), and other scaption angles showed weak reliability (ICC=0.26 to 0.43, SEM 3 to 8°). In a battery of clinical tools to evaluate factors contributing to shoulder pain, static scapular positioning and pressure pain thresholds were found to have good to excellent inter-rater reliability in middle-aged women. Additional training is recommended for measurements with a gravity inclinometer. Copyright © 2016 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

  20. The reliability and validity of three questionnaires: The Student Satisfaction and Self-Confidence in Learning Scale, Simulation Design Scale, and Educational Practices Questionnaire.

    Science.gov (United States)

    Unver, Vesile; Basak, Tulay; Watts, Penni; Gaioso, Vanessa; Moss, Jacqueline; Tastan, Sevinc; Iyigun, Emine; Tosun, Nuran

    2017-02-01

    The purpose of this study was to adapt the "Student Satisfaction and Self-Confidence in Learning Scale" (SCLS), "Simulation Design Scale" (SDS), and "Educational Practices Questionnaire" (EPQ) developed by Jeffries and Rizzolo into Turkish and establish the reliability and the validity of these translated scales. A sample of 87 nursing students participated in this study. These scales were cross-culturally adapted through a process including translation, comparison with original version, back translation, and pretesting. Construct validity was evaluated by factor analysis, and criterion validity was evaluated using the Perceived Learning Scale, Patient Intervention Self-confidence/Competency Scale, and Educational Belief Scale. Cronbach's alpha values were found as 0.77-0.85 for SCLS, 0.73-0.86 for SDS, and 0.61-0.86 for EPQ. The results of this study show that the Turkish versions of all scales are validated and reliable measurement tools.

  1. A Study on the Reliability of Sasang Constitutional Body Trunk Measurement

    Directory of Open Access Journals (Sweden)

    Eunsu Jang

    2012-01-01

    Full Text Available Objective. Body trunk measurement for human plays an important diagnostic role not only in conventional medicine but also in Sasang constitutional medicine (SCM. The Sasang constitutional body trunk measurement (SCBTM consists of the 5-widths and the 8-circumferences which are standard locations currently employed in the SCM society. This study suggests to what extent a comprehensive training can improve the reliability of the SCBTM. Methods. We recruited 10 male subjects and 5 male observers with no experience of anthropometric measurement. We conducted measurements twice before and after a comprehensive training. Relative technical error of measurement (%TEMs was produced to assess intra and inter observer reliabilities. Results. Post-training intra-observer %TEMs of the SCBTM were 0.27% to 1.85% reduced from 0.27% to 6.26% in pre-training, respectively. Post-training inter-observer %TEMs of those were 0.56% to 1.66% reduced from 1.00% to 9.60% in pre-training, respectively. Post-training % total TEMs which represent the whole reliability were 0.68% to 2.18% reduced from maximum value of 10.18%. Conclusion. A comprehensive training makes the SCBTM more reliable, hence giving a sufficiently confident diagnostic tool. It is strongly recommended to give a comprehensive training in advance to take the SCBTM.

  2. A study on the reliability of sasang constitutional body trunk measurement.

    Science.gov (United States)

    Jang, Eunsu; Kim, Jong Yeol; Lee, Haejung; Kim, Honggie; Baek, Younghwa; Lee, Siwoo

    2012-01-01

    Objective. Body trunk measurement for human plays an important diagnostic role not only in conventional medicine but also in Sasang constitutional medicine (SCM). The Sasang constitutional body trunk measurement (SCBTM) consists of the 5-widths and the 8-circumferences which are standard locations currently employed in the SCM society. This study suggests to what extent a comprehensive training can improve the reliability of the SCBTM. Methods. We recruited 10 male subjects and 5 male observers with no experience of anthropometric measurement. We conducted measurements twice before and after a comprehensive training. Relative technical error of measurement (%TEMs) was produced to assess intra and inter observer reliabilities. Results. Post-training intra-observer %TEMs of the SCBTM were 0.27% to 1.85% reduced from 0.27% to 6.26% in pre-training, respectively. Post-training inter-observer %TEMs of those were 0.56% to 1.66% reduced from 1.00% to 9.60% in pre-training, respectively. Post-training % total TEMs which represent the whole reliability were 0.68% to 2.18% reduced from maximum value of 10.18%. Conclusion. A comprehensive training makes the SCBTM more reliable, hence giving a sufficiently confident diagnostic tool. It is strongly recommended to give a comprehensive training in advance to take the SCBTM.

  3. Pneumothorax size measurements on digital chest radiographs: Intra- and inter- rater reliability.

    Science.gov (United States)

    Thelle, Andreas; Gjerdevik, Miriam; Grydeland, Thomas; Skorge, Trude D; Wentzel-Larsen, Tore; Bakke, Per S

    2015-10-01

    Detailed and reliable methods may be important for discussions on the importance of pneumothorax size in clinical decision-making. Rhea's method is widely used to estimate pneumothorax size in percent based on chest X-rays (CXRs) from three measure points. Choi's addendum is used for anterioposterior projections. The aim of this study was to examine the intrarater and interrater reliability of the Rhea and Choi method using digital CXR in the ward based PACS monitors. Three physicians examined a retrospective series of 80 digital CXRs showing pneumothorax, using Rhea and Choi's method, then repeated in a random order two weeks later. We used the analysis of variance technique by Eliasziw et al. to assess the intrarater and interrater reliability in altogether 480 estimations of pneumothorax size. Estimated pneumothorax sizes ranged between 5% and 100%. The intrarater reliability coefficient was 0.98 (95% one-sided lower-limit confidence interval C 0.96), and the interrater reliability coefficient was 0.95 (95% one-sided lower-limit confidence interval 0.93). This study has shown that the Rhea and Choi method for calculating pneumothorax size has high intrarater and interrater reliability. These results are valid across gender, side of pneumothorax and whether the patient is diagnosed with primary or secondary pneumothorax. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  4. Test-retest reliability of behavioral measures of impulsive choice, impulsive action, and inattention.

    Science.gov (United States)

    Weafer, Jessica; Baggott, Matthew J; de Wit, Harriet

    2013-12-01

    Behavioral measures of impulsivity are widely used in substance abuse research, yet relatively little attention has been devoted to establishing their psychometric properties, especially their reliability over repeated administration. The current study examined the test-retest reliability of a battery of standardized behavioral impulsivity tasks, including measures of impulsive choice (i.e., delay discounting, probability discounting, and the Balloon Analogue Risk Task), impulsive action (i.e., the stop signal task, the go/no-go task, and commission errors on the continuous performance task), and inattention (i.e., attention lapses on a simple reaction time task and omission errors on the continuous performance task). Healthy adults (n = 128) performed the battery on two separate occasions. Reliability estimates for the individual tasks ranged from moderate to high, with Pearson correlations within the specific impulsivity domains as follows: impulsive choice (r range: .76-.89, ps reliable measures and thus can be confidently used to assess various facets of impulsivity as intermediate phenotypes for drug abuse.

  5. Reliability of Eustachian tube function measurements in a hypobaric and hyperbaric pressure chamber.

    Science.gov (United States)

    Meyer, M F; Jansen, S; Mordkovich, O; Hüttenbrink, K-B; Beutner, D

    2017-12-01

    Measurement of the Eustachian tube (ET) function is a challenge. The demand for a precise and meaningful diagnostic tool increases-especially because more and more operative therapies are being offered without objective evidence. The measurement of the ET function by continuous impedance recording in a pressure chamber is an established method, although the reliability of the measurements is still unclear. Twenty-five participants (50 ears) were exposed to phases of compression and decompression in a hypo- and hyperbaric pressure chamber. The ET function reflecting parameters-ET opening pressure (ETOP), ET opening duration (ETOD) and ET opening frequency (ETOF)-were determined under exactly the same preconditions three times in a row. The intraclass correlation coefficient (ICC) and Bland and Altman plot were used to assess test-retest reliability. ICCs revealed a high correlation for ETOP and ETOF in phases of decompression (passive equalisation) as well as ETOD and ETOP in phases of compression (active induced equalisation). Very high correlation could be shown for ETOD in decompression and ETOF in compression phases. The Bland and Altman graphs could show that measurements provide results within a 95 % confidence interval in compression and decompression phases. We conclude that measurements in a pressure chamber are a very valuable tool in terms of estimating the ET opening and closing function. Measurements show some variance comparing participants, but provide reliable results within a 95 % confidence interval in retest. This study is the basis for enabling efficacy measurements of ET treatment modalities. © 2017 John Wiley & Sons Ltd.

  6. Design, validation, and reliability of survey to measure female athlete triad knowledge among coaches

    Directory of Open Access Journals (Sweden)

    Jillian E. Frideres

    2015-06-01

    Full Text Available The purpose of this study was to design and to test the validity and reliability of an instrument to evaluate coaches' knowledge about the female athlete triad syndrome and their confidence in this knowledge. The instrument collects information regarding: knowledge of the syndrome, components, prevention and intervention; confidence of the coaches in their answers; and coach's characteristics (gender, degree held, years of experience in coaching females, continuing education participation specific to the syndrome and its components, and sport coached. The process of designing the questionnaire and testing the validity and reliability of it was done in four phases: a design and development of the instrument, b content validity, c instrument reliability, and d concurrent validity. The results show that the instrument is suitable for measuring coaches' female athlete triad knowledge. The instrument can contribute to assessing the coaches' knowledge level in relation to this topic.

  7. Confidence mediates the sex difference in mental rotation performance.

    Science.gov (United States)

    Estes, Zachary; Felker, Sydney

    2012-06-01

    On tasks that require the mental rotation of 3-dimensional figures, males typically exhibit higher accuracy than females. Using the most common measure of mental rotation (i.e., the Mental Rotations Test), we investigated whether individual variability in confidence mediates this sex difference in mental rotation performance. In each of four experiments, the sex difference was reliably elicited and eliminated by controlling or manipulating participants' confidence. Specifically, confidence predicted performance within and between sexes (Experiment 1), rendering confidence irrelevant to the task reliably eliminated the sex difference in performance (Experiments 2 and 3), and manipulating confidence significantly affected performance (Experiment 4). Thus, confidence mediates the sex difference in mental rotation performance and hence the sex difference appears to be a difference of performance rather than ability. Results are discussed in relation to other potential mediators and mechanisms, such as gender roles, sex stereotypes, spatial experience, rotation strategies, working memory, and spatial attention.

  8. Psychometric properties of the communication Confidence Rating Scale for Aphasia (CCRSA): phase 1.

    Science.gov (United States)

    Cherney, Leora R; Babbitt, Edna M; Semik, Patrick; Heinemann, Allen W

    2011-01-01

    Confidence is a construct that has not been explored previously in aphasia research. We developed the Communication Confidence Rating Scale for Aphasia (CCRSA) to assess confidence in communicating in a variety of activities and evaluated its psychometric properties using rating scale (Rasch) analysis. The CCRSA was administered to 21 individuals with aphasia before and after participation in a computer-based language therapy study. Person reliability of the 8-item CCRSA was .77. The 5-category rating scale demonstrated monotonic increases in average measures from low to high ratings. However, one item ("I follow news, sports, stories on TV/movies") misfit the construct defined by the other items (mean square infit = 1.69, item-measure correlation = .41). Deleting this item improved reliability to .79; the 7 remaining items demonstrated excellent fit to the underlying construct, although there was a modest ceiling effect in this sample. Pre- to posttreatment changes on the 7-item CCRSA measure were statistically significant using a paired samples t test. Findings support the reliability and sensitivity of the CCRSA in assessing participants' self-report of communication confidence. Further evaluation of communication confidence is required with larger and more diverse samples.

  9. The Nutrition Literacy Assessment Instrument is a Valid and Reliable Measure of Nutrition Literacy in Adults with Chronic Disease.

    Science.gov (United States)

    Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K

    2018-03-01

    To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.

  10. OSS reliability measurement and assessment

    CERN Document Server

    Yamada, Shigeru

    2016-01-01

    This book analyses quantitative open source software (OSS) reliability assessment and its applications, focusing on three major topic areas: the Fundamentals of OSS Quality/Reliability Measurement and Assessment; the Practical Applications of OSS Reliability Modelling; and Recent Developments in OSS Reliability Modelling. Offering an ideal reference guide for graduate students and researchers in reliability for open source software (OSS) and modelling, the book introduces several methods of reliability assessment for OSS including component-oriented reliability analysis based on analytic hierarchy process (AHP), analytic network process (ANP), and non-homogeneous Poisson process (NHPP) models, the stochastic differential equation models and hazard rate models. These measurement and management technologies are essential to producing and maintaining quality/reliable systems using OSS.

  11. Reliability of radiographic measurements for acute distal radius fractures

    International Nuclear Information System (INIS)

    Watson, Narelle J.; Asadollahi, Saeed; Parrish, Frank; Ridgway, Jacqueline; Tran, Phong; Keating, Jennifer L.

    2016-01-01

    The management of distal radial fractures is guided by the interpretation of radiographic findings. The aim of this investigation was to determine the intra- and inter-observer reliability of eight traditionally reported anatomic radiographic parameters in adults with an acute distal radius fracture. Five observers participated. All were routinely involved in making treatment decisions based on distal radius fracture radiographs. Observers performed independent repeated measurements on 30 radiographs for eight anatomical parameters: dorsal shift (mm), intra-articular gap (mm), intra-articular step (mm), palmar tilt (degrees), radial angle (degrees), radial height (mm), radial shift (mm), ulnar variance (mm). Intraclass correlation coefficients (ICCs) and the magnitude of retest errors were calculated. Measurement reliability was summarised as high (ICC > 0.80), moderate (0.60–0.80) or low (<0.60). Intra-observer reliability was high for dorsal shift and palmar tilt; moderate for radial angle, radial height, ulnar variance and radial shift; and low for intra-articular gap and step. Inter-observer reliability was high for palmar tilt; moderate for dorsal shift, ulnar variance, radial angle and radial height; and low for radial shift, intra-articular gap and step. Error magnitude (95 % confidence interval) was within 1–2 mm for intra-articular gap and step, 2–4 mm for ulnar variance, 4–6 mm for radial shift, dorsal shift and radial height, and 6–8° for radial angle and palmar tilt. Based on previous reports of critical values for palmar tilt, ulnar variance and radial angle, error margins appear small enough for measurements to be useful in guiding treatment decisions. Our findings indicate that clinicians cannot reliably measure values ≤1 mm for intra-articular gap and step when interpreting radiographic parameters using the standardised methods investigated in this study. As a guide for treatment selection, palmar tilt, ulnar variance and radial angle

  12. Reliability of measuring abductor hallucis muscle parameters using two different diagnostic ultrasound machines

    Directory of Open Access Journals (Sweden)

    Cameron Alyse FM

    2009-11-01

    Full Text Available Abstract Background Diagnostic ultrasound provides a method of analysing soft tissue structures of the musculoskeletal system effectively and reliably. The aim of this study was to evaluate within and between session reliability of measuring muscle dorso-plantar thickness, medio-lateral length and cross-sectional area, of the abductor hallucis muscle using two different ultrasound machines, a higher end Philips HD11 Ultrasound machine and clinically orientated Chison 8300 Deluxe Digital Portable Ultrasound System. Methods The abductor hallucis muscle of both the left and right feet of thirty asymptomatic participants was imaged and then measured using both ultrasound machines. Interclass correlation coefficients (ICC with 95% confidence intervals (CI were used to calculate both within and between session intra-tester reliability. Standard error of the measurement (SEM calculations were undertaken to assess difference between the actual measured score across trials and the smallest real difference (SRD was calculated from the SEM to indicate the degree of change that would exceed the expected trial to trial variability. Results The ICCs, SEM and SRD for dorso-plantar thickness and medial-lateral length were shown to have excellent to high within and between-session reliability for both ultrasound machines. The between-session reliability indices for cross-sectional area were acceptable for both ultrasound machines. Conclusion The results of the current study suggest that regardless of the type ultrasound machine, intra-tester reliability for the measurement the abductor hallucis muscle parameters is very high.

  13. Reliability and validity of the German short version of the Activities specific Balance Confidence (ABC-D6) scale in older adults.

    Science.gov (United States)

    Schott, Nadja

    2014-01-01

    The Activities specific Balance Confidence (ABC) is a questionnaire which was developed to assess falls-associated self-efficacy. The aim of this study was to evaluate reliability and validity of the German abbreviated 6-item version of the ABC scores in community-dwelling older people. The study sample included 384 subjects (age 71.1 ± 9.7). In order to determine the psychometric properties, reliability and validity were assessed through administration of the German adaptation of the ABC-D16 to participants twice, 10 days apart, and comparison of the ABC-D16 and the ABC-D6 with functional measures of balance and mobility (one-leg stance; 10 m walk; TUG; Fullerton Advanced Balance Scale (FAB)), physical activity (Physical Activity Scale for the Elderly (PASE)), physical fitness (30s arm curl, 30s chair stand, 6 min walk), cognition (Trail-Making-Test (TMT)), falls status, and quality of life (SF36). Factor analyses suggested a 1-factor solution for the ABC-D6 scale (explained variance 79.8%). Internal consistency (.95) and test-retest reliability (.98) for the ABC-D6 scores were excellent. Scores on the ABC-D6 were significantly lower than on the ABC-D16, but ABC-D16 and ABC-D6 scores were highly correlated (.94). There was an increasing difference in the ABC-scores between men and women with increasing age. Fallers reported lower balance confidence than non-fallers. The ABC-D6 score significantly correlated with functional measures of balance and mobility, physical activity, physical fitness, cognition, and quality of life (-.698valid instrument to asses falls-associated self-efficacy and may be used in future research projects and clinical trials. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  14. Radiographic measurement reliability of lumbar lordosis in ankylosing spondylitis.

    Science.gov (United States)

    Lee, Jung Sub; Goh, Tae Sik; Park, Shi Hwan; Lee, Hong Seok; Suh, Kuen Tak

    2013-04-01

    Intraobserver and interobserver reliabilities of the several different methods to measure lumbar lordosis have been reported. However, it has not been studied sofar in patients with ankylosing spondylitis (AS). We evaluated the inter and intraobserver reliabilities of six specific measures of global lumbar lordosis in patients with AS. Ninety-one consecutive patients with AS who met the most recently modified New York criteria were enrolled and underwent anteroposterior and lateral radiographs of whole spine. The radiographs were divided into non-ankylosis (no bony bridge in the lumbar spine), incomplete ankylosis (lumbar spines were partially connected by bony bridge) and complete ankylosis groups to evaluate the reliability of the Cobb L1-S1, Cobb L1-L5, centroid, posterior tangent L1-S1, posterior tangent L1-L5, and TRALL methods. The radiographs were composed of 39 non-ankylosis, 27 incomplete ankylosis and 25 complete ankylosis. Intra- and inter-class correlation coefficients (ICCs) of all six methods were generally high. The ICCs were all ≥0.77 (excellent) for the six radiographic methods in the combined group. However, a comparison of the ICCs, 95 % confidence intervals and mean absolute difference (MAD) between groups with varying degrees of ankylosis showed that the reliability of the lordosis measurements decreased in proportion to the severity of ankylosis. The Cobb L1-S1, Cobb L1-L5 and posterior tangent L1-S1 method demonstrated higher ICCs for both inter and intraobserver comparisons and the other methods showed lower ICCs in all groups. The intraobserver MAD was similar in the Cobb L1-S1 and Cobb L1-L5 (2.7°-4.3°), but the other methods showed higher intraobserver MAD. Interobserver MAD of Cobb L1-L5 only showed low in all group. These results are the first to provide a reliability analysis of different global lumbar lordosis measurement methods in AS. The findings in this study demonstrated that the Cobb L1-L5 method is reliable for measuring

  15. Stress Rupture Life Reliability Measures for Composite Overwrapped Pressure Vessels

    Science.gov (United States)

    Murthy, Pappu L. N.; Thesken, John C.; Phoenix, S. Leigh; Grimes-Ledesma, Lorie

    2007-01-01

    Composite Overwrapped Pressure Vessels (COPVs) are often used for storing pressurant gases onboard spacecraft. Kevlar (DuPont), glass, carbon and other more recent fibers have all been used as overwraps. Due to the fact that overwraps are subjected to sustained loads for an extended period during a mission, stress rupture failure is a major concern. It is therefore important to ascertain the reliability of these vessels by analysis, since the testing of each flight design cannot be completed on a practical time scale. The present paper examines specifically a Weibull statistics based stress rupture model and considers the various uncertainties associated with the model parameters. The paper also examines several reliability estimate measures that would be of use for the purpose of recertification and for qualifying flight worthiness of these vessels. Specifically, deterministic values for a point estimate, mean estimate and 90/95 percent confidence estimates of the reliability are all examined for a typical flight quality vessel under constant stress. The mean and the 90/95 percent confidence estimates are computed using Monte-Carlo simulation techniques by assuming distribution statistics of model parameters based also on simulation and on the available data, especially the sample sizes represented in the data. The data for the stress rupture model are obtained from the Lawrence Livermore National Laboratories (LLNL) stress rupture testing program, carried out for the past 35 years. Deterministic as well as probabilistic sensitivities are examined.

  16. A Reliable Measure of Information Security Awareness and the Identification of Bias in Responses

    Directory of Open Access Journals (Sweden)

    Agata McCormac

    2017-11-01

    Full Text Available The Human Aspects of Information Security Questionnaire (HAIS-Q is designed to measure Information Security Awareness. More specifically, the tool measures an individual’s knowledge, attitude, and self-reported behaviour relating to information security in the workplace. This paper reports on the reliability of the HAIS-Q, including test-retest reliability and internal consistency. The paper also assesses the reliability of three preliminary over-claiming items, designed specifically to complement the HAIS-Q, and identify those individuals who provide socially desirable responses. A total of 197 working Australians completed two iterations of the HAIS-Q and the over-claiming items, approximately 4 weeks apart. Results of the analysis showed that the HAIS-Q was externally reliable and internally consistent. Therefore, the HAIS-Q can be used to reliably measure information security awareness. Reliability testing on the preliminary over-claiming items was not as robust and further development is required and recommended. The implications of these findings mean that organisations can confidently use the HAIS-Q to not only measure the current state of employee information security awareness within their organisation, but they can also measure the effectiveness and impacts of training interventions, information security awareness programs and campaigns. The influence of cultural changes and the effect of security incidents can also be assessed.

  17. Reliability of measuring hip abductor strength following total knee arthroplasty using a hand-held dynamometer.

    Science.gov (United States)

    Schache, Margaret B; McClelland, Jodie A; Webster, Kate E

    2016-01-01

    To investigate the test-retest reliability of measuring hip abductor strength in patients with total knee arthroplasty (TKA) using a hand-held dynamometer (HHD) with two different types of resistance: belt and manual resistance. Test-retest reliability of 30 subjects (17 female, 13 male, 71.9 ± 7.4 years old), 9.2 ± 2.7 days post TKA was measured using belt and therapist resistance. Retest reliability was calculated with intra-class coefficients (ICC3,1) and 95% confidence intervals (CI) for both the group average and the individual scores. A paired t-test assessed whether a difference existed between the belt and therapist methods of resistance. ICCs were 0.82 and 0.80 for the belt and therapist resisted methods, respectively. Hip abductor strength increases of 8 N (14%) for belt resisted and 14 N (17%) for therapist resisted measurements of the group average exceeded the 95% CI and may represent real change. For individuals, hip abductor strength increases of 33 N (72%) (belt resisted) and 57 N (79%) (therapist resisted) could be interpreted as real change. Hip abductor strength can be reliably measured using HHD in the clinical setting with the described protocol. Belt resistance demonstrated slightly higher test-retest reliability. Reliable measurement of hip abductor muscle strength in patients with TKA is important to ensure deficiencies are addressed in rehabilitation programs and function is maximized. Hip abductor strength can be reliably measured with a hand-held dynamometer in the clinical setting using manual or belt resistance.

  18. Measuring older adults' sedentary time: reliability, validity, and responsiveness.

    Science.gov (United States)

    Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville

    2011-11-01

    With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is

  19. Test-Retest Reliability of Measurements of Hand-Grip Strength Obtained by Dynamometry from Older Adults: A Systematic Review of Research in the PubMed Database.

    Science.gov (United States)

    Bohannon, R W

    2017-01-01

    A systematic review was performed to summarize literature describing the test-retest reliability of grip strength measures obtained from older adults. Relevant literature was identified via a PubMed search. Seventeen articles were deemed appropriate based on inclusion and exclusion criteria. The relative test-retest reliability of grip strength measures obtained by dynamometry was good to excellent (intra-class correlation coefficients > 0.80) in all but 3 studies, which involved older adults with severe dementia. Absolute reliability, as indicated by summary statistics such as the minimum detectable change (95%), was more variable. As a percentage, that change ranged from 14.5% to 98.5%. Consequently, clinicians can be confident in the relative reliability of grip strength measures obtained from at risk older adults. However, relatively large percentage changes in grip strength may be necessary to conclude with confidence that a real change has occurred over time in some populations.

  20. The reliability of commonly used electrophysiology measures.

    Science.gov (United States)

    Brown, K E; Lohse, K R; Mayer, I M S; Strigaro, G; Desikan, M; Casula, E P; Meunier, S; Popa, T; Lamy, J-C; Odish, O; Leavitt, B R; Durr, A; Roos, R A C; Tabrizi, S J; Rothwell, J C; Boyd, L A; Orth, M

    Electrophysiological measures can help understand brain function both in healthy individuals and in the context of a disease. Given the amount of information that can be extracted from these measures and their frequent use, it is essential to know more about their inherent reliability. To understand the reliability of electrophysiology measures in healthy individuals. We hypothesized that measures of threshold and latency would be the most reliable and least susceptible to methodological differences between study sites. Somatosensory evoked potentials from 112 control participants; long-latency reflexes, transcranial magnetic stimulation with resting and active motor thresholds, motor evoked potential latencies, input/output curves, and short-latency sensory afferent inhibition and facilitation from 84 controls were collected at 3 visits over 24 months at 4 Track-On HD study sites. Reliability was assessed using intra-class correlation coefficients for absolute agreement, and the effects of reliability on statistical power are demonstrated for different sample sizes and study designs. Measures quantifying latencies, thresholds, and evoked responses at high stimulator intensities had the highest reliability, and required the smallest sample sizes to adequately power a study. Very few between-site differences were detected. Reliability and susceptibility to between-site differences should be evaluated for electrophysiological measures before including them in study designs. Levels of reliability vary substantially across electrophysiological measures, though there are few between-site differences. To address this, reliability should be used in conjunction with theoretical calculations to inform sample size and ensure studies are adequately powered to detect true change in measures of interest. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Reliability and responsiveness of a goniometric device for measuring the range of motion in the dart-throwing motion plane.

    Science.gov (United States)

    Kasubuchi, Kenji; Dohi, Yoshihiro; Fujita, Hiroyuki; Fukumoto, Takahiko

    2018-02-26

    Dart-throwing motion (DTM) is an important component of wrist function and, consequently, has the potential to become an evaluation tool in rehabilitation. However, no measurement method is currently available to reliably measure range of motion (ROM) of the wrist in the DTM plane. To determine the reliability and responsiveness of a goniometric device to measure wrist ROM in the DTM plane. ROM of the wrist in the DTM plane was measured in 70 healthy participants. The intra-class correlation coefficient (ICC) was used to evaluate the relative reliability of measurement, and a Bland-Altman analysis conducted to establish its absolute reliability, including the 95% limits of agreement (95% LOA). The standard error of the measurement (SEM) and minimal detectable change at the 95% confidence level (MDC 95 ) were calculated as measures of responsiveness. The intra-rater ICC was 0.87, and an inter-rater ICC of 0.71. There was no evidence of a fixed or proportional bias. For intra- and inter-rater reliability, 95% LOA ranged from -13.83 to 11.12 and from -17.75 to 16.19, respectively. The SEM and MDC 95 were 4.5° and 12.4°, respectively, for intra-rater reliability, and 6.0° and 16.6°, respectively, for inter-rater reliability. The ROM of the wrist in the DTM plane was measured with fair-to-good reliability and responsiveness and, therefore, has the potential to become an evaluation tool for rehabilitation.

  2. A Systematic Review of Statistical Methods Used to Test for Reliability of Medical Instruments Measuring Continuous Variables

    Directory of Open Access Journals (Sweden)

    Rafdzah Zaki

    2013-06-01

    Full Text Available   Objective(s: Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice.   Materials and Methods: In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria. Results: The Intra-class Correlation Coefficient (ICC is the most popular method with 25 (60% studies having used this method followed by the comparing means (8 or 19%. Out of 25 studies using the ICC, only 7 (28% reported the confidence intervals and types of ICC used. Most studies (71% also tested the agreement of instruments. Conclusion: This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.

  3. A clinical tool to measure plagiocephaly in infants using a flexicurve: a reliability study

    Directory of Open Access Journals (Sweden)

    Leung A

    2013-10-01

    Full Text Available Amy Leung,1 Pauline Watter,2 John Gavranich3 1Department of Physiotherapy, Royal Children's Hospital, Brisbane, Australia; 2Physiotherapy Division, University of Queensland, Brisbane, Australia; 3Child and Family Health Services, West Moreton Health Service District, Ipswich, Australia Purpose: There has been an increasing incidence of infants presenting with plagiocephaly in the last two decades. A practical, economical, and reliable clinical plagiocephaly measure is essential to assess progression and intervention outcomes. This study investigated the reliability of a modified cranial vault asymmetry index using a flexible curve in infants. Measurement: A flexicurve was molded to the infant's head and its shape maintained as it was placed onto paper to trace the head shape. Using a small modification of Loveday and De Chaplain's procedure to measure a cranial vault asymmetry index, a pair of diagonals were drawn at 30° through the midpoint of the central line to their intersection with the traced head outline. The difference in length of the paired diagonals was divided by the short diameter then multiplied by 100%, yielding the modified cranial vault-asymmetry index. Patients and methods: Infants referred to a community health physiotherapist for assessment due to suspected abnormal head shape were included. To explore intrarater reliability, 34 infants aged 3–14 months were measured twice (T1/T1′ at the beginning, and 21 of these remeasured twice at the end (T2/T2′ of their physiotherapy sessions. Test–retest reliability used matched-average data (T1/T1′ and (T2/T2′ from 21 infants. To explore interrater reliability, 18 healthy infants aged 2–6 months were recruited. Each infant was measured once by each rater. Results: For intrarater reliability, the intraclass correlation coefficient with 54 degrees of freedom (ICCdf54 was 0.868 (95% confidence interval [CI] 0.783–0.921; for test–retest reliability, ICCdf20 = 0.958 (95

  4. The confidence in diabetes self-care scale

    DEFF Research Database (Denmark)

    Van Der Ven, Nicole C W; Weinger, Katie; Yi, Joyce

    2003-01-01

    evaluated in Dutch (n = 151) and U.S. (n = 190) outpatients with type 1 diabetes. In addition to the CIDS scale, assessment included HbA(1c), emotional distress, fear of hypoglycemia, self-esteem, anxiety, depression, and self-care behavior. The Dutch sample completed additional measures on perceived burden......OBJECTIVE: To examine psychometric properties of the Confidence in Diabetes Self-Care (CIDS) scale, a newly developed instrument assessing diabetes-specific self-efficacy in Dutch and U.S. patients with type 1 diabetes. RESEARCH DESIGN AND METHODS: Reliability and validity of the CIDS scale were...... and importance of self-care. Test-retest reliability was established in a second Dutch sample (n = 62). RESULTS: Internal consistency (Cronbach's alpha = 0.86 for Dutch patients and 0.90 U.S. patients) and test-retest reliability (Spearman's r = 0.85, P

  5. Clinimetric properties of the Tinetti Mobility Test, Four Square Step Test, Activities-specific Balance Confidence Scale, and spatiotemporal gait measures in individuals with Huntington's disease.

    Science.gov (United States)

    Kloos, Anne D; Fritz, Nora E; Kostyk, Sandra K; Young, Gregory S; Kegelmeyer, Deb A

    2014-09-01

    Individuals with Huntington's disease (HD) experience balance and gait problems that lead to falls. Clinicians currently have very little information about the reliability and validity of outcome measures to determine the efficacy of interventions that aim to reduce balance and gait impairments in HD. This study examined the reliability and concurrent validity of spatiotemporal gait measures, the Tinetti Mobility Test (TMT), Four Square Step Test (FSST), and Activities-specific Balance Confidence (ABC) Scale in individuals with HD. Participants with HD [n = 20; mean age ± SD=50.9 ± 13.7; 7 male] were tested on spatiotemporal gait measures and the TMT, FSST, and ABC Scale before and after a six week period to determine test-retest reliability and minimal detectable change (MDC) values. Linear relationships between gait and clinical measures were estimated using Pearson's correlation coefficients. Spatiotemporal gait measures, the TMT total and the FSST showed good to excellent test-retest reliability (ICC > 0.75). MDC values were 0.30 m/s and 0.17 m/s for velocity in forward and backward walking respectively, four points for the TMT, and 3s for the FSST. The TMT and FSST were highly correlated with most spatiotemporal measures. The ABC Scale demonstrated lower reliability and less concurrent validity than other measures. The high test-retest reliability over a six week period and concurrent validity between the TMT, FSST, and spatiotemporal gait measures suggest that the TMT and FSST may be useful outcome measures for future intervention studies in ambulatory individuals with HD. Copyright © 2014 Elsevier B.V. All rights reserved.

  6. Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

    Science.gov (United States)

    Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

    2018-03-02

    There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.

  7. ImageJ: A Free, Easy, and Reliable Method to Measure Leg Ulcers Using Digital Pictures.

    Science.gov (United States)

    Aragón-Sánchez, Javier; Quintana-Marrero, Yurena; Aragón-Hernández, Cristina; Hernández-Herero, María José

    2017-12-01

    Wound measurement to document the healing course of chronic leg ulcers has an important role in the management of these patients. Digital cameras in smartphones are readily available and easy to use, and taking pictures of wounds is becoming a routine in specialized departments. Analyzing digital pictures with appropriate software provides clinicians a quick, clean, and easy-to-use tool for measuring wound area. A set of 25 digital pictures of plain foot and leg ulcers was the basis of this study. Photographs were taken placing a ruler next to the wound in parallel with the healthy skin with the iPhone 6S (Apple Inc, Cupertino, CA), which has a camera of 12 megapixels using the flash. The digital photographs were visualized with ImageJ 1.45s freeware (National Institutes of Health, Rockville, MD; http://imagej.net/ImageJ ). Wound area measurement was carried out by 4 raters: head of the department, wound care nurse, physician, and medical student. We assessed intra- and interrater reliability using the interclass correlation coefficient. To determine intraobserver reliability, 2 of the raters repeated the measurement of the set 1 week after the first reading. The interrater model displayed an interclass correlation coefficient of 0.99 with 95% confidence interval of 0.999 to 1.000, showing excellent reliability. The intrarater model of both examiners showed excellent reliability. In conclusion, analyzing digital images of leg ulcers with ImageJ estimates wound area with excellent reliability. This method provides a free, rapid, and accurate way to measure wounds and could routinely be used to document wound healing in daily clinical practice.

  8. Development of a reliable, valid measure to assess parents' and teachers' understanding of postural care for children with physical disabilities: the (UKC PostCarD) questionnaire.

    Science.gov (United States)

    Hotham, S; Hutton, E; Hamilton-West, K E

    2015-11-01

    Previous research has highlighted lack of knowledge, understanding and confidence among parents and teachers responsible for the postural care of children with physical disability. Interventions designed to improve these qualities require a reliable and validated tool to assess pre- and post-intervention levels. Currently, however, no validated measure of postural care confidence (i.e. self-efficacy) exists. Hence, the aim of this research was to develop a reliable and valid questionnaire to assess parents' and teachers' confidence, alongside knowledge and understanding of postural care - the Understanding Knowledge and Confidence in providing POSTural CARe for children with Disabilities (UKC PostCarD) questionnaire. Items were developed by a multidisciplinary team and designed to map onto the content of 'An A-to-Z of Postural Care'. Parents, teachers and therapists assessed items for face validity. Scale reliability was then assessed using Cronbach's alpha and known-group validity was assessed by comparing scores of an 'expert' group (physiotherapists and occupational therapists) with those of a 'non-expert' group (with no formal training in postural care). The total scale and all three subscales (understanding and knowledge, confidence and concerns) demonstrated adequate reliability (α > 0.83) and subscale correlations formed a logical pattern (understanding and knowledge correlated positively with confidence and negatively with concerns). Experts' (n = 111) scores were higher than non-experts' (n = 79) for the total scale and all subscales (P children with disabilities. © 2015 John Wiley & Sons Ltd.

  9. Confidence building measures at sea:opportunities for India and Pakistan.

    Energy Technology Data Exchange (ETDEWEB)

    Vohra, Ravi Bhushan Rear Admiral (; ); Ansari, Hasan Masood Rear Admiral (; )

    2003-12-01

    The sea presents unique possibilities for implementing confidence building measures (CBMs) between India and Pakistan that are currently not available along the contentious land borders surrounding Jammu and Kashmir. This is due to the nature of maritime issues, the common military culture of naval forces, and a less contentious history of maritime interaction between the two nations. Maritime issues of mutual concern provide a strong foundation for more far-reaching future CBMs on land, while addressing pressing security, economic, and humanitarian needs at sea in the near-term. Although Indian and Pakistani maritime forces currently have stronger opportunities to cooperate with one another than their counterparts on land, reliable mechanisms to alleviate tension or promote operational coordination remain non-existent. Therefore, possible maritime CBMs, as well as pragmatic mechanisms to initiate and sustain cooperation, require serious examination. This report reflects the unique joint research undertaking of two retired Senior Naval Officers from both India and Pakistan, sponsored by the Cooperative Monitoring Center of the International Security Center at Sandia National Laboratories. Research focuses on technology as a valuable tool to facilitate confidence building between states having a low level of initial trust. Technical CBMs not only increase transparency, but also provide standardized, scientific means of interacting on politically difficult problems. Admirals Vohra and Ansari introduce technology as a mechanism to facilitate consistent forms of cooperation and initiate discussion in the maritime realm. They present technical CBMs capable of being acted upon as well as high-level political recommendations regarding the following issues: (1) Delimitation of the maritime boundary between India and Pakistan and its relationship to the Sir Creek dispute; (2) Restoration of full shipping links and the security of ports and cargos; (3) Fishing within

  10. The Validity and Reliability Characteristics of the M-BACK Questionnaire to Assess the Barriers, Attitudes, Confidence, and Knowledge of Mental Health Staff Regarding Metabolic Health of Mental Health Service Users

    Directory of Open Access Journals (Sweden)

    Andrew Watkins

    2017-12-01

    Full Text Available BackgroundAddressing the burden of poor physical health and the subsequent gap in life expectancy experienced by people with mental illness is a major priority in mental health services. To equip mental health staff with the competence to deliver evidence-based interventions, targeted staff training regarding metabolic health is required. In order to evaluate the effectiveness of staff training regarding metabolic health, we aimed to develop a succinct measure to determine the barriers, attitudes, confidence, and knowledge of health practitioners through the development and test–retest reliability of the Metabolic-Barriers, Attitudes, Confidence, and Knowledge Questionnaire (M-BACK.MethodsThe M-BACK questionnaire was developed to evaluate the impact of specialized training in metabolic health care for mental health nurses. Content of the M-BACK was developed from a literature review and refined by an expert review panel and validated via a piloting process. To determine the test–retest reliability of the M-BACK, 31 nursing students recruited from the University of Notre Dame, Sydney completed the questionnaire on two separate occasions, 7 days apart. Intraclass correlation coefficients (ICCs were calculated for the total score, as well as each of the four domains.ResultsPilot testing was undertaken with a sample of 106 mental health nurses with a mean age 48.2, ranging from 24 to 63 years of age, who participated in six training courses. Questionnaire development resulted in a 16-item instrument, with each item is scored on a five-point Likert scale ranging from “strongly disagree” to “strongly agree.” Test–retest reliability of the M-BACK was completed by 30 of 31 nursing students recruited, ICCs ranged from 0.62 to 0.96.ConclusionThe M-BACK is a reliable measure of the key elements of practitioner perceptions of barriers, and their knowledge, attitudes, and confidence regarding metabolic monitoring in people with mental

  11. The theory of confidence-building measures

    International Nuclear Information System (INIS)

    Darilek, R.E.

    1992-01-01

    This paper discusses the theory of Confidence-Building Measures (CBMs) in two ways. First, it employs a top-down, deductively oriented approach to explain CBM theory in terms of the arms control goals and objectives to be achieved, the types of measures to be employed, and the problems or limitations likely to be encountered when applying CBMs to conventional or nuclear forces. The chapter as a whole asks how various types of CBMs might function during a political - military escalation from peacetime to a crisis and beyond (i.e. including conflict), as well as how they might operate in a de-escalatory environment. In pursuit of these overarching issues, the second section of the chapter raises a fundamental but complicating question: how might the next all-out war actually come aoubt - by unpremeditated escalation resulting from misunderstanding or miscalculation, or by premeditation resulting in a surprise attack? The second section of the paper addresses this question, explores its various implications for CBMs, and suggests the potential contribution of different types of CBMs toward successful resolution of the issues involved

  12. Building, measuring and improving public confidence in the nuclear regulator

    International Nuclear Information System (INIS)

    2006-01-01

    An important factor for public confidence in the nuclear regulator is the general public trust of the government and its representatives, which is clearly not the same in all countries. Likewise, cultural differences between countries can be considerable, and similar means of communication between government authorities and the public may not be universally effective. Nevertheless, this workshop identified a number of common principles for the communication of nuclear regulatory decisions that can be recommended to all regulators. They have been cited in particular for their ability to help build, measure and/or improve overall public confidence in the nuclear regulator. (author)

  13. Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.

    Science.gov (United States)

    Park, Myung Sook; Kang, Kyung Ja; Jang, Sun Joo; Lee, Joo Yun; Chang, Sun Ju

    2018-03-01

    This study aimed to evaluate the components of test-retest reliability including time interval, sample size, and statistical methods used in patient-reported outcome measures in older people and to provide suggestions on the methodology for calculating test-retest reliability for patient-reported outcomes in older people. This was a systematic literature review. MEDLINE, Embase, CINAHL, and PsycINFO were searched from January 1, 2000 to August 10, 2017 by an information specialist. This systematic review was guided by both the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and the guideline for systematic review published by the National Evidence-based Healthcare Collaborating Agency in Korea. The methodological quality was assessed by the Consensus-based Standards for the selection of health Measurement Instruments checklist box B. Ninety-five out of 12,641 studies were selected for the analysis. The median time interval for test-retest reliability was 14days, and the ratio of sample size for test-retest reliability to the number of items in each measure ranged from 1:1 to 1:4. The most frequently used statistical methods for continuous scores was intraclass correlation coefficients (ICCs). Among the 63 studies that used ICCs, 21 studies presented models for ICC calculations and 30 studies reported 95% confidence intervals of the ICCs. Additional analyses using 17 studies that reported a strong ICC (>0.09) showed that the mean time interval was 12.88days and the mean ratio of the number of items to sample size was 1:5.37. When researchers plan to assess the test-retest reliability of patient-reported outcome measures for older people, they need to consider an adequate time interval of approximately 13days and the sample size of about 5 times the number of items. Particularly, statistical methods should not only be selected based on the types of scores of the patient-reported outcome measures, but should also be described clearly in

  14. A fuzzy logic algorithm to assign confidence levels to heart and respiratory rate time series

    International Nuclear Information System (INIS)

    Liu, J; McKenna, T M; Gribok, A; Reifman, J; Beidleman, B A; Tharion, W J

    2008-01-01

    We have developed a fuzzy logic-based algorithm to qualify the reliability of heart rate (HR) and respiratory rate (RR) vital-sign time-series data by assigning a confidence level to the data points while they are measured as a continuous data stream. The algorithm's membership functions are derived from physiology-based performance limits and mass-assignment-based data-driven characteristics of the signals. The assigned confidence levels are based on the reliability of each HR and RR measurement as well as the relationship between them. The algorithm was tested on HR and RR data collected from subjects undertaking a range of physical activities, and it showed acceptable performance in detecting four types of faults that result in low-confidence data points (receiver operating characteristic areas under the curve ranged from 0.67 (SD 0.04) to 0.83 (SD 0.03), mean and standard deviation (SD) over all faults). The algorithm is sensitive to noise in the raw HR and RR data and will flag many data points as low confidence if the data are noisy; prior processing of the data to reduce noise allows identification of only the most substantial faults. Depending on how HR and RR data are processed, the algorithm can be applied as a tool to evaluate sensor performance or to qualify HR and RR time-series data in terms of their reliability before use in automated decision-assist systems

  15. Reliability and reproducibility analysis of the Cobb angle and assessing sagittal plane by computer-assisted and manual measurement tools.

    Science.gov (United States)

    Wu, Weifei; Liang, Jie; Du, Yuanli; Tan, Xiaoyi; Xiang, Xuanping; Wang, Wanhong; Ru, Neng; Le, Jinbo

    2014-02-06

    Although many studies on reliability and reproducibility of measurement have been performed on coronal Cobb angle, few results about reliability and reproducibility are reported on sagittal alignment measurement including the pelvis. We usually use SurgimapSpine software to measure the Cobb angle in our studies; however, there are no reports till date on its reliability and reproducible measurements. Sixty-eight standard standing posteroanterior whole-spine radiographs were reviewed. Three examiners carried out the measurements independently under the settings of manual measurement on X-ray radiographies and SurgimapSpine software on the computer. Parameters measured included pelvic incidence, sacral slope, pelvic tilt, Lumbar lordosis (LL), thoracic kyphosis, and coronal Cobb angle. SPSS 16.0 software was used for statistical analyses. The means, standard deviations, intraclass and interclass correlation coefficient (ICC), and 95% confidence intervals (CI) were calculated. There was no notable difference between the two tools (P = 0.21) for the coronal Cobb angle. In the sagittal plane parameters, the ICC of intraobserver reliability for the manual measures varied from 0.65 (T2-T5 angle) to 0.95 (LL angle). Further, for SurgimapSpine tool, the ICC ranged from 0.75 to 0.98. No significant difference in intraobserver reliability was found between the two measurements (P > 0.05). As for the interobserver reliability, measurements with SurgimapSpine tool had better ICC (0.71 to 0.98 vs 0.59 to 0.96) and Pearson's coefficient (0.76 to 0.99 vs 0.60 to 0.97). The reliability of SurgimapSpine measures was significantly higher in all parameters except for the coronal Cobb angle where the difference was not significant (P > 0.05). Although the differences between the two methods are very small, the results of this study indicate that the SurgimapSpine measurement is an equivalent measuring tool to the traditional manual in coronal Cobb angle, but is advantageous in spino

  16. Large Sample Confidence Intervals for Item Response Theory Reliability Coefficients

    Science.gov (United States)

    Andersson, Björn; Xin, Tao

    2018-01-01

    In applications of item response theory (IRT), an estimate of the reliability of the ability estimates or sum scores is often reported. However, analytical expressions for the standard errors of the estimators of the reliability coefficients are not available in the literature and therefore the variability associated with the estimated reliability…

  17. Measuring Confidence Levels of Male and Female Students in Open Access Enabling Courses

    Science.gov (United States)

    Atherton, Mirella

    2015-01-01

    The study of confidence was undertaken at the University of Newcastle with students selecting science courses at two campuses. The students were enrolled in open access programs and aimed to gain access to undergraduate studies in various disciplines at University. The "third person effect" was used to measure the confidence levels of…

  18. 用Delta法估计多维测验合成信度的置信区间%Estimating the Confidence Interval of Composite Reliability of a Multidimensional Test With the Delta Method

    Institute of Scientific and Technical Information of China (English)

    叶宝娟; 温忠麟

    2012-01-01

    Reliability is very important in evaluating the quality of a test. Based on the confirmatory factor analysis, composite reliabili- ty is a good index to estimate the test reliability for general applications. As is well known, point estimate contains limited information a- bout a population parameter and cannot indicate how far it can be from the population parameter. The confidence interval of the parame- ter can provide more information. In evaluating the quality of a test, the confidence interval of composite reliability has received atten- tion in recent years. There are three approaches to estimating the confidence interval of composite reliability of an unidimensional test: the Bootstrap method, the Delta method, and the direct use of the standard error of a software output (e. g. , LISREL). The Bootstrap method pro- vides empirical results of the standard error, and is the most credible method. But it needs data simulation techniques, and its computa- tion process is rather complex. The Delta method computes the standard error of composite reliability by approximate calculation. It is simpler than the Bootstrap method. The LISREL software can directly prompt the standard error, and it is the easiest among the three methods. By simulation study, it had been found that the interval estimates obtained by the Delta method and the Bootstrap method were almost identical, whereas the results obtained by LISREL and by the Bootstrap method were substantially different ( Ye & Wen, 2011 ). The Delta method is recommended when the confidence interval of composite reliability of a unidimensional test is estimated, because the Delta method is simpler than the Bootstrap method. There was little research about how to compute the confidence interval of composite reliability of a multidimensional test. We de- duced a formula by using the Delta method for computing the standard error of composite reliability of a multidimensional test. Based on the standard error, the

  19. Limits of reliability for the measurement of integral count

    International Nuclear Information System (INIS)

    Erbeszkorn, L.

    1979-01-01

    A method is presented for exact and approximate calculation of reliability limits of measured nuclear integral count. The formulae are applicable in measuring conditions which assure the Poisson distribution of the counts. The coefficients of the approximate formulae for 90, 95, 98 and 99 per cent reliability levels are given. The exact reliability limits for 90 per cent reliability level are calculated up to 80 integral counts. (R.J.)

  20. Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

    Science.gov (United States)

    Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

    2014-01-01

    This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.

  1. Impact of a critical care postgraduate certificate course on nurses' self-reported competence and confidence: A quasi-experimental study.

    Science.gov (United States)

    Baxter, Rebecca; Edvardsson, David

    2018-06-01

    Postgraduate education is said to support the development of nurses' professional competence and confidence, essential to the delivery of safe and effective care. However, there is a shortness of empirical evidence to demonstrate an increase to nurses' self-reported confidence and competence on completion of critical care postgraduate certificate-level education. To explore the impact of a critical care postgraduate certificate course on nurses' self-reported competence and confidence. To explore the psychometric properties and performance of the Critical Care Competence and Confidence Questionnaire. A quasi-experimental pre/post-test design. A total population sample of nurses completing a critical care postgraduate certificate course at an Australian University. The Critical Care Competence and Confidence Questionnaire was developed for this study to measure nurses' self-reported competence and confidence at baseline and follow up. Descriptive and inferential statistics were used to explore sample characteristics and changes between baseline and follow-up. Reliability of the questionnaire was explored using Cronbach's Alpha and item-total correlations. There was a statistically significant increase in competence and confidence between baseline and follow-up across all questionnaire domains. Satisfactory reliability estimates were found for the questionnaire. Completion of a critical care postgraduate certificate course significantly increased nurses' perceived competence and confidence. The Critical Care Competence and Confidence Questionnaire was found to be psychometrically sound for measuring nurses' self-reported competence and confidence. Copyright © 2018 Elsevier Ltd. All rights reserved.

  2. Generalized Confidence Intervals and Fiducial Intervals for Some Epidemiological Measures

    Directory of Open Access Journals (Sweden)

    Ionut Bebu

    2016-06-01

    Full Text Available For binary outcome data from epidemiological studies, this article investigates the interval estimation of several measures of interest in the absence or presence of categorical covariates. When covariates are present, the logistic regression model as well as the log-binomial model are investigated. The measures considered include the common odds ratio (OR from several studies, the number needed to treat (NNT, and the prevalence ratio. For each parameter, confidence intervals are constructed using the concepts of generalized pivotal quantities and fiducial quantities. Numerical results show that the confidence intervals so obtained exhibit satisfactory performance in terms of maintaining the coverage probabilities even when the sample sizes are not large. An appealing feature of the proposed solutions is that they are not based on maximization of the likelihood, and hence are free from convergence issues associated with the numerical calculation of the maximum likelihood estimators, especially in the context of the log-binomial model. The results are illustrated with a number of examples. The overall conclusion is that the proposed methodologies based on generalized pivotal quantities and fiducial quantities provide an accurate and unified approach for the interval estimation of the various epidemiological measures in the context of binary outcome data with or without covariates.

  3. How to Measure the Onset of Babbling Reliably?

    Science.gov (United States)

    Molemans, Inge; van den Berg, Renate; van Severen, Lieve; Gillis, Steven

    2012-01-01

    Various measures for identifying the onset of babbling have been proposed in the literature, but a formal definition of the exact procedure and a thorough validation of the sample size required for reliably establishing babbling onset is lacking. In this paper the reliability of five commonly used measures is assessed using a large longitudinal…

  4. A double-loop adaptive sampling approach for sensitivity-free dynamic reliability analysis

    International Nuclear Information System (INIS)

    Wang, Zequn; Wang, Pingfeng

    2015-01-01

    Dynamic reliability measures reliability of an engineered system considering time-variant operation condition and component deterioration. Due to high computational costs, conducting dynamic reliability analysis at an early system design stage remains challenging. This paper presents a confidence-based meta-modeling approach, referred to as double-loop adaptive sampling (DLAS), for efficient sensitivity-free dynamic reliability analysis. The DLAS builds a Gaussian process (GP) model sequentially to approximate extreme system responses over time, so that Monte Carlo simulation (MCS) can be employed directly to estimate dynamic reliability. A generic confidence measure is developed to evaluate the accuracy of dynamic reliability estimation while using the MCS approach based on developed GP models. A double-loop adaptive sampling scheme is developed to efficiently update the GP model in a sequential manner, by considering system input variables and time concurrently in two sampling loops. The model updating process using the developed sampling scheme can be terminated once the user defined confidence target is satisfied. The developed DLAS approach eliminates computationally expensive sensitivity analysis process, thus substantially improves the efficiency of dynamic reliability analysis. Three case studies are used to demonstrate the efficacy of DLAS for dynamic reliability analysis. - Highlights: • Developed a novel adaptive sampling approach for dynamic reliability analysis. • POD Developed a new metric to quantify the accuracy of dynamic reliability estimation. • Developed a new sequential sampling scheme to efficiently update surrogate models. • Three case studies were used to demonstrate the efficacy of the new approach. • Case study results showed substantially enhanced efficiency with high accuracy

  5. The Reliability of Anthropometric Measurements Used Preoperatively in Aesthetic Breast Surgery.

    Science.gov (United States)

    Isaac, Kathryn V; Murphy, Blake D; Beber, Brett; Brown, Mitchell

    2016-04-01

    Patient outcomes in aesthetic breast surgery are highly dependent on breast measurements used in preoperative planning. The purpose of this study is to determine the reliability of anthropometric breast measurements. Four raters measured 28 women using 7 measurements: sternal notch to nipple distance (Sn-N), nipple to midline (N-M), nipple to inframammary-fold distance under maximal stretch (N-IMF), breast base width (BW), soft tissue pinch thickness of the upper pole (STPT:UP), STPT at the inframammary fold (STPT:IMF), and anterior pull skin stretch (APSS). Reliability was assessed using intra-class correlation coefficients (ICCs). Inter-rater reliability was excellent for Sn-N, N-M, and BW (ICC = 0.94, 0.90, and 0.76, respectively) and was good for N-IMF (ICC = 0.70). The STPT:UP, STPT:IMF, and APSS measurements were not reliable between raters (ICC reliability was excellent for Sn-N, N-M, and BW for all raters (all ICC > 0.75). The N-IMF intra-rater reliability was excellent in senior raters (ICC > 0.75) and good in junior raters (ICC > 0.6). The STPT:UP, STPT:IMF, and APSS measurements showed fair or poor reliability for most raters (ICC reliable. Dynamic measurements including APSS, STPT:UP, and STUP:IMF are unreliable. N-IMF is the only reliable dynamic measurement, and its reliability improves with increasing clinical experience. The variable reliability of preoperative measurements must be considered in the planning of aesthetic breast surgery. 4 Diagnostic. © 2015 The American Society for Aesthetic Plastic Surgery, Inc. Reprints and permission: journals.permissions@oup.com.

  6. Computationally efficient SVM multi-class image recognition with confidence measures

    International Nuclear Information System (INIS)

    Makili, Lazaro; Vega, Jesus; Dormido-Canto, Sebastian; Pastor, Ignacio; Murari, Andrea

    2011-01-01

    Typically, machine learning methods produce non-qualified estimates, i.e. the accuracy and reliability of the predictions are not provided. Transductive predictors are very recent classifiers able to provide, simultaneously with the prediction, a couple of values (confidence and credibility) to reflect the quality of the prediction. Usually, a drawback of the transductive techniques for huge datasets and large dimensionality is the high computational time. To overcome this issue, a more efficient classifier has been used in a multi-class image classification problem in the TJ-II stellarator database. It is based on the creation of a hash function to generate several 'one versus the rest' classifiers for every class. By using Support Vector Machines as the underlying classifier, a comparison between the pure transductive approach and the new method has been performed. In both cases, the success rates are high and the computation time with the new method is up to 0.4 times the old one.

  7. Reliability and Validity of Computerized Force Platform Measures of Balance Function in Healthy Older Adults.

    Science.gov (United States)

    Harro, Cathy C; Garascia, Chelsea

    2018-01-10

    Postural control declines with aging and is an independent risk factor for falls in older adults. Objective examination of balance function is warranted to direct fall prevention strategies. Force platform (FP) systems provide quantitative measures of postural control and analysis of different aspects of balance. The purpose of this study was to examine the reliability and validity of FP measures in healthy older adults. This study enrolled 46 healthy elderly adults, mean age 67.67 (5.1) years, who had no history of falls. They were assessed on 3 standardized tests on the NeuroCom Equitest FP system: limits of stability (LOS), motor control test (MCT), and sensory organization test (SOT). The test battery was administered twice within a 10-day period for test-retest reliability; intraclass correlation coefficients (ICCs), standard error of measurement (SEM), and minimal detectable change based on a 95% confidence interval (MDC95) were calculated. FP measures were compared with criterion clinical balance (Mini-BESTest and Functional Gait Assessment) and gait (10-m walk and 6-minute walk) measures to examine concurrent validity using Pearson correlation coefficients. Multiple linear regression analysis examined whether age and activity level were associated with FP performance. The α level was set at P point excursion measures all demonstrated excellent test-retest reliability (ICC = 0.90, 0.85, and 0.77, respectively), whereas moderate to good reliability was found for SOT vestibular ratio score (ICC = 0.71). There was large variability in performance in this healthy elderly cohort, resulting in relatively large MDC95 for these measures, especially for the LOS test. Fair correlations were found between LOS end point excursion and clinical balance and gait measures (r = 0.31-0.49), and between MCT average latency and gait measures only (r = -0.32). No correlations were found between SOT measures and clinical balance and gait measures. Age was only marginally

  8. Measuring physical activity in young people with cerebral palsy: validity and reliability of the ActivPAL™ monitor.

    Science.gov (United States)

    Bania, Theofani

    2014-09-01

    We determined the criterion validity and the retest reliability of the ΑctivPAL™ monitor in young people with diplegic cerebral palsy (CP). Activity monitor data were compared with the criterion of video recording for 10 participants. For the retest reliability, activity monitor data were collected from 24 participants on two occasions. Participants had to have diplegic CP and be between 14 and 22 years of age. They also had to be of Gross Motor Function Classification System level II or III. Outcomes were time spent in standing, number of steps (physical activity) and time spent in sitting (sedentary behaviour). For criterion validity, coefficients of determination were all high (r(2)  ≥ 0.96), and limits of group agreement were relatively narrow, but limits of agreement for individuals were narrow only for number of steps (≥5.5%). Relative reliability was high for number of steps (intraclass correlation coefficient = 0.87) and moderate for time spent in sitting and lying, and time spent in standing (intraclass correlation coefficients = 0.60-0.66). For groups, changes of up to 7% could be due to measurement error with 95% confidence, but for individuals, changes as high as 68% could be due to measurement error. The results support the criterion validity and the retest reliability of the ActivPAL™ to measure physical activity and sedentary behaviour in groups of young people with diplegic CP but not in individuals. Copyright © 2014 John Wiley & Sons, Ltd.

  9. Test-retest reliability for aerodynamic measures of voice.

    Science.gov (United States)

    Awan, Shaheen N; Novaleski, Carolyn K; Yingling, Julie R

    2013-11-01

    The purpose of this study was to investigate the intrasubject reliability of aerodynamic characteristics of the voice within typical/normal speakers across testing sessions using the Phonatory Aerodynamic System (PAS 6600; KayPENTAX, Montvale, NJ). Participants were 60 healthy young adults (30 males and 30 females) between the ages 18 and 31 years with perceptually typical voice. Participants were tested using the PAS 6600 (Phonatory Aerodynamic System) on two separate days with approximately 1 week between each session at approximately the same time of day. Four PAS protocols were conducted (vital capacity, maximum sustained phonation, comfortable sustained phonation, and voicing efficiency) and measures of expiratory volume, maximum phonation time, mean expiratory airflow (during vowel production) and target airflow (obtained via syllable repetition), peak air pressure, aerodynamic power, aerodynamic resistance, and aerodynamic efficiency were obtained during each testing session. Associated acoustic measures of vocal intensity and frequency were also collected. All phonations were elicited at comfortable pitch and loudness. All aerodynamic and associated variables evaluated in this study showed useable test-retest reliability (ie, intraclass correlation coefficients [ICCs] ≥ 0.60). A high degree of mean test-retest reliability was found across all subjects for aerodynamic and associated acoustic measurements of vital capacity, maximum sustained phonation, glottal resistance, and vocal intensity (all with ICCs > 0.75). Although strong ICCs were observed for measures of glottal power and mean expiratory airflow in males, weaker overall results for these measures (ICC range: 0.60-0.67) were observed in females subjects and sizable coefficients of variation were observed for measures of power, resistance, and efficiency in both men and women. Differences in degree of reliability from measure to measure were revealed in greater detail using methods such as ICCs and

  10. Psychometric testing on the NLN Student Satisfaction and Self-Confidence in Learning, Simulation Design Scale, and Educational Practices Questionnaire using a sample of pre-licensure novice nurses.

    Science.gov (United States)

    Franklin, Ashley E; Burns, Paulette; Lee, Christopher S

    2014-10-01

    In 2006, the National League for Nursing published three measures related to novice nurses' beliefs about self-confidence, scenario design, and educational practices associated with simulation. Despite the extensive use of these measures, little is known about their reliability and validity. The psychometric properties of the Student Satisfaction and Self-Confidence in Learning Scale, Simulation Design Scale, and Educational Practices Questionnaire were studied among a sample of 2200 surveys completed by novice nurses from a liberal arts university in the southern United States. Psychometric tests included item analysis, confirmatory and exploratory factor analyses in randomly-split subsamples, concordant and discordant validity, and internal consistency. All three measures have sufficient reliability and validity to be used in education research. There is room for improvement in content validity with the Student Satisfaction and Self-Confidence in Learning and Simulation Design Scale. This work provides robust evidence to ensure that judgments made about self-confidence after simulation, simulation design and educational practices are valid and reliable. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Reliability of infrared thermometric measurements of skin temperature in the hand.

    Science.gov (United States)

    Packham, Tara L; Fok, Diana; Frederiksen, Karen; Thabane, Lehana; Buckley, Norman

    2012-01-01

    Clinical measurement study. Skin temperature asymmetries (STAs) are used in the diagnosis of complex regional pain syndrome (CRPS), but little evidence exists for reliability of the equipment and methods. This study examined the reliability of an inexpensive infrared (IR) thermometer and measurement points in the hand for the study of STA. ST was measured three times at five points on both hands with an IR thermometer by two raters in 20 volunteers (12 normals and 8 CRPS). ST measurement results using IR thermometers support inter-rater reliability: intraclass correlation coefficient (ICC) estimate for single measures 0.80; all ST measurement points were also highly reliable (ICC single measures, 0.83-0.91). The equipment demonstrated excellent reliability, with little difference in the reliability of the five measurement sites. These preliminary findings support their use in future CRPS research. Not applicable. Copyright © 2012 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

  12. TWO CRITERIA FOR GOOD MEASUREMENTS IN RESEARCH: VALIDITY AND RELIABILITY

    Directory of Open Access Journals (Sweden)

    Haradhan Kumar Mohajan

    2017-12-01

    Full Text Available Reliability and validity are two most important and fundamental features in the evaluation of any measurement instrument or toll for a good research. The purpose of this research is to discuss the validity and reliability of measurement instruments that are used in research. Validity concerns what an instrument measures, and how well it does so. Reliability concerns the faith that one can have in the data obtained from use of an instrument, that is, the degree to which any measuring tool controls for random error. An attempt has been taken here to review the reliability and validity, and threat to them in some details.

  13. A computer program (COSTUM) to calculate confidence intervals for in situ stress measurements. V. 1

    International Nuclear Information System (INIS)

    Dzik, E.J.; Walker, J.R.; Martin, C.D.

    1989-03-01

    The state of in situ stress is one of the parameters required both for the design and analysis of underground excavations and for the evaluation of numerical models used to simulate underground conditions. To account for the variability and uncertainty of in situ stress measurements, it is desirable to apply confidence limits to measured stresses. Several measurements of the state of stress along a borehole are often made to estimate the average state of stress at a point. Since stress is a tensor, calculating the mean stress and confidence limits using scalar techniques is inappropriate as well as incorrect. A computer program has been written to calculate and present the mean principle stresses and the confidence limits for the magnitudes and directions of the mean principle stresses. This report describes the computer program, COSTUM

  14. Reliability of the Brazilian Portuguese version of the Gross Motor Function Measure in children with cerebral palsy

    Science.gov (United States)

    Almeida, Kênnea M.; Albuquerque, Karolina A.; Ferreira, Marina L.; Aguiar, Stéphany K. B.; Mancini, Marisa C.

    2016-01-01

    OBJECTIVE: To test the intra- and interrater reliability of the Brazilian Portuguese version of the 66-item Gross Motor Function Measure (GMFM-66). METHOD: The sample included 48 children with cerebral palsy (CP), ranging from 2-17 years old, classified at levels I to IV of the Gross Motor Function Classification System (GMFCS) and four child rehabilitation examiners. A main examiner evaluated all children using the GMFM-66 and video-recorded the assessments. The other examiners watched the video recordings and scored them independently for the assessment of interrater reliability. For the intrarater reliability evaluation, the main examiner watched the video recordings one month after the evaluation and re-scored each child. We calculated reliability by using intraclass correlation coefficients (ICC) with their respective 95% confidence intervals. RESULTS: Excellent test reliability was documented. The intrarater reliability of the total sample was ICC=0.99 (95% CI 0.98-0.99), and the interrater reliability was ICC=0.97 (95% CI 0.95-0.98). The reliability across GMFCS levels ranged from ICC=0.92 (95% CI 0.72-0.98) to ICC=0.99 (95% CI 0.99-0.99); the lowest value was the interrater reliability for the GMFCS IV group. Reliability in the five GMFM dimensions varied from ICC=0.95 (95% CI 0.93-0.97) to ICC=0.99 (95% CI 0.99-0.99). CONCLUSION: The Brazilian Portuguese version of the GMFM-66 showed excellent intra- and interrater reliability when used in Brazilian children with CP levels GMFCS I to IV. PMID:26786081

  15. The reliability, validity, and feasibility of physical activity measurement in adults with traumatic brain injury: an observational study.

    Science.gov (United States)

    Hassett, Leanne; Moseley, Anne; Harmer, Alison; van der Ploeg, Hidde P

    2015-01-01

    To determine the reliability and validity of the Physical Activity Scale for Individuals with a Physical Disability (PASIPD) in adults with severe traumatic brain injury (TBI) and estimate the proportion of the sample participants who fail to meet the World Health Organization guidelines for physical activity. A single-center observational study recruited a convenience sample of 30 community-based ambulant adults with severe TBI. Participants completed the PASIPD on 2 occasions, 1 week apart, and wore an accelerometer (ActiGraph GT3X; ActiGraph LLC, Pensacola, Florida) for the 7 days between these 2 assessments. The PASIPD test-retest reliability was substantial (intraclass correlation coefficient = 0.85; 95% confidence interval, 0.70-0.92), and the correlation with the accelerometer ranged from too low to be meaningful (R = 0.09) to moderate (R = 0.57). From device-based measurement of physical activity, 56% of participants failed to meet the World Health Organization physical activity guidelines. The PASIPD is a reliable measure of the type of physical activity people with severe TBI participate in, but it is not a valid measure of the amount of moderate to vigorous physical activity in which they engage. Accelerometers should be used to quantify moderate to vigorous physical activity in people with TBI.

  16. Developing safety performance functions incorporating reliability-based risk measures.

    Science.gov (United States)

    Ibrahim, Shewkar El-Bassiouni; Sayed, Tarek

    2011-11-01

    Current geometric design guides provide deterministic standards where the safety margin of the design output is generally unknown and there is little knowledge of the safety implications of deviating from these standards. Several studies have advocated probabilistic geometric design where reliability analysis can be used to account for the uncertainty in the design parameters and to provide a risk measure of the implication of deviation from design standards. However, there is currently no link between measures of design reliability and the quantification of safety using collision frequency. The analysis presented in this paper attempts to bridge this gap by incorporating a reliability-based quantitative risk measure such as the probability of non-compliance (P(nc)) in safety performance functions (SPFs). Establishing this link will allow admitting reliability-based design into traditional benefit-cost analysis and should lead to a wider application of the reliability technique in road design. The present application is concerned with the design of horizontal curves, where the limit state function is defined in terms of the available (supply) and stopping (demand) sight distances. A comprehensive collision and geometric design database of two-lane rural highways is used to investigate the effect of the probability of non-compliance on safety. The reliability analysis was carried out using the First Order Reliability Method (FORM). Two Negative Binomial (NB) SPFs were developed to compare models with and without the reliability-based risk measures. It was found that models incorporating the P(nc) provided a better fit to the data set than the traditional (without risk) NB SPFs for total, injury and fatality (I+F) and property damage only (PDO) collisions. Copyright © 2011 Elsevier Ltd. All rights reserved.

  17. Surveying the impact of satisfaction and e-reliability on customers' loyalty in e-purchase process: a case in Pars Khodro co

    Directory of Open Access Journals (Sweden)

    Vahid Qaemi

    2012-10-01

    Full Text Available Today, customer return issue in e-purchase process is considered as important topic in companies' marketing and managerial decision making. In this paper, we present an empirical study on measuring the impact of e-loyalty for an Iranian auto-industry called Pars Khodro co. The proposed study measures reliability, responsiveness, design, security/privacy as independent variables, e-confidence and e-satisfaction as mediator variable, and e-loyalty as dependent variable. The preliminary results show that effectiveness of e-satisfaction and e-confidence on loyalty and effectiveness of e-confidence on e-satisfaction are in high level. Reliability/Fulfillment and security variables on e-confidence have significant impacts, and effectiveness level of reliability/Fulfillment and responsiveness and website design on e-satisfaction is high. The results indicate that there is no significant relationship between responsiveness and e-confidence.

  18. Regional inversion of CO2 ecosystem fluxes from atmospheric measurements. Reliability of the uncertainty estimates

    Energy Technology Data Exchange (ETDEWEB)

    Broquet, G.; Chevallier, F.; Breon, F.M.; Yver, C.; Ciais, P.; Ramonet, M.; Schmidt, M. [Laboratoire des Sciences du Climat et de l' Environnement, CEA-CNRS-UVSQ, UMR8212, IPSL, Gif-sur-Yvette (France); Alemanno, M. [Servizio Meteorologico dell' Aeronautica Militare Italiana, Centro Aeronautica Militare di Montagna, Monte Cimone/Sestola (Italy); Apadula, F. [Research on Energy Systems, RSE, Environment and Sustainable Development Department, Milano (Italy); Hammer, S. [Universitaet Heidelberg, Institut fuer Umweltphysik, Heidelberg (Germany); Haszpra, L. [Hungarian Meteorological Service, Budapest (Hungary); Meinhardt, F. [Federal Environmental Agency, Kirchzarten (Germany); Necki, J. [AGH University of Science and Technology, Krakow (Poland); Piacentino, S. [ENEA, Laboratory for Earth Observations and Analyses, Palermo (Italy); Thompson, R.L. [Max Planck Institute for Biogeochemistry, Jena (Germany); Vermeulen, A.T. [Energy research Centre of the Netherlands ECN, EEE-EA, Petten (Netherlands)

    2013-07-01

    The Bayesian framework of CO2 flux inversions permits estimates of the retrieved flux uncertainties. Here, the reliability of these theoretical estimates is studied through a comparison against the misfits between the inverted fluxes and independent measurements of the CO2 Net Ecosystem Exchange (NEE) made by the eddy covariance technique at local (few hectares) scale. Regional inversions at 0.5{sup 0} resolution are applied for the western European domain where {approx}50 eddy covariance sites are operated. These inversions are conducted for the period 2002-2007. They use a mesoscale atmospheric transport model, a prior estimate of the NEE from a terrestrial ecosystem model and rely on the variational assimilation of in situ continuous measurements of CO2 atmospheric mole fractions. Averaged over monthly periods and over the whole domain, the misfits are in good agreement with the theoretical uncertainties for prior and inverted NEE, and pass the chi-square test for the variance at the 30% and 5% significance levels respectively, despite the scale mismatch and the independence between the prior (respectively inverted) NEE and the flux measurements. The theoretical uncertainty reduction for the monthly NEE at the measurement sites is 53% while the inversion decreases the standard deviation of the misfits by 38 %. These results build confidence in the NEE estimates at the European/monthly scales and in their theoretical uncertainty from the regional inverse modelling system. However, the uncertainties at the monthly (respectively annual) scale remain larger than the amplitude of the inter-annual variability of monthly (respectively annual) fluxes, so that this study does not engender confidence in the inter-annual variations. The uncertainties at the monthly scale are significantly smaller than the seasonal variations. The seasonal cycle of the inverted fluxes is thus reliable. In particular, the CO2 sink period over the European continent likely ends later than

  19. Does interaction matter? Testing whether a confidence heuristic can replace interaction in collective decision-making.

    Science.gov (United States)

    Bang, Dan; Fusaroli, Riccardo; Tylén, Kristian; Olsen, Karsten; Latham, Peter E; Lau, Jennifer Y F; Roepstorff, Andreas; Rees, Geraint; Frith, Chris D; Bahrami, Bahador

    2014-05-01

    In a range of contexts, individuals arrive at collective decisions by sharing confidence in their judgements. This tendency to evaluate the reliability of information by the confidence with which it is expressed has been termed the 'confidence heuristic'. We tested two ways of implementing the confidence heuristic in the context of a collective perceptual decision-making task: either directly, by opting for the judgement made with higher confidence, or indirectly, by opting for the faster judgement, exploiting an inverse correlation between confidence and reaction time. We found that the success of these heuristics depends on how similar individuals are in terms of the reliability of their judgements and, more importantly, that for dissimilar individuals such heuristics are dramatically inferior to interaction. Interaction allows individuals to alleviate, but not fully resolve, differences in the reliability of their judgements. We discuss the implications of these findings for models of confidence and collective decision-making. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  20. A general model of confidence building: analysis and implications

    International Nuclear Information System (INIS)

    Kilgour, D.M.

    1998-01-01

    For more than two decades, security approaches in Europe have included confidence building. Many have argued that Confidence-Building Measures (CBMS) played an essential role in the enormous transformations that took place there. Thus, it is hardly,surprising that CBMs have been proposed as measures to reduce tensions and transform security relationships elsewhere in the world. The move toward wider application of CBMs has strengthened recently, as conventional military, diplomatic, and humanitarian approaches seem to have failed to address problems associated with peace-building and peace support operations. There is, however, a serious problem. We don't really know why, or even how, CBMs work. Consequently, we have no reliable way to design CBMs that would be appropriate in substance, form, and timing for regions culturally, geographically, and militarily different from Europe. Lacking a solid understanding of confidence building, we are handicapped in our efforts to extend its successes to the domain of peace building and peace support. To paraphrase Macintosh, if we don't know how CBMs succeeded in the past, then we are unlikely to be good at maintaining, improving, or extending them. The specific aim of this project is to step into this gap, using the methods of game theory to clarify some aspects of the underlying logic of confidence building. Formal decision models will be shown to contribute new and valuable insights that will assist in the design of CBMs to contribute to new problems and in new arenas. (author)

  1. Use of a tibial accelerometer to measure ground reaction force in running: A reliability and validity comparison with force plates.

    Science.gov (United States)

    Raper, Damian P; Witchalls, Jeremy; Philips, Elissa J; Knight, Emma; Drew, Michael K; Waddington, Gordon

    2018-01-01

    The use of microsensor technologies to conduct research and implement interventions in sports and exercise medicine has increased recently. The objective of this paper was to determine the validity and reliability of the ViPerform as a measure of load compared to vertical ground reaction force (GRF) as measured by force plates. Absolute reliability assessment, with concurrent validity. 10 professional triathletes ran 10 trials over force plates with the ViPerform mounted on the mid portion of the medial tibia. Calculated vertical ground reaction force data from the ViPerform was matched to the same stride on the force plate. Bland-Altman (BA) plot of comparative measure of agreement was used to assess the relationship between the calculated load from the accelerometer and the force plates. Reliability was calculated by intra-class correlation coefficients (ICC) with 95% confidence intervals. BA plot indicates minimal agreement between the measures derived from the force plate and ViPerform, with variation at an individual participant plot level. Reliability was excellent (ICC=0.877; 95% CI=0.825-0.917) in calculating the same vertical GRF in a repeated trial. Standard error of measure (SEM) equalled 99.83 units (95% CI=82.10-119.09), which, in turn, gave a minimum detectable change (MDC) value of 276.72 units (95% CI=227.32-330.07). The ViPerform does not calculate absolute values of vertical GRF similar to those measured by a force plate. It does provide a valid and reliable calculation of an athlete's lower limb load at constant velocity. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  2. Measuring Passenger Travel Time Reliability using Smartcard Data

    NARCIS (Netherlands)

    Bagherian, M.; Cats, O.; van Oort, N.; Hickman, M

    2016-01-01

    Service reliability is a key performance measure for transit agencies in increasing their service quality and thus ridership. Conventional reliability metrics are established based on vehicle movements and thus do not adequately reflect passenger’s experience. In the past few years, the growing

  3. Reliability importance measures and their calculation

    International Nuclear Information System (INIS)

    Andsten, R.; Vaurio, J.K.

    1989-01-01

    The importance of a component to the system reliability or availability and to the system failure rate can be measured by a number of importance measures. Such measures can be used to guide the system design improvement actions as well as the diagnostic and repair actions. This report develops relationships between several importance measures, illustrates their meaning with interpretations and applications, and describes the computer program called IMPO that calculates importance measures when the system minimum cat sets and component parameters are given. A user's manual is included with illustrative examples

  4. Confidence Measurement in the Light of Signal Detection Theory

    Directory of Open Access Journals (Sweden)

    Sébastien eMassoni

    2014-12-01

    Full Text Available We compare three alternative methods for eliciting retrospective confidence in the context of a simple perceptual task: the Simple Confidence Rating (a direct report on a numerical scale, the Quadratic Scoring Rule (a post-wagering procedure and the Matching Probability (a generalization of the no-loss gambling method. We systematically compare the results obtained with these three rules to the theoretical confidence levels that can be inferred from performance in the perceptual task using Signal Detection Theory. We find that the Matching Probability provides better results in that respect. We conclude that Matching Probability is particularly well suited for studies of confidence that use Signal Detection Theory as a theoretical framework.

  5. Variance misperception explains illusions of confidence in simple perceptual decisions

    NARCIS (Netherlands)

    Zylberberg, Ariel; Roelfsema, Pieter R.; Sigman, Mariano

    2014-01-01

    Confidence in a perceptual decision is a judgment about the quality of the sensory evidence. The quality of the evidence depends not only on its strength ('signal') but critically on its reliability ('noise'), but the separate contribution of these quantities to the formation of confidence judgments

  6. Intrarater and interrater reliability for measurements in videofluoroscopy of swallowing

    International Nuclear Information System (INIS)

    Baijens, Laura; Barikroo, Ali; Pilz, Walmari

    2013-01-01

    Objective: Intrarater and interrater reliability is crucial to the quality of diagnostic and therapy-effect studies. This paper reports on a systematic review of studies on intrarater and interrater reliability for measurements in videofluoroscopy of swallowing. The aim of this review was to summarize and qualitatively analyze published studies on that topic. Materials and methods: Those published up to March 2013 were found through a comprehensive electronic database search using PubMed, Embase, and The Cochrane Library. Two reviewers independently assessed the studies using strict inclusion criteria. Results: Nineteen studies were included and then qualitatively analyzed. In several of these, methodological problems were found. Moreover, intrarater and interrater reliability varied with the measure applied. A meta-analysis was not carried out as studies were not of sufficient quality to warrant doing so. Conclusion: In order to achieve reliable measurements in videofluoroscopy of swallowing, it is recommended that raters use well-defined guidelines for the levels of ordinal visuoperceptual variables. Furthermore, in order to make the measurements reliable (intrarater and interrater) it is recommended that, following protocolled pre-experimental training, the raters should have maximum consensus about the definition of the measured variables

  7. Intrarater and interrater reliability for measurements in videofluoroscopy of swallowing

    Energy Technology Data Exchange (ETDEWEB)

    Baijens, Laura, E-mail: laura.baijens@mumc.nl [Department of Otorhinolaryngology, Head and Neck Surgery, Maastricht University Medical Center, Maastricht (Netherlands); Barikroo, Ali, E-mail: a.Barikroo@ufl.edu [Swallowing Research Laboratory, Department of Speech, Language and Hearing Sciences, College of Public Health and Health Professions, University of Florida, Gainesville, FL (United States); Pilz, Walmari, E-mail: walmari.pilz@mumc.nl [Department of Otorhinolaryngology, Head and Neck Surgery, Maastricht University Medical Center, Maastricht (Netherlands)

    2013-10-01

    Objective: Intrarater and interrater reliability is crucial to the quality of diagnostic and therapy-effect studies. This paper reports on a systematic review of studies on intrarater and interrater reliability for measurements in videofluoroscopy of swallowing. The aim of this review was to summarize and qualitatively analyze published studies on that topic. Materials and methods: Those published up to March 2013 were found through a comprehensive electronic database search using PubMed, Embase, and The Cochrane Library. Two reviewers independently assessed the studies using strict inclusion criteria. Results: Nineteen studies were included and then qualitatively analyzed. In several of these, methodological problems were found. Moreover, intrarater and interrater reliability varied with the measure applied. A meta-analysis was not carried out as studies were not of sufficient quality to warrant doing so. Conclusion: In order to achieve reliable measurements in videofluoroscopy of swallowing, it is recommended that raters use well-defined guidelines for the levels of ordinal visuoperceptual variables. Furthermore, in order to make the measurements reliable (intrarater and interrater) it is recommended that, following protocolled pre-experimental training, the raters should have maximum consensus about the definition of the measured variables.

  8. Reliability of radiographic measurement of lateral capitellohumeral angle in healthy children.

    Science.gov (United States)

    Hasegawa, Masaki; Suzuki, Taku; Kuroiwa, Takashi; Oka, Yusuke; Maeda, Atsushi; Takeda, Hiroki; Shizu, Kanae; Tsuji, Takashi; Suzuki, Katsuji; Yamada, Harumoto

    2018-04-01

    This retrospective cohort study was designed to validate the reliability of measurement of the lateral capitellohumeral angle (LCHA), an index of sagittal angulation of the elbow, in healthy children. The results were compared to the Baumann angle (BA), which is a similar concept to LCHA.Sixty-two radiographs of the elbow in healthy children (range, 2-11 years) were reviewed by 6 examiners at 2 sessions. The mean value and reliability of the measurement of LCHA and BA were assessed. Intraobserver reliability and interobserver reliability were calculated using intraclass correlation coefficients (ICCs).The mean LCHA value was 45° (range, 22° to 70°) and the mean BA was 71° (range, 56° to 86°). The ICCs for intraobserver reliability of the LCHA measurements were almost perfect for 2 examiners, substantial for 3 examiners, and moderate for 1 examiner with a mean value of 0.77 (range, 0.57-0.95). For BA measurements, the ICCs were almost perfect for 1 examiner and substantial for 5 examiners with a mean value of 0.74 (range, 0.66-0.83). The ICCs for interobserver reliability between the first and second measurements were both moderate for LCHA (0.56 and 0.51) and for BA (0.52 and 0.50).LCHA showed almost the same reliability in measurement as BA, which is the gold standard assessment for coronal alignment of the elbow. LCHA showed moderate-to-good reliability in the evaluation of sagittal plane elbow alignment.

  9. Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity

    Directory of Open Access Journals (Sweden)

    Adebisi Bisi-Balogun

    2016-04-01

    Full Text Available This study aimed to determine the relative and absolute reliability of ultrasound (US measurements of the thickness and echogenicity of the plantar fascia (PF at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.

  10. Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity.

    Science.gov (United States)

    Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

    2016-04-13

    This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.

  11. Bootstrap resampling: a powerful method of assessing confidence intervals for doses from experimental data

    International Nuclear Information System (INIS)

    Iwi, G.; Millard, R.K.; Palmer, A.M.; Preece, A.W.; Saunders, M.

    1999-01-01

    Bootstrap resampling provides a versatile and reliable statistical method for estimating the accuracy of quantities which are calculated from experimental data. It is an empirically based method, in which large numbers of simulated datasets are generated by computer from existing measurements, so that approximate confidence intervals of the derived quantities may be obtained by direct numerical evaluation. A simple introduction to the method is given via a detailed example of estimating 95% confidence intervals for cumulated activity in the thyroid following injection of 99m Tc-sodium pertechnetate using activity-time data from 23 subjects. The application of the approach to estimating confidence limits for the self-dose to the kidney following injection of 99m Tc-DTPA organ imaging agent based on uptake data from 19 subjects is also illustrated. Results are then given for estimates of doses to the foetus following administration of 99m Tc-sodium pertechnetate for clinical reasons during pregnancy, averaged over 25 subjects. The bootstrap method is well suited for applications in radiation dosimetry including uncertainty, reliability and sensitivity analysis of dose coefficients in biokinetic models, but it can also be applied in a wide range of other biomedical situations. (author)

  12. Intra- and interobserver reliability of quantitative ultrasound measurement of the plantar fascia.

    Science.gov (United States)

    Rathleff, Michael Skovdal; Moelgaard, Carsten; Lykkegaard Olesen, Jens

    2011-01-01

    To determine intra- and interobserver reliability and measurement precision of sonographic assessment of plantar fascia thickness when using one, the mean of two, or the mean of three measurements. Two experienced observers scanned 20 healthy subjects twice with 60 minutes between test and retest. A GE LOGIQe ultrasound scanner was used in the study. The built-in software in the scanner was used to measure the thickness of the plantar fascia (PF). Reliability was calculated using intraclass correlation coefficient (ICC) and limits of agreement (LOA). Intraobserver reliability (ICC) using one measurement was 0.50 for one observer and 0.52 for the other, and using the mean of three measurements intraobserver reliability increased up to 0.77 and 0.67, respectively. Interobserver reliability (ICC) when using one measurement was 0.62 and increased to 0.82 when using the average of three measurements. LOA showed that when using the average of three measurements, LOA decreased to 0.6 mm, corresponding to 17.5% of the mean thickness of the PF. The results showed that reliability increases when using the mean of three measurements compared with one. Limits of agreement based on intratester reliability shows that changes in thickness that are larger than 0.6 mm can be considered actual changes in thickness and not a result of measurement error. Copyright © 2011 Wiley Periodicals, Inc.

  13. Reliability of Wearable Inertial Measurement Units to Measure Physical Activity in Team Handball.

    Science.gov (United States)

    Luteberget, Live S; Holme, Benjamin R; Spencer, Matt

    2018-04-01

    To assess the reliability and sensitivity of commercially available inertial measurement units to measure physical activity in team handball. Twenty-two handball players were instrumented with 2 inertial measurement units (OptimEye S5; Catapult Sports, Melbourne, Australia) taped together. They participated in either a laboratory assessment (n = 10) consisting of 7 team handball-specific tasks or field assessment (n = 12) conducted in 12 training sessions. Variables, including PlayerLoad™ and inertial movement analysis (IMA) magnitude and counts, were extracted from the manufacturers' software. IMA counts were divided into intensity bands of low (1.5-2.5 m·s -1 ), medium (2.5-3.5 m·s -1 ), high (>3.5 m·s -1 ), medium/high (>2.5 m·s -1 ), and total (>1.5 m·s -1 ). Reliability between devices and sensitivity was established using coefficient of variation (CV) and smallest worthwhile difference (SWD). Laboratory assessment: IMA magnitude showed a good reliability (CV = 3.1%) in well-controlled tasks. CV increased (4.4-6.7%) in more-complex tasks. Field assessment: Total IMA counts (CV = 1.8% and SWD = 2.5%), PlayerLoad (CV = 0.9% and SWD = 2.1%), and their associated variables (CV = 0.4-1.7%) showed a good reliability, well below the SWD. However, the CV of IMA increased when categorized into intensity bands (2.9-5.6%). The reliability of IMA counts was good when data were displayed as total, high, or medium/high counts. A good reliability for PlayerLoad and associated variables was evident. The CV of the previously mentioned variables was well below the SWD, suggesting that OptimEye's inertial measurement unit and its software are sensitive for use in team handball.

  14. Precision of lumbar intervertebral measurements: does a computer-assisted technique improve reliability?

    Science.gov (United States)

    Pearson, Adam M; Spratt, Kevin F; Genuario, James; McGough, William; Kosman, Katherine; Lurie, Jon; Sengupta, Dilip K

    2011-04-01

    Comparison of intra- and interobserver reliability of digitized manual and computer-assisted intervertebral motion measurements and classification of "instability." To determine if computer-assisted measurement of lumbar intervertebral motion on flexion-extension radiographs improves reliability compared with digitized manual measurements. Many studies have questioned the reliability of manual intervertebral measurements, although few have compared the reliability of computer-assisted and manual measurements on lumbar flexion-extension radiographs. Intervertebral rotation, anterior-posterior (AP) translation, and change in anterior and posterior disc height were measured with a digitized manual technique by three physicians and by three other observers using computer-assisted quantitative motion analysis (QMA) software. Each observer measured 30 sets of digital flexion-extension radiographs (L1-S1) twice. Shrout-Fleiss intraclass correlation coefficients for intra- and interobserver reliabilities were computed. The stability of each level was also classified (instability defined as >4 mm AP translation or 10° rotation), and the intra- and interobserver reliabilities of the two methods were compared using adjusted percent agreement (APA). Intraobserver reliability intraclass correlation coefficients were substantially higher for the QMA technique THAN the digitized manual technique across all measurements: rotation 0.997 versus 0.870, AP translation 0.959 versus 0.557, change in anterior disc height 0.962 versus 0.770, and change in posterior disc height 0.951 versus 0.283. The same pattern was observed for interobserver reliability (rotation 0.962 vs. 0.693, AP translation 0.862 vs. 0.151, change in anterior disc height 0.862 vs. 0.373, and change in posterior disc height 0.730 vs. 0.300). The QMA technique was also more reliable for the classification of "instability." Intraobserver APAs ranged from 87 to 97% for QMA versus 60% to 73% for digitized manual

  15. Determination and Interpretation of Characteristic Limits for Radioactivity Measurements: Decision Threshhold, Detection Limit and Limits of the Confidence Interval

    International Nuclear Information System (INIS)

    2017-01-01

    Since 2004, the environment programme of the IAEA has included activities aimed at developing a set of procedures for analytical measurements of radionuclides in food and the environment. Reliable, comparable and fit for purpose results are essential for any analytical measurement. Guidelines and national and international standards for laboratory practices to fulfil quality assurance requirements are extremely important when performing such measurements. The guidelines and standards should be comprehensive, clearly formulated and readily available to both the analyst and the customer. ISO 11929:2010 is the international standard on the determination of the characteristic limits (decision threshold, detection limit and limits of the confidence interval) for measuring ionizing radiation. For nuclear analytical laboratories involved in the measurement of radioactivity in food and the environment, robust determination of the characteristic limits of radioanalytical techniques is essential with regard to national and international regulations on permitted levels of radioactivity. However, characteristic limits defined in ISO 11929:2010 are complex, and the correct application of the standard in laboratories requires a full understanding of various concepts. This publication provides additional information to Member States in the understanding of the terminology, definitions and concepts in ISO 11929:2010, thus facilitating its implementation in Member State laboratories.

  16. Validation of an instrument to assess evidence-based practice knowledge, attitudes, access, and confidence in the dental environment.

    Science.gov (United States)

    Hendricson, William D; Rugh, John D; Hatch, John P; Stark, Debra L; Deahl, Thomas; Wallmann, Elizabeth R

    2011-02-01

    This article reports the validation of an assessment instrument designed to measure the outcomes of training in evidence-based practice (EBP) in the context of dentistry. Four EBP dimensions are measured by this instrument: 1) understanding of EBP concepts, 2) attitudes about EBP, 3) evidence-accessing methods, and 4) confidence in critical appraisal. The instrument-the Knowledge, Attitudes, Access, and Confidence Evaluation (KACE)-has four scales, with a total of thirty-five items: EBP knowledge (ten items), EBP attitudes (ten), accessing evidence (nine), and confidence (six). Four elements of validity were assessed: consistency of items within the KACE scales (extent to which items within a scale measure the same dimension), discrimination (capacity to detect differences between individuals with different training or experience), responsiveness (capacity to detect the effects of education on trainees), and test-retest reliability. Internal consistency of scales was assessed by analyzing responses of second-year dental students, dental residents, and dental faculty members using Cronbach coefficient alpha, a statistical measure of reliability. Discriminative validity was assessed by comparing KACE scores for the three groups. Responsiveness was assessed by comparing pre- and post-training responses for dental students and residents. To measure test-retest reliability, the full KACE was completed twice by a class of freshman dental students seventeen days apart, and the knowledge scale was completed twice by sixteen faculty members fourteen days apart. Item-to-scale consistency ranged from 0.21 to 0.78 for knowledge, 0.57 to 0.83 for attitude, 0.70 to 0.84 for accessing evidence, and 0.87 to 0.94 for confidence. For discrimination, ANOVA and post hoc testing by the Tukey-Kramer method revealed significant score differences among students, residents, and faculty members consistent with education and experience levels. For responsiveness to training, dental students

  17. Reliability of reflectance measures in passive filters

    Science.gov (United States)

    Saldiva de André, Carmen Diva; Afonso de André, Paulo; Rocha, Francisco Marcelo; Saldiva, Paulo Hilário Nascimento; Carvalho de Oliveira, Regiani; Singer, Julio M.

    2014-08-01

    Measurements of optical reflectance in passive filters impregnated with a reactive chemical solution may be transformed to ozone concentrations via a calibration curve and constitute a low cost alternative for environmental monitoring, mainly to estimate human exposure. Given the possibility of errors caused by exposure bias, it is common to consider sets of m filters exposed during a certain period to estimate the latent reflectance on n different sample occasions at a certain location. Mixed models with sample occasions as random effects are useful to analyze data obtained under such setups. The intra-class correlation coefficient of the mean of the m measurements is an indicator of the reliability of the latent reflectance estimates. Our objective is to determine m in order to obtain a pre-specified reliability of the estimates, taking possible outliers into account. To illustrate the procedure, we consider an experiment conducted at the Laboratory of Experimental Air Pollution, University of São Paulo, Brazil (LPAE/FMUSP), where sets of m = 3 filters were exposed during 7 days on n = 9 different occasions at a certain location. The results show that the reliability of the latent reflectance estimates for each occasion obtained under homoskedasticity is km = 0.74. A residual analysis suggests that the within-occasion variance for two of the occasions should be different from the others. A refined model with two within-occasion variance components was considered, yielding km = 0.56 for these occasions and km = 0.87 for the remaining ones. To guarantee that all estimates have a reliability of at least 80% we require measurements on m = 10 filters on each occasion.

  18. Quantitative measurement of hypertrophic scar: interrater reliability and concurrent validity.

    Science.gov (United States)

    Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo

    2008-01-01

    Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.

  19. Pocket Handbook on Reliability

    Science.gov (United States)

    1975-09-01

    exponencial distributions Weibull distribution, -xtimating reliability, confidence intervals, relia- bility growth, 0. P- curves, Bayesian analysis. 20 A S...introduction for those not familiar with reliability and a good refresher for those who are currently working in the area. LEWIS NERI, CHIEF...includes one or both of the following objectives: a) prediction of the current system reliability, b) projection on the system reliability for someI future

  20. Measuring Passenger Travel Time Reliability Using Smart Card Data

    NARCIS (Netherlands)

    Bagherian, M.; Cats, O.; van Oort, N.; Hickman, M

    2016-01-01

    Service reliability is a key performance measure for transit agencies in increasing their service quality and thus ridership. Conventional reliability metrics are established based on vehicle movements and thus do not adequately reflect passenger’s experience. In the past few years, the growing

  1. Measuring time and risk preferences: Reliability, stability, domain specificity

    NARCIS (Netherlands)

    Wölbert, E.M.; Riedl, A.M.

    2013-01-01

    To accurately predict behavior economists need reliable measures of individual time preferences and attitudes toward risk and typically need to assume stability of these characteristics over time and across decision domains. We test the reliability of two choice tasks for eliciting discount rates,

  2. Self-confidence and metacognitive processes

    Directory of Open Access Journals (Sweden)

    Kleitman Sabina

    2005-01-01

    Full Text Available This paper examines the status of Self-confidence trait. Two studies strongly suggest that Self-confidence is a component of metacognition. In the first study, participants (N=132 were administered measures of Self-concept, a newly devised Memory and Reasoning Competence Inventory (MARCI, and a Verbal Reasoning Test (VRT. The results indicate a significant relationship between confidence ratings on the VRT and the Reasoning component of MARCI. The second study (N=296 employed an extensive battery of cognitive tests and several metacognitive measures. Results indicate the presence of robust Self-confidence and Metacognitive Awareness factors, and a significant correlation between them. Self-confidence taps not only processes linked to performance on items that have correct answers, but also beliefs about events that may never occur.

  3. Automated reliability assessment for spectroscopic redshift measurements

    Science.gov (United States)

    Jamal, S.; Le Brun, V.; Le Fèvre, O.; Vibert, D.; Schmitt, A.; Surace, C.; Copin, Y.; Garilli, B.; Moresco, M.; Pozzetti, L.

    2018-03-01

    Context. Future large-scale surveys, such as the ESA Euclid mission, will produce a large set of galaxy redshifts (≥106) that will require fully automated data-processing pipelines to analyze the data, extract crucial information and ensure that all requirements are met. A fundamental element in these pipelines is to associate to each galaxy redshift measurement a quality, or reliability, estimate. Aim. In this work, we introduce a new approach to automate the spectroscopic redshift reliability assessment based on machine learning (ML) and characteristics of the redshift probability density function. Methods: We propose to rephrase the spectroscopic redshift estimation into a Bayesian framework, in order to incorporate all sources of information and uncertainties related to the redshift estimation process and produce a redshift posterior probability density function (PDF). To automate the assessment of a reliability flag, we exploit key features in the redshift posterior PDF and machine learning algorithms. Results: As a working example, public data from the VIMOS VLT Deep Survey is exploited to present and test this new methodology. We first tried to reproduce the existing reliability flags using supervised classification in order to describe different types of redshift PDFs, but due to the subjective definition of these flags (classification accuracy 58%), we soon opted for a new homogeneous partitioning of the data into distinct clusters via unsupervised classification. After assessing the accuracy of the new clusters via resubstitution and test predictions (classification accuracy 98%), we projected unlabeled data from preliminary mock simulations for the Euclid space mission into this mapping to predict their redshift reliability labels. Conclusions: Through the development of a methodology in which a system can build its own experience to assess the quality of a parameter, we are able to set a preliminary basis of an automated reliability assessment for

  4. Test-retest reliability of spatial and temporal gait parameters in children with cerebral palsy as measured by an electronic walkway.

    Science.gov (United States)

    Sorsdahl, Anne Brit; Moe-Nilssen, Rolf; Strand, Liv Inger

    2008-01-01

    The purpose of this study was to examine test-retest reliability of seven selected temporal and spatial gait parameters and asymmetry measures in children with cerebral palsy. Seventeen children with CP between 3 and 13 years of age walked at three different speeds across an electronic walkway of 5.2m. The tests were repeated after approximately 25 min. The scores were normalized to a walking speed of 1.1m/s to avoid the confounding effect of gait speed on speed dependent gait parameters. Intraclass correlation coefficients (ICC(1,1) and ICC(3,1)) with 95% confidence intervals, within-subject standard deviation (S(w)) and smallest detectable difference (SDD) were calculated. The relative reliability of cadence, step length, stride length and single stance time was high to excellent (ICC(1,1) between 0.73 and 0.95), while it was poor for step width (ICC(1,1)=0.27 and 0.35). The relative reliability for two calculated asymmetry measures were high for the step length index (ICC(1,1)=0.82) and moderate for the single stance time index (ICC(1,1)=0.49). The absolute reliability values for all gait parameters are reported. Five of seven gait parameters measured by an electronic walkway and normalized to a common walking speed, appear to be highly repeatable in a short-term time span in children with CP who were able to walk without assistive walking devices, provided sufficient cognitive function.

  5. Memory judgements: the contribution of detail and emotion to assessments of believability and reliability.

    Science.gov (United States)

    Justice, Lucy V; Smith, Harriet M J

    2018-06-06

    In legal settings, jury members, police, and legal professionals often have to make judgements about witnesses' or victims' memories of events. Without a scientific understanding of memory, (often erroneous) beliefs are used to make decisions. Evaluation of the literature identified two prevalent beliefs that could influence judgements: (1) memory operates like a video recorder therefore, accounts that are detailed are more believable than those containing vague descriptions, and (2) memories recalled with congruent emotion are more believable than those recalled with incongruent emotion. A 2 (emotionality: emotional, non-emotional) × 2 (detail: high, low) factorial design was generated. In line with previous research, participants made believability judgements (Experiment 1) but uniquely, participants were also asked to judge the reliability of the rememberer's recall (Experiment 2). Self-reported confidence, personality measures, and political orientation were also recorded. Believability judgements did not vary as a function of detail or emotion but detailed accounts were judged as more reliable than vague accounts. Confidence and believability were positively correlated, whereas the confidence-reliability relationship was more complex. Personality and political measures were independent of judgements of both constructs. Our results suggest that believability and reliability are distinct constructs and should be examined as such in future research.

  6. A comparison of manual anthropometric measurements with Kinect-based scanned measurements in terms of precision and reliability.

    Science.gov (United States)

    Bragança, Sara; Arezes, Pedro; Carvalho, Miguel; Ashdown, Susan P; Castellucci, Ignacio; Leão, Celina

    2018-01-01

    Collecting anthropometric data for real-life applications demands a high degree of precision and reliability. It is important to test new equipment that will be used for data collectionOBJECTIVE:Compare two anthropometric data gathering techniques - manual methods and a Kinect-based 3D body scanner - to understand which of them gives more precise and reliable results. The data was collected using a measuring tape and a Kinect-based 3D body scanner. It was evaluated in terms of precision by considering the regular and relative Technical Error of Measurement and in terms of reliability by using the Intraclass Correlation Coefficient, Reliability Coefficient, Standard Error of Measurement and Coefficient of Variation. The results obtained showed that both methods presented better results for reliability than for precision. Both methods showed relatively good results for these two variables, however, manual methods had better results for some body measurements. Despite being considered sufficiently precise and reliable for certain applications (e.g. apparel industry), the 3D scanner tested showed, for almost every anthropometric measurement, a different result than the manual technique. Many companies design their products based on data obtained from 3D scanners, hence, understanding the precision and reliability of the equipment used is essential to obtain feasible results.

  7. Reliability of impedance cardiography in measuring central haemodynamics

    DEFF Research Database (Denmark)

    Mehlsen, J; Bonde, J; Stadeager, C

    1991-01-01

    The purpose of the study described here was to investigate the reliability of impedance cardiography (IC) in measuring cardiac output (CO) and central blood volume. Absolute values and changes in these variables obtained by impedance cardiography and by isotope- or thermodilution techniques were...... suitable for repeated measurements in studies on the haemodynamic effects of physiological or pharmacological intervention. Impedance cardiography is sufficiently reliable for comparison of absolute values of CO between different groups of patients. We cannot recommend impedance cardiography...... healthy subjects and in 25 unmedicated patients with ischaemic heart disease. We obtained significant correlations between absolute values (y = 0.68x + 1.48) and changes (y = 1.00x + 0.0003) in CO measured by IC and isotope- or thermodilution. IC significantly overestimated absolute values of CO (P less...

  8. Reliability of EEG Interactions Differs between Measures and Is Specific for Neurological Diseases

    Directory of Open Access Journals (Sweden)

    Yvonne Höller

    2017-07-01

    Full Text Available Alterations of interaction (connectivity of the EEG reflect pathological processes in patients with neurologic disorders. Nevertheless, it is questionable whether these patterns are reliable over time in different measures of interaction and whether this reliability of the measures is the same across different patient populations. In order to address this topic we examined 22 patients with mild cognitive impairment, five patients with subjective cognitive complaints, six patients with right-lateralized temporal lobe epilepsy, seven patients with left lateralized temporal lobe epilepsy, and 20 healthy controls. We calculated 14 measures of interaction from two EEG-recordings separated by 2 weeks. In order to characterize test-retest reliability, we correlated these measures for each group and compared the correlations between measures and between groups. We found that both measures of interaction as well as groups differed from each other in terms of reliability. The strongest correlation coefficients were found for spectrum, coherence, and full frequency directed transfer function (average rho > 0.9. In the delta (2–4 Hz range, reliability was lower for mild cognitive impairment compared to healthy controls and left lateralized temporal lobe epilepsy. In the beta (13–30 Hz, gamma (31–80 Hz, and high gamma (81–125 Hz frequency ranges we found decreased reliability in subjective cognitive complaints compared to mild cognitive impairment. In the gamma and high gamma range we found increased reliability in left lateralized temporal lobe epilepsy patients compared to healthy controls. Our results emphasize the importance of documenting reliability of measures of interaction, which may vary considerably between measures, but also between patient populations. We suggest that studies claiming clinical usefulness of measures of interaction should provide information on the reliability of the results. In addition, differences between patient

  9. Reliability of goniometry in Labrador Retrievers.

    Science.gov (United States)

    Jaegger, Gayle; Marcellin-Little, Denis J; Levine, David

    2002-07-01

    To evaluate the reliability of goniometry by comparing goniometric measurements with radiographic measurements and evaluate the effects of sedation on range of joint motion. 16 healthy adult Labrador Retrievers. 3 investigators blindly and independently measured range of motion of the carpus, elbow, shoulder, tarsus, stifle, and hip joints of 16 Labrador Retrievers in triplicate before and after dogs were sedated. Radiographs of all joints in maximal flexion and extension were made during under sedation. Goniometric measurements were compared with radiographic measurements. The influence of sedation and the intra- and intertester variability were evaluated; 95% confidence intervals for all ranges of motion were determined. Results of goniometric and radiographic measurements were not significantly different. Results of measurements made by the 3 investigators were not significantly different. Multiple measurements made by 1 investigator varied from 1 to 6 degrees (median, 3 degrees) depending on the joint. Sedation did not influence the range of motion of the evaluated joints. Goniometry is a reliable and objective method for determining range of motion of joints in healthy Labrador Retrievers.

  10. Brain networks for confidence weighting and hierarchical inference during probabilistic learning.

    Science.gov (United States)

    Meyniel, Florent; Dehaene, Stanislas

    2017-05-09

    Learning is difficult when the world fluctuates randomly and ceaselessly. Classical learning algorithms, such as the delta rule with constant learning rate, are not optimal. Mathematically, the optimal learning rule requires weighting prior knowledge and incoming evidence according to their respective reliabilities. This "confidence weighting" implies the maintenance of an accurate estimate of the reliability of what has been learned. Here, using fMRI and an ideal-observer analysis, we demonstrate that the brain's learning algorithm relies on confidence weighting. While in the fMRI scanner, human adults attempted to learn the transition probabilities underlying an auditory or visual sequence, and reported their confidence in those estimates. They knew that these transition probabilities could change simultaneously at unpredicted moments, and therefore that the learning problem was inherently hierarchical. Subjective confidence reports tightly followed the predictions derived from the ideal observer. In particular, subjects managed to attach distinct levels of confidence to each learned transition probability, as required by Bayes-optimal inference. Distinct brain areas tracked the likelihood of new observations given current predictions, and the confidence in those predictions. Both signals were combined in the right inferior frontal gyrus, where they operated in agreement with the confidence-weighting model. This brain region also presented signatures of a hierarchical process that disentangles distinct sources of uncertainty. Together, our results provide evidence that the sense of confidence is an essential ingredient of probabilistic learning in the human brain, and that the right inferior frontal gyrus hosts a confidence-based statistical learning algorithm for auditory and visual sequences.

  11. Brain networks for confidence weighting and hierarchical inference during probabilistic learning

    Science.gov (United States)

    Meyniel, Florent; Dehaene, Stanislas

    2017-01-01

    Learning is difficult when the world fluctuates randomly and ceaselessly. Classical learning algorithms, such as the delta rule with constant learning rate, are not optimal. Mathematically, the optimal learning rule requires weighting prior knowledge and incoming evidence according to their respective reliabilities. This “confidence weighting” implies the maintenance of an accurate estimate of the reliability of what has been learned. Here, using fMRI and an ideal-observer analysis, we demonstrate that the brain’s learning algorithm relies on confidence weighting. While in the fMRI scanner, human adults attempted to learn the transition probabilities underlying an auditory or visual sequence, and reported their confidence in those estimates. They knew that these transition probabilities could change simultaneously at unpredicted moments, and therefore that the learning problem was inherently hierarchical. Subjective confidence reports tightly followed the predictions derived from the ideal observer. In particular, subjects managed to attach distinct levels of confidence to each learned transition probability, as required by Bayes-optimal inference. Distinct brain areas tracked the likelihood of new observations given current predictions, and the confidence in those predictions. Both signals were combined in the right inferior frontal gyrus, where they operated in agreement with the confidence-weighting model. This brain region also presented signatures of a hierarchical process that disentangles distinct sources of uncertainty. Together, our results provide evidence that the sense of confidence is an essential ingredient of probabilistic learning in the human brain, and that the right inferior frontal gyrus hosts a confidence-based statistical learning algorithm for auditory and visual sequences. PMID:28439014

  12. 2017 NREL Photovoltaic Reliability Workshop

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, Sarah [National Renewable Energy Laboratory (NREL), Golden, CO (United States)

    2017-08-15

    NREL's Photovoltaic (PV) Reliability Workshop (PVRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology -- both critical goals for moving PV technologies deeper into the electricity marketplace.

  13. Reliability and relationship of radiographic measurements in hallux valgus.

    Science.gov (United States)

    Lee, Kyoung Min; Ahn, Soyeon; Chung, Chin Youb; Sung, Ki Hyuk; Park, Moon Seok

    2012-09-01

    Although various radiographic measurements have been developed and used for evaluating hallux valgus, not all are universally believed to be necessary and their relationships have not been clearly established. Determining which are related could provide some insight into which might be useful and which would not. We investigated the reliability of eight radiographic measurements used to evaluate hallux valgus, and determined which were correlated and which predicted the hallux valgus angle. We determined eight radiographic indices for 732 patients (mean age, 51 years; SD, 17 years; 107 males and 625 females) with hallux valgus: hallux valgus angle, intermetatarsal angle, hallux interphalangeal angle, distal metatarsal articular angle, proximal phalangeal articular angle, simplified metatarsus adductus angle, first metatarsal protrusion distance, and sesamoid rotation angle. Intraobserver and interobserver reliabilities of each radiographic measurement were analyzed on 36 feet from 36 randomly selected patients. Correlations among the radiographic measurements were analyzed. Radiographic measurements predicting hallux valgus angle were evaluated using multiple regression analysis. Hallux valgus angle had the highest reliability, whereas the distal metatarsal articular angle and simplified metatarsus adductus angle had the lowest. Distal metatarsal articular angle, intermetatarsal angle, and sesamoid rotation angle had the highest correlations with hallux valgus angle. Distal metatarsal articular angle correlated with sesamoid rotation angle. The intermetatarsal angle, interphalangeal angle, distal metatarsal articular angle, first metatarsal protrusion distance, sesamoid rotation angle, and metatarsus adductus angle predicted the hallux valgus angle. We suggest using hallux valgus angle, intermetatarsal angle, interphalangeal angle, sesamoid rotation angle, and first metatarsal protrusion distance considering their reliability and prediction of the deformity.

  14. A high confidence, manually validated human blood plasma protein reference set

    DEFF Research Database (Denmark)

    Schenk, Susann; Schoenhals, Gary J; de Souza, Gustavo

    2008-01-01

    BACKGROUND: The immense diagnostic potential of human plasma has prompted great interest and effort in cataloging its contents, exemplified by the Human Proteome Organization (HUPO) Plasma Proteome Project (PPP) pilot project. Due to challenges in obtaining a reliable blood plasma protein list......-trap-Fourier transform (LTQ-FT) and a linear ion trap-Orbitrap (LTQ-Orbitrap) for mass spectrometry (MS) analysis. Both instruments allow the measurement of peptide masses in the low ppm range. Furthermore, we employed a statistical score that allows database peptide identification searching using the products of two...... consecutive stages of tandem mass spectrometry (MS3). The combination of MS3 with very high mass accuracy in the parent peptide allows peptide identification with orders of magnitude more confidence than that typically achieved. RESULTS: Herein we established a high confidence set of 697 blood plasma proteins...

  15. RELIABILITY OF ANKLE-FOOT MORPHOLOGY, MOBILITY, STRENGTH, AND MOTOR PERFORMANCE MEASURES.

    Science.gov (United States)

    Fraser, John J; Koldenhoven, Rachel M; Saliba, Susan A; Hertel, Jay

    2017-12-01

    Assessment of foot posture, morphology, intersegmental mobility, strength and motor control of the ankle-foot complex are commonly used clinically, but measurement properties of many assessments are unclear. To determine test-retest and inter-rater reliability, standard error of measurement, and minimal detectable change of morphology, joint excursion and play, strength, and motor control of the ankle-foot complex. Reliability study. 24 healthy, recreationally-active young adults without history of ankle-foot injury were assessed by two clinicians on two occasions, three to ten days apart. Measurement properties were assessed for foot morphology (foot posture index, total and truncated length, width, arch height), joint excursion (weight-bearing dorsiflexion, rearfoot and hallux goniometry, forefoot inclinometry, 1 st metatarsal displacement) and joint play, strength (handheld dynamometry), and motor control rating during intrinsic foot muscle (IFM) exercises. Clinician order was randomized using a Latin Square. The clinicians performed independent examinations and did not confer on the findings for the duration of the study. Test-retest and inter-tester reliability and agreement was assessed using intraclass correlation coefficients (ICC 2,k ) and weighted kappa ( K w ). Test-retest reliability ICC were as follows: morphology: .80-1.00, joint excursion: .58-.97, joint play: -.67-.84, strength: .67-.92, IFM motor rating: K W -.01-.71. Inter-rater reliability ICC were as follows: morphology: .81-1.00, joint excursion: .32-.97, joint play: -1.06-1.00, strength: .53-.90, and IFM motor rating: K w .02-.56. Measures of ankle-foot posture, morphology, joint excursion, and strength demonstrated fair to excellent test-retest and inter-rater reliability. Test-retest reliability for rating of perceived difficulty and motor performance was good to excellent for short-foot, toe-spread-out, and hallux exercises and poor to fair for lesser toe extension. Joint play measures had

  16. Reliable and valid assessment of performance in thoracoscopy

    DEFF Research Database (Denmark)

    Konge, Lars; Lehnert, Per; Hansen, Henrik Jessen

    2012-01-01

    BACKGROUND: As we move toward competency-based education in medicine, we have lagged in developing competency-based evaluation methods. In the era of minimally invasive surgery, there is a need for a reliable and valid tool dedicated to measure competence in video-assisted thoracoscopic surgery....... The purpose of this study is to create such an assessment tool, and to explore its reliability and validity. METHODS: An expert group of physicians created an assessment tool consisting of 10 items rated on a five-point rating scale. The following factors were included: economy and confidence of movement...

  17. Measuring the cortical silent period can increase diagnostic confidence for amyotrophic lateral sclerosis.

    NARCIS (Netherlands)

    Schelhaas, H.J.; Arts, I.M.P.; Overeem, S.; Houtman, C.J.; Janssen, H.; Kleine, B.U.; Munneke, M.; Zwarts, M.J.

    2007-01-01

    We evaluated a modified measurement of the cortical silent period (CSP) as a simple procedure to add further confidence in the diagnostic work-up for ALS. Thirty-seven consecutive patients with a suspicion of having ALS were included together with 25 healthy volunteers, and followed until a final

  18. Reliability of cervical lordosis measurement techniques on long-cassette radiographs.

    Science.gov (United States)

    Janusz, Piotr; Tyrakowski, Marcin; Yu, Hailong; Siemionow, Kris

    2016-11-01

    Lateral radiographs are commonly used to assess cervical sagittal alignment. Three assessment methods have been described and are commonly utilized in clinical practice. These methods are described for perfect lateral cervical radiographs, however in everyday practice radiograph quality varies. The aim of this study was to compare the reliability and reproducibility of 3 cervical lordosis (CL) measurement methods. Forty-four standing lateral radiographs were randomly chosen from a lateral long-cassette radiograph database. Measurements of CL were performed with: Cobb method C2-C7 (CM), C2-C7 posterior tangent method (PTM), sum of posterior tangent method for each segment (SPTM). Three independent orthopaedic surgeons measured CL using the three methods on 44 lateral radiographs. One researcher used the three methods to measured CL three times at 4-week time intervals. Agreement between the methods as well as their intra- and interobserver reliability were tested and quantified by intraclass correlation coefficient (ICC) and median error for a single measurement (SEM). ICC of 0.75 or more reflected an excellent agreement/reliability. The results were compared with repeated ANOVA test, with p  0.05). All three methods appeared to be highly reliable. Although, high agreement between all measurement methods was shown, we do not recommend using Cobb measurement method interchangeably with PTM or SPTM within a single study as this could lead to error, whereas, such a comparison between tangent methods can be considered.

  19. Measuring reliability under epistemic uncertainty: Review on non-probabilistic reliability metrics

    Directory of Open Access Journals (Sweden)

    Kang Rui

    2016-06-01

    Full Text Available In this paper, a systematic review of non-probabilistic reliability metrics is conducted to assist the selection of appropriate reliability metrics to model the influence of epistemic uncertainty. Five frequently used non-probabilistic reliability metrics are critically reviewed, i.e., evidence-theory-based reliability metrics, interval-analysis-based reliability metrics, fuzzy-interval-analysis-based reliability metrics, possibility-theory-based reliability metrics (posbist reliability and uncertainty-theory-based reliability metrics (belief reliability. It is pointed out that a qualified reliability metric that is able to consider the effect of epistemic uncertainty needs to (1 compensate the conservatism in the estimations of the component-level reliability metrics caused by epistemic uncertainty, and (2 satisfy the duality axiom, otherwise it might lead to paradoxical and confusing results in engineering applications. The five commonly used non-probabilistic reliability metrics are compared in terms of these two properties, and the comparison can serve as a basis for the selection of the appropriate reliability metrics.

  20. Machine learning classification with confidence: application of transductive conformal predictors to MRI-based diagnostic and prognostic markers in depression.

    Science.gov (United States)

    Nouretdinov, Ilia; Costafreda, Sergi G; Gammerman, Alexander; Chervonenkis, Alexey; Vovk, Vladimir; Vapnik, Vladimir; Fu, Cynthia H Y

    2011-05-15

    There is rapidly accumulating evidence that the application of machine learning classification to neuroimaging measurements may be valuable for the development of diagnostic and prognostic prediction tools in psychiatry. However, current methods do not produce a measure of the reliability of the predictions. Knowing the risk of the error associated with a given prediction is essential for the development of neuroimaging-based clinical tools. We propose a general probabilistic classification method to produce measures of confidence for magnetic resonance imaging (MRI) data. We describe the application of transductive conformal predictor (TCP) to MRI images. TCP generates the most likely prediction and a valid measure of confidence, as well as the set of all possible predictions for a given confidence level. We present the theoretical motivation for TCP, and we have applied TCP to structural and functional MRI data in patients and healthy controls to investigate diagnostic and prognostic prediction in depression. We verify that TCP predictions are as accurate as those obtained with more standard machine learning methods, such as support vector machine, while providing the additional benefit of a valid measure of confidence for each prediction. Copyright © 2010 Elsevier Inc. All rights reserved.

  1. Use of and confidence in administering outcome measures among clinical prosthetists: Results from a national survey and mixed-methods training program.

    Science.gov (United States)

    Gaunaurd, Ignacio; Spaulding, Susan E; Amtmann, Dagmar; Salem, Rana; Gailey, Robert; Morgan, Sara J; Hafner, Brian J

    2015-08-01

    Outcome measures can be used in prosthetic practices to evaluate interventions, inform decision making, monitor progress, document outcomes, and justify services. Strategies to enhance prosthetists' ability to use outcome measures are needed to facilitate their adoption in routine practice. To assess prosthetists' use of outcome measures and evaluate the effects of training on their confidence in administering performance-based measures. Cross-sectional and single-group pretest-posttest survey. Seventy-nine certified prosthetists (mean of 16.0 years of clinical experience) were surveyed about their experiences with 20 standardized outcome measures. Prosthetists were formally trained by the investigators to administer the Timed Up and Go and Amputee Mobility Predictor. Prosthetists' confidence in administering the Timed Up and Go and Amputee Mobility Predictor was measured before and after training. The majority of prosthetists (62%) were classified as non-routine outcome measure users. Confidence administering the Timed Up and Go and Amputee Mobility Predictor prior to training was low-to-moderate across the study sample. Training significantly (p measures. Interactive training resulted in a statistically significant increase of prosthetists' confidence in administering the Timed Up and Go and Amputee Mobility Predictor and may facilitate use of outcome measures in clinical practice. Frequency of outcome measure use in the care of persons with limb loss has not been studied. Study results suggest that prosthetists may not regularly use standardized outcome measures and report limited confidence in administering them. Training enhances confidence and may encourage use of outcome measures in clinical practice. © The International Society for Prosthetics and Orthotics 2014.

  2. Inter-Rater Reliability of Cyclotorsion Measurements Using Fundus Photography.

    Science.gov (United States)

    Dysli, Muriel; Kanku, Madeleine; Traber, Ghislaine L

    2018-04-01

    The foveo-papillary angle (FPA) on fundus photographs is the accepted standard for the measurement of ocular cyclotorsion. We assessed the inter-rater reliability of this method in healthy subjects and in patients with trochlear nerve palsies. In this methodological study, fundus photographs of healthy subjects and of patients with trochlear nerve palsies were made with a fundus camera (Zeiss Fundus Camera FF 450 plus, Jena, Germany). Three independent observers measured the FPA on the fundus photographs of all subjects in synedra View (synedra View 16, Version 16.0.0.11, Innsbruck, Austria). One hundred and four eyes of 52 subjects (26 healthy controls and 26 patients) were assessed. The mean FPA of the healthy controls was 5.80 degrees (°) [± 0.44 standard error of the mean (SEM)] compared to 11.55° (± 0.80 SEM) for patients with trochlear nerve palsies. The inter-rater reliability of all measured FPAs showed an intraclass correlation coefficient (ICC) of 0.98 (95% CI 0.97 - 0.98). The inter-rater reliability of objective cyclotorsion measurements using fundus photographs was very high. Georg Thieme Verlag KG Stuttgart · New York.

  3. Inter-arch digital model vs. manual cast measurements: Accuracy and reliability.

    Science.gov (United States)

    Kiviahde, Heikki; Bukovac, Lea; Jussila, Päivi; Pesonen, Paula; Sipilä, Kirsi; Raustia, Aune; Pirttiniemi, Pertti

    2017-06-28

    The purpose of this study was to evaluate the accuracy and reliability of inter-arch measurements using digital dental models and conventional dental casts. Thirty sets of dental casts with permanent dentition were examined. Manual measurements were done with a digital caliper directly on the dental casts, and digital measurements were made on 3D models by two independent examiners. Intra-class correlation coefficients (ICC), a paired sample t-test or Wilcoxon signed-rank test, and Bland-Altman plots were used to evaluate intra- and inter-examiner error and to determine the accuracy and reliability of the measurements. The ICC values were generally good for manual and excellent for digital measurements. The Bland-Altman plots of all the measurements showed good agreement between the manual and digital methods and excellent inter-examiner agreement using the digital method. Inter-arch occlusal measurements on digital models are accurate and reliable and are superior to manual measurements.

  4. Reliability and validity of the de Morton Mobility Index in individuals with sub-acute stroke.

    Science.gov (United States)

    Braun, Tobias; Marks, Detlef; Thiel, Christian; Grüneberg, Christian

    2018-02-04

    To establish the validity and reliability of the de Morton Mobility Index (DEMMI) in patients with sub-acute stroke. This cross-sectional study was performed in a neurological rehabilitation hospital. We assessed unidimensionality, construct validity, internal consistency reliability, inter-rater reliability, minimal detectable change and possible floor and ceiling effects of the DEMMI in adult patients with sub-acute stroke. The study included a total sample of 121 patients with sub-acute stroke. We analysed validity (n = 109) and reliability (n = 51) in two sub-samples. Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 12.37, p = 0.577). All hypotheses on construct validity were confirmed. Internal consistency reliability (Cronbach's alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.95; 95% confidence interval: 0.92-0.97) were excellent. The minimal detectable change with 90% confidence was 13 points. No floor or ceiling effects were evident. These results indicate unidimensionality, sufficient internal consistency reliability, inter-rater reliability, and construct validity of the DEMMI in patients with a sub-acute stroke. Advantages of the DEMMI in clinical application are the short administration time, no need for special equipment and interval level data. The de Morton Mobility Index, therefore, may be a useful performance-based bedside test to measure mobility in individuals with a sub-acute stroke across the whole mobility spectrum. Implications for Rehabilitation The de Morton Mobility Index (DEMMI) is an unidimensional measurement instrument of mobility in individuals with sub-acute stroke. The DEMMI has excellent internal consistency and inter-rater reliability, and sufficient construct validity. The minimal detectable change of the DEMMI with 90% confidence in stroke rehabilitation is 13 points. The lack of any floor or ceiling effects on hospital admission indicates

  5. Evaluating Measures of Optimism and Sport Confidence

    Science.gov (United States)

    Fogarty, Gerard J.; Perera, Harsha N.; Furst, Andrea J.; Thomas, Patrick R.

    2016-01-01

    The psychometric properties of the Life Orientation Test-Revised (LOT-R), the Sport Confidence Inventory (SCI), and the Carolina SCI (CSCI) were examined in a study involving 260 athletes. The study aimed to test the dimensional structure, convergent and divergent validity, and invariance over competition level of scores generated by these…

  6. Current Developments in Measuring Academic Behavioural Confidence

    Science.gov (United States)

    Sander, Paul

    2009-01-01

    Using published findings and by further analyses of existing data, the structure, validity and utility of the Academic Behavioural Confidence scale (ABC) is critically considered. Validity is primarily assessed through the scale's relationship with other existing scales as well as by looking for predicted differences. The utility of the ABC scale…

  7. TEST-RETEST RELIABILITY OF HAND GRIP STRENGTH MEASUREMENT USING A JAMAR HAND DYNAMOMETER IN PATIENTS WITH ACUTE AND CHRONIC CERVICAL RADICULOPATHY

    Directory of Open Access Journals (Sweden)

    Ejazi G

    2017-12-01

    Full Text Available Background: To evaluate the test-retest reliability of Jamar hand held dynamometer for measuring handgrip strength (HGS in patients with acute and chronic cervical radiculopathy and to find out the difference in measurement of the handgrip strength between acute and chronic cervical radiculopathy. Methods: A prospective, observational and non-experimental, the comparative study design was used. A sample of 72 subjects (37 women and 35 men suffering from cervical radiculopathy were divided into two groups i.e., Group A(acute and Group B(chronic, handgrip strength was measured using Jamar hand held dynamometer on two occasions by the same rater with an interval of 7-days. Data collection was based on standard guidelines of American Society of Hand Therapists. Three gripping trials (measured in Kg with patient’s arm in standardized arm position were recorded. The data was analyzed from the mean score obtained from the sample. Result: One-way Analysis of Variance(ANOVA was used to evaluate test-retest reliability and Tukey-Kramer Multiple Comparison Test used to find the difference between handgrip strength among acute and chronic Cervical radiculopathy cases. Greater P-value (>0.05 in both testing session, as well as 95% of the confidence interval, shows the reliability of the instrument and lesser p-value (0.05 in female subjects shows no significant difference in handgrip strength between the two groups. Conclusion: Excellent test-retest reliability for hand grip strength measurement was measured in patients with acute and chronic cervical radiculopathy shows that the equipment could be used as an assessment tool for this patient and significant difference exists among male handgrip strength between acute and chronic cervical radiculopathy cases whereas no difference exists among female handgrip strength between acute and chronic cervical radiculopathy cases.

  8. Measurement-based reliability/performability models

    Science.gov (United States)

    Hsueh, Mei-Chen

    1987-01-01

    Measurement-based models based on real error-data collected on a multiprocessor system are described. Model development from the raw error-data to the estimation of cumulative reward is also described. A workload/reliability model is developed based on low-level error and resource usage data collected on an IBM 3081 system during its normal operation in order to evaluate the resource usage/error/recovery process in a large mainframe system. Thus, both normal and erroneous behavior of the system are modeled. The results provide an understanding of the different types of errors and recovery processes. The measured data show that the holding times in key operational and error states are not simple exponentials and that a semi-Markov process is necessary to model the system behavior. A sensitivity analysis is performed to investigate the significance of using a semi-Markov process, as opposed to a Markov process, to model the measured system.

  9. Confidence-building measures in the Asia-Pacific region

    International Nuclear Information System (INIS)

    Qin Huasun

    1991-01-01

    The regional confidence-building, security and disarmament issues in the Asia-Pacific region, and in particular, support to non-proliferation regime and establishing nuclear-weapon-free zones are reviewed

  10. Nanoscale deformation measurements for reliability assessment of material interfaces

    Science.gov (United States)

    Keller, Jürgen; Gollhardt, Astrid; Vogel, Dietmar; Michel, Bernd

    2006-03-01

    With the development and application of micro/nano electronic mechanical systems (MEMS, NEMS) for a variety of market segments new reliability issues will arise. The understanding of material interfaces is the key for a successful design for reliability of MEMS/NEMS and sensor systems. Furthermore in the field of BIOMEMS newly developed advanced materials and well known engineering materials are combined despite of fully developed reliability concepts for such devices and components. In addition the increasing interface-to volume ratio in highly integrated systems and nanoparticle filled materials are challenges for experimental reliability evaluation. New strategies for reliability assessment on the submicron scale are essential to fulfil the needs of future devices. In this paper a nanoscale resolution experimental method for the measurement of thermo-mechanical deformation at material interfaces is introduced. The determination of displacement fields is based on scanning probe microscopy (SPM) data. In-situ SPM scans of the analyzed object (i.e. material interface) are carried out at different thermo-mechanical load states. The obtained images are compared by grayscale cross correlation algorithms. This allows the tracking of local image patterns of the analyzed surface structure. The measurement results are full-field displacement fields with nanometer resolution. With the obtained data the mixed mode type of loading at material interfaces can be analyzed with highest resolution for future needs in micro system and nanotechnology.

  11. Conformal prediction for reliable machine learning theory, adaptations and applications

    CERN Document Server

    Balasubramanian, Vineeth; Vovk, Vladimir

    2014-01-01

    The conformal predictions framework is a recent development in machine learning that can associate a reliable measure of confidence with a prediction in any real-world pattern recognition application, including risk-sensitive applications such as medical diagnosis, face recognition, and financial risk prediction. Conformal Predictions for Reliable Machine Learning: Theory, Adaptations and Applications captures the basic theory of the framework, demonstrates how to apply it to real-world problems, and presents several adaptations, including active learning, change detection, and anomaly detecti

  12. Intra- and inter-rater reliabilities of measurement of ultrasound imaging for muscle thickness and pennation angle of tibialis anterior muscle in stroke patients.

    Science.gov (United States)

    Cho, Ki Hun; Lee, Hwang Jae; Lee, Wan Hee

    2017-07-01

    Dysfunction of skeletal muscle has been commonly reported in stroke patients. The purpose of this study was to investigate the intra- and inter-rater reliabilities of measurement of ultrasound imaging (USI) for pennation angle (PA) and muscle thickness (MT) of tibialis anterior muscle in stroke patients. Thirty-four stroke patients (19 men) participated in this study. USI was used for measurement of PA and MT of the tibialis anterior muscles at rest and during maximum voluntary contraction (MVC). Two examiners acquired images from all participants during two separate testing sessions, seven days apart. Intra-class correlation coefficients (ICCs), confidence interval (CI), standard error of measurement, minimal detectable change, and Bland-Altman plots were used for estimation of reliability. In the intra-rater reliability between measures, for all variables (PA and MT of the paretic and non-paretic sides of tibialis anterior muscles at rest and during MVC), the ICCs ranged between 0.639 and 0.998 and the CI was within an acceptable range of 0.388-0.999. In inter-rater reliability between examiners for the two tests, for all variables, the ICCs ranged between 0.690 and 0.995 and the CI was within an acceptable range of 0.463-0.997. In addition, significant difference was observed between the paretic and non-paretic sides of the tibialis anterior muscle architecture (p stroke patients. In addition, objective and quantitative measurements of tibialis anterior muscle using USI may provide appropriate management for the walking recovery of stroke patients.

  13. Distinguishing highly confident accurate and inaccurate memory: insights about relevant and irrelevant influences on memory confidence

    OpenAIRE

    Chua, Elizabeth F.; Hannula, Deborah E.; Ranganath, Charan

    2012-01-01

    It is generally believed that accuracy and confidence in one’s memory are related, but there are many instances when they diverge. Accordingly, it is important to disentangle the factors which contribute to memory accuracy and confidence, especially those factors that contribute to confidence, but not accuracy. We used eye movements to separately measure fluent cue processing, the target recognition experience, and relative evidence assessment on recognition confidence and accuracy. Eye movem...

  14. Inferring high-confidence human protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Yu Xueping

    2012-05-01

    Full Text Available Abstract Background As numerous experimental factors drive the acquisition, identification, and interpretation of protein-protein interactions (PPIs, aggregated assemblies of human PPI data invariably contain experiment-dependent noise. Ascertaining the reliability of PPIs collected from these diverse studies and scoring them to infer high-confidence networks is a non-trivial task. Moreover, a large number of PPIs share the same number of reported occurrences, making it impossible to distinguish the reliability of these PPIs and rank-order them. For example, for the data analyzed here, we found that the majority (>83% of currently available human PPIs have been reported only once. Results In this work, we proposed an unsupervised statistical approach to score a set of diverse, experimentally identified PPIs from nine primary databases to create subsets of high-confidence human PPI networks. We evaluated this ranking method by comparing it with other methods and assessing their ability to retrieve protein associations from a number of diverse and independent reference sets. These reference sets contain known biological data that are either directly or indirectly linked to interactions between proteins. We quantified the average effect of using ranked protein interaction data to retrieve this information and showed that, when compared to randomly ranked interaction data sets, the proposed method created a larger enrichment (~134% than either ranking based on the hypergeometric test (~109% or occurrence ranking (~46%. Conclusions From our evaluations, it was clear that ranked interactions were always of value because higher-ranked PPIs had a higher likelihood of retrieving high-confidence experimental data. Reducing the noise inherent in aggregated experimental PPIs via our ranking scheme further increased the accuracy and enrichment of PPIs derived from a number of biologically relevant data sets. These results suggest that using our high-confidence

  15. A Poisson process approximation for generalized K-5 confidence regions

    Science.gov (United States)

    Arsham, H.; Miller, D. R.

    1982-01-01

    One-sided confidence regions for continuous cumulative distribution functions are constructed using empirical cumulative distribution functions and the generalized Kolmogorov-Smirnov distance. The band width of such regions becomes narrower in the right or left tail of the distribution. To avoid tedious computation of confidence levels and critical values, an approximation based on the Poisson process is introduced. This aproximation provides a conservative confidence region; moreover, the approximation error decreases monotonically to 0 as sample size increases. Critical values necessary for implementation are given. Applications are made to the areas of risk analysis, investment modeling, reliability assessment, and analysis of fault tolerant systems.

  16. Inter- and intrarater reliability of two proprioception tests using clinical applicable measurement tools in subjects with and without knee osteoarthritis.

    Science.gov (United States)

    Baert, Isabel A C; Lluch, Enrique; Struyf, Thomas; Peeters, Greta; Van Oosterwijck, Sophie; Tuynman, Joanna; Rufai, Salim; Struyf, Filip

    2018-06-01

    The therapeutic value of proprioceptive-based exercises in knee osteoarthritis (KOA) management warrants investigation of proprioceptive testing methods easily accessible in clinical practice. To estimate inter- and intrarater reliability of the knee joint position sense (KJPS) test and knee force sense (KFS) test in subjects with and without KOA. Cross-sectional test-retest design. Two blinded raters performed independently repeated measures of the KJPS and KFS test, using an analogue inclinometer and handheld dynamometer, respectively, in eight KOA patients (12 symptomatic knees) and 26 healthy controls (52 asymptomatic knees). Intraclass correlation coefficients (ICCs; model 2,1), standard error of measurement (SEM) and minimal detectable change with 95% confidence bounds (MDC 95 ) were calculated. For KJPS, results showed good to excellent test-retest agreement (ICCs 0.70-0.95 in KOA patients; ICCs 0.65-0.85 in healthy controls). A 2° measurement error (SEM 1°) was reported when measuring KJPS in multiple test positions and calculating mean repositioning error. Testing KOA patients pre and post therapy a repositioning error larger than 4° (MDC 95 ) is needed to consider true change. Measuring KFS using handheld dynamometry showed poor to fair interrater and poor to excellent intrarater reliability in subjects with and without KOA. Measuring KJPS in multiple test positions using an analogue inclinometer and calculating mean repositioning error is reliable and can be used in clinical practice. We do not recommend the use of the KFS test to clinicians. Further research is required to establish diagnostic accuracy and validity of our KJPS test in larger knee pain populations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Reliability and concurrent validity of the iPhone® Compass application to measure thoracic rotation range of motion (ROM) in healthy participants

    Science.gov (United States)

    Schram, Ben; Cox, Alistair J.; Anderson, Sarah L.; Keogh, Justin

    2018-01-01

    Background Several water-based sports (swimming, surfing and stand up paddle boarding) require adequate thoracic mobility (specifically rotation) in order to perform the appropriate activity requirements. The measurement of thoracic spine rotation is problematic for clinicians due to a lack of convenient and reliable measurement techniques. More recently, smartphones have been used to quantify movement in various joints in the body; however, there appears to be a paucity of research using smartphones to assess thoracic spine movement. Therefore, the aim of this study is to determine the reliability (intra and inter rater) and validity of the iPhone® app (Compass) when assessing thoracic spine rotation ROM in healthy individuals. Methods A total of thirty participants were recruited for this study. Thoracic spine rotation ROM was measured using both the current clinical gold standard, a universal goniometer (UG) and the Smart Phone Compass app. Intra-rater and inter-rater reliability was determined with a Intraclass Correlation Coefficient (ICC) and associated 95% confidence intervals (CI). Validation of the Compass app in comparison to the UG was measured using Pearson’s correlation coefficient and levels of agreement were identified with Bland–Altman plots and 95% limits of agreement. Results Both the UG and Compass app measurements both had excellent reproducibility for intra-rater (ICC 0.94–0.98) and inter-rater reliability (ICC 0.72–0.89). However, the Compass app measurements had higher intra-rater reliability (ICC = 0.96 − 0.98; 95% CI [0.93–0.99]; vs. ICC = 0.94 − 0.98; 95% CI [0.88–0.99]) and inter-rater reliability (ICC = 0.87 − 0.89; 95% CI [0.74–0.95] vs. ICC = 0.72 − 0.82; 95% CI [0.21–0.94]). A strong and significant correlation was found between the UG and the Compass app, demonstrating good concurrent validity (r = 0.835, p reliable tool for measuring thoracic spine rotation which produces greater

  18. Multivoxel neurofeedback selectively modulates confidence without changing perceptual performance

    Science.gov (United States)

    Cortese, Aurelio; Amano, Kaoru; Koizumi, Ai; Kawato, Mitsuo; Lau, Hakwan

    2016-01-01

    A central controversy in metacognition studies concerns whether subjective confidence directly reflects the reliability of perceptual or cognitive processes, as suggested by normative models based on the assumption that neural computations are generally optimal. This view enjoys popularity in the computational and animal literatures, but it has also been suggested that confidence may depend on a late-stage estimation dissociable from perceptual processes. Yet, at least in humans, experimental tools have lacked the power to resolve these issues convincingly. Here, we overcome this difficulty by using the recently developed method of decoded neurofeedback (DecNef) to systematically manipulate multivoxel correlates of confidence in a frontoparietal network. Here we report that bi-directional changes in confidence do not affect perceptual accuracy. Further psychophysical analyses rule out accounts based on simple shifts in reporting strategy. Our results provide clear neuroscientific evidence for the systematic dissociation between confidence and perceptual performance, and thereby challenge current theoretical thinking. PMID:27976739

  19. Is linear distance measured by panoramic radiography reliable?

    International Nuclear Information System (INIS)

    Nishikawa, Keiichi; Wakoh, Mamoru; Sano, Tsukasa; Suehiro, Atsushi; Sekine, Hideshi; Kousuge, Yuuji

    2010-01-01

    The objective of this study was to re-examine the reliability of distance measurements on clinical panoramic radiographs by comparing them with computed tomography (CT) images, from which the most accurate distance measurement is possible. Twenty pairs of images from patients examined both with panoramic radiography and CT for dental implant treatment planning in the premolar and molar regions of the mandible were used. The vertical linear distance between the alveolar crest and the closest mandibular canal was measured by three experienced oral radiologists on both images. The distances measured on panoramic radiographs were corrected for the magnification factor at the focal plane. Double-oblique cross-sectional images were used for CT. Pearson's correlation coefficient was calculated between distances obtained from both images. The paired t test was performed for statistical comparison. Error levels with the panoramic radiograph versus the CT image were also calculated. Pearson's correlation coefficient showed a significant strong linear correlation (R=0.90; p<0.01). However, the corrected value of distance measured on panoramic radiographs tended to be too small, and a significant difference was observed (p<0.05). The error level was approximately 10% (9.6±7.3%). Distance measurement on clinical panoramic radiographs is less reliable than CT images and cannot be recommended. (author)

  20. Reliability and Inequality Measures for the Weimal Distribution ...

    African Journals Online (AJOL)

    ). This article aimed at discussing both reliability and inequality measures from the Weimal distribution. The work has derived and discussed theoretically, expressions for the survival and hazard function of the Weimal distribution. The ordinary ...

  1. Reliability measures in item response theory: manifest versus latent correlation functions.

    Science.gov (United States)

    Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel; Verbeke, Geert; De Boeck, Paul

    2015-02-01

    For item response theory (IRT) models, which belong to the class of generalized linear or non-linear mixed models, reliability at the scale of observed scores (i.e., manifest correlation) is more difficult to calculate than latent correlation based reliability, but usually of greater scientific interest. This is not least because it cannot be calculated explicitly when the logit link is used in conjunction with normal random effects. As such, approximations such as Fisher's information coefficient, Cronbach's α, or the latent correlation are calculated, allegedly because it is easy to do so. Cronbach's α has well-known and serious drawbacks, Fisher's information is not meaningful under certain circumstances, and there is an important but often overlooked difference between latent and manifest correlations. Here, manifest correlation refers to correlation between observed scores, while latent correlation refers to correlation between scores at the latent (e.g., logit or probit) scale. Thus, using one in place of the other can lead to erroneous conclusions. Taylor series based reliability measures, which are based on manifest correlation functions, are derived and a careful comparison of reliability measures based on latent correlations, Fisher's information, and exact reliability is carried out. The latent correlations are virtually always considerably higher than their manifest counterparts, Fisher's information measure shows no coherent behaviour (it is even negative in some cases), while the newly introduced Taylor series based approximations reflect the exact reliability very closely. Comparisons among the various types of correlations, for various IRT models, are made using algebraic expressions, Monte Carlo simulations, and data analysis. Given the light computational burden and the performance of Taylor series based reliability measures, their use is recommended. © 2014 The British Psychological Society.

  2. Doubly Bayesian Analysis of Confidence in Perceptual Decision-Making.

    Science.gov (United States)

    Aitchison, Laurence; Bang, Dan; Bahrami, Bahador; Latham, Peter E

    2015-10-01

    Humans stand out from other animals in that they are able to explicitly report on the reliability of their internal operations. This ability, which is known as metacognition, is typically studied by asking people to report their confidence in the correctness of some decision. However, the computations underlying confidence reports remain unclear. In this paper, we present a fully Bayesian method for directly comparing models of confidence. Using a visual two-interval forced-choice task, we tested whether confidence reports reflect heuristic computations (e.g. the magnitude of sensory data) or Bayes optimal ones (i.e. how likely a decision is to be correct given the sensory data). In a standard design in which subjects were first asked to make a decision, and only then gave their confidence, subjects were mostly Bayes optimal. In contrast, in a less-commonly used design in which subjects indicated their confidence and decision simultaneously, they were roughly equally likely to use the Bayes optimal strategy or to use a heuristic but suboptimal strategy. Our results suggest that, while people's confidence reports can reflect Bayes optimal computations, even a small unusual twist or additional element of complexity can prevent optimality.

  3. 2015 NREL Photovoltaic Module Reliability Workshops

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, Sarah [National Renewable Energy Laboratory (NREL), Golden, CO (United States)

    2017-09-14

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology--both critical goals for moving PV technologies deeper into the electricity marketplace.

  4. 2016 NREL Photovoltaic Module Reliability Workshop

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, Sarah [National Renewable Energy Laboratory (NREL), Golden, CO (United States)

    2017-09-07

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology - both critical goals for moving PV technologies deeper into the electricity marketplace.

  5. Reliability of Rehabilitative Ultrasonography to Measure Transverse Abdominis and Multifidus Muscle Dimensions

    International Nuclear Information System (INIS)

    Nabavi, Narjes; Mosallanezhad, Zahra; Haghighatkhah, Hamid Reza; Mohseni Bandpeid, Mohammad Ali

    2014-01-01

    Lumbar paraspinal muscles play an important role in providing both mobility and stability during dynamic tasks. Among paraspinal muscles, transverse abdominis and lumbar multifidus have been of particular interest as active stabilizers of the lumbar spine. These muscles may become dysfunctional in chronic low back pain (CLBP). Low back injury can result in muscle inhibition and control loss that cannot recover spontaneously, and specific exercises are required to stimulate their recovery. The purpose of this study was to test the reliability of ultrasonography to measure muscle dimensions and to present a reliable method for measuring transverse abdominis and lumbar multifidus as stabilizing muscles of the lumbar spine. Fifteen healthy participants (18-55 year olds) were evaluated by a radiologist using ultrasonography (ES500) with two probes (50mm linear 7.5 MHZ and 70 mm curvilinear 3.5 MHz). The muscle thickness of transverse abdominis and the anterior-posterior diameter and cross sectional area of the LMF were measured. To determine within and between days reliabilities, second and third measurements were repeated with half an hour and one week intervals, respectively. Intraclass correlation coefficient for left and right showed good to high reliability for the cross sectional area of lumbar multifidi (0.74 and 0.88, respectively) as well as the anterior-posterior dimensions of lumbar multifidi (0.89 and 0.91, respectively) and transverse abdomini thickness (0.73 and 0.85, respectively). Rehabilitative ultrasonography is a reliable and non-invasive instrument to measure muscle thickness. The method used in this study is a reliable way to measure lumbar stabilizing muscles

  6. Improved radiograph measurement inter-observer reliability by use of statistical shape models

    Energy Technology Data Exchange (ETDEWEB)

    Pegg, E.C., E-mail: elise.pegg@ndorms.ox.ac.uk [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom); Mellon, S.J., E-mail: stephen.mellon@ndorms.ox.ac.uk [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom); Salmon, G. [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom); Alvand, A., E-mail: abtin.alvand@ndorms.ox.ac.uk [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom); Pandit, H., E-mail: hemant.pandit@ndorms.ox.ac.uk [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom); Murray, D.W., E-mail: david.murray@ndorms.ox.ac.uk [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom); Gill, H.S., E-mail: richie.gill@ndorms.ox.ac.uk [University of Oxford, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Windmill Road, Oxford OX3 7LD (United Kingdom)

    2012-10-15

    Pre- and post-operative radiographs of patients undergoing joint arthroplasty are often examined for a variety of purposes including preoperative planning and patient assessment. This work examines the feasibility of using active shape models (ASM) to semi-automate measurements from post-operative radiographs for the specific case of the Oxford™ Unicompartmental Knee. Measurements of the proximal tibia and the position of the tibial tray were made using the ASM model and manually. Data were obtained by four observers and one observer took four sets of measurements to allow assessment of the inter- and intra-observer reliability, respectively. The parameters measured were the tibial tray angle, the tray overhang, the tray size, the sagittal cut position, the resection level and the tibial width. Results demonstrated improved reliability (average of 27% and 11.2% increase for intra- and inter-reliability, respectively) and equivalent accuracy (p > 0.05 for compared data values) for all of the measurements using the ASM model, with the exception of the tray overhang (p = 0.0001). Less time (15 s) was required to take measurements using the ASM model compared with manual measurements, which was significant. These encouraging results indicate that semi-automated measurement techniques could improve the reliability of radiographic measurements.

  7. Improved radiograph measurement inter-observer reliability by use of statistical shape models

    International Nuclear Information System (INIS)

    Pegg, E.C.; Mellon, S.J.; Salmon, G.; Alvand, A.; Pandit, H.; Murray, D.W.; Gill, H.S.

    2012-01-01

    Pre- and post-operative radiographs of patients undergoing joint arthroplasty are often examined for a variety of purposes including preoperative planning and patient assessment. This work examines the feasibility of using active shape models (ASM) to semi-automate measurements from post-operative radiographs for the specific case of the Oxford™ Unicompartmental Knee. Measurements of the proximal tibia and the position of the tibial tray were made using the ASM model and manually. Data were obtained by four observers and one observer took four sets of measurements to allow assessment of the inter- and intra-observer reliability, respectively. The parameters measured were the tibial tray angle, the tray overhang, the tray size, the sagittal cut position, the resection level and the tibial width. Results demonstrated improved reliability (average of 27% and 11.2% increase for intra- and inter-reliability, respectively) and equivalent accuracy (p > 0.05 for compared data values) for all of the measurements using the ASM model, with the exception of the tray overhang (p = 0.0001). Less time (15 s) was required to take measurements using the ASM model compared with manual measurements, which was significant. These encouraging results indicate that semi-automated measurement techniques could improve the reliability of radiographic measurements

  8. Can confidence indicators forecast the probability of expansion in Croatia?

    Directory of Open Access Journals (Sweden)

    Mirjana Čižmešija

    2016-04-01

    Full Text Available The aim of this paper is to investigate how reliable are confidence indicators in forecasting the probability of expansion. We consider three Croatian Business Survey indicators: the Industrial Confidence Indicator (ICI, the Construction Confidence Indicator (BCI and the Retail Trade Confidence Indicator (RTCI. The quarterly data, used in the research, covered the periods from 1999/Q1 to 2014/Q1. Empirical analysis consists of two parts. The non-parametric Bry-Boschan algorithm is used for distinguishing periods of expansion from the period of recession in the Croatian economy. Then, various nonlinear probit models were estimated. The models differ with respect to the regressors (confidence indicators and the time lags. The positive signs of estimated parameters suggest that the probability of expansion increases with an increase in Confidence Indicators. Based on the obtained results, the conclusion is that ICI is the most powerful predictor of the probability of expansion in Croatia.

  9. Increasing Product Confidence-Shifting Paradigms.

    Science.gov (United States)

    Phillips, Marla; Kashyap, Vishal; Cheung, Mee-Shew

    2015-01-01

    with a newfound respect for their suppliers, and it will allow manufacturers to finally address true root causes that can lead to a marked increase in product confidence. In the past decade, pharmaceutical, medical device, and food manufacturers have increased their focus on controlling and managing the performance of their suppliers in an effort to improve the confidence of the materials going into the final marketed products and to improve patient and customer confidence in final product reliability and safety. Concerned that product confidence has not improved, Xavier University launched the Integrity of Supply Initiative in 2012 with a team of industry leaders and U.S. Food and Drug Administration officials. Through this initiative, data generated has revealed that manufacturers either unknowingly increase the potential for error or can control/prevent many aspects of product confidence failure. Product confidence can be improved by shifting the focus from controlling supplier practices to controlling the practices of the manufacturers themselves. © PDA, Inc. 2015.

  10. Reliability of a Computerized Neurocognitive Test in Baseline Concussion Testing of High School Athletes.

    Science.gov (United States)

    MacDonald, James; Duerson, Drew

    2015-07-01

    Baseline assessments using computerized neurocognitive tests are frequently used in the management of sport-related concussions. Such testing is often done on an annual basis in a community setting. Reliability is a fundamental test characteristic that should be established for such tests. Our study examined the test-retest reliability of a computerized neurocognitive test in high school athletes over 1 year. Repeated measures design. Two American high schools. High school athletes (N = 117) participating in American football or soccer during the 2011-2012 and 2012-2013 academic years. All study participants completed 2 baseline computerized neurocognitive tests taken 1 year apart at their respective schools. The test measures performance on 4 cognitive tasks: identification speed (Attention), detection speed (Processing Speed), one card learning accuracy (Learning), and one back speed (Working Memory). Reliability was assessed by measuring the intraclass correlation coefficient (ICC) between the repeated measures of the 4 cognitive tasks. Pearson and Spearman correlation coefficients were calculated as a secondary outcome measure. The measure for identification speed performed best (ICC = 0.672; 95% confidence interval, 0.559-0.760) and the measure for one card learning accuracy performed worst (ICC = 0.401; 95% confidence interval, 0.237-0.542). All tests had marginal or low reliability. In a population of high school athletes, computerized neurocognitive testing performed in a community setting demonstrated low to marginal test-retest reliability on baseline assessments 1 year apart. Further investigation should focus on (1) improving the reliability of individual tasks tested, (2) controlling for external factors that might affect test performance, and (3) identifying the ideal time interval to repeat baseline testing in high school athletes. Computerized neurocognitive tests are used frequently in high school athletes, often within a model of baseline testing

  11. Reliability-guided digital image correlation for image deformation measurement

    International Nuclear Information System (INIS)

    Pan Bing

    2009-01-01

    A universally applicable reliability-guided digital image correlation (DIC) method is proposed for reliable image deformation measurement. The zero-mean normalized cross correlation (ZNCC) coefficient is used to identify the reliability of the point computed. The correlation calculation begins with a seed point and is then guided by the ZNCC coefficient. That means the neighbors of the point with the highest ZNCC coefficient in a queue for computed points will be processed first. Thus the calculation path is always along the most reliable direction, and possible error propagation of the conventional DIC method can be avoided. The proposed novel DIC method is universally applicable to the images with shadows, discontinuous areas, and deformation discontinuity. Two image pairs were used to evaluate the performance of the proposed technique, and the successful results clearly demonstrate its robustness and effectiveness

  12. The role of test-retest reliability in measuring individual and group differences in executive functioning.

    Science.gov (United States)

    Paap, Kenneth R; Sawi, Oliver

    2016-12-01

    Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Non-proliferation and confidence-building measures in Asia and the Pacific

    International Nuclear Information System (INIS)

    1992-01-01

    In the face of improved international relations, regional and subregional issues have acquired additional urgency and importance in the field of disarmament and international security. The pursuit of regional solutions to regional problems is thus being actively encouraged by the international community. Towards this end, the United Nations Office for Disarmament Affairs is seeking to promote regional approaches to disarmament either through the united nations regional centres for peace and Disarmament or cooperation with individual Governments. Within this framework this conference was dealing with non-proliferation and confidence-building measures in Asia and the Pacific region

  14. The Reliability of a Novel Mobile 3-dimensional Wound Measurement Device.

    Science.gov (United States)

    Anghel, Ersilia L; Kumar, Anagha; Bigham, Thomas E; Maselli, Kathryn M; Steinberg, John S; Evans, Karen K; Kim, Paul J; Attinger, Christopher E

    2016-11-01

    Objective assessment of wound dimensions is essential for tracking progression and determining treatment effectiveness. A reliability study was designed to establish intrarater and interrater reliability of a novel mobile 3-dimensional wound measurement (3DWM) device. Forty-five wounds were assessed by 2 raters using a 3DWM device to obtain length, width, area, depth, and volume measurements. Wounds were also measured manually, using a disposable ruler and digital planimetry. The intraclass correlation coefficient (ICC) was used to establish intrarater and interrater reliability. High levels of intrarater and interrater agreement were observed for area, length, and width; ICC = 0.998, 0.977, 0.955 and 0.999, 0.997, 0.995, respectively. Moderate levels of intrarater (ICC = 0.888) and interrater (ICC = 0.696) agreement were observed for volume. Lastly, depth yielded an intrarater ICC of 0.360 and an interrater ICC of 0.649. Measures from the 3DWM device were highly correlated with those obtained from scaled photography for length, width, and area (ρ = 0.997, 0.988, 0.997, P device yielded correlations of ρ = 0.990, 0.987, 0.996 with P device was found to be highly reliable for measuring wound areas for a range of wound sizes and types as compared to manual measurement and digital planimetry. The depth and therefore volume measurement using the 3DWM device was found to have a lower ICC, but volume ICC alone was moderate. Overall, this device offers a mobile option for objective wound measurement in the clinical setting.

  15. Editorial: disarmament, non proliferation, confidence-building measures, armament control

    International Nuclear Information System (INIS)

    Soutou, Georges-Henri

    2015-01-01

    After having described the vicious circle existing between disarmament and security as it appeared before and during the first World War, the author deals with the specific case of nuclear disarmament as it was first addressed just after the Second World War, and was then not accepted by the Russians. He comments the political and strategical approach adopted by the Kennedy administration, notably within the context of severe crises (Berlin and Cuba). This resulted in the re-establishment of a relationship between war and policy as defined by Clausewitz, but based on a trilogy of three inseparable pairs: deterrence and armament control, armament control and non proliferation, armament control and confidence-building measures. The author shows that this trilogy has been somehow operating until the end of Cold War, and that nothing works anymore since the end of Cold War and of the bipolar world

  16. IMPROVING SEMI-GLOBAL MATCHING: COST AGGREGATION AND CONFIDENCE MEASURE

    Directory of Open Access Journals (Sweden)

    P. d’Angelo

    2016-06-01

    Full Text Available Digital elevation models are one of the basic products that can be generated from remotely sensed imagery. The Semi Global Matching (SGM algorithm is a robust and practical algorithm for dense image matching. The connection between SGM and Belief Propagation was recently developed, and based on that improvements such as correction of over-counting the data term, and a new confidence measure have been proposed. Later the MGM algorithm has been proposed, it aims at improving the regularization step of SGM, but has only been evaluated on the Middlebury stereo benchmark so far. This paper evaluates these proposed improvements on the ISPRS satellite stereo benchmark, using a Pleiades Triplet and a Cartosat-1 Stereo pair. The over-counting correction slightly improves matching density, at the expense of adding a few outliers. The MGM cost aggregation shows leads to a slight increase of accuracy.

  17. Reliability and Validity of Finger Strength and Endurance Measurements in Rock Climbing

    Science.gov (United States)

    Michailov, Michail Lubomirov; Baláš, Jirí; Tanev, Stoyan Kolev; Andonov, Hristo Stoyanov; Kodejška, Jan; Brown, Lee

    2018-01-01

    Purpose: An advanced system for the assessment of climbing-specific performance was developed and used to: (a) investigate the effect of arm fixation (AF) on construct validity evidence and reliability of climbing-specific finger-strength measurement; (b) assess reliability of finger-strength and endurance measurements; and (c) evaluate the…

  18. Sources of sport confidence, imagery type and performance among competitive athletes: the mediating role of sports confidence.

    Science.gov (United States)

    Levy, A R; Perry, J; Nicholls, A R; Larkin, D; Davies, J

    2015-01-01

    This study explored the mediating role of sport confidence upon (1) sources of sport confidence-performance relationship and (2) imagery-performance relationship. Participants were 157 competitive athletes who completed state measures of confidence level/sources, imagery type and performance within one hour after competition. Among the current sample, confirmatory factor analysis revealed appropriate support for the nine-factor SSCQ and the five-factor SIQ. Mediational analysis revealed that sport confidence had a mediating influence upon the achievement source of confidence-performance relationship. In addition, both cognitive and motivational imagery types were found to be important sources of confidence, as sport confidence mediated imagery type- performance relationship. Findings indicated that athletes who construed confidence from their own achievements and report multiple images on a more frequent basis are likely to benefit from enhanced levels of state sport confidence and subsequent performance.

  19. Evidence for a confidence-accuracy relationship in memory for same- and cross-race faces.

    Science.gov (United States)

    Nguyen, Thao B; Pezdek, Kathy; Wixted, John T

    2017-12-01

    Discrimination accuracy is usually higher for same- than for cross-race faces, a phenomenon known as the cross-race effect (CRE). According to prior research, the CRE occurs because memories for same- and cross-race faces rely on qualitatively different processes. However, according to a continuous dual-process model of recognition memory, memories that rely on qualitatively different processes do not differ in recognition accuracy when confidence is equated. Thus, although there are differences in overall same- and cross-race discrimination accuracy, confidence-specific accuracy (i.e., recognition accuracy at a particular level of confidence) may not differ. We analysed datasets from four recognition memory studies on same- and cross-race faces to test this hypothesis. Confidence ratings reliably predicted recognition accuracy when performance was above chance levels (Experiments 1, 2, and 3) but not when performance was at chance levels (Experiment 4). Furthermore, at each level of confidence, confidence-specific accuracy for same- and cross-race faces did not significantly differ when overall performance was above chance levels (Experiments 1, 2, and 3) but significantly differed when overall performance was at chance levels (Experiment 4). Thus, under certain conditions, high-confidence same-race and cross-race identifications may be equally reliable.

  20. Statistical Primer for Athletic Trainers: The Essentials of Understanding Measures of Reliability and Minimal Important Change.

    Science.gov (United States)

    Riemann, Bryan L; Lininger, Monica R

    2018-01-01

      To describe the concepts of measurement reliability and minimal important change.   All measurements have some magnitude of error. Because clinical practice involves measurement, clinicians need to understand measurement reliability. The reliability of an instrument is integral in determining if a change in patient status is meaningful.   Measurement reliability is the extent to which a test result is consistent and free of error. Three perspectives of reliability-relative reliability, systematic bias, and absolute reliability-are often reported. However, absolute reliability statistics, such as the minimal detectable difference, are most relevant to clinicians because they provide an expected error estimate. The minimal important difference is the smallest change in a treatment outcome that the patient would identify as important.   Clinicians should use absolute reliability characteristics, preferably the minimal detectable difference, to determine the extent of error around a patient's measurement. The minimal detectable difference, coupled with an appropriately estimated minimal important difference, can assist the practitioner in identifying clinically meaningful changes in patients.

  1. The reliability of a segmentation methodology for assessing intramuscular adipose tissue and other soft-tissue compartments of lower leg MRI images.

    Science.gov (United States)

    Karampatos, Sarah; Papaioannou, Alexandra; Beattie, Karen A; Maly, Monica R; Chan, Adrian; Adachi, Jonathan D; Pritchard, Janet M

    2016-04-01

    Determine the reliability of a magnetic resonance (MR) image segmentation protocol for quantifying intramuscular adipose tissue (IntraMAT), subcutaneous adipose tissue, total muscle and intermuscular adipose tissue (InterMAT) of the lower leg. Ten axial lower leg MRI slices were obtained from 21 postmenopausal women using a 1 Tesla peripheral MRI system. Images were analyzed using sliceOmatic™ software. The average cross-sectional areas of the tissues were computed for the ten slices. Intra-rater and inter-rater reliability were determined and expressed as the standard error of measurement (SEM) (absolute reliability) and intraclass coefficient (ICC) (relative reliability). Intra-rater and inter-rater reliability for IntraMAT were 0.991 (95% confidence interval [CI] 0.978-0.996, p soft tissue compartments, the ICCs were all >0.90 (p soft-tissue compartments of the lower leg. A standard operating procedure manual is provided to assist users, and SEM values can be used to estimate sample size and determine confidence in repeated measurements in future research.

  2. Measurement of the Inter-Rater Reliability Rate Is Mandatory for Improving the Quality of a Medical Database: Experience with the Paulista Lung Cancer Registry.

    Science.gov (United States)

    Lauricella, Leticia L; Costa, Priscila B; Salati, Michele; Pego-Fernandes, Paulo M; Terra, Ricardo M

    2018-06-01

    Database quality measurement should be considered a mandatory step to ensure an adequate level of confidence in data used for research and quality improvement. Several metrics have been described in the literature, but no standardized approach has been established. We aimed to describe a methodological approach applied to measure the quality and inter-rater reliability of a regional multicentric thoracic surgical database (Paulista Lung Cancer Registry). Data from the first 3 years of the Paulista Lung Cancer Registry underwent an audit process with 3 metrics: completeness, consistency, and inter-rater reliability. The first 2 methods were applied to the whole data set, and the last method was calculated using 100 cases randomized for direct auditing. Inter-rater reliability was evaluated using percentage of agreement between the data collector and auditor and through calculation of Cohen's κ and intraclass correlation. The overall completeness per section ranged from 0.88 to 1.00, and the overall consistency was 0.96. Inter-rater reliability showed many variables with high disagreement (>10%). For numerical variables, intraclass correlation was a better metric than inter-rater reliability. Cohen's κ showed that most variables had moderate to substantial agreement. The methodological approach applied to the Paulista Lung Cancer Registry showed that completeness and consistency metrics did not sufficiently reflect the real quality status of a database. The inter-rater reliability associated with κ and intraclass correlation was a better quality metric than completeness and consistency metrics because it could determine the reliability of specific variables used in research or benchmark reports. This report can be a paradigm for future studies of data quality measurement. Copyright © 2018 American College of Surgeons. Published by Elsevier Inc. All rights reserved.

  3. Reliability and concurrent validity of the iPhone® Compass application to measure thoracic rotation range of motion (ROM in healthy participants

    Directory of Open Access Journals (Sweden)

    James Furness

    2018-03-01

    Full Text Available Background Several water-based sports (swimming, surfing and stand up paddle boarding require adequate thoracic mobility (specifically rotation in order to perform the appropriate activity requirements. The measurement of thoracic spine rotation is problematic for clinicians due to a lack of convenient and reliable measurement techniques. More recently, smartphones have been used to quantify movement in various joints in the body; however, there appears to be a paucity of research using smartphones to assess thoracic spine movement. Therefore, the aim of this study is to determine the reliability (intra and inter rater and validity of the iPhone® app (Compass when assessing thoracic spine rotation ROM in healthy individuals. Methods A total of thirty participants were recruited for this study. Thoracic spine rotation ROM was measured using both the current clinical gold standard, a universal goniometer (UG and the Smart Phone Compass app. Intra-rater and inter-rater reliability was determined with a Intraclass Correlation Coefficient (ICC and associated 95% confidence intervals (CI. Validation of the Compass app in comparison to the UG was measured using Pearson’s correlation coefficient and levels of agreement were identified with Bland–Altman plots and 95% limits of agreement. Results Both the UG and Compass app measurements both had excellent reproducibility for intra-rater (ICC 0.94–0.98 and inter-rater reliability (ICC 0.72–0.89. However, the Compass app measurements had higher intra-rater reliability (ICC = 0.96 − 0.98; 95% CI [0.93–0.99]; vs. ICC = 0.94 − 0.98; 95% CI [0.88–0.99] and inter-rater reliability (ICC = 0.87 − 0.89; 95% CI [0.74–0.95] vs. ICC = 0.72 − 0.82; 95% CI [0.21–0.94]. A strong and significant correlation was found between the UG and the Compass app, demonstrating good concurrent validity (r = 0.835, p < 0.001. Levels of agreement between the two devices were 24.8° (LoA –9

  4. Semi-automated CCTV surveillance: the effects of system confidence, system accuracy and task complexity on operator vigilance, reliance and workload.

    Science.gov (United States)

    Dadashi, N; Stedmon, A W; Pridmore, T P

    2013-09-01

    Recent advances in computer vision technology have lead to the development of various automatic surveillance systems, however their effectiveness is adversely affected by many factors and they are not completely reliable. This study investigated the potential of a semi-automated surveillance system to reduce CCTV operator workload in both detection and tracking activities. A further focus of interest was the degree of user reliance on the automated system. A simulated prototype was developed which mimicked an automated system that provided different levels of system confidence information. Dependent variable measures were taken for secondary task performance, reliance and subjective workload. When the automatic component of a semi-automatic CCTV surveillance system provided reliable system confidence information to operators, workload significantly decreased and spare mental capacity significantly increased. Providing feedback about system confidence and accuracy appears to be one important way of making the status of the automated component of the surveillance system more 'visible' to users and hence more effective to use. Copyright © 2012 Elsevier Ltd and The Ergonomics Society. All rights reserved.

  5. Confidence Intervals from Realizations of Simulated Nuclear Data

    Energy Technology Data Exchange (ETDEWEB)

    Younes, W. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Ratkiewicz, A. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Ressler, J. J. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2017-09-28

    Various statistical techniques are discussed that can be used to assign a level of confidence in the prediction of models that depend on input data with known uncertainties and correlations. The particular techniques reviewed in this paper are: 1) random realizations of the input data using Monte-Carlo methods, 2) the construction of confidence intervals to assess the reliability of model predictions, and 3) resampling techniques to impose statistical constraints on the input data based on additional information. These techniques are illustrated with a calculation of the keff value, based on the 235U(n, f) and 239Pu (n, f) cross sections.

  6. Tactile acuity charts: a reliable measure of spatial acuity.

    Science.gov (United States)

    Bruns, Patrick; Camargo, Carlos J; Campanella, Humberto; Esteve, Jaume; Dinse, Hubert R; Röder, Brigitte

    2014-01-01

    For assessing tactile spatial resolution it has recently been recommended to use tactile acuity charts which follow the design principles of the Snellen letter charts for visual acuity and involve active touch. However, it is currently unknown whether acuity thresholds obtained with this newly developed psychophysical procedure are in accordance with established measures of tactile acuity that involve passive contact with fixed duration and control of contact force. Here we directly compared tactile acuity thresholds obtained with the acuity charts to traditional two-point and grating orientation thresholds in a group of young healthy adults. For this purpose, two types of charts, using either Braille-like dot patterns or embossed Landolt rings with different orientations, were adapted from previous studies. Measurements with the two types of charts were equivalent, but generally more reliable with the dot pattern chart. A comparison with the two-point and grating orientation task data showed that the test-retest reliability of the acuity chart measurements after one week was superior to that of the passive methods. Individual thresholds obtained with the acuity charts agreed reasonably with the grating orientation threshold, but less so with the two-point threshold that yielded relatively distinct acuity estimates compared to the other methods. This potentially considerable amount of mismatch between different measures of tactile acuity suggests that tactile spatial resolution is a complex entity that should ideally be measured with different methods in parallel. The simple test procedure and high reliability of the acuity charts makes them a promising complement and alternative to the traditional two-point and grating orientation thresholds.

  7. Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis.

    Science.gov (United States)

    Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

    2017-01-18

    To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.

  8. Self-report measures of prospective memory are reliable but not valid.

    Science.gov (United States)

    Uttl, Bob; Kibreab, Mekale

    2011-03-01

    Are self-report measures of prospective memory (ProM) reliable and valid? To examine this question, 240 undergraduate student volunteers completed several widely used self-report measures of ProM including the Prospective Memory Questionnaire (PMQ), the Prospective and Retrospective Memory Questionnaire (PRMQ), the Comprehensive Assessment of Prospective Memory (CAPM) questionnaire, self-reports of retrospective memory (RetM), objective measures of ProM and RetM, and measures of involvement in activities and events, memory strategies and aids use, personality and verbal intelligence. The results showed that both convergent and divergent validity of ProM self-reports are poor, even though we assessed ProM using a newly developed, reliable continuous measure. Further analyses showed that a substantial proportion of variability in ProM self-report scores was due to verbal intelligence, personality (conscientiousness, neuroticism), activities and event involvement (busyness), and use of memory strategies and aids. ProM self-reports have adequate reliability, but poor validity and should not be interpreted as reflecting ProM ability. (PsycINFO Database Record (c) 2011 APA, all rights reserved).

  9. Reliability of Two Smartphone Applications for Radiographic Measurements of Hallux Valgus Angles.

    Science.gov (United States)

    Mattos E Dinato, Mauro Cesar; Freitas, Marcio de Faria; Milano, Cristiano; Valloto, Elcio; Ninomiya, André Felipe; Pagnano, Rodrigo Gonçalves

    The objective of the present study was to assess the reliability of 2 smartphone applications compared with the traditional goniometer technique for measurement of radiographic angles in hallux valgus and the time required for analysis with the different methods. The radiographs of 31 patients (52 feet) with a diagnosis of hallux valgus were analyzed. Four observers, 2 with >10 years' experience in foot and ankle surgery and 2 in-training surgeons, measured the hallux valgus angle and intermetatarsal angle using a manual goniometer technique and 2 smartphone applications (Hallux Angles and iPinPoint). The interobserver and intermethod reliability were estimated using intraclass correlation coefficients (ICCs), and the time required for measurement of the angles among the 3 methods was compared using the Friedman test. A very good or good interobserver reliability was found among the 4 observers measuring the hallux valgus angle and intermetatarsal angle using the goniometer (ICC 0.913 and 0.821, respectively) and iPinPoint (ICC 0.866 and 0.638, respectively). Using the Hallux Angles application, a very good interobserver reliability was found for measurements of the hallux valgus angle (ICC 0.962) and intermetatarsal angle (ICC 0.935) only among the more experienced observers. The time required for the measurements was significantly shorter for the measurements using both smartphone applications compared with the goniometer method. One smartphone application (iPinPoint) was reliable for measurements of the hallux valgus angles by either experienced or nonexperienced observers. The use of these tools might save time in the evaluation of radiographic angles in the hallux valgus. Copyright © 2016 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

  10. Reliability and validity of selected measures associated with increased fall risk in females over the age of 45 years with distal radius fracture - A pilot study.

    Science.gov (United States)

    Mehta, Saurabh P; MacDermid, Joy C; Richardson, Julie; MacIntyre, Norma J; Grewal, Ruby

    2015-01-01

    Clinical measurement. This study examined test-retest reliability and convergent/divergent construct validity of selected tests and measures that assess balance impairment, fear of falling (FOF), impaired physical activity (PA), and lower extremity muscle strength (LEMS) in females >45 years of age after the distal radius fracture (DRF) population. Twenty one female participants with DRF were assessed on two occasions. Timed Up and Go, Functional Reach, and One Leg Standing tests assessed balance impairment. Shortened Falls Efficacy Scale, Activity-specific Balance Confidence scale, and Fall Risk Perception Questionnaire assessed FOF. International Physical Activity Questionnaire and Rapid Assessment of Physical Activity were administered to assess PA level. Chair stand test and isometric muscle strength testing for hip and knee assessed LEMS. Intraclass correlation coefficients (ICC) examined the test-retest reliability of the measures. Pearson correlation coefficients (r) examined concurrent relationships between the measures. The results demonstrated fair to excellent test-retest reliability (ICC between 0.50 and 0.96) and low to moderate concordance between the measures (low if r ≤ 0.4; moderate if r = 0.4-0.7). The results provide preliminary estimates of test-retest reliability and convergent/divergent construct validity of selected measures associated with increased risk for falling in the females >45 years of age after DRF. Further research directions to advance knowledge regarding fall risk assessment in DRF population have been identified. Copyright © 2015 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

  11. Time-variant reliability assessment through equivalent stochastic process transformation

    International Nuclear Information System (INIS)

    Wang, Zequn; Chen, Wei

    2016-01-01

    Time-variant reliability measures the probability that an engineering system successfully performs intended functions over a certain period of time under various sources of uncertainty. In practice, it is computationally prohibitive to propagate uncertainty in time-variant reliability assessment based on expensive or complex numerical models. This paper presents an equivalent stochastic process transformation approach for cost-effective prediction of reliability deterioration over the life cycle of an engineering system. To reduce the high dimensionality, a time-independent reliability model is developed by translating random processes and time parameters into random parameters in order to equivalently cover all potential failures that may occur during the time interval of interest. With the time-independent reliability model, an instantaneous failure surface is attained by using a Kriging-based surrogate model to identify all potential failure events. To enhance the efficacy of failure surface identification, a maximum confidence enhancement method is utilized to update the Kriging model sequentially. Then, the time-variant reliability is approximated using Monte Carlo simulations of the Kriging model where system failures over a time interval are predicted by the instantaneous failure surface. The results of two case studies demonstrate that the proposed approach is able to accurately predict the time evolution of system reliability while requiring much less computational efforts compared with the existing analytical approach. - Highlights: • Developed a new approach for time-variant reliability analysis. • Proposed a novel stochastic process transformation procedure to reduce the dimensionality. • Employed Kriging models with confidence-based adaptive sampling scheme to enhance computational efficiency. • The approach is effective for handling random process in time-variant reliability analysis. • Two case studies are used to demonstrate the efficacy

  12. THE RELIABILITY AND ACCURACY OF THE TRIPLE MEASUREMENTS OF ANALOG PROCESS VARIABLES

    Directory of Open Access Journals (Sweden)

    V. A. Anishchenko

    2017-01-01

    Full Text Available The increase in unit capacity of electric equipment as well as complication of technological processes, devices control and management of the latter in power plants and substations demonstrate the need to improve the reliability and accuracy of measurement information characterizing the state of the objects being managed. The mentioned objective is particularly important for nuclear power plants, where the price of inaccuracy of measurement responsible process variables is particularly high and the error might lead to irreparable consequences. Improving the reliability and accuracy of measurements along with the improvement of the element base is provided by methods of operational validation. These methods are based on the use of information redundancy (structural, topological, temporal. In particular, information redundancy can be achieved by the simultaneous measurement of one analog variable by two (duplication or three devices (triplication i.e., triple redundancy. The problem of operational control of the triple redundant system of measurement of electrical analog variables (currents, voltages, active and reactive power and energy is considered as a special case of signal processing by an orderly sampling on the basis of majority transformation and transformation being close to majority one. Difficulties in monitoring the reliability of measurements are associated with the two tasks. First, one needs to justify the degree of truncation of the distributions of random errors of measurements and allowable residuals of the pairwise differences of the measurement results. The second task consists in formation of the algorithm of joint processing of a set of separate measurements determined as valid. The quality of control is characterized by the reliability, which adopted the synonym of validity, and accuracy of the measuring system. Taken separately, these indicators might lead to opposite results. A compromise solution is therefore proposed

  13. Reliability of ultrasound thickness measurement of the abdominal muscles during clinical isometric endurance tests.

    Science.gov (United States)

    ShahAli, Shabnam; Arab, Amir Massoud; Talebian, Saeed; Ebrahimi, Esmaeil; Bahmani, Andia; Karimi, Noureddin; Nabavi, Hoda

    2015-07-01

    The study was designed to evaluate the intra-examiner reliability of ultrasound (US) thickness measurement of abdominal muscles activity when supine lying and during two isometric endurance tests in subjects with and without Low back pain (LBP). A total of 19 women (9 with LBP, 10 without LBP) participated in the study. Within-day reliability of the US thickness measurements at supine lying and the two isometric endurance tests were assessed in all subjects. The intra-class correlation coefficient (ICC) was used to assess the relative reliability of thickness measurement. The standard error of measurement (SEM), minimal detectable change (MDC) and the coefficient of variation (CV) were used to evaluate the absolute reliability. Results indicated high ICC scores (0.73-0.99) and also small SEM and MDC scores for within-day reliability assessment. The Bland-Altman plots of agreement in US measurement of the abdominal muscles during the two isometric endurance tests demonstrated that 95% of the observations fall between the limits of agreement for test and retest measurements. Together the results indicate high intra-tester reliability for the US measurement of the thickness of abdominal muscles in all the positions tested. According to the study's findings, US imaging can be used as a reliable method for assessment of abdominal muscles activity in supine lying and the two isometric endurance tests employed, in participants with and without LBP. Copyright © 2014 Elsevier Ltd. All rights reserved.

  14. Reliability and validity of a dual-probe personal computer-based muscle viewer for measuring the pennation angle of the medial gastrocnemius muscle in patients who have had a stroke.

    Science.gov (United States)

    Cho, Ji-Eun; Cho, Ki Hun; Yoo, Jun Sang; Lee, Su Jin; Lee, Wan-Hee

    2018-01-01

    Background A dual-probe personal computer-based muscle viewer (DPC-BMW) is advantageous in that it is relatively lightweight and easy to apply. Objective To investigate the reliability and validity of the DPC-BMW in comparison with those of a portable ultrasonography (P-US) device for measuring the pennation angle of the medial gastrocnemius (MG) muscle at rest and during contraction. Methods Twenty-four patients who had a stroke (18 men and 6 women) participated in this study. Using the DPC-BMW and P-US device, the pennation angle of the MG muscle on the affected side was randomly measured. Two examiners randomly obtained the images of all the participants in two separate test sessions, 7 days apart. Intraclass correlation coefficient (ICC), confidence interval, standard error of measurement, Bland-Altman plot, and Pearson correlation coefficient were used to estimate their reliability and validity. Results The ICC for the intrarater reliability of the MG muscle pennation angle measured using the DPC-BMW was > 0.916, indicating excellent reliability, and that for the interrater reliability ranged from 0.964 to 0.994. The P-US device also exhibited good reliability. A high correlation was found between the measurements of MG muscle pennation angle obtained using the DPC-BMW and that obtained using the P-US device (p < 0.01). Conclusion The DPC-BMW can provide clear images for accurate measurements, including measurements using dual probes. It has the advantage of rehabilitative US imaging for individuals who have had a stroke. More research studies are needed to evaluate the usefulness of the DPC-BMW in rehabilitation.

  15. Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease.

    Science.gov (United States)

    Strouwen, Carolien; Molenaar, Esther A L M; Keus, Samyra H J; Münks, Liesbeth; Bloem, Bastiaan R; Nieuwboer, Alice

    2016-08-01

    Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains largely unknown. The purpose of this study was to assess the reliability of DT outcome measures in patients with PD. A repeated-measures design was used. Patients with PD ("on" medication, Mini-Mental State Examination score ≥24) performed 2 cognitive tasks (ie, backward digit span task and auditory Stroop task) and 1 functional task (ie, mobile phone task) in combination with walking. Tasks were assessed at 2 time points (same hour) with an interval of 6 weeks. Test-retest reliability was assessed for gait while performing each secondary task (DT gait) for both cognitive tasks while walking (DT cognitive) and for the functional task while walking (DT functional). Sixty-two patients with PD (age=39-89 years, Hoehn and Yahr stages II-III) were included in the study. Intraclass correlation coefficients (ICCs) showed excellent reliability for DT gait measures, ranging between .86 and .95 when combined with the digit span task, between .86 and .95 when combined with the auditory Stroop task, and between .72 and .90 when combined with the mobile phone task. The standard error of measurements for DT gait speed varied between 0.06 and 0.08 m/s, leading to minimal detectable changes between 0.16 and 0.22 m/s. With regard to DT cognitive measures, reaction times showed good-to-excellent reliability (digit span task: ICC=.75; auditory Stroop task: ICC=.82). The results cannot be generalized to patients with advanced disease or to other DT measures. In people with PD, DT measures proved to be reliable for use in clinical studies and look promising for use in clinical practice to assess improvements after DT training. Large effects, however, are needed to obtain meaningful effect sizes.

  16. The Reliability of Lumbar Lordosis Measurements Using a Flexible-Rule.

    Science.gov (United States)

    The purpose of this study was to examine the intra-rater and intra-rater reliability of lumbar lordosis measurements taken with a flexible-rule. Two...coefficients (ICC) were used to determine the degree of agreement between measurements. The results suggest that measurements of lumbar lordosis with a

  17. Reliability technology and nuclear power

    International Nuclear Information System (INIS)

    Garrick, B.J.; Kaplan, S.

    1976-01-01

    This paper reviews some of the history and status of nuclear reliability and the evolution of this subject from art towards science. It shows that that probability theory is the appropriate and essential mathematical language of this subject. The authors emphasize that it is more useful to view probability not as a $prime$frequency$prime$, i.e., not as the result of a statistical experiment, but rather as a measure of state of confidence or a state of knowledge. They also show that the probabilistic, quantitative approach has a considerable history of application in the electric power industry in the area of power system planning. Finally, the authors show that the decision theory notion of utility provides a point of view from which risks, benefits, safety, and reliability can be viewed in a unified way thus facilitating understanding, comparison, and communication. 29 refs

  18. Reliability and validity of the AutoCAD software method in lumbar lordosis measurement.

    Science.gov (United States)

    Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe

    2011-12-01

    The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis.

  19. A test chip for automatic reliability measurements of interconnect vias

    NARCIS (Netherlands)

    Lippe, K.; Hasper, A.; Elfrink, G.W.; Niehof, J.; Kerkhoff, Hans G.

    1992-01-01

    A test circuit for electromigration reliability measurements was designed and tested. The device under test (DUT) is a via-hole chain. The test circuit permits simultaneous measurements of a number of DUTs, and a fatal error of one DUT does not influence the measurement results of the other DUTs.

  20. Using LISREL to Evaluate Measurement Models and Scale Reliability.

    Science.gov (United States)

    Fleishman, John; Benson, Jeri

    1987-01-01

    LISREL program was used to examine measurement model assumptions and to assess reliability of Coopersmith Self-Esteem Inventory for Children, Form B. Data on 722 third-sixth graders from over 70 schools in large urban school district were used. LISREL program assessed (1) nature of basic measurement model for scale, (2) scale invariance across…

  1. Tactile acuity charts: a reliable measure of spatial acuity.

    Directory of Open Access Journals (Sweden)

    Patrick Bruns

    Full Text Available For assessing tactile spatial resolution it has recently been recommended to use tactile acuity charts which follow the design principles of the Snellen letter charts for visual acuity and involve active touch. However, it is currently unknown whether acuity thresholds obtained with this newly developed psychophysical procedure are in accordance with established measures of tactile acuity that involve passive contact with fixed duration and control of contact force. Here we directly compared tactile acuity thresholds obtained with the acuity charts to traditional two-point and grating orientation thresholds in a group of young healthy adults. For this purpose, two types of charts, using either Braille-like dot patterns or embossed Landolt rings with different orientations, were adapted from previous studies. Measurements with the two types of charts were equivalent, but generally more reliable with the dot pattern chart. A comparison with the two-point and grating orientation task data showed that the test-retest reliability of the acuity chart measurements after one week was superior to that of the passive methods. Individual thresholds obtained with the acuity charts agreed reasonably with the grating orientation threshold, but less so with the two-point threshold that yielded relatively distinct acuity estimates compared to the other methods. This potentially considerable amount of mismatch between different measures of tactile acuity suggests that tactile spatial resolution is a complex entity that should ideally be measured with different methods in parallel. The simple test procedure and high reliability of the acuity charts makes them a promising complement and alternative to the traditional two-point and grating orientation thresholds.

  2. Reliability of surface EMG measurements from the suprahyoid muscle complex

    DEFF Research Database (Denmark)

    Kothari, Mohit; Stubbs, Peter William; Pedersen, Asger Roer

    2017-01-01

    of using the suprahyoid muscle complex (SMC) using surface electromyography (sEMG) to assess changes to neural pathways by determining the reliability of measurements in healthy participants over days. Methods: Seventeen healthy participants were recruited. Measurements were performed twice with one week...... on stimulus type/intensity) had significantly different MEP values between day 1 and day 2 for single pulse and paired pulse TMS. A large stimulus artefact resulted in MEP responses that could not be assessed in four participants. Conclusions: The assessment of the SMC using sEMG following TMS was poorly...... reliable for ≈50% of participants. Although using sEMG to assess swallowing musculature function is easier to perform clinically and more comfortable to patients than invasive measures, as the measurement of muscle activity using TMS is unreliable, the use of sEMG for this muscle group is not recommended...

  3. How to reliably deliver narrow individual-patient error bars for optimization of pacemaker AV or VV delay using a "pick-the-highest" strategy with haemodynamic measurements.

    Science.gov (United States)

    Francis, Darrel P

    2013-03-10

    Intuitive and easily-described, "pick-the-highest" is often recommended for quantitative optimization of AV and especially VV delay settings of biventricular pacemakers (BVP; cardiac resynchronization therapy, CRT). But reliable selection of the optimum setting is challenged by beat-to-beat physiological variation, which "pick-the-highest" combats by averaging multiple heartbeats. Optimization is not optimization unless the optimum is identified confidently. This document shows how to calculate how many heartbeats must be averaged to optimize reliably by pick-the-highest. Any reader, by conducting a few measurements, can calculate for locally-available methods (i) biological scatter between replicate measurements, and (ii) curvature of the biological response. With these, for any clinically-desired precision of optimization, the necessary number of heartbeats can be calculated. To achieve 95% confidence of getting within ±∆x of the true optimum, the number of heartbeats needed is 2(scatter/curvature)(2)/∆x(4) per setting. Applying published scatter/curvature values (which readers should re-evaluate locally) indicates that optimizing AV, even coarsely with a 40ms-wide band of precision, requires many thousand beats. For VV delay, the number approaches a million. Moreover, identifying the optimum twice as precisely requires 30-fold more beats. "Pick the highest" is quick to say but slow to do. We must not expect staff to do the impossible; nor criticise them for not doing so. Nor should we assume recommendations and published protocols are well-designed. Reliable AV or VV optimization, using "pick-the-highest" on commonly-recommended manual measurements, is unrealistic. Improving time-efficiency of the optimization process to become clinically realistic may need a curve-fitting strategy instead, with all acquired data marshalled conjointly. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  4. Recommendations for certification or measurement of reliability for reliable digital archival repositories with emphasis on access

    Directory of Open Access Journals (Sweden)

    Paula Regina Ventura Amorim Gonçalez

    2017-04-01

    Full Text Available Introduction: Considering the guidelines of ISO 16363: 2012 (Space data and information transfer systems -- Audit and certification of trustworthy digital repositories and the text of CONARQ Resolution 39 for certification of Reliable Digital Archival Repository (RDC-Arq, verify the technical recommendations should be used as the basis for a digital archival repository to be considered reliable. Objective: Identify requirements for the creation of Reliable Digital Archival Repositories with emphasis on access to information from the ISO 16363: 2012 and CONARQ Resolution 39. Methodology: For the development of the study, the methodology consisted of an exploratory, descriptive and documentary theoretical investigation, since it is based on ISO 16363: 2012 and CONARQ Resolution 39. From the perspective of the problem approach, the study is qualitative and quantitative, since the data were collected, tabulated, and analyzed from the interpretation of their contents. Results: We presented a set of Checklist Recommendations for reliability measurement and/or certification for RDC-Arq with a clipping focused on the identification of requirements with emphasis on access to information is presented. Conclusions: The right to information as well as access to reliable information is a premise for Digital Archival Repositories, so the set of recommendations is directed to archivists who work in Digital Repositories and wish to verify the requirements necessary to evaluate the reliability of the Digital Repository or still guide the information professional in collecting requirements for repository reliability certification.

  5. Reliability and mechanical design

    International Nuclear Information System (INIS)

    Lemaire, Maurice

    1997-01-01

    A lot of results in mechanical design are obtained from a modelisation of physical reality and from a numerical solution which would lead to the evaluation of needs and resources. The goal of the reliability analysis is to evaluate the confidence which it is possible to grant to the chosen design through the calculation of a probability of failure linked to the retained scenario. Two types of analysis are proposed: the sensitivity analysis and the reliability analysis. Approximate methods are applicable to problems related to reliability, availability, maintainability and safety (RAMS)

  6. Targeting Low Career Confidence Using the Career Planning Confidence Scale

    Science.gov (United States)

    McAuliffe, Garrett; Jurgens, Jill C.; Pickering, Worth; Calliotte, James; Macera, Anthony; Zerwas, Steven

    2006-01-01

    The authors describe the development and validation of a test of career planning confidence that makes possible the targeting of specific problem issues in employment counseling. The scale, developed using a rational process and the authors' experience with clients, was tested for criterion-related validity against 2 other measures. The scale…

  7. Semiconductor measurement technology: reliability technology for cardiac pacemakers 2: a workshop report, 1976

    International Nuclear Information System (INIS)

    Schafft, H.A.

    1977-01-01

    Summaries are presented of 12 invited talks on the following topics: the procurement and assurance of high reliability electronic parts, leak rate and moisture measurements, pacemaker batteries, and pacemaker leads. The workshop, second in a series, was held in response to strong interest expressed by the pacemaker community to address technical questions relevant to the enhancement and assurance of cardiac pacemaker reliability. Discussed at the workshop were a process validation wafer concept for assuring process uniformity in device chips; screen tests for assuring reliable electronic parts; reliability prediction; reliability comparison of semiconductor technologies; mechanisms of short-circuiting dendritic growths; details of helium and radioisotope leak test methods; a study to correlate package leak rates, as measured with test gasses, and actual moisture infusion; battery life prediction; microcalorimetric measurements to nondestructively evaluate batteries for pacemakers; and an engineer's and a physician's view of the present status of pacemaker leads. References are included with most of the reports

  8. Maximum-confidence discrimination among symmetric qudit states

    International Nuclear Information System (INIS)

    Jimenez, O.; Solis-Prosser, M. A.; Delgado, A.; Neves, L.

    2011-01-01

    We study the maximum-confidence (MC) measurement strategy for discriminating among nonorthogonal symmetric qudit states. Restricting to linearly dependent and equally likely pure states, we find the optimal positive operator valued measure (POVM) that maximizes our confidence in identifying each state in the set and minimizes the probability of obtaining inconclusive results. The physical realization of this POVM is completely determined and it is shown that after an inconclusive outcome, the input states may be mapped into a new set of equiprobable symmetric states, restricted, however, to a subspace of the original qudit Hilbert space. By applying the MC measurement again onto this new set, we can still gain some information about the input states, although with less confidence than before. This leads us to introduce the concept of sequential maximum-confidence (SMC) measurements, where the optimized MC strategy is iterated in as many stages as allowed by the input set, until no further information can be extracted from an inconclusive result. Within each stage of this measurement our confidence in identifying the input states is the highest possible, although it decreases from one stage to the next. In addition, the more stages we accomplish within the maximum allowed, the higher will be the probability of correct identification. We will discuss an explicit example of the optimal SMC measurement applied in the discrimination among four symmetric qutrit states and propose an optical network to implement it.

  9. Confidence assessment. Site-descriptive modelling SDM-Site Laxemar

    International Nuclear Information System (INIS)

    2009-06-01

    independent data from different disciplines. While some aspects have lower confidence this lack of confidence is handled by providing wider uncertainty ranges, bounding estimates and/or alternative models to repository engineering and long term safety assessment. It is judged that most, of the low confidence aspects have little impact on repository engineering design or for long-term safety. It may also be noted that the feedback requirements from SR-Can to the site modelling are now met in the completed site investigations, subject to levels of uncertainty that are viewed as acceptable. Only a few data points and a few types of data have been omitted from the modelling, mainly because they are judged less relevant and reliable than the data considered. Inclusion of data from outside the Laxemar subarea might have enhanced confidence in the regional model, but only at the locations of the data and these changes in confidence would have been of little significance in relation to implications for the local model area and would not, therefore, have been of any real significance to design or safety assessment. These omissions are judged to have little or no negative impact on confidence in the Laxemar subarea model. In fact, identification of unreliable data and their elimination should have a positive effect on confidence. Poor precision in the measured data is judged to have a limited impact on uncertainties in the site descriptive model, with the exceptions of interpretation and combination of borehole and outcrop fracture data and general uncertainties in sorption data

  10. Confidence assessment. Site-descriptive modelling SDM-Site Laxemar

    Energy Technology Data Exchange (ETDEWEB)

    2008-12-15

    independent data from different disciplines. While some aspects have lower confidence this lack of confidence is handled by providing wider uncertainty ranges, bounding estimates and/or alternative models to repository engineering and long term safety assessment. It is judged that most, of the low confidence aspects have little impact on repository engineering design or for long-term safety. It may also be noted that the feedback requirements from SR-Can to the site modelling are now met in the completed site investigations, subject to levels of uncertainty that are viewed as acceptable. Only a few data points and a few types of data have been omitted from the modelling, mainly because they are judged less relevant and reliable than the data considered. Inclusion of data from outside the Laxemar subarea might have enhanced confidence in the regional model, but only at the locations of the data and these changes in confidence would have been of little significance in relation to implications for the local model area and would not, therefore, have been of any real significance to design or safety assessment. These omissions are judged to have little or no negative impact on confidence in the Laxemar subarea model. In fact, identification of unreliable data and their elimination should have a positive effect on confidence. Poor precision in the measured data is judged to have a limited impact on uncertainties in the site descriptive model, with the exceptions of interpretation and combination of borehole and outcrop fracture data and general uncertainties in sorption data

  11. TEST-RETEST RELIABILITY OF THE CLOSED KINETIC CHAIN UPPER EXTREMITY STABILITY TEST (CKCUEST) IN ADOLESCENTS: RELIABILITY OF CKCUEST IN ADOLESCENTS.

    Science.gov (United States)

    de Oliveira, Valéria M A; Pitangui, Ana C R; Nascimento, Vinícius Y S; da Silva, Hítalo A; Dos Passos, Muana H P; de Araújo, Rodrigo C

    2017-02-01

    The Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) has been proposed as an option to assess upper limb function and stability; however, there are few studies that support the use of this test in adolescents. The purpose of the present study was to investigate the intersession reliability and agreement of three CKCUEST scores in adolescents and establish clinimetric values for this test. Test-retest reliability. Twenty-five healthy adolescents of both sexes were evaluated. The subjects performed two CKCUEST with an interval of one week between the tests. An intraclass correlation coefficient (ICC 3,3 ) two-way mixed model with a 95% interval of confidence was utilized to determine intersession reliability. A Bland-Altman graph was plotted to analyze the agreement between assessments. The presence of systematic error was evaluated by a one-sample t test. The difference between the evaluation and reevaluation was observed using a paired-sample t test. The level of significance was set at 0.05. Standard error of measurements and minimum detectable changes were calculated. The intersession reliability of the average touches score, normalized score, and power score were 0.68, 0.68 and 0.87, the standard error of measurement were 2.17, 1.35 and 6.49, and the minimal detectable change was 6.01, 3.74 and 17.98, respectively. The presence of systematic error (p test with moderate to excellent reliability when used with adolescents. The CKCUEST is a measurement with moderate to excellent reliability for adolescents. 2b.

  12. Test-Retest Reliability of Dual-Task Outcome Measures in People With Parkinson Disease

    NARCIS (Netherlands)

    Strouwen, C.; Molenaar, E.A.; Keus, S.H.; Munks, L.; Bloem, B.R.; Nieuwboer, A.

    2016-01-01

    BACKGROUND: Dual-task (DT) training is gaining ground as a physical therapy intervention in people with Parkinson disease (PD). Future studies evaluating the effect of such interventions need reliable outcome measures. To date, the test-retest reliability of DT measures in patients with PD remains

  13. Reliability analysis for manual radiographic measures of rotatory subluxation or lateral listhesis in adult scoliosis.

    Science.gov (United States)

    Freedman, Brett A; Horton, William C; Rhee, John M; Edwards, Charles C; Kuklo, Timothy R

    2009-03-15

    Retrospective observational study. To define the inter- and intraobserver reliability of 3 measures of rotatory subluxation (RS) in adult scoliosis (AS). RS is a hallmark of AS. To accurately track this measure, one must know its reliability. Reliability testing has not been performed. PA 36" films of 29 AS patients were collected from one surgeon's practice. Three observers on 2 separate occasions measured all levels with >or=3-mm RS (60 levels, 360 measurements) on the convexity of the involved segment using 3 different techniques-midbody (MB), endplate (EP), and centroid (C). These data were then analyzed to determine the intraclass correlation coefficient (ICC) for inter- and intraobserver reliability. The thoracolumbar/lumbar curve (average 58 degrees ) was the major curve for the majority (62%) of patients. RS at L3/4 was most common (35%). The overall inter- and intraobserver reliability was good-excellent for all methods, but the centroid method consistently had the highest ICC. ICC correlated with observer experience. Moderate-severe arthritic change (present in 55%) and poor image quality (52%) decreased ICC, but it still remained good-excellent for each measure. The reproducibility coefficient for each measure was 4 mm for MB and 2.8 mm for C and EP. MB, EP, and C are reliable techniques to measure RS even in elderly arthritic spines, but the methods inherently produce different values for a given level. The centroid method is most reliable and least influenced by experience. The EP method is easy to perform and very reliable. Spine surgeons should pick their preferred method and apply it consistently. Changes >3 mm suggest RS progression. RS may be a useful measure in addition to Cobb angle in AS. Having defined measurement reliability, the role of RS progression in surgical indications and patient outcomes can be evaluated.

  14. A new measurement of workload in Web application reliability assessment

    Directory of Open Access Journals (Sweden)

    CUI Xia

    2015-02-01

    Full Text Available Web application has been popular in various fields of social life.It becomes more and more important to study the reliability of Web application.In this paper the definition of Web application failure is firstly brought out,and then the definition of Web application reliability.By analyzing data in the IIS server logs and selecting corresponding usage and information delivery failure data,the paper study the feasibility of Web application reliability assessment from the perspective of Web software system based on IIS server logs.Because the usage for a Web site often has certain regularity,a new measurement of workload in Web application reliability assessment is raised.In this method,the unit is removed by weighted average technique;and the weights are assessed by setting objective function and optimization.Finally an experiment was raised for validation.The experiment result shows the assessment of Web application reliability base on the new workload is better.

  15. Reliability of in-Shoe Plantar Pressure Measurements in Rheumatoid Arthritis Patients

    Science.gov (United States)

    Vidmar, Gaj; Novak, Primoz

    2009-01-01

    Plantar pressures measurement is a frequently used method in rehabilitation and related research. Metric characteristics of the F-Scan system have been assessed from different standpoints and in different patients, but not its reliability in rheumatoid arthritis patients. Therefore, our objective was to assess reliability of the F-Scan plantar…

  16. Inter- and intra-observer reliability of masking in plantar pressure measurement analysis.

    Science.gov (United States)

    Deschamps, K; Birch, I; Mc Innes, J; Desloovere, K; Matricali, G A

    2009-10-01

    Plantar pressure measurement is an important tool in gait analysis. Manual placement of small masks (masking) is increasingly used to calculate plantar pressure characteristics. Little is known concerning the reliability of manual masking. The aim of this study was to determine the reliability of masking on 2D plantar pressure footprints, in a population with forefoot deformity (i.e. hallux valgus). Using a random repeated-measure design, four observers identified the third metatarsal head on a peak-pressure barefoot footprint, using a small mask. Subsequently, the location of all five metatarsal heads was identified, using the same size of masks and the same protocol. The 2D positional variation of the masks and the peak pressure (PP) and pressure time integral (PTI) values of each mask were calculated. For single-masking the lowest inter-observer reliability was found for the distal-proximal direction, causing a clear, adverse impact on the reliability of the pressure characteristics (PP and PTI). In the medial-lateral direction the inter-observer reliability could be scored as high. Intra-observer reliability was better and could be scored as high or good for both directions, with a correlated improved reliability of the pressure characteristics. Reliability of multi-masking showed a similar pattern, but overall values tended to be lower. Therefore, small sized masking in order to define pressure characteristics in the forefoot should be done with care.

  17. Reliability of routine clinical measurements of neonatal circumferences and research measurements of neonatal skinfold thicknesses: findings from the Born in Bradford study

    Science.gov (United States)

    West, Jane; Manchester, Ben; Wright, John; Lawlor, Debbie A; Waiblinger, Dagmar

    2011-01-01

    Summary West J, Manchester B, Wright J, Lawlor DA, Waiblinger D. Reliability of routine clinical measurements of neonatal circumferences and research measurements of neonatal skinfold thicknesses: findings from the Born in Bradford study. Paediatric and Perinatal Epidemiology 2011. Assessing neonatal size reliably is important for research and clinical practice. The aim of this study was to examine the reliability of routine clinical measurements of neonatal circumferences and of skinfold thicknesses assessed for research purposes. All measurements were undertaken on the same population of neonates born in a large maternity unit in Bradford, UK. Technical error of measurement (TEM), relative TEM and the coefficient of reliability are reported. Intra-observer TEMs for routine circumference measurements were all below 0.4 cm and were generally within ±2-times the mean. Inter-observer TEM ranged from 0.20 to 0.36 cm for head circumference, 0.19 to 0.39 cm for mid upper arm circumference and from 0.39 to 0.77 cm for abdominal circumference. Intra and inter-observer TEM for triceps skinfold thickness ranged from 0.22 to 0.35 mm and 0.15 to 0.54 mm, respectively. Subscapular skinfold thickness TEM values were 0.14 to 0.25 mm for intra-observer measurements and 0.17 to 0.63 mm for inter-observer measurements. Relative TEM values for routine circumferences were all below 4.00% but varied between 2.88% and 14.23% for research skinfold measurements. Reliability was mostly between 80% and 99% for routine circumference measurements and ≥70% for most research skinfold measurements. Routine clinical measurements of neonatal circumferences are reliably assessed in Bradford. Assessing skinfolds in neonates has variable reliability, but on the whole is good. The greater intra-observer, compared with inter-observer, reliability for both sets of measurements highlights the importance of having a minimal number of assessors whenever possible. PMID:21281329

  18. Reliability and validity of non-radiographic methods of thoracic kyphosis measurement: a systematic review.

    Science.gov (United States)

    Barrett, Eva; McCreesh, Karen; Lewis, Jeremy

    2014-02-01

    A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.

  19. The gothic arch: a reliable measurement for developmental dysplasia of the hip.

    Science.gov (United States)

    Herickhoff, Paul K; O'Brien, Megan K; Dolan, Lori A; Morcuende, Jose A; Peterson, Jonathan B; Weinstein, Stuart L

    2013-01-01

    The "Gothic Arch" is a radio-graphic finding on AP pelvis x-rays postulated to be predictive of hip osteoarthritis. The purpose of this study was to determine the reliability of measurement of the Gothic Arch in patients with no known hip pathology and patients with unilateral developmental dysplasia of the hip (DDH). After obtaining IRB approval, nine skeletally mature patients (18 hips) with no known hip pathology were selected to serve as the control group. The AP pelvis x-rays at skeletal maturity of eight patients (16 hips) with unilateral DDH treated with closed reduction and casting comprised the comparison group. A digitizing program was designed to measure the Gothic Arch based on landmarks identified by the user. Two pediatric orthopaedic surgeons and two orthopaedic residents completed the program on two separate occasions. Intra-and interobserver reliability were determined using intraclass cor-relation coefficients (ICC) for continuous variables. Both the unilateral DDH group and the control group demonstrated excellent inter- and intraobserver reliability (ICC >0.70) for base, height, area, and orientation of the Gothic Arch, but poor reliability (ICC Gothic Arch can be reliably measured on AP pelvis x-rays of patients with normal and dysplastic hips. III, Diagnostic study. See the Guidelines for Authors for a complete description of levels of evidence.

  20. Validity and reliability of food security measures.

    Science.gov (United States)

    Cafiero, Carlo; Melgar-Quiñonez, Hugo R; Ballard, Terri J; Kepple, Anne W

    2014-12-01

    This paper reviews some of the existing food security indicators, discussing the validity of the underlying concept and the expected reliability of measures under reasonably feasible conditions. The main objective of the paper is to raise awareness on existing trade-offs between different qualities of possible food security measurement tools that must be taken into account when such tools are proposed for practical application, especially for use within an international monitoring framework. The hope is to provide a timely, useful contribution to the process leading to the definition of a food security goal and the associated monitoring framework within the post-2015 Development Agenda. © 2014 New York Academy of Sciences.

  1. Effect of rater training on reliability and accuracy of mini-CEX scores: a randomized, controlled trial.

    Science.gov (United States)

    Cook, David A; Dupras, Denise M; Beckman, Thomas J; Thomas, Kris G; Pankratz, V Shane

    2009-01-01

    Mini-CEX scores assess resident competence. Rater training might improve mini-CEX score interrater reliability, but evidence is lacking. Evaluate a rater training workshop using interrater reliability and accuracy. Randomized trial (immediate versus delayed workshop) and single-group pre/post study (randomized groups combined). Academic medical center. Fifty-two internal medicine clinic preceptors (31 randomized and 21 additional workshop attendees). The workshop included rater error training, performance dimension training, behavioral observation training, and frame of reference training using lecture, video, and facilitated discussion. Delayed group received no intervention until after posttest. Mini-CEX ratings at baseline (just before workshop for workshop group), and four weeks later using videotaped resident-patient encounters; mini-CEX ratings of live resident-patient encounters one year preceding and one year following the workshop; rater confidence using mini-CEX. Among 31 randomized participants, interrater reliabilities in the delayed group (baseline intraclass correlation coefficient [ICC] 0.43, follow-up 0.53) and workshop group (baseline 0.40, follow-up 0.43) were not significantly different (p = 0.19). Mean ratings were similar at baseline (delayed 4.9 [95% confidence interval 4.6-5.2], workshop 4.8 [4.5-5.1]) and follow-up (delayed 5.4 [5.0-5.7], workshop 5.3 [5.0-5.6]; p = 0.88 for interaction). For the entire cohort, rater confidence (1 = not confident, 6 = very confident) improved from mean (SD) 3.8 (1.4) to 4.4 (1.0), p = 0.018. Interrater reliability for ratings of live encounters (entire cohort) was higher after the workshop (ICC 0.34) than before (ICC 0.18) but the standard error of measurement was similar for both periods. Rater training did not improve interrater reliability or accuracy of mini-CEX scores. clinicaltrials.gov identifier NCT00667940

  2. Characterizing reliability in a product/process design-assurance program

    Energy Technology Data Exchange (ETDEWEB)

    Kerscher, W.J. III [Delphi Energy and Engine Management Systems, Flint, MI (United States); Booker, J.M.; Bement, T.R.; Meyer, M.A. [Los Alamos National Lab., NM (United States)

    1997-10-01

    Over the years many advancing techniques in the area of reliability engineering have surfaced in the military sphere of influence, and one of these techniques is Reliability Growth Testing (RGT). Private industry has reviewed RGT as part of the solution to their reliability concerns, but many practical considerations have slowed its implementation. It`s objective is to demonstrate the reliability requirement of a new product with a specified confidence. This paper speaks directly to that objective but discusses a somewhat different approach to achieving it. Rather than conducting testing as a continuum and developing statistical confidence bands around the results, this Bayesian updating approach starts with a reliability estimate characterized by large uncertainty and then proceeds to reduce the uncertainty by folding in fresh information in a Bayesian framework.

  3. Insightful practice: a reliable measure for medical revalidation

    Science.gov (United States)

    Guthrie, Bruce; Sullivan, Frank M; Mercer, Stewart W; Russell, Andrew; Bruce, David A

    2012-01-01

    Background Medical revalidation decisions need to be reliable if they are to reassure on the quality and safety of professional practice. This study tested an innovative method in which general practitioners (GPs) were assessed on their reflection and response to a set of externally specified feedback. Setting and participants 60 GPs and 12 GP appraisers in the Tayside region of Scotland, UK. Methods A feedback dataset was specified as (1) GP-specific data collected by GPs themselves (patient and colleague opinion; open book self-evaluated knowledge test; complaints) and (2) Externally collected practice-level data provided to GPs (clinical quality and prescribing safety). GPs' perceptions of whether the feedback covered UK General Medical Council specified attributes of a ‘good doctor’ were examined using a mapping exercise. GPs' professionalism was examined in terms of appraiser assessment of GPs' level of insightful practice, defined as: engagement with, insight into and appropriate action on feedback data. The reliability of assessment of insightful practice and subsequent recommendations on GPs' revalidation by face-to-face and anonymous assessors were investigated using Generalisability G-theory. Main outcome measures Coverage of General Medical Council attributes by specified feedback and reliability of assessor recommendations on doctors' suitability for revalidation. Results Face-to-face assessment proved unreliable. Anonymous global assessment by three appraisers of insightful practice was highly reliable (G=0.85), as were revalidation decisions using four anonymous assessors (G=0.83). Conclusions Unlike face-to-face appraisal, anonymous assessment of insightful practice offers a valid and reliable method to decide GP revalidation. Further validity studies are needed. PMID:22653078

  4. Distinguishing highly confident accurate and inaccurate memory: insights about relevant and irrelevant influences on memory confidence.

    Science.gov (United States)

    Chua, Elizabeth F; Hannula, Deborah E; Ranganath, Charan

    2012-01-01

    It is generally believed that accuracy and confidence in one's memory are related, but there are many instances when they diverge. Accordingly it is important to disentangle the factors that contribute to memory accuracy and confidence, especially those factors that contribute to confidence, but not accuracy. We used eye movements to separately measure fluent cue processing, the target recognition experience, and relative evidence assessment on recognition confidence and accuracy. Eye movements were monitored during a face-scene associative recognition task, in which participants first saw a scene cue, followed by a forced-choice recognition test for the associated face, with confidence ratings. Eye movement indices of the target recognition experience were largely indicative of accuracy, and showed a relationship to confidence for accurate decisions. In contrast, eye movements during the scene cue raised the possibility that more fluent cue processing was related to higher confidence for both accurate and inaccurate recognition decisions. In a second experiment we manipulated cue familiarity, and therefore cue fluency. Participants showed higher confidence for cue-target associations for when the cue was more familiar, especially for incorrect responses. These results suggest that over-reliance on cue familiarity and under-reliance on the target recognition experience may lead to erroneous confidence.

  5. Efficiency criteria for high reliability measured system structures

    International Nuclear Information System (INIS)

    Sal'nikov, N.L.

    2012-01-01

    The procedures of structural redundancy are usually used to develop high reliability measured systems. To estimate efficiency of such structures the criteria to compare different systems has been developed. So it is possible to develop more exact system by inspection of redundant system data unit stochastic characteristics in accordance with the developed criteria [ru

  6. Reliability generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R).

    Science.gov (United States)

    Herrington, Hayley M; Smith, Timothy B; Feinauer, Erika; Griner, Derek

    2016-10-01

    [Correction Notice: An Erratum for this article was reported in Vol 63(5) of Journal of Counseling Psychology (see record 2016-33161-001). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected.] Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  7. Characterization of perovskite solar cells: Towards a reliable measurement protocol

    Directory of Open Access Journals (Sweden)

    Eugen Zimmermann

    2016-09-01

    Full Text Available Lead halide perovskite solar cells have shown a tremendous rise in power conversion efficiency with reported record efficiencies of over 20% making this material very promising as a low cost alternative to conventional inorganic solar cells. However, due to a differently severe “hysteretic” behaviour during current density-voltage measurements, which strongly depends on scan rate, device and measurement history, preparation method, device architecture, etc., commonly used solar cell measurements do not give reliable or even reproducible results. For the aspect of commercialization and the possibility to compare results of different devices among different laboratories, it is necessary to establish a measurement protocol which gives reproducible results. Therefore, we compare device characteristics derived from standard current density-voltage measurements with stabilized values obtained from an adaptive tracking of the maximum power point and the open circuit voltage as well as characteristics extracted from time resolved current density-voltage measurements. Our results provide insight into the challenges of a correct determination of device performance and propose a measurement protocol for a reliable characterisation which is easy to implement and has been tested on varying perovskite solar cells fabricated in different laboratories.

  8. Improving Metrological Reliability of Information-Measuring Systems Using Mathematical Modeling of Their Metrological Characteristics

    Science.gov (United States)

    Kurnosov, R. Yu; Chernyshova, T. I.; Chernyshov, V. N.

    2018-05-01

    The algorithms for improving the metrological reliability of analogue blocks of measuring channels and information-measuring systems are developed. The proposed algorithms ensure the optimum values of their metrological reliability indices for a given analogue circuit block solution.

  9. Comparison of reliability and responsiveness of patient-reported clinical outcome measures in knee osteoarthritis rehabilitation.

    Science.gov (United States)

    Williams, Valerie J; Piva, Sara R; Irrgang, James J; Crossley, Chad; Fitzgerald, G Kelley

    2012-08-01

    Secondary analysis, pretreatment-posttreatment observational study. To compare the reliability and responsiveness of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), the Knee Outcome Survey activities of daily living subscale (KOS-ADL), and the Lower Extremity Functional Scale (LEFS) in individuals with knee osteoarthritis (OA). The WOMAC is the current standard in patient-reported measures of function in patients with knee OA. The KOS-ADL and LEFS were designed for potential use in patients with knee OA. If the KOS-ADL and LEFS are to be considered viable alternatives to the WOMAC for measuring patient-reported function in individuals with knee OA, they should have measurement properties comparable to the WOMAC. It would also be important to determine whether either of these instruments may be superior to the WOMAC in terms of reliability or responsiveness in this population. Data from 168 subjects with knee OA, who participated in a rehabilitation program, were used in the analyses. Reliability and responsiveness of each outcome measure were estimated at follow-ups of 2, 6, and 12 months. Reliability was estimated by calculating the intraclass correlation coefficient (ICC2,1) for subjects who were unchanged in status from baseline at each follow-up time, based on a global rating of change score. To examine responsiveness, the standard error of the measurement, minimal detectable change, minimal clinically important difference, and the Guyatt responsiveness index were calculated for each outcome measure at each follow-up time. All 3 outcome measures demonstrated reasonable reliability and responsiveness to change. Reliability and responsiveness tended to decrease somewhat with increasing follow-up time. There were no substantial differences between outcome measures for reliability or any of the 3 measures of responsiveness at any follow-up time. The results do not indicate that one outcome measure is more reliable or responsive than

  10. Modeling reliability measurement of interface on information system: Towards the forensic of rules

    Science.gov (United States)

    Nasution, M. K. M.; Sitompul, Darwin; Harahap, Marwan

    2018-02-01

    Today almost all machines depend on the software. As a software and hardware system depends also on the rules that are the procedures for its use. If the procedure or program can be reliably characterized by involving the concept of graph, logic, and probability, then regulatory strength can also be measured accordingly. Therefore, this paper initiates an enumeration model to measure the reliability of interfaces based on the case of information systems supported by the rules of use by the relevant agencies. An enumeration model is obtained based on software reliability calculation.

  11. Reliability assessment for thickness measurements of pipe wall using probability of detection

    International Nuclear Information System (INIS)

    Nakamoto, Hiroyuki; Kojima, Fumio; Kato, Sho

    2013-01-01

    This paper proposes a reliability assessment method for thickness measurements of pipe wall using probability of detection (POD). Thicknesses of pipes are measured by qualified inspectors with ultrasonic thickness gauges. The inspection results are affected by human factors of the inspectors and include some errors, because the inspectors have different experiences and frequency of inspections. In order to ensure reliability for inspection results, first, POD evaluates experimental results of pipe-wall thickness inspection. We verify that the results have differences depending on inspectors including qualified inspectors. Second, two human factors that affect POD are indicated. Finally, it is confirmed that POD can identify the human factors and ensure reliability for pipe-wall thickness inspections. (author)

  12. Inter-expert and intra-expert reliability in sleep spindle scoring

    DEFF Research Database (Denmark)

    Wendt, Sabrina Lyngbye; Welinder, Peter; Sørensen, Helge Bjarup Dissing

    2015-01-01

    Objectives To measure the inter-expert and intra-expert agreement in sleep spindle scoring, and to quantify how many experts are needed to build a reliable dataset of sleep spindle scorings. Methods The EEG dataset was comprised of 400 randomly selected 115 s segments of stage 2 sleep from 110...... with higher reliability than the estimation of spindle duration. Reliability of sleep spindle scoring can be improved by using qualitative confidence scores, rather than a dichotomous yes/no scoring system. Conclusions We estimate that 2–3 experts are needed to build a spindle scoring dataset...... with ‘substantial’ reliability (κ: 0.61–0.8), and 4 or more experts are needed to build a dataset with ‘almost perfect’ reliability (κ: 0.81–1). Significance Spindle scoring is a critical part of sleep staging, and spindles are believed to play an important role in development, aging, and diseases of the nervous...

  13. Raven’s Progressive Matrices, manipulations of complexity and measures of accuracy, speed and confidence

    OpenAIRE

    LAZAR STANKOV; KARL SCHWEIZER

    2007-01-01

    This paper examines the effects of complexity-enhancing manipulations of two cognitive tasks – Swaps and Triplet Numbers tests (Stankov, 2000) – on their relationship with Raven’s Progressive Matrices test representing aspects of fluid intelligence. The complexity manipulations involved four treatment levels, each requiring an increasing number of components and relationships among these components. The accuracy, speed of processing, and confidence measures were decomposed into experimental a...

  14. Intra and inter-rater reliability study of pelvic floor muscle dynamometric measurements

    Directory of Open Access Journals (Sweden)

    Natalia M. Martinho

    2015-04-01

    Full Text Available OBJECTIVE: The aim of this study was to evaluate the intra and inter-rater reliability of pelvic floor muscle (PFM dynamometric measurements for maximum and average strengths, as well as endurance. METHOD: A convenience sample of 18 nulliparous women, without any urogynecological complaints, aged between 19 and 31 (mean age of 25.4±3.9 participated in this study. They were evaluated using a pelvic floor dynamometer based on load cell technology. The dynamometric evaluations were repeated in three successive sessions: two on the same day with a rest period of 30 minutes between them, and the third on the following day. All participants were evaluated twice in each session; first by examiner 1 followed by examiner 2. The vaginal dynamometry data were analyzed using three parameters: maximum strength, average strength, and endurance. The Intraclass Correlation Coefficient (ICC was applied to estimate the PFM dynamometric measurement reliability, considering a good level as being above 0.75. RESULTS: The intra and inter-raters' analyses showed good reliability for maximum strength (ICCintra-rater1=0.96, ICCintra-rater2=0.95, and ICCinter-rater=0.96, average strength (ICCintra-rater1=0.96, ICCintra-rater2=0.94, and ICCinter-rater=0.97, and endurance (ICCintra-rater1=0.88, ICCintra-rater2=0.86, and ICCinter-rater=0.92 dynamometric measurements. CONCLUSIONS: The PFM dynamometric measurements showed good intra- and inter-rater reliability for maximum strength, average strength and endurance, which demonstrates that this is a reliable device that can be used in clinical practice.

  15. Can Reliability of Multiple Component Measuring Instruments Depend on Response Option Presentation Mode?

    Science.gov (United States)

    Menold, Natalja; Raykov, Tenko

    2016-01-01

    This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…

  16. Self-confidence, overconfidence and prenatal testosterone exposure : Evidence from the lab

    NARCIS (Netherlands)

    Dalton, Patricio S.; Ghosal, Sayantan

    2018-01-01

    This paper examines whether foetal testosterone exposure predicts the extent of confidence and over-confidence in own absolute ability in adulthood. To study this question, we elicited incentive-compatible measures of confidence and over-confidence in the lab and correlate them with measures of

  17. Reliable Portfolio Selection Problem in Fuzzy Environment: An mλ Measure Based Approach

    Directory of Open Access Journals (Sweden)

    Yuan Feng

    2017-04-01

    Full Text Available This paper investigates a fuzzy portfolio selection problem with guaranteed reliability, in which the fuzzy variables are used to capture the uncertain returns of different securities. To effectively handle the fuzziness in a mathematical way, a new expected value operator and variance of fuzzy variables are defined based on the m λ measure that is a linear combination of the possibility measure and necessity measure to balance the pessimism and optimism in the decision-making process. To formulate the reliable portfolio selection problem, we particularly adopt the expected total return and standard variance of the total return to evaluate the reliability of the investment strategies, producing three risk-guaranteed reliable portfolio selection models. To solve the proposed models, an effective genetic algorithm is designed to generate the approximate optimal solution to the considered problem. Finally, the numerical examples are given to show the performance of the proposed models and algorithm.

  18. Relating measurement invariance, cross-level invariance, and multilevel reliability

    NARCIS (Netherlands)

    Jak, S.; Jorgensen, T.D.

    2017-01-01

    Data often have a nested, multilevel structure, for example when data are collected from children in classrooms. This kind of data complicate the evaluation of reliability and measurement invariance, because several properties can be evaluated at both the individual level and the cluster level, as

  19. Reliability of the "Ten Test" for assessment of discriminative sensation in hand trauma.

    Science.gov (United States)

    Berger, Michael J; Regan, William R; Seal, Alex; Bristol, Sean G

    2016-10-01

    "Ten Test" (TT) is a bedside measure of discriminative sensation, whereby the magnitude of abnormal sensation to moving light touch is normalized to an area of normal sensation on an 11-point Likert scale (0-10). The purposes of this study were to determine reliability parameters of the TT in a cohort of patients presenting to a hand trauma clinic with subjectively altered sensation post-injury and to compare the reliability of TT to that of the Weinstein Enhanced Sensory Test (WEST). Study participants (n = 29, mean age = 37 ± 12) comprised patients presenting to an outpatient hand trauma clinic with recent hand trauma and self reported abnormal sensation. Participants underwent TT and WEST by two separate raters on the same day. Interrater reliability, response stability and responsiveness of each test were determined by the intraclass correlation coefficient (ICC: 2, 1), standard error of measurement (SEM) with 95% confidence intervals (CI) and minimal detectable difference score, with 95% CI (MDD95), respectively. The TT displayed excellent interrater reliability (ICC = 0.95, 95% CI 0.89-0.97) compared to good reliability for WEST (ICC = 0.78, 95% CI 0.58-0.89). The range of true scores expected with 95% confidence based on the SEM (i.e. response stability), was ±1.1 for TT and ±1.1 for WEST. MDD95 scores reflecting test responsiveness were 1.5 and 1.6 for TT and WEST, respectively. The TT displayed excellent reliability parameters in this patient population. Reliability parameters were stronger for TT compared to WEST. These results provide support for the use of TT as a component of the sensory exam in hand trauma. Copyright © 2016 British Association of Plastic, Reconstructive and Aesthetic Surgeons. Published by Elsevier Ltd. All rights reserved.

  20. Inference on the reliability of Weibull distribution with multiply Type-I censored data

    International Nuclear Information System (INIS)

    Jia, Xiang; Wang, Dong; Jiang, Ping; Guo, Bo

    2016-01-01

    In this paper, we focus on the reliability of Weibull distribution under multiply Type-I censoring, which is a general form of Type-I censoring. In multiply Type-I censoring in this study, all units in the life testing experiment are terminated at different times. Reliability estimation with the maximum likelihood estimate of Weibull parameters is conducted. With the delta method and Fisher information, we propose a confidence interval for reliability and compare it with the bias-corrected and accelerated bootstrap confidence interval. Furthermore, a scenario involving a few expert judgments of reliability is considered. A method is developed to generate extended estimations of reliability according to the original judgments and transform them to estimations of Weibull parameters. With Bayes theory and the Monte Carlo Markov Chain method, a posterior sample is obtained to compute the Bayes estimate and credible interval for reliability. Monte Carlo simulation demonstrates that the proposed confidence interval outperforms the bootstrap one. The Bayes estimate and credible interval for reliability are both satisfactory. Finally, a real example is analyzed to illustrate the application of the proposed methods. - Highlights: • We focus on reliability of Weibull distribution under multiply Type-I censoring. • The proposed confidence interval for the reliability is superior after comparison. • The Bayes estimates with a few expert judgements on reliability are satisfactory. • We specify the cases where the MLEs do not exist and present methods to remedy it. • The distribution of estimate of reliability should be used for accurate estimate.

  1. Reliability of 4-meter and 10-meter walk tests after lower extremity surgery.

    Science.gov (United States)

    Unver, Bayram; Baris, Refik Hilmi; Yuksel, Ertugrul; Cekmece, Senol; Kalkan, Serpil; Karatosun, Vasfi

    2017-12-01

    To investigate the test-retest reliability of the 4-meter walk test (4 MWT) and 10-meter walk test (10 MWT) in patients undergoing lower extremity surgery during inpatient rehabilitation. In all, 102 patients with total hip arthroplasty (THA), total knee arthroplasty (TKA), lower extremity fracture (LEF) and soft tissue operation were recruited. Patients performed two 4 MWT and two 10 MWT trials on the same day. The same researcher performed all the measurements to avoid inter-rater variability. The 4 MWT and 10 MWT were shown to have excellent test-retest reliability. The ICCs for the 4 MWT and 10 MWT were found as 0.94 and 0.95, respectively. The SEMs for the 4 MWT and 10 MWT were 2.0 and 5.5 seconds, respectively. The smallest real difference at the 95% confidence level (SRD95) was 5.5 seconds for the 4 MWT and 12.2 seconds for 10 MWT and SRD95 percentage was 31.2 for the 4 MWT and 28.5 for the 10 MWT. Both the 4 MWT and the 10 MWT have excellent reliability in patients undergoing lower extremity surgery such as TKA, THA, LEF and soft tissue operation during inpatient rehabilitation. Clinicians and researchers can be confident that changes above the SRD95s for the different patient groups, for both sexes and with regard to weight-bearing status, represent a real clinical change in rehabilitation process. Implications for Rehabilitation The 4 MWT and the 10 MWT are simple methods and were also shown to be reliable measurement methods in many patient groups. This study illustrates that the test-retest reliability of the 4 MWT and 10 MWT are excellent in patients undergoing lower extremity surgery during inpatient rehabilitation (ICC: 0.94 for 4 MWT, ICC: 0.95 for 10 MWT). Clinicians and researchers can be confident that changes above the SRD95s for the different patient groups, for both sexes and with regard to weight-bearing status represent a real clinical change in rehabilitation process.

  2. Reliability in individual monitoring service.

    Science.gov (United States)

    Mod Ali, N

    2011-03-01

    As a laboratory certified to ISO 9001:2008 and accredited to ISO/IEC 17025, the Secondary Standard Dosimetry Laboratory (SSDL)-Nuclear Malaysia has incorporated an overall comprehensive system for technical and quality management in promoting a reliable individual monitoring service (IMS). Faster identification and resolution of issues regarding dosemeter preparation and issuing of reports, personnel enhancement, improved customer satisfaction and overall efficiency of laboratory activities are all results of the implementation of an effective quality system. Review of these measures and responses to observed trends provide continuous improvement of the system. By having these mechanisms, reliability of the IMS can be assured in the promotion of safe behaviour at all levels of the workforce utilising ionising radiation facilities. Upgradation of in the reporting program through a web-based e-SSDL marks a major improvement in Nuclear Malaysia's IMS reliability on the whole. The system is a vital step in providing a user friendly and effective occupational exposure evaluation program in the country. It provides a higher level of confidence in the results generated for occupational dose monitoring of the IMS, thus, enhances the status of the radiation protection framework of the country.

  3. Reliability and Minimum Detectable Change of Temporal-Spatial, Kinematic, and Dynamic Stability Measures during Perturbed Gait.

    Directory of Open Access Journals (Sweden)

    Christopher A Rábago

    Full Text Available Temporal-spatial, kinematic variability, and dynamic stability measures collected during perturbation-based assessment paradigms are often used to identify dysfunction associated with gait instability. However, it remains unclear which measures are most reliable for detecting and tracking responses to perturbations. This study systematically determined the between-session reliability and minimum detectable change values of temporal-spatial, kinematic variability, and dynamic stability measures during three types of perturbed gait. Twenty young healthy adults completed two identical testing sessions two weeks apart, comprised of an unperturbed and three perturbed (cognitive, physical, and visual walking conditions in a virtual reality environment. Within each session, perturbation responses were compared to unperturbed walking using paired t-tests. Between-session reliability and minimum detectable change values were also calculated for each measure and condition. All temporal-spatial, kinematic variability and dynamic stability measures demonstrated fair to excellent between-session reliability. Minimal detectable change values, normalized to mean values ranged from 1-50%. Step width mean and variability measures demonstrated the greatest response to perturbations with excellent between-session reliability and low minimum detectable change values. Orbital stability measures demonstrated specificity to perturbation direction and sensitivity with excellent between-session reliability and low minimum detectable change values. We observed substantially greater between-session reliability and lower minimum detectable change values for local stability measures than previously described which may be the result of averaging across trials within a session and using velocity versus acceleration data for reconstruction of state spaces. Across all perturbation types, temporal-spatial, orbital and local measures were the most reliable measures with the

  4. Measuring walking within and outside the neighborhood in Chinese elders: reliability and validity

    Directory of Open Access Journals (Sweden)

    Cerin Ester

    2011-11-01

    Full Text Available Abstract Background Walking is a preferred, prevalent and recommended activity for aging populations and is influenced by the neighborhood built environment. To study this influence it is necessary to differentiate whether walking occurs within or outside of the neighborhood. The Neighborhood Physical Activity Questionnaire (NPAQ collects information on setting-specific physical activity, including walking, inside and outside one's neighborhood. While the NPAQ has shown to be a reliable measure in adults, its reliability in older adults is unknown. Additionally its validity and the influence of type of neighborhood on reliability and validity have yet to be explored. Methods The NPAQ walking component was adapted for Chinese speaking elders (NWQ-CS. Ninety-six Chinese elders, stratified by social economic status and neighborhood walkability, wore an accelerometer and completed a log of walks for 7 days. Following the collection of valid data the NWQ-CS was interviewer-administered. Fourteen to 20 days (average of 17 days later the NWQ-CS was re-administered. Test-retest reliability and validity of the NWQ-CS were assessed. Results Reliability and validity estimates did not differ with type of neighborhood. NWQ-CS measures of walking showed moderate to excellent reliability. Reliability was generally higher for estimates of weekly frequency than minutes of walking. Total weekly minutes of walking were moderately related to all accelerometry measures. Moderate-to-strong associations were found between the NWQ-CS and log-of-walks variables. The NWQ-CS yielded statistically significantly lower mean values of total walking, weekly minutes of walking for transportation and weekly frequency of walking for transportation outside the neighborhood than the log-of-walks. Conclusions The NWQ-CS showed measurement invariance across types of neighborhoods. It is a valid measure of walking for recreation and frequency of walking for transport. However, it may

  5. Techniques, processes, and measures for software safety and reliability

    International Nuclear Information System (INIS)

    Sparkman, D.

    1992-01-01

    The purpose of this report is to provide a detailed survey of current recommended practices and measurement techniques for the development of reliable and safe software-based systems. This report is intended to assist the United States Nuclear Reaction Regulation (NRR) in determining the importance and maturity of the available techniques and in assessing the relevance of individual standards for application to instrumentation and control systems in nuclear power generating stations. Lawrence Livermore National Laboratory (LLNL) provides technical support for the Instrumentation and Control System Branch (ICSB) of NRRin advanced instrumentation and control systems, distributed digital systems, software reliability, and the application of verificafion and validafion for the development of software

  6. Reliability of biceps femoris and semitendinosus muscle architecture measurements obtained with ultrasonography

    Directory of Open Access Journals (Sweden)

    Viviane Bastos de Oliveira

    Full Text Available Introduction Currently, little attention is given to the muscle architecture reliability studies of the hamstring using a robust statistical. Our purpose was to determine the reliability of ultrasound measurements of muscle thickness, fascicle length and pennation angle of the biceps femoris and semitendinosus muscles, including heteroskedasticity and internal consistency analyses. Methods Two images of biceps femoris and semitendinosus at 50% of the thigh length were acquired from 21 volunteers, in two visits. The parameters were measured three times in each image, and for each muscle. The reliability was analyzed by the intraclass correlation coefficient (ICC and Cronbach’s alpha (αCronbach. The relative standard error of the measurements (%SEM were calculated and Bland-Altman plots were generated. Results All parameters presented excellent ICC for the three repeated measurements (ICC from 0.93 ‒ 0.99 and moderate to excellent reliability intraday (ICC from 0.70 ‒ 0.95 for both muscles. The present study indicates that ultrasound is a reliable tool to estimate the biceps femoris fascicle length (ICC = 0.97, αCronbach = 0.98, %SEM = 7.86 and semitendinosus (ICC = 0.90, αCronbach = 0.95, %SEM = 7.55, as well as the biceps femoris muscle thickness (ICC = 0.89, αCronbach = 0.94, %SEM = 10.23 and semitendinosus muscle thickness (ICC = 0.87, αCronbach = 0.93, %SEM = 1.35. At last, biceps femoris pennation angle (ICC = 0.93, αCronbach = 0.96 and %SEM = 4.36 and semitendinosus (ICC = 0.96, αCronbach = 0.98 and %SEM = 4.25 also had good repeatability. Conclusion Ultrasonography show good repeatability in estimating of muscle architecture parameters.

  7. Evaluation of error bands and confidence limits for thermal measurements in the CFTL bundle

    International Nuclear Information System (INIS)

    Childs, K.W.; Sanders, J.P.; Conklin, J.C.

    1979-01-01

    Surface cladding temperatures for the fuel rod simulators in the Core Flow Test Loop (CFTL) must be inferred from a measurement at a thermocouple junction within the rod. This step requires the evaluation of the thermal field within the rod based on known parameters such as heat generation rate, dimensional tolerances, thermal properties, and contact coefficients. Uncertainties in the surface temperature can be evaluated by assigning error bands to each of the parameters used in the calculation. A statistical method has been employed to establish the confidence limits for the surface temperature from a combination of the standard deviations of the important parameters. This method indicates that for a CFTL fuel rod simulator with a total power of 38 kW and a ratio of maximum to average axial power of 1.21, the 95% confidence limit for the calculated surface temperature is +- 45 0 C at the midpoint of the rod

  8. Performance of classification confidence measures in dynamic classifier systems

    Czech Academy of Sciences Publication Activity Database

    Štefka, D.; Holeňa, Martin

    2013-01-01

    Roč. 23, č. 4 (2013), s. 299-319 ISSN 1210-0552 R&D Projects: GA ČR GA13-17187S Institutional support: RVO:67985807 Keywords : classifier combining * dynamic classifier systems * classification confidence Subject RIV: IN - Informatics, Computer Science Impact factor: 0.412, year: 2013

  9. Reliability of smartphone-based gait measurements for quantification of physical activity/inactivity levels.

    Science.gov (United States)

    Ebara, Takeshi; Azuma, Ryohei; Shoji, Naoto; Matsukawa, Tsuyoshi; Yamada, Yasuyuki; Akiyama, Tomohiro; Kurihara, Takahiro; Yamada, Shota

    2017-11-25

    Objective measurements using built-in smartphone sensors that can measure physical activity/inactivity in daily working life have the potential to provide a new approach to assessing workers' health effects. The aim of this study was to elucidate the characteristics and reliability of built-in step counting sensors on smartphones for development of an easy-to-use objective measurement tool that can be applied in ergonomics or epidemiological research. To evaluate the reliability of step counting sensors embedded in seven major smartphone models, the 6-minute walk test was conducted and the following analyses of sensor precision and accuracy were performed: 1) relationship between actual step count and step count detected by sensors, 2) reliability between smartphones of the same model, and 3) false detection rates when sitting during office work, while riding the subway, and driving. On five of the seven models, the inter-class correlations coefficient (ICC (3,1) ) showed high reliability with a range of 0.956-0.993. The other two models, however, had ranges of 0.443-0.504 and the relative error ratios of the sensor-detected step count to the actual step count were ±48.7%-49.4%. The level of agreement between the same models was ICC (3,1) : 0.992-0.998. The false detection rates differed between the sitting conditions. These results suggest the need for appropriate regulation of step counts measured by sensors, through means such as correction or calibration with a predictive model formula, in order to obtain the highly reliable measurement results that are sought in scientific investigation.

  10. Inter-rater and intra-rater reliability of a clinical protocol for measuring turnout in collegiate dancers.

    Science.gov (United States)

    Greene, Amanda; Lasner, Andrea; Deu, Rajwinder; Oliphant, Seth; Johnson, Kenneth

    2018-02-02

    Reliable methods of measuring turnout in dancers and comparing active turnout (used in class) with functional (uncompensated) turnout are needed. Authors have suggested measurement techniques but there is no clinically useful, easily reproducible technique with established inter-rater and intra-rater reliability. We adapted a technique based on previous research, which is easily reproducible. We hypothesized excellent inter-rater and intra-rater reliability between experienced physical therapists (PTs) and a briefly trained faculty member from a university's department of dance. Thirty-two participants were recruited from the same dance department. Dancers' active and functional turnout was measured by each rater. We found that our technique for measuring active and functional turnout has excellent inter-rater and intra-rater reliability when performed by two experienced PTs and by one briefly trained university-level dance faculty member. For active turnout, inter-rater reliability was 0.78 among all raters and 0.82 among only the PT raters; intra-rater reliability was 0.82 among all raters and 0.85 among only the PT raters. For functional turnout, inter-rater reliability was 0.86 among all raters and 0.88 among only the PT raters; intra-rater reliability was 0.87 among all raters and 0.88 among only the PT raters. The measurement technique described provides a standardized protocol with excellent inter-rater and intra-rater reliability when performed by experienced PTs or by a briefly trained university-level dance faculty member.

  11. Psychometric Properties of Persian Translated Version of Activities-specific Balance Confidence Scale (ABC in Arak Community-dwelling Older Adults

    Directory of Open Access Journals (Sweden)

    Daryoush Khajavi

    2017-11-01

    Full Text Available Abstract Background: Balance deficiency, falls and fear of fall are important problems that can resulted in reversed health outcomes including decreased quality of life. The purpose of this study was surveying factor structure, validation, and reliability determination of Persian translated version of Activities-specific Balance Confidence scale in community-dwelling older adults of Arak city. Materials and Methods: Research method was descriptive in form of psychometry. The statistic population was older adults of Arak in year 2012 and 308 subjects with mean age 69.38 years were selected availably. Data were collected by Persian translated version of Activities-specific Balance Confidence that is a 16-item scale and evaluates balance confidence in activities of daily living. Data were analyzed by Exploratory Factor Analysis. Test-retest and internal reliability were calculated by Pearson correlation coefficient and Chronbach’s Alpha. Data were analyzed with SPSS-16. Results: The findings resulted in extraction of one factor with eigenvalue over one that explained 82.89% of total variance. Test-retest reliability between 1 to 4 weeks and internal reliability (Chronbach’s alpha were 0.82 and 0.98, respectively. Gutmann split-half correlation coefficient and intra-class correlation coefficient were calculated 95% and 85%, respectively. Conclusion: Persian translated version of Activities-specific Balance Confidence (ABC-F is a valid and reliable tool for Iranian community-dwelling older adults that can be used in clinical and research purpose.

  12. Reliability of Phase Velocity Measurements of Flexural Acoustic Waves in the Human Tibia In-Vivo.

    Science.gov (United States)

    Vogl, Florian; Schnüriger, Karin; Gerber, Hans; Taylor, William R

    2016-01-01

    Axial-transmission acoustics have shown to be a promising technique to measure individual bone properties and detect bone pathologies. With the ultimate goal being the in-vivo application of such systems, quantification of the key aspects governing the reliability is crucial to bring this method towards clinical use. This work presents a systematic reliability study quantifying the sources of variability and their magnitudes of in-vivo measurements using axial-transmission acoustics. 42 healthy subjects were measured by an experienced operator twice per week, over a four-month period, resulting in over 150000 wave measurements. In a complementary study to assess the influence of different operators performing the measurements, 10 novice operators were trained, and each measured 5 subjects on a single occasion, using the same measurement protocol as in the first part of the study. The estimated standard error for the measurement protocol used to collect the study data was ∼ 17 m/s (∼ 4% of the grand mean) and the index of dependability, as a measure of reliability, was Φ = 0.81. It was shown that the method is suitable for multi-operator use and that the reliability can be improved efficiently by additional measurements with device repositioning, while additional measurements without repositioning cannot improve the reliability substantially. Phase velocity values were found to be significantly higher in males than in females (p < 10-5) and an intra-class correlation coefficient of r = 0.70 was found between the legs of each subject. The high reliability of this non-invasive approach and its intrinsic sensitivity to mechanical properties opens perspectives for the rapid and inexpensive clinical assessment of bone pathologies, as well as for monitoring programmes without any radiation exposure for the patient.

  13. Financial Literacy, Confidence and Financial Advice Seeking

    NARCIS (Netherlands)

    Kramer, Marc M.

    2016-01-01

    We find that people with higher confidence in their own financial literacy are less likely to seek financial advice, but no relation between objective measures of literacy and advice seeking. The negative association between confidence and advice seeking is more pronounced among wealthy households.

  14. Photovoltaic Module Reliability Workshop 2010: February 18-19, 2010

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, J.

    2013-11-01

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology--both critical goals for moving PV technologies deeper into the electricity marketplace.

  15. Photovoltaic Module Reliability Workshop 2011: February 16-17, 2011

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, S.

    2013-11-01

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology--both critical goals for moving PV technologies deeper into the electricity marketplace.

  16. Photovoltaic Module Reliability Workshop 2013: February 26-27, 2013

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, S.

    2013-10-01

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology--both critical goals for moving PV technologies deeper into the electricity marketplace.

  17. Photovoltaic Module Reliability Workshop 2014: February 25-26, 2014

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, S.

    2014-02-01

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology--both critical goals for moving PV technologies deeper into the electricity marketplace.

  18. Integration of multiple biological features yields high confidence human protein interactome.

    Science.gov (United States)

    Karagoz, Kubra; Sevimoglu, Tuba; Arga, Kazim Yalcin

    2016-08-21

    The biological function of a protein is usually determined by its physical interaction with other proteins. Protein-protein interactions (PPIs) are identified through various experimental methods and are stored in curated databases. The noisiness of the existing PPI data is evident, and it is essential that a more reliable data is generated. Furthermore, the selection of a set of PPIs at different confidence levels might be necessary for many studies. Although different methodologies were introduced to evaluate the confidence scores for binary interactions, a highly reliable, almost complete PPI network of Homo sapiens is not proposed yet. The quality and coverage of human protein interactome need to be improved to be used in various disciplines, especially in biomedicine. In the present work, we propose an unsupervised statistical approach to assign confidence scores to PPIs of H. sapiens. To achieve this goal PPI data from six different databases were collected and a total of 295,288 non-redundant interactions between 15,950 proteins were acquired. The present scoring system included the context information that was assigned to PPIs derived from eight biological attributes. A high confidence network, which included 147,923 binary interactions between 13,213 proteins, had scores greater than the cutoff value of 0.80, for which sensitivity, specificity, and coverage were 94.5%, 80.9%, and 82.8%, respectively. We compared the present scoring method with others for evaluation. Reducing the noise inherent in experimental PPIs via our scoring scheme increased the accuracy significantly. As it was demonstrated through the assessment of process and cancer subnetworks, this study allows researchers to construct and analyze context-specific networks via valid PPI sets and one can easily achieve subnetworks around proteins of interest at a specified confidence level. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Reliability of the Q Force; a mobile instrument for measuring isometric quadriceps muscle strength.

    Science.gov (United States)

    Douma, K W; Regterschot, G R H; Krijnen, W P; Slager, G E C; van der Schans, C P; Zijlstra, W

    2016-01-01

    The ability to generate muscle strength is a pre-requisite for all human movement. Decreased quadriceps muscle strength is frequently observed in older adults and is associated with a decreased performance and activity limitations. To quantify the quadriceps muscle strength and to monitor changes over time, instruments and procedures with a sufficient reliability are needed. The Q Force is an innovative mobile muscle strength measurement instrument suitable to measure in various degrees of extension. Measurements between 110 and 130° extension present the highest values and the most significant increase after training. The objective of this study is to determine the test-retest reliability of muscle strength measurements by the Q Force in older adults in 110° extension. Forty-one healthy older adults, 13 males and 28 females were included in the study. Mean (SD) age was 81.9 (4.89) years. Isometric muscle strength of the Quadriceps muscle was assessed with the Q Force at 110° of knee extension. Participants were measured at two sessions with a three to eight day interval between sessions. To determine relative reliability, the intraclass correlation coefficient (ICC) was calculated. To determine absolute reliability, Bland and Altman Limits of Agreement (LOA) were calculated and t-tests were performed. Relative reliability of the Q Force is good to excellent as all ICC coefficients are higher than 0.75. Generally a large 95 % LOA, reflecting only moderate absolute reliability, is found as exemplified for the peak torque left leg of -18.6 N to 33.8 N and the right leg of -9.2 N to 26.4 N was between 15.7 and 23.6 Newton representing 25.2 % to 39.9 % of the size of the mean. Small systematic differences in mean were found between measurement session 1 and 2. The present study shows that the Q Force has excellent relative test-retest reliability, but limited absolute test-retest reliability. Since the Q Force is relatively cheap and mobile it is suitable for

  20. Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps.

    Science.gov (United States)

    Powell, Adam C; Torous, John; Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

    2016-02-10

    There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff's alpha was calculated for each of the measures and reported by app category and in aggregate. The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with

  1. Reliability and accuracy analysis of a new semiautomatic radiographic measurement software in adult scoliosis.

    Science.gov (United States)

    Aubin, Carl-Eric; Bellefleur, Christian; Joncas, Julie; de Lanauze, Dominic; Kadoury, Samuel; Blanke, Kathy; Parent, Stefan; Labelle, Hubert

    2011-05-20

    Radiographic software measurement analysis in adult scoliosis. To assess the accuracy as well as the intra- and interobserver reliability of measuring different indices on preoperative adult scoliosis radiographs using a novel measurement software that includes a calibration procedure and semiautomatic features to facilitate the measurement process. Scoliosis requires a careful radiographic evaluation to assess the deformity. Manual and computer radiographic process measures have been studied extensively to determine the reliability and reproducibility in adolescent idiopathic scoliosis. Most studies rely on comparing given measurements, which are repeated by the same user or by an expert user. A given measure with a small intra- or interobserver error might be deemed as good repeatability, but all measurements might not be truly accurate because the ground-truth value is often unknown. Thorough accuracy assessment of radiographic measures is necessary to assess scoliotic deformities, compare these measures at different stages or to permit valid multicenter studies. Thirty-four sets of adult scoliosis digital radiographs were measured two times by three independent observers using a novel radiographic measurement software that includes semiautomatic features to facilitate the measurement process. Twenty different measures taken from the Spinal Deformity Study Group radiographic measurement manual were performed on the coronal and sagittal images. Intra- and intermeasurer reliability for each measure was assessed. The accuracy of the measurement software was also assessed using a physical spine model in six different scoliotic configurations as a true reference. The majority of the measures demonstrated good to excellent intra- and intermeasurer reliability, except for sacral obliquity. The standard variation of all the measures was very small: ≤ 4.2° for Cobb angles, ≤ 4.2° for the kyphosis, ≤ 5.7° for the lordosis, ≤ 3.9° for the pelvic angles, and

  2. A Reliable Method to Measure Lip Height Using Photogrammetry in Unilateral Cleft Lip Patients.

    Science.gov (United States)

    van der Zeeuw, Frederique; Murabit, Amera; Volcano, Johnny; Torensma, Bart; Patel, Brijesh; Hay, Norman; Thorburn, Guy; Morris, Paul; Sommerlad, Brian; Gnarra, Maria; van der Horst, Chantal; Kangesu, Loshan

    2015-09-01

    There is still no reliable tool to determine the outcome of the repaired unilateral cleft lip (UCL). The aim of this study was therefore to develop an accurate, reliable tool to measure vertical lip height from photographs. The authors measured the vertical height of the cutaneous and vermilion parts of the lip in 72 anterior-posterior view photographs of 17 patients with repairs to a UCL. Points on the lip's white roll and vermillion were marked on both the cleft and the noncleft sides on each image. Two new concepts were tested. First, photographs were standardized using the horizontal (medial to lateral) eye fissure width (EFW) for calibration. Second, the authors tested the interpupillary line (IPL) and the alar base line (ABL) for their reliability as horizontal lines of reference. Measurements were taken by 2 independent researchers, at 2 different time points each. Overall 2304 data points were obtained and analyzed. Results showed that the method was very effective in measuring the height of the lip on the cleft side with the noncleft side. When using the IPL, inter- and intra-rater reliability was 0.99 to 1.0, with the ABL it varied from 0.91 to 0.99 with one exception at 0.84. The IPL was easier to define because in some subjects the overhanging nasal tip obscured the alar base and gave more consistent measurements possibly because the reconstructed alar base was sometimes indistinct. However, measurements from the IPL can only give the percentage difference between the left and right sides of the lip, whereas those from the ABL can also give exact measurements. Patient examples were given that show how the measurements correlate with clinical assessment. The authors propose this method of photogrammetry with the innovative use of the IPL as a reliable horizontal plane and use of the EFW for calibration as a useful and reliable tool to assess the outcome of UCL repair.

  3. Translation, reliability, and clinical utility of the Melbourne Assessment 2.

    Science.gov (United States)

    Gerber, Corinna N; Plebani, Anael; Labruyère, Rob

    2017-10-12

    The aims were to (i) provide a German translation of the Melbourne Assessment 2 (MA2), a quantitative test to measure unilateral upper limb function in children with neurological disabilities and (ii) to evaluate its reliability and aspects of clinical utility. After its translation into German and approval of the back translation by the original authors, the MA2 was performed and videotaped twice with 30 children with neuromotor disorders. For each participant, two raters scored the video of the first test for inter-rater reliability. To determine test-retest reliability, one rater additionally scored the video of the second test while the other rater repeated the scoring of the first video to evaluate intra-rater reliability. Time needed for rater training, test administration, and scoring was recorded. The four subscale scores showed excellent intra-, inter-rater, and test-retest reliability with intraclass correlation coefficients of 0.90-1.00 (95%-confidence intervals 0.78-1.00). Score items revealed substantial to almost perfect intra-rater reliability (weighted kappa k w  = 0.66-1.00) for the more affected side. Score item inter-rater and test-retest reliability of the same extremity were, with one exception, moderate to almost perfect (k w  = 0.42-0.97; k w  = 0.40-0.89). Furthermore, the MA2 was feasible and acceptable for patients and clinicians. The MA2 showed excellent subscale and moderate to almost perfect score item reliability. Implications for Rehabilitation There is a lack of high-quality studies about psychometric properties of upper limb measurement tools in the neuropediatric population. The Melbourne Assessment 2 is a promising tool for reliable measurement of unilateral upper limb movement quality in the neuropediatric population. The Melbourne Assessment 2 is acceptable and practicable to therapists and patients for routine use in clinical care.

  4. Reliability measures in managing GI bleeding.

    Science.gov (United States)

    Sonnenberg, Amnon

    2012-06-01

    Multiple procedures and devices are used in a complex interplay to diagnose and treat GI bleeding. To model how a large variety of diagnostic and therapeutic components interact in the successful management of GI bleeding. The analysis uses the concept of reliability block diagrams from probability theory to model management outcome. Separate components of the management process are arranged in a serial or parallel fashion. If the outcome depends on the function of each component individually, such components are modeled to be arranged in series. If components complement each other and can mutually compensate for each of their failures, such components are arranged in a parallel fashion. General endoscopy practice. Patients with GI bleeding of unknown etiology. All available endoscopic and radiographic means to diagnose and treat GI bleeding. Process reliability in achieving hemostasis. Serial arrangements tend to reduce process reliability, whereas parallel arrangements increase it. Whenever possible, serial components should be bridged and complemented by additional alternative (parallel) routes of operation. Parallel components with low individual reliability can still contribute to overall process reliability as long as they function independently of other pre-existing alternatives. Probability of success associated with individual components is partly unknown. Modeling management of GI bleeding by a reliability block diagram provides a useful tool in assessing the impact of individual endoscopic techniques and administrative structures on the overall outcome. Copyright © 2012 American Society for Gastrointestinal Endoscopy. Published by Mosby, Inc. All rights reserved.

  5. Increasing Confidence and Ability in Implementing Kangaroo Mother Care Method Among Young Mothers.

    Science.gov (United States)

    Kenanga Purbasary, Eleni; Rustina, Yeni; Budiarti, Tri

    Mothers giving birth to low birth weight babies (LBWBs) have low confidence in caring for their babies because they are often still young and may lack the knowledge, experience, and ability to care for the baby. This research aims to determine the effect of education about kangaroo mother care (KMC) on the confidence and ability of young mothers to implement KMC. The research methodology used was a controlled-random experimental approach with pre- and post-test equivalent groups of 13 mothers and their LBWBs in the intervention group and 13 mothers and their LBWBs in the control group. Data were collected via an instrument measuring young mothers' confidence, the validity and reliability of which have been tested with a resulting r value of .941, and an observation sheet on KMC implementation. After conducting the education, the confidence score of young mothers and their ability to perform KMC increased meaningfully. The score of confidence of young mothers before education was 37 (p = .1555: and the ability score for KMC Implementation before education was 9 (p = .1555). The median score of confidence of young mothers after education in the intervention group was 87 and in the control group was 50 (p = .001, 95% CI 60.36-75.56), and ability median score for KMC implementation after education in the intervention group was 16 and in the control group was 12 (p = .001, 95% CI 1.50-1.88). KMC education should be conducted gradually, and it is necessary to involve the family, in order for KMC implementation to continue at home. A family visit can be done for LBWBs to evaluate the ability of the young mothers to implement KMC.

  6. Feasibility and Inter-Rater Reliability of Physical Performance Measures in Acutely Admitted Older Medical Patients

    DEFF Research Database (Denmark)

    Bodilsen, Ann Christine; Juul-Larsen, Helle Gybel; Petersen, Janne

    2015-01-01

    OBJECTIVE: Physical performance measures can be used to predict functional decline and increased dependency in older persons. However, few studies have assessed the feasibility or reliability of such measures in hospitalized older patients. Here we assessed the feasibility and inter-rater reliabi......OBJECTIVE: Physical performance measures can be used to predict functional decline and increased dependency in older persons. However, few studies have assessed the feasibility or reliability of such measures in hospitalized older patients. Here we assessed the feasibility and inter......-rater reliability of four simple measures of physical performance in acutely admitted older medical patients. DESIGN: During the first 24 hours of hospitalization, the following were assessed twice by different raters in 52 (≥ 65 years) patients admitted for acute medical illness: isometric hand grip strength, 4......, and 30-s chair stand were 8%, 7%, and 18%, and the SRD95% values were 22%, 17%, and 49%. CONCLUSION: In acutely admitted older medical patients, grip strength, gait speed, and the Cumulated Ambulation Score measurements were feasible and showed high inter-rater reliability when administered by different...

  7. Toward a Common Language for Measuring Patient Mobility in the Hospital: Reliability and Construct Validity of Interprofessional Mobility Measures.

    Science.gov (United States)

    Hoyer, Erik H; Young, Daniel L; Klein, Lisa M; Kreif, Julie; Shumock, Kara; Hiser, Stephanie; Friedman, Michael; Lavezza, Annette; Jette, Alan; Chan, Kitty S; Needham, Dale M

    2018-02-01

    The lack of common language among interprofessional inpatient clinical teams is an important barrier to achieving inpatient mobilization. In The Johns Hopkins Hospital, the Activity Measure for Post-Acute Care (AM-PAC) Inpatient Mobility Short Form (IMSF), also called "6-Clicks," and the Johns Hopkins Highest Level of Mobility (JH-HLM) are part of routine clinical practice. The measurement characteristics of these tools when used by both nurses and physical therapists for interprofessional communication or assessment are unknown. The purposes of this study were to evaluate the reliability and minimal detectable change of AM-PAC IMSF and JH-HLM when completed by nurses and physical therapists and to evaluate the construct validity of both measures when used by nurses. A prospective evaluation of a convenience sample was used. The test-retest reliability and the interrater reliability of AM-PAC IMSF and JH-HLM for inpatients in the neuroscience department (n = 118) of an academic medical center were evaluated. Each participant was independently scored twice by a team of 2 nurses and 1 physical therapist; a total of 4 physical therapists and 8 nurses participated in reliability testing. In a separate inpatient study protocol (n = 69), construct validity was evaluated via an assessment of convergent validity with other measures of function (grip strength, Katz Activities of Daily Living Scale, 2-minute walk test, 5-times sit-to-stand test) used by 5 nurses. The test-retest reliability values (intraclass correlation coefficients) for physical therapists and nurses were 0.91 and 0.97, respectively, for AM-PAC IMSF and 0.94 and 0.95, respectively, for JH-HLM. The interrater reliability values (intraclass correlation coefficients) between physical therapists and nurses were 0.96 for AM-PAC IMSF and 0.99 for JH-HLM. Construct validity (Spearman correlations) ranged from 0.25 between JH-HLM and right-hand grip strength to 0.80 between AM-PAC IMSF and the Katz Activities of

  8. Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 2. Technical Report #1201

    Science.gov (United States)

    Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald

    2012-01-01

    In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

  9. Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

    Science.gov (United States)

    Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

  10. Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

    Science.gov (United States)

    Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

  11. Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

    Science.gov (United States)

    Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

    2012-01-01

    In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

  12. Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 6. Technical Report #1205

    Science.gov (United States)

    Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald

    2012-01-01

    In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

  13. Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

    Science.gov (United States)

    Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald

    2012-01-01

    In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

  14. Night-to-night arousal variability and interscorer reliability of arousal measurements.

    Science.gov (United States)

    Loredo, J S; Clausen, J L; Ancoli-Israel, S; Dimsdale, J E

    1999-11-01

    Measurement of arousals from sleep is clinically important, however, their definition is not well standardized, and little data exist on reliability. The purpose of this study is to determine factors that affect arousal scoring reliability and night-to-night arousal variability. The night-to-night arousal variability and interscorer reliability was assessed in 20 subjects with and without obstructive sleep apnea undergoing attended polysomnography during two consecutive nights. Five definitions of arousal were studied, assessing duration of electroencephalographic (EEG) frequency changes, increases in electromyographic (EMG) activity and leg movement, association with respiratory events, as well as the American Sleep Disorders Association (ASDA) definition of arousals. NA. NA. NA. Interscorer reliability varied with the definition of arousal and ranged from an Intraclass correlation (ICC) of 0.19 to 0.92. Arousals that included increases in EMG activity or leg movement had the greatest reliability, especially when associated with respiratory events (ICC 0.76 to 0.92). The ASDA arousal definition had high interscorer reliability (ICC 0.84). Reliability was lowest for arousals consisting of EEG changes lasting <3 seconds (ICC 0.19 to 0.37). The within subjects night-to-night arousal variability was low for all arousal definitions In a heterogeneous population, interscorer arousal reliability is enhanced by increases in EMG activity, leg movements, and respiratory events and decreased by short duration EEG arousals. The arousal index night-to-night variability was low for all definitions.

  15. Reliability of rehabilitative ultrasonographic imaging for muscle thickness measurement of the rhomboid major.

    Science.gov (United States)

    Jeong, Ju Ri; Ko, Young Jun; Ha, Hyun Geun; Lee, Wan Hee

    2016-03-01

    This study was to establish inter-rater and intrarater reliability of the rehabilitative ultrasonographic imaging (RUSI) technique for muscle thickness measurement of the rhomboid major at rest and with the shoulder abducted to 90°. Twenty-four young adults (eight men, 16 women; right-handed; mean age [±SD], 24·4 years [±2·6]) with no history of neck, shoulder, or arm pain were recruited. Rhomboid major muscle images were obtained in the resting position and with shoulder in 90° abduction using an ultrasonography system with a 7·5-MHz linear transducer. In these two positions, the examiners found the site at which the transducer could be placed. Two examiners obtained the images of all participants in three test sessions at random. Intraclass correlation coefficients (ICC) were used to estimate reliability. All ICCs (95% CI) were >0·75, ranging from 0·93 to 0·98, which indicates good reliability. The ICCs for inter-rater reliability ranged from 0·75 to 0·94. For the absolute value of the difference in the intra-examiner reliability between the right and left ratios, the ICCs ranged from 0·58 to 0·91. In this study, the intra- and interexaminer reliability of muscle thickness measurements of the rhomboid major were good. Therefore, we suggest that muscle thickness measurements of the rhomboid major obtained with the RUSI technique would be useful for clinical rehabilitative assessment. © 2014 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.

  16. Low-Budget Instrumentation of a Conventional Leg Press to Measure Reliable Isometric-Strength Capacity.

    Science.gov (United States)

    Baur, Heiner; Groppa, Alessia Severina; Limacher, Regula; Radlinger, Lorenz

    2016-02-02

    Maximum strength and rate of force development (RFD) are 2 important strength characteristics for everyday tasks and athletic performance. Measurements of both parameters must be reliable. Expensive isokinetic devices with isometric modes are often used. The possibility of cost-effective measurements in a practical setting would facilitate quality control. The purpose of this study was to assess the reliability of measurements of maximum isometric strength (Fmax) and RFD on a conventional leg press. Sixteen subjects (23 ± 2 y, 1.68 ± 0.05 m, 59 ± 5 kg) were tested twice within 1 session. After warm-up, subjects performed 2 times 5 trials eliciting maximum voluntary isometric contractions on an instrumented leg press (1- and 2-legged randomized). Fmax (N) and RFD (N/s) were extracted from force-time curves. Reliability was determined for Fmax and RFD by calculating the intraclass correlation coefficient (ICC), the test-retest variability (TRV), and the bias and limits of agreement. Reliability measures revealed good to excellent ICCs of .80-.93. TRV showed mean differences between measurement sessions of 0.4-6.9%. The systematic error was low compared with the absolute mean values (Fmax 5-6%, RFD 1-4%). The implementation of a force transducer into a conventional leg press provides a viable procedure to assess Fmax and RFD. Both performance parameters can be assessed with good to excellent reliability allowing quality control of interventions.

  17. Optimal number of tests to achieve and validate product reliability

    International Nuclear Information System (INIS)

    Ahmed, Hussam; Chateauneuf, Alaa

    2014-01-01

    The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces

  18. Reliability and measurement error of sagittal spinal motion parameters in 220 patients with chronic low back pain using a three-dimensional measurement device.

    Science.gov (United States)

    Mieritz, Rune M; Bronfort, Gert; Jakobsen, Markus D; Aagaard, Per; Hartvigsen, Jan

    2014-09-01

    A basic premise for any instrument measuring spinal motion is that reliable outcomes can be obtained on a relevant sample under standardized conditions. The purpose of this study was to assess the overall reliability and measurement error of regional spinal sagittal plane motion in patients with chronic low back pain (LBP), and then to evaluate the influence of body mass index, examiner, gender, stability of pain, and pain distribution on reliability and measurement error. This study comprises a test-retest design separated by 7 to 14 days. The patient cohort consisted of 220 individuals with chronic LBP. Kinematics of the lumbar spine were sampled during standardized spinal extension-flexion testing using a 6-df instrumented spatial linkage system. Test-retest reliability and measurement error were evaluated using interclass correlation coefficients (ICC(1,1)) and Bland-Altman limits of agreement (LOAs). The overall test-retest reliability (ICC(1,1)) for various motion parameters ranged from 0.51 to 0.70, and relatively wide LOAs were observed for all parameters. Reliability measures in patient subgroups (ICC(1,1)) ranged between 0.34 and 0.77. In general, greater (ICC(1,1)) coefficients and smaller LOAs were found in subgroups with patients examined by the same examiner, patients with a stable pain level, patients with a body mass index less than below 30 kg/m(2), patients who were men, and patients in the Quebec Task Force classifications Group 1. This study shows that sagittal plane kinematic data from patients with chronic LBP may be sufficiently reliable in measurements of groups of patients. However, because of the large LOAs, this test procedure appears unusable at the individual patient level. Furthermore, reliability and measurement error varies substantially among subgroups of patients. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. The Impact of Automation Reliability and Operator Fatigue on Performance and Reliance

    Science.gov (United States)

    2016-09-23

    Cummings et al., 2007). Automation designed to assist operators in overload situations may promote operator disengagement during periods of low...Calhoun et al., 2011). This testbed offers several tasks designed to emulate the cognitive demands that an operator managing multiple UAVs is likely...reliable (Cronbach’s α = 0.94) measure of affective and cognitive components of trust in automation. Items gauge confidence in an automation and

  20. Communication confidence in persons with aphasia.

    Science.gov (United States)

    Babbitt, Edna M; Cherney, Leora R

    2010-01-01

    Communication confidence is a construct that has not been explored in the aphasia literature. Recently, national and international organizations have endorsed broader assessment methods that address quality of life and include participation, activity, and impairment domains as well as psychosocial areas. Individuals with aphasia encounter difficulties in all these areas on a daily basis in living with a communication disorder. Improvements are often reflected in narratives that are not typically included in standard assessments. This article illustrates how a new instrument measuring communication confidence might fit into a broad assessment framework and discusses the interaction of communication confidence, autonomy, and self-determination for individuals living with aphasia.

  1. The reliability of repeated TMS measures in older adults and in patients with subacute and chronic stroke

    Directory of Open Access Journals (Sweden)

    Heidi M. Schambra

    2015-09-01

    Full Text Available The reliability of transcranial magnetic stimulation (TMS measures in healthy older adults and stroke patients has been insufficiently characterized. We determined whether common TMS measures could reliably evaluate change in individuals and in groups using the smallest detectable change (SDC, or could tell subjects apart using the intraclass correlation coefficient (ICC. We used a single-rater test-retest design in older healthy, subacute stroke, and chronic stroke subjects. At twice daily sessions on two consecutive days, we recorded resting motor threshold, test stimulus intensity, recruitment curves, short-interval intracortical inhibition and facilitation, and long-interval intracortical inhibition. Using variances estimated from a random effects model, we calculated the SDC and ICC for each TMS measure. For all TMS measures in all groups, SDCs for single subjects were large; only with modest group sizes did the SDCs become low. Thus, while these TMS measures cannot be reliably used as a biomarker to detect individual change, they can reliably detect change exceeding measurement noise in moderate-sized groups. For several of the TMS measures, ICCs were universally high, suggesting that they can reliably discriminate between subjects. Though most TMS measures have sufficient reliability in particular contexts, work establishing their validity, responsiveness, and clinical relevance is still needed.

  2. Reliable intraocular pressure measurement using automated radio-wave telemetry.

    Science.gov (United States)

    Paschalis, Eleftherios I; Cade, Fabiano; Melki, Samir; Pasquale, Louis R; Dohlman, Claes H; Ciolino, Joseph B

    2014-01-01

    To present an autonomous intraocular pressure (IOP) measurement technique using a wireless implantable transducer (WIT) and a motion sensor. The WIT optical aid was implanted within the ciliary sulcus of a normotensive rabbit eye after extracapsular clear lens extraction. An autonomous wireless data system (AWDS) comprising of a WIT and an external antenna aided by a motion sensor provided continuous IOP readings. The sensitivity of the technique was determined by the ability to detect IOP changes resulting from the administration of latanoprost 0.005% or dorzolamide 2%, while the reliability was determined by the agreement between baseline and vehicle (saline) IOP. On average, 12 diurnal and 205 nocturnal IOP measurements were performed with latanoprost, and 26 diurnal and 205 nocturnal measurements with dorzolamide. No difference was found between mean baseline IOP (13.08±2.2 mmHg) and mean vehicle IOP (13.27±2.1 mmHg) (P=0.45), suggesting good measurement reliability. Both antiglaucoma medications caused significant IOP reduction compared to baseline; latanoprost reduced mean IOP by 10% (1.3±3.54 mmHg; P<0.001), and dorzolamide by 5% (0.62±2.22 mmHg; P<0.001). Use of latanoprost resulted in an overall twofold higher IOP reduction compared to dorzolamide (P<0.001). Repeatability was ±1.8 mmHg, assessed by the variability of consecutive IOP measurements performed in a short period of time (≤1 minute), during which the IOP is not expected to change. IOP measurements in conscious rabbits obtained without the need for human interactions using the AWDS are feasible and provide reproducible results.

  3. Reliable intraocular pressure measurement using automated radio-wave telemetry

    Science.gov (United States)

    Paschalis, Eleftherios I; Cade, Fabiano; Melki, Samir; Pasquale, Louis R; Dohlman, Claes H; Ciolino, Joseph B

    2014-01-01

    Purpose To present an autonomous intraocular pressure (IOP) measurement technique using a wireless implantable transducer (WIT) and a motion sensor. Methods The WIT optical aid was implanted within the ciliary sulcus of a normotensive rabbit eye after extracapsular clear lens extraction. An autonomous wireless data system (AWDS) comprising of a WIT and an external antenna aided by a motion sensor provided continuous IOP readings. The sensitivity of the technique was determined by the ability to detect IOP changes resulting from the administration of latanoprost 0.005% or dorzolamide 2%, while the reliability was determined by the agreement between baseline and vehicle (saline) IOP. Results On average, 12 diurnal and 205 nocturnal IOP measurements were performed with latanoprost, and 26 diurnal and 205 nocturnal measurements with dorzolamide. No difference was found between mean baseline IOP (13.08±2.2 mmHg) and mean vehicle IOP (13.27±2.1 mmHg) (P=0.45), suggesting good measurement reliability. Both antiglaucoma medications caused significant IOP reduction compared to baseline; latanoprost reduced mean IOP by 10% (1.3±3.54 mmHg; P<0.001), and dorzolamide by 5% (0.62±2.22 mmHg; P<0.001). Use of latanoprost resulted in an overall twofold higher IOP reduction compared to dorzolamide (P<0.001). Repeatability was ±1.8 mmHg, assessed by the variability of consecutive IOP measurements performed in a short period of time (≤1 minute), during which the IOP is not expected to change. Conclusion IOP measurements in conscious rabbits obtained without the need for human interactions using the AWDS are feasible and provide reproducible results. PMID:24531415

  4. Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

    Science.gov (United States)

    Sanders, James L; Williams, Robert J

    2016-01-01

    Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.

  5. High inter-rater reliability, agreement, and convergent validity of Constant score in patients with clavicle fractures

    DEFF Research Database (Denmark)

    Ban, Ilija; Troelsen, Anders; Kristensen, Morten Tange

    2016-01-01

    BACKGROUND: The Constant score (CS) has been the primary endpoint in most studies on clavicle fractures. However, the CS was not developed to assess patients with clavicle fractures. Our aim was to examine inter-rater reliability and agreement of the CS in patients with clavicle fractures...... standardized CS assessment at a mean of 6.8 weeks (SD, 1.0 weeks) after injury. Reliability and agreement of the CS were determined by 2 raters. The interclass correlation coefficient (ICC2,1), standard error of measurement, minimal detectable change, Cronbach α coefficient, and Pearson correlation coefficient...... were estimated. RESULTS: Inter-rater reliability of the total CS was excellent (interclass correlation coefficient, 0.94; 95% confidence interval, 0.88-0.97), with no systematic difference between the 2 raters (P = .75). The standard error of measurement (measurement error at the group level) was 4...

  6. Comprehensive Plan for Public Confidence in Nuclear Regulator

    International Nuclear Information System (INIS)

    Choi, Kwang Sik; Choi, Young Sung; Kim, Ho ki

    2008-01-01

    Public confidence in nuclear regulator has been discussed internationally. Public trust or confidence is needed for achieving regulatory goal of assuring nuclear safety to the level that is acceptable by the public or providing public ease for nuclear safety. In Korea, public ease or public confidence has been suggested as major policy goal in the 'Nuclear regulatory policy direction' annually announced. This paper reviews theory of trust, its definitions and defines nuclear safety regulation, elements of public trust or public confidence developed based on the study conducted so far. Public ease model developed and 10 measures for ensuring public confidence are also presented and future study directions are suggested

  7. The reliability and validity of hand-held refractometry water content measures of hydrogel lenses.

    Science.gov (United States)

    Nichols, Jason J; Mitchell, G Lynn; Good, Gregory W

    2003-06-01

    To investigate within- and between-examiner reliability and validity of hand-held refractometry water content measures of hydrogel lenses. Nineteen lenses of various nominal water contents were examined by two examiners on two occasions separated by 1 hour. An Atago N2 hand-held refractometer was used for all water content measures. Lenses were presented in a random order to each examiner by a third party, and examiners were masked to any potential lens identifiers. Intraclass correlation coefficients (ICC), 95% limits of agreement, and Wilcoxon signed rank test were used to characterize the within- and between-examiner reliability and validity of lens water content measures. Within-examiner reliability was excellent (ICC, 0.97; 95% limits of agreement, -3.6% to +5.7%), and the inter-visit mean difference of 1.1 +/- 2.4% was not biased (p = 0.08). Between-examiner reliability was also excellent (ICC, 0.98; 95% limits of agreement, -4.1% to +3.9%). The mean difference between examiners was -0.1 +/- 2.1% (p = 0.83). The mean difference between the nominally reported water content and our water content measures was -2.1 +/- 1.7% (p refractometry and is material dependent. Therefore, investigators may need to account for bias when measuring hydrogel lens water content via hand-held refractometry.

  8. Measuring awareness of financial skills: reliability and validity of a new measure.

    Science.gov (United States)

    Cramer, K; Tuokko, H A; Mateer, C A; Hultsch, D F

    2004-03-01

    This paper examines the psychometric properties of a three-part (participant, informant, and performance) Measure for assessing Awareness of Financial Skills (MAFS). The MAFS was administered to 10 seniors with dementia and 25 well-functioning seniors, and their informants. Measures of cognitive functioning, social desirability, neuroticism, and perceived control were administered to each participant to allow for an assessment of validity. Internal consistency estimates for the participant and informant questionnaires were found to be 0.92 and 0.97, respectively. Convergent validity analysis indicated that performance on this measure was related to level of cognitive functioning, with higher level of unawareness associated with decreased cognitive ability. Discriminant validity analysis showed that performance on this measure was not related to social desirability or neuroticism. This study provides evidence that the MAFS is a reliable and valid tool for assessing awareness of financial skills in older adults.

  9. Political factors in the development and implementation of technology-based confidence-building measures

    International Nuclear Information System (INIS)

    Steinberg, G.M.

    1989-01-01

    The second half of the 20th century has been characterized by the continuous development and improvement of weapons of mass destruction, including strategic bombers, missiles, chemical and biological agents, and of course, a variety of nuclear weapons. In contrast to the massive change in military capabilities brought about by the rapid development of science and technology, international relations is still dominated by relations between sovereign nation states and characterized by distrust and narrow interests. At the same time that scientific developments created the foundation for the nuclear arms race, however the scientific and technical community has also sought some antidotes. Technology-based confidence building measures (TBCBMS), designed to reduce international conflict and to prevent nuclear war, have been proposed by scientists from the US and the USSR. These TBCBMS have taken a number of forms such as cooperative research and development programs, joint panels and meetings of professional societies, and specially dedicated international forums. These have provided a meeting ground for the exchange of views among scientists from many different countries. In addition, a number of more direct forms of TBCBMS, such as satellite-based observation systems and IAEA nuclear safeguards, have national technical means of verification. More recently, there have been a number of proposals to apply many of these technologies to verification of conventional force reduction, arms control, and other confidence-building measures in context of regional conflicts in the Third World. An International Satellite Monitoring Agency has bee proposed to develop space-based technologies such as observation satellites to increase stability and prevent the outbreak of accidental war in regional contexts such as the Middle East

  10. Reliability and Accuracy of Brain Volume Measurement on MR Imaging

    DEFF Research Database (Denmark)

    Yamagchii, Kechiro; Lassen, Anders; Ring, Poul

    1998-01-01

    Yamaguchi, K., Lassen, A. And Ring, P. Reliability and Accuracy of Brain Volume Measurement on MR Imaging. Abstract at ESMRMB98 European Society for Magnetic Resonance in Medicine and Biology, Geneva, Sept 17-20, 1998 Danish Research Center for Magnetic Resonance, Hvidovre University Hospital...

  11. Reliability of the craniocervical posture assessment: visual and angular measurements using photographs and radiographs.

    Science.gov (United States)

    Gadotti, Inae C; Armijo-Olivo, Susan; Silveira, Anelise; Magee, David

    2013-01-01

    The purposes of this study were to determine the intrarater and interrater reliability of the craniocervical posture in a sagittal view using quantitative measurements on photographs and radiographs and to determine the agreement of the visual assessment of posture between raters. One photograph and 1 radiograph of the sagittal craniocervical posture were simultaneously taken from 39 healthy female subjects. Three angles were measured on the photographs and 10 angles on the radiographs of 22 subjects using Alcimage software (Alcimage; Uberlândia, MG, Brazil). Two repeated measurements were performed by 2 raters. The measurements were compared within and between raters to test the intrarater and interrater reliability, respectively. Intraclass correlation coefficient and SEM were used. κ Agreement was calculated for the visual assessment of 39 subjects using photographs and radiographs between 2 raters. Good to excellent intrarater and interrater intraclass correlation coefficient values were found on both photographs and radiographs. Interrater SEM was large and clinically significant for cervical lordosis photogrammetry and for 1 angle measuring cervical lordosis on radiographs. Interrater κ agreement for the visual assessment using photographs was poor (κ = 0.37). The raters were reliable to measure angles in photographs and radiographs to quantify craniocervical posture with exception of 2 angles measuring lordosis of the cervical spine when compared between raters. The visual assessment of posture between raters was not reliable. © 2013. Published by National University of Health Sciences All rights reserved.

  12. Validity and test-retest reliability of manual goniometers for measuring passive hip range of motion in femoroacetabular impingement patients.

    Directory of Open Access Journals (Sweden)

    Nussbaumer Silvio

    2010-08-01

    Full Text Available Abstract Background The aims of this study were to evaluate the construct validity (known group, concurrent validity (criterion based and test-retest (intra-rater reliability of manual goniometers to measure passive hip range of motion (ROM in femoroacetabular impingement patients and healthy controls. Methods Passive hip flexion, abduction, adduction, internal and external rotation ROMs were simultaneously measured with a conventional goniometer and an electromagnetic tracking system (ETS on two different testing sessions. A total of 15 patients and 15 sex- and age-matched healthy controls participated in the study. Results The goniometer provided greater hip ROM values compared to the ETS (range 2.0-18.9 degrees; P P Conclusions The present study suggests that goniometer-based assessments considerably overestimate hip joint ROM by measuring intersegmental angles (e.g., thigh flexion on trunk for hip flexion rather than true hip ROM. It is likely that uncontrolled pelvic rotation and tilt due to difficulties in placing the goniometer properly and in performing the anatomically correct ROM contribute to the overrating of the arc of these motions. Nevertheless, conventional manual goniometers can be used with confidence for longitudinal assessments in the clinic.

  13. Reliability Measure Model for Assistive Care Loop Framework Using Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Venki Balasubramanian

    2010-01-01

    Full Text Available Body area wireless sensor networks (BAWSNs are time-critical systems that rely on the collective data of a group of sensor nodes. Reliable data received at the sink is based on the collective data provided by all the source sensor nodes and not on individual data. Unlike conventional reliability, the definition of retransmission is inapplicable in a BAWSN and would only lead to an elapsed data arrival that is not acceptable for time-critical application. Time-driven applications require high data reliability to maintain detection and responses. Hence, the transmission reliability for the BAWSN should be based on the critical time. In this paper, we develop a theoretical model to measure a BAWSN's transmission reliability, based on the critical time. The proposed model is evaluated through simulation and then compared with the experimental results conducted in our existing Active Care Loop Framework (ACLF. We further show the effect of the sink buffer in transmission reliability after a detailed study of various other co-existing parameters.

  14. Photovoltaic Module Reliability Workshop 2012: February 28 - March 1, 2012

    Energy Technology Data Exchange (ETDEWEB)

    Kurtz, S.

    2013-11-01

    NREL's Photovoltaic (PV) Module Reliability Workshop (PVMRW) brings together PV reliability experts to share information, leading to the improvement of PV module reliability. Such improvement reduces the cost of solar electricity and promotes investor confidence in the technology--both critical goals for moving PV technologies deeper into the electricity marketplace.

  15. Reliability of isometric lower-extremity muscle strength measurements in children with cerebral palsy: implications for measurement design

    NARCIS (Netherlands)

    Willemse, Lydia; Brehm, Merel A.; Scholtes, Vanessa A.; Jansen, Laura; Woudenberg-Vos, Hester; Dallmeijer, Annet J.

    2013-01-01

    Children with cerebral palsy (CP) typically show muscle weakness of the lower extremities, which can be measured with the use of handheld dynamometry (HHD). The purposes of this study were: (1) to determine test-retest reliability and measurement error of isometric lower-extremity strength

  16. The Berg Balance Scale has high intra- and inter-rater reliability but absolute reliability varies across the scale: a systematic review.

    Science.gov (United States)

    Downs, Stephen; Marquez, Jodie; Chiarelli, Pauline

    2013-06-01

    What is the intra-rater and inter-rater relative reliability of the Berg Balance Scale? What is the absolute reliability of the Berg Balance Scale? Does the absolute reliability of the Berg Balance Scale vary across the scale? Systematic review with meta-analysis of reliability studies. Any clinical population that has undergone assessment with the Berg Balance Scale. Relative intra-rater reliability, relative inter-rater reliability, and absolute reliability. Eleven studies involving 668 participants were included in the review. The relative intrarater reliability of the Berg Balance Scale was high, with a pooled estimate of 0.98 (95% CI 0.97 to 0.99). Relative inter-rater reliability was also high, with a pooled estimate of 0.97 (95% CI 0.96 to 0.98). A ceiling effect of the Berg Balance Scale was evident for some participants. In the analysis of absolute reliability, all of the relevant studies had an average score of 20 or above on the 0 to 56 point Berg Balance Scale. The absolute reliability across this part of the scale, as measured by the minimal detectable change with 95% confidence, varied between 2.8 points and 6.6 points. The Berg Balance Scale has a higher absolute reliability when close to 56 points due to the ceiling effect. We identified no data that estimated the absolute reliability of the Berg Balance Scale among participants with a mean score below 20 out of 56. The Berg Balance Scale has acceptable reliability, although it might not detect modest, clinically important changes in balance in individual subjects. The review was only able to comment on the absolute reliability of the Berg Balance Scale among people with moderately poor to normal balance. Copyright © 2013 Australian Physiotherapy Association. Published by .. All rights reserved.

  17. The Relationship Between Eyewitness Confidence and Identification Accuracy: A New Synthesis.

    Science.gov (United States)

    Wixted, John T; Wells, Gary L

    2017-05-01

    The U.S. legal system increasingly accepts the idea that the confidence expressed by an eyewitness who identified a suspect from a lineup provides little information as to the accuracy of that identification. There was a time when this pessimistic assessment was entirely reasonable because of the questionable eyewitness-identification procedures that police commonly employed. However, after more than 30 years of eyewitness-identification research, our understanding of how to properly conduct a lineup has evolved considerably, and the time seems ripe to ask how eyewitness confidence informs accuracy under more pristine testing conditions (e.g., initial, uncontaminated memory tests using fair lineups, with no lineup administrator influence, and with an immediate confidence statement). Under those conditions, mock-crime studies and police department field studies have consistently shown that, for adults, (a) confidence and accuracy are strongly related and (b) high-confidence suspect identifications are remarkably accurate. However, when certain non-pristine testing conditions prevail (e.g., when unfair lineups are used), the accuracy of even a high-confidence suspect ID is seriously compromised. Unfortunately, some jurisdictions have not yet made reforms that would create pristine testing conditions and, hence, our conclusions about the reliability of high-confidence identifications cannot yet be applied to those jurisdictions. However, understanding the information value of eyewitness confidence under pristine testing conditions can help the criminal justice system to simultaneously achieve both of its main objectives: to exonerate the innocent (by better appreciating that initial, low-confidence suspect identifications are error prone) and to convict the guilty (by better appreciating that initial, high-confidence suspect identifications are surprisingly accurate under proper testing conditions).

  18. Reliability of Instruments Measuring At-Risk and Problem Gambling Among Young Individuals

    DEFF Research Database (Denmark)

    Edgren, Robert; Castrén, Sari; Mäkelä, Marjukka

    2016-01-01

    This review aims to clarify which instruments measuring at-risk and problem gambling (ARPG) among youth are reliable and valid in light of reported estimates of internal consistency, classification accuracy, and psychometric properties. A systematic search was conducted in PubMed, Medline, and Psyc......Info covering the years 2009–2015. In total, 50 original research articles fulfilled the inclusion criteria: target age under 29 years, using an instrument designed for youth, and reporting a reliability estimate. Articles were evaluated with the revised Quality Assessment of Diagnostic Accuracy Studies tool....... Reliability estimates were reported for five ARPG instruments. Most studies (66%) evaluated the South Oaks Gambling Screen Revised for Adolescents. The Gambling Addictive Behavior Scale for Adolescents was the only novel instrument. In general, the evaluation of instrument reliability was superficial. Despite...

  19. Smartphone photography utilized to measure wrist range of motion.

    Science.gov (United States)

    Wagner, Eric R; Conti Mica, Megan; Shin, Alexander Y

    2018-02-01

    The purpose was to determine if smartphone photography is a reliable tool in measuring wrist movement. Smartphones were used to take digital photos of both wrists in 32 normal participants (64 wrists) at extremes of wrist motion. The smartphone measurements were compared with clinical goniometry measurements. There was a very high correlation between the clinical goniometry and smartphone measurements, as the concordance coefficients were high for radial deviation, ulnar deviation, wrist extension and wrist flexion. The Pearson coefficients also demonstrated the high precision of the smartphone measurements. The Bland-Altman plots demonstrated 29-31 of 32 smartphone measurements were within the 95% confidence interval of the clinical measurements for all positions of the wrists. There was high reliability between the photography taken by the volunteer and researcher, as well as high inter-observer reliability. Smartphone digital photography is a reliable and accurate tool for measuring wrist range of motion. II.

  20. Myth of the Master Detective: Reliability of Interpretations for Kaufman's "Intelligent Testing" Approach to the WISC-III.

    Science.gov (United States)

    Macmann, Gregg M.; Barnett, David W.

    1997-01-01

    Used computer simulation to examine the reliability of interpretations for Kaufman's "intelligent testing" approach to the Wechsler Intelligence Scale for Children (3rd ed.) (WISC-III). Findings indicate that factor index-score differences and other measures could not be interpreted with confidence. Argues that limitations of IQ testing…

  1. Developing information-space Confidence Building Measures (CBMs) between India and Pakistan

    Energy Technology Data Exchange (ETDEWEB)

    Yamin, Tughral

    2014-06-01

    The Internet has changed the world in ways hitherto unknown. The international financial system, air, land and maritime transport systems are all digitally linked. Similarly most militaries are fully or partially networked. This has not only sped up the decision making processes at all levels, it has also rendered these systems vulnerable to cyber-attacks. Cyber-warfare is now recognized as the most potent form of non-kinetic war fighting. In order to prevent large scale network-attacks, cyber-powers are simultaneously spending a lot of time, money and effort to erect redundant cyber-defenses and enhancing their offensive cyber capabilities. Difficulties in creating a stable environment in information-space stem from differing national perceptions regarding the freedom of the Internet, application of international law and problems associated with attribution. This paper discusses a range of Confidence Building Measures that can be created between India and Pakistan in information-space to control malicious cyber behavior and avert an inadvertent war.

  2. Reliability of Using Motion Sensors to Measure Children’s Physical Activity Levels in Exergaming

    Directory of Open Access Journals (Sweden)

    Nan Zeng

    2018-05-01

    Full Text Available Objectives: This study examined the reliability of two objective measurement tools in assessing children’s physical activity (PA levels in an exergaming setting. Methods: A total of 377 children (190 girls, Mage = 8.39, SD = 1.55 attended the 30-min exergaming class every other day for 18 weeks. Children’s PA levels were concurrently measured by NL-1000 pedometer and ActiGraph GT3X accelerometer, while children’s steps per min and time engaged in sedentary, light, and moderate-to-vigorous PA were estimated, respectively. Results: The results of intraclass correlation coefficient (ICC indicated a low degree of reliability (single measures ICC = 0.03 in accelerometers. ANOVA did detect a possible learning effect for 27 classes (p < 0.01, and the single measures ICC was 0.20 for pedometers. Moreover, there was no significant positive relationship between steps per min and time spent in moderate-to-vigorous physical activity (MVPA. Finally, only 1.3% variance was explained by pedometer as a predictor using Hierarchical Linear Modeling to further explore the relationship between pedometer and accelerometer data. Conclusions: The NL-1000 pedometers and ActiGraph GT3X accelerometers have low reliability in assessing elementary school children’s PA levels during exergaming. More research is warranted in determining the reliable and accurate measurement information regarding the use of modern devices in exergaming setting.

  3. Is consumer confidence an indicator of JSE performance?

    OpenAIRE

    Kamini Solanki; Yudhvir Seetharam

    2014-01-01

    While most studies examine the impact of business confidence on market performance, we instead focus on the consumer because consumer spending habits are a natural extension of trading activity on the equity market. This particular study examines investor sentiment as measured by the Consumer Confidence Index in South Africa and its effect on the Johannesburg Stock Exchange (JSE). We employ Granger causality tests to investigate the relationship across time between the Consumer Confidence Ind...

  4. Supersonic shear imaging provides a reliable measurement of resting muscle shear elastic modulus

    International Nuclear Information System (INIS)

    Lacourpaille, Lilian; Hug, François; Bouillard, Killian; Nordez, Antoine; Hogrel, Jean-Yves

    2012-01-01

    The aim of the present study was to assess the reliability of shear elastic modulus measurements performed using supersonic shear imaging (SSI) in nine resting muscles (i.e. gastrocnemius medialis, tibialis anterior, vastus lateralis, rectus femoris, triceps brachii, biceps brachii, brachioradialis, adductor pollicis obliquus and abductor digiti minimi) of different architectures and typologies. Thirty healthy subjects were randomly assigned to the intra-session reliability (n = 20), inter-day reliability (n = 21) and the inter-observer reliability (n = 16) experiments. Muscle shear elastic modulus ranged from 2.99 (gastrocnemius medialis) to 4.50 kPa (adductor digiti minimi and tibialis anterior). On the whole, very good reliability was observed, with a coefficient of variation (CV) ranging from 4.6% to 8%, except for the inter-operator reliability of adductor pollicis obliquus (CV = 11.5%). The intraclass correlation coefficients were good (0.871 ± 0.045 for the intra-session reliability, 0.815 ± 0.065 for the inter-day reliability and 0.709 ± 0.141 for the inter-observer reliability). Both the reliability and the ease of use of SSI make it a potentially interesting technique that would be of benefit to fundamental, applied and clinical research projects that need an accurate assessment of muscle mechanical properties. (note)

  5. The development and validation of measures to assess cooking skills and food skills.

    Science.gov (United States)

    Lavelle, Fiona; McGowan, Laura; Hollywood, Lynsey; Surgenor, Dawn; McCloat, Amanda; Mooney, Elaine; Caraher, Martin; Raats, Monique; Dean, Moira

    2017-09-02

    With the increase use of convenience food and eating outside the home environment being linked to the obesity epidemic, the need to assess and monitor individuals cooking and food skills is key to help intervene where necessary to promote the usage of these skills. Therefore, this research aimed to develop and validate a measure for cooking skills and one for food skills, that are clearly described, relatable, user-friendly, suitable for different types of studies, and applicable across all sociodemographic levels. Two measures were developed in light of the literature and expert opinion and piloted for clarity and ease of use. Following this, four studies were undertaken across different cohorts (including a sample of students, both 'Food preparation novices' and 'Experienced food preparers', and a nationally representative sample) to assess temporal stability, psychometrics, internal consistency reliability and construct validity of both measures. Analysis included T-tests, Pearson's correlations, factor analysis, and Cronbach's alphas, with a significance level of 0.05. Both measures were found to have a significant level of temporal stability (P cooking skills confidence measure ranged from 0.78 to 0.93 across all cohorts. The food skills confidence measure's Cronbach's alpha's ranged from 0.85 to 0.94. The two measures also showed a high discriminate validity as there were significant differences (P cooking skills confidence and P cooking skills confidence measure and the food skills confidence measure have been shown to have a very satisfactory reliability, validity and are consistent over time. Their user-friendly applicability make both measures highly suitable for large scale cross-sectional, longitudinal and intervention studies to assess or monitor cooking and food skills levels and confidence.

  6. The reliability of prayer-based self-efficacy scale to assess self-confidence of Muslims with low back pain.

    Science.gov (United States)

    Al-Obaidi, Saud; Wall, James C; Mulekar, Madhuri S; Al-Mutairie, Rebecca

    2012-06-01

    Low back pain (LBP) may challenge an individual's self-confidence to perform usual daily activities such as Islamic daily prayer. Existing self-efficacy scales may not be appropriate to assess individual's self-confidence to perform Islamic prayers. This study aimed to develop a scale to assess self-confidence to prepare and perform Islamic prayer in the presence of LBP, the Islamic Prayer-based Self-efficacy Scale (IpbSeS), and to determine its consistency. The IpbSeS consists of three parts: pre-prayer preparation, getting to and from the mosque, and positions and movements during prayer. On a scale of 0 to 6, 0 indicates 'not at all confident' and 6 'fully confident'. Sixty individuals with LBP gave their responses on two different visits. Pain intensity was assessed by the Visual Analogue Scale (VAS), and the pain intensity changes were assessed using a seven-point global patient rating scale. Descriptive statistics, Pearson's correlation coefficient, Wilcoxon test and t-test were used in the analysis (alpha set at 0.05). VAS scores did not differ significantly between visits. No association was found between VAS and age (r = 0.039, p = 0.77) and between VAS and body mass index (BMI; r = 0.06, p = 0. 67). All 28 questions have consistent responses on two visits (0.75 ≤ r ≤ 0.99, p Muslims in the presence of LBP to pray. Copyright © 2011 John Wiley & Sons, Ltd.

  7. How to measure distinct components of visual attention fast and reliably

    DEFF Research Database (Denmark)

    Vangkilde, Signe Allerup; Kyllingsbæk, Søren; Habekost, Thomas

    2009-01-01

    Measuring different attentional processes in a fast and reliable way is important in both clinical and experimental settings. However, most tests of visual attention are either lengthy or lack sensitivity, specificity, and reliability. To address this we developed a ten minute test procedure...... for the Swedish Betula-project, a longitudinal study investigating changes in cognitive functions over the adult life span (Nilsson et al., 2004). The test consists of a computer-based letter recognition task with stimulus displays of varied durations followed by pattern masks or a blank screen. The temporal...

  8. Reliability and responsiveness of algometry for measuring pressure pain threshold in patients with knee osteoarthritis.

    Science.gov (United States)

    Mutlu, Ebru Kaya; Ozdincler, Arzu Razak

    2015-06-01

    [Purpose] This study aimed to establish the intrarater reliability and responsiveness of a clinically available algometer in patients with knee osteoarthritis as well as to determine the minimum-detectable-change and standard error of measurement of testing to facilitate clinical interpretation of temporal changes. [Subjects] Seventy-three patients with knee osteoarthritis were included. [Methods] Pressure pain threshold measured by algometry was evaluated 3 times at 2-min intervals over 2 clinically relevant sites-mediolateral to the medial femoral tubercle (distal) and lateral to the medial malleolus (local)-on the same day. Intrarater reliability was estimated by intraclass correlation coefficients. The minimum-detectable-change and standard error of measurement were calculated. As a measure of responsiveness, the effect size was calculated for the results at baseline and after treatment. [Results] The intrarater reliability was almost perfect (intraclass correlation coefficient = 0.93-0.97). The standard error of measurement and minimum-detectable-change were 0.70-0.66 and 1.62-1.53, respectively. The pressure pain threshold over the distal site was inadequately responsive in knee osteoarthritis, but the local site was responsive. The effect size was 0.70. [Conclusion] Algometry is reliable and responsive to assess measures of pressure pain threshold for evaluating pain patients with knee osteoarthritis.

  9. Confidence assessment. Site descriptive modelling SDM-Site Forsmark

    International Nuclear Information System (INIS)

    2008-09-01

    distribution and size-intensity models for fractures at repository depth can only be reduced by data from underground, i.e. from fracture mapping of tunnel walls etc. Specifically it will be necessary to carry out statistical modelling of fractures in a DFN study at depth during construction work on the access ramp and shafts. Uncertainties in stress magnitude will be reduced by observations and measurements of deformation with back analysis during the construction phase. Underground mapping data from deposition tunnels will allow fore a division of the fine-grained granitoid into different rock types. This will enable thermal optimisation of the repository. The next step in confidence building would be to predict conditions and impacts from underground tunnels. Tunnel data will provide information about the fracture size distribution at the relevant depths. The underground excavations will also provide possibilities for short-range interference tests at relevant depth. Uncertainties in understanding chemical processes may be reduced by assessing results from underground monitoring (groundwater chemistry; fracture minerals etc) of the effects of drawdown and inflows during excavation. The hydrogeological DFN fitting parameters for fractures within the repository volume can only be properly constrained by mapping of flowing or potentially open fracture statistics in tunnels. Surface outcrop statistics are not relevant for properties at repository depth. During underground investigations, the flowing fracture frequencies in tunnels and investigations of couplings between rock mechanical properties and fracture transmissivities may give clues to the extent of in-plane flow channelling which will lead to more reliable models for transport from the repository volume, particularly close to deposition holes where the most important retention and retardation of any released radionuclides may occur in the rock barrier

  10. Magnetic resonance imaging of shoulders with idiopathic adhesive capsulitis: reliability of measures

    Energy Technology Data Exchange (ETDEWEB)

    Lefevre-Colau, Marie-Martine; Fayad, Fouad; Rannou, Francois; Demaille-Wlodyka, Samantha; Mayoux-Benhamou, Marie-Anne; Poiraudeau, Serge; Revel, Michel [Universite Rene Descartes, Department of Physical and Rehabilitation Medicine, Hopital Cochin (AP-HP), Paris (France); Drape, Jean-Luc; Diche, Thierry; Minvielle, Francois [Hopital Cochin (AP-HP), Department of Radiology B, Paris (France); Fermanian, Jacques [Universite Rene Descartes, Department of Biostatistics, Hopital Necker (AP-HP), Paris (France)

    2005-12-01

    The magnetic resonance imaging (MRI) findings in idiopathic adhesive capsulitis (AC) were compared with those of contralateral healthy shoulders and the reliability of measures assessed. Twenty-six consecutive patients (26 AC and 14 healthy shoulders) were prospectively assessed. The main measurements were thickness of the joint capsule and synovial membrane in the axillary recess and rotator interval in T1-weighted spin-echo sequence enhanced with intravenous (IV) gadolinium chelate (Gd-chelate). Reliability was studied by use of the intraclass correlation coefficient (ICC). The mean thickness of the axillary recess on the coronal plane was 9.0{+-}2.2 mm in AC shoulders and 0.4{+-}0.7 mm in healthy shoulders. The mean thickness of the rotator interval on the sagittal plane was 8.4{+-}2.8 in AC shoulders and 0.6{+-}0.8 mm in healthy shoulders. Interobserver reliability was good for the axillary recess, with ICC values of 0.84 for the coronal plane, and good for the rotator interval, with ICC values of 0.80 for the sagittal plane. MRI with IV Gd-chelate injection can show, with acceptable reliability, signal and thickness abnormalities of the shoulder joint capsule and synovial membrane in AC. (orig.)

  11. Magnetic resonance imaging of shoulders with idiopathic adhesive capsulitis: reliability of measures

    International Nuclear Information System (INIS)

    Lefevre-Colau, Marie-Martine; Fayad, Fouad; Rannou, Francois; Demaille-Wlodyka, Samantha; Mayoux-Benhamou, Marie-Anne; Poiraudeau, Serge; Revel, Michel; Drape, Jean-Luc; Diche, Thierry; Minvielle, Francois; Fermanian, Jacques

    2005-01-01

    The magnetic resonance imaging (MRI) findings in idiopathic adhesive capsulitis (AC) were compared with those of contralateral healthy shoulders and the reliability of measures assessed. Twenty-six consecutive patients (26 AC and 14 healthy shoulders) were prospectively assessed. The main measurements were thickness of the joint capsule and synovial membrane in the axillary recess and rotator interval in T1-weighted spin-echo sequence enhanced with intravenous (IV) gadolinium chelate (Gd-chelate). Reliability was studied by use of the intraclass correlation coefficient (ICC). The mean thickness of the axillary recess on the coronal plane was 9.0±2.2 mm in AC shoulders and 0.4±0.7 mm in healthy shoulders. The mean thickness of the rotator interval on the sagittal plane was 8.4±2.8 in AC shoulders and 0.6±0.8 mm in healthy shoulders. Interobserver reliability was good for the axillary recess, with ICC values of 0.84 for the coronal plane, and good for the rotator interval, with ICC values of 0.80 for the sagittal plane. MRI with IV Gd-chelate injection can show, with acceptable reliability, signal and thickness abnormalities of the shoulder joint capsule and synovial membrane in AC. (orig.)

  12. Validity and reliability of a structured-light 3D scanner and an ultrasound imaging system for measurements of facial skin thickness.

    Science.gov (United States)

    Lee, Kang-Woo; Kim, Sang-Hwan; Gil, Young-Chun; Hu, Kyung-Seok; Kim, Hee-Jin

    2017-10-01

    Three-dimensional (3 D)-scanning-based morphological studies of the face are commonly included in various clinical procedures. This study evaluated validity and reliability of a 3 D scanning system by comparing the ultrasound (US) imaging system versus the direct measurement of facial skin. The facial skin thickness at 19 landmarks was measured using the three different methods in 10 embalmed adult Korean cadavers. Skin thickness was first measured using the ultrasound device, then 3 D scanning of the facial skin surface was performed. After the skin on the left half of face was gently dissected, deviating slightly right of the midline, to separate it from the subcutaneous layer, and the harvested facial skin's thickness was measured directly using neck calipers. The dissected specimen was then scanned again, then the scanned images of undissected and dissected faces were superimposed using Morpheus Plastic Solution (version 3.0) software. Finally, the facial skin thickness was calculated from the superimposed images. The ICC value for the correlations between the 3 D scanning system and direct measurement showed excellent reliability (0.849, 95% confidence interval = 0.799-0.887). Bland-Altman analysis showed a good level of agreement between the 3 D scanning system and direct measurement (bias = 0.49 ± 0.49 mm, mean±SD). These results demonstrate that the 3 D scanning system precisely reflects structural changes before and after skin dissection. Therefore, an in-depth morphological study using this 3 D scanning system could provide depth data about the main anatomical structures of face, thereby providing crucial anatomical knowledge for utilization in various clinical applications. Clin. Anat. 30:878-886, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  13. Reliability of ultrasound for measurement of selected foot structures.

    Science.gov (United States)

    Crofts, G; Angin, S; Mickle, K J; Hill, S; Nester, C J

    2014-01-01

    Understanding the relationship between the lower leg muscles, foot structures and function is essential to explain how disease or injury may relate to changes in foot function and clinical pathology. The aim of this study was to investigate the inter-operator reliability of an ultrasound protocol to quantify features of: rear, mid and forefoot sections of the plantar fascia (PF); flexor hallucis brevis (FHB); flexor digitorum brevis (FDB); abductor hallucis (AbH); flexor digitorum longus (FDL); flexor hallucis longus (FHL); tibialis anterior (TA); and peroneus longus and brevis (PER). A sample of 6 females and 4 males (mean age 29.1 ± 7.2 years, mean BMI 25.5 ± 4.8) was recruited from a university student and staff population. Scans were obtained using a portable Venue 40 musculoskeletal ultrasound system (GE Healthcare UK) with a 5-13 MHz wideband linear array probe with a 12.7 mm × 47.1mm footprint by two operators in the same scanning session. Intraclass Correlation Coefficients (ICC) values for muscle thickness (ICC range 0.90-0.97), plantar fascia thickness (ICC range 0.94-0.98) and cross sectional muscle measurements (ICC range 0.91-0.98) revealed excellent inter-operator reliability. The limits of agreement, relative to structure size, ranged from 9.0% to 17.5% for muscle thickness, 11.0-18.0% for plantar fascia, and 11.0-26.0% for cross sectional area measurements. The ultrasound protocol implemented in this work has been shown to be reliable. It therefore offers the opportunity to quantify the structures concerned and better understand their contributions to foot function. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.

  14. Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

    Science.gov (United States)

    Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

    2018-01-01

    The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.

  15. Psychometric properties of the Confidence and Trust in Delivery Questionnaire (CTDQ: a pilot study

    Directory of Open Access Journals (Sweden)

    Jeschke Elke

    2012-09-01

    Full Text Available Abstract Background Assessing expecting mother’s opinions prior to birth draws a comprehensive picture for the caregivers about their emotional state and their expectations. Some questionnaires to cover these aspects do exist. This study aims to present the psychometric properties of a new instrument, the Confidence and Trust in Delivery Questionnaire (CDTQ a short but reliable a self-report instrument that focuses on confidence and trust as meaningful dimensions for expectant mothers. Methods A pilot validation study of 221 women 6 weeks before childbirth was conducted in Germany between October 2007 and June 2008. To detect structural relations between the items, factor and reliability analyses were applied to the CTDQ items. Factor analysis was performed by means of principal components analysis and varimax rotation. Internal reliability was assessed by Cronbach’s alpha. External validation was performed using the sense of coherence (SOC scale. Results The CTDQ comprises of 11 items. We found a 4-factor structure. The internal consistency of the whole item pool (Cronbach’s α = 0.79 and the 4 subscales [confidence in labor (α = 0.82; partner’s support (α = 0.62; trust in medical competency (α = 0.68; being informed (α = 0.60] can be regarded as sufficient or even excellent. The 4 factors explained 69.6% of total variance. Except for a high intercorrelation (0.70 between “partner’s support” and “trust in medical competence”, the subscales show low intercorrelations, indicating an adequate independence of the respective subscales. Regarding the external validity we found minor respective moderate correlations with the SOC scale. Conclusions Our data suggest that the CTDQ is a useful instrument to assess confidence and trust in delivery. With 4 clinically relevant dimensions, the CTDQ is now open for further studies in the field of labor.

  16. How to measure wisdom: content, reliability, and validity of five measures

    Science.gov (United States)

    Glück, Judith; König, Susanne; Naschenweng, Katja; Redzanowski, Uwe; Dorner, Lara; Straßer, Irene; Wiedermann, Wolfgang

    2013-01-01

    Wisdom is a field of growing interest both inside and outside academic psychology, and researchers are increasingly interested in using measures of wisdom in their work. However, wisdom is a highly complex construct, and its various operationalizations are based on quite different definitions. Which measure a researcher chooses for a particular research project may have a strong influence on the results. This study compares four well-established measures of wisdom—the Self-Assessed Wisdom Scale (Webster, 2003, 2007), the Three-Dimensional Wisdom Scale (Ardelt, 2003), the Adult Self-Transcendence Inventory (Levenson et al., 2005), and the Berlin Wisdom Paradigm (Baltes and Smith, 1990; Baltes and Staudinger, 2000)—with respect to content, reliability, factorial structure, and construct validity (relationships to wisdom nomination, interview-based wisdom ratings, and correlates of wisdom). The sample consisted of 47 wisdom nominees and 123 control participants. While none of the measures performed “better” than the others by absolute standards, recommendations are given for researchers to select the most suitable measure for their substantive interests. In addition, a “Brief Wisdom Screening Scale” is introduced that contains those 20 items from the three self-report scales that were most highly correlated with the common factor across the scales. PMID:23874310

  17. Reliability and Measurement Error of Tensiomyography to Assess Mechanical Muscle Function: A Systematic Review.

    Science.gov (United States)

    Martín-Rodríguez, Saúl; Loturco, Irineu; Hunter, Angus M; Rodríguez-Ruiz, David; Munguia-Izquierdo, Diego

    2017-12-01

    Martín-Rodríguez, S, Loturco, I, Hunter, AM, Rodríguez-Ruiz, D, and Munguia-Izquierdo, D. Reliability and measurement error of tensiomyography to assess mechanical muscle function: A systematic review. J Strength Cond Res 31(12): 3524-3536, 2017-Interest in studying mechanical skeletal muscle function through tensiomyography (TMG) has increased in recent years. This systematic review aimed to (a) report the reliability and measurement error of all TMG parameters (i.e., maximum radial displacement of the muscle belly [Dm], contraction time [Tc], delay time [Td], half-relaxation time [½ Tr], and sustained contraction time [Ts]) and (b) to provide critical reflection on how to perform accurate and appropriate measurements for informing clinicians, exercise professionals, and researchers. A comprehensive literature search was performed of the Pubmed, Scopus, Science Direct, and Cochrane databases up to July 2017. Eight studies were included in this systematic review. Meta-analysis could not be performed because of the low quality of the evidence of some studies evaluated. Overall, the review of the 9 studies involving 158 participants revealed high relative reliability (intraclass correlation coefficient [ICC]) for Dm (0.91-0.99); moderate-to-high ICC for Ts (0.80-0.96), Tc (0.70-0.98), and ½ Tr (0.77-0.93); and low-to-high ICC for Td (0.60-0.98), independently of the evaluated muscles. In addition, absolute reliability (coefficient of variation [CV]) was low for all TMG parameters except for ½ Tr (CV = >20%), whereas measurement error indexes were high for this parameter. In conclusion, this study indicates that 3 of the TMG parameters (Dm, Td, and Tc) are highly reliable, whereas ½ Tr demonstrate insufficient reliability, and thus should not be used in future studies.

  18. The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals

    Energy Technology Data Exchange (ETDEWEB)

    Fuller, Joel T.; Buckley, Jonathan D.; Tsiros, Margarita D.; Thewlis, Dominic [University of South Australia, Alliance for Research in Exercise, Nutrition and Activity (ARENA), Sansom Institute for Health Research, GPO Box 2471, Adelaide, South Australia (Australia); Archer, Jane [University of South Australia, Medical Radiation, School of Health Sciences, Adelaide (Australia)

    2016-01-15

    To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5 %; limits of agreement [LOA] ≤4.2 %). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2 %; LOA ≤11.9 %). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2 %; LOA ≤23.6 %) and fourth metatarsals (ICC ≥0.67; CV ≤9.6 %; LOA ≤27.5 %). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals. (orig.)

  19. The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals

    International Nuclear Information System (INIS)

    Fuller, Joel T.; Buckley, Jonathan D.; Tsiros, Margarita D.; Thewlis, Dominic; Archer, Jane

    2016-01-01

    To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5 %; limits of agreement [LOA] ≤4.2 %). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2 %; LOA ≤11.9 %). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2 %; LOA ≤23.6 %) and fourth metatarsals (ICC ≥0.67; CV ≤9.6 %; LOA ≤27.5 %). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals. (orig.)

  20. The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals.

    Science.gov (United States)

    Fuller, Joel T; Archer, Jane; Buckley, Jonathan D; Tsiros, Margarita D; Thewlis, Dominic

    2016-01-01

    To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5%; limits of agreement [LOA] ≤4.2%). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2%; LOA ≤11.9%). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2%; LOA ≤23.6%) and fourth metatarsals (ICC ≥0.67; CV ≤9.6%; LOA ≤27.5%). BMD was greatest in the first and second metatarsals (P Reliable measurements of BMD were achieved for the first, second and fifth metatarsals.

  1. Research on reliability measures of the main transformer and GIS equipment manufacturing process

    International Nuclear Information System (INIS)

    Wu Honglong

    2014-01-01

    Based on the accidents of the main transformer GIS equipment and the accidents of the high voltage switch equipment, combined with the main transformer switch equipment maintenance experience and electrical theory, the reliability measures of the main transformer GIS equipment during manufacturing stage are studied and improved. Six successful reliability measures are identified: 1) design properly and check the ability of transformer for anti short circuit; 2) choose mature and reliable main transformer HV bushing; 3) choose GIS switch operation mechanism of high quality and reliability; 4) ensure that the insulation margin through tests piece by piece on withstand voltage and partial discharge of the GIS equipment insulation; 5) take test measures such as GIS conductor, shell polishing witness process and full form lightning impulse, to find out and eliminate the defects of abnormal electric field distribution; 6) Anti VFTO design for the main transformer connected with GIS with the voltage of 500 kV should be considered, and its anti VFTO ability to meet the safe operation under VFTO requirements should be checked. This paper proposed 2 new measures: 1) the main transformer insulation material quality standard is determined not only by its high dielectric strength, but more importantly by the homogeneous dielectric electric strength. Insulating Materials with a high and also uniform dielectric strength should be chosen. 2) During the silver-coating stage of the GIS equipment conductor, QC group activities should be organized to ensure that the plating layer quality, and the current lap surface DC resistance measurements should be supervised and witnessed to ensure the quality of the conductor contact surface. These measures are verified in Fuqing project of GIS main transformer equipment manufacturing process, and their effectiveness is proven. (author)

  2. Building and strengthening confidence and security in Asia

    International Nuclear Information System (INIS)

    Corden, P.S.

    1992-01-01

    This paper presents a few thoughts on the question of building and strengthening confidence and security in Asia, in particular in the area centred on the Korean peninsula. This question includes the process of establishing and implementing confidence- and security-building measures, some of which might involve States other than North and South Korea. The development of CSBMs has now been well established in Europe, and there are encouraging signs that such measures are taking hold in other areas of the world, including in Korea. Consequently there is a fairly rich mine of information, precedent and experience from which to draw in focusing on the particular subject at hand. In these remarks the concept of confidence- and security-building is briefly addressed and measures are examined that have proven useful in other circumstances and review some possibilities that appear of interest in the present context

  3. Reliability and Validity of Selected PROMIS Measures in People with Rheumatoid Arthritis.

    Directory of Open Access Journals (Sweden)

    Susan J Bartlett

    Full Text Available To evaluate the reliability and validity of 11 PROMIS measures to assess symptoms and impacts identified as important by people with rheumatoid arthritis (RA.Consecutive patients (N = 177 in an observational study completed PROMIS computer adapted tests (CATs and a short form (SF assessing pain, fatigue, physical function, mood, sleep, and participation. We assessed test-test reliability and internal consistency using correlation and Cronbach's alpha. We assessed convergent validity by examining Pearson correlations between PROMIS measures and existing measures of similar domains and known groups validity by comparing scores across disease activity levels using ANOVA.Participants were mostly female (82% and white (83% with mean (SD age of 56 (13 years; 24% had ≤ high school, 29% had RA ≤ 5 years with 13% ≤ 2 years, and 22% were disabled. PROMIS Physical Function, Pain Interference and Fatigue instruments correlated moderately to strongly (rho's ≥ 0.68 with corresponding PROs. Test-retest reliability ranged from .725-.883, and Cronbach's alpha from .906-.991. A dose-response relationship with disease activity was evident in Physical Function with similar trends in other scales except Anger.These data provide preliminary evidence of reliability and construct validity of PROMIS CATs to assess RA symptoms and impacts, and feasibility of use in clinical care. PROMIS instruments captured the experiences of RA patients across the broad continuum of RA symptoms and function, especially at low disease activity levels. Future research is needed to evaluate performance in relevant subgroups, assess responsiveness and identify clinically meaningful changes.

  4. Assessment of intra-interobserver reliability of the sonographic optic nerve sheath diameter measurement

    Directory of Open Access Journals (Sweden)

    Tuba Cimilli Ozturk

    2015-08-01

    Full Text Available Diagnosis and measuring the level of increase in intracranial pressure (ICP is critical, especially for the management of trauma patients in the emergency department and intensive care unit. However, measurements are operator-dependent as in all of the sonographic diagnoses. The aim of this study is to assess the operator variations in the measurement of optic nerve sheath diameter (ONSD. There were four emergency medicine specialists involved in the study. Each had at least 1 year of experience of ultrasound scans and performed at least 25 prior ocular scans examining the ONSD. Two measurements were made 1 week apart from both axial and longitudinal planes. Sixty healthy adults were involved in the study and every investigator obtained four measurements from each. Intra-interobserver reliabilities were tested. The investigators performed 60 ocular ultrasounds on individual healthy adults and obtained two measurements in axial and longitudinal planes 1 week apart. Therefore, 960 measurements were analyzed. The levels of compatibilities for most of the measurements were found at acceptable levels statistically. However, it is not possible to say that there was a perfect compatibility among the sonographers according to the previously conducted reliability studies of ultrasound measurements. According to our results, it is hard to say that sonographic measurement of the ONSD is a highly reliable method both in longitudinal and transverse planes.

  5. Test-Retest Reliability of Isokinetic Knee Strength Measurements in Children Aged 8 to 10 Years.

    Science.gov (United States)

    Fagher, Kristina; Fritzson, Annelie; Drake, Anna Maria

    Isokinetic dynamometry is a useful tool to objectively assess muscle strength of children and adults in athletic and rehabilitative settings. This study examined test-retest reliability of isokinetic knee strength measurements in children aged 8 to 10 years and defined limits for the minimum difference (MD) in strength that indicates a clinically important change. Isokinetic knee strength measurements (using the Biodex System 4) in children will provide reliable results. Descriptive laboratory study. In 22 healthy children, 5 maximal concentric (CON) knee extensor (KE) and knee flexor (KF) contractions at 2 angular velocities (60 deg/s and 180 deg/s) and 5 maximal eccentric (ECC) KE/KF contractions at 60 deg/s were assessed 7 days apart. The intraclass correlation coefficient (ICC 2.1 ) was used to examine relative reliability, and the MD was calculated on the basis of standard error of measurement. ICCs for CON KE/KF peak torque measurements were fair to excellent (range, 0.49-0.81). The MD% values for CON KE and KF ranged from 31% to 37% at 60 deg/s and from 34% to 39% at 180 deg/s. ICCs in the ECC mode were good (range, 0.60-0.70), but associated MD% values were high (>50%). There was no systematic error for CON KE/KF and ECC KE strength measurements at 60 deg/s, but systematic error was found for all other measurements. The dynamometer provides a reliable analysis of isokinetic CON knee strength measurements at 60 deg/s in children aged 8 to 10 years. Measurements at 180 deg/s and in the ECC mode were not reliable, indicating a need for more familiarization prior to testing. The MD values may help clinicians to determine whether a change in knee strength is due to error or intervention.

  6. Reliability of Pressure Ulcer Rates: How Precisely Can We Differentiate Among Hospital Units, and Does the Standard Signal‐Noise Reliability Measure Reflect This Precision?

    Science.gov (United States)

    Cramer, Emily

    2016-01-01

    Abstract Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital‐acquired pressure ulcer rates and evaluate a standard signal‐noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step‐down, medical, surgical, and medical‐surgical nursing units from 1,299 US hospitals were analyzed. Using beta‐binomial models, we estimated between‐unit variability (signal) and within‐unit variability (noise) in annual unit pressure ulcer rates. Signal‐noise reliability was computed as the ratio of between‐unit variability to the total of between‐ and within‐unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal‐noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal‐noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. PMID:27223598

  7. Accounting for measurement reliability to improve the quality of inference in dental microhardness research: a worked example.

    Science.gov (United States)

    Sever, Ivan; Klaric, Eva; Tarle, Zrinka

    2016-07-01

    Dental microhardness experiments are influenced by unobserved factors related to the varying tooth characteristics that affect measurement reproducibility. This paper explores the appropriate analytical tools for modeling different sources of unobserved variability to reduce the biases encountered and increase the validity of microhardness studies. The enamel microhardness of human third molars was measured by Vickers diamond. The effects of five bleaching agents-10, 16, and 30 % carbamide peroxide, and 25 and 38 % hydrogen peroxide-were examined, as well as the effect of artificial saliva and amorphous calcium phosphate. To account for both between- and within-tooth heterogeneity in evaluating treatment effects, the statistical analysis was performed in the mixed-effects framework, which also included the appropriate weighting procedure to adjust for confounding. The results were compared to those of the standard ANOVA model usually applied. The weighted mixed-effects model produced the parameter estimates of different magnitude and significance than the standard ANOVA model. The results of the former model were more intuitive, with more precise estimates and better fit. Confounding could seriously bias the study outcomes, highlighting the need for more robust statistical procedures in dental research that account for the measurement reliability. The presented framework is more flexible and informative than existing analytical techniques and may improve the quality of inference in dental research. Reported results could be misleading if underlying heterogeneity of microhardness measurements is not taken into account. The confidence in treatment outcomes could be increased by applying the framework presented.

  8. Reliability

    OpenAIRE

    Condon, David; Revelle, William

    2017-01-01

    Separating the signal in a test from the irrelevant noise is a challenge for all measurement. Low test reliability limits test validity, attenuates important relationships, and can lead to regression artifacts. Multiple approaches to the assessment and improvement of reliability are discussed. The advantages and disadvantages of several different approaches to reliability are considered. Practical advice on how to assess reliability using open source software is provided.

  9. PV Systems Reliability Final Technical Report.

    Energy Technology Data Exchange (ETDEWEB)

    Lavrova, Olga [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Flicker, Jack David [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Johnson, Jay [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Armijo, Kenneth Miguel [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Gonzalez, Sigifredo [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Schindelholz, Eric John [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Sorensen, Neil R. [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Yang, Benjamin Bing-Yeh [Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)

    2015-12-01

    The continued exponential growth of photovoltaic technologies paves a path to a solar-powered world, but requires continued progress toward low-cost, high-reliability, high-performance photovoltaic (PV) systems. High reliability is an essential element in achieving low-cost solar electricity by reducing operation and maintenance (O&M) costs and extending system lifetime and availability, but these attributes are difficult to verify at the time of installation. Utilities, financiers, homeowners, and planners are demanding this information in order to evaluate their financial risk as a prerequisite to large investments. Reliability research and development (R&D) is needed to build market confidence by improving product reliability and by improving predictions of system availability, O&M cost, and lifetime. This project is focused on understanding, predicting, and improving the reliability of PV systems. The two areas being pursued include PV arc-fault and ground fault issues, and inverter reliability.

  10. Measuring the airway in 3 dimensions: a reliability and accuracy study.

    Science.gov (United States)

    El, Hakan; Palomo, Juan Martin

    2010-04-01

    The aim of the study was to compare the reliability and accuracy of 3 commercially available digital imaging and communications in medicine (DICOM) viewers for measuring upper airway volumes. Thirty cone-beam computed tomography scans were randomly selected, and the upper airway volumes were calculated for both oropharynx and nasal passage. Dolphin3D (version 11, Dolphin Imaging & Management Solutions, Chatsworth, Calif), InVivoDental (version 4.0.70, Anatomage, San Jose, Calif), and OnDemand3D (version 1.0.1.8407, CyberMed, Seoul, Korea) were compared with a previously tested manual segmentation program called OrthoSegment (OS) (developed at the Department of Orthodontics at Case Western Reserve University, Cleveland, Ohio). The measurements were repeated after 2 weeks, and the ICC was used for the reliability tests. All commercially available programs were compared with the OS program by using regression analysis. The Pearson correlation was used to evaluate the correlation between the OS and the automatic segmentation programs. The reliability was high for all programs. The highest correlation found was between the OS and Dolphin3D for the oropharynx, and between the OS and InVivoDental for nasal passage volume. A high correlation was found for all programs, but the results also showed statistically significant differences compared with the OS program. The programs also had inconsistencies among themselves. The 3 commercially available DICOM viewers are highly reliable in their airway volume calculations and showed high correlation of results but poor accuracy, suggesting systematic errors. Copyright 2010 American Association of Orthodontists. Published by Mosby, Inc. All rights reserved.

  11. "Reliability of the Norwegian version of the short physical performance battery in older people with and without dementia".

    Science.gov (United States)

    Olsen, Cecilie Fromholt; Bergland, Astrid

    2017-06-09

    The purpose of the study was to establish the test-retest reliability of the Norwegian version of the Short Physical Performance Battery (SPPB). This was a cross- sectional reliability study. A convenience sample of 61 older adults with a mean age of 88.4(8.1) was tested by two different physiotherapists at two time points. The mean time interval between tests was 2.5 days. The Intraclass Correlation Coefficient model 3.1 (ICC, 3.1) with 95% confidence intervals as well as the weighted Kappa (K) were used as measures of relative reliability. The Standard Error of Measurement (SEM) and Minimal Detectable Change (MDC) were used to measure absolute reliability. The results were also analyzed for a subgroup of 24 older people with dementia. The ICC reflected high relative reliability for the SPPB summary score and the 4 m walk test (4mwt), both for the total sample (ICC = 0.92, and 0.91 respectively)) and for the subgroup with dementia (ICC = 0.84 and 0.90 respectively). Furthermore, weighted Ks for the SPPB subscales were 0.64 for the chair stand, 0.80 for gait and 0.52 for balance for the total sample and almost identical for the subgroup with dementia. MDC-values at the 95% confidence intervals (MDC95) were calculated at 0.8 for the total score of SPPB and 0.39 m/s for the 4mwt in the total sample. For the subgroup with dementia MDC95 was 1.88 for the total score of SPPB and 0.28 m/s for 4mwt. The SPPB total score and the timed walking test showed overall high relative and absolute reliability for the total sample indicating that the Norwegian version of the SPPB is reliable when used by trained physiotherapists with older people. The reliability of the Norwegian SPPB in older people with dementia seems high, but due to a small sample size this needs further investigation.

  12. Reliability of agriculture universal joint shafts based on temperature measuring in universal joint bearing assemblies

    Directory of Open Access Journals (Sweden)

    Аleksandar Asonja

    2015-03-01

    Full Text Available This paper presents a research into reliability calculations of agriculture double universal joint shafts based on temperature measuring in cardan-type universal joint bearing assemblies. Special laboratory equipment was developed for this research which is presented in the paper. The objective of this research was to test the real life span of universal joint shafts in the laboratory and in field, to obtain the results which can be used to improve the reliability of universal joint shafts. If the presented research were used along with maintenance measures recommended in the paper and with proper use, the level of reliability of the shafts would be 2.1 times higher. The presented results of the research showed that needle bearings, i.e. bearing assemblies of the joints, are the most critical elements on universal joint shafts and are possible causes of their lower reliability. The second universal joint is the part with the lowest reliability in the observed technical system.

  13. Harmonization process and reliability assessment of anthropometric measurements in the elderly EXERNET multi-centre study.

    Directory of Open Access Journals (Sweden)

    Alba Gómez-Cabello

    Full Text Available BACKGROUND: The elderly EXERNET multi-centre study aims to collect normative anthropometric data for old functionally independent adults living in Spain. PURPOSE: To describe the standardization process and reliability of the anthropometric measurements carried out in the pilot study and during the final workshop, examining both intra- and inter-rater errors for measurements. MATERIALS AND METHODS: A total of 98 elderly from five different regions participated in the intra-rater error assessment, and 10 different seniors living in the city of Toledo (Spain participated in the inter-rater assessment. We examined both intra- and inter-rater errors for heights and circumferences. RESULTS: For height, intra-rater technical errors of measurement (TEMs were smaller than 0.25 cm. For circumferences and knee height, TEMs were smaller than 1 cm, except for waist circumference in the city of Cáceres. Reliability for heights and circumferences was greater than 98% in all cases. Inter-rater TEMs were 0.61 cm for height, 0.75 cm for knee-height and ranged between 2.70 and 3.09 cm for the circumferences measured. Inter-rater reliabilities for anthropometric measurements were always higher than 90%. CONCLUSION: The harmonization process, including the workshop and pilot study, guarantee the quality of the anthropometric measurements in the elderly EXERNET multi-centre study. High reliability and low TEM may be expected when assessing anthropometry in elderly population.

  14. Operator adaptation to changes in system reliability under adaptable automation.

    Science.gov (United States)

    Chavaillaz, Alain; Sauer, Juergen

    2017-09-01

    This experiment examined how operators coped with a change in system reliability between training and testing. Forty participants were trained for 3 h on a complex process control simulation modelling six levels of automation (LOA). In training, participants either experienced a high- (100%) or low-reliability system (50%). The impact of training experience on operator behaviour was examined during a 2.5 h testing session, in which participants either experienced a high- (100%) or low-reliability system (60%). The results showed that most operators did not often switch between LOA. Most chose an LOA that relieved them of most tasks but maintained their decision authority. Training experience did not have a strong impact on the outcome measures (e.g. performance, complacency). Low system reliability led to decreased performance and self-confidence. Furthermore, complacency was observed under high system reliability. Overall, the findings suggest benefits of adaptable automation because it accommodates different operator preferences for LOA. Practitioner Summary: The present research shows that operators can adapt to changes in system reliability between training and testing sessions. Furthermore, it provides evidence that each operator has his/her preferred automation level. Since this preference varies strongly between operators, adaptable automation seems to be suitable to accommodate these large differences.

  15. Validity and Reliability of Dynamic Visual Acuity (DVA) Measurement During Walking

    Science.gov (United States)

    Deshpande, Nandini; Peters, Brian T.; Bloomberg, Jacob J.

    2014-01-01

    DVA is primarily subserved by the vestibulo-ocular reflex mechanism. Individuals with vestibular hypofunction commonly experience highly debilitating illusory movement or blurring of visual images during daily activities possibly, due to impaired DVA. Even without pathologies, gradual age-related morphological deterioration is evident in all components of the vestibular system. We examined the construct validity to detect age-related differences and test-retest reliability of DVA measurements performed during walking. METHODS: Healthy adults were recruited into 3 groups: 1. young (20-39years, n=18), 2. middle-aged (40-59years, n=14), and 3. older adults (60-80years, n=15). Randomly selected seven participants from each group (n=21) participated in retesting. Participants were excluded if they had a history of vestibular or neuromuscular pathologies, dizziness/vertigo or >1 falls in the past year. Older persons with MMSE scores reliability. RESULTS: The three age groups were not different in their height, weight and normal walking speed (p>0.05). The post hoc analyses for DVA measurements demonstrated that each group was significantly different from the other two groups for Near as well as FarDVA (preliability. FarDVA at 0.8 m/s and 1.0 m/s demonstrated good test-retest reliability (ICCs 0.71 and 0.77, respectively).

  16. The Reliability of Isometer 2 Device in Measuring of Cervical Flexor and Extensor Muscles Strength

    Directory of Open Access Journals (Sweden)

    Asghar Reza Soltan-Zadeh

    2006-07-01

    Full Text Available Objective: The strength of a group of muscles can be measured by muscle strength test, employing a force measuring instrument. In order to monitor the effectiveness of a therapeutic or training programs we need a reliable technique which is also accurate in repeated measurements. The purpose of this study was to examine the reliability of an isometric neck muscle force measurement device.  Materials & Methods: Thirty seven healthy non athlete subjects (18 males and 19 females, aged 18-25 participated in this analytical study. The maximal isometric contractions of the neck extensor and flexor muscles were measured in different times and different days and by two different testers. A new sensitive “load cell” was applied to our previously designed neck muscle force measurement apparatus. Results: The results of the inter-trail, test retest, and inter rater reliability (0.86 < ICC < 0.98 , 2.2< Sw <5.1 N indicated that the neck muscle force measurements were highly repeatable and less variable between measurements. There were no statistically significant differences in neck muscle force measurements, between times, between days and between retsters. Maximum isometric contractions were significantly higher in males than in the females (p < 0.001. Women’s neck muscle strengths were 30.8% and 46.1% of men in cervical extension and cervical flexion. Conclusion: In this study we used a new model (Isometer 2 of our previous apparatus (Isometer. The isometric strength of neck flexor and extensor muscles which was measured by Isometer 2 appeared to be a reliable and useful method for measuring the force of the neck extensor and flexor muscles.

  17. Fatigue is a reliable, sensitive and unique outcome measure in rheumatoid arthritis.

    LENUS (Irish Health Repository)

    Minnock, Patricia

    2009-12-01

    Fatigue is an important symptom in patients with RA. Measurement of fatigue in clinical trials and in clinical practice requires scales that are reproducible, sensitive to change and practical. This study examined the reliability and sensitivity to change of fatigue and its relative independence as an outcome measure in RA.

  18. Using the confidence interval confidently.

    Science.gov (United States)

    Hazra, Avijit

    2017-10-01

    Biomedical research is seldom done with entire populations but rather with samples drawn from a population. Although we work with samples, our goal is to describe and draw inferences regarding the underlying population. It is possible to use a sample statistic and estimates of error in the sample to get a fair idea of the population parameter, not as a single value, but as a range of values. This range is the confidence interval (CI) which is estimated on the basis of a desired confidence level. Calculation of the CI of a sample statistic takes the general form: CI = Point estimate ± Margin of error, where the margin of error is given by the product of a critical value (z) derived from the standard normal curve and the standard error of point estimate. Calculation of the standard error varies depending on whether the sample statistic of interest is a mean, proportion, odds ratio (OR), and so on. The factors affecting the width of the CI include the desired confidence level, the sample size and the variability in the sample. Although the 95% CI is most often used in biomedical research, a CI can be calculated for any level of confidence. A 99% CI will be wider than 95% CI for the same sample. Conflict between clinical importance and statistical significance is an important issue in biomedical research. Clinical importance is best inferred by looking at the effect size, that is how much is the actual change or difference. However, statistical significance in terms of P only suggests whether there is any difference in probability terms. Use of the CI supplements the P value by providing an estimate of actual clinical effect. Of late, clinical trials are being designed specifically as superiority, non-inferiority or equivalence studies. The conclusions from these alternative trial designs are based on CI values rather than the P value from intergroup comparison.

  19. Reliability and validity of an internet-based questionnaire measuring lifetime physical activity.

    Science.gov (United States)

    De Vera, Mary A; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek

    2010-11-15

    Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005-2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity.

  20. The reliability of a severity rating scale to measure stuttering in an unfamiliar language.

    Science.gov (United States)

    Hoffman, Laura; Wilson, Linda; Copley, Anna; Hewat, Sally; Lim, Valerie

    2014-06-01

    With increasing multiculturalism, speech-language pathologists (SLPs) are likely to work with stuttering clients from linguistic backgrounds that differ from their own. No research to date has estimated SLPs' reliability when measuring severity of stuttering in an unfamiliar language. Therefore, this study was undertaken to estimate the reliability of SLPs' use of a 9-point severity rating (SR) scale, to measure severity of stuttering in a language that was different from their own. Twenty-six Australian SLPs rated 20 speech samples (10 Australian English [AE] and 10 Mandarin) of adults who stutter using a 9-point SR scale on two separate occasions. Judges showed poor agreement when using the scale to measure stuttering in Mandarin samples. Results also indicated that 50% of individual judges were unable to reliably measure the severity of stuttering in AE. The results highlight the need for (a) SLPs to develop intra- and inter-judge agreement when using the 9-point SR scale to measure severity of stuttering in their native language (in this case AE) and in unfamiliar languages; and (b) research into the development and evaluation of practice and/or training packages to assist SLPs to do so.

  1. Relating measurement invariance, cross-level invariance, and multilevel reliability

    OpenAIRE

    Jak, S.; Jorgensen, T.D.

    2017-01-01

    Data often have a nested, multilevel structure, for example when data are collected from children in classrooms. This kind of data complicate the evaluation of reliability and measurement invariance, because several properties can be evaluated at both the individual level and the cluster level, as well as across levels. For example, cross-level invariance implies equal factor loadings across levels, which is needed to give latent variables at the two levels a similar interpretation. Reliabili...

  2. Animal Spirits and Extreme Confidence: No Guts, No Glory?

    NARCIS (Netherlands)

    M.G. Douwens-Zonneveld (Mariska)

    2012-01-01

    textabstractThis study investigates to what extent extreme confidence of either management or security analysts may impact financial or operating performance. We construct a multidimensional degree of company confidence measure from a wide range of corporate decisions. We empirically test this

  3. Reliable intraocular pressure measurement using automated radio-wave telemetry

    Directory of Open Access Journals (Sweden)

    Paschalis EI

    2014-01-01

    Full Text Available Eleftherios I Paschalis,* Fabiano Cade,* Samir Melki, Louis R Pasquale, Claes H Dohlman, Joseph B CiolinoMassachusetts Eye and Ear Infirmary, Harvard Medical School, Boston, MA, USA*These authors contributed equally to this workPurpose: To present an autonomous intraocular pressure (IOP measurement technique using a wireless implantable transducer (WIT and a motion sensor.Methods: The WIT optical aid was implanted within the ciliary sulcus of a normotensive rabbit eye after extracapsular clear lens extraction. An autonomous wireless data system (AWDS comprising of a WIT and an external antenna aided by a motion sensor provided continuous IOP readings. The sensitivity of the technique was determined by the ability to detect IOP changes resulting from the administration of latanoprost 0.005% or dorzolamide 2%, while the reliability was determined by the agreement between baseline and vehicle (saline IOP.Results: On average, 12 diurnal and 205 nocturnal IOP measurements were performed with latanoprost, and 26 diurnal and 205 nocturnal measurements with dorzolamide. No difference was found between mean baseline IOP (13.08±2.2 mmHg and mean vehicle IOP (13.27±2.1 mmHg (P=0.45, suggesting good measurement reliability. Both antiglaucoma medications caused significant IOP reduction compared to baseline; latanoprost reduced mean IOP by 10% (1.3±3.54 mmHg; P<0.001, and dorzolamide by 5% (0.62±2.22 mmHg; P<0.001. Use of latanoprost resulted in an overall twofold higher IOP reduction compared to dorzolamide (P<0.001. Repeatability was ±1.8 mmHg, assessed by the variability of consecutive IOP measurements performed in a short period of time (≤1 minute, during which the IOP is not expected to change.Conclusion: IOP measurements in conscious rabbits obtained without the need for human interactions using the AWDS are feasible and provide reproducible results.Keywords: IOP, pressure transducer, wireless, MEMS, implant, intraocular

  4. Reliability, validity, and minimal detectable change of the push-off test scores in assessing upper extremity weight-bearing ability.

    Science.gov (United States)

    Mehta, Saurabh P; George, Hannah R; Goering, Christian A; Shafer, Danielle R; Koester, Alan; Novotny, Steven

    2017-11-01

    Clinical measurement study. The push-off test (POT) was recently conceived and found to be reliable and valid for assessing weight bearing through injured wrist or elbow. However, further research with larger sample can lend credence to the preliminary findings supporting the use of the POT. This study examined the interrater reliability, construct validity, and measurement error for the POT in patients with wrist conditions. Participants with musculoskeletal (MSK) wrist conditions were recruited. The performance on the POT, grip isometric strength of wrist extensors was assessed. The shortened version of the Disabilities of the Arm, Shoulder and Hand and numeric pain rating scale were completed. The intraclass correlation coefficient assessed interrater reliability of the POT. Pearson correlation coefficients (r) examined the concurrent relationships between the POT and other measures. The standard error of measurement and the minimal detectable change at 90% confidence interval were assessed as measurement error and index of true change for the POT. A total of 50 participants with different elbow or wrist conditions (age: 48.1 ± 16.6 years) were included in this study. The results of this study strongly supported the interrater reliability (intraclass correlation coefficient: 0.96 and 0.93 for the affected and unaffected sides, respectively) of the POT in patients with wrist MSK conditions. The POT showed convergent relationships with the grip strength on the injured side (r = 0.89) and the wrist extensor strength (r = 0.7). The POT showed smaller standard error of measurement (1.9 kg). The minimal detectable change at 90% confidence interval for the POT was 4.4 kg for the sample. This study provides additional evidence to support the reliability and validity of the POT. This is the first study that provides the values for the measurement error and true change on the POT scores in patients with wrist MSK conditions. Further research should examine the

  5. The Reliability and Validity of Discrete and Continuous Measures of Psychopathology: A Quantitative Review

    Science.gov (United States)

    Markon, Kristian E.; Chmielewski, Michael; Miller, Christopher J.

    2011-01-01

    In 2 meta-analyses involving 58 studies and 59,575 participants, we quantitatively summarized the relative reliability and validity of continuous (i.e., dimensional) and discrete (i.e., categorical) measures of psychopathology. Overall, results suggest an expected 15% increase in reliability and 37% increase in validity through adoption of a…

  6. Strength Measurements in Acute Hamstring Injuries: Intertester Reliability and Prognostic Value of Handheld Dynamometry.

    Science.gov (United States)

    Reurink, Gustaaf; Goudswaard, Gert Jan; Moen, Maarten H; Tol, Johannes L; Verhaar, Jan A N; Weir, Adam

    2016-08-01

    Study Design Cohort study, repeated measures. Background Although hamstring strength measurements are used for assessing prognosis and monitoring recovery after hamstring injury, their actual clinical relevance has not been established. Handheld dynamometry (HHD) is a commonly used method of measuring muscle strength. The reliability of HHD has not been determined in athletes with acute hamstring injuries. Objectives To determine the intertester reliability and the prognostic value of hamstring HHD strength measurement in acute hamstring injuries. Methods We measured knee flexion strength with HHD in 75 athletes at 2 visits, at baseline (within 5 days of hamstring injury) and follow-up (5 to 7 days after the baseline measurement). We assessed isometric hamstring strength in 15° and 90° of knee flexion. Reliability analysis testing was performed by 2 testers independently at the follow-up visit. We recorded the time needed to return to play (RTP) up to 6 months following baseline. Results The intraclass correlation coefficients of the strength measurements in injured hamstrings were between 0.75 and 0.83. There was a statistically significant but weak correlation between the time to RTP and the strength deficit at 15° of knee flexion measured at baseline (Spearman r = 0.25, P = .045) and at the follow-up visit (Spearman r = 0.26, P = .034). Up to 7% of the variance in time to RTP is explained by this strength deficit. None of the other strength variables were significantly correlated with time to RTP. Conclusion Hamstring strength can be reliably measured with HHD in athletes with acute hamstring injuries. The prognostic value of strength measurements is limited, as there is only a weak association between the time to RTP and hamstring strength deficit after acute injury. Level of Evidence Prognosis, level 4. J Orthop Sports Phys Ther 2016;46(8):689-696. Epub 12 May 2016. doi:10.2519/jospt.2016.6363.

  7. Reliability and Validity of the Turkish Version of the Voice-Related Quality of Life Measure.

    Science.gov (United States)

    Tezcaner, Zahide Çiler; Aksoy, Songül

    2017-03-01

    This study aims to test the validity and reliability of the Turkish version of the Voice-Related Quality of Life (V-RQOL) questionnaire. This is a nonrandomized, prospective study with control group. The questionnaire was administered to 249 individuals-130 with vocal complaint and 119 without-with a mean age of 37.8 ± 12.3 years. The Turkish version of the Voice Handicap Index (VHI) and perceptual voice evaluation measures were also administered at 2-14 days for retest reliability. The instrument was submitted to validity and reliability evaluation. The V-RQOL measure showed a strong internal consistency and test-retest reliability; the Cronbach's alpha coefficient for the overall V-RQOL was 0.969, the physical functioning domain was 0.949, and the social-emotional domain was 0.940. In the test-retest reliability test, the overall V-RQOL was found to be 0.989. The construct validity of the V-RQOL was determined based on the strength and direction of its relation to the VHI and the perceptual voice evaluation measure. The higher the VHI level, the lower the physical functioning, social-emotional, and overall score levels of the V-RQOL (r = -0.927, r = -0.912, r = -0.944, respectively; P reliability and validity and may play a crucial role in evaluating Turkish-speaking patients with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.

  8. Reliability of BOD POD Measurements Remains High After a Short-Duration Low-Carbohydrate Diet.

    Science.gov (United States)

    Greer, Beau Kjerulf; Edsall, Kathleen M; Greer, Anna E

    2016-04-01

    The purpose of the current study was to determine whether expected changes in body weight via a 3-day low-carbohydrate (LC) diet will disrupt the reliability of air displacement plethysmography measurements via BOD POD. Twenty-four subjects recorded their typical diets for 3 days before BOD POD and 7-site skinfold analyses. Subjects were matched for lean body mass and divided into low-CHO (LC) and control (CON) groups. The LC group was given instruction intended to prevent more than 50 grams/day of carbohydrate consumption for 3 consecutive days, and the CON group replicated their previously recorded diet. Body composition measurements were repeated after dietary intervention. Test-retest reliability measures were significant (p fat percentage in both the LC and the CON groups (rs = .993 and .965, respectively). Likewise, skinfold analysis for body fat percentage reliability was high in both groups (rs = .996 and .997, respectively). There were significant differences between 1st and 2nd BOD POD measurements for body mass (72.9 ± 13.3 vs. 72.1 ± 13.0 kg [M ± SD]) and body volume (69.0 ± 12.7-68.1 ± 12.2 L) in the LC group (p .05) in BOD POD-determined body fat percentage, lean body mass, or fat mass between the 1st and 2nd trial in either group. Body composition measures via BOD POD and 7-site skinfolds remain reliable after 3 days of an LC diet despite significant decreases in body mass.

  9. Reliability and validity of the Pragmatics Observational Measure (POM): a new observational measure of pragmatic language for children.

    Science.gov (United States)

    Cordier, Reinie; Munro, Natalie; Wilkes-Gillan, Sarah; Speyer, Renée; Pearce, Wendy M

    2014-07-01

    There is a need for a reliable and valid assessment of childhood pragmatic language skills during peer-peer interactions. This study aimed to evaluate the psychometric properties of a newly developed pragmatic assessment, the Pragmatic Observational Measure (POM). The psychometric properties of the POM were investigated from observational data of two studies - study 1 involved 342 children aged 5-11 years (108 children with ADHD; 108 typically developing playmates; 126 children in the control group), and study 2 involved 9 children with ADHD who attended a 7-week play-based intervention. The psychometric properties of the POM were determined based on the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) taxonomy of psychometric properties and definitions for health-related outcomes; the Pragmatic Protocol was used as the reference tool against which the POM was evaluated. The POM demonstrated sound psychometric properties in all the reliability, validity and interpretability criteria against which it was assessed. The findings showed that the POM is a reliable and valid measure of pragmatic language skills of children with ADHD between the age of 5 and 11 years and has clinical utility in identifying children with pragmatic language difficulty. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  10. A Reliability Test of a Complex System Based on Empirical Likelihood

    OpenAIRE

    Zhou, Yan; Fu, Liya; Zhang, Jun; Hui, Yongchang

    2016-01-01

    To analyze the reliability of a complex system described by minimal paths, an empirical likelihood method is proposed to solve the reliability test problem when the subsystem distributions are unknown. Furthermore, we provide a reliability test statistic of the complex system and extract the limit distribution of the test statistic. Therefore, we can obtain the confidence interval for reliability and make statistical inferences. The simulation studies also demonstrate the theorem results.

  11. Reliability, reference values and predictor variables of the ulnar sensory nerve in disease free adults.

    Science.gov (United States)

    Ruediger, T M; Allison, S C; Moore, J M; Wainner, R S

    2014-09-01

    The purposes of this descriptive and exploratory study were to examine electrophysiological measures of ulnar sensory nerve function in disease free adults to determine reliability, determine reference values computed with appropriate statistical methods, and examine predictive ability of anthropometric variables. Antidromic sensory nerve conduction studies of the ulnar nerve using surface electrodes were performed on 100 volunteers. Reference values were computed from optimally transformed data. Reliability was computed from 30 subjects. Multiple linear regression models were constructed from four predictor variables. Reliability was greater than 0.85 for all paired measures. Responses were elicited in all subjects; reference values for sensory nerve action potential (SNAP) amplitude from above elbow stimulation are 3.3 μV and decrement across-elbow less than 46%. No single predictor variable accounted for more than 15% of the variance in the response. Electrophysiologic measures of the ulnar sensory nerve are reliable. Absent SNAP responses are inconsistent with disease free individuals. Reference values recommended in this report are based on appropriate transformations of non-normally distributed data. No strong statistical model of prediction could be derived from the limited set of predictor variables. Reliability analyses combined with relatively low level of measurement error suggest that ulnar sensory reference values may be used with confidence. Copyright © 2014 Elsevier Masson SAS. All rights reserved.

  12. Reliability of lower limb alignment measures using an established landmark-based method with a customized computer software program

    Science.gov (United States)

    Sled, Elizabeth A.; Sheehy, Lisa M.; Felson, David T.; Costigan, Patrick A.; Lam, Miu; Cooke, T. Derek V.

    2010-01-01

    The objective of the study was to evaluate the reliability of frontal plane lower limb alignment measures using a landmark-based method by (1) comparing inter- and intra-reader reliability between measurements of alignment obtained manually with those using a computer program, and (2) determining inter- and intra-reader reliability of computer-assisted alignment measures from full-limb radiographs. An established method for measuring alignment was used, involving selection of 10 femoral and tibial bone landmarks. 1) To compare manual and computer methods, we used digital images and matching paper copies of five alignment patterns simulating healthy and malaligned limbs drawn using AutoCAD. Seven readers were trained in each system. Paper copies were measured manually and repeat measurements were performed daily for 3 days, followed by a similar routine with the digital images using the computer. 2) To examine the reliability of computer-assisted measures from full-limb radiographs, 100 images (200 limbs) were selected as a random sample from 1,500 full-limb digital radiographs which were part of the Multicenter Osteoarthritis (MOST) Study. Three trained readers used the software program to measure alignment twice from the batch of 100 images, with two or more weeks between batch handling. Manual and computer measures of alignment showed excellent agreement (intraclass correlations [ICCs] 0.977 – 0.999 for computer analysis; 0.820 – 0.995 for manual measures). The computer program applied to full-limb radiographs produced alignment measurements with high inter- and intra-reader reliability (ICCs 0.839 – 0.998). In conclusion, alignment measures using a bone landmark-based approach and a computer program were highly reliable between multiple readers. PMID:19882339

  13. Reliability of a single objective measure in assessing sleepiness.

    Science.gov (United States)

    Sunwoo, Bernie Y; Jackson, Nicholas; Maislin, Greg; Gurubhagavatula, Indira; George, Charles F; Pack, Allan I

    2012-01-01

    To evaluate reliability of single objective tests in assessing sleepiness. Subjects who completed polysomnography underwent a 4-nap multiple sleep latency test (MSLT) the following day. Prior to each nap opportunity on MSLT, subjects performed the psychomotor vigilance test (PVT) and divided attention driving task (DADT). Results of single versus multiple test administrations were compared using the intraclass correlation coefficient (ICC) and adjusted for test administration order effects to explore time of day effects. Measures were explored as continuous and binary (i.e., impaired or not impaired). Community-based sample evaluated at a tertiary, university-based sleep center. 372 adult commercial vehicle operators oversampled for increased obstructive sleep apnea risk. N/A. AS CONTINUOUS MEASURES, ICC WERE AS FOLLOWS: MSLT 0.45, PVT median response time 0.69, PVT number of lapses 0.51, 10-min DADT tracking error 0.87, 20-min DADT tracking error 0.90. Based on binary outcomes, ICC were: MSLT 0.63, PVT number of lapses 0.85, 10-min DADT 0.95, 20-min DADT 0.96. Statistically significant time of day effects were seen in both the MSLT and PVT but not the DADT. Correlation between ESS and different objective tests was strongest for MSLT, range [-0.270 to -0.195] and persisted across all time points. Single DADT and PVT administrations are reliable measures of sleepiness. A single MSLT administration can reasonably discriminate individuals with MSL < 8 minutes. These results support the use of a single administration of some objective tests of sleepiness when performed under controlled conditions in routine clinical care.

  14. Inter-rater reliability of kinesthetic measurements with the KINARM robotic exoskeleton.

    Science.gov (United States)

    Semrau, Jennifer A; Herter, Troy M; Scott, Stephen H; Dukelow, Sean P

    2017-05-22

    Kinesthesia (sense of limb movement) has been extremely difficult to measure objectively, especially in individuals who have survived a stroke. The development of valid and reliable measurements for proprioception is important to developing a better understanding of proprioceptive impairments after stroke and their impact on the ability to perform daily activities. We recently developed a robotic task to evaluate kinesthetic deficits after stroke and found that the majority (~60%) of stroke survivors exhibit significant deficits in kinesthesia within the first 10 days post-stroke. Here we aim to determine the inter-rater reliability of this robotic kinesthetic matching task. Twenty-five neurologically intact control subjects and 15 individuals with first-time stroke were evaluated on a robotic kinesthetic matching task (KIN). Subjects sat in a robotic exoskeleton with their arms supported against gravity. In the KIN task, the robot moved the subjects' stroke-affected arm at a preset speed, direction and distance. As soon as subjects felt the robot begin to move their affected arm, they matched the robot movement with the unaffected arm. Subjects were tested in two sessions on the KIN task: initial session and then a second session (within an average of 18.2 ± 13.8 h of the initial session for stroke subjects), which were supervised by different technicians. The task was performed both with and without the use of vision in both sessions. We evaluated intra-class correlations of spatial and temporal parameters derived from the KIN task to determine the reliability of the robotic task. We evaluated 8 spatial and temporal parameters that quantify kinesthetic behavior. We found that the parameters exhibited moderate to high intra-class correlations between the initial and retest conditions (Range, r-value = [0.53-0.97]). The robotic KIN task exhibited good inter-rater reliability. This validates the KIN task as a reliable, objective method for quantifying

  15. Assessing Confidence in Performance Assessments Using an Evidence Support Logic Methodology: An Application of Tesla

    International Nuclear Information System (INIS)

    Egan, M.; Paulley, A.; Lehman, L.; Lowe, J.; Rochette, E.; Baker, St.

    2009-01-01

    The assessment of uncertainties and their implications is a key requirement when undertaking performance assessment (PA) of radioactive waste facilities. Decisions based on the outcome of such assessments become translated into judgments about confidence in the information they provide. This confidence, in turn, depends on uncertainties in the underlying evidence. Even if there is a large amount of information supporting an assessment, it may be only partially relevant, incomplete or less than completely reliable. In order to develop a measure of confidence in the outcome, sources of uncertainty need to be identified and adequately addressed in the development of the PA, or in any overarching strategic decision-making processes. This paper describes a trial application of the technique of Evidence Support Logic (ESL), which has been designed for application in support of 'high stakes' decisions, where important aspects of system performance are subject to uncertainty. The aims of ESL are to identify the amount of uncertainty or conflict associated with evidence relating to a particular decision, and to guide understanding of how evidence combines to support confidence in judgments. Elicitation techniques are used to enable participants in the process to develop a logical hypothesis model that best represents the relationships between different sources of evidence to the proposition under examination. The aim is to identify key areas of subjectivity and other sources of potential bias in the use of evidence (whether for or against the proposition) to support judgments of confidence. Propagation algorithms are used to investigate the overall implications of the logic according to the strength of the underlying evidence and associated uncertainties. (authors)

  16. Computational area measurement of orbital floor fractures: Reliability, accuracy and rapidity

    International Nuclear Information System (INIS)

    Schouman, Thomas; Courvoisier, Delphine S.; Imholz, Benoit; Van Issum, Christopher; Scolozzi, Paolo

    2012-01-01

    Objective: To evaluate the reliability, accuracy and rapidity of a specific computational method for assessing the orbital floor fracture area on a CT scan. Method: A computer assessment of the area of the fracture, as well as that of the total orbital floor, was determined on CT scans taken from ten patients. The ratio of the fracture's area to the orbital floor area was also calculated. The test–retest precision of measurement calculations was estimated using the Intraclass Correlation Coefficient (ICC) and Dahlberg's formula to assess the agreement across observers and across measures. The time needed for the complete assessment was also evaluated. Results: The Intraclass Correlation Coefficient across observers was 0.92 [0.85;0.96], and the precision of the measures across observers was 4.9%, according to Dahlberg's formula .The mean time needed to make one measurement was 2 min and 39 s (range, 1 min and 32 s to 4 min and 37 s). Conclusion: This study demonstrated that (1) the area of the orbital floor fracture can be rapidly and reliably assessed by using a specific computer system directly on CT scan images; (2) this method has the potential of being routinely used to standardize the post-traumatic evaluation of orbital fractures

  17. Reliability of one-repetition maximum performance in people with chronic heart failure.

    Science.gov (United States)

    Ellis, Rachel; Holland, Anne E; Dodd, Karen; Shields, Nora

    2018-02-24

    Evaluate intra-rater and inter-rater reliability of the one-repetition maximum strength test in people with chronic heart failure. Intra-rater and inter-rater reliability study. A public tertiary hospital in northern metropolitan Melbourne. Twenty-four participants (nine female, mean age 71.8 ± 13.1 years) with mild to moderate heart failure of any aetiology. Lower limb strength was assessed by determining the maximum weight that could be lifted using a leg press. Intra-rater reliability was tested by one assessor on two separate occasions . Inter-rater reliability was tested by two assessors in random order. Intra-class correlation coefficients and 95% confidence intervals were calculated. Bland and Altman analyses were also conducted, including calculation of mean differences between measures ([Formula: see text]) and limits of agreement . Ten intra-rater and 21 inter-rater assessments were completed. Excellent intra-rater (intra-class correlation coefficient 2,1 0.96) and inter-rater (intra-class correlation coefficient 2,1 0.93) reliability was found. Intra-rater assessment showed less variability (mean difference 4.5 kg, limits of agreement -8.11 to 17.11 kg) than inter-rater agreement (mean difference -3.81 kg, limits of agreement -23.39 to 15.77 kg). One-repetition maximum determined using a leg press is a reliable measure in people with heart failure. Given its smaller limits of agreement, intra-rater testing is recommended. Implications for Rehabilitation Using a leg press to determine a one-repetition maximum we were able to demonstrate excellent inter-rater and intra-rater reliability using an intra-class correlation coefficient. The Bland and Altman levels of agreement were wide for inter-rater reliability and so we recommend using one assessor if measuring change in strength within an individual over time.

  18. Reliability of measuring pelvic floor elevation with a diagnostic ultrasonic imaging device

    OpenAIRE

    Ubukata, Hitomi; Maruyama, Hitoshi; Huo, Ming

    2015-01-01

    [Purpose] The purpose of this study was to investigate the reliability of measuring the amount of pelvic floor elevation during pelvic and abdominal muscle contraction with a diagnostic ultrasonic imaging device. [Subjects] The study group comprised 11 healthy women without urinary incontinence or previous birth experience. [Methods] We measured the displacement elevation of the bladder base during contraction of the abdominal and pelvic floor muscles was measured using a diagnostic ultrasoni...

  19. Lifetime Estimation of Electrolytic Capacitors in Fuel Cell Power Converter at Various Confidence Levels

    DEFF Research Database (Denmark)

    Zhou, Dao; Wang, Huai; Blaabjerg, Frede

    2016-01-01

    DC capacitors in power electronic converters are a major constraint on improvement of the power density and the reliability. In this paper, according to the degradation data of tested capacitors, the lifetime model of the component is analyzed at various confidence levels. Then, the mission profile...... based lifetime expectancy of the individual capacitor and the capacitor bank is estimated in a fuel cell backup power converter operating in both standby mode and operation mode. The lifetime prediction of the capacitor banks at different confidence levels is also obtained....

  20. Reliability of pelvic floor measurements on three- and four-dimensional ultrasound during and after first pregnancy: implications for training.

    Science.gov (United States)

    van Veelen, G A; Schweitzer, K J; van der Vaart, C H

    2013-11-01

    To evaluate the reliability of measurements of the levator hiatus and levator-urethra gap (LUG) using three/four-dimensional (3D/4D) transperineal ultrasound in women during their first pregnancy and 6 months postpartum, and to assess the learning process for these measurements. An inexperienced observer was taught to perform measurements of the levator hiatus and LUG by an experienced observer. After training, 3D/4D ultrasound volume datasets of 40 women in the first trimester were analyzed by these two observers. Another training session then took place and both observers repeated the analyses of the same volume datasets. Finally, analyses of 40 volume datasets of the women 6 months postpartum were performed by both observers. Intra- and interobserver reliability were determined by intraclass correlation coefficients (ICC) with 95% CIs. For levator hiatal measurements, in the women during their first pregnancy the interobserver reliability was substantial to almost perfect after both the first and second training session (ICC, 0.62-0.83 and 0.71-0.89, respectively, for anteroposterior diameter, transverse diameter and area at rest, on contraction and on Valsalva) and the intraobserver reliability was substantial to almost perfect for both observers. For these measurements performed once the women had delivered, interobserver reliability was moderate to almost perfect. For LUG measurements performed during pregnancy, interobserver reliability was slight to moderate after the first training session (ICC, 0.14-0.54), but improved after the second training session (ICC, 0.38-0.71), and intraobserver reliability was moderate to substantial for the experienced observer and slight to moderate for the inexperienced observer. For these measurements performed when the women had delivered, interobserver reliability was fair to moderate. The levator hiatus and LUG can be measured reliably using 3D/4D ultrasound in primigravid and primiparous women. The technique to measure

  1. Reliability measures for indexed semi-Markov chains applied to wind energy production

    International Nuclear Information System (INIS)

    D'Amico, Guglielmo; Petroni, Filippo; Prattico, Flavio

    2015-01-01

    The computation of the dependability measures is a crucial point in many engineering problems as well as in the planning and development of a wind farm. In this paper we address the issue of energy production by wind turbines by using an indexed semi-Markov chain as a model of wind speed. We present the mathematical model, the data and technical characteristics of a commercial wind turbine (Aircon HAWT-10kW). We show how to compute some of the main dependability measures such as reliability, availability and maintainability functions. We compare the results of the model with real energy production obtained from data available in the Lastem station (Italy) and sampled every 10 min. - Highlights: • Semi-Markov models. • Time series generation of wind speed. • Computation of availability, reliability and maintainability.

  2. Cyber Physical Systems for User Reliability Measurements in a Sharing Economy Environment.

    Science.gov (United States)

    Seo, Aria; Jeong, Junho; Kim, Yeichang

    2017-08-13

    As the sharing economic market grows, the number of users is also increasing but many problems arise in terms of reliability between providers and users in the processing of services. The existing methods provide shared economic systems that judge the reliability of the provider from the viewpoint of the user. In this paper, we have developed a system for establishing mutual trust between providers and users in a shared economic environment to solve existing problems. In order to implement a system that can measure and control users' situation in a shared economic environment, we analyzed the necessary factors in a cyber physical system (CPS). In addition, a user measurement system based on a CPS structure in a sharing economic environment is implemented through analysis of the factors to consider when constructing a CPS.

  3. Reliability of capturing foot parameters using digital scanning and the neutral suspension casting technique

    Science.gov (United States)

    2011-01-01

    Background A clinical study was conducted to determine the intra and inter-rater reliability of digital scanning and the neutral suspension casting technique to measure six foot parameters. The neutral suspension casting technique is a commonly utilised method for obtaining a negative impression of the foot prior to orthotic fabrication. Digital scanning offers an alternative to the traditional plaster of Paris techniques. Methods Twenty one healthy participants volunteered to take part in the study. Six casts and six digital scans were obtained from each participant by two raters of differing clinical experience. The foot parameters chosen for investigation were cast length (mm), forefoot width (mm), rearfoot width (mm), medial arch height (mm), lateral arch height (mm) and forefoot to rearfoot alignment (degrees). Intraclass correlation coefficients (ICC) with 95% confidence intervals (CI) were calculated to determine the intra and inter-rater reliability. Measurement error was assessed through the calculation of the standard error of the measurement (SEM) and smallest real difference (SRD). Results ICC values for all foot parameters using digital scanning ranged between 0.81-0.99 for both intra and inter-rater reliability. For neutral suspension casting technique inter-rater reliability values ranged from 0.57-0.99 and intra-rater reliability values ranging from 0.36-0.99 for rater 1 and 0.49-0.99 for rater 2. Conclusions The findings of this study indicate that digital scanning is a reliable technique, irrespective of clinical experience, with reduced measurement variability in all foot parameters investigated when compared to neutral suspension casting. PMID:21375757

  4. Effects of postidentification feedback on eyewitness identification and nonidentification confidence.

    Science.gov (United States)

    Semmler, Carolyn; Brewer, Neil; Wells, Gary L

    2004-04-01

    Two experiments investigated new dimensions of the effect of confirming feedback on eyewitness identification confidence using target-absent and target-present lineups and (previously unused) unbiased witness instructions (i.e., "offender not present" option highlighted). In Experiment 1, participants viewed a crime video and were later asked to try to identify the thief from an 8-person target-absent photo array. Feedback inflated witness confidence for both mistaken identifications and correct lineup rejections. With target-present lineups in Experiment 2, feedback inflated confidence for correct and mistaken identifications and lineup rejections. Although feedback had no influence on the confidence-accuracy correlation, it produced clear overconfidence. Confidence inflation varied with the confidence measure reference point (i.e., retrospective vs. current confidence) and identification response latency.

  5. Confidant Relations in Italy

    Directory of Open Access Journals (Sweden)

    Jenny Isaacs

    2015-02-01

    Full Text Available Confidants are often described as the individuals with whom we choose to disclose personal, intimate matters. The presence of a confidant is associated with both mental and physical health benefits. In this study, 135 Italian adults responded to a structured questionnaire that asked if they had a confidant, and if so, to describe various features of the relationship. The vast majority of participants (91% reported the presence of a confidant and regarded this relationship as personally important, high in mutuality and trust, and involving minimal lying. Confidants were significantly more likely to be of the opposite sex. Participants overall were significantly more likely to choose a spouse or other family member as their confidant, rather than someone outside of the family network. Familial confidants were generally seen as closer, and of greater value, than non-familial confidants. These findings are discussed within the context of Italian culture.

  6. Study on highly reliable digital communication technology of reactor nuclear measuring equipment

    International Nuclear Information System (INIS)

    Gu Pengfei; Huang Xiaojin

    2007-01-01

    To meet the need of highly reliable of reactor nuclear measuring equipment, in allusion to the idiographic request of nuclear measuring equipment, the actual technical development and the application in industrial field, we design a kind of redundancy communication net based on PROFIBUS, and a kind of communication interface module based on redundancy PROFIBUS communication, which link the nuclear measuring equipment and PROFIBUS communication net, and also lay a foundation for advanced research. (authors)

  7. Reliability and validity in measurement of true humeral retroversion by a three-dimensional cylinder fitting method.

    Science.gov (United States)

    Saka, Masayuki; Yamauchi, Hiroki; Hoshi, Kenji; Yoshioka, Toru; Hamada, Hidetoshi; Gamada, Kazuyoshi

    2015-05-01

    Humeral retroversion is defined as the orientation of the humeral head relative to the distal humerus. Because none of the previous methods used to measure humeral retroversion strictly follow this definition, values obtained by these techniques vary and may be biased by morphologic variations of the humerus. The purpose of this study was 2-fold: to validate a method to define the axis of the distal humerus with a virtual cylinder and to establish the reliability of 3-dimensional (3D) measurement of humeral retroversion by this cylinder fitting method. Humeral retroversion in 14 baseball players (28 humeri) was measured by the 3D cylinder fitting method. The root mean square error was calculated to compare values obtained by a single tester and by 2 different testers using the embedded coordinate system. To establish the reliability, intraclass correlation coefficient (ICC) and precision (standard error of measurement [SEM]) were calculated. The root mean square errors for the humeral coordinate system were reliability and precision of the 3D measurement of retroversion yielded an intratester ICC of 0.99 (SEM, 1.0°) and intertester ICC of 0.96 (SEM, 2.8°). The error in measurements obtained by a distal humerus cylinder fitting method was small enough not to affect retroversion measurement. The 3D measurement of retroversion by this method provides excellent intratester and intertester reliability. Copyright © 2015 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.

  8. Reliability and reference values of two clinical measurements of dynamic and static knee position in healthy children

    DEFF Research Database (Denmark)

    Ortqvist, Maria; Moström, Eva B; Roos, Ewa M.

    2011-01-01

    PURPOSE: The purposes of this study were to evaluate reliability of the Single-limb mini squat test (a dynamic measure of medio-lateral knee position) and the Quadriceps-angle (Q-angle) (a static measure of medio-lateral knee position), present paediatric reference values of the Q......-angle measurements was found. Reference values for the Q-angle (mean 13.5° (1.9)-15.3° (2.8)) varies with age and gender. No associations were found between dynamic and static measures. CONCLUSIONS: The Single-limb mini squat test showed a moderate reliability and the Q-angle showed a fair to moderate reliability......-angle, and evaluate the association between the tests. METHODS: Two hundred and forty-six healthy children (9-16 years) were included (intra/inter-rater reliability for Q-angle (n = 37/85) and for Single-limb mini squat test (n = 33/28)). Dynamic medio-lateral knee position was assessed by the Single-limb mini squat...

  9. Reliability of widefield capillary microscopy to measure nailfold capillary density in systemic sclerosis.

    Science.gov (United States)

    Hudson, M; Masetto, A; Steele, R; Arthurs, E; Baron, M

    2010-01-01

    To determine intra- and inter-observer reliability of widefield microscopy to measure nailfold capillary density in patients with systemic sclerosis (SSc). Five SSc patients were examined with a STEMV-8 Zeiss biomicroscope with 50x magnification. The nailfold of the second, third, fourth and fifth fingers of both hands of each patient were photographed twice by each of two observers, once in the morning and again in the afternoon (total of 32 pictures). Two raters reviewed the photographs to produce capillary density readings. Intra- and inter-rater reliability of the readings were computed using intra-class correlations (ICC). Additional analyses were undertaken to determine the impact of other sources of variability in the data, namely patient, finger, technician and time. Intra-and inter-rater reliability were substantial (ICC 0.72-0.84) when raters were reading the same photographs or photographs taken at the same time of day. Agreement was only fair between morning and afternoon density readings (ICC 0.30-0.37). Patients, individual fingers and technician accounted for a large part of the variability in the data (combined variance component of 7.69 out of the total 12.23). The coefficient of variation of widefield microscopy was 24%. Although intra- and inter-rater reliability of nailfold capillary density measurements using widefield microscopy are good, proper standardisation of the conditions under which capillaroscopy is done and better imaging of nailfold capillary abnormalities should be considered if nailfold capillary density is to be used as an outcome measure in multi-centre clinical trials in SSc.

  10. Determining Reliability and Validity of the Persian Version of Software Usability Measurements Inventory (SUMI) Questionnaire

    OpenAIRE

    seyed abolfazl zakerian; Roya Azizi; Mehdi Rahgozar

    2013-01-01

    The term usability refers to a special index for success of an operating system. This study aimed to determine the reliability and validity of the Software Usability Measurements Inventory (SUMI) questionnaire as one of the valid and common questionnaires about usability evaluation. The back translation method was used to translate the questionnaire from English to Persian back to English. Moreover, repeatability or test-retest reliability was practically used to determine the reliability of ...

  11. How “consistent” is “consistent”? A clinician-based assessment of the reliability of expressions used by radiologists to communicate diagnostic confidence

    International Nuclear Information System (INIS)

    Rosenkrantz, A.B.; Kiritsy, M.; Kim, S.

    2014-01-01

    Aim: To evaluate the degree of variability in clinicians' interpretation of expressions used by radiologists to communicate their level of diagnostic confidence within radiological reports. Materials and methods: Clinicians were solicited to complete a prospective survey asking them to select the approximate perceived level of certainty, expressed as a percentage, associated with 20 expressions used by radiologists to communicate their level of diagnostic confidence within radiological reports. The median and inter-decile range (IDR) were computed for each expression, with a smaller IDR indicating greater reproducibility. Clinicians were also asked questions regarding their attitudes about radiologists' communication of diagnostic confidence. Results: Forty-nine surveys were completed. Median confidence associated with the expressions ranged from 10–90%. Reproducibility of the expressions was variable, as IDR ranged from 15–53%, although a median IDR of 40% indicated overall poor reproducibility. Expressions with relatively higher reproducibility included “most likely”, “likely”, and “unlikely” (IDR 15–20%), whereas expressions with relatively lower reproducibility included “compatible with”, “suspicious for”, “possibly,” and “can be seen in the setting of” (IDR ≥45%). Only 20% of clinicians agreed or strongly agreed that radiologists consistently use such expressions within their reports. Fifty-five percent of clinicians preferred that diagnostic confidence be communicated as a percentage rather than as a textual expression. Conclusion: There was poor reproducibility in clinicians' interpretations of many expressions used by radiologists to communicate their level of diagnostic confidence. Use of percentages to convey diagnostic confidence within reports may mitigate this source of ambiguity in radiologists' communication with clinicians. - Highlights: • Clinicians recorded certainty associated with

  12. Evidence for validity and reliability of a french version of the FAAM

    Directory of Open Access Journals (Sweden)

    Ballabeni Pierluigi

    2011-02-01

    Full Text Available Abstract Background The Foot and Ankle Ability Measure (FAAM is a self reported questionnaire for patients with foot and ankle disorders available in English, German, and Persian. This study plans to translate the FAAM from English to French (FAAM-F and assess the validity and reliability of this new version. Methods The FAAM-F Activities of Daily Living (ADL and sports subscales were completed by 105 French-speaking patients (average age 50.5 years presenting various chronic foot and ankle disorders. Convergent and divergent validity was assessed by Pearson's correlation coefficients between the FAAM-F subscales and the SF-36 scales: Physical Functioning (PF, Physical Component Summary (PCS, Mental Health (MH and Mental Component Summary (MCS. Internal consistency was calculated by Cronbach's Alpha (CA. To assess test re-test reliability, 22 patients filled out the questionnaire a second time to estimate minimal detectable changes (MDC and intraclass correlation coefficients (ICC. Results Correlations for FAAM-F ADL subscale were 0.85 with PF, 0.81 with PCS, 0.26 with MH, 0.37 with MCS. Correlations for FAAM-F Sports subscale were 0.72 with PF, 0.72 with PCS, 0.21 with MH, 0.29 with MCS. CA estimates were 0.97 for both subscales. Respectively for the ADL and Sports subscales, ICC were 0.97 and 0.94, errors for a single measure were 8 and 10 points at 95% confidence and the MDC values at 95% confidence were 7 and 18 points. Conclusion The FAAM-F is valid and reliable for the self-assessment of physical function in French-speaking patients with a wide range of chronic foot and ankle disorders.

  13. The reliability of the Adelaide in-shoe foot model.

    Science.gov (United States)

    Bishop, Chris; Hillier, Susan; Thewlis, Dominic

    2017-07-01

    Understanding the biomechanics of the foot is essential for many areas of research and clinical practice such as orthotic interventions and footwear development. Despite the widespread attention paid to the biomechanics of the foot during gait, what largely remains unknown is how the foot moves inside the shoe. This study investigated the reliability of the Adelaide In-Shoe Foot Model, which was designed to quantify in-shoe foot kinematics and kinetics during walking. Intra-rater reliability was assessed in 30 participants over five walking trials whilst wearing shoes during two data collection sessions, separated by one week. Sufficient reliability for use was interpreted as a coefficient of multiple correlation and intra-class correlation coefficient of >0.61. Inter-rater reliability was investigated separately in a second sample of 10 adults by two researchers with experience in applying markers for the purpose of motion analysis. The results indicated good consistency in waveform estimation for most kinematic and kinetic data, as well as good inter-and intra-rater reliability. The exception is the peak medial ground reaction force, the minimum abduction angle and the peak abduction/adduction external hindfoot joint moments which resulted in less than acceptable repeatability. Based on our results, the Adelaide in-shoe foot model can be used with confidence for 24 commonly measured biomechanical variables during shod walking. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Reliability and validity of Champion's Health Belief Model Scale for breast cancer screening among Malaysian women.

    Science.gov (United States)

    Parsa, P; Kandiah, M; Mohd Nasir, M T; Hejar, A R; Nor Afiah, M Z

    2008-11-01

    Breast cancer is the leading cause of cancer deaths in Malaysian women, and the use of breast self-examination (BSE), clinical breast examination (CBE) and mammography remain low in Malaysia. Therefore, there is a need to develop a valid and reliable tool to measure the beliefs that influence breast cancer screening practices. The Champion's Health Belief Model Scale (CHBMS) is a valid and reliable tool to measure beliefs about breast cancer and screening methods in the Western culture. The purpose of this study was to translate the use of CHBMS into the Malaysian context and validate the scale among Malaysian women. A random sample of 425 women teachers was taken from 24 secondary schools in Selangor state, Malaysia. The CHBMS was translated into the Malay language, validated by an expert's panel, back translated, and pretested. Analyses included descriptive statistics of all the study variables, reliability estimates, and construct validity using factor analysis. The mean age of the respondents was 37.2 (standard deviation 7.1) years. Factor analysis yielded ten factors for BSE with eigenvalue greater than 1 (four factors more than the original): confidence 1 (ability to differentiate normal and abnormal changes in the breasts), barriers to BSE, susceptibility for breast cancer, benefits of BSE, health motivation 1 (general health), seriousness 1 (fear of breast cancer), confidence 2 (ability to detect size of lumps), seriousness 2 (fear of long-term effects of breast cancer), health motivation 2 (preventive health practice), and confidence 3 (ability to perform BSE correctly). For CBE and mammography scales, seven factors each were identified. Factors for CBE scale include susceptibility, health motivation 1, benefits of CBE, seriousness 1, barriers of CBE, seriousness 2 and health motivation 2. For mammography the scale includes benefits of mammography, susceptibility, health motivation 1, seriousness 1, barriers to mammography seriousness 2 and health

  15. Measurement Error, Reliability, and Minimum Detectable Change in the Mini-Mental State Examination, Montreal Cognitive Assessment, and Color Trails Test among Community Living Middle-Aged and Older Adults.

    Science.gov (United States)

    Feeney, Joanne; Savva, George M; O'Regan, Claire; King-Kallimanis, Bellinda; Cronin, Hilary; Kenny, Rose Anne

    2016-05-31

    Knowing the reliability of cognitive tests, particularly those commonly used in clinical practice, is important in order to interpret the clinical significance of a change in performance or a low score on a single test. To report the intra-class correlation (ICC), standard error of measurement (SEM) and minimum detectable change (MDC) for the Mini-Mental State Examination (MMSE), Montreal Cognitive Assessment (MoCA), and Color Trails Test (CTT) among community dwelling older adults. 130 participants aged 55 and older without severe cognitive impairment underwent two cognitive assessments between two and four months apart. Half the group changed rater between assessments and half changed time of day. Mean (standard deviation) MMSE was 28.1 (2.1) at baseline and 28.4 (2.1) at repeat. Mean (SD) MoCA increased from 24.8 (3.6) to 25.2 (3.6). There was a rater effect on CTT, but not on the MMSE or MoCA. The SEM of the MMSE was 1.0, leading to an MDC (based on a 95% confidence interval) of 3 points. The SEM of the MoCA was 1.5, implying an MDC95 of 4 points. MoCA (ICC = 0.81) was more reliable than MMSE (ICC = 0.75), but all tests examined showed substantial within-patient variation. An individual's score would have to change by greater than or equal to 3 points on the MMSE and 4 points on the MoCA for the rater to be confident that the change was not due to measurement error. This has important implications for epidemiologists and clinicians in dementia screening and diagnosis.

  16. On reliability and maintenance modelling of ageing equipment in electric power systems

    International Nuclear Information System (INIS)

    Lindquist, Tommie

    2008-04-01

    Maintenance optimisation is essential to achieve cost-efficiency, availability and reliability of supply in electric power systems. The process of maintenance optimisation requires information about the costs of preventive and corrective maintenance, as well as the costs of failures borne by both electricity suppliers and customers. To calculate expected costs, information is needed about equipment reliability characteristics and the way in which maintenance affects equipment reliability. The aim of this Ph.D. work has been to develop equipment reliability models taking the effect of maintenance into account. The research has focussed on the interrelated areas of condition estimation, reliability modelling and maintenance modelling, which have been investigated in a number of case studies. In the area of condition estimation two methods to quantitatively estimate the condition of disconnector contacts have been developed, which utilise results from infrared thermography inspections and contact resistance measurements. The accuracy of these methods were investigated in two case studies. Reliability models have been developed and implemented for SF6 circuit-breakers, disconnector contacts and XLPE cables in three separate case studies. These models were formulated using both empirical and physical modelling approaches. To improve confidence in such models a Bayesian statistical method incorporating information from the equipment design process was also developed. This method was illustrated in a case study of SF6 circuit-breaker operating rods. Methods for quantifying the effect of maintenance on equipment condition and reliability have been investigated in case studies on disconnector contacts and SF6 circuit-breakers. The input required by these methods are condition measurements and historical failure and maintenance data, respectively. This research has demonstrated that the effect of maintenance on power system equipment may be quantified using available data

  17. Application-Driven Reliability Measures and Evaluation Tool for Fault-Tolerant Real-Time Systems

    National Research Council Canada - National Science Library

    Krishna, C

    2001-01-01

    .... The measure combines graphic-theoretic concepts in evaluating the underlying reliability of the network and other means to evaluate the ability of the network to support interprocessor traffic...

  18. Measurement methods to assess diastasis of the rectus abdominis muscle (DRAM): A systematic review of their measurement properties and meta-analytic reliability generalisation.

    Science.gov (United States)

    van de Water, A T M; Benjamin, D R

    2016-02-01

    Systematic literature review. Diastasis of the rectus abdominis muscle (DRAM) has been linked with low back pain, abdominal and pelvic dysfunction. Measurement is used to either screen or to monitor DRAM width. Determining which methods are suitable for screening and monitoring DRAM is of clinical value. To identify the best methods to screen for DRAM presence and monitor DRAM width. AMED, Embase, Medline, PubMed and CINAHL databases were searched for measurement property studies of DRAM measurement methods. Population characteristics, measurement methods/procedures and measurement information were extracted from included studies. Quality of all studies was evaluated using 'quality rating criteria'. When possible, reliability generalisation was conducted to provide combined reliability estimations. Thirteen studies evaluated measurement properties of the 'finger width'-method, tape measure, calipers, ultrasound, CT and MRI. Ultrasound was most evaluated. Methodological quality of these studies varied widely. Pearson's correlations of r = 0.66-0.79 were found between calipers and ultrasound measurements. Calipers and ultrasound had Intraclass Correlation Coefficients (ICC) of 0.78-0.97 for test-retest, inter- and intra-rater reliability. The 'finger width'-method had weighted Kappa's of 0.73-0.77 for test-retest reliability, but moderate agreement (63%; weighted Kappa = 0.53) between raters. Comparing calipers and ultrasound, low measurement error was found (above the umbilicus), and the methods had good agreement (83%; weighted Kappa = 0.66) for discriminative purposes. The available information support ultrasound and calipers as adequate methods to assess DRAM. For other methods limited measurement information of low to moderate quality is available and further evaluation of their measurement properties is required. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Reliability of corneal dynamic scheimpflug analyser measurements in virgin and post-PRK eyes.

    Directory of Open Access Journals (Sweden)

    Xiangjun Chen

    Full Text Available PURPOSE: To determine the measurement reliability of CorVis ST, a dynamic Scheimpflug analyser, in virgin and post-photorefractive keratectomy (PRK eyes and compare the results between these two groups. METHODS: Forty virgin eyes and 42 post-PRK eyes underwent CorVis ST measurements performed by two technicians. Repeatability was evaluated by comparing three consecutive measurements by technician A. Reproducibility was determined by comparing the first measurement by technician A with one performed by technician B. Intraobserver and interobserver intraclass correlation coefficients (ICCs were calculated. Univariate analysis of covariance (ANCOVA was used to compare measured parameters between virgin and post-PRK eyes. RESULTS: The intraocular pressure (IOP, central corneal thickness (CCT and 1st applanation time demonstrated good intraobserver repeatability and interobserver reproducibility (ICC ≧ 0.90 in virgin and post-PRK eyes. The deformation amplitude showed a good or close to good repeatability and reproducibility in both groups (ICC ≧ 0.88. The CCT correlated positively with 1st applanation time (r = 0.437 and 0.483, respectively, p<0.05 and negatively with deformation amplitude (r = -0.384 and -0.375, respectively, p<0.05 in both groups. Compared to post-PRK eyes, virgin eyes showed longer 1st applanation time (7.29 ± 0.21 vs. 6.96 ± 0.17 ms, p<0.05 and lower deformation amplitude (1.06 ± 0.07 vs. 1.17 ± 0.08 mm, p < 0.05. CONCLUSIONS: CorVis ST demonstrated reliable measurements for CCT, IOP, and 1st applanation time, as well as relatively reliable measurement for deformation amplitude in both virgin and post-PRK eyes. There were differences in 1st applanation time and deformation amplitude between virgin and post-PRK eyes, which may reflect corneal biomechanical changes occurring after the surgery in the latter.

  20. Comparison of physical impairment, functional, and psychosocial measures based on fear of reinjury/lack of confidence and return-to-sport status after ACL reconstruction.

    Science.gov (United States)

    Lentz, Trevor A; Zeppieri, Giorgio; George, Steven Z; Tillman, Susan M; Moser, Michael W; Farmer, Kevin W; Chmielewski, Terese L

    2015-02-01

    Fear of reinjury and lack of confidence influence return-to-sport outcomes after anterior cruciate ligament (ACL) reconstruction. The physical, psychosocial, and functional recovery of patients reporting fear of reinjury or lack of confidence as their primary barrier to resuming sports participation is unknown. To compare physical impairment, functional, and psychosocial measures between subgroups based on return-to-sport status and fear of reinjury/lack of confidence in the return-to-sport stage and to determine the association of physical impairment and psychosocial measures with function for each subgroup at 6 months and 1 year after surgery. Case-control study; Level of evidence, 3. Physical impairment (quadriceps index [QI], quadriceps strength/body weight [QSBW], hamstring:quadriceps strength ratio [HQ ratio], pain intensity), self-report of function (International Knee Documentation Committee [IKDC]), and psychosocial (Tampa Scale for Kinesiophobia-shortened form [TSK-11]) measures were collected at 6 months and 1 year after surgery in 73 patients with ACL reconstruction. At 1 year, subjects were divided into "return-to-sport" (YRTS) or "not return-to-sport" (NRTS) subgroups based on their self-reported return to preinjury sport status. Patients in the NRTS subgroup were subcategorized as NRTS-Fear/Confidence if fear of reinjury/lack of confidence was the primary reason for not returning to sports, and all others were categorized as NRTS-Other. A total of 46 subjects were assigned to YRTS, 13 to NRTS-Other, and 14 to NRTS-Fear/Confidence. Compared with the YRTS subgroup, the NRTS-Fear/Confidence subgroup was older and had lower QSBW, lower IKDC score, and higher TSK-11 score at 6 months and 1 year; however, they had similar pain levels. In the NRTS-Fear/Confidence subgroup, the IKDC score was associated with QSBW and pain at 6 months and QSBW, QI, pain, and TSK-11 scores at 1 year. Elevated pain-related fear of movement/reinjury, quadriceps weakness, and

  1. Determining Reliability of a Dual-Task Functional Mobility Protocol for Individuals With Lower Extremity Amputation.

    Science.gov (United States)

    Hunter, Susan W; Frengopoulos, Courtney; Holmes, Jeff; Viana, Ricardo; Payne, Michael W

    2018-04-01

    To determine the relative and absolute reliability of a dual-task functional mobility assessment. Cross-sectional study. Academic rehabilitation hospital. Individuals (N=60) with lower extremity amputation attending an outpatient amputee clinic (mean age, 58.21±12.59y; 18, 80% male) who were stratified into 3 groups: (1) transtibial amputation of vascular etiology (n=20); (2) transtibial amputation of nonvascular etiology (n=20); and (3) transfemoral or bilateral amputation of any etiology (n=20). Not applicable. Time to complete the L Test measured functional mobility under single- and dual-task conditions. The addition of a cognitive task (serial subtractions by 3's) created dual-task conditions. Single-task performance on the cognitive task was also reported. Intraclass correlation coefficients (ICCs) measured relative reliability; SEM and minimal detectable change with a 95% confidence interval (MDC 95 ) measured absolute reliability. Bland-Altman plots measured agreement between assessments. Relative reliability results were excellent for all 3 groups. Values for the dual-task L Test for those with transtibial amputation of vascular etiology (n=20; mean age, 60.36±7.84y; 19, 90% men) were ICC=.98 (95% confidence interval [CI], .94-.99), SEM=1.36 seconds, and MDC 95 =3.76 seconds; for those with transtibial amputation of nonvascular etiology (n=20; mean age, 55.85±14.08y; 17, 85% men), values were ICC=.93 (95% CI, .80-.98), SEM=1.34 seconds, and MDC 95 =3.71 seconds; and for those with transfemoral or bilateral amputation (n=20; mean age, 58.21±14.88y; 13, 65% men), values were ICC=.998 (95% CI, .996-.999), SEM=1.03 seconds, and MDC 95 =2.85 seconds. Bland-Altman plots indicated that assessments did not vary systematically for each group. This dual-task assessment protocol achieved approved levels of relative reliability values for the 3 groups tested. This protocol may be used clinically or in research settings to assess the interaction between cognition

  2. Intra-rater reliability of hallux flexor strength measures using the Nintendo Wii Balance Board.

    Science.gov (United States)

    Quek, June; Treleaven, Julia; Brauer, Sandra G; O'Leary, Shaun; Clark, Ross A

    2015-01-01

    The purpose of this study was to investigate the intra-rater reliability of a new method in combination with the Nintendo Wii Balance Board (NWBB) to measure the strength of hallux flexor muscle. Thirty healthy individuals (age: 34.9 ± 12.9 years, height: 170.4 ± 10.5 cm, weight: 69.3 ± 15.3 kg, female = 15) participated. Repeated testing was completed within 7 days. Participants performed strength testing in sitting using a wooden platform in combination with the NWBB. This new method was set up to selectively recruit an intrinsic muscle of the foot, specifically the flexor hallucis brevis muscle. Statistical analysis was performed using intra-class coefficients and ordinary least product analysis. To estimate measurement error, standard error of measurement (SEM), minimal detectable change (MDC) and percentage error were calculated. Results indicate excellent intra-rater reliability (ICC = 0.982, CI = 0.96-0.99) with an absence of systematic bias. SEM, MDC and percentage error value were 0.5, 1.4 and 12 % respectively. This study demonstrates that a new method in combination with the NWBB application is reliable to measure hallux flexor strength and has potential to be used for future research and clinical application.

  3. Quantitative measurement of hypertrophic scar: intrarater reliability, sensitivity, and specificity.

    Science.gov (United States)

    Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo

    2008-01-01

    The comparison of scar evaluation over time requires measurement tools with acceptable intrarater reliability and the ability to discriminate skin characteristics of interest. The objective of this study was to evaluate the intrarater reliability and sensitivity and specificity of the Cutometer, the Mexameter, and the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar (donor sites), and hypertrophic scar (HSc). A single investigator evaluated four tissue types (severe HSc, less severe HSc, donor site, and normal skin) in 30 burn survivors with all four measurement tools. The intraclass correlation coefficient (ICC) for the Cutometer was acceptable (> or =0.75) for the maximum deformation measure for the donor site and normal skin (>0.78) but was below the acceptable range for the HSc sites and all other parameters. The ICC for the Mexameter erythema (>0.75) and melanin index (>0.89) and the DermaScan C total thickness measurement (>0.82) were acceptable for all sites. The ICC for the total of the height, pliability, and vascularity subscales of the mVSS was acceptable (0.81) for normal scar but below the acceptable range for the scar sites. The DermaScan C was clearly able to discriminate HSc from normal scar and normal skin based on the total thickness measure. The Cutometer was less discriminating but was still able to discriminate HSc from normal scar and normal skin. The Mexameter erythema index was not a good discriminator of HSc and normal scar. Receiver operating characteristic curves were generated to establish the best cutoff point for the DermaScan C total thickness and the Cutometer maximum deformation, which were 2.034 and 0.387 mm, respectively. This study showed that although the Cutometer, the DermaScan C, and the Mexameter have measurement properties that make them attractive substitutes for the mVSS, caution must be used when interpreting results since the Cutometer has a ceiling effect when

  4. The minimum sit-to-stand height test: reliability, responsiveness and relationship to leg muscle strength.

    Science.gov (United States)

    Schurr, Karl; Sherrington, Catherine; Wallbank, Geraldine; Pamphlett, Patricia; Olivetti, Lynette

    2012-07-01

    To determine the reliability of the minimum sit-to-stand height test, its responsiveness and its relationship to leg muscle strength among rehabilitation unit inpatients and outpatients. Reliability study using two measurers and two test occasions. Secondary analysis of data from two clinical trials. Inpatient and outpatient rehabilitation services in three public hospitals. Eighteen hospital patients and five others participated in the reliability study. Seventy-two rehabilitation unit inpatients and 80 outpatients participated in the clinical trials. The minimum sit-to-stand height test was assessed using a standard procedure. For the reliability study, a second tester repeated the minimum sit-to-stand height test on the same day. In the inpatient clinical trial the measures were repeated two weeks later. In the outpatient trial the measures were repeated five weeks later. Knee extensor muscle strength was assessed in the clinical trials using a hand-held dynamometer. The reliability for the minimum sit-to-stand height test was excellent (intraclass correlation coefficient (ICC) 0.91, 95% confidence interval (CI) 0.81-0.96). The standard error of measurement was 34 mm. Responsiveness was moderate in the inpatient trial (effect size: 0.53) but small in the outpatient trial (effect size: 0.16). A small proportion (8-17%) of variability in minimum sit-to-stand height test was explained by knee extensor muscle strength. The minimum sit-to-stand height test has excellent reliability and moderate responsiveness in an inpatient rehabilitation setting. Responsiveness in an outpatient rehabilitation setting requires further investigation. Performance is influenced by factors other than knee extensor muscle strength.

  5. Effects of confidence and anxiety on flow state in competition.

    Science.gov (United States)

    Koehn, Stefan

    2013-01-01

    Confidence and anxiety are important variables that underlie the experience of flow in sport. Specifically, research has indicated that confidence displays a positive relationship and anxiety a negative relationship with flow. The aim of this study was to assess potential direct and indirect effects of confidence and anxiety dimensions on flow state in tennis competition. A sample of 59 junior tennis players completed measures of Competitive State Anxiety Inventory-2d and Flow State Scale-2. Following predictive analysis, results showed significant positive correlations between confidence (intensity and direction) and anxiety symptoms (only directional perceptions) with flow state. Standard multiple regression analysis indicated confidence as the only significant predictor of flow. The results confirmed a protective function of confidence against debilitating anxiety interpretations, but there were no significant interaction effects between confidence and anxiety on flow state.

  6. Measuring the Confidence of 8th Grade Taiwanese Students' Knowledge of Acids and Bases

    Science.gov (United States)

    Jack, Brady Michael; Liu, Chia-Ju; Chiu, Houn-Lin; Tsai, Chun-Yen

    2012-01-01

    The present study investigated whether gender differences were present on the confidence judgments made by 8th grade Taiwanese students on the accuracy of their responses to acid-base test items. A total of 147 (76 male, 71 female) students provided item-specific confidence judgments during a test of their knowledge of acids and bases. Using the…

  7. [Reliability study in the measurement of the cusp inclination angle of a chairside digital model].

    Science.gov (United States)

    Xinggang, Liu; Xiaoxian, Chen

    2018-02-01

    This study aims to evaluate the reliability of the software Picpick in the measurement of the cusp inclination angle of a digital model. Twenty-one trimmed models were used as experimental objects. The chairside digital impression was then used for the acquisition of 3D digital models, and the software Picpick was employed for the measurement of the cusp inclination of these models. The measurements were repeated three times, and the results were compared with a gold standard, which was a manually measured experimental model cusp angle. The intraclass correlation coefficient (ICC) was calculated. The paired t test value of the two measurement methods was 0.91. The ICCs between the two measurement methods and three repeated measurements were greater than 0.9. The digital model achieved a smaller coefficient of variation (9.9%). The software Picpick is reliable in measuring the cusp inclination of a digital model.

  8. Reliability of perceived neighbourhood conditions and the effects of measurement error on self-rated health across urban and rural neighbourhoods.

    Science.gov (United States)

    Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario

    2012-04-01

    Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.

  9. Quantitative outcome measures for systemic sclerosis-related Microangiopathy - Reliability of image acquisition in Nailfold Capillaroscopy.

    Science.gov (United States)

    Dinsdale, Graham; Moore, Tonia; O'Leary, Neil; Berks, Michael; Roberts, Christopher; Manning, Joanne; Allen, John; Anderson, Marina; Cutolo, Maurizio; Hesselstrand, Roger; Howell, Kevin; Pizzorni, Carmen; Smith, Vanessa; Sulli, Alberto; Wildt, Marie; Taylor, Christopher; Murray, Andrea; Herrick, Ariane L

    2017-09-01

    Nailfold capillaroscopic parameters hold increasing promise as outcome measures for clinical trials in systemic sclerosis (SSc). Their inclusion as outcomes would often naturally require capillaroscopy images to be captured at several time points during any one study. Our objective was to assess repeatability of image acquisition (which has been little studied), as well as of measurement. 41 patients (26 with SSc, 15 with primary Raynaud's phenomenon) and 10 healthy controls returned for repeat high-magnification (300×) videocapillaroscopy mosaic imaging of 10 digits one week after initial imaging (as part of a larger study of reliability). Images were assessed in a random order by an expert blinded observer and 4 outcome measures extracted: (1) overall image grade and then (where possible) distal vessel locations were marked, allowing (2) vessel density (across the whole nailfold) to be calculated (3) apex width measurement and (4) giant vessel count. Intra-rater, intra-visit and intra-rater inter-visit (baseline vs. 1week) reliability were examined in 475 and 392 images respectively. A linear, mixed-effects model was used to estimate variance components, from which intra-class correlation coefficients (ICCs) were determined. Intra-visit and inter-visit reliability estimates (ICCs) were (respectively): overall image grade, 0.97 and 0.90; vessel density, 0.92 and 0.65; mean vessel width, 0.91 and 0.79; presence of giant capillary, 0.68 and 0.56. These estimates were conditional on each parameter being measurable. Within-operator image analysis and acquisition are reproducible. Quantitative nailfold capillaroscopy, at least with a single observer, provides reliable outcome measures for clinical studies including randomised controlled trials. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Increasing the reliability of the fluid/crystallized difference score from the Kaufman Adolescent and Adult Intelligence Test with reliable component analysis.

    Science.gov (United States)

    Caruso, J C

    2001-06-01

    The unreliability of difference scores is a well documented phenomenon in the social sciences and has led researchers and practitioners to interpret differences cautiously, if at all. In the case of the Kaufman Adult and Adolescent Intelligence Test (KAIT), the unreliability of the difference between the Fluid IQ and the Crystallized IQ is due to the high correlation between the two scales. The consequences of the lack of precision with which differences are identified are wide confidence intervals and unpowerful significance tests (i.e., large differences are required to be declared statistically significant). Reliable component analysis (RCA) was performed on the subtests of the KAIT in order to address these problems. RCA is a new data reduction technique that results in uncorrelated component scores with maximum proportions of reliable variance. Results indicate that the scores defined by RCA have discriminant and convergent validity (with respect to the equally weighted scores) and that differences between the scores, derived from a single testing session, were more reliable than differences derived from equal weighting for each age group (11-14 years, 15-34 years, 35-85+ years). This reliability advantage results in narrower confidence intervals around difference scores and smaller differences required for statistical significance.

  11. Validation of Navigation Ultrasound for Clavicular Length Measurement

    DEFF Research Database (Denmark)

    Høj, Anders Thorsmark; Villa, Chiara; Christensen, Ole M.

    2017-01-01

    interval): approximately ± 7.5 mm, Pearson's correlation R: 0.948-0.974). Navigation ultrasound can measure clavicular length with an intra-rater reliability matching that of 3-D rendered computed tomography scans and with high validity. Its use could spread to other fields requiring accurate...... of 52.5 (range: 21-78 y) were included. Navigation ultrasound exhibited high reliability (intra-class correlation coefficient: 0.942-0.997, standard error of the mean: 0.7-2.9 mm, minimal detectable change: 2.3-8.1 mm) and validity (measurement error: 1.3%-1.8%, limits of agreement (95% confidence...

  12. System Reliability Engineering

    International Nuclear Information System (INIS)

    Lim, Tae Jin

    2005-02-01

    This book tells of reliability engineering, which includes quality and reliability, reliability data, importance of reliability engineering, reliability and measure, the poisson process like goodness of fit test and the poisson arrival model, reliability estimation like exponential distribution, reliability of systems, availability, preventive maintenance such as replacement policies, minimal repair policy, shock models, spares, group maintenance and periodic inspection, analysis of common cause failure, and analysis model of repair effect.

  13. Statistical equivalence and test-retest reliability of delay and probability discounting using real and hypothetical rewards.

    Science.gov (United States)

    Matusiewicz, Alexis K; Carter, Anne E; Landes, Reid D; Yi, Richard

    2013-11-01

    Delay discounting (DD) and probability discounting (PD) refer to the reduction in the subjective value of outcomes as a function of delay and uncertainty, respectively. Elevated measures of discounting are associated with a variety of maladaptive behaviors, and confidence in the validity of these measures is imperative. The present research examined (1) the statistical equivalence of discounting measures when rewards were hypothetical or real, and (2) their 1-week reliability. While previous research has partially explored these issues using the low threshold of nonsignificant difference, the present study fully addressed this issue using the more-compelling threshold of statistical equivalence. DD and PD measures were collected from 28 healthy adults using real and hypothetical $50 rewards during each of two experimental sessions, one week apart. Analyses using area-under-the-curve measures revealed a general pattern of statistical equivalence, indicating equivalence of real/hypothetical conditions as well as 1-week reliability. Exceptions are identified and discussed. Copyright © 2013 Elsevier B.V. All rights reserved.

  14. Between-day reliability of a method for non-invasive estimation of muscle composition.

    Science.gov (United States)

    Simunič, Boštjan

    2012-08-01

    Tensiomyography is a method for valid and non-invasive estimation of skeletal muscle fibre type composition. The validity of selected temporal tensiomyographic measures has been well established recently; there is, however, no evidence regarding the method's between-day reliability. Therefore it is the aim of this paper to establish the between-day repeatability of tensiomyographic measures in three skeletal muscles. For three consecutive days, 10 healthy male volunteers (mean±SD: age 24.6 ± 3.0 years; height 177.9 ± 3.9 cm; weight 72.4 ± 5.2 kg) were examined in a supine position. Four temporal measures (delay, contraction, sustain, and half-relaxation time) and maximal amplitude were extracted from the displacement-time tensiomyogram. A reliability analysis was performed with calculations of bias, random error, coefficient of variation (CV), standard error of measurement, and intra-class correlation coefficient (ICC) with a 95% confidence interval. An analysis of ICC demonstrated excellent agreement (ICC were over 0.94 in 14 out of 15 tested parameters). However, lower CV was observed in half-relaxation time, presumably because of the specifics of the parameter definition itself. These data indicate that for the three muscles tested, tensiomyographic measurements were reproducible across consecutive test days. Furthermore, we indicated the most possible origin of the lowest reliability detected in half-relaxation time. Copyright © 2012 Elsevier Ltd. All rights reserved.

  15. Confidence-Based Learning in Investment Analysis

    Science.gov (United States)

    Serradell-Lopez, Enric; Lara-Navarra, Pablo; Castillo-Merino, David; González-González, Inés

    The aim of this study is to determine the effectiveness of using multiple choice tests in subjects related to the administration and business management. To this end we used a multiple-choice test with specific questions to verify the extent of knowledge gained and the confidence and trust in the answers. The tests were performed in a group of 200 students at the bachelor's degree in Business Administration and Management. The analysis made have been implemented in one subject of the scope of investment analysis and measured the level of knowledge gained and the degree of trust and security in the responses at two different times of the course. The measurements have been taken into account different levels of difficulty in the questions asked and the time spent by students to complete the test. The results confirm that students are generally able to obtain more knowledge along the way and get increases in the degree of trust and confidence in the answers. It is confirmed as the difficulty level of the questions set a priori by the heads of the subjects are related to levels of security and confidence in the answers. It is estimated that the improvement in the skills learned is viewed favourably by businesses and are especially important for job placement of students.

  16. Inertial Measurement Units for Clinical Movement Analysis: Reliability and Concurrent Validity

    Directory of Open Access Journals (Sweden)

    Mohammad Al-Amri

    2018-02-01

    Full Text Available The aim of this study was to investigate the reliability and concurrent validity of a commercially available Xsens MVN BIOMECH inertial-sensor-based motion capture system during clinically relevant functional activities. A clinician with no prior experience of motion capture technologies and an experienced clinical movement scientist each assessed 26 healthy participants within each of two sessions using a camera-based motion capture system and the MVN BIOMECH system. Participants performed overground walking, squatting, and jumping. Sessions were separated by 4 ± 3 days. Reliability was evaluated using intraclass correlation coefficient and standard error of measurement, and validity was evaluated using the coefficient of multiple correlation and the linear fit method. Day-to-day reliability was generally fair-to-excellent in all three planes for hip, knee, and ankle joint angles in all three tasks. Within-day (between-rater reliability was fair-to-excellent in all three planes during walking and squatting, and poor-to-high during jumping. Validity was excellent in the sagittal plane for hip, knee, and ankle joint angles in all three tasks and acceptable in frontal and transverse planes in squat and jump activity across joints. Our results suggest that the MVN BIOMECH system can be used by a clinician to quantify lower-limb joint angles in clinically relevant movements.

  17. The reliability and validity of fatigue measures during short-duration maximal-intensity intermittent cycling.

    Science.gov (United States)

    Glaister, Mark; Stone, Michael H; Stewart, Andrew M; Hughes, Michael; Moir, Gavin L

    2004-08-01

    The purpose of the present study was to assess the reliability and validity of fatigue measures, as derived from 4 separate formulae, during tests of repeat sprint ability. On separate days over a 3-week period, 2 groups of 7 recreationally active men completed 6 trials of 1 of 2 maximal (20 x 5 seconds) intermittent cycling tests with contrasting recovery periods (10 or 30 seconds). All trials were conducted on a friction-braked cycle ergometer, and fatigue scores were derived from measures of mean power output for each sprint. Apart from formula 1, which calculated fatigue from the percentage difference in mean power output between the first and last sprint, all remaining formulae produced fatigue scores that showed a reasonably good level of test-retest reliability in both intermittent test protocols (intraclass correlation range: 0.78-0.86; 95% likely range of true values: 0.54-0.97). Although between-protocol differences in the magnitude of the fatigue scores suggested good construct validity, within-protocol differences highlighted limitations with each formula. Overall, the results support the use of the percentage decrement score as the most valid and reliable measure of fatigue during brief maximal intermittent work.

  18. Reliability and Validity of an Internet-based Questionnaire Measuring Lifetime Physical Activity

    OpenAIRE

    De Vera, Mary A.; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek

    2010-01-01

    Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005–2006. Reliability was examined u...

  19. The use of adaptable automation: Effects of extended skill lay-off and changes in system reliability.

    Science.gov (United States)

    Sauer, Juergen; Chavaillaz, Alain

    2017-01-01

    This experiment aimed to examine how skill lay-off and system reliability would affect operator behaviour in a simulated work environment under wide-range and large-choice adaptable automation comprising six different levels. Twenty-four participants were tested twice during a 2-hr testing session, with the second session taking place 8 months after the first. In the middle of the second testing session, system reliability changed. The results showed that after the retention interval trust increased and self-confidence decreased. Complacency was unaffected by the lay-off period. Diagnostic speed slowed down after the retention interval but diagnostic accuracy was maintained. No difference between experimental conditions was found for automation management behaviour (i.e. level of automation chosen and frequency of switching between levels). There were few effects of system reliability. Overall, the findings showed that subjective measures were more sensitive to the impact of skill lay-off than objective behavioural measures. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Confidence Intervals Verification for Simulated Error Rate Performance of Wireless Communication System

    KAUST Repository

    Smadi, Mahmoud A.

    2012-12-06

    In this paper, we derived an efficient simulation method to evaluate the error rate of wireless communication system. Coherent binary phase-shift keying system is considered with imperfect channel phase recovery. The results presented demonstrate the system performance under very realistic Nakagami-m fading and additive white Gaussian noise channel. On the other hand, the accuracy of the obtained results is verified through running the simulation under a good confidence interval reliability of 95 %. We see that as the number of simulation runs N increases, the simulated error rate becomes closer to the actual one and the confidence interval difference reduces. Hence our results are expected to be of significant practical use for such scenarios. © 2012 Springer Science+Business Media New York.

  1. Assessing the Reliability of Curriculum-Based Measurement: An Application of Latent Growth Modeling

    Science.gov (United States)

    Yeo, Seungsoo; Kim, Dong-Il; Branum-Martin, Lee; Wayman, Miya Miura; Espin, Christine A.

    2012-01-01

    The purpose of this study was to demonstrate the use of Latent Growth Modeling (LGM) as a method for estimating reliability of Curriculum-Based Measurement (CBM) progress-monitoring data. The LGM approach permits the error associated with each measure to differ at each time point, thus providing an alternative method for examining of the…

  2. Reliability, precision, and gender differences in knee internal/external rotation proprioception measurements.

    Science.gov (United States)

    Nagai, Takashi; Sell, Timothy C; Abt, John P; Lephart, Scott M

    2012-11-01

    To develop and assess the reliability and precision of knee internal/external rotation (IR/ER) threshold to detect passive motion (TTDPM) and determine if gender differences exist. Test-retest for the reliability/precision and cross-sectional for gender comparisons. University neuromuscular and human performance research laboratory. Ten subjects for the reliability and precision aim. Twenty subjects (10 males and 10 females) for gender comparisons. All TTDPM tests were performed using a multi-mode dynamometer. Subjects performed TTDPM at two knee positions (near IR or ER end-range). Intraclass correlation coefficient (ICC (3,k)) and standard error of measurement (SEM) were used to evaluate the reliability and precision. Independent t-tests were used to compare genders. TTDPM toward IR and ER at two knee positions. Intrasession and intersession reliability and precision were good (ICC=0.68-0.86; SEM=0.22°-0.37°). Females had significantly diminished TTDPM toward IR at IR-test position (males: 0.77°±0.14°, females: 1.18°±0.46°, p=0.021) and TTDPM toward IR at the ER-test position (males: 0.87°±0.13°, females: 1.36°±0.58°, p=0.026). No other significant gender differences were found (p>0.05). The current IR/ER TTDPM methods are reliable and accurate for the test-retest or cross-section research design. Gender differences were found toward IR where the ACL acts as the secondary restraint. Copyright © 2011 Elsevier Ltd. All rights reserved.

  3. Reliability of length measurements collected by community nurses and health volunteers in rural growth monitoring and promotion services.

    Science.gov (United States)

    Laar, Matilda E; Marquis, Grace S; Lartey, Anna; Gray-Donald, Katherine

    2018-02-17

    Length measurements are important in growth, monitoring and promotion (GMP) for the surveillance of a child's weight-for-length and length-for-age. These two indices provide an indication of a child's risk of becoming wasted or stunted, and are more informative about a child's growth than the widely used weight-for-age index (underweight). Although the introduction of length measurements in GMP is recommended by the World Health Organization, concerns about the reliability of length measurements collected in rural outreach settings have been expressed by stakeholders. Our aim was to describe the reliability and challenges associated with community health personnel measuring length for rural outreach GMP activities. Two reliability studies (A and B), using 10 children less than 24 months each, were conducted in the GMP services of a rural district in Ghana. Fifteen nurses and 15 health volunteers (HV) with no prior experience in length measurements were trained. Intra- and inter-observer technical error of measurement (TEM), average bias from expert anthropometrist, and coefficient of reliability (R) of length measurements were assessed and compared across sessions. Observations and interviews were used to understand the ability and experiences of health personnel with measuring length at outreach GMP. Inter-observer TEM was larger than intra-observer TEM for both nurses and HV at both sessions and was unacceptably (compared to error standards) high in both groups at both time points. Average biases from expert's measurements were within acceptable limits, however, both groups tended to underestimate length measurements. The R for lengths collected by nurses (92.3%) was higher at session B compared to that of HV (87.5%). Length measurements taken by nurses and HV, and those taken by an experienced anthropometrist at GMP sessions were of moderate agreement (kappa = 0.53, p reliability of length measurements improved after two refresher trainings for nurses but

  4. Reliability, precision, and measurement in the context of data from ability tests, surveys, and assessments

    International Nuclear Information System (INIS)

    Fisher, W P Jr; Elbaum, B; Coulter, A

    2010-01-01

    Reliability coefficients indicate the proportion of total variance attributable to differences among measures separated along a quantitative continuum by a testing, survey, or assessment instrument. Reliability is usually considered to be influenced by both the internal consistency of a data set and the number of items, though textbooks and research papers rarely evaluate the extent to which these factors independently affect the data in question. Probabilistic formulations of the requirements for unidimensional measurement separate consistency from error by modelling individual response processes instead of group-level variation. The utility of this separation is illustrated via analyses of small sets of simulated data, and of subsets of data from a 78-item survey of over 2,500 parents of children with disabilities. Measurement reliability ultimately concerns the structural invariance specified in models requiring sufficient statistics, parameter separation, unidimensionality, and other qualities that historically have made quantification simple, practical, and convenient for end users. The paper concludes with suggestions for a research program aimed at focusing measurement research more on the calibration and wide dissemination of tools applicable to individuals, and less on the statistical study of inter-variable relations in large data sets.

  5. The reliability of measuring pain distribution and location using body pain diagrams in patients with acute whiplash-associated disorders.

    Science.gov (United States)

    Southerst, Danielle; Stupar, Maja; Côté, Pierre; Mior, Silvano; Stern, Paula

    2013-09-01

    The objective of this study was to measure the interexaminer reliability of scoring pain distribution using paper and electronic body pain diagrams in patients with acute whiplash-associated disorder and to assess the intermethod reliability of measuring pain distribution and location using paper and electronic diagrams. We conducted an interexaminer reliability study on 80 participants recruited from a randomized controlled trial on the conservative management of acute grade I/II whiplash-associated disorder. Participants were assessed for inclusion/exclusion criteria by an experienced clinician. As part of the baseline assessment, participants independently completed paper and electronic pain diagrams. Diagrams were scored independently by 2 examiners using the body region method. Interexaminer and intermethod reliability was computed using intraclass correlation coefficients (ICCs) for pain distribution and κ coefficient for pain location. We used Bland-Altman plots to compute limits of agreement. The interexaminer reliability was ICC = 0.925 for paper and ICC = 0.997 for the electronic body pain diagram. The intermethod reliability for measuring pain distribution ranged from ICC = 0.63 to ICC = 0.93. For pain location, the intermethod reliability varied from κ = 0.23 (posterior neck) to κ = 0.90 (right side of the face). We found good to excellent interexaminer reliability for scoring 2 versions of the body pain diagram. Pain distribution and pain location were reliably and consistently measured on body pain diagrams using paper and electronic methods; therefore, clinicians and researchers may choose either medium when using body pain diagrams. Copyright © 2013 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.

  6. Reliability of Composite Dichotomous Measurements

    Czech Academy of Sciences Publication Activity Database

    Martinková, Patrícia; Zvára, Karel

    2010-01-01

    Roč. 6, č. 2 (2010), s. 103-109 ISSN 1801-5603 R&D Projects: GA MŠk(CZ) 1M06014 Institutional research plan: CEZ:AV0Z10300504 Keywords : reliability * binary data * logistic regression * Cronbach alpha * Rasch model * myocardial perfusion diagnosis Subject RIV: BB - Applied Statistics, Operational Research http://www.ejbi.cz/articles/201012/65/1.html

  7. Sensitivity to mental effort and test-retest reliability of heart rate variability measures in healthy seniors.

    Science.gov (United States)

    Mukherjee, Shalini; Yadav, Rajeev; Yung, Iris; Zajdel, Daniel P; Oken, Barry S

    2011-10-01

    To determine (1) whether heart rate variability (HRV) was a sensitive and reliable measure in mental effort tasks carried out by healthy seniors and (2) whether non-linear approaches to HRV analysis, in addition to traditional time and frequency domain approaches were useful to study such effects. Forty healthy seniors performed two visual working memory tasks requiring different levels of mental effort, while ECG was recorded. They underwent the same tasks and recordings 2 weeks later. Traditional and 13 non-linear indices of HRV including Poincaré, entropy and detrended fluctuation analysis (DFA) were determined. Time domain, especially mean R-R interval (RRI), frequency domain and, among non-linear parameters - Poincaré and DFA were the most reliable indices. Mean RRI, time domain and Poincaré were also the most sensitive to different mental effort task loads and had the largest effect size. Overall, linear measures were the most sensitive and reliable indices to mental effort. In non-linear measures, Poincaré was the most reliable and sensitive, suggesting possible usefulness as an independent marker in cognitive function tasks in healthy seniors. A large number of HRV parameters was both reliable as well as sensitive indices of mental effort, although the simple linear methods were the most sensitive. Copyright © 2011 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  8. Measurement of transplanted pancreatic volume using computed tomography: reliability by intra- and inter-observer variability

    International Nuclear Information System (INIS)

    Lundqvist, Eva; Segelsjoe, Monica; Magnusson, Anders; Andersson, Anna; Biglarnia, Ali-Reza

    2012-01-01

    Background Unlike other solid organ transplants, pancreas allografts can undergo a substantial decrease in baseline volume after transplantation. This phenomenon has not been well characterized, as there are insufficient data on reliable and reproducible volume assessments. We hypothesized that characterization of pancreatic volume by means of computed tomography (CT) could be a useful method for clinical follow-up in pancreas transplant patients. Purpose To evaluate the feasibility and reliability of pancreatic volume assessment using CT scan in transplanted patients. Material and Methods CT examinations were performed on 21 consecutive patients undergoing pancreas transplantation. Volume measurements were carried out by two observers tracing the pancreatic contours in all slices. The observers performed the measurements twice for each patient. Differences in volume measurement were used to evaluate intra- and inter-observer variability. Results The intra-observer variability for the pancreatic volume measurements of Observers 1 and 2 was found to be in almost perfect agreement, with an intraclass correlation coefficient (ICC) of 0.90 (0.77-0.96) and 0.99 (0.98-1.0), respectively. Regarding inter-observer validity, the ICCs for the first and second measurements were 0.90 (range, 0.77-0.96) and 0.95 (range, 0.85-0.98), respectively. Conclusion CT volumetry is a reliable and reproducible method for measurement of transplanted pancreatic volume

  9. Measurement of transplanted pancreatic volume using computed tomography: reliability by intra- and inter-observer variability

    Energy Technology Data Exchange (ETDEWEB)

    Lundqvist, Eva; Segelsjoe, Monica; Magnusson, Anders [Uppsala Univ., Dept. of Radiology, Oncology and Radiation Science, Section of Radiology, Uppsala (Sweden)], E-mail: eva.lundqvist.8954@student.uu.se; Andersson, Anna; Biglarnia, Ali-Reza [Dept. of Surgical Sciences, Section of Transplantation Surgery, Uppsala Univ. Hospital, Uppsala (Sweden)

    2012-11-15

    Background Unlike other solid organ transplants, pancreas allografts can undergo a substantial decrease in baseline volume after transplantation. This phenomenon has not been well characterized, as there are insufficient data on reliable and reproducible volume assessments. We hypothesized that characterization of pancreatic volume by means of computed tomography (CT) could be a useful method for clinical follow-up in pancreas transplant patients. Purpose To evaluate the feasibility and reliability of pancreatic volume assessment using CT scan in transplanted patients. Material and Methods CT examinations were performed on 21 consecutive patients undergoing pancreas transplantation. Volume measurements were carried out by two observers tracing the pancreatic contours in all slices. The observers performed the measurements twice for each patient. Differences in volume measurement were used to evaluate intra- and inter-observer variability. Results The intra-observer variability for the pancreatic volume measurements of Observers 1 and 2 was found to be in almost perfect agreement, with an intraclass correlation coefficient (ICC) of 0.90 (0.77-0.96) and 0.99 (0.98-1.0), respectively. Regarding inter-observer validity, the ICCs for the first and second measurements were 0.90 (range, 0.77-0.96) and 0.95 (range, 0.85-0.98), respectively. Conclusion CT volumetry is a reliable and reproducible method for measurement of transplanted pancreatic volume.

  10. We will be champions: Leaders' confidence in 'us' inspires team members' team confidence and performance.

    Science.gov (United States)

    Fransen, K; Steffens, N K; Haslam, S A; Vanbeselaere, N; Vande Broek, G; Boen, F

    2016-12-01

    The present research examines the impact of leaders' confidence in their team on the team confidence and performance of their teammates. In an experiment involving newly assembled soccer teams, we manipulated the team confidence expressed by the team leader (high vs neutral vs low) and assessed team members' responses and performance as they unfolded during a competition (i.e., in a first baseline session and a second test session). Our findings pointed to team confidence contagion such that when the leader had expressed high (rather than neutral or low) team confidence, team members perceived their team to be more efficacious and were more confident in the team's ability to win. Moreover, leaders' team confidence affected individual and team performance such that teams led by a highly confident leader performed better than those led by a less confident leader. Finally, the results supported a hypothesized mediational model in showing that the effect of leaders' confidence on team members' team confidence and performance was mediated by the leader's perceived identity leadership and members' team identification. In conclusion, the findings of this experiment suggest that leaders' team confidence can enhance members' team confidence and performance by fostering members' identification with the team. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  11. Reliability of three-dimensional sonographic measurements in early pregnancy using virtual reality

    NARCIS (Netherlands)

    C.M. Verwoerd-Dikkeboom (Christine); A.H.J. Koning (Anton); W.C.J. Hop (Wim); M. Rousian (Melek); P.J. van der Spek (Peter); N. Exalto (Niek); R.P.M. Steegers-Theunissen (Régine)

    2008-01-01

    textabstractObjective: To establish the reliability of three-dimensional (3D) ultrasound measurements in early pregnancy using a virtual reality system (the Barco I-Space). Methods: The study included 28 pregnancies with gestational ages ranging from 6 to 14 (median, 10) weeks. 3D volumes were

  12. Reliability-Based Marginal Cost Pricing Problem Case with Both Demand Uncertainty and Travelers’ Perception Errors

    Directory of Open Access Journals (Sweden)

    Shaopeng Zhong

    2013-01-01

    Full Text Available Focusing on the first-best marginal cost pricing (MCP in a stochastic network with both travel demand uncertainty and stochastic perception errors within the travelers’ route choice decision processes, this paper develops a perceived risk-based stochastic network marginal cost pricing (PRSN-MCP model. Numerical examples based on an integrated method combining the moment analysis approach, the fitting distribution method, and the reliability measures are also provided to demonstrate the importance and properties of the proposed model. The main finding is that ignoring the effect of travel time reliability and travelers’ perception errors may significantly reduce the performance of the first-best MCP tolls, especially under high travelers’ confidence and network congestion levels. The analysis result could also enhance our understanding of (1 the effect of stochastic perception error (SPE on the perceived travel time distribution and the components of road toll; (2 the effect of road toll on the actual travel time distribution and its reliability measures; (3 the effect of road toll on the total network travel time distribution and its statistics; and (4 the effect of travel demand level and the value of reliability (VoR level on the components of road toll.

  13. Accuracy and reliability of three-dimensional surface reconstruction measurement

    International Nuclear Information System (INIS)

    Mizukami, Chikashi; Yamamoto, Etsuo; Ohmura, Masaki; Oiki, Hiroyuki; Tsuji, Jun; Muneta, Yuki; Tanabe, Makito; Hakuba, Nobuhiro; Azemoto, Syougo.

    1993-01-01

    We are using a new three-dimensional (3-D) surface reconstruction system to measure the temporal bones. This system offers the advantage of observation of the external aperture of the vestibular aqueduct and the porus acusticus internus in living subjects. However, its accuracy has not been confirmed. To investigate the accuracy of this new system, we measured the length of an in situ ceramic ossicular replacement prosthesis (CORP) of known length of 6.0 mm using 3-D surface reconstruction, conventional plain X-ray and polytomography. The CORP was scanned in the axial, sagittal and oblique directions. The mean measured length obtained with the 3-D surface reconstruction images was 5.94±0.21 on vertical scans, 5.91±0.27 on horizontal scans, and 6.01±0.25 on oblique scans. There were no significant differences among the measured lengths obtained in the three directions. Therefore, this 3-D surface reconstruction measurement system is considered to be reliable. Conversely, the mean measured length obtained by plain X-ray was 7.98±0.20, and by polytomography it was 7.94±0.23. These conventional methods have the inherent disadvantage of magnification of size which consequently requires correction. (author)

  14. Reliability of ultrasonographic measurements in suspected patients of developmental dysplasia of the hip and correlation with the acetabular index

    Directory of Open Access Journals (Sweden)

    Cem Copuroglu

    2011-01-01

    Full Text Available Background: Ultrasonography is accepted as a useful imaging modality in the early detection of developmental dysplasia of the hip (DDH. Early detection and early treatment of DDH prevents hip dislocation and related physical, social, economic, and psychological problems. The purpose of this study was to evaluate the reliability of ultrasonographic and roentgenographic measurements measured by seven different observers. Materials and Methods: The alpha angles of 66 hips in 33 patients were measured using the Graf method by seven different observers. Acetabular index degrees on plane roentgenograms were measured in order to assess the correlation between the ultrasonographic alpha angle and the radiographic acetabular index, which both show the bony acetabular depth, retrospectively. Results: The interclass correlation coefficient, measuring the interobserver reliability, was high and statistically significant for the ultrasonographic measurements. There was a negative correlation between the alpha angle and the acetabular index. Conclusions: Ultrasonography, when applied properly, is a reliable technique between different observers, in the diagnosis and follow up of DDH. When assessed concomitantly with the roentgenographic measurements, the results are reliable and statistically meaningful.

  15. A scale for consumer confidence in the safety of food

    NARCIS (Netherlands)

    Jonge, de J.; Trijp, van J.C.M.; Lans, van der I.A.; Renes, R.J.; Frewer, L.J.

    2008-01-01

    The aim of this study was to develop and validate a scale to measure general consumer confidence in the safety of food. Results from exploratory and confirmatory analyses indicate that general consumer confidence in the safety of food consists of two distinct dimensions, optimism and pessimism,

  16. A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace.

    Science.gov (United States)

    Seguí, María del Mar; Cabrero-García, Julio; Crespo, Ana; Verdú, José; Ronda, Elena

    2015-06-01

    To design and validate a questionnaire to measure visual symptoms related to exposure to computers in the workplace. Our computer vision syndrome questionnaire (CVS-Q) was based on a literature review and validated through discussion with experts and performance of a pretest, pilot test, and retest. Content validity was evaluated by occupational health, optometry, and ophthalmology experts. Rasch analysis was used in the psychometric evaluation of the questionnaire. Criterion validity was determined by calculating the sensitivity and specificity, receiver operator characteristic curve, and cutoff point. Test-retest repeatability was tested using the intraclass correlation coefficient (ICC) and concordance by Cohen's kappa (κ). The CVS-Q was developed with wide consensus among experts and was well accepted by the target group. It assesses the frequency and intensity of 16 symptoms using a single rating scale (symptom severity) that fits the Rasch rating scale model well. The questionnaire has sensitivity and specificity over 70% and achieved good test-retest repeatability both for the scores obtained [ICC = 0.802; 95% confidence interval (CI): 0.673, 0.884] and CVS classification (κ = 0.612; 95% CI: 0.384, 0.839). The CVS-Q has acceptable psychometric properties, making it a valid and reliable tool to control the visual health of computer workers, and can potentially be used in clinical trials and outcome research. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Raising Confident Kids

    Science.gov (United States)

    ... First Aid & Safety Doctors & Hospitals Videos Recipes for Kids Kids site Sitio para niños How the Body ... Videos for Educators Search English Español Raising Confident Kids KidsHealth / For Parents / Raising Confident Kids What's in ...

  18. Intrarater reliability of hand held dynamometry in measuring lower extremity isometric strength using a portable stabilization device.

    Science.gov (United States)

    Jackson, Steven M; Cheng, M Samuel; Smith, A Russell; Kolber, Morey J

    2017-02-01

    Hand held dynamometry (HHD) is a more objective way to quantify muscle force production (MP) compared to traditional manual muscle testing. HHD reliability can be negatively impacted by both the strength of the tester and the subject particularly in the lower extremities due to larger muscle groups. The primary aim of this investigation was to assess intrarater reliability of HHD with use of a portable stabilization device for lower extremity MP in an athletic population. Isometric lower extremity strength was measured for bilateral lower extremities including hip abductors, external rotators, adductors, knee extensors, and ankle plantar flexors was measured in a sample of healthy recreational runners (8 male, 7 females, = 30 limbs) training for a marathon. These measurements were assessed using an intrasession intrarater reliability design. Intraclass correlation coefficients (ICC) were calculated using 3,1 model based on the single rater design. The standard error of measurement (SEM) for each muscle group was also calculated. ICC were excellent ranging from ICC (3,1) = 0.93-0.98 with standard error of measurements ranging from 0.58 to 17.2 N. This study establishes the use of a HHD with a portable stabilization device as demonstrating good reliability within testers for measuring lower extremity muscle performance in an active healthy population. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Hypsarrhythmia assessment exhibits poor interrater reliability: a threat to clinical trial validity.

    Science.gov (United States)

    Hussain, Shaun A; Kwong, Grace; Millichap, John J; Mytinger, John R; Ryan, Nicole; Matsumoto, Joyce H; Wu, Joyce Y; Lerner, Jason T; Sankar, Raman

    2015-01-01

    Hypsarrhythmia is the classic interictal electroencephalographic pattern associated with infantile spasms, and characterized by high voltage, disorganization, and multifocal independent epileptiform discharges. Given this seemingly simple definition, one might expect excellent interrater reliability (IRR) in the identification of this pattern. Alternatively, it may be argued that assessments of voltage and disorganization are fairly subjective, and thus quite challenging in borderline cases. We sought to test the IRR of hypsarrhythmia assessment in a systematic fashion. Six blinded pediatric electroencephalographers from four centers reviewed 22 electroencephalography (EEG) samples from patients with infantile spasms. Each sample was 5 min in duration and included only wakefulness. Raters determined if each EEG was abnormal and if hypsarrhythmia was present/absent, and characterized relevant features: voltage, organization, epileptiform discharges, slowing, interictal attenuations, symmetry, and synchrony. In addition, raters indicated their level of confidence for each assessment. Multirater kappa statistics (κ) were calculated for the assessment of hypsarrhythmia and each feature. Although IRR was favorable in determining whether a study was normal or abnormal (κ=0.89), reliability was unfavorable for assessment of hypsarrhythmia (κ=0.40), modified hypsarrhythmia (κ=0.47), high voltage (κ=0.37), disorganization (κ=0.22), multifocal epileptiform discharges (κ=0.68), interictal voltage attenuations (κ=0.21), slowing (κ=0.20), asymmetry (κ=0.26), and asynchrony (κ=0.08). Despite generally unsatisfactory interrater agreement, raters consistently reported high confidence in assessments. This study contradicts the view that hypsarrhythmia assessment is straightforward. Even small variability in the identification of hypsarrhythmia has potentially deleterious consequences for clinical care, as its presence or absence impacts decisions to pursue high-risk and

  20. The reliability of the Extra Load Index as a measure of relative load carriage economy.

    Science.gov (United States)

    Hudson, Sean; Cooke, Carlton; Lloyd, Ray

    2017-09-01

    The aim of this study was to measure the reliability of the extra load index (ELI) as a method for assessing relative load carriage economy. Seventeen volunteers (12 males, 5 females) performed walking trials at 3 km·h -1 , 6 km·h -1 and a self-selected speed. Trial conditions were repeated 7 days later to assess test-retest reliability. Trials involved four 4-minute periods of walking, each separated by 5 min of rest. The initial stage was performed unloaded followed in a randomised order by a second unloaded period and walking with backpacks of 7 and 20 kg. Results show ELI values did not differ significantly between trials for any of the speeds (p = 0.46) with either of the additional loads (p = 0.297). The systematic bias, limits of agreement and coefficients of variation were small in all trial conditions. We conclude the ELI appears to be a reliable measure of relative load carriage economy. Practitioner Summary: This paper demonstrates that the ELI is a reliable measure of load carriage economy at a range of walking speeds with both a light and heavy load. The ELI, therefore, represents a useful tool for comparing the relative economy associated with different load carriage systems.

  1. Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

    Directory of Open Access Journals (Sweden)

    Formiga MF

    2018-05-01

    Full Text Available Magno F Formiga,1,2 Kathryn E Roach,1 Isabel Vital,3 Gisel Urdaneta,3 Kira Balestrini,3 Rafael A Calderon-Candelario,3,4 Michael A Campos,3,4,* Lawrence P Cahalin1,* 1Department of Physical Therapy, University of Miami Miller School of Medicine, Coral Gables, FL, USA; 2CAPES Foundation, Ministry of Education of Brazil, Brasilia, Brazil; 3Pulmonary Section, Miami Veterans Administration Medical Center, Miami, FL, USA; 4Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, University of Miami Miller School of Medicine, Miami, FL, USA *These authors contributed equally to this work Purpose: The Test of Incremental Respiratory Endurance (TIRE provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP over time. The integration of MIP over inspiratory duration (ID provides the sustained maximal inspiratory pressure (SMIP. Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods: Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results: All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP

  2. Validity and intra-rater reliability of an android phone application to measure cervical range-of-motion.

    Science.gov (United States)

    Quek, June; Brauer, Sandra G; Treleaven, Julia; Pua, Yong-Hao; Mentiplay, Benjamin; Clark, Ross Allan

    2014-04-17

    Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1-7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant's head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone.The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine.

  3. Vaccination Confidence and Parental Refusal/Delay of Early Childhood Vaccines.

    Directory of Open Access Journals (Sweden)

    Melissa B Gilkey

    Full Text Available To support efforts to address parental hesitancy towards early childhood vaccination, we sought to validate the Vaccination Confidence Scale using data from a large, population-based sample of U.S. parents.We used weighted data from 9,354 parents who completed the 2011 National Immunization Survey. Parents reported on the immunization history of a 19- to 35-month-old child in their households. Healthcare providers then verified children's vaccination status for vaccines including measles, mumps, and rubella (MMR, varicella, and seasonal flu. We used separate multivariable logistic regression models to assess associations between parents' mean scores on the 8-item Vaccination Confidence Scale and vaccine refusal, vaccine delay, and vaccination status.A substantial minority of parents reported a history of vaccine refusal (15% or delay (27%. Vaccination confidence was negatively associated with refusal of any vaccine (odds ratio [OR] = 0.58, 95% confidence interval [CI], 0.54-0.63 as well as refusal of MMR, varicella, and flu vaccines specifically. Negative associations between vaccination confidence and measures of vaccine delay were more moderate, including delay of any vaccine (OR = 0.81, 95% CI, 0.76-0.86. Vaccination confidence was positively associated with having received vaccines, including MMR (OR = 1.53, 95% CI, 1.40-1.68, varicella (OR = 1.54, 95% CI, 1.42-1.66, and flu vaccines (OR = 1.32, 95% CI, 1.23-1.42.Vaccination confidence was consistently associated with early childhood vaccination behavior across multiple vaccine types. Our findings support expanding the application of the Vaccination Confidence Scale to measure vaccination beliefs among parents of young children.

  4. On the Reliability of Implicit and Explicit Memory Measures.

    Science.gov (United States)

    Buchner, Axel; Wippich, Werner

    2000-01-01

    Studied the reliability of implicit and explicit memory tests in experiments involving these tests. Results with 168, 84, 120, and 128 undergraduates show that methodological artifacts may cause implicit memory tests to have lower reliability than explicit memory tests, but that implicit tests need not necessarily be less reliable. (SLD)

  5. Reliability of the Danish version of the McGill ingestive skills assessment for observation-based measures during meals

    DEFF Research Database (Denmark)

    Hansen, Tina; Lambert, Heather C; Faber, Jens

    2012-01-01

    To establish measurement equivalence in terms of reliability of the Danish version of the Canadian McGill ingestive skills assessment (MISA) for use by occupational therapists.......To establish measurement equivalence in terms of reliability of the Danish version of the Canadian McGill ingestive skills assessment (MISA) for use by occupational therapists....

  6. Test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy.

    Science.gov (United States)

    Savva, Christos; Giakas, Giannis; Efstathiou, Michalis; Karagiannis, Christos

    2014-01-01

    The purpose of this study was to evaluate the test-retest reliability of handgrip strength measurement using a hydraulic hand dynamometer in patients with cervical radiculopathy (CR). A convenience sample of 19 participants (14 men and 5 women; mean ± SD age, 50.5 ± 12 years) with CR was measured using a Jamar hydraulic hand dynamometer by the same rater on 2 different testing sessions with an interval of 7 days between sessions. Data collection procedures followed standardized grip strength testing guidelines established by the American Society of Hand Therapists. During the repeated measures, patients were advised to rest their upper limb in the standardized arm position and encouraged to exert 3 maximum gripping efforts. The mean value of the 3 efforts (measured in kilogram force [Kgf]) was used for data analysis. The intraclass correlation coefficient, SEM, and the Bland-Altman plot were used to estimate test-retest reliability and measurement precision. Grip strength measurement in CR demonstrated an intraclass correlation coefficient of 0.976, suggesting excellent test-retest reliability. The small SEM in both testing sessions (SEM1, 2.41 Kgf; SEM2, 2.51 Kgf) as well as the narrow width of the 95% limits of agreements (95% limits of agreement, -4.9 to 4.4 Kgf) in the Bland-Altman plot reflected precise measurements of grip strength in both occasions. Excellent test-retest reliability for grip strength measurement was measured in patients with CR, demonstrating that a hydraulic hand dynamometer could be used as an outcome measure for these patients. Copyright © 2014 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.

  7. Accuracy and reliability of facial soft tissue depth measurements using cone beam computer tomography

    NARCIS (Netherlands)

    Fourie, Zacharias; Damstra, Janalt; Gerrits, Pieter; Ren, Yijin

    2010-01-01

    It is important to have accurate and reliable measurements of soft tissue thickness for specific landmarks of the face and scalp when producing a facial reconstruction. In the past several methods have been created to measure facial soft tissue thickness (FSTT) in cadavers and in the living. The

  8. Reliability analysis of a phaser measurement unit using a generalized fuzzy lambda-tau(GFLT) technique.

    Science.gov (United States)

    Komal

    2018-05-01

    Nowadays power consumption is increasing day-by-day. To fulfill failure free power requirement, planning and implementation of an effective and reliable power management system is essential. Phasor measurement unit(PMU) is one of the key device in wide area measurement and control systems. The reliable performance of PMU assures failure free power supply for any power system. So, the purpose of the present study is to analyse the reliability of a PMU used for controllability and observability of power systems utilizing available uncertain data. In this paper, a generalized fuzzy lambda-tau (GFLT) technique has been proposed for this purpose. In GFLT, system components' uncertain failure and repair rates are fuzzified using fuzzy numbers having different shapes such as triangular, normal, cauchy, sharp gamma and trapezoidal. To select a suitable fuzzy number for quantifying data uncertainty, system experts' opinion have been considered. The GFLT technique applies fault tree, lambda-tau method, fuzzified data using different membership functions, alpha-cut based fuzzy arithmetic operations to compute some important reliability indices. Furthermore, in this study ranking of critical components of the system using RAM-Index and sensitivity analysis have also been performed. The developed technique may be helpful to improve system performance significantly and can be applied to analyse fuzzy reliability of other engineering systems. Copyright © 2018 ISA. Published by Elsevier Ltd. All rights reserved.

  9. Reliability and reproducibility of disc-foveal angle measurements by non-mydriatic fundus photography.

    Science.gov (United States)

    Le Jeune, Caroline; Chebli, Fayçal; Leon, Lorette; Anthoine, Emmanuelle; Weber, Michel; Péchereau, Alain; Lebranchu, Pierre

    2018-01-01

    Abnormal torsion could be associated with cyclovertical strabismus, but torsion measurements are not reliable in children. To assess an objective fundus torsion evaluation in a paediatric population, we used Non-Mydriatic Fundus photography (NMFP) in healthy and cyclovertical strabismus patients to evaluate the disc-foveal angle over time and observers. We used a retrospective set of NMFP including 24 A or V-pattern strabismus and 27 age-matched normal children (mean age 6.4 and 6.7 years respectively), taken during 2 distinct follow-up consultations (separated by 251 and 479 days respectively). Each disc-foveal angle measurement (from which the ocular torsion can be assessed) was performed by 5 different observers, using graphical software and based on reproducible fundus anatomical marks. Statistical analysis was performed with a multivariate ANOVA using group, time and observers as factors, in addition to intraclass coefficient correlation (ICC) to assess measurement reproducibility. A significant difference of disc-foveal angle measures was observed between groups (p0,97). Abnormal amount of objective torsion could be associated with alphabet-pattern strabismus. Disc-foveal angle evaluation by NMFP in a children population appears as a non-invasive, reliable and reproducible method.

  10. Inter-rater reliability of measures to characterize the tobacco retail environment in Mexico

    Directory of Open Access Journals (Sweden)

    Marissa G Hall

    2015-11-01

    Full Text Available Objective. To evaluate the inter-rater reliability of a data collection instrument to assess the tobacco retail environ- ment in Mexico, after major marketing regulations were implemented. Materials and methods. In 2013, two data collectors independently evaluated 21 stores in two census tracts, through a data collection instrument that assessed the presence of price promotions, whether single cigarettes were sold, the number of visible advertisements, the pre- sence of signage prohibiting the sale of cigarettes to minors, and characteristics of cigarette pack displays. We evaluated the inter-rater reliability of the collected data, through the calculation of metrics such as intraclass correlation coefficient, percent agreement, Cohen’s kappa and Krippendorff’s alpha. Results. Most measures demonstrated substantial or perfect inter-rater reliability. Conclusions. Our results indicate the potential utility of the data collection instrument for future point-of-sale research.

  11. Reliability and Validity of a New Method for Isometric Back Extensor Strength Evaluation Using A Hand-Held Dynamometer.

    Science.gov (United States)

    Park, Hee-Won; Baek, Sora; Kim, Hong Young; Park, Jung-Gyoo; Kang, Eun Kyoung

    2017-10-01

    To investigate the reliability and validity of a new method for isometric back extensor strength measurement using a portable dynamometer. A chair equipped with a small portable dynamometer was designed (Power Track II Commander Muscle Tester). A total of 15 men (mean age, 34.8±7.5 years) and 15 women (mean age, 33.1±5.5 years) with no current back problems or previous history of back surgery were recruited. Subjects were asked to push the back of the chair while seated, and their isometric back extensor strength was measured by the portable dynamometer. Test-retest reliability was assessed with intraclass correlation coefficient (ICC). For the validity assessment, isometric back extensor strength of all subjects was measured by a widely used physical performance evaluation instrument, BTE PrimusRS system. The limit of agreement (LoA) from the Bland-Altman plot was evaluated between two methods. The test-retest reliability was excellent (ICC=0.82; 95% confidence interval, 0.65-0.91). The Bland-Altman plots demonstrated acceptable agreement between the two methods: the lower 95% LoA was -63.1 N and the upper 95% LoA was 61.1 N. This study shows that isometric back extensor strength measurement using a portable dynamometer has good reliability and validity.

  12. The navicular position test - a reliable measure of the navicular bone position during rest and loading

    DEFF Research Database (Denmark)

    Spörndly-Nees, Søren; Dåsberg, Brian; Nielsen, Rasmus Oestergaard

    2011-01-01

    .08 degrees, ICC = 0.91. Discussion: The present data support The Navicular Position Test as a reliable test of the navicular bone position during rest and loading measured in a simple test set-up. Conclusion: The Navicular Position Test was shown to have a high intraday-, intra- and inter-tester reliability...

  13. The Turkish version of the Physical Activity Scale for the Elderly (PASE): its cultural adaptation, validation, and reliability.

    Science.gov (United States)

    Ayvat, Ender; Kilinç, Muhammed; Kirdi, Nuray

    2017-06-12

    This study aimed to describe the cultural adaptation of the Turkish Physical Activity Scale for the Elderly (PASE) and to examine the reliability and validity of the scale in older Turkish adults. Eighty elderly people were recruited for the study. The assessments included the PASE, the International Physical Activity Questionnaire (IPAQ), the Short Physical Performance Battery and Short Form-36 Quality of Life Questionnaire (SF-36), and the Mini Mental State Test. Outcome measures were conducted twice within a week (test-retest) for reliability. Cronbach's α coefficient was 0.714 for the initial evaluation. The intraclass correlation coefficient for the test-retest reliability was 0.995 with a 95% confidence interval of 0.993-0.997. A high level of positive correlation (0.742, P reliable and valid scale for the fields of research and practice.

  14. Life-Space Assessment questionnaire: Novel measurement properties for Brazilian community-dwelling older adults.

    Science.gov (United States)

    Simões, Maria do Socorro Mp; Garcia, Isabel Ff; Costa, Lucíola da Cm; Lunardi, Adriana C

    2018-05-01

    The Life-Space Assessment (LSA) assesses mobility from the spaces that older adults go, and how often and how independent they move. Despite its increased use, LSA measurement properties remain unclear. The aim of the present study was to analyze the content validity, reliability, construct validity and interpretability of the LSA for Brazilian community-dwelling older adults. In this clinimetric study we analyzed the measurement properties (content validity, reliability, construct validity and interpretability) of the LSA administered to 80 Brazilian community-dwelling older adults. Reliability was analyzed by Cronbach's alpha (internal consistency), intraclass correlation coefficients and 95% confidence interval (reproducibility), and standard error of measurement (measurement error). Construct validity was analyzed by Pearson's correlations between the LSA and accelerometry (time in inactivity and moderate-to-vigorous activities), and interpretability was analyzed by determination of the minimal detectable change, and floor and ceiling effects. The LSA met the criteria for content validity. The Cronbach's alpha was 0.92, intraclass correlation coefficient was 0.97 (95% confidence interval 0.95-0.98) and standard error of measurement was 4.12. The LSA showed convergence with accelerometry (negative correlation with time in inactivity and positive correlation with time in moderate to vigorous activities), the minimal detectable change was 0.36 and we observed no floor or ceiling effects. The LSA showed adequate reliability, validity and interpretability for life-space mobility assessment of Brazilian community-dwelling older adults. Geriatr Gerontol Int 2018; 18: 783-789. © 2018 Japan Geriatrics Society.

  15. The Development of the Redox Concept Inventory as a Measure of Students' Symbolic and Particulate Redox Understandings and Confidence

    Science.gov (United States)

    Brandriet, Alexandra R.; Bretz, Stacey Lowery

    2014-01-01

    This article describes the development of the Redox Concept Inventory (ROXCI) as a measure of students' understandings and confidence of both the symbolic and particulate domains of oxidation-reduction (redox) reactions. The ROXCI was created using a mixed-methods design in which the items were developed based upon themes that emerged from…

  16. Reliability of Ultrasonographic Measurement of Cervical Multifidus Muscle Dimensions during Isometric Contraction of Neck Muscles

    Directory of Open Access Journals (Sweden)

    Somayeh Amiri Arimi

    2012-07-01

    Full Text Available Background and Aim: Cervical multifidus is considered as one of the most important neck stabilizers. Weakness and muscular atrophy of this muscle were seen in patients with chronic neck pain. Ultrasonographic imaging is a non-invasive and feasible technique that commonly used to record such changes and measure muscle dimensions. Therefore, the aim of this study was to evaluate the reliability of ultrasonographic measurement of cervical multifidus muscle’s dimensions during isometric contraction of neck muscles. Materials and Method: Ten subjects (5 patients with chronic neck pain and 5 healthy subjects were recruited in this study. Cervical multifidus muscle’s dimensions were measured at the level of forth cervical vertebrae. Ultrasonographic measurement of cervical multifidus muscle at rest, 50% and 100% of maximal voluntary contraction (MVC were performed by one examiner within 1 week interval. The dimensions of cervical multifidus muscle including cross-sectional area (CSA, anterior posterior dimension (APD, and lateral dimension (LD were measured. Intraclass correlation coefficients (ICC, standard error of measurement (SEM and minimal detectable change (MDC were computed for data analysis.Results: The between days reliability of maximum strength of neck muscles and multifidus muscle dimensions at rest, 50% and 100% of MVC of neck muscles were good to excellent (ICC=0.75-0.99.Conclusion: The results of this study showed that ultrasonographic measuring of cervical multifidus muscle’s dimensions during isometric contraction of neck muscles at the level of C4 in females with chronic neck pain and healthy subjects is a reliable and repeatable method.

  17. Generalizability Theory Reliability of Written Expression Curriculum-Based Measurement in Universal Screening

    Science.gov (United States)

    Keller-Margulis, Milena A.; Mercer, Sterett H.; Thomas, Erin L.

    2016-01-01

    The purpose of this study was to examine the reliability of written expression curriculum-based measurement (WE-CBM) in the context of universal screening from a generalizability theory framework. Students in second through fifth grade (n = 145) participated in the study. The sample included 54% female students, 49% White students, 23% African…

  18. Establishing Reliability and Construct Validity for an Instrument to Measure Environmental Connectedness

    Science.gov (United States)

    Beery, Thomas H.

    2013-01-01

    The purpose of this preliminary study is to establish a reliable and valid measure of environmental connectedness (EC) to allow for further exploration of the Swedish Outdoor Recreation in Change national survey data. The Nordic concept of friluftsliv (nature-based outdoor recreation) and the environmental psychology concept of EC are explored to…

  19. Accuracy and reliability of linear cephalometric measurements from cone-beam computed tomography scans of a dry human skull.

    Science.gov (United States)

    Berco, Mauricio; Rigali, Paul H; Miner, R Matthew; DeLuca, Stephelynn; Anderson, Nina K; Will, Leslie A

    2009-07-01

    The purpose of this study was to determine the accuracy and reliability of 3-dimensional craniofacial measurements obtained from cone-beam computed tomography (CBCT) scans of a dry human skull. Seventeen landmarks were identified on the skull. CBCT scans were then obtained, with 2 skull orientations during scanning. Twenty-nine interlandmark linear measurements were made directly on the skull and compared with the same measurements made on the CBCT scans. All measurements were made by 2 operators on 4 separate occasions. The method errors were 0.19, 0.21, and 0.19 mm in the x-, y- and z-axes, respectively. Repeated measures analysis of variance (ANOVA) showed no significant intraoperator or interoperator differences. The mean measurement error was -0.01 mm (SD, 0.129 mm). Five measurement errors were found to be statistically significantly different; however, all measurement errors were below the known voxel size and clinically insignificant. No differences were found in the measurements from the 2 CBCT scan orientations of the skull. CBCT allows for clinically accurate and reliable 3-dimensional linear measurements of the craniofacial complex. Moreover, skull orientation during CBCT scanning does not affect the accuracy or the reliability of these measurements.

  20. Intra-observer and interobserver reliability ofOne Leg Stand Test as a measure of posturalbalance in low back pain patients

    DEFF Research Database (Denmark)

    Maribo, Thomas; Iversen, Elena; Andersen, Niels Trolle

    2009-01-01

    Objective: To determine the absolute and relative reliability of intra-observer and interobserver To determine the absolute and relative reliability of intra-observer and interobserver measurements of postural balance using the One Leg Stand Test in patients with low back pain. Patients and methods...... to stand for the maximum time, and no further analysis was done. Eyes closed: intra-observer reliability was tested in 21 patients; absolute reliability showed a standard error of the measurement (SEM) of 2.48 s and a minimal detectable change (MDC) of 6.88. The relative reliability was acceptable...... with an intra class correlation coefficient (ICC) of 0.86. Interobserver reliability was tested in 27 patients; absolute reliability showed a SEM of 1.42 s and a MDC of 3.95. The relative reliability was acceptable with an ICC of 0.91. Conclusions: The One Leg Stand Test can be used to test postural balance...

  1. Pattern of alveolar bone loss and reliability of measurements with the radiographic technique

    International Nuclear Information System (INIS)

    Rise, J.; Albandar, J.M.

    1988-01-01

    The purposes of this paper were to study the pattern of bone loss among different teeth at the individual level and to study the effect of using different aggregated units of analysis on measurement error. Bone loss was assessed in standardized periapical radiographs from 293 subjects (18-68 years), and the mean bone loss score for each tooth type was calculated. These were then correlated by means of factor analysis to study the bone loss pattern. Reliability (measurement error) was studied by the internal consistency and the test-retest methods. The pattern of bone loss showed a unidimensional pattern, indicating that any tooth will work equally well as a dependent variable for epidemiologic descriptive purposes. However, a more thorough analysis also showed a multidimensional pattern in terms of four dimensions, which correspond to four tooth groups: incisors, upper premolars, lower premolars and molars. The four dimensions accounted for 80% of the toal variance. The multidimensional pattern may be important for the modeling of bone loss; thus different models may explain the four dimension (indices) used as dependent variables. The reliability (internal consistency) of the four indices was satisfactory. By the test-retest method, reliability was higher when the more aggregated unit (the individual) was used

  2. Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

    Science.gov (United States)

    Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

    2018-01-01

    The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.

  3. Diverse interpretations of confidence building

    International Nuclear Information System (INIS)

    Macintosh, J.

    1998-01-01

    This paper explores the variety of operational understandings associated with the term 'confidence building'. Collectively, these understandings constitute what should be thought of as a 'family' of confidence building approaches. This unacknowledged and generally unappreciated proliferation of operational understandings that function under the rubric of confidence building appears to be an impediment to effective policy. The paper's objective is to analyze these different understandings, stressing the important differences in their underlying assumptions. In the process, the paper underlines the need for the international community to clarify its collective thinking about what it means when it speaks of 'confidence building'. Without enhanced clarity, it will be unnecessarily difficult to employ the confidence building approach effectively due to the lack of consistent objectives and common operating assumptions. Although it is not the intention of this paper to promote a particular account of confidence building, dissecting existing operational understandings should help to identify whether there are fundamental elements that define what might be termed 'authentic' confidence building. Implicit here is the view that some operational understandings of confidence building may diverge too far from consensus models to count as meaningful members of the confidence building family. (author)

  4. Chinese Management Research Needs Self-Confidence but not Over-confidence

    DEFF Research Database (Denmark)

    Li, Xin; Ma, Li

    2018-01-01

    Chinese management research aims to contribute to global management knowledge by offering rigorous and innovative theories and practical recommendations both for managing in China and outside. However, two seemingly opposite directions that researchers are taking could prove detrimental......-confidence, limiting theoretical innovation and practical relevance. Yet going in the other direction of overly indigenous research reflects over-confidence, often isolating the Chinese management research from the mainstream academia and at times, even becoming anti-science. A more integrated approach of conducting...... to the healthy development of Chinese management research. We argue that the two directions share a common ground that lies in the mindset regarding the confidence in the work on and from China. One direction of simply following the American mainstream on academic rigor demonstrates a lack of self...

  5. Reliability and validity of the Performance Recorder 1 for measuring isometric knee flexor and extensor strength.

    Science.gov (United States)

    Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L

    2013-11-01

    Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n  =  15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC  =  0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC  =  0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC  =  0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC  =  0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC  =  0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.

  6. Sensitivity, reliability and the effects of diurnal variation on a test battery of field usable upper limb fatigue measures.

    Science.gov (United States)

    Yung, Marcus; Wells, Richard P

    2017-07-01

    Fatigue has been linked to deficits in production quality and productivity and, if of long duration, work-related musculoskeletal disorders. It may thus be a useful risk indicator and design and evaluation tool. However, there is limited information on the test-retest reliability, the sensitivity and the effects of diurnal fluctuation on field usable fatigue measures. This study reports on an evaluation of 11 measurement tools and their 14 parameters. Eight measures were found to have test-retest ICC values greater than 0.8. Four measures were particularly responsive during an intermittent fatiguing condition. However, two responsive measures demonstrated rhythmic behaviour, with significant time effects from 08:00 to mid-afternoon and early evening. Action tremor, muscle mechanomyography and perceived fatigue were found to be most reliable and most responsive; but additional analytical considerations might be required when interpreting daylong responses of MMG and action tremor. Practitioner Summary: This paper presents findings from test-retest and daylong reliability and responsiveness evaluations of 11 fatigue measures. This paper suggests that action tremor, muscle mechanomyography and perceived fatigue were most reliable and most responsive. However, mechanomyography and action tremor may be susceptible to diurnal changes.

  7. Application of nonparametric statistics to material strength/reliability assessment

    International Nuclear Information System (INIS)

    Arai, Taketoshi

    1992-01-01

    An advanced material technology requires data base on a wide variety of material behavior which need to be established experimentally. It may often happen that experiments are practically limited in terms of reproducibility or a range of test parameters. Statistical methods can be applied to understanding uncertainties in such a quantitative manner as required from the reliability point of view. Statistical assessment involves determinations of a most probable value and the maximum and/or minimum value as one-sided or two-sided confidence limit. A scatter of test data can be approximated by a theoretical distribution only if the goodness of fit satisfies a test criterion. Alternatively, nonparametric statistics (NPS) or distribution-free statistics can be applied. Mathematical procedures by NPS are well established for dealing with most reliability problems. They handle only order statistics of a sample. Mathematical formulas and some applications to engineering assessments are described. They include confidence limits of median, population coverage of sample, required minimum number of a sample, and confidence limits of fracture probability. These applications demonstrate that a nonparametric statistical estimation is useful in logical decision making in the case a large uncertainty exists. (author)

  8. Reliability of cervical lordosis and global sagittal spinal balance measurements in adolescent idiopathic scoliosis.

    Science.gov (United States)

    Vidal, Christophe; Ilharreborde, Brice; Azoulay, Robin; Sebag, Guy; Mazda, Keyvan

    2013-06-01

    Radiological reproducibility study. To assess intra and interobserver reliability of radiographic measurements for global sagittal balance parameters and sagittal spine curves, including cervical spine. Sagittal spine balance in adolescent idiopathic scoliosis (AIS) is a main issue and many studies have been reported, showing that coronal and sagittal deformities often involve sagittal cervical unbalance. Global sagittal balance aims to obtain a horizontal gaze and gravity line at top of hips when subject is in a static position, involving adjustment of each spine curvature in the sagittal plane. To our knowledge, no study did use a methodologically validated imaging analysis tool able to appreciate sagittal spine contours and distances in AIS and especially in the cervical region. Lateral full-spine low-dose EOS radiographs were performed in 75 patients divided in three groups (control subjects, AIS, operated AIS). Three observers digitally analyzed twice each radiograph and 11 sagittal measures were collected for each image. Reliability was assessed calculating intraobserver Pearson's r correlation coefficient, interobserver intra-class correlation coefficient (ICC) completed with a two-by-two Bland-Altman plot analysis. This measurement method has shown excellent intra and interobserver reliability in all parameters, sagittal curvatures, pelvic parameters and global sagittal balance. This study validated a simple and efficient tool in AIS sagittal contour analysis. It defined new relevant landmarks allowing to characterize cervical segmental curvatures and cervical involvement in global balance.

  9. Understanding public confidence in government to prevent terrorist attacks.

    Energy Technology Data Exchange (ETDEWEB)

    Baldwin, T. E.; Ramaprasad, A,; Samsa, M. E.; Decision and Information Sciences; Univ. of Illinois at Chicago

    2008-04-02

    A primary goal of terrorism is to instill a sense of fear and vulnerability in a population and to erode its confidence in government and law enforcement agencies to protect citizens against future attacks. In recognition of its importance, the Department of Homeland Security includes public confidence as one of the principal metrics used to assess the consequences of terrorist attacks. Hence, a detailed understanding of the variations in public confidence among individuals, terrorist event types, and as a function of time is critical to developing this metric. In this exploratory study, a questionnaire was designed, tested, and administered to small groups of individuals to measure public confidence in the ability of federal, state, and local governments and their public safety agencies to prevent acts of terrorism. Data was collected from three groups before and after they watched mock television news broadcasts portraying a smallpox attack, a series of suicide bomber attacks, a refinery explosion attack, and cyber intrusions on financial institutions, resulting in identity theft. Our findings are: (a) although the aggregate confidence level is low, there are optimists and pessimists; (b) the subjects are discriminating in interpreting the nature of a terrorist attack, the time horizon, and its impact; (c) confidence recovery after a terrorist event has an incubation period; and (d) the patterns of recovery of confidence of the optimists and the pessimists are different. These findings can affect the strategy and policies to manage public confidence after a terrorist event.

  10. The idiosyncratic nature of confidence.

    Science.gov (United States)

    Navajas, Joaquin; Hindocha, Chandni; Foda, Hebah; Keramati, Mehdi; Latham, Peter E; Bahrami, Bahador

    2017-11-01

    Confidence is the 'feeling of knowing' that accompanies decision making. Bayesian theory proposes that confidence is a function solely of the perceived probability of being correct. Empirical research has suggested, however, that different individuals may perform different computations to estimate confidence from uncertain evidence. To test this hypothesis, we collected confidence reports in a task where subjects made categorical decisions about the mean of a sequence. We found that for most individuals, confidence did indeed reflect the perceived probability of being correct. However, in approximately half of them, confidence also reflected a different probabilistic quantity: the perceived uncertainty in the estimated variable. We found that the contribution of both quantities was stable over weeks. We also observed that the influence of the perceived probability of being correct was stable across two tasks, one perceptual and one cognitive. Overall, our findings provide a computational interpretation of individual differences in human confidence.

  11. Reliability and validity of the brief multidimensional measure of religiousness/spirituality among adolescents.

    Science.gov (United States)

    Harris, Sion Kim; Sherritt, Lon R; Holder, David W; Kulig, John; Shrier, Lydia A; Knight, John R

    2008-12-01

    Developed for use in health research, the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) consists of brief measures of a broad range of religiousness and spirituality (R/S) dimensions. It has established psychometric properties among adults, but little is known about its appropriateness for use with adolescents. We assessed the psychometric properties of the BMMRS among adolescents. We recruited a racially diverse (85% non-White) sample of 305 adolescents aged 12-18 years (median 16 yrs, IQR 14-17) from 3 urban medical clinics; 93 completed a retest 1 week later. We assessed internal consistency and test-retest reliability. We assessed construct validity by examining how well the measures discriminated groups expected to differ based on self-reported religious preference, and how they related to a hypothesized correlate, depressive symptoms. Religious preference was categorized into "No religion/Atheist" (11%), "Don't know/Confused" (9%), or "Named a religion" (80%). Responses to multi-item measures were generally internally consistent (alpha > or = 0.70 for 12/16 measures) and stable over 1 week (intraclass correlation coefficients > or = 0.70 for 14/16). Forgiveness, Negative R/S Coping, and Commitment items showed lower internal cohesiveness. Scores on most measures were higher (p Atheist" group. Forgiveness, Commitment, and Anticipated Support from members of one's congregation were inversely correlated with depressive symptoms, while BMMRS measures assessing negative R/S experiences (Negative R/S Coping, Negative Interactions with others in congregation, Loss in Faith) were positively correlated with depressive symptoms. These findings suggest that most BMMRS measures are reliable and valid for use among adolescents.

  12. Practical reliability and uncertainty quantification in complex systems : final report.

    Energy Technology Data Exchange (ETDEWEB)

    Grace, Matthew D.; Ringland, James T.; Marzouk, Youssef M. (Massachusetts Institute of Technology, Cambridge, MA); Boggs, Paul T.; Zurn, Rena M.; Diegert, Kathleen V. (Sandia National Laboratories, Albuquerque, NM); Pebay, Philippe Pierre; Red-Horse, John Robert (Sandia National Laboratories, Albuquerque, NM)

    2009-09-01

    The purpose of this project was to investigate the use of Bayesian methods for the estimation of the reliability of complex systems. The goals were to find methods for dealing with continuous data, rather than simple pass/fail data; to avoid assumptions of specific probability distributions, especially Gaussian, or normal, distributions; to compute not only an estimate of the reliability of the system, but also a measure of the confidence in that estimate; to develop procedures to address time-dependent or aging aspects in such systems, and to use these models and results to derive optimal testing strategies. The system is assumed to be a system of systems, i.e., a system with discrete components that are themselves systems. Furthermore, the system is 'engineered' in the sense that each node is designed to do something and that we have a mathematical description of that process. In the time-dependent case, the assumption is that we have a general, nonlinear, time-dependent function describing the process. The major results of the project are described in this report. In summary, we developed a sophisticated mathematical framework based on modern probability theory and Bayesian analysis. This framework encompasses all aspects of epistemic uncertainty and easily incorporates steady-state and time-dependent systems. Based on Markov chain, Monte Carlo methods, we devised a computational strategy for general probability density estimation in the steady-state case. This enabled us to compute a distribution of the reliability from which many questions, including confidence, could be addressed. We then extended this to the time domain and implemented procedures to estimate the reliability over time, including the use of the method to predict the reliability at a future time. Finally, we used certain aspects of Bayesian decision analysis to create a novel method for determining an optimal testing strategy, e.g., we can estimate the 'best' location to

  13. Isometric and isokinetic muscle strength in the upper extremity can be reliably measured in persons with chronic stroke.

    Science.gov (United States)

    Ekstrand, Elisabeth; Lexell, Jan; Brogårdh, Christina

    2015-09-01

    To evaluate the test-retest reliability of isometric and isokinetic muscle strength measurements in the upper extremity after stroke. A test-retest design. Forty-five persons with mild to moderate paresis in the upper extremity > 6 months post-stroke. Isometric arm strength (shoulder abduction, elbow flexion), isokinetic arm strength (elbow extension/flexion) and isometric grip strength were measured with electronic dynamometers. Reliability was evaluated with intra-class correlation coefficients (ICC), changes in the mean, standard error of measurements (SEM) and smallest real differences (SRD). Reliability was high (ICCs: 0.92-0.97). The absolute and relative (%) SEM ranged from 2.7 Nm (5.6%) to 3.0 Nm (9.4%) for isometric arm strength, 2.6 Nm (7.4%) to 2.9 Nm (12.6%) for isokinetic arm strength, and 22.3 N (7.6%) to 26.4 N (9.2%) for grip strength. The absolute and relative (%) SRD ranged from 7.5 Nm (15.5%) to 8.4 Nm (26.1%) for isometric arm strength, 7.1 Nm (20.6%) to 8.0 Nm (34.8%) for isokinetic arm strength, and 61.8 N (21.0%) to 73.3 N (25.6%) for grip strength. Muscle strength in the upper extremity can be reliably measured in persons with chronic stroke. Isometric measurements yield smaller measurement errors than isokinetic measurements and might be preferred, but the choice depends on the research question.

  14. Is radiographic measurement of bony landmarks reliable for lateral meniscal sizing?

    Science.gov (United States)

    Yoon, Jung-Ro; Kim, Taik-Seon; Lim, Hong-Chul; Lim, Hyung-Tae; Yang, Jae-Hyuk

    2011-03-01

    The accuracy of meniscal measurement methods is still in debate. The authors' protocol for radiologic measurements will provide reproducible bony landmarks, and this measurement method of the lateral tibial plateau will correlate with the actual anatomic value. Controlled laboratory study. Twenty-five samples of fresh lateral meniscus with attached proximal tibia were obtained during total knee arthroplasty. Each sample was obtained without damage to the meniscus and bony attachment sites. The inclusion criterion was mild to moderate osteoarthritis in patients with mechanical axis deviation of less than 15°. Knees with lateral compartment osteoarthritic change or injured or degenerated menisci were excluded. For the lateral tibial plateau length measurements, the radiographic beam was angled 10° caudally at neutral rotation, which allowed differentiation of the lateral plateau cortical margins from the medial plateau. The transition points were identified and used for length measurement. The values of length were then compared with the conventional Pollard method and the anatomic values. The width measurement was done according to Pollard's protocol. For each knee, the percentage deviation from the anatomic dimension was recorded. Intraobserver error and interobserver error were calculated. The deviation of the authors' radiographic length measurements from anatomic dimensions was 1.4 ± 1.1 mm. The deviation of Pollard's radiographic length measurements was 4.1 ± 2.0 mm. With respect to accuracy-which represents the frequency of measurements that fall within 10% of measurements-the accuracy of authors' length was 98%, whereas for Pollard's method it was 40%. There was a good correlation between anatomic meniscal dimensions and each radiologic plateau dimensions for lateral meniscal width (R(2) = .790) and the authors' lateral meniscal length (R(2) = .823) and fair correlation for Pollard's lateral meniscal length (R(2) = .660). The reliability of each

  15. Ultrasound evaluation of the abductor hallucis muscle: Reliability study

    Directory of Open Access Journals (Sweden)

    Hing Wayne A

    2008-09-01

    Full Text Available Abstract Background The Abductor hallucis muscle (AbdH plays an integral role during gait and is often affected in pathological foot conditions. The aim of this study was to evaluate the within and between-session intra-tester reliability using diagnostic ultrasound of the dorso-plantar thickness, medio-lateral width and cross-sectional area, of the AbdH in asymptomatic adults. Methods The AbdH muscles of thirty asymptomatic subjects were imaged and then measured using a Philips HD11 Ultrasound machine. Interclass correlation coefficients (ICC with 95% confidence intervals (CI were used to calculate both within and between session intra-tester reliability. Results The within-session reliability results demonstrated for dorso-plantar thickness an ICC of 0.97 (95% CI: 0.99–0.99; medio-lateral width an ICC: of 0.97 (95% CI: 0.92–0.97 and cross-sectional area an ICC of 0.98 (95% CI: 0.98–0.99. Between-session reliability results demonstrated for dorso-plantar thickness an ICC of 0.97 (95% CI: 0.95 to 0.98; medio-lateral width an ICC of 0.94 (95% CI 0.90 to 0.96 and for cross-sectional area an ICC of 0.79 (95% CI 0.65 to 0.88. Conclusion Diagnostic ultrasound has the potential to be a reliable tool for evaluating the AbdH muscle in asymptomatic subjects. Subsequent studies may be conducted to provide a better understanding of the AbdH function in foot and ankle pathologies.

  16. Reliability of Heterochromatic Flicker Photometry in Measuring Macular Pigment Optical Density among Preadolescent Children

    Directory of Open Access Journals (Sweden)

    Sasha M. McCorkle

    2015-10-01

    Full Text Available Macular pigment optical density (MPOD—assessed using customized heterochromatic flicker photometry (cHFP—is related to better cognition and brain lutein among adults. However, the reliability of MPOD assessed by cHFP has not been investigated in children. We assessed inter-session reliability of MPOD using modified cHFP. 7–10-year-olds (n = 66 underwent cHFP over 2 visits using 11 examiners. Reliability was also assessed in a subsample (n = 46 with only 2 examiners. Among all participants, there was no significant difference between the two sessions (p = 0.59—session 1: 0.61 ± 0.28; session 2: 0.62 ± 0.27. There was no significant difference in the MPOD of boys vs. girls (p = 0.56. There was a significant correlation between sessions (Y = 0.52x + 0.31; R2 = 0.29, p ≤ 0.005, with a reliability of 0.70 (Cronbach’s α. Among the subsample with 2 examiners, there was a significant correlation between sessions (Y = 0.54x + 0.31; R2 = 0.32, p < 0.005, with a reliability of 0.72 (Cronbach’s α. In conclusion, there is moderate reliability for modified cHFP to measure MPOD in preadolescents. These findings provide support for future studies aiming to conduct noninvasive assessments of retinal xanthophylls and study their association with cognition during childhood.

  17. Improving the reliability of fishery predictions under climate change

    DEFF Research Database (Denmark)

    Brander, Keith

    2015-01-01

    The increasing number of publications assessing impacts of climate change on marine ecosystems and fisheries attests to rising scientific and public interest. A selection of recent papers, dealing more with biological than social and economic aspects, is reviewed here, with particular attention...... to the reliability of projections of climate impacts on future fishery yields. The 2014 Intergovernmental Panel on Climate Change (IPCC) report expresses high confidence in projections that mid- and high-latitude fish catch potential will increase by 2050 and medium confidence that low-latitude catch potential...... understanding of climate impacts, such as how to improve coupled models from physics to fish and how to strengthen confidence in analysis of time series...

  18. Level of Self-confidence among Female Students of Hail University in Saudi Arabia in Relationship with some Variables

    Directory of Open Access Journals (Sweden)

    Wedad Mohammad Saleh Alkferi

    2017-11-01

    Full Text Available This study aimed to detect the level of self-confidence among female students of Hail University, and whether there were significant differences at the level of the students' self-confidence attributed to the variables of age and specialization. The study sample, which was randomly selected, consisted of 802 students from various disciplines at the university (medicine, engineering, psychology, and Islamic culture enrolled for the second semester of the academic year (2015/2016. To achieve the objectives of the study a confidence Scale developed by Kawasmeh and Farah (1996. The scale was checked for its validity and reliability. The statistical package SPSS was used to extract the results. Results of the study revealed a low level of self-confidence for the students of the university, whereas there were no statistically significant differences due to the variables of age and specialization. Keywords: Self-confidence, Students of Hail University, Saudi Arabia, Some variables.

  19. Validity and Reliability of a New Device (WIMU®) for Measuring Hamstring Muscle Extensibility.

    Science.gov (United States)

    Muyor, José M

    2017-09-01

    The aims of the current study were 1) to evaluate the validity of the WIMU ® system for measuring hamstring muscle extensibility in the passive straight leg raise (PSLR) test using an inclinometer for the criterion and 2) to determine the test-retest reliability of the WIMU ® system to measure hamstring muscle extensibility during the PSLR test. 55 subjects were evaluated on 2 separate occasions. Data from a Unilever inclinometer and WIMU ® system were collected simultaneously. Intraclass correlation coefficients (ICCs) for the validity were very high (0.983-1); a very low systematic bias (-0.21°--0.42°), random error (0.05°-0.04°) and standard error of the estimate (0.43°-0.34°) were observed (left-right leg, respectively) between the 2 devices (inclinometer and the WIMU ® system). The R 2 between the devices was 0.999 (p<0.001) in both the left and right legs. The test-retest reliability of the WIMU ® system was excellent, with ICCs ranging from 0.972-0.995, low coefficients of variation (0.01%), and a low standard error of the estimate (0.19-0.31°). The WIMU ® system showed strong concurrent validity and excellent test-retest reliability for the evaluation of hamstring muscle extensibility in the PSLR test. © Georg Thieme Verlag KG Stuttgart · New York.

  20. Test–Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study

    Directory of Open Access Journals (Sweden)

    Clare E. Palmer

    2018-01-01

    Full Text Available Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington’s disease (HD and Parkinson’s disease (PD across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test–retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test–retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test–retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT from the combined-Simon-Stop-Signal task showed moderate test–retest reliability, however, the combined SSRT interference effect showed poor test–retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.

  1. Test-Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study.

    Science.gov (United States)

    Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina

    2017-01-01

    Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.

  2. The use of ground reflecting boards in measuring wind turbine noise

    International Nuclear Information System (INIS)

    Henderson, A.R.; Mackinnon, A.; Benson, I.M.

    1992-01-01

    This paper gives an account of an experimental programme to assess the ground microphone measurement technique which can potentially increase the accuracy, reliability and confidence in wind turbine noise emission measurements. It shows that a 1 m diameter circular board can achieve acceptable accuracy and, since it is significantly more practical to use, could readily be adopted for international standards. (author)

  3. Reliability of a new method for measuring coronal trunk imbalance, the axis-line-angle technique.

    Science.gov (United States)

    Zhang, Rui-Fang; Liu, Kun; Wang, Xue; Liu, Qian; He, Jia-Wei; Wang, Xiang-Yang; Yan, Zhi-Han

    2015-12-01

    Accurate determination of the extent of trunk imbalance in the coronal plane plays a key role in an evaluation of patients with trunk imbalance, such as patients with adolescent idiopathic scoliosis. An established, widely used practice in evaluating trunk imbalance is to drop a plumb line from the C7 vertebra to a key reference axis, the central sacral vertical line (CSVL) in full-spine standing anterioposterior radiographs, and measuring the distance between them, the C7-CSVL. However, measuring the CSVL is subject to intraobserver differences, is error-prone, and is of poor reliability. Therefore, the development of a different way to measure trunk imbalance is needed. This study aimed to describe a new method to measure coronal trunk imbalance, the axis-line-angle technique (ALAT), which measures the angle at the intersection between the C7 plumb line and an axis line drawn from the vertebral centroid of the C7 to the middle of the superior border of the symphysis pubis, and to compare the reliability of the ALAT with that of the C7-CSVL. A prospective study at a university hospital was used. The patient sample consisted of sixty-nine consecutively enrolled men and women patients, aged 10-18 years, who had trunk imbalance defined as C7-CSVL longer than 20 mm on computed full-spine standing anterioposterior radiographs. Data were analyzed to determine the correlation between C7-CSVL and ALAT measurements and to determine intraobserver and interobserver reliabilities. Using a picture archiving and communication system, three radiologists independently evaluated trunk imbalance on the 69 computed radiographs by measuring the C7-CSVL and by measuring the angle determined by the ALAT. Data were analyzed to determine the correlations between the two measures of trunk imbalance, and to determine intraobserver and interobserver reliabilities of each of them. Overall results from the measurements by the C7-CSVL and the ALAT were significantly moderately correlated

  4. Reliability of thermal-hydraulic passive safety systems

    International Nuclear Information System (INIS)

    D'Auria, F.; Araneo, D.; Pierro, F.; Galassi, G.

    2014-01-01

    The scholar will be informed of reliability concepts applied to passive system adopted for nuclear reactors. Namely, for classical components and systems the failure concept is associated with malfunction of breaking of hardware. In the case of passive systems the failure is associated with phenomena. A method for studying the reliability of passive systems is discussed and is applied. The paper deals with the description of the REPAS (Reliability Evaluation of Passive Safety System) methodology developed by University of Pisa (UNIPI) and with results from its application. The general objective of the REPAS methodology is to characterize the performance of a passive system in order to increase the confidence toward its operation and to compare the performances of active and passive systems and the performances of different passive systems

  5. Is a sphygmomanometer a valid and reliable tool to measure the isometric strength of hip muscles? A systematic review.

    Science.gov (United States)

    Toohey, Liam Anthony; De Noronha, Marcos; Taylor, Carolyn; Thomas, James

    2015-02-01

    Muscle strength measurement is a key component of physiotherapists' assessment and is frequently used as an outcome measure. A sphygmomanometer is an instrument commonly used to measure blood pressure that can be potentially used as a tool to assess isometric muscle strength. To systematically review the evidence on the reliability and validity of a sphygmomanometer for measuring isometric strength of hip muscles. A literature search was conducted across four databases. Studies were eligible if they presented data on reliability and/or validity, used a sphygmomanometer to measure isometric muscle strength of the hip region, and were peer reviewed. The individual studies were evaluated for quality using a standardized critical appraisal tool. A total of 644 articles were screened for eligibility, with five articles chosen for inclusion. The use of a sphygmomanometer to objectively assess isometric muscle strength of the hip muscles appears to be reliable with intraclass correlation coefficient values ranging from 0.66 to 0.94 in elderly and young populations. No studies were identified that have assessed the validity of a sphygmomanometer. The sphygmomanometer appears to be reliable for assessment of isometric muscle strength around the hip joint, but further research is warranted to establish its validity.

  6. Test-Retest Reliability of Diffusion Tensor Imaging in Huntington's Disease.

    Science.gov (United States)

    Cole, James H; Farmer, Ruth E; Rees, Elin M; Johnson, Hans J; Frost, Chris; Scahill, Rachael I; Hobbs, Nicola Z

    2014-03-21

    Diffusion tensor imaging (DTI) has shown microstructural abnormalities in patients with Huntington's Disease (HD) and work is underway to characterise how these abnormalities change with disease progression. Using methods that will be applied in longitudinal research, we sought to establish the reliability of DTI in early HD patients and controls. Test-retest reliability, quantified using the intraclass correlation coefficient (ICC), was assessed using region-of-interest (ROI)-based white matter atlas and voxelwise approaches on repeat scan data from 22 participants (10 early HD, 12 controls). T1 data was used to generate further ROIs for analysis in a reduced sample of 18 participants. The results suggest that fractional anisotropy (FA) and other diffusivity metrics are generally highly reliable, with ICCs indicating considerably lower within-subject compared to between-subject variability in both HD patients and controls. Where ICC was low, particularly for the diffusivity measures in the caudate and putamen, this was partly influenced by outliers. The analysis suggests that the specific DTI methods used here are appropriate for cross-sectional research in HD, and give confidence that they can also be applied longitudinally, although this requires further investigation. An important caveat for DTI studies is that test-retest reliability may not be evenly distributed throughout the brain whereby highly anisotropic white matter regions tended to show lower relative within-subject variability than other white or grey matter regions.

  7. ESTIMATING RELIABILITY OF DISTURBANCES IN SATELLITE TIME SERIES DATA BASED ON STATISTICAL ANALYSIS

    Directory of Open Access Journals (Sweden)

    Z.-G. Zhou

    2016-06-01

    Full Text Available Normally, the status of land cover is inherently dynamic and changing continuously on temporal scale. However, disturbances or abnormal changes of land cover — caused by such as forest fire, flood, deforestation, and plant diseases — occur worldwide at unknown times and locations. Timely detection and characterization of these disturbances is of importance for land cover monitoring. Recently, many time-series-analysis methods have been developed for near real-time or online disturbance detection, using satellite image time series. However, the detection results were only labelled with “Change/ No change” by most of the present methods, while few methods focus on estimating reliability (or confidence level of the detected disturbances in image time series. To this end, this paper propose a statistical analysis method for estimating reliability of disturbances in new available remote sensing image time series, through analysis of full temporal information laid in time series data. The method consists of three main steps. (1 Segmenting and modelling of historical time series data based on Breaks for Additive Seasonal and Trend (BFAST. (2 Forecasting and detecting disturbances in new time series data. (3 Estimating reliability of each detected disturbance using statistical analysis based on Confidence Interval (CI and Confidence Levels (CL. The method was validated by estimating reliability of disturbance regions caused by a recent severe flooding occurred around the border of Russia and China. Results demonstrated that the method can estimate reliability of disturbances detected in satellite image with estimation error less than 5% and overall accuracy up to 90%.

  8. Reliability of new software in measuring cervical multifidus diameters and shoulder muscle strength in a synchronized way; an ultrasonographic study

    Directory of Open Access Journals (Sweden)

    Leila Rahnama

    2015-08-01

    Full Text Available OBJECTIVES: This study was conducted with the purpose of evaluating the inter-session reliability of new software to measure the diameters of the cervical multifidus muscle (CMM, both at rest and during isometric contractions of the shoulder abductors in subjects with neck pain and in healthy individuals.METHOD: In the present study, the reliability of measuring the diameters of the CMM with the Sonosynch software was evaluated by using 24 participants, including 12 subjects with chronic neck pain and 12 healthy individuals. The anterior-posterior diameter (APD and the lateral diameter (LD of the CMM were measured in a resting state and then repeated during isometric contraction of the shoulder abductors. Measurements were taken on separate occasions 3 to 7 days apart in order to determine inter-session reliability. Intraclass correlation coefficient (ICC, standard error of measurement (SEM, and smallest detectable difference (SDD were used to evaluate the relative and absolute reliability, respectively.RESULTS: The Sonosynch software has shown to be highly reliable in measuring the diameters of the CMM both in healthy subjects and in those with neck pain. The ICCs 95% CI for APD ranged from 0.84 to 0.94 in subjects with neck pain and from 0.86 to 0.94 in healthy subjects. For LD, the ICC 95% CI ranged from 0.64 to 0.95 in subjects with neck pain and from 0.82 to 0.92 in healthy subjects.CONCLUSIONS: Ultrasonographic measurement of the diameters of the CMM using Sonosynch has proved to be reliable especially for APD in healthy subjects as well as subjects with neck pain.

  9. Measuring reliable change in cognition using the Edinburgh Cognitive and Behavioural ALS Screen (ECAS).

    Science.gov (United States)

    Crockford, Christopher; Newton, Judith; Lonergan, Katie; Madden, Caoifa; Mays, Iain; O'Sullivan, Meabhdh; Costello, Emmet; Pinto-Grau, Marta; Vajda, Alice; Heverin, Mark; Pender, Niall; Al-Chalabi, Ammar; Hardiman, Orla; Abrahams, Sharon

    2018-02-01

    Cognitive impairment affects approximately 50% of people with amyotrophic lateral sclerosis (ALS). Research has indicated that impairment may worsen with disease progression. The Edinburgh Cognitive and Behavioural ALS Screen (ECAS) was designed to measure neuropsychological functioning in ALS, with its alternate forms (ECAS-A, B, and C) allowing for serial assessment over time. The aim of the present study was to establish reliable change scores for the alternate forms of the ECAS, and to explore practice effects and test-retest reliability of the ECAS's alternate forms. Eighty healthy participants were recruited, with 57 completing two and 51 completing three assessments. Participants were administered alternate versions of the ECAS serially (A-B-C) at four-month intervals. Intra-class correlation analysis was employed to explore test-retest reliability, while analysis of variance was used to examine the presence of practice effects. Reliable change indices (RCI) and regression-based methods were utilized to establish change scores for the ECAS alternate forms. Test-retest reliability was excellent for ALS Specific, ALS Non-Specific, and ECAS Total scores of the combined ECAS A, B, and C (all > .90). No significant practice effects were observed over the three testing sessions. RCI and regression-based methods produced similar change scores. The alternate forms of the ECAS possess excellent test-retest reliability in a healthy control sample, with no significant practice effects. The use of conservative RCI scores is recommended. Therefore, a change of ≥8, ≥4, and ≥9 for ALS Specific, ALS Non-Specific, and ECAS Total score is required for reliable change.

  10. Validity and test-retest reliability of a novel simple back extensor muscle strength test.

    Science.gov (United States)

    Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

    2017-01-01

    To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r  = 0.824, p  strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p  strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p  strength ( p  strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.

  11. Reliability of the Star Excursion Balance Test and Two New Similar Protocols to Measure Trunk Postural Control.

    Science.gov (United States)

    López-Plaza, Diego; Juan-Recio, Casto; Barbado, David; Ruiz-Pérez, Iñaki; Vera-Garcia, Francisco J

    2018-05-18

    Although the Star Excursion Balance test (SEBT) has shown a good intrasession reliability, the intersession reliability of this test has not been deeply studied. Furthermore, there is an evident high influence of the lower limbs in the performance of the SEBT, so even if it has been used to measure core stability, it is possibly not the most suitable measurement. The aims of this study were to (1) to assess the absolute and relative between-session reliability of the SEBT and 2 novel variations of this test to assess trunk postural control while sitting, ie, the Star Excursion Sitting Test (SEST) and the Star Excursion Timing Test (SETT); and (2) to analyze the relationships between these 3 test scores. Correlational and reliability test-retest study. Controlled laboratory environment. Twenty-seven physically active men (age: 24.54 ± 3.05 years). Relative and absolute reliability of the SEBT, SEST, and SETT were calculated through the intraclass correlation coefficient (ICC) and standard error of measurement (SEM), respectively. A Pearson correlation analysis was carried out between the variables of the 3 tests. Maximum normalized reach distances were assessed for different SEBT and SEST directions. In addition, composite indexes were calculated for SEBT, SEST, and SETT. The SEBT (dominant leg: ICC = 0.87 [0.73-0.94], SEM = 2.12 [1.66-2.93]; nondominant leg: ICC = 0.74 [0.50-0.87], SEM = 3.23 [2.54-4.45]), SEST (ICC = 0.85 [0.68-0.92], SEM = 1.27 [1.03-1.80]), and SETT (ICC = 0.61 [0.30-0.80], SEM = 2.31 [1.82-3.17]) composite indexes showed moderate-to-high 1-month reliability. A learning effect was detected for some SEBT and SEST directions and for SEST and SETT composite indexes. No significant correlations were found between SEBT and its 2 variations (r ≤ .366; P > .05). A significant correlation was found between the SEST and SETT composite indexes (r = .520; P > .01). SEBT, SEST, and SETT are reliable field protocols to measure postural control. However

  12. Reliability of anthropometric measurements in young male and female artistic gymnasts.

    Science.gov (United States)

    Siatras, Theophanis; Skaperda, Malamati; Mameletzi, Dimitra

    2010-12-01

    Body dimensions and body composition of children participating in artistic activities, such as gymnastics and many types of dancing, are important factors in performance improvement. The present study aimed to determine the reliability of a series of selected anthropometric measurements in young male and female gymnasts. Segment lengths, body breadths, circumferences, and skinfold thickness were measured in 20 young gymnasts by the same experienced examiner, using portable and easy-to-use instruments. All parameters were measured twice (test-retest) under the same conditions within a week's period. The high intra-class correlation coefficient (ICC) values ranging from 0.87 to 0.99, as well as the low coefficient of variation (CV) values (artistic gymnasts. Therefore, these measurements could contribute to further research in this field of investigation, helping to monitor young artistic gymnasts' growth status and identify specific characteristics for increased performance in this sport.

  13. Intra-rater reliability of cervical sensory motor function and cervical reconstruction test in healthy subjects

    Directory of Open Access Journals (Sweden)

    Hatamvand S

    2016-07-01

    Full Text Available Impairment of cervicocephalic and head joint position sense has an important role in the recurrent and chronic of cervicocephalic pain. The various tools are suggested for evaluating the cervicocephalic joint position sense. Although reconstruction of cervical angle is a clinical criterion for measuring the cervicocephalic proprioception, the reliability of this method has not been completely accepted. The purpose of this study was to evaluate intra-rater reliability of cervical sensory motor function and cervical reconstruction test in healthy subjects. twenty four healthy subjects (25.70±6.08 y through simple non-probability sampling participated in this single-group repeatedmeasures reliability study. Participants were asked to relocate the neck, as accurately as possible, after full active cervical flexion, extension and rotation to the left and right sides. Five trials were performed for each movement. Laser pointer was used in head of patient. The distance between zero spot and joint position which patient had been reconstructed, was measured by centimeter. Intra-class correlation Coefficient (ICCs and Pearson's correlation coefficient test was used to determine intra-rater reliability of variables. The results showed that intra-class correlation Coefficient (ICCs values with 95% confidence interval (CI and the standard error of the measurement (SEM were good to excellent agreement for a single investigator between measurement occasions. Intra-class correlation Coefficient (ICCs values were obtained for flexion movement (ICCs:0.75, good, extension movement (ICCs:0.81, very good, right rotation (ICCs:0.64, good and left rotation (ICCs:0.64, good. The cervicocephalic relocation test to neutral head position by laser pointer is a reliable method to measure cervical sensory motor function. Therefore, it can be used for evaluating cervicocephalic proprioception of patient with cervicocephalic pain.

  14. Reliability of Measuring Lumbar Lordosis, Flexion and Extension Using Dual Inclinometer in Healthy Subjects and Patients with Non-Specific Chronic Low Back Pain

    Directory of Open Access Journals (Sweden)

    Samira Garmabi

    2012-07-01

    Full Text Available Objective: Accurate assessment of lumbar range of motion is of great value for both evaluating lumbar functions and monitoring treatment progress. Recent research indicates that there is no general consensus on the most valid and reliable method of measuring spinal range of motion. The purpose of this study was to determine the intra-rater reliability of lumbar flexion and extension measurements (within-day and between-days using the dual inclinometer technique.   Materials & Methods: Lumbar flexion and extension of 22 women (14 healthy and 8 with low back pain, were measured by the same examiner on three occasions. The first two measurements were taken with half an hour apart on the first occasion to assess the within-day reliability and the third measurement was taken one week later to assess the between-days reliability.  Results: Within-day lumbar lordosis, flexion and extension measurements using dual inclinometer technique were shown to be very reliable with high Intraclass Correlation Coefficients (ICC values (ICC were 98%, 77% and 69% for lordosis, flexion and extension measurements, respectively in healthy subjects and 94%, 95% and 69% for lordosis, flexion and extension measurements, respectively in patients group. Between-Days measurements also demonstrated high reliability with the high values of ICC (ICC were 96%, 70% and 67% for lordosis, flexion and extension measurements, in healthy subjects and 91%, 71% and 66% for lordosis, flexion and extension measurements, respectively in patients group. Conclusion: The results indicated that, the dual inclinometer technique appears to be a highly reliable method for measuring lumbar lordosis, flexion and extension and can be used as a reliable tool in the assessment of lumbar range of motion and monitoring therapeutic interventions.

  15. A study of operational and testing reliability in software reliability analysis

    International Nuclear Information System (INIS)

    Yang, B.; Xie, M.

    2000-01-01

    Software reliability is an important aspect of any complex equipment today. Software reliability is usually estimated based on reliability models such as nonhomogeneous Poisson process (NHPP) models. Software systems are improving in testing phase, while it normally does not change in operational phase. Depending on whether the reliability is to be predicted for testing phase or operation phase, different measure should be used. In this paper, two different reliability concepts, namely, the operational reliability and the testing reliability, are clarified and studied in detail. These concepts have been mixed up or even misused in some existing literature. Using different reliability concept will lead to different reliability values obtained and it will further lead to different reliability-based decisions made. The difference of the estimated reliabilities is studied and the effect on the optimal release time is investigated

  16. Exploration of analysis methods for diagnostic imaging tests: problems with ROC AUC and confidence scores in CT colonography.

    Science.gov (United States)

    Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G

    2014-01-01

    Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.

  17. Test-Retest Reliability of Handgrip Strength as an Outcome Measure in Patients With Symptoms of Shoulder Impingement Syndrome.

    Science.gov (United States)

    Savva, Christos; Mougiaris, Paraskevas; Xadjimichael, Christoforos; Karagiannis, Christos; Efstathiou, Michalis

    The purpose of this study was to investigate the degree of test-retest reliability of grip strength measurement using a hand dynamometer in patients with shoulder impingement syndrome. A total of 19 patients (10 women and 9 men; mean ± standard deviation age, 33.2 ± 12.9 years; range 18-59 years) with shoulder impingement syndrome were measured using a hand dynamometer by the same data collector in 2 different testing sessions with a 7-day interval. During each session, patients were encouraged to exert 3 maximal isometric contractions on the affected hand and the mean value of the 3 efforts (measured in kilogram-force [Kgf]) was used for data analysis. The intraclass correlation coefficient (ICC 2,1 ) as well as the standard error of measurement (SEM) and Bland-Altman plot were used to estimate the degree of test-retest reliability and the measurement error, respectively. Grip strength data analysis revealed an ICC 2,1 score of 0.94, which, based on the Shrout classification, is considered as excellent test-retest reliability of grip strength measurement. The small values of SEMs reported in both sessions (SEM 1 , 2.55 Kgf; SEM 2 , 2.39 Kgf) and the small width of the 95% limits of agreement in the Bland-Altman plot (ranging from -7.39 Kgf to 7.03 Kgf) reflected the measurement precision and the narrow variation of the differences during the 2 testing sessions. Results from this study identified excellent test-retest reliability of grip strength measurement in shoulder impingement syndrome, indicating its potential use as an outcome measure in clinical practice. Copyright © 2018. Published by Elsevier Inc.

  18. Nearest unlike neighbor (NUN): an aid to decision confidence estimation

    Science.gov (United States)

    Dasarathy, Belur V.

    1995-09-01

    The concept of nearest unlike neighbor (NUN), proposed and explored previously in the design of nearest neighbor (NN) based decision systems, is further exploited in this study to develop a measure of confidence in the decisions made by NN-based decision systems. This measure of confidence, on the basis of comparison with a user-defined threshold, may be used to determine the acceptability of the decision provided by the NN-based decision system. The concepts, associated methodology, and some illustrative numerical examples using the now classical Iris data to bring out the ease of implementation and effectiveness of the proposed innovations are presented.

  19. Free release measurement of radioactive waste on the basis of the Bayes theory

    International Nuclear Information System (INIS)

    Sokcic-Kostic, M.; Langer, F.; Schultheis, R.

    2013-01-01

    The application of Bayesian theory in the evaluation of the free release measurements requires complex co-ordination between experiment and analysis. The algorithms are more complex compared to those used in the frequentist data analysis and partly to those of the Monte Carlo methods. The user can get an objective treatment of parameters of the measurement error and - as a result - a reliable indication of confidence intervals. For release measurement, the upper limit of the confidence interval must be compared with the limit given by the Radiation Protection Regulations (StrlSchV) to decide on a possible release of the material under test. (orig.)

  20. Test-retest reliability and minimal detectable change of two simplified 3-point balance measures in patients with stroke.

    Science.gov (United States)

    Chen, Yi-Miau; Huang, Yi-Jing; Huang, Chien-Yu; Lin, Gong-Hong; Liaw, Lih-Jiun; Lee, Shih-Chieh; Hsieh, Ching-Lin

    2017-10-01

    The 3-point Berg Balance Scale (BBS-3P) and 3-point Postural Assessment Scale for Stroke Patients (PASS-3P) were simplified from the BBS and PASS to overcome the complex scoring systems. The BBS-3P and PASS-3P were more feasible in busy clinical practice and showed similarly sound validity and responsiveness to the original measures. However, the reliability of the BBS-3P and PASS-3P is unknown limiting their utility and the interpretability of scores. We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BBS-3P and PASS-3P in patients with stroke. Cross-sectional study. The rehabilitation departments of a medical center and a community hospital. A total of 51 chronic stroke patients (64.7% male). Both balance measures were administered twice 7 days apart. The test-retest reliability of both the BBS-3P and PASS-3P were examined by intraclass correlation coefficients (ICC). The MDC and its percentage over the total score (MDC%) of each measure was calculated for examining the random measurement errors. The ICC values of the BBS-3P and PASS-3P were 0.99 and 0.97, respectively. The MDC% (MDC) of the BBS-3P and PASS-3P were 9.1% (5.1 points) and 8.4% (3.0 points), respectively, indicating that both measures had small and acceptable random measurement errors. Our results showed that both the BBS-3P and the PASS-3P had good test-retest reliability, with small and acceptable random measurement error. These two simplified 3-level balance measures can provide reliable results over time. Our findings support the repeated administration of the BBS-3P and PASS-3P to monitor the balance of patients with stroke. The MDC values can help clinicians and researchers interpret the change scores more precisely.

  1. Reliability of the discrete choice experiment at the input and output level in patients with rheumatoid arthritis

    DEFF Research Database (Denmark)

    Skjoldborg, Ulla Slothuus; Lauridsen, Jørgen; Junker, Peter

    2009-01-01

    OBJECTIVES: To investigate the issue of conjoint reliability over time. METHODS: A discrete choice experiment was applied using scenarios that describe the effect of treating rheumatoid arthritis patients with TNF-alpha inhibitors, a novel class of highly effective, but expensive antirheumatic...... agents. Respondents participated in three face-to-face interviews over a period of 4 months. Reliability was measured both at the input level, where the consistency of matches made by respondents to the Discrete Choice Experiment (DCE) question between replications was determined, and at the output level...... and the final choice in survey 3. Output level: The confidence intervals for WTP figures in surveys 1 and 2 and 1 and 3 were overlapping, implying that the DCE was reliable at the output level over time. CONCLUSION: The proportion of consistent responses was higher than would be expected by chance. Conjoint...

  2. Intra-observer reproducibility and interobserver reliability of the radiographic parameters in the Spinal Deformity Study Group's AIS Radiographic Measurement Manual.

    Science.gov (United States)

    Dang, Natasha Radhika; Moreau, Marc J; Hill, Douglas L; Mahood, James K; Raso, James

    2005-05-01

    Retrospective cross-sectional assessment of the reproducibility and reliability of radiographic parameters. To measure the intra-examiner and interexaminer reproducibility and reliability of salient radiographic features. The management and treatment of adolescent idiopathic scoliosis (AIS) depends on accurate and reproducible radiographic measurements of the deformity. Ten sets of radiographs were randomly selected from a sample of patients with AIS, with initial curves between 20 degrees and 45 degrees. Fourteen measures of the deformity were measured from posteroanterior and lateral radiographs by 2 examiners, and were repeated 5 times at intervals of 3-5 days. Intra-examiner and interexaminer differences were examined. The parameters include measures of curve size, spinal imbalance, sagittal kyphosis and alignment, maximum apical vertebral rotation, T1 tilt, spondylolysis/spondylolisthesis, and skeletal age. Intra-examiner reproducibility was generally excellent for parameters measured from the posteroanterior radiographs but only fair to good for parameters from the lateral radiographs, in which some landmarks were not clearly visible. Of the 13 parameters observed, 7 had excellent interobserver reliability. The measurements from the lateral radiograph were less reproducible and reliable and, thus, may not add value to the assessment of AIS. Taking additional measures encourages a systematic and comprehensive assessment of spinal radiographs.

  3. Test-Retest Reliability of Self-Reported Sexual Health Measures among US Hispanic Adolescents

    Science.gov (United States)

    Jerman, Petra; Berglas, Nancy F.; Rohrbach, Louise A.; Constantine, Norman A.

    2016-01-01

    Objective: Although Hispanic adolescents in the USA are often the focus of sexual health interventions, their response to survey measures has rarely been assessed within evaluation studies. This study documents the test-retest reliability of a wide range of self-reported sexual health values, attitudes, knowledge and behaviours among Hispanic…

  4. Quantifying frontal plane knee motion during single limb squats: reliability and validity of 2-dimensional measures.

    Science.gov (United States)

    Gwynne, Craig R; Curran, Sarah A

    2014-12-01

    Clinical assessment of lower limb kinematics during dynamic tasks may identify individuals who demonstrate abnormal movement patterns that may lead to etiology of exacerbation of knee conditions such as patellofemoral joint (PFJt) pain. The purpose of this study was to determine the reliability, validity and associated measurement error of a clinically appropriate two-dimensional (2-D) procedure of quantifying frontal plane knee alignment during single limb squats. Nine female and nine male recreationally active subjects with no history of PFJt pain had frontal plane limb alignment assessed using three-dimensional (3-D) motion analysis and digital video cameras (2-D analysis) while performing single limb squats. The association between 2-D and 3-D measures was quantified using Pearson's product correlation coefficients. Intraclass correlation coefficients (ICCs) were determined for within- and between-session reliability of 2-D data and standard error of measurement (SEM) was used to establish measurement error. Frontal plane limb alignment assessed with 2-D analysis demonstrated good correlation compared with 3-D methods (r = 0.64 to 0.78, p < 0.001). Within-session (0.86) and between-session ICCs (0.74) demonstrated good reliability for 2-D measures and SEM scores ranged from 2° to 4°. 2-D measures have good consistency and may provide a valid measure of lower limb alignment when compared to existing 3-D methods. Assessment of lower limb kinematics using 2-D methods may be an accurate and clinically useful alternative to 3-D motion analysis when identifying individuals who demonstrate abnormal movement patterns associated with PFJt pain. 2b.

  5. Confidence, Visual Research, and the Aesthetic Function

    Directory of Open Access Journals (Sweden)

    Stan Ruecker

    2007-05-01

    Full Text Available The goal of this article is to identify and describe one of the primary purposes of aesthetic quality in the design of computer interfaces and visualization tools. We suggest that humanists can derive advantages in visual research by acknowledging by their efforts to advance aesthetic quality that a significant function of aesthetics in this context is to inspire the user’s confidence. This confidence typically serves to create a sense of trust in the provider of the interface or tool. In turn, this increased trust may result in an increased willingness to engage with the object, on the basis that it demonstrates an attention to detail that promises to reward increased engagement. In addition to confidence, the aesthetic may also contribute to a heightened degree of satisfaction with having spent time using or investigating the object. In the realm of interface design and visualization research, we propose that these aesthetic functions have implications not only for the quality of interactions, but also for the results of the standard measures of performance and preference.

  6. Inter-Rater Reliability and Downstream Financial Implications of Electrocardiography Screening in Young Athletes.

    Science.gov (United States)

    Dhutia, Harshil; Malhotra, Aneil; Yeo, Tee Joo; Ster, Irina Chis; Gabus, Vincent; Steriotis, Alexandros; Dores, Helder; Mellor, Greg; García-Corrales, Carmen; Ensam, Bode; Jayalapan, Viknesh; Ezzat, Vivienne Anne; Finocchiaro, Gherardo; Gati, Sabiha; Papadakis, Michael; Tome-Esteban, Maria; Sharma, Sanjay

    2017-08-01

    Preparticipation screening for cardiovascular disease in young athletes with electrocardiography is endorsed by the European Society of Cardiology and several major sporting organizations. One of the concerns of the ECG as a screening test in young athletes relates to the potential for variation in interpretation. We investigated the degree of variation in ECG interpretation in athletes and its financial impact among cardiologists of differing experience. Eight cardiologists (4 with experience in screening athletes) each reported 400 ECGs of consecutively screened young athletes according to the 2010 European Society of Cardiology recommendations, Seattle criteria, and refined criteria. Cohen κ coefficient was used to calculate interobserver reliability. Cardiologists proposed secondary investigations after ECG interpretation, the costs of which were based on the UK National Health Service tariffs. Inexperienced cardiologists were more likely to classify an ECG as abnormal compared with experienced cardiologists (odds ratio, 1.44; 95% confidence interval, 1.03-2.02). Modification of ECG interpretation criteria improved interobserver reliability for categorizing an ECG as abnormal from poor (2010 European Society of Cardiology recommendations; κ=0.15) to moderate (refined criteria; κ=0.41) among inexperienced cardiologists; however, interobserver reliability was moderate for all 3 criteria among experienced cardiologists (κ=0.40-0.53). Inexperienced cardiologists were more likely to refer athletes for further evaluation compared with experienced cardiologists (odds ratio, 4.74; 95% confidence interval, 3.50-6.43) with poorer interobserver reliability (κ=0.22 versus κ=0.47). Interobserver reliability for secondary investigations after ECG interpretation ranged from poor to fair among inexperienced cardiologists (κ=0.15-0.30) and fair to moderate among experienced cardiologists (κ=0.21-0.46). The cost of cardiovascular evaluation per athlete was $175 (95

  7. Factors influencing the reliability of non-electric detonating circuit in underground uranium mines and preventive measures of misfiring

    International Nuclear Information System (INIS)

    Li Qin

    2010-01-01

    Characteristics of non-electric detonating circuit are introduced. The main factors influencing the reliability of non-electric detonating circuit are described. Taking an underground blasting of a uranium mine for example, the reliability of various kinds of detonating network system is calculated using the reliability theory and numerical analysis method. The reasons that cause the misfiring in non-electric detonating circuit system are analyzed, and preventive measures are put forward.(authors)

  8. Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

    Science.gov (United States)

    Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

    2016-04-01

    The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.

  9. Reliability of tensiomyography and myotonometry in detecting mechanical and contractile characteristics of the lumbar erector spinae in healthy volunteers.

    Science.gov (United States)

    Lohr, Christine; Braumann, Klaus-Michael; Reer, Ruediger; Schroeder, Jan; Schmidt, Tobias

    2018-04-20

    Tensiomyography™ (TMG) and MyotonPRO ® (MMT) are two non-invasive devices for monitoring muscle contractile and mechanical characteristics. This study aimed to evaluate the test-retest reliability of TMG and MMT parameters for measuring (TMG:) muscle displacement (D m ), contraction time (T c ), and velocity (V c ) and (MMT:) frequency (F), stiffness (S), and decrement (D) of the erector spinae muscles (ES) in healthy adults. A particular focus was set on the establishment of reliability measures for the previously barely evaluated secondary TMG parameter V c . Twenty-four subjects (13 female and 11 male, mean ± SD, 38.0 ± 12.0 years) were measured using TMG and MMT over 2 consecutive days. Absolute and relative reliability was calculated by standard error of measurement (SEM, SEM%), Minimum detectable change (MDC, MDC%), coefficient of variation (CV%) and intraclass correlation coefficient (ICC, 3.1) with a 95% confidence interval (CI). The ICCs for all variables and test-retest intervals ranged from 0.75 to 0.99 indicating a good to excellent relative reliability for both TMG and MMT, demonstrating the lowest values for TMG T c and between-day MMT D (ICC TMG parameter (ICC > 0.95, CV TMG V c could be established successfully. Its further applicability needs to be confirmed in future studies. MMT was found to be more reliable on repeated testing than the two other TMG parameters D m and T c .

  10. Feasibility and test-retest reliability of measuring lower‑limb strength in young children with cerebral palsy.

    Science.gov (United States)

    Van Vulpen, L F; De Groot, S; Becher, J G; De Wolf, G S; Dallmeijer, A J

    2013-12-01

    Quantifying leg muscle strength in young children with cerebral palsy (CP) is essential for identifying muscle groups for treatment and for monitoring progress. To study the feasibility, intratester reliability and the optimal test design (number of test occasions and repetitions) of measuring lower-limb strength with handheld dynamometry (HHD) and dynamic ankle plantar flexor strength with the standing heel-rise (SH) test in 3-10 year aged children with CP. Test-retest design. Rehabilitation centre, special needs school for children with disabilities, and university medical centre. Knee extensor, hip abductor and calf muscle strength was assessed in 20 ambulatory children with spastic CP (3-5 years [N.=10] and 6-10 years [N.=10]) on two test occasions. Intraclass correlation coefficients (ICC) and Smallest Detectable Differences (SDD) were calculated to determine the optimal test design for detecting changes in strength. All isometric strength tests had acceptable SDDs (9-30%), when taking the mean values of 2-3 test occasions (separate days) and 2-3 repetitions. The one-leg SH test had large SDDs (40-128% for younger group, 23-48% for older group). Isometric strength (improvements) can only be measured reliably with HHD in young children with CP when the average values over at least 2 test occasions are taken. Reliability of the SH test is not sufficient for measuring individual changes in dynamic muscle strength in the younger children. Results of this study can be used to determine the optimal number of test occasions and repetitions for reliable HHD measurements depending on expected changes, muscle group and age in 3-10 year old children with CP.

  11. Reliability of hand-held dynamometry for measurement of lower limb muscle strength in children with Duchenne and Becker muscular dystrophy

    Directory of Open Access Journals (Sweden)

    Wei SHI

    2015-05-01

    Full Text Available Objective To determine the reliability of hand-held dynamometry (HHD for lower limb isometric muscle strength measurement in children with Duchenne and Becker muscular dystrophy (DMD/BMD.  Methods A total of 21 children [20 males and one female; mean age was (7.88 ± 2.87 years, ranging between 3.96-14.09 years; mean age at diagnosis was (5.88 ± 2.88 years, ranging between 1.35-12.89 years; mean height was (120.64 ± 16.30 cm, ranging between 97-153 cm; mean body weight was (24.62 ± 9.05 kg, ranging between 14-50 kg] with DMD (19/21 and BMD (2/21 were involved from Rehabilitation Center of Children's Hospital of Fudan University. The muscle strength of hip, knee and ankle was measured by HHD under standardized test methods. The test-retest results were compared to determine the inter-test reliability, and the results among testers were compared to determine the inter-tester reliability.  Results HHD showed fine inter-tester reliability (ICC = 0.762-0.978 and inter-test reliability (ICC = 0.690-0.938 in measuring lower limb muscle strength of children with DMD/BMD. Results also showed relatively poor reliability in distal muscle groups (foot plantar flexion and dorsiflexion.  Conclusions HHD, showing fine inter-tester and inter-test reliability in measuring the lower limb muscle strength of children with DMD/BMD, can be used in monitoring muscle strength changing and assessing effects of clinical interventions. DOI: 10.3969/j.issn.1672-6731.2015.05.009

  12. Validity and reliability of using photography for measuring knee range of motion: a methodological study

    Directory of Open Access Journals (Sweden)

    Adie Sam

    2011-04-01

    Full Text Available Abstract Background The clinimetric properties of knee goniometry are essential to appreciate in light of its extensive use in the orthopaedic and rehabilitative communities. Intra-observer reliability is thought to be satisfactory, but the validity and inter-rater reliability of knee goniometry often demonstrate unacceptable levels of variation. This study tests the validity and reliability of measuring knee range of motion using goniometry and photographic records. Methods Design: Methodology study assessing the validity and reliability of one method ('Marker Method' which uses a skin marker over the greater trochanter and another method ('Line of Femur Method' which requires estimation of the line of femur. Setting: Radiology and orthopaedic departments of two teaching hospitals. Participants: 31 volunteers (13 arthritic and 18 healthy subjects. Knee range of motion was measured radiographically and photographically using a goniometer. Three assessors were assessed for reliability and validity. Main outcomes: Agreement between methods and within raters was assessed using concordance correlation coefficient (CCCs. Agreement between raters was assessed using intra-class correlation coefficients (ICCs. 95% limits of agreement for the mean difference for all paired comparisons were computed. Results Validity (referenced to radiographs: Each method for all 3 raters yielded very high CCCs for flexion (0.975 to 0.988, and moderate to substantial CCCs for extension angles (0.478 to 0.678. The mean differences and 95% limits of agreement were narrower for flexion than they were for extension. Intra-rater reliability: For flexion and extension, very high CCCs were attained for all 3 raters for both methods with slightly greater CCCs seen for flexion (CCCs varied from 0.981 to 0.998. Inter-rater reliability: For both methods, very high ICCs (min to max: 0.891 to 0.995 were obtained for flexion and extension. Slightly higher coefficients were obtained

  13. Validity and reliability of an instrumented leg-extension machine for measuring isometric muscle strength of the knee extensors.

    Science.gov (United States)

    Ruschel, Caroline; Haupenthal, Alessandro; Jacomel, Gabriel Fernandes; Fontana, Heiliane de Brito; Santos, Daniela Pacheco dos; Scoz, Robson Dias; Roesler, Helio

    2015-05-20

    Isometric muscle strength of knee extensors has been assessed for estimating performance, evaluating progress during physical training, and investigating the relationship between isometric and dynamic/functional performance. To assess the validity and reliability of an adapted leg-extension machine for measuring isometric knee extensor force. Validity (concurrent approach) and reliability (test and test-retest approach) study. University laboratory. 70 healthy men and women aged between 20 and 30 y (39 in the validity study and 31 in the reliability study). Intraclass correlation coefficient (ICC) values calculated for the maximum voluntary isometric torque of knee extensors at 30°, 60°, and 90°, measured with the prototype and with an isokinetic dynamometer (ICC2,1, validity study) and measured with the prototype in test and retest sessions, scheduled from 48 h to 72 h apart (ICC1,1, reliability study). In the validity analysis, the prototype showed good agreement for measurements at 30° (ICC2,1 = .75, SEM = 18.2 Nm) and excellent agreement for measurements at 60° (ICC2,1 = .93, SEM = 9.6 Nm) and at 90° (ICC2,1 = .94, SEM = 8.9 Nm). Regarding the reliability analysis, between-days' ICC1,1 were good to excellent, ranging from .88 to .93. Standard error of measurement and minimal detectable difference based on test-retest ranged from 11.7 Nm to 18.1 Nm and 32.5 Nm to 50.1 Nm, respectively, for the 3 analyzed knee angles. The analysis of validity and repeatability of the prototype for measuring isometric muscle strength has shown to be good or excellent, depending on the knee joint angle analyzed. The new instrument, which presents a relative low cost and easiness of transportation when compared with an isokinetic dynamometer, is valid and provides consistent data concerning isometric strength of knee extensors and, for this reason, can be used for practical, clinical, and research purposes.

  14. Reliability of a Cryoscopic Micro-Osmometer Using 15-µL Plasma Samples to Measure Hydration Status in Varied Environmental Conditions

    Science.gov (United States)

    Scanlan, Aaron T.; Richter-Stretton, Gina L.; Madueno, Maria C.; Borges, Nattai R.; Fenning, Andrew S.

    2017-01-01

    Measurement of plasma osmolality (P[subscript osm]) remains popular for assessing hydration status in exercise science. However, a controlled reliability assessment of micro-osmometry using small sample volumes to measure Posm remains to be performed. This study aimed to examine the reliability of a cryoscopic micro-osmometer requiring 15-µL…

  15. The reliability of running economy expressed as oxygen cost and energy cost in trained distance runners.

    Science.gov (United States)

    Shaw, Andrew J; Ingham, Stephen A; Fudge, Barry W; Folland, Jonathan P

    2013-12-01

    This study assessed the between-test reliability of oxygen cost (OC) and energy cost (EC) in distance runners, and contrasted it with the smallest worthwhile change (SWC) of these measures. OC and EC displayed similar levels of within-subject variation (typical error < 3.85%). However, the typical error (2.75% vs 2.74%) was greater than the SWC (1.38% vs 1.71%) for both OC and EC, respectively, indicating insufficient sensitivity to confidently detect small, but meaningful, changes in OC and EC.

  16. Confidence-building in the Asia-Pacific region. Report of working group II

    International Nuclear Information System (INIS)

    Cotton, J.

    1992-01-01

    Detailed presentations of South and North Korea offers positive evaluation concerning bilateral agreements, which aim both reconciliation between the two states and denuclearization of the Korean peninsula. Consideration was given to confidence building measures in the Asia-Pacific region as a whole as well as to the progress made in introducing such measures in various Subregions of Asia-Pacific. The concept of confidence building actually implies a two-part agenda, particular procedures and general process

  17. Reliability, validity and minimal detectable change of the Mini-BESTest in Greek participants with chronic stroke.

    Science.gov (United States)

    Lampropoulou, Sofia I; Billis, Evdokia; Gedikoglou, Ingrid A; Michailidou, Christina; Nowicky, Alexander V; Skrinou, Dimitra; Michailidi, Fotini; Chandrinou, Danae; Meligkoni, Margarita

    2018-02-23

    This study aimed to investigate the psychometric characteristics of reliability, validity and ability to detect change of a newly developed balance assessment tool, the Mini-BESTest, in Greek patients with stroke. A prospective, observational design study with test-retest measures was conducted. A convenience sample of 21 Greek patients with chronic stroke (14 male, 7 female; age of 63 ± 16 years) was recruited. Two independent examiners administered the scale, for the inter-rater reliability, twice within 10 days for the test-retest reliability. Bland Altman Analysis for repeated measures assessed the absolute reliability and the Standard Error of Measurement (SEM) and the Minimum Detectable Change at 95% confidence interval (MDC 95% ) were established. The Greek Mini-BESTest (Mini-BESTest GR ) was correlated with the Greek Berg Balance Scale (BBS GR ) for assessing the concurrent validity and with the Timed Up and Go (TUG), the Functional Reach Test (FRT) and the Greek Falls Efficacy Scale-International (FES-I GR ) for the convergent validity. The Mini-BESTestGR demonstrated excellent inter-rater reliability (ICC (95%CI) = 0.997 (0.995-0.999, SEM = 0.46) with the scores of two raters within the limits of agreement (mean dif  = -0.143 ± 0.727, p > 0.05) and test-retest reliability (ICC (95%CI) = 0.966 (0.926-0.988), SEM = 1.53). Additionally, the Mini-BESTest GR yielded very strong to moderate correlations with BBS GR (r = 0.924, p reliability and the equally good validity of the Mini-BESTest GR , strongly support its utility in Greek people with chronic stroke. Its ability to identify clinically meaningful changes and falls risk need further investigation.

  18. Measuring disability: a systematic review of the validity and reliability of the Global Activity Limitations Indicator (GALI).

    Science.gov (United States)

    Van Oyen, Herman; Bogaert, Petronille; Yokota, Renata T C; Berger, Nicolas

    2018-01-01

    GALI or Global Activity Limitation Indicator is a global survey instrument measuring participation restriction. GALI is the measure underlying the European indicator Healthy Life Years (HLY). Gali has a substantial policy use within the EU and its Member States. The objective of current paper is to bring together what is known from published manuscripts on the validity and the reliability of GALI. Following the PRISMA guidelines, two search strategies (PUBMED, Google Scholar) were combined to identify manuscripts published in English with publication date 2000 or beyond. Articles were classified as reliability studies, concurrent or predictive validity studies, in national or international populations. Four cross-sectional studies (of which 2 international) studied how GALI relates to other health measures (concurrent validity). A dose-response effect by GALI severity level on the association with the other health status measures was observed in the national studies. The 2 international studies (SHARE, EHIS) concluded that the odds of reporting participation restriction was higher in subjects with self-reported or observed functional limitations. In SHARE, the size of the Odds Ratio's (ORs) in the different countries was homogeneous, while in EHIS the size of the ORs varied more strongly. For the predictive validity, subjects were followed over time (4 studies of which one international). GALI proved, both in national and international data, to be a consistent predictor of future health outcomes both in terms of mortality and health care expenditure. As predictors of mortality, the two distinct health concepts, self-rated health and GALI, acted independently and complementary of each other. The one reliability study identified reported a sufficient reliability of GALI. GALI as inclusive one question instrument fits all conceptual characteristics specified for a global measure on participation restriction. In none of the studies, included in the review, there was

  19. Sonographic measurements of the achilles tendon, plantar fascia, and heel fat pad are reliable

    DEFF Research Database (Denmark)

    Johannsen, Finn E; Jensen, Signe; Stallknecht, Sandra E

    2016-01-01

    PURPOSE: To determine intra- and interobserver reliability and precision of sonographic (US) scanning in measuring thickness of the Achilles tendon, plantar fascia, and heel fat pad in patients with heel pain. METHODS: Seventeen consecutive patients referred with heel pain were included. Two...

  20. Developing Reliable Life Support for Mars

    Science.gov (United States)

    Jones, Harry W.

    2017-01-01

    A human mission to Mars will require highly reliable life support systems. Mars life support systems may recycle water and oxygen using systems similar to those on the International Space Station (ISS). However, achieving sufficient reliability is less difficult for ISS than it will be for Mars. If an ISS system has a serious failure, it is possible to provide spare parts, or directly supply water or oxygen, or if necessary bring the crew back to Earth. Life support for Mars must be designed, tested, and improved as needed to achieve high demonstrated reliability. A quantitative reliability goal should be established and used to guide development t. The designers should select reliable components and minimize interface and integration problems. In theory a system can achieve the component-limited reliability, but testing often reveal unexpected failures due to design mistakes or flawed components. Testing should extend long enough to detect any unexpected failure modes and to verify the expected reliability. Iterated redesign and retest may be required to achieve the reliability goal. If the reliability is less than required, it may be improved by providing spare components or redundant systems. The number of spares required to achieve a given reliability goal depends on the component failure rate. If the failure rate is under estimated, the number of spares will be insufficient and the system may fail. If the design is likely to have undiscovered design or component problems, it is advisable to use dissimilar redundancy, even though this multiplies the design and development cost. In the ideal case, a human tended closed system operational test should be conducted to gain confidence in operations, maintenance, and repair. The difficulty in achieving high reliability in unproven complex systems may require the use of simpler, more mature, intrinsically higher reliability systems. The limitations of budget, schedule, and technology may suggest accepting lower and

  1. Registered nurse leadership style and confidence in delegation.

    Science.gov (United States)

    Saccomano, Scott J; Pinto-Zipp, Genevieve

    2011-05-01

      Leadership and confidence in delegation are two important explanatory constructs of nursing practice. The relationship between these constructs, however, is not clearly understood. To be successful in their roles as leaders, regardless of their experience, registered nurses (RNs) need to understand how to best delegate. The present study explored and described the relationship between RN leadership styles, demographic variables and confidence in delegation in a community teaching hospital. Utilizing a cross-sectional survey design, RNs employed in one acute care hospital completed questionnaires that measured leadership style [Path-Goal Leadership Questionnaire (PGLQ)] and confidence in delegating patient care tasks [Confidence and Intent to Delegate Scale (CIDS)]. Contrary to expectations, the data did not confirm a relationship between confidence in delegating tasks to unlicensed assistive personnel (UAPs) and leadership style. Nurses who were diploma or associate degree prepared were initially less confident in delegating tasks to UAPs as compared with RNs holding a bachelor's degree or higher. Further, after 5 years of clinical nursing experience, nurses with less educational experience reported more confidence in delegating tasks as compared with RNs with more educational experience. The lack of a relationship between leadership style and confidence in delegating patient care tasks were discussed in terms of the PGLQ classification criteria and hospital unit differences. As suggested by the significant two-way interaction between educational preparation and clinical nursing experience, changes in the nurse's confidence in delegating patient care tasks to UAPs was a dynamic changing variable that resulted from the interplay between amount of educational preparation and years of clinical nursing experience in this population of nurses. Clearly, generalizability of these findings to nurses outside the US is questionable, thus nurse managers must be familiar

  2. Reliability and Correlation of Static and Dynamic Foot Arch Measurement in a Healthy Pediatric Population.

    Science.gov (United States)

    Scholz, Timo; Zech, Astrid; Wegscheider, Karl; Lezius, Susanne; Braumann, Klaus-Michael; Sehner, Susanne; Hollander, Karsten

    2017-09-01

    Measurement of the medial longitudinal foot arch in children is a controversial topic, as there are many different methods without a definite standard procedure. The purpose of this study was to 1) investigate intraday and interrater reliability regarding dynamic arch index and static arch height, 2) explore the correlation between both arch indices, and 3) examine the variation of the medial longitudinal arch at two different times of the day. Eighty-six children (mean ± SD age, 8.9 ± 1.9 years) participated in the study. Dynamic footprint data were captured with a pedobarographic platform. For static arch measurements, a specially constructed caliper was used to assess heel-to-toe length and dorsum height. A mixed model was established to determine reliability and variation. Reliability was found to be excellent for the static arch height index in sitting (intraday, 0.90; interrater, 0.80) and standing positions (0.88 and 0.85) and for the dynamic arch index (both 1.00). There was poor correlation between static and dynamic assessment of the medial longitudinal arch (standing dynamic arch index, r = -0.138; sitting dynamic arch index, r = -0.070). Static measurements were found to be significantly influenced by the time of day (P body mass index (P mind. For clinical purposes, static and dynamic arch data should be interpreted separately.

  3. Opportunities for measuring wheelchair kinematics in match settings; reliability of a three inertial sensor configuration.

    Science.gov (United States)

    van der Slikke, R M A; Berger, M A M; Bregman, D J J; Lagerberg, A H; Veeger, H E J

    2015-09-18

    Knowledge of wheelchair kinematics during a match is prerequisite for performance improvement in wheelchair basketball. Unfortunately, no measurement system providing key kinematic outcomes proved to be reliable in competition. In this study, the reliability of estimated wheelchair kinematics based on a three inertial measurement unit (IMU) configuration was assessed in wheelchair basketball match-like conditions. Twenty participants performed a series of tests reflecting different motion aspects of wheelchair basketball. During the tests wheelchair kinematics were simultaneously measured using IMUs on wheels and frame, and a 24-camera optical motion analysis system serving as gold standard. Results showed only small deviations of the IMU method compared to the gold standard, once a newly developed skid correction algorithm was applied. Calculated Root Mean Square Errors (RMSE) showed good estimates for frame displacement (RMSE≤0.05 m) and speed (RMSE≤0.1m/s), except for three truly vigorous tests. Estimates of frame rotation in the horizontal plane (RMSE0.90), rotational speed (ICC>0.99) and IRC (ICC> 0.90) showed high correlations between IMU data and gold standard. IMU based estimation of wheelchair kinematics provided reliable results, except for brief moments of wheel skidding in truly vigorous tests. The IMU method is believed to enable prospective research in wheelchair basketball match conditions and contribute to individual support of athletes in everyday sports practice. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Using VIIRS Day/Night Band to Measure Electricity Supply Reliability: Preliminary Results from Maharashtra, India

    Directory of Open Access Journals (Sweden)

    Michael L. Mann

    2016-08-01

    Full Text Available Unreliable electricity supplies are common in developing countries and impose large socio-economic costs, yet precise information on electricity reliability is typically unavailable. This paper presents preliminary results from a machine-learning approach for using satellite imagery of nighttime lights to develop estimates of electricity reliability for western India at a finer spatial scale. We use data from the Visible Infrared Imaging Radiometer Suite (VIIRS onboard the Suomi National Polar Partnership (SNPP satellite together with newly-available data from networked household voltage meters. Our results point to the possibilities of this approach as well as areas for refinement. With currently available training data, we find a limited ability to detect individual outages identified by household-level measurements of electricity voltage. This is likely due to the relatively small number of individual outages observed in our preliminary data. However, we find that the approach can estimate electricity reliability rates for individual locations fairly well, with the predicted versus actual regression yielding an R2 > 0.5. We also find that, despite the after midnight overpass time of the SNPP satellite, the reliability estimates derived are representative of daytime reliability.

  5. Reliability of segmental accelerations measured using a new wireless gait analysis system.

    Science.gov (United States)

    Kavanagh, Justin J; Morrison, Steven; James, Daniel A; Barrett, Rod

    2006-01-01

    The purpose of this study was to determine the inter- and intra-examiner reliability, and stride-to-stride reliability, of an accelerometer-based gait analysis system which measured 3D accelerations of the upper and lower body during self-selected slow, preferred and fast walking speeds. Eight subjects attended two testing sessions in which accelerometers were attached to the head, neck, lower trunk, and right shank. In the initial testing session, two different examiners attached the accelerometers and performed the same testing procedures. A single examiner repeated the procedure in a subsequent testing session. All data were collected using a new wireless gait analysis system, which features near real-time data transmission via a Bluetooth network. Reliability for each testing condition (4 locations, 3 directions, 3 speeds) was quantified using a waveform similarity statistic known as the coefficient of multiple determination (CMD). CMD's ranged from 0.60 to 0.98 across all test conditions and were not significantly different for inter-examiner (0.86), intra-examiner (0.87), and stride-to-stride reliability (0.86). The highest repeatability for the effect of location, direction and walking speed were for the shank segment (0.94), the vertical direction (0.91) and the fast walking speed (0.91), respectively. Overall, these results indicate that a high degree of waveform repeatability was obtained using a new gait system under test-retest conditions involving single and dual examiners. Furthermore, differences in acceleration waveform repeatability associated with the reapplication of accelerometers were small in relation to normal motor variability.

  6. Reliability of Reagent Strips for Semi-quantitative Measurement of Glucosuria in a Neonatal Intensive Care Setting

    Directory of Open Access Journals (Sweden)

    Jolita Bekhof

    2014-12-01

    Conclusion: The reliability of the semi-quantitative measurement of glucosuria in newborn infants using reagent strips is good, even under the conditions of a NICU. Changes in the rating of reagent strips of more than one category are most likely to be beyond measurement error.

  7. Modeling and Forecasting (Un)Reliable Realized Covariances for More Reliable Financial Decisions

    DEFF Research Database (Denmark)

    Bollerslev, Tim; Patton, Andrew J.; Quaedvlieg, Rogier

    We propose a new framework for modeling and forecasting common financial risks based on (un)reliable realized covariance measures constructed from high-frequency intraday data. Our new approach explicitly incorporates the effect of measurement errors and time-varying attenuation biases into the c......We propose a new framework for modeling and forecasting common financial risks based on (un)reliable realized covariance measures constructed from high-frequency intraday data. Our new approach explicitly incorporates the effect of measurement errors and time-varying attenuation biases...

  8. Using Wireless Pedometers to Measure Children’s Physical Activity: How Reliable is the Fitbit Zip?

    Directory of Open Access Journals (Sweden)

    Tingting Xu

    2017-07-01

    Full Text Available The purpose of this study is to examine the reliability of wireless pedometers in measuring elementary school children’s physical activity. Activity measurement using a wireless pedometer Fitbit ZipTM was compared to activity measurement using Yamax Digi-WalkerTM SW701 for a group of randomly selected 25 children in Grades 3, 4, and 5. Fitbit ZipTM wireless pedometers were found to have an appropriate degree (Nunnally & Bernstein, 1994 of accuracy and reliability compared to the Yamax Digi-WalkerTM SW701 pedometer. The Fitbit ZipTM wireless pedometer collected more step counts than the Yamax Digi-WalkerTM SW701 pedometer; however, the difference was not statistically significant. Participants reported that they preferred wearing the Fitbit ZipTM to the Yamax Digi-WalkerTM SW701 because the Fitbit ZipTM was more comfortable to wear and less likely to fall off. Participants also reported being more motivated to move while wearing the Fitbit ZipTM.

  9. Method of administration of PROMIS scales did not significantly impact score level, reliability, or validity

    DEFF Research Database (Denmark)

    Bjorner, Jakob B; Rose, Matthias; Gandek, Barbara

    2014-01-01

    OBJECTIVES: To test the impact of the method of administration (MOA) on score level, reliability, and validity of scales developed in the Patient Reported Outcomes Measurement Information System (PROMIS). STUDY DESIGN AND SETTING: Two nonoverlapping parallel forms each containing eight items from......, no significant mode differences were found and all confidence intervals were within the prespecified minimal important difference of 0.2 standard deviation. Parallel-forms reliabilities were very high (ICC = 0.85-0.93). Only one across-mode ICC was significantly lower than the same-mode ICC. Tests of validity...... questionnaire (PQ), personal digital assistant (PDA), or personal computer (PC) and a second form by PC, in the same administration. Method equivalence was evaluated through analyses of difference scores, intraclass correlations (ICCs), and convergent/discriminant validity. RESULTS: In difference score analyses...

  10. The effect of the spatial positioning of items on the reliability of questionnaires measuring affect

    Directory of Open Access Journals (Sweden)

    Leigh Leo

    2016-08-01

    Full Text Available Orientation: Extant research has shown that the relationship between spatial location and affect may have pervasive effects on evaluation. In particular, experimental findings on embodied cognition indicate that a person is spatially orientated to position what is positive at the top and what is negative at the bottom (vertical spatial orientation, and to a lesser extent, to position what is positive on the left and what is negative on the right (horizontal spatial orientation. It is therefore hypothesised, that when there is congruence between a respondent’s spatial orientation (related to affect and the spatial positioning (layout of a questionnaire, the reliability will be higher than in the case of incongruence. Research purpose: The principal objective of the two studies reported here was to ascertain the extent to which congruence between a respondent’s spatial orientation (related to affect and the layout of the questionnaire (spatial positioning of questionnaire items may impact on the reliability of a questionnaire measuring affect. Motivation for the study: The spatial position of items on a questionnaire measuring affect may indirectly impact on the reliability of the questionnaire. Research approach, design and method: In both studies, a controlled experimental research design was conducted using a sample of university students (n = 1825. Major findings: In both experiments, evidence was found to support the hypothesis that greater congruence between a respondent’s spatial orientation (related to affect and the spatial positioning (layout of a questionnaire leads to higher reliability on a questionnaire measuring affect. Practical implications: These findings may serve to create awareness of the influence of the spatial positioning of items as a confounding variable in questionnaire design. Contribution/value-add: Overall, this research complements previous studies by confirming the metaphorical representation of affect and

  11. Validity of the Mexican version of the combined Foot Care Confidence / Foot-Care Behavior scale for diabetes

    Directory of Open Access Journals (Sweden)

    Jaime A. García-Inzunza

    2015-07-01

    Full Text Available OBJECTIVE: To 1 translate / transculturally adapt the original (English-language combined Foot Care Confidence Scale / Foot-Care Behavior instrument (FCCS-FCB to produce a Mexican-Spanish version and 2 determine its validity and reliability in a population with diabetes in Tijuana, Mexico. METHODS: The original FCCS-FCB was translated (and back-translated, the content validated (by a group of health professional experts, and the instrument applied to 304 patients 23-78 years old in diabetes support groups in Tijuana, Mexico. Internal consistency for the study constructs ("self-efficacy," and risk / preventive foot self-care behaviors was measured using Cronbach's alpha. The constructs were validated using principal component factor analysis. RESULTS: The Cronbach's alpha values for internal consistency were 0.782 for self-efficacy and 0.505 for behaviors. Based on the analysis, two factors explained 49.1% of the total variance for self-efficacy, and six factors explained 57.7% of the total variance for behaviors. The results were consistent with those for the original (English version of the FCCS-FCB. CONCLUSIONS: The Mexican version of the FCCS-FCB is a reliable and valid instrument recommended for use with Mexican-Spanish-speaking patients with diabetes.

  12. An analysis of confidence limit calculations used in AAPM Task Group No. 119

    International Nuclear Information System (INIS)

    Knill, Cory; Snyder, Michael

    2011-01-01

    Purpose: The report issued by AAPM Task Group No. 119 outlined a procedure for evaluating the effectiveness of IMRT commissioning. The procedure involves measuring gamma pass-rate indices for IMRT plans of standard phantoms and determining if the results fall within a confidence limit set by assuming normally distributed data. As stated in the TG report, the assumption of normally distributed gamma pass rates is a convenient approximation for commissioning purposes, but may not accurately describe the data. Here the authors attempt to better describe gamma pass-rate data by fitting it to different distributions. The authors then calculate updated confidence limits using those distributions and compare them to those derived using TG No. 119 method. Methods: Gamma pass-rate data from 111 head and neck patients are fitted using the TG No. 119 normal distribution, a truncated normal distribution, and a Weibull distribution. Confidence limits to 95% are calculated for each and compared. A more general analysis of the expected differences between the TG No. 119 method of determining confidence limits and a more time-consuming curve fitting method is performed. Results: The TG No. 119 standard normal distribution does not fit the measured data. However, due to the small range of measured data points, the inaccuracy of the fit has only a small effect on the final value of the confidence limits. The confidence limits for the 111 patient plans are within 0.1% of each other for all distributions. The maximum expected difference in confidence limits, calculated using TG No. 119's approximation and a truncated distribution, is 1.2%. Conclusions: A three-parameter Weibull probability distribution more accurately fits the clinical gamma index pass-rate data than the normal distribution adopted by TG No. 119. However, the sensitivity of the confidence limit on distribution fit is low outside of exceptional circumstances.

  13. Interobserver reliability when using the Van Herick method to measure anterior chamber depth

    Directory of Open Access Journals (Sweden)

    Ahmed Javed

    2017-01-01

    Conclusion: The Van Herick score has a good interobserver reliability for Grades 1 and 4; however, Grades 2 and 3 require further tests such as gonioscopy or ocular coherence tomography. Temporal and nasal scores demonstrated good agreement; therefore, if the nasal score cannot be measured due to nasal bridge size, the temporal can be used as an approximation.

  14. Validity and reliability of GPS and LPS for measuring distances covered and sprint mechanical properties in team sports.

    Science.gov (United States)

    Hoppe, Matthias W; Baumgart, Christian; Polglaze, Ted; Freiwald, Jürgen

    2018-01-01

    This study aimed to investigate the validity and reliability of global (GPS) and local (LPS) positioning systems for measuring distances covered and sprint mechanical properties in team sports. Here, we evaluated two recently released 18 Hz GPS and 20 Hz LPS technologies together with one established 10 Hz GPS technology. Six male athletes (age: 27±2 years; VO2max: 48.8±4.7 ml/min/kg) performed outdoors on 10 trials of a team sport-specific circuit that was equipped with double-light timing gates. The circuit included various walking, jogging, and sprinting sections that were performed either in straight-lines or with changes of direction. During the circuit, athletes wore two devices of each positioning system. From the reported and filtered velocity data, the distances covered and sprint mechanical properties (i.e., the theoretical maximal horizontal velocity, force, and power output) were computed. The sprint mechanical properties were modeled via an inverse dynamic approach applied to the center of mass. The validity was determined by comparing the measured and criterion data via the typical error of estimate (TEE), whereas the reliability was examined by comparing the two devices of each technology (i.e., the between-device reliability) via the coefficient of variation (CV). Outliers due to measurement errors were statistically identified and excluded from validity and reliability analyses. The 18 Hz GPS showed better validity and reliability for determining the distances covered (TEE: 1.6-8.0%; CV: 1.1-5.1%) and sprint mechanical properties (TEE: 4.5-14.3%; CV: 3.1-7.5%) than the 10 Hz GPS (TEE: 3.0-12.9%; CV: 2.5-13.0% and TEE: 4.1-23.1%; CV: 3.3-20.0%). However, the 20 Hz LPS demonstrated superior validity and reliability overall (TEE: 1.0-6.0%; CV: 0.7-5.0% and TEE: 2.1-9.2%; CV: 1.6-7.3%). For the 10 Hz GPS, 18 Hz GPS, and 20 Hz LPS, the relative loss of data sets due to measurement errors was 10.0%, 20.0%, and 15.8%, respectively. This study shows that

  15. Confident failures: Lapses of working memory reveal a metacognitive blind spot.

    Science.gov (United States)

    Adam, Kirsten C S; Vogel, Edward K

    2017-07-01

    Working memory performance fluctuates dramatically from trial to trial. On many trials, performance is no better than chance. Here, we assessed participants' awareness of working memory failures. We used a whole-report visual working memory task to quantify both trial-by-trial performance and trial-by-trial subjective ratings of inattention to the task. In Experiment 1 (N = 41), participants were probed for task-unrelated thoughts immediately following 20% of trials. In Experiment 2 (N = 30), participants gave a rating of their attentional state following 25% of trials. Finally, in Experiments 3a (N = 44) and 3b (N = 34), participants reported confidence of every response using a simple mouse-click judgment. Attention-state ratings and off-task thoughts predicted the number of items correctly identified on each trial, replicating previous findings that subjective measures of attention state predict working memory performance. However, participants correctly identified failures on only around 28% of failure trials. Across experiments, participants' metacognitive judgments reliably predicted variation in working memory performance but consistently and severely underestimated the extent of failures. Further, individual differences in metacognitive accuracy correlated with overall working memory performance, suggesting that metacognitive monitoring may be key to working memory success.

  16. The reliability of the Associate Platinum digital foot scanner in measuring previously developed footprint characteristics: a technical note.

    Science.gov (United States)

    Papuga, M Owen; Burke, Jeanmarie R

    2011-02-01

    An ink pad and paper, pressure-sensitive platforms, and photography have previously been used to collect footprint data used in clinical assessment. Digital scanners have been widely used more recently to collect such data. The purpose of this study was to evaluate the intra- and interrater reliability of a flatbed digital image scanning technology to capture footprint data. This study used a repeated-measures design on 32 (16 male 16 female) healthy subjects. The following measured indices of footprint were recorded from 2-dimensional images of the plantar surface of the foot recorded with an Associate Platinum (Foot Levelers Inc, Roanoke, VA) digital foot scanner: Staheli index, Chippaux-Smirak index, arch angle, and arch index. Intraclass correlation coefficient (ICC) values were calculated to evaluate intrarater, interday, and interclinician reliability. The ICC values for intrarater reliability were greater than or equal to .817, indicating an excellent level of reproducibility in assessing the collected images. Analyses of variance revealed that there were no significant differences between raters for each index (P > .05). The ICC values also indicated excellent reliability (.881-.971) between days and clinicians in all but one of the indices of footprint, arch angle (.689), with good reliability between clinicians. The full-factorial analysis of variance model did not reveal any interaction effects (P > .05), which indicated that indices of footprint were not changing across days and clinicians. Scanning technology used in this study demonstrated good intra- and interrater reliability measurements of footprint indices, as demonstrated by high ICC values. Copyright © 2011 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.

  17. Confiabilidade da medida de espessuras musculares pela ultrassonografia Reliability of muscle thickness measurements using ultrasound

    Directory of Open Access Journals (Sweden)

    Paulo Sergio Chagas Gomes

    2010-02-01

    Full Text Available OBJETIVO: Determinar a confiabilidade das medidas de espessuras dos músculos flexores e extensores do cotovelo e joelho pela ultrassonografia (US, quantificando o erro típico associado a essas medidas (ETM. MÉTODOS: A confiabilidade (duas medidas interdias foi determinada em 15 voluntários aparentemente saudáveis (oito mulheres, 33,9 ± 11,4 anos, 76 ± 21kg, 170 ± 10cm. As imagens da musculatura flexora (FC e extensora do cotovelo (EC e flexora (FJ e extensora do joelho (EJ foram obtidas pela US bidimensional no modo B, utilizando transdutor de 7,5MHz. As espessuras do tecido muscular compreendidas entre as interfaces com o osso e com o tecido adiposo foram medidas em sítios anatômicos identificados e registrados para ser repetidos na segunda medida. RESULTADOS: A ANOVA não identificou diferenças significativas entre as medidas repetidas. Os coeficientes de correlação intraclasse foram FC = 0,970, EC = 0,971, FJ = 0,555 e EJ = 0,929 (P PURPOSE: To determine the reliability of muscle thickness measurements of elbow and knee flexors and extensors using ultrasound, and to quantify the typical error associated to the measurements (TEM. METHODS: The test-retest reliability was determined in 15 apparently healthy volunteers (8 women, 34 ± 11 years, 76 ± 21 kg, 170 ± 10 cm. The images of elbow flexors (EF and extensors (EE and knee flexors (KF and extensors (KE were obtained using a two dimensional mode B ultrasound instrument with a 7.5 MHz transducer. Muscle thickness between the adipose tissue and bone interfaces were measured at anatomical landmarks previously identified and recorded to assure the exact site for the retest. RESULTS: ANOVA did not identify any significant differences between the repeated measurements. Intraclass correlation coefficients (ICC of each pair of measure were EF = 0.970, EE = 0.971, KF = 0.555 e KE = 0.929 (P < 0.05 for all. The coefficients of variation were 3.9 %, 6.1 %, 6.6 % e 4.6 %, and TEM 1.3 mm, 1

  18. Reliability data banks

    International Nuclear Information System (INIS)

    Cannon, A.G.; Bendell, A.

    1991-01-01

    Following an introductory chapter on Reliability, what is it, why it is needed, how it is achieved and measured, the principles of reliability data bases and analysis methodologies are the subject of the next two chapters. Achievements due to the development of data banks are mentioned for different industries in the next chapter, FACTS, a comprehensive information system for industrial safety and reliability data collection in process plants are covered next. CREDO, the Central Reliability Data Organization is described in the next chapter and is indexed separately, as is the chapter on DANTE, the fabrication reliability Data analysis system. Reliability data banks at Electricite de France and IAEA's experience in compiling a generic component reliability data base are also separately indexed. The European reliability data system, ERDS, and the development of a large data bank come next. The last three chapters look at 'Reliability data banks, - friend foe or a waste of time'? and future developments. (UK)

  19. The Reliability of Electronic Health Record Data Used for Obstetrical Research.

    Science.gov (United States)

    Altman, Molly R; Colorafi, Karen; Daratha, Kenn B

    2018-01-01

    Hospital electronic health record (EHR) data are increasingly being called upon for research purposes, yet only recently has it been tested to examine its reliability. Studies that have examined reliability of EHR data for research purposes have varied widely in methods used and field of inquiry, with little reporting of the reliability of perinatal and obstetric variables in the current literature. To assess the reliability of data extracted from a commercially available inpatient EHR as compared with manually abstracted data for common attributes used in obstetrical research. Data extracted through automated EHR reports for 3,250 women who delivered a live infant at a large hospital in the Pacific Northwest were compared with manual chart abstraction for the following perinatal measures: delivery method, labor induction, labor augmentation, cervical ripening, vertex presentation, and postpartum hemorrhage. Almost perfect agreement was observed for all four modes of delivery (vacuum assisted: kappa = 0.92; 95% confidence interval [CI] = 0.88-0.95, forceps assisted: kappa = 0.90; 95%CI = 0.76-1.00, cesarean delivery: kappa = 0.91; 95%CI = 0.90-0.93, and spontaneous vaginal delivery: kappa = 0.91; 95%CI = 0.90-0.93). Cervical ripening demonstrated substantial agreement (kappa = 0.77; 95%CI = 0.73-0.80); labor induction (kappa = 0.65; 95%CI = 0.62-0.68) and augmentation (kappa = 0.54; 95%CI = 0.49-0.58) demonstrated moderate agreement between the two data sources. Vertex presentation (kappa = 0.35; 95%CI = 0.31-0.40) and post-partum hemorrhage (kappa = 0.21; 95%CI = 0.13-0.28) demonstrated fair agreement. Our study demonstrates variability in the reliability of obstetrical data collected and reported through the EHR. While delivery method was satisfactorily reliable in our sample, other examined perinatal measures were less so when compared with manual chart abstraction. The use of multiple

  20. Estimating the reliability of eyewitness identifications from police lineups.

    Science.gov (United States)

    Wixted, John T; Mickes, Laura; Dunn, John C; Clark, Steven E; Wells, William

    2016-01-12

    Laboratory-based mock crime studies have often been interpreted to mean that (i) eyewitness confidence in an identification made from a lineup is a weak indicator of accuracy and (ii) sequential lineups are diagnostically superior to traditional simultaneous lineups. Largely as a result, juries are increasingly encouraged to disregard eyewitness confidence, and up to 30% of law enforcement agencies in the United States have adopted the sequential procedure. We conducted a field study of actual eyewitnesses who were assigned to simultaneous or sequential photo lineups in the Houston Police Department over a 1-y period. Identifications were made using a three-point confidence scale, and a signal detection model was used to analyze and interpret the results. Our findings suggest that (i) confidence in an eyewitness identification from a fair lineup is a highly reliable indicator of accuracy and (ii) if there is any difference in diagnostic accuracy between the two lineup formats, it likely favors the simultaneous procedure.