conservativeness level , the conservative probability of failure obtained from Section 4 must be maintained. The mathematical formulation of conservative model... CIE 2015 August 2-5, 2015, Boston, Massachusetts, USA [DRAFT] DETC2015-46982 DEVELOPMENT OF A CONSERVATIVE MODEL VALIDATION APPROACH FOR RELIABLE...PDF and a probability of failure are selected from these predicted output PDFs at a user-specified conservativeness level for validation. For
Previous research funded by Florida Department of Transportation (FDOT) developed a method for estimating : travel time reliability for arterials. This method was not initially implemented or validated using field data. This : project evaluated and r...
Travel time reliability (TTR) has been proposed as : a better measure of a facilitys performance than : a statistical measure like peak hour demand. TTR : is based on more information about average traffic : flows and longer time periods, thus inc...
Jacobsen, Stine Lindahl
The paper will present a phd study concerning reliability and validity of music therapy assessment model “Assessment of Parenting Competences” (APC) in the area of families with emotionally neglected children. This study had a multiple strategy design with a philosophical base of critical realism...... and pragmatism. The fixed design for this study was a between and within groups design in testing the APCs reliability and validity. The two different groups were parents with neglected children and parents with non-neglected children. The flexible design had a multiple case study strategy specifically...... with interplay of turns between parent and child as the case under study comparing clinical and non-clinical groups and looking for differences in patterns of interaction. The flexible design informed the fixed design and led to further valuable statistical analysis. The presenter will provide an overview...
Gustavo Adolfo Watanabe-Kanno
Full Text Available The aim of this study was to determine the reproducibility, reliability and validity of measurements in digital models compared to plaster models. Fifteen pairs of plaster models were obtained from orthodontic patients with permanent dentition before treatment. These were digitized to be evaluated with the program Cécile3 v2.554.2 beta. Two examiners measured three times the mesiodistal width of all the teeth present, intercanine, interpremolar and intermolar distances, overjet and overbite. The plaster models were measured using a digital vernier. The t-Student test for paired samples and interclass correlation coefficient (ICC were used for statistical analysis. The ICC of the digital models were 0.84 ± 0.15 (intra-examiner and 0.80 ± 0.19 (inter-examiner. The average mean difference of the digital models was 0.23 ± 0.14 and 0.24 ± 0.11 for each examiner, respectively. When the two types of measurements were compared, the values obtained from the digital models were lower than those obtained from the plaster models (p < 0.05, although the differences were considered clinically insignificant (differences < 0.1 mm. The Cécile digital models are a clinically acceptable alternative for use in Orthodontics.
Wouda, Jan C.; Zandbelt, Linda C.; Smets, Ellen M. A.; van de Wiel, Harry B. M.
Establish the inter-rater reliability and the concept, convergent and construct validity of an instrument for assessing the competency of physicians in patient education. Three raters assessed the quality of patient education in 30 outpatient consultations with the CELI instrument. This instrument
Wouda, Jan C.; Zandbelt, Linda C.; Smets, Ellen M. A.; van de Wiel, Harry B. M.
Objective: Establish the inter-rater reliability and the concept, convergent and construct validity of an instrument for assessing the competency of physicians in patient education. Methods: Three raters assessed the quality of patient education in 30 outpatient consultations with the CELI
Alexandre dit Sandretto, Julien
Model-Predictive Control (MPC) is one of the most advanced control technique nowadays. Indeed,MPC approaches are well known for their robustness and stability properties. Nevertheless, NonlinearModel-Predictive Control (NMPC), the extension of MPC in the nonlinear world, still poses challenging theoretical, computationaland implementation issues. By the help of validated simulation, which can handle nonlinear models, a new algorithmfor a robust by-construction control strategy based on NMPC i...
Stanley, Leanne M.; Edwards, Michael C.
The purpose of this article is to highlight the distinction between the reliability of test scores and the fit of psychometric measurement models, reminding readers why it is important to consider both when evaluating whether test scores are valid for a proposed interpretation and/or use. It is often the case that an investigator judges both the…
Czarnota, Judith; Hey, Jeremias; Fuhrmann, Robert
The purpose of this work was to determine the reliability and validity of measurements performed on digital models with a desktop scanner and analysis software in comparison with measurements performed manually on conventional plaster casts. A total of 20 pairs of plaster casts reflecting the intraoral conditions of 20 fully dentate individuals were digitized using a three-dimensional scanner (D700; 3Shape). A series of defined parameters were measured both on the resultant digital models with analysis software (Ortho Analyzer; 3Shape) and on the original plaster casts with a digital caliper (Digimatic CD-15DCX; Mitutoyo). Both measurement series were repeated twice and analyzed for intrarater reliability based on intraclass correlation coefficients (ICCs). The results from the digital models were evaluated for their validity against the casts by calculating mean-value differences and associated 95 % limits of agreement (Bland-Altman method). Statistically significant differences were identified via a paired t test. Significant differences were obtained for 16 of 24 tooth-width measurements, for 2 of 5 sites of contact-point displacement in the mandibular anterior segment, for overbite, for maxillary intermolar distance, for Little's irregularity index, and for the summation indices of maxillary and mandibular incisor width. Overall, however, both the mean differences between the results obtained on the digital models versus on the plaster casts and the dispersion ranges associated with these differences suggest that the deviations incurred by the digital measuring technique are not clinically significant. Digital models are adequately reproducible and valid to be employed for routine measurements in orthodontic practice.
Guan, Jianmin; McBride, Ron; Xiang, Ping
Although empirical research in academic areas provides support for both a 3-factor as well as a 4-factor achievement goal model, both models were proposed and tested with a collegiate sample. Little is known about the generalizability of either model with high school level samples. This study was designed to examine whether the 3-factor model…
Hopkins, Kenneth D.; And Others
Study was designed to provide some indication as to whether the gain in test reliability from confidence scoring results from relevant or irrelevant sources. If the increase in reliability is the result of a gambling response style, it is conceivable that validity could actually decrease even though reliability is increased. (Authors/CB)
Hoffman, William M.; Riley, Matthew E.; Spencer, Benjamin W.
In nuclear light water reactors (LWRs), the reactor coolant, core and shroud are contained within a massive, thick walled steel vessel known as a reactor pressure vessel (RPV). Given the tremendous size of these structures, RPVs typically contain a large population of pre-existing flaws introduced in the manufacturing process. After many years of operation, irradiation-induced embrittlement makes these vessels increasingly susceptible to fracture initiation at the locations of the pre-existing flaws. Because of the uncertainty in the loading conditions, flaw characteristics and material properties, probabilistic methods are widely accepted and used in assessing RPV integrity. The Fracture Analysis of Vessels – Oak Ridge (FAVOR) computer program developed by researchers at Oak Ridge National Laboratory is widely used for this purpose. This program can be used in order to perform deterministic and probabilistic risk-informed analyses of the structural integrity of an RPV subjected to a range of thermal-hydraulic events. FAVOR uses a one-dimensional representation of the global response of the RPV, which is appropriate for the beltline region, which experiences the most embrittlement, and employs an influence coefficient technique to rapidly compute stress intensity factors for axis-aligned surface-breaking flaws. The Grizzly code is currently under development at Idaho National Laboratory (INL) to be used as a general multiphysics simulation tool to study a variety of degradation mechanisms in nuclear power plant components. The first application of Grizzly has been to study fracture in embrittled RPVs. Grizzly can be used to model the thermo-mechanical response of an RPV under transient conditions observed in a pressurized thermal shock (PTS) scenario. The global response of the vessel provides boundary conditions for local 3D models of the material in the vicinity of a flaw. Fracture domain integrals are computed to obtain stress intensity factors, which can in
Wiranto, Matthew G.; Engelbrecht, W. Petrie; Nolthenius, Heleen E. Tutein; van der Meer, W. Joerd; Ren, Yijin
INTRODUCTION: Digital 3-dimensional models are widely used for orthodontic diagnosis. The aim of this study was to assess the validity, reliability, and reproducibility of digital models obtained from the Lava Chairside Oral scanner (3M ESPE, Seefeld, Germany) and cone-beam computed tomography scans
Aven, Terje; Heide, Bjornar
In this paper we investigate to what extent risk analysis meets the scientific quality requirements of reliability and validity. We distinguish between two types of approaches within risk analysis, relative frequency-based approaches and Bayesian approaches. The former category includes both traditional statistical inference methods and the so-called probability of frequency approach. Depending on the risk analysis approach, the aim of the analysis is different, the results are presented in different ways and consequently the meaning of the concepts reliability and validity are not the same.
Yasin, Siti Munira; Taib, Khairul Mizan; Zaki, Rafdzah Ahmad
The transtheoretical model (TTM) has been used as one of the major constructs in developing effective cognitive behavioural interventions for smoking cessation and relapse prevention, in Western societies. This study aimed to examine the reliability and construct validity of the translated Bahasa Malaysia version of TTM questionnaire among adult smokers in Klang Valley, Malaysia. The sample consisted of 40 smokers from four different worksites in Klang Valley. A 26-item TTM questionnaire was administered, and a similar set one week later. The questionnaire consisted of three measures; decisional balance, temptations and impact of smoking. Construct validity was measured by factor analysis and the reliability by Cronbach' s alpha (internal consistency) and test-retest correlation. Results revealed that Cronbach' s alpha coefficients for the items were: decisional balance (0.84; 0.74) and temptations (0.89; 0.54; 0.85). The values for test retest correlation were all above 0.4. In addition, factor analysis suggested two meaningful common factors for decisional balance and three for temptations. This is consistent with the original construct of the TTM questionnaire. Overall results demonstrated that construct validity and reliability were acceptable for all items. In conclusion, the Bahasa Malaysia version of TTM questionnaire is a reliable and valid tool in ass.
Özpınar, Saliha; Taner, Şafak; Yıldırım, Gülay; Mahleç Anar, Ceyda; Altıparmak, Osman; Baydur, Hakan
In many health conditions, stigma is receiving increasing attention. Public stigmatization toward social illness can affect particularly the patients and family memberships to help seeking behavior and treatment. This study, the aim of this study was to investigate the validity and reliability of the Turkish "Stigma of Tuberculosis Scale " which was developed to evaluate of perception of stigma with tuberkulosis patient. This methodological study was conducted with 150 with tuberculosis disease people who above 18 age and without known psychological and mental disability. In the study, "Stigma of Tuberculosis Scale" was used as data collection tool. During the study, language equivalence, content validity, reliability and construct validity of the scale was performed. The data was assessed by using mean, median, standard deviation, Spearman Correlation, Cronbach Alpha coefficient and confirmatory factor analysis. The mean age of study participants was 45.6 ± 16.1 (20 - 85). years. Spearman correlation coefficient of the scale for test-retest reliability was 0.853 and the Cronbach Alpha coefficient was 0.95. According to fit indexes of confirmatory factor analysis [x2/SD= 481.38/230= 2.09; RMSEA= 0.087; GFI= 0.776; CFI= 0.776; NNFI= 0.875] it was found that one factors were appropriate for the scale. The adoption of the translated "Stigma of Tuberculosis Scale"in Turkey is found reliable and valid to evaluate of perception of stigma with tuberkulosis patient.
Doctor, S.R.; Deffenbaugh, J.D.; Good, M.S.; Green, E.R.; Heasler, P.G.; Hutton, P.H.; Reid, L.D.; Simonen, F.A.; Spanner, J.C.; Vo, T.V.
This paper reports on progress for three programs: (1) evaluation and improvement in nondestructive examination reliability for inservice inspection of light water reactors (LWR) (NDE Reliability Program), (2) field validation acceptance, and training for advanced NDE technology, and (3) evaluation of computer-based NDE techniques and regional support of inspection activities. The NDE Reliability Program objectives are to quantify the reliability of inservice inspection techniques for LWR primary system components through independent research and establish means for obtaining improvements in the reliability of inservice inspections. The areas of significant progress will be described concerning ASME Code activities, re-analysis of the PISC-II data, the equipment interaction matrix study, new inspection criteria, and PISC-III. The objectives of the second program are to develop field procedures for the AE and SAFT-UT techniques, perform field validation testing of these techniques, provide training in the techniques for NRC headquarters and regional staff, and work with the ASME Code for the use of these advanced technologies. The final program's objective is to evaluate the reliability and accuracy of interpretation of results from computer-based ultrasonic inservice inspection systems, and to develop guidelines for NRC staff to monitor and evaluate the effectiveness of inservice inspections conducted on nuclear power reactors. This program started in the last quarter of FY89, and the extent of the program was to prepare a work plan for presentation to and approval from a technical advisory group of NRC staff
Pigford, T.H.; Chambre, P.L.
The objective of predicting long-term performance should be to make reliable determinations of whether the prediction falls within the criteria for acceptable performance. Establishing reliable predictions of long-term performance of a waste repository requires emphasis on valid theories to predict performance. The validation process must establish the validity of the theory, the parameters used in applying the theory, the arithmetic of calculations, and the interpretation of results; but validation of such performance predictions is not possible unless there are clear criteria for acceptable performance. Validation programs should emphasize identification of the substantive issues of prediction that need to be resolved. Examples relevant to waste package performance are predicting the life of waste containers and the time distribution of container failures, establishing the criteria for defining container failure, validating theories for time-dependent waste dissolution that depend on details of the repository environment, and determining the extent of congruent dissolution of radionuclides in the UO 2 matrix of spent fuel. Prediction and validation should go hand in hand and should be done and reviewed frequently, as essential tools for the programs to design and develop repositories. 29 refs
This report includes three papers as follows: : 1. Guo F., Rakha H., and Park S. (2010), "A Multi-state Travel Time Reliability Model," : Transportation Research Record: Journal of the Transportation Research Board, n 2188, : pp. 46-54. : 2. Park S.,...
Bakker, Leonhard A; Schröder, Carin D; van Es, Michael A; Westers, Paul; Visser-Meily, Johanna M A; van den Berg, Leonard H
The amyotrophic lateral sclerosis functional rating scale-revised (ALSFRS-R) is a widely used primary outcome measure in amyotrophic lateral sclerosis (ALS) clinical practice and clinical trials. ALSFRS-R items cannot, however, validly be summed to obtain a total score, but constitute domain scores reflecting a profile of disease severity. Currently, there are different measurement models for estimating domain scores. The objective of the present study is, therefore, to derive the measurement model that best fits the data for a valid and uniform estimation of ALSFRS-R domain scores. Data from 1556 patients with ALS were obtained from a population-based register in The Netherlands. A random split of the sample provided a calibration and validation set. Measurement models of the ALSFRS-R were investigated using both exploratory factor analyses and confirmatory factor analyses. The measurement model with a four-factor structure (i.e., bulbar, fine motor, gross motor, and respiratory function), with correlated factors and cross-loading items on dressing and hygiene and turning in bed and adjusting bed clothes on both motor function scales, provided the best fit to the data in both sets. Correlation between factors ranged from weak to modest, confirming that the ALSFRS-R constitutes a profile of four clinically relevant domain scores rather than a total score that expresses disease severity. The internal consistency of the four domain scores was satisfactory. Our revision of the measurement model may allow for a more adequate estimation of disease severity and disease progression in epidemiological studies and clinical trials.
Full Text Available Background: Today it is virtually impossible to operate alone on the international level in the logistics business. This promotes the establishment and development of new integrated business entities - logistic operators. However, such cooperation within a supply chain creates also many problems related to the supply chain reliability as well as the optimization of the supplies planning. The aim of this paper was to develop and formulate the mathematical model and algorithms to find the optimum plan of supplies by using economic criterion and the model for the probability evaluating of non-failure operation of supply chain. Methods: The mathematical model and algorithms to find the optimum plan of supplies were developed and formulated by using economic criterion and the model for the probability evaluating of non-failure operation of supply chain. Results and conclusions: The problem of ensuring failure-free performance of goods supply channel analyzed in the paper is characteristic of distributed network systems that make active use of business process outsourcing technologies. The complex planning problem occurring in such systems that requires taking into account the consumer's requirements for failure-free performance in terms of supply volumes and correctness can be reduced to a relatively simple linear programming problem through logical analysis of the structures. The sequence of the operations, which should be taken into account during the process of the supply planning with the supplier's functional reliability, was presented.
Zeeman, Jacqueline M; McLaughlin, Jacqueline E; Cox, Wendy C
With increased emphasis placed on non-academic skills in the workplace, a need exists to identify an admissions process that evaluates these skills. This study assessed the validity and reliability of an application review process involving three dedicated application reviewers in a multi-stage admissions model. A multi-stage admissions model was utilized during the 2014-2015 admissions cycle. After advancing through the academic review, each application was independently reviewed by two dedicated application reviewers utilizing a six-construct rubric (written communication, extracurricular and community service activities, leadership experience, pharmacy career appreciation, research experience, and resiliency). Rubric scores were extrapolated to a three-tier ranking to select candidates for on-site interviews. Kappa statistics were used to assess interrater reliability. A three-facet Many-Facet Rasch Model (MFRM) determined reviewer severity, candidate suitability, and rubric construct difficulty. The kappa statistic for candidates' tier rank score (n = 388 candidates) was 0.692 with a perfect agreement frequency of 84.3%. There was substantial interrater reliability between reviewers for the tier ranking (kappa: 0.654-0.710). Highest construct agreement occurred in written communication (kappa: 0.924-0.984). A three-facet MFRM analysis explained 36.9% of variance in the ratings, with 0.06% reflecting application reviewer scoring patterns (i.e., severity or leniency), 22.8% reflecting candidate suitability, and 14.1% reflecting construct difficulty. Utilization of dedicated application reviewers and a defined tiered rubric provided a valid and reliable method to effectively evaluate candidates during the application review process. These analyses provide insight into opportunities for improving the application review process among schools and colleges of pharmacy. Copyright © 2017 Elsevier Inc. All rights reserved.
Full Text Available Negative attitudes of mothers towards their infant is conceptualized as postpartum bonding disorder, which leads to serious health problems in perinatal health care. However, its measurement still remains to be standardized. Our aim was to examine and confirm the psychometric properties of the Postnatal Bonding Questionnaire (PBQ in Japanese mothers. We distributed a set of questionnaires to community mothers and studied 392 mothers who returned the questionnaires at 1 month after childbirth. Our model was compared with three other models derived from previous studies. In a randomly halved sample, an exploratory factor analysis yielded a three-factor structure: Anger and Restrictedness, Lack of Affection, and Rejection and Fear. This factor structure was cross-validated by a confirmatory factor analysis using the other halved sample. The three subscales showed satisfactory internal consistency. The three PBQ subscale scores were correlated with depression and psychological abuse scores. Their test–retest reliability between day 5 and 1 month after childbirth was measured by intraclass correlation coefficients between 0.76 and 0.83. The Akaike Information Criteria of our model was better than the original four-factor model of Brockington. The present study indicates that the PBQ is a reliable and valid measure of bonding difficulties of Japanese mothers with neonates.
Buri, Hilary M; Daly, Jeanette M; Jogerst, Gerald J
(a) To identify reliable and valid questions that identify elder abuse, (b) to assess the reliability and validity of extant self-reported elder abuse screens in a high-risk elderly population, and (c) to describe difficulties of completing and interpreting screens in a high-need elderly population. All elders referred to research-trained social workers in a community service agency were asked to participate. Of the 70 elders asked, 49 participated, 44 completed the first questionnaire, and 32 completed the duplicate second questionnaire. A research assistant administered the telephone questionnaires. Twenty-nine (42%) persons were judged abused, 12 (17%) had abuse reported, and 4 (6%) had abuse substantiated. The elder abuse screen instruments were not found to be predictive of assessed abuse or as predictors of reported abuse; the measures tended toward being inversely predictive. Two questions regarding harm and taking of belongings were significantly different for the assessed abused group. In this small group of high-need community-dwelling elders, the screens were not effective in discriminating between abused and nonabused groups. Better instruments are needed to assess for elder abuse.
Dimitrov, Nikolay Krasimirov; Friis-Hansen, Peter; Berggreen, Christian
Reliability analysis of fiber-reinforced composite structures is a relatively unexplored field, and it is therefore expected that engineers and researchers trying to apply such an approach will meet certain challenges until more knowledge is accumulated. While doing the analyses included in the p......Reliability analysis of fiber-reinforced composite structures is a relatively unexplored field, and it is therefore expected that engineers and researchers trying to apply such an approach will meet certain challenges until more knowledge is accumulated. While doing the analyses included...
components. Thus, models of reliability should be developed and applied in order to quantify the residual life of the components. Damage models based on physics of failure combined with stochastic models describing the uncertain parameters are imperative for development of cost-optimal decision tools...... for Operation & Maintenance planning. Concentrating efforts on development of such models, this research is focused on reliability modeling of Wind Turbine critical subsystems (especially the power converter system). For reliability assessment of these components, structural reliability methods are applied....... Further, reliability modeling of load sharing systems is considered and a theoretical model is proposed based on sequential order statistics and structural systems reliability methods. Procedures for reliability estimation are detailed and presented in a collection of research papers....
Shi, Wenzhong; Zhang, Xiaokang; Hao, Ming; Shao, Pan; Cai, Liping; Lyu, Xuzhe
Validation of land cover products is a fundamental task prior to data applications. Current validation schemes and methods are, however, suited only for assessing classification accuracy and disregard the reliability of land cover products. The reliability evaluation of land cover products should be undertaken to provide reliable land cover information. In addition, the lack of high-quality reference data often constrains validation and affects the reliability results of land cover products. ...
Carey, Mark A; Laird, Daniel E; Murray, Keith A; Stevenson, John R
The purposes of this study were to establish the validity of a digital goniometer (DG) prototype, determine the inter- and intra-rater reliability of the DG as compared to the universal goniometer (UG), and evaluate and describe the clinical usability of the DG as a measurement tool. Eighteen healthy patient models and 5 physical therapists volunteered to participate. Reliability testing of both the UG and DG was determined from the measurements of 5 therapists who performed 2 randomized, repeated measures of 5 joint motions on each of 6 patient models during 3 data collection sessions. To determine the validity of the DG, the investigators measured 8 randomly selected, computer-aided design (CAD) angles during each of the 3 data collection sessions. Physical therapist participants completed pre-/post-surveys to assess perceptions of clinical usability of the DG. No statistically significant differences were found between devices. Comparison of the DG and UG demonstrated no difference for intra- or inter-rater reliability, with the DG yielding higher inter-rater ICC values for each of the 5 motions measured. The DG has adequate concurrent criterion-related validity as a tool for assessment of joint ROM and equivalent inter- and intra-rater reliability to the UG. User surveys indicated that several of the novel features of the DG contributed to a higher likelihood that the device would be utilized by clinicians.
Bates-Jensen, B M; Vredevoe, D L; Brecht, M L
There is not yet a universal system for describing the status of pressure sores. The purpose of this study was to assess the validity and reliability of an instrument developed by the researchers for evaluation of pressure sores, the Pressure Sore Status Tool (PSST). This study was part of a larger study, which included development of a theoretical model for creation of items for the PSST. A nine-member expert judge panel established content validity of items on the instrument. Data were analyzed using a content of validity index (average index for tool = .91) and judges' comments were used to modify two items on the PSST. Two Enterostomal Therapy (ET) nurses independently used the revised tool to rate 20 pressure sores on ten adult medical-surgical patients, at two observation times. Interrater reliability was established at r = .91 for first observation and r = .92 for the second observation (p < .001). Intrarater reliability was r = .99 for rater one and r = .96 for rater two (p < .001). Future research will focus on refinement and further reliability testing of the instrument.
Haradhan Kumar Mohajan
Reliability and validity are the two most important and fundamental features in the evaluation of any measurement instrument or tool for a good research. The purpose of this research is to discuss the validity and reliability of measurement instruments that are used in research. Validity concerns what an instrument measures, and how well it does so. Reliability concerns the faith that one can have in the data obtained from the use of an instrument, that is, the degree to which any measuring t...
Reitz, Orin Edward
A cross sectional correlational mailed survey was used to assess the validity and reliability of the Job Embeddedness (JE) instrument for predicting nurse retention. A regression model using the variables of JE and age were predictive of nurse retention in a sample of Registered Nurses (RNs) working in long term care (LTC) settings. This study supports the multi-dimensionality, validity and reliability of the underlying JE construct. However a different factor structure was suggested that better fit the data. Long term care organizations (LTC) may benefit from a strategy of increasing the embeddedness of their existing employees in order to increase nurse retention. Job embbededness could be used as a foundation to provide effective nurse retention strategies in LTC settings. Copyright © 2014 Elsevier Inc. All rights reserved.
Haradhan Kumar Mohajan
Full Text Available Reliability and validity are two most important and fundamental features in the evaluation of any measurement instrument or toll for a good research. The purpose of this research is to discuss the validity and reliability of measurement instruments that are used in research. Validity concerns what an instrument measures, and how well it does so. Reliability concerns the faith that one can have in the data obtained from use of an instrument, that is, the degree to which any measuring tool controls for random error. An attempt has been taken here to review the reliability and validity, and threat to them in some details.
Olesen, Jes; Jansen-Olesen, Inger
The pharmaceutical industry shows a decreasing interest in the development of drugs for migraine. One of the reasons for this could be the lack of reliable animal models for studying the effect of acute and prophylactic migraine drugs. The infusion of glyceryl trinitrate (GTN) is the best validated...... and most studied human migraine model. Several attempts have been made to transfer this model to animals. The different variants of this model are discussed as well as other recent models....
Nielsen, Julie Bøjstrup; Kyvsgaard, Julie Nyholm; Sildorf, Stine Møller
and diabetes-specific validated multinational DISABKIDS® questionnaires into Danish, and then determine their validity and reliability. Methods: The questionnaires were translated using a validated translation procedure and completed by 99 children and adolescents from our diabetes-department; all diagnosed......Background: Type 1 Diabetes (T1D) has a negative impact on psychological and overall well-being. Screening for Health-related Quality of Life (HrQoL) and addressing HrQoL issues in the clinic leads to improved well-being and metabolic outcomes. The aim of this study was to translate the generic...
Russell, D W
In this article I evaluated the psychometric properties of the UCLA Loneliness Scale (Version 3). Using data from prior studies of college students, nurses, teachers, and the elderly, analyses of the reliability, validity, and factor structure of this new version of the UCLA Loneliness Scale were conducted. Results indicated that the measure was highly reliable, both in terms of internal consistency (coefficient alpha ranging from .89 to .94) and test-retest reliability over a 1-year period (r = .73). Convergent validity for the scale was indicated by significant correlations with other measures of loneliness. Construct validity was supported by significant relations with measures of the adequacy of the individual's interpersonal relationships, and by correlations between loneliness and measures of health and well-being. Confirmatory factor analyses indicated that a model incorporating a global bipolar loneliness factor along with two method factor reflecting direction of item wording provided a very good fit to the data across samples. Implications of these results for future measurement research on loneliness are discussed.
The aim of this research is to develop the Mobbing Scale and examine its validity and reliability. The sample of the study consisted of 515 persons from Sakarya and Bursa. In this study, construct validity, internal consistency, test-retest reliability, and item analysis of the scale were examined. As a result of factor analysis for construct…
Stockbrugger, Barry A.; Haennel, Robert G.
Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…
Background: Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even ...
Badjadi, Nour El Imane
The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Osadebe, P. U.
The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
Alkhamra, Rana A.; Al-Jazi, Aya B.
Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…
The paper considers the state of operator modelling in reliability analysis. Operator models are needed in reliability analysis because operators are needed in process control systems. HRA methods must therefore be able to account both for human performance variability and for the dynamics of the interaction. A selected set of first generation HRA approaches is briefly described in terms of the operator model they use, their classification principle, and the actual method they propose. In addition, two examples of second generation methods are also considered. It is concluded that first generation HRA methods generally have very simplistic operator models, either referring to the time-reliability relationship or to elementary information processing concepts. It is argued that second generation HRA methods must recognise that cognition is embedded in a context, and be able to account for that in the way human reliability is analysed and assessed
Admiraal, W.; Hoeksma, M.; van de Kamp, M.-T.; van Duin, G.
The richness and complexity of video portfolios endanger both the reliability and validity of the assessment of teacher competencies. In a post-graduate teacher education program, the assessment of video portfolios was evaluated for its reliability, construct validity, and consequential validity.
Robbins, Mandy; Francis, Leslie J; Bradford, Amanda
A sample of 16 male and 30 female undergraduates completed the Greer and Francis Scale of Rejection of Christianity. The data support the internal consistency reliability and construct validity of the scale for this sample.
Validity and reliability of the simplified Therapeutic Intervention Scoring System in intensive care units of a public sector hospital in Johannesburg. ... and reliability of TISS-28 and hence its feasibility for use in South African ICUs. Recommendations for nursing education, practice, management and research are proposed.
Karpienko, Katarzyna; Wróbel, Maciej S.; UrniaŻ, Rafał
Reliability and validity of measurements is of utmost importance when assessing measuring capability of instruments developed for research. In order to perform an experiment which is legitimate, used instruments must be both reliable and valid. Reliability estimates the degree of precision of measurement, the extent to which a measurement is internally consistent. Validity is the usefulness of an instrument to perform accurate measurements of quantities it was designed to measure. Statistical analysis for reliability and validity control of low-coherence interferometry method for refractive index measurements of biological fluids is presented. The low-coherence interferometer is sensitive to optical path difference between interfering beams. This difference depends on the refractive index of measured material. To assess the validity and reliability of proposed method for blood measurements, the statistical analysis of the method was performed on several substances with known refractive indices. Analysis of low-coherence interferograms considered the mean distances between fringes. Performed statistical analysis for validity and reliability consisted of Grubb's test for outliers, Shapiro-Wilk test for normal distribution, T-Student test, standard deviation, coefficient of determination and r-Pearson correlation. Overall the tests proved high statistical significance of measurement method with confidence level measurement method.
Full Text Available In this paper, I respond to the challenge raised against contemporary experimental neurobiology according to which the field is in a state of crisis because of the multiple experimental protocols employed in different laboratories and strengthening their reliability that presumably preclude the validity of neurobiological knowledge. I provide an alternative account of experimentation in neurobiology which makes sense of its experimental practices. I argue that maintaining a multiplicity of experimental protocols and strengthening their reliability are well justified and they foster rather than preclude the validity of neurobiological knowledge. Thus, their presence indicates thriving rather than crisis of experimental neurobiology.
Cost reductions for offshore wind turbines are a substantial requirement in order to make offshore wind energy more competitive compared to other energy supply methods. During the 20 – 25 years of wind turbines useful life, Operation & Maintenance costs are typically estimated to be a quarter...... the actions should be made and the type of actions requires knowledge on the accumulated damage or degradation state of the wind turbine components. For offshore wind turbines, the action times could be extended due to weather restrictions and result in damage or degradation increase of the remaining...... for Operation & Maintenance planning. Concentrating efforts on development of such models, this research is focused on reliability modeling of Wind Turbine critical subsystems (especially the power converter system). For reliability assessment of these components, structural reliability methods are applied...
Ruffino, Amy Goetz; Mistrett, Susan G.; Tomita, Machiko; Hajare, Poonam
The Universal Design for Play (UDP) Tool is an instrument designed to evaluate the presence of universal design (UD) features in toys. This study evaluated its psychometric properties, including content validity, construct validity, and test-retest reliability. The UDP tool was designed to assist in selecting toys most appropriate for children…
Tissue, Camden M; Velleman, Paul F; Stegink-Jansen, Caroline W; Aaron, Dorit Haenosh; Winthrop, Barbara G; Gogola, Gloria R
Clinical measurement study. The Functional Dexterity Test (FDT) has not been validated in children. To determine reliability and validity of the FDT in a pediatric population. Intraclass Correlation Coefficients (ICCs) were used to calculate interrater and test-retest reliability in typically developing children. Pearson correlation coefficients were used to compare FDT speed with the Jebsen-Taylor Hand Function Test (JHFT) and with 2 activities of daily living tasks to establish validity in children with congenital hand differences. The FDT demonstrated excellent interrater (ICC, 0.99) and test-retest (ICC, 0.90) reliability. Pearson correlation coefficients exceeded 0.67 for JHFT subsets of fine dexterity and were all less than 0.66 for JHFT subsets of gross grasp. Correlations with the activities of daily living tasks were good to excellent. FDT speeds in TD children exceeded those of children with congenital hand differences (P validity. Children with congenital hand differences are often treated early in life, making it important to reliably assess hand function of these young children to distinguish developmental change from changes due to interventions. The FDT can reliably measure functional progress over time, help clinicians monitor the efficacy of treatment, and provide families realistic feedback on their child's progress. The FDT is a valid and reliable instrument for the measurement of fine motor dexterity in children. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Martinková, Patrícia; Zvára, K.
Roč. 43, č. 3 (2007), s. 315-326 ISSN 0023-5954 R&D Projects: GA MŠk(CZ) 1M06014 Institutional research plan: CEZ:AV0Z10300504 Keywords : Cronbach's alpha * Rasch model * reliability Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.552, year: 2007 http://dml.cz/handle/10338.dmlcz/135776
Saiz de Bustamante, Amalio; Saiz de Bustamante, Barbara
The multinomial-exponential reliability function (MERF) was developed during a detailed study of the software failure/correction processes. Later on MERF was approximated by a much simpler exponential reliability function (EARF), which keeps most of MERF mathematical properties, so the two functions together makes up a single reliability model. The reliability model MERF/EARF considers the software failure process as a non-homogeneous Poisson process (NHPP), and the repair (correction) process, a multinomial distribution. The model supposes that both processes are statistically independent. The paper discusses the model's theoretical basis, its mathematical properties and its application to software reliability. Nevertheless it is foreseen model applications to inspection and maintenance of physical systems. The paper includes a complete numerical example of the model application to a software reliability analysis
Full Text Available Abstract Background Wolfram syndrome (WFS is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS. Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease. WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age. Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93, moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91 and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, ps>.76, ps=-.86, p=.001. The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83. Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS.
Nguyen, Chau; Foster, Erin R; Paciorkowski, Alexander R; Viehoever, Amy; Considine, Colleen; Bondurant, Aidena; Marshall, Bess A; Hershey, Tamara
Wolfram syndrome (WFS) is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS) to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS). A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease). WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age). Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. The WURS had high inter-rater reliability (ICCs>.93), moderate to high internal consistency reliability (Cronbach's α = 0.78-0.91) and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, pScale and reports of mood and behavior (rs>.76, pvalidity (Scale-Content Validity Index=0.83). These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS.
The primary objective for the Model Validation Status Review was to perform a one-time evaluation of model validation associated with the analysis/model reports (AMRs) containing model input to total-system performance assessment (TSPA) for the Yucca Mountain site recommendation (SR). This review was performed in response to Corrective Action Request BSC-01-C-01 (Clark 2001, Krisha 2001) pursuant to Quality Assurance review findings of an adverse trend in model validation deficiency. The review findings in this report provide the following information which defines the extent of model validation deficiency and the corrective action needed: (1) AMRs that contain or support models are identified, and conversely, for each model the supporting documentation is identified. (2) The use for each model is determined based on whether the output is used directly for TSPA-SR, or for screening (exclusion) of features, events, and processes (FEPs), and the nature of the model output. (3) Two approaches are used to evaluate the extent to which the validation for each model is compliant with AP-3.10Q (Analyses and Models). The approaches differ in regard to whether model validation is achieved within individual AMRs as originally intended, or whether model validation could be readily achieved by incorporating information from other sources. (4) Recommendations are presented for changes to the AMRs, and additional model development activities or data collection, that will remedy model validation review findings, in support of licensing activities. The Model Validation Status Review emphasized those AMRs that support TSPA-SR (CRWMS M and O 2000bl and 2000bm). A series of workshops and teleconferences was held to discuss and integrate the review findings. The review encompassed 125 AMRs (Table 1) plus certain other supporting documents and data needed to assess model validity. The AMRs were grouped in 21 model areas representing the modeling of processes affecting the natural and
The primary objective for the Model Validation Status Review was to perform a one-time evaluation of model validation associated with the analysis/model reports (AMRs) containing model input to total-system performance assessment (TSPA) for the Yucca Mountain site recommendation (SR). This review was performed in response to Corrective Action Request BSC-01-C-01 (Clark 2001, Krisha 2001) pursuant to Quality Assurance review findings of an adverse trend in model validation deficiency. The review findings in this report provide the following information which defines the extent of model validation deficiency and the corrective action needed: (1) AMRs that contain or support models are identified, and conversely, for each model the supporting documentation is identified. (2) The use for each model is determined based on whether the output is used directly for TSPA-SR, or for screening (exclusion) of features, events, and processes (FEPs), and the nature of the model output. (3) Two approaches are used to evaluate the extent to which the validation for each model is compliant with AP-3.10Q (Analyses and Models). The approaches differ in regard to whether model validation is achieved within individual AMRs as originally intended, or whether model validation could be readily achieved by incorporating information from other sources. (4) Recommendations are presented for changes to the AMRs, and additional model development activities or data collection, that will remedy model validation review findings, in support of licensing activities. The Model Validation Status Review emphasized those AMRs that support TSPA-SR (CRWMS M&O 2000bl and 2000bm). A series of workshops and teleconferences was held to discuss and integrate the review findings. The review encompassed 125 AMRs (Table 1) plus certain other supporting documents and data needed to assess model validity. The AMRs were grouped in 21 model areas representing the modeling of processes affecting the natural and
Glenn, Catherine R; Klonsky, E David
Although the DSM-IV suggests that dysfunctional personality patterns can be traced back to adolescence, there is continued debate about whether borderline personality disorder (BPD) can be reliably and validly diagnosed before age 18. The current study examined the reliability and validity of BPD in a large sample of adolescent psychiatric patients. BPD and Axis I disorders were assessed with validated structured interviews and a series of clinical, emotion, and personality correlates were assessed with validated self-report questionnaires. Consistent with previous studies in adolescent clinical samples, approximately 30% of patients in the current sample met criteria for BPD. The nine BPD criteria demonstrated good internal consistency, equivalent to rates reported in adult samples. In addition, BPD was related to greater clinical severity and impairment as indexed by strong associations with all major Axis I disorders, as well as with dimensional measures of depression, anxiety, difficulties with emotion regulation, and impulsiveness. Notably, reliability and validity remained satisfactory even when analyses were limited to younger adolescents between the ages of 12 and 14. Overall, findings suggest that BPD can be reliably and validly diagnosed in adolescents as young as 12-14 years old.
Konge, Lars; Larsen, Klaus Richter; Clementsen, Paul
Background: There have been several attempts to systematically assess performance in bronchoscopy. Earlier validation studies have used bronchoscopy simulators, not real-life performance in patients. Objectives: The aim of this study was to explore the reliability and validity of an assessment tool...... aspects of the assessment, but post hoc tests showed different discriminative abilities. Conclusions: This new tool for assessing clinical bronchoscopy performance has a high interrater reliability. One rater assessing performance of 3 bronchoscopies ensures sufficient reliability. The assessment tool......: The interrater reliability was high, with Cronbach's a = 0.86. Assessment of 3 bronchoscopies by a single rater had a generalizability coefficient of 0.84. The correlation between experience and performance was good (Pearson correlation = 0.76). There were significant differences between the groups for all...
Conway, Michelle R; Marshall, Mallory R; Schlaff, Rebecca A; Pfeiffer, Karin A; Pivarnik, James M
Current physical activity (PA) recommendations for women experiencing a normal pregnancy reflect recent research showing numerous health benefits for mother and offspring. However, few studies have evaluated PA devices' reliability and validity during pregnancy, because anatomical and physiological changes throughout gestation could affect an instrument's accuracy. This study aimed to determine the reliability and validity of PA devices worn on the hip, ankle, and triceps during pregnancy and postpartum. Thirty-three women performed six activities of daily living and one treadmill walk at approximately 21 and 32 wk of pregnancy, and 12 wk postpartum. There were two visits at each time period, 1 wk apart. Energy expenditure (oxygen consumption) was measured by using indirect calorimetry (IC; criterion measure), whereas PA was quantified by using accelerometers and pedometers placed at the right hip and ankle and left triceps. Interclass reliability and monitor validity compared with IC in relative (mL·kg·min) terms were calculated using Pearson correlation. Both multitrial and single-trial intraclass reliabilities (ICC) were estimated using ANOVA to assess monitor reliability at each time period. SEM values were calculated in relative terms for each time period. The reliability of the devices was moderate/strong because 66% of the Pearson correlations were between 0.6 and 1.0. Multitrial ICC values were largely in the moderate/strong range because 38% of the ICC values were between 0.6 and 0.79 and 50% were between 0.8 and 1.0. The SEM values for each device between visits ranged from 7% to 23% of the mean values. Comparison between IC and devices showed that 40% and 46% of the validity coefficients were between 0.4 and 0.59 and between 0.6 and 0.79, respectively. PA devices show moderate/strong reliability and moderate validity for measuring PA during pregnancy and postpartum.
Rehman, Muniza; Pedersen, Stig Andur
In philosophy of science, the interest for computational models and simulations has increased heavily during the past decades. Different positions regarding the validity of models have emerged but the views have not succeeded in capturing the diversity of validation methods. The wide variety...
Konge, Lars; Lehnert, Per; Hansen, Henrik Jessen
BACKGROUND: As we move toward competency-based education in medicine, we have lagged in developing competency-based evaluation methods. In the era of minimally invasive surgery, there is a need for a reliable and valid tool dedicated to measure competence in video-assisted thoracoscopic surgery....... The purpose of this study is to create such an assessment tool, and to explore its reliability and validity. METHODS: An expert group of physicians created an assessment tool consisting of 10 items rated on a five-point rating scale. The following factors were included: economy and confidence of movement...
Fleury, Agnès; Romo, Matthew L.; Abraham, Ronaldo; Fandiño, Jaime; Durán, Juan C.; Cárdenas, Graciela; Moncayo, Jorge; Leite Rodrigues, Cleonísio; San‐Juan, Daniel; Serrano‐Dueñas, Marcos; Takayanagui, Oswaldo; Sander, Josemir W.
Objective The diagnosis of neurocysticercosis (NCC) remains problematic because of the heterogeneity of its clinical, immunological, and imaging characteristics. Our aim was to develop and assess a new set of diagnostic criteria for NCC, which might allow for the accurate detection of, and differentiation between, parenchymal and extraparenchymal disease. Methods A group of Latin American NCC experts developed by consensus a new set of diagnostic criteria for NCC. A multicenter, retrospective study was then conducted to validate it. The reference standard for diagnosis of active NCC was the disappearance or reduction of cysts after anthelmintic treatment. In total, three pairs of independent neurologists blinded to the diagnosis evaluated 93 cases (with NCC) and 93 controls (without NCC) using the new diagnostic criteria. Mixed‐effects logistic regression models were used to estimate sensitivity and specificity. Results Inter‐rater reliability (kappa) of diagnosis among evaluators was 0.60. For diagnosis of NCC versus no NCC, the new criteria had a sensitivity of 93.2% and specificity of 81.4%. For parenchymal NCC, the new criteria had a sensitivity of 89.8% and specificity of 80.7% and for extraparenchymal NCC, the new criteria had a sensitivity of 65.9% and specificity of 94.9%. Interpretation These criteria have acceptable reliability and validity and could be a new tool for clinicians and researchers. An advantage of the new criteria is that they consider parasite location (ie, parenchymal or extraparenchymal), which is an important factor determining the clinical, immunological, and radiological presentation of the disease, and importantly, its treatment and prognosis. Ann Neurol 2016;80:434–442 PMID:27438337
Full Text Available Objective: Discomfort Intolerance Scale was developed by Norman B. Schmidt et al. to assess the individual differences of capacity to withstand physical perturbations or uncomfortable bodily states (2006. The aim of this study is to investigate the validity and reliability of Discomfort Intolerance Scale-Turkish Version (RDÖ. Method: From two different universities, total of 225 students (male=167, female=58 were participated in this study. In order to determine the criterion validity, Beck Anxiety Inventory (BAI and State-Trait Anxiety Inventory (STAI were used. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Barlett test had been performed. To assess the test-retest reliability the scale was re-applied to 54 participants 6 weeks later. Results: To assess construct validity of DIS, factor analyses were performed using varimax principal components analysis with varimax rotation. The factor analysis resulted in two factors named “discomfort (in tolerance” and “discomfort avoidance”. The Cronbach’s alpha coefficient for the entire scale, discomfort-(intolerance subscale, discomfortavoidance subscale were, .592, .670, .600 respectively. Correlations between two factors of DIS, discomfort intolerance and discomfort avoidance, and Trait Anxiety Inventory of STAI (State-Trait Anxiety Inventory were statistically significant at the level of 0.05. Test-retest reliability was statistically significant at the level of 0.01. Conclusion: Analysis demonstrated that DIS had a satisfactory level of reliability and validity in Turkish university students.
Minner, Daphne Diane
The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating
Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa
Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: email@example.com.
Ahmed, Hussam; Chateauneuf, Alaa
The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces
Limviriyakul, Siriporn; Supavanich, Walop
Demonstrate the reliability and validity of the Tinnitus Handicap Inventory Thai Version (THI-T), a self-report measure of tinnitus. A cross-sectional psychometric validation study was used to determine internal consistency reliability and validity of the Tinnitus Handicap Inventory Thai Version at the Otoneurology clinic at Tertiary care center The cross-cultural adaptation of the Tinnitus Handicapped Inventory English version (Newman et al, 1996) was translated into Thai version following the steps indicated by Guillemin et al. The reliability was constructed by using Cronbach's coefficient alpha. The validity was analyzed by the correlation between Tinnitus Handicap Inventory Thai version and the 36-items short form health survey and visual analog scale using Spearman and Pearson test. The result showed good internal consistency reliabilities of total, functional, emotional, and catastrophic scale (a = 0.902, 0.804, 0.831 and 0.661, respectively) of Tinnitus Handicap Inventory Thai Version. Spearman correlation showed the significant correlation of Tinnitus Handicap Inventory to 36-items short form health survey and visual analog scale. Tinnitus Handicap Inventory Thai Version will be a vigorous tool in evaluating tinnitus patients as well as monitoring the progress of their symptoms.
Heineman, Kirsten R.; Middelburg, Karin J.; Bos, Arend F.; Eidhof, Lieke; La Bastide-van Gemert, Sacha; van den Heuvel, Edwin R.; Hadders-Algra, Mijna
The Infant Motor Profile (IMP) is a qualitative assessment of motor behaviour in infancy. It consists of five domains: movement variation, variability, fluency, symmetry, and performance. The aim of this study was to assess interobserver reliability and concurrent validity of the IMP with the
van der Wulp, I.
Reliability and validity of triage systems is important because this can affect patient safety. In this thesis, these aspects of two emergency department (ED) triage systems were studied as well as methodological aspects in these types of studies. The consistency, reproducibility, and criterion
Dr. Martijn de Groot; W.P. Krijnen; Thea J.M. Kooiman; Dr. C.P. van der Schans; Siska R. Sprenger; Manon L. Dontje
Background: Activity trackers can potentially stimulate users to increase their physical activity behavior. The aim of this study was to examine the reliability and validity of ten consumer activity trackers for measuring step count in both laboratory and free-living conditions. Method: Healthy
Todsen, Tobias; Tolsgaard, Martin Grønnebæk; Olsen, Beth Härstedt
OBJECTIVE: To explore the reliability and validity of the Objective Structured Assessment of Ultrasound Skills (OSAUS) scale for point-of-care ultrasonography (POC US) performance. BACKGROUND: POC US is increasingly used by clinicians and is an essential part of the management of acute surgical...
Kenyon, Dorry; Van Duzer, Carol
Ensuring that language tests for adult English language learners are appropriate, valid, and reliable is a challenge. Performance-based assessments are complex to develop and implement. Yet, because the focus of assessment, both in the National Reporting System for Adult Education (NRS) descriptors and in the Department of Education's definition…
Heineman, Kirsten R.; Middelburg, Karin J.; Bos, Arend F.; Eidhof, Lieke; La Bastide-Van Gemert, Sacha; Van den Heuvel, Edwin R.; Hadders-Algra, Mijna
Aim The Infant Motor Profile (IMP) is a qualitative assessment of motor behaviour in infancy. It consists of five domains: movement variation, variability, fluency, symmetry, and performance. The aim of this study was to assess interobserver reliability and concurrent validity of the IMP with the
The evaluation of consumer response to sport sponsorship is limited in the academic literature. This research was aimed to conduct a dimensionality, validity and reliability study of the Speed and Thompson Sponsorship Questionnaire in Turkey (2000). Eight hundred and fifty-two (852) university students participated in the ...
The aim of this study was to develop a Turkish version of the “Leisure Negotiation Strategies Scale” for university students and to examine its validity and reliability. The Leisure Negotiation Strategies Scale contains 31 items, which are expressed on a 5-point Likert scale. The scale was tested in two separate samples ...
Automated Body Reaction Test (ABRT) is a new device for skills and physical assessment instrument to measure ability on react, move quickly and accurately in accordance with stimulus. A total of 474 subjects aged 7-17 years old were randomly selected for the construct validity (n=330) and reliability (n=144). The ABRT ...
Massatti, Richard R.; Vonk, M. Elizabeth; Gregoire, Thomas K.
The present study provides information on the reliability and validity of the Transracial Adoption Parenting Scale (TAPS), a multidimensional 36-item Likert-type scale that measures cultural competence among transracial adoptive (TRA) parents. The TAPS was theoretically developed and refined through feedback from experts in TRA adoption. A…
Hop, M.; Moues, C.; Bogomolova, K.; Nieuwenhuis, M.; Oen, I.; Middelkoop, E.; Breederveld, R.; de Baar, M.
Objective: The aim of this study was to examine the reliability and validity of using photographs of burns to assess both burn size and depth. Method: Fifty randomly selected photographs taken on day 0-1 post burn were assessed by seven burn experts and eight referring physicians. Inter-rater
Kim, Bryan S. K.; Soliz, Alicia; Orellana, Blanca; Alamilla, Saul G.
This article describes the development of the Latino/a Values Scale (35 items, 14 reverse-worded). Evidence of reliability and validity are presented on the basis of three studies. An examination of the factor structure of the items suggests the presence of the following dimensions: cultural pride, simpatia, familismo, and espiritismo. (Contains 4…
Pavuluri, Mani N.; Henry, David B.; Devineni, Bhargavi; Carbray, Julie A.; Birmaher, Boris
Objective: To develop a reliable and valid parent-report screening instrument for mania, based on DSM-IV symptoms. Method: A 21-item Child Mania Rating Scale-Parent version (CMRS-P) was completed by parents of 150 children (42.3% female) ages 10.3 plus or minus 2.9 years (healthy controls = 50; bipolar disorder = 50;…
Yocum, Allison; McCoy, Sarah Westcott; Bjornson, Kristie F.; Mullens, Pamela; Burton, Gay Naganuma
A standardized protocol for a pediatric heel-rise test was developed and reliability and validity are reported. Fifty-seven children developing typically (CDT) and 34 children with plantar flexion weakness performed three tests: unilateral heel rise, vertical jump, and force measurement using handheld dynamometry. Intraclass correlation…
Background The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. Methods We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson’s correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach’s alpha coefficient; the intraclass (ICC) and Pearson’s correlation coefficients were used for test-retest reliability. Results One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson’s correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson’s correlation coefficient was 0.89 and ICC was 0.90. Conclusions The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality. PMID:23327598
In facilitating cross-cultural study in the field of psychology and Logotherapy, the reliability and validity of the logotest which measures inner meaning fulfillment was carried out among 885 University of Ibadan students, 439 males and 434 females, aged between 15 and 60 years old with mean X age of 6.0. Data analyses ...
Background: Brain fag is an indigenous psychopathology or culture-bound syndrome formally documented in Nigeria in the 1960's by Raymond Prince. Objective: The need for a factorial examination of the scale to ensure factorial validity and also to examine the reliability of this screening scale. Methods: Two hundred thirty ...
Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel
The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…
Resnick, Barbara; Galik, Elizabeth; Dorsey, Susan; Scheve, Ann; Gutkin, Susan
Objective: The purpose of this study was to test reliability and validity of the Physical Resilience Scale. Methods: A single-group repeated measure design was used and 130 older adults from three different housing sites participated. Participants completed the Physical Resilience Scale, Hardy-Gill Resilience Scale, 14-item Resilience Scale,…
McMullen, Tara; Resnick, Barbara
To establish the reliability and validity of the Rosenberg Self-Esteem Scale (RSES) when used with nursing assistants (NAs). Testing the RSES used baseline data from a randomized controlled trial testing the Res-Care Intervention. Female NAs were recruited from nursing homes (n = 508). Validity testing for the positive and negative subscales of the RSES was based on confirmatory factor analysis (CFA) using structural equation modeling and Rasch analysis. Estimates of reliability were based on Rasch analysis and the person separation index. Evidence supports the reliability and validity of the RSES in NAs although we recommend minor revisions to the measure for subsequent use. Establishing reliable and valid measures of self-esteem in NAs will facilitate testing of interventions to strengthen workplace self-esteem, job satisfaction, and retention.
Due, Ulla; Ottesen, Marianne
offered pelvic floor muscle examination and instruction by a specialist physiotherapist. In relation to that, a non-validated questionnaire about anal and urinary incontinence was to be answered six months after childbirth. Method. The original questionnaire was revised and a pilot test was performed...... main questions but one. Two questions needed further explanation. Seven women made minor errors. Conclusion. The validated Danish questionnaire has a good construct, content and face validity. It is a well accepted, reliable, simple and clinically relevant screening tool. It reveals physical problems...
Reis, Rodrigo Siqueira; Hino, Adriano Akira Ferreira; Añez, Ciro Romélio Rodriguez
The perceived stress scale (PSS-10) reliability and validity were evaluated in Brazilian adults. A two-stage translation procedure was employed to achieve a Portuguese version. Participants were 793 Brazilian university teachers. The exploratory factor analysis showed two factors with eigenvalues greater than 1.0 (56.8% of variance). The Cronbach's alpha coefficients were 0.83 (Factor 1), 0.77 (Factor 2) and 0.87 (Total Score). The test-retest reliability scores were 0.83 (Factor 1), 0.68 (Factor 2) and 0.86 (Total Score). PSS-10 and perceived health correlations ranged from -0.22 to -0.35. The PSS-10 showed an adequate reliability and validity supporting its use in this population.
Meng, Zhaoli; Zheng, Yun; Wang, Kai; Kong, Xiudan; Tao, Yong; Xu, Ke; Liu, Guanjian
Objectives The Tinnitus Handicap Inventory (THI) is a commonly used self-reporting tinnitus questionnaire. We undertook this study to determine the reliability and validity of the Chinese-Mandarin version of the Tinnitus Handicap Inventory (THI-CM) for measuring tinnitus-related handicaps. Methods We tested the test-retest reliability, internal reliability, and construct validity of the THI-CM. Two-hundred patients seeking treatment for primary or secondary tinnitus in Southwest China were asked to complete THI-CM prior to clinical evaluation. Patients were evaluated by a clinician using standard methods, and 40 patients were asked to complete THI-CM a second time 14±3 days after the initial interview. Results The test-retest reliability of THI-CM was high (Pearson correlation, 0.98), as was the internal reliability (Cronbach's α, 0.93). Factor analysis indicated that THI-CM has a unifactorial structure. Conclusion The THI-CM version is reliable. The total score in THI-CM can be used to measure tinnitus-related handicaps in Mandarin-speaking populations. PMID:22468196
Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto
Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Full Text Available Validity and Reliability of Agoraphobic Cognitions Questionnaire-Turkish Version Objective: The aim of this study is to investigate the validity and reliability of Agoraphobic Cognitions Questionnaire -Turkish Version (ACQ. Method: ACQ was administered to 92 patients with agoraphobia or panic disorder with agoraphobia. BSQ Turkish version completed by translation, back-translation and pilot assessment. Reliability of ACQ was analyzed by test-retest correlation, split-half technique, Cronbach’s alpha coefficient. Construct validity was evaluated by factor analysis after the Kaiser-Meyer-Olkin (KMO and Bartlett test had been performed. Principal component analysis and varimax rotation used for factor analysis. Results: 64% of patients evaluated in the study were female and 36% were male. Age interval was between 18 and 58, mean age was 31.5±10.4. The Cronbach’s alpha coefficient was 0.91. Analysis of test-retest evaluations revealed that there were statistically significant correlations ranging between 24% and 84% concerning questionnaire components. In analysis performed by split-half method reliability coefficients of half questionnaires were found as 0.77 and 0.91. Again Spearmen-Brown coefficient was found as 0.87 by the same analysis. To assess construct validity of ACQ, factor analysis was performed and two basic factors found. These two factors explained 57.6% of the total variance. (Factor 1: 34.6%, Factor 2: 23% Conclusion: Our findings support that ACQ-Turkish version had a satisfactory level of reliability and validity
Kimberlin, Carole L; Winterstein, Almut G
Issues related to the validity and reliability of measurement instruments used in research are reviewed. Key indicators of the quality of a measuring instrument are the reliability and validity of the measures. The process of developing and validating an instrument is in large part focused on reducing error in the measurement process. Reliability estimates evaluate the stability of measures, internal consistency of measurement instruments, and interrater reliability of instrument scores. Validity is the extent to which the interpretations of the results of a test are warranted, which depends on the particular use the test is intended to serve. The responsiveness of the measure to change is of interest in many of the applications in health care where improvement in outcomes as a result of treatment is a primary goal of research. Several issues may affect the accuracy of data collected, such as those related to self-report and secondary data sources. Self-report of patients or subjects is required for many of the measurements conducted in health care, but self-reports of behavior are particularly subject to problems with social desirability biases. Data that were originally gathered for a different purpose are often used to answer a research question, which can affect the applicability to the study at hand. In health care and social science research, many of the variables of interest and outcomes that are important are abstract concepts known as theoretical constructs. Using tests or instruments that are valid and reliable to measure such constructs is a crucial component of research quality.
Due, Ulla; Ottesen, Marianne
Objective. To revise, validate and test for reliability an anal sphincter rupture questionnaire in relation to construct, content and face validity. Setting and background. Since 1996 women with anal sphincter rupture (ASR) at one of the public university hospitals in Copenhagen, Denmark have been...... among health care personnel, followed by an expert panel discussion. Ten women were interviewed about their understanding and attitude toward answering the questionnaire and it was pre-tested on 52 women with ASR. The questionnaire was revised five times during the validation process. The final version...... was test-retested among 20 women with ASR. Results. The questionnaire revealed typical problems after ASR. The length, use of language, and the way the questionnaire was constructed, was deemed appropriate. The reliability test showed good to very good correlation (Kappa values from 0.733 to 0.923) in all...
Carlsen, C G; Lindorff Larsen, Karen; Funch-Jensen, P
of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. METHODS: Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia...... repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. RESULTS: The internal consistency of the items was high...... a significant difference between the three groups which indicates construct validity, p skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment...
Lund, Rikke; Nielsen, Lene Snabe; Henriksen, Pia Wichmann
OBJECTIVE: The aim of the present article is to describe the face and content validity as well as reliability of the Copenhagen Social Relations Questionnaire (CSRQ). METHOD: The face and content validity test was based on focus group discussions and individual interviews with 31 informants....... Another 94 men and women participated in an 8-day test-retest analysis. RESULTS: Informants generally expressed that the questions and response categories were relevant and easy to understand. Themes on structure of social relations, social support, and negative aspects of social relations emerged clearly...... from the interviews. Two additional themes not covered by CSRQ on dynamics and reciprocity of social relations were identified. DISCUSSION: CSRQ holds satisfactory face and content validity as well as reliability, and is suitable for measuring structure and function of social relations including...
Ulozienė, Ingrida; Balnytė, Renata; Alzbutienė, Giedrė; Arechvo, Irina; Vaitkus, Antanas; Šileikaitė, Milda; Šaferis, Viktoras; Ulozas, Virgilijus
The aim of this study was to determine the reliability and validity of the Lithuanian version of the Tinnitus Handicap Inventory (THI), a self-report measure of perceived tinnitus handicap. A cross-sectional psychometric validation study was performed in the University Hospital. A total of 248 subjects reporting chronic tinnitus as their primary complaint or secondary to hearing loss were encluded in the study and filled in the Lithuanian version of THI. For assessment of construct validity a subgroup of 55 participants completed the Lithuanian version of the Hospital Anxiety and Depression Scale as a measure of self-perceived levels of anxiety and depression. Test-retest and internal consistency reliability as well as construct validity were calculated. The Lithuanian version of the THI and its subscales showed a robust internal consistency reliability (Cronbach's alpha=0.93) comparable to the original version. Statistically significant correlations were observed between the Lithuanian translation of the THI and the measures of self-perceived levels of anxiety and depression using HADS. Confirmatory factor analysis demonstrated that the three subscales of the THI Lithuanian version corresponded to three different factors, which strongly correlated between themselves. The results suggest that the Lithuanian version of THI maintains its original validity and may serve as reliable and valid measure of general tinnitus related distress that can be used in a clinical setting to quantify the impact of tinnitus on daily living. Copyright © 2016 The Lithuanian University of Health Sciences. Production and hosting by Elsevier Urban & Partner Sp. z o.o. All rights reserved.
Napier, B.A.; Gilbert, R.O.; Simpson, J.C.; Ramsdell, J.V. Jr.; Thiede, M.E.; Walters, W.H.
The Hanford Environmental Dose Reconstruction (HEDR) Project has developed a set of computational ''tools'' for estimating the possible radiation dose that individuals may have received from past Hanford Site operations. This document describes the planned activities to ''validate'' these tools. In the sense of the HEDR Project, ''validation'' is a process carried out by comparing computational model predictions with field observations and experimental measurements that are independent of those used to develop the model
Noormohammadpour, Pardis; Hosseini Khezri, Alireza; Farahbakhsh, Farzin; Mansournia, Mohammad Ali; Smuck, Matthew; Kordi, Ramin
The purpose of this study was to evaluate validity and reliability of a new proposed questionnaire for assessment of functional disability in athletes with low back pain (LBP). Validity and reliability study. Elite athletes participating in different fields of sports. Participants were 165 male and female athletes (between 12 and 50 years old) with LBP. Athlete Disability Index (ADI) Questionnaire which is developed by the authors for assessing LBP-related disability in athletes, Oswestry Disability Index (ODI), and the Roland-Morris Disability Questionnaire (RDQ). Self-reported responses were collected regarding LBP-related disability through ADI, ODI, and RDQ. The test-retest reliability was strong, and intraclass correlation value ranged between 0.74 and 0.94. The Cronbach alpha coefficient value of 0.91 (P visual analog scale was r = 0.626 (P disability levels were mild in the large majority of subjects (91.5% and 86.0%, respectively). Alternatively, disability assessments by the ADI did not cluster at the mild level and ranged more broadly from mild to very high. The ADI is a reliable and valid instrument for assessing disability in athletes with LBP. Compared with the available LBP disability questionnaires used in the general population, ADI can more precisely stratify the disability levels of athletes due to LBP.
Nordness, Philip D.; Epstein, Michael H.; Cullinan, Douglas; Pierce, Corey D.
The Emotional and Behavioral Screener (EBS) is a universal screening instrument designed to identify students whose excessive problem behaviors put them at risk of the education disability category of emotional disturbance (ED). This article reports findings from three studies that address the reliability and validity of the EBS. Studies 1 and 2…
Downing Michael G
Full Text Available Abstract Background The Palliative Performance Scale (PPS was first introduced in1996 as a new tool for measurement of performance status in palliative care. PPS has been used in many countries and has been translated into other languages. Methods This study evaluated the reliability and validity of PPS. A web-based, case scenarios study with a test-retest format was used to determine reliability. Fifty-three participants were recruited and randomly divided into two groups, each evaluating 11 cases at two time points. The validity study was based on the content validation of 15 palliative care experts conducted over telephone interviews, with discussion on five themes: PPS as clinical assessment tool, the usefulness of PPS, PPS scores affecting decision making, the problems in using PPS, and the adequacy of PPS instruction. Results The intraclass correlation coefficients for absolute agreement were 0.959 and 0.964 for Group 1, at Time-1 and Time-2; 0.951 and 0.931 for Group 2, at Time-1 and Time-2 respectively. Results showed that the participants were consistent in their scoring over the two times, with a mean Cohen's kappa of 0.67 for Group 1 and 0.71 for Group 2. In the validity study, all experts agreed that PPS is a valuable clinical assessment tool in palliative care. Many of them have already incorporated PPS as part of their practice standard. Conclusion The results of the reliability study demonstrated that PPS is a reliable tool. The validity study found that most experts did not feel a need to further modify PPS and, only two experts requested that some performance status measures be defined more clearly. Areas of PPS use include prognostication, disease monitoring, care planning, hospital resource allocation, clinical teaching and research. PPS is also a good communication tool between palliative care workers.
Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl
To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P volleyball players.
model structure suggested by University of Lund the WP4 leader. This particular model structure has the advantages that it fits better into the control design frame work used by WP3-4 compared to the model structures previously developed in WP2. The different model structures are first summarised....... Then issues dealing with optimal experimental design is considered. Finally the parameters are estimated in the chosen static and dynamic models and a validation is performed. Two of the static models, one of them the additive model, explains the data well. In case of dynamic models the suggested additive...
Due, Ulla; Ottesen, Marianne
Objective. To revise, validate and test for reliability an anal sphincter rupture questionnaire in relation to construct, content and face validity. Setting and background. Since 1996 women with anal sphincter rupture (ASR) at one of the public university hospitals in Copenhagen, Denmark have been...... among health care personnel, followed by an expert panel discussion. Ten women were interviewed about their understanding and attitude toward answering the questionnaire and it was pre-tested on 52 women with ASR. The questionnaire was revised five times during the validation process. The final version...... offered pelvic floor muscle examination and instruction by a specialist physiotherapist. In relation to that, a non-validated questionnaire about anal and urinary incontinence was to be answered six months after childbirth. Method. The original questionnaire was revised and a pilot test was performed...
Zhang Ruoxue; Mahadevan, Sankaran
This paper develops a methodology to assess the reliability computation model validity using the concept of Bayesian hypothesis testing, by comparing the model prediction and experimental observation, when there is only one computational model available to evaluate system behavior. Time-independent and time-dependent problems are investigated, with consideration of both cases: with and without statistical uncertainty in the model. The case of time-independent failure probability prediction with no statistical uncertainty is a straightforward application of Bayesian hypothesis testing. However, for the life prediction (time-dependent reliability) problem, a new methodology is developed in this paper to make the same Bayesian hypothesis testing concept applicable. With the existence of statistical uncertainty in the model, in addition to the application of a predictor estimator of the Bayes factor, the uncertainty in the Bayes factor is explicitly quantified through treating it as a random variable and calculating the probability that it exceeds a specified value. The developed method provides a rational criterion to decision-makers for the acceptance or rejection of the computational model
Parker, Matthew O
Current advances in the ability to assay adult aquatic vertebrate behaviour are potentially very useful to aquatic toxicologists wishing to characterise the effects of pollutants on behaviour, cognition or neurodevelopment. This review considers two specific challenges faced by researchers wishing to exploit these technologies: maximising reliability and validity. It will suggest two behavioural procedures, with the potential for automation and high-throughput implementation, which can be used to measure social cohesion and anxiety, two areas of interest in behavioural aquatic toxicology. In addition, the review will make recommendations about how these procedures (and others) could be carried out to maximise reliability and validity. Copyright © 2015 Elsevier B.V. All rights reserved.
Full Text Available Abstract Background Clinicians need a practical, objective test of postural control that is sensitive to mild neurological disease, shows experimental and clinical validity, and has good test-retest reliability. We developed an instrumented test of postural sway (ISway using a body-worn accelerometer to offer an objective and practical measure of postural control. Methods We conducted two separate studies with two groups of subjects. Study I: sensitivity and experimental concurrent validity. Thirteen subjects with early, untreated Parkinson’s disease (PD and 12 age-matched control subjects (CTR were tested in the laboratory, to compare sway from force-plate COP and inertial sensors. Study II: test-retest reliability and clinical concurrent validity. A different set of 17 early-to-moderate, treated PD (tested ON medication, and 17 age-matched CTR subjects were tested in the clinic to compare clinical balance tests with sway from inertial sensors. For reliability, the sensor was removed, subjects rested for 30 min, and the protocol was repeated. Thirteen sway measures (7 time-domain, 5 frequency-domain measures, and JERK were computed from the 2D time series acceleration (ACC data to determine the best metrics for a clinical balance test. Results Both center of pressure (COP and ACC measures differentiated sway between CTR and untreated PD. JERK and time-domain measures showed the best test-retest reliability (JERK ICC was 0.86 in PD and 0.87 in CTR; time-domain measures ICC ranged from 0.55 to 0.84 in PD and from 0.60 to 0.89 in CTR. JERK, all but one time-domain measure, and one frequency measure were significantly correlated with the clinical postural stability score (r ranged from 0.50 to 0.63, 0.01 Conclusions Based on these results, we recommend a subset of the most sensitive, reliable, and valid ISway measures to characterize posture control in PD: 1 JERK, 2 RMS amplitude and mean velocity from the time-domain measures, and 3 centroidal
Schindler, Antonio; Mozzanica, Francesco; Monzani, Anna; Ceriani, Eleonora; Atac, Murat; Jukic-Peladic, Nikolina; Venturini, Claudia; Orlandoni, Paolo
We sought to evaluate the reliability and validity of the Italian EAT-10 (Italian Eating Assessment Tool; I-EAT-10). The study consisted of 4 phases: item generation, internal consistency and reliability analysis, normative data generation, and validity analysis. Discussion of the EAT-10 with 30 patients and its back-translation were accomplished. The recruited population included 172 patients (40 with dysphonia and 132 with dysphagia) and 269 asymptomatic subjects for testing of internal consistency, and 94 patients with dysphagia and 158 asymptomatic subjects for test-retest reliability analysis. Normative data were gathered from the 269 subjects. The scores of patients and asymptomatic subjects were compared. The I-EAT-10 and flexible endoscopic evaluation of swallowing (FEES) scores in 94 patients were correlated. The I-EAT-10 scores made before and after successful swallowing rehabilitation in 38 patients were compared. Excellent internal consistency (Cronbach's alpha values of 0.90 and 0.93) and strong test-retest reliability (intraclass correlation coefficients of 0.95 and 0.98) were found in patients and asymptomatic subjects. The I-EAT-10 mean (+/- SD) score of the normal cohort was 0.6 +/- 1.1. The asymptomatic subjects and dysphonic patients scored lower than the dysphagic patients on the Kruskal-Wallis test (p = 0.001). The I-EAT-10 and FEES scores were mildly correlated. The mean I-EAT-10 score improved from 9.8 +/- 10.3 to 5.8 +/- 6.7 after swallowing rehabilitation (p = 0.04). The I-EAT-10 is a reliable, valid, symptom-specific outcome tool.
Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo
Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.
Manuel V. Garnacho-Castaño
Full Text Available The objectives of the study were to determine the validity and reliability of peak velocity (PV, average velocity (AV, peak power (PP and average power (AP measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain during two resistance exercises, bench press (BP and full back squat (BS, performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2. Intraclass correlation coefficients (ICCs indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W. Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W. Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP make this device a useful tool for monitoring resistance training.
Braun, Tobias; Marks, Detlef; Thiel, Christian; Grüneberg, Christian
To establish the validity and reliability of the de Morton Mobility Index (DEMMI) in patients with sub-acute stroke. This cross-sectional study was performed in a neurological rehabilitation hospital. We assessed unidimensionality, construct validity, internal consistency reliability, inter-rater reliability, minimal detectable change and possible floor and ceiling effects of the DEMMI in adult patients with sub-acute stroke. The study included a total sample of 121 patients with sub-acute stroke. We analysed validity (n = 109) and reliability (n = 51) in two sub-samples. Rasch analysis indicated unidimensionality with an overall fit to the model (chi-square = 12.37, p = 0.577). All hypotheses on construct validity were confirmed. Internal consistency reliability (Cronbach's alpha = 0.94) and inter-rater reliability (intraclass correlation coefficient = 0.95; 95% confidence interval: 0.92-0.97) were excellent. The minimal detectable change with 90% confidence was 13 points. No floor or ceiling effects were evident. These results indicate unidimensionality, sufficient internal consistency reliability, inter-rater reliability, and construct validity of the DEMMI in patients with a sub-acute stroke. Advantages of the DEMMI in clinical application are the short administration time, no need for special equipment and interval level data. The de Morton Mobility Index, therefore, may be a useful performance-based bedside test to measure mobility in individuals with a sub-acute stroke across the whole mobility spectrum. Implications for Rehabilitation The de Morton Mobility Index (DEMMI) is an unidimensional measurement instrument of mobility in individuals with sub-acute stroke. The DEMMI has excellent internal consistency and inter-rater reliability, and sufficient construct validity. The minimal detectable change of the DEMMI with 90% confidence in stroke rehabilitation is 13 points. The lack of any floor or ceiling effects on hospital admission indicates
Denizoğlu, İsmail İlter; Şahin, Mustafa; Kazancıoğlu, Alper; Dağdelen, Zibelhan; Akdeniz, Serap; Oğuz, Haldun; Kılıç, Mehmet Akif; Yücedağ, Aslı; Öğüt, Mehmet Fatih
This study aims to constitute a valid and reliable Turkish version of the original Singing Voice Handicap Index. An authorized committee assessed the reliability and validity of the content, scope, and language of the original Singing Voice Handicap Index which underwent a back translation process. The Turkish version of the questionnaire was answered twice with a 7 to 10-day interval by two singing voice groups with or without singing voice problems. The reliability and validity analyses were performed based on these answers. Of a total of 123 individuals (64 females, 59 males; mean age 26.2±7.3 years), 81 were without a voice pathology and 42 were with a voice pathology. The total Cronbach's alpha coefficient was 0.917. The item-total correlations ranged between 0.51 and 0.89. The weighted kappa values of test-retest correlation values of the items were 0.82-0.91. The Cronbach's alpha values of two part of the questionnaire based on the split-half method were 0.89 and 0.84. The mean total scale scores were 21.8±18.5 and 53.6±28.9 in normal and pathology groups, respectively and there was a statistically significant difference in scores between these two groups (p=0.000). The Turkish version of the Singing Voice Handicap Index is a valid and reliable scale which can be used in the evaluation of voice problems of Turkish-speaking singing voice users.
Yazici Sayin, Yazile
To report the translation and adaptation process from English to Turkish and the psychometric estimates of the validity and reliability of The Organ Donation Attitude Scale Turkish. Its aim (1) is to provide data about and (2) to assess Turkish people's attitudes and volunteerism towards organ donation. Lack of donors is a significant problem for organ transplantation worldwide. Attitudes about organ donation and volunteerism are important factors in the lack of donors. To collect survey data from Turkish participants, a cross-sectional design was used: the classical measurement method. The Organ Donation Attitude Scale was translated from English to Turkish and back-translated into English. The analysis included a total of 892 Turkish participants. The validity of the scale was confirmed by exploratory factor analysis and criterion-relation validity testing. A test-retest procedure was implemented for the reliability of the scale over time. The Organ Donation Attitude Scale consists of three relatively independent components: humanity and moral conviction, fears of medical neglect and fears of bodily mutilation. Internal consistency of these three components resulted in acceptable Cronbach's α levels. Positive correlation occurred between the volunteerism score and positive attitude about organ donation. The correlation between volunteerism score and negative attitude about organ donation was negative. Fears of bodily mutilation were most significantly related to unwillingness to commit to organ donation. The test-retest correlation coefficients proves that the Organ Donation Attitude Scale were reliable over time. The Organ Donation Attitude Scale Turkish version is both a reliable and valid instrument that can be useful in measuring positive and negative attitudes of Turkish people about organ donation. With the Organ Donation Attitude Scale, researchers in Turkey will be able to ascertain important data on volunteerism and attitudes towards organ donation
Gonzalez-Cardero, Eduardo; Infante-Cossio, Pedro; Cayuela, Aurelio; Acosta-Feria, Manuel; Gutierrez-Perez, Jose-Luis
To adapt to Spanish the facial disability index (FDI) described by VanSwearingen and Brach in 1995 and to assess its reliability and validity in patients with facial nerve paresis after parotidectomy. The present study was conducted in two different stages: a) cross-cultural adaptation of the questionnaire and b) cross-sectional study of a control group of 79 Spanish-speaking patients who suffered facial paresis after superficial parotidectomy with facial nerve preservation. The cross-cultural adaptation process comprised the following stages: (I) initial translation, (II) synthesis of the translated document, (III) retro-translation, (IV) review by a board of experts, (V) pilot study of the pre-final draft and (VI) analysis of the pilot study and final draft. The reliability and internal consistency of every one of the rating scales included in the FDI (Cronbach's alpha coefficient) was 0.83 for the complete scale and 0.77 and 0.82 for the physical and the social well-being subscales. The analysis of the factorial validity of the main components of the adapted FDI yielded similar results to the original questionnaire. Bivariate correlations between FDI and House-Brackmann scale were positive. The variance percentage was calculated for all FDI components. The FDI questionnaire is a specific instrument for assessing facial neuromuscular dysfunction which becomes a useful tool in order to determine quality of life in patients with facial nerve paralysis. Spanish adapted FDI is equivalent to the original questionnaire and shows similar reliability and validity. The proven reproducibility, reliability and validity of this questionnaire make it a useful additional tool for evaluating the impact of facial nerve paralysis in Spanish-speaking patients.
Park, Dae-Sung; Lee, GyuChang
A balance test provides important information such as the standard to judge an individual's functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment software using the Nintendo Wii Balance Board, investigated its reliability and validity, and compared it with a laboratory-grade force platform. Twenty healthy adults participated in our study. The participants participated in the test for inter-rater reliability, intra-rater reliability, and concurrent validity. The tests were performed with balance assessment software using the Nintendo Wii balance board and a laboratory-grade force platform. Data such as Center of Pressure (COP) path length and COP velocity were acquired from the assessment systems. The inter-rater reliability, the intra-rater reliability, and concurrent validity were analyzed by an intraclass correlation coefficient (ICC) value and a standard error of measurement (SEM). The inter-rater reliability (ICC: 0.89-0.79, SEM in path length: 7.14-1.90, SEM in velocity: 0.74-0.07), intra-rater reliability (ICC: 0.92-0.70, SEM in path length: 7.59-2.04, SEM in velocity: 0.80-0.07), and concurrent validity (ICC: 0.87-0.73, SEM in path length: 5.94-0.32, SEM in velocity: 0.62-0.08) were high in terms of COP path length and COP velocity. The balance assessment software incorporating the Nintendo Wii balance board was used in our study and was found to be a reliable assessment device. In clinical settings, the device can be remarkably inexpensive, portable, and convenient for the balance assessment.
Hill, C.; Robinson, L.
Mammographers currently score their own images according to criteria set out by Regional Quality Assurance. The criteria used are based on the ‘Perfect, Good, Moderate, Inadequate’ (PGMI) marking criteria established by the National Health Service Breast Screening Programme (NHSBSP) in their Quality Assurance Guidelines of 2006 1 . This document discusses the validity and reliability of the current mammography image assessment scheme. Commencing with a critical review of the literature this document sets out to highlight problems with the national approach to the use of marking schemes. The findings suggest that ‘PGMI’ scheme is flawed in terms of reliability and validity and is not universally applied across the UK. There also appear to be differences in schemes used by trainees and qualified mammographers. Initial recommendations are to be made in collaboration with colleagues within the National Health Service Breast Screening Programme (NHSBSP), Higher Education Centres, College of Radiographers and the Royal College of Radiologists in order to identify a mammography image appraisal scheme that is fit for purpose. - Highlights: • Currently no robust evidence based marking tools in use for the assessment of images in mammography. • Is current system valid, reliable and robust? • How can the current image assessment tool be improved? • Should students and qualified mammographers use the same tool? • What marking criteria are available for image assessment?
Cetin, Fatma Cosar; Sezer, Ayse; Merih, Yeliz Dogan
OBJECTIVE: The objective of this study is to investigate the validity and the reliability of Birth Satisfaction Scale (BSS) and to adapt it into the Turkish language. This scale is used for measuring maternal satisfaction with birth in order to evaluate women’s birth perceptions. METHODS: In this study there were 150 women who attended to inpatient postpartum clinic. The participants filled in an information form and the BSS questionnaire forms. The properties of the scale were tested by conducting reliability and validation analyses. RESULTS: BSS entails 30 Likert-type questions. It was developed by Hollins Martin and Fleming. Total scale scores ranged between 30–150 points. Higher scores from the scale mean increases in birth satisfaction. Three overarching themes were identified in Scale: service provision (home assessment, birth environment, support, relationships with health care professionals); personal attributes (ability to cope during labour, feeling in control, childbirth preparation, relationship with baby); and stress experienced during labour (distress, obstetric injuries, receiving sufficient medical care, obstetric intervention, pain, prolonged labour and baby’s health). Cronbach’s alfa coefficient was 0.62. CONCLUSION: According to the present study, BSS entails 30 Likert-type questions and evaluates women’s birth perceptions. The Turkish version of BSS has been proven to be a valid and a reliable scale. PMID:28058355
Full Text Available Keeping the arrow velocity constant during consecutive shots and responding to "clicker's fall" are considered to be an important feature of archery performance. A specially designed device called an archery chronometer was developed to measure the reaction time of an archer to clicker's fall, arrow velocity, and external factors that may affect arrow velocity. The purposes of this study were to test (1 the validity of Clicker Reaction Time (CRT measurer, and (2 the reliability of CRT in accordance with the Flying Time (FT/Average Speed (AS, temperature (TEMP, wind speed (WS and wind direction (WD measurements. 20 elite archers participated in this study. The Reaction Time (RT, which was derived from EMG values and CRT from the archery chronometer were correlated to test the validity of the CRT measurer. The test re-test method was applied to test the reliability of archery chronometer. CRT scores were related with RT scores (r = .787, p < 0.01. The archery chronometer was valid in terms of predicting reaction time. The device was found to be reliable in measuring CRT, AS, FT, WS, WD, and TEMP. It was concluded that archery chronometer could be used for technical evaluation and enhancing ones shooting technique in archery.
Espinoza-Venegas, Maritza; Sanhueza-Alvarado, Olivia; Ramírez-Elizondo, Noé; Sáez-Carrillo, Katia
The current study aimed to validate the construct and reliability of an emotional intelligence scale. The Trait Meta-Mood Scale-24 was applied to 349 nursing students. The process included content validation, which involved expert reviews, pilot testing, measurements of reliability using Cronbach's alpha, and factor analysis to corroborate the validity of the theoretical model's construct. Adequate Cronbach coefficients were obtained for all three dimensions, and factor analysis confirmed the scale's dimensions (perception, comprehension, and regulation). The Trait Meta-Mood Scale is a reliable and valid tool to measure the emotional intelligence of nursing students. Its use allows for accurate determinations of individuals' abilities to interpret and manage emotions. At the same time, this new construct is of potential importance for measurements in nursing leadership; educational, organizational, and personal improvements; and the establishment of effective relationships with patients.
Condon, David; Revelle, William
Separating the signal in a test from the irrelevant noise is a challenge for all measurement. Low test reliability limits test validity, attenuates important relationships, and can lead to regression artifacts. Multiple approaches to the assessment and improvement of reliability are discussed. The advantages and disadvantages of several different approaches to reliability are considered. Practical advice on how to assess reliability using open source software is provided.
Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob; Troelsen, Anders
The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately strong correlations with the physical subscores of the Medical Outcomes Study 36-item Short-Form Health Survey (r = .70 to .75; p questionnaire (r = .71; p validity. For study and follow-up purposes, the ATRS seems reliable for comparisons of groups of patients. Its usability is limited for repeated assessment of individual patients. The development of analysis guidelines would be desirable. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Ganestam, Ann; Barfod, Kristoffer; Klit, Jakob
strong correlations with the physical subscores of the Medical Outcomes Study 36-item Short-Form Health Survey (r = .70 to .75; p questionnaire (r = .71; p ...The best treatment of acute Achilles tendon rupture remains debated. Patient-reported outcome measures have become cornerstones in treatment evaluations. The Achilles tendon total rupture score (ATRS) has been developed for this purpose but requires additional validation. The purpose of the present...... study was to validate a Danish translation of the ATRS. The ATRS was translated into Danish according to internationally adopted standards. Of 142 patients, 90 with previous rupture of the Achilles tendon participated in the validity study and 52 in the reliability study. The ATRS showed moderately...
Anderson Kathryn L
Full Text Available Abstract The Quality of Life Scale (QOLS, created originally by American psychologist John Flanagan in the 1970's, has been adapted for use in chronic illness groups. This paper reviews the development and psychometric testing of the QOLS. A descriptive review of the published literature was undertaken and findings summarized in the frequently asked questions format. Reliability, content and construct validity testing has been performed on the QOLS and a number of translations have been made. The QOLS has low to moderate correlations with physical health status and disease measures. However, content validity analysis indicates that the instrument measures domains that diverse patient groups with chronic illness define as quality of life. The QOLS is a valid instrument for measuring quality of life across patient groups and cultures and is conceptually distinct from health status or other causal indicators of quality of life.
Byoung-Hee, Lee; Min-Yong, Kim; Young-Bae, Park; Young-Jae, Park
To develop and validate a Seven Emotions Impairment questionnaire (SEIQ), to define an optimum cut-off point for the SEIQ, and to examine whether SEI was predictive of Phlegm and Blood Stasis (BS). Two hundred outpatients and 75 college students were asked to complete the SEIQ, the Profile of Mood States (POMS), Phlegm Pattern Questionnaire (PPQ), and BS Questionnaire (BSQ). Twelve clinicians determined whether the outpatients exhibited SEI. SEIQ data were used to examine the internal consistency and determine validity for the outpatients. SEIQ, POMS, PPQ, and BSQ data were used to examine concurrent validity and predictability of SEI for Phlegm and BS in the college students. Total SEIQ scores and the clinicians' diagnoses of the outpatients were considered to define an optimum cut-off score for the SEIQ. The 18-item SEIQ had satisfactory internal consistency (α = 0.905) and concurrent validity. In the construct validity test, four factors (chest-anxiety, fatigue-depression, working-family-troubles, and sleep-memory) were identified. In the receiver operator characteristic curve curve analysis, the sensitivity, specificity, and area under the curve of the SEIQ were 67.2%, 72.1%, and 73%, respectively. The optimum cut-off score was defined as nine points. SEIQ scores were strongly predictive of Phlegm and BS (β = 0.862 and 0.673, respectively). Based on our results, we concluded that the SEIQ is a reliable and valid instrument for evaluating SEI, and is strongly predictive of Phlegm and BS.
Heineman, Kirsten R; Middelburg, Karin J; Bos, Arend F; Eidhof, Lieke; La Bastide-Van Gemert, Sacha; Van Den Heuvel, Edwin R; Hadders-Algra, Mijna
The Infant Motor Profile (IMP) is a qualitative assessment of motor behaviour in infancy. It consists of five domains: movement variation, variability, fluency, symmetry, and performance. The aim of this study was to assess interobserver reliability and concurrent validity of the IMP with the Alberta Infant Motor Scale (AIMS) and an age-specific neurological examination. Fifty-nine preterm infants (25 females, 34 males; median gestational age 29.7wks, median birthweight 1285g) and 146 term infants (74 females, 72 males; median gestational age 40.1wks, birthweight 3500g) were included. Assessments were performed at corrected ages of 4, 6, 10, 12, and 18 months and consisted of the IMP, AIMS, and an age-specific neurological examination. Interobserver reliability was investigated on a sample of 25 video recordings. Non-parametric statistics were used to analyse the data. Interobserver reliability was high (intraclass correlation coefficient 0.95). At all ages, AIMS scores correlated weakly to fairly with total IMP scores (Spearman's ρ 0.36-0.55), but moderately to strongly with scores on the performance domain of the IMP (Spearman's ρ 0.47-0.84). A clear relation was found between total IMP score and outcome of the neurological examination (Kruskal-Wallis p<0.001 at all ages). Interobserver reliability of the IMP is good. Concurrent validity with the AIMS is best for the IMP performance domain. Concurrent validity with age-specific neurological examination is very good. © The Authors. Developmental Medicine & Child Neurology © 2013 Mac Keith Press.
Hedrick, Valisa E.; Comber, Dana L.; Estabrooks, Paul A.; Savla, Jyoti; Davy, Brenda M.
Consumption of energy-containing beverages may lead to weight gain, yet research investigating this issue is limited. An easily-administered beverage intake assessment tool could facilitate research on this topic. The purpose of this cross-sectional investigation was to determine the validity and reliability of a self-administered beverage intake questionnaire (BEVQ), which estimates mean daily intake of beverages consumed (g, kcals) across 19 beverage categories. Participants (n=105; aged 39±2 yrs) underwent assessments of height, weight, body mass index, and dietary intake using 4-day food intake records (FIR) from June, 2008-June, 2009. The BEVQ was completed at two additional visits (BEVQ1, BEVQ2). Urine samples were collected to objectively determine total fluid intake and encourage accurate self-reporting. Validity was assessed by comparing BEVQ1 with FIR results; reliability was assessed by comparing BEVQ1 and BEVQ2. Analyses included descriptive statistics, bivariate correlations, paired samples t-tests, and independent samples t-tests. Self-reported water and total beverage intake (g) were not different between the BEVQ1 and FIR (mean difference: 129±77g [P=0.096] and 61±106g [P=0.567], respectively). Total beverage and sugar-sweetened beverage (SSB) energy intake were significantly different, although mean differences were small (63 and 44 kcal, respectively). Daily consumption (g) of water (r=0.53), total beverages (r=0.46), and SSB (r=0.49) determined by the BEVQ1 were correlated with reported intake determined by the FIR, as was energy from total beverages (r=0.61) and SSB (r=0.59) (all P<0.001). Reliability was demonstrated, with correlations (P<0.001) detected between BEVQ1 and BEVQ2 results. The BEVQ is a valid, reliable, and rapid self-administered dietary assessment tool. PMID:20656099
Full Text Available John C Sieverdes,1 Eric E Wickel,2 Gregory A Hand,3 Marco Bergamin,4 Robert R Moran,5 Steven N Blair3,51Medical University of South Carolina, College of Nursing and Medicine, Charleson, SC, 2University of Tulsa, Exercise and Sport Science, Tulsa, OK, 3University of South Carolina, Department of Exercise Science, Division of Health Aspects of Physical Activity, Arnold School of Public Health, Columbia, SC, USA; 4University of Padova, Department of Medicine, Sports Medicine Division, Padova, Italy; 5University of South Carolina, Department of Epidemiology and Biostatistics, Arnold School of Public Health, Columbia, SC, USABackground: This study evaluated the reliability and criterion validity of the Mywellness Key accelerometer (MWK using treadmill protocols and indirect calorimetry.Methods: Twenty-five participants completed two four-stage 20-minute treadmill protocols while wearing two MWK accelerometers. Reliability was assessed using raw counts. Validity was assessed by comparing the estimated VO2 calculated from the MWK with values from respiratory gas exchange.Results: Good overall and point estimates of reliability were found for the MWK (all intraclass correlations > 0.93. Generalizability theory coefficients showed lower values for running speed (0.70 versus walking speed (all > 0.84, with the majority of the overall percentage of variability derived from the participant (68%–88% of the total 100%. Acceptable validity was found overall (Pearson’s r = 0.895–0.902, P < 0.0001, with an overall mean absolute error of 16.22% and a coefficient of variance of 16.92%. Bland-Altman plots showed an overestimation of energy expenditure during the running speed, but total kilocalories were underestimated during the protocol by approximately 10%.Conclusion: Good validity was found during light and moderate walking, while running was slightly overestimated. The MWK may be useful for clinicians and researchers interested in promotion or assessment
Eric Swanson, MD
Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Sporis, Goran; Jukic, Igor; Milanovic, Luka; Vucetic, Vlatko
The purpose of this study was to evaluate the reliability and factorial validity of agility tests used in soccer. One hundred fifty (n = 150), elite, male, junior soccer players, members of the First Junior League Team, volunteered to participate in the study. The slalom test (ST) sprint 4 x 5 m (S4 x 5) and sprint 9-3-6-3-6-9 m with 180 degree turns (S180 degrees) tests had a greater reliability coefficient (alpha = 0.992, 0.979, and 0.976), whereas the within-subject variation ranged between 2.9 and 5.6%. The mentioned 6 agility tests resulted in the extraction of 2 significant components. The S4 x 5 test had the lowest correlation coefficient with the first component (r = 0.38), whereas the correlation coefficients of the other 5 agility tests were higher than 0.63. The T-test (TT) showed statistically significant differences between the defenders and midfielders (p agility tests used in this study, the SBF, TT, and S180 degrees are the most reliable and valid tests for estimating the agility of soccer players. According to the results of the study, the TT proved to be the most appropriate for estimating the agility of defenders, the SBF, and S180 degrees for estimating the agility of midfielders, whereas the S4 x 5 test can be used for estimating the agility of attackers.
Vargas, J.V.C. [Federal University of Parana (UFPR), Curitiba, PR (Brazil). Dept. of Mechanical Engineering], E-mail: firstname.lastname@example.org; Ordonez, J.C.; Martins, L.S. [Florida State University, Tallahassee, FL (United States). Center for Advanced Power Systems], Emails: email@example.com, firstname.lastname@example.org
In this paper, a simplified and comprehensive PEMFC mathematical model introduced in previous studies is experimentally validated. Numerical results are obtained for an existing set of commercial unit PEM fuel cells. The model accounts for pressure drops in the gas channels, and for temperature gradients with respect to space in the flow direction, that are investigated by direct infrared imaging, showing that even at low current operation such gradients are present in fuel cell operation, and therefore should be considered by a PEMFC model, since large coolant flow rates are limited due to induced high pressure drops in the cooling channels. The computed polarization and power curves are directly compared to the experimentally measured ones with good qualitative and quantitative agreement. The combination of accuracy and low computational time allow for the future utilization of the model as a reliable tool for PEMFC simulation, control, design and optimization purposes. (author)
Szucs, Kimberly A; Brown, Elena V Donoso
[Purpose] Measurement of posture is important for those with a clinical diagnosis as well as researchers aiming to understand the impact of faulty postures on the development of musculoskeletal disorders. A reliable, cost-effective and low tech posture measure may be beneficial for research and clinical applications. The purpose of this study was to determine rater reliability and construct validity of a posture screening mobile application in healthy young adults. [Subjects and Methods] Pictures of subjects were taken in three standing positions. Two raters independently digitized the static standing posture image twice. The app calculated posture variables, including sagittal and coronal plane translations and angulations. Intra- and inter-rater reliability were calculated using the appropriate ICC models for complete agreement. Construct validity was determined through comparison of known groups using repeated measures ANOVA. [Results] Intra-rater reliability ranged from 0.71 to 0.99. Inter-rater reliability was good to excellent for all translations. ICCs were stronger for translations versus angulations. The construct validity analysis found that the app was able to detect the change in the four variables selected. [Conclusion] The posture mobile application has demonstrated strong rater reliability and preliminary evidence of construct validity. This application may have utility in clinical and research settings.
Zurlo, Maria Clelia; Pes, Daniela; Capasso, Roberto
This study analyses the psychometric properties of the Italian version of the Teacher Stress Questionnaire elaborated in England by Travers and Cooper in 1996. This Italian survey was completed by 863 teachers randomly drawn from a cross-section of Italian school levels. The construct validity of the questionnaire was verified by factor analysis and by measuring the internal consistency of the single scales. All dimensions measured by the Teacher Stress Questionnaire were compared for sample subgroups of all teacher levels. Several meaningful and reliable factors emerged from the factor analysis of the scales. The internal consistency of each scale (Cronbach's alpha) revealed satisfactory values. Teachers' age and school level were determining factors for all dimensions of stress explored. The Italian version of the Teacher Stress Questionnaire showed satisfactory psychometric properties and constitutes a useful and reliable measure to analyse stress in Italian schools.
Pham, H.; Pham, M.
This report presents the results of the first phase of the ongoing EG&G Idaho, Inc. Software Reliability Research Program. The program is studying the existing software reliability models and proposes a state-of-the-art software reliability model that is relevant to the nuclear reactor control environment. This report consists of three parts: (1) summaries of the literature review of existing software reliability and fault tolerant software reliability models and their related issues, (2) proposed technique for software reliability enhancement, and (3) general discussion and future research. The development of this proposed state-of-the-art software reliability model will be performed in the second place. 407 refs., 4 figs., 2 tabs.
Pham, H.; Pham, M.
This report presents the results of the first phase of the ongoing EG G Idaho, Inc. Software Reliability Research Program. The program is studying the existing software reliability models and proposes a state-of-the-art software reliability model that is relevant to the nuclear reactor control environment. This report consists of three parts: (1) summaries of the literature review of existing software reliability and fault tolerant software reliability models and their related issues, (2) proposed technique for software reliability enhancement, and (3) general discussion and future research. The development of this proposed state-of-the-art software reliability model will be performed in the second place. 407 refs., 4 figs., 2 tabs.
Liu, Kena; Liu, Shaofeng; Zhou, Zhou; Ren, Qinyi; Zhong, Jie; Luo, Renzhong; Qin, Huabiao; Zhang, Siyi; Ge, Pingjiang
To evaluate the reliability and validity of the Chinese version of pediatric voice handicap index (pVHI). The original English version-pVHI was translated into Chinese. Parents of 52 children with voice dysphonia and 43 children with no history or symptoms of voice problems were asked to fill the Chinese pVHI questionnaires twice with an interval of 2 weeks. GRB (Grade, Roughness, Breathiness) scale was used for perceptual assessment by two otolaryngologists and one speech pathologist for each child's voice. The internal consistency was assessed using Cronbach's alpha coefficient. Pearson's correlation coefficient was used to evaluate the test-retest reliability. The Kendall's coefficient of concordance W was used to assess the consistency of GRB scores of 3 voice specialists. The nonparametric Mann-Whitney test was used to assess the differences between the dysphonia group and controls. The correlation between pVHI and GRB scores were assessed using Pearson's correlation coefficient. The internal consistency of total score and three subscales scores of Chinese pVHI were 0.788-0.944. The test-retest reliability was 0.631-0.887(P < .001). The pVHI scores of control group significantly were lower than the pathological group (P = .000). The GRB scores of 3 voice specialists have an excellent consistency (W = 0.694-0.807, P = .000). The pVHI scores positively correlated with GRB assessment (P < .01). The Chinese version of pVHI had a good reliability and validity. It can be applicable and useful supplementary tool for evaluating parents' perception of their children's dysphonia. Copyright © 2017. Published by Elsevier B.V.
Singh, C.; Patton, A. D.; Kim, Y.; Wagner, H.
This paper presents a methodology for the reliability evaluation of Space Station power system. The two options considered are the photovoltaic system and the solar dynamic system. Reliability models for both of these options are described along with the methodology for calculating the reliability indices.
Wikstrom, Erik A
Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Descriptive laboratory study. Sports medicine research laboratory. Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Participants completed a single-limb-stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with scores ranging from fair (ICC = 0.74) to poor (ICC = 0.29). Wii Fit balance activity scores had poor concurrent validity relative to COP outcomes and SEBT
The purpose of this study was to explore the potential of "invisible-gap filling" items primarily as an in-house achievement measure of reading-oriented courses and secondarily as a more general overall-ability measure. More specifically, it compared multiple-matching "invisible-gap filling" items and their "visible" counterparts in terms of item facility, item discrimination, test reliability, and test validity.Eighty-eight Japanese university 1st year students took a 25-item invisible-gap f...
The purpose of this study is to develop, and test the validity and reliability of a scale for the use of researchers to determine the accreditation standards of open and distance education based on the views of administrators, teachers, staff and students. This research was designed according to the general descriptive survey model since it aims…
Lichtenberg, Peter A; Teresi, Jeanne A; Ocepek-Welikson, Katja; Eimicke, Joseph P
The scarcity of empirically validated assessment instruments continues to impede the work of professionals in a number of fields, including medicine, finance, and estate planning; adult protective services; and criminal justice-and, more importantly, it impedes their ability to effectively assist and, in some case, protect their clients. Other professionals (e.g. legal, financial, medical, mental health services) are in a position to prevent financial exploitation and would benefit from access to new instruments. The Lichtenberg Financial Decision Screening Scale (LFDSS) was introduced in 2016, along with evidence for its convergent validity (Lichtenberg et al., 2016). Using a sample of 213 participants, this study investigated the internal consistency of the LFDSS and its criterion validity based on ratings by professionals using the scale. Results demonstrate that the LFDSS has excellent internal consistency and clinical utility properties. This paper provides support for use of the LFDSS as a reliable and valid instrument. The LFDSS and instructions for its use are included in the article, along with information about online tools and support.
Belafsky, Peter C; Mouadeb, Debbie A; Rees, Catherine J; Pryor, Jan C; Postma, Gregory N; Allen, Jacqueline; Leonard, Rebecca J
The Eating Assessment Tool is a self-administered, symptom-specific outcome instrument for dysphagia. The purpose of this study was to assess the validity and reliability of the 10-item Eating Assessment Tool (EAT-10). The investigation consisted of 4 phases: 1) line-item generation, 2) line-item reduction and reliability, 3) normative data generation, and 4) validity analysis. All data were collected prospectively. Internal consistency was assessed with the Cronbach alpha. Test-retest reliability was evaluated with the Pearson product moment correlation coefficient. Normative data were obtained by administering the instrument to a community cohort of healthy volunteers. Validity was assessed by administering the instrument before and after dysphagia treatment and by evaluating survey differences between normal persons and those with known diagnoses. A total of 629 surveys were administered to 482 patients. The internal consistency (Cronbach alpha) of the final instrument was 0.960. The test-retest intra-item correlation coefficients ranged from 0.72 to 0.91. The mean (+/- SD) EAT-10 score of the normal cohort was 0.40 +/- 1.01. The mean EAT-10 score was 23.58 +/- 13.18 for patients with esophageal dysphagia, 23.10 +/- 12.22 for those with oropharyngeal dysphagia, 9.19 +/- 12.60 for those with voice disorders, 22.42 +/- 14.06 for those with head and neck cancer, and 11.71 +/- 9.61 for those with reflux. The patients with oropharyngeal and esophageal dysphagia and a history of head and neck cancer had a significantly higher EAT-10 score than did those with reflux or voice disorders (p EAT-10 score of the patients with dysphagia improved from 19.87 +/- 10.5 to 5.2 +/- 7.4 after treatment (p EAT-10 has displayed excellent internal consistency, test-retest reproducibility, and criterion-based validity. The normative data suggest that an EAT-10 score of 3 or higher is abnormal. The instrument may be utilized to document the initial dysphagia severity and monitor the
Akenhead, Richard; French, Duncan; Thompson, Kevin G; Hayes, Philip R
To examine the validity and inter-unit reliability of 10 Hz GPS for measuring instantaneous velocity during maximal accelerations. Experimental. Two 10 Hz GPS devices secured to a sliding platform mounted on a custom built monorail were towed whilst sprinting maximally over 10 m. Displacement of GPS devices was measured using a laser sampling at 2000 Hz, from which velocity and mean acceleration were derived. Velocity data was pooled into acceleration thresholds according to mean acceleration. Agreement between laser and GPS measures of instantaneous velocity within each acceleration threshold was examined using least squares linear regression and Bland-Altman limits of agreement (LOA). Inter-unit reliability was expressed as typical error (TE) and a Pearson correlation coefficient. Mean bias ± 95% LOA during accelerations of 0-0.99 ms(-2) was 0.12 ± 0.27 ms(-1), decreasing to -0.40 ± 0.67 ms(-1) during accelerations >4 ms(-2). Standard error of the estimate ± 95% CI (SEE) increased from 0.12 ± 0.02 ms(-1) during accelerations of 0-0.99 ms(-2) to 0.32 ± 0.06 ms(-1) during accelerations >4 ms(-2). TE increased from 0.05 ± 0.01 to 0.12 ± 0.01 ms(-1) during accelerations of 0-0.99 ms(-2) and >4 ms(-2) respectively. The validity and reliability of 10 Hz GPS for the measurement of instantaneous velocity has been shown to be inversely related to acceleration. Those using 10 Hz GPS should be aware that during accelerations of over 4 ms(-2), accuracy is compromised. Copyright © 2013 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
In educational research that calls itself empirical, the relationship between validity and reliability is that of trade-off: the stronger the bases for validity, the weaker the bases for reliability (and vice versa). Validity and reliability are widely regarded as basic criteria for evaluating research; however, there are ethical implications of…
Full Text Available Objectives: The aim of this study was to evaluate the psychometric features of the Persian version of the Autism Behavior Checklist (ABC. Method:The International Quality of Life Assessment (IQOLA approach was used to translate the English ABC into Persian. A total sample of 184 parents of children including 114 children with autism disorder (mean age =7.21, SD =1.65 and 70 typically developing children (mean age = 6.82, SD =1.75 completed the ABC. Internal consistency, test-retest reliability, concurrent and discriminant validity, and cut-off score were assessed. Results: The results of this study revealed that the Persian version of the ABC has an acceptable degree of internal consistency (.73. Test–retest comparisons using interclass correlation confirmed the instrument’s time stability (.83. The instrument’s concurrent validity with Gilliam Autism Rating Scale (GARS was verified; the correlation between total scores was .94. In the discriminant validity, the autism group had significantly higher scores compared to the normal group. Receiver Operating Characteristic (ROC analysis revealed that individuals with total scores below 25 are less likely to be in the autism group. Conclusion:The Persian version of the ABC can be used as an initial screening tool in clinical contexts.
Conclusion: Only seven studies calculated validity coefficients within the study whereas 47 cited the validity coefficient. Twenty-six calculated a reliability coefficient whereas 47 cited the reliability of the ED measures. Four studies found validity evidence for the EAT, EDI, BULIT-R, QEDD, and EDE-Q in an athlete population. Few studies reviewed calculated validity and reliability coefficients of ED measures. Cross-validation of these measures in athlete populations is clearly needed.
Veder, Laura; Pullens, Bas; Timmerman, Marieke; Hoeve, Hans; Joosten, Koen; Hakkesteegt, Marieke
The pediatric voice handicap index (pVHI) has been developed to provide a better insight into the parents' perception of their child's voice related quality of life. The purpose of the present study was to validate the Dutch pVHI by evaluating its internal consistency and reliability. Furthermore, we determined the optimal cut-off point for a normal pVHI score. All items of the English pVHI were translated into Dutch. Parents of children in our dysphonic and control group were asked to fill out the questionnaire. For the test re-test analysis we used a different study group who filled out the pVHI twice as part of a large follow up study. Internal consistency was analyzed through Cronbach's α coefficient. The test-retest reliability was assessed by determining Pearson's correlation coefficient. Mann-Whitney test was used to compare the scores of the questionnaire of the control group with the dysphonic group. By calculating receiver operating characteristic (ROC) curves, sensitivity and specificity we were able to set a cut-off point. We obtained data from 122 asymptomatic children and from 79 dysphonic children. The scores of the questionnaire significantly differed between both groups. The internal consistency showed an overall Cronbach α coefficient of 0.96 and an excellent test-retest reliability of the total pVHI questionnaire with a Pearson's correlation coefficient of 0.90. A cut-off point for the total pVHI questionnaire was set at 7 points with a specificity of 85% and sensitivity of 100%. A cut-off point for the VAS score was set at 13 with a specificity of 93% and sensitivity of 97%. The Dutch pVHI is a valid and reliable tool for the assessment of children with voice problems. By setting a cut-off point for the score of the total pVHI questionnaire of 7 points and the VAS score of 13, the pVHI might be used as a screening tool to assess dysphonic complaints and the pVHI might be a useful and complementary tool to identify children with dysphonia
Full Text Available The purpose of this study is to develop a scale unique to our culture, concerning individual instrument performance anxiety of the students who are getting instrument training in the Department of Music Education. In the study, the descriptive research model is used and qualitative research techniques are utilized. The study population consists of the students attending the 23 universities which has Music Education Department. The sample of the study consists of 438 girls and 312 boys, totally 750 students who are studying in the Department of Music Education of randomly selected 10 universities. As a result of the explanatory and confirmatory factor analyses that were performed, a one-dimensional structure consisting of 14 items was obtained. Also, t-scores and the coefficient scores of total item correlation concerning the distinguishing power of the items, the difference in the scores of the set of lower and upper 27% was calculated, and it was observed that the items are distinguishing as a result of both analyses. Of the scale, Cronbach's alpha coefficient of internal consistency was calculated as .94, and test-retest reliability coefficient was calculated as .93. As a result, a valid and reliable assessment and evaluation instrument that measures the exam performance anxiety of the students studying in the Department of Music Education, has been developed.Extended AbstractsIntroductionAnxiety is a universal phenomenon which people experience once or a few times during lives. It was accepted as concern for the future or as an unpleasant emotional experience regarding probable hitches of the events (Di Tomasso & Gosch, 2002.In general, the occasions on which negative feelings are experienced cause anxiety to arise (Baltaş and Baltaş, 2000. People also feel anxious in dangerous situations. Anxiety may lead a person to be creative, while it may have hindering characteristics. Anxiety is that an individual considers him
Lillo-Bevia, José R; Pallarés, Jesús G
To validate the new drive indoor trainer Hammer designed by Cycleops®. Eleven cyclists performed 44 randomized and counterbalanced graded exercise tests (100-500W), at 70, 85 and 100 rev.min -1 cadences, in seated and standing positions, on 3 different Hammer units, while a scientific SRM system continuously recorded cadence and power output data. No significant differences were detected between the three Hammer devices and the SRM for any workload, cadence, or pedalling condition (P value between 1.00 and 0.350), except for some minor differences (P 0.03 and 0.04) found in the Hammer 1 at low workloads, and for Hammer 2 and 3 at high workloads, all in seated position. Strong ICCs were found between the power output values recorded by the Hammers and the SRM (≥0.996; P=0.001), independently from the cadence condition and seated position. Bland-Altman analysis revealed low Bias (-5.5-3.8) and low SD of Bias (2.5-5.3) for all testing conditions, except marginal values found for the Hammer 1 at high cadences and seated position (9.6±6.6). High absolute reliability values were detected for the 3 Hammers (150-500W; CV<1.2%; SEM<2.1). This new Cycleops trainer is a valid and reliable device to drive and measure power output in cyclists, providing an alternative to larger and more expensive laboratory ergometers, and allowing cyclists to use their own bicycle.
REENA JHARANIYA; M.AHFAZ KHAN
In electrical power network, transformer is one of the most important electrical equipment in power system, which running status is directly concerned with the reliability of power system. Reliability of a power system is considerably influenced by its equipments. Power transformers are one of the most critical and expensive equipments of a power system and their proper functions are vital for the substations and utilities .Therefore, reliability model of power transformer is very important i...
Palumbo, Daniel L.; Nicol, David M.
An effort has been underway for several years at NASA's Langley Research Center to extend the capability of Markov modeling techniques for reliability analysis to the designers of highly reliable avionic systems. This effort has been focused in the areas of increased model abstraction and increased computational capability. The reliability model generator (RMG), a software tool which uses as input a graphical, object-oriented block diagram of the system, is discussed. RMG uses an automated failure modes-effects analysis algorithm to produce the reliability model from the graphical description. Also considered is the ASSURE software tool, a parallel processing program which uses the ASSIST modeling language and SURE semi-Markov solution technique. An executable failure modes-effects analysis is used by ASSURE. The successful combination of the power of graphical representation, automated model generation, and parallel computation leads to the conclusion that large system architectures can now be analyzed.
Full Text Available Objective: (i To develop the Pharmacy Value-Added Services Questionnaire (PVASQ using emerging themes generated from interviews. (ii To establish reliability and validity of questionnaire instrument. Methods: Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT to adopt pharmacy value-added services is predicted by attitudes (ATT, subjective norms (SN, perceived behavioral control (PBC, knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25 was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach’s alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC. Confirmatory Factor Analysis, CFA (N=410 was conducted to assess construct validity of the PVASQ. Results: The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest reliability was good. The overall Cronbach’ s alpha (N=25 is 0.912 and 0.908 for the two time points. The result of CFA (N=410 showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. Conclusions: This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients’ intention to adopt pharmacy value-added services to collect partial
Tan, Christine L; Hassali, Mohamed A; Saleem, Fahad; Shafie, Asrul A; Aljadhey, Hisham; Gan, Vincent B
(i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach's alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach' s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients' intention to adopt pharmacy value-added services to collect partial medicine supply.
Examination of the reliability and validity of research tools is an important process in nursing research. As all scales are developed on the basis of a specific set of concepts or application to a specific population, great care must be taken in the translation and handling of questionnaires developed in different languages or, in particular, by people from different cultures. Only after passing through an integral process of repetitive translation and reliability/validity examinations can they be regarded as accurate research tools and be widely utilized. This study exercises Brislin's revised Translation Model to overcome cross-cultural differences, and also uses a new research tool developed overseas, the Diabetes Management Self-Efficacy Scale (DMSES), as an example for explaining the two-stage design of the translation and reliability/validity process. The First Stage is the translation stage, which involves forward translation, consensus meetings, backward translation, and review by potential users. The Second Stage involves psychometric analysis, which includes validity and reliability testing. Such careful processes should reveal the problems encountered when adopting foreign scales for domestic use, and thereby increase the "nativeness" and adaptability of these scales.
Chang, Yaw-Wen; Chang, Ying-Hsue; Pan, Yu-Ling; Kao, Tung-Wei; Kao, Senyeong
A comprehensive fall risk assessment can provide information for effective prevention and intervention measures and reduce falls among hospitalized elderly people. The purpose of this study was to develop a Chinese version of an inpatient fall risk assessment tool and evaluate its validity and reliability.This study employed the Falls Risk for Hospitalised Older People (FRHOP) assessment to construct a FRHOP-Taiwan Version (Tw-FRHOP) through forward, synthesized, and backward translation. A face validation was conducted by 5 clinical nurses and a content validation was conducted by 5 specialists using the content validity index (CVI) to validate the proposed model. Thirty hospitalized older adults in an internal care unit were selected for an interrater reliability assessment, conducted separately by specialists in 4 disciplines (i.e., nurses, physicians, occupational therapists, and physiotherapists) by using Cohen kappa statistic and intraclass correlation coefficients (ICCs). Specifically, the assessment rating developed in the Tw-FRHOP was compared with the Morse Fall Scale (MFS), St. Thomas Risk Assessment Tool in Falling Elderly Inpatients (STRATIFY), and the Hendrich II Fall Risk Model (HIIFRM) for criterion validation.According to the analysis results, the CVI was 0.94, and the indexes of criterion-related validity for the FRHOP-Taiwan Version, MFS, STRATIFY, and HIIFRM were 0.49, 0.63, and 0.54 (all P fall risk assessment that can serve as a satisfactorily valid and reliable reference tool for medical personnel with full professional training, as well as inpatient fall prevention interventions for multidisciplinary teams in hospitals.
Park, Dae-Sung; Lee, GyuChang
Background A balance test provides important information such as the standard to judge an individual’s functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment softwar...
Mohammad, Ashiru Hamza; Al Sadat, Nabilla; Loh, Siew Yim; Chinna, Karuthan
Background: Social support has been identified as one of the key factors for enhancing the quality of life after stroke. However, a scientific tool that is valid for evaluating social support among stroke survivors in Nigeria has not been developed so far. Objectives: The objective of this study was to examine the psychometric properties of the Hausa translated versions of the Multidimensional Scale of Perceived Social Support (MSPSS) and to validate it for future use in clinical research in Nigeria. Patients and Methods: This cross-sectional study was conducted on a sample of 140 adult stroke survivors attending rehabilitation services at stroke referral centers in Kano, Nigeria. The psychometric analysis of the Hausa-MSPSS involved face, content, criterion and construct validity tests, as well as internal and test-retest reliability. Confirmatory factor analysis was performed to assess validity of the three-factor (Family, Friends and Significant others) model for the Hausa-MSPSS. Results: Based on expert panel, clinicians’ review and patients’ feedback, the 12 item Hausa-MSPSS had sufficient face, content and criterion validity. In reliability analysis, the Cronbach’s alpha was 0.781. In test-retest reliability analysis, the minimum kappa value was 0.673. In Confirmatory factor analysis, the three-factor model was not acceptable. An alternative two-factor model with Family and Friends, as the two domain was found to be acceptable (Chi-square/df 0.9, RMSEA < 0.08). The average variances extracted for the two constructs were 0.5 and 0.7, respectively. The R-squared value between the two constructs was 0.397. Cross validity tested using 1000 bootstrap resamples gave satisfactory results (P = 0.079). Conclusions: The 11 item Hausa-MSPSS index is valid for the assessment of perceived social support among stroke survivors in Nigeria. PMID:25838933
García-López, Luis M.; Gutiérrez, David; Pastor, Juan Carlos; Romo, Vicente
The aim of this study is to develop a scale to measure primary and secondary school teachers' perception of a competence-based curriculum model in Spain. After reviewing the literature, we designed an initial questionnaire with 37 five-point Likert-type items. This was then reviewed by a panel of experts. A pilot test was conducted with 100…
Thirugnana Murthy, D.; Murali, N.; Sridevi, T.; Satya Murty, S.A.V.; Velusamy, K.
The demand for complex software systems has increased more rapidly than the ability to design, implement, test, and maintain them, and the reliability of software systems has become a major concern for our, modern society.Software failures have impaired several high visibility programs in space, telecommunications, defense and health industries. Besides the costs involved, it setback the projects. The ways of quantifying it and using it for improvement and control of the software development and maintenance process. This paper discusses need for systematic approaches for measuring and assuring software reliability which is a major share of project development resources. It covers the reliability models with the concern on 'Reliability Growth'. It includes data collection on reliability, statistical estimation and prediction, metrics and attributes of product architecture, design, software development, and the operational environment. Besides its use for operational decisions like deployment, it includes guiding software architecture, development, testing and verification and validation. (author)
Janssen, Ellen M; Marshall, Deborah A; Hauber, A Brett; Bridges, John F P
The recent endorsement of discrete-choice experiments (DCEs) and other stated-preference methods by regulatory and health technology assessment (HTA) agencies has placed a greater focus on demonstrating the validity and reliability of preference results. Areas covered: We present a practical overview of tests of validity and reliability that have been applied in the health DCE literature and explore other study qualities of DCEs. From the published literature, we identify a variety of methods to assess the validity and reliability of DCEs. We conceptualize these methods to create a conceptual model with four domains: measurement validity, measurement reliability, choice validity, and choice reliability. Each domain consists of three categories that can be assessed using one to four procedures (for a total of 24 tests). We present how these tests have been applied in the literature and direct readers to applications of these tests in the health DCE literature. Based on a stakeholder engagement exercise, we consider the importance of study characteristics beyond traditional concepts of validity and reliability. Expert commentary: We discuss study design considerations to assess the validity and reliability of a DCE, consider limitations to the current application of tests, and discuss future work to consider the quality of DCEs in healthcare.
Juliana Alvares Duarte Bonini Campos
Full Text Available The aim of this study was to assess the validity and reliability of the Fonseca Anamnestic Index (IAF, used to assess the severity of temporomandibular disorders, applied to Brazilian women. We used a probabilistic sampling design. The participants were 700 women over 18 years of age, living in the city of Araraquara (SP. The IAF questionnaire was applied by telephone interviews. We conducted Confirmatory Factor Analysis (CFA using Chi-Square Over Degrees of Freedom (χ2/df, Comparative Fit Index (CFI, Tucker-Lewis Index (TLI, and Root Mean Square Error of Approximation (RMSEA as goodness of fit indices. We calculated the convergent validity, the average variance extracted (AVE and the composite reliability (CR. Internal consistency was assessed by Cronbach's alpha coefficient (α.The factorial weights of questions 8 and 10 were below the adequate values. Thus, we refined the original model and these questions were excluded. The resulting factorial model showed appropriate goodness of fit to the sample (χ2/df = 3.319, CFI = 0.978, TLI = 0.967, RMSEA = 0.058. The convergent validity (AVE = 0.513, CR = 0.878 and internal consistency (α = 0.745 were adequate. The reduced IAF version showed adequate validity and reliability in a sample of Brazilian women.
Nicholas A. Petrunoff
Full Text Available Background. The purpose of this study was to assess the (previously untested reliability and validity of survey questions commonly used to assess travel mode and travel time. Methods. Sixty-five respondents from a staff survey of travel behaviour conducted in a south-western Sydney hospital agreed to complete a travel diary for a week, wear an accelerometer over the same period, and twice complete an online travel survey an average of 21 days apart. The agreement in travel modes between the self-reported online survey and travel diary was examined with the kappa statistic. Spearman’s correlation coefficient was used to examine agreement of travel time from home to workplace measured between the self-reported online survey and four-day travel diary. Moderate-to-vigorous physical activity (MVPA time of active and nonactive travellers was compared by t-test. Results. There was substantial agreement between travel modes (K=0.62, P<0.0001 and a moderate correlation for travel time (ρ=0.75, P<0.0001 reported in the travel diary and online survey. There was a high level of agreement for travel mode (K=0.82, P<0.0001 and travel time (ρ=0.83, P<0.0001 between the two travel surveys. Accelerometer data indicated that for active travellers, 16% of the journey-to-work time is MVPA, compared with 6% for car drivers. Active travellers were significantly more active across the whole workday. Conclusions. The survey question “How did you travel to work this week? If you used more than one transport mode specify the one you used for the longest (distance portion of your journey” is reliable over 21 days and agrees well with a travel diary.
Full Text Available The present study examined the Active Listening Attitude Scale (ALAS validity and reliability in a sample of 3955 Greek educators. The sample was randomly split and an exploratory factor analysis (EFA was conducted in the even subsample to evaluate the scale’s construct validity. A confirmatory factor analysis (CFA was performed in the odd subsample to confirm the three-factor model identified by the EFA. The chi square test (χ2 of the model was significant (p < 0.05, due to the large sample size. The root mean square error of approximation (RMSEA, the comparative fit index (CFI and the goodness of fit index (GFI values were 0.079, 0.969 and 0.960, respectively, further supporting the fit of the three-factor model. Cronbach’s alpha coefficient was used to test internal consistency reliability and was satisfactory exceeding 0.72 for ALAS subscales. The intercorrelations of the three subscales were all positive and significant (p < 0.001, ranging from 0.20 to 0.42. Student’s t-tests and the computation of effect sizes revealed that women scored higher on Listening Skill and Conversation Opportunity, while principals and participants trained on mental health promotion scored higher on all three subscales. The analyses confirmed the three-factor model of ALAS and demonstrated its validity and reliability in measuring Greek teachers’ active listening attitudes.
Full Text Available Backround and design. Internalized stigma involves endorsing negative feelings and beliefs such as insignificance, shame and withdrawal triggered by applying these negative stereotypes to one self. Internalized Stigma Scale has not been applied to psoriasis patients. We aimed to evaluate the reliability and validity of Internalized Stigma Scale in psoriasis patients. Materials and Methods. 100 consecutive, volunteer psoriasis patients (48 female, 52 male; aged, 40.59±15.44 years were enrolled in the study. PASI and BSA were evaluated by physician (A.B.. Patients responded contemporaneously to Psoriasis Internalized Stigma Scale (PISS, DQoL, and Perceived Health Status (PHS, single-item self-rated general health question, of which Likert scores 1, 2, and 3 were classified as “from fair to very poor”, and 4, 5 as “good”. Results. Cronbach's alpha coefficient of PISS subscales was 0.83 for alienation, 0.70 for stereotype endorsement, 0.70 for perceived discrimination, 0.84 for social withdrawal and 0.68 for stigma resistance. The same value was 0.89 for the total scale. PISS and DQoL scores mean values were 58.8±12.6 and 10.0±9.4, respectively. PISS was significantly correlated with the patients' DQoL scores (r=,726, p=0,001. PISS was also significantly correlated with disease duration (r=,209, p=0,047. There was no any significant relationship between PASI or BSA and PISS. Mean DQoL scores in patients reporting their PHS as “from fair to very poor” and “good” were 12.1±7.3 and 5.0±4.3, respectively. Mean values of PISS in patients reporting their PHS as “from fair to very poor” was significantly increased compared with patients reporting their PHS as “good” (p=0.001. Conclusion. PISS can be used as a reliable and valid tool in assesing internalized stigmatization in psoriasis patients. Our results indicate a high level of stigmatization in psoriasis patients. Low DQoL scores show a correlation with increased levels of
Chiba, Hiromi; Nagamitsu, Shinichiro; Sakurai, Rieko; Mukai, Takayo; Shintou, Hiroko; Koyanagi, Kenshi; Yamashita, Yushiro; Kakuma, Tatsuyuki; Uchimura, Naohisa; Matsuishi, Toyojiro
Eating disorders (ED) are serious psychosomatic disorders that commonly occur in girls during adolescence. An increase in earlier onset ED has recently been suggested. Therefore, accurate assessment of eating attitudes in children is a necessary part of school mental health. The 26-item Children's Eating Attitudes Test (ChEAT-26) is widely used internationally to assess abnormal eating attitudes. The present study aimed to validate the Japanese version of the ChEAT-26. Participants were 7076 school children (aged 10-15years) from large, medium-sized, and small cities, and 44 children with anorexia nervosa. We examined the average ChEAT-26 score by participant attributes, including sex, age, geographical region, and school style. Factor analysis of the ChEAT-26 content was performed with varimax rotation. The optimal cut-off point was evaluated using receiver operating characteristic (ROC) analysis. The mean ChEAT-26 score was 7.94 for girls and 5.86 for boys. The mean score was significantly higher in children from larger cities than small cities, and was higher with increasing age, and private schools. Five factors explained 31.4% of the variance. The Cronbach's alpha was 0.81 for the scale. The area under the ROC curve was 0.83; sensitivity was 0.69 and specificity was 0.93 for a cut-off score of 18. The Japanese version of the ChEAT-26 is a reliable and valid psychometric tool that may be useful in the triage and assessment of children with anorexia nervosa. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Frank, Guido K W; Favaro, Angela; Marsh, Rachel; Ehrlich, Stefan; Lawson, Elizabeth A
Human brain imaging can help improve our understanding of mechanisms underlying brain function and how they drive behavior in health and disease. Such knowledge may eventually help us to devise better treatments for psychiatric disorders. However, the brain imaging literature in psychiatry and especially eating disorders has been inconsistent, and studies are often difficult to replicate. The extent or severity of extremes of eating and state of illness, which are often associated with differences in, for instance hormonal status, comorbidity, and medication use, commonly differ between studies and likely add to variation across study results. Those effects are in addition to the well-described problems arising from differences in task designs, data quality control procedures, image data preprocessing and analysis or statistical thresholds applied across studies. Which of those factors are most relevant to improve reproducibility is still a question for debate and further research. Here we propose guidelines for brain imaging research in eating disorders to acquire valid results that are more reliable and clinically useful. © 2018 Wiley Periodicals, Inc.
Borges-Yáñez, S Aída; Maupomé, Gerardo; Jiménez-García, Gustavo
To compare the extent and severity index (ESI) with a gold standard represented by actual readings of loss of attachment on six sites around all teeth present (excluding third molars). Five standardized dentists (kappa=0.6) examined 712 subjects > or =20 years of age at a dental school (1993-1995). Sensitivity, specificity, positive and negative predictive values, and true and apparent prevalence were established. True severe periodontitis prevalence was 95.8%. ESI underestimated the severity (0.1 mm), extent (4%), and prevalence (16%) of periodontitis. The severity, as established by ESI, coincided 23.4% with the gold standard. ESI failed to identify 16.7% of subjects with severe periodontal disease, but specificity and positive predictive value were very high. The underestimation of severe periodontitis through ESI may lead to inadequate recommendations for further treatment. Accurately identifying subjects with severe periodontitis requires a full-mouth examination. Because the ESI relies on measurements taken on only 28 periodontal sites to estimate the periodontitis status of the entire mouth, the validity and reliability of ESI may be modified by the prevalence of severe periodontal disease and the distribution of disease according to age and operational definitions. Copyright Blackwell Munksgaard, 2004.
Full Text Available Objective: To translate the Perceived Stress Scale (versions PSS-4, -10 and -14 and to assess its psychometric properties in a sample of general Greek population. Methods: 941 individuals completed anonymously questionnaires comprising of PSS, the Depression Anxiety and Stress scale (DASS-21 version, and a list of stress-related symptoms. Psychometric properties of PSS were investigated by confirmatory factor analysis (construct validity, Cronbach’s alpha (reliability, and by investigating relations with the DASS-21 scores and the number of symptoms, across individuals’ characteristics. The two-factor structure of PSS-10 and PSS-14 was confirmed in our analysis. We found satisfactory Cronbach’s alpha values (0.82 for the full scale for PSS-14 and PSS-10 and marginal satisfactory values for PSS-4 (0.69. PSS score exhibited high correlation coefficients with DASS-21 subscales scores, meaning stress (r = 0.64, depression (r = 0.61, and anxiety (r = 0.54. Women reported significantly more stress compared to men and divorced or widows compared to married or singled only. A strong significant (p < 0.001 positive correlation between the stress score and the number of self-reported symptoms was also noted. Conclusions: The Greek versions of the PSS-14 and PSS-10 exhibited satisfactory psychometric properties and their use for research and health care practice is warranted.
Andreou, Eleni; Alexopoulos, Evangelos C; Lionis, Christos; Varvogli, Liza; Gnardellis, Charalambos; Chrousos, George P; Darviri, Christina
To translate the Perceived Stress Scale (versions PSS-4, -10 and -14) and to assess its psychometric properties in a sample of general Greek population. 941 individuals completed anonymously questionnaires comprising of PSS, the Depression Anxiety and Stress scale (DASS-21 version), and a list of stress-related symptoms. Psychometric properties of PSS were investigated by confirmatory factor analysis (construct validity), Cronbach's alpha (reliability), and by investigating relations with the DASS-21 scores and the number of symptoms, across individuals' characteristics. The two-factor structure of PSS-10 and PSS-14 was confirmed in our analysis. We found satisfactory Cronbach's alpha values (0.82 for the full scale) for PSS-14 and PSS-10 and marginal satisfactory values for PSS-4 (0.69). PSS score exhibited high correlation coefficients with DASS-21 subscales scores, meaning stress (r = 0.64), depression (r = 0.61), and anxiety (r = 0.54). Women reported significantly more stress compared to men and divorced or widows compared to married or singled only. A strong significant (p stress score and the number of self-reported symptoms was also noted. The Greek versions of the PSS-14 and PSS-10 exhibited satisfactory psychometric properties and their use for research and health care practice is warranted.
Dogan, Tayfun; Cetin, Bayram
The purpose of the present study was to investigate the reliability and validity of the Turkish version of the Tromso Social Intelligence Scale (TSIS) developed by Silvera, Martinussen, and Dahl (2001). 719 students from Sakarya University participated in the study. Construct validity and criterion related validity and reliability were assessed.…
Sher, Kenneth J.; And Others
Interrelated analyses were conducted with more than 4,000 college students to examine the reliability and validity of the Tridimensional Personality Questionnaire (TPQ) and to develop and validate a short version of the scale. Results provide moderate support for the reliability and validity of both the TPQ and the short form. (SLD)
Ay, Zuhal Yetkin; Gündogar, Duru; Büyüköztürk, Sener; Gadbury-Amyot, Cynthia
The aims of the present study were to 1) investigate patient anxiety as it relates to dental hygiene procedures and 2) conduct validity and reliability testing of the Turkish version of the Dental Hygiene Fear Survey (DHFS) to test its psychometric properties in the Turkish culture. The study consisted of 200 periodontal patients. The patients completed the DHFS, the Modified Dental Anxiety Scale (MDAS), and a questionnaire designed to elicit sociodemographic and clinical characteristics. Reliability was estimated by Cronbach's alpha. Validity was examined by correlating the DHFS with measures found in the literature that correlate with dental fear and anxiety. Construct validity was determined using exploratory factor analysis with a direct oblimin (oblique) rotation, which revealed a model with three factors accounting for 62.8 of the variance when item 11, which has the lowest item total correlation, was omitted. The Cronbach alpha coefficients of the subscales in the fifteen-item scale (generalized anxiety, dental hygiene specific anxiety, and catastrophe) were 0.83, 0.86, and 0.79, respectively. The overall reliability of the DHFS was 0.91. Significant negative correlations were found between total DHFS scores and age (pvalid and reliable instrument and may be included in assessments designed to measure fear related to dental hygiene treatment procedures.
Full Text Available Aiming to resolve the problems of a variety of uncertainty variables that coexist in the engineering structure reliability analysis, a new hybrid reliability index to evaluate structural hybrid reliability, based on the random–fuzzy–interval model, is proposed in this article. The convergent solving method is also presented. First, the truncated probability reliability model, the fuzzy random reliability model, and the non-probabilistic interval reliability model are introduced. Then, the new hybrid reliability index definition is presented based on the random–fuzzy–interval model. Furthermore, the calculation flowchart of the hybrid reliability index is presented and it is solved using the modified limit-step length iterative algorithm, which ensures convergence. And the validity of convergent algorithm for the hybrid reliability model is verified through the calculation examples in literature. In the end, a numerical example is demonstrated to show that the hybrid reliability index is applicable for the wear reliability assessment of mechanisms, where truncated random variables, fuzzy random variables, and interval variables coexist. The demonstration also shows the good convergence of the iterative algorithm proposed in this article.
Anagnostopoulos, Fotios; Dimitrakaki, Christine; Niakas, Dimitris; Tountas, Yannis
The purpose was to assess the validity and reliability of health belief model variables (perceived benefits, perceived barriers, and self-efficacy beliefs) that are often the target of mammography screening interventions. Based on Champion's health belief model scales, questionnaire data were obtained from a nationally representative sample of 408 Greek women, 40 years of age or older and with no personal history of cancer. Using confirmatory factor analysis, we found all items to load significantly on their respective factors. Standardized factor loadings ranged from 0.55 to 0.88. An analysis of localized areas of respecification of a three-factor model revealed that measurement error correlations between a few items existed. Known-groups and criterion-related validity was supported for all scales, while internal consistency reliability of the scales was satisfactory, with Cronbach's alpha ranging from 0.71 to 0.88. The Greek version of the scales measuring perceived benefits, perceived barriers, and self-efficacy beliefs is a reliable and valid tool when used to assess health beliefs for mammography screening among women. Implications for intervention strategies aimed at increasing the likelihood of mammography utilization are considered. Copyright © 2012 John Wiley & Sons, Ltd.
Leticia de Matos Malavasi
Full Text Available The lack of adherence to practice physical activities urges several researchers to ind answers for this matter. Among these researches, it is investigated how or what motivates people to perform any type of physical activity. Besides that, the environmental conditions are an important reason to establish a healthier lifestyle among individuals. In Brazil, the amount of validated scales about environmental barriers for physical activity in communities is restricted. The validation and the cultural adaptation of these instruments are important not only to compare with studies from other countries, but mainly for planning public politics to improve the adherence to practice physical activities. Thus, the present research aimed to analyze the validity and reliability of the Brazilian version of the Neighborhood EnvironmentWalkability Scale (NEWS. The methodological procedures were structured in three stages. The first stage had the following procedures: translation of NEWS and back-translation by bilingual specialists. The second stage was the adaptation of NEWS to the Brazilian reality through a pilot study and with reliability. The third stage, together with a professional urban panel indicating which neighborhoods had better or worse mobility, it was accomplished a application of the NEWS questionnaire to assure construct validation. The sample of this research were separated in two parts, 75persons for the reliability; and for the validity of the questionnaire 200 residents from the four neighborhoods pointed by the specialists of the city of Florianópolis (SC. Through the NEWS the subjects answered questions about the neighborhoods regarding: type of residences, stores and trade proximity, perception of access to these places, streets characteristics, facilities to walk and ride bicycle, and safety related to traffic and crimes. The statistical analysis was made in the SPSS 11.0 version for the intra-class correlation and reliability for the
Ananda, M.M.A.; Singh, A.K.; Flueck, J.A.
The Weibull distribution is widely used in reliability literature as a distribution of time to failure, as it allows for both increasing failure rate (IFR) and decreasing failure rate (DFR) models. It has also been used to develop models for an engineered barrier system (EBS), which is known to be one of the key components in a deep geological repository for high level radioactive waste (HLW). The EBS failure time can more realistically be modelled by an IFR distribution, since the failure rate for the EBS is not expected to decrease with time. In this paper, we use an IFR distribution to develop a reliability model for the EBS
Le Guilly, Thibaut; Olsen, Petur; Ravn, Anders Peter
This chapter presents a process to design and validate models of reactive systems in the form of communicating timed automata. The models are extended with faults associated with probabilities of occurrence. This enables a fault tree analysis of the system using minimal cut sets that are automati......This chapter presents a process to design and validate models of reactive systems in the form of communicating timed automata. The models are extended with faults associated with probabilities of occurrence. This enables a fault tree analysis of the system using minimal cut sets...... that are automatically generated. The stochastic information on the faults is used to estimate the reliability of the fault affected system. The reliability is given with respect to properties of the system state space. We illustrate the process on a concrete example using the Uppaal model checker for validating...... the ideal system model and the fault modeling. Then the statistical version of the tool, UppaalSMC, is used to find reliability estimates....
Mann, Joanna; Henley, William; O'Mahen, Heather; Ford, Tamsin
Depression could be considered to be on a continuum with well-being and some have argued that it is important to measure well-being as well as distress. The Everyday Feelings Questionnaire was designed to measure both these aspects. Its validity has been assessed in a nonclinical population. This project aims to assess the validity and reliability of the EFQ in a clinical population. The EFQ was completed by 105 clients within a mental health clinical setting. The following aspects of the EFQ were explored: its internal structure, concurrent validity, re-test reliability and internal consistency. The EFQ had good internal consistency and correlated highly with other measures of anxiety and depression. The correlation between total EFQ scores on the two occasions was reasonable and there was no effect of time during completion. A Bland-Altman plot showed no obvious pattern between the difference between EFQ scores and the mean score. A one factor model showed a moderate fit to the data. This study does not explore the acceptability or sensitivity to change of the EFQ, and a larger sample size would be needed to extend the analysis conducted. The EFQ is a valid and reliable measure when used in this clinical population. Copyright © 2012 Elsevier B.V. All rights reserved.
Full Text Available Cybercrime is a growing and worrisome problem, particularly when it involves minors. Cyber aggression among adolescents in particular can result in negative legal and psychological consequences for people involved. Therefore, it is important to have instruments to detect these incidents early and understand the problem to propose effective measures for prevention and treatment. This paper aims to design a new self-report, the Cyber-Aggression Questionnaire for Adolescents (CYBA, to evaluate the extentto which the respondent conducts aggressions through a mobile phone or the internet and analyse the factorial and criterion validity and reliability of their scores in a sample of adolescents from Asturias, Spain. The CYBA was administered to 3,148 youth aged between 12 and 18 years old along with three self-reports to measure aggression at school, impulsivity, and empathy. Regarding factorial validity, the model that best represents the structure of the CYBA consists of three factors (Impersonation, Visual sexual Cyber-aggression, and Verbal Cyber-aggression and Exclusion and four additional indicators of Visual Cyber-aggression–Teasing/Happy Slapping. Regarding criterion validity, the score on the CYBA correlates positively with aggression at school and impulsivity and negatively with empathy. That is the way cyber-aggression correlates with these three variables, according to previous empirical evidence. The reliability of the scores on each item and factor of the CYBA are adequate. Therefore, the CYBA offers a valid and reliable measure of cyber-aggression in adolescents.
Wissink, I B; Colonnesi, C; Stams, G J J M; Hoeve, M; Asscher, J J; Noom, M J; Polderman, N; Kellaert-Knol, M G
The Attachment Insecurity Screening Inventory (AISI) 2-5 years is a parent-report questionnaire for assessing attachment insecurity in preschoolers. Validity and reliability of the AISI 2-5 years were examined in a general sample ( n = 429) and in a clinical sample ( n = 71). Confirmatory factor analysis (CFA) confirmed a three-factor model of avoidant, ambivalent/resistant and disorganized attachment, and one higher-order factor of total attachment insecurity. Multi-group CFA indicated measurement invariance across mothers and fathers, and across the general and clinical population sample. Reliability coefficients were generally found to be good. We found partial support for convergent validity in associations between AISI-scores and observed attachment (AQS). Concurrent validity was supported by associations between AISI-scores and observed parental sensitivity (MBQS) and parent-reported psychopathology (SDQ). Finally, the AISI discriminated well between children from the general and from the clinical sample. We argue that both research and practice could benefit from the AISI as there is now a prospect of quickly, reliably and validly screening for attachment insecurity in pre-school aged children. Based on this information, help can be offered timely and, subsequently, the prevention of attachment related problems of children can be strengthened.
Negahban, Hossein; Mazaheri, Masood; Salavati, Mahyar; Sohani, Soheil Mansour; Askari, Marjan; Fanian, Hossein; Parnianpour, Mohamad
The aims of this study were to culturally adapt and validate the Persian version of Foot and Ankle Outcome Score (FAOS) and present data on its psychometric properties for patients with different foot and ankle problems. The Persian version of FAOS was developed after a standard forward-backward translation and cultural adaptation process. The sample included 93 patients with foot and ankle disorders who were asked to complete two questionnaires: FAOS and Short-Form 36 Health Survey (SF-36). To determine test-retest reliability, 60 randomly chosen patients completed the FAOS again 2 to 6 days after the first administration. Test-retest reliability and internal consistency were assessed using intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. To evaluate convergent and divergent validity of FAOS compared to similar and dissimilar concepts of SF-36, the Spearman's rank correlation was used. Dimensionality was determined by assessing item-subscale correlation corrected for overlap. The results of test-retest reliability show that all the FAOS subscales have a very high ICC, ranging from 0.92 to 0.96. The minimum Cronbach's alpha level of 0.70 was exceeded by most subscales. The Spearman's correlation coefficient for convergent construct validity fell within 0.32 to 0.58 for the main hypotheses presented a priori between FAOS and SF-36 subscales. For dimensionality, the minimum Spearman's correlation coefficient of 0.40 was exceeded by most items. In conclusion, the results of our study show that the Persian version of FAOS seems to be suitable for Iranian patients with various foot and ankle problems especially lateral ankle sprain. Future studies are needed to establish stronger psychometric properties for patients with different foot and ankle problems.
Keller, Ted; Munson, John C.; Schneidewind, Norman; Stark, George
Panel Session at the International Symposium on Software Reliability Engineering 1993, Saturday: 6 November 1993, 0830-1000 and 1030-1200 The NASA Space Shuttle on-board software is one of the nation’s most safety-critical software systems. The process which produces this software has been rated at maturity level five. Among the quality assurance methods that are used to ensure the software is free of safetycritical faults is the use of reliability modelling and predi...
Tornga, Shawn Robert [Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
When modeling system performance of space based detection systems it is important to consider spacecraft reliability. As space vehicles age the components become prone to failure for a variety of reasons such as radiation damage. Additionally, some vehicles may lose the ability to maneuver once they exhaust fuel supplies. Typically failure is divided into two categories: engineering mistakes and technology surprise. This document will report on a method of simulating space vehicle reliability in the DIORAMA framework.
Ruland, Cornelia M; Bakken, Suzanne; Røislien, Jo
Recently there has been a proliferation of interactive tailored patient assessment (ITPA) tools. However, evidence of the reliability and validity of these instruments is often missing, which makes their value in research studies questionable. Because several of the common methods to evaluate instrument reliability and validity are not applicable to interactive tailored patient assessments, informatics researchers may benefit from some guidance on which methods of reliability and validity ass...
Cil Akinci, Ayse; Pinar, Rukiye
To investigate the validity and reliability of the Caregiver Burden Scale in family members who provide primary care for haemodialysis patients. In Turkey, there is a need for a multi-dimensional instrument to evaluate the caregiver burden in people who provide care for patients with chronic diseases. A methodological study. The study sample consisted of 161 family members who provide primary care for haemodialysis patients. The forward-backward translation method was used to develop the Turkish Caregiver Burden Scale. The reliability was based on internal consistency investigated by Cronbach's alpha and item-total correlation. The factorial construct validity of the scale was tested with confirmatory factor analysis. By means of convergent and divergent validity, correlation between Caregiver Burden Scale and 36-Item Short Form Health Survey (SF-36) and correlation between Caregiver Burden Scale and the Maslach Burnout Scale were investigated. Cronbach's alpha and item-total correlations results suggested that there was good internal reliability. We found five underlying factors similar to original Scale's five-factor solution. The confirmatory factor analysis five-factor model represented an acceptable fit. Factor loadings were significant, with standardised loadings ranging from 0·43-0·81. By means of divergent validity, all sub-dimension scores and the total score of the Caregiver Burden Scale were negatively correlated with the SF-36, whereas there was a positive correlation with the emotional exhaustion and depersonalisation subscales of the Maslach Burnout Scale as expected. These results suggest that the Caregiver Burden Scale is a reliable and valid instrument which can be used with confidence in Turkish caregivers for haemodialysis patients to screen caregiver burden. The burden experienced by people who provide care for patients with chronic diseases can be evaluated with the Caregiver Burden Scale. Additionally, the Caregiver Burden Scale can be used
Morales, Alexandra; Reis, Sibília; Espada, José P; Orgilés, Mireia
The Short Health Anxiety Inventory is a brief instrument to assess health anxiety widely used across countries; however, no validated version is available for Portuguese-speaking population. Factorial structure, reliability, and equivalency factor with the Spanish version were analyzed with Portuguese adolescents aged 14-18 years. A Portuguese adolescent cohort ( N = 629) and a comparative Spanish adolescent cohort ( N = 1502) were evaluated. The original two-factor version was the best fitting model for the Portuguese version. The reliability was excellent. Complete measurement invariance across both countries was supported. The Portuguese version of the Short Health Anxiety Inventory is a valid screening inventory to assess health anxiety in adolescents.
Britt Karin Støen Utvær
Full Text Available Self-determination theory (SDT distinguishes types of motivation according to types of self-regulation along a continuum of internalisation. Types of motivation vary in quality and outcomes and are frequently used in research as predictors of educational outcomes such as learning, performance, engagement, and persistence. The Academic Motivation Scale (AMS, which is based on the SDT, has not previously been evaluated in Norway. In response, by using correlation and confirmatory factor analysis, we examined the dimensionality, reliability, and construct validity of the AMS among vocational health and social care students. Our hypothesised 7-factor model demonstrated the best fit, while the AMS demonstrated good reliability and construct validity in the sample of students. However, some improvements remain necessary. In predicting the rate of school completion among students on vocational tracks, amotivation and identified regulation appeared to be more powerful as intrinsic motivational variables.
Full Text Available Purpose of this study was to investigate the reliability and validity of the Turkish form of the Attitude Towards Mathematics Instrument Short Form developed by Lim and Chapman (2013. In this study, data gathered from 310 elementary students were utilized for Exploratory and Confirmatory Factor Analysis to determine the structure of factor loading. The factor loading among the sub-scales were different from original. Confirmatory Factor analysis revealed that the model was acceptable. There were three sub-scales, value, self-confidence, enjoyment and motivation. Cronbach’s alpha coefficient for the overall instrument was calculated as .84, respectively. The adapted instrument includes three sub-scales: value (α=.91, self-confidence (α=.86, enjoyment and motivation (α=.82. Turkish adaptation of the questionnaire is valid and reliable and appropriate to use in Turkish culture.
Loeding, B L; Greenan, J P
The study examined the validity and reliability of four assessments, with three instruments per domain. Domains included generalizable mathematics, communication, interpersonal relations, and reasoning skills. Participants were deaf, legally blind, or visually impaired students enrolled in vocational classes at residential secondary schools. The researchers estimated the internal consistency reliability, test-retest reliability, and construct validity correlations of three subinstruments: student self-ratings, teacher ratings, and performance assessments. The data suggest that these instruments are highly internally consistent measures of generalizable vocational skills. Four performance assessments have high-to-moderate test-retest reliability estimates, and were generally considered to possess acceptable validity and reliability.
Riou, Marie-Ève; Rioux, François; Lamothe, Gilles; Doucet, Éric
The aim of this study was to validate the performance and reliability of results obtained from a classification model that measures time spent performing activities in confined (CE) and unrestricted (UE) environments. In CE, participants wore a pair of biaxial and/or triaxial accelerometers while performing pre-determined training activities classified as variants of lying down, dynamic standing, sitting, walking and running on two separate days. A classification model trained with activities performed in a specific order during the first day was developed to validate the activities performed in a random order on the second day (CE) and over 24 hours on a separate day (UE). The performance of the classification model was validated against triaxial accelerometers using six (x, y and step counts for arm and thigh) or eight (same as six features plus z axis) features. The reliability of the classification model was tested in both environments using six features. Results revealed an overall accuracy of 94% in CE and 90% in UE. The sensitivity in CE and UE was 94% and 95% for lying down, 88% and 80% for dynamic standing, 97% and 89% for sitting, 96% and 78% for walking and 90% and 64% for running, respectively. No significant differences were noted between performances obtained with six or eight features. Results were highly reproducible in both environments. The results obtained from the classification model were accurate and reproducible, and highlight the potential use of this approach in research to quantify the time spent performing different activities.
Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E
Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J
To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; pteaching and learning (SoTL) articles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
Liaghatdar, Mohammad Javad; Jafari, Ebrahim; Abedi, Mohammad Reza; Samiee, Fatemeh
The aim of the present study was to examine the internal reliability, content validity, construct validity, and concurrent validity of the Persian translation/Farsi version of the Oxford Happiness Inventory. The Oxford Happiness Inventory and Fordyce Happiness Inventory were completed by a sample of 727 Iranian university students. Findings confirmed the internal reliability, construct, and concurrent validity of the Oxford Happiness Inventory. Thus, it can be recommended for use as a trait-measure of happiness among Iranian samples.
Kozine, Igor; Krymsky, Victor
The application of imprecise reliability models is often hindered by the rapid growth in imprecision that occurs when many components constitute a system and by the fact that time to failure is bounded from above. The latter results in the necessity to explicitly introduce an upper bound on time...... to failure which is in reality a rather arbitrary value. The practical meaning of the models of this kind is brought to question. We suggest an approach that overcomes the issue of having to impose an upper bound on time to failure and makes the calculated lower and upper reliability measures more precise...
Coutts, Aaron J; Duffield, Rob
There is limited information regarding the validity and reliability of global positioning system (GPS) devices for measuring movement during team sports. The aim of this study was to assess the validity and intra-model reliability of different GPS devices for quantifying high-intensity, intermittent exercise performance. Two moderately trained males each completed eight bouts of a standard circuit that consisted of six laps around a 128.5-m course involving intermittent exercise. Distance and speed were collected concurrently at 1-Hz using six GPS devices (2 SPI-10, 2 SPI Elite and 2 WiSPI, GPSports, Canberra, Australia). Performance measures were: (1) total distance covered for each bout and each lap; (2) high-intensity running distance (>14.4 km h(-1), HIR); very high-intensity running distance (>20 km h(-1), VHIR) during each bout. Peak speed was also measured during a 20-m sprint at the start of each lap of the circuit (N=192). Actual distance was measured using a measuring tape. Mean (+/-SD) circuit total distance was significantly different between each of the GPS devices (PGPS devices. These results show that the GPS devices have an acceptable level of accuracy and reliability for total distance and peak speeds during high-intensity, intermittent exercise, but may not be provide reliable measures for higher intensity activities. Copyright (c) 2008 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Capio, Catherine M; Eguia, Kathlynne F; Simons, Johan
This study aimed to examine aspects of validity and reliability of the Test of Gross Motor Development-2 (TGMD-2) in Filipino children with intellectual disability. Content and construct validity were verified, as well as inter-rater and intra-rater reliability. Two paediatric physiotherapists tested 81 children with intellectual disability (mean age = 9.29 ± 2.71 years) on locomotor and object control skills. Analysis of covariance, confirmatory factor analysis and analysis of variance were used to test validity, while Cronbach's alpha, intra-class correlation coefficients (ICC) and Bland-Altman plots were used to examine reliability. Age was a significant predictor of locomotor and object control scores (P = 0.004). The data fit the hypothesised two-factor model with fit indices as follows: χ(2) = 33.525, DF = 34, P = 0.491, χ(2)/DF = 0.986. As hypothesised, gender was a significant predictor for object control skills (P = 0.038). Participants' mean scores were significantly below mastery (locomotor, P intellectual disability.
Hadziabdic, Emina; Safipour, Jalal; Bachrach-Lindström, Margareta; Hultsjö, Sally
Nearly 20 % of the Swedish population is foreign-born. Increased exposure of patients from diverse cultures means there is an urgent need to address their unique requirements and provide optimal health care to a diverse population. Nursing schools thus have an important goal of educating nurses to ensure they are culturally competent. Culturally competent care improves safety and equity for patients. To measure cultural awareness among nursing students in Sweden, the aim of this study was to translate, adapt and test the validity and reliability of the Swedish version of a cultural awareness scale which has not previously been tested. A total of 158 nursing students from three universities in Sweden completed the 36-item questionnaire on cultural awareness. Verification of face and content validity and a translation/reverse translation process were first carried out. The results indicate that one item (no 13) caused weak reliability and validity, and therefore it was removed. The reliability test result (with 35 items) showed Cronbach's Alpha ranged from 0.60 to 0.87. The Model ChiSq group fit for five factors was 50.44 (31.27-77.06; Df = 5; p cultural settings.
Moghadam, Manije; Salavati, Mahyar; Sahaf, Robab; Rassouli, Maryam; Moghadam, Mojgan; Kamrani, Ahmad Ali Akbari
After forward-backward translation, the LSS was administered to 334 Persian speaking, cognitively healthy elderly aged 60 years and over recruited through convenience sampling. To analyze the validity of the model's constructs and the relationships between the constructs, a confirmatory factor analysis followed by PLS analysis was performed. The Construct validity was further investigated by calculating the correlations between the LSS and the "Short Form Health Survey" (SF-36) subscales measuring similar and dissimilar constructs. The LSS was re-administered to 50 participants a month later to assess the reliability. For the eight-factor model of the life satisfaction construct, adequate goodness of fit between the hypothesized model and the model derived from the sample data was attained (positive and statistically significant beta coefficients, good R-squares and acceptable GoF). Construct validity was supported by convergent and discriminant validity, and correlations between the LSS and SF-36 subscales. Minimum Intraclass Correlation Coefficient level of 0.60 was exceeded by all subscales. Minimum level of reliability indices (Cronbach's α, composite reliability and indicator reliability) was exceeded by all subscales. The Persian-version of the Life Satisfaction Scale is a reliable and valid instrument, with psychometric properties which are consistent with the original version.
Michalopoulos, Lynn T M; Murray, Laura K; Kane, Jeremy C; Skavenski van Wyk, Stephanie; Chomba, Elwyn; Cohen, Judith; Imasiku, Mwiya; Semrau, Katherine; Unick, Jay; Bolton, Paul A
The aim of the current study is to test the validity and reliability of the Shame Questionnaire among traumatized girls in Lusaka, Zambia. The Shame Questionnaire was validated through both classical test and item response theory methods. Internal reliability, criterion validity and construct validity were examined among a sample of 325 female children living in Zambia. Sub-analyses were conducted to examine differences in construct validity among girls who reported sexual abuse and girls who did not. All girls in the sample were sexually abused, but only 61.5% endorsed or reported that sexual abuse had occurred. Internal consistency was very good among the sample with alpha = .87. Criterion validity was demonstrated through a significant difference of mean Shame Questionnaire scores between girls who experienced 0-1 trauma events and more than one traumatic event, with higher mean Shame Questionnaire scores among girls who had more than one traumatic event (p = .004 for 0-1 compared to 2 and 3 events and p = .016 for 0-1 compared to 4+ events). Girls who reported a history of witnessing or experiencing physical abuse had a significantly higher mean Shame Questionnaire score than girls who did not report a history of witnessing or experiencing physical abuse (pShame Questionnaire score between girls who reported a sexual abuse history and girls who did not. Exploratory factor analysis indicated a two-factor model of the Shame Questionnaire, with an experience of shame dimension and an active outcomes of shame dimension. Item response theory analysis indicated adequate overall item fit. Results also indicate potential differences in construct validity between girls who did and did not endorse sexual abuse. This study suggests the general utility of the Shame Questionnaire among Zambian girls and demonstrates the need for more psychometric studies in low and middle income countries.
O'Hare, L; Santin, O; Winter, K; McGuinness, C
There is a growing impetus across the research, policy and practice communities for children and young people to participate in decisions that affect their lives. Furthermore, there is a dearth of general instruments that measure children and young people's views on their participation in decision-making. This paper presents the reliability and validity of the Child and Adolescent Participation in Decision-Making Questionnaire (CAP-DMQ) and specifically looks at a population of looked-after children, where a lack of participation in decision-making is an acute issue. The participants were 151 looked after children and adolescents between 10-23 years of age who completed the 10 item CAP-DMQ. Of the participants 113 were in receipt of an advocacy service that had an aim of increasing participation in decision-making with the remaining participants not having received this service. The results showed that the CAP-DMQ had good reliability (Cronbach's alpha = 0.94) and showed promising uni-dimensional construct validity through an exploratory factor analysis. The items in the CAP-DMQ also demonstrated good content validity by overlapping with prominent models of child and adolescent participation (Lundy 2007) and decision-making (Halpern 2014). A regression analysis showed that age and gender were not significant predictors of CAP-DMQ scores but receipt of advocacy was a significant predictor of scores (effect size d = 0.88), thus showing appropriate discriminant criterion validity. Overall, the CAP-DMQ showed good reliability and validity. Therefore, the measure has excellent promise for theoretical investigation in the area of child and adolescent participation in decision-making and equally shows empirical promise for use as a measure in evaluating services, which have increasing the participation of children and adolescents in decision-making as an intended outcome. © 2016 John Wiley & Sons Ltd.
Park, Yu Kyung; Ju, Hyeon Ok; Na, Hunjoo
The Perinatal Post-Traumatic Stress Disorder Questionnaire (PPQ) was designed to measure post-traumatic symptoms related to childbirth and symptoms during postnatal period. The purpose of this study was to develop a translated Korean version of the PPQ and to evaluate reliability and validity of the Korean PPQ. Participants were 196 mothers at one to 18 months after giving childbirth and data were collected through e-mails. The PPQ was translated into Korean using translation guideline from World Health Organization. For this study Cronbach's alpha and split-half reliability were used to evaluate the reliability of the PPQ. Exploratory Factor Analysis (EFA), Confirmatory Factor Analysis (CFA), and known-group validity were conducted to examine construct validity. Correlations of the PPQ with Impact of Event Scale (IES), Beck Depression Inventory II (BDI-II), and Beck Anxiety Inventory (BAI) were used to test a criterion validity of the PPQ. Cronbach's alpha and Spearman-Brown split-half correlation coefficient were 0.91 and 0.77, respectively. EFA identified a 3-factor solution including arousal, avoidance, and intrusion factors and CFA revealed the strongest support for the 3-factor model. The correlations of the PPQ with IES, BDI-II, and BAI were .99, .60, and .72, respectively, pointing to criterion validity of a high level. The Korean version PPQ is a useful tool for screening and assessing mothers' experiencing emotional distress related to child birth and during the postnatal period. The PPQ also reflects Post Traumatic Stress Disorder's diagnostic standards well.
Henderson, Ronald W.; Rankin, Richard J.
The reliability and predictive validity of the WPPSI was investigated with a sample of lower SES Mexican-American children. The results revealed that the WPPSI has high reliability, as determined by internal consistency procedures, but validity coefficients were not significant. The WPPSI may be inappropriate for special program placement purposes…
Nederhof, Esther; Brink, Michel S.; Lemmink, Koen A. P. M.
The purpose of the present study was to investigate the cross-cultural validity of the Recovery Stress Questionnaire for Athletes (RESTQ-sport) by analysing reliability and validity of a Dutch translation. Two studies were performed to assess test-retest reliability with a one week interval,
Smith, Jack E.; Hakel, Milton D.
Examined are questions pertinent to the use of the Position Analysis Questionnaire: Who can use the PAQ reliably and validly? Must one rely on trained job analysts? Can people having no direct contact with the job use the PAQ reliably and validly? Do response biases influence PAQ responses? (Author/KC)
Boonstra, Anne M.; Schiphorst Preuper, Henrica R.; Reneman, Michiel F.; Posthumus, Jitze B.; Stewart, Roy E.
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional
Markon, Kristian E.; Chmielewski, Michael; Miller, Christopher J.
In 2 meta-analyses involving 58 studies and 59,575 participants, we quantitatively summarized the relative reliability and validity of continuous (i.e., dimensional) and discrete (i.e., categorical) measures of psychopathology. Overall, results suggest an expected 15% increase in reliability and 37% increase in validity through adoption of a…
Mehmet Emrah Karadere
Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that Reasoning with Inductive Argument Test supports reliability and validity in Turkish population. [JCBPR 2013; 2(3.000: 156-161
de Groot, Sonja; Balvers, Inge J.M.; Kouwenhoven, Sanne M.; Janssen, Thomas W.J.
The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
De Groot, Sonja; Balvers, Inge J. M.; Kouwenhoven, Sanne M.; Janssen, Thomas W. J.
The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Watt, Torquil; Hegedus, Laszlo; Grønvold, Mogens
Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated....
Bhat, Mehraj A.
This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
de Boer, M.K.; Castelein, Stynke; Bous, Johan; van den Heuvel, Edwin R.; Wiersma, Durk; Schoevers, Robert A.; Knegtering, Henderikus
The aim of this study is to describe the psychometric properties of the Antipsychotics and Sexual Functioning Questionnaire (ASFQ). Internal reliability, test-retest reliability, inter-rater reliability, validity and sensitivity to change were calculated in a sample of 30 patients with schizophrenia
Full Text Available The Problem Solving Inventory (PSI is designed to measure adults’ perceptions of problem-solving ability. The presented study aimed to translate it and assess its reliability and validity in a nationwide sample of 3668 Greek educators. In order to evaluate internal consistency reliability, Cronbach’s alpha coefficient was used. The scale’s construct validity was examined by a confirmatory factor analysis (CFA and by investigating its correlation with the Internality, Powerful others and Chance Multidimensional Locus of Control Scale (IPC LOC Scale, the Rosenberg Self-Esteem Scale (RSES and demographic information. Internal consistency reliability was satisfactory with Cronbach’s alphas ranging from 0.79 to 0.91 for all PSI scales. CFA confirmed that the bi-level model fitted the data well. The root mean square error of approximation (RMSEA, the comparative fit index (CFI and the goodness of fit index (GFI values were 0.030, 0.97 and 0.96, respectively, further confirming the bi-level model and the three-factors construct of the PSI. Intercorrelations and correlation coefficients between the PSI, the IPC LOC Scale and the RSES were significant. Age, sex, and working experience differences were found. In conclusion, the Greek version of the PSI was found to have satisfactory psychometric properties and therefore, it can be used to evaluate Greek teachers’ perceptions of their problem-solving skills.
Jang, Moon Jung; Lee, Eun Nam
To investigate the validity and reliability of the Korean Version of the Assault Response Questionnaire (ARQ-K) measuring the intensity of reaction to victimization of emergency nurses in Korea. An internal consistency reliability and construct validity using exploratory and confirmatory factor analysis were conducted using SPSS WIN (20.0) and AMOS (20.0). Survey data were collected from 321 nurses who worked in 3 levels - wide regional emergency centers, regional emergency centers, appointed emergency centers - of emergency care facilities in Busan, Korea. The Cronbach's alpha values regarding internal consistency were .77~.93 for the subscales of ARQ-K. Factor loadings of the 26 items on the four subscales ranged from .59 to .84. The four-subscale model was validated by confirmatory factor analysis (χ²/df=3.85, pKorean Version of the Assault Response Questionnaire is a valid and reliable instrument to assess nurses' reaction to victimization of emergency nurses in Korea.
seyed abolfazl zakerian; Roya Azizi; Mehdi Rahgozar
The term usability refers to a special index for success of an operating system. This study aimed to determine the reliability and validity of the Software Usability Measurements Inventory (SUMI) questionnaire as one of the valid and common questionnaires about usability evaluation. The back translation method was used to translate the questionnaire from English to Persian back to English. Moreover, repeatability or test-retest reliability was practically used to determine the reliability of ...
Adie Sam; Ko Victoria; Naylor Justine M; Gaskin Clive; Walker Richard; Harris Ian A; Mittal Rajat
Abstract Background The clinimetric properties of knee goniometry are essential to appreciate in light of its extensive use in the orthopaedic and rehabilitative communities. Intra-observer reliability is thought to be satisfactory, but the validity and inter-rater reliability of knee goniometry often demonstrate unacceptable levels of variation. This study tests the validity and reliability of measuring knee range of motion using goniometry and photographic records. Methods Design: Methodolo...
Zhang, Tingting; Yin, Anchun; Sun, Xiaohong; Liu, Qigui; Song, Guirong; Li, Lianhong
To develop psychosocial adaptation scale for Parkinson's disease (PD) in Chinese population and evaluate its reliability and validity. The items were designed by literature review, expert consultation and semi-structured interview. The methods of corrected item-total correlation, discrimination analysis and exploratory factor analysis were used for items selection. 427 valid scales from PD patients were collected in the study to test the reliability and validity. The scale incorporated six dimensions: anxiety, self-esteem, attitude, self-acceptance, self-efficacy and social support, a total of 32 items. The scale possessed good internal consistency. The test-retest correlation coefficient was 0.99 and average content validation rate was 0.97. The Hoehn and Yahr stage were correlated with total score of the scale. The psychosocial adaptation scale in this study showed good reliability and validity, it can be used as a reliable and valid instrument to evaluate the psychosocial adaptation of PD objectively and effectively.
Isaksson, Ulf; Hajdarević, Senada; Jutterström, Lena; Hörnsten, Åsa
The Melbourne Decision-Making Questionnaire (MDMQ) is an attempt to capture and measure coping strategies that people use. The instrument had not previously been translated into Swedish. The aim of this study was to evaluate validity and reliability of the Swedish version of the MDMQ. A Swedish translation was performed and back-translated. A group of five pilot readers evaluated content validity. The translated questionnaire was tested among 735 patients, healthcare workers, healthcare students and teachers. A parallel analysis (PA), exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were performed. An initial EFA with a four-factor solution showed a low concordance with the original 22-item four-factor model with a very low Cronbach's alpha in one of the dimensions. However, a second EFA with a three-factor solution showed a good model fit for the Swedish translation of the Melbourne Decision-Making Questionnaire (MDMQ-S) with a satisfactory Cronbach's alpha. A CFA showed a goodness of fit after deleting six items. After testing the MDMQ-S, we found support for validity and reliability of the instrument. We found the 16-item version of MDMQ-S to be satisfactory concerning the subscales vigilance, procrastination and buck-passing. However, we found no support that the hypervigilance dimension could be measured by the MDMQ-S. © 2013 Nordic College of Caring Science.
Full Text Available Purpose This study aims to demonstrate whether the functionalities between the widely used questionnaire in other countries and the Korean-translated version show similarity. Also, it intends to verify their reliability and validity. Methods The original questionnaire was first developed by professor Noe at the University of Minnesota with 29 items named as “mentoring functions” to identify the participants’ psychological and career-development functions. Using the Korean-translated version of the original questionnaire, the study was conducted on total 288 Kyung Hee University medical students, ranging from the first-year students to the third-year students on December 2015. In order to investigate if the survey form could be applied to the Korean participants, exploratory factor analysis and confirmatory factor analysis were conducted, using varimax rotation. Cronbach α statistics and the figures of standardized regression weights were analyzed respectively to indicate construct, convergent, and discriminant validities. Results The result of exploratory factor analysis shows there are a total of three functions, including the additional “friendship function” (Eigen value, 1.152; significant level if higher than 1.0 with its high emphasis. The result of confirmatory factor analysis also demonstrates the similarity (p=0.000. As the whole reliability scale of the three functions is significantly high (Cronbach α , 0.971, each reliability scales of the three functions are shown to be high (0.814–0.955 as well. Conclusion The model, with its three functions, proved the significant statistics regarding the reliability and validity. According to this ground, the conclusion is that the adapted questionnaire used in this study could be applied to Korean medical students.
Das, Jacqueline; de Ruiter, Corine; Doreleijers, Theo; Hillege, Sanne
The present study examines the reliability and construct validity of the Dutch version of the Psychopathy Check List: Youth Version (PCL:YV) in a sample of male adolescents admitted to a secure juvenile justice treatment institution (N = 98). Hare's four-factor model is used to examine reliability and validity of the separate dimensions of…
The invited papers focused on the following issues: 1. The different technical and scientific approaches to the problem of how to guarantees or demonstrate the reliability of NDE: a. Application of established prescriptive standards, b. Probabilities of Detection (PDO) and False Alarm (PFA) from blind trials, c. POD and PFA from signal statistics, d. Modeling, e. ''Technical Justification''; 2. The dissimilar validation/qualification concepts used in different industries in Europe and North America: a. Nuclear Power Generation, b. Aerospace Industry, c. Offcshore Industry and d. Service Companies
National Aeronautics and Space Administration — The innovation is the systematic "building-block" validation of CFD/turbulence models employing a GUI driven CFD code (RPFM) and existing as well as new data sets to...
Full Text Available The primary objective of the study was to quantitatively test the DART model, which despite being one of the most popular representations of co-creation concept was so far studied almost solely with qualitative methods. To this end, the researchers developed a multiple measurement scale and employed it in interviewing managers. The statistical evidence for adequacy of the model was obtained through CFA with AMOS software. The findings suggest that the DART model may not be an accurate representation of co-creation practices in companies. From the data analysis it was evident that the building blocks of DART had too much of conceptual overlap to be an effective framework for quantitative analysis. It was also implied that the phenomenon of co-creation is so rich and multifaceted that it may be more adequately captured by a measurement model where co-creation is conceived as a third-level factor with two layers of intermediate latent variables.
Scott, Macfarlane T U; Scott, Tannath J; Kelly, Vincent G
The use of global positioning systems (GPS) has increased dramatically over the last decade. Using signals from orbiting satellites, the GPS receiver calculates the exact position of the device and the speed at which the device is moving. Within team sports GPS devices are used to quantify the external load experienced by an athlete, allowing coaches to better manage trainings loads and potentially identify athletes who are overreaching or overtraining. This review aims to collate all studies that have tested either (or both) the validity or reliability of GPS devices in a team sport setting, with a particular focus on (a) measurements of distance, speed, velocities, and accelerations across all sampling rates and (b) accelerometers, player/body load and impacts in accelerometer-integrated GPS devices. A comprehensive search of the online libraries identified 22 articles that fit search criteria. The literature suggests that all GPS units, regardless of sampling rate, are capable of tracking athlete's distance during team sport movements with adequate intraunit reliability. One Hertz and 5Hz GPS units have limitations in their reporting of distance during high-intensity running, velocity measures, and short linear running (particularly those involving changes of direction), although these limitations seem to be overcome during measures recorded during team sport movements. Ten Hertz GPS devices seem the most valid and reliable to date across linear and team sport simulated running, overcoming many limitations of earlier models, whereas the increase to 15Hz GPS devices have had no additional benefit.
All three of these instruments do not involve high costs, do not require high technical skills, mobile, save time, and are suitable for use in large populations. Because all three instruments can estimate the percentage of body fat, but it is important to identify the most appropriate instruments and have high reliability. Hence, this ...
This article discusses the use of assessment by teachers to replace external marking. It shows how professional participation and moderation can provide reliability in summative assessment, even in public examinations for older students. It draws on historical experiences of assessment for A-level English literature.
M. van Veen (Mirjam); H.A. Moll (Henriëtte)
textabstractBackground. Triage in paediatric emergency care is an important tool to prioritize seriously ill children. Triage can also be used to identify patients who do not need urgent care and who can safely wait. The aim of this review was to provide an overview of the literature on reliability
Powers, Stephen; And Others
Spanish speaking first graders were administered the Artes de Lenguage (ADL)--a Spanish, criterion-referenced, language arts test. Reliability analyses indicated the adequacy of three of the four subscales (Phonetic Analysis, Vocabulary Development, Comprehension Skills, and General Skills). A principal factors analysis of the intercorrelation…
Cetin, Bayram; Yaman, Erkan; Peker, Adem
The purpose of this study is to develop a reliable and valid scale, which determines cyber victimization and bullying behaviors of high school students. Research group consisted of 404 students (250 male, 154 male) in Sakarya, in 2009-2010 academic years. In the study sample, mean age is 16.68. Content validity and face validity of the scale was…
Dedecius, Kamil; Sečkárová, Vladimíra
Roč. 19, č. 5 (2013), s. 471-482 ISSN 1387-3954 R&D Projects: GA MŠk 7D12004 Grant - others:GA MŠk(CZ) SVV-265315 Keywords : Bayesian modelling * Sensor network * Reliability Subject RIV: BD - Theory of Information Impact factor: 0.984, year: 2013 http://library.utia.cas.cz/separaty/2013/AS/dedecius-0392551.pdf
Watt, Torquil; Hegedüs, Laszlo; Groenvold, Mogens
Background Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test......-retest reliability should be evaluated. Aim To investigate clinical ('known-groups') validity and test-retest reliability of the Danish version of the ThyPRO. Methods For each of the 13 ThyPRO scales, we defined groups expected to have high versus low scores ('known-groups'). The clinical validity (known......-groups validity) was evaluated by whether the ThyPRO scales could detect expected differences in a cross-sectional study of 907 thyroid patients. Test-retest reliability was evaluated by intra-class correlations of two responses to the ThyPRO 2 weeks apart in a subsample of 87 stable patients. Results On all 13...
Herrera-Kiengelher, L; Zepeda-Zaragoza, J; Austria-Corrales, F; Vázquez-Zarate, V M
Patient Safety is a major public health problem worldwide and is responsibility of all those involved in health care. Establishing a Safety Culture has proved to be a factor that favors the integration of work teams, communication and construction of clear procedures in various organizations. Promote a culture of safety depends on several factors, such as organization, work unit and staff. Objective assessment of these factors will help to identify areas for improvement and establish strategic lines of action. [corrected] To adapt, validate and calibrate the questionnaire Culture of Quality in Health Services (CQHS) in Mexican population. A cross with a stratified representative sample of 522 health workers. The questionnaire was translated and adapted from Singer's. Content was validated by experts, internal consistency, confirmatory factorial validity and item calibration with Samejima's Graded Response Model. Convergent and divergent construct validity was confirmed from the CQHS, item calibration showed that the questionnaire is able to discriminate between patients and represent different levels of the hypothesized dimensions with greater accuracy and lower standard error. The CQHS is a valid and reliable instrument to assess patient safety culture in hospitals in Mexico. Copyright © 2013 SECA. Published by Elsevier Espana. All rights reserved.
Blanche, Erna Imperatore; Bodison, Stefanie; Chang, Megan C.; Reinoso, Gustavo
OBJECTIVE We developed an observational tool, the Comprehensive Observations of Proprioception (COP), for identifying proprioceptive processing issues in children with developmental disabilities. METHOD Development of the COP underwent three phases. First, we developed items representing proprioceptive functions on the basis of an extensive literature review and consultation with occupational therapists. We then established interrater reliability and content, construct, and criterion validity. Finally, we completed a factor analysis of COP ratings of 130 children with known developmental disabilities. RESULTS Adequate validity and reliability were established. Factor analysis revealed a four-factor model that explained the underlying structure of the measure as it was hypothesized. CONCLUSION The COP is a valid criterion-referenced short observational tool that structures the clinician’s observations by linking a child’s behaviors to areas identified in the literature as relevant to proprioceptive processing. It takes 15 min to administer and can be used in a variety of contexts, such as the home, clinic, and school. PMID:23106989
Our daily lives can be maintained by the high-technology systems. Computer systems are typical examples of such systems. We can enjoy our modern lives by using many computer systems. Much more importantly, we have to maintain such systems without failure, but cannot predict when such systems will fail and how to fix such systems without delay. A stochastic process is a set of outcomes of a random experiment indexed by time, and is one of the key tools needed to analyze the future behavior quantitatively. Reliability and maintainability technologies are of great interest and importance to the maintenance of such systems. Many mathematical models have been and will be proposed to describe reliability and maintainability systems by using the stochastic processes. The theme of this book is "Stochastic Models in Reliability and Main tainability. " This book consists of 12 chapters on the theme above from the different viewpoints of stochastic modeling. Chapter 1 is devoted to "Renewal Processes," under which cla...
Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen
Objective Examination of in-hospital suicide risk levels concerning their validity and their reliability. Method The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. Results For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). Conclusion In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Maroufizadeh, Saman; Omani Samani, Reza; Amini, Payam; Navid, Behnaz
This study examined psychometric properties of the Levenson's Locus of Control Scale among Iranian infertile patients. In all, 312 infertile patients completed the Levenson's Locus of Control Scale and Hospital Anxiety and Depression Scale. The confirmatory factor analysis indicated that the original three-factor model of Levenson's Locus of Control Scale was adequate ( χ 2 / df = 2.139; goodness-of-fit index = 0.88; root mean square error of approximation = 0.061; and standardized root mean square residual = 0.076). The Cronbach's alpha of the subscales ranged from 0.56 to 0.67. The Levenson's Locus of Control Scale subscales significantly correlated with anxiety and depression, showing an acceptable convergent validity. In conclusion, the Levenson's Locus of Control Scale has adequate reliability and validity and can be used to measure locus of control orientation in Iranian infertile patients.
Full Text Available Abstract Background This study aimed to investigate the reliability and construct validity of the International Classification of Functioning, Disability and Health (ICF Comprehensive Core Set for osteoarthritis (OA in order to test its possible use as a measuring tool for functioning. Methods 100 patients with OA (84 F, 16 M; mean age 63 yr completed forms including demographic and clinical information besides the Short Form (36 Health Survey (SF-36® and the Western Ontario and McMaster Universities Index of Osteoarthritis (WOMAC. The ICF Comprehensive Core Set for OA was filled by health professionals. The internal construct validities of "Body Functions-Body structures" (BF-BS, "Activity" (A, "Participation" (P and "Environmental Factors" (EF domains were tested by Rasch analysis and reliability by internal consistency and person separation index (PSI. External construct validity was evaluated by correlating the Rasch transformed scores with SF-36 and WOMAC. Results In each scale, some items showing disordered thresholds were rescored, testlets were created to overcome the problem of local dependency and items that did not fit to the Rasch model were deleted. The internal construct validity of the four scales (BF-BS 16 items, A 8 items, P 7 items, EF 13 items were good [mean item fit (SD 0.138 (0.921, 0.216 (1.237, 0.759 (0.986 and -0.079 (2.200; person item fit (SD -0.147 (0.652, -0.241 (0.894, -0.310 (1.187 and -0.491 (1.173 respectively], indicating a single underlying construct for each scale. The scales were free of differential item functioning (DIF for age, gender, years of education and duration of disease. Reliabilities of the BF-BS, A, P, and EF scales were good with Cronbach's alphas of 0.79, 0.86, 0.88, and 0.83 and PSI's of 0.76, 0.86, 0.87, and 0.71, respectively. Rasch scores of BF-BS, A, and P showed moderate correlations with SF-36 and WOMAC scores where the EF had significant but weak correlations only with SF36-Social
P. K. Kapur
Full Text Available Several software reliability growth models (SRGMs have been developed by software developers in tracking and measuring the growth of reliability. As the size of software system is large and the number of faults detected during the testing phase becomes large, so the change of the number of faults that are detected and removed through each debugging becomes sufficiently small compared with the initial fault content at the beginning of the testing phase. In such a situation, we can model the software fault detection process as a stochastic process with continuous state space. In this paper, we propose a new software reliability growth model based on Itô type of stochastic differential equation. We consider an SDE-based generalized Erlang model with logistic error detection function. The model is estimated and validated on real-life data sets cited in literature to show its flexibility. The proposed model integrated with the concept of stochastic differential equation performs comparatively better than the existing NHPP-based models.
The Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA) were used to determine the validity of the measurement instrument. In the EFA analysis, the structure with 27 items and five factors was reached as in the original form. The two-step analyses carried out in the scope of the CFA revealed that both ...
Kane, Michael; Case, Susan
The scores on two distinct tests (e.g., essay and objective) are often combined into a composite score, which is used to make decisions. The validity of the observed composite can sometimes be evaluated relative to a separate criterion. In cases where no criterion is available, the observed composite has generally been evaluated in terms of its…
They completed this 15 item self-rated instrument that assesses patient satisfaction with services using a 5 point response format. Results:The internal consistency for the scale was high ( a=0.91), and item total correlations ranged between 0.33 to 0.70. Its convergent validity was supported by significant correlations of all ...
Mentiplay, Benjamin F; Perraton, Luke G; Bower, Kelly J; Adair, Brooke; Pua, Yong-Hao; Williams, Gavin P; McGaw, Rebekah; Clark, Ross A
Hand-held dynamometry (HHD) has never previously been used to examine isometric muscle power. Rate of force development (RFD) is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power. 30 healthy young adults (age: 23±5 yrs, male: 15) were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2) and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC), Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change. Comparison of RFD methods revealed that a peak 200 ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70) for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70) for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31-0.79). Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability and validity of these variables in clinical
Benjamin F Mentiplay
Full Text Available Hand-held dynamometry (HHD has never previously been used to examine isometric muscle power. Rate of force development (RFD is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power.30 healthy young adults (age: 23±5 yrs, male: 15 were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2 and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC, Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change.Comparison of RFD methods revealed that a peak 200 ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70 for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70 for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31-0.79.Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability and validity of these variables in
Riddle, Daniel L; Jiranek, William A; Hull, Jason R
Most orthopedic surgeons do not routinely use radiographic classification systems to grade the extent of joint space narrowing in patients considered for total knee arthroplasty. The authors compared the validity and reliability of radiographic measures of tibiofemoral osteoarthritis by 2 experienced and 2 inexperienced orthopedic surgeons on individuals who subsequently underwent total knee arthroplasty. The Kellgren-Lawrence and the Osteoarthritis Research Society International classification systems were used by all surgeons to score the radiographs in 116 individuals in the Osteoarthritis Initiative, a federally funded cohort study of individuals with or at risk of knee osteoarthritis. Validity was judged based on comparison with the criterion centrally adjudicated consensus measures obtained by Osteoarthritis Initiative investigators. Weighted kappa, a chance corrected agreement index, was used to describe validity and reliability. Validity and intrarater reliability were substantial to almost perfect for 1 experienced and 1 inexperienced surgeon, with weighted kappas ranging from 0.76 to 0.96 for the surgical knees. The other experienced and inexperienced surgeons demonstrated moderate to substantial validity, with weighted kappas ranging from 0.43 to 0.70 and lower intrarater reliability. Interrater reliability was generally less than intrarater reliability. With minimal training, some surgeons can obtain valid and reliable measurements of knee osteoarthritis status in individuals who eventually undergo total knee arthroplasty. Measurement quality does not appear to be dependent on extent of surgeon experience. Some surgeons require additional training to become proficient in the radiographic classification systems, and future research should examine this issue. Copyright 2013, SLACK Incorporated.
Students' opinions continue to be a significant factor in the evaluation of teaching in higher education institutions. The purpose of this study was to psychometrically assess short students evaluation of teaching (SET) forms using the UAE University form as a model. The study evaluated the form validity, reliability, the overall question, and…
Smith, K.; Wood, E.; Santhanagopalan, S.; Kim, G. H.; Neubauer, J.; Pesaran, A.
Models describing battery degradation physics are needed to more accurately understand how battery usage and next-generation battery designs can be optimized for performance and lifetime. Such lifetime models may also reduce the cost of battery aging experiments and shorten the time required to validate battery lifetime. Models for chemical degradation and mechanical stress are reviewed. Experimental analysis of aging data from a commercial iron-phosphate lithium-ion (Li-ion) cell elucidates the relative importance of several mechanical stress-induced degradation mechanisms.
Skinner, T. C.; Howells, L.; Greene, S.
Aims: This article reports on the development and validity of a Diabetes-specific Illness Representations Questionnaire (DIRQ) to assess all five dimensions of an individual's perception of diabetes, for adolescents with Type 1 diabetes mellitus. Methods: There were two development studies. Study 1...... consist of two subscales, perceived threat and perceived impact, and provide further support for the distinction between treatment effectiveness to control diabetes and treatment effectiveness to prevent complications. Along with the validation studies, the results indicate that the questionnaire scales......: participants (n = 115) completed a questionnaire assessing perceptions of the consequences of diabetes and the effectiveness of treatment, along with a questionnaire assessing self-care. Study 2: participants (n = 79) completed a questionnaire assessing their identity, timeline and causal beliefs, along...
Full Text Available Hardiness, comprising feelings of commitment, control and challenge, is most frequently measured with the Dispositional Resiliency Scale (DRS, but little work has been done with the brief 15-item version.To examine the factor structure, reliability, validity, and item functioning for gender invariance of the 15-item DRS with 525 collegiate athletes from a wide range of sports. Convergent and divergent validity were examined through relationships with mental toughness, grit and competitive anxiety. Participants completed measures of mental toughness, optimism, grit, competitive anxiety, and the DRS-15.Confirmatory Factor Analysis revealed a poor fit for the three-factor hardiness model, and subsequent exploratory factor analysis yielded a four-factor model with better fit than the three-factor structure. Additionally, several items appear to be biased towards males or females. The fourth factor may be unique to the collegiate athlete population, and related to perceived lack of control in future life directions. Convergent and divergent validity were supported through correlations of DRS scores with related measures.The four-factor model should be tested with different samples to determine if these changes should be adapted when using the DRS-15 in collegiate athletics or other settings.
Chorong Park, MSN, RN
Conclusion: The K-HES had acceptable validity and reliability. The brevity and ease of administration of the K-HES makes it a suitable tool for evaluating empowerment-based education programs targeted towards older populations.
Boer, Y.A. de; Ende, C.H.M. van den; Eygendaal, D.; Jolie, I.M.M.; Hazes, J.M.W.; Rozing, P.M.
OBJECTIVES: (1) To investigate the measurement characteristics of the Hospital for Special Surgery (HSS) and Mayo Clinic elbow assessment instruments, utilizing methodological criteria including feasibility, reliability, validity, and discriminative ability; and (2) to develop an efficient and
Betül Tosun, RN, PhD
Conclusions: The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems.
M. Reijman (Max); J.M.W. Hazes (Mieke); H.A.P. Pols (Huib); R.M.D. Bernsen (Roos); B.W. Koes (Bart); S.M. Bierma-Zeinstra (Sita)
textabstractOBJECTIVES: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the
Tolsgaard, M G; Ringsted, C; Dreisler, E
OBJECTIVES: To explore the reliability and validity of a recently developed instrument for assessment of ultrasound operator competence, the Objective Structured Assessment of Ultrasound Skills (OSAUS). METHODS: Three groups of 10 doctors with different levels of ultrasound experience in obstetrics...
Gamze Sarikoc, PhD, RN
Conclusion: Results showed that the SNSI had a satisfactory level of reliability and validity in nursing students in Turkey. Multicenter studies including nursing students from different nursing schools are recommended for the SNSI to be generalized.
Rikkert, Marcel G M Olde; Tona, Klodiana Daphne; Janssen, Lieneke
New staging systems of dementia require adaptation of disease management programs and adequate staging instruments. Therefore, we systematically reviewed the literature on validity and reliability of clinically applicable, multidomain, and dementia staging instruments. A total of 23 articles desc...
Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe
The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis.
Ringsted, C; Lippert, F; Hesselfeldt, R
Cardiac Arrest Simulation Test (CASTest) scenarios for the assessments according to guidelines 2005. AIMS: To analyse the reliability and validity of the individual sub-tests provided by ERC and to find a combination of MCQ and CASTest that provides a reliable and valid single effect measure of ALS...... that possessed high reliability, equality of test sets, and ability to discriminate between the two groups of supposedly different ALS competence. CONCLUSIONS: ERC sub-tests of ALS competence possess sufficient reliability and validity. A combined ALS score with equal weighting of one MCQ and one CASTest can......-rater agreement and generalisability across skills scenarios were estimated. Validity was studied in terms of equality of test difficulty and ability to discriminate performance between the groups. RESULTS: Inter-rater agreement on checklist scores were generally high, Intraclass Correlation Coefficients between...
Conclusion: Considering that Validity and Reliability factors of the questionnaire were be appropriate, it can be recommended that NIOSH Generic Job Stress Questionnaire (GJSQ can be used as a Valid and Reliable questionnaire for job stress evaluation in Iran.
Tennis is a competitive sport attracting millions of players and fans worldwide. During a competition, the physical component crucially affects the final result of a match. In field sports such as soccer physical demand data are collected using the global positioning system (GPS). There is question regarding the validity and reliability of using GPS technology for court sports such as tennis. The purpose of this study is to determine the validity and reliability of GPS to determine distances...
Yangzong, Ciren; Lerkiatbundit, Sanguan; Luobu, Ouzhu; Cui, Chaoying; Liabsuetrakul, Tippawan; Kangzhuo, Baima; Quzong, Deji; Zhandui, Luobu; Zhen, Pu; Chongsuvivatwong, Virasakdi
Parenting style experienced during childhood has profound effects on children's futures. Scales developed in other countries have never been validated in the Tibetan context. The present study aimed to examine the construct validity and reliability of a Tibetan translation of the 23-item short form of the Egna Minnen Beträffande Uppfostran [One's Memories of Upbringing] (s-EMBU) and to test the correlation between the parenting styles of fathers and mothers. A cross-sectional study was conducted in a sample of 847 students aged 12-21 years from Lhasa, Tibet, during September and October 2015 with a participation rate of 97.7%. The Tibetan translation of self-completed s-EMBU was administered. Confirmatory factor analysis was employed to test the scale's validity on the first half of the sample and was then cross-validated with the second half of the sample. The final model consisted of six factors: three (rejection, emotional warmth, and overprotection) for each parent, equality constrained on factor loadings, factor correlations, and error variance between father and mother. Father-mother correlation coefficients ranged from 0.81 to 0.86, and the level of consistency ranged from 0.62 to 0.82. Thus, the slightly modified s-EMBU is suitable for use in the Tibetan culture where both the father and the mother have consistent parenting styles.
Yangzong, Ciren; Lerkiatbundit, Sanguan; Luobu, Ouzhu; Cui, Chaoying; Liabsuetrakul, Tippawan; Kangzhuo, Baima; Quzong, Deji; Zhandui, Luobu; Zhen, Pu; Chongsuvivatwong, Virasakdi
Parenting style experienced during childhood has profound effects on children’s futures. Scales developed in other countries have never been validated in the Tibetan context. The present study aimed to examine the construct validity and reliability of a Tibetan translation of the 23-item short form of the Egna Minnen Beträffande Uppfostran [One’s Memories of Upbringing] (s-EMBU) and to test the correlation between the parenting styles of fathers and mothers. A cross-sectional study was conducted in a sample of 847 students aged 12–21 years from Lhasa, Tibet, during September and October 2015 with a participation rate of 97.7%. The Tibetan translation of self-completed s-EMBU was administered. Confirmatory factor analysis was employed to test the scale’s validity on the first half of the sample and was then cross-validated with the second half of the sample. The final model consisted of six factors: three (rejection, emotional warmth, and overprotection) for each parent, equality constrained on factor loadings, factor correlations, and error variance between father and mother. Father–mother correlation coefficients ranged from 0.81 to 0.86, and the level of consistency ranged from 0.62 to 0.82. Thus, the slightly modified s-EMBU is suitable for use in the Tibetan culture where both the father and the mother have consistent parenting styles. PMID:28053560
Dotti, A; Lazzari, R
The Italian from of the short, 26-item Eating Attitudes Test (EAT-26) has been administered to 1277 Roman high-school students, mostly females, as a screening device. Ninety-five students with a total score > 20 and 40 students with a low score, were randomly selected, interviewed and diagnosed. The EAT-26 proved to be more sensitive to the presence of an eating disorder than to a specific clinical entity. Item analysis performed on the EAT-26 variables showed satisfactory reliability coefficients. Factor analysis using an oblique rotation was similar to that obtained by Garner et al. (1). Factor analysis with an orthogonal rotation (Cattell's screen test) identified five factors. Results suggested that the EAT-26 isolates cases at risk of clinical spectrum eating disorders.
Keywords: test construction; nutrition knowledge; dietary fat; validity; reliability .... processing are also easily,20,23 objectively20,22 and reliably20 done. An item ... pepper added)a. Butternut soup (no cream added). Orange juice (no sugar added). Skim milk (fat free). 5. Which one of the following contains the most fat?b.
Romano, Jeanine; Kromrey, Jeffrey D.
The purpose of this study was to examine the potential impact of selected methodological factors on the validity of conclusions from reliability generalization (RG) studies. The study focused on four factors; (1) missing data in the primary studies; (2) transformation of sample reliability estimates; (3) use of sample weights for estimating mean…
Hartingsveldt, M.J. van; Cup, E.H.C.; Groot, I.J.M. de; Nijhuis-Van der Sanden, M.W.
BACKGROUND/AIM: This study examined the reliability and convergent validity of the Writing Readiness Inventory Tool in Context, a measurement evaluating writing readiness in kindergarten children (aged from five to six years). METHODS: Test-retest reliability was established with 59 children,
Fikkers, K.M.; Taylor Piotrowski, J.; Valkenburg, P.M.
This study evaluated whether common self-report measures of television and game violence exposure represent reliable and valid measurement tools. Three self-report measures—direct estimates, user-rated favorites, and agency-rated favorites—were assessed in terms of test-retest reliability, criterion
Reliable and valid evaluation of hand strength is important for determining the effectiveness of treatment strategies. Muscular grip strength is objectively measured by various types of dynamometers but some instruments are costly, time-consuming to use, or have questionable reliability. The various dynamometers that ...
Conclusion: The tool designed to assess bag-mask ventilation and tracheal intubation skills in anesthesia trainees demonstrated excellent inter-rater reliability, fair test-retest reliability, and good construct validity. The authors recommend its use for formative and summative assessment of junior anesthesia trainees.
Deming, Mary P.; And Others
Compares the reliability and validity of the 10 subtests of the Learning and Study Strategies Inventory (LASSI) obtained from 99 developmental college students with norming values obtained in an earlier study. Indicates that 9 out of 10 subtest reliability coefficients approached but did not equal the earlier study. (HB)
Schoppen, Tanneke; Boonstra, Antje; Groothoff, JW; de Vries, J; Goeken, LNH; Eisma, Willem
Objective: To determine the interrater and interrater reliability and the validity of the Timed "up and go" test as a measure for physical mobility in elderly patients with an amputation of the lower extremity. Design: To test interrater reliability, the test was performed for two observers at
Rathus, Spencer A.; Siegel, Larry J.
Self-concept questionnaire was shown to have high test-retest reliability, but only fair to moderate split-half (odd-even) reliability. Validity was adequate. The scale will serve as a heuristic device for family counselors who require a rapid assessment of a child's self-esteem. (Author)
IJmker, S.; Leijssen, J.N.M.; Blatter, B.M.; Beek, A.J. van der; Mechelen, W. van; Bongers, P.M.
This study evaluated the test–retest reliability and the validity of self-reported duration of computer use at work. Test–retest reliability was studied among 81 employees of a research department of a university medical center. The employees filled out a web-based questionnaire twice with an
Brody, Michelle L.; And Others
Examined reliability and validity of binge eating disorder (BED), proposed for inclusion in Diagnostic and Statistical Manual of Mental Disorders (DSM), fourth edition. Interrater reliability of BED diagnosis compared favorably with that of most diagnoses in DSM revised third edition. Study comparing obese individuals with and without BED and…
Rae, James R.; Olson, Kristina R.
The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
The Rey Visual Design Learning Test (Rey, 1964, in Spreen & Strauss, 1991) assesses immediate memory span, new learning and recognition for non-verbal material. Three studies are presented that focused on the reliability and validity of the RVDLT in primary school children. Test-retest reliability
Van Helvoort, Jos; Brand-Gruwel, Saskia; Huysmans, Frank; Sjoer, Ellen
The purpose of this paper is to measure reliability and validity of the Scoring Rubric for Information Literacy (Van Helvoort, 2010).Design/methodology/approach Percentages of agreement and Intraclass Correlation were used to describe interrater reliability. For the determination of construct
Thoomes-de Graaf, Marloes; Scholten-Peeters, Gwendolijne G M; Duijn, Edwin; Karel, Yasmaine; Koes, Bart W; Verhagen, Arianne P
To evaluate the reliability and validity of the Dutch Shoulder Pain and Disability Index (SPADI-D). The SPADI is recommended and frequently used. However, the validity and reliability of the SPADI-D are unknown. The study population consisted of patients consulting a physical therapist for shoulder pain. We assessed construct validity, using known groups, convergent validity (SDQ) and divergent validity (EQ5D) for which the mean difference or Spearman correlations coefficients were calculated. The factor structure was assessed using principal component factor analysis, and we calculated Cronbach's alpha and the ICC to assess the reliability. A total of 356 patients and a randomly selected group of 74 subjects for the reliability analysis were included. There was a significant difference between extreme groups (a high/low level of pain and work absence/presence) in SPADI score. The correlation between the SPADI and the SDQ was 0.69, with the EQ5D mobility-item 0.25 and with the depression-item 0.14. The SPADI consisted of one factor according to principal component factor analysis, which showed high internal consistency (Cronbach's alpha = 0.94 for the total score), and the test-retest reliability was good (ICC = 0.89). The Dutch SPADI is a valid and reliable questionnaire for patients in primary care in assessing functional disability.
Uğurlu, Nezihe; Bayar, Banu; Bayar, Kiliçhan; Göktaş, Atilla; Karakaya, Ilkim Çitak; Polat, Hatice
The aim of the current study was to translate the Hung Postpartum Stress Scale into Turkish and test the reliability and validity of the Turkish version of the scale. The translation process of the Hung Postpartum Stress Scale into Turkish and the testing of reliability and validity of the newly developed scale on postpartum stress in women are described. The questionnaire was translated using a back-translation technique. Expert review of internal consistency reliability, content validity, factor analysis of construct validity and criterion-related validity were examined. Cronbach's alpha for the Turkish version of the Hung Postpartum Stress Scale was 0.931, indicating very good reliability. Factor analyses resulted in a two factor scale structure: maternal concerns and acceptability of the newborn by the family. Analyses also indicated good test-re-test reliability for the Turkish version of the Hung Postpartum Stress Scale (P design of a Turkish version of the Hung Postpartum Stress Scale. The newly developed scale proved to be reliable and valid and will be a valuable instrument for women's healthcare professionals. © 2012 The Authors. Journal of Obstetrics and Gynaecology Research © 2012 Japan Society of Obstetrics and Gynecology.
Zhong, Yan; Xu, Tianqiu; Dong, Ruijuan; Lyu, Jing; Liu, Bo; Chen, Xueqing
The aim of this study was to investigate the reliability and validity of the Infant-toddler Meaningful Auditory Integration Scale (IT-MAIS), Meaningful Auditory Integration Scale (MAIS), and Meaningful Use of Speech Scale (MUSS). IT-MAIS, MAIS and MUSS were divided into 3 sub dimensions. 300 children with cochlear implants (CI) were included in the investigation. To assess test-retest reliability of these questionnaires, 30 children were selected randomly to be evaluated at a two-week interval indicated that there were no significant changes between test and retest. Furthermore random test analysis by different evaluators was also administered to 30 users. Reliability test: Test-retest reliability of the three scales was proved to be satisfactory. All domains had correlation coefficients that exceeded 0.750(P Reliability between evaluators of the three scales were considered to be satisfactory. All domains had correlation coefficients that exceeded 0.750(P Validity test: The evaluation of content validity by expert review showed the questionnaire had good content validity; The correlation coefficients between the overall scores of the three scales and their three domains were 0.699-0.978(P validity. IT-MAIS, MAIS, MUSS scales have good reliability and validity, and can be used to measure the outcome for children with cochlear implants hearing and speech evaluation. Copyright © 2017 Elsevier B.V. All rights reserved.
Ahmad Ali Eslami
Full Text Available Background: The main purpose of this study was to assess the factorial validity and reliability of the Iranian versions of the personality and behavior system scales (49 items of the AHDQ (The Adolescent Health and Development Questionnaire and interrelations among them based on Jessor′s PBT (Problem Behavior Theory. Methods: A multi-staged approach was employed. The cross-cultural adaptation was performed according to the internationally recommended methodology, using the following guidelines: translation, back-translation, revision by a committee, and pretest. After modifying and identifying of the best items, a cross-sectional study was conducted to assess the psychometric properties of Persian version using calibration and validation samples of adolescents. Also 113 of them completed it again two weeks later for stability. Results: The findings of the exploratory factor analysis suggested that the 7-factor solution with low self concept, emotional distress, general delinquency, cigarette, hookah, alcohol, and hard drugs use provided a better fitting model. The a range for these identified factors was 0.69 to 0.94, the ICC range was 0.73 to 0.93, and there was a significant difference in mean scores for these instruments in compare between the male normative and detention adolescents. The first and second-order measurement models testing found good model fit for the 7-factor model. Conclusions: Factor analyses provided support of existence internalizing and externalizing problem behavior syndrome. With those qualifications, this model can be applied for studies among Persian adolescents.
Öksüz, Çigdem; Alemdaroglu, Ipek; Kilinç, Muhammed; Abaoğlu, Hatice; Demirci, Cevher; Karahan, Sevilay; Yilmaz, Oznur; Yildirim, Sibel Aksu
This study was performed to examine the reliability and validity of the Turkish version of ABILHAND-Kids questionnaire which assesses manual functions of children with neuromuscular diseases (NMDs). A cross sectional survey study design and Rasch analysis were used to assess the reliability and validity of the Turkish version of scale. Ninety-three children with different neuromuscular disorders and their parents were included in the study. The scale was applied to the parents with face-to-face interview twice; on their first visit and after an interval of 15 days. The test-retest reliability was assessed with intraclass correlation coefficient (ICC), and internal consistency of the multi-item subscales by calculating Cronbach alpha values. Brooke Upper Extremity Functional Classification (BUEFC) and Wee-Functional Independency Measurement (Wee-FIM) were correlated to determine the construct validity. The ICC value for the test/retest reliability was 0.94. The internal consistency was 0.81. Floor (1.1%) and ceiling (11.8%) effects were not significant. There were moderate correlations between the Turkish version of ABILHAND-Kids and Wee-FIM (0.67) and BUEFC (-0.37). Rasch analysis indicated good item ﬁt, unidimensionality, and model ﬁt. The Turkish version of ABILHAND-Kids questionnaire was found to be a reliable and valid scale for the assessment of the manual ability of children with NMDs.
Gance-Cleveland, Bonnie; Schmiege, Sarah; Aldrich, Heather; Stevens, Carol; Scheller, Makena
Reliable and valid instruments to measure counseling effectiveness related to eating, activity, and inactivity are lacking. HeartSmartKids (HSK; HeartSmartKids, LLC, Boulder, Colorado) is a bilingual decision-support technology that can be used to assess and counsel families on cardiovascular risk factors in children. The purpose of this study was to establish preliminary test-retest reliability and convergent validity of HSK questions relative to a previously validated questionnaire, HABITS. Data were collected from children (ages 9-14 years) during an intervention study. A multitrait/multimethod correlation matrix framework was used to examine the test-retest reliability (n = 35) and convergent validity (n = 103) of HSK relative to HABITS. HSK had moderate/strong test-retest reliability and good convergent validity between HSK and HABITS. This study provides preliminary test-retest reliability and convergent validity of a useful clinical tool that can be used to quickly assess children's nutrition (e.g., fruit/vegetable consumption, snacks, sugar-sweetened beverages, breakfast) and activity habits to guide behavior change counseling. Additional work should be done validating HSK with objective measures. Copyright © 2018 National Association of Pediatric Nurse Practitioners. Published by Elsevier Inc. All rights reserved.
Cano-Cappellacci, Marcelo; Leyton, Fernanda Aleitte; Carreño, Joshua Durán
ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2) for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries. PMID:26815160
Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Cano-Cappellacci, Marcelo; Leyton, Fernanda Aleitte; Carreño, Joshua Durán
To validate a Spanish version of the Test of Gross Motor Development (TGMD-2) for the Chilean population. Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Full Text Available The main purpose of the present study was to examine the validation and reliability of the Korean version of the Sport Anxiety Scale (SAS-2Kr by evaluating its factorial invariance across gender. A total of 303 Korean collegiate athletes (198 males and 105 females from 9 sports participated in the study, and they completed the demographic questionnaire and the SAS-2Kr containing 15 items to measure multidimensional trait anxiety and individual differences in the cognitive and somatic anxiety experienced by athletes. The results of this study indicated that the construct validity in the SAS-2Kr was well established in that the values of the standardized factor loadings, composite reliability, and average variance extracted values were above the recommended cutoff points. The multiple-sample confirmatory factor analyses showed the SAS-2Kr could be generalizable across gender in college samples. The results also indicated that the SAS-2Kr supported the original 3-factor model of SAS-2 in English consisting of somatic anxiety, worry, and concentration disruption, and thus this study provides useful information for researchers to understand the athletes’ tendency to experience anxiety reactions in sport situations. Suggestions for future research on competitive trait anxiety are provided in the discussion section.
Corrigan, Patrick W; Michaels, Patrick J; Vega, Eduardo; Gause, Michael; Watson, Amy C; Rüsch, Nicolas
The internalization of public stigma by persons with serious mental illnesses may lead to self-stigma, which harms self-esteem, self-efficacy, and empowerment. Previous research has evaluated a hierarchical model that distinguishes among stereotype awareness, agreement, application to self, and harm to self with the 40-item Self-Stigma of Mental Illness Scale (SSMIS). This study addressed SSMIS critiques (too long, contains offensive items that discourages test completion) by strategically omitting half of the original scale's items. Here we report reliability and validity of the 20-item short form (SSMIS-SF) based on data from three previous studies. Retained items were rated less offensive by a sample of consumers. Results indicated adequate internal consistencies for each subscale. Repeated measures ANOVAs showed subscale means progressively diminished from awareness to harm. In support of its validity, the harm subscale was found to be inversely and significantly related to self-esteem, self-efficacy, empowerment, and hope. After controlling for level of depression, these relationships remained significant with the exception of the relation between empowerment and harm SSMIS-SF subscale. Future research with the SSMIS-SF should evaluate its sensitivity to change and its stability through test-rest reliability. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Full Text Available Marco Mula, Stefano Pini, Simona Calugi, Matteo Preve, Matteo Masini, Ilaria Giovannini, Ciro Conversano, Paola Rucci, Giovanni B CassanoDepartment of Psychiatry, Neurobiology, Pharmacology and Biotechnologies, University of Pisa, ItalyAbstract: This study evaluates the validity and reliability of a new instrument developed to assess symptoms of depersonalization: the Structured Clinical Interview for the Depersonalization-Derealization Spectrum (SCI-DER. The instrument is based on a spectrum model that emphasizes soft-signs, sub-threshold syndromes as well as clinical and subsyndromal manifestations. Items of the interview include, in addition to DSM-IV criteria for depersonalization, a number of features derived from clinical experience and from a review of phenomenological descriptions. Study participants included 258 consecutive patients with mood and anxiety disorders, 16.7% bipolar I disorder, 18.6% bipolar II disorder, 32.9% major depression, 22.1% panic disorder, 4.7% obsessive compulsive disorder, and 1.5% generalized anxiety disorder; 2.7% patients were also diagnosed with depersonalization disorder. A comparison group of 42 unselected controls was enrolled at the same site. The SCI-DER showed excellent reliability and good concurrent validity with the Dissociative Experiences Scale. It significantly discriminated subjects with any diagnosis of mood and anxiety disorders from controls and subjects with depersonalization disorder from controls. The hypothesized structure of the instrument was confirmed empirically.Keywords: depersonalization, derealization, mood disorders, anxiety disorders
Gassman, Esther E.; Kallemeyn, Nicole A.; DeVries, Nicole A.; Shivanna, Kiran H.; Powell, Stephanie M.; Magnotta, Vincent A.; Ramme, Austin J.; Adams, Brian D.; Grosland, Nicole M.
The objective was to develop tools for automating the identification of bony structures, to assess the reliability of this technique against manual raters, and to validate the resulting regions of interest against physical surface scans obtained from the same specimen. Artificial intelligence-based algorithms have been used for image segmentation, specifically artificial neural networks (ANNs). For this study, an ANN was created and trained to identify the phalanges of the human hand. The relative overlap between the ANN and a manual tracer was 0.87, 0.82, and 0.76, for the proximal, middle, and distal index phalanx bones respectively. Compared with the physical surface scans, the ANN-generated surface representations differed on average by 0.35 mm, 0.29 mm, and 0.40 mm for the proximal, middle, and distal phalanges respectively. Furthermore, the ANN proved to segment the structures in less than one-tenth of the time required by a manual rater. The ANN has proven to be a reliable and valid means of segmenting the phalanx bones from CT images. Employing automated methods such as the ANN for segmentation, eliminates the likelihood of rater drift and inter-rater variability. Automated methods also decrease the amount of time and manual effort required to extract the data of interest, thereby making the feasibility of patient-specific modeling a reality. (orig.)
Maria C. Richaud
Full Text Available Empathy is a basic socio-emotional process of human development that involves the ability to perceive, share, and understand the emotional states of others. This process is essential to successful social functioning. However, despite its significance, empathy has been difficult to define and measure, particularly when incorporating both its emotional and cognitive aspects. The purpose of this study was to develop an Empathy Questionnaire for children aged 9–12 years based on a model of social cognitive neuroscience and to analyze its construct validity and reliability. This questionnaire aimed to integrate the following aspects: emotional contagion, self-other awareness, perspective-taking, emotional regulation, and empathic action. Three studies were conducted. Study 1 evaluated the discriminative power of the items and studied the underlying structure of the instrument using exploratory factor analysis. In Study 2, confirmatory factor analysis was performed to test the model obtained. Finally, the goal of Study 3 was to analyze the convergent and discriminant validity of the questionnaire and the internal consistency of its dimensions. The final version of the instrument contained 15 items that operationalized the previously listed dimensions. The results of the 3 studies indicated that the questionnaire had good validity and reliability. This study has important implications for research and clinical practice. Given its simplicity and brevity, this new self-report scale may work well as a screening method to evaluate the key psychological issues underlying numerous child behaviors that predict the success or failure of social relationships, individual quality of life, and mental well-being.
Saad, Karen Ruggeri; Colombo, Alexandra S; João, Silvia M Amado
The purpose of this study was to investigate the reliability and validity of photogrammetry in measuring the lateral spinal inclination angles. Forty subjects (32 female and 8 males) with a mean age of 23.4 +/- 11.2 years had their scoliosis evaluated by radiographs of their trunk, determined by the Cobb angle method, and by photogrammetry. The statistical methods used included Cronbach alpha, Pearson/Spearman correlation coefficients, and regression analyses. The Cronbach alpha values showed that the photogrammetric measures showed high internal consistency, which indicated that the sample was bias free. The radiograph method showed to be more precise with intrarater reliabilities of 0.936, 0.975, and 0.945 for the thoracic, lumbar, and thoracolumbar curves, respectively, and interrater reliabilities of 0.942 and 0.879 for the angular measures of the thoracic and thoracolumbar segments, respectively. The regression analyses revealed a high determination coefficient although limited to the adjusted linear model between the radiographic and photographic measures. It was found that with more severe scoliosis, the lateral curve measures obtained with the photogrammetry were for the thoracic and lumbar regions (R = 0.619 and 0.551). The photogrammetric measures were found to be reproducible in this study and could be used as supplementary information to decrease the number of radiographs necessary for the monitoring of scoliosis.
Full Text Available Background Imagery is often beneficial not only in gaining a psychological advantage when competing but also in building self-esteem and self-confidence. The aim of this study was to examine the reliability and validity of the Polish adaptation of the Movement Imagery Questionnaire-3 (MIQ-3, consisting of 12 questions measuring 3 dimensions: visual internal imagery, visual external imagery and kinesthetic imagery. Participants and procedure A sample of athletes (N = 276 – 102 women, 174 men (M = 21.25, SD = 6.35 of various disciplines (football, volleyball, karate, swimming, etc. with different sport experience (from recreation to the national team filled in the MIQ-3 questionnaire in the Polish language. Results The results of the confirmatory factor analysis (CFA with maximum likelihood confirmed that the established three-factor model reflects well the relationships observed in the respondents’ answers (satisfactory value of RMSEA below the recommended value of .05 (RMSEA = .04 and a high value of the index above .90 CFI (CFI = .93. Reliability indicators (composite reliability – CR observed for individual factors indicated a very high internal consistency (external visual imagery = .75, internal visual imagery = .79 and kinesthetic imagery = .82. The results indicated that good stability and internal consistency were maintained over a 3-week period. In addition, analyses were examined across age, level of experience and gender. Conclusions The results of the present study support the psychometric properties of the Polish adaptation of the MIQ-3.
Wongpakaran, Tinakon; Wongpakaran, Nahathai; Ruktrakul, Ruk
This study examines the Thai version of the Multidimensional Scale of Perceived Social Support (MSPSS) for its psychometric properties. In total 462 participants were recruited - 310 medical students from Chiang Mai University and 152 psychiatric patients, and they completed the Thai version of the MSPSS, the State Trait Anxiety Inventory (STAI), the Rosenberg Self-Esteem Scale (RSES) and the Thai Depression Inventory (TDI). Test-retest reliability was conducted over a four week period. Factor analysis produced three-factor solutions for both patient (PG) and student groups (SG), and overall the model demonstrated adequate fit indices. The mean total score and the sub-scale score for the SG were statistically higher than those in the PG, except for 'Significant Others'. The internal consistency of the scale was good, with a Cronbach's alpha of 0.91 for the SG and 0.87 for the PG. After a four week retest for reliability exercise, the intra-class correlation coefficient (ICC) was found to be 0.84. The Thai-MSPSS was found to have a negative correlation with the STAI and the TDI, but was positively correlated with the RSES. The Thai MSPSS is a reliable and valid instrument to use.
The Entrepreneurial Attitude Orientation (EAO) scale is a multidimensional self-report measure of attitudes toward entrepreneurship. Few studies have tested the validity and reliability of the EAO scale in different social and cultural situations. The present study examined the generalizability of the EAO scale in a Chinese context using a two-wave survey. Exploratory factor analysis with the first wave data revealed a four-factor solution consistent with the original scale with 5 items removed. Confirmatory factor analysis of the secondary wave data verified the hypothesized model with a group of parameters and an overall satisfactory fit. The analysis indicated that the Cronbach's a of the four subscales were similar to the values reported by the developers of the scale. However, the results of the test-retest r of the four subscales were lower than the original reports. The findings generally support the generalizability of the four-dimensional model of the EAO. Further research questions are discussed.
Research purpose: The primary goal of the study was to validate the Organisational Citizenship Behaviour Scale (OCBS developed by Podsakoff, Mackenzie, Moorman and Fetter (1990 on a South African sample. Motivation for the study: Organisational citizenship behaviour is one of the important workplace outcomes. A psychometrically sound instrument is therefore required. Research design, approach and method: The sample consisted of 503 employees from the educational sector in the Eastern and Western Cape Provinces of South Africa. The OCBS was used to measure organisational citizenship behaviour. Main findings: High levels of reliability were found for the OCBS sub-scales. The first and second-order measurement models of the OCBS showed good fit. A competing one-factor model did not show good model fit. In terms of discriminant validity four of the five subdimensions correlated highly. Practical/managerial implications: Although the OCBS demonstrated some sound reliability coefficients and reasonable construct validity, the discriminant validity of four of the subscales raise some questions which future studies should confirm. The use of the instrument should help to continue to measure the much-needed extra-role behaviours that mirror an employee’s interest in the success of the organisation. Contribution/value-add: The study contributes to the requirements of the Employment Equity Act (No. 55 of 1998 and the Amended Employment Equity Act of South Africa (Republic of South Africa, 1998; 2014. This promotes the use of reliable and valid instruments in South Africa by confirming the psychometric properties of the OCBS.
Doell, P.; Lehner, B.; Kaspar, F.; Vassolo, S.
due to the precipitation mea- surement errors. Even though the explicit modeling of wetlands and lakes leads to a much improved modeling of both the vertical water balance and the lateral transport of water, not enough information is included in WGHM to accurately capture the hy- drology of these water bodies. Certainly, the reliability of model results is highest at the locations at which WGHM was calibrated. The validation indicates that reliability for cells inside calibrated basins is satisfactory if the basin is relatively homogeneous. Analyses of the few available stations outside of calibrated basins indicate a reason- ably high model reliability, particularly in humid regions.
Suzuki, Eiko; Kanoya, Yuka; Katsuki, Takeshi; Sato, Chifumi
To verify the reliability and validity of a Japanese version of the Rathus Assertiveness Schedule in novice nurses to contribute to nursing management. An adequate scale is needed to measure the assertiveness and the effect of assertion training for Japanese nurses and to compare them with those in other countries. Rathus Assertiveness Schedule was adapted to Japanese with back-translation and its validity was examined in 989 novice nurses. The Japanese version showed a high coefficient of reliability in a split-half reliability test (r=0.76; PRathus Assertiveness Schedule. The Japanese version of Rathus Assertiveness Schedule was verified.
The purpose of this study was to determine the test-retest reliability and concurrent validity of the short form (Form B) of the Coopersmith Self-Esteem Inventory. Criterion measures for validity included: (1) sociometric measures; (2) teacher's popularity ranking; and, (3) self-esteem rating. (Author/LMO)
Biasutti, Michele; Frate, Sara
This article describes the development and validation of the Attitudes toward Sustainable Development scale, a quantitative 20-item scale that measures Italian university students' attitudes toward sustainable development. A total of 484 undergraduate students completed the questionnaire. The validity and reliability of the scale was statistically…
Vanbellingen, Tim; Nyffeler, Thomas; Nef, Tobias; Kwakkel, Gert; Bohlhalter, Stephan; van Wegen, Erwin E.H.
Background Patients with Parkinson's disease exhibit disturbed dexterity. Validated self-reported outcomes for dexterity in Parkinson's disease are lacking. The aim of this study was to investigate the reliability, content and construct validity of a new Dexterity Questionnaire 24. Methods One
Branscum, Paul; Sharma, Manoj; Kaye, Gail; Succop, Paul
Objective: The objective of this study was to report the construct validity and internal consistency reliability of the Food Behavior Checklist modified for children (FBC-MC), with low-income, Youth Expanded Food and Nutrition Education Program (EFNEP)-eligible children. Methods: Using a cross-sectional research design, construct validity was…
Newgent, Rebecca A.; Parr, Patricia E.; Newman, Isadore; Higgins, Kristin K.
This investigation was conducted to estimate the reliability and validity of scores on the Riso-Hudson Enneagram Type Indicator (D. R. Riso & R. Hudson, 1999a). Results of 287 participants were analyzed. Alpha suggests an adequate degree of internal consistency. Evidence provides mixed support for construct validity using correlational and…
Boerebach, Benjamin C. M.; Arah, Onyebuchi A.; Busch, Olivier R. C.; Lombarts, Kiki M. J. M. H.
BACKGROUND: In surgical education, there is a need for educational performance evaluation tools that yield reliable and valid data. This paper describes the development and validation of robust evaluation tools that provide surgeons with insight into their clinical teaching performance. We
M. Reijman (Max); J.M.W. Hazes (Mieke); H.A.P. Pols (Huib); R.M.D. Bernsen (Roos); B.W. Koes (Bart); S.M. Bierma-Zeinstra (Sita)
textabstractObjectives: To compare the reliability and validity in a large open population of three frequently used radiological definitions of hip osteoarthritis (OA): Kellgren and Lawrence grade, minimal joint space (MJS), and Croft grade; and to investigate whether the validity of the three
Objective. We sought to determine the validity and reliability of a self-report physical activity questionnaire (PAQ) measuring physical activity/inactivity in South African schoolgirls of different ethnic origins. Methods. Construct validity of the PAQ was tested against physical activity energy expenditure estimated from an ...
Watt, Torquil; Hegedus, Laszlo; Grønvold, Mogens
Appropriate scale validity and internal consistency reliability have recently been documented for the new thyroid-specific quality of life (QoL) patient-reported outcome (PRO) measure for benign thyroid disorders, the ThyPRO. However, before clinical use, clinical validity and test...
Dere, Zeynep; Ömeroglu, Esra
This study, Creative Behavior Observation Form was developed to assess creativity of the children. While the study group on the reliability and validity of Creative Behavior Observation Form was being developed, 257 children in total who were at the ages of 5-6 were used as samples with stratified sampling method. Content Validity Index (CVI) and…
van Raaij, W. Fred; Pruyn, Adriaan T.H.
A control and attribution model of service production and evaluation is proposed. Service production consists of the stages specification (input), realization (throughput), and outcome (output). Customers may exercise control over all three stages of the service. Critical factors of service
Zhang, C; Yang, G P; Li, Z; Li, X N; Li, Y; Hu, J; Zhang, F Y; Zhang, X J
Objective: To assess the reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test (AUDIT) among medical students in China and to provide correct way of application on the recommended scales. Methods: An E-questionnaire was developed and sent to medical students in five different colleges. Students were all active volunteers to accept the testings. Cronbach's α and split-half reliability were calculated to evaluate the reliability of AUDIT while content, contract, discriminant and convergent validity were performed to measure the validity of the scales. Results: The overall Cronbach's α of AUDIT was 0.782 and the split-half reliability was 0.711. Data showed that the domain Cronbach's α and split-half reliability were 0.796 and 0.794 for hazardous alcohol use, 0.561 and 0.623 for dependence symptoms, and 0.647 and 0.640 for harmful alcohol use. Results also showed that the content validity index on the levels of items I-CVI) were from 0.83 to 1.00, the content validity index of scale level (S-CVI/UA) was 0.90, content validity index of average scale level (S-CVI/Ave) was 0.99 and the content validity ratios (CVR) were from 0.80 to 1.00. The simplified version of AUDIT supported a presupposed three-factor structure which could explain 61.175% of the total variance revealed through exploratory factor analysis. AUDIT semed to have good convergent and discriminant validity, with the success rate of calibration experiment as 100%. Conclusion: AUDIT showed good reliability and validity among medical students in China thus worth for promotion on its use.
Leal, Bruna Estima; Gonçalves, Márcia Aparecida; Lisboa, Liseane Gonçalves; Linné, Larissa Martins Schmitz; Tavares, Michelle Gonçalves de Souza; Yamaguti, Wellington Pereira; Paulin, Elaine
Fluoroscopy is considered the most accurate method to evaluate the diaphragm, yet most existing methods for measuring diaphragmatic mobility using fluoroscopy are complex. To assess the validity and reliability of a new evaluation method of diaphragmatic motion using fluoroscopy by digital radiography of healthy adults. Twenty-six adults were evaluated, according to the parameters: anthropometry and pulmonary function test. The evaluation of diaphragm mobility by means of fluoroscopy by digital radiography method was randomly conducted by two raters (A and B). The Pearson correlation coefficient and the intraclass correlation coefficient (ICC) were used to assess the concurrent validity. The inter-rater and intra-rater reliability of the measurement of diaphragmatic motion was determined using ICC and a confidence interval of 95%. There was a relationship in the assessment of the concurrent validity. There was good inter-rater reliability for right hemidiaphragm mobility and moderate reliability for left hemidiaphragm in the first assessment. In the second assessment, there was good reliability for the mobility of both hemidiaphragms. There was good intra-rater reliability in the mobility of both hemidiaphragms for raters A and B. The evaluation of diaphragmatic motion using fluoroscopy by digital radiography proved to be a valid and reliable method of healthy adults.
Køster, B; Søndergaard, J; Nielsen, J B; Olsen, A; Bentzen, J
An important feature of questionnaire validation is reliability. To be able to measure a given concept by questionnaire validly, the reliability needs to be high. The objectives of this study were to examine reliability of attitude and knowledge and behavioral consistency of sunburn in a developed questionnaire for monitoring and evaluating population sun-related behavior. Sun related behavior, attitude and knowledge was measured weekly by a questionnaire in the summer of 2013 among 664 Danes. Reliability was tested in a test-retest design. Consistency of behavioral information was tested similarly in a questionnaire adapted to measure behavior throughout the summer. The response rates for questionnaire 1, 2 and 3 were high and the drop out was not dependent on demographic characteristic. There was at least 73% agreement between sunburns in the measurement week and the entire summer, and a possible sunburn underestimation in questionnaires summarizing the entire summer. The participants underestimated their outdoor exposure in the evaluation covering the entire summer as compared to the measurement week. The reliability of scales measuring attitude and knowledge was high for majority of scales, while consistency in protection behavior was low. To our knowledge, this is the first study to report reliability for a completely validated questionnaire on sun-related behavior in a national random population based sample. Further, we show that attitude and knowledge questions confirmed their validity with good reliability, while consistency of protection behavior in general and in a week's measurement was low.
Verma, Ajai; Samuel, Asir John; Aranha, Vencita Priyanka
To estimate test-retest reliability, inter-rater reliability and validation of a method for measuring balance by stepping over four squares, four square step test (FSST) in children with Down syndrome (DS). A sample of 13 children with DS was recruited for the cross-sectional study. They were asked to perform FSST and the time taken was noted. For estimating test-retest reliability, FSST was done by principal investigator twice and Inter-rater reliability was assessed by principal investigator and second investigator with an interval of 7 days. FRT was used as the criterion dynamic balance measure to validate FSST. The data was tabulated and analysed statistically. The test-retest reliability and inter-rater reliability of FSST are ICC(1,1) = 0.70 (0.64-0.90) and ICC(2,1) = 0.78 (0.62-0.91) respectively. The Bland-Altman limits of agreement were also satisfied. Concurrent validity between FRT and FSST with Spearman's ρ = -0.58 (-0.86 to -0.13). FSST has moderate to good concurrent validity and good reliability among the children with Down syndrome.
Jeng, S F; Yau, K I; Chen, L C; Hsiao, S F
The goal of this study was to examine the reliability and validity of measurements obtained with the Alberta Infant Motor Scale (AIMS) for evaluation of preterm infants in Taiwan. Two independent groups of preterm infants were used to investigate the reliability (n=45) and validity (n=41) for the AIMS. In the reliability study, the AIMS was administered to the infants by a physical therapist, and infant performance was videotaped. The performance was then rescored by the same therapist and by 2 other therapists to examine the intrarater and interrater reliability. In the validity study, the AIMS and the Bayley Motor Scale were administered to the infants at 6 and 12 months of age to examine criterion-related validity. Intraclass correlation coefficients (ICCs) for intrarater and interrater reliability of measurements obtained with the AIMS were high (ICC=.97-.99). The AIMS scores correlated with the Bayley Motor Scale scores at 6 and 12 months (r=.78 and.90), although the AIMS scores at 6 months were only moderately predictive of the motor function at 12 months (r=.56). The results suggest that measurements obtained with the AIMS have acceptable reliability and concurrent validity but limited predictive value for evaluating preterm Taiwanese infants.
Barks, Lelia; Garvan, Cynthia; Crane, Barbara; Peterson, Matthew; Brown, Lisa M; Powell-Cope, Gail
Earlier, we reported development and initial testing of a rapid, low burden measure of wheelchair seated posture (the Seated Posture Scale [SPS]) for research and clinical use. The purpose of this study was to test the intrarater reliability, interrater reliability, and criterion related validity of the SPS. We used a descriptive design to test and develop the instrument. We used the method described by Walter, Donner, and Eliasziw to determine 46 participants were needed, from two Veterans Administration nursing homes or CLCs (community living centers). Using a digital goniometer to score rapidly and with two trained raters, we scored individual participants simultaneously and again in succession. For criterion related concurrent validity, we also scored each participant with Section 2 of the Seated Postural Control Measure for Adults. Intrarater reliability, interrater reliability, and criterion related, concurrent validity were assessed using kappa statistics for individual instrument items (and corresponding 95% confidence intervals where appropriate) and intraclass correlation coefficients (ICC) for total scores. The intrarater intraclass correlation coefficient (ICC) was .995; interrater ICC was .80; interrater reliability kappas ranged from -.03 to .80. Criterion-related, concurrent validity kappas ranged from .13 to .91. ICC for total scores was .85. The SPS has sufficient preliminary validity and reliability to support measurement of wheelchair-seated posture in outcomes research and clinical use. In the future, rehabilitation nurses may use the SPS to improve management of wheelchair seated posture, to improve clinical outcomes, particularly for those who do not reposition themselves.
Gouveia, Valdiney V; Barbosa, Genário Alves; Oliveira Andrade, Edson de; Carneiro, Mauro Brandão
The 12-item General Health Questionnaire (GHQ-12) is a widely used screening instrument. One- and two-factor structures have been identified in some countries. In Brazil, the best factor structure is still unclear. This study aimed at knowing its factorial validity and reliability, and testing the one-factor and two-factor models. The participants were 7,512 Brazilian physicians. They answered the GHQ-12 and demographic questions. Unrotated (one-factor) and rotated (two-factor) structures of the GHQ-12 were extracted by principal component analysis. Confirmatory factor analyses (ML) were used to compare the one- and two-factor solutions. The two-factor model fitted the data better than the one-factor one. Those two factors were depression and social dysfunction, and they showed themselves to be directly correlated to one another. They also showed adequate reliability coefficients. The two-factor model is remarkably adequate, showing better fit indices, although it is acceptable to admit a common factor, which could be defined as psychological distress.
Clifford, Merci S; Hamer, Peter; Phillips, Michael; Wood, Fiona M; Edgar, Dale W
Upper limb (UL) burns can result in significant loss of strength and physical function. The aim of this study was to establish the reliability and validity of grip strength dynamometry (GSD) for measuring burn-affected UL strength over time. A retrospective sample of adult participants (n=89) with UL burns was obtained from Royal Perth Hospital. Data were compiled from assessments conducted at discharge, one, three, six and 12 months afer burn. Within-session reliability and validity was examined through multivariable analyses. GSD demonstrated within-session reliability for all investigated timepoints (ICC's≥0.87, pdynamometry is a reliable and valid outcome measure for measuring burn-affected UL strength from one month to one year after burn. Copyright © 2013 Elsevier Ltd and ISBI. All rights reserved.
Full Text Available BACKGROUND: Cognitive impairment in patients with schizophrenia is a core symptom of this disease. The computerized CogState Battery (CSB has been used to detect seven of the most common cognitive domains in schizophrenia. The aim of this study was to examine the reliability and validity of the Chinese version of the CSB (CSB-C, in Chinese patients with schizophrenia. METHODOLOGY/PRINCIPAL FINDINGS: Sixty Chinese patients with schizophrenia and 58 age, sex, and education matched healthy controls were enrolled. All subjects completed the CSB-C and the Repeated Battery for the Assessment of Neuropsychological Status (RBANS. To examine the test-retest reliability of CSB-C, we tested 33 healthy controls twice, at a one month interval. The Cronbach α value of CSB-C in patients was 0.81. The test-retest correlation coefficients of the Two Back Task, Gronton Maze Learning Task, Social Emotional Cognition Task, and Continuous Paired Association Learning Task were between 0.39 and 0.62 (p<0.01 in healthy controls. The composite scores and all subscores for the CSB-C in patients were significantly (p<0.01 lower than those of healthy controls. Furthermore, composite scores for patients on the RBANS were also significantly lower than those of healthy controls. Interestingly, there was a positive correlation (r = 0.544, p<0.001 between the composite scores on CSB-C and RBANS for patients. Additionally, in the attention and memory cognitive domains, corresponding subsets from the two batteries correlated significantly (p<0.05. Moreover, factor analysis showed a two-factor model, consisting of speed, memory and reasoning. CONCLUSIONS/SIGNIFICANCE: The CSB-C shows good reliability and validity in measuring the broad cognitive domains of schizophrenia in affected Chinese patients. Therefore, the CSB-C can be used as a cognitive battery, to assess the therapeutic effects of potential cognitive-enhancing agents in this cohort.
Full Text Available The 14-item Chalder Fatigue Scale (CFS is widely used, while the 11-item version is seldom to be found in current research in mainland China. The objectives of the present study is to compare the reliability and construct validity between these two versions and to confirm which may be better for the mainland Chinese setting. Based on a cross-sectional health survey with a constructive questionnaire, 1887 individuals aged 18 years or above were selected. Socio-demographic, health-related, gynecological data were collected, and 11-item and 14-item Chalder Fatigue Scale (CFS were used to assess fatigue. Confirmatory factor analysis and exploratory structural equation modeling (ESEM were performed to test the fit of models of the two versions. Confirmatory factor analysis of the two versions of CFS did not support the two-factor theorized models. In addition, a three-factor ESEM model of the 11-item version, but not the 14-item version, showed better factor structure and fitness than the other models examined. Both the versions had good internal consistency reliability and a satisfactory internal consistency (Ω = 0.78–0.96, omega coefficient indicates the internal consistency reliability was obtained from the optimal model. This study provided evidence for satisfactory reliability and structural validity for the three-factor model of the 11-item version, which was proven to be superior to the 14-item version for this data.
Uggioni, Paula Lazzarin; Salay, Elisabete
Validated and reliable instruments for measuring consumer attitudes regarding food quality certifications are lacking, but the measurement of consumer attitude could be an important tool for understanding consumer behavior. Thus the objective of this study was to develop an instrument for measuring consumer attitudes regarding private food safety certifications for commercial restaurants. To this end, the following steps were carried out: development of the interview items; complete pilot testing; item analyses (influence of social desirability and total-item correlation); reliability test (internal consistency and test-retest); and validity assessment (content and discriminative validity and exploratory and confirmatory factor analysis). The subjects, all over the age of 18 and drawn from six non-probabilistic samples (n=7-350) in the city of Campinas, Brazil, were all subjected to an interview. The final scale included 24 items and had a Cronbach's alpha coefficient of 0.79 and a content validation coefficient of 0.99, both within acceptable limits. The confirmatory factor analysis validated a model with five factors and the final instrument discriminated reasonably well between the groups and showed satisfactory reproducibility (r=0.955). Furthermore, the scale validity and reliability were satisfactory, suggesting it could also be applied to future studies. Copyright Â© 2011 Elsevier Ltd. All rights reserved.
Jillian E. Frideres
Full Text Available The purpose of this study was to design and to test the validity and reliability of an instrument to evaluate coaches' knowledge about the female athlete triad syndrome and their confidence in this knowledge. The instrument collects information regarding: knowledge of the syndrome, components, prevention and intervention; confidence of the coaches in their answers; and coach's characteristics (gender, degree held, years of experience in coaching females, continuing education participation specific to the syndrome and its components, and sport coached. The process of designing the questionnaire and testing the validity and reliability of it was done in four phases: a design and development of the instrument, b content validity, c instrument reliability, and d concurrent validity. The results show that the instrument is suitable for measuring coaches' female athlete triad knowledge. The instrument can contribute to assessing the coaches' knowledge level in relation to this topic.
Schneider, Eleanor; Rosenblum, Sara
This article describes the development, reliability, and validity of My Child's Play (MCP), a parent questionnaire designed to evaluate the play of children ages 3-9 yr. The first phase of the study determined the questionnaire's content and face validity. Subsequently, the internal reliability consistency and construct and concurrent validity were demonstrated using 334 completed questionnaires. The MCP showed good internal consistency (α = .86). The factor analysis revealed four distinct factors with acceptable levels of internal reliability (Cronbach's αs = .63-.81) and gender- and age-related differences in play characteristics; both findings attest to the tool's construct validity. Significant correlations (r = .33, p child's play. Copyright © 2014 by the American Occupational Therapy Association, Inc.
Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara
This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
San Isidro, M. J.
It is presented a methodology on empirical validation about any detailed simulation model. This king of validation it is always related with an experimental case. The empirical validation has a residual sense, because the conclusions are based on comparisons between simulated outputs and experimental measurements. This methodology will guide us to detect the fails of the simulation model. Furthermore, it can be used a guide in the design of posterior experiments. Three steps can be well differentiated: Sensitivity analysis. It can be made with a DSA, differential sensitivity analysis, and with a MCSA, Monte-Carlo sensitivity analysis. Looking the optimal domains of the input parameters. It has been developed a procedure based on the Monte-Carlo methods and Cluster techniques, to find the optimal domains of these parameters. Residual analysis. This analysis has been made on the time domain and on the frequency domain, it has been used the correlation analysis and spectral analysis. As application of this methodology, it is presented the validation carried out on a thermal simulation model on buildings, Esp., studying the behavior of building components on a Test Cell of LECE of CIEMAT. (Author) 17 refs
Zhang, Jinwu; Liu, Nian
The Level of Service Inventory-Revised (LSI-R) is an instrument used world-wide for offender risk/need assessment, and the predominant samples for the LSI-R literature were Caucasian participants. This study is the first attempt to examine the reliability and validity of a Chinese version of the LSI-R with 305 probationers (269 males, 36 females) in Guangzhou, China. Factor analysis was conducted using principal component analysis. The internal consistency was estimated by Cronbach's alpha for the total and subscales. Under the cross-sectional design, technical violation (TV) was used as a dependent variable for bivariate correlations and binary logistic regression model to investigate the concurrent validity of the Chinese version of LSI-R, controlling the age. Results offered support to this instrument by sound internal consistency. Significant gender differences were apparent on the total scores, and subscales of Criminal History, Family/Marital, and Companions. Bivariate correlations and binary logistic regression analyses supported the concurrent validity of the LSI-R total score for technical violations, and the subscales of Criminal History and Education/Employment were significantly associated with technical violations, which are consistent with the LSI-R literature. Overall, the LSI-R, which has been translated into Chinese, is a reliable risk/need assessment instrument for technical violations for probationers in Guangzhou, China. Further studies with follow-up data for different offender groups are needed. Discussions about the results, implications, limitations, and further research were presented. © The Author(s) 2014.
Full Text Available Background Imagery is an effective performance enhancement technique. Imagery has been described previously in a range of psychological domains. Measuring imagery is critical in research and practice in sport. Self-report questionnaires are the most regularly used method. The aim of the present study was to examine reliability and validity characteristics of the Imagination in Sport Questionnaire (Kwestionariusz Wyobraźni w Sporcie – KWS. Participants and procedure Five and hundred eight (N = 326 – study I; N = 182 – study II Polish athletes completed questionnaires (169 male, 156 female – study I; 139 male, 43 female – study II, aged between 12 and 57 years (M = 22.08, SD = 8.18 – study I; age 19-24, M = 20.46, SD = 1.1 – study II, at different competitive levels and recruited from various sports disciplines. Results Results indicated the maintained good stability and internal consistency over a 3-week period. Results of confirmatory factor analysis suggested that the 7-factor structure of the KWS resulted in acceptable model fit indices (NC = 2416.63, df = 1203, GFI = 0.944, AGFI = 0.944, CFI = 0.786, RMSEA = 0.056, p (RMSEA < 0.05 = 0.002 – first study; NC = 2234.39, df = 1203, GFI = 0.673, AGFI = = 0.640, CFI = 0.691, RMSEA = 0.069, p (RMSEA < 0.05 = = 0.000 – second study. Concurrent validity was supported by examination of the relationships between the KWS subscales and the SIAM (Sport Imagery Ability Measure in Polish adaptation. In addition, differences in athletes’ imagery ability were examined across competitive levels, and in relation to both gender and age. Conclusions Overall, the results supported the reliability and construct validity of the KWS.
Johnson, G.; Lawrence, D.; Yu, H.
The objective of this project is to develop a method to predict the potential reliability of software to be used in a digital system instrumentation and control system. The reliability prediction is to make use of existing measures of software reliability such as those described in IEEE Std 982 and 982.2. This prediction must be of sufficient accuracy to provide a value for uncertainty that could be used in a nuclear power plant probabilistic risk assessment (PRA). For the purposes of the project, reliability was defined to be the probability that the digital system will successfully perform its intended safety function (for the distribution of conditions under which it is expected to respond) upon demand with no unintended functions that might affect system safety. The ultimate objective is to use the identified measures to develop a method for predicting the potential quantitative reliability of a digital system. The reliability prediction models proposed in this report are conceptual in nature. That is, possible prediction techniques are proposed and trial models are built, but in order to become a useful tool for predicting reliability, the models must be tested, modified according to the results, and validated. Using methods outlined by this project, models could be constructed to develop reliability estimates for elements of software systems. This would require careful review and refinement of the models, development of model parameters from actual experience data or expert elicitation, and careful validation. By combining these reliability estimates (generated from the validated models for the constituent parts) in structural software models, the reliability of the software system could then be predicted. Modeling digital system reliability will also require that methods be developed for combining reliability estimates for hardware and software. System structural models must also be developed in order to predict system reliability based upon the reliability
Casartelli, Nicola; Müller, Roland; Maffiuletti, Nicola A
The aim of the present study was to verify the validity and reliability of the Myotest accelerometric system (Myotest SA, Sion, Switzerland) for the assessment of vertical jump height. Forty-four male basketball players (age range: 9-25 years) performed series of squat, countermovement and repeated jumps during 2 identical test sessions separated by 2-15 days. Flight height was simultaneously quantified with the Myotest system and validated photoelectric cells (Optojump). Two calculation methods were used to estimate the jump height from Myotest recordings: flight time (Myotest-T) and vertical takeoff velocity (Myotest-V). Concurrent validity was investigated comparing Myotest-T and Myotest-V to the criterion method (Optojump), and test-retest reliability was also examined. As regards validity, Myotest-T overestimated jumping height compared to Optojump (p 0.98), that is, excellent validity. Myotest-V overestimated jumping height compared to Optojump (p 12 cm), high limits of agreement ratios (>36%), and low ICCs (9 cm). In conclusion, Myotest-T is a valid and reliable method for the assessment of vertical jump height, and its use is legitimate for field-based evaluations, whereas Myotest-V is neither valid nor reliable.
Hunter, N.F. [Los Alamos National Lab., NM (United States). Engineering Science and Analysis Div.; Barney, P.; Paez, T.L. [Sandia National Labs., Albuquerque, NM (United States). Experimental Structural Dynamics Dept.; Ferregut, C.; Perez, L. [Univ. of Texas, El Paso, TX (United States). Dept. of Civil Engineering
It is common practice in structural dynamics to develop mathematical models for system behavior, and the authors are now capable of developing stochastic models, i.e., models whose parameters are random variables. Such models have random characteristics that are meant to simulate the randomness in characteristics of experimentally observed systems. This paper suggests a formal statistical procedure for the validation of mathematical models of stochastic systems when data taken during operation of the stochastic system are available. The statistical characteristics of the experimental system are obtained using the bootstrap, a technique for the statistical analysis of non-Gaussian data. The authors propose a procedure to determine whether or not a mathematical model is an acceptable model of a stochastic system with regard to user-specified measures of system behavior. A numerical example is presented to demonstrate the application of the technique.
Liu, Z.; Merwade, V.
Uncertainty analysis is an indispensable part of modeling the hydrology and hydrodynamics of non-idealized environmental systems. Compared to reliance on prediction from one model simulation, using on ensemble of predictions that consider uncertainty from different sources is more reliable. In this study, Bayesian model averaging (BMA) is applied to Black River watershed in Arkansas and Missouri by combining multi-model simulations to get reliable deterministic water stage and probabilistic inundation extent predictions. The simulation ensemble is generated from 81 LISFLOOD-FP subgrid model configurations that include uncertainty from channel shape, channel width, channel roughness and discharge. Model simulation outputs are trained with observed water stage data during one flood event, and BMA prediction ability is validated for another flood event. Results from this study indicate that BMA does not always outperform all members in the ensemble, but it provides relatively robust deterministic flood stage predictions across the basin. Station based BMA (BMA_S) water stage prediction has better performance than global based BMA (BMA_G) prediction which is superior to the ensemble mean prediction. Additionally, high-frequency flood inundation extent (probability greater than 60%) in BMA_G probabilistic map is more accurate than the probabilistic flood inundation extent based on equal weights.
Herbert, A.W.; Hodgkinson, D.P.; Jackson, C.P.; Lever, D.A.; Robinson, P.C.
The numerical accuracy of the computer models for groundwater flow and radionuclide transport that are to be used in repository safety assessment must be tested, and their ability to describe experimental data assessed: they must be verified and validated respectively. Also appropriate ways to use the codes in performance assessments, taking into account uncertainties in present data and future conditions, must be studied. These objectives are being met by participation in international exercises, by developing bench-mark problems, and by analysing experiments. In particular the project has funded participation in the HYDROCOIN project for groundwater flow models, the Natural Analogues Working Group, and the INTRAVAL project for geosphere models. (author)
Hoppe, Matthias W; Baumgart, Christian; Polglaze, Ted; Freiwald, Jürgen
This study aimed to investigate the validity and reliability of global (GPS) and local (LPS) positioning systems for measuring distances covered and sprint mechanical properties in team sports. Here, we evaluated two recently released 18 Hz GPS and 20 Hz LPS technologies together with one established 10 Hz GPS technology. Six male athletes (age: 27±2 years; VO2max: 48.8±4.7 ml/min/kg) performed outdoors on 10 trials of a team sport-specific circuit that was equipped with double-light timing gates. The circuit included various walking, jogging, and sprinting sections that were performed either in straight-lines or with changes of direction. During the circuit, athletes wore two devices of each positioning system. From the reported and filtered velocity data, the distances covered and sprint mechanical properties (i.e., the theoretical maximal horizontal velocity, force, and power output) were computed. The sprint mechanical properties were modeled via an inverse dynamic approach applied to the center of mass. The validity was determined by comparing the measured and criterion data via the typical error of estimate (TEE), whereas the reliability was examined by comparing the two devices of each technology (i.e., the between-device reliability) via the coefficient of variation (CV). Outliers due to measurement errors were statistically identified and excluded from validity and reliability analyses. The 18 Hz GPS showed better validity and reliability for determining the distances covered (TEE: 1.6-8.0%; CV: 1.1-5.1%) and sprint mechanical properties (TEE: 4.5-14.3%; CV: 3.1-7.5%) than the 10 Hz GPS (TEE: 3.0-12.9%; CV: 2.5-13.0% and TEE: 4.1-23.1%; CV: 3.3-20.0%). However, the 20 Hz LPS demonstrated superior validity and reliability overall (TEE: 1.0-6.0%; CV: 0.7-5.0% and TEE: 2.1-9.2%; CV: 1.6-7.3%). For the 10 Hz GPS, 18 Hz GPS, and 20 Hz LPS, the relative loss of data sets due to measurement errors was 10.0%, 20.0%, and 15.8%, respectively. This study shows that
Palm, Peter; Josephson, Malin; Mathiassen, Svend Erik; Kjellberg, Katarina
We evaluated the intra- and inter-observer reliability and criterion validity of an observation protocol, developed in an iterative process involving practicing ergonomists, for assessment of working technique during cash register work for the purpose of preventing upper extremity symptoms. Two ergonomists independently assessed 17 15-min videos of cash register work on two occasions each, as a basis for examining reliability. Criterion validity was assessed by comparing these assessments with meticulous video-based analyses by researchers. Intra-observer reliability was acceptable (i.e. proportional agreement >0.7 and kappa >0.4) for 10/10 questions. Inter-observer reliability was acceptable for only 3/10 questions. An acceptable inter-observer reliability combined with an acceptable criterion validity was obtained only for one working technique aspect, 'Quality of movements'. Thus, major elements of the cashiers' working technique could not be assessed with an acceptable accuracy from short periods of observations by one observer, such as often desired by practitioners. Practitioner Summary: We examined an observation protocol for assessing working technique in cash register work. It was feasible in use, but inter-observer reliability and criterion validity were generally not acceptable when working technique aspects were assessed from short periods of work. We recommend the protocol to be used for educational purposes only.
Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H
Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (pTest-retest reliability was (ICC3,3) = 0.953 (pvalidity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Yildirim, Yasemin; Tokem, Yasemin; Bozkurt, Nilufer; Fadiloglu, Cicek; Uyar, Meltem; Uslu, Ruchan
The Memorial Symptom Assessment Scale (MSAS) is a multidimensional tool developed to evaluate measure the prevalence, characteristics and distress of common symptoms related to cancer. A validated Turkish version has now become available. The aim of this study was to evaluate its reliability and validity. One hundred-twenty patients were included into this study. The MSAS, The Rotterdam Symptom Checklist (RSCL), and Karnofsky Performance Status Scale (KPSS) were used for data collection. Content and criterion validities were examined. Reliability analyses of the MSAS were performed using internal consistency reliability and test-retest reliability. The most frequently reported symptom (90%) was problems with sexual interest or activity. Item-total correlations ranged between 0.03 and 0.64. There was a high correlation between total MSAS and the RSCL (r=0.875, ptest - re-test reliability was 0.78. The MSAS for cancer patients was determined to be a valid and reliable instrument for the use in the Turkish population. It is recommended that the MSAS-Turkish version can be used as a tool for comprehensive symptom assessment in planning nursing care for cancer patients.
Huang, X N; Zhang, Y; Feng, W W; Wang, H S; Cao, B; Zhang, B; Yang, Y F; Wang, H M; Zheng, Y; Jin, X M; Jia, M X; Zou, X B; Zhao, C X; Robert, J; Jing, Jin
Objective: To evaluate the reliability and validity of warning signs checklist developed by the National Health and Family Planning Commission of the People's Republic of China (NHFPC), so as to determine the screening effectiveness of warning signs on developmental problems of early childhood. Method: Stratified random sampling method was used to assess the reliability and validity of checklist of warning sign and 2 110 children 0 to 6 years of age(1 513 low-risk subjects and 597 high-risk subjects) were recruited from 11 provinces of China. The reliability evaluation for the warning signs included the test-retest reliability and interrater reliability. With the use of Age and Stage Questionnaire (ASQ) and Gesell Development Diagnosis Scale (GESELL) as the criterion scales, criterion validity was assessed by determining the correlation and consistency between the screening results of warning signs and the criterion scales. Result: In terms of the warning signs, the screening positive rates at different ages ranged from 10.8%(21/141) to 26.2%(51/137). The median (interquartile) testing time for each subject was 1(0.6) minute. Both the test-retest reliability and interrater reliability of warning signs reached 0.7 or above, indicating that the stability was good. In terms of validity assessment, there was remarkable consistency between ASQ and warning signs, with the Kappa value of 0.63. With the use of GESELL as criterion, it was determined that the sensitivity of warning signs in children with suspected developmental delay was 82.2%, and the specificity was 77.7%. The overall Youden index was 0.6. Conclusion: The reliability and validity of warning signs checklist for screening early childhood developmental problems have met the basic requirements of psychological screening scales, with the characteristics of short testing time and easy operation. Thus, this warning signs checklist can be used for screening psychological and behavioral problems of early childhood
Full Text Available Background: The purpose of this study was to evaluate the validity and reliability on the Persian translation of the Modifiable Activity Questionnaire (MAQ in a sample of Tehranian adolescents. Methods: Of a total of 52 subjects, a sub-sample of 40 participations (55.0% boys was used to assess the reliability and the validity of the physical activity questionnaire. The reliability of the two MAQs was calculated by intraclass correlation coefficients, and validation was evaluated using Pearson correlation coefficients to compare data between mean of the two MAQs and mean of four physical activity records. Results: Intraclass correlation coefficient was calculated to assess the reliability between two MAQs and the results of leisure time physical activity over the past year were 0.97. Pearson correlation coefficients between mean of two MAQs and mean of four physical activity records were 0.49 (P < 0.001, for leisure time physical activities. Conclusions: High reliability and relatively moderate validity were found for the Persian translation of the MAQ in a Tehranian adolescent population. Further studies with large sample size are suggested to assess the validity more precisely.
Marco Aurelio Lumertz Saffi
Full Text Available Using a sample of patients with coronary artery disease, this methodological study aimed to conduct a cross-cultural adaptation and validation of a questionnaire on knowledge of cardiovascular risk factors (Q-FARCS, lifestyle changes, and treatment adherence for use in Brazil. The questionnaire has three scales: general knowledge of risk factors (RFs; specific knowledge of these RFs; and lifestyle changes achieved. Cross-cultural adaptation included translation, synthesis, back-translation, expert committee review, and pretesting. Face and content validity, reliability, and construct validity were measured. Cronbach’s alpha for the total sample (n = 240 was 0.75. Assessment of psychometric properties revealed adequate face and content validity, and the construct revealed seven components. It was concluded that the Brazilian version of Q-FARCS had adequate reliability and validity for the assessment of knowledge of cardiovascular RFs.
Full Text Available Ciren Yangzong,1,2 Sanguan Lerkiatbundit,3 Ouzhu Luobu,4 Chaoying Cui,4 Tippawan Liabsuetrakul,2 Baima Kangzhuo,4 Deji Quzong,4 Luobu Zhandui,5 Pu Zhen,6 Virasakdi Chongsuvivatwong2 1Department of Preventive Medicine, Medical College, Tibet University, Lhasa, Tibet Autonomous Region, People’s Republic of China; 2Epidemiology Unit, Faculty of Medicine, 3Department of Pharmacy Administration, Faculty of Pharmaceutical Science, Prince of Songkla University, Songkhla, Thailand; 4High Altitude Research Centre, 5Experimental Centre, 6Department of Pharmaceutical Science, Medical College, Tibet University, Lhasa, Tibet Autonomous Region, People’s Republic of China Abstract: Parenting style experienced during childhood has profound effects on children’s futures. Scales developed in other countries have never been validated in the Tibetan context. The present study aimed to examine the construct validity and reliability of a Tibetan translation of the 23-item short form of the Egna Minnen Beträffande Uppfostran [One’s Memories of Upbringing] (s-EMBU and to test the correlation between the parenting styles of fathers and mothers. A cross-sectional study was conducted in a sample of 847 students aged 12–21 years from Lhasa, Tibet, during September and October 2015 with a participation rate of 97.7%. The Tibetan translation of self-completed s-EMBU was administered. Confirmatory factor analysis was employed to test the scale’s validity on the first half of the sample and was then cross-validated with the second half of the sample. The final model consisted of six factors: three (rejection, emotional warmth, and overprotection for each parent, equality constrained on factor loadings, factor correlations, and error variance between father and mother. Father–mother correlation coefficients ranged from 0.81 to 0.86, and the level of consistency ranged from 0.62 to 0.82. Thus, the slightly modified s-EMBU is suitable for use in the Tibetan culture
Helmerhorst Hendrik JF
Full Text Available Abstract Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA and in particular by physical activity questionnaires (PAQs remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument.
Physical inactivity is one of the four leading risk factors for global mortality. Accurate measurement of physical activity (PA) and in particular by physical activity questionnaires (PAQs) remains a challenge. The aim of this paper is to provide an updated systematic review of the reliability and validity characteristics of existing and more recently developed PAQs and to quantitatively compare the performance between existing and newly developed PAQs. A literature search of electronic databases was performed for studies assessing reliability and validity data of PAQs using an objective criterion measurement of PA between January 1997 and December 2011. Articles meeting the inclusion criteria were screened and data were extracted to provide a systematic overview of measurement properties. Due to differences in reported outcomes and criterion methods a quantitative meta-analysis was not possible. In total, 31 studies testing 34 newly developed PAQs, and 65 studies examining 96 existing PAQs were included. Very few PAQs showed good results on both reliability and validity. Median reliability correlation coefficients were 0.62–0.71 for existing, and 0.74–0.76 for new PAQs. Median validity coefficients ranged from 0.30–0.39 for existing, and from 0.25–0.41 for new PAQs. Although the majority of PAQs appear to have acceptable reliability, the validity is moderate at best. Newly developed PAQs do not appear to perform substantially better than existing PAQs in terms of reliability and validity. Future PAQ studies should include measures of absolute validity and the error structure of the instrument. PMID:22938557
Preti, Antonio; Sheehan, David V; Coric, Vladimir; Distinto, Marco; Pitanti, Mirko; Vacca, Irene; Siddi, Alessandra; Masala, Carmelo; Petretto, Donatella Rita
The Sheehan Suicidality Tracking Scale (S-STS) is a patient self-report or clinician-administered rating scale that tracks spontaneous and treatment-emergent suicidal ideation and behaviors. This study set out to evaluate the reliability, convergent and divergent validity of the S-STS in a sample of college students, a population with a high risk of completed and attempted suicide. Cross-sectional, survey design. Participants (303 undergraduate students; males: 42%) completed several measures assessing psychological distress (General Health Questionnaire; GHQ); self-esteem (Rosenberg Self Esteem Scale; RSES); social support (Modified Social Support Survey; MOSSS); and suicidal behavior, including ideation and attempts (S-STS). Both internal consistency and test-retest stability were excellent for the S-STS-global score. The S-STS subscale on suicide ideation also showed good reliability, while the subscale on suicidal behavior showed some inconsistency at retest. Convergent and divergent validity of S-STS was confirmed. All S-STS items loaded on a single factor, which had an excellent fit for the unidimensional model, thus justifying the use of the S-STS as a screening tool. In a mediation model, self-esteem and social support explained 45% of the effects of psychological distress on suicide ideation and behavior as measured by the S-STS-global score. This study provided promising evidence on the convergent, divergent, internal consistency and test-retest stability of the Sheehan Suicidality Tracking Scale. The cross-sectional design and lack of measures of hopelessness and helplessness prevent any conclusion about the links of suicidal behavior with self-esteem and social support. Copyright © 2013 Elsevier Inc. All rights reserved.
Guo, Xinying; Wu, Xinjuan; Guo, Aimin; Zhao, Yanwei
Condyloma acuminata (CA) is a sexually transmitted disease that affects quality of life (QOL). CECA10 is an English-language questionnaire for assessing QOL in patients with CA, but there is no equivalent in China. This study aimed to develop a validated and reliable Chinese version of CECA10.The Chinese CECA10 was developed from the English version by forward translation, back translation, comparison with the original, cultural adjustments, and a pre-test (5 patients). The Chinese CECA10 and EuroQol Five Dimensions Three Level Questionnaire (EQ-5D-3L) was administered to patients with CA. Content validity (item/scale content validity indexes, I-CVI/S-CVI), test-retest reliability (intraclass coefficient, ICC), internal consistency (Cronbach α), criterion validity (comparison with the Dermatology Life Quality Index, DLQL, using Spearman correlation analysis), construct validity (exploratory factor analysis), and discriminant validity (between subgroups based on number of warts, number of recurrences, or number of sites involved) were assessed.The Chinese CECA10 had good test-retest reliability (ICC = 0.98, P validity (I-CVI = 1 for all items), and criterion validity (r = -0.50, P 0.4. Discriminant validity was not high. The mean CECA10 and EQ-VAS scores of 211 patients with CA (28.19 ± 7.16 years; 139 males) were 34.56 ± 19.01 and 64.64 ± 19.28, respectively.The Chinese CECA10 has good reliability and validity for evaluating the QOL of Chinese patients with CA.
Campo-Arias, Adalberto; Lafaurie, María Mercedes; Gaitán-Duarte, Hernando G
There are several scales to quantify homophobia in different populations. However, the reliability and validity of these instruments among Colombian students are unknown. Consequently, this work is intended to assess reliability (inner consistency) as well as the validity of the Scale for Homophobia in Medicine students from a private university in Bogotá (Colombia). Methodological study with 199 Medicine students from 1st to 5th semester that filled out the Homophobia Scale form, the general welfare questionnaire, the Attitude Towards Gays and Lesbians Scale (ATGL), WHO-5 (divergent validity) and the Francis Scale of Attitude Toward Christianity (nomologic validity). Pearson's correlations were computed, the Cronbach's alfa coefficient, the omega coefficient (construct's reliability) and confirmatory factorial analysis. The Scale for Homophobia showed an alpha Cronbach coefficient of 0,785, an omega coefficient of 0,790 and a Pearson correlation with the ATGL of 0,844; with WHO-5, -0,059; and a Francis Scale of Attitude Toward Christianity, 0,187. The Scale toward Homophobia exhibited a relevant factor of 44,7% of the total variance. The Scale for Homophobia showed acceptable reliability and validity. New studies should investigate the stability of the scale and the nomologic validity regarding other constructs. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Dawson, Andreas; Raphael, Karen G; Glaros, Alan; Axelsson, Susanna; Arima, Taro; Ernberg, Malin; Farella, Mauro; Lobbezoo, Frank; Manfredini, Daniele; Michelotti, Ambra; Svensson, Peter; List, Thomas
To combine empirical evidence and expert opinion in a formal consensus method in order to develop a quality-assessment tool for experimental bruxism studies in systematic reviews. Tool development comprised five steps: (1) preliminary decisions, (2) item generation, (3) face-validity assessment, (4) reliability and discriminitive validity assessment, and (5) instrument refinement. The kappa value and phi-coefficient were calculated to assess inter-observer reliability and discriminative ability, respectively. Following preliminary decisions and a literature review, a list of 52 items to be considered for inclusion in the tool was compiled. Eleven experts were invited to join a Delphi panel and 10 accepted. Four Delphi rounds reduced the preliminary tool-Quality-Assessment Tool for Experimental Bruxism Studies (Qu-ATEBS)- to 8 items: study aim, study sample, control condition or group, study design, experimental bruxism task, statistics, interpretation of results, and conflict of interest statement. Consensus among the Delphi panelists yielded good face validity. Inter-observer reliability was acceptable (k = 0.77). Discriminative validity was excellent (phi coefficient 1.0; P reviews of experimental bruxism studies, exhibits face validity, excellent discriminative validity, and acceptable inter-observer reliability. Development of quality assessment tools for many other topics in the orofacial pain literature is needed and may follow the described procedure.
Loudon, Kirsty; Zwarenstein, Merrick; Sullivan, Frank M; Donnan, Peter T; Gágyor, Ildikó; Hobbelen, Hans J S M; Althabe, Fernando; Krishnan, Jerry A; Treweek, Shaun
PRagmatic Explanatory Continuum Indicator Summary (PRECIS)-2 is a tool that could improve design insight for trialists. Our aim was to validate the PRECIS-2 tool, unlike its predecessor, testing the discriminant validity and interrater reliability. Over 80 international trialists, methodologists, clinicians, and policymakers created PRECIS-2 helping to ensure face validity and content validity. The interrater reliability of PRECIS-2 was measured using 19 experienced trialists who used PRECIS-2 to score a diverse sample of 15 randomized controlled trial protocols. Discriminant validity was tested with two raters to independently determine if the trial protocols were more pragmatic or more explanatory, with scores from the 19 raters for the 15 trials as predictors of pragmatism. Interrater reliability was generally good, with seven of nine domains having an intraclass correlation coefficient over 0.65. Flexibility (adherence) and recruitment had wide confidence intervals, but raters found these difficult to rate and wanted more information. Each of the nine PRECIS-2 domains could be used to differentiate between trials taking more pragmatic or more explanatory approaches with better than chance discrimination for all domains. We have assessed the validity and reliability of PRECIS-2. An elaboration study and web site provide guidance to help future users of the tool which is continuing to be tested by trial teams, systematic reviewers, and funders. Copyright © 2017 Elsevier Inc. All rights reserved.
Frattini, Paolo; Valagussa, Andrea; Zenoni, Stefania; Crosta, Giovanni B.
actual blocks, (2) the percentage of trajectories passing through the buffer of the actual rockfall path, (3) the mean distance between the location of arrest of each simulated blocks and the location of the nearest actual blocks; (4) the mean distance between the location of detachment of each simulated block and the location of detachment of the actual block located closer to the arrest position. By applying the four measures to the case studies, we observed that all measures are able to represent the model performance for validation purposes. However, the third measure is more simple and reliable than the others, and seems to be optimal for model calibration, especially when using a parameter estimation and optimization modelling software for automated calibration.
Küçükgüçlü, Özlem; Mert, Hatice; Akpınar, Burcu
To assess the reliability and validity of a Turkish version of Kogan's Attitudes Toward Older People Scale. To explore nursing students' attitudes towards older people, standardised, well-tested instruments are needed. Kogan's Attitudes Toward Older People is a comprehensive instrument for assessment of attitudes toward older adults and has been validated in several languages. However, a validated Turkish version has not been available until now. The study used psychometric testing to establish reliability and validity of the Turkish version of Kogan's Attitudes Toward Older People Scale. A convenience sample of 237 nursing students in Turkey was used to collect data regarding attitudes towards older people. Content validity, construct validity, internal consistency and stability reliability of the Kogan's Attitudes Toward Older People were assessed. Kogan's Attitudes Toward Older People scores were between 74-224. All of the 34 items were found to have significant item-to-total correlations (p student population. The Cronbach's alpha was 0·89 for the total scale (0·82 for negative, 0·85 for positive). In addition, test-retest correlation was 0·83 (negative subscale 0·77, positive subscale 0·73) (p Attitudes Toward Older People was found to be a reliable and valid tool for assessing Turkish nursing students' attitudes toward older adults. This study provided evidence that the Kogan's Attitudes Toward Older People is a reliable and valid instrument for assessing Turkish nursing student's positive and negative attitudes toward older adults. It is easy and practical to use for both informants and investigators and acceptable for Turkish Culture. © 2011 Blackwell Publishing Ltd.
Roldán-Merino, J; Lluch-Canut, M T; Casas, I; Sanromà-Ortíz, M; Ferré-Grau, C; Sequeira, C; Falcó-Pegueroles, A; Soares, D; Puig-Llobet, M
the subscales and the overall scale. Confirmatory factor analysis showed that the model proposed for the factors fit the data satisfactorily. Discussion This scale is a valid and reliable instrument for evaluating positive mental health in university students. Implications for Practice A good questionnaire to measure positive mental health in university students is useful not only to promote mental health but also to strengthen the curricula of future professionals. © 2017 John Wiley & Sons Ltd.
Selim, Abeer A; Ramadan, Fatma H; El-Gueneidy, Mervat M; Gaafer, Maha M
While there is widespread use of OSCE in general nursing specialties, psychiatric nursing has been slow to adopt this evaluation method and it has only recently been introduced to psychiatric nursing education. The main aim of the present study is to test the first application, validity and reliability of the OSCE in undergraduate psychiatric nursing education. OSCE was developed to assess undergraduate psychiatric nursing students' clinical skills. The students' evaluation of the OSCE process was obtained after the completion of each OSCE circuit. The psychiatric nursing OSCE proved to be a reliable and valid method in assessing psychiatric nursing clinical competencies. In general, the students perceived OSCE as a positive experience and stressful on the other hand. OSCE is a reliable and valid method of assessing the students' psychiatric nursing competency skills. It has been shown to have many advantages over traditional methods of assessment and has the ability to objectively assess psychiatric nursing skills. Copyright © 2011 Elsevier Ltd. All rights reserved.
Kang, Deying; Wu, Yuxia; Hu, Dan; Hong, Qi; Wang, Jialiang; Zhang, Xin
Objective. The aim of this study is to measure the reliability and external validity of AMSTAR by applying it to a sample of TCM systematic reviews. Study Design and Methods. We tested the agreement, reliability, construct validity, and feasibility of AMSTAR through comparisons with OQAQ. Statistical analyses were performed by using SPSS 13.0. Results. A random of sample with 41 TCM systematic reviews was selected from a database. The interrater agreement of the individual items of AMSTAR was moderate with a mean kappa of 0.50 (95% CI: 0.26, 0.73). The ICC for AMSTAR against OQAQ (total score of 9 items, excluding item 10) was 0.87 (95% CI: 0.76, 0.93). Conclusions. Although there is room for improvement on few items, the new tool is reliable, valid, and easy to use for methodological quality assessment of systematic reviews on TCM.
Full Text Available Objective. The aim of this study is to measure the reliability and external validity of AMSTAR by applying it to a sample of TCM systematic reviews. Study Design and Methods. We tested the agreement, reliability, construct validity, and feasibility of AMSTAR through comparisons with OQAQ. Statistical analyses were performed by using SPSS 13.0. Results. A random of sample with 41 TCM systematic reviews was selected from a database. The interrater agreement of the individual items of AMSTAR was moderate with a mean kappa of 0.50 (95% CI: 0.26, 0.73. The ICC for AMSTAR against OQAQ (total score of 9 items, excluding item 10 was 0.87 (95% CI: 0.76, 0.93. Conclusions. Although there is room for improvement on few items, the new tool is reliable, valid, and easy to use for methodological quality assessment of systematic reviews on TCM.
Morales-Monforte, Erica; Bagur-Calafat, Caridad; Suc-Lerin, Neus; Fornaguera-Martí, Montserrat; Cazorla-Sánchez, Engracia; Girabent-Farrés, Montserrat
Validity and reliability of the cross-cultural adaptive translation of the Alberta Infant Motor Scale (AIMS), to monitor gross motor development in infants from 0 to 18 months of age, were evaluated. A cross-cultural translation was used to generate a Spanish version of the AIMS. Fifty infants at risk or with diagnosis of motor delay, 0-18 months of age, participated in this study. Two independent physical therapists scored infants on the AIMS. Concurrent validity was tested using the AIMS and the Bayley Scales of Infant and Toddler Development - III (Bayley - III). Reliability and the internal consistency were high (ICCs ranged from 0.94 to 1.00 and KR-20 ranged from 0.90 to 0.98, respectively). AIMS and Bayley - III scores correlated strongly (r = 0.97). The Spanish version of the AIMS presented excellent validity and reliability. Further studies are suggested in order to assess the AIMS in preterm babies.
To provide reliability and validity data on a computerised dental anxiety scale (the Smiley Faces Program or SFP) for children using faces as a response set. Five schools across Derby, Newcastle-upon-Tyne and Shropshire. The SFP was completed by 464 children (mean age 10.8 years). For validity purposes, 241 children also completed two other children's dental anxiety measures (the Modified Child Dental Anxiety Scale and the Dental Fear Survey). For test-retest purposes 100 of the children completed the SFP two weeks later. The SFP demonstrated good internal consistency (alpha = 0.8), test-retest reliability was good and significant correlations were found between the SFP and the other measures. The children rated the local anaesthetic and drill as the most anxiety-provoking items. This study suggests that the SFP is a valid and reliable measure for assessing children's trait dental anxiety and may help encourage dentists to formally assess dental anxiety.
Full Text Available The main objective of this study is to develop a valid and reliable scale for identifying digital citizenship perceptions of young people in the most common age groups. The study was conducted as a survey study. The study group of this study is composed of 438 people in Turkey who are among 16-24 age group with the highest rate of internet use in Turkey. An exploratory factor analysis was performed to determine the validity of the scale and the item discrimination powers were calculated. The total variance of the scale was determined that the scale had 8-factor structure and was found to be 49,70%. The internal consistency level was also calculated to determine the reliability of the scale. As a result, it can be said that this scale is a valid and reliable scale that can be used to determine the digital citizenship perceptions of young people.
Full Text Available The subject of this paper is the level of accuracy i.e. validity and reliability of subjective well-being indicators in sociological research, in case of life satisfaction. First, the relationship between subjective and objective indicators of well-being is presented. Second, the concept of life satisfaction is defined, and the findings of research related to the validity (convergent, discriminant, and predictive and reliability (test-retest, and internal consistency of life satisfaction measures are presented. Third, these findings were reassessed using data originating from large international surveys, which have not been used for this purpose yet. The results in this paper generally contribute to, in quite a large extent existing agreement within the scientific literature about the satisfactory level of validity and reliability of life satisfaction measures i.e. subjective indicators of well-being.
Bartlett, Susan J; Barbic, Skye P; Bykerk, Vivian P
OBJECTIVE: The Outcome Measures in Rheumatology (OMERACT) Rheumatoid Arthritis (RA) Flare Group was established to develop a reliable way to identify and measure RA flares in randomized controlled trials (RCT). Here, we summarized the development and field testing of the RA Flare Questionnaire (RA......-FQ), and the voting results at OMERACT 2016. METHODS: Classic and modern psychometric methods were used to assess reliability, validity, sensitivity, factor structure, scoring, and thresholds. Interviews with patients and clinicians also assessed content validity, utility, and meaningfulness of RA-FQ scores. RESULTS......: People with RA in observational trials in Canada (n = 896) and France (n = 138), and an RCT in the Netherlands (n = 178) completed 5 items (11-point numerical rating scale) representing RA Flare core domains. There was moderate to high evidence of reliability, content and construct validity...
Boonstra, Anne M; Schiphorst Preuper, Henrica R; Reneman, Michiel F; Posthumus, Jitze B; Stewart, Roy E
To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional design was used. A general rehabilitation centre and a university rehabilitation centre was the setting for the study. The study population consisted of patients over 18 years of age, suffering from chronic musculoskeletal pain; 52 patients in the reliability study, 344 patients in the validity study. Main outcome measures were as follows. Reliability study: Spearman's correlation coefficients (rho values) of the test and retest data of the VAS for disability; validity study: rho values of the VAS disability scores with the scores on four domains of the Short-Form Health Survey (SF-36) and VAS pain scores, and with Roland-Morris Disability Questionnaire scores in chronic low back pain patients. Results were as follows: in the reliability study rho values varied from 0.60 to 0.77; and in the validity study rho values of VAS disability scores with SF-36 domain scores varied from 0.16 to 0.51, with Roland-Morris Disability Questionnaire scores from 0.38 to 0.43 and with VAS pain scores from 0.76 to 0.84. The conclusion of the study was that the reliability of the VAS for disability is moderate to good. Because of a weak correlation with other disability instruments and a strong correlation with the VAS for pain, however, its validity is questionable.
Francine Guimarães Gonçalves
Full Text Available Abstract The Revised Olweus Bully/Victim Questionnaire (OBVQ is among the few bullying assessment instruments with well-established psychometric properties in different countries. Nevertheless, the psychometric properties of the Brazilian version (Questionário de Bullying de Olweus - QBO have not been determined. We aimed at verifying the construct validity and reliability of the bully and victim scales of the QBO. To achieve that goal, the victim and bully scales were assessed using polytomous item response theory (IRT. The best fit was obtained with a generalized partial credit model that is capable of measuring the specific discriminating power for each item in these scales. The QBO was administered to 703 public school students (mean age: 13 years; standard deviation = 1.58. Based on IRT analysis, the number of response categories in each item was reduced from four to three. Cronbach reliability scores were satisfactory: α = 0.85 (victim scale and α = 0.87 (bully scale. In this study, hurtful comments, persecution, or threats had high power to discriminate victims and bullies. For both QBO scales, higher severity parameters were observed for direct bullying items. The results also show that the construct of both QBO scales measures the same construct proposed for the overall instrument. Thus, the QBO can be administered to different Brazilian populations to assess the main characteristics of bullying: repetition of behavior over time and intentionally acting to humiliate, threaten, or harm somebody.
Full Text Available In this study, it is intended to develop a scale unique to our culture, concerning individual instrument performance anxiety of the students who are getting instrument training in the Department of Music Education. In the study, the descriptive research model is used and qualitative research techniques are utilized. The study population consists of the students attending the 23 universities which has Music Education Department. The sample of the study consists of 438 girls and 312 boys, totally 750 students who are studying in the Department of Music Education of randomly selected 10 universities. As a result of the explanatory and confirmatory factor analyses that were performed, a onedimensional structure consisting of 14 items was obtained. Also, t-scores and the coefficient scores of total item correlation concerning the distinguishing power of the items, the difference in the scores of the set of lower and upper 27% was calculated, and it was observed that the items are distinguishing as a result of both analyses. Of the scale, Cronbach's alpha coefficient of internal consistency was calculated as .94, and test-retest reliability coefficient was calculated as .93. As a result, a valid and reliable assessment and evaluation instrument that measures the exam performance anxiety of the students studying in the Department of Music Education, has been developed.
Kanehara, Akiko; Kotake, Risa; Miyamoto, Yuki; Kumakura, Yousuke; Morita, Kentaro; Ishiura, Tomoko; Shimizu, Kimiko; Fujieda, Yumiko; Ando, Shuntaro; Kondo, Shinsuke; Kasai, Kiyoto
Personal recovery is increasingly recognised as an important outcome measure in mental health services. This study aimed to develop a Japanese version of the Questionnaire about the Process of Recovery (QPR-J) and test its validity and reliability. The study comprised two stages that employed the cross-sectional and prospective cohort designs, respectively. We translated the questionnaire using a standard translation/back-translation method. Convergent validity was examined by calculating Pearson's correlation coefficients with scores on the Recovery Assessment Scale (RAS) and the Short-Form-8 Health Survey (SF-8). An exploratory factor analysis (EFA) was conducted to examine factorial validity. We used intraclass correlation and Cronbach's alpha to examine the test-retest and internal consistency reliability of the QPR-J's 22-item full scale, 17-item intrapersonal and 5-item interpersonal subscales. We conducted an EFA along with a confirmatory factor analysis (CFA). Data were obtained from 197 users of mental health services (mean age: 42.0 years; 61.9% female; 49.2% diagnosed with schizophrenia). The QPR-J showed adequate convergent validity, exhibiting significant, positive correlations with the RAS and SF-8 scores. The QPR-J's full version, subscales, showed excellent test-retest and internal consistency reliability, with the exception of acceptable but relatively low internal consistency reliability for the interpersonal subscale. Based on the results of the CFA and EFA, we adopted the factor structure extracted from the original 2-factor model based on the present CFA. The QPR-J is an adequately valid and reliable measure of the process of recovery among Japanese users with mental health services.
Shen, Wang-Qin; Chen, Hong-Lin; Hu, Yan
Self-directed learning is crucial to the professional development of nursing students, and which enables them to expand the knowledge and enhance the quality of their practice. A validated self-directed learning instrument is important not only in assessing the individual's self-directed learning level, but also in evaluating the effectiveness of teaching or learning methods. The aim of this study is to evaluate the validity and reliability of the SDLI in mainland Chinese nursing students. A cross-sectional design with convenience sampling was used to recruit participants from three nursing schools. The mainland Chinese version of SDLI was tested with respect to validity and reliability in 1,499 nursing students, and another 30 nursing students were invited to evaluate the test-retest reliability of the scale in 7 days interval. Explorary factor analysis identified a four-factor structure, accounting for 56.101% of the total variance. The confirmatory factor analysis showed a good overall fit of this four-factor model. Convergent validity was supported by the highly positive Pearson's correlation between SDLI score and SRSSDL score (r = .876, p = .000). Cronbach's alpha for internal consistency of overall scale was .916, and 4 dimensions were between .755-.825.The test-retest reliability of overall scale was .850, and 4 dimensions were between .708-.821. The intraclass correlation coefficient (ICC) of overall scale was .916, and 4 dimensions were .822-.889. This study indicates that the SDLI is a valid and reliable instrument for assessing self-directed learning in mainland Chinese nursing students. Nurse educators could use such knowledge to develop their roles and plan to support nursing students in becoming self-directed learners and lifelong learner.
Panczyk, Mariusz; Stachacz, Grzegorz; Gałązkowski, Robert; Gotlib, Joanna
In the interest of preservation of high degree of objectivity of information about students' educational outcomes, a system of assessment needs to meet criteria of appropriate reliability and validity. Analysis of reliability and validity of the system of assessment of students' educational outcomes for courses followed by an examination and covered by a curriculum in Medical Rescue at Medical University of Warsaw (MU W). A retrospective study enrolling a group of 421 students of eight subsequent full education cycles. Detailed data concerning grades for fourteen courses followed by an examination in the entire course of studies were collected. Reliability (Cronbach's alpha coefficient) and criteria validity (Spearman's rank correlation) were assessed. Internal consistency was estimated using a multiple regression model. The levels of assessment reliability for the general university, pre-clinical, and clinical scopes amounted to alpha: 0.42, 0.53, and 0.70, respectively. The strongest positive correlations between the results of pre-clinical and clinical trainings were found for the Anatomy course (r ≈ 0.30). Only in the case of the Pharmacology course it was found that students' achievements in this field were significantly correlated with all other courses of clinical training. The influence of educational outcomes in particular areas of clinical training on the final grade for the entire course of studies was diverse (β regression between 0.04 and 0.11). While the Pharmacology course had the strongest impact on final results, the Surgery course had the least influence on students' final grades (β = 0.04). 1. Sufficient reliability of the system of assessment of educational outcomes in Medical Rescue showed good precision and repeatability of assessment. 2. A low level of validity was caused by a failure to keep the appropriateness of the assessment of educational outcomes in several clinical courses. 3. Prognostic and diagnostic validity of methods used for
Arunakul, Marut; Arunakul, Preeyaphan; Suesiritumrong, Chakhrist; Angthong, Chayanin; Chernchujit, Bancha
Self-administered questionnaires have become an important aspect for clinical outcome assessment of foot and ankle-related problems. The Foot and Ankle Ability Measure (FAAM) subjective form is a region-specific questionnaire that is widely used and has sufficient validity and reliability from previous studies. Translate the original English version of FAAM into a Thai version and evaluate the validity and reliability of Thai FAAM in patients with foot and ankle-related problems. The FAAM subjective form was translated into Thai using forward-backward translation protocol. Afterward, reliability and validity were tested. Following responses from 60 consecutive patients on two questionnaires, the Thai FAAM subjective form and the short form (SF)-36, were used. The validity was tested by correlating the scores from both questionnaires. The reliability was adopted by measuring the test-retest reliability and internal consistency. Thai FAAM score including activity of daily life (ADL) and Sport subscale demonstrated the sufficient correlations with physical functioning (PF) and physical composite score (PCS) domains of the SF-36 (statistically significant with p foot and ankle-related problems.
Strand, Edythe A; McCauley, Rebecca J; Weigand, Stephen D; Stoeckel, Ruth E; Baas, Becky S
In this article, the authors report reliability and validity evidence for the Dynamic Evaluation of Motor Speech Skill (DEMSS), a new test that uses dynamic assessment to aid in the differential diagnosis of childhood apraxia of speech (CAS). Participants were 81 children between 36 and 79 months of age who were referred to the Mayo Clinic for diagnosis of speech sound disorders. Children were given the DEMSS and a standard speech and language test battery as part of routine evaluations. Subsequently, intrajudge, interjudge, and test-retest reliability were evaluated for a subset of participants. Construct validity was explored for all 81 participants through the use of agglomerative cluster analysis, sensitivity measures, and likelihood ratios. The mean percentage of agreement for 171 judgments was 89% for test-retest reliability, 89% for intrajudge reliability, and 91% for interjudge reliability. Agglomerative hierarchical cluster analysis showed that total DEMSS scores largely differentiated clusters of children with CAS vs. mild CAS vs. other speech disorders. Positive and negative likelihood ratios and measures of sensitivity and specificity suggested that the DEMSS does not overdiagnose CAS but sometimes fails to identify children with CAS. The value of the DEMSS in differential diagnosis of severe speech impairments was supported on the basis of evidence of reliability and validity.
Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia
Bayani, Ali Asghar
The internal consistency, test-retest reliability, and construct validity of the Farsi version of the Depression Anxiety Stress Scales were examined, with a sample of 306 undergraduate students (123 men, 183 women) ranging from 18 to 51 years of age (M age = 25.4, SD = 6.1). Participants completed the Satisfaction with Life Scale, Rosenberg Self-esteem Scale, and the Depression Anxiety Stress Scales. The findings confirmed the preliminary reliabilities and preliminary construct validity of the Farsi translation of the Depression Anxiety Stress Scales.
Apolzan, John W; Myers, Candice A; Cowley, Amanda D; Brady, Heather; Hsia, Daniel S; Stewart, Tiffany M; Redman, Leanne M; Martin, Corby K
Mindfulness is theorized to affect the eating behavior and weight of pregnant women, yet no measure has been validated during pregnancy. This study qualitatively and quantitatively evaluated the reliability and validity of the Mindful Eating Questionnaire (MEQ) in overweight and obese pregnant women. Participants completed focus groups and cognitive interviews. The MEQ was administered twice to measure test-retest reliability. The Eating Inventory (EI) and Mindful Attention Awareness Scale (MAAS) were administered to assess convergent validity, and the Neighborhood Environment Walkability Scale (NEWS) assessed discriminant validity. Participants were 20 ± 8 weeks gestation (mean ± SD), 30 ± 2 years old, and 55% were obese. The MEQ total score had good test-retest reliability (r = .85). The total score internal consistency reliability was poor (Cronbach's α = .56). The external cues subscale (ECS) was not internally consistent (α = .31). Other subscales ranged from α = .59-.68. When the ECS was excluded, the MEQ total score internal consistency was acceptable (α = .62). Convergent validity was supported by the MEQ total score (with and without ECS) correlating significantly with the MAAS and the EI disinhibition and hunger subscales. Discriminant validity of the MEQ was supported by the MEQ and NEWS total scores and subscales not being significantly correlated. The quantitative results were supported by the qualitative context and content analysis. With the exception of the ECS, the MEQ's reliability and validity was supported in pregnant women, and most of the subscales were more robust in pregnant women than in the original sample of healthy adults. The MEQ's use with overweight and obese pregnant women is supported. Copyright © 2016 Elsevier Ltd. All rights reserved.
Pakpour, Amir H; Yekaninejad, Mir S; Zarei, Fatemeh; Hashemi, Fariba; Steele, Michael M; Varni, James W
The primary objective of the study was to translate and evaluate the psychometric properties of the Pediatric Quality of Life Inventory™ (PedsQL™) Oral Health Scale in over 1000 Iranian children. A standard forward and backward translation procedure was used to convert the US English dialect version of the PedsQL™ Oral Health Scale into the Iranian language (Persian). The Iranian version of the PedsQL™ Oral Health Scale, in combination with the PedsQL™ 4.0 Generic Core Scales, was then subsequently administered to 1053 Iranian children and 1026 parents. The reliability of the PedsQL™ Oral Health Scale was evaluated using internal consistency and test-retest methods. Known-groups discriminant validity, exploratory factor analysis (EFA) of the Oral Health and the four Generic Core Scales combined, and confirmatory factor analysis (CFA) of the Oral Health Scale alone were conducted. The Benjamini-Hochberg procedure was used to correct P-values for multiple comparisons. Good to excellent internal consistency and test-retest reliabilities were demonstrated. The PedsQL™ Oral Health Scale demonstrated discriminant validity for subgroups of children across different decayed, missing and filled teeth (DMFT) index categories and gender. The EFA supported the a priori factor model of the combined five scales. The CFA analysis confirmed the unidimensional factor structure of the Oral Health Scale. The PedsQL™ Oral Health Scale demonstrated excellent psychometric properties in combination with the PedsQL™ 4.0 Generic Core Scales. These five scales combined can be utilized to assess the multidimensional oral-health-related quality of life of Iranian children. 2011 The Authors. International Journal of Paediatric Dentistry © 2011 BSPD, IAPD and Blackwell Publishing Ltd.
Full Text Available The Chinese Healthy Eating Index (CHEI is a measuring instrument of diet quality in accordance with the Dietary Guidelines for Chinese (DGC-2016. The objective of the study was to evaluate the validity and reliability of the CHEI. Data from 12,473 adults from the China Health and Nutrition Survey (CHNS-2011, including 3-day–24-h dietary recalls were used in this study. The CHEI was assessed by four exemplary menus developed by the DGC-2016, the general linear models, the independent t-test and the Mann–Whitney U-test, the Spearman’s correlation analysis, the principal components analysis (PCA, the Cronbach’s coefficient, and the Pearson correlation with nutrient intakes. A higher CHEI score was linked with lower exposure to known risk factors of Chinese diets. The CHEI scored nearly perfect for exemplary menus for adult men (99.8, adult women (99.7, and the healthy elderly (99.1, but not for young children (91.2. The CHEI was able to distinguish the difference in diet quality between smokers and non-smokers (P < 0.0001, people with higher and lower education levels (P < 0.0001, and people living in urban and rural areas (P < 0.0001. Low correlations with energy intake for the CHEI total and component scores (|r| < 0.34, P < 0.01 supported the index assessed diet quality independently of diet quantity. The PCA indicated that underlying multiple dimensions compose the CHEI, and Cronbach’s coefficient α was 0.22. Components of dairy, fruits and cooking oils had the greatest impact on the total score. People with a higher CHEI score had not only a higher absolute intake of nutrients (P < 0.001, but also a more nutrient-dense diet (P < 0.001. Our findings support the validity and reliability of the CHEI when using the 3-day–24-h recalls.
D'Angelo, Daniela; Mastroianni, Chiara; Artico, Marco; Biagioli, Valentina; Latina, Roberto; Guarda, Michela; Piredda, Michela; De Marinis, Maria Grazia
Patients suffering from advanced disease face different care transitions. The transition from acute to palliative care is challenging and may lead to the discontinuity of care. Family caregivers become important sources of information, as patients begin to experience difficulties in coping with emotional transition events. The Care Transition Measure was developed to evaluate care transitions as experienced by the elderly. It has never been used in palliative care. The aim of this study was to test the validity and reliability of a modified version of the Palliative Care Transition Measure, specifically the Palliative Care Transition Measure for Caregivers (PCTM-C). The study included two main phases. Phase I focused on the construction of a modified version of the Palliative Care Transition Measure through two focus groups and by computing the content validity index. Phase II focused on testing the psychometric properties of the PCTM-C on 272 family caregivers through confirmatory factor analysis. Result The content validity index for each of the items was higher than 0.80, whereas that for the scale was 0.95. The model tested with confirmatory factor analysis fitted the data well and confirmed that the transition measures referred to communication, integrated care and a trusting-relationship, and therefore the core dimensions of continuity according to existing conceptual models. The internal consistency was high (Cronbach's alpha = 0.94). Significance of results The PCTM-C proved to be a suitable measure of the quality of such transitions. It may be used in clinical practice as a continuity quality indicator and has the potential to guide interventions to enhance family caregivers' experience of care continuity.
Gomutbutra, Patama; Aramrat, Apinun; Sattapansri, Worapoj; Chutima, Siam; Tooprakai, Dusida; Sakarinkul, Pokin; Sangkhasilapin, Yaowapa
The Assessment Chronic Illness Care (ACIC), developed in the United States, is a quality-improvement tool used to help organization evaluate the strengths and weaknesses of their delivery of care for chronic illness in six areas, community linkages, self-management support, decision support, delivery system design, information systems, and organization of care. These areas of care are influenced by the Chronic Care Model. The questionnaire scale ranges from 0 to 11. Translate in Thai language and validate the ACIC as a practical tool to measure the quality of chronic illness care in Thailand. In a cross-sectional study, the content validity was examined by public health experts. The original ACIC was translated into Thai with permission from The MacColl Institute for Healthcare Innovation at Group Health's Center for Health Studies. The translation process followed the World Health Organization (WHO) process of translation and adaptation of instruments, including forward translation, expert panel and synthesis of the translation, back translation, pre-testing, and cognitive interviewing. The pre-testing was done by distributing the questionnaire to a sample of 12 organizations with cognitive interviewing, followed by revision and finalization of the questionnaire. The reliability and validity of the translated version was then examined by distributing the questionnaire to 172 organizations (84 district hospitals and 88 community health center primary care units within the upper northern part of Thailand)focusing on care of cerebrovascular disease. The response rate was approximately 70% or 120 organizations. The results from these organizations 'self-assessment showed that the Thai version of ACIC achieved good levels of reliability and validity, with the range of Cronbach's alpha coefficients being 0.846 to 0.972 in each aspect of ACIC. However ACIC inablility to detect statistical significant difference in score for each dimension though the self
Li, Tong; Jia, Qingxuan; Chen, Gang; Sun, Hanxu
Motion reliability as a criterion can reflect the accuracy of manipulator in completing operations. Since path planning task takes a significant role in operations of manipulator, the motion reliability evaluation of path planning task is discussed in the paper. First, a modeling method for motion reliability is proposed by taking factors related to position accuracy of manipulator into account. In the model, multidimensional integral for PDF is carried out to calculate motion reliability. Co...
Isabelle Ottenvall Hammar
Full Text Available In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA-Older persons (IPA-O, showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation.
Danielle Fabiana Cucolo
Full Text Available ABSTRACT Objectives: to verify the reliability and construct validity estimates of the "Assessment of nursing care product" scale (APROCENF and its applicability. Methods: this validation study included a sample of 40 (inter-rater reliability and 172 (construct validity assessments performed by nurses at the end of the work shift at nine inpatient services of a teaching hospital in the Brazilian Southeast. The data were collected between February and September/2014 with interruptions. Cronbach's alpha and Spearman's correlation coefficients were calculated, as well as the intraclass correlation and the weighted kappa index (inter-rater reliability. Exploratory factor analysis was used with principal component extraction and varimax rotation (construct validity. Results: the internal consistency revealed an alpha coefficient of 0.85, item-item correlation ranging between 0.13 and 0.61 and item-total correlation between 0.43 and 0.69. Inter-rater equivalence was obtained and all items evidenced significant factor loadings. Conclusion: this research evidenced the reliability and construct validity of the scale to assess the nursing care product. Its application in nursing practice permits identifying improvements needed in the production process, contributing to management and care decisions.
Sanders, James L; Williams, Robert J
Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Pedersen, Claus Bang; McHorney, Colleen A; Larsen, Lotte Seiding; Lophaven, Katja Wendicke; Moeller, Anders Holmen; Reaney, Matthew
The single-item Psoriasis Itch VAS was developed to measure itch intensity within the last 24 hours in psoriasis vulgaris to assess treatment benefit. Its psychometric properties were explored in two trials. Data from two randomized, parallel-group phase 3 trials with subjects suffering from psoriasis vulgaris on the body (n = 426, 463) were analyzed. Cross-sectional distributional properties and construct validity of the Psoriasis Itch VAS as well as longitudinal test-retest reliability and sensitivity to change of the Psoriasis Itch VAS were investigated. All statistical tests were two-tailed. Across both trials, acceptable distributional properties were observed. Convergent-validity correlations between the Psoriasis Itch VAS and other patient-reported and clinician-reported outcomes provided strong endorsement for construct validity as did tests of known-groups validity. Longitudinal measurement properties, involving test-retest reliability and sensitivity to change, also offered evidence for the measurement integrity of the Psoriasis Itch VAS. Results from the assessment of validity, reliability, and sensitivity to change support the use of the Psoriasis Itch VAS to measure itch intensity in psoriasis vulgaris. Data from two trials provided evidence that the Psoriasis Itch VAS is well-defined and reliable for measuring itch in psoriasis vulgaris to assess treatment benefit (i.e. therapeutic response).
García García, J A; Landa Petralanda, V; Trigueros Manzano, M C; Gaminde Inda, I
To adapt into Spanish the Texas Revised Inventory of Grief (TRIG) of Faschinbaguer et al (1977, 1981) and to examine its reliability and validity. Two stages: a) cross-culture adaptation of a questionnaire, and b) cross-sectional study of reliability and validity. Primary care teams in Vizcaya, north of Spain. 118 people who had been widowed between 3 months and 3 years before the study. Cross-culture adaptation: the TRIG was translated from American English, then back-translated and finally compared with the version in American Spanish by Grabowski & Frantz (1993). Reliability: the Cronbach's alpha coefficients (internal consistency) of the 2 TRIG scales were 0.75 and 0.86. Factorial validity: all items except one threw saturation >0.40 on the 2 factors extracted from the varimax rotation. Validity by hypothesis: the TRIG scales showed differences (PGrief Experiences of Sanders et al (1985) and García García et al (2001). In the evaluation of grief due to the death of the spouse, the TRIG adapted to Spanish is as reliable and valid as the original version.
Hammar, Isabelle Ottenvall; Ekelund, Christina; Wilhelmson, Katarina; Eklund, Kajsa
In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA) assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA- Older persons (IPA-O), showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation. PMID:26973949
Miles, Scott B.
"What makes for a good model?" In various forms, this question is a question that, undoubtedly, many people, businesses, and institutions ponder with regards to their particular domain of modeling. One particular domain that is wrestling with this question is the multidisciplinary field of environmental modeling. Examples of environmental models range from models of contaminated ground water flow to the economic impact of natural disasters, such as earthquakes. One of the distinguishing claims of the field is the relevancy of environmental modeling to policy and environment-related decision-making in general. A pervasive view by both scientists and decision-makers is that a "good" model is one that is an accurate predictor. Thus, determining whether a model is "accurate" or "correct" is done by comparing model output to empirical observations. The expected outcome of this process, usually referred to as "validation" or "ground truthing," is a stamp on the model in question of "valid" or "not valid" that serves to indicate whether or not the model will be reliable before it is put into service in a decision-making context. In this paper, I begin by elaborating on the prevailing view of model validation and why this view must change. Drawing from concepts coming out of the studies of science and technology, I go on to propose a contextual view of validity that can overcome the problems associated with "ground truthing" models as an indicator of model goodness. The problem of how we talk about and determine model validity has much to do about how we perceive the utility of environmental models. In the remainder of the paper, I argue that we should adopt ideas of pragmatism in judging what makes for a good model and, in turn, developing good models. From such a perspective of model goodness, good environmental models should facilitate communication, convey—not bury or "eliminate"—uncertainties, and, thus, afford the active building of consensus decisions, instead
Jørgensen, René; Ris Hansen, Inge; Falla, Deborah
-retest reliability in people with and without chronic neck pain. Moreover, construct and between-group discriminative validity of the tests were examined. METHODS: Twenty-one participants with chronic neck pain and 21 asymptomatic participants were included. Intra- and inter-reliability were evaluated for the Cranio......-Cervical Flexion Test (CCFT), Range of Movement (ROM), Joint Position Error (JPE), Gaze Stability (GS), Smooth Pursuit Neck Torsion Test (SPNTT), and neuromuscular control of the Deep Cervical Extensors (DCE). Test-retest reliability was assessed for Postural Control (SWAY) and Pressure Pain Threshold (PPT) over...... neck pain....
Oosterkamp, B.C.M.; van der Meer, W.J.; Rutenfrans, M.; Dijkstra, P.U.
Objective: To assess the reliability and validity of measurements performed on three-dimensional virtual models of neonatal bilateral cleft lip and palate patients, compared with measurements performed on plaster cast models. Materials and Methods: Ten high-quality plaster cast models of bilateral
The main purpose of this document is to review and outline the current state-of-the-art of the Human Reliability Assessment (HRA) used for quantitative assessment of nuclear power plants safe and economical operation. Another objective is to consider Human Performance Indicators (HPI) which can alert plant manager and regulator to departures from states of normal and acceptable operation. These two objectives are met in the three sections of this report. The first objective has been divided into two areas, based on the location of the human actions being considered. That is, the modelling and data collection associated with control room actions are addressed first in chapter 1 while actions outside the control room (including maintenance) are addressed in chapter 2. Both chapters 1 and 2 present a brief outline of the current status of HRA for these areas, and major outstanding issues. Chapter 3 discusses HPI. Such performance indicators can signal, at various levels, changes in factors which influence human performance. The final section of this report consists of papers presented by the participants of the Technical Committee Meeting. A separate abstract was prepared for each of these papers. Refs, figs and tabs
A probabilistic methodology for safety system technical specification evaluation was developed. The method for Surveillance Test Interval (S.T.I.) evaluation basically means an optimization of S.T.I. of most important system's periodically tested components. For Allowed Outage Time (A.O.T.) calculations, the method uses system reliability time-dependent models (A computer code called FRANTIC III). A new approximation, which was called Independent Minimal Cut Sets (A.C.I.), to compute system unavailability was also developed. This approximation is better than Rare Event Approximation (A.E.R.) and the extra computing cost is neglectible. A.C.I. was joined to FRANTIC III to replace A.E.R. on future applications. The case study evaluations verified that this methodology provides a useful probabilistic assessment of surveillance test intervals and allowed outage times for many plant components. The studied system is a typical configuration of nuclear power plant safety systems (two of three logic). Because of the good results, these procedures will be used by the Argentine nuclear regulatory authorities in evaluation of technical specification of Atucha I and Embalse nuclear power plant safety systems. (Author) [es
Full Text Available BACKGROUND: Modeling plays a critical role in mitigating impacts of seasonal influenza epidemics. Complex simulation models are currently at the forefront of evaluating optimal mitigation strategies at multiple scales and levels of organization. Given their evaluative role, these models remain limited in their ability to predict and forecast future epidemics leading some researchers and public-health practitioners to question their usefulness. The objective of this study is to evaluate the predictive ability of an existing complex simulation model of influenza spread. METHODS AND FINDINGS: We used extensive data on past epidemics to demonstrate the process of predictive validation. This involved generalizing an individual-based model for influenza spread and fitting it to laboratory-confirmed influenza infection data from a single observed epidemic (1998-1999. Next, we used the fitted model and modified two of its parameters based on data on real-world perturbations (vaccination coverage by age group and strain type. Simulating epidemics under these changes allowed us to estimate the deviation/error between the expected epidemic curve under perturbation and observed epidemics taking place from 1999 to 2006. Our model was able to forecast absolute intensity and epidemic peak week several weeks earlier with reasonable reliability and depended on the method of forecasting-static or dynamic. CONCLUSIONS: Good predictive ability of influenza epidemics is critical for implementing mitigation strategies in an effective and timely manner. Through the process of predictive validation applied to a current complex simulation model of influenza spread, we provided users of the model (e.g. public-health officials and policy-makers with quantitative metrics and practical recommendations on mitigating impacts of seasonal influenza epidemics. This methodology may be applied to other models of communicable infectious diseases to test and potentially improve
Hyder, Ayaz; Buckeridge, David L.; Leung, Brian
Background Modeling plays a critical role in mitigating impacts of seasonal influenza epidemics. Complex simulation models are currently at the forefront of evaluating optimal mitigation strategies at multiple scales and levels of organization. Given their evaluative role, these models remain limited in their ability to predict and forecast future epidemics leading some researchers and public-health practitioners to question their usefulness. The objective of this study is to evaluate the predictive ability of an existing complex simulation model of influenza spread. Methods and Findings We used extensive data on past epidemics to demonstrate the process of predictive validation. This involved generalizing an individual-based model for influenza spread and fitting it to laboratory-confirmed influenza infection data from a single observed epidemic (1998–1999). Next, we used the fitted model and modified two of its parameters based on data on real-world perturbations (vaccination coverage by age group and strain type). Simulating epidemics under these changes allowed us to estimate the deviation/error between the expected epidemic curve under perturbation and observed epidemics taking place from 1999 to 2006. Our model was able to forecast absolute intensity and epidemic peak week several weeks earlier with reasonable reliability and depended on the method of forecasting-static or dynamic. Conclusions Good predictive ability of influenza epidemics is critical for implementing mitigation strategies in an effective and timely manner. Through the process of predictive validation applied to a current complex simulation model of influenza spread, we provided users of the model (e.g. public-health officials and policy-makers) with quantitative metrics and practical recommendations on mitigating impacts of seasonal influenza epidemics. This methodology may be applied to other models of communicable infectious diseases to test and potentially improve their predictive
Celiker, Reyhan; Altan, Lale; Rezvani, Aylin; Aktas, Ilknur; Tastekin, Nurettin; Dursun, Erbil; Dursun, Nigar; Sar?kaya, Selda; Ozdolap, Senay; Akgun, Kenan; Zateri, Coskun; Birtane, Murat
[Purpose] An easy-to-use, psychometrically validated screening tool for fibromyalgia is needed. This study aims to evaluate the reliability and validity of the Turkish version of the Fibromyalgia Rapid Screening Tool by correlating it with 2013 American College of Rheumatology alternative diagnostic criteria and the Hospital Anxiety and Depression Scale. [Subjects and Methods] Subjects were 269 Physical Medicine and Rehabilitation clinic outpatients. Patients completed a questionnaire includi...
Nualnong Wongtongkam; Paul Russell Ward; Andrew Day; Anthony Harold Winefield
In Thailand physical violence among male adolescents is considered a significant public health issue, although there has been little published research into the aetiology and functions of violence in Thai youth. Research in this area has been hampered by a lack of psychometrically sound tools that have been validated to assess problem behaviours in Asian youth. The purpose of this paper is to provide validity and reliability data on an instrument to measure violence in Thai youth. In this stu...
Leu, Shang-Herng; Chou, Jen-Yu; Lee, Pei-Chin; Cheng, Hsiu-Chu; Shao, Wen-Chuan; Hsien, Wan-Lin; Huang, Chieh-Liang; Chen, Vincent Chin-Hung
This study aims to evaluate the validity and reliability of the Chinese version of the Sheehan Disability Scale (SDS-C) among depression patients in Taiwan. To validate the SDS-C, two samples were selected: Sample 1 (n = 78) consisted of patients diagnosed with Depressive Disorder, while sample 2 (n = 63) consisted of a nondepressive population. Sample 1 was evaluated with SDS twice with 7-14 days interval to measure test-retest reliability. Both samples were also evaluated with The Centre for Epidemiologic Studies Depression Scale (CESD), Family APGAR (adaptation, partnership, growth, affection, resolve) score, the Global Assessment of Functioning (GAF), and brief questionnaire of the World Health Organization Quality of Life instrument (WHOQOL-BREF) and by a psychiatrist. The two samples were combined to measure validity. In concurrent validity, SDS-C scores were significantly correlated with the total score of GAF. The discriminative validity was carried out by comparing the score of SDS-C between subjects with and without current depressive disorder (t = 12.32, df = 139, P 11 (sensitivity: 82.1, specificity: 93.7). The area under the curve was 0.93 (±0.02). For the factor validity, principal components analysis showed that a single factor solution was the best solution accounting for 88.9% of the variance. Regarding the construct validity, SDS-C scores were significantly correlated with CESD, WHOQOL-BREF, and the Family APGAR score. For test-retest reliability, the intra-class correlation coefficient for the total SDS-C score was 0.89 (95% confidence interval 0.82-0.93) and internal consistency Cronbach's alpha was 0.94. The SDS-C was found to be a valid and reliable instrument for current depressive disorder patients in Taiwan. © 2015 Wiley Publishing Asia Pty Ltd.
Cano-Cappellacci, Marcelo; Leyton, Fernanda Aleitte; Carre?o, Joshua Dur?n
ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2) for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence ...
DUNN, JAMES A.
A SERIES OF STUDIES SOUGHT TO ASCERTAIN (1) INTER- AND INTRA-RATER RELIABILITY FOR THE HARRIS-GOODENOUGH DRAW-A-MAN TEST, (2) DRAW-A-MAN (DAM) VALIDITY, AND DRAW-A-WOMAN (DAW) VALIDITY. THE DRAWINGS OF 72 STUDENTS IN AN UPPER MIDDLE CLASS WERE SCORED INDEPENDENTLY BY TWO SELF-TAUGHT SCORERS, ONE OF WHOM ALSO RESCORED THE TEST A WEEK LATER. THE…
Emmanuel, Andy; Clow, Sheila E.
Background Validating a questionnaire/instrument (whether developed or adapted) before proceeding to the field for data collection is important. This article presents the modification of an Irish questionnaire for a Nigerian setting. The validation process and reliability testing of this questionnaire (which was used in assessing previous breastfeeding practices and breastfeeding intentions of pregnant women in English and Hausa languages) were also presented. Method Five experts in the field...
Pazira, Parvin; Rostami Haji-Abadi, Mahdi; Zolaktaf, Vahid; Sabahi, Mohammadfarzan; Pazira, Toomaj
In relation to statistical analysis, studies to determine the validity, reliability, objectivity and precision of new measuring devices are usually incomplete, due in part to using only correlation coefficient and ignoring the data dispersion. The aim of this study was to demonstrate the best way to determine the validity, reliability, objectivity and accuracy of an electro-inclinometer or other measuring devices. Another purpose of this study is to answer the question of whether reliability and objectivity represent accuracy of measuring devices. The validity of an electro-inclinometer was examined by mechanical and geometric methods. The objectivity and reliability of the device was assessed by calculating Cronbach's alpha for repeated measurements by three raters and by measurements on the same person by mechanical goniometer and the electro-inclinometer. Measurements were performed on "hip flexion with the extended knee" and "shoulder abduction with the extended elbow." The raters measured every angle three times within an interval of two hours. The three-way ANOVA was used to determine accuracy. The results of mechanical and geometric analysis showed that validity of the electro-inclinometer was 1.00 and level of error was less than one degree. Objectivity and reliability of electro-inclinometer was 0.999, while objectivity of mechanical goniometer was in the range of 0.802 to 0.966 and the reliability was 0.760 to 0.961. For hip flexion, the difference between raters in joints angle measurement by electro-inclinometer and mechanical goniometer was 1.74 and 16.33 degree (Preliability are acceptable, the results showed that measurement error was very high in the mechanical goniometer. Therefore, it can be concluded that objectivity and reliability alone cannot determine the accuracy of a device and it is preferable to use other statistical methods to compare and evaluate the accuracy of these two devices.
Murray, Nicholas; Salvatore, Anthony; Powell, Douglas; Reed-Jones, Rebecca
An estimated 300 000 sport-related concussion injuries occur in the United States annually. Approximately 30% of individuals with concussions experience balance disturbances. Common methods of balance assessment include the Clinical Test of Sensory Organization and Balance (CTSIB), the Sensory Organization Test (SOT), the Balance Error Scoring System (BESS), and the Romberg test; however, the National Collegiate Athletic Association recommended the Wii Fit as an alternative measure of balance in athletes with a concussion. A central concern regarding the implementation of the Wii Fit is whether it is reliable and valid for measuring balance disturbance in athletes with concussion. To examine the reliability and validity evidence for the CTSIB, SOT, BESS, Romberg test, and Wii Fit for detecting balance disturbance in athletes with a concussion. Literature considered for review included publications with reliability and validity data for the assessments of balance (CTSIB, SOT, BESS, Romberg test, and Wii Fit) from PubMed, PsycINFO, and CINAHL. We identified 63 relevant articles for consideration in the review. Of the 63 articles, 28 were considered appropriate for inclusion and 35 were excluded. No current reliability or validity information supports the use of the CTSIB, SOT, Romberg test, or Wii Fit for balance assessment in athletes with a concussion. The BESS demonstrated moderate to high reliability (interclass correlation coefficient = 0.87) and low to moderate validity (sensitivity = 34%, specificity = 87%). However, the Romberg test and Wii Fit have been shown to be reliable tools in the assessment of balance in Parkinson patients. The BESS can evaluate balance problems after a concussion. However, it lacks the ability to detect balance problems after the third day of recovery. Further investigation is needed to establish the use of the CTSIB, SOT, Romberg test, and Wii Fit for assessing balance in athletes with concussions.
Martinez, Bruna Reclusa; Staboli, Isabela Maschk; Kamonseki, Danilo Harudy; Budiman-Mak, Elly; Yi, Liu Chiao
To evaluate the validity and reliability of the Foot Function Index (FFI) in its Brazilian Portuguese version. The validity and reliability of the FFI were tested in 50 volunteers, with plantar fasciitis, metatarsalgia and chronic ankle sprain. The FFI validity process used the Short Form-36 (SF-36) and Foot and Ankle Outcome Score (FAOS) questionnaires. The correlation between FFI, SF-36 and FAOS was done using the Pearson's linear coefficient. The inter and intra-evaluator reliability was ascertained by means of the intraclass correlation coefficient (ICC) and the internal consistency by means of Cronbach's alpha coefficient. The scores were used to assess the standard error measurement (SEM), minimal detectable change (MDC) and ceiling floor and effects. The validity process showed that there were correlations between FFI and the "pain" and "social aspects" subscales of SF-36 and all subscales of FAOS, except for "other symptoms". The Brazilian-Portuguese version of FFI showed excellent intra and interevaluator correlations, with an ICC range of 0.99-0.97 and score reliability that was considered highly satisfactory, with Cronbach's alpha range of 0.80-0.61. The SEMs for inter and intra-evaluator reliability were 1.32 and 1.08, respectively. The MDC was 2.42 (90 % confidence interval). No ceiling and floor effect were detected. The Brazilian-Portuguese version of the FFI questionnaire was found to be a valid and reliable instrument for foot function evaluation, and can be used both in scientific settings and in clinical practice.
.../validate new diagnostic technologies and their associated claims as well as factors that affect the quality...] Food and Drug Administration/American Glaucoma Society Workshop on the Validity, Reliability, and... entitled ``FDA/American Glaucoma Society (AGS) Workshop on the Validity, Reliability, and Usability of...
Bahram Saleh Sedghpour
Full Text Available Objectives: The aim of the present study was to reconstruct determining validity, and score The Stanford Diagnostic Reading Test fourth edition (SDRT4 in the sixth grade students. Methods: The population of the study was all sixth grades of the 19 educational districts from Tehran, 571 students (255 boys and 316 girls were selected by using a random multi-cluster sampling. The data were analyzed. The techniques were item analysis (difficulty index, discriminative index, and loop techniques. Validity, translation validity, content validity, and construct validity (factorial analysis, and reliability (Kuder-Richardson. Results: The exploratory factor analysis determined five factors: declarative knowledge, inferential knowledge, procedural knowledge and visualization knowledge. The reliability of the Stanford diagnostic Reading Test’s subtests by computing the Kuder-Richardson coefficient were 0.778, 0.732 and 0.748 for comprehension subtest, vocabulary subtest and scanning subtest in order. Discussion: By considering the results of present study, SDRT4 has good reliability and validity and can appropriately diagnose the reading disabled students in the sixth grade.
Full Text Available This study aimed to translate MIDAS questionnaire from English into Persian and determine its content validity and reliability. MIDAS was translated and validated on a sample (N = 110 of Iranian adult population. The participants were both male and female with the age range of 17-57. They were at different educational levels and from different ethnic groups in Iran. A translating team, consisting of five members, bilingual in English and Persian and familiar with multiple intelligences (MI theory and practice, were involved in translating and determining content validity, which included the processes of forward translation, back-translation, review, final proof-reading, and testing. The statistical analyses of inter-scale correlation were performed using the Cronbach's alpha coefficient. In an intra-class correlation, the Cronbach's alpha was high for all of the questions. Translation and content validity of MIDAS questionnaire was completed by a proper process leading to high reliability and validity. The results suggest that Persian MIDAS (P-MIDAS could serve as a valid and reliable instrument for measuring Iranian adults MIs.
Yamato, Tie Parma; Maher, Chris; Koes, Bart; Moseley, Anne
The Physiotherapy Evidence Database (PEDro) scale has been widely used to investigate methodological quality in physiotherapy randomized controlled trials; however, its validity has not been tested for pharmaceutical trials. The aim of this study was to investigate the validity and interrater reliability of the PEDro scale for pharmaceutical trials. The reliability was also examined for the Cochrane Back and Neck (CBN) Group risk of bias tool. This is a secondary analysis of data from a previous study. We considered randomized placebo controlled trials evaluating any pain medication for chronic spinal pain or osteoarthritis. Convergent validity was evaluated by correlating the PEDro score with the summary score of the CBN risk of bias tool. The construct validity was tested using a linear regression analysis to determine the degree to which the total PEDro score is associated with treatment effect sizes, journal impact factor, and the summary score for the CBN risk of bias tool. The interrater reliability was estimated using the Prevalence and Bias Adjusted Kappa coefficient and 95% confidence interval (CI) for the PEDro scale and CBN risk of bias tool. Fifty-three trials were included, with 91 treatment effect sizes included in the analyses. The correlation between PEDro scale and CBN risk of bias tool was 0.83 (95% CI 0.76-0.88) after adjusting for reliability, indicating strong convergence. The PEDro score was inversely associated with effect sizes, significantly associated with the summary score for the CBN risk of bias tool, and not associated with the journal impact factor. The interrater reliability for each item of the PEDro scale and CBN risk of bias tool was at least substantial for most items (>0.60). The intraclass correlation coefficient for the PEDro score was 0.80 (95% CI 0.68-0.88), and for the CBN, risk of bias tool was 0.81 (95% CI 0.69-0.88). There was evidence for the convergent and construct validity for the PEDro scale when used to evaluate
Park, Jinse; Koh, Seong-Beom; Kim, Hee Jin; Oh, Eungseok; Kim, Joong-Seok; Yun, Ji Young; Kwon, Do-Young; Kim, Younsoo; Kim, Ji Seon; Kwon, Kyum-Yil; Park, Jeong-Ho; Youn, Jinyoung; Jang, Wooyoung
Postural instability and gait disturbance are the cardinal symptoms associated with falling among patients with Parkinson's disease (PD). The Tinetti mobility test (TMT) is a well-established measurement tool used to predict falls among elderly people. However, the TMT has not been established or widely used among PD patients in Korea. The purpose of this study was to evaluate the reliability and validity of the Korean version of the TMT for PD patients. Twenty-four patients diagnosed with PD were enrolled in this study. For the interrater reliability test, thirteen clinicians scored the TMT after watching a video clip. We also used the test-retest method to determine intrarater reliability. For concurrent validation, the unified Parkinson's disease rating scale, Hoehn and Yahr staging, Berg Balance Scale, Timed-Up and Go test, 10-m walk test, and gait analysis by three-dimensional motion capture were also used. We analyzed receiver operating characteristic curve to predict falling. The interrater reliability and intrarater reliability of the Korean Tinetti balance scale were 0.97 and 0.98, respectively. The interrater reliability and intra-rater reliability of the Korean Tinetti gait scale were 0.94 and 0.96, respectively. The Korean TMT scores were significantly correlated with the other clinical scales and three-dimensional motion capture. The cutoff values for predicting falling were 14 points (balance subscale) and 10 points (gait subscale). We found that the Korean version of the TMT showed excellent validity and reliability for gait and balance and had high sensitivity and specificity for predicting falls among patients with PD.
DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M
The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no
Adams, Emma J; Goad, Mary; Sahlqvist, Shannon; Bull, Fiona C; Cooper, Ashley R; Ogilvie, David
No current validated survey instrument allows a comprehensive assessment of both physical activity and travel behaviours for use in interdisciplinary research on walking and cycling. This study reports on the test-retest reliability and validity of physical activity measures in the transport and physical activity questionnaire (TPAQ). The TPAQ assesses time spent in different domains of physical activity and using different modes of transport for five journey purposes. Test-retest reliability of eight physical activity summary variables was assessed using intra-class correlation coefficients (ICC) and Kappa scores for continuous and categorical variables respectively. In a separate study, the validity of three survey-reported physical activity summary variables was assessed by computing Spearman correlation coefficients using accelerometer-derived reference measures. The Bland-Altman technique was used to determine the absolute validity of survey-reported time spent in moderate-to-vigorous physical activity (MVPA). In the reliability study, ICC for time spent in different domains of physical activity ranged from fair to substantial for walking for transport (ICC = 0.59), cycling for transport (ICC = 0.61), walking for recreation (ICC = 0.48), cycling for recreation (ICC = 0.35), moderate leisure-time physical activity (ICC = 0.47), vigorous leisure-time physical activity (ICC = 0.63), and total physical activity (ICC = 0.56). The proportion of participants estimated to meet physical activity guidelines showed acceptable reliability (k = 0.60). In the validity study, comparison of survey-reported and accelerometer-derived time spent in physical activity showed strong agreement for vigorous physical activity (r = 0.72, ptravel behaviours and may be suitable for wider use. Its physical activity summary measures have comparable reliability and validity to those of similar existing questionnaires.
Full Text Available Background: The Endometriosis Health Profile-30 (EHP-30 is a disease-specific questionnaire to measure the health-related quality of life in patients with endometriosis. The aim of this study was to evaluate the validity and reliability of the Persian version of Endometriosis Health Profile (EHP-30 in women with endometriosis referring to three Gynecology Clinics in Tehran, Iran. Methods: One hundred women (20 to 50 years old with surgically confirmed endometriosis recruited from three outpatient Gynecology Clinics affiliated to the Iran University of Medical Sciences. All 100 patients were asked to complete EHP-30 questionnaire while referring to the Clinics. The findings were analyzed using descriptive statistics, internal reliability consistency, construct validity (using short form-36, which had already been validated in Iran, factor analysis (with principle component analysis method, and item total correlation to assess the validity and reliability of the questionnaire. Results: The internal consistency reliability of the questionnaire was high (Cronbach’s α ranged between 0.80 and 0.93 for core, and 0.78 and 0.90 for modular parts. All items were loaded on their own factors except item 17 (feeling aggressive or violent and item 18 (feeling unwell, which were loaded on pain and social support domains, respectively. Construct validity of EHP-30, established by using SF-36, indicates good correlations in several similar scales of these two questionnaires. Conclusion: The findings of the study demonstrate that Persian version of EHP-30 is a valid and reliable measure to assess the quality of life in women with endometriosis
Boerebach, Benjamin C M; Arah, Onyebuchi A; Busch, Olivier R C; Lombarts, Kiki M J M H
In surgical education, there is a need for educational performance evaluation tools that yield reliable and valid data. This paper describes the development and validation of robust evaluation tools that provide surgeons with insight into their clinical teaching performance. We investigated (1) the reliability and validity of 2 tools for evaluating the teaching performance of attending surgeons in residency training programs, and (2) whether surgeons' self evaluation correlated with the residents' evaluation of those surgeons. We surveyed 343 surgeons and 320 residents as part of a multicenter prospective cohort study of faculty teaching performance in residency training programs. The reliability and validity of the SETQ (System for Evaluation Teaching Qualities) tools were studied using standard psychometric techniques. We then estimated the correlations between residents' and surgeons' evaluations. The response rate was 87% among surgeons and 84% among residents, yielding 2625 residents' evaluations and 302 self evaluations. The SETQ tools yielded reliable and valid data on 5 domains of surgical teaching performance, namely, learning climate, professional attitude towards residents, communication of goals, evaluation of residents, and feedback. The correlations between surgeons' self and residents' evaluations were low, with coefficients ranging from 0.03 for evaluation of residents to 0.18 for communication of goals. The SETQ tools for the evaluation of surgeons' teaching performance appear to yield reliable and valid data. The lack of strong correlations between surgeons' self and residents' evaluations suggest the need for using external feedback sources in informed self evaluation of surgeons. Copyright © 2012 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Emmanuel, Andy; Clow, Sheila E
Validating a questionnaire/instrument (whether developed or adapted) before proceeding to the field for data collection is important. This article presents the modification of an Irish questionnaire for a Nigerian setting. The validation process and reliability testing of this questionnaire (which was used in assessing previous breastfeeding practices and breastfeeding intentions of pregnant women in English and Hausa languages) were also presented. Five experts in the field of breastfeeding and infant feeding voluntarily and independently evaluated the instrument. The experts evaluated the various items of the questionnaire based on relevance, clarity, simplicity and ambiguity on a Likert scale of 4. The analysis was performed to determine the content validity index (CVI).Two language experts performed the translation and back-translation. Ten pregnant women completed questionnaires which were evaluated for internal consistency. Two other pregnant women completed the questionnaire twice at an interval of two weeks to test the reliability. SPSS version 21 was used to calculate the coefficient of reliability. The content validity index was high (0.94 for relevance, clarity and ambiguity and 0.96 for simplicity). The analysis suggested that four of the seventy one items should be removed. Cronbach's Alpha was 0.81, while the reliability coefficient was 0.76. The emerged validated questionnaire was translated from English to Hausa, then, back-translated into English and compared for accuracy. The final instrument is reliable and valid for data collection on breastfeeding in Nigeria among English and Hausa speakers. Therefore, the instrument is recommended for use in assessing breastfeeding intention and practices in Nigeria.
Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael
Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
Gill Dawn P
Full Text Available Abstract Background Single-item physical activity questions provide a quick approximation of physical activity levels. While recall questionnaires provide a more detailed picture of an individual's level of physical activity, single-item questions may be more appropriate in certain situations. The aim of this study was to evaluate two single-item physical activity questions (one absolute question and one relative question for test-retest reliability, convergent validity, and discriminant validity, in a sample of older adults. Methods Data was obtained from the Project to Prevent Falls in Veterans, a fall risk-factor screening and modification trial. One question measured absolute physical activity (seldom, moderately, vigorously active and one measured relative physical activity (more, about as, less active than peers. Test-retest reliability was examined using weighted Kappa statistics (κ in a sample of 43 subjects. Validity was assessed using correlation coefficients (r in participants who received clinical assessments (n = 159. Results The absolute physical activity question was more reliable than the relative physical activity question (κ = 0.75 vs. κ = 0.56. Convergent validity, however, was stronger for the relative physical activity question (r = 0.28 to 0.57 vs. r = 0.10 to 0.33. Discriminant validity was similar for both questions. For the relative physical activity question, there was moderate agreement when this question was re-administered seven days later, fair to moderate/good associations when compared with indicators of physical function, and little to no associations when compared with measures hypothesized to be theoretically not related to physical activity. Conclusions The relative physical activity question had the best combination of test-retest reliability, convergent validity and discriminant validity. In studies requiring a measure of physical activity, where physical activity is not the primary focus and more
Dong, Lijuan; Liu, Na; Tian, Xiaoyu; Qiao, Xiaoxia; Gobbens, Robbert J J; Kane, Robert L; Wang, Cuili
To translate the Tilburg Frailty Indicator (TFI) into Chinese and assess its reliability and validity. A sample of 917 community-dwelling older people, aged ≥60 years, in a Chinese city was included between August 2015 and March 2016. Construct validity was assessed using alternative measures corresponding to the TFI items, including self-rated health status (SRH), unintentional weight loss, walking speed, timed-up-and-go tests (TUGT), making telephone calls, grip strength, exhaustion, Short Portable Mental Status Questionnaire (SPMSQ), Geriatric Depression scale (GDS-15), emotional role, Adaptability Partnership Growth Affection and Resolve scale (APGAR) and Social Support Rating Scale (SSRS). Fried's phenotype and frailty index were measured to evaluate criterion validity. Adverse health outcomes (ADL and IADL disability, healthcare utilization, GDS-15, SSRS) were used to assess predictive (concurrent) validity. The internal consistency reliability was good (Cronbach's α=0.71). The test-retest reliability was strong (r=0.88). Kappa coefficients showed agreements between the TFI items and corresponding alternative measures. Alternative measures correlated as expected with the three domains of TFI, with an exclusion that alternative psychological measures had similar correlations with psychological and physical domains of the TFI. The Chinese TFI had excellent criterion validity with the AUCs regarding physical phenotype and frailty index of 0.87 and 0.86, respectively. The predictive (concurrent) validities of the adverse health outcomes and healthcare utilization were acceptable (AUCs: 0.65-0.83). The Chinese TFI has good validity and reliability as an integral instrument to measure frailty of older people living in the community in China. Copyright © 2017 Elsevier B.V. All rights reserved.
Lull, James T.; Cappella, Joseph N.
This paper introduces a new measure of attitude, the "attitude pie," which attempts to separate true neutrality from degree of indifference while providing an evaluation score on the issue being judged. A large reliability-validity study involving ten attitude traits, four different methods, and over one hundred subjects at three different times…
Feroz, Abu H.M.; Islam, Nazrul; ten Klooster, Peter M.; Hasan, Mahmud; Rasker, Johannes J.; Haq, Syed A.
Objective To develop a culturally adapted Bengali version of the Short Form-36 (SF-36) Health Survey and to test its acceptability, reliability, and validity in patients with rheumatoid arthritis (RA). Study Design and Setting The US English SF-36 was translated into Bengali after established
Sayin, Ayfer; Sahin, Mustafa Yasar
The present study aimed to provide a Turkish adaptation of the Organizational Justice in Sport Scale and perform reliability and validity studies. Answers provided by 260 participants who work as football, male basketball and female basketball coaches in National Collegiate Athletic Association (NCAA) were analysed using the original scale that…
Selby-Harrington, Maija L.; And Others
Summarizes the principles of instrument validity and reliability and identifies deviations from these principles in a random sample of 55 research studies published in 1989 in 5 refereed nursing journals targeted toward practicing clinicians. Provides documentation, justification, and suggestions for nursing educators, journal editors, and…
Lerdpornkulrat, Thanita; Poondej, Chanut; Koul, Ravinder
This study aimed to translate the information-seeking behavior scale from English to Thai, and to ascertain the construct reliability and validity of the scale. Data were collected from 664 undergraduate students in Thailand. The descriptive statistics were explored to see the extent to which various information sources are being used by…
Antonio Paulo Nassar Junior
Full Text Available CONTEXT AND OBJECTIVE: Sedation scales are used to guide sedation protocols in intensive care units (ICUs. However, no sedation scale in Portuguese has ever been evaluated. The aim of this study was to evaluate the validity and reliability of Portuguese translations of four sedation-agitation scales, among critically ill patients: Glasgow Coma Score, Ramsay, Richmond Agitation-Sedation Scale (RASS and Sedation-Agitation Scale (SAS. DESIGN AND SETTING: Validation study in two mixed ICUs of a university hospital. METHODS: All scales were applied to 29 patients by four different critical care team members (nurse, physiotherapist, senior critical care physician and critical care resident. We tested each scale for interrater reliability and for validity, by correlations between them. Interrater agreement was measured using weighted kappa (k and correlations used Spearman's test. RESULTS: 136 observations were made on 29 patients. All scales had at least substantial agreement (weighted k 0.68-0.90. RASS (weighted k 0.82-0.87 and SAS (weighted k 0.83-0.90 had the best agreement. All scales had a good and significant correlation with each other. CONCLUSIONS: All scales demonstrated good interrater reliability and were comparable. RASS and SAS showed the best correlations and the best agreement results in all professional categories. All these characteristics make RASS and SAS good scales for use at the bedside, to evaluate sedation-agitation among critically ill patients in terms of validity, reliability and applicability.
Conclusion: Diabetes Self-Efficacy Scale is a valid and reliable instrument for determining the self-efficacy of patients and providing a proper care. It can be suggested to investigate and evaluate the consistency of the scale by applying it to broader sample groups representing different socioeconomic levels.
Pizzola, L.; Martos, Z.; Pfisterer, K.; Groot, de C.P.G.M.; Keller, H.
Mealtime satisfaction is an important component of quality of life (QOL) in residential care, yet there currently is no self-administered tool described in the literature. The purpose of this study is to investigate internal and test–retest reliability, and construct validity of a mealtime
Demir, Numan; Serel Arslan, Selen; İnal, Özgü; Karaduman, A Ayşe
The purpose of this study was to test the reliability and validity of the Turkish Eating Assessment Tool (T-EAT-10) among patients with swallowing disorders. One hundred and five patients completed the T-EAT-10 and Functional Oral Intake Scale (FOIS). The internal consistency, test-retest reliability, and criterion validity of T-EAT-10 were investigated. The internal consistency was assessed using Cronbach's alpha. Intraclass correlation coefficient (ICC) value with 95 % confidence intervals was calculated for test-retest reliability. The criterion validity of the T-EAT-10 was determined by assessing the correlation between T-EAT-10 and FOIS. All the patients in the study completed the T-EAT-10 without assistance. The mean time to complete the instrument was 1.8 ± 0.9 min. The internal consistency of the T-EAT-10 was found to be high with 0.90 Cronbach's alpha for test and 0.91 Cronbach's alpha for retest reproducibility. No difference between the test and retest scores of the T-EAT-10 was found (p = 0.14). A negative, moderate correlation between T-EAT-10 and FOIS was detected (r = -0.365, p EAT-10 is a reliable and valid symptom-specific outcome tool for dysphagia in adult Turkish patients. It can be used in clinical practice and research.
Gutiérrez-Vilahú, Lourdes; Massó-Ortigosa, Núria; Costa-Tutusaus, Lluís; Guerra-Balic, Myriam
Several sophisticated methods of footprint analysis currently exist. However, it is sometimes useful to apply standard measurement methods of recognized evidence with an easy and quick application. We sought to assess the reliability and validity of a new method of footprint assessment in a healthy population using Photoshop CS5 software (Adobe Systems Inc, San Jose, California). Forty-two footprints, corresponding to 21 healthy individuals (11 men with a mean ± SD age of 20.45 ± 2.16 years and 10 women with a mean ± SD age of 20.00 ± 1.70 years) were analyzed. Footprints were recorded in static bipedal standing position using optical podography and digital photography. Three trials for each participant were performed. The Hernández-Corvo, Chippaux-Smirak, and Staheli indices and the Clarke angle were calculated by manual method and by computerized method using Photoshop CS5 software. Test-retest was used to determine reliability. Validity was obtained by intraclass correlation coefficient (ICC). The reliability test for all of the indices showed high values (ICC, 0.98-0.99). Moreover, the validity test clearly showed no difference between techniques (ICC, 0.99-1). The reliability and validity of a method to measure, assess, and record the podometric indices using Photoshop CS5 software has been demonstrated. This provides a quick and accurate tool useful for the digital recording of morphostatic foot study parameters and their control.
Coenen, P.; Formanoy, M.; Douwes, M.; Bosch, T.; Kraker, H. de
Exposure to mechanical vibrations at work (e.g., due to handling powered tools) is a potential occupational risk as it may cause upper extremity complaints. However, reliable and valid assessment methods for vibration exposure at work are lacking. Measuring hand-arm vibration objectively is often
Perkins, Rose J. Merlino
"Women's Mental Health Questionnaire" (W-MHQ) assesses females' adult mental health concerns, and examines their associations with specified father-daughter childhood relationships. Presented are W-MHQ item and scale development, and psychometric findings drawn from factor analyses, reliability assessments, and validation processes. For…
This study evaluated the reliability, validity, and educational usefulness of a comprehensive, multidisciplinary Objective Structured Clinical Examination (OSCE) in dental education. The OSCE was administered to dental students at the Columbia University College of Dental Medicine (CDM) before they entered clinical training. Participants in this…
To facilitate cross-cultural research in the psychology of religion, the reliability and validity of the 7-item short form of the Francis Scale of Attitude toward Christianity was examined among a sample of 453 young people aged between 12-19 years old from standards six, seven, eight, nine and ten attending a secondary ...
Ruschival, M. Lena; Way, John Gilbert
The purpose of the study was to determine whether WPPSI is a valid and reliable instrument for the measurement of the intelligence of children whose scores fall within the upper range as measured by the Stanford-Binet. The results of this study indicated that there was only a moderate relationship between the two. (Author)
Kramp, Kelvin H.; van Det, Marc J.; Veeger, Nic J. G. M.; Pierie, Jean-Pierre E. N.
Background There is no widely used method to evaluate procedure-specific laparoscopic skills. The first aim of this study was to develop a procedure-based assessment method. The second aim was to compare its validity, reliability and feasibility with currently available global rating scales (GRSs).
Leeuwen, R.R.; Tiesinga, L.J.; Middel, L.J.; Post, D.; Jochemsen, H.
Aim. This study contributes to the development of a valid and reliable instrument, the spiritual care competence scale, as an instrument to assess nurses’ competencies in providing spiritual care. Background. Measuring these competencies and their development is important and the construction of a
van Leeuwen, Rene; Tiesinga, Lucas J.; Middel, Berrie; Post, Doeke; Jochemsen, Henk
Aim. This study contributes to the development of a valid and reliable instrument, the spiritual care competence scale, as an instrument to assess nurses' competencies in providing spiritual care. Background. Measuring these competencies and their development is important and the construction of a
Basha, Ertan; Kaya, Mehmet
The purpose of this study is to examine validity and reliability of the Albanian version of the Depression, Anxiety and Stress Scale (DASS), which is developed by Lovibond and Lovibond (1995). The sample of this study is consisted of 555 subjects who were living in Kosovo. The results of confirmatory factor analysis indicated 42 items loaded on…
Determination of students' attitudes towards online cooperative learning is an important issue, which has not been studied adequately. In the literature, there are few scales to measure the attitude towards online cooperative learning for which validity and reliability have been proven. The main purpose of this study is to develop an attitude…
Arseven, Zeynep; Kiliç, Abdurrahman; Sahin, Seyma
In the present study, it is aimed to develop a valid and reliable scale for determining value-eroding behaviors of teachers, hence their values of judgment. The items of the "Value-eroding Teacher Behaviors Scale" were designed in the form of 5-point likert type rating scale. The exploratory factor analysis (EFA) was conducted to…
Kenyon, Dorry; Van Duzer, Carol
This document examines the concepts of validity, reliability, and appropriateness from a language testing perspective as they apply to the following four assessment issues raised by the National Reporting System (NRS): (1) What type of language assessment seems to be required by the NRS: proficiency or achievement?; (2) Knowing that the NRS…
Demaray, Michelle Kilpatrick; Summers, Kelly Hodgson; Jenkins, Lyndsay N.; Becker, Lisa Davidson
The current study further establishes the reliability and validity of the Bullying Participant Behaviors Questionnaire (BPBQ), a self-report survey that allows for an examination of participation in various bullying participant role behaviors including bully, assistant to the bully, victim, defender of the victim, and outsider. The study included…
Harris, Larry P.; Wolf, Steven R.
The article focuses on the controversy over norm-referenced v criterion-referenced measures (CRM) in assessment of learning disorders. The authors contend that while the reliability of CRMs is generally indisputable, the validity of measures designed from local curricula is still dependent on the intuitive judgments of teachers. (Author/SBH)
Fokkema, Tryntsje; Kooiman, Thea J. M.; Krijnen, Wim P.; Van der Schans, Cees P.; De Groot, Martijn
Purpose: To examine the test-retest reliability and validity of ten activity trackers for step counting at three different walking speeds. Methods: Thirty-one healthy participants walked twice on a treadmill for 30 min while wearing 10 activity trackers (Polar Loop, Garmin Vivosmart, Fitbit Charge
Almeida, Kênnea Martins; Dutra, Maria Virginia Peixoto; Mello, Rosane Reis de; Reis, Ana Beatriz Rodrigues; Martins, Priscila Silveira
To verify the concurrent validity and interobserver reliability of the Alberta Infant Motor Scale (AIMS) in premature infants followed-up at the outpatient clinic of Instituto Fernandes Figueira, Fundação Oswaldo Cruz (IFF/Fiocruz), in Rio de Janeiro, Brazil. A total of 88 premature infants were enrolled at the follow-up clinic at IFF/Fiocruz, between February and December of 2006. For the concurrent validity study, 46 infants were assessed at either 6 (n = 26) or 12 (n = 20) months' corrected age using the AIMS and the second edition of the Bayley Scales of Infant Development, by two different observers, and applying Pearson's correlation coefficient to analyze the results. For the reliability study, 42 infants between 0 and 18 months were assessed using the Alberta Infant Motor Scale, by two different observers and the results analyzed using the intraclass correlation coefficient. The concurrent validity study found a high level of correlation between the two scales (r = 0.95) and one that was statistically significant (p < 0.01) for the entire population of infants, with higher values at 12 months (r = 0.89) than at 6 months (r = 0.74). The interobserver reliability study found satisfactory intraclass correlation coefficients at all ages tested, varying from 0.76 to 0.99. The AIMS is a valid and reliable instrument for the evaluation of motor development in high-risk infants within the Brazilian public health system.
Bech, B; Lönn, L; Falkenberg, M
with clinical experience (R = -0.53, P fluoroscopy time and amount of contrast fluid did not correlate. Conclusions The construct validity and reliability of assessment with the SAVE scale was high when applied to performances in a simulation setting with advanced realism. No ceiling effect...
Dilmac, Bulent; Aricak, Osman Tolga; Cesur, Sevim
The purpose of the present study is to examine the initial psychometric properties of the Values Scale for adults. While developing the first stage of the Values Scale, open-ended data on the values held by 216 university students were obtained. During the second stage, the validity and reliability studies of the 60-item Values Scale obtained by…
Wissink, I.B.|info:eu-repo/dai/nl/293037817; Stams, G.J.J.M.; Colonnesi, C.; Asscher, J.J.|info:eu-repo/dai/nl/288661834; Hoeve, M.; Noom, M.J.; Polderman, Nelleke; Kellaert-Knol, Marijke G.
The Attachment Insecurity Screening Inventory (AISI) 2–5 years is a parent-report questionnaire for assessing attachment insecurity in preschoolers. Validity and reliability of the AISI 2–5 years were examined in a general sample (n = 429) and in a clinical sample (n = 71). Confirmatory factor
Wissink, I.B.; Colonnesi, C.; Stams, G.J.J.M.; Hoeve, M.; Asscher, J.J.; Noom, M.J.; Polderman, N.; Kellaert-Knol, M.G.
The Attachment Insecurity Screening Inventory (AISI) 2-5 years is a parent-report questionnaire for assessing attachment insecurity in preschoolers. Validity and reliability of the AISI 2-5 years were examined in a general sample (n = 429) and in a clinical sample (n = 71). Confirmatory factor
Sharkey, Deirdre Jennielle
The purpose of this study was to determine the validity and reliability of the Cultural Awareness and Beliefs Inventory for Higher Education (CABIHE). The CABIHE consists of thirty-nine items, with thirty that measure professors' cultural awareness and beliefs on a Likert-type four-point scale. In addition, this study examined the extent to which…
Validation and reliability of the Chichewa translation of the EQ-5D quality of life questionnaire in adults with orthopaedic injuries in Malawi. Linda Chokotho, Nyengo Mkandawire, Devin Conway, Hao-Hua Wu, David D. Shearer, Geir Hallan, Jan-Erik Gjertsen, Sven Young, Brian C. Lau ...
The purpose of this study is to develop a valid and reliable measurement tool to determine the social media addictions of secondary school, high school and university students. 998 students participated in the study. 476 students from secondary schools, high schools and universities participated in the first application during which the…
Benítez-Porres, Javier; López-Fernández, Iván; Raya, Juan Francisco; Álvarez Carnero, Sabrina; Alvero-Cruz, José Ramón; Álvarez Carnero, Elvis
Background: Physical activity (PA) assessment by questionnaire is a cornerstone in the field of sport epidemiology studies. The Physical Activity Questionnaire for Children (PAQ-C) has been used widely to assess PA in healthy school populations. The aim of this study was to evaluate the reliability and validity of the PAQ-C questionnaire in…
Hooghe, L.; Bakker, R.; Brigevich, A.; de Vries, C.; Edwards, E.; Marks, G.; Rovny, J.; Steenbergen, M.; Vachudova, M.
This research note reports on the 2002 and 2006 Chapel Hill expert surveys (CHES), which measure national party positioning on European integration, ideology, and several European Union (EU) and non-EU policies. The reliability of expert judgments is examined and the CHES data are cross-validated
Strand, Edythe A.; McCauley, Rebecca J.; Weigand, Stephen D.; Stoeckel, Ruth E.; Baas, Becky S.
Purpose: In this article, the authors report reliability and validity evidence for the Dynamic Evaluation of Motor Speech Skill (DEMSS), a new test that uses dynamic assessment to aid in the differential diagnosis of childhood apraxia of speech (CAS). Method: Participants were 81 children between 36 and 79 months of age who were referred to the…
Lagarde, Marloes L J; Kamalski, DMA; Van Den Engel-Hoek, Lenie
Objective: To systematically review the available evidence for the reliability and validity of cervical auscultation in diagnosing the several aspects of dysphagia in adults and children suffering from dysphagia. Data sources: Medline (PubMed), Embase and the Cochrane Library databases. Review
Katz, Noomi; Karpin, Hanah; Lak, Arit; Furman, Tania; Hartman-Maeir, Adina
A study assessed the reliability and validity of the Activity Card Sort (ACS) within different adult groups (n=263): healthy adults, healthy older adults, Alzheimer's caregivers, multiple sclerosis patients, and stroke survivors. Found that the ACS had high internal consistency for daily living and social-cultural activities and a lower…
Bear, George G.; Holst, Bruna; Lisboa, Carolina; Chen, Dandan; Yang, Chunyan; Chen, Fang Fang
This study presents evidence of the validity and reliability of scores for the newly developed Brazilian Portuguese version of the Delaware School Climate Survey-Student (Brazilian DSCS-S). The sample consisted of 378 students, grades 5 through 9, attending four private and three public schools in southern Brazil. Confirmatory factor analyses…
Levpušcek, Melita Puklek; Inglés, Candido J.; Marzo, Juan C.; García-Fernández, Jose M.
The purpose of this study was to examine the reliability and validity of the School Anxiety Inventory (SAI) using a sample of 646 Slovenian adolescents (48% boys), ranging in age from 12 to 19 years. Single confirmatory factor analyses replicated the correlated four-factor structure of scores on the SAI for anxiety-provoking school situations…
Woodburn, Jim; Sutcliffe, Nick
The Objective Structured Clinical Examination (OSCE), initially developed for undergraduate medical education, has been adapted for assessment of clinical skills in podiatry students. A 12-month pilot study found the test had relatively low levels of reliability, high construct and criterion validity, and good stability of performance over time.…
Jennings, Denise; Cormack, Stuart; Coutts, Aaron J; Boyd, Luke; Aughey, Robert J
To assess the validity and reliability of distance data measured by global positioning system (GPS) units sampling at 1 and 5 Hz during movement patterns common to team sports. Twenty elite Australian Football players each wearing two GPS devices (MinimaxX, Catapult, Australia) completed straight line movements (10, 20, 40 m) at various speeds (walk, jog, stride, sprint), changes of direction (COD) courses of two different frequencies (gradual and tight), and a team sport running simulation circuit. Position and speed data were collected by the GPS devices at 1 and 5 Hz. Distance validity was assessed using the standard error of the estimate (±90% confidence intervals [CI]). Reliability was estimated using typical error (TE) ± 90% CI (expressed as coefficient of variation [CV]). Measurement accuracy decreased as speed of locomotion increased in both straight line and the COD courses. Difference between criterion and GPS measured distance ranged from 9.0% to 32.4%. A higher sampling rate improved validity regardless of distance and locomotion in the straight line, COD and simulated running circuit trials. The reliability improved as distance traveled increased but decreased as speed increased. Total distance over the simulated running circuit exhibited the lowest variation (CV 3.6%) while sprinting over 10 m demonstrated the highest (CV 77.2% at 1 Hz). Current GPS systems maybe limited for assessment of short, high speed straight line running and efforts involving change of direction. An increased sample rate improves validity and reliability of GPS devices.
Kerimova, Melek; Gunuc, Selim
The purpose of the present paper was to adapt Gunuc and Kayri's (2010) "Internet Addiction Scale," with show validity and reliability for many various sampling groups, into the Azerbaijani language. Another objective of the study is to determine the prevalence of Internet addiction among Azerbaijani adolescents and youth, which…
van der Meer, Wicher J.; Dijkstra, Pieter U.; Visser, Anita; Vissink, Arjan; Ren, Yijin
Volume changes in facial morphology can be assessed using the 3dMD DSP400 stereo-optical 3-dimensional scanner, which uses visible light and has a short scanning time. Its reliability and validity have not to our knowledge been investigated for the assessment of facial swelling. Our aim therefore
Hauser, Peter C.; Lukomski, Jennifer; Samar, Vince
This study investigated the reliability and validity of the Behavior Rating Inventory of Executive Functions-Adult Form (BRIEF-A) when used with deaf college students. The BRIEF-A was administered to 176 deaf and 184 hearing students of whom 25 deaf students and 56 hearing students self-identified as having an Attention Deficit Hyperactivity…
Tsuchiya, Kenji J.; Matsumoto, Kaori; Yagi, Atsuko; Inada, Naoko; Kuroda, Miho; Inokuchi, Eiko; Koyama, Tomonori; Kamio, Yoko; Tsujii, Masatsugu; Sakai, Saeko; Mohri, Ikuko; Taniike, Masako; Iwanaga, Ryoichiro; Ogasahara, Kei; Miyachi, Taishi; Nakajima, Shunji; Tani, Iori; Ohnishi, Masafumi; Inoue, Masahiko; Nomura, Kazuyo; Hagiwara, Taku; Uchiyama, Tokio; Ichikawa, Hironobu; Kobayashi, Shuji; Miyamoto, Ken; Nakamura, Kazuhiko; Suzuki, Katsuaki; Mori, Norio; Takei, Nori
To examine the inter-rater reliability of Autism Diagnostic Interview-Revised, Japanese Version (ADI-R-JV), the authors recruited 51 individuals aged 3-19 years, interviewed by two independent raters. Subsequently, to assess the discriminant and diagnostic validity of ADI-R-JV, the authors investigated 317 individuals aged 2-19 years, who were…
Gittings, Paul; Salet, Myrthe; Burrows, Sally; Ruettermann, Mike; Wood, Fiona M; Edgar, Dale
Small burns are common and can cause disproportionate levels of disability. The ability to measure muscle impairment and consequent functional disability is a necessity during rehabilitation of patients. This study aimed to determine the reliability and validity of grip and muscle strength dynamometry in patients with unhealed, minor burn wounds. Grip and muscle strength were assessed three times on each side. Assessment occurred at presentation for the initial injury and again every other day (or every 5 days beyond 10 days post injury) until discharge from the service. Reliability was assessed using intraclass correlation. Minimum detectable differences were calculated for each muscle group. Validity was assessed using regression analysis, incorporating appropriate burn severity measures and patient demographics. Thirty patients with TBSA ≤15% were assessed. Both grip and muscle strength demonstrated very good reliability (intraclass correlation coefficient: 0.85-0.96). Minimum detectable differences ranged from 3.8 to 8.0 kg. Validity of both forms of dynamometry was confirmed through associations with gender for all muscle groups (P dynamometry are reliable and valid assessments of strength and are applicable for clinical use in patients who have unhealed, minor burn wounds.
Development of a valid and reliable test for higher-educated young adults measuring dietary fibre food source and health-disease association ... Public health nutrition education accentuates fibre con- sumption to reduce chronic disease ..... who found that South African university students seemed to have above-average ...
Beery, Thomas H.
The purpose of this preliminary study is to establish a reliable and valid measure of environmental connectedness (EC) to allow for further exploration of the Swedish Outdoor Recreation in Change national survey data. The Nordic concept of friluftsliv (nature-based outdoor recreation) and the environmental psychology concept of EC are explored to…
Damen, H.R.; Veerman, J.W.; Vermulst, A.A.; Nieuwhoff, R.; Meyer, R.E. de; Scholte, R.H.J.
In this study, the construct validity and reliability of the Empowerment questionnaire (EMPO) that was developed in Dutch youth care was examined. The 12-item EMPO focuses on measuring parental empowerment in raising their children. The three components of psychological empowerment (intrapersonal,
Ramada, Jose M.; Serra, Consol; Amick, Benjamin C.; Abma, Femke I.; Castano, Juan R.; Pidemunt, Gemma; Bultmann, Ute; Delclos, George L.
Purpose Recently, the cross-cultural adaptation of the Work Role Functioning Questionnaire to Spanish was carried out, achieving satisfactory psychometric properties. Now we examined the reliability and validity of the adapted [Work role functioning questionnaire-Spanish version (WRFQ-SpV)] in a
Ersoy, Mehmet Akif; Varan, Azmi
The aim of this study was to evaluate the reliability and validity of the Turkish version of the Internalized Stigma of Mental Illness Scale (ISMI) in patients with psychiatric disorders. The study included 203 patients diagnosed with various psychiatric disorders in a psychiatry outpatient clinic of a university hospital. The reliability of the scale was assessed by investigation of its internal consistency and split-half reliability. The convergent validity of the scale was demonstrated by the relationship between the Turkish form of the ISMI and various criteria scales. Cronbach's alpha value was 0.93 for the entire scale and ranged between 0.63 and 0.87 for the 5 subscales of the ISMI. In terms of convergent validity, the total score of the Turkish ISMI significantly correlated with the Beck Depression Inventory, Rosenberg Self-Esteem Scale, Sociotropy-Autonomy Scale, Brief Symptom Inventory, Multidimensional Scale of Perceived Social Support, Clinical Global Impression Scale, and Global Assessment of Functioning Scale scores. All values were in the expected direction. In the light of the findings, it was concluded that the Turkish version of ISMI could be used as a reliable and valid tool in assessing internalized stigma of the Turkish psychiatric patients.
Hager, Erin R.; Treuth, Margarita S.; Gormely, Candice; Epps, LaShawna; Snitker, Soren; Black, Maureen M.
Purpose: Ankle accelerometry allows for 24-hr data collection and improves data volume/integrity versus hip accelerometry. Using Actical ankle accelerometry, the purpose of this study was to (a) develop sensitive/specific thresholds, (b) examine validity/reliability, (c) compare new thresholds with those of the manufacturer, and (d) examine…
Christensen, Catie; Haddad, Amanda; Maus, Elizabeth
To evaluate interrater reliability and concurrent validity of the 50-ft walk test (FWT) for children with idiopathic toe walking (ITW). Thirty children, 6 to 13 years old, with ITW participated. During the 50-FWT, an accelerometer counted total steps. A physical therapist counted the number of toe-walking steps. The number of toe-walking steps was divided by the total steps to calculate a toe-walking percentage. Interrater reliability was assessed by correlating the toe-walking percentage obtained by 2 raters using an intraclass correlation coefficient. Concurrent validity was evaluated by correlating the toe-walking percentage calculated by the GAITRite and therapist using a Spearman ρ. There was excellent interrater reliability and concurrent validity. Experience level did not impact the therapist's ability to identify a toe-walking step. The 50-FWT demonstrated excellent interrater reliability and concurrent validity. It can be used to obtain a percentage of toe walking in children 6-13 years of age with ITW.
Erlich, Richard J.; Russ-Eft, Darlene F.
The validity and reliability of three instruments, the "Counselor Rubric for Gauging Student Understanding of Academic Planning," micro-analytic questions, and the "Student Survey for Understanding Academic Planning," all based on social cognitive theory, were tested as means to assess self-efficacy and self-regulated learning in college academic…
Yalçin, Mehmet Tufan; Eres, Figen
The aim of this study is to develop a valid and reliable measurement tool that can determine the instructional capacity, according to teacher opinions. In the academic year of 2016-2017, 1011 teachers working in the public high schools and vocational technical schools in Ankara participated in the study. The total number of items on the scale was…
van Saane, N.; Sluiter, J. K.; Verbeek, J. H. A. M.; Frings-Dresen, M. H. W.
Background Although job satisfaction research has been carried out for decades, no recent overview of job satisfaction instruments and their quality is available. Aim The aim of this systematic review is to select job satisfaction instruments of adequate reliability and validity for use as
Galetzka, Mirjam; Verhoeven, J.W.M.; Pruyn, Adriaan T.H.
The purpose of this research is to add to our understanding of the antecedents of customer satisfaction by examining the effects of service reliability (Is the service “correctly” produced?) and service validity (Is the “correct” service produced?) of search, experience and credence services.
Thompson, Bruce; Cook, Colleen
Research libraries are increasingly supplementing collection counts with perceptions of service quality as indices of status and productivity. The present study was undertaken to explore the reliability and validity of scores from the SERVQUAL measurement protocol (A. Parasuraman and others, 1991), which has previously been used in this type of…
Peterson, Anne C.; And Others
The Self-Image Questionnaire for Young Adolescents (SIQYA), an adaptation of the Offer Self-Image Questionnaire (OSIQ), designed to measure aspects of self-image among young adolescents, was administered to two groups of sixth graders. The development of the SIQYA is described and reliability and validity results are presented. (EGS)
The eButton takes frontal images at 4 second intervals throughout the day. A three-dimensional (3D) manually administered wire mesh procedure has been developed to quantify portion sizes from the two-dimensional (2D) images. This paper reports a test of the interrater reliability and validity of use...
Jeong, Byounglock; Yoo, Eunyoung; Jung, Minye; Kang, Daehyuk; Park, Soyeon; Park, Soo Hyun
Background: The aim of this study was to establish the psychometric properties of the Korean Behaviour Problems Inventory (BPI-01). Methods: Fifty-five allied professionals completed a content validity questionnaire about the Korean BPI-01. To examine reliability, 52 individuals with intellectual disabilities residing in a medium-sized residential…
de Greef, M.H.; Sprenger, S.R.; Elzenga, C.T.; Popkema, D.Y.; Bennekers, J.H.; Niemeijer, M.G.; Middel, B.; Mook, G.A.
This study examined the reliability and validity of a 12-min. walking test for coronary heart disease patients. CHD patients (28 men, 18 women) were recruited out of 86 CHD patients of the Martini Hospital Groningen, The Netherlands. 46 CHD patients (age M=66.0 yr., SD=6.8) participated in the
Lumyongsatien, Jate; Yangsakul, Waralak; Bunnag, Chaweewan; Hopkins, Claire; Tantilipikorn, Pongsakorn
Chronic rhinosinusitis (CRS) is one of common health conditions that affects patients' health-related quality of life. Our purpose is to assess the reliability and validity of Thai-version of Sino-Nasal Outcome Test 22 in chronic rhinosinusitis. Permission for translation of SNOT-22 from English language to Thai language was obtained from the developer. The translation process was done based on the international standard of translation method. A total of 80 subjects were recruited into the study and divided into two groups comprising of 50 patients with chronic rhinosinusitis and 30 healthy volunteers. Cronbach's α and Intraclass correlation coefficient were evaluated for its reliability. Validity test was evaluated against VAS score, SF-36 (Thai version) questionnaire and CT scan (based on Lund-Mackay score). Responsiveness was assessed between pre-operative and post-operative scores in 34 patients. The Thai version of SNOT-22 showed good reliability according to high value of Cronbach's α coefficient ( r = 0.929) and intraclass correlation coefficient ( r = 0.935). It also showed good validity by its ability to differential the patients with chronic rhinosinusitis from normal ( p reliability and validity, suitable for evaluation of chronic rhinosinusitis symptoms together with severity of the disease and response to treatment. Thai clinical trials registry TCTR20170320003. Date of registration 20/03/2017 (retrospectively registered).
Rae, James R; Olson, Kristina R
The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Feyzioglu, Burak; Demirdag, Baris; Akyildiz, Murat; Altun, Eralp
Science process skills are claimed to enable an individual to improve their own life visions and give a scientific view/literacy as a standard of their understanding about the nature of science. The main purpose of this study was to develop a test for measuring a valid, reliable and practical test for Science Process Skills (SPS) in secondary…
Pekdogan, Serpil; Ulutas, Ilkay
The purpose of this study is to develop a valid and reliable data collection tool to assess the decision-making skills of children at the age of 5 to 6. The study group is composed of 300 children attending independent pre-schools located in the central district of Amasya province and their parents. In the study, four-factor and 29-item…
Naylor, Justine M; Ko, Victoria; Adie, Sam; Gaskin, Clive; Walker, Richard; Harris, Ian A; Mittal, Rajat
The clinimetric properties of knee goniometry are essential to appreciate in light of its extensive use in the orthopaedic and rehabilitative communities. Intra-observer reliability is thought to be satisfactory, but the validity and inter-rater reliability of knee goniometry often demonstrate unacceptable levels of variation. This study tests the validity and reliability of measuring knee range of motion using goniometry and photographic records. Methodology study assessing the validity and reliability of one method ('Marker Method') which uses a skin marker over the greater trochanter and another method ('Line of Femur Method') which requires estimation of the line of femur. Radiology and orthopaedic departments of two teaching hospitals. 31 volunteers (13 arthritic and 18 healthy subjects). Knee range of motion was measured radiographically and photographically using a goniometer. Three assessors were assessed for reliability and validity. Agreement between methods and within raters was assessed using concordance correlation coefficient (CCCs). Agreement between raters was assessed using intra-class correlation coefficients (ICCs). 95% limits of agreement for the mean difference for all paired comparisons were computed. Validity (referenced to radiographs): Each method for all 3 raters yielded very high CCCs for flexion (0.975 to 0.988), and moderate to substantial CCCs for extension angles (0.478 to 0.678). The mean differences and 95% limits of agreement were narrower for flexion than they were for extension. Intra-rater reliability: For flexion and extension, very high CCCs were attained for all 3 raters for both methods with slightly greater CCCs seen for flexion (CCCs varied from 0.981 to 0.998). Inter-rater reliability: For both methods, very high ICCs (min to max: 0.891 to 0.995) were obtained for flexion and extension. Slightly higher coefficients were obtained for flexion compared to extension, and with the Marker compared to the Line of Femur Method
Ricardo Franco de Lima
Full Text Available Abstract This paper aimed to verify evidences of validity and reliability of Luria-Nebraska Test for Children (TLN-C, in Portuguese. Three hundred eighty-seven students aged 6–13 years old, with learning difficulties, comprised the study. They were assessed with the Wechsler Intelligence Scale for Children (WISC-III and TLN-C; and effect of age differences, as well as accuracy rating by internal consistency were investigated. Age effects were found for all subtests and in the general score, except for receptive speech subtest, even when total IQ effect was controlled. Reliability analysis had satisfactory results (0.79. The TLN-C showed evidences of validity and reliability. Receptive speech subtest requires revision.
Verite, B.; Villain, B.; Venturini, V.; Hugonnard, S.; Bryla, P.
Electricite de France (EDF) has recently developed its OMF-Structures method, designed to optimize preventive maintenance of passive structures such as pipes and support, based on risk. In particular, reliability performances of components need to be determined; it is a two-step process, consisting of a qualitative sort followed by a quantitative evaluation, involving two types of models. Initially, degradation models are widely used to exclude some components from the field of preventive maintenance. The reliability of the remaining components is then evaluated by means of quantitative reliability models. The results are then included in a risk indicator that is used to directly optimize preventive maintenance tasks. (author)
Lane, Mariella M; Czyzewski, Danita I; Chumpitazi, Bruno P; Shulman, Robert J
This study sought to: evaluate the ability of children to reliably use a modified Bristol Stool Form Scale for Children (mBSFS-C), evaluate criterion-related validity of the mBSFS-C, and identify the lower age limit for mBSFS-C use. The mBSFS-C comprises 5 stool form types described and depicted in drawings. Children 3 to 18 years old rated stool form for 10 stool photographs. Because of low reliability when stool form descriptors were not read aloud (n = 119), a subsequent sample of children (n = 191) rated photographs with descriptors read. Intraclass correlation coefficients for descriptor-unread versus -read samples were 0.62 and 0.79, respectively. Children were increasingly reliable with age. Percentage of correct ratings varied with stool form type, but generally increased with age. With descriptors unread, children 8 years and older demonstrated acceptable interobserver reliability, with >78% of ratings correct. With descriptors read, children 6 years and older demonstrated acceptable reliability, with >80% of ratings correct. The mBSFS-C is reliable and valid for use by children, with age 6 years being the lower limit for scale use with descriptors read and age 8 years being the lower limit without descriptors read. We anticipate that the mBSFS-C can be effectively used in pediatric clinical and research settings. Copyright © 2011 Mosby, Inc. All rights reserved.
Eraydın, Ayten; Akarsu, Mesut; Derviş Hakim, Gözde; Keskinoğlu, Pembe; Ellidokuz, Hülya
Chronic liver diseases have been shown to adversely affect the quality of life. Standardized tools for patient assessment are of great importance for treatment and follow-up of these patients. In this study, we aimed to determine the validity and reliability of the Liver Symptom Index 2.0 (LDSI 2.0) for Turkish society, for use in other studies and in daily clinical practice. A total of 308 patients with chronic liver disease attending to the outpatient liver clinic of the Department of Gastroenterology, Faculty of Medicine, Dokuz Eylül University between September 2011 and May 2012 were included in this study. A sociodemographic data questionnaire, the LDSI 2.0 comprising 24 items, and the Short Form-36 (SF-36) were completed by the participating patients. After 6 weeks, these tools were re-administered to a total of 115 patients. After obtaining the required permissions, LDSI 2.0 was translated into Turkish using the translation/re-translation method. Of the 308 participants, 160 (51.9%) were male and 184 (43.1%) were female, with an average age of 48.67±13.31 years. Of all cases, 70.5% had viral hepatitis. The average Child-Pugh score was 5.9±1.2, and the average Model For End-Stage Liver Disease (MELD) score was 10.2±3.2. The assessment tool comprised the following sub-items: itching, joint pain, abdominal pain, sleepiness, worry, appetite, depression, fear, jaundice, memory, personality, financial status, use of time, sexual desire, and sexual activity. For more than 50% of the patients, worry (68.8%), depression (65.3%), joint pain (62.3%), itch (56.5%), sleepiness (54.2%), memory problems (53.6), and sexual problems (50%) were present. The internal coefficient of consistency (Cronbach alpha coefficient) was 0.908, which indicates a very high level of consistency. The correlation coefficient for the intraobserver test/re-test reliability was 0.746 (pvalidity between each sub-item of the tool and sub-items of SF-36 was assessed using Spearman's correlation
Steeves, Jeremy A; Tyo, Brian M; Connolly, Christopher P; Gregory, Douglas A; Stark, Nyle A; Bassett, David R
This study compared the validity of a new Omron HJ-303 piezoelectric pedometer and 2 other pedometers (Sportline Traq and Yamax SW200). To examine the effect of speed, 60 subjects walked on a treadmill at 2, 3, and 4 mph. Twenty subjects also ran at 6, 7, and 8 mph. To test lifestyle activities, 60 subjects performed front-back-side-side stepping, elliptical machine and stair climbing/descending. Twenty others performed ballroom dancing. Sixty participants completed 5 100-step trials while wearing 5 different sets of the devices tested device reliability. Actual steps were determined using a hand tally counter. Significant differences existed among pedometers (P < .05). For walking, the Omron pedometers were the most valid. The Sportline overestimated and the Yamax underestimated steps (P < .05). Worn on the waist or in the backpack, the Omron device and Sportline were valid for running. The Omron was valid for 3 activities (elliptical machine, ascending and descending stairs). The Sportline overestimated all of these activities, and Yamax was only valid for descending stairs. The Omron andYamax were both valid and reliable in the 100-step trials. The Omron HJ-303, worn on the waist, appeared to be the most valid of the 3 pedometers.
Zischg, Andreas Paul; Mosimann, Markus; Bernet, Daniel Benjamin; Röthlisberger, Veronika
Flood impact modelling requires reliable models for the simulation of flood processes. In recent years, flood inundation models have been remarkably improved and widely used for flood hazard simulation, flood exposure and loss analyses. In this study, we validate a 2D inundation model for the purpose of flood exposure analysis at the river reach scale. We validate the BASEMENT simulation model with insurance claims using conventional validation metrics. The flood model is established on the basis of available topographic data in a high spatial resolution for four test cases. The validation metrics were calculated with two different datasets; a dataset of event documentations reporting flooded areas and a dataset of insurance claims. The model fit relating to insurance claims is in three out of four test cases slightly lower than the model fit computed on the basis of the observed inundation areas. This comparison between two independent validation data sets suggests that validation metrics using insurance claims can be compared to conventional validation data, such as the flooded area. However, a validation on the basis of insurance claims might be more conservative in cases where model errors are more pronounced in areas with a high density of values at risk.
da Silva, José Vitor; Baptista, Makilim Nunes
To assess validity and reliability of the Vitor Quality of Life Scale for the Elderly (VITOR QLSE). A sociodemographic questionnaire, a mental status questionnaire, the VITOR QLSE, the World Health Organization Quality of Life Instrument old module (WHOQOL-OLD), and the Baptista Depression Scale adult version (EBADEP-A) were administered to a non-probabilistic sample of 617 elderly persons living in Brazil. Exploratory factor analysis was performed to reduce the 70 items of the first version of the VITOR QLSE. Construct validity was then evaluated; the VITOR QLSE was tested against the WHOQOL-OLD and EBADEP-A. One hundred and ninety-two randomly selected participants completed the instrument again 7-15 days after the first interview, providing a test-retest reliability estimate. Exploratory factor analysis reduced the 70 items to 48 items grouped into six domains: autonomy and psychological, environment, physical independence, family, health, and social domains. The total explained variance was 39.46 %. Cronbach's alpha for overall reliability was 0.93 and ranged from 0.79 for health to 0.90 for physical independence. Pearson's correlation coefficient (r) was 0.76 for test-retest reliability, and 0.56 and -0.57 for the correlation of VITOR QLSE scores with those of the WHOQOL-OLD and EBADEP-A, respectively. The final version of the VITOR QLSE has 48 items grouped into six domains and shows adequate validity and reliability. The rapid aging of the population and reduced number of instruments in the Latin America, and more specifically in Brazil, assessing quality of life in the elderly justify the development of a valid and reliable tool.
Cabanas-Sánchez, Verónica; Martínez-Gómez, David; Esteban-Cornejo, Irene; Castro-Piñero, José; Conde-Caveda, Julio; Veiga, Óscar L
To develop a questionnaire able to assess time spent by youth in a wide range of leisure-time sedentary behaviors (SB) and evaluate its test-retest reliability and criterion validity. Cross-sectional observational. The reliability sample included 194 youth, aged 10-18 years, who completed the questionnaire twice, separated by one-week interval. The validity study comprised 1207 participants aged 8-18 years. Participants wore an accelerometer for 7 consecutive days. The questionnaire was designed to assess the amount of time spent in twelve different SB during weekdays and weekends, separately. In order to avoid usual phenomenon of time over reporting, values were adjusted to real available leisure-time (LT) for each participant. Reliability was assessed by using Intraclass Correlation Coefficients (ICC) and weighted (quadratic) kappa (k), and validity was assessed by using Pearson correlation and Bland-Altman plots. The reliability of questionnaire showed a moderate-to-substantial agreement for the most (91%) of items (k=0.43-0.74; ICC=0.41-0.79) with three items (4%) reaching an almost perfect agreement (ICC=0.82-0.83). Only 'sitting and talking' evidenced fair-to-moderate reliability (k=0.27-0.39; ICC=0.34-0.46). The relationship between average sedentary time assessed by the questionnaire and accelerometry was moderate (r=0.36; pquestionnaire and accelerometer sedentary time for average day (r=0.05; p=0.11) but Bland-Altman plots suggest moderate discrepancies between both methods of SB measurement (mean=19.86; limits of agreement=-280.04 to 319.76). The questionnaire showed moderate to good test-retest reliability and a moderate level of validity for assessing SB in youth, similar or slightly better to previously published in this population. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Mehmet Emrah Karadere
Full Text Available Reasoning with Inductive Argument Test:A Study of Validity and Reliability Objective: The aim of our study is to research reliability and validity and to evaluate the usability of Turkish version of Reasoning with Inductive Argument Test (RIAT in Turkish healty population. Method: 51 healty volunteers who work in Ankara Dıskapi Yildirim Beyazit Research and Training Hospital participated in this study. Reasoning with Inductive Argument Test (RIAT was translated into Turkish by three clinical good knowledge of English. Participants were given a sociodemographic data form, and RIAT were performed by clinicians. To test the reliability of the Turkish version of RIAT, Cronbach’s alpha coefficient was calculated and the halving method was used for the test. Results: The internal consistency of the Reasoning with Inductive Argument Test (RIAT items, Cronbach’s alpha internal consistency coefficient measurements of 0.73 was found to be statistically significant. Spearman-Brown coefficient that determines the reliability of the whole test r=0.74 was found. Kurtosis values of all the items was below 1.5 and the percentages in the second evaluation were mainly lower. At the same time, both change in belief between self produced RIAT options and given RIAT options (p=0.02, z=-2296 as well as changes in beliefs between related and unrelated items for Obsessive Compulsive Disorder (OCD difference (p=0.03, z=-2.199 were significant. Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that ‘Reasoning with Inductive Argument Test’ supports reliability and validity in Turkish population.
Reis, Cláudia Tartaglia; Laguardia, Josué; Vasconcelos, Ana Glória Godoi; Martins, Mônica
The evaluation of the culture of patient safety in hospitals is nowadays considered as a management too, since it helps to identify problem areas and provide valuable information for planning improvements. This study explored the reliability and validity of the Brazilian version of the Hospital Survey on Patient Safety Culture, an instrument that evaluates characteristics of patient safety culture among hospital staff. The reliability of the instrument was evaluated by analyzing the internal consistency of each dimension. The validity of the tool was carried out by means of exploratory and confirmatory factor analysis. The sample was made up of 322 questionnaires that were collected in two Brazilian hospitals in 2012. Cronbach's alpha ranged from 0.52 to 0.91 for the different dimensions, with the exception of two, for which it was much lower. After excluding four items, the exploratory factor analysis presented adjusted indices that were appropriate for a 10 factor model.
Naruse, Takashi; Sakai, Mahiro; Nagata, Satoko
The Relational Coordination Scale is used to measure teamwork with specific subjects about specific tasks. We translated the scale into Japanese (J-RCS) and examined its reliability and validity. We translated the scale into Japanese. In July 2013, we conducted an internet survey for 124 home visiting nurses of 11 agencies in Shiga prefecture. The J-RCS items asked about teamwork with physicians, care managers, and colleague nurses. Eighty-one home visiting nurses (65.3%) answered the survey, and 70 (56.5% of the responders) answered the re-test survey. Cronbach's alpha coefficients of J-RCS were 0.770 to 0.859, and the intraclass correlation coefficients for test-retest reliability were 0.673 to 0.830 (P<0.001). In the confirmatory factor analysis, only J-RCS ［physicians］ met the well fit indices and showed factorial validity. Concurrent validity was supported by correlations of J-RCS and the degree of confidence about collaboration (r=0.559 to 0.585, P<0.001) and social support (r=0.549 to 0.789, P<0.001). While issues such as factorial validity and the concentration of responses on some scale items will be a topic for further research, the J-RCS has been shown to have an acceptable level of reliability and validity.
Full Text Available Abstract: Introduction: The eHEALS is an 8-item measure of eHealth literacy developed to measure consumers’ combined knowledge, comfort, and perceived skills at finding, evaluating, and applying electronic health information to health problems. The current study aims to measure validity and reliability of the Iranian version of eHEALS questionnaire in a population context. Materials & Methods: A cross-sectional study was done on 525 youths people who has been chosen randomly in Iran, Yazd. We determined content validity, construct validity and predictive validity of the translated questionnaire. Principal components factor analysis was used to determine the theoretical fit of the measures with the data. The internal consistency of the translated questionnaire was evaluated using Cronbach α coefficient. The results were analyzed in SPSSv16. Results: The principal component analysis (PCA produced a single factor solution (70.48% of variance with factor loading ranging from 0.723 to 0.862. The internal consistency of the scale was sufficient (alpha=0.88 , P<0.001 and the test-retest coefficients for the items were reliable (r= 0.96, P<0.001. Discussion: The results of the study showed that the items in the translated questionnaire were equivalent to the original scale .The version of the eHEALS questionnaire showed both good reliability and validity for the screening of eHealth literacy of Iranian people.
Ricardo Andrés Cartes-Velásquez
Full Text Available The clinical paradigm of evidence-based medicine requires a foundation of good quality research upon which clinical and epidemiological decisions can be based. Several instruments have been designed to assess research quality and validated, though most have limitations. The MINCIR scale was designed to determine the methodological quality (MQ of clinical research, its psychometric properties for dental research involving large-scale evaluations have not yet been determined. The aim of this study was to determine the validity and reliability of the MINCIR scale for assessment of the MQ of dental therapy studies published in journals indexed in Institute for Scientific Information (ISI databases. A validation study was performed on a sample of 99 articles from four representative ISI dental journals. Criterion validity was determined in relation to level of evidence (LoE classification as described by the Oxford Center for Evidence-Based Medicine (OCEBM ranking system, reliability was determined by calculation of intra-class correlation coefficient (ICC values, and internal consistency was determined by calculation of Cronbach’s alpha. Very good inter-observer reliability (ICC = 0.93, excellent temporal stability (ICC = 0.97, good internal consistency (Cronbach’s alpha = 0.77, and a strong (inverse correlation with OCEBM LoEs (-0.807; p < .0001 were obtained. These results indicate that the MINCIR scale has adequate psychometric properties and therefore is a valid option for use in the assessment of MQ in dental therapy research articles.
Huh, Jung-Wook; Eun, Il-Soo; Ko, Young-Chul; Park, Man-Jun; Hwang, Keum-Min; Park, Sook-Hyun; Park, Tae-Hong; Park, Joon-Hyung
Although translated versions of the Foot Function Index (FFI) in several languages are available, the absence of a Korean version has precluded comparing the data from Korea with the data from other countries using the FFI. We, therefore, evaluated the reliability and validity of the adapted Korean version of the FFI. We translated the English version of the FFI into Korean and back into English. We mailed the Korean version of the visual analog scale, FFI, and the previously validated Medical Outcomes Study Short-Form 36-item questionnaire (SF-36) to 121 patients with treated foot complaints. To evaluate the test-retest reliability and internal consistency, we used the intraclass correlation coefficient and Cronbach's α, respectively. We also evaluated the concurrent and construct validity of Korean version of the FFI by comparing the visual analog scale and SF-36. Cronbach's α was 0.91 and 0.95 for the pain and disability subscales, respectively. The reproducibility was good, and a strong correlation between the FFI and the SF-36 and visual analog scale with related content was observed, indicating good construct validity.The Korean version of the FFI is a reliable and valid questionnaire for the self-assessment of pain and disability in Korean patients with foot complaints. Copyright © 2016 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Jesus-Ribeiro, Joana; Vieira, Elsa; Ferreira, Pedro; Januário, Cristina; Freire, António
Parkinson's disease has a significant impact in quality of life, which can be assessed with 39-item Parkinson's Disease Questionnaire and Parkinson's Disease Quality of Life Questionnaire. This study aimed to evaluate the reliability and validity of these scales in Portuguese patients. Reliability was assessed through internal consistency (Cronbach's alpha) and reproducibility (intraclass correlation coefficient). Regarding construct validity, we performed one-way analysis of variance across different groups according to modified Hoehn and Yahr scale. For criterion validity, we compared both scales with each other and with the Short Form 36-item Health Survey. In a total of 100 patients with Parkinson's disease, Cronbach's alpha ranged for 39-item Parkinson's Disease Questionnaire between 0.66 - 0.98, and for Parkinson's Disease Quality of Life Questionnaire, between 0.78 - 0.98. Intraclass correlation coefficient for 39-item Parkinson's Disease Questionnaire ranged between 0.49 - 0.96, and for Parkinson's Disease Quality of Life Questionnaire, ranged between 0.65 - 0.96. Both scales showed, in general, capacity to discriminate differences among patients in the different stages of disease. The scales presented moderate to strong magnitude correlations with some Short Form 36-item Health Survey domains. Cronbach's alpha coefficients for most domains were satisfactory. Overall, it has been demonstrated good reproducibility, as well as construct and criterion validity. The Portuguese versions of both scales showed to be valid and reliable.
Park, Ik Keun; Park, Un Su; Kim, Hyun Mook; Park, Yoon Won; Kang, Suk Chull; Choi, Young Hwan; Lee, Jin Ho
Ultrasonic inspection system is consisted of the operator, equipment and procedure. The reliability of results in ultrasonic inspection is affected by its ability. Furthermore, the reliability of nondestructive testing is influenced by the inspection environment, other materials and types of defect. Therefore, it is very difficult to estimate the reliability of NDT due to various factors. In this study, the probability of detection, used logistic probability model and Monte Carlo simulation, estimated the reliability of ultrasonic inspection. The utility of the NDT reliability assessment is verified by the analysis of the data from round robin test applied these models
Opdam, K T M; Zwiers, R; Wiegerinck, J I; Kleipool, A E B; Haverlag, R; Goslings, J C; van Dijk, C N
Patient-reported outcome measures (PROMs) have become a cornerstone for the evaluation of the effectiveness of treatment. The Achilles tendon Total Rupture Score (ATRS) is a PROM for outcome and assessment of an Achilles tendon rupture. The aim of this study was to translate the ATRS to Dutch and evaluate its reliability and validity in the Dutch population. A forward-backward translation procedure was performed according to the guidelines of cross-cultural adaptation process. The Dutch ATRS was evaluated for reliability and validity in patients treated for a total Achilles tendon rupture from 1 January 2012 to 31 December 2014 in one teaching hospital and one academic hospital. Reliability was assessed by the intraclass correlation coefficients (ICC), Cronbach's alpha and minimal detectable change (MDC). We assessed construct validity by calculation of Spearman's rho correlation coefficient with domains of the Foot and Ankle Outcome Score (FAOS), Victorian Institute of Sports Assessment-Achilles questionnaire (VISA-A) and Numeric Rating Scale (NRS) for pain in rest and during running. The Dutch ATRS had a good test-retest reliability (ICC = 0.852) and a high internal consistency (Cronbach's alpha = 0.96). MDC was 30.2 at individual level and 3.5 at group level. Construct validity was supported by 75 % of the hypothesized correlations. The Dutch ATRS had a strong correlation with NRS for pain during running (r = -0.746) and all the five subscales of the Dutch FAOS (r = 0.724-0.867). There was a moderate correlation with the VISA-A-NL (r = 0.691) and NRS for pain in rest (r = -0.580). The Dutch ATRS shows an adequate reliability and validity and can be used in the Dutch population for measuring the outcome of treatment of a total Achilles tendon rupture and for research purposes. Diagnostic study, Level I.
Montilla-Ibáñez, Alharilla; Martínez-Amat, Antonio; Lomas-Vega, Rafael; Cruz-Díaz, David; Torre-Cruz, Manuel J De la; Casuso-Pérez, Rafael; Hita-Contreras, Fidel
To examine the reliability and validity of the Spanish version of the Activities-specific Balance Confidence scale (ABC-S), and its ability to discriminate between patients with and without a history of falls among a Spanish population with vestibular disorders. A total of 84 participants completed the ABC-S. Internal consistency, test-retest reliability and construct validity (exploratory factor analysis) were analysed. Concurrent validity was evaluated using the 12-item Short Form Health Survey (SF-12) and the Dizziness Handicap Inventory (DHI). To determine the accuracy of the ABC total score in discriminating patients with and without a history of falls, a receiver operating characteristic (ROC) curve analysis was performed. The ABC-S showed excellent internal consistency (Cronbach's α = 0.916) and substantial test-retest reliability (ICC = 0.86, 95% CI: 0.74-0.93), with standard error and minimal detectable change values of 8.64 and 16.94, respectively. Factor analyses suggested a three-factor structure (explained variance was 62.24%). The ABC total score significantly correlated with the physical component summary score of the SF-12 and with the DHI-P, DHI-E, DHI-F and DHI total scores (p ABC-S was significantly able to discriminate between participants with and without a history of falls (p ABC-S is a valid and reliable instrument, suitable to assess balance confidence in Spanish patients with vestibular disorders. Implications for Rehabilitation The Spanish version of the ABC scale is a valid and reliable measure of balance confidence in patients with vestibular disorders. In persons with vestibular disorders, the Spanish version of the ABC scale has shown the ability to discriminate between patients with and without a history of falls in the last year.
Kaux, Jean-François; Delvaux, François; Oppong-Kyei, Julian; Dardenne, Nadia; Beaudart, Charlotte; Buckinx, Fanny; Croisier, Jean-Louis; Forthomme, Bénédicte; Crielaard, Jean-Michel; Bruyère, Oliver
The Victorian Institute of Sport Assessment - Achilles tendinopathy questionnaire (VISA-A) evaluates the clinical severity of Achilles tendinopathy. The aim of this study was to translate the VISA-A into French and to study the reliability and validity of this French version, the VISA-AF. The VISA-A was translated into French to produce the VISA-AF using a validated methodology in six steps. Thereafter, several psychometric properties of this French version such as test-retest reliability, internal consistency, construct validity and floor and ceiling effects were evaluated. Therefore, we recruited 116 subjects, distributed into 3 groups: pathological patients (n = 31), at-risk athletes (n = 63) and healthy people (n = 22). The final version of the VISA-AF was approved by an expert committee. On a scale ranging from 0 to 100, the average scores of the VISA-AF obtained were 59 (± 18) for the pathological group, 99 (± 1) for the healthy group and 94 (± 7) for the at-risk group. The VISA-AF shows excellent reliability, low correlations with the discriminant subscales of the SF-36 and moderate correlations with the convergent subscales of the SF-36. The French version of the VISA-A is equivalent to its original version and is a reliable and valid questionnaire for French-speaking patients with Achilles tendinopathy. Implication for Rehabilitation The VISA-AF questionnaire is a reliable translation of the original VISA-A, from English into French, which is one of the most widespread languages in the world. The VISA-AF questionnaire is now a valid instrument that can be used by clinicians and researchers to assess the severity of pain and disability of French-speaking subjects with Achilles tendinopathy. The VISA-AF is a questionnaire to assess the severity of Achilles tendinopathy symptoms but is not a diagnostic tool.
Brindle, Richard A; Ebaugh, D David; Milner, Clare E
Side-lying hip abductor strength tests are commonly used to evaluate muscle strength. In a 'break' test the tester applies sufficient force to lower the limb to the table while the patient resists. The peak force is postulated to occur while the leg is lowering, thus representing the participant's eccentric muscle strength. However, it is unclear whether peak force occurs before or after the leg begins to lower. To determine intra-rater reliability and construct validity of a hip abductor eccentric strength test. Intra-rater reliability and construct validity study. Twenty healthy adults (26 ±6 years; 1.66 ±0.06 m; 62.2 ±8.0 kg) made two visits to the laboratory at least one week apart. During the hip abductor eccentric strength test, a hand-held dynamometer recorded peak force and time to peak force and limb position was recorded via a motion capture system. Intra-rater reliability was determined using intra-class correlation (ICC), standard error of measurement (SEM), and minimal detectable difference (MDD). Construct validity was assessed by determining if peak force occurred after the start of the lowering phase using a one-sample t-test. The hip abductor eccentric strength test had substantial intra-rater reliability (ICC( 3,3 ) = 0.88; 95% confidence interval: 0.65-0.95), SEM of 0.9%BWh, and a MDD of 2.5%BWh. Construct validity was established as peak force occurred 2.1s (±0.6s; range 0.7s to 3.7s) after the start of the lowering phase of the test (p ≤ 0.001). The hip abductor eccentric strength test is a valid and reliable measure of eccentric muscle strength. This test may be used clinically to assess changes in eccentric muscle strength over time.
Gusi, N; Perez-Sousa, M A; Gozalo-Delgado, M; Olivares, P R
A proxy version of the EQ-5D-Y, a questionnaire to evaluate the Health Related Quality of Life (HRQoL) in children and adolescents, has recently been developed. There are currently no data on the validity and reliability of this tool. The objective of this study was to analyze the validity and reliability of the EQ-5D-Y proxy version. A core set of self-report tools, including the Spanish version of the EQ-5D-Y were administered to a group of Spanish children and adolescents drawn from the general population. A similar core set of internationally standardized proxy tools, including the EQ-5D-Y proxy version were administered to their parents. Test-retest reliability was determined, and correlations with other generic measurements of HRQoL were calculated. Additionally, known group validity was examined by comparing groups with a priori expected differences in HRQoL. The agreement between the self-report and proxy version responses was also calculated. A total of 477 children and adolescents and their parents participated in the study. One week later, 158 participants completed the EQ-5D-Y/EQ-5D-Y proxy to facilitate reliability analysis. Agreement between the test-retest scores was higher than 88% for EQ-5D-Y self-report, and proxy version. Correlations with other health measurements showed similar convergent validity to that observed in the international EQ-5D-Y. Agreement between the self-report and proxy versions ranged from 72.9% to 97.1%. The results provide preliminary evidence of the reliability and validity of the EQ-5D-Y proxy version. Copyright © 2013 Asociación Española de Pediatría. Published by Elsevier Espana. All rights reserved.
Davies, K; Bulsara, M; Ramelet, A S; Monteroso, L
To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide 'inexperienced' paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse 'experts' developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. 'Experienced' (n=12) and 'inexperienced' (n=14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the 'experts' (n=9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with 'experienced' and 'inexperienced' PIC nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by 'experts' (n=9), 'inexperienced' (n=14) and 'experienced' (n=12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the 'inexperienced' nurse to guide decision-making related to endotracheal tube suction. This article is protected by copyright
Craig, Cora Lynn; Russell, Storm J; Cameron, Christine
This investigation assessed the reliability and criterion validity of the Physical Activity Monitor, a telephone-interview adaptation of the Minnesota Leisure Time Physical Activity Questionnaire (MLTPAQ), which is currently used to assess trends in the Canadian population. A sample of 512 people aged 18 yr and older was selected by random-digit dialing for telephone interviewing in the reliability study. The Monitor questions were administered twice, 3 wk apart. For the criterion validity study, a sample of 148 people aged 18-69 yr was selected at random from households. Participants completed the Monitor questions by telephone and an in-home step test to estimate maximum oxygen uptake. Another random sample of individuals aged 18-69 yr participated in a comparison study of the Monitor against the 1988 Campbell's Survey of Well-Being (CSWB) instrument. All studies were conducted in the vicinity of Toronto, Ontario. Spearman correlations controlling for age and sex were calculated as a measure of association for the reliability, validity, and comparison studies. Validity estimates were further adjusted for body mass index and physical activity demands of work and chores. The Monitor instrument produced reliable estimates of total energy expenditure (P=0.90, Pvalidity of 0.36 (Pinstruments was 0.77 (Ptest-retest reliability and criterion validity. The research also demonstrated that for the purpose of population monitoring a change in data collection mode-telephone interview versus self-administration in households-can yield reasonably comparable estimates from two adaptations of the MLTPAQ.