validity testing showed: Topics by WorldWideScience.org

Sample records for validity testing showed

Convergent validity test, construct validity test and external validity test of the David Liberman algorithm

Directory of Open Access Journals (Sweden)

David Maldavsky

2013-08-01

Full Text Available The author first exposes a complement of a previous test about convergent validity, then a construct validity test and finally an external validity test of the David Liberman algorithm. The first part of the paper focused on a complementary aspect, the differential sensitivity of the DLA 1 in an external comparison (to other methods, and 2 in an internal comparison (between two ways of using the same method, the DLA. The construct validity test exposes the concepts underlined to DLA, their operationalization and some corrections emerging from several empirical studies we carried out. The external validity test examines the possibility of using the investigation of a single case and its relation with the investigation of a more extended sample.
Continuous validation of ASTEC containment models and regression testing

International Nuclear Information System (INIS)

Nowack, Holger; Reinke, Nils; Sonnenkalb, Martin

2014-01-01

The focus of the ASTEC (Accident Source Term Evaluation Code) development at GRS is primarily on the containment module CPA (Containment Part of ASTEC), whose modelling is to a large extent based on the GRS containment code COCOSYS (COntainment COde SYStem). Validation is usually understood as the approval of the modelling capabilities by calculations of appropriate experiments done by external users different from the code developers. During the development process of ASTEC CPA, bugs and unintended side effects may occur, which leads to changes in the results of the initially conducted validation. Due to the involvement of a considerable number of developers in the coding of ASTEC modules, validation of the code alone, even if executed repeatedly, is not sufficient. Therefore, a regression testing procedure has been implemented in order to ensure that the initially obtained validation results are still valid with succeeding code versions. Within the regression testing procedure, calculations of experiments and plant sequences are performed with the same input deck but applying two different code versions. For every test-case the up-to-date code version is compared to the preceding one on the basis of physical parameters deemed to be characteristic for the test-case under consideration. In the case of post-calculations of experiments also a comparison to experimental data is carried out. Three validation cases from the regression testing procedure are presented within this paper. The very good post-calculation of the HDR E11.1 experiment shows the high quality modelling of thermal-hydraulics in ASTEC CPA. Aerosol behaviour is validated on the BMC VANAM M3 experiment, and the results show also a very good agreement with experimental data. Finally, iodine behaviour is checked in the validation test-case of the THAI IOD-11 experiment. Within this test-case, the comparison of the ASTEC versions V2.0r1 and V2.0r2 shows how an error was detected by the regression testing
Effort, symptom validity testing, performance validity testing and traumatic brain injury.

Science.gov (United States)

Bigler, Erin D

2014-01-01

To understand the neurocognitive effects of brain injury, valid neuropsychological test findings are paramount. This review examines the research on what has been referred to a symptom validity testing (SVT). Above a designated cut-score signifies a 'passing' SVT performance which is likely the best indicator of valid neuropsychological test findings. Likewise, substantially below cut-point performance that nears chance or is at chance signifies invalid test performance. Significantly below chance is the sine qua non neuropsychological indicator for malingering. However, the interpretative problems with SVT performance below the cut-point yet far above chance are substantial, as pointed out in this review. This intermediate, border-zone performance on SVT measures is where substantial interpretative challenges exist. Case studies are used to highlight the many areas where additional research is needed. Historical perspectives are reviewed along with the neurobiology of effort. Reasons why performance validity testing (PVT) may be better than the SVT term are reviewed. Advances in neuroimaging techniques may be key in better understanding the meaning of border zone SVT failure. The review demonstrates the problems with rigidity in interpretation with established cut-scores. A better understanding of how certain types of neurological, neuropsychiatric and/or even test conditions may affect SVT performance is needed.
Validity of the Eating Attitude Test among Exercisers.

Science.gov (United States)

Lane, Helen J; Lane, Andrew M; Matheson, Hilary

2004-12-01

Theory testing and construct measurement are inextricably linked. To date, no published research has looked at the factorial validity of an existing eating attitude inventory for use with exercisers. The Eating Attitude Test (EAT) is a 26-item measure that yields a single index of disordered eating attitudes. The original factor analysis showed three interrelated factors: Dieting behavior (13-items), oral control (7-items), and bulimia nervosa-food preoccupation (6-items). The primary purpose of the study was to examine the factorial validity of the EAT among a sample of exercisers. The second purpose was to investigate relationships between eating attitudes scores and selected psychological constructs. In stage one, 598 regular exercisers completed the EAT. Confirmatory factor analysis (CFA) was used to test the single-factor, a three-factor model, and a four-factor model, which distinguished bulimia from food pre-occupation. CFA of the single-factor model (RCFI = 0.66, RMSEA = 0.10), the three-factor-model (RCFI = 0.74; RMSEA = 0.09) showed poor model fit. There was marginal fit for the 4-factor model (RCFI = 0.91, RMSEA = 0.06). Results indicated five-items showed poor factor loadings. After these 5-items were discarded, the three models were re-analyzed. CFA results indicated that the single-factor model (RCFI = 0.76, RMSEA = 0.10) and three-factor model (RCFI = 0.82, RMSEA = 0.08) showed poor fit. CFA results for the four-factor model showed acceptable fit indices (RCFI = 0.98, RMSEA = 0.06). Stage two explored relationships between EAT scores, mood, self-esteem, and motivational indices toward exercise in terms of self-determination, enjoyment and competence. Correlation results indicated that depressed mood scores positively correlated with bulimia and dieting scores. Further, dieting was inversely related with self-determination toward exercising. Collectively, findings suggest that a 21-item four-factor model shows promising validity coefficients among
Validity evidence based on test content.

Science.gov (United States)

Sireci, Stephen; Faulkner-Bond, Molly

2014-01-01

Validity evidence based on test content is one of the five forms of validity evidence stipulated in the Standards for Educational and Psychological Testing developed by the American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. In this paper, we describe the logic and theory underlying such evidence and describe traditional and modern methods for gathering and analyzing content validity data. A comprehensive review of the literature and of the aforementioned Standards is presented. For educational tests and other assessments targeting knowledge and skill possessed by examinees, validity evidence based on test content is necessary for building a validity argument to support the use of a test for a particular purpose. By following the methods described in this article, practitioners have a wide arsenal of tools available for determining how well the content of an assessment is congruent with and appropriate for the specific testing purposes.
Mollusc reproductive toxicity tests - Development and validation of test guidelines

DEFF Research Database (Denmark)

Ducrot, Virginie; Holbech, Henrik; Kinnberg, Karin Lund

. Draft standard operating procedures (SOPs) have been designed based upon literature and expert knowledge from project partners. Pre-validation studies have been implemented to validate the proposed test conditions and identify issues in performing the SOPs and analyzing test results. Pre-validation work......The Organisation for Economic Cooperation and Development is promoting the development and validation of mollusc toxicity tests within its test guidelines programme, eventually aiming for the standardization of mollusc apical toxicity tests. Through collaborative work between academia, industry...... and stakeholders, this study aims to develop innovative partial life-cycle tests on the reproduction of the freshwater gastropods Potamopyrgus antipodarum and Lymnaea stagnalis, which are relevant candidate species for the standardization of mollusc apical toxicity tests assessing reprotoxic effects of chemicals...
Development and Validation of a Test for Bulimia.

Science.gov (United States)

Smith, Marcia C.; Thelen, Mark H.

1984-01-01

Developed the Bulimia Test (BULIT) based on responses of clinically identified females (N=18) and normal female college students (N=119) to preliminary test items. Results showed that the BULIT provided an objective, reliable, and valid measure by which to identify individuals with symptoms of bulimia. (Instrument is appended.) (LLL)
Predictive validity of the Biomedical Admissions Test: an evaluation and case study.

Science.gov (United States)

McManus, I C; Ferguson, Eamonn; Wakeford, Richard; Powis, David; James, David

2011-01-01

There has been an increase in the use of pre-admission selection tests for medicine. Such tests need to show good psychometric properties. Here, we use a paper by Emery and Bell [2009. The predictive validity of the Biomedical Admissions Test for pre-clinical examination performance. Med Educ 43:557-564] as a case study to evaluate and comment on the reporting of psychometric data in the field of medical student selection (and the comments apply to many papers in the field). We highlight pitfalls when reliability data are not presented, how simple zero-order associations can lead to inaccurate conclusions about the predictive validity of a test, and how biases need to be explored and reported. We show with BMAT that it is the knowledge part of the test which does all the predictive work. We show that without evidence of incremental validity it is difficult to assess the value of any selection tests for medicine.
Validation of SSC using the FFTF natural-circulation tests

International Nuclear Information System (INIS)

Horak, W.C.; Guppy, J.G.; Kennett, R.J.

1982-01-01

As part of the Super System Code (SSC) validation program, the 100% power FFTF natural circulation test has been simulated using SSC. A detailed 19 channel, 2 loop model was used in SSC. Comparisons showed SSC calculations to be in good agreement with the Fast Flux Test Facility (FFTF), test data. Simulation of the test was obtained in real time
The Validation of NAA Method Used as Test Method in Serpong NAA Laboratory

International Nuclear Information System (INIS)

Rina-Mulyaningsih, Th.

2004-01-01

The Validation Of NAA Method Used As Test Method In Serpong NAA Laboratory. NAA Method is a non standard testing method. The testing laboratory shall validate its using method to ensure and confirm that it is suitable with application. The validation of NAA methods have been done with the parameters of accuracy, precision, repeatability and selectivity. The NIST 1573a Tomato Leaves, NIES 10C Rice flour unpolished and standard elements were used in this testing program. The result of testing with NIST 1573a showed that the elements of Na, Zn, Al and Mn are met from acceptance criteria of accuracy and precision, whereas Co is rejected. The result of testing with NIES 10C showed that Na and Zn elements are met from acceptance criteria of accuracy and precision, but Mn element is rejected. The result of selectivity test showed that the value of quantity is between 0.1-2.5 μg, depend on the elements. (author)
Validity and Reliability Testing of an e-learning Questionnaire for Chemistry Instruction

Science.gov (United States)

Guspatni, G.; Kurniawati, Y.

2018-04-01

The aim of this paper is to examine validity and reliability of a questionnaire used to evaluate e-learning implementation in chemistry instruction. 48 questionnaires were filled in by students who had studied chemistry through e-learning system. The questionnaire consisted of 20 indicators evaluating students’ perception on using e-learning. Parametric testing was done as data were assumed to follow normal distribution. Item validity of the questionnaire was examined through item-total correlation using Pearson’s formula while its reliability was assessed with Cronbach’s alpha formula. Moreover, convergent validity was assessed to see whether indicators building a factor had theoretically the same underlying construct. The result of validity testing revealed 19 valid indicators while the result of reliability testing revealed Cronbach’s alpha value of .886. The result of factor analysis showed that questionnaire consisted of five factors, and each of them had indicators building the same construct. This article shows the importance of factor analysis to get a construct valid questionnaire before it is used as research instrument.
Reasoning with Inductive Argument Test: A Study of Validity and Reliability

Directory of Open Access Journals (Sweden)

Mehmet Emrah Karadere

2013-12-01

Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that Reasoning with Inductive Argument Test supports reliability and validity in Turkish population. [JCBPR 2013; 2(3.000: 156-161
Differential Weighting of Items to Improve University Admission Test Validity

Directory of Open Access Journals (Sweden)

Eduardo Backhoff Escudero

2001-05-01

Full Text Available This paper gives an evaluation of different ways to increase university admission test criterion-related validity, by differentially weighting test items. We compared four methods of weighting multiple-choice items of the Basic Skills and Knowledge Examination (EXHCOBA: (1 punishing incorrect responses by a constant factor, (2 weighting incorrect responses, considering the levels of error, (3 weighting correct responses, considering the item’s difficulty, based on the Classic Measurement Theory, and (4 weighting correct responses, considering the item’s difficulty, based on the Item Response Theory. Results show that none of these methods increased the instrument’s predictive validity, although they did improve its concurrent validity. It was concluded that it is appropriate to score the test by simply adding up correct responses.
Thyroid-specific questions on work ability showed known-groups validity among Danes with thyroid diseases.

Science.gov (United States)

Nexo, Mette Andersen; Watt, Torquil; Bonnema, Steen Joop; Hegedüs, Laszlo; Rasmussen, Åse Krogh; Feldt-Rasmussen, Ulla; Bjorner, Jakob Bue

2015-07-01

We aimed to identify the best approach to work ability assessment in patients with thyroid disease by evaluating the factor structure, measurement equivalence, known-groups validity, and predictive validity of a broad set of work ability items. Based on the literature and interviews with thyroid patients, 24 work ability items were selected from previous questionnaires, revised, or developed anew. Items were tested among 632 patients with thyroid disease (non-toxic goiter, toxic nodular goiter, Graves' disease (with or without orbitopathy), autoimmune hypothyroidism, and other thyroid diseases), 391 of which had participated in a study 5 years previously. Responses to select items were compared to general population data. We used confirmatory factor analyses for categorical data, logistic regression analyses and tests of differential item function, and head-to-head comparisons of relative validity in distinguishing known groups. Although all work ability items loaded on a common factor, the optimal factor solution included five factors: role physical, role emotional, thyroid-specific limitations, work limitations (without disease attribution), and work performance. The scale on thyroid-specific limitations showed the most power in distinguishing clinical groups and time since diagnosis. A global single item proved useful for comparisons with the general population, and a thyroid-specific item predicted labor market exclusion within the next 5 years (OR 5.0, 95 % CI 2.7-9.1). Items on work limitations with attribution to thyroid disease were most effective in detecting impact on work ability and showed good predictive validity. Generic work ability items remain useful for general population comparisons.
Optimal number of tests to achieve and validate product reliability

International Nuclear Information System (INIS)

Ahmed, Hussam; Chateauneuf, Alaa

2014-01-01

The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces
Test re-test reliability and construct validity of the star-track test of manual dexterity

DEFF Research Database (Denmark)

Kildebro, Niels; Amirian, Ilda; Gögenur, Ismail

2015-01-01

Objectives. We wished to determine test re-test reliability and construct validity of the star-track test of manual dexterity. Design. Test re-test reliability was examined in a controlled study. Construct validity was tested in a blinded randomized crossover study. Setting. The study was performed...... at a university hospital in Denmark. Participants. A total of 11 subjects for test re-test and 20 subjects for the construct validity study were included. All subjects were healthy volunteers. Intervention. The test re-test trial had two measurements with 2 days pause in between. The interventions...... in the construct validity study included baseline measurement, intervention 1: fatigue, intervention 2: stress, and intervention 3: fatigue and stress. There was a 2 day pause between each intervention. Main outcome measure. An integrated measure of completion time and number of errors was used. Results. All...
VALIDITY OF THE EATING ATTITUDE TEST AMONG EXERCISERS

Directory of Open Access Journals (Sweden)

Hilary Matheson

2004-12-01

Full Text Available Theory testing and construct measurement are inextricably linked. To date, no published research has looked at the factorial validity of an existing eating attitude inventory for use with exercisers. The Eating Attitude Test (EAT is a 26-item measure that yields a single index of disordered eating attitudes. The original factor analysis showed three interrelated factors: Dieting behavior (13-items, oral control (7-items, and bulimia nervosa-food preoccupation (6-items. The primary purpose of the study was to examine the factorial validity of the EAT among a sample of exercisers. The second purpose was to investigate relationships between eating attitudes scores and selected psychological constructs. In stage one, 598 regular exercisers completed the EAT. Confirmatory factor analysis (CFA was used to test the single-factor, a three-factor model, and a four-factor model, which distinguished bulimia from food pre-occupation. CFA of the single-factor model (RCFI = 0.66, RMSEA = 0.10, the three-factor-model (RCFI = 0.74; RMSEA = 0.09 showed poor model fit. There was marginal fit for the 4-factor model (RCFI = 0.91, RMSEA = 0.06. Results indicated five-items showed poor factor loadings. After these 5-items were discarded, the three models were re-analyzed. CFA results indicated that the single-factor model (RCFI = 0.76, RMSEA = 0.10 and three-factor model (RCFI = 0.82, RMSEA = 0.08 showed poor fit. CFA results for the four-factor model showed acceptable fit indices (RCFI = 0.98, RMSEA = 0.06. Stage two explored relationships between EAT scores, mood, self-esteem, and motivational indices toward exercise in terms of self-determination, enjoyment and competence. Correlation results indicated that depressed mood scores positively correlated with bulimia and dieting scores. Further, dieting was inversely related with self-determination toward exercising. Collectively, findings suggest that a 21-item four-factor model shows promising validity coefficients
Impact on participation and autonomy: test of validity and reliability for older persons

Directory of Open Access Journals (Sweden)

Isabelle Ottenvall Hammar

2014-10-01

Full Text Available In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA-Older persons (IPA-O, showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation.
Validation of Symptom Validity Tests Using a "Child-model" of Adult Cognitive Impairments

NARCIS (Netherlands)

Rienstra, A.; Spaan, P. E. J.; Schmand, B.

2010-01-01

Validation studies of symptom validity tests (SVTs) in children are uncommon. However, since children's cognitive abilities are not yet fully developed, their performance may provide additional support for the validity of these measures in adult populations. Four SVTs, the Test of Memory Malingering
Construct Validity of Neuropsychological Tests in Schizophrenia.

Science.gov (United States)

Allen, Daniel N.; Aldarondo, Felito; Goldstein, Gerald; Huegel, Stephen G.; Gilbertson, Mark; van Kammen, Daniel P.

1998-01-01

The construct validity of neuropsychological tests in patients with schizophrenia was studied with 39 patients who were evaluated with a battery of six tests assessing attention, memory, and abstract reasoning abilities. Results support the construct validity of the neuropsychological tests in patients with schizophrenia. (SLD)

The validation of language tests

African Journals Online (AJOL)

KATEVG

Stellenbosch Papers in Linguistics, Vol. ... validation is necessary because of the major impact which test results can have on the many ... Messick (1989: 20) introduces his much-quoted progressive matrix (cf. table 1), which ... argue that current accounts of validity only superficially address theories of measurement.
Validation of symptom validity tests using a "child-model" of adult cognitive impairments

NARCIS (Netherlands)

Rienstra, A.; Spaan, P.E.J.; Schmand, B.

2010-01-01

Validation studies of symptom validity tests (SVTs) in children are uncommon. However, since children’s cognitive abilities are not yet fully developed, their performance may provide additional support for the validity of these measures in adult populations. Four SVTs, the Test of Memory Malingering
Valid methods: the quality assurance of test method development, validation, approval, and transfer for veterinary testing laboratories.

Science.gov (United States)

Wiegers, Ann L

2003-07-01

Third-party accreditation is a valuable tool to demonstrate a laboratory's competence to conduct testing. Accreditation, internationally and in the United States, has been discussed previously. However, accreditation is only I part of establishing data credibility. A validated test method is the first component of a valid measurement system. Validation is defined as confirmation by examination and the provision of objective evidence that the particular requirements for a specific intended use are fulfilled. The international and national standard ISO/IEC 17025 recognizes the importance of validated methods and requires that laboratory-developed methods or methods adopted by the laboratory be appropriate for the intended use. Validated methods are therefore required and their use agreed to by the client (i.e., end users of the test results such as veterinarians, animal health programs, and owners). ISO/IEC 17025 also requires that the introduction of methods developed by the laboratory for its own use be a planned activity conducted by qualified personnel with adequate resources. This article discusses considerations and recommendations for the conduct of veterinary diagnostic test method development, validation, evaluation, approval, and transfer to the user laboratory in the ISO/IEC 17025 environment. These recommendations are based on those of nationally and internationally accepted standards and guidelines, as well as those of reputable and experienced technical bodies. They are also based on the author's experience in the evaluation of method development and transfer projects, validation data, and the implementation of quality management systems in the area of method development.
Independent validation of the MMPI-2-RF Somatic/Cognitive and Validity scales in TBI Litigants tested for effort.

Science.gov (United States)

Youngjohn, James R; Wershba, Rebecca; Stevenson, Matthew; Sturgeon, John; Thomas, Michael L

2011-04-01

The MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008) is replacing the MMPI-2 as the most widely used personality test in neuropsychological assessment, but additional validation studies are needed. Our study examines MMPI-2-RF Validity scales and the newly created Somatic/Cognitive scales in a recently reported sample of 82 traumatic brain injury (TBI) litigants who either passed or failed effort tests (Thomas & Youngjohn, 2009). The restructured Validity scales FBS-r (restructured symptom validity), F-r (restructured infrequent responses), and the newly created Fs (infrequent somatic responses) were not significant predictors of TBI severity. FBS-r was significantly related to passing or failing effort tests, and Fs and F-r showed non-significant trends in the same direction. Elevations on the Somatic/Cognitive scales profile (MLS-malaise, GIC-gastrointestinal complaints, HPC-head pain complaints, NUC-neurological complaints, and COG-cognitive complaints) were significant predictors of effort test failure. Additionally, HPC had the anticipated paradoxical inverse relationship with head injury severity. The Somatic/Cognitive scales as a group were better predictors of effort test failure than the RF Validity scales, which was an unexpected finding. MLS arose as the single best predictor of effort test failure of all RF Validity and Somatic/Cognitive scales. Item overlap analysis revealed that all MLS items are included in the original MMPI-2 Hy scale, making MLS essentially a subscale of Hy. This study validates the MMPI-2-RF as an effective tool for use in neuropsychological assessment of TBI litigants.
Technique for unit testing of safety software verification and validation

International Nuclear Information System (INIS)

Li Duo; Zhang Liangju; Feng Junting

2008-01-01

The key issue arising from digitalization of the reactor protection system for nuclear power plant is how to carry out verification and validation (V and V), to demonstrate and confirm the software that performs reactor safety functions is safe and reliable. One of the most important processes for software V and V is unit testing, which verifies and validates the software coding based on concept design for consistency, correctness and completeness during software development. The paper shows a preliminary study on the technique for unit testing of safety software V and V, focusing on such aspects as how to confirm test completeness, how to establish test platform, how to develop test cases and how to carry out unit testing. The technique discussed here was successfully used in the work of unit testing on safety software of a digital reactor protection system. (authors)
15 CFR 995.27 - Format validation software testing.

Science.gov (United States)

2010-01-01

... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying... specification. These tests may be combined with testing of the conversion software. ... 15 Commerce and Foreign Trade 3 2010-01-01 2010-01-01 false Format validation software testing...
Validation of the Information/Communications Technology Literacy Test

Science.gov (United States)

2016-10-01

Technical Report 1360 Validation of the Information /Communications Technology Literacy Test D. Matthew Trippe Human Resources Research...TITLE AND SUBTITLE Validation of the Information /Communications Technology Literacy Test 5a. CONTRACT OR GRANT NUMBER W91WAS-09-D-0013 5b...validate a measure of cyber aptitude, the Information /Communications Technology Literacy Test (ICTL), in predicting trainee performance in Information
Validation of the Stroke Specific Quality of Life Scale (SS-QOL): test of reliability and validity of the Danish version (SS-QOL-DK).

Science.gov (United States)

Muus, Ingrid; Williams, Linda S; Ringsberg, Karin C

2007-07-01

To test the reliability and validity of the Danish version of the Stroke Specific Quality of Life Scale version 2.0 (SS-QOL-DK), an instrument for evaluation of health-related quality of life. A correlational study. A stroke unit that provides acute care and rehabilitation for stroke patients in Frederiksborg County, Denmark. One hundred and fifty-two stroke survivors participated; 24 of these performed test-retest. Questionnaires were sent out and returned by mail. A subsequent telephone interview assessed functional level and missing items. Test-retest was measured using Spearman's r, internal consistency was estimated using Cronbach's alpha, and evaluation of floor and ceiling values in proportion of minimum and maximum scores. Construct validity was assessed by comparing patients' scores on the SS-QOL-DK with those obtained by other test methods: Beck's Depression Index, the General Health Survey Short Form 36 (SF-36), the Barthel Index and the National Institutes of Health Stroke Scale, evaluating shared variance using coefficient of determination, r2. Comparing groups with known scores assessed known-group validity. Convergent and discriminant validity were assessed. Test-retest of SS-QOL-DK showed excellent stability, Spearman's r = 0.65-0.99. Internal consistency for all domains showed Cronbach's alpha = 0.81-0.94. Missing items rate was 1.0%. Most SS-QOL-DK domains showed moderately shared variance with similar domains of other test methods, r2 = 0.03-0.62. Groups with known differences showed statistically significant difference in scores. Item-to-scale correlation coefficients of 0.37-0.88 supported convergent validity. SS-QOL-DK is a reliable and valid instrument for measuring self-reported health-related quality of life on group level among people with mild to moderate stroke.
Test-driven verification/validation of model transformations

Institute of Scientific and Technical Information of China (English)

László LENGYEL; Hassan CHARAF

2015-01-01

Why is it important to verify/validate model transformations? The motivation is to improve the quality of the trans-formations, and therefore the quality of the generated software artifacts. Verified/validated model transformations make it possible to ensure certain properties of the generated software artifacts. In this way, verification/validation methods can guarantee different requirements stated by the actual domain against the generated/modified/optimized software products. For example, a verified/ validated model transformation can ensure the preservation of certain properties during the model-to-model transformation. This paper emphasizes the necessity of methods that make model transformation verified/validated, discusses the different scenarios of model transformation verification and validation, and introduces the principles of a novel test-driven method for verifying/ validating model transformations. We provide a solution that makes it possible to automatically generate test input models for model transformations. Furthermore, we collect and discuss the actual open issues in the field of verification/validation of model transformations.
Development and psychometric validation of the verbal affective memory test

DEFF Research Database (Denmark)

Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S

2015-01-01

. Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...
Validity and reliability of the NAB Naming Test.

Science.gov (United States)

Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto

2016-05-01

Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.
Validation of the German version of the Ford Insomnia Response to Stress Test.

Science.gov (United States)

Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

2018-06-01

The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.
Testing ESL pragmatics development and validation of a web-based assessment battery

CERN Document Server

Roever, Carsten

2014-01-01

Although second language learners' pragmatic competence (their ability to use language in context) is an essential part of their general communicative competence, it has not been a part of second language tests. This book helps fill this gap by describing the development and validation of a web-based test of ESL pragmalinguistics. The instrument assesses learners' knowledge of routine formulae, speech acts, and implicature in 36 multiple-choice and brief-response items. The test's quantitative and qualitative validation with 300 learners showed high reliability and provided strong evidence of
Test validation of nuclear and fossil fuel control operators

International Nuclear Information System (INIS)

Moffie, D.J.

1976-01-01

To establish job relatedness, one must go through a procedure of concurrent and predictive validation. For concurrent validity a group of employees is tested and the test scores are related to performance concurrently or during the same time period. For predictive validity, individuals are tested but the results of these tests are not used at the time of employment. The tests are sealed and scored at a later date, and then related to job performance. Job performance data include ratings by supervisors, actual job performance indices, turnover, absenteeism, progress in training, etc. The testing guidelines also stipulate that content and construct validity can be used
Validation and test report

DEFF Research Database (Denmark)

Pedersen, Jens Meldgaard; Andersen, T. Bull

2012-01-01

. As a consequence of extensive movement artefacts seen during dynamic contractions, the following validation and test report consists of a report that investigates the physiological responses to a static contraction in a standing and a supine position. Eight subjects performed static contractions of the ankle...
The validity of the Michigan Alcoholism Screening Test (MAST)

DEFF Research Database (Denmark)

Storgaard, H; Nielsen, S D; Gluud, C

1994-01-01

This review examines the validity of the Michigan Alcoholism Screening Test (MAST) as a screening instrument for alcohol problems. Studies that compare the MAST-questionnaire with other defined diagnostic criteria of alcohol problems were retrieved through MEDLINE and a cross-bibliographic check....... A total of 20 validity studies were included. The studies varied considerably regarding the prevalence of alcohol problems, the diagnostic criteria, and the examined patient categories. The MAST compared with other diagnostic criteria of alcohol problems gave validity measures with the following span...... and the specificities show substantial variations. The variables that seem to have the largest influence on the PVpos seem to be the prevalence of alcohol problems, the diagnostic method against which the MAST-questionnaire is validated, and the populations on which the MAST is applied. The MAST should in the future...
IP validation in remote microelectronics testing

Science.gov (United States)

Osseiran, Adam; Eshraghian, Kamran; Lachowicz, Stefan; Zhao, Xiaoli; Jeffery, Roger; Robins, Michael

2004-03-01

This paper presents the test and validation of FPGA based IP using the concept of remote testing. It demonstrates how a virtual tester environment based on a powerful, networked Integrated Circuit testing facility, aimed to complement the emerging Australian microelectronics based research and development, can be employed to perform the tasks beyond the standard IC test. IC testing in production consists in verifying the tested products and eliminating defective parts. Defects could have a number of different causes, including process defects, process migration and IP design and implementation errors. One of the challenges in semiconductor testing is that while current fault models are used to represent likely faults (stuck-at, delay, etc.) in a global context, they do not account for all possible defects. Research in this field keeps growing but the high cost of ATE is preventing a large community from accessing test and verification equipment to validate innovative IP designs. For these reasons a world class networked IC teletest facility has been established in Australia under the support of the Commonwealth government. The facility is based on a state-of-the-art semiconductor tester operating as a virtual centre spanning Australia and accessible internationally. Through a novel approach the teletest network provides virtual access to the tester on which the DUT has previously been placed. The tester software is then accessible as if the designer is sitting next to the tester. This paper presents the approach used to test and validate FPGA based IPs using this remote test approach.
Construct Validity of the Nepalese School Leaving English Reading Test

Science.gov (United States)

Dawadi, Saraswati; Shrestha, Prithvi N.

2018-01-01

There has been a steady interest in investigating the validity of language tests in the last decades. Despite numerous studies on construct validity in language testing, there are not many studies examining the construct validity of a reading test. This paper reports on a study that explored the construct validity of the English reading test in…
Validation test case generation based on safety analysis ontology

International Nuclear Information System (INIS)

Fan, Chin-Feng; Wang, Wen-Shing

2012-01-01

Highlights: ► Current practice in validation test case generation for nuclear system is mainly ad hoc. ► This study designs a systematic approach to generate validation test cases from a Safety Analysis Report. ► It is based on a domain-specific ontology. ► Test coverage criteria have been defined and satisfied. ► A computerized toolset has been implemented to assist the proposed approach. - Abstract: Validation tests in the current nuclear industry practice are typically performed in an ad hoc fashion. This study presents a systematic and objective method of generating validation test cases from a Safety Analysis Report (SAR). A domain-specific ontology was designed and used to mark up a SAR; relevant information was then extracted from the marked-up document for use in automatically generating validation test cases that satisfy the proposed test coverage criteria; namely, single parameter coverage, use case coverage, abnormal condition coverage, and scenario coverage. The novelty of this technique is its systematic rather than ad hoc test case generation from a SAR to achieve high test coverage.
Validation testing of safety-critical software

International Nuclear Information System (INIS)

Kim, Hang Bae; Han, Jae Bok

1995-01-01

A software engineering process has been developed for the design of safety critical software for Wolsung 2/3/4 project to satisfy the requirements of the regulatory body. Among the process, this paper described the detail process of validation testing performed to ensure that the software with its hardware, developed by the design group, satisfies the requirements of the functional specification prepared by the independent functional group. To perform the tests, test facility and test software were developed and actual safety system computer was connected. Three kinds of test cases, i.e., functional test, performance test and self-check test, were programmed and run to verify each functional specifications. Test failures were feedback to the design group to revise the software and test results were analyzed and documented in the report to submit to the regulatory body. The test methodology and procedure were very efficient and satisfactory to perform the systematic and automatic test. The test results were also acceptable and successful to verify the software acts as specified in the program functional specification. This methodology can be applied to the validation of other safety-critical software. 2 figs., 2 tabs., 14 refs. (Author)

Validation of new CFD release by Ground-Coupled Heat Transfer Test Cases

Directory of Open Access Journals (Sweden)

Sehnalek Stanislav

2017-01-01

Full Text Available In this article is presented validation of ANSYS Fluent with IEA BESTEST Task 34. Article stars with outlook to the topic, afterward are described steady-state cases used for validation. Thereafter is mentioned implementation of these cases on CFD. Article is concluded with presentation of the simulated results with a comparison of those from already validated simulation software by IEA. These validation shows high correlation with an older version of tested ANSYS as well as with other main software. The paper ends by discussion with an outline of future research.
DTU PMU Laboratory Development - Testing and Validation

DEFF Research Database (Denmark)

Garcia-Valle, Rodrigo; Yang, Guang-Ya; Martin, Kenneth E.

2010-01-01

This is a report of the results of phasor measurement unit (PMU) laboratory development and testing done at the Centre for Electric Technology (CET), Technical University of Denmark (DTU). Analysis of the PMU performance first required the development of tools to convert the DTU PMU data into IEEE...... standard, and the validation is done for the DTU-PMU via a validated commercial PMU. The commercial PMU has been tested from the authors' previous efforts, where the response can be expected to follow known patterns and provide confirmation about the test system to confirm the design and settings....... In a nutshell, having 2 PMUs that observe same signals provides validation of the operation and flags questionable results with more certainty. Moreover, the performance and accuracy of the DTU-PMU is tested acquiring good and precise results, when compared with a commercial phasor measurement device, PMU-1....
Validating the Interpretations and Uses of Test Scores

Science.gov (United States)

Kane, Michael T.

2013-01-01

To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Validity of selected cardiovascular field-based test among Malaysian ...

African Journals Online (AJOL)

Based on emerge obese problem among Malaysian, this research is formulated to validate published tests among healthy female adult. Selected test namely; 20 meter multi-stage shuttle run, 2.4km run test, 1 mile walk test and Harvard Step test were correlated with laboratory test (Bruce protocol) to find the criterion validity ...
Validation of the Arabic Version of the Internet Gaming Disorder-20 Test.

Science.gov (United States)

Hawi, Nazir S; Samaha, Maya

2017-04-01

In recent years, researchers have been trying to shed light on gaming addiction and its association with different psychiatric disorders and psychological determinants. The latest edition version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) included in its Section 3 Internet Gaming Disorder (IGD) as a condition for further empirical study and proposed nine criteria for the diagnosis of IGD. The 20-item Internet Gaming Disorder (IGD-20) Test was developed as a valid and reliable tool to assess gaming addiction based on the nine criteria set by the DSM-5. The aim of this study is to validate an Arabic version of the IGD-20 Test. The Arabic version of IGD-20 will not only help in identifying Arabic-speaking pathological gamers but also stimulate cross-cultural studies that could contribute to an area in need of more research for insight and treatment. After a process of translation and back-translation and with the participation of a sizable sample of Arabic-speaking adolescents, the present study conducted a psychometric validation of the IGD-20 Test. Our confirmatory factor analysis showed the validity of the Arabic version of the IGD-20 Test. The one-factor model of the Arabic IGD-20 Test had very good psychometric properties, and it fitted the sample data extremely well. In addition, correlation analysis between the IGD-20 Test and the daily duration on weekdays and weekends gameplay revealed significant positive relationships that warranted a criterion-related validation. Thus, the Arabic version of the IGD-20 Test is a valid and reliable measure of IGD among Arabic-speaking populations.
Convergent and diagnostic validity of STAVUX, a word and pseudoword spelling test for adults.

Science.gov (United States)

Östberg, Per; Backlund, Charlotte; Lindström, Emma

2016-10-01

Few comprehensive spelling tests are available in Swedish, and none have been validated in adults with reading and writing disorders. The recently developed STAVUX test includes word and pseudoword spelling subtests with high internal consistency and adult norms stratified by education. This study evaluated the convergent and diagnostic validity of STAVUX in adults with dyslexia. Forty-six adults, 23 with dyslexia and 23 controls, took STAVUX together with a standard word-decoding test and a self-rated measure of spelling skills. STAVUX subtest scores showed moderate to strong correlations with word-decoding scores and predicted self-rated spelling skills. Word and pseudoword subtest scores both predicted dyslexia status. Receiver-operating characteristic (ROC) analysis showed excellent diagnostic discriminability. Sensitivity was 91% and specificity 96%. In conclusion, the results of this study support the convergent and diagnostic validity of STAVUX.
Validation of the Vanderbilt Holistic Face Processing Test

OpenAIRE

Wang, Chao-Chih; Ross, David A.; Gauthier, Isabel; Richler, Jennifer J.

2016-01-01

The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the ...
Validation of the Vanderbilt Holistic Face Processing Test.

OpenAIRE

Chao-Chih Wang; Chao-Chih Wang; David Andrew Ross; Isabel Gauthier; Jennifer Joanna Richler

2016-01-01

The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the ...
[Reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test].

Science.gov (United States)

Zhang, C; Yang, G P; Li, Z; Li, X N; Li, Y; Hu, J; Zhang, F Y; Zhang, X J

2017-08-10

Objective: To assess the reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test (AUDIT) among medical students in China and to provide correct way of application on the recommended scales. Methods: An E-questionnaire was developed and sent to medical students in five different colleges. Students were all active volunteers to accept the testings. Cronbach's α and split-half reliability were calculated to evaluate the reliability of AUDIT while content, contract, discriminant and convergent validity were performed to measure the validity of the scales. Results: The overall Cronbach's α of AUDIT was 0.782 and the split-half reliability was 0.711. Data showed that the domain Cronbach's α and split-half reliability were 0.796 and 0.794 for hazardous alcohol use, 0.561 and 0.623 for dependence symptoms, and 0.647 and 0.640 for harmful alcohol use. Results also showed that the content validity index on the levels of items I-CVI) were from 0.83 to 1.00, the content validity index of scale level (S-CVI/UA) was 0.90, content validity index of average scale level (S-CVI/Ave) was 0.99 and the content validity ratios (CVR) were from 0.80 to 1.00. The simplified version of AUDIT supported a presupposed three-factor structure which could explain 61.175% of the total variance revealed through exploratory factor analysis. AUDIT semed to have good convergent and discriminant validity, with the success rate of calibration experiment as 100%. Conclusion: AUDIT showed good reliability and validity among medical students in China thus worth for promotion on its use.
Independent verification and validation testing of the FLASH computer code, Versiion 3.0

International Nuclear Information System (INIS)

Martian, P.; Chung, J.N.

1992-06-01

Independent testing of the FLASH computer code, Version 3.0, was conducted to determine if the code is ready for use in hydrological and environmental studies at various Department of Energy sites. This report describes the technical basis, approach, and results of this testing. Verification tests, and validation tests, were used to determine the operational status of the FLASH computer code. These tests were specifically designed to test: correctness of the FORTRAN coding, computational accuracy, and suitability to simulating actual hydrologic conditions. This testing was performed using a structured evaluation protocol which consisted of: blind testing, independent applications, and graduated difficulty of test cases. Both quantitative and qualitative testing was performed through evaluating relative root mean square values and graphical comparisons of the numerical, analytical, and experimental data. Four verification test were used to check the computational accuracy and correctness of the FORTRAN coding, and three validation tests were used to check the suitability to simulating actual conditions. These tests cases ranged in complexity from simple 1-D saturated flow to 2-D variably saturated problems. The verification tests showed excellent quantitative agreement between the FLASH results and analytical solutions. The validation tests showed good qualitative agreement with the experimental data. Based on the results of this testing, it was concluded that the FLASH code is a versatile and powerful two-dimensional analysis tool for fluid flow. In conclusion, all aspects of the code that were tested, except for the unit gradient bottom boundary condition, were found to be fully operational and ready for use in hydrological and environmental studies
Wave Tank Testing and Model Validation of an Autonomous Wave Energy Converter

Directory of Open Access Journals (Sweden)

Bret Bosma

2015-08-01

Full Text Available A key component in bringing ocean wave energy converters from concept to commercialization is the building and testing of scaled prototypes to provide model validation. A one quarter scale prototype of an autonomous two body heaving point absorber was modeled, built, and tested for this work. Wave tank testing results are compared with two hydrodynamic and system models—implemented in both ANSYS AQWA and MATLAB/Simulink—and show model validation over certain regions of operation. This work will serve as a guide for future developers of wave energy converter devices, providing insight in taking their design from concept to prototype stage.
Testing and Validation of the Dynamic Inertia Measurement Method

Science.gov (United States)

Chin, Alexander W.; Herrera, Claudia Y.; Spivey, Natalie D.; Fladung, William A.; Cloutier, David

2015-01-01

The Dynamic Inertia Measurement (DIM) method uses a ground vibration test setup to determine the mass properties of an object using information from frequency response functions. Most conventional mass properties testing involves using spin tables or pendulum-based swing tests, which for large aerospace vehicles becomes increasingly difficult and time-consuming, and therefore expensive, to perform. The DIM method has been validated on small test articles but has not been successfully proven on large aerospace vehicles. In response, the National Aeronautics and Space Administration Armstrong Flight Research Center (Edwards, California) conducted mass properties testing on an "iron bird" test article that is comparable in mass and scale to a fighter-type aircraft. The simple two-I-beam design of the "iron bird" was selected to ensure accurate analytical mass properties. Traditional swing testing was also performed to compare the level of effort, amount of resources, and quality of data with the DIM method. The DIM test showed favorable results for the center of gravity and moments of inertia; however, the products of inertia showed disagreement with analytical predictions.
Test Method Facet and the Construct Validity of Listening Comprehension Tests

Directory of Open Access Journals (Sweden)

Roya Khoii

2010-05-01

Full Text Available The assessment of listening abilities is one of the least understood, least developed and, yet, one of the most important areas of language testing and assessment. It is particularly important because of its potential wash-back effects on classroom practices. Given the fact that listening tests play a great role in assessing the language proficiency of students, they are expected to enjoy a high level of construct validity. The present study was dedicated to investigating the construct validity of three different test formats, namely, multiple-choice, gap filling on summary (also called listening summary cloze, and fill-in-the-blank, used to evaluate the listening comprehension of EFL learners. In order to achieve the purpose of the study, three passages with relatively similar readability levels were used for the construction of 9 listening tests, that is, each appeared in three formats. Following a counter-balanced design, the tests were administered to 91homogeneous EFL learners divided into three groups. The statistical analysis of the results revealed that the multiple-choice test enjoyed the highest level of construct validity. Moreover, a repeated measure one-way ANOVA demonstrated that the fill-in-the-blank task was the most difficult with the MC test as the easiest for the participants.
Reasoning with Inductive Argument Test: A Study of Validity and Reliability

Directory of Open Access Journals (Sweden)

Mehmet Emrah Karadere

2013-11-01

Full Text Available Reasoning with Inductive Argument Test:A Study of Validity and Reliability Objective: The aim of our study is to research reliability and validity and to evaluate the usability of Turkish version of Reasoning with Inductive Argument Test (RIAT in Turkish healty population. Method: 51 healty volunteers who work in Ankara Dıskapi Yildirim Beyazit Research and Training Hospital participated in this study. Reasoning with Inductive Argument Test (RIAT was translated into Turkish by three clinical good knowledge of English. Participants were given a sociodemographic data form, and RIAT were performed by clinicians. To test the reliability of the Turkish version of RIAT, Cronbach’s alpha coefficient was calculated and the halving method was used for the test. Results: The internal consistency of the Reasoning with Inductive Argument Test (RIAT items, Cronbach’s alpha internal consistency coefficient measurements of 0.73 was found to be statistically significant. Spearman-Brown coefficient that determines the reliability of the whole test r=0.74 was found. Kurtosis values of all the items was below 1.5 and the percentages in the second evaluation were mainly lower. At the same time, both change in belief between self produced RIAT options and given RIAT options (p=0.02, z=-2296 as well as changes in beliefs between related and unrelated items for Obsessive Compulsive Disorder (OCD difference (p=0.03, z=-2.199 were significant. Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that ‘Reasoning with Inductive Argument Test’ supports reliability and validity in Turkish population.
Test of Gross Motor Development : Expert Validity, confirmatory validity and internal consistence

Directory of Open Access Journals (Sweden)

Nadia Cristina Valentini

2008-12-01

Full Text Available The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motordevelopment. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by expertsand the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. Across-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionalsand 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls.Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated thatthe Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices ofconfirmatory factorial validity (χ2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tuckerand Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. ThePortuguese TGMD-2 demonstrated validity and reliability for the sample investigated.
Test of Gross Motor Development: expert validity, confirmatory validity and internal consistence

Directory of Open Access Journals (Sweden)

Nadia Cristina Valentini

2008-01-01

The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motor development. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by experts and the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. A cross-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionals and 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls. Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated that the Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices of confirmatory factorial validity (÷2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tucker and Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. The Portuguese TGMD-2 demonstrated validity and reliability for the sample investigated.
Test rig overview for validation and reliability testing of shutdown system software

International Nuclear Information System (INIS)

Zhao, M.; McDonald, A.; Dick, P.

2007-01-01

The test rig for Validation and Reliability Testing of shutdown system software has been upgraded from the AECL Windows-based test rig previously used for CANDU6 stations. It includes a Virtual Trip Computer, which is a software simulation of the functional specification of the trip computer, and a real-time trip computer simulator in a separate chassis, which is used during the preparation of trip computer test cases before the actual trip computers are available. This allows preparation work for Validation and Reliability Testing to be performed in advance of delivery of actual trip computers to maintain a project schedule. (author)
AULA virtual reality test as an attention measure: convergent validity with Conners' Continuous Performance Test.

Science.gov (United States)

Díaz-Orueta, Unai; Garcia-López, Cristina; Crespo-Eguílaz, Nerea; Sánchez-Carpintero, Rocío; Climent, Gema; Narbona, Juan

2014-01-01

The majority of neuropsychological tests used to evaluate attention processes in children lack ecological validity. The AULA Nesplora (AULA) is a continuous performance test, developed in a virtual setting, very similar to a school classroom. The aim of the present study is to analyze the convergent validity between the AULA and the Continuous Performance Test (CPT) of Conners. The AULA and CPT were administered correlatively to 57 children, aged 6-16 years (26.3% female) with average cognitive ability (IQ mean = 100.56, SD = 10.38) who had a diagnosis of attention deficit/hyperactivity disorder (ADHD) according to DSM-IV-TR criteria. Spearman correlations analyses were conducted among the different variables. Significant correlations were observed between both tests in all the analyzed variables (omissions, commissions, reaction time, and variability of reaction time), including for those measures of the AULA based on different sensorial modalities, presentation of distractors, and task paradigms. Hence, convergent validity between both tests was confirmed. Moreover, the AULA showed differences by gender and correlation to Perceptual Reasoning and Working Memory indexes of the WISC-IV, supporting the relevance of IQ measures in the understanding of cognitive performance in ADHD. In addition, the AULA (but not Conners' CPT) was able to differentiate between ADHD children with and without pharmacological treatment for a wide range of measures related to inattention, impulsivity, processing speed, motor activity, and quality of attention focus. Additional measures and advantages of the AULA versus Conners' CPT are discussed.
Development and Validation of a Persian Version of Dichotic Emotional Word Test

Directory of Open Access Journals (Sweden)

Atefe Davudazde

2016-03-01

Full Text Available Introduction: Emotional words in comparison with neutral words have different hemispheric specialization. It is assumed that the right hemisphere has a role in processing every kind of emotional word. The objective of the present study was the development of a Persian version of the dichotic emotional word test and evaluate its validation among adult Persian speakers. Materials and Methods: The present study was done on 60 adults, with the age ranging from 18-30 years for both genders, who had no history of neurological disorders with normal hearing. The developed test included eight main lists; each had several dichotic emotional/ neutral pairs of words. Participants were asked to recall as many words in each list as they could after they listened to them. A content validity index was used to analyze the validity of the test. Results: The mean content validity index score was 0.94. The findings showed that in the left ear, emotional words were remembered more than neutral ones (P=0.007. While in the right ear, neutral words were remembered more (P=0.009. There were no significant differences in male and female scores. Conclusion: Dichotic emotional word test has a high content validity. The ability to remember emotional words better in the left ear supports the dominant role of the right hemisphere in emotional word perception.
Development and validation of a theoretical test in basic laparoscopy

DEFF Research Database (Denmark)

Strandbygaard, Jeanett; Maagaard, Mathilde; Larsen, Christian Rifbjerg

2013-01-01

for first-year residents in obstetrics and gynecology. This study therefore aimed to develop and validate a framework for a theoretical knowledge test, a multiple-choice test, in basic theory related to laparoscopy. METHODS: The content of the multiple-choice test was determined by conducting informal...... conversational interviews with experts in laparoscopy. The subsequent relevance of the test questions was evaluated using the Delphi method involving regional chief physicians. Construct validity was tested by comparing test results from three groups with expected different clinical competence and knowledge.......001). Internal consistency (Cronbach's alpha) was 0.82. There was no evidence of differential item functioning between the three groups tested. CONCLUSIONS: A newly developed knowledge test in basic laparoscopy proved to have content and construct validity. The formula for the development and validation...

Dynamic testing in schizophrenia: does training change the construct validity of a test?

Science.gov (United States)

Wiedl, Karl H; Schöttke, Henning; Green, Michael F; Nuechterlein, Keith H

2004-01-01

Dynamic testing typically involves specific interventions for a test to assess the extent to which test performance can be modified, beyond level of baseline (static) performance. This study used a dynamic version of the Wisconsin Card Sorting Test (WCST) that is based on cognitive remediation techniques within a test-training-test procedure. From results of previous studies with schizophrenia patients, we concluded that the dynamic and static versions of the WCST should have different construct validity. This hypothesis was tested by examining the patterns of correlations with measures of executive functioning, secondary verbal memory, and verbal intelligence. Results demonstrated a specific construct validity of WCST dynamic (i.e., posttest) scores as an index of problem solving (Tower of Hanoi) and secondary verbal memory and learning (Auditory Verbal Learning Test), whereas the impact of general verbal capacity and selective attention (Verbal IQ, Stroop Test) was reduced. It is concluded that the construct validity of the test changes with dynamic administration and that this difference helps to explain why the dynamic version of the WCST predicts functional outcome better than the static version.
Methodology for testing and validating knowledge bases

Science.gov (United States)

Krishnamurthy, C.; Padalkar, S.; Sztipanovits, J.; Purves, B. R.

1987-01-01

A test and validation toolset developed for artificial intelligence programs is described. The basic premises of this method are: (1) knowledge bases have a strongly declarative character and represent mostly structural information about different domains, (2) the conditions for integrity, consistency, and correctness can be transformed into structural properties of knowledge bases, and (3) structural information and structural properties can be uniformly represented by graphs and checked by graph algorithms. The interactive test and validation environment have been implemented on a SUN workstation.
DTU PMU Laboratory Development - Testing and Validation

OpenAIRE

Garcia-Valle, Rodrigo; Yang, Guang-Ya; Martin, Kenneth E.; Nielsen, Arne Hejde; Østergaard, Jacob

2010-01-01

This is a report of the results of phasor measurement unit (PMU) laboratory development and testing done at the Centre for Electric Technology (CET), Technical University of Denmark (DTU). Analysis of the PMU performance first required the development of tools to convert the DTU PMU data into IEEE standard, and the validation is done for the DTU-PMU via a validated commercial PMU. The commercial PMU has been tested from the authors' previous efforts, where the response can be expected to foll...
Validation of Cardiovascular Parameters during NASA's Functional Task Test

Science.gov (United States)

Arzeno, N. M.; Stenger, M. B.; Bloomberg, J. J.; Platts, S. H.

2009-01-01

Microgravity exposure causes physiological deconditioning and impairs crewmember task performance. The Functional Task Test (FTT) is designed to correlate these physiological changes to performance in a series of operationally-relevant tasks. One of these, the Recovery from Fall/Stand Test (RFST), tests both the ability to recover from a prone position and cardiovascular responses to orthostasis. PURPOSE: Three minutes were chosen for the duration of this test, yet it is unknown if this is long enough to induce cardiovascular responses similar to the operational 5 min stand test. The purpose of this study was to determine the validity and reliability of heart rate variability (HRV) analysis of a 3 min stand and to examine the effect of spaceflight on these measures. METHODS: To determine the validity of using 3 vs. 5 min of standing to assess HRV, ECG was collected from 7 healthy subjects who participated in a 6 min RFST. Mean R-R interval (RR) and spectral HRV were measured in minutes 0-3 and 0-5 following the heart rate transient due to standing. Significant differences between the segments were determined by a paired t-test. To determine the reliability of the 3-min stand test, 13 healthy subjects completed 3 trials of the FTT on separate days, including the RFST with a 3 min stand. Analysis of variance (ANOVA) was performed on the HRV measures. One crewmember completed the FTT before a 14-day mission, on landing day (R+0) and one (R+1) day after returning to Earth. RESULTS VALIDITY: HRV measures reflecting autonomic activity were not significantly different during the 0-3 and 0-5 min segments. RELIABILITY: The average coefficient of variation for RR, systolic (SBP) and diastolic blood pressures during the RFST were less than 8% for the 3 sessions. ANOVA results yielded a greater inter-subject variability (p0.05) for HRV in the RFST. SPACEFLIGHT: Lower RR and higher SBP were observed on R+0 in rest and stand. On R+1, both RR and SBP trended towards preflight
Validating a Spanish Developmental Spelling Test.

Science.gov (United States)

Ferroli, Lou; Krajenta, Marilyn

The creation and validation of a Spanish version of an English developmental spelling test (DST) is described. An introductory section reviews related literature on the rationale for and construction of DSTs, spelling development in the early grades, and Spanish-English bilingual education. Differences between the English and Spanish test versions…
Reliability, Validity and Factor Structure of Drug Abuse Screening Test

Directory of Open Access Journals (Sweden)

Sayed Hadi Sayed Alitabar

2016-05-01

Full Text Available Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST. Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. The DAST Was Used as Instrument. Divergent & Convergent Validity of this Scale Was Assessed with Problems Assessment for Substance Using Psychiatric Patients (PASUPP and Relapse Prediction Scale (RPS.Results: The DAST after the First Time Factor Structure of Using Confirmatory Factor Analysis Was Confirmed. The DAST Had a Good Internal Consistency (Cranach’s Alpha, and the Reliability of the Test Within a Week, 0.9, 0.8. Also this Scale Had a Positive Correlation with Problems Assessment for Substance Using Psychiatric Patients and Relapse Prediction Scale (P<0.01.Conclusion: The Overall Results Showed that the Drug Abuse Screening Test in Iranian Society Is Valid. It Can Be Said that Self-Report Scale Tool Is Useful for Research Purposes and Addiction.
Test-retest reliability and predictive validity of the Implicit Association Test in children.

Science.gov (United States)

Rae, James R; Olson, Kristina R

2018-02-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Testing the Predictive Validity of the Hendrich II Fall Risk Model.

Science.gov (United States)

Jung, Hyesil; Park, Hyeoun-Ae

2018-03-01

Cumulative data on patient fall risk have been compiled in electronic medical records systems, and it is possible to test the validity of fall-risk assessment tools using these data between the times of admission and occurrence of a fall. The Hendrich II Fall Risk Model scores assessed during three time points of hospital stays were extracted and used for testing the predictive validity: (a) upon admission, (b) when the maximum fall-risk score from admission to falling or discharge, and (c) immediately before falling or discharge. Predictive validity was examined using seven predictive indicators. In addition, logistic regression analysis was used to identify factors that significantly affect the occurrence of a fall. Among the different time points, the maximum fall-risk score assessed between admission and falling or discharge showed the best predictive performance. Confusion or disorientation and having a poor ability to rise from a sitting position were significant risk factors for a fall.
Educational testing validity and reliability in pharmacy and medical education literature.

Science.gov (United States)

Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J

2013-12-16

To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; particles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.
The Sandia MEMS Passive Shock Sensor : FY08 testing for functionality, model validation, and technology readiness.

Energy Technology Data Exchange (ETDEWEB)

Walraven, Jeremy Allen; Blecke, Jill; Baker, Michael Sean; Clemens, Rebecca C.; Mitchell, John Anthony; Brake, Matthew Robert; Epp, David S.; Wittwer, Jonathan W.

2008-10-01

This report summarizes the functional, model validation, and technology readiness testing of the Sandia MEMS Passive Shock Sensor in FY08. Functional testing of a large number of revision 4 parts showed robust and consistent performance. Model validation testing helped tune the models to match data well and identified several areas for future investigation related to high frequency sensitivity and thermal effects. Finally, technology readiness testing demonstrated the integrated elements of the sensor under realistic environments.
Latency-Based and Psychophysiological Measures of Sexual Interest Show Convergent and Concurrent Validity.

Science.gov (United States)

Ó Ciardha, Caoilte; Attard-Johnson, Janice; Bindemann, Markus

2018-04-01

Latency-based measures of sexual interest require additional evidence of validity, as do newer pupil dilation approaches. A total of 102 community men completed six latency-based measures of sexual interest. Pupillary responses were recorded during three of these tasks and in an additional task where no participant response was required. For adult stimuli, there was a high degree of intercorrelation between measures, suggesting that tasks may be measuring the same underlying construct (convergent validity). In addition to being correlated with one another, measures also predicted participants' self-reported sexual interest, demonstrating concurrent validity (i.e., the ability of a task to predict a more validated, simultaneously recorded, measure). Latency-based and pupillometric approaches also showed preliminary evidence of concurrent validity in predicting both self-reported interest in child molestation and viewing pornographic material containing children. Taken together, the study findings build on the evidence base for the validity of latency-based and pupillometric measures of sexual interest.
Software test and validation of wireless sensor nodes used in nuclear power plant

International Nuclear Information System (INIS)

Deng Changjian; Chen Dongyi; Zhang Heng

2015-01-01

The software test and validation of wireless sensor nodes is one of the key approaches to improve or guarantee the reliability of wireless network application in nuclear power plants (NPPs). At first, to validate the software test, some concepts are defined quantitatively, for example the robustness of software, the reliability of software, and the security of software. Then the development tools and simulators of discrete event drive operating system are compared, in order to present robustness, reliability and security of software test approach based on input-output function. Some simple preliminary test results are given to show that different development software can obtain almost same measurement and communication results although the software of special application may be different than normal application. (author)
Cultural Adaptation of the Portuguese Version of the "Sniffin' Sticks" Smell Test: Reliability, Validity, and Normative Data.

Science.gov (United States)

Ribeiro, João Carlos; Simões, João; Silva, Filipe; Silva, Eduardo D; Hummel, Cornelia; Hummel, Thomas; Paiva, António

2016-01-01

The cross-cultural adaptation and validation of the Sniffin`Sticks test for the Portuguese population is described. Over 270 people participated in four experiments. In Experiment 1, 67 participants rated the familiarity of presented odors and seven descriptors of the original test were adapted to a Portuguese context. In Experiment 2, the Portuguese version of Sniffin`Sticks test was administered to 203 healthy participants. Older age, male gender and active smoking status were confirmed as confounding factors. The third experiment showed the validity of the Portuguese version of Sniffin`Sticks test in discriminating healthy controls from patients with olfactory dysfunction. In Experiment 4, the test-retest reliability for both the composite score (r71 = 0.86) and the identification test (r71 = 0.62) was established (pPortuguese version of Sniffin`Sticks test is provided, showing good validity and reliability and effectively distinguishing patients from healthy controls with high sensitivity and specificity. The Portuguese version of Sniffin`Sticks test identification test is a clinically suitable screening tool in routine outpatient Portuguese settings.
Safe and secure South Africa. Vehicle landmine protection validation testing

CSIR Research Space (South Africa)

Reinecke, JD

2008-11-01

Full Text Available The objective of this paper is to provide an overview of vehicle landmine protection validation testing in South Africa. A short history of validation test standards is given, followed by a summary of current open test standards in general use...
Validating safeguards effectiveness given inherently limited test data

International Nuclear Information System (INIS)

Sicherman, A.

1987-01-01

A key issue in designing and evaluating nuclear safeguards systems is how to validate safeguards effectiveness against a spectrum of potential threats. Safeguards effectiveness is measured by a performance indicator such as the probability of defeating an adversary attempting a malevolent act. Effectiveness validation means a testing program that provides sufficient evidence that the performance indicator is at an acceptable level. Traditional statistical program when numerous independent system trials are possible. However, within the safeguards environment, many situations arise for which traditional statistical approaches may be neither feasible nor appropriate. Such situations can occur, for example, when there are obvious constraints on the number of possible tests due to operational impacts and testing costs. Furthermore, these tests are usually simulations (e.g., staged force-on-force exercises) rather than actual tests, and the system is often modified after each test. Under such circumstances, it is difficult to make and justify inferences about system performance by using traditional statistical techniques. In this paper, the authors discuss several alternative quantitative techniques for validating system effectiveness. The techniques include: (1) minimizing the number of required tests using sequential testing; (2) combining data from models inspections and exercises using Bayesian statistics to improve inferences about system performance; and (3) using reliability growth and scenario modeling to help specify which safeguards elements and scenarios to test
Certification Testing as an Illustration of Argument-Based Validation

Science.gov (United States)

Kane, Michael

2004-01-01

The theories of validity developed over the past 60 years are quite sophisticated, but the methodology of validity is not generally very effective. The validity evidence for major testing programs is typically much weaker than the evidence for more technical characteristics such as reliability. In addition, most validation efforts have a strong…
Development and Validation of a Theoretical Test in Endosonography for Pulmonary Diseases

DEFF Research Database (Denmark)

Savran, Mona M; Clementsen, Paul Frost; Annema, Jouke T

2014-01-01

evidence for this test. METHODS: Initially, 78 questions were constructed after informal conversational interviews with 4 international experts in endosonography. The clarity and content validity of the questions were tested using a Delphi-like approach. Construct validity was explored by administering......BACKGROUND: Theoretical testing provides the necessary foundation to perform technical skills. Additionally, testing improves the retention of knowledge. OBJECTIVES: The aims of this study were to develop a multiple-choice test in endosonography for pulmonary diseases and to gather validity...... consistently than the novices (p = 0.037) and the intermediates (p Validity evidence was gathered, and the test demonstrated content and construct validity....
COMMUNICATIVE VALIDITY OF THE NEW CET-4 LISTENING COMPREHENSION TEST IN CHINA

Directory of Open Access Journals (Sweden)

Chao Wang

2014-07-01

Full Text Available Abstract: Based on the major dimensions of a communicative language test that Bachman proposed, this paper aims to have an investigation on the validity of the new CET-4 listening subtest in China from a communicative point of view. Both qualitative and quantitative methods are involved in the study. Material analysis falls into qualitative study, including analysis of the CET-4 testing syllabus and eight new CET-4 listening comprehension tests. Students’ scores of two tests and the questionnaires are analyzed quantitatively. Through analysis, it is found that the new CET-4 listening subtest has a high validity and can measure test-takers’ listening ability in real communication. First, the new CET-4 listening subtest has the quality of reliability. Second, the seven listening skills tested in this subtest can measure the communicative language ability required in the testing syllabus. The intra-correlation analysis shows that each part of the new CET-4 listening subtest focuses on different language abilities related to listening. Third, the authenticity of the new CET-4 listening subtest reaches a satisfactory level. The materials chosen in the test cover various topics and genres. Speakers’ pronunciation, tone and speed are in accordance with the real situation. However, some shortcomings also exist in the test design and should be improved later. For example, its limited item types cannot represent the task types in real life, and the actual input is too ideal to be authentic. Keywords: Communicative language ability, communicative language testing, listening comprehension, test validity
The development and validation of a test of science critical thinking for fifth graders.

Science.gov (United States)

Mapeala, Ruslan; Siew, Nyet Moi

2015-01-01

The paper described the development and validation of the Test of Science Critical Thinking (TSCT) to measure the three critical thinking skill constructs: comparing and contrasting, sequencing, and identifying cause and effect. The initial TSCT consisted of 55 multiple choice test items, each of which required participants to select a correct response and a correct choice of critical thinking used for their response. Data were obtained from a purposive sampling of 30 fifth graders in a pilot study carried out in a primary school in Sabah, Malaysia. Students underwent the sessions of teaching and learning activities for 9 weeks using the Thinking Maps-aided Problem-Based Learning Module before they answered the TSCT test. Analyses were conducted to check on difficulty index (p) and discrimination index (d), internal consistency reliability, content validity, and face validity. Analysis of the test-retest reliability data was conducted separately for a group of fifth graders with similar ability. Findings of the pilot study showed that out of initial 55 administered items, only 30 items with relatively good difficulty index (p) ranged from 0.40 to 0.60 and with good discrimination index (d) ranged within 0.20-1.00 were selected. The Kuder-Richardson reliability value was found to be appropriate and relatively high with 0.70, 0.73 and 0.92 for identifying cause and effect, sequencing, and comparing and contrasting respectively. The content validity index obtained from three expert judgments equalled or exceeded 0.95. In addition, test-retest reliability showed good, statistically significant correlations ([Formula: see text]). From the above results, the selected 30-item TSCT was found to have sufficient reliability and validity and would therefore represent a useful tool for measuring critical thinking ability among fifth graders in primary science.
Standardization, Validity and Reliability Study of Gülhane Aphasia Test-2 (GAT-2

Directory of Open Access Journals (Sweden)

İlknur Maviş

2007-04-01

Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia

VALIDITY OF THE MODIFIED CONCONI TEST FOR DETERMINING VENTILATORY THRESHOLD DURING ON-WATER ROWING

Directory of Open Access Journals (Sweden)

Jorge Villamil Cabo

2011-12-01

Full Text Available The objectives of this study were to design a field test based on the Conconi protocol to determine the ventilatory threshold of rowers and to test its reliability and validity. A group of sixteen oarsmen completed a modified Conconi test for on-water rowing. The reliability of the detection of the heart rate threshold was evaluated using heart rate breaking point in the Conconi test and retest. Heart rate threshold was detected in 88.8% of cases in the test-retest. The validity of the modified Conconi test was evaluated by comparing the heart rate threshold data acquired with that obtained in a ventilatory threshold test (VT2. No significant differences were found for the values of different intensity parameters i.e. heart rate (HR, oxygen consumption (VO2, stroke rate (SR and speed (S between the heart rate threshold and the ventilatory threshold, (170.9 ± 6.8 vs. 169.3 ± 6.4 beats·min-1; 42.0 ± 8.6 vs. 43.5 ± 8.3 ml·kg-1·min-1; 25.8 ± 3.3 vs. 27.0 ± 3.2 strokes·min-1 and 14.4 ± 0.8 vs. 14.6 ± 0.8 km·h-1. The differences in averages obtained in the Conconi test-retest were small with a low standard error of the mean. The reliability data between the Conconi test-retest showed low coefficients of variations (CV and high intraclass correlation coefficients (ICC. The total errors for the Conconi test-retest are low for the measured variables (1.31 HR, 0.87 VO2, 0.65 SR, and 0.1 S. The Bland- Altman's method for analysis validity showed a strong concordance according to the analyzed variables. We conclude that the modified Conconi test for on-water rowing is a valid and reliable method for the determination of the second ventilatory threshold (VT2.
Five-Kilometers Time Trial: Preliminary Validation of a Short Test for Cycling Performance Evaluation.

Science.gov (United States)

Dantas, Jose Luiz; Pereira, Gleber; Nakamura, Fabio Yuzo

2015-09-01

The five-kilometer time trial (TT5km) has been used to assess aerobic endurance performance without further investigation of its validity. This study aimed to perform a preliminary validation of the TT5km to rank well-trained cyclists based on aerobic endurance fitness and assess changes of the aerobic endurance performance. After the incremental test, 20 cyclists (age = 31.3 ± 7.9 years; body mass index = 22.7 ± 1.5 kg/m(2); maximal aerobic power = 360.5 ± 49.5 W) performed the TT5km twice, collecting performance (time to complete, absolute and relative power output, average speed) and physiological responses (heart rate and electromyography activity). The validation criteria were pacing strategy, absolute and relative reliability, validity, and sensitivity. Sensitivity index was obtained from the ratio between the smallest worthwhile change and typical error. The TT5km showed high absolute (coefficient of variation 0.95) reliability of performance variables, whereas it presented low reliability of physiological responses. The TT5km performance variables were highly correlated with the aerobic endurance indices obtained from incremental test (r > 0.70). These variables showed adequate sensitivity index (> 1). TT5km is a valid test to rank the aerobic endurance fitness of well-trained cyclists and to differentiate changes on aerobic endurance performance. Coaches can detect performance changes through either absolute (± 17.7 W) or relative power output (± 0.3 W.kg(-1)), the time to complete the test (± 13.4 s) and the average speed (± 1.0 km.h(-1)). Furthermore, TT5km performance can also be used to rank the athletes according to their aerobic endurance fitness.
Microscale validation of 4-aminoantipyrine test method for quantifying phenolic compounds in microbial culture

International Nuclear Information System (INIS)

Justiz Mendoza, Ibrahin; Aguilera Rodriguez, Isabel; Perez Portuondo, Irasema

2014-01-01

Validation of test methods microscale is currently of great importance due to the economic and environmental advantages possessed, which constitutes a prerequisite for the performance of services and quality assurance of the results to provide customer. This paper addresses the microscale validation of 4-aminoantipyrine spectrophotometric method for the quantification of phenolic compounds in culture medium. Parameters linearity, precision, regression, accuracy, detection limits, quantification limits and robustness were evaluated, addition to the comparison test with no standardized method for determining polyphenols (Folin Ciocalteu). The results showed that both methods are feasible for determining phenols
Coverage of the Test of Memory Malingering, Victoria Symptom Validity Test, and Word Memory Test on the Internet: is test security threatened?

Science.gov (United States)

Bauer, Lyndsey; McCaffrey, Robert J

2006-01-01

In forensic neuropsychological settings, maintaining test security has become critically important, especially in regard to symptom validity tests (SVTs). Coaching, which can entail providing patients or litigants with information about the cognitive sequelae of head injury, or teaching them test-taking strategies to avoid detection of symptom dissimulation has been examined experimentally in many research studies. Emerging evidence supports that coaching strategies affect psychological and neuropsychological test performance to differing degrees depending on the coaching paradigm and the tests administered. The present study sought to examine Internet coverage of SVTs because it is potentially another source of coaching, or information that is readily available. Google searches were performed on the Test of Memory Malingering, the Victoria Symptom Validity Test, and the Word Memory Test. Results indicated that there is a variable amount of information available about each test that could threaten test security and validity should inappropriately interested parties find it. Steps that could be taken to improve this situation and limitations to this exploration are discussed.
Validation of Clinical Testing for Warfarin Sensitivity

Science.gov (United States)

Langley, Michael R.; Booker, Jessica K.; Evans, James P.; McLeod, Howard L.; Weck, Karen E.

2009-01-01

Responses to warfarin (Coumadin) anticoagulation therapy are affected by genetic variability in both the CYP2C9 and VKORC1 genes. Validation of pharmacogenetic testing for warfarin responses includes demonstration of analytical validity of testing platforms and of the clinical validity of testing. We compared four platforms for determining the relevant single nucleotide polymorphisms (SNPs) in both CYP2C9 and VKORC1 that are associated with warfarin sensitivity (Third Wave Invader Plus, ParagonDx/Cepheid Smart Cycler, Idaho Technology LightCycler, and AutoGenomics Infiniti). Each method was examined for accuracy, cost, and turnaround time. All genotyping methods demonstrated greater than 95% accuracy for identifying the relevant SNPs (CYP2C9 *2 and *3; VKORC1 −1639 or 1173). The ParagonDx and Idaho Technology assays had the shortest turnaround and hands-on times. The Third Wave assay was readily scalable to higher test volumes but had the longest hands-on time. The AutoGenomics assay interrogated the largest number of SNPs but had the longest turnaround time. Four published warfarin-dosing algorithms (Washington University, UCSF, Louisville, and Newcastle) were compared for accuracy for predicting warfarin dose in a retrospective analysis of a local patient population on long-term, stable warfarin therapy. The predicted doses from both the Washington University and UCSF algorithms demonstrated the best correlation with actual warfarin doses. PMID:19324988
Validating High-Stakes Testing Programs.

Science.gov (United States)

Kane, Michael

2002-01-01

Makes the point that the interpretations and use of high-stakes test scores rely on policy assumptions about what should be taught and the content standards and performance standards that should be applied. The assumptions built into an assessment need to be subjected to scrutiny and criticism if a strong case is to be made for the validity of the…
Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

Science.gov (United States)

Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

2015-12-01

To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.
Validity and Reliability of the Arabic Token Test for Children

Science.gov (United States)

Alkhamra, Rana A.; Al-Jazi, Aya B.

2016-01-01

Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…
Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

Science.gov (United States)

Badjadi, Nour El Imane

2013-01-01

The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…
Construction of Valid and Reliable Test for Assessment of Students

Science.gov (United States)

Osadebe, P. U.

2015-01-01

The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…
POLYGON - A NEW FUNDAMENTAL MOVEMENT SKILLS TEST FOR 8 YEAR OLD CHILDREN: CONSTRUCTION AND VALIDATION

Directory of Open Access Journals (Sweden)

Frane Zuvela

2011-03-01

Full Text Available Inadequately adopted fundamental movement skills (FMS in early childhood may have a negative impact on the motor performance in later life (Gallahue and Ozmun, 2005. The need for an efficient FMS testing in Physical Education was recognized. The aim of this paper was to construct and validate a new FMS test for 8 year old children. Ninety-five 8 year old children were used for the testing. A total of 24 new FMS tasks were constructed and only the best representatives of movement areas entered into the final test product - FMS-POLYGON. The ICC showed high values for all 24 tasks (0.83-0.97 and the factorial analysis revealed the best representatives of each movement area that entered the FMS-POLYGON: tossing and catching the volleyball against a wall, running across obstacles, carrying the medicine balls, and straight running. The ICC for the FMS-POLYGON showed a very high result (0.98 and, therefore, confirmed the test's intra-rater reliability. Concurrent validity was tested with the use of the "Test of Gross Motor Development" (TGMD-2. Correlation analysis between the newly constructed FMS-POLYGON and the TGMD-2 revealed the coefficient of -0.82 which indicates a high correlation. In conclusion, the new test for FMS assessment proved to be a reliable and valid instrument for 8 year old children. Application of this test in schools is justified and could play an important factor in physical education and sport practice.
Testing the Predictive Validity and Construct of Pathological Video Game Use

Science.gov (United States)

Groves, Christopher L.; Gentile, Douglas; Tapscott, Ryan L.; Lynch, Paul J.

2015-01-01

Three studies assessed the construct of pathological video game use and tested its predictive validity. Replicating previous research, Study 1 produced evidence of convergent validity in 8th and 9th graders (N = 607) classified as pathological gamers. Study 2 replicated and extended the findings of Study 1 with college undergraduates (N = 504). Predictive validity was established in Study 3 by measuring cue reactivity to video games in college undergraduates (N = 254), such that pathological gamers were more emotionally reactive to and provided higher subjective appraisals of video games than non-pathological gamers and non-gamers. The three studies converged to show that pathological video game use seems similar to other addictions in its patterns of correlations with other constructs. Conceptual and definitional aspects of Internet Gaming Disorder are discussed. PMID:26694472
Testing the Predictive Validity and Construct of Pathological Video Game Use

Directory of Open Access Journals (Sweden)

Christopher L. Groves

2015-12-01

Full Text Available Three studies assessed the construct of pathological video game use and tested its predictive validity. Replicating previous research, Study 1 produced evidence of convergent validity in 8th and 9th graders (N = 607 classified as pathological gamers. Study 2 replicated and extended the findings of Study 1 with college undergraduates (N = 504. Predictive validity was established in Study 3 by measuring cue reactivity to video games in college undergraduates (N = 254, such that pathological gamers were more emotionally reactive to and provided higher subjective appraisals of video games than non-pathological gamers and non-gamers. The three studies converged to show that pathological video game use seems similar to other addictions in its patterns of correlations with other constructs. Conceptual and definitional aspects of Internet Gaming Disorder are discussed.
Development of a test rig and its application for validation and reliability testing of safety-critical software

Energy Technology Data Exchange (ETDEWEB)

Thai, N D; McDonald, A M [Atomic Energy of Canada Ltd., Mississauga, ON (Canada)

1996-12-31

This paper describes a versatile test rig developed by AECL for functional testing of safety-critical software used in the process trip computers of the Wolsong CANDU stations. The description covers the hardware and software aspects of the test rig, the test language and its interpreter, and other major testing software utilities such as the test oracle, sampler and profiler. The paper also discusses the application of the rig in the final stages of testing of the process trip computer software, namely validation and reliability tests. It shows how random test cases are generated, test scripts prepared and automatically run on the test rig. The versatility of the rig is further demonstrated in other types of testing such as sub-system tests, verification of the test oracle, testing of newly-developed test script, self-test and calibration. (author). 5 tabs., 10 figs.
Development of a test rig and its application for validation and reliability testing of safety-critical software

International Nuclear Information System (INIS)

Thai, N.D.; McDonald, A.M.

1995-01-01

This paper describes a versatile test rig developed by AECL for functional testing of safety-critical software used in the process trip computers of the Wolsong CANDU stations. The description covers the hardware and software aspects of the test rig, the test language and its interpreter, and other major testing software utilities such as the test oracle, sampler and profiler. The paper also discusses the application of the rig in the final stages of testing of the process trip computer software, namely validation and reliability tests. It shows how random test cases are generated, test scripts prepared and automatically run on the test rig. The versatility of the rig is further demonstrated in other types of testing such as sub-system tests, verification of the test oracle, testing of newly-developed test script, self-test and calibration. (author). 5 tabs., 10 figs
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

Science.gov (United States)

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (Test possessed low CV and significant (pTest possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
BENDER GESTALT VISUALMOTOR TEST AND CARAS TEST: A EXAM OF CONSTRUCT VALIDITY

Directory of Open Access Journals (Sweden)

Cesar Merino Soto

2011-12-01

Full Text Available Research with new versions of the Bender Gestalt Test (TGB has hardly attracted attention to the researchers of the Hispanic world, onsidering that this test is one of the most widely used psychological assessments. This study evaluates the construct validity of the modified version of TGB for children, elative to sustainedattention assessed by the Caras Test. Both tests were applied to 90 children, aged between 5 and 8, in standardized conditions. The esults indicate that the shared variance between the two measures is zero, even when applied disattenuated correlations for measurement error; also, no non-linear patterns were detected between the two variables. These correlations were consistent in the total sample and among subgroups of children. We discuss these results with respect to the limits of validity of this modified version of TGB in the Spanish language.
Vertical jumping tests in volleyball: reliability, validity, and playing-position specifics.

Science.gov (United States)

Sattler, Tine; Sekulic, Damir; Hadzic, Vedran; Uljevic, Ognjen; Dervisevic, Edvin

2012-06-01

Vertical jumping is known to be important in volleyball, and jumping performance tests are frequently studied for their reliability and validity. However, most studies concerning jumping in volleyball have dealt with standard rather than sport-specific jumping procedures and tests. The aims of this study, therefore, were (a) to determine the reliability and factorial validity of 2 volleyball-specific jumping tests, the block jump (BJ) test and the attack jump (AJ) test, relative to 2 frequently used and systematically validated jumping tests, the countermovement jump test and the squat jump test and (b) to establish volleyball position-specific differences in the jumping tests and simple anthropometric indices (body height [BH], body weight, and body mass index [BMI]). The BJ was performed from a defensive volleyball position, with the hands positioned in front of the chest. During an AJ, the players used a 2- to 3-step approach and performed a drop jump with an arm swing followed by a quick vertical jump. A total of 95 high-level volleyball players (all men) participated in this study. The reliability of the jumping tests ranged from 0.97 to 0.99 for Cronbach's alpha coefficients, from 0.93 to 0.97 for interitem correlation coefficients and from 2.1 to 2.8 for coefficients of variation. The highest reliability was found for the specific jumping tests. The factor analysis extracted one significant component, and all of the tests were highly intercorrelated. The analysis of variance with post hoc analysis showed significant differences between 5 playing positions in some of the jumping tests. In general, receivers had a greater jumping capacity, followed by libero players. The differences in jumping capacities should be emphasized vis-a-vis differences in the anthropometric measures of players, where middle hitters had higher BH and body weight, followed by opposite hitters and receivers, with no differences in the BMI between positions.
POLYGON - A New Fundamental Movement Skills Test for 8 Year Old Children: Construction and Validation.

Science.gov (United States)

Zuvela, Frane; Bozanic, Ana; Miletic, Durdica

2011-01-01

Inadequately adopted fundamental movement skills (FMS) in early childhood may have a negative impact on the motor performance in later life (Gallahue and Ozmun, 2005). The need for an efficient FMS testing in Physical Education was recognized. The aim of this paper was to construct and validate a new FMS test for 8 year old children. Ninety-five 8 year old children were used for the testing. A total of 24 new FMS tasks were constructed and only the best representatives of movement areas entered into the final test product - FMS-POLYGON. The ICC showed high values for all 24 tasks (0.83-0.97) and the factorial analysis revealed the best representatives of each movement area that entered the FMS-POLYGON: tossing and catching the volleyball against a wall, running across obstacles, carrying the medicine balls, and straight running. The ICC for the FMS-POLYGON showed a very high result (0.98) and, therefore, confirmed the test's intra-rater reliability. Concurrent validity was tested with the use of the "Test of Gross Motor Development" (TGMD-2). Correlation analysis between the newly constructed FMS-POLYGON and the TGMD-2 revealed the coefficient of -0.82 which indicates a high correlation. In conclusion, the new test for FMS assessment proved to be a reliable and valid instrument for 8 year old children. Application of this test in schools is justified and could play an important factor in physical education and sport practice. Key pointsAll 21 newly constructed tasks demonstrated high intra-rater reliability (0.83-0.97) in FMS assessment. High reliability was also noted in the FMS-POLYGON test (0.98).A high correlation was found between the FMS-POLYGON and TGMD-2 which is a confirmation of the new test's concurrent validity.The research resolved the problem of long and detailed FMS assessment by adding a new dimension using quick and effective norm-referenced approach but also covering all the most important movement areas.New and validated test can be of great use
[Validation of three screening tests used for early detection of cervical cancer].

Science.gov (United States)

Rodriguez-Reyes, Esperanza Rosalba; Cerda-Flores, Ricardo M; Quiñones-Pérez, Juan M; Cortés-Gutiérrez, Elva I

2008-01-01

to evaluate the validity (sensitivity, specificity, and accuracy) of three screening methods used in the early detection of the cervical carcinoma versus the histopathology diagnosis. a selected sample of 107 women attended in the Opportune Detection of Cervicouterine Cancer Program in the Hospital de Zona 46, Instituto Mexicano del Seguro Social in Durango, during the 2003 was included. The application of Papa-nicolaou, acetic acid test, and molecular detection of human papillomavirus, and histopatholgy diagnosis were performed in all the patients at the time of the gynecological exam. The detection and tipification of the human papillomavirus was performed by polymerase chain reaction (PCR) and analysis of polymorphisms of length of restriction fragments (RFLP). Histopathology diagnosis was considered the gold standard. The evaluation of the validity was carried out by the Bayesian method for diagnosis test. the positive cases for acetic acid test, Papanicolaou, and PCR were 47, 22, and 19. The accuracy values were 0.70, 0.80 and 0.99, respectively. since the molecular method showed a greater validity in the early detection of the cervical carcinoma we considered of vital importance its implementation in suitable programs of Opportune Detection of Cervicouterino Cancer Program in Mexico. However, in order to validate this conclusion, cross-sectional studies in different region of country must be carried out.

Modeling Run Test Validity: A Meta-Analytic Approach

National Research Council Canada - National Science Library

Vickers, Ross

2002-01-01

.... This study utilized data from 166 samples (N = 5,757) to test the general hypothesis that differences in testing methods could account for the cross-situational variation in validity. Only runs >2 km...
'Mechanical restraint-confounders, risk, alliance score': testing the clinical validity of a new risk assessment instrument.

Science.gov (United States)

Deichmann Nielsen, Lea; Bech, Per; Hounsgaard, Lise; Alkier Gildberg, Frederik

2017-08-01

Unstructured risk assessment, as well as confounders (underlying reasons for the patient's risk behaviour and alliance), risk behaviour, and parameters of alliance, have been identified as factors that prolong the duration of mechanical restraint among forensic mental health inpatients. To clinically validate a new, structured short-term risk assessment instrument called the Mechanical Restraint-Confounders, Risk, Alliance Score (MR-CRAS), with the intended purpose of supporting the clinicians' observation and assessment of the patient's readiness to be released from mechanical restraint. The content and layout of MR-CRAS and its user manual were evaluated using face validation by forensic mental health clinicians, content validation by an expert panel, and pilot testing within two, closed forensic mental health inpatient units. The three sub-scales (Confounders, Risk, and a parameter of Alliance) showed excellent content validity. The clinical validations also showed that MR-CRAS was perceived and experienced as a comprehensible, relevant, comprehensive, and useable risk assessment instrument. MR-CRAS contains 18 clinically valid items, and the instrument can be used to support the clinical decision-making regarding the possibility of releasing the patient from mechanical restraint. The present three studies have clinically validated a short MR-CRAS scale that is currently being psychometrically tested in a larger study.
Symptom validity testing in memory clinics: Hippocampal-memory associations and relevance for diagnosing mild cognitive impairment.

Science.gov (United States)

Rienstra, Anne; Groot, Paul F C; Spaan, Pauline E J; Majoie, Charles B L M; Nederveen, Aart J; Walstra, Gerard J M; de Jonghe, Jos F M; van Gool, Willem A; Olabarriaga, Silvia D; Korkhov, Vladimir V; Schmand, Ben

2013-01-01

Patients with mild cognitive impairment (MCI) do not always convert to dementia. In such cases, abnormal neuropsychological test results may not validly reflect cognitive symptoms due to brain disease, and the usual brain-behavior relationships may be absent. This study examined symptom validity in a memory clinic sample and its effect on the associations between hippocampal volume and memory performance. Eleven of 170 consecutive patients (6.5%; 13% of patients younger than 65 years) referred to memory clinics showed noncredible performance on symptom validity tests (SVTs, viz. Word Memory Test and Test of Memory Malingering). They were compared to a demographically matched group (n = 57) selected from the remaining patients. Hippocampal volume, measured by an automated volumetric method (Freesurfer), was correlated with scores on six verbal memory tests. The median correlation was r = .49 in the matched group. However, the relation was absent (median r = -.11) in patients who failed SVTs. Memory clinic samples may include patients who show noncredible performance, which invalidates their MCI diagnosis. This underscores the importance of applying SVTs in evaluating patients with cognitive complaints that may signify a predementia stage, especially when these patients are relatively young.
The validation of Huffaz Intelligence Test (HIT)

Science.gov (United States)

Rahim, Mohd Azrin Mohammad; Ahmad, Tahir; Awang, Siti Rahmah; Safar, Ajmain

2017-08-01

In general, a hafiz who can memorize the Quran has many specialties especially in respect to their academic performances. In this study, the theory of multiple intelligences introduced by Howard Gardner is embedded in a developed psychometric instrument, namely Huffaz Intelligence Test (HIT). This paper presents the validation and the reliability of HIT of some tahfiz students in Malaysia Islamic schools. A pilot study was conducted involving 87 huffaz who were randomly selected to answer the items in HIT. The analysis method used includes Partial Least Square (PLS) on reliability, convergence and discriminant validation. The study has validated nine intelligences. The findings also indicated that the composite reliabilities for the nine types of intelligences are greater than 0.8. Thus, the HIT is a valid and reliable instrument to measure the multiple intelligences among huffaz.
Construct validity of the Free and Cued Selective Reminding Test in older adults with memory complaints.

Science.gov (United States)

Clerici, Francesca; Ghiretti, Roberta; Di Pucchio, Alessandra; Pomati, Simone; Cucumo, Valentina; Marcone, Alessandra; Vanacore, Nicola; Mariani, Claudio; Cappa, Stefano Francesco

2017-06-01

The Free and Cued Selective Reminding Test (FCSRT) is the memory test recommended by the International Working Group on Alzheimer's disease (AD) for the detection of amnestic syndrome of the medial temporal type in prodromal AD. Assessing the construct validity and internal consistency of the Italian version of the FCSRT is thus crucial. The FCSRT was administered to 338 community-dwelling participants with memory complaints (57% females, age 74.5 ± 7.7 years), including 34 with AD, 203 with Mild Cognitive Impairment, and 101 with Subjective Memory Impairment. Internal Consistency was estimated using Cronbach's alpha coefficient. To assess convergent validity, five FCSRT scores (Immediate Free Recall, Immediate Total Recall, Delayed Free Recall, Delayed Total Recall, and Index of Sensitivity of Cueing) were correlated with three well-validated memory tests: Story Recall, Rey Auditory Verbal Learning test, and Rey Complex Figure (RCF) recall (partial correlation analysis). To assess divergent validity, a principal component analysis (an exploratory factor analysis) was performed including, in addition to the above-mentioned memory tasks, the following tests: Word Fluencies, RCF copy, Clock Drawing Test, Trail Making Test, Frontal Assessment Battery, Raven Coloured Progressive Matrices, and Stroop Colour-Word Test. Cronbach's alpha coefficients for immediate recalls (IFR and ITR) and delayed recalls (DFR and DTR) were, respectively, .84 and .81. All FCSRT scores were highly correlated with those of the three well-validated memory tests. The factor analysis showed that the FCSRT does not load on the factors saturated by non-memory tests. These findings indicate that the FCSRT has a good internal consistency and has an excellent construct validity as an episodic memory measure. © 2015 The British Psychological Society.
Testing the Validity of a Cognitive Behavioral Model for Gambling Behavior.

Science.gov (United States)

Raylu, Namrata; Oei, Tian Po S; Loo, Jasmine M Y; Tsai, Jung-Shun

2016-06-01

Currently, cognitive behavioral therapies appear to be one of the most studied treatments for gambling problems and studies show it is effective in treating gambling problems. However, cognitive behavior models have not been widely tested using statistical means. Thus, the aim of this study was to test the validity of the pathways postulated in the cognitive behavioral theory of gambling behavior using structural equation modeling (AMOS 20). Several questionnaires assessing a range of gambling specific variables (e.g., gambling urges, cognitions and behaviors) and gambling correlates (e.g., psychological states, and coping styles) were distributed to 969 participants from the community. Results showed that negative psychological states (i.e., depression, anxiety and stress) only directly predicted gambling behavior, whereas gambling urges predicted gambling behavior directly as well as indirectly via gambling cognitions. Avoidance coping predicted gambling behavior only indirectly via gambling cognitions. Negative psychological states were significantly related to gambling cognitions as well as avoidance coping. In addition, significant gender differences were also found. The results provided confirmation for the validity of the pathways postulated in the cognitive behavioral theory of gambling behavior. It also highlighted the importance of gender differences in conceptualizing gambling behavior.
Validation of Helicopter Gear Condition Indicators Using Seeded Fault Tests

Science.gov (United States)

Dempsey, Paula; Brandon, E. Bruce

2013-01-01

A "seeded fault test" in support of a rotorcraft condition based maintenance program (CBM), is an experiment in which a component is tested with a known fault while health monitoring data is collected. These tests are performed at operating conditions comparable to operating conditions the component would be exposed to while installed on the aircraft. Performance of seeded fault tests is one method used to provide evidence that a Health Usage Monitoring System (HUMS) can replace current maintenance practices required for aircraft airworthiness. Actual in-service experience of the HUMS detecting a component fault is another validation method. This paper will discuss a hybrid validation approach that combines in service-data with seeded fault tests. For this approach, existing in-service HUMS flight data from a naturally occurring component fault will be used to define a component seeded fault test. An example, using spiral bevel gears as the targeted component, will be presented. Since the U.S. Army has begun to develop standards for using seeded fault tests for HUMS validation, the hybrid approach will be mapped to the steps defined within their Aeronautical Design Standard Handbook for CBM. This paper will step through their defined processes, and identify additional steps that may be required when using component test rig fault tests to demonstrate helicopter CI performance. The discussion within this paper will provide the reader with a better appreciation for the challenges faced when defining a seeded fault test for HUMS validation.
Test anxiety and the validity of cognitive tests: A confirmatory factor analysis perspective and some empirical findings

NARCIS (Netherlands)

Wicherts, J.M.; Zand Scholten, A.

2010-01-01

The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by
Content Validity Index and Intra- and Inter-Rater Reliability of a New Muscle Strength/Endurance Test Battery for Swedish Soldiers.

Directory of Open Access Journals (Sweden)

Helena Larsson

Full Text Available The objective of this study was to examine the content validity of commonly used muscle performance tests in military personnel and to investigate the reliability of a proposed test battery. For the content validity investigation, thirty selected tests were those described in the literature and/or commonly used in the Nordic and North Atlantic Treaty Organization (NATO countries. Nine selected experts rated, on a four-point Likert scale, the relevance of these tests in relation to five different work tasks: lifting, carrying equipment on the body or in the hands, climbing, and digging. Thereafter, a content validity index (CVI was calculated for each work task. The result showed excellent CVI (≥0.78 for sixteen tests, which comprised of one or more of the military work tasks. Three of the tests; the functional lower-limb loading test (the Ranger test, dead-lift with kettlebells, and back extension, showed excellent content validity for four of the work tasks. For the development of a new muscle strength/endurance test battery, these three tests were further supplemented with two other tests, namely, the chins and side-bridge test. The inter-rater reliability was high (intraclass correlation coefficient, ICC2,1 0.99 for all five tests. The intra-rater reliability was good to high (ICC3,1 0.82-0.96 with an acceptable standard error of mean (SEM, except for the side-bridge test (SEM%>15. Thus, the final suggested test battery for a valid and reliable evaluation of soldiers' muscle performance comprised the following four tests; the Ranger test, dead-lift with kettlebells, chins, and back extension test. The criterion-related validity of the test battery should be further evaluated for soldiers exposed to varying physical workload.
ASTM Validates Air Pollution Test Methods

Science.gov (United States)

Chemical and Engineering News, 1973

1973-01-01

The American Society for Testing and Materials (ASTM) has validated six basic methods for measuring pollutants in ambient air as the first part of its Project Threshold. Aim of the project is to establish nationwide consistency in measuring pollutants; determining precision, accuracy and reproducibility of 35 standard measuring methods. (BL)
Criterion and convergent validity of the Montreal cognitive assessment with screening and standardized neuropsychological testing.

Science.gov (United States)

Lam, Benjamin; Middleton, Laura E; Masellis, Mario; Stuss, Donald T; Harry, Robin D; Kiss, Alex; Black, Sandra E

2013-12-01

To compare the validity of the Montreal Cognitive Assessment (MoCA) with the criterion standard of standardized neuropsychological testing and to compare the convergent validity of the MoCA with that of existing screening tools and global measures of cognition. Cross-sectional observational study. Tertiary care hospital-based cognitive neurology subspecialty clinic. A convenience sample of 107 individuals with mild Alzheimer's disease (AD, n=75) or mild cognitive impairment (MCI, n=32) from the Sunnybrook Dementia Study. In addition to the MoCA, all participants completed the Mini-Mental State Examination (MMSE), the Mattis Dementia Rating Scale (DRS), and detailed neuropsychological testing. Convergent validity was supported, with MoCA scores correlating well with the MMSE (correlation coefficient (r)=0.66, Pvalidity was supported, with MoCA subscores according to cognitive domain correlating well with analogous neuropsychological tests and, in the case of memory (area under the receiver operating characteristic curve (AUC)=0.86), executive (AUC=0.79), and visuospatial function (AUC=0.79), being reasonably sensitive to impairment in those domains. The MoCA is a valid assessment of cognition that shows good agreement with existing screening tools and global measures (convergent validity) and was superior to the MMSE in this regard. The MoCA domain-specific subscores align with performance on more-detailed neuropsychological tests, suggesting not only good criterion validity for the MoCA, but also that it may be useful in guiding further neuropsychological testing. © 2013, Copyright the Authors Journal compilation © 2013, The American Geriatrics Society.
An Integrated Approach to Establish Validity and Reliability of Reading Tests

Science.gov (United States)

Razi, Salim

2012-01-01

This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…
Validity and Reliability of a Medicine Ball Explosive Power Test.

Science.gov (United States)

Stockbrugger, Barry A.; Haennel, Robert G.

2001-01-01

Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…
Validity of purchasing power parity for selected Latin American countries: Linear and non-linear unit root tests

Directory of Open Access Journals (Sweden)

Claudio Roberto Fóffano Vasconcelos

2016-01-01

Full Text Available The aim of this study is to examine empirically the validity of PPP in the context of unit root tests based on linear and non-linear models of the real effective exchange rate of Argentina, Brazil, Chile, Colombia, Mexico, Peru and Venezuela. For this purpose, we apply the Harvey et al. (2008 linearity test and the non-linear unit root test (Kruse, 2011. The results show that the series with linear characteristics are Argentina, Brazil, Chile, Colombia and Peru and those with non-linear characteristics are Mexico and Venezuela. The linear unit root tests indicate that the real effective exchange rate is stationary for Chile and Peru, and the non-linear unit root tests evidence that Mexico is stationary. In the period analyzed, the results show support for the validity of PPP in only three of the seven countries.
Fundamentals of endoscopic surgery: creation and validation of the hands-on test.

Science.gov (United States)

Vassiliou, Melina C; Dunkin, Brian J; Fried, Gerald M; Mellinger, John D; Trus, Thadeus; Kaneva, Pepa; Lyons, Calvin; Korndorffer, James R; Ujiki, Michael; Velanovich, Vic; Kochman, Michael L; Tsuda, Shawn; Martinez, Jose; Scott, Daniel J; Korus, Gary; Park, Adrian; Marks, Jeffrey M

2014-03-01

The Fundamentals of Endoscopic Surgery™ (FES) program consists of online materials and didactic and skills-based tests. All components were designed to measure the skills and knowledge required to perform safe flexible endoscopy. The purpose of this multicenter study was to evaluate the reliability and validity of the hands-on component of the FES examination, and to establish the pass score. Expert endoscopists identified the critical skill set required for flexible endoscopy. They were then modeled in a virtual reality simulator (GI Mentor™ II, Simbionix™ Ltd., Airport City, Israel) to create five tasks and metrics. Scores were designed to measure both speed and precision. Validity evidence was assessed by correlating performance with self-reported endoscopic experience (surgeons and gastroenterologists [GIs]). Internal consistency of each test task was assessed using Cronbach's alpha. Test-retest reliability was determined by having the same participant perform the test a second time and comparing their scores. Passing scores were determined by a contrasting groups methodology and use of receiver operating characteristic curves. A total of 160 participants (17 % GIs) performed the simulator test. Scores on the five tasks showed good internal consistency reliability and all had significant correlations with endoscopic experience. Total FES scores correlated 0.73, with participants' level of endoscopic experience providing evidence of their validity, and their internal consistency reliability (Cronbach's alpha) was 0.82. Test-retest reliability was assessed in 11 participants, and the intraclass correlation was 0.85. The passing score was determined and is estimated to have a sensitivity (true positive rate) of 0.81 and a 1-specificity (false positive rate) of 0.21. The FES hands-on skills test examines the basic procedural components required to perform safe flexible endoscopy. It meets rigorous standards of reliability and validity required for high
ASSERT validation against the Stern Laboratories' single-phase pressure drop tests

International Nuclear Information System (INIS)

Waddington, G.M.; Kiteley, J.C.; Carver, M.B.

1995-01-01

This paper describes the preliminary validation of ASSERT-IV against the single-phase pressure drop tests from the 37-element CHF (critical heat flux) experiments conducted at Stern Laboratories, and shows how this study fits into the overall ASSERT validation plan. The effects on the pressure drop of several friction and form loss models are evaluated, including the geometry-based K-factor model. The choice of friction factor has a small effect on the predicted channel pressure drop, compared to the form loss model choice. Using the uniform K-factors of Hameed, the computed pressure drops are in excellent agreement with the experimental results from the nominal pressure tube tests. For future ASSERT applications, either Hameed's uniform K-factors or the geometry-based model using Idelchik's thick-edged orifice equation are recommended, as are the friction factor correlations of Colebrook-White, Selander, and Aly and Groeneveld. More analysis of the geometry-based K-factor model is required. (author). 23 refs., 4 tabs., 9 figs
How Can Consumers Be Sure a Genetic Test Is Valid and Useful?

Science.gov (United States)

... a genetic test is valid and useful? How can consumers be sure a genetic test is valid ... particular gene or genetic change. In other words, can the test accurately detect whether a specific genetic ...
Solar Sail Models and Test Measurements Correspondence for Validation Requirements Definition

Science.gov (United States)

Ewing, Anthony; Adams, Charles

2004-01-01

Solar sails are being developed as a mission-enabling technology in support of future NASA science missions. Current efforts have advanced solar sail technology sufficient to justify a flight validation program. A primary objective of this activity is to test and validate solar sail models that are currently under development so that they may be used with confidence in future science mission development (e.g., scalable to larger sails). Both system and model validation requirements must be defined early in the program to guide design cycles and to ensure that relevant and sufficient test data will be obtained to conduct model validation to the level required. A process of model identification, model input/output documentation, model sensitivity analyses, and test measurement correspondence is required so that decisions can be made to satisfy validation requirements within program constraints.
Are chiropractic tests for the lumbo-pelvic spine reliable and valid? A systematic critical literature review

DEFF Research Database (Denmark)

Hestbaek, L; Leboeuf-Yde, C

2000-01-01

OBJECTIVE: To systematically review the peer-reviewed literature about the reliability and validity of chiropractic tests used to determine the need for spinal manipulative therapy of the lumbo-pelvic spine, taking into account the quality of the studies. DATA SOURCES: The CHIROLARS database......-pelvic spine were included. DATA EXTRACTION: Data quality were assessed independently by the two reviewers, with a quality score based on predefined methodologic criteria. Results of the studies were then evaluated in relation to quality. DATA SYNTHESIS: None of the tests studied had been sufficiently...... evaluated in relation to reliability and validity. Only tests for palpation for pain had consistently acceptable results. Motion palpation of the lumbar spine might be valid but showed poor reliability, whereas motion palpation of the sacroiliac joints seemed to be slightly reliable but was not shown...
Validation of measured friction by process tests

DEFF Research Database (Denmark)

Eriksen, Morten; Henningsen, Poul; Tan, Xincai

The objective of sub-task 3.3 is to evaluate under actual process conditions the friction formulations determined by simulative testing. As regards task 3.3 the following tests have been used according to the original project plan: 1. standard ring test and 2. double cup extrusion test. The task...... has, however, been extended to include a number of new developed process tests: 3. forward rod extrusion test, 4. special ring test at low normal pressure, 5. spike test (especially developed for warm and hot forging). Validation of the measured friction values in cold forming from sub-task 3.1 has...... been made with forward rod extrusion, and very good agreement was obtained between the measured friction values in simulative testing and process testing....

Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

Science.gov (United States)

Bhat, Mehraj A.

2014-01-01

This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Migraine patients consistently show abnormal vestibular bedside tests.

Science.gov (United States)

Maranhão, Eliana Teixeira; Maranhão-Filho, Péricles; Luiz, Ronir Raggio; Vincent, Maurice Borges

2016-01-01

Migraine and vertigo are common disorders, with lifetime prevalences of 16% and 7% respectively, and co-morbidity around 3.2%. Vestibular syndromes and dizziness occur more frequently in migraine patients. We investigated bedside clinical signs indicative of vestibular dysfunction in migraineurs. To test the hypothesis that vestibulo-ocular reflex, vestibulo-spinal reflex and fall risk (FR) responses as measured by 14 bedside tests are abnormal in migraineurs without vertigo, as compared with controls. Cross-sectional study including sixty individuals - thirty migraineurs, 25 women, 19-60 y-o; and 30 gender/age healthy paired controls. Migraineurs showed a tendency to perform worse in almost all tests, albeit only the Romberg tandem test was statistically different from controls. A combination of four abnormal tests better discriminated the two groups (93.3% specificity). Migraine patients consistently showed abnormal vestibular bedside tests when compared with controls.
Computer-aided test selection and result validation-opportunities and pitfalls

DEFF Research Database (Denmark)

McNair, P; Brender, J; Talmon, J

1998-01-01

/or to increase cost-efficiency). Our experience shows that there is a practical limit to the extent of exploitation of the principle of dynamic test scheduling, unless it is automated in one way or the other. This paper analyses some issues of concern related to the profession of clinical biochemistry, when......Dynamic test scheduling is concerned with pre-analytical preprocessing of the individual samples within a clinical laboratory production by means of decision algorithms. The purpose of such scheduling is to provide maximal information with minimal data production (to avoid data pollution and...... implementing such dynamic test scheduling within a Laboratory Information System (and/or an advanced analytical workstation). The challenge is related to 1) generation of appropriately validated decision models, and 2) mastering consequences of analytical imprecision and bias....
Validity of an Interactive Functional Reach Test.

Science.gov (United States)

Galen, Sujay S; Pardo, Vicky; Wyatt, Douglas; Diamond, Andrew; Brodith, Victor; Pavlov, Alex

2015-08-01

Videogaming platforms such as the Microsoft (Redmond, WA) Kinect(®) are increasingly being used in rehabilitation to improve balance performance and mobility. These gaming platforms do not have built-in clinical measures that offer clinically meaningful data. We have now developed software that will enable the Kinect sensor to assess a patient's balance using an interactive functional reach test (I-FRT). The aim of the study was to test the concurrent validity of the I-FRT and to establish the feasibility of implementing the I-FRT in a clinical setting. The concurrent validity of the I-FRT was tested among 20 healthy adults (mean age, 25.8±3.4 years; 14 women). The Functional Reach Test (FRT) was measured simultaneously by both the Kinect sensor using the I-FRT software and the Optotrak Certus(®) 3D motion-capture system (Northern Digital Inc., Waterloo, ON, Canada). The feasibility of implementing the I-FRT in a clinical setting was assessed by performing the I-FRT in 10 participants with mild balance impairments recruited from the outpatient physical therapy clinic (mean age, 55.8±13.5 years; four women) and obtaining their feedback using a NASA Task Load Index (NASA-TLX) questionnaire. There was moderate to good agreement between FRT measures made by the two measurement systems. The greatest agreement between the two measurement system was found with the Kinect sensor placed at a distance of 2.5 m [intraclass correlation coefficient (2,k)=0.786; PNASA/TLX questionnaire. FRT measures made using the Kinect sensor I-FRT software provides a valid clinical measure that can be used with the gaming platforms.
Migraine patients consistently show abnormal vestibular bedside tests

Directory of Open Access Journals (Sweden)

Eliana Teixeira Maranhão

2015-01-01

Full Text Available Migraine and vertigo are common disorders, with lifetime prevalences of 16% and 7% respectively, and co-morbidity around 3.2%. Vestibular syndromes and dizziness occur more frequently in migraine patients. We investigated bedside clinical signs indicative of vestibular dysfunction in migraineurs.Objective To test the hypothesis that vestibulo-ocular reflex, vestibulo-spinal reflex and fall risk (FR responses as measured by 14 bedside tests are abnormal in migraineurs without vertigo, as compared with controls.Method Cross-sectional study including sixty individuals – thirty migraineurs, 25 women, 19-60 y-o; and 30 gender/age healthy paired controls.Results Migraineurs showed a tendency to perform worse in almost all tests, albeit only the Romberg tandem test was statistically different from controls. A combination of four abnormal tests better discriminated the two groups (93.3% specificity.Conclusion Migraine patients consistently showed abnormal vestibular bedside tests when compared with controls.
Overview of CSNI separate effects tests validation matrix

Energy Technology Data Exchange (ETDEWEB)

Aksan, N. [Paul Scherrer Institute, Villigen (Switzerland); Auria, F.D. [Univ. of Pisa (Italy); Glaeser, H. [Gesellschaft fuer anlagen und Reaktorsicherheit, (GRS), Garching (Germany)] [and others

1995-09-01

An internationally agreed separate effects test (SET) Validation Matrix for thermal-hydraulic system codes has been established by a sub-group of the Task Group on Thermal Hydraulic System Behaviour as requested by the OECD/NEA Committee on Safety of Nuclear Installations (SCNI) Principal Working Group No. 2 on Coolant System Behaviour. The construction of such a Matrix is an attempt to collect together in a systematic way the best sets of openly available test data for code validation, assessment and improvement and also for quantitative code assessment with respect to quantification of uncertainties to the modeling of individual phenomena by the codes. The methodology, that has been developed during the process of establishing CSNI-SET validation matrix, was an important outcome of the work on SET matrix. In addition, all the choices which have been made from the 187 identified facilities covering the 67 phenomena will be investigated together with some discussions on the data base.
Performance Validity Testing in Neuropsychology: Scientific Basis and Clinical Application-A Brief Review.

Science.gov (United States)

Greher, Michael R; Wodushek, Thomas R

2017-03-01

Performance validity testing refers to neuropsychologists' methodology for determining whether neuropsychological test performances completed in the course of an evaluation are valid (ie, the results of true neurocognitive function) or invalid (ie, overly impacted by the patient's effort/engagement in testing). This determination relies upon the use of either standalone tests designed for this sole purpose, or specific scores/indicators embedded within traditional neuropsychological measures that have demonstrated this utility. In response to a greater appreciation for the critical role that performance validity issues play in neuropsychological testing and the need to measure this variable to the best of our ability, the scientific base for performance validity testing has expanded greatly over the last 20 to 30 years. As such, the majority of current day neuropsychologists in the United States use a variety of measures for the purpose of performance validity testing as part of everyday forensic and clinical practice and address this issue directly in their evaluations. The following is the first article of a 2-part series that will address the evolution of performance validity testing in the field of neuropsychology, both in terms of the science as well as the clinical application of this measurement technique. The second article of this series will review performance validity tests in terms of methods for development of these measures, and maximizing of diagnostic accuracy.
Validity and Reliability of Baseline Testing in a Standardized Environment.

Science.gov (United States)

Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur

2017-08-11

The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
USFDA-GUIDELINE BASED VALIDATION OF TESTING METHOD FOR RIFAMPICIN IN INDONESIAN SERUM SPECIMEN

Directory of Open Access Journals (Sweden)

Tri Joko Raharjo

2010-06-01

Full Text Available Regarding a new regulation from Indonesia FDA (Badan POM-RI, all new non patent drugs should show bioequivalence with the originator drug prior to registration. Bioequivalence testing (BE-testing has to be performed to the people that represented of population to which the drug to be administrated. BE testing need a valid bio-analytical method for certain drug target and group of population. This research report specific validation of bio-analysis of Rifampicin in Indonesian serum specimen in order to be used for BE testing. The extraction was performed using acetonitrile while the chromatographic separation was accomplished on a RP 18 column (250 × 4.6 mm i.d., 5 µm, with a mobile phase composed of KH2PO4 10 mM-Acetonitrile (40:60, v/v and UV detection was set at 333 nm. The method shown specificity compared to blank serum specimen with retention time of rifampicin at 2.1 min. Lower limit of quantification (LLOQ was 0.06 µg/mL with dynamic range up to 20 µg/mL (R>0.990. Precision of the method was very good with coefficient of variance (CV 0.58; 7.40 and 5.56% for concentration at 0.06, 5, 15 µg/mL, respectively. Accuracies of the method were 3.22; 1.94; 1.90% for concentration 0.06, 5 and 15 µg/mL respectively. The average recoveries were 97.82, 95.50 and 97.31% for concentration of rifampicin 1, 5 and 5 µg/mL, respectively. The method was also shown reliable result on stability test on freezing-thawing, short-term and long-term stability as well as post preparation stability. Validation result shown that the method was ready to be used for Rifampicin BE testing with Indonesian subject. Keywords: Rifampicin, Validation, USFDA-Guideline
Understanding Student Teachers’ Behavioural Intention to Use Technology: Technology Acceptance Model (TAM Validation and Testing

Directory of Open Access Journals (Sweden)

Kung-Teck, Wong

2013-01-01

Full Text Available This study sets out to validate and test the Technology Acceptance Model (TAM in the context of Malaysian student teachers’ integration of their technology in teaching and learning. To establish factorial validity, data collected from 302 respondents were tested against the TAM using confirmatory factor analysis (CFA, and structural equation modelling (SEM was used for model comparison and hypotheses testing. The goodness-of-fit test of the analysis shows partial support of the applicability of the TAM in a Malaysian context. Overall, the TAM accounted for 37.3% of the variance in intention to use technology among student teachers and of the five hypotheses formulated, four are supported. Perceived usefulness is a significant influence on attitude towards computer use and behavioural intention. Perceived ease of use significantly influences perceived usefulness, and finally, behavioural intention is found to be influenced by attitude towards computer use. The findings of this research contribute to the literature by validating the TAM in the Malaysian context and provide several prominent implications for the research and practice of technology integration development.
ExEP yield modeling tool and validation test results

Science.gov (United States)

Morgan, Rhonda; Turmon, Michael; Delacroix, Christian; Savransky, Dmitry; Garrett, Daniel; Lowrance, Patrick; Liu, Xiang Cate; Nunez, Paul

2017-09-01

EXOSIMS is an open-source simulation tool for parametric modeling of the detection yield and characterization of exoplanets. EXOSIMS has been adopted by the Exoplanet Exploration Programs Standards Definition and Evaluation Team (ExSDET) as a common mechanism for comparison of exoplanet mission concept studies. To ensure trustworthiness of the tool, we developed a validation test plan that leverages the Python-language unit-test framework, utilizes integration tests for selected module interactions, and performs end-to-end crossvalidation with other yield tools. This paper presents the test methods and results, with the physics-based tests such as photometry and integration time calculation treated in detail and the functional tests treated summarily. The test case utilized a 4m unobscured telescope with an idealized coronagraph and an exoplanet population from the IPAC radial velocity (RV) exoplanet catalog. The known RV planets were set at quadrature to allow deterministic validation of the calculation of physical parameters, such as working angle, photon counts and integration time. The observing keepout region was tested by generating plots and movies of the targets and the keepout zone over a year. Although the keepout integration test required the interpretation of a user, the test revealed problems in the L2 halo orbit and the parameterization of keepout applied to some solar system bodies, which the development team was able to address. The validation testing of EXOSIMS was performed iteratively with the developers of EXOSIMS and resulted in a more robust, stable, and trustworthy tool that the exoplanet community can use to simulate exoplanet direct-detection missions from probe class, to WFIRST, up to large mission concepts such as HabEx and LUVOIR.
Development and Validation of a Dissolution Test Method for ...

African Journals Online (AJOL)

Purpose: To develop and validate a dissolution test method for dissolution release of artemether and lumefantrine from tablets. Methods: A single dissolution method for evaluating the in vitro release of artemether and lumefantrine from tablets was developed and validated. The method comprised of a dissolution medium of ...
Cross-cultural validation and psychometric testing of the Norwegian version of the TeamSTEPPS® teamwork perceptions questionnaire.

Science.gov (United States)

Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise

2017-12-02

Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.
Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.

Science.gov (United States)

Schlaberg, Robert; Chiu, Charles Y; Miller, Steve; Procop, Gary W; Weinstock, George

2017-06-01

- Metagenomic sequencing can be used for detection of any pathogens using unbiased, shotgun next-generation sequencing (NGS), without the need for sequence-specific amplification. Proof-of-concept has been demonstrated in infectious disease outbreaks of unknown causes and in patients with suspected infections but negative results for conventional tests. Metagenomic NGS tests hold great promise to improve infectious disease diagnostics, especially in immunocompromised and critically ill patients. - To discuss challenges and provide example solutions for validating metagenomic pathogen detection tests in clinical laboratories. A summary of current regulatory requirements, largely based on prior guidance for NGS testing in constitutional genetics and oncology, is provided. - Examples from 2 separate validation studies are provided for steps from assay design, and validation of wet bench and bioinformatics protocols, to quality control and assurance. - Although laboratory and data analysis workflows are still complex, metagenomic NGS tests for infectious diseases are increasingly being validated in clinical laboratories. Many parallels exist to NGS tests in other fields. Nevertheless, specimen preparation, rapidly evolving data analysis algorithms, and incomplete reference sequence databases are idiosyncratic to the field of microbiology and often overlooked.
Validity of the Worth 4 Dot Test in Patients with Red-Green Color Vision Defect.

Science.gov (United States)

Bak, Eunoo; Yang, Hee Kyung; Hwang, Jeong-Min

2017-05-01

The Worth four dot test uses red and green glasses for binocular dissociation, and although it has been believed that patients with red-green color vision defects cannot accurately perform the Worth four dot test, this has not been validated. Therefore, the purpose of this study was to demonstrate the validity of the Worth four dot test in patients with congenital red-green color vision defects who have normal or abnormal binocular vision. A retrospective review of medical records was performed on 30 consecutive congenital red-green color vision defect patients who underwent the Worth four dot test. The type of color vision anomaly was determined by the Hardy Rand and Rittler (HRR) pseudoisochromatic plate test, Ishihara color test, anomaloscope, and/or the 100 hue test. All patients underwent a complete ophthalmologic examination. Binocular sensory status was evaluated with the Worth four dot test and Randot stereotest. The results were interpreted according to the presence of strabismus or amblyopia. Among the 30 patients, 24 had normal visual acuity without strabismus nor amblyopia and 6 patients had strabismus and/or amblyopia. The 24 patients without strabismus nor amblyopia all showed binocular fusional responses by seeing four dots of the Worth four dot test. Meanwhile, the six patients with strabismus or amblyopia showed various results of fusion, suppression, and diplopia. Congenital red-green color vision defect patients of different types and variable degree of binocularity could successfully perform the Worth four dot test. They showed reliable results that were in accordance with their estimated binocular sensory status.
Validity and reliability of tests determining performance-related components of wheelchair basketball

NARCIS (Netherlands)

De Groot, Sonja; Balvers, Inge J. M.; Kouwenhoven, Sanne M.; Janssen, Thomas W. J.

2012-01-01

The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Validity and reliability of tests determining performance-related components of wheelchair basketball

NARCIS (Netherlands)

de Groot, Sonja; Balvers, Inge J.M.; Kouwenhoven, Sanne M.; Janssen, Thomas W.J.

The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'
Evaporation over sump surface in containment studies: code validation on TOSQAN tests

International Nuclear Information System (INIS)

Malet, J.; Gelain, T.; Degrees du Lou, O.; Daru, V.

2011-01-01

During the course of a severe accident in a Nuclear Power Plant, water can be collected in the sump containment through steam condensation on walls and spray systems activation. The objective of this paper is to present code validation on evaporative sump tests performed on the TOSQAN facility. The ASTEC-CPA code is used as a lumped-parameter code and specific user-defined-functions are developed for the TONUS-CFD code. The tests are air-steam tests, as well as tests with other non-condensable gases (He, CO 2 and SF 6 ) under steady and transient conditions. The results show a good agreement between codes and experiments, indicating a good behaviour of the sump models in both codes. (author)
[Attempt for development of rapid word reading test for children--evaluation of reliability and validity].

Science.gov (United States)

Hashimoto, Ryusaku; Kashiwagi, Mitsuru; Suzuki, Shuhei

2008-09-01

We developed a rapid word reading test for examining the phonological processing ability of Japanese children. We prepared two versions of the test, version A and B. Each test has word and non-word tasks. Twenty-two healthy boys of third grade in primary schools participated in this validation study. For criterion related validity, we performed the serial Hiragana reading test, the sentence reading test, Raven's coloured progressive matrices (RCPM), the Token test for children, the Kana word dictation test, the standardized comprehension test of abstract words (SCTAW), and Trail Circle test. The reading times of the newly developed test correlated moderately or highly with those of the serial Hiragana reading test and the sentence reading test. However, the scores of the other tests (RCPM, Token test for children, Kana word dictation test, SCTAW, Trail Circle test) did not correlated with the reading time of the rapid word reading test. Test-retest reliabilities in the word tasks were more than moderate: 0.52 and 0.76 in versions A and B, while those in the non-word tasks were high: 0.91 and 0.88 in versions A and B. The correlation coefficient between versions A and B was 0.7 for the word tasks and 0.92 for the non-word tasks. This study showed that the rapid word reading test has substantial validity and reliability for testing the phonological processing ability of Japanese children. In addition, the non-word tasks were more suitable for selectively examining the speed of the grapheme to phoneme conversion process.
Validation of the Vanderbilt Holistic Face Processing Test.

Science.gov (United States)

Wang, Chao-Chih; Ross, David A; Gauthier, Isabel; Richler, Jennifer J

2016-01-01

The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom), which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1.

Validation of the Vanderbilt Holistic Face Processing Test.

Directory of Open Access Journals (Sweden)

Chao-Chih Wang

2016-11-01

Full Text Available The Vanderbilt Holistic Face Processing Test (VHPT-F is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014. In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom, which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1.
Sino-Nasal Outcome Test-22: Translation, Cross-cultural Adaptation, and Validation in Hebrew-Speaking Patients.

Science.gov (United States)

Shapira Galitz, Yael; Halperin, Doron; Bavnik, Yosef; Warman, Meir

2016-05-01

To perform the translation, cross-cultural adaptation, and validation of the Sino-Nasal Outcome Test-22 (SNOT-22) questionnaire to the Hebrew language. A single-center prospective cross-sectional study. Seventy-three chronic rhinosinusitis (CRS) patients and 73 patients without sinonasal disease filled the Hebrew version of the SNOT-22 questionnaire. Fifty-one CRS patients underwent endoscopic sinus surgery, out of which 28 filled a postoperative questionnaire. Seventy-three healthy volunteers without sinonasal disease also answered the questionnaire. Internal consistency, test-retest reproducibility, validity, and responsiveness of the questionnaire were evaluated. Questionnaire reliability was excellent, with a high internal consistency (Cronbach's alpha coefficient, 0.91-0.936) and test-retest reproducibility (Spearman's coefficient, 0.962). Mean scores for the preoperative, postoperative, and control groups were 50.44, 29.64, and 13.15, respectively (P < .0001 for CRS vs controls, P < .001 for preoperative vs postoperative), showing validity and responsiveness of the questionnaire. The Hebrew version of SNOT-22 questionnaire is a valid outcome measure for patients with CRS with or without nasal polyps. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Validity Theory: Reform Policies, Accountability Testing, and Consequences

Science.gov (United States)

Chalhoub-Deville, Micheline

2016-01-01

Educational policies such as Race to the Top in the USA affirm a central role for testing systems in government-driven reform efforts. Such reform policies are often referred to as the global education reform movement (GERM). Changes observed with the GERM style of testing demand socially engaged validity theories that include consequential…
Official Position of the American Academy of Clinical Neuropsychology Social Security Administration Policy on Validity Testing: Guidance and Recommendations for Change.

Science.gov (United States)

Chafetz, M D; Williams, M A; Ben-Porath, Y S; Bianchini, K J; Boone, K B; Kirkwood, M W; Larrabee, G J; Ord, J S

2015-01-01

The milestone publication by Slick, Sherman, and Iverson (1999) of criteria for determining malingered neurocognitive dysfunction led to extensive research on validity testing. Position statements by the National Academy of Neuropsychology and the American Academy of Clinical Neuropsychology (AACN) recommended routine validity testing in neuropsychological evaluations. Despite this widespread scientific and professional support, the Social Security Administration (SSA) continued to discourage validity testing, a stance that led to a congressional initiative for SSA to reevaluate their position. In response, SSA commissioned the Institute of Medicine (IOM) to evaluate the science concerning the validation of psychological testing. The IOM concluded that validity assessment was necessary in psychological and neuropsychological examinations (IOM, 2015 ). The AACN sought to provide independent expert guidance and recommendations concerning the use of validity testing in disability determinations. A panel of contributors to the science of validity testing and its application to the disability process was charged with describing why the disability process for SSA needs improvement, and indicating the necessity for validity testing in disability exams. This work showed how the determination of malingering is a probability proposition, described how different types of validity tests are appropriate, provided evidence concerning non-credible findings in children and low-functioning individuals, and discussed the appropriate evaluation of pain disorders typically seen outside of mental consultations. A scientific plan for validity assessment that additionally protects test security is needed in disability determinations and in research on classification accuracy of disability decisions.
Validation of a clinical critical thinking skills test in nursing.

Science.gov (United States)

Shin, Sujin; Jung, Dukyoo; Kim, Sungeun

2015-01-27

The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability.
Reproducibility and validity of the DynaPort KneeTest

NARCIS (Netherlands)

Mokkink, L.B.; Terwee, C.B.; Slikke, van der R.M.; Lummel, van R.C.; Benink, R.J.; Bouter, L.M.; Vet, de H.C.W.

2005-01-01

OBJECTIVE: To determine the reproducibility and validity of the DynaPort KneeTest, a performance-based test that measures quality of movement of patients undergoing total knee replacement (TKR). METHODS: A total of 92 patients with osteoarthritis (OA) of the knee performed the KneeTest twice on the
Water evaporation over sump surface in nuclear containment studies: CFD and LP codes validation on TOSQAN tests

Energy Technology Data Exchange (ETDEWEB)

Malet, J., E-mail: jeanne.malet@irsn.fr [Institut de Radioprotection et de Sûreté Nucléaire (IRSN), PSN-RES/SCA BP 68, 91192 Gif-sur-Yvette (France); Degrees du Lou, O. [Institut de Radioprotection et de Sûreté Nucléaire (IRSN), PSN-RES/SCA BP 68, 91192 Gif-sur-Yvette (France); Arts et Métiers ParisTech, DynFluid Lab. EA92, 151, boulevard de l’Hôpital, 75013 Paris (France); Gelain, T. [Institut de Radioprotection et de Sûreté Nucléaire (IRSN), PSN-RES/SCA BP 68, 91192 Gif-sur-Yvette (France)

2013-10-15

Highlights: • Simulations of evaporative TOSQAN sump tests are performed. • These tests are under air–steam gas conditions with addition of He, CO{sub 2} and SF{sub 6}. • ASTEC-CPA LP and TONUS-CFD codes with UDF for sump model are used. • Validation of sump models of both codes show good results. • The code–experiment differences are attributed to turbulent gas mixing modeling. -- Abstract: During the course of a severe accident in a Nuclear Power Plant, water can be collected in the sump containment through steam condensation on walls and spray systems activation. The objective of this paper is to present code validation on evaporative sump tests performed on TOSQAN facility. The ASTEC-CPA code is used as a lumped-parameter code and specific user-defined-functions are developed for the TONUS-CFD code. The seven tests are air–steam tests, as well as tests with other non-condensable gases (He, CO{sub 2} and SF{sub 6}) under steady and transient conditions (two depressurization tests). The results show a good agreement between codes and experiments, indicating a good behavior of the sump models in both codes. The sump model developed as User-Defined Functions (UDF) for TONUS is considered as well validated and is ‘ready-to-use’ for all CFD codes in which such UDF can be added. The remaining discrepancies between codes and experiments are caused by turbulent transport and gas mixing, especially in the presence of non-condensable gases other than air, so that code validation on this important topic for hydrogen safety analysis is still recommended.
Validating a dance-specific screening test for balance: preliminary results from multisite testing.

Science.gov (United States)

Batson, Glenna

2010-09-01

Few dance-specific screening tools adequately capture balance. The aim of this study was to administer and modify the Star Excursion Balance Test (oSEBT) to examine its utility as a balance screen for dancers. The oSEBT involves standing on one leg while lightly targeting with the opposite foot to the farthest distance along eight spokes of a star-shaped grid. This task simulates dance in the spatial pattern and movement quality of the gesturing limb. The oSEBT was validated for distance on athletes with history of ankle sprain. Thirty-three dancers (age 20.1 +/- 1.4 yrs) participated from two contemporary dance conservatories (UK and US), with or without a history of lower extremity injury. Dancers were verbally instructed (without physical demonstration) to execute the oSEBT and four modifications (mSEBT): timed (speed), timed with cognitive interference (answering questions aloud), and sensory disadvantaging (foam mat). Stepping strategies were tracked and performance strategies video-recorded. Unlike the oSEBT results, distances reached were not significant statistically (p = 0.05) or descriptively (i.e., shorter) for either group. Performance styles varied widely, despite sample homogeneity and instructions to control for strategy. Descriptive analysis of mSEBT showed an increased number of near-falls and decreased timing on the injured limb. Dancers appeared to employ variable strategies to keep balance during this test. Quantitative analysis is warranted to define balance strategies for further validation of SEBT modifications to determine its utility as a balance screening tool.
LADO as a Language Test: Issues of Validity

Science.gov (United States)

McNamara, Tim; Van Den Hazelkamp, Carolien; Verrips, Maaike

2016-01-01

This article brings together the theoretical field of language testing and the practical field of language analysis for the determination of the origin of asylum seekers. It considers what it would mean to think of language analysis as a form of language test, subject to the same validity constraints, and proposes a research agenda.
Testing ESL sociopragmatics development and validation of a web-based test battery

CERN Document Server

Roever, Carsten; Elder, Catherine

2014-01-01

Testing of second language pragmatics has grown as a research area but still suffers from a tension between construct coverage and practicality. In this book, the authors describe the development and validation of a web-based test of second language pragmatics for learners of English. The test has a sociopragmatic orientation and strives for a broad coverage of the construct by assessing learners'' metapragmatic judgments as well as their ability to co-construct discourse. To ensure practicality, the test is delivered online and is scored partially automatically and partially by human raters.
The Persian version of phonological test of diagnostic evaluation articulation and phonology for Persian speaking children and investigating its validity and reliability

Directory of Open Access Journals (Sweden)

Talieh Zarifian

2014-10-01

Full Text Available Background and Aim: Speech and language pathologists (SLP often refer to phonological data as part of their assessment protocols in evaluating the communication skills of children. The aim of this study was to develop the Persian version of the phonological test in evaluating and diagnosing communication skills in Persian speaking children and to evaluate its validity and reliability.Methods: The Persian phonological test (PPT was conducted on 387 monolingual Persian speaking boys and girls (3-6 years of age who were selected from 12 nurseries in the northwest region of Tehran. Content validity ratio (CVR and content validity index (CVI were assessed by speechtherapists and linguists. Correlation between speech and language pathologists experts' opinions and Persian phonological test results in children with and without phonological disorders was evaluated to investigate the Persian phonological test validity. In addition, the Persian phonological test test-retest reliability was investigated.Results: Both content validity ratio and content validity index were found to be acceptable (CVR≥94.71 and CVI=97.35. The PPT validity was confirmed by finding a good correlation between s peech and language pathologists experts' opinions and Persian phonological test results ( r Kappa =0.73 and r Spearman =0.76. The percent of agreement between transcription and analyzing error patterns in test-retest (ranging from 86.27%-100% and score-rescore (ranging from 94.28%-100% showed that Persian phonological test had a very high reliability.Conclusion: The results of this study show that the Persian phonological test seems to be a suitable tool in evaluating phonological skills of Persian speaking children in clinical settings and research projects.
Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

Science.gov (United States)

Rae, James R.; Olson, Kristina R.

2018-01-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Was the Conconi test validated by sporting success, expert opinion ...

African Journals Online (AJOL)

Was the Conconi test validated by sporting success, expert opinion or good science? ... Open Access DOWNLOAD FULL TEXT ... Despite scientific evidence to the contrary, a popular incremental field test for endurance athletes (Conconi Test) ...
Evidences of validity and reliability of the Luria-Nebraska Test for Children

Directory of Open Access Journals (Sweden)

Ricardo Franco de Lima

2016-01-01

Full Text Available Abstract This paper aimed to verify evidences of validity and reliability of Luria-Nebraska Test for Children (TLN-C, in Portuguese. Three hundred eighty-seven students aged 6–13 years old, with learning difficulties, comprised the study. They were assessed with the Wechsler Intelligence Scale for Children (WISC-III and TLN-C; and effect of age differences, as well as accuracy rating by internal consistency were investigated. Age effects were found for all subtests and in the general score, except for receptive speech subtest, even when total IQ effect was controlled. Reliability analysis had satisfactory results (0.79. The TLN-C showed evidences of validity and reliability. Receptive speech subtest requires revision.
Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.

Science.gov (United States)

Sawers, Andrew; Hafner, Brian

2018-04-11

To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
The validity of upper-limb neurodynamic tests for detecting peripheral neuropathic pain.

Science.gov (United States)

Nee, Robert J; Jull, Gwendolen A; Vicenzino, Bill; Coppieters, Michel W

2012-05-01

The validity of upper-limb neurodynamic tests (ULNTs) for detecting peripheral neuropathic pain (PNP) was assessed by reviewing the evidence on plausibility, the definition of a positive test, reliability, and concurrent validity. Evidence was identified by a structured search for peer-reviewed articles published in English before May 2011. The quality of concurrent validity studies was assessed with the Quality Assessment of Diagnostic Accuracy Studies tool, where appropriate. Biomechanical and experimental pain data support the plausibility of ULNTs. Evidence suggests that a positive ULNT should at least partially reproduce the patient's symptoms and that structural differentiation should change these symptoms. Data indicate that this definition of a positive ULNT is reliable when used clinically. Limited evidence suggests that the median nerve test, but not the radial nerve test, helps determine whether a patient has cervical radiculopathy. The median nerve test does not help diagnose carpal tunnel syndrome. These findings should be interpreted cautiously, because diagnostic accuracy might have been distorted by the investigators' definitions of a positive ULNT. Furthermore, patients with PNP who presented with increased nerve mechanosensitivity rather than conduction loss might have been incorrectly classified by electrophysiological reference standards as not having PNP. The only evidence for concurrent validity of the ulnar nerve test was a case study on cubital tunnel syndrome. We recommend that researchers develop more comprehensive reference standards for PNP to accurately assess the concurrent validity of ULNTs and continue investigating the predictive validity of ULNTs for prognosis or treatment response.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

Science.gov (United States)

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Validation of a clinical critical thinking skills test in nursing

Directory of Open Access Journals (Sweden)

Sujin Shin

2015-01-01

Full Text Available Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Results: Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. Conclusion: From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability.
VALIDITY IN COMPUTER-BASED TESTING: A LITERATURE REVIEW OF COMPARABILITY ISSUES AND EXAMINEE PERSPECTIVES

Directory of Open Access Journals (Sweden)

Ika Kana Trisnawati

2015-05-01

Full Text Available These past years have seen the growing popularity of the Computer-Based Tests (CBTs in various disciplines, for various purposes, although the Paper-and Pencil Based Tests (P&Ps are still in use. However, many question on whether the use of CBTs outperform the effectiveness of the P&Ps or if the CBTs can become a valid measuring tool compared to the PBTs. This paper tries to present the comparison on both the CBTs and the P&Ps and their respective examinee perspectives in order to figure out if doubts should arise to the emergence of the CBTs over the classic P&Ps. Findings showed that the CBTs are advantageous in that they are both efficient (reducing testing time and effective (maintaining the test reliability over the P&P versions. Nevertheless, the CBTs still need to have their variables well-designed (e.g., study design, computer algorithm in order for the scores to be comparable to those in the P&P tests since the score equivalence is one of the validity evidences needed in a CBT.
Validation of Alternative In Vitro Methods to Animal Testing: Concepts, Challenges, Processes and Tools.

Science.gov (United States)

Griesinger, Claudius; Desprez, Bertrand; Coecke, Sandra; Casey, Warren; Zuang, Valérie

This chapter explores the concepts, processes, tools and challenges relating to the validation of alternative methods for toxicity and safety testing. In general terms, validation is the process of assessing the appropriateness and usefulness of a tool for its intended purpose. Validation is routinely used in various contexts in science, technology, the manufacturing and services sectors. It serves to assess the fitness-for-purpose of devices, systems, software up to entire methodologies. In the area of toxicity testing, validation plays an indispensable role: "alternative approaches" are increasingly replacing animal models as predictive tools and it needs to be demonstrated that these novel methods are fit for purpose. Alternative approaches include in vitro test methods, non-testing approaches such as predictive computer models up to entire testing and assessment strategies composed of method suites, data sources and decision-aiding tools. Data generated with alternative approaches are ultimately used for decision-making on public health and the protection of the environment. It is therefore essential that the underlying methods and methodologies are thoroughly characterised, assessed and transparently documented through validation studies involving impartial actors. Importantly, validation serves as a filter to ensure that only test methods able to produce data that help to address legislative requirements (e.g. EU's REACH legislation) are accepted as official testing tools and, owing to the globalisation of markets, recognised on international level (e.g. through inclusion in OECD test guidelines). Since validation creates a credible and transparent evidence base on test methods, it provides a quality stamp, supporting companies developing and marketing alternative methods and creating considerable business opportunities. Validation of alternative methods is conducted through scientific studies assessing two key hypotheses, reliability and relevance of the

40 CFR 1045.501 - How do I run a valid emission test?

Science.gov (United States)

2010-07-01

... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test... Procedures § 1045.501 How do I run a valid emission test? (a) Applicability. This subpart is addressed to you... maximum test speed. (g) Special and alternate procedures. If you are unable to run the duty cycle...
40 CFR 1054.501 - How do I run a valid emission test?

Science.gov (United States)

2010-07-01

... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test... Procedures § 1054.501 How do I run a valid emission test? (a) Applicability. This subpart is addressed to you... provisions of 40 CFR 1065.405 describes how to prepare an engine for testing. However, you may consider...
S.E.T., CSNI Separate Effects Test Facility Validation Matrix

International Nuclear Information System (INIS)

1997-01-01

1 - Description of test facility: The SET matrix of experiments is suitable for the developmental assessment of thermal-hydraulics transient system computer codes by selecting individual tests from selected facilities, relevant to each phenomena. Test facilities differ from one another in geometrical dimensions, geometrical configuration and operating capabilities or conditions. Correlation between SET facility and phenomena were calculated on the basis of suitability for model validation (which means that a facility is designed in such a way as to stimulate the phenomena assumed to occur in a plant and is sufficiently instrumented); limited suitability for model variation (which means that a facility is designed in such a way as to stimulate the phenomena assumed to occur in a plant but has problems associated with imperfect scaling, different test fluids or insufficient instrumentation); and unsuitability for model validation. 2 - Description of test: Whereas integral experiments are usually designed to follow the behaviour of a reactor system in various off-normal or accident transients, separate effects tests focus on the behaviour of a single component, or on the characteristics of one thermal-hydraulic phenomenon. The construction of a separate effects test matrix is an attempt to collect together the best sets of openly available test data for code validation, assessment and improvement, from the wide range of experiments that have been carried out world-wide in the field of thermal hydraulics. In all, 2094 tests are included in the SET matrix
Development of an Agility Test for Badminton Players and Assessment of Its Validity and Test-Retest Reliability.

Science.gov (United States)

Loureiro, Luiz de França Bahia; de Freitas, Paulo Barbosa

2016-04-01

Badminton requires open and fast actions toward the shuttlecock, but there is no specific agility test for badminton players with specific movements. To develop an agility test that simultaneously assesses perception and motor capacity and examine the test's concurrent and construct validity and its test-retest reliability. The Badcamp agility test consists of running as fast as possible to 6 targets placed on the corners and middle points of a rectangular area (5.6 × 4.2 m) from the start position located in the center of it, following visual stimuli presented in a luminous panel. The authors recruited 43 badminton players (17-32 y old) to evaluate concurrent (with shuttle-run agility test--SRAT) and construct validity and test-retest reliability. Results revealed that Badcamp presents concurrent and construct validity, as its performance is strongly related to SRAT (ρ = 0.83, P < .001), with performance of experts being better than nonexpert players (P < .01). In addition, Badcamp is reliable, as no difference (P = .07) and a high intraclass correlation (ICC = .93) were found in the performance of the players on 2 different occasions. The findings indicate that Badcamp is an effective, valid, and reliable tool to measure agility, allowing coaches and athletic trainers to evaluate players' athletic condition and training effectiveness and possibly detect talented individuals in this sport.
Ecological validity and reliability of an age-adapted endurance field test in young male soccer players

DEFF Research Database (Denmark)

Castagna, Carlo; Krustrup, Peter; D'Ottavio, Stefano

2018-01-01

The purpose of this study was to examine the reliability and the association with relevant match activities (ecological validity) of an age-adapted field test for intermittent high-intensity endurance known as Yo-Yo intermittent recovery level 1 children test (YYIR1C) in young male soccer players......-intensity metabolic power (r=0.46) distances. Match total distance was largely associated with YYIR1C (r=0.30). The results of this study showed that YYIR1C may be considered a valid and reliable field test for assessing intermittent high-intensity endurance in young male soccer players. Due to the relevance...... performance showed an excellent relative (ICC=0.94) and a good absolute reliability (TEM as %CV=5.1%). Very large and significant associations were found between YYIR1C performance and match high-intensity activity (r=0.53). Large correlations were found between YYIR1C and match sprinting (r=0.42) and high...
Validity and Reliability Study of the Korean Tinetti Mobility Test for Parkinson's Disease.

Science.gov (United States)

Park, Jinse; Koh, Seong-Beom; Kim, Hee Jin; Oh, Eungseok; Kim, Joong-Seok; Yun, Ji Young; Kwon, Do-Young; Kim, Younsoo; Kim, Ji Seon; Kwon, Kyum-Yil; Park, Jeong-Ho; Youn, Jinyoung; Jang, Wooyoung

2018-01-01

Postural instability and gait disturbance are the cardinal symptoms associated with falling among patients with Parkinson's disease (PD). The Tinetti mobility test (TMT) is a well-established measurement tool used to predict falls among elderly people. However, the TMT has not been established or widely used among PD patients in Korea. The purpose of this study was to evaluate the reliability and validity of the Korean version of the TMT for PD patients. Twenty-four patients diagnosed with PD were enrolled in this study. For the interrater reliability test, thirteen clinicians scored the TMT after watching a video clip. We also used the test-retest method to determine intrarater reliability. For concurrent validation, the unified Parkinson's disease rating scale, Hoehn and Yahr staging, Berg Balance Scale, Timed-Up and Go test, 10-m walk test, and gait analysis by three-dimensional motion capture were also used. We analyzed receiver operating characteristic curve to predict falling. The interrater reliability and intrarater reliability of the Korean Tinetti balance scale were 0.97 and 0.98, respectively. The interrater reliability and intra-rater reliability of the Korean Tinetti gait scale were 0.94 and 0.96, respectively. The Korean TMT scores were significantly correlated with the other clinical scales and three-dimensional motion capture. The cutoff values for predicting falling were 14 points (balance subscale) and 10 points (gait subscale). We found that the Korean version of the TMT showed excellent validity and reliability for gait and balance and had high sensitivity and specificity for predicting falls among patients with PD.
[Testing reliability and validity of reduced substitutes for leadership scales(rd-SLS)].

Science.gov (United States)

Kim, Jeong-Hee

2005-10-01

This paper was conducted to test the reliability and validity of rd-SLS, developed by Podsakoff, et al. (1993) which measured 'substitutes for leadership'. The subjects were 345 nurses in 5 general hospitals. Cronbach's and the Guttman split-half coefficient were used to test the reliability of rd-SLS. Factor analysis, and the correlations of the rv-SLS and SLS with rd-SLS were used for convergent and discriminant validity. Cronbach's data was 0.76 and the Guttman split-half coefficient was 0.52. Twelve factors evolved by factor analysis, which explained 70.4% of the total variance. This result was similar to previous study results. However, 'Indifference toward organizational rewards'-related items were classified two factors. It was not clear t hat the rd-SLS consisted of 13 concepts(factors). The correlations of the rv-SLS and SLS with the rd-SLS were 0.93 and 0.87 respectively. The rd-SLS showed a moderate degree of validity and reliability. Thus, it is recommended to use the rd-SLS in general nursing organizations for screening for leadership substitutes. In addition, it is necessary to clarify the concept of organizational rewards. In a further study, the factor structure of the rd-SLS may be considered.
The design organization test: further demonstration of reliability and validity as a brief measure of visuospatial ability.

Science.gov (United States)

Killgore, William D S; Gogel, Hannah

2014-01-01

Neuropsychological assessments are frequently time-consuming and fatiguing for patients. Brief screening evaluations may reduce test duration and allow more efficient use of time by permitting greater attention toward neuropsychological domains showing probable deficits. The Design Organization Test (DOT) was initially developed as a 2-min paper-and-pencil alternative for the Block Design (BD) subtest of the Wechsler scales. Although initially validated for clinical neurologic patients, we sought to further establish the reliability and validity of this test in a healthy, more diverse population. Two alternate versions of the DOT and the Wechsler Abbreviated Scale of Intelligence (WASI) were administered to 61 healthy adult participants. The DOT showed high alternate forms reliability (r = .90-.92), and the two versions yielded equivalent levels of performance. The DOT was highly correlated with BD (r = .76-.79) and was significantly correlated with all subscales of the WASI. The DOT proved useful when used in lieu of BD in the calculation of WASI IQ scores. Findings support the reliability and validity of the DOT as a measure of visuospatial ability and suggest its potential worth as an efficient estimate of intellectual functioning in situations where lengthier tests may be inappropriate or unfeasible.
A Human Proximity Operations System test case validation approach

Science.gov (United States)

Huber, Justin; Straub, Jeremy

A Human Proximity Operations System (HPOS) poses numerous risks in a real world environment. These risks range from mundane tasks such as avoiding walls and fixed obstacles to the critical need to keep people and processes safe in the context of the HPOS's situation-specific decision making. Validating the performance of an HPOS, which must operate in a real-world environment, is an ill posed problem due to the complexity that is introduced by erratic (non-computer) actors. In order to prove the HPOS's usefulness, test cases must be generated to simulate possible actions of these actors, so the HPOS can be shown to be able perform safely in environments where it will be operated. The HPOS must demonstrate its ability to be as safe as a human, across a wide range of foreseeable circumstances. This paper evaluates the use of test cases to validate HPOS performance and utility. It considers an HPOS's safe performance in the context of a common human activity, moving through a crowded corridor, and extrapolates (based on this) to the suitability of using test cases for AI validation in other areas of prospective application.
Establishing the Test-Retest Reliability & Concurrent Validity for the Repeat Ice Skating Test (RIST) in Adolescent Male Ice Hockey Players

Science.gov (United States)

Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William

2012-01-01

In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…
Validation of the shake test for detecting freeze damage to adsorbed vaccines.

Science.gov (United States)

Kartoglu, Umit; Ozgüler, Nejat Kenan; Wolfson, Lara J; Kurzatkowski, Wiesław

2010-08-01

To determine the validity of the shake test for detecting freeze damage in aluminium-based, adsorbed, freeze-sensitive vaccines. A double-blind crossover design was used to compare the performance of the shake test conducted by trained health-care workers (HCWs) with that of phase contrast microscopy as a "gold standard". A total of 475 vials of 8 different types of World Health Organization prequalified freeze-sensitive vaccines from 10 different manufacturers were used. Vaccines were kept at 5 degrees C. Selected numbers of vials from each type were then exposed to -25 degrees C and -2 degrees C for 24-hour periods. There was complete concordance between HCWs and phase-contrast microscopy in identifying freeze-damaged vials and non-frozen samples. Non-frozen samples showed a fine-grain structure under phase contrast microscopy, but freeze-damaged samples showed large conglomerates of massed precipitates with amorphous, crystalline, solid and needle-like structures. Particles in the non-frozen samples measured from 1 microm (vaccines against diphtheria-tetanus-pertussis; Haemophilus influenzae type b; hepatitis B; diphtheria-tetanus-pertussis-hepatitis B) to 20 microm (diphtheria and tetanus vaccines, alone or in combination). By contrast, aggregates in the freeze-damaged samples measured up to 700 microm (diphtheria-tetanus-pertussis) and 350 microm on average. The shake test had 100% sensitivity, 100% specificity and 100% positive predictive value in this study, which confirms its validity for detecting freeze damage to aluminium-based freeze-sensitive vaccines.
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan.

Science.gov (United States)

Moore, Amy Lawson; Miller, Terissa M

2018-01-01

The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Content validity and reliability of test of gross motor development in Chilean children

Directory of Open Access Journals (Sweden)

Marcelo Cano-Cappellacci

2015-01-01

Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.
Concurrent Validity and Feasibility of Short Tests Currently Used to Measure Early Childhood Development in Large Scale Studies.

Directory of Open Access Journals (Sweden)

Marta Rubio-Codina

Full Text Available In low- and middle-income countries (LIMCs, measuring early childhood development (ECD with standard tests in large scale surveys and evaluations of interventions is difficult and expensive. Multi-dimensional screeners and single-domain tests ('short tests' are frequently used as alternatives. However, their validity in these circumstances is unknown. We examined the feasibility, reliability, and concurrent validity of three multi-dimensional screeners (Ages and Stages Questionnaires (ASQ-3, Denver Developmental Screening Test (Denver-II, Battelle Developmental Inventory screener (BDI-2 and two single-domain tests (MacArthur-Bates Short-Forms (SFI and SFII, WHO Motor Milestones (WHO-Motor in 1,311 children 6-42 months in Bogota, Colombia. The scores were compared with those on the Bayley Scales of Infant and Toddler Development (Bayley-III, taken as the 'gold standard'. The Bayley-III was given at a center by psychologists; whereas the short tests were administered in the home by interviewers, as in a survey setting. Findings indicated good internal validity of all short tests except the ASQ-3. The BDI-2 took long to administer and was expensive, while the single-domain tests were quickest and cheapest and the Denver-II and ASQ-3 were intermediate. Concurrent validity of the multi-dimensional tests' cognitive, language, and fine motor scales with the corresponding Bayley-III scale was low below 19 months. However, it increased with age, becoming moderate-to-high over 30 months. In contrast, gross motor scales' concurrence was high under 19 months and then decreased. Of the single-domain tests, the WHO-Motor had high validity with gross motor under 16 months, and the SFI and SFII expressive scales showed moderate correlations with language under 30 months. Overall, the Denver-II was the most feasible and valid multi-dimensional test and the ASQ-3 performed poorly under 31 months. By domain, gross motor development had the highest concurrence
Cross-Cultural Validation of TEMAS, a Minority Projective Test.

Science.gov (United States)

Costantino, Giuseppe; And Others

The theoretical framework and cross-cultural validation of Tell-Me-A-Story (TEMAS), a projective test developed to measure personality development in ethnic minority children, is presented. The TEMAS test consists of 23 chromatic pictures which incorporate the following characteristics: (1) representation of antithetical concepts which the…
Some Findings from Thermal-Hydraulic Validation Tests for SMART Passive Safety System

Energy Technology Data Exchange (ETDEWEB)

Park, Hyun Sik; Bae, Hwang; Ryu, Sung-Uk; Ryu, Hyobong; Shin, Yong-Cheol; Min, Kyoung-Ho; Yi, Sung-Jae [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

2014-10-15

To satisfy the domestic and international needs for nuclear safety improvement after the Fukushima accident, an effort to improve its safety has been studied, and a Passive Safety System (PSS) for SMART has been designed. In addition, an Integral Test Loop for the SMART design (SMART-ITL, or FESTA) has been constructed and it finished its commissioning tests in 2012. Consequently, a set of Design Base Accident (DBA) scenarios have been simulated using SMARTITL. Recently, a test program to validate the performance of the SMART PSS was launched and its scaled-down test facility was additionally installed at the existing SMART-ITL facility. In this paper, some findings from the validation tests for the SMART PSS will be summarized. The acquired data will be used to validate the safety analysis code and its related models, to evaluate the performance of SMART PSS, and to provide base data during the application phase of SDA revision and construction licensing. A test program to validate the performance of SMARS PSS was launched with an additional scaleddown test facility of SMART PSS, which will be installed at the existing SMART-ITL facility. In this paper, some findings from the validation tests of the SMART passive safety system during 2013-2014 were summarized. They include a couple of SMART PSS tests using active pumps and several 1-train SMART PSS tests. From the test results it was estimated that the SMART PSS has sufficient cooling capability to deal with the SBLOCA scenario of SMART. During the SBLOCA scenario, in the CMT the water layer inventory was well stratified thermally and the safety injection water was injected efficiently into the RPV from the initial period and cools down the RCS properly.
Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests.

Science.gov (United States)

Rijken, Noortje H; van Engelen, Baziel G; Weerdesteyn, Vivian; Geurts, Alexander C

2015-12-01

To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). Case-control study. University medical center. Patients with various severity levels of FSHD (n=9) and healthy control subjects (n=10) were included (N=19). Not applicable. A 4-point ordinal scale was designed to grade performance on the following 4 antigravity tests: sit to stance, stance to sit, step up, and step down. In addition, the 6-minute walk test, 10-m walking test, Berg Balance Scale, and timed Up and Go test were administered as conventional tests. Construct validity was determined by linear regression analysis using the Clinical Severity Score (CSS) as the dependent variable. Interrater agreement was tested using a κ analysis. Patients with FSHD performed worse on all 4 antigravity tests compared with the controls. Stronger correlations were found within than between test categories (antigravity vs conventional). The antigravity tests revealed the highest explained variance with regard to the CSS (R(2)=.86, P=.014). Interrater agreement was generally good. The results of this exploratory study support the construct validity and interrater reliability of the proposed antigravity tests for the assessment of functional capacity in patients with FSHD taking into account the use of compensatory strategies. Future research should further validate these results in a larger sample of patients with FSHD. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Validity, Reliability and Standardization Study of the Language Assessment Test for Aphasia

Directory of Open Access Journals (Sweden)

Bülent Toğram

2012-09-01

Full Text Available OBJECTIVE: Aphasia assessment is the first step towards a well- founded language therapy. Language tests need to consider cultural as well as typological linguistic aspects of a given language. This study was designed to determine the standardization, validity and reliability of Language Assessment Test for Aphasia, which consists of eight subtests including spontaneous speech and language, auditory comprehension, repetition, naming, reading, grammar, speech acts, and writing. METHODS: The test was administered to 282 healthy participants and 92 aphasic participants in age, education and gender matched groups. The validity study of the test was investigated with analysis of content, structure and criterion-related validity. For reliability of the test, the analysis of internal consistency, stability and equivalence reliability was conducted. The influence of variables on healhty participants’ sub-test scores, test score and language score was examined. According to significant differences, norms and cut-off scores based on language score were determined. RESULTS: The group with aphasia performed highly lower than healthy participants on subtest, test and language scores. The test scores of healthy group were mostly affected by age and educational level but not affected by gender. According to significant differences, age and educational level for both groups were determined. Considering age and educational levels, the reference values for the cut-off scores were presented. CONCLUSION: The test was found to be a highly reliable and valid aphasia test for Turkish- speaking aphasic patients either in Turkey or other Turkish communities around the world
Recent trends on Software Verification and Validation Testing

International Nuclear Information System (INIS)

Kim, Hyungtae; Jeong, Choongheui

2013-01-01

Verification and Validation (V and V) include the analysis, evaluation, review, inspection, assessment, and testing of products. Especially testing is an important method to verify and validate software. Software V and V testing covers test planning to execution. IEEE Std. 1012 is a standard on the software V and V. Recently, IEEE Std. 1012-2012 was published. This standard is a major revision to IEEE Std. 1012-2004 which defines only software V and V. It expands the scope of the V and V processes to include system and hardware as well as software. This standard describes the scope of V and V testing according to integrity level. In addition, independent V and V requirement related to software V and V testing in IEEE 7-4.3.2-2010 have been revised. This paper provides a recent trend of software V and V testing by reviewing of IEEE Std. 1012-2012 and IEEE 7-4.3.2-2010. There are no major changes of software V and V testing activities and tasks in IEEE 1012-2012 compared with IEEE 1012-2004. But the positions on the responsibility to perform software V and V testing are changed. In addition IEEE 7-4.3.2-2010 newly describes the positions on responsibility to perform Software V and V Testing. However, the positions of these standards on the V and V testing are different. For integrity level 3 and 4, IEEE 1012-2012 basically requires that V and V organization shall conduct all of V and V testing tasks such as test plan, test design, test case, and test procedure except test execution. If V and V testing is conducted by not V and V but another organization, the results of that testing shall be analyzed by the V and V organization. For safety-related software, IEEE 7-4.3.2-2010 requires that test procedures and reports shall be independently verified by the alternate organization regardless of who writes the procedures and/or conducts the tests
Reliability and Validity of the Inline Skating Skill Test

Science.gov (United States)

Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje

2016-01-01

This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters. Key points Study evaluated the reliability and construct validity of a newly developed inline skating skill test. Evaluated test is a first protocol designed to assess specific inline skating skill. Two groups of amateur skaters with different skating proficiency repeated the skill test in four separate occasions. The results suggest that evaluated test is reliable and valid to evaluate inline skating skill in amateur skaters. PMID:27803616

Development and validation of a partial life-cycle test with Potamopyrgus antipodarum

DEFF Research Database (Denmark)

Geiss, Cornelia; Holbech, Henrik; Kinnberg, Karin Lund

endpoints. The present study aims to develop and validate the partial life-cycle test on the reproduction of P. antipodarum. Here, results from two pre-validation studies of the reproduction test with the chemicals tributyltin (TBT) with nominal concentrations of 10 - 400 ng TBT-Sn/L and cadmium...
Test your memory-Turkish version (TYM-TR): reliability and validity study of a cognitive screening test.

Science.gov (United States)

Maviş, Ilknur; Özbabalik Adapinar, Belgin Demet; Yenilmez, Çinar; Aydin, Ayşe; Olgun, Engin; Bal, Cengiz

2015-01-01

The test your memory (TYM) is reported to be a sensitive cognitive function assessment scale for people with dementia. The aim of the present study was to investigate the reliability and validity of an adapted Turkish version of the TYM (TYM-TR) among Turkish dementia patients. The TYM-TR was given to 59 patients with dementia aged 60+ and 336 normal controls aged 23-75+. The diagnostic utility of the TYM-TR was compared with that of the mini-mental state examination (MMSE) to validate it. The internal consistency of the TYM-TR was a = 0.85. The test-retest reliability was 0.97 (P reliability and validity to distinguish dementia in the Turkish population.
Urine specimen validity test for drug abuse testing in workplace and court settings.

Science.gov (United States)

Lin, Shin-Yu; Lee, Hei-Hwa; Lee, Jong-Feng; Chen, Bai-Hsiun

2018-01-01

In recent decades, urine drug testing in the workplace has become common in many countries in the world. There have been several studies concerning the use of the urine specimen validity test (SVT) for drug abuse testing administered in the workplace. However, very little data exists concerning the urine SVT on drug abuse tests from court specimens, including dilute, substituted, adulterated, and invalid tests. We investigated 21,696 submitted urine drug test samples for SVT from workplace and court settings in southern Taiwan over 5 years. All immunoassay screen-positive urine specimen drug tests were confirmed by gas chromatography/mass spectrometry. We found that the mean 5-year prevalence of tampering (dilute, substituted, or invalid tests) in urine specimens from the workplace and court settings were 1.09% and 3.81%, respectively. The mean 5-year percentage of dilute, substituted, and invalid urine specimens from the workplace were 89.2%, 6.8%, and 4.1%, respectively. The mean 5-year percentage of dilute, substituted, and invalid urine specimens from the court were 94.8%, 1.4%, and 3.8%, respectively. No adulterated cases were found among the workplace or court samples. The most common drug identified from the workplace specimens was amphetamine, followed by opiates. The most common drug identified from the court specimens was ketamine, followed by amphetamine. We suggest that all urine specimens taken for drug testing from both the workplace and court settings need to be tested for validity. Copyright © 2017. Published by Elsevier B.V.
Known-Groups and Concurrent Validity of the Mandarin Tone Identification Test (MTIT.

Directory of Open Access Journals (Sweden)

Shufeng Zhu

Full Text Available The Mandarin Tone Identification Test (MTIT is a new test designed to assess the tone identification abilities of children with hearing impairment (HI. Evidence for reliability and sensitivity has been reported. The present study aimed to evaluate the known-groups and concurrent validity of the MTIT.The MTIT and Mandarin Pediatric Speech Intelligibility test (MPSI were administered in quiet and in noise conditions. The known-groups validity was evaluated by comparing the performance of the MTIT on children with two different levels of HI. The MPSI was included to evaluate the concurrent validity of the MTIT.81 children with HI were recruited in the present study. They were Mandarin-speaking children with profound HI (mean age = 9; 0, n = 41 and with moderate to severe HI (mean age = 8; 9, n = 40.Scores on the MTIT differed between the two groups with different hearing levels suggesting good known-groups validity. A strong relationship between tone and sentence perception both in quiet and in noise provided preliminary evidence for concurrent validity.The present study confirmed that the MTIT has good known-groups validity and provided preliminary evidence for concurrent validity. The MTIT could be used to evaluate tone identification ability in children with HI with confidence.
Validation of Linguistic and Communicative Oral Language Tests for Spanish-English Bilingual Programs.

Science.gov (United States)

Politzer, Robert L.; And Others

1983-01-01

The development, administration, and scoring of a communicative test and its validation with tests of linguistic and sociolinguistic competence in English and Spanish are reported. Correlation with measures of home language use and school achievement are also presented, and issues of test validation for bilingual programs are discussed. (MSE)
[Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].

Science.gov (United States)

Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen

2018-05-01

Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m = .54, p validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.
Brief implicit association test: Validity and utility in prediction of voting behavior

Directory of Open Access Journals (Sweden)

Pavlović Maša D.

2013-01-01

Full Text Available We employed the Brief Implicit Association Test (a recently developed short version of IAT to measure implicit political attitudes toward four political parties running for Serbian parliament. To test its criterion validity, we measured voting intention and actual voting behavior. In addition, we introduced political involvement as a potential moderator of the BIAT’s predictive and incremental validity. The BIAT demonstrated good internal and predictive validity, but lacked incremental validity over self-report measures. Predictive power of the BIAT was moderated by political involvement - the BIAT scores were stronger predictors of voting intention and behavior among voters highly involved in politics. [Projekat Ministarstva nauke Republike Srbije, br. 179018
Validity of a cross-specialty test in basic laparoscopic techniques (TABLT)

DEFF Research Database (Denmark)

Thinggaard, Ebbe; Bjerrum, Flemming; Strandbygaard, Jeanett

2015-01-01

. The aim of this study was to establish validity evidence for the Training and Assessment of Basic Laparoscopic Techniques (TABLT) test, a tablet-based training system. METHODS: Laparoscopic surgeons and trainees were recruited from departments of general surgery, gynaecology and urology. Participants...... included novice, intermediate and experienced surgeons. All participants performed the TABLT test. Performance scores were calculated based on time taken and errors made. Evidence of validity was explored using a contemporary framework of validity. RESULTS: Some 60 individuals participated. The TABLT...... was shown to be reliable, with an intraclass correlation coefficient of 0·99 (P value of 0·73 (P
Evaluating the Predictive Validity of Graduate Management Admission Test Scores

Science.gov (United States)

Sireci, Stephen G.; Talento-Miller, Eileen

2006-01-01

Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…
Measurement of Dietary Restraint: Validity Tests of Four Questionnaires

Science.gov (United States)

Williamson, Donald A.; Martin, Corby K.; York-Crowe, Emily; Anton, Stephen D.; Redman, Leanne M.; Han, Hongmei; Ravussin, Eric

2007-01-01

This study tested the validity of four measures of dietary restraint: Dutch Eating Behavior Questionnaire, Eating Inventory (EI), Revised Restraint Scale (RS), and the Current Dieting Questionnaire. Dietary restraint has been implicated as a determinant of overeating and binge eating. Conflicting findings have been attributed to different methods for measuring dietary restraint. The validity of four self-report measures of dietary restraint and dieting behavior was tested using: 1) factor analysis, 2) changes in dietary restraint in a randomized controlled trial of different methods to achieve calorie restriction, and 3) correlation of changes in dietary restraint with an objective measure of energy balance, calculated from the changes in fat mass and fat-free mass over a six-month dietary intervention. Scores from all four questionnaires, measured at baseline, formed a dietary restraint factor, but the RS also loaded on a binge eating factor. Based on change scores, the EI Restraint scale was the only measure that correlated significantly with energy balance expressed as a percentage of energy require d for weight maintenance. These findings suggest that that, of the four questionnaires tested, the EI Restraint scale was the most valid measure of the intent to diet and actual caloric restriction. PMID:17101191
40 CFR 1039.501 - How do I run a valid emission test?

Science.gov (United States)

2010-07-01

... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test? 1039.501 Section 1039.501 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Procedures § 1039.501 How do I run a valid emission test? (a) Use the equipment and procedures for...
Validation testing of a soil macronutrient sensing system

Science.gov (United States)

Rapid on-site measurements of soil macronutrients (i.e., nitrogen, phosphorus, and potassium) are needed for site-specific crop management, where fertilizer nutrient application rates are adjusted spatially based on local requirements. This study reports on validation testing of a previously develop...
Validation of new prognostic and predictive scores by sequential testing approach

International Nuclear Information System (INIS)

Nieder, Carsten; Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid

2010-01-01

Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)
Validation of new prognostic and predictive scores by sequential testing approach

Energy Technology Data Exchange (ETDEWEB)

Nieder, Carsten [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway); Inst. of Clinical Medicine, Univ. of Tromso (Norway); Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway)

2010-03-15

Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)
Validity of the Eating Attitudes Test and the Eating Disorders Inventory in Bulimia Nervosa.

Science.gov (United States)

Gross, Janet; And Others

1986-01-01

Assessed criterion and concurrent validity of the Eating Attitudes Test and the Eating Disorder Inventory in 82 women with bulimia nervosa. Both tests demonstrated criterion validity by discriminating bulimia nervosa subjects from normals. Only weak support was found for concurrent validity within bulimia subjects. Recommends combination of…
The Ostomy Adjustment Scale: translation into Norwegian language with validation and reliability testing.

Science.gov (United States)

Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin

2014-01-01

The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is
Multi Directional Repeated Sprint Is a Valid and Reliable Test for Assessment of Junior Handball Players

Directory of Open Access Journals (Sweden)

Amin Daneshfar

2018-04-01

Full Text Available The aim of the present study was to examine the validity and reliability of a 10 × (6 × 5 m multi-directional repeated sprint ability test (RSM in elite young team handball (TH players. Participants were members of the Iranian national team (n = 20, age 16.4 ± 0.7 years, weight 82.5 ± 5.5 kg, height 184.8 ± 4.6 cm, body fat 15.4 ± 4.3%. The validity of RSM was tested against a 10 × (15 + 15 m repeated sprint ability test (RSA, Yo-Yo Intermittent Recovery test Level 1 (Yo-Yo IR1, squat jump (SJ and countermovement jump (CMJ. To test the reliability of RSM, the participants repeated the testing sessions of RSM and RSA 1 week later. Both RSA and RSM tests showed good to excellent reliability of the total time (TT, best time (BT, and weakest time (WT. The results of the correlation analysis showed significant inverse correlations between maximum aerobic capacity and TT in RSA (r = −0.57, p ≤ 0.05 and RSM (r = −0.76, p ≤ 0.01. There was also a significant inverse correlation between maximum aerobic capacity with fatigue index (FI in RSA test (r = −0.64, p ≤ 0.01 and in RSM test (r = −0.53, p ≤ 0.05. BT, WT, and TT of RSA was largely-to-very largely correlated with BT (r = 0.58, p ≤ 0.01, WT (r = 0.62, p ≤ 0.01, and TT (r = 0.65, p ≤ 0.01 of RSM. BT in RSM was also correlated with FI in RSM (r = 0.88, p ≤ 0.01. In conclusion, based on the findings of the current study, the recently developed RSM test is a valid and reliable test and should be utilized for assessment of repeated sprint ability in handball players.
Multi Directional Repeated Sprint Is a Valid and Reliable Test for Assessment of Junior Handball Players

Science.gov (United States)

Daneshfar, Amin; Gahreman, Daniel E.; Koozehchian, Majid S.; Amani Shalamzari, Sadegh; Hassanzadeh Sablouei, Mozhgan; Rosemann, Thomas; Knechtle, Beat; Nikolaidis, Pantelis T.

2018-01-01

The aim of the present study was to examine the validity and reliability of a 10 × (6 × 5 m) multi-directional repeated sprint ability test (RSM) in elite young team handball (TH) players. Participants were members of the Iranian national team (n = 20, age 16.4 ± 0.7 years, weight 82.5 ± 5.5 kg, height 184.8 ± 4.6 cm, body fat 15.4 ± 4.3%). The validity of RSM was tested against a 10 × (15 + 15 m) repeated sprint ability test (RSA), Yo-Yo Intermittent Recovery test Level 1 (Yo-Yo IR1), squat jump (SJ) and countermovement jump (CMJ). To test the reliability of RSM, the participants repeated the testing sessions of RSM and RSA 1 week later. Both RSA and RSM tests showed good to excellent reliability of the total time (TT), best time (BT), and weakest time (WT). The results of the correlation analysis showed significant inverse correlations between maximum aerobic capacity and TT in RSA (r = −0.57, p ≤ 0.05) and RSM (r = −0.76, p ≤ 0.01). There was also a significant inverse correlation between maximum aerobic capacity with fatigue index (FI) in RSA test (r = −0.64, p ≤ 0.01) and in RSM test (r = −0.53, p ≤ 0.05). BT, WT, and TT of RSA was largely-to-very largely correlated with BT (r = 0.58, p ≤ 0.01), WT (r = 0.62, p ≤ 0.01), and TT (r = 0.65, p ≤ 0.01) of RSM. BT in RSM was also correlated with FI in RSM (r = 0.88, p ≤ 0.01). In conclusion, based on the findings of the current study, the recently developed RSM test is a valid and reliable test and should be utilized for assessment of repeated sprint ability in handball players. PMID:29670536
Cross-Cultural Adaptation, Validation, and Reliability Testing of the Modified Oswestry Disability Questionnaire in Persian Population with Low Back Pain.

Science.gov (United States)

Baradaran, Aslan; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza

2016-04-01

Prospective study. We aimed to validate the Persian version of the modified Oswestry disability questionnaire (MODQ) in patients with low back pain. Modified Oswestry low back pain disability questionnaire is a well-known condition-specific outcome measure that helps quantify disability in patients with lumbar syndromes. To test the validity in a pilot study, the Persian MODQ was administered to 25 individuals with low back pain. We then enrolled 200 consecutive patients with low back pain to fill the Persian MODQ as well as the short form 36 (SF-36) questionnaire. Convergent validity of the MODQ was tested using the Spearman's correlation coefficient between the MODQ and SF-36 subscales. Intraclass correlation coefficient (ICC) and Cronbach's α coefficient were measured to test the reliability between test and retest and internal consistency of all items, respectively. ICC for individual items ranged from 0.43 to 0.80 showing good reliability and reproducibility of each individual item. Cronbach's α coefficient was 0.69 showing good internal consistency across all 10 items of the Persian MODQ. Total MODQ score showed moderate to strong correlation with the eight subscales and the two domains of the SF-36. The highest correlation was between the MODQ and the physical functioning subscale of the SF-36 (r=-0.54, pPersian version of the MODQ is a valid and reliable tool for the assessment of the disability following low back pain.
Video game addiction test: validity and psychometric characteristics.

NARCIS (Netherlands)

Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de

2012-01-01

The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

Video Game Addiction Test: Validity and Psychometric Characteristics

NARCIS (Netherlands)

Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de

2012-01-01

The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study
Six-minute stepper test: a valid clinical exercise tolerance test for COPD patients

Directory of Open Access Journals (Sweden)

Grosbois JM

2016-03-01

.005. Performances on the 6MST and 6MWT were significantly improved after PR (570 vs 488 steps, P=0.001 and 448 vs 406 m, respectively; P<0.0001. Improvements of the 6MST and 6MWT after PR were significantly correlated (r=0.34; P=0.03.Conclusion: The results of this study show that the 6MST is a valid test to evaluate exercise tolerance in COPD patients. The use of this test in clinical practice appears to be particularly relevant for the assessment of patients managed by home PR. Keywords: 6-minute stepper test, 6-minute walk test, exercise tolerance, pulmonary rehabilitation, cardiopulmonary exercise testing, validity
Development and validation status of the IFMIF High Flux Test Module

International Nuclear Information System (INIS)

Arbeiter, Frederik; Abou-Sena, Ali; Chen Yuming; Dolensky, Bernhard; Heupel, Tobias; Klein, Christine; Scheel, Nicola; Schlindwein, Georg

2011-01-01

The development of the IFMIF (International Fusion Material Irradiation Facility) High Flux Test Module in the EVEDA (Engineering Validation and Engineering Design Activities) phase up to 2013 includes conceptual design, engineering analyses, as well as design and engineering validation by building of prototypes and their testing. The High Flux Test Module is the device to facilitate the irradiation of SSTT samples of RAFM steels at temperatures 250-550 deg. C and up to an accumulated irradiation damage of 150 dpa. The requirements, the current design and the performance of the module are discussed, and the development process is outlined.
Development and validation status of the IFMIF High Flux Test Module

Energy Technology Data Exchange (ETDEWEB)

Arbeiter, Frederik, E-mail: frederik.arbeiter@kit.edu [Karlsruhe Institute of Technology, Institute for Neutron Physics and Reactor Technology (KIT-INR), Karlsruhe (Germany); Abou-Sena, Ali; Chen Yuming; Dolensky, Bernhard; Heupel, Tobias; Klein, Christine; Scheel, Nicola; Schlindwein, Georg [Karlsruhe Institute of Technology, Institute for Neutron Physics and Reactor Technology (KIT-INR), Karlsruhe (Germany)

2011-10-15

The development of the IFMIF (International Fusion Material Irradiation Facility) High Flux Test Module in the EVEDA (Engineering Validation and Engineering Design Activities) phase up to 2013 includes conceptual design, engineering analyses, as well as design and engineering validation by building of prototypes and their testing. The High Flux Test Module is the device to facilitate the irradiation of SSTT samples of RAFM steels at temperatures 250-550 deg. C and up to an accumulated irradiation damage of 150 dpa. The requirements, the current design and the performance of the module are discussed, and the development process is outlined.
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan

Directory of Open Access Journals (Sweden)

Moore AL

2018-02-01

Full Text Available Amy Lawson Moore, Terissa M Miller Gibson Institute of Cognitive Research, Colorado Springs, CO, USA Purpose: The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills.Methods: This study included 2,737 participants aged 5–85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test–retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement.Results: Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test–retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93.Conclusion: The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan. Keywords: testing, cognitive skills, memory, processing speed, visual processing, auditory processing
The ad-libitum alcohol 'taste test': secondary analyses of potential confounds and construct validity.

Science.gov (United States)

Jones, Andrew; Button, Emily; Rose, Abigail K; Robinson, Eric; Christiansen, Paul; Di Lemma, Lisa; Field, Matt

2016-03-01

Motivation to drink alcohol can be measured in the laboratory using an ad-libitum 'taste test', in which participants rate the taste of alcoholic drinks whilst their intake is covertly monitored. Little is known about the construct validity of this paradigm. The objective of this study was to investigate variables that may compromise the validity of this paradigm and its construct validity. We re-analysed data from 12 studies from our laboratory that incorporated an ad-libitum taste test. We considered time of day and participants' awareness of the purpose of the taste test as potential confounding variables. We examined whether gender, typical alcohol consumption, subjective craving, scores on the Alcohol Use Disorders Identification Test and perceived pleasantness of the drinks predicted ad-libitum consumption (construct validity). We included 762 participants (462 female). Participant awareness and time of day were not related to ad-libitum alcohol consumption. Males drank significantly more alcohol than females (p alcohol consumption (p = 0.04), craving (p alcohol consumption. The construct validity of the taste test was supported by relationships between ad-libitum consumption and typical alcohol consumption, craving and pleasantness ratings of the drinks. The ad-libitum taste test is a valid method for the assessment of alcohol intake in the laboratory.
Validity and test-retest reliability of a novel simple back extensor muscle strength test.

Science.gov (United States)

Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

2017-01-01

To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r = 0.824, p strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p strength ( p strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.
Investigation of reliability, validity and normality Persian version of the California Critical Thinking Skills Test; Form B (CCTST

Directory of Open Access Journals (Sweden)

Khallli H

2003-04-01

Full Text Available Background: To evaluate the effectiveness of the present educational programs in terms of students' achieving problem solving, decision making and critical thinking skills, reliable, valid and standard instrument are needed. Purposes: To Investigate the Reliability, validity and Norm of CCTST Form.B .The California Critical Thinking Skills Test contain 34 multi-choice questions with a correct answer in the jive Critical Thinking (CT cognitive skills domain. Methods: The translated CCTST Form.B were given t0405 BSN nursing students ojNursing Faculties located in Tehran (Tehran, Iran and Shahid Beheshti Universitiesthat were selected in the through random sampling. In order to determine the face and content validity the test was translated and edited by Persian and English language professor and researchers. it was also confirmed by judgments of a panel of medical education experts and psychology professor's. CCTST reliability was determined with internal consistency and use of KR-20. The construct validity of the test was investigated with factor analysis and internal consistency and group difference. Results: The test coefficien for reliablity was 0.62. Factor Analysis indicated that CCTST has been formed from 5 factor (element namely: Analysis, Evaluation, lriference, Inductive and Deductive Reasoning. Internal consistency method shows that All subscales have been high and positive correlation with total test score. Group difference method between nursing and philosophy students (n=50 indicated that there is meaningfUl difference between nursing and philosophy students scores (t=-4.95,p=0.OOO1. Scores percentile norm also show that percentile offifty scores related to 11 raw score and 95, 5 percentiles are related to 17 and 6 raw score ordinary. Conclusions: The Results revealed that the questions test is sufficiently reliable as a research tool, and all subscales measure a single construct (Critical Thinking and are able to distinguished the
Vancomycin-resistant enterococci: validation of susceptibility testing and in vitro activity of novel antibiotics

DEFF Research Database (Denmark)

Rathe, Mathias; Lise, Kristensen,; Ellermann-Eriksen, Svend

Vancomycin-resistant enterococci: validation of susceptibility testing and in vitro activity of novel antibiotics......Vancomycin-resistant enterococci: validation of susceptibility testing and in vitro activity of novel antibiotics...
Italian validation of the Purpose In Life (PIL) test and the Seeking Of Noetic Goals (SONG) test in a population of cancer patients.

Science.gov (United States)

Brunelli, C; Bianchi, E; Murru, L; Monformoso, P; Bosisio, M; Gangeri, L; Miccinesi, G; Scrignaro, M; Ripamonti, C; Borreani, C

2012-11-01

The first instruments developed to evaluate specific logotherapeutic dimensions were the Purpose In Life (PIL) and the Seeking Of Noetic Goals (SONG) tests, designed to reflect Frankl's concepts of, respectively, meaning in life attainment and will to meaning. This study aims to perform the Italian cultural adaptation and the psychometric validation of the PIL and SONG questionnaires. We administered the PIL and SONG, culturally adapted into the Italian language, to 266 cancer patients. The psychometric validation appraised construct validity, internal consistency, test-retest reliability, known-group validity, and convergent validity of the two questionnaires with respect to one another. The factorial analysis indicates that the original single-factor solution can be maintained for both instruments (proportion of variance explained by the first factor 77% and 71% for the PIL and SONG, respectively). The results show excellent internal consistency (Cronbach's alpha of 0.91 for the PIL and 0.90 for the SONG) and test-retest reliability (intraclass correlation coefficient of 0.92 for the PIL and 0.81 for the SONG). As expected, males, believers, patients nearer to the diagnosis, and patients not undergoing psychological therapy have higher PIL and lower SONG scores, while expectations for age were not confirmed. The average level for the PIL was 107.3, while for the SONG, it was 66.1, and a negative correlation (-0.47) between PIL and SONG scores indicates good convergent validity of the two instruments. Italian versions of the PIL and SONG are adequate and reliable self-report instruments for evaluating purpose in life and the motivation to find purpose for cancer patient populations.
[Comparison of the Wechsler Memory Scale-III and the Spain-Complutense Verbal Learning Test in acquired brain injury: construct validity and ecological validity].

Science.gov (United States)

Luna-Lario, P; Pena, J; Ojeda, N

2017-04-16

To perform an in-depth examination of the construct validity and the ecological validity of the Wechsler Memory Scale-III (WMS-III) and the Spain-Complutense Verbal Learning Test (TAVEC). The sample consists of 106 adults with acquired brain injury who were treated in the Area of Neuropsychology and Neuropsychiatry of the Complejo Hospitalario de Navarra and displayed memory deficit as the main sequela, measured by means of specific memory tests. The construct validity is determined by examining the tasks required in each test over the basic theoretical models, comparing the performance according to the parameters offered by the tests, contrasting the severity indices of each test and analysing their convergence. The external validity is explored through the correlation between the tests and by using regression models. According to the results obtained, both the WMS-III and the TAVEC have construct validity. The TAVEC is more sensitive and captures not only the deficits in mnemonic consolidation, but also in the executive functions involved in memory. The working memory index of the WMS-III is useful for predicting the return to work at two years after the acquired brain injury, but none of the instruments anticipates the disability and dependence at least six months after the injury. We reflect upon the construct validity of the tests and their insufficient capacity to predict functionality when the sequelae become chronic.
Development and Validation of a Food-Associated Olfactory Test (FAOT).

Science.gov (United States)

Denzer-Lippmann, Melanie Yvonne; Beauchamp, Jonathan; Freiherr, Jessica; Thuerauf, Norbert; Kornhuber, Johannes; Buettner, Andrea

2017-01-01

Olfactory tests are an important tool in human nutritional research for studying food preferences, yet comprehensive tests dedicated solely to food odors are currently lacking. Therefore, within this study, an innovative food-associated olfactory test (FAOT) system was developed. The FAOT comprises 16 odorant pens that contain representative food odors relating to different macronutrient classes. The test underwent a sensory validation based on identification rate, intensity, hedonic value, and food association scores. The accuracy of the test was further compared to the accuracy of the established Sniffin' Sticks identification test. The identification rates and intensities of this new FAOT were found to be comparable to the Sniffin' Sticks olfactory identification test. The odorant pens were also assessed chemo-analytically and were found to be chemically stable for at least 24 weeks. Overall, this new identification test for use in assessing olfaction in a food-associated context is valid both in terms of its use in sensory perception studies and its chemical stability. The FOAT is particularly suited to examinations of the sense of smell regarding food odors. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Suitability Screening Test for Marine Corps Air Traffic Controllers Phase 3: Non-cognitive Test Validation and Cognitive Test Prototype

Science.gov (United States)

2014-06-01

developed, pilot tested, and in its Beta form. Findings or Results The subset of NCAPS traits that demonstrated statistically significant prediction for...development and initial pilot testing of the Prototype Marine ATC Cognitive Test. Method The validation approach chosen for this project was a criterion... multitasking ability, and 5) inductive reasoning ability. A working memory capacity test was developed because working memory has been linked to
40 CFR 1048.501 - How do I run a valid emission test?

Science.gov (United States)

2010-07-01

... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test... § 1048.501 How do I run a valid emission test? (a) Use the equipment and procedures for spark-ignition... 86.132-96(h) and then operate the engine for 60 minutes over repeat runs of the duty cycle specified...
Unit testing, model validation, and biological simulation.

Science.gov (United States)

Sarma, Gopal P; Jacobs, Travis W; Watts, Mark D; Ghayoomie, S Vahid; Larson, Stephen D; Gerkin, Richard C

2016-01-01

The growth of the software industry has gone hand in hand with the development of tools and cultural practices for ensuring the reliability of complex pieces of software. These tools and practices are now acknowledged to be essential to the management of modern software. As computational models and methods have become increasingly common in the biological sciences, it is important to examine how these practices can accelerate biological software development and improve research quality. In this article, we give a focused case study of our experience with the practices of unit testing and test-driven development in OpenWorm, an open-science project aimed at modeling Caenorhabditis elegans. We identify and discuss the challenges of incorporating test-driven development into a heterogeneous, data-driven project, as well as the role of model validation tests, a category of tests unique to software which expresses scientific models.
Portuguese validation of the children's eating attitudes test

Directory of Open Access Journals (Sweden)

Maria Del Carmen Bento Teixeira

2012-01-01

Full Text Available BACKGROUND: The Eating Attitudes Test (EAT is the most widely used instrument for evaluating eating disorders in adults and adolescents in a variety of cultures and samples. OBJECTIVE: The aim of this study was to analyse the psychometric properties of the Portuguese version of the Children's Eating Attitudes Test (ChEAT. METHOD: Nine hundred and fifty-six Portuguese secondary students (565 girls and 391 boys answered the ChEAT. The test-retest reliability was obtained with data from 206 participants from the total sample who re-answered the questionnaire after 4-6 weeks. Psychometric analyses were carried out for the total sample and separately for girls and boys. RESULTS: Internal consistency and test-retest reliability were satisfactory. Principal components factorial analysis yielded four factors in the total sample, accounting for 42.35% of the total variance. Factor structure was similar in the total sample and in both genders. Factors were labelled: F1 "Fear of Getting Fat", F2 "Restrictive and Purgative Behaviours", F3 "Food Preoccupation" and F4 "Social Pressure to Eat". The concurrent validity, explored using the Contour Drawing Figure Rating Scale (CDRS was high. DISCUSSION: The Portuguese version of the ChEAT is a valid and useful instrument for the evaluation of abnormal eating attitudes and behaviours among Portuguese adolescents.
Simulation tests for cervical nonorganic signs: a study of face validity.

Science.gov (United States)

Vernon, Howard; Proctor, Dan; Bakalovski, Dianna; Moreton, Jesse

2010-01-01

The purpose of this study was to develop and determine the face validity of additional cervical nonorganic simulation tests. Four simulation tests were either selected from the literature or newly designed: simulated sitting trunk/shoulder rotation (SR; test no. 1), active vs passive cervical rotation (CR; test no. 2), Libman's test (LT; test no. 3) of pressure over the mastoid process, and side-lying passive shoulder abduction (SA; test no. 4). Three groups, 1 without neck pain (n = 44) and 2 with neck pain (n = 43 and 27), were formed. Outcome measures consisted of questions on provocation of pain (Yes/No) and appropriateness (Yes/No) as well as measurements of cervical rotation (goniometric) and pressure pain threshold (pressure algometer). Group test responses were evaluated and scored. A threshold of acceptance was established at 80% agreement for face validity. Ranges of rotation and pressure threshold values were analyzed with the Student t test. In nonneck pain subjects, all 4 tests were rated as nonpainful and 3 were rated as "appropriate" for neck pain examination (not SR). In neck pain subjects, this test and SA were rated as nonpainful, whereas LT was rated as painful in 26% of subjects. Only CR and LT were rated as "appropriate." In neck pain subjects, passive rotations exceeded actives by 10% to 14% (P = .000). On a second round of testing with a slightly modified method, SR and SA achieved acceptable "appropriateness." Once 2 tests were slightly modified, all 4 tests were found to have acceptable face validity. Further research into the reliability of these tests as well as into the combinations of these tests is warranted. Copyright 2010 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Validity of the American Sign Language Discrimination Test

Science.gov (United States)

Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A.

2016-01-01

American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…
Functional Literacy Tests: A Case of Anticipatory Validity?

Science.gov (United States)

Anderson, Lorin W.; Anderson, Jo Craig

1981-01-01

Development of the mathematics functional literacy test (MFLT) is described, issues of predictive and content validity are discussed, and implications for educational policy are presented. Ten basic skill areas identified by the National Council of Supervisors of Mathematics were used as the basis for the development of the MFLT. (RL)
Test-retest reliability and construct validity of the DOiT (Dutch Obesity Intervention in Teenagers) questionnaire: measuring energy balance-related behaviours in Dutch adolescents.

Science.gov (United States)

Janssen, Evelien H C; Singh, Amika S; van Nassau, Femke; Brug, Johannes; van Mechelen, Willem; Chinapaw, Mai J M

2014-02-01

Adequate assessment of energy balance-related behaviours in adolescents is essential to develop and evaluate effective obesity prevention programmes. The present study examined the test-retest reliability and construct validity of a questionnaire assessing energy balance-related behaviours in adolescents during the evaluation of the DOiT (Dutch Obesity Intervention in Teenagers) intervention. To assess test-retest reliability, adolescents filled in the questionnaire twice (n 111). To assess construct validity, the results from the first test were compared with data collected in a personal cognitive interview (n 20, independent from the reliability study). For both reliability and validity, intraclass correlation coefficients for continuous data or Cohen's kappa coefficients for categorical data were calculated as well as percentage agreement. Data were collected during school time from February to May 2010. Study participants were Dutch adolescents aged 12-14 years attending pre-vocational secondary schools. In more than three-quarters of the ninety-five questionnaire items the test-retest reliability appeared to be good to excellent. Moderate reliability was found for all other twenty-one items. Fifty-one items (of ninety-five items) showed good to excellent construct validity. Construct validity appeared moderate in twenty-three items and poor in twenty-one items. Most items with poor construct validity concerned consumption of sugar-containing beverages and high-energy snacks/sweets. Our study showed good test-retest reliability and largely moderate to good construct validity for the majority of items of the DOiT questionnaire. Items with poor construct validity (most of them found for items concerning energy intake-related behaviours) should be revised and tested again to improve the questionnaire for future use.

Validation of a diabetes numeracy test in Arabic

OpenAIRE

Alghodaier, Hussah; Jradi, Hoda; Mohammad, Najwa Samantha; Bawazir, Amen

2017-01-01

Background The prevalence of diabetes Mellitus in Saudi Arabia is 24%, ranking it among the top ten Worldwide. Diabetes education focuses on self-management and relies on numeracy skills. Poor numeracy may go unrecognized and it is important to have an assessment tool in Arabic to measure such a skill in diabetes care. Objectives To validate a 15-item Diabetes Numeracy Test (DNT-15) in the Arabic Language as a tool to assess the numeracy skills of patients with diabetes and to test its proper...
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

OpenAIRE

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administer...
Intra-tester Reliability and Construct Validity of a Hip Abductor Eccentric Strength Test.

Science.gov (United States)

Brindle, Richard A; Ebaugh, D David; Milner, Clare E

2017-11-15

Side-lying hip abductor strength tests are commonly used to evaluate muscle strength. In a 'break' test the tester applies sufficient force to lower the limb to the table while the patient resists. The peak force is postulated to occur while the leg is lowering, thus representing the participant's eccentric muscle strength. However, it is unclear whether peak force occurs before or after the leg begins to lower. To determine intra-rater reliability and construct validity of a hip abductor eccentric strength test. Intra-rater reliability and construct validity study. Twenty healthy adults (26 ±6 years; 1.66 ±0.06 m; 62.2 ±8.0 kg) made two visits to the laboratory at least one week apart. During the hip abductor eccentric strength test, a hand-held dynamometer recorded peak force and time to peak force and limb position was recorded via a motion capture system. Intra-rater reliability was determined using intra-class correlation (ICC), standard error of measurement (SEM), and minimal detectable difference (MDD). Construct validity was assessed by determining if peak force occurred after the start of the lowering phase using a one-sample t-test. The hip abductor eccentric strength test had substantial intra-rater reliability (ICC( 3,3 ) = 0.88; 95% confidence interval: 0.65-0.95), SEM of 0.9%BWh, and a MDD of 2.5%BWh. Construct validity was established as peak force occurred 2.1s (±0.6s; range 0.7s to 3.7s) after the start of the lowering phase of the test (p ≤ 0.001). The hip abductor eccentric strength test is a valid and reliable measure of eccentric muscle strength. This test may be used clinically to assess changes in eccentric muscle strength over time.
The Unified Language Testing Plan: Speaking Proficiency Test. Spanish and English Pilot Validation Studies. Report Number 1.

Science.gov (United States)

Thornton, Julie A.

This report describes one segment of the Federal Language Testing Board's Unified Language Testing Plan (ULTP), the validation of speaking proficiency tests in Spanish and English. The ULTP is a project to increase standardization of foreign language proficiency measurement and promote sharing of resources among testing programs in the federal…
Screening for cognitive impairment in older individuals. Validation study of a computer-based test.

Science.gov (United States)

Green, R C; Green, J; Harrison, J M; Kutner, M H

1994-08-01

This study examined the validity of a computer-based cognitive test that was recently designed to screen the elderly for cognitive impairment. Criterion-related validity was examined by comparing test scores of impaired patients and normal control subjects. Construct-related validity was computed through correlations between computer-based subtests and related conventional neuropsychological subtests. University center for memory disorders. Fifty-two patients with mild cognitive impairment by strict clinical criteria and 50 unimpaired, age- and education-matched control subjects. Control subjects were rigorously screened by neurological, neuropsychological, imaging, and electrophysiological criteria to identify and exclude individuals with occult abnormalities. Using a cut-off total score of 126, this computer-based instrument had a sensitivity of 0.83 and a specificity of 0.96. Using a prevalence estimate of 10%, predictive values, positive and negative, were 0.70 and 0.96, respectively. Computer-based subtests correlated significantly with conventional neuropsychological tests measuring similar cognitive domains. Thirteen (17.8%) of 73 volunteers with normal medical histories were excluded from the control group, with unsuspected abnormalities on standard neuropsychological tests, electroencephalograms, or magnetic resonance imaging scans. Computer-based testing is a valid screening methodology for the detection of mild cognitive impairment in the elderly, although this particular test has important limitations. Broader applications of computer-based testing will require extensive population-based validation. Future studies should recognize that normal control subjects without a history of disease who are typically used in validation studies may have a high incidence of unsuspected abnormalities on neurodiagnostic studies.
Experimental validation of a new heterogeneous mechanical test design

Science.gov (United States)

Aquino, J.; Campos, A. Andrade; Souto, N.; Thuillier, S.

2018-05-01

Standard material parameters identification strategies generally use an extensive number of classical tests for collecting the required experimental data. However, a great effort has been made recently by the scientific and industrial communities to support this experimental database on heterogeneous tests. These tests can provide richer information on the material behavior allowing the identification of a more complete set of material parameters. This is a result of the recent development of full-field measurements techniques, like digital image correlation (DIC), that can capture the heterogeneous deformation fields on the specimen surface during the test. Recently, new specimen geometries were designed to enhance the richness of the strain field and capture supplementary strain states. The butterfly specimen is an example of these new geometries, designed through a numerical optimization procedure where an indicator capable of evaluating the heterogeneity and the richness of strain information. However, no experimental validation was yet performed. The aim of this work is to experimentally validate the heterogeneous butterfly mechanical test in the parameter identification framework. For this aim, DIC technique and a Finite Element Model Up-date inverse strategy are used together for the parameter identification of a DC04 steel, as well as the calculation of the indicator. The experimental tests are carried out in a universal testing machine with the ARAMIS measuring system to provide the strain states on the specimen surface. The identification strategy is accomplished with the data obtained from the experimental tests and the results are compared to a reference numerical solution.
Validation of a Paper and Pencil Test Battery for the Diagnosis of Minimal Hepatic Encephalopathy in Korea.

Science.gov (United States)

Jeong, Jae Yoon; Jun, Dae Won; Bai, Daiseg; Kim, Ji Yean; Sohn, Joo Hyun; Ahn, Sang Bong; Kim, Sang Gyune; Kim, Tae Yeob; Kim, Hyoung Su; Jeong, Soung Won; Cho, Yong Kyun; Song, Do Seon; Kim, Hee Yeon; Jung, Young Kul; Yoon, Eileen L

2017-09-01

The aim of this study was to validate a new paper and pencil test battery to diagnose minimal hepatic encephalopathy (MHE) in Korea. A new paper and pencil test battery was composed of number connection test-A (NCT-A), number connection test-B (NCT-B), digit span test (DST), and symbol digit modality test (SDMT). The norm of the new test was based on 315 healthy individuals between the ages of 20 and 70 years old. Another 63 healthy subjects (n = 31) and cirrhosis patients (n = 32) were included as a validation cohort. All participants completed the new paper and pencil test, a critical flicker frequency (CFF) test and computerized cognitive function test (visual continuous performance test [CPT]). The scores on the NCT-A and NCT-B increased but those of DST and SDMT decreased according to age. Twelve of the cirrhotic patients (37.5%) were diagnosed with MHE based on the new paper and pencil test battery. The total score of the paper and pencil test battery showed good positive correlation with the CFF (r = 0.551, P cognitive function test. Also, this score was lower in patients with MHE compared to those without MHE (P cognitive test decreased significantly in patients with MHE compared to those without MHE. Test-retest reliability was comparable. In conclusion, the new paper and pencil test battery including NCT-A, NCT-B, DST, and SDMT showed good correlation with neuropsychological tests. This new paper and pencil test battery could help to discriminate patients with impaired cognitive function in cirrhosis (registered at Clinical Research Information Service [CRIS], https://cris.nih.go.kr/cris, KCT0000955). © 2017 The Korean Academy of Medical Sciences.
Validation of RNAi Silencing Efficiency Using Gene Array Data shows 18.5% Failure Rate across 429 Independent Experiments

Directory of Open Access Journals (Sweden)

Gyöngyi Munkácsy

2016-01-01

Full Text Available No independent cross-validation of success rate for studies utilizing small interfering RNA (siRNA for gene silencing has been completed before. To assess the influence of experimental parameters like cell line, transfection technique, validation method, and type of control, we have to validate these in a large set of studies. We utilized gene chip data published for siRNA experiments to assess success rate and to compare methods used in these experiments. We searched NCBI GEO for samples with whole transcriptome analysis before and after gene silencing and evaluated the efficiency for the target and off-target genes using the array-based expression data. Wilcoxon signed-rank test was used to assess silencing efficacy and Kruskal–Wallis tests and Spearman rank correlation were used to evaluate study parameters. All together 1,643 samples representing 429 experiments published in 207 studies were evaluated. The fold change (FC of down-regulation of the target gene was above 0.7 in 18.5% and was above 0.5 in 38.7% of experiments. Silencing efficiency was lowest in MCF7 and highest in SW480 cells (FC = 0.59 and FC = 0.30, respectively, P = 9.3E−06. Studies utilizing Western blot for validation performed better than those with quantitative polymerase chain reaction (qPCR or microarray (FC = 0.43, FC = 0.47, and FC = 0.55, respectively, P = 2.8E−04. There was no correlation between type of control, transfection method, publication year, and silencing efficiency. Although gene silencing is a robust feature successfully cross-validated in the majority of experiments, efficiency remained insufficient in a significant proportion of studies. Selection of cell line model and validation method had the highest influence on silencing proficiency.
Exploring the Reliability and Validity of the Social-Moral Awareness Test

Science.gov (United States)

Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth

2012-01-01

Background: The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor…
Experimental Testing Procedures and Dynamic Model Validation for Vanadium Redox Flow Battery Storage System

DEFF Research Database (Denmark)

Baccino, Francesco; Marinelli, Mattia; Nørgård, Per Bromand

2013-01-01

The paper aims at characterizing the electrochemical and thermal parameters of a 15 kW/320 kWh vanadium redox flow battery (VRB) installed in the SYSLAB test facility of the DTU Risø Campus and experimentally validating the proposed dynamic model realized in Matlab-Simulink. The adopted testing...... efficiency of the battery system. The test procedure has general validity and could also be used for other storage technologies. The storage model proposed and described is suitable for electrical studies and can represent a general model in terms of validity. Finally, the model simulation outputs...
Phase 1 Validation Testing and Simulation for the WEC-Sim Open Source Code

Science.gov (United States)

Ruehl, K.; Michelen, C.; Gunawan, B.; Bosma, B.; Simmons, A.; Lomonaco, P.

2015-12-01

WEC-Sim is an open source code to model wave energy converters performance in operational waves, developed by Sandia and NREL and funded by the US DOE. The code is a time-domain modeling tool developed in MATLAB/SIMULINK using the multibody dynamics solver SimMechanics, and solves the WEC's governing equations of motion using the Cummins time-domain impulse response formulation in 6 degrees of freedom. The WEC-Sim code has undergone verification through code-to-code comparisons; however validation of the code has been limited to publicly available experimental data sets. While these data sets provide preliminary code validation, the experimental tests were not explicitly designed for code validation, and as a result are limited in their ability to validate the full functionality of the WEC-Sim code. Therefore, dedicated physical model tests for WEC-Sim validation have been performed. This presentation provides an overview of the WEC-Sim validation experimental wave tank tests performed at the Oregon State University's Directional Wave Basin at Hinsdale Wave Research Laboratory. Phase 1 of experimental testing was focused on device characterization and completed in Fall 2015. Phase 2 is focused on WEC performance and scheduled for Winter 2015/2016. These experimental tests were designed explicitly to validate the performance of WEC-Sim code, and its new feature additions. Upon completion, the WEC-Sim validation data set will be made publicly available to the wave energy community. For the physical model test, a controllable model of a floating wave energy converter has been designed and constructed. The instrumentation includes state-of-the-art devices to measure pressure fields, motions in 6 DOF, multi-axial load cells, torque transducers, position transducers, and encoders. The model also incorporates a fully programmable Power-Take-Off system which can be used to generate or absorb wave energy. Numerical simulations of the experiments using WEC-Sim will be
The Stick Design Test on the assessment of older adults with low formal education: evidences of construct, criterion-related and ecological validity.

Science.gov (United States)

de Paula, Jonas Jardim; Costa, Mônica Vieira; Bocardi, Matheus Bortolosso; Cortezzi, Mariana; De Moraes, Edgar Nunes; Malloy-Diniz, Leandro Fernandes

2013-12-01

The assessment of visuospatial abilities is usually performed by drawing tasks. In patients with very low formal education, the use of these tasks might be biased by their cultural background. The Stick Design Test was developed for the assessment of this population. We aim to expand the test psychometric properties by assessing its construct, criterion-related and ecological validity in older adults with low formal education. Healthy older adults (n = 63) and Alzheimer's disease patients (n = 92) performed the Stick Design Test, Mini-Mental State Examination, Digit Span Forward and the Clock Drawing Test. Their caregivers answered Personal Care and Instrumental Activities of Daily Living). Construct validity was assessed by factor analysis, convergent correlations (with the Clock Drawing Test), and divergent correlations (with Digit Span Forward); criterion-related validity by receiver operating characteristic curve analysis and binary logistic regression; and Ecological validity by correlations with ADL. The test factor structure was composed by one component (R 2 = 64%). Significant correlations with the Clock Drawing Test and Digit Span Forward were found, and the relationship was stronger with the first measure. The test was less associated with formal education than the Clock Drawing Test. It classified about 76% of the participants correctly and had and additive effect with the Mini-Mental State Examination (84% of correct classification). The test also correlated significantly with measures of ADL, suggesting ecological validity. The Stick Design Test shows evidence of construct, criterion-related and ecological validity. It is an interesting alternative to drawing tasks for the assessment of visuospatial abilities.
Test of Creative Imagination: Validity and Reliability Study

Science.gov (United States)

Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel

2013-01-01

The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…
Reliability and validity of two isometric squat tests.

Science.gov (United States)

Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U

2002-05-01

The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p squat and FHS test performances (r squat and FHS test performance can be attributed to differences in the movement patterns of the tests
Validity and reliability of skill-related fitness tests for wheelchair-using youth with Spina Bifida.

NARCIS (Netherlands)

Bloemen, M.A.; Takken, T.; Backx, F.J.; Vos, M.; Kruitwagen, C.L.; Groot, J.F. de

2017-01-01

Objectives: To determine content validity of the Muscle Power Sprint Test (MPST), and construct validity and reliability of the MPST, 10x5 Meter Sprint Test (10x5MST), slalom test, and One Stroke Push Test (1SPT) in wheelchair-using youth with spina bifida (SB). Design: Clinimetric study. Setting:
Validity and Reliability of Skill-Related Fitness Tests for Wheelchair-Using Youth With Spina Bifida

NARCIS (Netherlands)

Bloemen, Manon A.; Takken, Tim; Backx, Frank J.; Vos, Marleen; Kruitwagen, Cas L.; de Groot, Janke F.

OBJECTIVE: To determine content validity of the Muscle Power Sprint Test (MPST) and construct validity and reliability of the MPST, 10x5 Meter Sprint Test (10x5MST), slalom test and one stroke push test (1SPT) in wheelchair-using youth with spina bifida (SB). DESIGN: Clinimetric study SETTING:
Validity and Reliability of Skill-Related Fitness Tests for Wheelchair-Using Youth with Spina Bifida

NARCIS (Netherlands)

Cas L.J.J. Kruitwagen; Frank J.G. Backx; Tim Takken; Janke de Groot; Marleen Vos; Manon A.T. Bloemen

2016-01-01

Objective: To determine content validity of the Muscle Power Sprint Test (MPST) and construct validity and reliability of the MPST, 10x5 Meter Sprint Test (10x5MST), slalom test and one stroke push test (1SPT) in wheelchair-using youth with spina bifida (SB). Design: Clinimetric study Setting:
Comprehension of Written Grammar Test: Reliability and Known-Groups Validity Study With Hearing and Deaf and Hard-of-Hearing Students.

Science.gov (United States)

Cannon, Joanna E; Hubley, Anita M; Millhoff, Courtney; Mazlouman, Shahla

2016-01-01

The aim of the current study was to gather validation evidence for the Comprehension of Written Grammar (CWG; Easterbrooks, 2010) receptive test of 26 grammatical structures of English print for use with children who are deaf and hard of hearing (DHH). Reliability and validity data were collected for 98 participants (49 DHH and 49 hearing) in Grades 2-6. The objectives were to: (a) examine 4-week test-retest reliability data; and (b) provide evidence of known-groups validity by examining expected differences between the groups on the CWG vocabulary pretest and main test, as well as selected structures. Results indicated excellent test-retest reliability estimates for CWG test scores. DHH participants performed statistically significantly lower on the CWG vocabulary pretest and main test than the hearing participants. Significantly lower performance by DHH participants on most expected grammatical structures (e.g., basic sentence patterns, auxiliary "be" singular/plural forms, tense, comparatives, and complementation) also provided known groups evidence. Overall, the findings of this study showed strong evidence of the reliability of scores and known group-based validity of inferences made from the CWG. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

Science.gov (United States)

Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

2017-10-23

Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (preliability was (ICC3,3) = 0.953 (pvalidity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Assessing cultural validity in standardized tests in stem education

Science.gov (United States)

Gassant, Lunes

This quantitative ex post facto study examined how race and gender, as elements of culture, influence the development of common misconceptions among STEM students. Primary data came from a standardized test: the Digital Logic Concept Inventory (DLCI) developed by Drs. Geoffrey L. Herman, Michael C. Louis, and Craig Zilles from the University of Illinois at Urbana-Champaign. The sample consisted of a cohort of 82 STEM students recruited from three universities in Northern Louisiana. Microsoft Excel and the Statistical Package for the Social Sciences (SPSS) were used for data computation. Two key concepts, several sub concepts, and 19 misconceptions were tested through 11 items in the DLCI. Statistical analyses based on both the Classical Test Theory (Spearman, 1904) and the Item Response Theory (Lord, 1952) yielded similar results: some misconceptions in the DLCI can reliably be predicted by the Race or the Gender of the test taker. The research is significant because it has shown that some misconceptions in a STEM discipline attracted students with similar ethnic backgrounds differently; thus, leading to the existence of some cultural bias in the standardized test. Therefore the study encourages further research in cultural validity in standardized tests. With culturally valid tests, it will be possible to increase the effectiveness of targeted teaching and learning strategies for STEM students from diverse ethnic backgrounds. To some extent, this dissertation has contributed to understanding, better, the gap between high enrollment rates and low graduation rates among African American students and also among other minority students in STEM disciplines.

Translation and validation of the Malay version of the Stroke Knowledge Test.

Science.gov (United States)

Sowtali, Siti Noorkhairina; Yusoff, Dariah Mohd; Harith, Sakinah; Mohamed, Monniaty

2016-04-01

To date, there is a lack of published studies on assessment tools to evaluate the effectiveness of stroke education programs. This study developed and validated the Malay language version of the Stroke Knowledge Test research instrument. This study involved translation, validity, and reliability phases. The instrument underwent backward and forward translation of the English version into the Malay language. Nine experts reviewed the content for consistency, clarity, difficulty, and suitability for inclusion. Perceived usefulness and utilization were obtained from experts' opinions. Later, face validity assessment was conducted with 10 stroke patients to determine appropriateness of sentences and grammar used. A pilot study was conducted with 41 stroke patients to determine the item analysis and reliability of the translated instrument using the Kuder Richardson 20 or Cronbach's alpha. The final Malay version Stroke Knowledge Test included 20 items with good content coverage, acceptable item properties, and positive expert review ratings. Psychometric investigations suggest that Malay version Stroke Knowledge Test had moderate reliability with Kuder Richardson 20 or Cronbach's alpha of 0.58. Improvement is required for Stroke Knowledge Test items with unacceptable difficulty indices. Overall, the average rating of perceived usefulness and perceived utility of the instruments were both 72.7%, suggesting that reviewers were likely to use the instruments in their facilities. Malay version Stroke Knowledge Test was a valid and reliable tool to assess educational needs and to evaluate stroke knowledge among participants of group-based stroke education programs in Malaysia.
Victoria Symptom Validity Test performance in children and adolescents with neurological disorders.

Science.gov (United States)

Brooks, Brian L

2012-12-01

It is becoming increasingly more important to study, use, and promote the utility of measures that are designed to detect non-compliance with testing (i.e., poor effort, symptom non-validity, response bias) as part of neuropsychological assessments with children and adolescents. Several measures have evidence for use in pediatrics, but there is a paucity of published support for the Victoria Symptom Validity Test (VSVT) in this population. The purpose of this study was to examine the performance on the VSVT in a sample of pediatric patients with known neurological disorders. The sample consisted of 100 consecutively referred children and adolescents between the ages of 6 and 19 years (mean = 14.0, SD = 3.1) with various neurological diagnoses. On the VSVT total items, 95% of the sample had performance in the "valid" range, with 5% being deemed "questionable" and 0% deemed "invalid". On easy items, 97% were "valid", 2% were "questionable", and 1% was "invalid." For difficult items, 84% were "valid," 16% were "questionable," and 0% was "invalid." For those patients given two effort measures (i.e., VSVT and Test of Memory Malingering; n = 65), none was identified as having poor test-taking compliance on both measures. VSVT scores were significantly correlated with age, intelligence, processing speed, and functional ratings of daily abilities (attention, executive functioning, and adaptive functioning), but not objective performance on the measure of sustained attention, verbal memory, or visual memory. The VSVT has potential to be used in neuropsychological assessments with pediatric patients.
Development and validation of a dissolution test for lodenafil carbonate based on in vivo data.

Science.gov (United States)

Codevilla, Cristiane Franco; Castilhos, Tamara dos Santos; Cirne, Carolina Araújo; Froehlich, Pedro Eduardo; Bergold, Ana Maria

2014-04-01

Lodenafil carbonate is a phosphodiesterase type 5 inhibitor used for the treatment of erectile dysfunction. Currently, there is no dissolution test reported for lodenafil carbonate and this drug is not listed in any pharmacopoeia. The present study focused on the development and validation of a dissolution test for lodenafil carbonate tablets, using a simulated absorption profile based on in vivo data. The appropriate conditions were determined after testing sink conditions. Different conditions as medium, surfactant concentration and rotation speed were evaluated. The percentage of dose absorbed was calculated by deconvolution, using the Wagner-Nelson method. According to the obtained results, the use of 0.1 M HCl + 1.5% SLS (900 mL, at 37 + 0.5 °C) as the dissolution medium, paddles at 25 rpm were considered adequate. The samples were quantified by UV spectroscopy at 295 nm and the validation was performed according to international guidelines. The method showed specificity, linearity, accuracy and precision, within the acceptable range. Kinetics of drug release was better described by the first-order model. The proposed dissolution test can be used for the routine quality control of lodenafil carbonate in tablets.
Ecological validity of the five digit test and the oral trails test.

Science.gov (United States)

Paiva, Gabrielle Chequer de Castro; Fialho, Mariana Braga; Costa, Danielle de Souza; Paula, Jonas Jardim de

2016-01-01

Tests evaluating the attentional-executive system are widely used in clinical practice. However, proximity of an objective cognitive test with real-world situations (ecological validity) is not frequently investigated. The present study evaluate the association between measures of the Five Digit Test (FDT) and the Oral Trails Test (OTT) with self-reported cognitive failures in everyday life as measured by the Cognitive Failures Questionnaire (CFQ). Brazilian adults from 18-to-65 years old voluntarily performed the FDT and OTT tests and reported the frequency of cognitive failures in their everyday life through the CFQ. After controlling for the age effect, the measures of controlled attentional processes were associated with cognitive failures, yet the cognitive flexibility of both FDT and OTT accounted for by the majority of variance in most aspects of the CFQ factors. The FDT and the OTT measures were predictive of real-world problems such as cognitive failures in everyday activities/situations.
Integration and validation testing for PhEDEx, DBS and DAS with the PhEDEx LifeCycle agent

International Nuclear Information System (INIS)

Boeser, C; Chwalek, T; Giffels, M; Kuznetsov, V; Wildish, T

2014-01-01

The ever-increasing amount of data handled by the CMS dataflow and workflow management tools poses new challenges for cross-validation among different systems within CMS experiment at LHC. To approach this problem we developed an integration test suite based on the LifeCycle agent, a tool originally conceived for stress-testing new releases of PhEDEx, the CMS data-placement tool. The LifeCycle agent provides a framework for customising the test workflow in arbitrary ways, and can scale to levels of activity well beyond those seen in normal running. This means we can run realistic performance tests at scales not likely to be seen by the experiment for some years, or with custom topologies to examine particular situations that may cause concern some time in the future. The LifeCycle agent has recently been enhanced to become a general purpose integration and validation testing tool for major CMS services. It allows cross-system integration tests of all three components to be performed in controlled environments, without interfering with production services. In this paper we discuss the design and implementation of the LifeCycle agent. We describe how it is used for small-scale debugging and validation tests, and how we extend that to large-scale tests of whole groups of sub-systems. We show how the LifeCycle agent can emulate the action of operators, physicists, or software agents external to the system under test, and how it can be scaled to large and complex systems.
Integration and validation testing for PhEDEx, DBS and DAS with the PhEDEx LifeCycle agent

Science.gov (United States)

Boeser, C.; Chwalek, T.; Giffels, M.; Kuznetsov, V.; Wildish, T.

2014-06-01

The ever-increasing amount of data handled by the CMS dataflow and workflow management tools poses new challenges for cross-validation among different systems within CMS experiment at LHC. To approach this problem we developed an integration test suite based on the LifeCycle agent, a tool originally conceived for stress-testing new releases of PhEDEx, the CMS data-placement tool. The LifeCycle agent provides a framework for customising the test workflow in arbitrary ways, and can scale to levels of activity well beyond those seen in normal running. This means we can run realistic performance tests at scales not likely to be seen by the experiment for some years, or with custom topologies to examine particular situations that may cause concern some time in the future. The LifeCycle agent has recently been enhanced to become a general purpose integration and validation testing tool for major CMS services. It allows cross-system integration tests of all three components to be performed in controlled environments, without interfering with production services. In this paper we discuss the design and implementation of the LifeCycle agent. We describe how it is used for small-scale debugging and validation tests, and how we extend that to large-scale tests of whole groups of sub-systems. We show how the LifeCycle agent can emulate the action of operators, physicists, or software agents external to the system under test, and how it can be scaled to large and complex systems.
Validation of a Spanish version of the Test Your Memory.

Science.gov (United States)

Ferrero-Arias, J; Turrión-Rojo, M Á

2016-01-01

To validate a Spanish version of the TYM, a self-administered cognitive screening test designed for the detection of Alzheimer's disease and mild cognitive defect. A cross-sectional study was conducted in a neurology outpatient clinic. The TYM was administered to individuals of 50 years o more who came to the clinic for whatever the symptom. Their cognitive state was evaluated regardless of the outcome of TYM. They were categorized into 3 groups: 1) Cognitively normal (739), 2) with mild cognitive impairment (183), 3) with dementia (127). An analysis of items was made and the psychometric properties of the TYM were defined. There was a cross-validation, and the predictive validity of the TYM score, adjusted to the demographic variables, was determined by evaluating their performance in ROC curves. The internal consistency, interobserver reliability, short term and long-term test-retest reliability were adequate. The TYM correlated with the MMSE (r=0.779, Pde Neurología. Published by Elsevier España, S.L.U. All rights reserved.
Use of the color trails test as an embedded measure of performance validity.

Science.gov (United States)

Henry, George K; Algina, James

2013-01-01

One hundred personal injury litigants and disability claimants referred for a forensic neuropsychological evaluation were administered both portions of the Color Trails Test (CTT) as part of a more comprehensive battery of standardized tests. Subjects who failed two or more free-standing tests of cognitive performance validity formed the Failed Performance Validity (FPV) group, while subjects who passed all free-standing performance validity measures were assigned to the Passed Performance Validity (PPV) group. A cutscore of ≥45 seconds to complete Color Trails 1 (CT1) was associated with a classification accuracy of 78%, good sensitivity (66%) and high specificity (90%), while a cutscore of ≥84 seconds to complete Color Trails 2 (CT2) was associated with a classification accuracy of 82%, good sensitivity (74%) and high specificity (90%). A CT1 cutscore of ≥58 seconds, and a CT2 cutscore ≥100 seconds was associated with 100% positive predictive power at base rates from 20 to 50%.
Validation of the OECD reproduction test guideline with the New Zealand mudsnail Potamopyrgus antipodarum using trenbolone and prochloraz.

Science.gov (United States)

Geiß, Cornelia; Ruppert, Katharina; Askem, Clare; Barroso, Carlos; Faber, Daniel; Ducrot, Virginie; Holbech, Henrik; Hutchinson, Thomas H; Kajankari, Paula; Kinnberg, Karin Lund; Lagadic, Laurent; Matthiessen, Peter; Morris, Steve; Neiman, Maurine; Penttinen, Olli-Pekka; Sanchez-Marin, Paula; Teigeler, Matthias; Weltje, Lennart; Oehlmann, Jörg

2017-04-01

The Organisation for Economic Cooperation and Development (OECD) provides several standard test methods for the environmental hazard assessment of chemicals, mainly based on primary producers, arthropods, and fish. In April 2016, two new test guidelines with two mollusc species representing different reproductive strategies were approved by OECD member countries. One test guideline describes a 28-day reproduction test with the parthenogenetic New Zealand mudsnail Potamopyrgus antipodarum. The main endpoint of the test is reproduction, reflected by the embryo number in the brood pouch per female. The development of a new OECD test guideline involves several phases including inter-laboratory validation studies to demonstrate the robustness of the proposed test design and the reproducibility of the test results. Therefore, a ring test of the reproduction test with P. antipodarum was conducted including eight laboratories with the test substances trenbolone and prochloraz and results are presented here. Most laboratories could meet test validity criteria, thus demonstrating the robustness of the proposed test protocol. Trenbolone did not have an effect on the reproduction of the snails at the tested concentration range (nominal: 10-1000 ng/L). For prochloraz, laboratories produced similar EC 10 and NOEC values, showing the inter-laboratory reproducibility of results. The average EC 10 and NOEC values for reproduction (with coefficient of variation) were 26.2 µg/L (61.7%) and 29.7 µg/L (32.9%), respectively. This ring test shows that the mudsnail reproduction test is a well-suited tool for use in the chronic aquatic hazard and risk assessment of chemicals.
Validity, Reliability, and Sensitivity of a Volleyball Intermittent Endurance Test.

Science.gov (United States)

Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl

2017-03-01

To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P volleyball players.
Excellent cross-cultural validity, intra-test reliability and construct validity of the dutch rivermead mobility index in patients after stroke undergoing rehabilitation

NARCIS (Netherlands)

Roorda, Leo D.; Green, John; De Kluis, Kiki R. A.; Molenaar, Ivo W.; Bagley, Pam; Smith, Jane; Geurts, Alexander C. H.

2008-01-01

Objective: To investigate the cross-cultural validity of international Dutch-English comparisons when using the Dutch Rivermead Mobility Index (RMI), and the intra-test reliability and construct validity of the Dutch RMI. Methods: Cross-cultural validity was studied in a combined data-set of Dutch
A comparison between the original and Tablet-based Symbol Digit Modalities Test in patients with schizophrenia: Test-retest agreement, random measurement error, practice effect, and ecological validity.

Science.gov (United States)

Tang, Shih-Fen; Chen, I-Hui; Chiang, Hsin-Yu; Wu, Chien-Te; Hsueh, I-Ping; Yu, Wan-Hui; Hsieh, Ching-Lin

2017-11-27

We aimed to compare the test-retest agreement, random measurement error, practice effect, and ecological validity of the original and Tablet-based Symbol Digit Modalities Test (T-SDMT) over five serial assessments, and to examine the concurrent validity of the T-SDMT in patients with schizophrenia. Sixty patients with chronic schizophrenia completed five serial assessments (one week apart) of the SDMT and T-SDMT and one assessment of the Activities of Daily Living Rating Scale III at the first time point. Both measures showed high test-retest agreement, similar levels of random measurement error over five serial assessments. Moreover, the practice effects of the two measures did not reach a plateau phase after five serial assessments in young and middle-aged participants. Nevertheless, only the practice effect of the T-SDMT became trivial after the first assessment. Like the SDMT, the T-SDMT had good ecological validity. The T-SDMT also had good concurrent validity with the SDMT. In addition, only the T-SDMT had discriminative validity to discriminate processing speed in young and middle-aged participants. Compared to the SDMT, the T-SDMT had overall slightly better psychometric properties, so it can be an alternative measure to the SDMT for assessing processing speed in patients with schizophrenia. Copyright © 2017 Elsevier B.V. All rights reserved.
Validation of a Human Papillomavirus (HPV) DNA Cervical Screening Test That Provides Expanded HPV Typing.

Science.gov (United States)

Demarco, Maria; Carter-Pokras, Olivia; Hyun, Noorie; Castle, Philip E; He, Xin; Dallal, Cher M; Chen, Jie; Gage, Julia C; Befano, Brian; Fetterman, Barbara; Lorey, Thomas; Poitras, Nancy; Raine-Bennett, Tina R; Wentzensen, Nicolas; Schiffman, Mark

2018-05-01

As cervical cancer screening shifts from cytology to human papillomavirus (HPV) testing, a major question is the clinical value of identifying individual HPV types. We aimed to validate Onclarity (Becton Dickinson Diagnostics, Sparks, MD), a nine-channel HPV test recently approved by the FDA, by assessing (i) the association of Onclarity types/channels with precancer/cancer; (ii) HPV type/channel agreement between the results of Onclarity and cobas (Roche Molecular Systems, Pleasanton, CA), another FDA-approved test; and (iii) Onclarity typing for all types/channels compared to typing results from a research assay (linear array [LA]; Roche). We compared Onclarity to histopathology, cobas, and LA. We tested a stratified random sample ( n = 9,701) of discarded routine clinical specimens that had tested positive by Hybrid Capture 2 (HC2; Qiagen, Germantown, MD). A subset had already been tested by cobas and LA ( n = 1,965). Cervical histopathology was ascertained from electronic health records. Hierarchical Onclarity channels showed a significant linear association with histological severity. Onclarity and cobas had excellent agreement on partial typing of HPV16, HPV18, and the other 12 types as a pool (sample-weighted kappa value of 0.83); cobas was slightly more sensitive for HPV18 and slightly less sensitive for the pooled high-risk types. Typing by Onclarity showed excellent agreement with types and groups of types identified by LA (kappa values from 0.80 for HPV39/68/35 to 0.97 for HPV16). Onclarity typing results corresponded well to histopathology and to an already validated HPV DNA test and could provide additional clinical typing if such discrimination is determined to be clinically desirable. This is a work of the U.S. Government and is not subject to copyright protection in the United States. Foreign copyrights may apply.
Reliability and criterion-related validity testing (construct) of the Endotracheal Suction Assessment Tool (ESAT©).

Science.gov (United States)

Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne

2018-05-01

To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube
Implementation of the validation testing in MPPG 5.a "Commissioning and QA of treatment planning dose calculations-megavoltage photon and electron beams".

Science.gov (United States)

Jacqmin, Dustin J; Bredfeldt, Jeremy S; Frigo, Sean P; Smilowitz, Jennifer B

2017-01-01

The AAPM Medical Physics Practice Guideline (MPPG) 5.a provides concise guidance on the commissioning and QA of beam modeling and dose calculation in radiotherapy treatment planning systems. This work discusses the implementation of the validation testing recommended in MPPG 5.a at two institutions. The two institutions worked collaboratively to create a common set of treatment fields and analysis tools to deliver and analyze the validation tests. This included the development of a novel, open-source software tool to compare scanning water tank measurements to 3D DICOM-RT Dose distributions. Dose calculation algorithms in both Pinnacle and Eclipse were tested with MPPG 5.a to validate the modeling of Varian TrueBeam linear accelerators. The validation process resulted in more than 200 water tank scans and more than 50 point measurements per institution, each of which was compared to a dose calculation from the institution's treatment planning system (TPS). Overall, the validation testing recommended in MPPG 5.a took approximately 79 person-hours for a machine with four photon and five electron energies for a single TPS. Of the 79 person-hours, 26 person-hours required time on the machine, and the remainder involved preparation and analysis. The basic photon, electron, and heterogeneity correction tests were evaluated with the tolerances in MPPG 5.a, and the tolerances were met for all tests. The MPPG 5.a evaluation criteria were used to assess the small field and IMRT/VMAT validation tests. Both institutions found the use of MPPG 5.a to be a valuable resource during the commissioning process. The validation testing in MPPG 5.a showed the strengths and limitations of the TPS models. In addition, the data collected during the validation testing is useful for routine QA of the TPS, validation of software upgrades, and commissioning of new algorithms. © 2016 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals, Inc. on behalf of
Validating a UAV artificial intelligence control system using an autonomous test case generator

Science.gov (United States)

Straub, Jeremy; Huber, Justin

2013-05-01

The validation of safety-critical applications, such as autonomous UAV operations in an environment which may include human actors, is an ill posed problem. To confidence in the autonomous control technology, numerous scenarios must be considered. This paper expands upon previous work, related to autonomous testing of robotic control algorithms in a two dimensional plane, to evaluate the suitability of similar techniques for validating artificial intelligence control in three dimensions, where a minimum level of airspeed must be maintained. The results of human-conducted testing are compared to this automated testing, in terms of error detection, speed and testing cost.
Ecological validity of the Yo-Yo SFIE2 test

DEFF Research Database (Denmark)

Krustrup, Peter; Randers, Morten Bredsgaard; Horton, J

2012-01-01

The present study investigated the movement pattern of Portuguese top-level futsal referees (n=16) during competitive games and the ecological validity of the new Yo-Yo Sideways-Forwards Intermittent Endurance level 2 test (Yo-Yo SFIE2). Total distance covered (TD), high-intensity running (HIR...
Validation of a Short Form of an Indecision Test: The Vocational Assessment Test

Science.gov (United States)

Picard, France; Frenette, Éric; Guay, Frédéric; Labrosse, Julie

2015-01-01

The purpose of this research was to validate the scores of a short form of a new instrument, "l'Épreuve de décision vocationnelle, forme scolaire" (EDV-9S; vocational assessment test), which measures six indecision-related problems (lack of self-knowledge, lack of readiness, lack of method in decision making, lack of information,…
The Smoking-Related Weight and Eating Episodes Test (SWEET): development and preliminary validation.

Science.gov (United States)

Adams, Claire E; Baillie, Lauren E; Copeland, Amy L

2011-11-01

Many smokers believe that smoking helps them to control their weight, and concerns about weight gain can interfere with smoking cessation. As researchers typically assess general weight concerns, a measure specific to smoking-related weight concerns is needed. The Smoking-related Weight and Eating Episodes Test (SWEET) was created by generating items from 4 content domains: Hunger, Craving, Overeating, and Body Image. Female undergraduate smokers (N = 280) rated their postcessation weight gain concern and completed the SWEET, Fagerström Test for Nicotine Dependence, Brief Smoking Consequences Questionnaire-Adult, Eating Attitudes Test (EAT)-26, Bulimia Test-Revised (BULIT-R), and Body Shape Questionnaire. Factor analysis of the initial items suggested a 4-factor solution, suggesting 4 subscales: Smoking to suppress appetite, smoking to prevent overeating, smoking to cope with body dissatisfaction, and withdrawal-related appetite increases. Based on these results, the SWEET subscales were revised and shortened. The resulting 10-item SWEET showed excellent internal consistency (total α = .94; mean α = .86) and evidence of validity by predicting smoking frequency, eating pathology, and body image concerns (ps < .05). Smoking frequency, eating pathology, and body image concerns were significantly predicted by the SWEET while controlling for existing measures of postcessation weight gain concern. The SWEET appears to be a reliable and valid measure of tendencies to smoke in response to body image concern and nicotine withdrawal and as a way to control appetite and overeating.
Validation Testing for Automated Solubility Measurement Equipment Final Report

Energy Technology Data Exchange (ETDEWEB)

Lachut, J. S. [Washington River Protection Solutions LLC, Richland, WA (United States)

2016-01-11

Laboratory tests have been completed to test the validity of automated solubility measurement equipment using sodium nitrate and sodium chloride solutions (see test plan WRPS-1404441, “Validation Testing for Automated Solubility Measurement Equipment”). The sodium nitrate solution results were within 2-3% of the reference values, so the experiment is considered successful using the turbidity meter. The sodium chloride test was done by sight, as the turbidity meter did not work well using sodium chloride. For example, the “clear” turbidity reading was 53 FNU at 80 °C, 107 FNU at 55 °C, and 151 FNU at 20 °C. The sodium chloride did not work because it is granular and large; as the solution was stirred, the granules stayed to the outside of the reactor and just above the stir bar level, having little impact on the turbidity meter readings as the meter was aimed at the center of the solution. Also, the turbidity meter depth has an impact. The salt tends to remain near the stir bar level. If the meter is deeper in the slurry, it will read higher turbidity, and if the meter is raised higher in the slurry, it will read lower turbidity (possibly near zero) because it reads the “clear” part of the slurry. The sodium chloride solution results, as measured by sight rather than by turbidity instrument readings, were within 5-6% of the reference values.

Six factors of adult dyslexia assesed by cognitive tests and self-report questions: Very high predictive validity

NARCIS (Netherlands)

Tamboer, P.; Vorst, H.C.M.; de Jong, P.F.

2017-01-01

The Multiple Diagnostic Digital Dyslexia Test for Adults (MDDDT-A) consists of 12 newly developed tests and self-report questions in the Dutch language. Predictive validity and construct validity were investigated and compared with validity of a standard test battery of dyslexia (STB) in a sample of
A set of pathological tests to validate new finite elements

Indian Academy of Sciences (India)

M. Senthilkumar (Newgen Imaging) 1461 1996 Oct 15 13:05:22

The finite element method entails several approximations. Hence it ... researchers have designed several pathological tests to validate any new finite element. The .... Three dimensional thick shell elements using a hybrid/mixed formu- lation.
WEC-SIM Phase 1 Validation Testing -- Numerical Modeling of Experiments: Preprint

Energy Technology Data Exchange (ETDEWEB)

Ruehl, Kelley; Michelen, Carlos; Bosma, Bret; Yu, Yi-Hsiang

2016-08-01

The Wave Energy Converter Simulator (WEC-Sim) is an open-source code jointly developed by Sandia National Laboratories and the National Renewable Energy Laboratory. It is used to model wave energy converters subjected to operational and extreme waves. In order for the WEC-Sim code to be beneficial to the wave energy community, code verification and physical model validation is necessary. This paper describes numerical modeling of the wave tank testing for the 1:33-scale experimental testing of the floating oscillating surge wave energy converter. The comparison between WEC-Sim and the Phase 1 experimental data set serves as code validation. This paper is a follow-up to the WEC-Sim paper on experimental testing, and describes the WEC-Sim numerical simulations for the floating oscillating surge wave energy converter.
Recommendations for elaboration, transcultural adaptation and validation process of tests in Speech, Hearing and Language Pathology.

Science.gov (United States)

Pernambuco, Leandro; Espelt, Albert; Magalhães, Hipólito Virgílio; Lima, Kenio Costa de

2017-06-08

to present a guide with recommendations for translation, adaptation, elaboration and process of validation of tests in Speech and Language Pathology. the recommendations were based on international guidelines with a focus on the elaboration, translation, cross-cultural adaptation and validation process of tests. the recommendations were grouped into two Charts, one of them with procedures for translation and transcultural adaptation and the other for obtaining evidence of validity, reliability and measures of accuracy of the tests. a guide with norms for the organization and systematization of the process of elaboration, translation, cross-cultural adaptation and validation process of tests in Speech and Language Pathology was created.
Psychometric properties and convergent and predictive validity of an executive function test battery for two-year-olds

Directory of Open Access Journals (Sweden)

Hanna eMulder

2014-07-01

Full Text Available Executive function (EF is an important predictor of numerous developmental outcomes, such as academic achievement and behavioral adjustment. Although a plethora of measurement instruments exists to assess executive function in children, only few of these are suitable for toddlers, and even fewer have undergone psychometric evaluation. The present study evaluates the psychometric properties and validity of an assessment battery for measuring EF in two-year-olds. A sample of 2437 children were administered the assessment battery at a mean age of 2;4 years (SD = 0;3 years in a large-scale field study. Measures of both hot EF (snack and gift delay tasks and cool EF (six boxes, memory for location, and visual search task were included. Confirmatory Factor Analyses showed that a two-factor hot and cool EF model fitted the data better than a one-factor model. Measurement invariance was supported across groups differing in age, gender, socioeconomic status (SES, home language, and test setting. Criterion and convergent validity were evaluated by examining relationships between EF and age, gender, SES, home language, and parent and teacher reports of children’s attention and inhibitory control. Predictive validity of the test battery was investigated by regressing children’s pre-academic skills and behavioral problems at age three on the latent hot and cool EF factors at age two years. The test battery showed satisfactory psychometric quality and criterion, convergent, and predictive validity. Whereas cool EF predicted both pre-academic skills and behavior problems one year later, hot EF predicted behavior problems only. These results show that EF can be assessed with psychometrically sound instruments in children as young as two years, and that EF tasks can be reliably applied in large scale field research. The current instruments offer new opportunities for investigating EF in early childhood, and for evaluating interventions targeted at improving
Ares I-X Flight Test Validation of Control Design Tools in the Frequency-Domain

Science.gov (United States)

Johnson, Matthew; Hannan, Mike; Brandon, Jay; Derry, Stephen

2011-01-01

A major motivation of the Ares I-X flight test program was to Design for Data, in order to maximize the usefulness of the data recorded in support of Ares I modeling and validation of design and analysis tools. The Design for Data effort was intended to enable good post-flight characterizations of the flight control system, the vehicle structural dynamics, and also the aerodynamic characteristics of the vehicle. To extract the necessary data from the system during flight, a set of small predetermined Programmed Test Inputs (PTIs) was injected directly into the TVC signal. These PTIs were designed to excite the necessary vehicle dynamics while exhibiting a minimal impact on loads. The method is similar to common approaches in aircraft flight test programs, but with unique launch vehicle challenges due to rapidly changing states, short duration of flight, a tight flight envelope, and an inability to repeat any test. This paper documents the validation effort of the stability analysis tools to the flight data which was performed by comparing the post-flight calculated frequency response of the vehicle to the frequency response calculated by the stability analysis tools used to design and analyze the preflight models during the control design effort. The comparison between flight day frequency response and stability tool analysis for flight of the simulated vehicle shows good agreement and provides a high level of confidence in the stability analysis tools for use in any future program. This is true for both a nominal model as well as for dispersed analysis, which shows that the flight day frequency response is enveloped by the vehicle s preflight uncertainty models.
Test of Flow Characteristics in Tubular Fuel Assembly I - Establishment of test loop and measurement validation test

International Nuclear Information System (INIS)

Park, Jong Hark; Chae, H. T.; Park, C.; Kim, H.

2005-12-01

Tubular type fuel has been developed as one of candidates for Advanced HANARO Reactor(AHR). It is necessary to test the flow characteristics such as velocity in each flow channels and pressure drop of tubular type fuel. A hydraulic test-loop to examine the hydraulic characteristics for a tubular type fuel has been designed and constructed. It consists of three parts; a) piping-loop including pump and motor, magnetic flow meter and valves etc, b) test-section part where a simulated tubular type fuel is located, and 3) data acquisition system to get reading signals from sensors or instruments. In this report, considerations during the design and installation of the facility and the selection of data acquisition sensors and instruments are described in detail. Before doing the experiment to measure the flow velocities in flow channels, a preliminary tests have been done for measuring the coolant velocities using pitot-tube and for validating the measurement accuracy as well. Local velocities of the radial direction in circular tubes are measured at regular intervals of 60 degrees by three pitot-tubes. Flow rate inside the circular flow channel can be obtained by integrating the velocity distribution in radial direction. The measured flow rate was compared to that of magnetic flow meter. According to the results, two values had a good agreement, which means that the measurement of coolant velocity by using pitot-tube and the flow rate measured by the magnetic flow meter are reliable. Uncertainty analysis showed that the error of velocity measurement by pitot-tube is less than ±2.21%. The hydraulic test-loop also can be adapted to others such as HANARO 18 and 36 fuel, in-pile system of FTL(Fuel Test Loop), etc
Proposal and validation of a clinical trunk control test in individuals with spinal cord injury.

Science.gov (United States)

Quinzaños, J; Villa, A R; Flores, A A; Pérez, R

2014-06-01

One of the problems that arise in spinal cord injury (SCI) is alteration in trunk control. Despite the need for standardized scales, these do not exist for evaluating trunk control in SCI. To propose and validate a trunk control test in individuals with SCI. National Institute of Rehabilitation, Mexico. The test was developed and later evaluated for reliability and criteria, content, and construct validity. We carried out 531 tests on 177 patients and found high inter- and intra-rater reliability. In terms of criterion validity, analysis of variance demonstrated a statistically significant difference in the test score of patients with adequate or inadequate trunk control according to the assessment of a group of experts. A receiver operating characteristic curve was plotted for optimizing the instrument's cutoff point, which was determined at 13 points, with a sensitivity of 98% and a specificity of 92.2%. With regard to construct validity, the correlation between the proposed test and the spinal cord independence measure (SCIM) was 0.873 (P=0.001) and that with the evolution time was 0.437 (P=0.001). For testing the hypothesis with qualitative variables, the Kruskal-Wallis test was performed, which resulted in a statistically significant difference between the scores in the proposed scale of each group defined by these variables. It was proven experimentally that the proposed trunk control test is valid and reliable. Furthermore, the test can be used for all patients with SCI despite the type and level of injury.
Validation of a clinical critical thinking skills test in nursing

OpenAIRE

Shin, Sujin; Jung, Dukyoo; Kim, Sungeun

2015-01-01

Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school stud...
Reliability and Validity of the Inline Skating Skill Test

Directory of Open Access Journals (Sweden)

Ivan Radman, Lana Ruzic, Viktoria Padovan, Vjekoslav Cigrovski, Hrvoje Podnar

2016-09-01

Full Text Available This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male were randomized into two groups (competitive level vs. recreational level. They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]. In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%] and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]. The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2 revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p < 0.01 to complete the test in comparison to recreational-level skaters. Moreover, moderate correlation (ρ = 0.80–0.82; all p < 0.01 was observed between the participant’s self-rating and achieved performance times. In conclusion, the proposed test is a reliable and valid method to evaluate inline skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.
Converting Hangar High Expansion Foam Systems to Prevent Cockpit Damage: Full-Scale Validation Tests

Science.gov (United States)

2017-09-01

AFCEC-CO-TY-TR-2018-0001 CONVERTING HANGAR HIGH EXPANSION FOAM SYSTEMS TO PREVENT COCKPIT DAMAGE: FULL-SCALE VALIDATION TESTS Gerard G...manufacturer, or otherwise does not constitute or imply its endorsement, recommendation , or approval by the United States Air Force. The views and...09-2017 Final Test Report May 2017 Converting Hangar High Expansion Foam Systems to Prevent Cockpit Damage: Full-Scale Validation Tests N00173-15-D
The bogus taste test: Validity as a measure of laboratory food intake.

Science.gov (United States)

Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A; Kemps, Eva; Higgs, Suzanne; Jones, Andrew

2017-09-01

Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The 'bogus' taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food intake. We assessed whether the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. We examined construct validity by testing whether participant sex, hunger and liking of taste test food were associated with the amount of food consumed in the taste test. In addition, we also examined whether BMI (body mass index), trait measures of dietary restraint and over-eating in response to palatable food cues were associated with food consumption. Results indicated that the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. Factors that were reliably associated with increased consumption during the taste test were being male, have a higher baseline hunger, liking of the taste test food and a greater tendency to overeat in response to palatable food cues, whereas trait dietary restraint and BMI were not. These results indicate that the bogus taste test is likely to be a valid measure of food intake and can be used to identify factors that have a causal effect on food intake. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Development and validation of a smartphone-based digits-in-noise hearing test in South African English.

Science.gov (United States)

Potgieter, Jenni-Marí; Swanepoel, De Wet; Myburgh, Hermanus Carel; Hopper, Thomas Christopher; Smits, Cas

2015-07-01

The objective of this study was to develop and validate a smartphone-based digits-in-noise hearing test for South African English. Single digits (0-9) were recorded and spoken by a first language English female speaker. Level corrections were applied to create a set of homogeneous digits with steep speech recognition functions. A smartphone application was created to utilize 120 digit-triplets in noise as test material. An adaptive test procedure determined the speech reception threshold (SRT). Experiments were performed to determine headphones effects on the SRT and to establish normative data. Participants consisted of 40 normal-hearing subjects with thresholds ≤15 dB across the frequency spectrum (250-8000 Hz) and 186 subjects with normal-hearing in both ears, or normal-hearing in the better ear. The results show steep speech recognition functions with a slope of 20%/dB for digit-triplets presented in noise using the smartphone application. The results of five headphone types indicate that the smartphone-based hearing test is reliable and can be conducted using standard Android smartphone headphones or clinical headphones. A digits-in-noise hearing test was developed and validated for South Africa. The mean SRT and speech recognition functions correspond to previous developed telephone-based digits-in-noise tests.
Limonene hydroperoxide analogues show specific patch test reactions.

Science.gov (United States)

Christensson, Johanna Bråred; Hellsén, Staffan; Börje, Anna; Karlberg, Ann-Therese

2014-05-01

The fragrance terpene R-limonene is a very weak sensitizer, but forms allergenic oxidation products upon contact with air. The primary oxidation products of oxidized limonene, the hydroperoxides, have an important impact on the sensitizing potency of the oxidation mixture. One analogue, limonene-1-hydroperoxide, was experimentally shown to be a significantly more potent sensitizer than limonene-2-hydroperoxide in the local lymph node assay with non-pooled lymph nodes. To investigate the pattern of reactivity among consecutive dermatitis patients to two structurally closely related limonene hydroperoxides, limonene-1-hydroperoxide and limonene-2-hydroperoxide. Limonene-1-hydroperoxide, limonene-2-hydroperoxide, at 0.5% in petrolatum, and oxidized limonene 3.0% pet. were tested in 763 consecutive dermatitis patients. Of the tested materials, limonene-1-hydroperoxide gave most reactions, with 2.4% of the patients showing positive patch test reactions. Limonene-2-hydroperoxide and oxidized R-limonene gave 1.7% and 1.2% positive patch test reactions, respectively. Concomitant positive patch test reactions to other fragrance markers in the baseline series were frequently noted. The results are in accordance with the experimental studies, as limonene-1-hydroperoxide gave more positive patch test reactions in the tested patients than limonene-2-hydroperoxide. Furthermore, the results support the specificity of the allergenic activity of the limonene hydroperoxide analogues and the importance of oxidized limonene as a cause of contact allergy. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
A validity test of movie, television, and video-game ratings.

Science.gov (United States)

Walsh, D A; Gentile, D A

2001-06-01

Numerous studies have documented the potential effects on young audiences of violent content in media products, including movies, television programs, and computer and video games. Similar studies have evaluated the effects associated with sexual content and messages. Cumulatively, these effects represent a significant public health risk for increased aggressive and violent behavior, spread of sexually transmitted diseases, and pediatric pregnancy. In partial response to these risks and to public and legislative pressure, the movie, television, and gaming industries have implemented ratings systems intended to provide information about the content and appropriate audiences for different films, shows, and games. To test the validity of the current movie-, television-, and video game-rating systems. Panel study. Participants used the KidScore media evaluation tool, which evaluates films, television shows, and video games on 10 aspects, including the appropriateness of the media product for children based on age. When an entertainment industry rates a product as inappropriate for children, parent raters agree that it is inappropriate for children. However, parent raters disagree with industry usage of many of the ratings designating material suitable for children of different ages. Products rated as appropriate for adolescents are of the greatest concern. The level of disagreement varies from industry to industry and even from rating to rating. Analysis indicates that the amount of violent content and portrayals of violence are the primary markers for disagreement between parent raters and industry ratings. As 1 part of a solution to the complex public health problems posed by violent and sexually explicit media products, ratings can have value if used with caution. Parents and caregivers relying on the ratings systems to guide their children's use of media products should continue to monitor content independently. Industry ratings systems should be revised with input
Validation of a Video-based Game-Understanding Test Procedure in Badminton.

Science.gov (United States)

Blomqvist, Minna T.; Luhtanen, Pekka; Laakso, Lauri; Keskinen, Esko

2000-01-01

Reports the development and validation of video-based game-understanding tests in badminton for elementary and secondary students. The tests included different sequences that simulated actual game situations. Players had to solve tactical problems by selecting appropriate solutions and arguments for their decisions. Results suggest that the test…
Noninvasive electrical conductivity measurement by MRI: a test of its validity and the electrical conductivity characteristics of glioma.

Science.gov (United States)

Tha, Khin Khin; Katscher, Ulrich; Yamaguchi, Shigeru; Stehning, Christian; Terasaka, Shunsuke; Fujima, Noriyuki; Kudo, Kohsuke; Kazumata, Ken; Yamamoto, Toru; Van Cauteren, Marc; Shirato, Hiroki

2018-01-01

This study noninvasively examined the electrical conductivity (σ) characteristics of diffuse gliomas using MRI and tested its validity. MRI including a 3D steady-state free precession (3D SSFP) sequence was performed on 30 glioma patients. The σ maps were reconstructed from the phase images of the 3D SSFP sequence. The σ histogram metrics were extracted and compared among the contrast-enhanced (CET) and noncontrast-enhanced tumour components (NCET) and normal brain parenchyma (NP). Difference in tumour σ histogram metrics among tumour grades and correlation of σ metrics with tumour grades were tested. Validity of σ measurement using this technique was tested by correlating the mean tumour σ values measured using MRI with those measured ex vivo using a dielectric probe. Several σ histogram metrics of CET and NCET of diffuse gliomas were significantly higher than NP (Bonferroni-corrected p ≤ .045). The maximum σ of NCET showed a moderate positive correlation with tumour grade (r = .571, Bonferroni-corrected p = .018). The mean tumour σ measured using MRI showed a moderate positive correlation with the σ measured ex vivo (r = .518, p = .040). Tissue σ can be evaluated using MRI, incorporation of which may better characterise diffuse gliomas. • This study tested the validity of noninvasive electrical conductivity measurements by MRI. • This study also evaluated the electrical conductivity characteristics of diffuse glioma. • Gliomas have higher electrical conductivity values than the normal brain parenchyma. • Noninvasive electrical conductivity measurement can be helpful for better characterisation of glioma.
A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing.

Science.gov (United States)

DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M

2017-10-27

The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no
The Validity of Value-Added Estimates from Low-Stakes Testing Contexts: The Impact of Change in Test-Taking Motivation and Test Consequences

Science.gov (United States)

Finney, Sara J.; Sundre, Donna L.; Swain, Matthew S.; Williams, Laura M.

2016-01-01

Accountability mandates often prompt assessment of student learning gains (e.g., value-added estimates) via achievement tests. The validity of these estimates have been questioned when performance on tests is low stakes for students. To assess the effects of motivation on value-added estimates, we assigned students to one of three test consequence…
A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

Science.gov (United States)

Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

2018-04-10

To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.

Exploring the reliability and validity of the social-moral awareness test.

Science.gov (United States)

Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth

2012-11-01

The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor structure, reliability and discriminant validity were also assessed. Seventy-one participants with mild-moderate learning disabilities completed the two scales of the SMAT and two False Belief Tasks for Theory of Mind. Reliability of the SMAT was very good, and the scales were shown to be uni-dimensional in factor structure. There was a significant positive relationship between Theory of Mind and both SMAT scales. There is early evidence of the construct validity and reliability of the SMAT. Further assessment of the validity of the SMAT will be required. © 2012 Blackwell Publishing Ltd.
Publishing nutrition research: validity, reliability, and diagnostic test assessment in nutrition-related research.

Science.gov (United States)

Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara

2010-03-01

This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.
Performance Validity Testing in Neuropsychology: Methods for Measurement Development and Maximizing Diagnostic Accuracy.

Science.gov (United States)

Wodushek, Thomas R; Greher, Michael R

2017-05-01

In the first column in this 2-part series, Performance Validity Testing in Neuropsychology: Scientific Basis and Clinical Application-A Brief Review, the authors introduced performance validity tests (PVTs) and their function, provided a justification for why they are necessary, traced their ongoing endorsement by neuropsychological organizations, and described how they are used and interpreted by ever increasing numbers of clinical neuropsychologists. To enhance readers' understanding of these measures, this second column briefly describes common detection strategies used in PVTs as well as the typical methods used to validate new PVTs and determine cut scores for valid/invalid determinations. We provide a discussion of the latest research demonstrating how neuropsychologists can combine multiple PVTs in a single battery to improve sensitivity/specificity to invalid responding. Finally, we discuss future directions for the research and application of PVTs.
Building a Validity Argument for the Test of English as a Foreign Language™

CERN Document Server

Chapelle, Carol A; Jamieson, Joan M

2007-01-01

Building a Validity Argument for the Test of English as a Foreign Language™ is distinctive in its attempt to develop a coherent story of the rationale for a test or its revision, explain the research and development process, and provide the results of the validation process. This volume is particularly relevant for professionals and graduate students in educational measurement, applied linguistics, and second language acquisition as well as anyone interested in assessment issues.
NASA Double Asteroid Redirection Test (DART) Trajectory Validation and Robutness

Science.gov (United States)

Sarli, Bruno V.; Ozimek, Martin T.; Atchison, Justin A.; Englander, Jacob A.; Barbee, Brent W.

2017-01-01

The Double Asteroid Redirection Test (DART) mission will be the first to test the concept of a kinetic impactor. Several studies have been made on asteroid redirection and impact mitigation, however, to this date no mission tested the proposed concepts. An impact study on a representative body allows the measurement of the effects on the target's orbit and physical structure. With this goal, DART's objective is to verify the effectiveness of the kinetic impact concept for planetary defense. The spacecraft uses solar electric propulsion to escape Earth, fly by (138971) 2001 CB21 for impact rehearsal, and impact Didymos-B, the secondary body of the binary (65803) Didymos system. This work focuses on the heliocentric transfer design part of the mission with the validation of the baseline trajectory, performance comparison to other mission objectives, and assessment of the baseline robustness to missed thrust events. Results show a good performance of the selected trajectory for different mission objectives: latest possible escape date, maximum kinetic energy on impact, shortest possible time of flight, and use of an Earth swing-by. The baseline trajectory was shown to be robust to a missed thrust with 1% of fuel margin being enough to recover the mission for failures of more than 14 days.
Validation of the Hwalek-Sengstock Elder Abuse Screening Test.

Science.gov (United States)

Neale, Anne Victoria; And Others

Elder abuse is recognized as an under-detected and under-reported social problem. Difficulties in detecting elder abuse are compounded by the lack of a standardized, psychometrically valid instrument for case finding. The development of the Hwalek-Sengstock Elder Abuse Screening Test (H-S/EAST) followed a larger effort to identify indicators and…
Construct validity and reliability of automated body reaction test ...

African Journals Online (AJOL)

Automated Body Reaction Test (ABRT) is a new device for skills and physical assessment instrument to measure ability on react, move quickly and accurately in accordance with stimulus. A total of 474 subjects aged 7-17 years old were randomly selected for the construct validity (n=330) and reliability (n=144). The ABRT ...
Emotion Recognition Ability Test Using JACFEE Photos: A Validity/Reliability Study of a War Veterans' Sample and Their Offspring.

Science.gov (United States)

Castro-Vale, Ivone; Severo, Milton; Carvalho, Davide; Mota-Cardoso, Rui

2015-01-01

Emotion recognition is very important for social interaction. Several mental disorders influence facial emotion recognition. War veterans and their offspring are subject to an increased risk of developing psychopathology. Emotion recognition is an important aspect that needs to be addressed in this population. To our knowledge, no test exists that is validated for use with war veterans and their offspring. The current study aimed to validate the JACFEE photo set to study facial emotion recognition in war veterans and their offspring. The JACFEE photo set was presented to 135 participants, comprised of 62 male war veterans and 73 war veterans' offspring. The participants identified the facial emotion presented from amongst the possible seven emotions that were tested for: anger, contempt, disgust, fear, happiness, sadness, and surprise. A loglinear model was used to evaluate whether the agreement between the intended and the chosen emotions was higher than the expected. Overall agreement between chosen and intended emotions was 76.3% (Cohen kappa = 0.72). The agreement ranged from 63% (sadness expressions) to 91% (happiness expressions). The reliability by emotion ranged from 0.617 to 0.843 and the overall JACFEE photo set Cronbach alpha was 0.911. The offspring showed higher agreement when compared with the veterans (RR: 41.52 vs 12.12, p < 0.001), which confirms the construct validity of the test. The JACFEE set of photos showed good validity and reliability indices, which makes it an adequate instrument for researching emotion recognition ability in the study sample of war veterans and their respective offspring.
Emotion Recognition Ability Test Using JACFEE Photos: A Validity/Reliability Study of a War Veterans' Sample and Their Offspring.

Directory of Open Access Journals (Sweden)

Ivone Castro-Vale

Full Text Available Emotion recognition is very important for social interaction. Several mental disorders influence facial emotion recognition. War veterans and their offspring are subject to an increased risk of developing psychopathology. Emotion recognition is an important aspect that needs to be addressed in this population. To our knowledge, no test exists that is validated for use with war veterans and their offspring. The current study aimed to validate the JACFEE photo set to study facial emotion recognition in war veterans and their offspring. The JACFEE photo set was presented to 135 participants, comprised of 62 male war veterans and 73 war veterans' offspring. The participants identified the facial emotion presented from amongst the possible seven emotions that were tested for: anger, contempt, disgust, fear, happiness, sadness, and surprise. A loglinear model was used to evaluate whether the agreement between the intended and the chosen emotions was higher than the expected. Overall agreement between chosen and intended emotions was 76.3% (Cohen kappa = 0.72. The agreement ranged from 63% (sadness expressions to 91% (happiness expressions. The reliability by emotion ranged from 0.617 to 0.843 and the overall JACFEE photo set Cronbach alpha was 0.911. The offspring showed higher agreement when compared with the veterans (RR: 41.52 vs 12.12, p < 0.001, which confirms the construct validity of the test. The JACFEE set of photos showed good validity and reliability indices, which makes it an adequate instrument for researching emotion recognition ability in the study sample of war veterans and their respective offspring.
Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

Science.gov (United States)

Mills, Tamara L; Holm, Margo B; Schmeler, Mark

2007-01-01

The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.
Evaluating abdominal core muscle fatigue: Assessment of the validity and reliability of the prone bridging test.

Science.gov (United States)

De Blaiser, C; De Ridder, R; Willems, T; Danneels, L; Vanden Bossche, L; Palmans, T; Roosen, P

2018-02-01

The aims of this study were to research the amplitude and median frequency characteristics of selected abdominal, back, and hip muscles of healthy subjects during a prone bridging endurance test, based on surface electromyography (sEMG), (a) to determine if the prone bridging test is a valid field test to measure abdominal muscle fatigue, and (b) to evaluate if the current method of administrating the prone bridging test is reliable. Thirty healthy subjects participated in this experiment. The sEMG activity of seven abdominal, back, and hip muscles was bilaterally measured. Normalized median frequencies were computed from the EMG power spectra. The prone bridging tests were repeated on separate days to evaluate inter and intratester reliability. Significant differences in normalized median frequency slope (NMF slope ) values between several abdominal, back, and hip muscles could be demonstrated. Moderate-to-high correlation coefficients were shown between NMF slope values and endurance time. Multiple backward linear regression revealed that the test endurance time could only be significantly predicted by the NMF slope of the rectus abdominis. Statistical analysis showed excellent reliability (ICC=0.87-0.89). The findings of this study support the validity and reliability of the prone bridging test for evaluating abdominal muscle fatigue. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.

Science.gov (United States)

Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid

2017-11-01

Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.
Comprehensive validation scheme for in situ fiber optics dissolution method for pharmaceutical drug product testing.

Science.gov (United States)

Mirza, Tahseen; Liu, Qian Julie; Vivilecchia, Richard; Joshi, Yatindra

2009-03-01

There has been a growing interest during the past decade in the use of fiber optics dissolution testing. Use of this novel technology is mainly confined to research and development laboratories. It has not yet emerged as a tool for end product release testing despite its ability to generate in situ results and efficiency improvement. One potential reason may be the lack of clear validation guidelines that can be applied for the assessment of suitability of fiber optics. This article describes a comprehensive validation scheme and development of a reliable, robust, reproducible and cost-effective dissolution test using fiber optics technology. The test was successfully applied for characterizing the dissolution behavior of a 40-mg immediate-release tablet dosage form that is under development at Novartis Pharmaceuticals, East Hanover, New Jersey. The method was validated for the following parameters: linearity, precision, accuracy, specificity, and robustness. In particular, robustness was evaluated in terms of probe sampling depth and probe orientation. The in situ fiber optic method was found to be comparable to the existing manual sampling dissolution method. Finally, the fiber optic dissolution test was successfully performed by different operators on different days, to further enhance the validity of the method. The results demonstrate that the fiber optics technology can be successfully validated for end product dissolution/release testing. (c) 2008 Wiley-Liss, Inc. and the American Pharmacists Association
Intra-laboratory validation of a human cell based in vitro angiogenesis assay for testing angiogenesis modulators

Directory of Open Access Journals (Sweden)

Jertta-Riina Sarkanen

2011-01-01

Full Text Available The developed standardized human cell based in vitro angiogenesis assay was intra-laboratory validated to verify that the method is reliable and relevant for routine testing of modulators of angiogenesis e.g. pharmaceuticals and industrial chemicals. This assay is based on the earlier published method but it was improved and shown to be more sensitive and rapid than the previous assay. The performance of the assay was assessed by using 6 reference chemicals, which are widely used pharmaceuticals that inhibit angiogenesis: acetyl salicylic acid, erlotinib, 2-methoxyestradiol, levamisole, thalidomide, and anti-vascular endothelial growth factor. In the intra-laboratory validation, the sensitivity of the assay (upper and lower limits of detection and linearity of response in tubule formation, batch to batch variation in tubule formation between different Master cell bank batches, and precision as well as the reliability of the assay (reproducibility and repeatability were tested. The pre-set acceptance criteria for the intra-laboratory validation study were met. The relevance of the assay in man was investigated by comparing the effects of reference chemicals and their concentrations to the published human data. The comparison showed a good concordance, which indicates that this human cell based angiogenesis model predicts well the effects in man and has the potential to be used to supplement and/or replace of animal tests.
Development and content validity testing of a comprehensive classification of diagnoses for pediatric nurse practitioners.

Science.gov (United States)

Burns, C

1991-01-01

Pediatric nurse practitioners (PNPs) need an integrated, comprehensive classification that includes nursing, disease, and developmental diagnoses to effectively describe their practice. No such classification exists. Further, methodologic studies to help evaluate the content validity of any nursing taxonomy are unavailable. A conceptual framework was derived. Then 178 diagnoses from the North American Nursing Diagnosis Association (NANDA) 1986 list, selected diagnoses from the International Classification of Diseases, the Diagnostic and Statistical Manual, Third Revision, and others were selected. This framework identified and listed, with definitions, three domains of diagnoses: Developmental Problems, Diseases, and Daily Living Problems. The diagnoses were ranked using a 4-point scale (4 = highly related to 1 = not related) and were placed into the three domains. The rating scale was assigned by a panel of eight expert pediatric nurses. Diagnoses that were assigned to the Daily Living Problems domain were then sorted into the 11 Functional Health patterns described by Gordon (1987). Reliability was measured using proportions of agreement and Kappas. Content validity of the groups created was measured using indices of content validity and average congruency percentages. The experts used a new method to sort the diagnoses in a new way that decreased overlaps among the domains. The Developmental and Disease domains were judged reliable and valid. The Daily Living domain of nursing diagnoses showed marginally acceptable validity with acceptable reliability. Six Functional Health Patterns were judged reliable and valid, mixed results were determined for four categories, and the Coping/Stress Tolerance category was judged reliable but not valid using either test. There were considerable differences between the panel's, Gordon's (1987), and NANDA's clustering of NANDA diagnoses. This study defines the diagnostic practice of nurses from a holistic, patient
The prone bridge test: Performance, validity, and reliability among older and younger adults.

Science.gov (United States)

Bohannon, Richard W; Steffl, Michal; Glenney, Susan S; Green, Michelle; Cashwell, Leah; Prajerova, Kveta; Bunn, Jennifer

2018-04-01

The prone bridge maneuver, or plank, has been viewed as a potential alternative to curl-ups for assessing trunk muscle performance. The purpose of this study was to assess prone bridge test performance, validity, and reliability among younger and older adults. Sixty younger (20-35 years old) and 60 older (60-79 years old) participants completed this study. Groups were evenly divided by sex. Participants completed surveys regarding physical activity and abdominal exercise participation. Height, weight, body mass index (BMI), and waist circumference were measured. On two occasions, 5-9 days apart, participants held a prone bridge until volitional exhaustion or until repeated technique failure. Validity was examined using data from the first session: convergent validity by calculating correlations between survey responses, anthropometrics, and prone bridge time, known groups validity by using an ANOVA comparing bridge times of younger and older adults and of men and women. Test-retest reliability was examined by using a paired t-test to compare prone bridge times for Session1 and Session 2. Furthermore, an intraclass correlation coefficient (ICC) was used to characterize relative reliability and minimal detectable change (MDC 95% ) was used to describe absolute reliability. The mean prone bridge time was 145.3 ± 71.5 s, and was positively correlated with physical activity participation (p ≤ 0.001) and negatively correlated with BMI and waist circumference (p ≤ 0.003). Younger participants had significantly longer plank times than older participants (p = 0.003). The ICC between testing sessions was 0.915. The prone bridge test is a valid and reliable measure for evaluating abdominal performance in both younger and older adults. Copyright © 2017 Elsevier Ltd. All rights reserved.
Process-oriented tests for validation of baroclinic shallow water models: The lock-exchange problem

Science.gov (United States)

Kolar, R. L.; Kibbey, T. C. G.; Szpilka, C. M.; Dresback, K. M.; Tromble, E. M.; Toohey, I. P.; Hoggan, J. L.; Atkinson, J. H.

A first step often taken to validate prognostic baroclinic codes is a series of process-oriented tests, as those suggested by Haidvogel and Beckmann [Haidvogel, D., Beckmann, A., 1999. Numerical Ocean Circulation Modeling. Imperial College Press, London], among others. One of these tests is the so-called "lock-exchange" test or "dam break" problem, wherein water of different densities is separated by a vertical barrier, which is removed at time zero. Validation against these tests has primarily consisted of comparing the propagation speed of the wave front, as predicted by various theoretical and experimental results, to model output. In addition, inter-model comparisons of the lock-exchange test have been used to validate codes. Herein, we present a high resolution data set, taken from a laboratory-scale model, for direct and quantitative comparison of experimental and numerical results throughout the domain, not just the wave front. Data is captured every 0.2 s using high resolution digital photography, with salt concentration extracted by comparing pixel intensity of the dyed fluid against calibration standards. Two scenarios are discussed in this paper, symmetric and asymmetric mixing, depending on the proportion of dense/light water (17.5 ppt/0.0 ppt) in the experiment; the Boussinesq approximation applies to both. Front speeds, cast in terms of the dimensionless Froude number, show excellent agreement with literature-reported values. Data are also used to quantify the degree of mixing, as measured by the front thickness, which also provides an error band on the front speed. Finally, experimental results are used to validate baroclinic enhancements to the barotropic shallow water ADvanced CIRCulation (ADCIRC) model, including the effect of the vertical mixing scheme on simulation results. Based on salinity data, the model provides an average root-mean-square (rms) error of 3.43 ppt for the symmetric case and 3.74 ppt for the asymmetric case, most of which can
Construct Validity and Test-Retest Reliability of the Climbing Stairs Questionnaire in Lower-Limb Amputees

NARCIS (Netherlands)

de Laat, Fred A.; Rommers, Gerardus M.; Geertzen, Jan H.; Roorda, Leo D.

de Laat FA, Rommers GM, Geertzen JH, Roorda LD. Construct validity and test-retest reliability of the Climbing Stairs Questionnaire in lower-limb amputees. Arch Phys Med Rehabil 2010;91:1396-401. Objective: To investigate the construct validity and test-retest reliability of the Climbing Stairs
Noncredible cognitive performance at clinical evaluation of adult ADHD: An embedded validity indicator in a visuospatial working memory test.

Science.gov (United States)

Fuermaier, Anselm B M; Tucha, Oliver; Koerts, Janneke; Lange, Klaus W; Weisbrod, Matthias; Aschenbrenner, Steffen; Tucha, Lara

2017-12-01

The assessment of performance validity is an essential part of the neuropsychological evaluation of adults with attention-deficit/hyperactivity disorder (ADHD). Most available tools, however, are inaccurate regarding the identification of noncredible performance. This study describes the development of a visuospatial working memory test, including a validity indicator for noncredible cognitive performance of adults with ADHD. Visuospatial working memory of adults with ADHD (n = 48) was first compared to the test performance of healthy individuals (n = 48). Furthermore, a simulation design was performed including 252 individuals who were randomly assigned to either a control group (n = 48) or to 1 of 3 simulation groups who were requested to feign ADHD (n = 204). Additional samples of 27 adults with ADHD and 69 instructed simulators were included to cross-validate findings from the first samples. Adults with ADHD showed impaired visuospatial working memory performance of medium size as compared to healthy individuals. Simulation groups committed significantly more errors and had shorter response times as compared to patients with ADHD. Moreover, binary logistic regression analysis was carried out to derive a validity index that optimally differentiates between true and feigned ADHD. ROC analysis demonstrated high classification rates of the validity index, as shown in excellent specificity (95.8%) and adequate sensitivity (60.3%). The visuospatial working memory test as presented in this study therefore appears sensitive in indicating cognitive impairment of adults with ADHD. Furthermore, the embedded validity index revealed promising results concerning the detection of noncredible cognitive performance of adults with ADHD. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Assessment of Advanced Life Support competence when combining different test methods--reliability and validity

DEFF Research Database (Denmark)

Ringsted, C; Lippert, F; Hesselfeldt, R

2007-01-01

Cardiac Arrest Simulation Test (CASTest) scenarios for the assessments according to guidelines 2005. AIMS: To analyse the reliability and validity of the individual sub-tests provided by ERC and to find a combination of MCQ and CASTest that provides a reliable and valid single effect measure of ALS...... that possessed high reliability, equality of test sets, and ability to discriminate between the two groups of supposedly different ALS competence. CONCLUSIONS: ERC sub-tests of ALS competence possess sufficient reliability and validity. A combined ALS score with equal weighting of one MCQ and one CASTest can...... competence. METHODS: Two groups of participants were included in this randomised, controlled experimental study: a group of newly graduated doctors, who had not taken the ALS course (N=17) and a group of students, who had passed the ALS course 9 months before the study (N=16). Reliability in terms of inter...

Test of Achievement in Quantitative Economics for Secondary Schools: Construction and Validation Using Item Response Theory

Science.gov (United States)

Eleje, Lydia I.; Esomonu, Nkechi P. M.

2018-01-01

A Test to measure achievement in quantitative economics among secondary school students was developed and validated in this study. The test is made up 20 multiple choice test items constructed based on quantitative economics sub-skills. Six research questions guided the study. Preliminary validation was done by two experienced teachers in…
Validity and Reliability of Published Comprehensive Theory of Mind Tests for Normal Preschool Children: A Systematic Review

Directory of Open Access Journals (Sweden)

Seyyede Zohreh Ziatabar Ahmadi

2015-12-01

Full Text Available Objective: Theory of mind (ToM or mindreading is an aspect of social cognition that evaluates mental states and beliefs of oneself and others. Validity and reliability are very important criteria when evaluating standard tests; and without them, these tests are not usable. The aim of this study was to systematically review the validity and reliability of published English comprehensive ToM tests developed for normal preschool children.Method: We searched MEDLINE (PubMed interface, Web of Science, Science direct, PsycINFO, and also evidence base Medicine (The Cochrane Library databases from 1990 to June 2015. Search strategy was Latin transcription of ‘Theory of Mind’ AND test AND children. Also, we manually studied the reference lists of all final searched articles and carried out a search of their references. Inclusion criteria were as follows: Valid and reliable diagnostic ToM tests published from 1990 to June 2015 for normal preschool children; and exclusion criteria were as follows: the studies that only used ToM tests and single tasks (false belief tasks for ToM assessment and/or had no description about structure, validity or reliability of their tests. Methodological quality of the selected articles was assessed using the Critical Appraisal Skills Programme (CASP.Result: In primary searching, we found 1237 articles in total databases. After removing duplicates and applying all inclusion and exclusion criteria, we selected 11 tests for this systematic review. Conclusion: There were a few valid, reliable and comprehensive ToM tests for normal preschool children. However, we had limitations concerning the included articles. The defined ToM tests were different in populations, tasks, mode of presentations, scoring, mode of responses, times and other variables. Also, they had various validities and reliabilities. Therefore, it is recommended that the researchers and clinicians select the ToM tests according to their psychometric
Development and validation of a theoretical test in non-anaesthesiologist-administered propofol sedation for gastrointestinal endoscopy

DEFF Research Database (Denmark)

Jensen, Jeppe Thue; Savran, Mona Meral; Møller, Ann Merete

2016-01-01

OBJECTIVE: Safety with non-anaesthesiologist-administered propofol sedation (NAAP) during gastrointestinal (GI) endoscopy is related to theoretical knowledge. A summative testing of knowledge before attempting supervised nurse-administered propofol sedation (NAPS) in the clinic is advised. The aims...... of this study were to develop a theoretical test about propofol sedation, to gather validity evidence for the test and to measure the effect of a NAPS-specific training course. MATERIAL AND METHODS: A three-phased psychometric study on multiple choice questionnaire (MCQ) test development, gathering of validity......% increase; p = 0.001 and 0.001, respectively). CONCLUSIONS: Data supported the validity of the developed MCQ test. The NAPS-specific course with pre-course testing adds theoretical knowledge to already well-prepared participants....
Establishing a 'Physician's Spiritual Well-being Scale' and testing its reliability and validity.

Science.gov (United States)

Fang, C K; Li, P Y; Lai, M L; Lin, M H; Bridge, D T; Chen, H W

2011-01-01

The purpose of this study was to develop a Physician's Spiritual Well-Being Scale (PSpWBS). The significance of a physician's spiritual well-being was explored through in-depth interviews with and qualitative data collection from focus groups. Based on the results of qualitative analysis and related literature, the PSpWBS consisting of 25 questions was established. Reliability and validity tests were performed on 177 subjects. Four domains of the PSpWBS were devised: physician's characteristics; medical practice challenges; response to changes; and overall well-being. The explainable total variance was 65.65%. Cronbach α was 0.864 when the internal consistency of the whole scale was calculated. Factor analysis showed that the internal consistency Cronbach α value for each factor was between 0.625 and 0.794 and the split-half reliability was 0.865. The scale has satisfactory reliability and validity and could serve as the basis for assessment of the spiritual well-being of a physician.
The Validity and Responsiveness of Isometric Lower Body Multi-Joint Tests of Muscular Strength: a Systematic Review.

Science.gov (United States)

Drake, David; Kennedy, Rodney; Wallace, Eric

2017-12-01

Researchers and practitioners working in sports medicine and science require valid tests to determine the effectiveness of interventions and enhance understanding of mechanisms underpinning adaptation. Such decision making is influenced by the supportive evidence describing the validity of tests within current research. The objective of this study is to review the validity of lower body isometric multi-joint tests ability to assess muscular strength and determine the current level of supporting evidence. Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines were followed in a systematic fashion to search, assess and synthesize existing literature on this topic. Electronic databases such as Web of Science, CINAHL and PubMed were searched up to 18 March 2015. Potential inclusions were screened against eligibility criteria relating to types of test, measurement instrument, properties of validity assessed and population group and were required to be published in English. The Consensus-based Standards for the Selection of health Measurement Instruments (COSMIN) checklist was used to assess methodological quality and measurement property rating of included studies. Studies rated as fair or better in methodological quality were included in the best evidence synthesis. Fifty-nine studies met the eligibility criteria for quality appraisal. The ten studies that rated fair or better in methodological quality were included in the best evidence synthesis. The most frequently investigated lower body isometric multi-joint tests for validity were the isometric mid-thigh pull and isometric squat. The validity of each of these tests was strong in terms of reliability and construct validity. The evidence for responsiveness of tests was found to be moderate for the isometric squat test and unknown for the isometric mid-thigh pull. No tests using the isometric leg press met the criteria for inclusion in the best evidence synthesis. Researchers and
Understanding Student Teachers' Behavioural Intention to Use Technology: Technology Acceptance Model (TAM) Validation and Testing

Science.gov (United States)

Wong, Kung-Teck; Osman, Rosma bt; Goh, Pauline Swee Choo; Rahmat, Mohd Khairezan

2013-01-01

This study sets out to validate and test the Technology Acceptance Model (TAM) in the context of Malaysian student teachers' integration of their technology in teaching and learning. To establish factorial validity, data collected from 302 respondents were tested against the TAM using confirmatory factor analysis (CFA), and structural equation…
New Standards for the Validation of EMC Test Sites particularly above 1 GHz

Directory of Open Access Journals (Sweden)

S. Battermann

2005-01-01

Full Text Available Standards for the validation of alternative test sites with conducting groundplane exist for the frequency range 30-1000 MHz since the end of the eighties. Recently the procedure for fully anechoic rooms (FAR has been included in CISPR 16 after more than 10 years intensive discussion in standards committees (CENELEC, 2002; CISPR, 2004. But there are no standards available for the validation of alternative test sites above 1 GHz. The responsible working group (WG1 in CISPR/A has drawn up the 7th common draft (CD. A CDV will be published in spring 2005. The German standards committee VDE AK 767.4.1 participates in the drafting of the standard. All suggested measurement procedures proposed in the last CDs have been investigated by measurements and theoretical analysis. This contribution describes the basic ideas and problems of the validation procedure of the test site. Furthermore measurement results and numerical calculations will be presented especially for the use of omni-directional antennas.
Naturalistic validation of an on-road driving test of older drivers.

Science.gov (United States)

Ott, Brian R; Papandonatos, George D; Davis, Jennifer D; Barco, Peggy P

2012-08-01

The objective was to compare a standardized road test to naturalistic driving by older people who may have cognitive impairment to define improvements that could potentially enhance the validity of road testing in this population. Road testing has been widely adapted as a tool to assess driving competence of older people who may be at risk for unsafe driving because of dementia; however, the validity of this approach has not been rigorously evaluated. For 2 weeks, 80 older drivers (38 healthy elders and 42 with cognitive impairment) who passed a standardized road test were video recorded in their own vehicles. Using a standardized rating scale, 4 hr of video was rated by a driving instructor. The authors examine weighting of individual road test items to form global impressions and to compare road test and naturalistic driving using factor analyses of these two assessments. The road test score was unidimensional, reflecting a major factor related to awareness of signage and traffic behavior. Naturalistic driving reflected two factors related to lane keeping as well as traffic behavior. Maintenance of proper lane is an important dimension of driving safety that appears to be relatively underemphasized during the highly supervised procedures of the standardized road test. Road testing in this population could be improved by standardized designs that emphasize lane keeping and that include self-directed driving. Additional information should be sought from observers in the community as well as crash evidence when advising older drivers who may be cognitively impaired.
Flight Testing an Iced Business Jet for Flight Simulation Model Validation

Science.gov (United States)

Ratvasky, Thomas P.; Barnhart, Billy P.; Lee, Sam; Cooper, Jon

2007-01-01

A flight test of a business jet aircraft with various ice accretions was performed to obtain data to validate flight simulation models developed through wind tunnel tests. Three types of ice accretions were tested: pre-activation roughness, runback shapes that form downstream of the thermal wing ice protection system, and a wing ice protection system failure shape. The high fidelity flight simulation models of this business jet aircraft were validated using a software tool called "Overdrive." Through comparisons of flight-extracted aerodynamic forces and moments to simulation-predicted forces and moments, the simulation models were successfully validated. Only minor adjustments in the simulation database were required to obtain adequate match, signifying the process used to develop the simulation models was successful. The simulation models were implemented in the NASA Ice Contamination Effects Flight Training Device (ICEFTD) to enable company pilots to evaluate flight characteristics of the simulation models. By and large, the pilots confirmed good similarities in the flight characteristics when compared to the real airplane. However, pilots noted pitch up tendencies at stall with the flaps extended that were not representative of the airplane and identified some differences in pilot forces. The elevator hinge moment model and implementation of the control forces on the ICEFTD were identified as a driver in the pitch ups and control force issues, and will be an area for future work.
Test-retest reliability and validity of the Sniffin' TOM odor memory test.

Science.gov (United States)

Croy, Ilona; Zehner, Cora; Larsson, Maria; Zucco, Gesualdo M; Hummel, Thomas

2015-03-01

Few attempts have been made to develop an olfactory test that captures episodic retention of olfactory information. Assessment of episodic odor memory is of particular interest in aging and in the cognitively impaired as both episodic memory deficits and olfactory loss have been targeted as reliable hallmarks of cognitive decline and impending dementia. Here, 96 healthy participants (18-92 years) and an additional 19 older people with mild cognitive impairment were tested (73-82 years). Participants were presented with 8 common odors with intentional encoding instructions that were followed by a yes-no recognition test. After recognition completion, participants were asked to identify all odors by means of free or cued identification. A retest of the odor memory test (Sniffin' TOM = test of odor memory) took place 17 days later. The results revealed satisfactory test-retest reliability (0.70) of odor recognition memory. Both recognition and identification performance were negatively affected by age and more pronounced among the cognitively impaired. In conclusion, the present work presents a reliable, valid, and simple test of episodic odor recognition memory that may be used in clinical groups where both episodic memory deficits and olfactory loss are prevalent preclinically such as Alzheimer's disease. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The Validity of Graduate Management Admission Test Scores: A Summary of Studies Conducted from 1997 to 2004

Science.gov (United States)

Talento-Miller, Eileen; Rudner, Lawrence M.

2008-01-01

The validity of Graduate Management Admission Test (GMAT) scores is examined by summarizing 273 studies conducted between 1997 and 2004. Each of the studies was conducted through the Validity Study Service of the test sponsor and contained identical variables and statistical methods. Validity coefficients from each of the studies were corrected…
A software prototype development of human system interfaces for human factors engineering validation tests of SMART MCR

International Nuclear Information System (INIS)

Lim, Jong Tae; Han, Kwan Ho; Yang, Seung Won

2011-02-01

An integrated system validation test bed used for human factors engineering validation test is being developed. This study has a goal to develop a software prototype for HFE validation of SMART MCR design. To achieve these, first, some prototype specifications of the software was developed. Then software prototypes of alarm reduction logic system, Plant Protection System, ESF-CCS, Elastic Tile Alarm Indication, and EID-based HSIs were implemented as codes. Test procedures for the software prototypes were established to verify the completeness of the codes implemented. The careful software test has been done according to these test procedures, and the result were documented
Development, construct validity and test-retest reliability of a field-based wheelchair mobility performance test for wheelchair basketball

NARCIS (Netherlands)

de Witte, Annemarie M. H.; Hoozemans, Marco J. M.; Berger, Monique A. M.; van der Slikke, Rienk M. A.; van der Woude, Lucas H. V.; Veeger, Dirkjan (H. E. J)

2018-01-01

The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of
Validation of transport models for use in repository performance assessments: a view illustrated for INTRAVAL test case 1b

International Nuclear Information System (INIS)

Jackson, C.P.; Lever, D.A.; Sumner, P.J.

1991-03-01

We present our views on validation. We consider that validation is slightly different for general models and specific models. We stress the importance of presenting for review the case for (or against) a model. We outline a formal framework for validation, which helps to ensure that all the issues are addressed. Our framework includes calibration, testing predictions, comparison with alternative models, which we consider particularly important, analysis of discrepancies, presentation, consideration of implications and suggested improved experiments. We illustrate the approach by application to an INTRAVAL test case based on laboratory experiments. Three models were considered: a simple model that included the effects of advection, dispersion and equilibrium sorption, a model that also included the effects of rock-matrix diffusion, and a model with kinetic sorption. We show that the model with rock-matrix diffusion is the only one to provide a good description of the data. We stress the implications of extrapolating to larger length and time scales for repository performance assessments. (author)
Compressive strength test for cemented waste forms: validation process

International Nuclear Information System (INIS)

Haucz, Maria Judite A.; Candido, Francisco Donizete; Seles, Sandro Rogerio

2007-01-01

In the Cementation Laboratory (LABCIM), of the Development Centre of the Nuclear Technology (CNEN/CDTN-MG), hazardous/radioactive wastes are incorporated in cement, to transform them into monolithic products, preventing or minimizing the contaminant release to the environment. The compressive strength test is important to evaluate the cemented product quality, in which it is determined the compression load necessary to rupture the cemented waste form. In LABCIM a specific procedure was developed to determine the compressive strength of cement waste forms based on the Brazilian Standard NBR 7215. The accreditation of this procedure is essential to assure reproductive and accurate results in the evaluation of these products. To achieve this goal the Laboratory personal implemented technical and administrative improvements in accordance with the NBR ISO/IEC 17025 standard 'General requirements for the competence of testing and calibration laboratories'. As the developed procedure was not a standard one the norm ISO/IEC 17025 requests its validation. There are some methodologies to do that. In this paper it is described the current status of the accreditation project, especially the validation process of the referred procedure and its results. (author)
Validity of the Mayer-Salovey-Caruso Emotional Intelligence Test: Youth Version-Research Edition

Science.gov (United States)

Peters, Christine; Kranzler, John H.; Rossen, Eric

2009-01-01

This study examines the criterion-related validity evidence of scores on the Mayer-Salovey-Caruso Emotional Intelligence Test: Youth Version-Research Version. The authors also investigate the relationship between scores on the MSCEIT-YV and chronological age. Results provide initial support for the construct validity of the MSCEIT-YV but also…
Flight test techniques for validating simulated nuclear electromagnetic pulse aircraft responses

Science.gov (United States)

Winebarger, R. M.; Neely, W. R., Jr.

1984-01-01

An attempt has been made to determine the effects of nuclear EM pulses (NEMPs) on aircraft systems, using a highly instrumented NASA F-106B to document the simulated NEMP environment at the Kirtland Air Force Base's Vertically Polarized Dipole test facility. Several test positions were selected so that aircraft orientation relative to the test facility would be the same in flight as when on the stationary dielectric stand, in order to validate the dielectric stand's use in flight configuration simulations. Attention is given to the flight test portions of the documentation program.
Validation of a Wave-Body Interaction Model by Experimental Tests

DEFF Research Database (Denmark)

Ferri, Francesco; Kramer, Morten; Pecher, Arthur

2013-01-01

Within the wave energy field, numerical simulation has recently acquired a worldwide consent as being a useful tool, besides physical model testing. The main goal of this work is the validation of a numerical model by experimental results. The numerical model is based on a linear wave-body intera...
Validation of the Reading the Mind in the Eyes Test in a healthy Spanish sample and women with anorexia nervosa.

Science.gov (United States)

Redondo, Iratxe; Herrero-Fernández, David

2018-04-11

The aim of this study was to build a Spanish version of the Reading the Mind in the Eyes Test (RMET) including limited time of response and an integrated glossary, and to test its validity. A total of 433 university students (121 men and 350 women) and 38 anorexic women completed the RMET and other related measures of empathy and alexithymia. The results of the Parallel Analysis suggested a unidimensional structure for 19 items, which was verified through a Confirmatory Factor Analysis. Similarly to other research, this factor had a low reliability (α = .56, ρ = .59); however, regarding validity, the total score of the instrument showed positive correlations with empathy and negatives with alexithymia. Furthermore, healthy females were superior to males in RMET, and to anorexic women; but no significant differences appeared between healthy men and the anorexic group. This study confirms the validity of the test and permits a relatively short and inexpensive means of administration in large samples of adults. Besides, it suggests the necessity of assessing and treating the theory of mind in anorexic women.
Test-retest reliability, smallest real difference and concurrent validity of six different balance tests on young people with mild to moderate intellectual disability.

Science.gov (United States)

Blomqvist, Sven; Wester, Anita; Sundelin, Gunnevi; Rehn, Börje

2012-12-01

Some studies have reported that people with intellectual disability may have reduced balance ability compared with the population in general. However, none of these studies involved adolescents, and the reliability and validity of balance tests in this population are not known. The purpose of this study was to examine the reliability of six different balance tests and to investigate their concurrent validity. Test-retest reliability assessment. All subjects were recruited from a special school for people with intellectual disability in Bollnäs, Sweden. Eighty-nine adolescents (35 females and 54 males) with mild to moderate intellectual disability with a mean age of 18 years (range 16 to 20 years). All subjects followed the same test protocol on two occasions within an 11-day period. Balance test performances. Intraclass correlation coefficients greater than 0.80 were achieved for four of the balance tests: Extended Timed Up and Go Test, Modified Functional Reach Test, One-leg Stance Test and Force Platform Test. The smallest real differences ranged from 12% to 40%; less than 20% is considered to be low. Concurrent validity among these balance tests varied between no and low correlation. The results indicate that these tests could be used to evaluate changes in balance ability over time in people with mild to moderate intellectual disability. The low concurrent validity illustrates the importance of knowing more about the influence of various sensory subsystems that are significant for balance among adolescents with intellectual disability. Copyright © 2011 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.

Gamma spectrometric validation of measurements test of radionuclides in food matrices

International Nuclear Information System (INIS)

Rosa, Mychelle M.L.; Custodio, Luis G.; Bonifacio, Rodrigo L.; Taddei, Maria Helena T.

2013-01-01

In a testing laboratory the quality system encompasses a set of activities planned and systematic, which ensure the traceability process of an analysis, which is based on the standards NBR ISO/TEC 17025. With the need for analysis of radionuclides in food products to meet the requirements of import and export, accreditation of testing on this standard becomes increasingly necessary. The Gamma Spectrometry is a technique used for direct determination of radionuclides in different matrices, among them the food, being possible the simultaneous determination of different radionuclides in the same sample without the need for a chemical separation. In the process of Accreditation the methodology validation is an important step that includes testing accuracy, traceability, linearity and recovery. This paper describes the procedures used to validate the assay for determining radionuclides using gamma spectrometry in food. These procedures were performed through analysis of a certificated reference material by the International Atomic Energy Agency (IAEA Soil 327), analysis of samples of milk powder prepared from the doping with certified liquid standards also by the results obtained in the participation of tests of proficiency in analysis of environmental samples. (author)
Measuring right-hemisphere dysfunction in children: validity of two new computer tests

NARCIS (Netherlands)

Sips, H.J.W.A.; C.E. Catsman-Berrevoets (Coriene); H.R. van Dongen (Huug); van der Werff, P.J.J.; Brooke, L.J.

1994-01-01

textabstractThe validity of two new computer‐mediated tests for the detection of right‐cerebral hemisphere lesions in children–the Right‐hemisphere Dysfunction Test and the Visual Perception Test–was evaluated. Normative data were drawn from a group of 91 children (aged five to 14 years) and 14
Naturalistic Validation of an On-Road Driving Test of Older Drivers

Science.gov (United States)

Ott, Brian R.; Papandonatos, George D.; Davis, Jennifer D.; Barco, Peggy P.

2013-01-01

Objective The objective was to compare a standardized road test to naturalistic driving by older people who may have cognitive impairment to define improvements that could potentially enhance the validity of road testing in this population. Background Road testing has been widely adapted as a tool to assess driving competence of older people who may be at risk for unsafe driving because of dementia; however, the validity of this approach has not been rigorously evaluated. Method For 2 weeks, 80 older drivers (38 healthy elders and 42 with cognitive impairment) who passed a standardized road test were video recorded in their own vehicles. Using a standardized rating scale, 4 hr of video was rated by a driving instructor. The authors examine weighting of individual road test items to form global impressions and to compare road test and naturalistic driving using factor analyses of these two assessments. Results The road test score was unidimensional, reflecting a major factor related to awareness of signage and traffic behavior. Naturalistic driving reflected two factors related to lane keeping as well as traffic behavior. Conclusion Maintenance of proper lane is an important dimension of driving safety that appears to be relatively underemphasized during the highly supervised procedures of the standardized road test. Application Road testing in this population could be improved by standardized designs that emphasize lane keeping and that include self-directed driving. Additional information should be sought from observers in the community as well as crash evidence when advising older drivers who may be cognitively impaired. PMID:22908688
Development and testing of a cross-culturally valid instrument: food-related life style

DEFF Research Database (Denmark)

Brunsø, Karen; Grunert, Klaus G.

1995-01-01

-culturaly valid way. To this end we have developed a pool of 202 items, collected data in three countries, and have constructed scales based on cross-culturally stable factor patterns. We have then applie set of scales to a fourth country, in order to further test the cross-cultural validity of the instrument....
ASSESSMENT OF SATISFACTION IN PERITONEAL EQUILIBRATION TEST: A STUDY ON THE VALIDITY AND RELIABILITY OF THE PERITONEAL EQUILIBRATION SATISFACTION SCALE

Directory of Open Access Journals (Sweden)

Eylem TOPBAŞ

2016-01-01

Full Text Available Aim: This study has been designed to develop an assessment tool to be used in determining the patients’ satisfaction level with the peritoneal equilibration test (PET procedure. Materials and Methods: The development and validation of the peritoneal equilibration test Satisfaction Scale (PETSS was completed in two phases. Phase I focused on instrument construction and included item development and establishment of concurrent validity. Phase II included the factor analysis and psychometric assessment of the scale. In statistical evaluation of the data descriptive statistics and non-paratmetric tests were used. Results: The first version of the scale that has 3.62 Content Validity Index value was composed of 20 items. It was found that the latest version of the scale that has 14 items explained 46% of the variance. It was found that the Cronbach alfa value of this scale, which has 0.52-0.89 coefficient of item-total correlation was 0.96. Psychometric assessment of the scale revealed that except for type of the PET application, none of the demographic and clinical characteristics effect patients level of satisfaction during the PET application. Conclusion: This preliminary study showed that PETSS was a valid and reliable scale that can be used for determining satisfaction level of patients during PET application.
Automated smartphone audiometry: Validation of a word recognition test app.

Science.gov (United States)

Dewyer, Nicholas A; Jiradejvong, Patpong; Henderson Sabes, Jennifer; Limb, Charles J

2018-03-01

Develop and validate an automated smartphone word recognition test. Cross-sectional case-control diagnostic test comparison. An automated word recognition test was developed as an app for a smartphone with earphones. English-speaking adults with recent audiograms and various levels of hearing loss were recruited from an audiology clinic and were administered the smartphone word recognition test. Word recognition scores determined by the smartphone app and the gold standard speech audiometry test performed by an audiologist were compared. Test scores for 37 ears were analyzed. Word recognition scores determined by the smartphone app and audiologist testing were in agreement, with 86% of the data points within a clinically acceptable margin of error and a linear correlation value between test scores of 0.89. The WordRec automated smartphone app accurately determines word recognition scores. 3b. Laryngoscope, 128:707-712, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Preliminary Validation of a New Measure of Negative Response Bias: The Temporal Memory Sequence Test.

Science.gov (United States)

Hegedish, Omer; Kivilis, Naama; Hoofien, Dan

2015-01-01

The Temporal Memory Sequence Test (TMST) is a new measure of negative response bias (NRB) that was developed to enrich the forced-choice paradigm. The TMST does not resemble the common structure of forced-choice tests and is presented as a temporal recall memory test. The validation sample consisted of 81 participants: 21 healthy control participants, 20 coached simulators, and 40 patients with acquired brain injury (ABI). The TMST had high reliability and significantly high positive correlations with the Test of Memory Malingering and Word Memory Test effort scales. Moreover, the TMST effort scales exhibited high negative correlations with the Glasgow Coma Scale, thus validating the previously reported association between probable malingering and mild traumatic brain injury. A suggested cutoff score yielded acceptable classification rates in the ABI group as well as in the simulator and control groups. The TMST appears to be a promising measure of NRB detection, with respectable rates of reliability and construct and criterion validity.
Noninvasive electrical conductivity measurement by MRI. A test of its validity and the electrical conductivity characteristics of glioma

Energy Technology Data Exchange (ETDEWEB)

Tha, Khin Khin; Kudo, Kohsuke [Hokkaido University Hospital, Department of Diagnostic and Interventional Radiology, N-14, W-5, Kita-ku, Sapporo (Japan); Hokkaido University, Global Station for Quantum Medical Science and Engineering, Global Institution for Collaborative Research and Education, Sapporo (Japan); Katscher, Ulrich; Stehning, Christian [Philips Research Laboratories, Hamburg (Germany); Yamaguchi, Shigeru; Terasaka, Shunsuke; Kazumata, Ken [Faculty of Medicine, Hokkaido University, Department of Neurosurgery, Sapporo (Japan); Fujima, Noriyuki [Hokkaido University Hospital, Department of Diagnostic and Interventional Radiology, N-14, W-5, Kita-ku, Sapporo (Japan); Yamamoto, Toru [Hokkaido University, Faculty of Health Sciences, Sapporo (Japan); Van Cauteren, Marc [Clinical Science Philips Healthtech Asia Pacific, Tokyo (Japan); Shirato, Hiroki [Hokkaido University, Global Station for Quantum Medical Science and Engineering, Global Institution for Collaborative Research and Education, Sapporo (Japan); Faculty of Medicine, Hokkaido University, Department of Radiation Medicine, Sapporo (Japan)

2018-01-15

This study noninvasively examined the electrical conductivity (σ) characteristics of diffuse gliomas using MRI and tested its validity. MRI including a 3D steady-state free precession (3D SSFP) sequence was performed on 30 glioma patients. The σ maps were reconstructed from the phase images of the 3D SSFP sequence. The σ histogram metrics were extracted and compared among the contrast-enhanced (CET) and noncontrast-enhanced tumour components (NCET) and normal brain parenchyma (NP). Difference in tumour σ histogram metrics among tumour grades and correlation of σ metrics with tumour grades were tested. Validity of σ measurement using this technique was tested by correlating the mean tumour σ values measured using MRI with those measured ex vivo using a dielectric probe. Several σ histogram metrics of CET and NCET of diffuse gliomas were significantly higher than NP (Bonferroni-corrected p ≤.045). The maximum σ of NCET showed a moderate positive correlation with tumour grade (r =.571, Bonferroni-corrected p =.018). The mean tumour σ measured using MRI showed a moderate positive correlation with the σ measured ex vivo (r =.518, p =.040). Tissue σ can be evaluated using MRI, incorporation of which may better characterise diffuse gliomas. (orig.)
Validation of an Instrument to Measure High School Students' Attitudes toward Fitness Testing

Science.gov (United States)

Mercier, Kevin; Silverman, Stephen

2014-01-01

Purpose: The purpose of this investigation was to develop an instrument that has scores that are valid and reliable for measuring students' attitudes toward fitness testing. Method: The method involved the following steps: (a) an elicitation study, (b) item development, (c) a pilot study, and (d) a validation study. The pilot study included 427…
Validating WCAG versions 1.0 and 2.0 through usability testing with disabled users

DEFF Research Database (Denmark)

Rømen, Dagfinn; Svanæs, Dag

2012-01-01

) and a control group (N = 6), it was found that only 27% of the identified website accessibility problems could have been identified through the use of WCAG 1.0. A similar analysis of conformance to WCAG 2.0 showed a marginal 5% improvement concerning identified website accessibility problems. Compensating...... accessibility evaluations and guidelines in many countries. WCAG version 2.0 was released in 2008. This paper reports on a study that empirically validated the usefulness of using WCAG as a heuristic for website accessibility. Through controlled usability tests of two websites with disabled users (N = 7...... for the low number of test subjects with confidence tests gave results that were still low (42% for WCAG 1.0 and 49% for WCAG 2.0, with 95% confidence). It is concluded from this that the application of WAI accessibility guidelines is not sufficient to guarantee website accessibility. It is recommended...
Validation of WIMS-AECL/(MULTICELL)/RFSP system by the results of phase-B test at Wolsung-II unit

Energy Technology Data Exchange (ETDEWEB)

Hong, In Seob; Min, Byung Joo; Suk, Ho Chun [Korea Atomic Energy Research Institute, Taejon (Korea)

1999-03-01

The object of this study is the validation of WIMS-AECL lattice code which has been proposed for the substitution of POWDERPUFS-V(PPV) code. For the validation of this code, WIMS-AECL/(MULTICELL)/RFSP (lattice calculation/(incremental cross section calculation)/core calculation) code system has been used for the Post-Simulation of Phase-B physics Test at Wolsung-II unit. This code system had been used for the Wolsong-I and Point Lepraeu reactors, but after a few modifications of WIMS-AECL input values for Wolsong-II, the results of WIMS-AECL/RFSP code calculations are much improved to those of the old ones. Most of the results show good estimation except moderator temperature coefficient test. And the verification of this result must be done, which is one of the further work. 6 figs., 15 tabs. (Author)
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

Directory of Open Access Journals (Sweden)

Singh Amika S

2011-12-01

Full Text Available Abstract Background Insight in children's energy balance-related behaviours (EBRBs and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77% showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23% and poor for one item. Construct validity appeared to be good to excellent for 70 (47% of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26% and poor for 41 items (27%. Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

Science.gov (United States)

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
OECD validation study to assess intra- and inter-laboratory reproducibility of the zebrafish embryo toxicity test for acute aquatic toxicity testing

NARCIS (Netherlands)

Busquet, F.; Strecker, R.; Rawlings, J.M.; Belanger, S.E.; Braunbeck, T.; Carr, G.J.; Cenijn, P.H.; Fochtman, P.; Gourmelon, A.; Hübler, N.; Kleensang, A.; Knöbel, M.; Kussatz, C.; Legler, J.; Lillicrap, A.; Martínez-Jerónimo, F.; Polleichtner, C.; Rzodeczko, H.; Salinas, E.; Schneider, K.E.; Scholz, S.; van den Brandhof, E.J.; van der Ven, L.T.; Walter-Rohde, S.; Weigt, S.; Witters, H.; Halder, M.

2014-01-01

A The OECD validation study of the zebrafish embryo acute toxicity test (ZFET) for acute aquatic toxicity testing evaluated the ZFET reproducibility by testing 20 chemicals at 5 different concentrations in 3 independent runs in at least 3 laboratories. Stock solutions and test concentrations were
Validation of EncephalApp, Smartphone-Based Stroop Test, for the Diagnosis of Covert Hepatic Encephalopathy.

Science.gov (United States)

Bajaj, Jasmohan S; Heuman, Douglas M; Sterling, Richard K; Sanyal, Arun J; Siddiqui, Muhammad; Matherly, Scott; Luketic, Velimir; Stravitz, R Todd; Fuchs, Michael; Thacker, Leroy R; Gilles, HoChong; White, Melanie B; Unser, Ariel; Hovermale, James; Gavis, Edith; Noble, Nicole A; Wade, James B

2015-10-01

Detection of covert hepatic encephalopathy (CHE) is difficult, but point-of-care testing could increase rates of diagnosis. We aimed to validate the ability of the smartphone app EncephalApp, a streamlined version of Stroop App, to detect CHE. We evaluated face validity, test-retest reliability, and external validity. Patients with cirrhosis (n = 167; 38% with overt HE [OHE]; mean age, 55 years; mean Model for End-Stage Liver Disease score, 12) and controls (n = 114) were each given a paper and pencil cognitive battery (standard) along with EncephalApp. EncephalApp has Off and On states; results measured were OffTime, OnTime, OffTime+OnTime, and number of runs required to complete 5 off and on runs. Thirty-six patients with cirrhosis underwent driving simulation tests, and EncephalApp results were correlated with results. Test-retest reliability was analyzed in a subgroup of patients. The test was performed before and after transjugular intrahepatic portosystemic shunt placement, and before and after correction for hyponatremia, to determine external validity. All patients with cirrhosis performed worse on paper and pencil and EncephalApp tests than controls. Patients with cirrhosis and OHE performed worse than those without OHE. Age-dependent EncephalApp cutoffs (younger or older than 45 years) were set. An OffTime+OnTime value of >190 seconds identified all patients with CHE with an area under the receiver operator characteristic value of 0.91; the area under the receiver operator characteristic value was 0.88 for diagnosis of CHE in those without OHE. EncephalApp times correlated with crashes and illegal turns in driving simulation tests. Test-retest reliability was high (intraclass coefficient, 0.83) among 30 patients retested 1-3 months apart. OffTime+OnTime increased significantly (206 vs 255 seconds, P = .007) among 10 patients retested 33 ± 7 days after transjugular intrahepatic portosystemic shunt placement. OffTime+OnTime decreased significantly (242 vs
Development and validation of an OECD reproductive toxicity test guideline with the mudsnail Potamopyrgus antipodarum (Mollusca, Gastropoda).

Science.gov (United States)

Ruppert, Katharina; Geiß, Cornelia; Askem, Clare; Benstead, Rachel; Brown, Rebecca; Coke, Maira; Ducrot, Virginie; Egeler, Philipp; Holbech, Henrik; Hutchinson, Thomas H; Kinnberg, Karin L; Lagadic, Laurent; Le Page, Gareth; Macken, Ailbhe; Matthiessen, Peter; Ostermann, Sina; Schimera, Agnes; Schmitt, Claudia; Seeland-Fremer, Anne; Smith, Andy J; Weltje, Lennart; Oehlmann, Jörg

2017-08-01

Mollusks are known to be uniquely sensitive to a number of reproductive toxicants including some vertebrate endocrine disrupting chemicals. However, they have widely been ignored in environmental risk assessment procedures for chemicals. This study describes the validation of the Potamopyrgus antipodarum reproduction test within the OECD Conceptual Framework for Endocrine Disrupters Testing and Assessment. The number of embryos in the brood pouch and adult mortality serve as main endpoints. The experiments are conducted as static systems in beakers filled with artificial medium, which is aerated trough glass pipettes. The test chemical is dispersed into the medium, and adult snails are subsequently introduced into the beakers. After 28 days the reproductive success is determined by opening the brood pouch and embryo counting. This study presents the results of two validation studies of the reproduction test with eleven laboratories and the chemicals tributyltin (TBT) with nominal concentrations ranging from 10 to 1000 ng TBT-Sn/L and cadmium with concentrations from 1.56 to 25 μg/L. The test design could be implemented by all laboratories resulting in comparable effect concentrations for the endpoint number of embryos in the brood pouch. After TBT exposure mean EC 10 , EC 50 , NOEC and LOEC were 35.6, 127, 39.2 and 75.7 ng Sn/L, respectively. Mean effect concentrations in cadmium exposed snails were, respectively, 6.53, 14.2, 6.45 and 12.6 μg/L. The effect concentrations are in good accordance with already published data. Both validation studies show that the reproduction test with P. antipodarum is a well-suited tool to assess reproductive effects of chemicals. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reliability and validity of videotaped functional performance tests in ACL-injured subjects

DEFF Research Database (Denmark)

von Porat, Anette; Holmström, Eva; Roos, Ewa

2008-01-01

BACKGROUND AND PURPOSE: In clinical practice, visual observation is often used to determine functional impairment and to evaluate treatment following a knee injury. The aim of this study was to evaluate the reliability and validity of observational assessments of knee movement pattern quality......, crossover hop on one leg and one-leg hop. The videos were observed by four physiotherapists, and the knee movement pattern quality, a feature of the loading strategy of the lower extremity, was scored on an 11-point rating scale. To assess the criterion validity, the observational rating was correlated...... obtained between the observers' assessment and knee flexion angle, r = 0.37-0.61. The crossover hop test or one-leg hop test was ranked as the most useful test in 172 of 192 occasions (90%) when assessing knee function. CONCLUSION: The moderate to good inter-observer reliability and the moderate criterion...
Testing the Predictive Validity of the IELTS Test on Omani English Candidates’ Professional Competencies

Directory of Open Access Journals (Sweden)

Moza Abdullah Said Al-Malki

2014-09-01

Full Text Available This study has investigated the relationship between IELTS testing and Omani English teacher trainees’ professional competencies by adopting a quantitative method for data collection. A total number of 94 graduate freshmen Omani English teachers’ IELTS, CGPA and their teaching professional competencies are collected. The results reveal a moderate significant relationship between IELTS and CGPA but a weak relationship between IELTS and teaching competencies. This study could contribute to the growing body of literature that aims to assess the construct validity of IELTS, and attempts to do so in the new terrain of teaching competencies. This study puts forwards recommendations for IELTS proficiency test in the Omani context.
1:50 Scale Testing of Three Floating Wind Turbines at MARIN and Numerical Model Validation Against Test Data

Energy Technology Data Exchange (ETDEWEB)

Dagher, Habib [Univ. of Maine, Orno, ME (United States); Viselli, Anthony [Univ. of Maine, Orno, ME (United States); Goupee, Andrew [Univ. of Maine, Orno, ME (United States); Allen, Christopher [Univ. of Maine, Orno, ME (United States)

2017-08-15

The primary goal of the basin model test program discussed herein is to properly scale and accurately capture physical data of the rigid body motions, accelerations and loads for different floating wind turbine platform technologies. The intended use for this data is for performing comparisons with predictions from various aero-hydro-servo-elastic floating wind turbine simulators for calibration and validation. Of particular interest is validating the floating offshore wind turbine simulation capabilities of NREL’s FAST open-source simulation tool. Once the validation process is complete, coupled simulators such as FAST can be used with a much greater degree of confidence in design processes for commercial development of floating offshore wind turbines. The test program subsequently described in this report was performed at MARIN (Maritime Research Institute Netherlands) in Wageningen, the Netherlands. The models considered consisted of the horizontal axis, NREL 5 MW Reference Wind Turbine (Jonkman et al., 2009) with a flexible tower affixed atop three distinct platforms: a tension leg platform (TLP), a spar-buoy modeled after the OC3 Hywind (Jonkman, 2010) and a semi-submersible. The three generic platform designs were intended to cover the spectrum of currently investigated concepts, each based on proven floating offshore structure technology. The models were tested under Froude scale wind and wave loads. The high-quality wind environments, unique to these tests, were realized in the offshore basin via a novel wind machine which exhibits negligible swirl and low turbulence intensity in the flow field. Recorded data from the floating wind turbine models included rotor torque and position, tower top and base forces and moments, mooring line tensions, six-axis platform motions and accelerations at key locations on the nacelle, tower, and platform. A large number of tests were performed ranging from simple free-decay tests to complex operating conditions with
Preparation, validation and user-testing of pictogram-based patient information leaflets for tuberculosis.

Science.gov (United States)

Shrestha, Anmol; Rajesh, V; Dessai, Sneha Shamrao; Stanly, Sharon Mary; Mateti, Uday Venkat

2018-05-25

Patient education is of paramount importance with regard to the condition of the disease and the treatment given besides lifestyle remodelling in order to get the desired therapeutic outcome. When verbal information is provided to the patients, they often tend to forget it. Pictorial aids or pictograms, as they are commonly known, are tools that are widely used for imparting knowledge to the patients. The aim of the study is to prepare and validate a Pictogram-based Patient Information Leaflet (P-PILs) on Tuberculosis (TB). P-PILs have been prepared from tertiary, secondary and primary sources. The knowledge-based questions are prepared with respect to the P-PILs. The baseline knowledge of the volunteers and patients have been analyzed before administering the P-PILs by using the validated questionnaire. The post-knowledge of the volunteers and patients has been analyzed after administering the P-PILs (20 min) by using the same questionnaire and the user-opinion has also been obtained at the end. The study results show that the mean scores of the overall user-testing knowledge assessment are found to have improved significantly from the pre-P-PILs administration score of 62.67 to the post-P-PILs administration score of 91. The overall user-opinion about the P-PILs has been found to be good (75%) followed by average (25%). The present study shows that there is significant improvement in the knowledge levels of the patients and volunteers after reading the validated leaflets. The Pictogram-based Patient Information Leaflets are found to be an effective educational tool for TB patients. Copyright © 2018. Published by Elsevier Ltd.

A Comparison of Validity Rates between Paper-and-Pencil and Computerized Testing with the MMPI-2

Science.gov (United States)

Blazek, Nicole L.; Forbey, Johnathan D.

2011-01-01

Although the use of computerized testing in psychopathology assessment has increased in recent years, limited research has examined the impact of this format in terms of potential differences in test validity rates. The current study explores potential differences in the rates of valid and invalid Minnesota Multiphasic Personality Inventory--2…
Validation of the Turkish Version of the Cognitive Test Anxiety Scale–Revised

Directory of Open Access Journals (Sweden)

Sati Bozkurt

2017-01-01

Full Text Available The current study explored the psychometric properties of the newly designed Turkish version of the Cognitive Test Anxiety Scale–Revised (CTAR. Results of an exploratory factor analysis revealed an unidimensional structure consistent with the conceptualized nature of cognitive test anxiety and previous examinations of the English version of the CTAR. Examination of the factor loadings revealed two items that were weakly related to the test anxiety construct and as such were prime candidates for removal. Confirmatory factor analyses were conducted to compare model fit for the 25- and 23-item version of the measure. Results indicated that the 23-item version of the measure provided a better fit to the data which support the removal of the problematic items in the Turkish version of the CTAR. Additional analyses demonstrated the internal consistency, test–retest reliability, concurrent validity, and gender equivalence for responses offered on the Turkish version of the measure. Results of the analysis revealed a 23-item Turkish version of the T-CTAR is a valid and reliable measure of cognitive test anxiety for use among Turkish students.
Diagnostic validity of physical examination tests for common knee disorders: An overview of systematic reviews and meta-analysis.

Science.gov (United States)

Décary, Simon; Ouellet, Philippe; Vendittoli, Pascal-André; Roy, Jean-Sébastien; Desmeules, François

2017-01-01

More evidence on diagnostic validity of physical examination tests for knee disorders is needed to lower frequently used and costly imaging tests. To conduct a systematic review of systematic reviews (SR) and meta-analyses (MA) evaluating the diagnostic validity of physical examination tests for knee disorders. A structured literature search was conducted in five databases until January 2016. Methodological quality was assessed using the AMSTAR. Seventeen reviews were included with mean AMSTAR score of 5.5 ± 2.3. Based on six SR, only the Lachman test for ACL injuries is diagnostically valid when individually performed (Likelihood ratio (LR+):10.2, LR-:0.2). Based on two SR, the Ottawa Knee Rule is a valid screening tool for knee fractures (LR-:0.05). Based on one SR, the EULAR criteria had a post-test probability of 99% for the diagnosis of knee osteoarthritis. Based on two SR, a complete physical examination performed by a trained health provider was found to be diagnostically valid for ACL, PCL and meniscal injuries as well as for cartilage lesions. When individually performed, common physical tests are rarely able to rule in or rule out a specific knee disorder, except the Lachman for ACL injuries. There is low-quality evidence concerning the validity of combining history elements and physical tests. Copyright © 2016 Elsevier Ltd. All rights reserved.
Validation of the Brazilian version of the childhood asthma control test (c-ACT).

Science.gov (United States)

Oliveira, Suelen G; Sarria, Edgar E; Roncada, Cristian; Stein, Renato T; Pitrez, Paulo M; Mattiello, Rita

2016-04-01

Children's perception of their symptoms has proved reliable and relevant to disease management and should be considered when assessing their asthma control. The aim of the study is to validate the Brazilian Portuguese version of the Childhood Asthma Control Test (c-ACT) in children aged 4-11 years. This is a cross-sectional study in children diagnosed with asthma undergoing treatment in a pediatric pulmonology outpatient clinic in Porto Alegre, Brazil. The translation and linguistic adaptation of the instrument were performed in accordance with international recommendations for questionnaire validation. A total of 105 participants were included, aged 4-11 years. all correlations between the total score and items on the questionnaire were significant and obtained values of r ≥ 0.3, and c-ACT means showed statistically significant differences between the GINA categories (P ACT scores than those of uncontrolled asthma group (controlled 22.0 ± 2.9 vs. uncontrolled 16.3 ± 5.3 P ACT scores than those of uncontrolled asthma group (partially controlled 20.0 ± 4.0 vs. uncontrolled 16.3 ± 5.3 P = 0.03). Correlations between the c-ACT total score and spirometry and nitric oxide were poor (r = 0.020; P = 0.866 and r = 0.035; P = 0.753, respectively). Reliability: the α-C coefficient for the c-ACT total score was 0.677 (95%CI 0.573-0763). Sensitivity to change had an effect size of 0.8 and an intraclass correlation coefficient of 0.598. No floor or ceiling effects were observed. The Brazilian version of the Childhood Asthma Control Test proved to be valid and reliable in children aged 4-11 years. © 2015 Wiley Periodicals, Inc.
Reliability and validity of a 20-s alternative to the wingate anaerobic test in team sport male athletes.

Directory of Open Access Journals (Sweden)

Ahmed Attia

Full Text Available The intent of this study was to evaluate relative and absolute reliability of the 20-s anaerobic test (WAnT20 versus the WAnT30 and to verify how far the various indices of the 30-s Wingate anaerobic test (WAnT30 could be predicted from the WAnT20 data in male athletes. The participants were Exercise Science majors (age: 21.5±1.6 yrs, stature: 0.183±0.08 m, body mass: 81.2±10.9 kg who participated regularly in team sports. In Phase I, 41 participants performed duplicate WAnT20 and WAnT30 tests to assess reliability. In Phase II, 31 participants performed one trial each of the WAnT20 and WAnT30 to determine the ability of the WAnT20 to predict components of the WAnT30. In Phase III, 31 participants were used to cross-validate the prediction equations developed in Phase II. Respective intra-class correlation coefficients (ICC for peak power output (PPO (ICC = 0.98 and 0.95 and mean power output (MPO (ICC 0.98 and 0.90 did not differ significantly between WAnT20 and WAnT30. ICCs for minimal power output (POmin and fatigue index (FI were poor for both tests (range 0.53 to 0.76. Standard errors of the means (SEM for PPO and MPO were less than their smallest worthwhile changes (SWC in both tests; however, POmin and FI values were "marginal," with SEM values greater than their respective SWCs for both tests values. Stepwise regression analysis showed that MPO had the highest coefficient of predictability (R = 0.97, with POmin and FI considerable lower (R = 0.71 and 0.41 respectively. Cross-validation showed insignificant bias with limits of agreement of 0.99±1.04, 6.5±92.7 W, and 1.6±9.8% between measured and predicted MPO, POmin, and FI, respectively. WAnT20 offers a reliable and valid test of leg anaerobic power in male athletes and could replace the classic WAnT30.
Assessment of Technical Skills in Young Soccer Goalkeepers: Reliability and Validity of Two Goalkeeper-Specific Tests

Directory of Open Access Journals (Sweden)

Ricardo Rebelo-Gonçalves, António J. Figueiredo, Manuel J. Coelho-e-Silva, Antonio Tessitore

2016-09-01

Full Text Available The purpose of this study was to evaluate the reproducibility and validity of two new tests designed to examine goalkeeper-specific technique. Twenty-six goalkeepers (14.49 ± 2.52 years old completed two trial sessions, each separated by one week, to evaluate the reproducibility of the Sprint-Keeper Test (S-Keeper and the Lateral Shuffle-Keeper Test (LS-Keeper. Construct validity was assessed among forty goalkeepers (14.49 ± 1.71 years old by competitive level (elite versus non-elite, after controlling for chronological age. All participants were examined in vertical jump (CMJ and CMJ-free arms, acceleration (5-m and 10-m sprint and goalkeeper-specific technique. The S-Keeper requires the goalkeeper to accelerate during 3 m and dive over a stationary ball after performing a change of direction in a total distance of 10 m. The LS-Keeper involves three changes of direction and a diving save over a stationary ball, in a total distance of 12.55 m. Performance was respectively measured as total time for the right and left sides in each protocol. Bivariate correlations between repeated measures were high and significant (r = 0.835 – 0.912. Test-retest results for the S-Keeper and LS-Keeper showed good reliability (reliability coefficients > 0.88, intra-class correlation coefficient > 0.908 and coefficients of variation < 4.37%, even though participants tended to improve performance when diving to their right side (p < 0.05. Both tests were able to detect significant differences between elite and non-elite goalkeepers, particularly to the left side (p < 0.05. These findings suggest that the S-Keeper and LS-Keeper are reliable and valid tests for assessing goalkeeper-specific technique. Both protocols can be used as a practical tool to provide relevant information about the influence of several components of performance in the overall execution of a diving save, particularly movement patterns, take-off movements and possible asymmetries.
VALIDATION REPORT (PHASE 2) FOR THE FISH SEXUAL DEVELOPMENT TEST FOR THE DETECTION OF ENDOCRINE ACTIVE SUBSTANCES

DEFF Research Database (Denmark)

Holbech, Henrik; Kinnberg, Karin Lund; Petersen, Gitte

This document presents the validation report (phase 2) of the Fish Sexual Development Test (FSDT). The Fish Sexual Development Test (FSDT) covers a life-stage where sexual development is particularly sensitive to perturbation caused by endocrine active chemicals. The chemical exposure lasts...... Guideline on the fish sexual development test to the Working Group of the National Coordinators of the Test Guidelines Programme (WNT). The project was included on the Test Guidelines workplan in 2003, and extensive validation of the test method was carried out until 2009. Two validation studies were...... for about 60 days, at the end of which endpoints of ecological relevance like the sex ratio of the exposed fish is calculated and the biomarker endpoint vitellogenin is measured in individual animals. In 2003, Denmark, on behalf of the European Nordic countries, proposed a new project o develop a Test...
Applicability of U.S. Army tracer test data to model validation needs of ERDA

International Nuclear Information System (INIS)

Shearer, D.L.; Minott, D.H.

1976-06-01

This report covers the first phase of an atmospheric dispersion model validation project sponsored by the Energy Research and Development Administration (ERDA). The project will employ dispersion data generated during an extensive series of field tracer experiments that were part of a meteorological research program which was conducted by the U. S. Army Dugway Proving Ground, Utah, from the late 1950's to the early 1970's. The tests were conducted at several locations in the U. S., South America, Germany, and Norway chosen to typify the effects of certain environmental factors on atmospheric dispersion. The purpose of the Phase I work of this project was to identify applicable portions of the Army data, obtain and review that data, and make recommendations for its uses for atmospheric dispersion model validations. This report presents key information in three formats. The first is a tabular listing of the Army dispersion test reports summarizing the test data contained in each report. This listing is presented in six separate tables with each tabular list representing a different topical area that is based on model validation requirements and the nature of the Army data base. The second format for presenting key information is a series of discussions of the Army test information assigned to each of the six topical areas. These discussions relate the extent and quality of the available data, as well as its prospective use for model validation. The third format is a series of synopses for each Army test report
Video game addiction test: validity and psychometric characteristics.

Science.gov (United States)

van Rooij, Antonius J; Schoenmakers, Tim M; van den Eijnden, Regina J J M; Vermulst, Ad A; van de Mheen, Dike

2012-09-01

The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study used data (n=2,894) from a large-sample paper-and-pencil questionnaire study, conducted in 2009 on secondary schools in Netherlands. Thus, the main source of data was a large sample of schoolchildren (aged 13-16 years). Measurements included the proposed VAT, the Compulsive Internet Use Scale, weekly hours spent on various game types, and several psychosocial variables. The VAT demonstrated excellent reliability, excellent construct validity, a one-factor model fit, and a high degree of measurement invariance across gender, ethnicity, and learning year, indicating that the scale outcomes can be compared across different subgroups with little bias. In summary, the VAT can be helpful in the further study of video game addiction, and it contributes to the debate on possible inclusion of behavioral addictions in the upcoming DSM-V.
Review of seismic tests for qualification of components and validation of methods

International Nuclear Information System (INIS)

Buland, P.; Gantenbein, F.; Gibert, R.J.; Hoffmann, A.; Queval, J.C.

1988-01-01

Seismic tests are performed in CEA-DEMT since many years in order: to demonstrate the qualification of components, to give an experimental validation of calculation methods used for seismic design of components. The paper presents examples of these two types of tests, a description of the existing facilities and details about the new facility TAMARIS under construction. (author)
Review of seismic tests for qualification of components and validation of methods

Energy Technology Data Exchange (ETDEWEB)

Buland, P; Gantenbein, F; Gibert, R J; Hoffmann, A; Queval, J C [CEA-CEN SACLAY-DEMT, Gif sur Yvette-Cedex (France)

1988-07-01

Seismic tests are performed in CEA-DEMT since many years in order: to demonstrate the qualification of components, to give an experimental validation of calculation methods used for seismic design of components. The paper presents examples of these two types of tests, a description of the existing facilities and details about the new facility TAMARIS under construction. (author)
Translation and validation of the Malay version of the Stroke Knowledge Test

Directory of Open Access Journals (Sweden)

Siti Noorkhairina Sowtali

2016-04-01

Conclusions: Malay version Stroke Knowledge Test was a valid and reliable tool to assess educational needs and to evaluate stroke knowledge among participants of group-based stroke education programs in Malaysia.
Prevalence of Invalid Performance on Baseline Testing for Sport-Related Concussion by Age and Validity Indicator.

Science.gov (United States)

Abeare, Christopher A; Messa, Isabelle; Zuccato, Brandon G; Merker, Bradley; Erdodi, Laszlo

2018-03-12

Estimated base rates of invalid performance on baseline testing (base rates of failure) for the management of sport-related concussion range from 6.1% to 40.0%, depending on the validity indicator used. The instability of this key measure represents a challenge in the clinical interpretation of test results that could undermine the utility of baseline testing. To determine the prevalence of invalid performance on baseline testing and to assess whether the prevalence varies as a function of age and validity indicator. This retrospective, cross-sectional study included data collected between January 1, 2012, and December 31, 2016, from a clinical referral center in the Midwestern United States. Participants included 7897 consecutively tested, equivalently proportioned male and female athletes aged 10 to 21 years, who completed baseline neurocognitive testing for the purpose of concussion management. Baseline assessment was conducted with the Immediate Postconcussion Assessment and Cognitive Testing (ImPACT), a computerized neurocognitive test designed for assessment of concussion. Base rates of failure on published ImPACT validity indicators were compared within and across age groups. Hypotheses were developed after data collection but prior to analyses. Of the 7897 study participants, 4086 (51.7%) were male, mean (SD) age was 14.71 (1.78) years, 7820 (99.0%) were primarily English speaking, and the mean (SD) educational level was 8.79 (1.68) years. The base rate of failure ranged from 6.4% to 47.6% across individual indicators. Most of the sample (55.7%) failed at least 1 of 4 validity indicators. The base rate of failure varied considerably across age groups (117 of 140 [83.6%] for those aged 10 years to 14 of 48 [29.2%] for those aged 21 years), representing a risk ratio of 2.86 (95% CI, 2.60-3.16; P indicator and the age of the examinee. The strong age association, with 3 of 4 participants aged 10 to 12 years failing validity indicators, suggests that the
Development, test-retest reliability, and construct validity of the resistance training skills battery.

Science.gov (United States)

Lubans, David R; Smith, Jordan J; Harries, Simon K; Barnett, Lisa M; Faigenbaum, Avery D

2014-05-01

The aim of this study was to describe the development and assess test-retest reliability and construct validity of the Resistance Training Skills Battery (RTSB) for adolescents. The RTSB provides an assessment of resistance training skill competency and includes 6 exercises (i.e., body weight squat, push-up, lunge, suspended row, standing overhead press, and front support with chest touches). Scoring for each skill is based on the number of performance criteria successfully demonstrated. An overall resistance training skill quotient (RTSQ) is created by adding participants' scores for the 6 skills. Participants (44 boys and 19 girls, mean age = 14.5 ± 1.2 years) completed the RTSB on 2 occasions separated by 7 days. Participants also completed the following fitness tests, which were used to create a muscular fitness score (MFS): handgrip strength, timed push-up, and standing long jump tests. Intraclass correlation (ICC), paired samples t-tests, and typical error were used to assess test-retest reliability. To assess construct validity, gender and RTSQ were entered into a regression model predicting MFS. The rank order repeatability of the RTSQ was high (ICC = 0.88). The model explained 39% of the variance in MFS (p ≤ 0.001) and RTSQ (r = 0.40, p ≤ 0.001) was a significant predictor. This study has demonstrated the construct validity and test-retest reliability of the RTSB in a sample of adolescents. The RTSB can reliably rank participants in regards to their resistance training competency and has the necessary sensitivity to detect small changes in resistance training skill proficiency.
Virtual reality myringotomy simulation with real-time deformation: development and validity testing.

Science.gov (United States)

Ho, Andrew K; Alsaffar, Hussain; Doyle, Philip C; Ladak, Hanif M; Agrawal, Sumit K

2012-08-01

Surgical simulation is becoming an increasingly common training tool in residency programs. The first objective was to implement real-time soft-tissue deformation and cutting into a virtual reality myringotomy simulator. The second objective was to test the various implemented incision algorithms to determine which most accurately represents the tympanic membrane during myringotomy. Descriptive and face-validity testing. A deformable tympanic membrane was developed, and three soft-tissue cutting algorithms were successfully implemented into the virtual reality myringotomy simulator. The algorithms included element removal, direction prediction, and Delaunay cutting. The simulator was stable and capable of running in real time on inexpensive hardware. A face-validity study was then carried out using a validated questionnaire given to eight otolaryngologists and four senior otolaryngology residents. Each participant was given an adaptation period on the simulator, was blinded to the algorithm being used, and was presented the three algorithms in a randomized order. A virtual reality myringotomy simulator with real-time soft-tissue deformation and cutting was successfully developed. The simulator was stable, ran in real time on inexpensive hardware, and incorporated haptic feedback and stereoscopic vision. The Delaunay cutting algorithm was found to be the most realistic algorithm representing the incision during myringotomy (P virtual reality myringotomy simulator is being developed and now integrates a real-time deformable tympanic membrane that appears to have face validity. Further development and validation studies are necessary before the simulator can be studied with respect to training efficacy and clinical impact. Copyright © 2012 The American Laryngological, Rhinological, and Otological Society, Inc.
Numerical Simulation and Experimental Validation of the Inflation Test of Latex Balloons

OpenAIRE

Bustos, Claudio; Herrera, Claudio García; Celentano, Diego; Chen, Daming; Cruchaga, Marcela

2016-01-01

Abstract Experiments and modeling aimed at assessing the mechanical response of latex balloons in the inflation test are presented. To this end, the hyperelastic Yeoh material model is firstly characterized via tensile test and, then, used to numerically simulate via finite elements the stress-strain evolution during the inflation test. The numerical pressure-displacement curves are validated with those obtained experimentally. Moreover, this analysis is extended to a biomedical problem of an...
A Structured Clinical Interview for Kleptomania (SCI-K): preliminary validity and reliability testing.

Science.gov (United States)

Grant, Jon E; Kim, Suck Won; McCabe, James S

2006-06-01

Kleptomania presents difficulties in diagnosis for clinicians. This study aimed to develop and test a DSM-IV-based diagnostic instrument for kleptomania. To assess for current kleptomania the Structured Clinical Interview for Kleptomania (SCI-K) was administered to 112 consecutive subjects requesting psychiatric outpatient treatment for a variety of disorders. Reliability and validity were determined. Classification accuracy was examined using the longitudinal course of illness. The SCI-K demonstrated excellent test-retest (Phi coefficient = 0.956 (95% CI = 0.937, 0.970)) and inter-rater reliability (phi coefficient = 0.718 (95% CI = 0.506, 0.848)) in the diagnosis of kleptomania. Concurrent validity was observed with a self-report measure using DSM-IV kleptomania criteria (phi coefficient = 0.769 (95% CI = 0.653, 0.850)). Discriminant validity was observed with a measure of depression (point biserial coefficient = -0.020 (95% CI = -0.205, 0.166)). The SCI-K demonstrated both high sensitivity and specificity based on longitudinal assessment. The SCI-K demonstrated excellent reliability and validity in diagnosing kleptomania in subjects presenting with various psychiatric problems. These findings require replication in larger groups, including non-psychiatric populations, to examine their generalizability. Copyright (c) 2006 John Wiley & Sons, Ltd.
[Validity of AUDIT test for detection of disorders related with alcohol consumption in women].

Science.gov (United States)

Pérula-de Torres, Luis Angel; Fernández-García, José Angel; Arias-Vega, Raquel; Muriel-Palomino, María; Márquez-Rebollo, Encarnación; Ruiz-Moral, Roger

2005-11-26

Early detection of patients with alcohol problems is important in clinical practice. The AUDIT (Alcohol Use Disorders Identification Test) questionnaire is a valid tool for this aim, especially in the male population. The objective of this study was to validate how useful is this questionnaire in females patients and to assess their test cut-off point for the diagnosis of alcohol problems in women. 414 woman were recruited in 2 health center and specialized center for addiction treatment. The AUDIT test and a semistructured interview (SCAN as gold standard) were performed to all patients. Internal consistency and criteria validity was assessed. Cronbach alpha was 0.93 (95% confidence interval [CI], 0.921-0.941). When the DSM-IV was taken as reference the most useful cut-off point was 6 points, with 89.6% (95% CI, 76.11-96.02) sensitivity and 95.07% (95% CI, 92.18-96.97) specificity. When CIE-10 was taken as reference the sensitivity was 89.58% (95% CI, 76.56-96.10) and the specificity was 95.33% (95% CI, 92.48-97.17). AUDIT is a questionnaire with good psychometrics properties and is valid for detecting dependence and risk alcohol consumption in women.
Development and validation of a new cognitive screening test: The Hong Kong Brief Cognitive Test (HKBC).

Science.gov (United States)

Chiu, Helen F K; Zhong, Bao-Liang; Leung, Tony; Li, S W; Chow, Paulina; Tsoh, Joshua; Yan, Connie; Xiang, Yu-Tao; Wong, Mike

2018-07-01

To develop and examine the validity of a new brief cognitive test with less educational bias for screening cognitive impairment. A new cognitive test, Hong Kong Brief Cognitive Test (HKBC), was developed based on review of the literature, as well as the views of an expert panel. Three groups of subjects aged 65 or above were recruited after written consent: normal older people recruited in elderly centres, people with mild NCD (neurocognitive disorder), and people with major NCD. The brief cognitive test, Mini-Mental State Examination (MMSE) and Montreal Cognitive Assessment Scale (MoCA), were administered to the subjects. The performance of HKBC in differentiating subjects with major NCD, mild NCD, and normal older people were compared with the clinical diagnosis, as well as the MMSE and MoCA scores. In total, 359 subjects were recruited, with 99 normal controls, 132 subjects with major NCD, and 128 with mild NCD. The mean MMSE, MoCA, and HKBC scores showed significant differences among the 3 groups of subjects. In the receiving operating characteristic curve analysis of the HKBC in differentiating normal subjects from those with cognitive impairment (mild NCD + major NCD), the area under the curve was 0.955 with an optimal cut-off score of 21/22. The performances of MMSE and MoCA in differentiating normal from cognitively impaired subjects are slightly inferior to the HKBC. The HKBC is a brief instrument useful for screening cognitive impairment in older adults and is also useful in populations with low educational level. Copyright © 2018 John Wiley & Sons, Ltd.
The modified Thomas test is not a valid measure of hip extension unless pelvic tilt is controlled

Directory of Open Access Journals (Sweden)

Andrew D. Vigotsky

2016-08-01

Full Text Available The modified Thomas test was developed to assess the presence of hip flexion contracture and to measure hip extensibility. Despite its widespread use, to the authors’ knowledge, its criterion reference validity has not yet been investigated. The purpose of this study was to assess the criterion reference validity of the modified Thomas test for measuring peak hip extension angle and hip extension deficits, as defined by the hip not being able to extend to 0º, or neutral. Twenty-nine healthy college students (age = 22.00 ± 3.80 years; height = 1.71 ± 0.09 m; body mass = 70.00 ± 15.60 kg were recruited for this study. Bland–Altman plots revealed poor validity for the modified Thomas test’s ability to measure hip extension, which could not be explained by differences in hip flexion ability alone. The modified Thomas test displayed a sensitivity of 31.82% (95% CI [13.86–54.87] and a specificity of 57.14% (95% CI [18.41–90.10] for testing hip extension deficits. It appears, however, that by controlling pelvic tilt, much of this variance can be accounted for (r = 0.98. When pelvic tilt is not controlled, the modified Thomas test displays poor criterion reference validity and, as per previous studies, poor reliability. However, when pelvic tilt is controlled, the modified Thomas test appears to be a valid test for evaluating peak hip extension angle.

Real time risk analysis of kick detection: Testing and validation

International Nuclear Information System (INIS)

Islam, Rakibul; Khan, Faisal; Venkatesan, Ramchandran

2017-01-01

Oil and gas development is moving into harsh and remote locations where the highest level of safety is required. A blowout is one of the most feared accidents in oil and gas developments projects. The main objective of this paper is to test and validate the kick detection of blowout risk assessment model using uniquely developed experimental results. Kick detection is a major part of the blowout risk assessment model. The accuracy and timeliness of kick detection are dependent on the monitoring of multiple downhole parameters such as downhole pressure, fluid density, fluid conductivity and mass flow rate. In the present study these four parameters are considered in different logical combinations to assess the occurrence of kick and associated blowout risk. The assessed results are compared against the experimental observations. It is observed that simultaneous monitoring of mass flow rate combined with any one the three parameters provides most reliable detection of kick and potential blowout likelihood. The current work presents the framework for a dynamic risk assessment and management model. Upon success testing of this approach at the pilot and field levels, this approach could provide a paradigm shift in drilling safety. - Highlights: • A novel dynamic risk model of kick detection and blowout prediction. • Testing and Validation of the risk model. • Application of the dynamic risk model.
Validation of a fracture mechanics approach to nuclear transportation cask design through a drop test program

International Nuclear Information System (INIS)

Sorenson, K.B.

1986-01-01

Sandia National Laboratories (SNL), under contract to the Department of Energy, is conducting a research program to develop and validate a fracture mechanics approach to cask design. A series of drop tests of a transportation cask is planned for the summer of 1986 as the method for benchmarking and, thereby, validating the fracture mechanics approach. This paper presents the drop test plan and background leading to the development of the test plan including structural analyses, material characterization, and non-destructive evaluation (NDE) techniques necessary for defining the test plan properly
The ad-libitum alcohol ?taste test?: secondary analyses of potential confounds and construct validity

OpenAIRE

Jones, Andrew; Button, Emily; Rose, Abigail K.; Robinson, Eric; Christiansen, Paul; Di Lemma, Lisa; Field, Matt

2015-01-01

Rationale Motivation to drink alcohol can be measured in the laboratory using an ad-libitum ?taste test?, in which participants rate the taste of alcoholic drinks whilst their intake is covertly monitored. Little is known about the construct validity of this paradigm. Objective The objective of this study was to investigate variables that may compromise the validity of this paradigm and its construct validity. Methods We re-analysed data from 12 studies from our laboratory that incorporated a...
The fish sexual development test: an OECD test guideline proposal with possible relevance for environmental risk assessment. Results from the validation programme

DEFF Research Database (Denmark)

Holbech, Henrik; Brande-Lavridsen, Nanna; Kinnberg, Karin Lund

2010-01-01

The Fish Sexual Development Test (FSDT) has gone through two validations as an OECD test guideline for the detection of endocrine active chemicals with different modes of action. The validation has been finalized on four species: Zebrafish (Danio rerio), Japanese medaka (Oryzias latipes), three s...... as a population relevant endpoint and the results of the two validation rounds will be discussed in relation to environmental risk assessment and species selection....... for histology. For all three methods, the fish parts were numbered and histology could therefore be linked to the vitellogenin concentration in individual fish. The two core endocrine relevant endpoints were vitellogenin concentrations and phenotypic sex ratio. Change in the sex ratio is presented...
Validation of a wind tunnel testing facility for blade surface pressure measurements

Energy Technology Data Exchange (ETDEWEB)

Fuglsang, P.; Antoniou, I.; Soerensen, N.N.; Madsen, H.A.

1998-04-01

This report concerns development and validation of a 2d testing facility for airfoil pressure measurements. The VELUX open jet wind tunnel was used with a test stand inserted. Reynolds numbers until 1.3 million were achieved with an airfoil chord of 0.45 m. The aerodynamic load coefficients were found from pressure distribution measurements and the total drag coefficient was calculated from wake rake measurements. Stationary inflow as well as dynamic inflow through pitching motion was possible. Wind tunnel corrections were applied for streamline curvature and down-wash. Even though the wind tunnel is not ideal for 2d testing, the overall quality of the flow was acceptable with a uniform flow field at the test stand position and a turbulence intensity of 1 % at the inlet of the test section. Reference values for free stream static and total pressure were found upstream of the test stand. The NACA 63-215 airfoil was tested and the results were compared with measurements from FFA and NACA. The measurements agreed well except for lift coefficient values at high angles of attack and the drag coefficient values at low angles of attack, that were slightly high. Comparisons of the measured results with numerical predictions from the XFOIL code and the EllipSys2D code showed good agreement. Measurements with the airfoil in pitching motion were carried out to study the dynamic aerodynamic coefficients. Steady inflow measurements at high angles of attack were used to investigate the double stall phenomenon. (au) EFP-94; EFP-95; EFP-97. 8 tabs., 82 ills., 16 refs.
The validity of the circumduction test in elderly men and women

NARCIS (Netherlands)

Lemmink, KAPM; Kemper, HCG; de Greef, MHG; Rispens, P; Stevens, M

2003-01-01

This article focuses on the validity of the circumduction test for measuring shoulder flexibility in older adults. Participants included 137 community-dwelling older adults. Equipment consisted of a cord with a fixed handle on one end and a sliding handle on the other. The sliding handle was
Numerical Simulation and Experimental Validation of the Inflation Test of Latex Balloons

Directory of Open Access Journals (Sweden)

Claudio Bustos

Full Text Available Abstract Experiments and modeling aimed at assessing the mechanical response of latex balloons in the inflation test are presented. To this end, the hyperelastic Yeoh material model is firstly characterized via tensile test and, then, used to numerically simulate via finite elements the stress-strain evolution during the inflation test. The numerical pressure-displacement curves are validated with those obtained experimentally. Moreover, this analysis is extended to a biomedical problem of an eyeball under glaucoma conditions.
SASSYS validation with the EBR-II shutdown heat removal tests

International Nuclear Information System (INIS)

Herzog, J.P.

1989-01-01

SASSYS is a coupled neutronic and thermal hydraulic code developed for the analysis of transients in liquid metal cooled reactors (LMRs). The code is especially suited for evaluating of normal reactor transients -- protected (design basis) and unprotected (anticipated transient without scram) transients. Because SASSYS is heavily used in support of the IFR concept and of innovative LMR designs, such as PRISM, a strong validation base for the code must exist. Part of the validation process for SASSYS is analysis of experiments performed on operating reactors, such as the metal fueled Experimental Breeder Reactor -- II (EBR-II). During the course of a series of historic whole-plant experiments, EBR-II illustrated key safety features of metal fueled LMRs. These experiments, the Shutdown Heat Removal Tests (SHRT), culminated in unprotected loss of flow and loss of heat sink transients from full power and flow. Analysis of these and earlier SHRT experiments constitutes a vital part of SASSYS validation, because it facilitates scrutiny of specific SASSYS models and of integrated code capability. 12 refs., 11 figs
Testing Math or Testing Language? The Construct Validity of the KeyMath-Revised for Children With Intellectual Disability and Language Difficulties.

Science.gov (United States)

Rhodes, Katherine T; Branum-Martin, Lee; Morris, Robin D; Romski, MaryAnn; Sevcik, Rose A

2015-11-01

Although it is often assumed that mathematics ability alone predicts mathematics test performance, linguistic demands may also predict achievement. This study examined the role of language in mathematics assessment performance for children with intellectual disability (ID) at less severe levels, on the KeyMath-Revised Inventory (KM-R) with a sample of 264 children, in grades 2-5. Using confirmatory factor analysis, the hypothesis that the KM-R would demonstrate discriminant validity with measures of language abilities in a two-factor model was compared to two plausible alternative models. Results indicated that KM-R did not have discriminant validity with measures of children's language abilities and was a multidimensional test of both mathematics and language abilities for this population of test users. Implications are considered for test development, interpretation, and intervention.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

Science.gov (United States)

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Decentral gene expression analysis: analytical validation of the Endopredict genomic multianalyte breast cancer prognosis test

Directory of Open Access Journals (Sweden)

Kronenwett Ralf

2012-10-01

Full Text Available Abstract Background EndoPredict (EP is a clinically validated multianalyte gene expression test to predict distant metastasis in ER-positive, HER2-negative breast cancer treated with endocrine therapy alone. The test is based on the combined analysis of 12 genes in formalin-fixed, paraffin-embedded (FFPE tissue by reverse transcription-quantitative real-time PCR (RT-qPCR. Recently, it was shown that EP is feasible for reliable decentralized assessment of gene expression. The aim of this study was the analytical validation of the performance characteristics of the assay and its verification in a molecular-pathological routine laboratory. Methods Gene expression values to calculate the EP score were assayed by one-step RT-qPCR using RNA from FFPE tumor tissue. Limit of blank, limit of detection, linear range, and PCR efficiency were assessed for each of the 12 PCR assays using serial samples dilutions. Different breast cancer samples were used to evaluate RNA input range, precision and inter-laboratory variability. Results PCR assays were linear up to Cq values between 35.1 and 37.2. Amplification efficiencies ranged from 75% to 101%. The RNA input range without considerable change of the EP score was between 0.16 and 18.5 ng/μl. Analysis of precision (variation of day, day time, instrument, operator, reagent lots resulted in a total noise (standard deviation of 0.16 EP score units on a scale from 0 to 15. The major part of the total noise (SD 0.14 was caused by the replicate-to-replicate noise of the PCR assays (repeatability and was not associated with different operating conditions (reproducibility. Performance characteristics established in the manufacturer’s laboratory were verified in a routine molecular pathology laboratory. Comparison of 10 tumor samples analyzed in two different laboratories showed a Pearson coefficient of 0.995 and a mean deviation of 0.15 score units. Conclusions The EP test showed reproducible performance
Decentral gene expression analysis: analytical validation of the Endopredict genomic multianalyte breast cancer prognosis test

International Nuclear Information System (INIS)

Kronenwett, Ralf; Brase, Jan C; Weber, Karsten E; Fisch, Karin; Müller, Berit M; Schmidt, Marcus; Filipits, Martin; Dubsky, Peter; Petry, Christoph; Dietel, Manfred; Denkert, Carsten; Bohmann, Kerstin; Prinzler, Judith; Sinn, Bruno V; Haufe, Franziska; Roth, Claudia; Averdick, Manuela; Ropers, Tanja; Windbergs, Claudia

2012-01-01

EndoPredict (EP) is a clinically validated multianalyte gene expression test to predict distant metastasis in ER-positive, HER2-negative breast cancer treated with endocrine therapy alone. The test is based on the combined analysis of 12 genes in formalin-fixed, paraffin-embedded (FFPE) tissue by reverse transcription-quantitative real-time PCR (RT-qPCR). Recently, it was shown that EP is feasible for reliable decentralized assessment of gene expression. The aim of this study was the analytical validation of the performance characteristics of the assay and its verification in a molecular-pathological routine laboratory. Gene expression values to calculate the EP score were assayed by one-step RT-qPCR using RNA from FFPE tumor tissue. Limit of blank, limit of detection, linear range, and PCR efficiency were assessed for each of the 12 PCR assays using serial samples dilutions. Different breast cancer samples were used to evaluate RNA input range, precision and inter-laboratory variability. PCR assays were linear up to C q values between 35.1 and 37.2. Amplification efficiencies ranged from 75% to 101%. The RNA input range without considerable change of the EP score was between 0.16 and 18.5 ng/μl. Analysis of precision (variation of day, day time, instrument, operator, reagent lots) resulted in a total noise (standard deviation) of 0.16 EP score units on a scale from 0 to 15. The major part of the total noise (SD 0.14) was caused by the replicate-to-replicate noise of the PCR assays (repeatability) and was not associated with different operating conditions (reproducibility). Performance characteristics established in the manufacturer’s laboratory were verified in a routine molecular pathology laboratory. Comparison of 10 tumor samples analyzed in two different laboratories showed a Pearson coefficient of 0.995 and a mean deviation of 0.15 score units. The EP test showed reproducible performance characteristics with good precision and negligible laboratory
Design and Validation of a Straight-Copy Typewriting Prognostic Test Using Kinesthetic Sensitivity.

Science.gov (United States)

Olson, Norma Jean

1979-01-01

Describes the development and application of a kinesthetic sensitivity test to determine whether it is a valid and reliable measure of straight-copy typing speed and accuracy. The author states that this kinesthetic sensitivity instrument may be used as a prognostic aptitude test and recommends administration methods. (MF)
Automation Hooks Architecture for Flexible Test Orchestration - Concept Development and Validation

Science.gov (United States)

Lansdowne, C. A.; Maclean, John R.; Winton, Chris; McCartney, Pat

2011-01-01

The Automation Hooks Architecture Trade Study for Flexible Test Orchestration sought a standardized data-driven alternative to conventional automated test programming interfaces. The study recommended composing the interface using multicast DNS (mDNS/SD) service discovery, Representational State Transfer (Restful) Web Services, and Automatic Test Markup Language (ATML). We describe additional efforts to rapidly mature the Automation Hooks Architecture candidate interface definition by validating it in a broad spectrum of applications. These activities have allowed us to further refine our concepts and provide observations directed toward objectives of economy, scalability, versatility, performance, severability, maintainability, scriptability and others.
Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

Directory of Open Access Journals (Sweden)

Roberts Gwerfyl

2012-06-01

Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.
Development and validation of the Approach-Iron Skill Test for use in golf.

Science.gov (United States)

Robertson, Samuel John; Burnett, Angus F; Newton, Robert U

2013-01-01

The primary aim of this study was to develop and validate a golf-specific approach-iron test for use with elite and high-level amateur golfers. Elite (n=26) and high-level amateur (n=23) golfers were recruited for this study. The 'Approach-Iron Skill Test' requires players to hit a total of 27 shots. Specifically, three shots are hit at each of nine targets on a specially constructed driving range in a randomised order. A real-time launch monitor positioned behind the player, measured the carry distance for each of these shots. A scoring system was developed based on the percentage error index of each shot, meaning that 81 points was the maximum score possible (with a maximum of three points per shot). Two rounds of the test were performed. For both rounds of the test, elite-level golfers scored significantly higher than their high-level amateur counterparts (56.3 ± 5.6 and 58.5 ± 4.6 points versus 46.0 ± 6.3 and 46.1 ± 6.7 points, respectively) (P<0.05). For both elite and high-level players, 95% limits of agreement statistics also indicated that the test showed good test-retest reliability (2.1 ± 7.9 and 0.2 ± 10.8, respectively). Due to the clinimetric properties of the test, we conclude that the Approach-Iron Skill Test is suitable for further examination with the players examined in this study.
Using Frankencerts for Automated Adversarial Testing of Certificate Validation in SSL/TLS Implementations.

Science.gov (United States)

Brubaker, Chad; Jana, Suman; Ray, Baishakhi; Khurshid, Sarfraz; Shmatikov, Vitaly

2014-01-01

Modern network security rests on the Secure Sockets Layer (SSL) and Transport Layer Security (TLS) protocols. Distributed systems, mobile and desktop applications, embedded devices, and all of secure Web rely on SSL/TLS for protection against network attacks. This protection critically depends on whether SSL/TLS clients correctly validate X.509 certificates presented by servers during the SSL/TLS handshake protocol. We design, implement, and apply the first methodology for large-scale testing of certificate validation logic in SSL/TLS implementations. Our first ingredient is "frankencerts," synthetic certificates that are randomly mutated from parts of real certificates and thus include unusual combinations of extensions and constraints. Our second ingredient is differential testing: if one SSL/TLS implementation accepts a certificate while another rejects the same certificate, we use the discrepancy as an oracle for finding flaws in individual implementations. Differential testing with frankencerts uncovered 208 discrepancies between popular SSL/TLS implementations such as OpenSSL, NSS, CyaSSL, GnuTLS, PolarSSL, MatrixSSL, etc. Many of them are caused by serious security vulnerabilities. For example, any server with a valid X.509 version 1 certificate can act as a rogue certificate authority and issue fake certificates for any domain, enabling man-in-the-middle attacks against MatrixSSL and GnuTLS. Several implementations also accept certificate authorities created by unauthorized issuers, as well as certificates not intended for server authentication. We also found serious vulnerabilities in how users are warned about certificate validation errors. When presented with an expired, self-signed certificate, NSS, Safari, and Chrome (on Linux) report that the certificate has expired-a low-risk, often ignored error-but not that the connection is insecure against a man-in-the-middle attack. These results demonstrate that automated adversarial testing with frankencerts
Development of blow down and sodium-water reaction jet analysis codes-Validation by sodium-water reaction tests (SWAT-1R)

International Nuclear Information System (INIS)

Hiroshi Seino; Akikazu Kurihara; Isao Ono; Koji Jitsu

2005-01-01

Blow down analysis code (LEAP-BLOW) and sodium-water reaction jet analysis code (LEAP-JET) have been developed in order to improve the evaluation method on sodium-water reaction event in the steam generator (SG) of a sodium cooled fast breeder reactor (FBR). The validation analyses by these two codes were carried out using the data of Sodium-Water Reaction Test (SWAT-1R). The following main results have been obtained through this validation: (1) The calculational results by LEAP-BLOW such as internal pressure and water flow rate show good agreement with the results of the SWAT- 1R test. (2) The LEAP-JET code can qualitatively simulate the behavior of sodium-water reaction. However, it is found that the code has tendency to overestimate the maximum temperature of the reaction jet. (authors)
Validity and Reliability of the Clock Drawing Test in Older People

Directory of Open Access Journals (Sweden)

Massoumeh Sadeghipour Roodsari

2013-07-01

Full Text Available Objectives: Early diagnosis of cognitive disorders in order to initiate new efficient treatments in time is an important task which cannot be fulfilled without proper cognitive screening tools. The Clock Drawing Test (CDT is a simple inexpensive cognitive screening tool which can be used in primary care settings delivering health services to older people. The aim of this study was to assess validity and reliability of the CDT in Iranian older population. Methods & Materials: In this study the CDT and Mini Mental State Examination (MMSE were concurrently performed on 74 literate participants aged 60 and over. Participants were recruited from the clients of Iran Alzheimer’s Association (dementia patients and non-demented clients, including other patients or care givers during a 5 month period. The CDT was performed by two trained raters using Shulman’s six points scoring method. Using SPSS version 20, reliability was assessed measuring kappa statistics as well as ICC. Concurrent validity between CDT and MMSE were statistically analyzed by spearman’s rank correlation coefficient. Results: Mean age of the participants was 72 years in a range of 60 to 90 years with equal numbers 0f male and female participants. Kappa statistics for test retest reliability was 0.554 (P<0.001. ICC for inter rater reliability was 0.964 (P<0.001. Spearman’s rank correlation coefficient for MMSE and CDT scores was 0.782, statistically significant at P<0.001. Conclusion: CDT is a valid and reliable test in literate older people that can be used as a cognitive screening tool in Iranian older population.
How'd they do it? Malingering strategies on symptom validity tests.

Science.gov (United States)

Tan, Jing Ee; Slick, Daniel J; Strauss, Esther; Hultsch, David F

2002-12-01

Twenty-five undergraduate students were instructed to feign believable impairment following a brain injury from a car accident and 27 students were told to perform like they had recovered from such an injury. Three forced-choice tests, the Test of Memory Malingering (TOMM), Victoria Symptom Validity Test (VSVT), and Word Memory Test (WMT) were given. Test-taking strategies were evaluated by means of a questionnaire given at the end of the test session. The results revealed that all the tasks differentiated between groups. Using conventional cut-scores, the WMT proved most efficient while the VSVT captured the most participants in the definitive below-chance category. Individuals instructed to feign injury were more likely to prepare prior to the experiment, with feigning of memory loss as the most frequently reported strategy. Regardless, preparation effort did not translate into believable performance on the tests.

Introduction to Large-sized Test Facility for validating Containment Integrity under Severe Accidents

International Nuclear Information System (INIS)

Na, Young Su; Hong, Seongwan; Hong, Seongho; Min, Beongtae

2014-01-01

An overall assessment of containment integrity can be conducted properly by examining the hydrogen behavior in the containment building. Under severe accidents, an amount of hydrogen gases can be generated by metal oxidation and corium-concrete interaction. Hydrogen behavior in the containment building strongly depends on complicated thermal hydraulic conditions with mixed gases and steam. The performance of a PAR can be directly affected by the thermal hydraulic conditions, steam contents, gas mixture behavior and aerosol characteristics, as well as the operation of other engineering safety systems such as a spray. The models in computer codes for a severe accident assessment can be validated based on the experiment results in a large-sized test facility. The Korea Atomic Energy Research Institute (KAERI) is now preparing a large-sized test facility to examine in detail the safety issues related with hydrogen including the performance of safety devices such as a PAR in various severe accident situations. This paper introduces the KAERI test facility for validating the containment integrity under severe accidents. To validate the containment integrity, a large-sized test facility is necessary for simulating complicated phenomena induced by an amount of steam and gases, especially hydrogen released into the containment building under severe accidents. A pressure vessel 9.5 m in height and 3.4 m in diameter was designed at the KAERI test facility for the validating containment integrity, which was based on the THAI test facility with the experimental safety and the reliable measurement systems certified for a long time. This large-sized pressure vessel operated in steam and iodine as a corrosive agent was made by stainless steel 316L because of corrosion resistance for a long operating time, and a vessel was installed in at KAERI in March 2014. In the future, the control systems for temperature and pressure in a vessel will be constructed, and the measurement system
Experience with Aero- and Fluid-Dynamic Testing for Engineering and CFD Validation

Science.gov (United States)

Ross, James C.

2016-01-01

Ever since computations have been used to simulate aerodynamics the need to ensure that the computations adequately represent real life has followed. Many experiments have been performed specifically for validation and as computational methods have improved, so have the validation experiments. Validation is also a moving target because computational methods improve requiring validation for the new aspect of flow physics that the computations aim to capture. Concurrently, new measurement techniques are being developed that can help capture more detailed flow features pressure sensitive paint (PSP) and particle image velocimetry (PIV) come to mind. This paper will present various wind-tunnel tests the author has been involved with and how they were used for validation of various kinds of CFD. A particular focus is the application of advanced measurement techniques to flow fields (and geometries) that had proven to be difficult to predict computationally. Many of these difficult flow problems arose from engineering and development problems that needed to be solved for a particular vehicle or research program. In some cases the experiments required to solve the engineering problems were refined to provide valuable CFD validation data in addition to the primary engineering data. All of these experiments have provided physical insight and validation data for a wide range of aerodynamic and acoustic phenomena for vehicles ranging from tractor-trailers to crewed spacecraft.
Validity and reliability of the single-trial line drill test of anaerobic power in basketball players.

Science.gov (United States)

Fatouros, I G; Laparidis, K; Kambas, A; Chatzinikolaou, A; Techlikidou, E; Katrabasas, I; Douroudos, I; Leontsini, D; Berberidou, F; Draganidis, D; Christoforidis, C; Tsoukas, D; Kelis, S; Taxildaris, K

2011-03-01

This study evaluated the validity, reliability, and sensitivity of the single-trial line drill test (SLDT) for anaerobic power assessment. Twenty-four volunteers were assigned to either a control (C, N.=12) or an experimental (BP, N.=12 basketball players) group. SLDT's (time-to-complete) concurrent validity was evaluated against the Wingate testing (WAnT: mean [MP] and peak power [PP]) and a 30-sec vertical jump testing test (VJT: mean height and MP). Blood lactate concentration was measured at rest and immediately post-test. SLDT's reliability [test-retest intraclass correlation coefficients (ICC), coefficient of variation (CV), Bland-Altman plots] and sensitivity were determined (one-way ANOVA). Kendall's tau correlation analysis revealed correlations (Pbasketball players.
Validation and sensitivity tests on improved parametrizations of a land surface process model (LSPM) in the Po Valley

International Nuclear Information System (INIS)

Cassardo, C.; Carena, E.; Longhetto, A.

1998-01-01

The Land Surface Process Model (LSPM) has been improved with respect to the 1. version of 1994. The modifications have involved the parametrizations of the radiation terms and of turbulent heat fluxes. A parametrization of runoff has also been developed, in order to close the hydrologic balance. This 2. version of LSPM has been validated against experimental data gathered at Mottarone (Verbania, Northern Italy) during a field experiment. The results of this validation show that this new version is able to apportionate the energy into sensible and latent heat fluxes. LSPM has also been submitted to a series of sensitivity tests in order to investigate the hydrological part of the model. The physical quantities selected in these sensitivity experiments have been the initial soil moisture content and the rainfall intensity. In each experiment, the model has been forced by using the observations carried out at the synoptic stations of San Pietro Capofiume (Po Valley, Italy). The observed characteristics of soil and vegetation (not involved in the sensitivity tests) have been used as initial and boundary conditions. The results of the simulation show that LSPM can reproduce well the energy, heat and water budgets and their behaviours with varying the selected parameters. A careful analysis of the LSPM output shows also the importance to identify the effective soil type
On the limits of effort testing: symptom validity tests and severity of neurocognitive symptoms in nonlitigant patients

NARCIS (Netherlands)

Merten, Thomas; Bossink, Linda; Schmand, Ben

2007-01-01

Modern symptom validity tests (SVTs) use empirical cutoffs for decision making. However, limits to the applicability of these cutoffs may arise when severe cognitive symptoms are present. The purpose of the studies presented here was to explore these limits of applicability. In Experiment 1, a group
The bogus taste test: Validity as a measure of laboratory food intake

OpenAIRE

Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A.; Kemps, Eva; Higgs, Suzanne; Jones, Andrew

2017-01-01

Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The ?bogus? taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food inta...
Bridging the Gap Between Validation and Implementation of Non-Animal Veterinary Vaccine Potency Testing Methods

Directory of Open Access Journals (Sweden)

Alistair Currie

2011-11-01

Full Text Available In recent years, technologically advanced high-throughput techniques have been developed that replace, reduce or refine animal use in vaccine quality control tests. Following validation, these tests are slowly being accepted for use by international regulatory authorities. Because regulatory acceptance itself has not guaranteed that approved humane methods are adopted by manufacturers, various organizations have sought to foster the preferential use of validated non-animal methods by interfacing with industry and regulatory authorities. After noticing this gap between regulation and uptake by industry, we began developing a paradigm that seeks to narrow the gap and quicken implementation of new replacement, refinement or reduction guidance. A systematic analysis of our experience in promoting the transparent implementation of validated non-animal vaccine potency assays has led to the refinement of our paradigmatic process, presented here, by which interested parties can assess the local regulatory acceptance of methods that reduce animal use and integrate them into quality control testing protocols, or ensure the elimination of peripheral barriers to their use, particularly for potency and other tests carried out on production batches.
Bridging the Gap Between Validation and Implementation of Non-Animal Veterinary Vaccine Potency Testing Methods.

Science.gov (United States)

Dozier, Samantha; Brown, Jeffrey; Currie, Alistair

2011-11-29

In recent years, technologically advanced high-throughput techniques have been developed that replace, reduce or refine animal use in vaccine quality control tests. Following validation, these tests are slowly being accepted for use by international regulatory authorities. Because regulatory acceptance itself has not guaranteed that approved humane methods are adopted by manufacturers, various organizations have sought to foster the preferential use of validated non-animal methods by interfacing with industry and regulatory authorities. After noticing this gap between regulation and uptake by industry, we began developing a paradigm that seeks to narrow the gap and quicken implementation of new replacement, refinement or reduction guidance. A systematic analysis of our experience in promoting the transparent implementation of validated non-animal vaccine potency assays has led to the refinement of our paradigmatic process, presented here, by which interested parties can assess the local regulatory acceptance of methods that reduce animal use and integrate them into quality control testing protocols, or ensure the elimination of peripheral barriers to their use, particularly for potency and other tests carried out on production batches.
Testing and Validation of Computational Methods for Mass Spectrometry.

Science.gov (United States)

Gatto, Laurent; Hansen, Kasper D; Hoopmann, Michael R; Hermjakob, Henning; Kohlbacher, Oliver; Beyer, Andreas

2016-03-04

High-throughput methods based on mass spectrometry (proteomics, metabolomics, lipidomics, etc.) produce a wealth of data that cannot be analyzed without computational methods. The impact of the choice of method on the overall result of a biological study is often underappreciated, but different methods can result in very different biological findings. It is thus essential to evaluate and compare the correctness and relative performance of computational methods. The volume of the data as well as the complexity of the algorithms render unbiased comparisons challenging. This paper discusses some problems and challenges in testing and validation of computational methods. We discuss the different types of data (simulated and experimental validation data) as well as different metrics to compare methods. We also introduce a new public repository for mass spectrometric reference data sets ( http://compms.org/RefData ) that contains a collection of publicly available data sets for performance evaluation for a wide range of different methods.
[Cognition-correlation indices of gender schema: tests of validity].

Science.gov (United States)

Ishida, E

1994-02-01

Four-hundred and seventy-seven subjects evaluated a set of traits and behaviors in terms of how masculine and feminine they were and in terms of how well they represented their real and ideal self-images. Within-individual correlation coefficients between these evaluations were proposed as measures of psychological gender schemata, because they would represent the degree of matching between the subjects' gender-image and ideal/real self-images of gender-related attributes. The present study aims at examining the construct validity of these measures, by testing them to psychological variables that are known to reflect gender identity. The individual difference variables used as criteria were (a) satisfaction with one's own sex, (b) general happiness, (c) self-esteem (d) gender-conflict, and (e) school and occupational achievement need. Correlations between the gender-schema indices and the criteria variables supported the construct validity of those measures. Advantages of the present measurement over the conventional simple trait approach, such as BSRI, or PAQ are discussed.
Identification of conductive hearing loss using air conduction tests alone: reliability and validity of an automatic test battery.

Science.gov (United States)

Convery, Elizabeth; Keidser, Gitte; Seeto, Mark; Freeston, Katrina; Zhou, Dan; Dillon, Harvey

2014-01-01

The primary objective of this study was to determine whether a combination of automatically administered pure-tone audiometry and a tone-in-noise detection task, both delivered via an air conduction (AC) pathway, could reliably and validly predict the presence of a conductive component to the hearing loss. The authors hypothesized that performance on the battery of tests would vary according to hearing loss type. A secondary objective was to evaluate the reliability and validity of a novel automatic audiometry algorithm to assess its suitability for inclusion in the test battery. Participants underwent a series of hearing assessments that were conducted in a randomized order: manual pure-tone air conduction audiometry and bone conduction audiometry; automatic pure-tone air conduction audiometry; and an automatic tone-in-noise detection task. The automatic tests were each administered twice. The ability of the automatic test battery to: (a) predict the presence of an air-bone gap (ABG); and (b) accurately measure AC hearing thresholds was assessed against the results of manual audiometry. Test-retest conditions were compared to determine the reliability of each component of the automatic test battery. Data were collected on 120 ears from normal-hearing and conductive, sensorineural, and mixed hearing-loss subgroups. Performance differences between different types of hearing loss were observed. Ears with a conductive component (conductive and mixed ears) tended to have normal signal to noise ratios (SNR) despite impaired thresholds in quiet, while ears without a conductive component (normal and sensorineural ears) demonstrated, on average, an increasing relationship between their thresholds in quiet and their achieved SNR. Using the relationship between these two measures among ears with no conductive component as a benchmark, the likelihood that an ear has a conductive component can be estimated based on the deviation from this benchmark. The sensitivity and
Validation of a Criterion Referenced Test for Young Handicapped Children: PIPER.

Science.gov (United States)

Strum, Irene; Shapiro, Madelaine

The purpose of this study was to validate the Prescriptive Instructional Program for Educational Readiness (PIPER) for utilization as a criterion referenced test (CRT) among learning disabled children. The program consisted of behavioral objectives and diagnostic and/or mastery tasks and activities for each objective in the area of gross motor…
Development of a Saudi Food Frequency Questionnaire and testing its reliability and validity.

Science.gov (United States)

Gosadi, Ibrahim M; Alatar, Abdullah A; Otayf, Mojahed M; AlJahani, Dhaherah M; Ghabbani, Hisham M; AlRajban, Waleed A; Alrsheed, Abdullah M; Al-Nasser, Khalid A

2017-06-01

To create a food frequency questionnaire specifically designed to capture the dietary habits of Saudis and test its validity and reliability. Methods: This investigation is a longitudinal, test-retest study conducted in King Saud University, Riyadh, Kingdom of Saudi Arabia between December 2015 and March 2016. A list of 140 food items was included in the questionnaire where a closed-ended and open-ended approach was used. Regarding past year food frequency consumption and 24 hours dietary recall, body weight and height were collected. Internal consistency, test-retest reliability, completeness of the food list, and criterion validity were assessed. Results: One-hundred and thirty eight participants were interviewed to complete the 24 hours dietary recall and the constructed questionnaire. Approximately 85% of the food items reported in the dietary recall were covered in the food frequency questionnaire. The association of body mass index with meats (regression coefficients: 2.28) and dairy products consumption frequency was statistically significant (regression coefficients: 2.31). A high overall reproducibility rate of the questionnaire was detected (Pearsons' correlation coefficient: 0.78 p less than 0.001). Conclusion: The developed questionnaire has a high reliability and reasonable validity, and suitable for use in nutritional epidemiological investigations in Saudi Arabia.
Further examination of embedded performance validity indicators for the Conners' Continuous Performance Test and Brief Test of Attention in a large outpatient clinical sample.

Science.gov (United States)

Sharland, Michael J; Waring, Stephen C; Johnson, Brian P; Taran, Allise M; Rusin, Travis A; Pattock, Andrew M; Palcher, Jeanette A

2018-01-01

Assessing test performance validity is a standard clinical practice and although studies have examined the utility of cognitive/memory measures, few have examined attention measures as indicators of performance validity beyond the Reliable Digit Span. The current study further investigates the classification probability of embedded Performance Validity Tests (PVTs) within the Brief Test of Attention (BTA) and the Conners' Continuous Performance Test (CPT-II), in a large clinical sample. This was a retrospective study of 615 patients consecutively referred for comprehensive outpatient neuropsychological evaluation. Non-credible performance was defined two ways: failure on one or more PVTs and failure on two or more PVTs. Classification probability of the BTA and CPT-II into non-credible groups was assessed. Sensitivity, specificity, positive predictive value, and negative predictive value were derived to identify clinically relevant cut-off scores. When using failure on two or more PVTs as the indicator for non-credible responding compared to failure on one or more PVTs, highest classification probability, or area under the curve (AUC), was achieved by the BTA (AUC = .87 vs. .79). CPT-II Omission, Commission, and Total Errors exhibited higher classification probability as well. Overall, these findings corroborate previous findings, extending them to a large clinical sample. BTA and CPT-II are useful embedded performance validity indicators within a clinical battery but should not be used in isolation without other performance validity indicators.
Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

Science.gov (United States)

Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

2015-01-01

Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
Statistical validation of normal tissue complication probability models.

Science.gov (United States)

Xu, Cheng-Jian; van der Schaaf, Arjen; Van't Veld, Aart A; Langendijk, Johannes A; Schilstra, Cornelis

2012-09-01

To investigate the applicability and value of double cross-validation and permutation tests as established statistical approaches in the validation of normal tissue complication probability (NTCP) models. A penalized regression method, LASSO (least absolute shrinkage and selection operator), was used to build NTCP models for xerostomia after radiation therapy treatment of head-and-neck cancer. Model assessment was based on the likelihood function and the area under the receiver operating characteristic curve. Repeated double cross-validation showed the uncertainty and instability of the NTCP models and indicated that the statistical significance of model performance can be obtained by permutation testing. Repeated double cross-validation and permutation tests are recommended to validate NTCP models before clinical use. Copyright © 2012 Elsevier Inc. All rights reserved.
Statistical Validation of Normal Tissue Complication Probability Models

Energy Technology Data Exchange (ETDEWEB)

Xu Chengjian, E-mail: c.j.xu@umcg.nl [Department of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen (Netherlands); Schaaf, Arjen van der; Veld, Aart A. van' t; Langendijk, Johannes A. [Department of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen (Netherlands); Schilstra, Cornelis [Department of Radiation Oncology, University of Groningen, University Medical Center Groningen, Groningen (Netherlands); Radiotherapy Institute Friesland, Leeuwarden (Netherlands)

2012-09-01

Purpose: To investigate the applicability and value of double cross-validation and permutation tests as established statistical approaches in the validation of normal tissue complication probability (NTCP) models. Methods and Materials: A penalized regression method, LASSO (least absolute shrinkage and selection operator), was used to build NTCP models for xerostomia after radiation therapy treatment of head-and-neck cancer. Model assessment was based on the likelihood function and the area under the receiver operating characteristic curve. Results: Repeated double cross-validation showed the uncertainty and instability of the NTCP models and indicated that the statistical significance of model performance can be obtained by permutation testing. Conclusion: Repeated double cross-validation and permutation tests are recommended to validate NTCP models before clinical use.
Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests

NARCIS (Netherlands)

Rijken, N.H.M.; Engelen, B.G.M. van; Weerdesteyn, V.G.M.; Geurts, A.C.H.

2015-01-01

OBJECTIVE: To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). DESIGN: Case-control study. SETTING: University medical center. PARTICIPANTS: Patients with various severity levels
Validation Test Report For The CRWMS Analysis and Logistics Visually Interactive Model Version 3.0, 10074-Vtr-3.0-00

International Nuclear Information System (INIS)

Gillespie, S.

2000-01-01

This report describes the tests performed to validate the CRWMS ''Analysis and Logistics Visually Interactive'' Model (CALVIN) Version 3.0 (V3.0) computer code (STN: 10074-3.0-00). To validate the code, a series of test cases was developed in the CALVIN V3.0 Validation Test Plan (CRWMS M and O 1999a) that exercises the principal calculation models and options of CALVIN V3.0. Twenty-five test cases were developed: 18 logistics test cases and 7 cost test cases. These cases test the features of CALVIN in a sequential manner, so that the validation of each test case is used to demonstrate the accuracy of the input to subsequent calculations. Where necessary, the test cases utilize reduced-size data tables to make the hand calculations used to verify the results more tractable, while still adequately testing the code's capabilities. Acceptance criteria, were established for the logistics and cost test cases in the Validation Test Plan (CRWMS M and O 1999a). The Logistics test cases were developed to test the following CALVIN calculation models: Spent nuclear fuel (SNF) and reactivity calculations; Options for altering reactor life; Adjustment of commercial SNF (CSNF) acceptance rates for fiscal year calculations and mid-year acceptance start; Fuel selection, transportation cask loading, and shipping to the Monitored Geologic Repository (MGR); Transportation cask shipping to and storage at an Interim Storage Facility (ISF); Reactor pool allocation options; and Disposal options at the MGR. Two types of cost test cases were developed: cases to validate the detailed transportation costs, and cases to validate the costs associated with the Civilian Radioactive Waste Management System (CRWMS) Management and Operating Contractor (M and O) and Regional Servicing Contractors (RSCs). For each test case, values calculated using Microsoft Excel 97 worksheets were compared to CALVIN V3.0 scenarios with the same input data and assumptions. All of the test case results compare with
The Abbott RealTime High Risk HPV test is a clinically validated human papillomavirus assay for triage in the referral population and use in primary cervical cancer screening in women 30 years and older: a review of validation studies.

Science.gov (United States)

Poljak, Mario; Oštrbenk, Anja

2013-01-01

Human papillomavirus (HPV) testing has become an essential part of current clinical practice in the management of cervical cancer and precancerous lesions. We reviewed the most important validation studies of a next-generation real-time polymerase chain reaction-based assay, the RealTime High Risk HPV test (RealTime)(Abbott Molecular, Des Plaines, IL, USA), for triage in referral population settings and for use in primary cervical cancer screening in women 30 years and older published in peer-reviewed journals from 2009 to 2013. RealTime is designed to detect 14 high-risk HPV genotypes with concurrent distinction of HPV-16 and HPV-18 from 12 other HPV genotypes. The test was launched on the European market in January 2009 and is currently used in many laboratories worldwide for routine detection of HPV. We concisely reviewed validation studies of a next-generation real-time polymerase chain reaction (PCR)-based assay: the Abbott RealTime High Risk HPV test. Eight validation studies of RealTime in referral settings showed its consistently high absolute clinical sensitivity for both CIN2+ (range 88.3-100%) and CIN3+ (range 93.0-100%), as well as comparative clinical sensitivity relative to the currently most widely used HPV test: the Qiagen/Digene Hybrid Capture 2 HPV DNA Test (HC2). Due to the significantly different composition of the referral populations, RealTime absolute clinical specificity for CIN2+ and CIN3+ varied greatly across studies, but was comparable relative to HC2. Four validation studies of RealTime performance in cervical cancer screening settings showed its consistently high absolute clinical sensitivity for both CIN2+ and CIN3+, as well as comparative clinical sensitivity and specificity relative to HC2 and GP5+/6+ PCR. RealTime has been extensively evaluated in the last 4 years. RealTime can be considered clinically validated for triage in referral population settings and for use in primary cervical cancer screening in women 30 years and older.

OECD validation study to assess intra- and inter-laboratory reproducibility of the zebrafish embryo toxicity test for acute aquatic toxicity testing.

Science.gov (United States)

Busquet, François; Strecker, Ruben; Rawlings, Jane M; Belanger, Scott E; Braunbeck, Thomas; Carr, Gregory J; Cenijn, Peter; Fochtman, Przemyslaw; Gourmelon, Anne; Hübler, Nicole; Kleensang, André; Knöbel, Melanie; Kussatz, Carola; Legler, Juliette; Lillicrap, Adam; Martínez-Jerónimo, Fernando; Polleichtner, Christian; Rzodeczko, Helena; Salinas, Edward; Schneider, Katharina E; Scholz, Stefan; van den Brandhof, Evert-Jan; van der Ven, Leo T M; Walter-Rohde, Susanne; Weigt, Stefan; Witters, Hilda; Halder, Marlies

2014-08-01

The OECD validation study of the zebrafish embryo acute toxicity test (ZFET) for acute aquatic toxicity testing evaluated the ZFET reproducibility by testing 20 chemicals at 5 different concentrations in 3 independent runs in at least 3 laboratories. Stock solutions and test concentrations were analytically confirmed for 11 chemicals. Newly fertilised zebrafish eggs (20/concentration and control) were exposed for 96h to chemicals. Four apical endpoints were recorded daily as indicators of acute lethality: coagulation of the embryo, lack of somite formation, non-detachment of the tail bud from the yolk sac and lack of heartbeat. Results (LC50 values for 48/96h exposure) show that the ZFET is a robust method with a good intra- and inter-laboratory reproducibility (CV30%) for some very toxic or volatile chemicals, and chemicals tested close to their limit of solubility. The ZFET is now available as OECD Test Guideline 236. Considering the high predictive capacity of the ZFET demonstrated by Belanger et al. (2013) in their retrospective analysis of acute fish toxicity and fish embryo acute toxicity data, the ZFET is ready to be considered for acute fish toxicity for regulatory purposes. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
Student mathematical imagination instruments: construction, cultural adaptation and validity

Science.gov (United States)

Dwijayanti, I.; Budayasa, I. K.; Siswono, T. Y. E.

2018-03-01

Imagination has an important role as the center of sensorimotor activity of the students. The purpose of this research is to construct the instrument of students’ mathematical imagination in understanding concept of algebraic expression. The researcher performs validity using questionnaire and test technique and data analysis using descriptive method. Stages performed include: 1) the construction of the embodiment of the imagination; 2) determine the learning style questionnaire; 3) construct instruments; 4) translate to Indonesian as well as adaptation of learning style questionnaire content to student culture; 5) perform content validation. The results stated that the constructed instrument is valid by content validation and empirical validation so that it can be used with revisions. Content validation involves Indonesian linguists, english linguists and mathematics material experts. Empirical validation is done through a legibility test (10 students) and shows that in general the language used can be understood. In addition, a questionnaire test (86 students) was analyzed using a biserial point correlation technique resulting in 16 valid items with a reliability test using KR 20 with medium reability criteria. While the test instrument test (32 students) to find all items are valid and reliability test using KR 21 with reability is 0,62.
Validation Test of Geant4 Simulation of Electron Backscattering

CERN Document Server

Kim, Sung Hun; Basaglia, Tullio; Han, Min Cheol; Hoff, Gabriela; Kim, Chan Hyeong; Saracco, Paolo

2015-01-01

Backscattering is a sensitive probe of the accuracy of electron scattering algorithms implemented in Monte Carlo codes. The capability of the Geant4 toolkit to describe realistically the fraction of electrons backscattered from a target volume is extensively and quantitatively evaluated in comparison with experimental data retrieved from the literature. The validation test covers the energy range between approximately 100 eV and 20 MeV, and concerns a wide set of target elements. Multiple and single electron scattering models implemented in Geant4, as well as preassembled selections of physics models distributed within Geant4, are analyzed with statistical methods. The evaluations concern Geant4 versions from 9.1 to 10.1. Significant evolutions are observed over the range of Geant4 versions, not always in the direction of better compatibility with experiment. Goodness-of-fit tests complemented by categorical analysis tests identify a configuration based on Geant4 Urban multiple scattering model in Geant4 vers...
Cross-cultural adaptation and validation of the sino-nasal outcome test (SNOT-22) for Spanish-speaking patients.

Science.gov (United States)

de los Santos, Gonzalo; Reyes, Pablo; del Castillo, Raúl; Fragola, Claudio; Royuela, Ana

2015-11-01

Our objective was to perform translation, cross-cultural adaptation and validation of the sino-nasal outcome test 22 (SNOT-22) to Spanish language. SNOT-22 was translated, back translated, and a pretest trial was performed. The study included 119 individuals divided into 60 cases, who met diagnostic criteria for chronic rhinosinusitis according to the European Position Paper on Rhinosinusitis 2012; and 59 controls, who reported no sino-nasal disease. Internal consistency was evaluated with Cronbach's alpha test, reproducibility with Kappa coefficient, reliability with intraclass correlation coefficient (ICC), validity with Mann-Whitney U test and responsiveness with Wilcoxon test. In cases, Cronbach's alpha was 0.91 both before and after treatment, as for controls, it was 0.90 at their first test assessment and 0.88 at 3 weeks. Kappa coefficient was calculated for each item, with an average score of 0.69. ICC was also performed for each item, with a score of 0.87 in the overall score and an average among all items of 0.71. Median score for cases was 47, and 2 for controls, finding the difference to be highly significant (Mann-Whitney U test, p internal consistency, reliability, reproducibility, validity and responsiveness necessary to be a valid instrument to be used in clinical practice.
Validating Models of Clinical Word Recognition Tests for Spanish/English Bilinguals

Science.gov (United States)

Shi, Lu-Feng

2014-01-01

Purpose: Shi and Sánchez (2010) developed models to predict the optimal test language for evaluating Spanish/English (S/E) bilinguals' word recognition. The current study intended to validate their conclusions in a separate bilingual listener sample. Method: Seventy normal-hearing S/E bilinguals varying in language profile were included.…
Alternatives to animal testing: research, trends, validation, regulatory acceptance.

Science.gov (United States)

Huggins, Jane

2003-01-01

Current trends and issues in the development of alternatives to the use of animals in biomedical experimentation are discussed in this position paper. Eight topics are considered and include refinement of acute toxicity assays; eye corrosion/irritation alternatives; skin corrosion/irritation alternatives; contact sensitization alternatives; developmental/reproductive testing alternatives; genetic engineering (transgenic) assays; toxicogenomics; and validation of alternative methods. The discussion of refinement of acute toxicity assays is focused primarily on developments with regard to reduction of the number of animals used in the LD(50) assay. However, the substitution of humane endpoints such as clinical signs of toxicity for lethality in these assays is also evaluated. Alternative assays for eye corrosion/irritation as well as those for skin corrosion/irritation are described with particular attention paid to the outcomes, both successful and unsuccessful, of several validation efforts. Alternative assays for contact sensitization and developmental/reproductive toxicity are presented as examples of methods designed for the examination of interactions between toxins and somewhat more complex physiological systems. Moreover, genetic engineering and toxicogenomics are discussed with an eye toward the future of biological experimentation in general. The implications of gene manipulation for research animals, specifically, are also examined. Finally, validation methods are investigated as to their effectiveness, or lack thereof, and suggestions for their standardization and improvement, as well as implementation are reviewed.
Test of gross motor development-2 for Filipino children with intellectual disability: validity and reliability.

Science.gov (United States)

Capio, Catherine M; Eguia, Kathlynne F; Simons, Johan

2016-01-01

This study aimed to examine aspects of validity and reliability of the Test of Gross Motor Development-2 (TGMD-2) in Filipino children with intellectual disability. Content and construct validity were verified, as well as inter-rater and intra-rater reliability. Two paediatric physiotherapists tested 81 children with intellectual disability (mean age = 9.29 ± 2.71 years) on locomotor and object control skills. Analysis of covariance, confirmatory factor analysis and analysis of variance were used to test validity, while Cronbach's alpha, intra-class correlation coefficients (ICC) and Bland-Altman plots were used to examine reliability. Age was a significant predictor of locomotor and object control scores (P = 0.004). The data fit the hypothesised two-factor model with fit indices as follows: χ(2) = 33.525, DF = 34, P = 0.491, χ(2)/DF = 0.986. As hypothesised, gender was a significant predictor for object control skills (P = 0.038). Participants' mean scores were significantly below mastery (locomotor, P intellectual disability.
Danish VISA-A questionnaire with validation and reliability testing for Danish-speaking Achilles tendinopathy patients

DEFF Research Database (Denmark)

Iversen, J. V.; Bartels, E. M.; Jørgensen, J. E.

2016-01-01

The VISA-A questionnaire has proven to be a valid and reliable tool for assessing severity of Achilles tendinopathy (AT). The aim was to translate and cross-culturally adapt the VISA-A questionnaire for a Danish-speaking AT population, and subsequently perform validity and reliability tests...
Aerobic fitness testing in 6- to 9-year-old children: reliability and validity of a modified Yo-Yo IR1 test and the Andersen test

DEFF Research Database (Denmark)

Ahler, T; Bendiksen, Mads; Krustrup, Peter

2012-01-01

This study analysed the reliability and validity of two intermittent running tests (the Yo-Yo IR1 test and the Andersen test) as tools for estimating VO(2max) in children under the age of 10. Two groups, aged 6-7 years (grade 0, n = 18) and 8-9 years (grade 2, n = 16), carried out two repetitions...
Creation and validation of the barriers to alcohol reduction (BAR) scale using classical test theory and item response theory.

Science.gov (United States)

Kunicki, Zachary J; Schick, Melissa R; Spillane, Nichea S; Harlow, Lisa L

2018-06-01

Those who binge drink are at increased risk for alcohol-related consequences when compared to non-binge drinkers. Research shows individuals may face barriers to reducing their drinking behavior, but few measures exist to assess these barriers. This study created and validated the Barriers to Alcohol Reduction (BAR) scale. Participants were college students ( n  = 230) who endorsed at least one instance of past-month binge drinking (4+ drinks for women or 5+ drinks for men). Using classical test theory, exploratory structural equation modeling found a two-factor structure of personal/psychosocial barriers and perceived program barriers. The sub-factors, and full scale had reasonable internal consistency (i.e., coefficient omega = 0.78 (personal/psychosocial), 0.82 (program barriers), and 0.83 (full measure)). The BAR also showed evidence for convergent validity with the Brief Young Adult Alcohol Consequences Questionnaire ( r  = 0.39, p  Theory (IRT) analysis showed the two factors separately met the unidimensionality assumption, and provided further evidence for severity of the items on the two factors. Results suggest that the BAR measure appears reliable and valid for use in an undergraduate student population of binge drinkers. Future studies may want to re-examine this measure in a more diverse sample.
Independent validation testing of the FLAME computer code, Version 1.0

International Nuclear Information System (INIS)

Martian, P.; Chung, J.N.

1992-07-01

Independent testing of the FLAME computer code, Version 1.0, was conducted to determine if the code is ready for use in hydrological and environmental studies at Department of Energy sites. This report describes the technical basis, approach, and results of this testing. Validation tests, (i.e., tests which compare field data to the computer generated solutions) were used to determine the operational status of the FLAME computer code and were done on a qualitative basis through graphical comparisons of the experimental and numerical data. These tests were specifically designed to check: (1) correctness of the FORTRAN coding, (2) computational accuracy, and (3) suitability to simulating actual hydrologic conditions. This testing was performed using a structured evaluation protocol which consisted of: (1) independent applications, and (2) graduated difficulty of test cases. Three tests ranging in complexity from simple one-dimensional steady-state flow field problems under near-saturated conditions to two-dimensional transient flow problems with very dry initial conditions
Tests for validation of fast neutron reactors safety

International Nuclear Information System (INIS)

Nagata, T.; Yamashita, H.

2001-01-01

Japanese scientific research and design enterprises in cooperation with industrial and power generating corporations implement a project on creating a fast neutron reactor of the ultimate safety. One of the basic expected results from such a development is creation of a reactor core structure that is able to eliminate recriticality occurrence in the course of reactor accident involving fuel melting. One of the possible ways to solve this problem is to include pipes (meant for specifying directed (controlled) molten fuel relocation) into fuel assembly structure. In the course of conduction and subsequent implementation of such a design the basic issue is to experimentally confirm the operating capacity of FA having such a structure and that is called FAIDUS. Within EAGLE Project on experimental basis of IAE NNC RK an activity has been started on preparation and conduction of out-of-pile and in-pile tests. During tests a sodium coolant will be used. Studies are conducted by NNC RK in cooperation with the Japanese corporations JAPC and JNC. Basic objective of out-of-pile tests was to obtain preliminary information on fuel relocation behavior under conditions simulating accident involving melting of core consisting of FAIDUS FA, which will help to clarify simulation criteria and to develop the most optimum structure of the experimental channel for reactor experiments conduction. The basic objective of in-pile tests was the experimental confirmation of operating capacity of FAIDUS FA model under reactor conditions. According to the program two tests are planned to be performed at IGR reactor: tests for validation of fast neutron reactor safety, and out-of-pile tests at EAGLE experimental facility without sodium coolant
Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

Science.gov (United States)

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Validation and Refinement of Prediction Models to Estimate Exercise Capacity in Cancer Survivors Using the Steep Ramp Test

NARCIS (Netherlands)

Stuiver, Martijn M.; Kampshoff, Caroline S.; Persoon, Saskia; Groen, Wim; van Mechelen, Willem; Chinapaw, Mai J. M.; Brug, Johannes; Nollet, Frans; Kersten, Marie-José; Schep, Goof; Buffart, Laurien M.

2017-01-01

Objective: To further test the validity and clinical usefulness of the steep ramp test (SRT) in estimating exercise tolerance in cancer survivors by external validation and extension of previously published prediction models for peak oxygen consumption (Vo2(peak)) and peak power output (W-peak).&
The Validity and Incremental Validity of Knowledge Tests, Low-Fidelity Simulations, and High-Fidelity Simulations for Predicting Job Performance in Advanced-Level High-Stakes Selection

Science.gov (United States)

Lievens, Filip; Patterson, Fiona

2011-01-01

In high-stakes selection among candidates with considerable domain-specific knowledge and experience, investigations of whether high-fidelity simulations (assessment centers; ACs) have incremental validity over low-fidelity simulations (situational judgment tests; SJTs) are lacking. Therefore, this article integrates research on the validity of…
The predictive validity of the BioMedical Admissions Test for pre-clinical examination performance.

Science.gov (United States)

Emery, Joanne L; Bell, John F

2009-06-01

Some medical courses in the UK have many more applicants than places and almost all applicants have the highest possible previous and predicted examination grades. The BioMedical Admissions Test (BMAT) was designed to assist in the student selection process specifically for a number of 'traditional' medical courses with clear pre-clinical and clinical phases and a strong focus on science teaching in the early years. It is intended to supplement the information provided by examination results, interviews and personal statements. This paper reports on the predictive validity of the BMAT and its predecessor, the Medical and Veterinary Admissions Test. Results from the earliest 4 years of the test (2000-2003) were matched to the pre-clinical examination results of those accepted onto the medical course at the University of Cambridge. Correlation and logistic regression analyses were performed for each cohort. Section 2 of the test ('Scientific Knowledge') correlated more strongly with examination marks than did Section 1 ('Aptitude and Skills'). It also had a stronger relationship with the probability of achieving the highest examination class. The BMAT and its predecessor demonstrate predictive validity for the pre-clinical years of the medical course at the University of Cambridge. The test identifies important differences in skills and knowledge between candidates, not shown by their previous attainment, which predict their examination performance. It is thus a valid source of additional admissions information for medical courses with a strong scientific emphasis when previous attainment is very high.
[COOP/WONCA: Reliability and validity of the test administered by telephone].

Science.gov (United States)

Pedrero-Pérez, Eduardo J; Díaz-Olalla, José Manuel

2016-01-01

The COOP/WONCA test was initially proposed as a self-report in which the answers were supported by drawings illustrating the state investigated. Subsequent studies have confirmed its usefulness as a mere verbal self-report face-to-face administered. No data have been found about its useful when administered by telephone interview. The aim of this study was to determine the psychometric properties of the COOP / WONCA test to measure Related Quality of Life (HRQoL) administered by telephone and compare them with those obtained in other forms of prior administration. Cross-sectional study on a random. City of Madrid. Random sample of 802 adult subjects, representative of the adult population in Madrid, obtained by stratification from the population census. Questionnaire COOP/WONCA with 9 ítems included in a broader battery, administered by telephone interview. The unrestricted factor analysis points to the unifactoriality of the scale, which measures a single latent construct (HRQOL), showing high internal consistency, not significantly different from those found by face-to-face administration, ruling out the existence of biases in the phone modality. The COOP/WONCA test appears as a reliable and valid measure of HRQOL and telephonic administration allows to assume no changes in the results, which can reduce costs in population studies, increasing efficiency without loss of quality in the information collected. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.
Validity of a new assessment rubric for a short-answer test of clinical reasoning.

Science.gov (United States)

Yeung, Euson; Kulasagarem, Kulamakan; Woods, Nicole; Dubrowski, Adam; Hodges, Brian; Carnahan, Heather

2016-07-26

The validity of high-stakes decisions derived from assessment results is of primary concern to candidates and certifying institutions in the health professions. In the field of orthopaedic manual physical therapy (OMPT), there is a dearth of documented validity evidence to support the certification process particularly for short-answer tests. To address this need, we examined the internal structure of the Case History Assessment Tool (CHAT); this is a new assessment rubric developed to appraise written responses to a short-answer test of clinical reasoning in post-graduate OMPT certification in Canada. Fourteen physical therapy students (novices) and 16 physical therapists (PT) with minimal and substantial OMPT training respectively completed a mock examination. Four pairs of examiners (n = 8) participated in appraising written responses using the CHAT. We conducted separate generalizability studies (G studies) for all participants and also by level of OMPT training. Internal consistency was calculated for test questions with more than 2 assessment items. Decision studies were also conducted to determine optimal application of the CHAT for OMPT certification. The overall reliability of CHAT scores was found to be moderate; however, reliability estimates for the novice group suggest that the scale was incapable of accommodating for scores of novices. Internal consistency estimates indicate item redundancies for several test questions which will require further investigation. Future validity studies should consider discriminating the clinical reasoning competence of OMPT trainees strictly at the post-graduate level. Although rater variance was low, the large variance attributed to error sources not incorporated in our G studies warrant further investigations into other threats to validity. Future examination of examiner stringency is also warranted.
Assessment of Prospective Memory – a Validity Study of Memory for Intentions Screening Test

NARCIS (Netherlands)

Bezdicek, O.; Raskin, S.A.; Altgassen, A.M.; Ruzicka, E.

2014-01-01

Aim: The goal of the present study was to validate the Czech version of the Memory for Intentions (Screening) Test (MIST, 2010). We included standardized testing material, translation of administration and scoring, and assessment of normative data for the MIST in the Czech population. Introduction:
Reliability and validity of the rey visual design learning test in primary school children

NARCIS (Netherlands)

Wilhelm, P.

2004-01-01

The Rey Visual Design Learning Test (Rey, 1964, in Spreen & Strauss, 1991) assesses immediate memory span, new learning and recognition for non-verbal material. Three studies are presented that focused on the reliability and validity of the RVDLT in primary school children. Test-retest reliability

Validation of science virtual test to assess 8th grade students' critical thinking on living things and environmental sustainability theme

Science.gov (United States)

Rusyati, Lilit; Firman, Harry

2017-05-01

This research was motivated by the importance of multiple-choice questions that indicate the elements and sub-elements of critical thinking and implementation of computer-based test. The method used in this research was descriptive research for profiling the validation of science virtual test to measure students' critical thinking in junior high school. The participant is junior high school students of 8th grade (14 years old) while science teacher and expert as the validators. The instrument that used as a tool to capture the necessary data are sheet of an expert judgment, sheet of legibility test, and science virtual test package in multiple choice form with four possible answers. There are four steps to validate science virtual test to measure students' critical thinking on the theme of "Living Things and Environmental Sustainability" in 7th grade Junior High School. These steps are analysis of core competence and basic competence based on curriculum 2013, expert judgment, legibility test and trial test (limited and large trial test). The test item criterion based on trial test are accepted, accepted but need revision, and rejected. The reliability of the test is α = 0.747 that categorized as `high'. It means the test instruments used is reliable and high consistency. The validity of Rxy = 0.63 means that the validity of the instrument was categorized as `high' according to interpretation value of Rxy (correlation).
[Validation of the AUDIT test for identifying risk consumption and alcohol use disorders in women].

Science.gov (United States)

Pérula de Torres, L A; Fernández-García, J A; Arias-Vega, R; Muriel-Palomino, M; Márquez-Rebollo, E; Ruiz-Moral, R

2005-11-30

To validate the AUDIT test for identifying women with excess alcohol consumption and/or dependency syndrome (DS). Descriptive study to validate a test. Two primary care centres and a county drug-dependency centre. 414 women from 18 to 75 recruited at the clinic. Interventions. Social and personal details were obtained through personal interview, their alcohol consumption was quantified and the AUDIT and MALT questionnaires were filled in. Then the semi-structured SCAN interview was conducted (gold standard; DSM-IV and CIE-10 criteria), and analyses were requested (GGT, GOT, GPT, VCM). 186 patients were given a follow-up appointment three-four weeks later (retest). Intra-observer reliability was evaluated with the Kappa index, internal consistency with Cronbach s alpha, and the validity of criteria with indexes of sensitivity and specificity, predictive values and probability quotients. To evaluate the diagnostic performance of the test and the most effective cut-off point, a ROC analysis was run. 11.4% (95% CI, 8.98-13.81) were diagnosed with alcohol abuse (0.5%) or DS (10.9%). The Kappa coefficients of the AUDIT items ranged between 0.685 and 0.795 (PAUDIT is a questionnaire with good psycho-measurement properties. It is reliable and valid for the detection of risk consumption and DS in women.
Full-Scale Structural and NDI Validation Tests of Bonded Composite Doublers for Commercial Aircraft Applications

Energy Technology Data Exchange (ETDEWEB)

Roach, D.; Walkington, P.

1999-02-01

Composite doublers, or repair patches, provide an innovative repair technique which can enhance the way aircraft are maintained. Instead of riveting multiple steel or aluminum plates to facilitate an aircraft repair, it is possible to bond a single Boron-Epoxy composite doubler to the damaged structure. Most of the concerns surrounding composite doubler technology pertain to long-term survivability, especially in the presence of non-optimum installations, and the validation of appropriate inspection procedures. This report focuses on a series of full-scale structural and nondestructive inspection (NDI) tests that were conducted to investigate the performance of Boron-Epoxy composite doublers. Full-scale tests were conducted on fuselage panels cut from retired aircraft. These full-scale tests studied stress reductions, crack mitigation, and load transfer capabilities of composite doublers using simulated flight conditions of cabin pressure and axial stress. Also, structures which modeled key aspects of aircraft structure repairs were subjected to extreme tension, shear and bending loads to examine the composite laminate's resistance to disbond and delamination flaws. Several of the structures were loaded to failure in order to determine doubler design margins. Nondestructive inspections were conducted throughout the test series in order to validate appropriate techniques on actual aircraft structure. The test results showed that a properly designed and installed composite doubler is able to enhance fatigue life, transfer load away from damaged structure, and avoid the introduction of new stress risers (i.e. eliminate global reduction in the fatigue life of the structure). Comparisons with test data obtained prior to the doubler installation revealed that stresses in the parent material can be reduced 30%--60% through the use of the composite doubler. Tests to failure demonstrated that the bondline is able to transfer plastic strains into the doubler and that
The Role of Policy Assumptions in Validating High-stakes Testing Programs.

Science.gov (United States)

Kane, Michael

L. Cronbach has made the point that for validity arguments to be convincing to diverse audiences, they need to be based on assumptions that are credible to these audiences. The interpretations and uses of high stakes test scores rely on a number of policy assumptions about what should be taught in schools, and more specifically, about the content…
[Design and validation of a questionnaire for psychosocial nursing diagnosis in Primary Care].

Science.gov (United States)

Brito-Brito, Pedro Ruymán; Rodríguez-Álvarez, Cristobalina; Sierra-López, Antonio; Rodríguez-Gómez, José Ángel; Aguirre-Jaime, Armando

2012-01-01

To develop a valid, reliable and easy-to-use questionnaire for a psychosocial nursing diagnosis. The study was performed in two phases: first phase, questionnaire design and construction; second phase, validity and reliability tests. A bank of items was constructed using the NANDA classification as a theoretical framework. Each item was assigned a Likert scale or dichotomous response. The combination of responses to the items constituted the diagnostic rules to assign up to 28 labels. A group of experts carried out the validity test for content. Other validated scales were used as reference standards for the criterion validity tests. Forty-five nurses provided the questionnaire to the patients on three separate occasions over a period of three weeks, and the other validated scales only once to 188 randomly selected patients in Primary Care centres in Tenerife (Spain). Validity tests for construct confirmed the six dimensions of the questionnaire with 91% of total variance explained. Validity tests for criterion showed a specificity of 66%-100%, and showed high correlations with the reference scales when the questionnaire was assigning nursing diagnoses. Reliability tests showed agreement of 56%-91% (PQuestionnaire for Psychosocial Nursing Diagnosis was called CdePS, and included 61 items. The CdePS is a valid, reliable and easy-to-use tool in Primary Care centres to improve the assigning of a psychosocial nursing diagnosis. Copyright © 2011 Elsevier España, S.L. All rights reserved.
The BACHD Rat Model of Huntington Disease Shows Specific Deficits in a Test Battery of Motor Function.

Science.gov (United States)

Manfré, Giuseppe; Clemensson, Erik K H; Kyriakou, Elisavet I; Clemensson, Laura E; van der Harst, Johanneke E; Homberg, Judith R; Nguyen, Huu Phuc

2017-01-01

Rationale : Huntington disease (HD) is a progressive neurodegenerative disorder characterized by motor, cognitive and neuropsychiatric symptoms. HD is usually diagnosed by the appearance of motor deficits, resulting in skilled hand use disruption, gait abnormality, muscle wasting and choreatic movements. The BACHD transgenic rat model for HD represents a well-established transgenic rodent model of HD, offering the prospect of an in-depth characterization of the motor phenotype. Objective : The present study aims to characterize different aspects of motor function in BACHD rats, combining classical paradigms with novel high-throughput behavioral phenotyping. Methods : Wild-type (WT) and transgenic animals were tested longitudinally from 2 to 12 months of age. To measure fine motor control, rats were challenged with the pasta handling test and the pellet reaching test. To evaluate gross motor function, animals were assessed by using the holding bar and the grip strength tests. Spontaneous locomotor activity and circadian rhythmicity were assessed in an automated home-cage environment, namely the PhenoTyper. We then integrated existing classical methodologies to test motor function with automated home-cage assessment of motor performance. Results : BACHD rats showed strong impairment in muscle endurance at 2 months of age. Altered circadian rhythmicity and locomotor activity were observed in transgenic animals. On the other hand, reaching behavior, forepaw dexterity and muscle strength were unaffected. Conclusions : The BACHD rat model exhibits certain features of HD patients, like muscle weakness and changes in circadian behavior. We have observed modest but clear-cut deficits in distinct motor phenotypes, thus confirming the validity of this transgenic rat model for treatment and drug discovery purposes.
Development and content validity of a screening instrument for gaming addiction in adolescents: the Gaming Addiction Identification Test (GAIT).

Science.gov (United States)

Vadlin, Sofia; Åslund, Cecilia; Nilsson, Kent W

2015-08-01

This study describes the development of a screening tool for gaming addiction in adolescents - the Gaming Addiction Identification Test (GAIT). Its development was based on the research literature on gaming and addiction. An expert panel comprising professional raters (n = 7), experiential adolescent raters (n = 10), and parent raters (n = 10) estimated the content validity of each item (I-CVI) as well as of the whole scale (S-CVI/Ave), and participated in a cognitive interview about the GAIT scale. The mean scores for both I-CVI and S-CVI/Ave ranged between 0.97 and 0.99 compared with the lowest recommended I-CVI value of 0.78 and the S-CVI/Ave value of 0.90. There were no sex differences and no differences between expert groups regarding ratings in content validity. No differences in the overall evaluation of the scale emerged in the cognitive interviews. Our conclusions were that GAIT showed good content validity in capturing gaming addiction. The GAIT needs further investigation into its psychometric properties of construct validity (convergent and divergent validity) and criterion-related validity, as well as its reliability in both clinical settings and in community settings with adolescents. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Validation of US3D for Capsule Aerodynamics using 05-CA Wind Tunnel Test Data

Science.gov (United States)

Schwing, Alan

2012-01-01

Several comparisons of computational fluid dynamics to wind tunnel test data are shown for the purpose of code validation. The wind tunnel test, 05-CA, uses a 7.66% model of NASA's Multi-Purpose Crew Vehicle in the 11-foot test section of the Ames Unitary Plan Wind tunnel. A variety of freestream conditions over four Mach numbers and three angles of attack are considered. Test data comparisons include time-averaged integrated forces and moments, time-averaged static pressure ports on the surface, and Strouhal Number. The applicability of the US3D code to subsonic and transonic flow over a bluff body is assessed on a comprehensive data set. With close comparison, this work validates US3D for highly separated flows similar to those examined here.
The Five Digits Test in the assessment of older adults with low formal education: construct validity and reliability in a Brazilian clinical sample.

Science.gov (United States)

de Paula, Jonas Jardim; Oliveira, Thaís Dell'Oro; Querino, Emanuel Henrique Gonçalves; Malloy-Diniz, Leandro Fernandes

2017-01-01

In the assessment of older adults with very low formal education, typical tests of selective attention and inhibitory control are biased by reading abilities. In this sense, we aim to assess the psychometric characteristics of the Five Digits Test (FDT), a numerical Stroop paradigm, in older adults without cognitive disorders, with mild cognitive impairment, and with dementia. We assessed 211 Brazilian older adults with low formal education using the FDT and other cognitive measures. Construct validity and reliability were assessed by correlations and internal consistency. The FDT test had weak correlations with crystalized intelligence tests and moderate-high correlations with fluid intelligence measures and tests of global cognitive status and executive functions. The split-half coefficient of reliability showed high internal consistency (>0.900). Together, the results suggest that the FDT is a valid and reliable measure for the assessment of processing speed and executive functions in older adults with low formal education.
PIG's Speed Estimated with Pressure Transducers and Hall Effect Sensor: An Industrial Application of Sensors to Validate a Testing Laboratory.

Science.gov (United States)

Lima, Gustavo F; Freitas, Victor C G; Araújo, Renan P; Maitelli, André L; Salazar, Andrés O

2017-09-15

The pipeline inspection using a device called Pipeline Inspection Gauge (PIG) is safe and reliable when the PIG is at low speeds during inspection. We built a Testing Laboratory, containing a testing loop and supervisory system to study speed control techniques for PIGs. The objective of this work is to present and validate the Testing Laboratory, which will allow development of a speed controller for PIGs and solve an existing problem in the oil industry. The experimental methodology used throughout the project is also presented. We installed pressure transducers on pipeline outer walls to detect the PIG's movement and, with data from supervisory, calculated an average speed of 0.43 m/s. At the same time, the electronic board inside the PIG received data from odometer and calculated an average speed of 0.45 m/s. We found an error of 4.44%, which is experimentally acceptable. The results showed that it is possible to successfully build a Testing Laboratory to detect the PIG's passage and estimate its speed. The validation of the Testing Laboratory using data from the odometer and its auxiliary electronic was very successful. Lastly, we hope to develop more research in the oil industry area using this Testing Laboratory.
Validity of the RAST for evaluating anaerobic power performance as compared to Wingate test in cycling athletes

Directory of Open Access Journals (Sweden)

Marcos Roberto Queiroga

2013-12-01

Full Text Available The validity of the Running-based Anaerobic Sprint Test (RAST was investigated to evaluate the anaerobic power performance in comparison to Wingate test in cycling athletes. Ten mountain-bike male cyclists (28.0±7.3 years randomly performed Wingate Test and RAST with two trials each. After several anthropometric measurements, peak power (PP, mean power (MP and fatigue index (FI for RAST and Wingate Test were analyzed using Student's paired t-test, Pearson's linear correlation test (r and Bland and Altman's plots. Results showed that, with the exception of FI (33.8±4.6% vs. 37.8±7.9%; r=0.172, significant differences were detected between the Wingate and RAST tests with regard to PP and MP. Although there was a strong correlation for PP and MP, or rather, 0.831 and 0.714 respectively, agreement of analysis between Wingate and RAST protocols was low. The above suggested that RAST was not appropriate to evaluate the performance of anaerobic power by Wingate test in cycling athletes.
Danish validation of sniffin' sticks olfactory test for threshold, discrimination, and identification

DEFF Research Database (Denmark)

Niklassen, Andreas Steenholt; Ovesen, Therese; Fernandes, Henrique

2017-01-01

to investigate external validity of international normative values to separate hyposmia from normosmia. METHODS: The study included 388 participants. The first step was a questionnaire study in which 238 adults rated their familiarity with 125 odor descriptors. In the second step, we evaluated the original...... in improvement of familiarity and rate of I, making the test valid for use in Denmark. Furthermore, the study found a large variation in T and D scores between different countries, which should be considered when using these scores to separate hyposmia and anosmia from normosmia. LEVEL OF EVIDENCE: 2b...
Validation and structural analysis of the kinematics concept test

Directory of Open Access Journals (Sweden)

A. Lichtenberger

2017-04-01

Full Text Available The kinematics concept test (KCT is a multiple-choice test designed to evaluate students’ conceptual understanding of kinematics at the high school level. The test comprises 49 multiple-choice items about velocity and acceleration, which are based on seven kinematic concepts and which make use of three different representations. In the first part of this article we describe the development and the validation process of the KCT. We applied the KCT to 338 Swiss high school students who attended traditional teaching in kinematics. We analyzed the response data to provide the psychometric properties of the test. In the second part we present the results of a structural analysis of the test. An exploratory factor analysis of 664 student answers finally uncovered the seven kinematics concepts as factors. However, the analysis revealed a hierarchical structure of concepts. At the higher level, mathematical concepts group together, and then split up into physics concepts at the lower level. Furthermore, students who seem to understand a concept in one representation have difficulties transferring the concept to similar problems in another representation. Both results have implications for teaching kinematics. First, teaching mathematical concepts beforehand might be beneficial for learning kinematics. Second, instructions have to be designed to teach students the change between different representations.
Validation and structural analysis of the kinematics concept test

Science.gov (United States)

Lichtenberger, A.; Wagner, C.; Hofer, S. I.; Stern, E.; Vaterlaus, A.

2017-06-01

The kinematics concept test (KCT) is a multiple-choice test designed to evaluate students' conceptual understanding of kinematics at the high school level. The test comprises 49 multiple-choice items about velocity and acceleration, which are based on seven kinematic concepts and which make use of three different representations. In the first part of this article we describe the development and the validation process of the KCT. We applied the KCT to 338 Swiss high school students who attended traditional teaching in kinematics. We analyzed the response data to provide the psychometric properties of the test. In the second part we present the results of a structural analysis of the test. An exploratory factor analysis of 664 student answers finally uncovered the seven kinematics concepts as factors. However, the analysis revealed a hierarchical structure of concepts. At the higher level, mathematical concepts group together, and then split up into physics concepts at the lower level. Furthermore, students who seem to understand a concept in one representation have difficulties transferring the concept to similar problems in another representation. Both results have implications for teaching kinematics. First, teaching mathematical concepts beforehand might be beneficial for learning kinematics. Second, instructions have to be designed to teach students the change between different representations.
Reliability and Validity of Colored Progressive Matrices for 4-6 Age Children

Directory of Open Access Journals (Sweden)

Ahmet Bildiren

2017-06-01

Full Text Available In this research, it was aimed to test the reliability and validity of Colored Progressive Matrices for children between the ages of 4 to 6 from 15 schools. The sample of the study consisted of 640 kindergarten children. Test-retest and parallel form were used for reliability analyses. For the validity analysis, the relations between the Colored Progressive Matrices Test and Bender Gestalt Visual Motor Sensitivity Test, WISC-R and TONI-3 tests were examined. The results showed that there was a significant relation between the test-retest results and the parallel forms in all the age groups. Validity analyses showed strong correlations between the Colored Progressive Matrices and all the other measures.
Validity of the German Version of the Continuous-Scale Physical Functional Performance 10 Test

Directory of Open Access Journals (Sweden)

Irene Härdi

2017-01-01

Full Text Available Background. The Continuous-Scale Physical Functional Performance 10 Test (CS-PFP 10 quantitatively assesses physical functional performance in older adults who have a broad range of physical functional ability. This study assessed the validity and reliability of the CS-PFP 10 German version. Methods. Forward-translations and backtranslations as well as cultural adaptions of the test were conducted. Participants were German-speaking Swiss community-dwelling adults aged 64 and older. Concurrent validity was assessed using Pearson correlation coefficients between CS-PFP 10 and gait velocity, Timed Up and Go Test, hand grip strength, SF-36 physical function domain, and Freiburger Physical Activity Questionnaire. Internal consistency was calculated by Cronbach’s alpha. Results. Backtranslation and cultural adaptions were accepted by the CS-PFP 10 developer. CS-PFP 10 total score and subscores (upper body strength, upper body flexibility, lower body strength, balance and coordination, and endurance correlated significantly with all measures of physical function tested. Internal consistency was high (Cronbach’s alpha 0.95–0.98. Conclusion. The CS-PFP 10 German version is valid and reliable for measuring physical functional performance in German-speaking Swiss community-dwelling older adults. Quantifying physical function is essential for clinical practice and research and provides meaningful insight into physical functional performance of older adults. This trial is registered with ClinicalTrials.gov NCT01539200.
Development of a valid and reliable test to assess trauma radiograph interpretation performance

International Nuclear Information System (INIS)

Neep, M.J.; Steffens, T.; Riley, V.; Eastgate, P.; McPhail, S.M.

2017-01-01

Objectives: The purpose of this investigation was to develop and examine the preliminary validity and reliability among radiographers of a test to assess trauma radiograph interpretation performance suitable for use among health professionals. Methods: Stage 1 examined 14,159 consecutive appendicular and axial examinations from a hospital emergency department over a 12 month period to quantify a typical anatomical region case-mix of trauma radiographs. A sample of radiographic cases representative of affected anatomical regions was then developed into the Image Interpretation Test (IIT). Stage 2 involved prospective investigations of the IIT's reliability (inter-rater, intra-rater, internal consistency) and validity (concurrent) among 41 radiographers. Results: The IIT included 60 cases. The median (interquartile range) clinical experience of participants was 5 (2–10) years. Case scores were internally consistent (Cronbach's alpha = 0.90). Favourable inter-rater reliability (kappa > 0.70 for 58/60 cases, Intra-class correlation coefficient (ICC) > 0.99 for total score) and intra-rater reliability (kappa > 0.90 for 60/60 cases, ICC > 0.99 for total score) was observed. There was a positive association between radiographers' confidence in image interpretation and IIT score (coefficient = 1.52, r-squared = 0.60, p < 0.001). Conclusions: The IIT developed during this investigation included a selection of radiographic cases consistent with anatomical regions represented in an adult trauma case-mix. This study has also provided foundational preliminary evidence to support the reliability and validity of the IIT among radiographers. The findings suggest that it is possible to assess image interpretation performance of adult trauma radiographs with this test. - Highlights: • Development of an Image Interpretation Test (IIT). • Cases consistent with anatomical regions represented in a typical adult trauma case-mix. • Development of a
Familiarization, validity and smallest detectable difference of the isometric squat test in evaluating maximal strength.

Science.gov (United States)

Drake, David; Kennedy, Rodney; Wallace, Eric

2018-02-06

Isometric multi-joint tests are considered reliable and have strong relationships with 1RM performance. However, limited evidence is available for the isometric squat in terms of effects of familiarization and reliability. This study aimed to assess, the effect of familiarization, stability reliability, determine the smallest detectible difference, and the correlation of the isometric squat test with 1RM squat performance. Thirty-six strength-trained participants volunteered to take part in this study. Following three familiarization sessions, test-retest reliability was evaluated with a 48-hour window between each time point. Isometric squat peak, net and relative force were assessed. Results showed three familiarizations were required, isometric squat had a high level of stability reliability and smallest detectible difference of 11% for peak and relative force. Isometric strength at a knee angle of ninety degrees had a strong significant relationship with 1RM squat performance. In conclusion, the isometric squat is a valid test to assess multi-joint strength and can discriminate between strong and weak 1RM squat performance. Changes greater than 11% in peak and relative isometric squat performance should be considered as meaningful in participants who are familiar with the test.
Invalid Permutation Tests

Directory of Open Access Journals (Sweden)

Mikel Aickin

2010-01-01

Full Text Available Permutation tests are often presented in a rather casual manner, in both introductory and advanced statistics textbooks. The appeal of the cleverness of the procedure seems to replace the need for a rigorous argument that it produces valid hypothesis tests. The consequence of this educational failing has been a widespread belief in a “permutation principle”, which is supposed invariably to give tests that are valid by construction, under an absolute minimum of statistical assumptions. Several lines of argument are presented here to show that the permutation principle itself can be invalid, concentrating on the Fisher-Pitman permutation test for two means. A simple counterfactual example illustrates the general problem, and a slightly more elaborate counterfactual argument is used to explain why the main mathematical proof of the validity of permutation tests is mistaken. Two modifications of the permutation test are suggested to be valid in a very modest simulation. In instances where simulation software is readily available, investigating the validity of a specific permutation test can be done easily, requiring only a minimum understanding of statistical technicalities.
The reliability and validity of using the urine dipstick test by patient self-assessment for urinary tract infection screening in spinal cord injury patients.

Science.gov (United States)

Duanngai, Krit; Sirasaporn, Patpiya; Ngaosinchai, Siriwan Surapaitoon

2017-01-01

The aim of this is to evaluate the reliability of the urine dipstick test by patients' self-assessment for urinary tract infection (UTI) screening and to determine the validity of urine dipstick test. Rehabilitation Department, Srinagarind Hospital, Thailand. A diagnostic study. This study compared the urine dipstick test (index test) with the National Institute on Disability and Rehabilitation Research (NIDRR) criteria (gold standard test) in spinal cord injury (SCI) patients. The urine dipstick test informed positive and negative results. Besides the NIDRR criteria classified as UTI and no UTI. The interrater reliability was measured in the sense of Kappa whereas the validity of urine dipstick test was reported in terms of sensitivity, specificity, positive likelihood ratio (LR) (+LR), negative LR (-LR), positive predictive value (PPV), and negative predictive value (NPV). Out of the 56 participants, the kappa of urine dipstick test for leukocyte esterase, nitrite, and combined leukocyte esterase and nitrite were 0.09, 0.21, and 0.52, respectively. The nitrite urine dipstick test showed the highest sensitivity (90%). The combined leukocyte esterase and nitrite urine dipstick test gave the highest specificity (87%), PPV (60%), NPV (93%), and +LR (5.63). The interrater reliability of combined leukocyte esterase and nitrite urine dipstick test was moderate agreement. The combined leukocyte esterase and nitrite urine dipstick test showed high level of both sensitivity and specificity. The combined leukocyte esterase and nitrite urine dipstick test should be promoted for patients' self-assessment for UTI screening in SCI patients.

Validity, Reliability, and Performance Determinants of a New Job-Specific Anaerobic Work Capacity Test for the Norwegian Navy Special Operations Command.

Science.gov (United States)

Angeltveit, Andreas; Paulsen, Gøran; Solberg, Paul A; Raastad, Truls

2016-02-01

Operators in Special Operation Forces (SOF) have a particularly demanding profession where physical and psychological capacities can be challenged to the extremes. The diversity of physical capacities needed depend on the mission. Consequently, tests used to monitor SOF operators' physical fitness should cover a broad range of physical capacities. Whereas tests for strength and aerobic endurance are established, there is no test for specific anaerobic work capacity described in the literature. The purpose of this study was therefore to evaluate the reliability, validity, and to identify performance determinants of a new test developed for testing specific anaerobic work capacity in SOF operators. Nineteen active young students were included in the concurrent validity part of the study. The students performed the evacuation (EVAC) test 3 times and the results were compared for reliability and with performance in the Wingate cycle test, 300-m sprint, and a maximal accumulated oxygen deficit (MAOD) test. In part II of the study, 21 Norwegian Navy Special Operations Command operators conducted the EVAC test, anthropometric measurements, a dual x-ray absorptiometry scan, leg press, isokinetic knee extensions, maximal oxygen uptake test, and countermovement jump (CMJ) test. The EVAC test showed good reliability after 1 familiarization trial (intraclass correlation = 0.89; coefficient of variance = 3.7%). The EVAC test correlated well with the Wingate test (r = -0.68), 300-m sprint time (r = 0.51), and 300-m mean power (W) (r = -0.67). No significant correlation was found with the MAOD test. In part II of the study, height, body mass, lean body mass, isokinetic knee extension torque, maximal oxygen uptake, and maximal power in a CMJ was significantly correlated with performance in the EVAC test. The EVAC test is a reliable and valid test for anaerobic work capacity for SOF operators, and muscle mass, leg strength, and leg power seem to be the most important determinants
Validity of the Optometry Admission Test in Predicting Performance in Schools and Colleges of Optometry.

Science.gov (United States)

Kramer, Gene A.; Johnston, JoElle

1997-01-01

A study examined the relationship between Optometry Admission Test scores and pre-optometry or undergraduate grade point average (GPA) with first and second year performance in optometry schools. The test's predictive validity was limited but significant, and comparable to those reported for other admission tests. In addition, the scores…
Analysis of RVACS test 2F-L for COMMIX validation

International Nuclear Information System (INIS)

Tzanos, C.P.; Pedersen, D.R.

1989-01-01

The RVACS test 2F-L was analyzed to support the validation of COMMIX. This test is characterized by a power input of 50 kW, natural convection in the sodium pool, forced RVACS air circulation and a heat up period of 8 hours. At the beginning of the experiment the sodium pool was isothermal. After 7.5 hours the system reached near steady state with a temperature difference between the bottom and top of the pool of 96 degree C. The COMMIX predictions for the sodium pool temperatures and the air outlet temperatures were in good agreement with measurements. The maximum difference between predictions and measurements was ∼12 degree C. 4 refs., 5 figs
Reliability and convergent validity of the five-step test in people with chronic stroke.

Science.gov (United States)

Ng, Shamay S M; Tse, Mimi M Y; Tam, Eric W C; Lai, Cynthia Y Y

2018-01-10

(i) To estimate the intra-rater, inter-rater and test-retest reliabilities of the Five-Step Test (FST), as well as the minimum detectable change in FST completion times in people with stroke. (ii) To estimate the convergent validity of the FST with other measures of stroke-specific impairments. (iii) To identify the best cut-off times for distinguishing FST performance in people with stroke from that of healthy older adults. A cross-sectional study. University-based rehabilitation centre. Forty-eight people with stroke and 39 healthy controls. None. The FST, along with (for the stroke survivors only) scores on the Fugl-Meyer Lower Extremity Assessment (FMA-LE), the Berg Balance Scale (BBS), Limits of Stability (LOS) tests, and Activities-specific Balance Confidence (ABC) scale were tested. The FST showed excellent intra-rater (intra-class correlation coefficient; ICC = 0.866-0.905), inter-rater (ICC = 0.998), and test-retest (ICC = 0.838-0.842) reliabilities. A minimum detectable change of 9.16 s was found for the FST in people with stroke. The FST correlated significantly with the FMA-LE, BBS, and LOS results in the forward and sideways directions (r = -0.411 to -0.716, p people with stroke and healthy older adults. The FST is a reliable, easy-to-administer clinical test for assessing stroke survivors' ability to negotiate steps and stairs.
Overview of results of the first phase of validation activities for the IFMIF High Flux Test Module

International Nuclear Information System (INIS)

Arbeiter, Frederik; Chen Yuming; Dolensky, Bernhard; Freund, Jana; Heupel, Tobias; Klein, Christine; Scheel, Nicola; Schlindwein, Georg

2012-01-01

Highlights: ► Validation of computational fluid dynamics (CFD) modeling approach for application in the IFMIF High Flux Test Module. ► Fabrication of prototypes of the irradiation capsules of the IFMIF High Flux Test Module. - Abstract: The international fusion materials irradiation facility (IFMIF) is projected to create an experimentally validated database of material properties relevant for fusion reactor designs. The IFMIF High Flux Test Module is the dedicated experiment to irradiate alloys in the temperature range 250–550 °C and up to 50 displacements per atom per irradiation cycle. The High Flux Test Module is developed to maximize the specimen payload in the restricted irradiation volume, and to minimize the temperature spread within each specimen bundle. Low pressure helium mini-channel cooling is used to offer a high integration density. Due to the demanding thermo-hydraulic and mechanical conditions, the engineering design process (involving numerical neutronic, thermo-hydraulic and mechanical analyses) is supported by extensive experimental validation activities. This paper reports on the prototype manufacturing, thermo-hydraulic modeling experiments and component tests, as well as on mechanical testing. For the testing of the 1:1 prototype of the High Flux Test Module, a dedicated test facility, the Helium Loop Karlsruhe-Low Pressure (HELOKA-LP) has been taken into service.
Autism Spectrum Disorders and Self-Reports: Testing Validity and Reliability Using the NEO-PI-R

Science.gov (United States)

Hesselmark, Eva; Eriksson, Jonna M.; Westerlund, Joakim; Bejerot, Susanne

2015-01-01

Although self-reported measures are frequently used to assess adults with autism spectrum disorders (ASD), the validity of self-reports is under-researched in ASD. The core symptoms of ASD may negatively affect the psychometric properties of self-reported measures. The aim of the present study was to test the validity and reliability of…
Validity of FAA-approved color vision tests for class II and class III aeromedical screening.

Science.gov (United States)

1993-09-01

All clinical color vision tests currently used in the medical examination of pilots were studied regarding validity for prediction of performance on practical tests of ability to discriminate the aviation signal colors, red, green, and white given un...
Validation of a computer-adaptive test to evaluate generic health-related quality of life

Directory of Open Access Journals (Sweden)

Zardaín Pilar C

2010-12-01

Full Text Available Abstract Background Health Related Quality of Life (HRQoL is a relevant variable in the evaluation of health outcomes. Questionnaires based on Classical Test Theory typically require a large number of items to evaluate HRQoL. Computer Adaptive Testing (CAT can be used to reduce tests length while maintaining and, in some cases, improving accuracy. This study aimed at validating a CAT based on Item Response Theory (IRT for evaluation of generic HRQoL: the CAT-Health instrument. Methods Cross-sectional study of subjects aged over 18 attending Primary Care Centres for any reason. CAT-Health was administered along with the SF-12 Health Survey. Age, gender and a checklist of chronic conditions were also collected. CAT-Health was evaluated considering: 1 feasibility: completion time and test length; 2 content range coverage, Item Exposure Rate (IER and test precision; and 3 construct validity: differences in the CAT-Health scores according to clinical variables and correlations between both questionnaires. Results 396 subjects answered CAT-Health and SF-12, 67.2% females, mean age (SD 48.6 (17.7 years. 36.9% did not report any chronic condition. Median completion time for CAT-Health was 81 seconds (IQ range = 59-118 and it increased with age (p Conclusions Although domain-specific CATs exist for various areas of HRQoL, CAT-Health is one of the first IRT-based CATs designed to evaluate generic HRQoL and it has proven feasible, valid and efficient, when administered to a broad sample of individuals attending primary care settings.
A validated model for the 22-item Sino-Nasal Outcome Test subdomain structure in chronic rhinosinusitis.

Science.gov (United States)

Feng, Allen L; Wesely, Nicholas C; Hoehle, Lloyd P; Phillips, Katie M; Yamasaki, Alisa; Campbell, Adam P; Gregorio, Luciano L; Killeen, Thomas E; Caradonna, David S; Meier, Josh C; Gray, Stacey T; Sedaghat, Ahmad R

2017-12-01

Previous studies have identified subdomains of the 22-item Sino-Nasal Outcome Test (SNOT-22), reflecting distinct and largely independent categories of chronic rhinosinusitis (CRS) symptoms. However, no study has validated the subdomain structure of the SNOT-22. This study aims to validate the existence of underlying symptom subdomains of the SNOT-22 using confirmatory factor analysis (CFA) and to develop a subdomain model that practitioners and researchers can use to describe CRS symptomatology. A total of 800 patients with CRS were included into this cross-sectional study (400 CRS patients from Boston, MA, and 400 CRS patients from Reno, NV). Their SNOT-22 responses were analyzed using exploratory factor analysis (EFA) to determine the number of symptom subdomains. A CFA was performed to develop a validated measurement model for the underlying SNOT-22 subdomains along with various tests of validity and goodness of fit. EFA demonstrated 4 distinct factors reflecting: sleep, nasal, otologic/facial pain, and emotional symptoms (Cronbach's alpha, >0.7; Bartlett's test of sphericity, p Kaiser-Meyer-Olkin >0.90), independent of geographic locale. The corresponding CFA measurement model demonstrated excellent measures of fit (root mean square error of approximation, 0.95; Tucker-Lewis index, >0.95) and measures of construct validity (heterotrait-monotrait [HTMT] ratio, 0.7), again independent of geographic locale. The use of the 4-subdomain structure for SNOT-22 (reflecting sleep, nasal, otologic/facial pain, and emotional symptoms of CRS) was validated as the most appropriate to calculate SNOT-22 subdomain scores for patients from different geographic regions using CFA. © 2017 ARS-AAOA, LLC.
Fecal electrolyte testing for evaluation of unexplained diarrhea: Validation of body fluid test accuracy in the absence of a reference method.

Science.gov (United States)

Voskoboev, Nikolay V; Cambern, Sarah J; Hanley, Matthew M; Giesen, Callen D; Schilling, Jason J; Jannetto, Paul J; Lieske, John C; Block, Darci R

2015-11-01

Validation of tests performed on body fluids other than blood or urine can be challenging due to the lack of a reference method to confirm accuracy. The aim of this study was to evaluate alternate assessments of accuracy that laboratories can rely on to validate body fluid tests in the absence of a reference method using the example of sodium (Na(+)), potassium (K(+)), and magnesium (Mg(2+)) testing in stool fluid. Validations of fecal Na(+), K(+), and Mg(2+) were performed on the Roche cobas 6000 c501 (Roche Diagnostics) using residual stool specimens submitted for clinical testing. Spiked recovery, mixing studies, and serial dilutions were performed and % recovery of each analyte was calculated to assess accuracy. Results were confirmed by comparison to a reference method (ICP-OES, PerkinElmer). Mean recoveries for fecal electrolytes were Na(+) upon spiking=92%, mixing=104%, and dilution=105%; K(+) upon spiking=94%, mixing=96%, and dilution=100%; and Mg(2+) upon spiking=93%, mixing=98%, and dilution=100%. When autoanalyzer results were compared to reference ICP-OES results, Na(+) had a slope=0.94, intercept=4.1, and R(2)=0.99; K(+) had a slope=0.99, intercept=0.7, and R(2)=0.99; and Mg(2+) had a slope=0.91, intercept=-4.6, and R(2)=0.91. Calculated osmotic gap using both methods were highly correlated with slope=0.95, intercept=4.5, and R(2)=0.97. Acid pretreatment increased magnesium recovery from a subset of clinical specimens. A combination of mixing, spiking, and dilution recovery experiments are an acceptable surrogate for assessing accuracy in body fluid validations in the absence of a reference method. Copyright © 2015 The Canadian Society of Clinical Chemists. Published by Elsevier Inc. All rights reserved.
Reliability, Validity and Factor Structure of Drug Abuse Screening Test

OpenAIRE

Sayed Hadi Sayed Alitabar; Mojtaba Habibi; Maryam Falahatpisheh; Musa Arvin

2016-01-01

Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST). Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men) with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. T...
Advanced testing and validation centre gets electric vehicle technology to market faster

Energy Technology Data Exchange (ETDEWEB)

Astil, T.; Girard, F. [National Research Council of Canada, Vancouver, BC (Canada). Inst. for Fuel Cell Innovation

2010-07-01

The National Research Council (NRC) Institute for Fuel Cell Innovation is advancing Canada's clean energy advantage through NRC's technology cluster initiatives, which help Canadian small and medium enterprises achieve commercialization breakthroughs in key sectors. This presentation discussed the technology evaluation program (TEP) offered by the NRC Institute for Fuel Cell Innovation. The presentation discussed the TEPs mission, advanced testing and validation centre (ATVC), previous ATVC clients, environmental chamber, dynamometer, vibration table, electrochemical battery testing, and electrochemical testing laboratory. The ATVC is a specialized and safe environment for objective, reliable and accurate standardized testing applications of electric vehicle technologies. It offers independent test services to external organizations, making it easier to prove that electric vehicle technologies will perform under specific operating conditions. figs.
Traditional Chinese version of the Mayer Salovey Caruso Emotional Intelligence Test (MSCEIT-TC): Its validation and application to schizophrenic individuals.

Science.gov (United States)

Mao, Wei-Chung; Chen, Li-Fen; Chi, Chia-Hsing; Lin, Ching-Hung; Kao, Yu-Chen; Hsu, Wen-Yau; Lane, Hsien-Yuan; Hsieh, Jen-Chuen

2016-09-30

Schizophrenia is an illness that impairs a person's social cognition. The Mayer Salovey Caruso Emotional Intelligence Test (MSCEIT) is the most well-known test used to measure emotional intelligence (EI), which is a major component of social cognition. Given the absence of EI ability-based scales adapted to Chinese speakers, we translated the MSCEIT into a Traditional Chinese version (MSCEIT-TC) and validated this scale for use in schizophrenia studies. The specific aims were to validate the MSCEIT-TC, to develop a norm for the MSCEIT-TC, and use this norm to explore the EI performance of schizophrenic individuals. We included in our study seven hundred twenty-eight healthy controls and seventy-six individuals with schizophrenia. The results suggest that the MSCEIT-TC is reliable and valid when assessing EI. The results showed good discrimination and validity when comparing the two study groups. Impairment was the greatest for two branches Understanding and Managing Emotions, which implies that the deficits of schizophrenia individuals involve ToM (theory of mind) tasks. Deficits involving the negative scale of schizophrenia was related to impaired performance when the MSCEIT-TC was used (in branch 2, 3, 4, and the area Strategic). Our findings suggest that the MSCEIT-TC can be used for emotional studies in healthy Chinese and in clinical setting for investigating schizophrenic individuals. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Symptom validity testing in memory clinics: Hippocampal-memory associations and relevance for diagnosing mild cognitive impairment

NARCIS (Netherlands)

Rienstra, Anne; Groot, Paul F. C.; Spaan, Pauline E. J.; Majoie, Charles B. L. M.; Nederveen, Aart J.; Walstra, Gerard J. M.; de Jonghe, Jos F. M.; van Gool, Willem A.; Olabarriaga, Silvia D.; Korkhov, Vladimir V.; Schmand, Ben

2013-01-01

Patients with mild cognitive impairment (MCI) do not always convert to dementia. In such cases, abnormal neuropsychological test results may not validly reflect cognitive symptoms due to brain disease, and the usual brain-behavior relationships may be absent. This study examined symptom validity in
Testing instrument validity for LATE identification based on inequality moment constraints

DEFF Research Database (Denmark)

Huber, Martin; Mellace, Giovanni

2015-01-01

We derive testable implications of instrument validity in just identified treatment effect models with endogeneity and consider several tests. The identifying assumptions of the local average treatment effect allow us to both point identify and bound the mean potential outcomes (i) of the always...... takers under treatment and (ii) of the never takers under non-treatment. The point identified means must lie within their respective bounds, which provides us with four testable inequality moment constraints. Finally, we adapt our testing framework to the identification of distributional features....... A brief simulation study and an application to labor market data are also provided....
Validity of the Wechsler Test of Adult Reading (WTAR): effort considered in a clinical sample of U.S. military veterans.

Science.gov (United States)

Whitney, Kriscinda A; Shepard, Polly H; Mariner, Jennifer; Mossbarger, Brad; Herman, Steven M

2010-07-01

The current study represents an examination of the construct validity of the Wechsler Test of Adult Reading (WTAR) among a sample of U.S. military veterans referred for outpatient neuropsychological evaluation that included a measure of negative response bias, namely, the Test of Memory Malingering (TOMM). This retrospective data analysis examined the relationship between the WTAR and measures of current verbal general intellectual function and current cognitive skills. Findings showed that, among patients passing the TOMM (N = 98), WTAR scores were most highly correlated with current verbal IQ but also showed significant correlations with verbal memory and lesser, but still significant, correlations with measures of visual-spatial memory. Discriminant validity for the WTAR was also shown among the group passing the TOMM in the sense that the WTAR, which is designed to measure verbal premorbid general intellectual skill, was not as highly correlated with measures of learning and memory as was a measure of current verbal general intellectual skill. Whereas scores on most study measures did significantly differ between the groups that passed versus failed the TOMM (N = 26), scores on the WTAR did not, suggesting that the WTAR may remain robust even in the face of suboptimal effort.
The accomplishments of lithium target and test facility validation activities in the IFMIF/EVEDA phase

Science.gov (United States)

Arbeiter, Frederik; Baluc, Nadine; Favuzza, Paolo; Gröschel, Friedrich; Heidinger, Roland; Ibarra, Angel; Knaster, Juan; Kanemura, Takuji; Kondo, Hiroo; Massaut, Vincent; Saverio Nitti, Francesco; Miccichè, Gioacchino; O'hira, Shigeru; Rapisarda, David; Sugimoto, Masayoshi; Wakai, Eiichi; Yokomine, Takehiko

2018-01-01

As part of the engineering validation and engineering design activities (EVEDA) phase for the international fusion materials irradiation facility IFMIF, major elements of a lithium target facility and the test facility were designed, prototyped and validated. For the lithium target facility, the EVEDA lithium test loop was built at JAEA and used to test the stability (waves and long term) of the lithium flow in the target, work out the startup procedures, and test lithium purification and analysis. It was confirmed by experiments in the Lifus 6 plant at ENEA that lithium corrosion on ferritic martensitic steels is acceptably low. Furthermore, complex remote handling procedures for the remote maintenance of the target in the test cell environment were successfully practiced. For the test facility, two variants of a high flux test module were prototyped and tested in helium loops, demonstrating their good capabilities of maintaining the material specimens at the desired temperature with a low temperature spread. Irradiation tests were performed for heated specimen capsules and irradiation instrumentation in the BR2 reactor at SCK-CEN. The small specimen test technique, essential for obtaining material test results with limited irradiation volume, was advanced by evaluating specimen shape and test technique influences.
Translation, Validation, and Reliability of the Dutch Late-Life Function and Disability Instrument Computer Adaptive Test.

Science.gov (United States)

Arensman, Remco M; Pisters, Martijn F; de Man-van Ginkel, Janneke M; Schuurmans, Marieke J; Jette, Alan M; de Bie, Rob A

2016-09-01

Adequate and user-friendly instruments for assessing physical function and disability in older adults are vital for estimating and predicting health care needs in clinical practice. The Late-Life Function and Disability Instrument Computer Adaptive Test (LLFDI-CAT) is a promising instrument for assessing physical function and disability in gerontology research and clinical practice. The aims of this study were: (1) to translate the LLFDI-CAT to the Dutch language and (2) to investigate its validity and reliability in a sample of older adults who spoke Dutch and dwelled in the community. For the assessment of validity of the LLFDI-CAT, a cross-sectional design was used. To assess reliability, measurement of the LLFDI-CAT was repeated in the same sample. The item bank of the LLFDI-CAT was translated with a forward-backward procedure. A sample of 54 older adults completed the LLFDI-CAT, World Health Organization Disability Assessment Schedule 2.0, RAND 36-Item Short-Form Health Survey physical functioning scale (10 items), and 10-Meter Walk Test. The LLFDI-CAT was repeated in 2 to 8 days (mean=4.5 days). Pearson's r and the intraclass correlation coefficient (ICC) (2,1) were calculated to assess validity, group-level reliability, and participant-level reliability. A correlation of .74 for the LLFDI-CAT function scale and the RAND 36-Item Short-Form Health Survey physical functioning scale (10 items) was found. The correlations of the LLFDI-CAT disability scale with the World Health Organization Disability Assessment Schedule 2.0 and the 10-Meter Walk Test were -.57 and -.53, respectively. The ICC (2,1) of the LLFDI-CAT function scale was .84, with a group-level reliability score of .85. The ICC (2,1) of the LLFDI-CAT disability scale was .76, with a group-level reliability score of .81. The high percentage of women in the study and the exclusion of older adults with recent joint replacement or hospitalization limit the generalizability of the results. The Dutch LLFDI
Reliability and validity of a low load endurance strength test for upper and lower extremities in patients with fibromyalgia.

Science.gov (United States)

Munguía-Izquierdo, Diego; Legaz-Arrese, Alejandro

2012-11-01

To evaluate the reliability, standard error of the mean (SEM), clinical significant change, and known group validity of 2 assessments of endurance strength to low loads in patients with fibromyalgia syndrome (FS). Cross-sectional reliability and comparative study. University Pablo de Olavide, Seville, Spain. Middle-aged women with FS (n=95) and healthy women (n=64) matched for age, weight, and body mass index (BMI) were recruited for the study. Not applicable. The endurance strength to low loads tests of the upper and lower extremities and anthropometric measures (BMI) were used for the evaluations. The differences between the readings (tests 1 and 2) and the SDs of the differences, intraclass correlation coefficient (ICC) model (2,1), 95% confidence interval for the ICC, coefficient of repeatability, intrapatient SD, SEM, Wilcoxon signed-rank test, and Bland-Altman plots were used to examine reliability. A Mann-Whitney U test was used to analyze the differences in test values between the patient group and the control group. We hypothesized that patients with FS would have an endurance strength to low loads performance in lower and upper extremities at least twice as low as that of the healthy controls. Satisfactory test-retest reliability and SEMs were found for the lower extremity, dominant arm, and nondominant arm tests (ICC=.973-.979; P.05 for all). The Bland-Altman plots showed 95% limits of agreement for the lower extremity (4.7 to -4.5), dominant arm (3.8 to -4.4), and nondominant arm (3.9 to -4.1) tests. The endurance strength to low loads test scores for the patients with FS were 4-fold lower than for the controls in all performed tests (P<.001 for all). The endurance strength to low loads tests showed good reliability and known group validity and can be recommended for evaluating endurance strength to low loads in patients with FS. For individual evaluation, however, an improved score of at least 4 and 5 repetitions for the upper and lower extremities
Reliability and Validity of a New Test of Change-of-Direction Speed for Field-Based Sports: the Change-of-Direction and Acceleration Test (CODAT).

Science.gov (United States)

Lockie, Robert G; Schultz, Adrian B; Callaghan, Samuel J; Jeffriess, Matthew D; Berry, Simon P

2013-01-01

Field sport coaches must use reliable and valid tests to assess change-of-direction speed in their athletes. Few tests feature linear sprinting with acute change- of-direction maneuvers. The Change-of-Direction and Acceleration Test (CODAT) was designed to assess field sport change-of-direction speed, and includes a linear 5-meter (m) sprint, 45° and 90° cuts, 3- m sprints to the left and right, and a linear 10-m sprint. This study analyzed the reliability and validity of this test, through comparisons to 20-m sprint (0-5, 0-10, 0-20 m intervals) and Illinois agility run (IAR) performance. Eighteen Australian footballers (age = 23.83 ± 7.04 yrs; height = 1.79 ± 0.06 m; mass = 85.36 ± 13.21 kg) were recruited. Following familiarization, subjects completed the 20-m sprint, CODAT, and IAR in 2 sessions, 48 hours apart. Intra-class correlation coefficients (ICC) assessed relative reliability. Absolute reliability was analyzed through paired samples t-tests (p ≤ 0.05) determining between-session differences. Typical error (TE), coefficient of variation (CV), and differences between the TE and smallest worthwhile change (SWC), also assessed absolute reliability and test usefulness. For the validity analysis, Pearson's correlations (p ≤ 0.05) analyzed between-test relationships. Results showed no between-session differences for any test (p = 0.19-0.86). CODAT time averaged ~6 s, and the ICC and CV equaled 0.84 and 3.0%, respectively. The homogeneous sample of Australian footballers meant that the CODAT's TE (0.19 s) exceeded the usual 0.2 x standard deviation (SD) SWC (0.10 s). However, the CODAT is capable of detecting moderate performance changes (SWC calculated as 0.5 x SD = 0.25 s). There was a near perfect correlation between the CODAT and IAR (r = 0.92), and very large correlations with the 20-m sprint (r = 0.75-0.76), suggesting that the CODAT was a valid change-of-direction speed test. Due to movement specificity, the CODAT has value for field sport

Validity evidence for the situational judgment test paradigm in emotional intelligence measurement.

Science.gov (United States)

Libbrecht, Nele; Lievens, Filip

2012-01-01

To date, various measurement approaches have been proposed to assess emotional intelligence (EI). Recently, two new EI tests have been developed based on the situational judgment test (SJT) paradigm: the Situational Test of Emotional Understanding (STEU) and the Situational Test of Emotion Management (STEM). Initial attempts have been made to examine the construct-related validity of these new tests; we extend these findings by placing the tests in a broad nomological network. To this end, 850 undergraduate students completed a personality inventory, a cognitive ability test, a self-report EI test, a performance-based EI measure, the STEU, and the STEM. The SJT-based EI tests were not strongly correlated with personality and fluid cognitive ability. Regarding their relation with existing EI measures, the tests did not capture the same construct as self-report EI measures, but corresponded rather to performance-based EI measures. Overall, these results lend support for the SJT paradigm for measuring EI as an ability.
Clinical utility and validity of minoxidil response testing in androgenetic alopecia.

Science.gov (United States)

Goren, Andy; Shapiro, Jerry; Roberts, Janet; McCoy, John; Desai, Nisha; Zarrab, Zoulikha; Pietrzak, Aldona; Lotti, Torello

2015-01-01

Clinical response to 5% topical minoxidil for the treatment of androgenetic alopecia (AGA) is typically observed after 3-6 months. Approximately 40% of patients will regrow hair. Given the prolonged treatment time required to elicit a response, a diagnostic test for ruling out nonresponders would have significant clinical utility. Two studies have previously reported that sulfotransferase enzyme activity in plucked hair follicles predicts a patient's response to topical minoxidil therapy. The aim of this study was to assess the clinical utility and validity of minoxidil response testing. In this communication, the present authors conducted an analysis of completed and ongoing studies of minoxidil response testing. The analysis confirmed the clinical utility of a sulfotransferase enzyme test in successfully ruling out 95.9% of nonresponders to topical minoxidil for the treatment of AGA. © 2014 Wiley Periodicals, Inc.
Criterion validity and reliability of a smartphone delivered sub-maximal fitness test for people with type 2 diabetes

DEFF Research Database (Denmark)

Brinklov, Cecilie Fau; Thorsen, Ida Kær; Karstoft, Kristian

2016-01-01

Background: Prevention of multi-morbidities following non-communicable diseases requires a systematic registration of adverse modifiable risk factors, including low physical fitness. The aim of the study was to establish criterion validity and reliability of a smartphone app (InterWalk) delivered....... The algorithm was validated using leave-one-out cross validation. Test-retest reliability was tested in a subset of participants (N = 10). Results: The overall VO2peak prediction of the algorithm (R2) was 0.60 and 0.45 when the smartphone was placed in the pockets of the pants and jacket, respectively (p ... calorimetry and the acceleration (vector magnitude) from the smartphone was obtained. The vector magnitude was used to predict VO2peak along with the co-variates weight, height and sex. The validity of the algorithm was tested when the smartphone was placed in the right pocket of the pants or jacket...
Reliability and Validity of the Inline Skating Skill Test.

Science.gov (United States)

Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje

2016-09-01

This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8-2.6%] - 2.2% [95% CI: 0.0-4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2-2.4%] - 2.7% [95% CI: 2.1-4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92-0.99] - 0.99 [95% CI: 0.98-1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters' performances. Competitive-level skaters needed shorter time (24.4-26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.
Recent Advances in Simulation of Eddy Current Testing of Tubes and Experimental Validations

Science.gov (United States)

Reboud, C.; Prémel, D.; Lesselier, D.; Bisiaux, B.

2007-03-01

Eddy current testing (ECT) is widely used in iron and steel industry for the inspection of tubes during manufacturing. A collaboration between CEA and the Vallourec Research Center led to the development of new numerical functionalities dedicated to the simulation of ECT of non-magnetic tubes by external probes. The achievement of experimental validations led us to the integration of these models into the CIVA platform. Modeling approach and validation results are discussed here. A new numerical scheme is also proposed in order to improve the accuracy of the model.
A framework for the testing and validation of the I and C system based on a simulator

Energy Technology Data Exchange (ETDEWEB)

Lee, Young Jun; Kwon, Kee Choon; Lee, Jang Soo [KAERI, Daejeon (Korea, Republic of)

2016-05-15

The I and C system for a nuclear power plant should be developed as a prototype or mock-up from the concept phase of the development process, and the function and performance of the computer system also have to be tested and validated. If possible, the developed prototype or mock-up could receive the signals of a normal or abnormal operation status of a nuclear power plant and generate the proper requirement output signal. Using these processes, it can be verified that the status of a plant is changed to the design state or the state needed by the plant operator. A simulation-based conformity evaluation platform is an environment that can automate the testing and validation actions. A traditional testing and validation method defines the static test requirements and extracts the input data from the defined requirement using IO signal generation devices. On the contrary, a simulation-based test method can generate the real calculated input data from a simulator and send the signals to the test devices directly. In this paper, we developed a framework that can conduct a conformity evaluation based on a simulator and implement the communication and monitoring program.
The WRAIR projectile concussive impact model of mild traumatic brain injury: re-design, testing and preclinical validation.

Science.gov (United States)

Leung, Lai Yee; Larimore, Zachary; Holmes, Larry; Cartagena, Casandra; Mountney, Andrea; Deng-Bryant, Ying; Schmid, Kara; Shear, Deborah; Tortella, Frank

2014-08-01

The WRAIR projectile concussive impact (PCI) model was developed for preclinical study of concussion. It represents a truly non-invasive closed-head injury caused by a blunt impact. The original design, however, has several drawbacks that limit the manipulation of injury parameters. The present study describes engineering advancements made to the PCI injury model including helmet material testing, projectile impact energy/head kinematics and impact location. Material testing indicated that among the tested materials, 'fiber-glass/carbon' had the lowest elastic modulus and yield stress for providing an relative high percentage of load transfer from the projectile impact, resulting in significant hippocampal astrocyte activation. Impact energy testing of small projectiles, ranging in shape and size, showed the steel sphere produced the highest impact energy and the most consistent impact characteristics. Additional tests confirmed the steel sphere produced linear and rotational motions on the rat's head while remaining within a range that meets the criteria for mTBI. Finally, impact location testing results showed that PCI targeted at the temporoparietal surface of the rat head produced the most prominent gait abnormalities. Using the parameters defined above, pilot studies were conducted to provide initial validation of the PCI model demonstrating quantifiable and significant increases in righting reflex recovery time, axonal damage and astrocyte activation following single and multiple concussions.
Overview of results of the first phase of validation activities for the IFMIF High Flux Test Module

Energy Technology Data Exchange (ETDEWEB)

Arbeiter, Frederik, E-mail: frederik.arbeiter@kit.edu [Karlsruhe Institute of Technology, Karlsruhe (Germany); Chen Yuming; Dolensky, Bernhard; Freund, Jana; Heupel, Tobias; Klein, Christine; Scheel, Nicola; Schlindwein, Georg [Karlsruhe Institute of Technology, Karlsruhe (Germany)

2012-08-15

Highlights: Black-Right-Pointing-Pointer Validation of computational fluid dynamics (CFD) modeling approach for application in the IFMIF High Flux Test Module. Black-Right-Pointing-Pointer Fabrication of prototypes of the irradiation capsules of the IFMIF High Flux Test Module. - Abstract: The international fusion materials irradiation facility (IFMIF) is projected to create an experimentally validated database of material properties relevant for fusion reactor designs. The IFMIF High Flux Test Module is the dedicated experiment to irradiate alloys in the temperature range 250-550 Degree-Sign C and up to 50 displacements per atom per irradiation cycle. The High Flux Test Module is developed to maximize the specimen payload in the restricted irradiation volume, and to minimize the temperature spread within each specimen bundle. Low pressure helium mini-channel cooling is used to offer a high integration density. Due to the demanding thermo-hydraulic and mechanical conditions, the engineering design process (involving numerical neutronic, thermo-hydraulic and mechanical analyses) is supported by extensive experimental validation activities. This paper reports on the prototype manufacturing, thermo-hydraulic modeling experiments and component tests, as well as on mechanical testing. For the testing of the 1:1 prototype of the High Flux Test Module, a dedicated test facility, the Helium Loop Karlsruhe-Low Pressure (HELOKA-LP) has been taken into service.
Italian Validation of Homophobia Scale (HS).

Science.gov (United States)

Ciocca, Giacomo; Capuano, Nicolina; Tuziak, Bogdan; Mollaioli, Daniele; Limoncin, Erika; Valsecchi, Diana; Carosa, Eleonora; Gravina, Giovanni L; Gianfrilli, Daniele; Lenzi, Andrea; Jannini, Emmanuele A

2015-09-01

The Homophobia Scale (HS) is a valid tool to assess homophobia. This test is self-reporting, composed of 25 items, which assesses a total score and three factors linked to homophobia: behavior/negative affect, affect/behavioral aggression, and negative cognition. The aim of this study was to validate the HS in the Italian context. An Italian translation of the HS was carried out by two bilingual people, after which an English native translated the test back into the English language. A psychologist and sexologist checked the translated items from a clinical point of view. We recruited 100 subjects aged18-65 for the Italian validation of the HS. The Pearson coefficient and Cronbach's α coefficient were performed to test the test-retest reliability and internal consistency. A sociodemographic questionnaire including the main information as age, geographic distribution, partnership status, education, religious orientation, and sex orientation was administrated together with the translated version of HS. The analysis of the internal consistency showed an overall Cronbach's α coefficient of 0.92. In the four domains, the Cronbach's α coefficient was 0.90 in behavior/negative affect, 0.94 in affect/behavioral aggression, and 0.92 in negative cognition, whereas in the total score was 0.86. The test-retest reliability showed the following results: the HS total score was r = 0.93 (P cognition was r = 0.75 (P validation of the HS revealed the use of this self-report test to have good psychometric properties. This study offers a new tool to assess homophobia. In this regard, the HS can be introduced into the clinical praxis and into programs for the prevention of homophobic behavior.
Validation of the 3-day rule for stool bacterial tests in Japan.

Science.gov (United States)

Kobayashi, Masanori; Sako, Akahito; Ogami, Toshiko; Nishimura, So; Asayama, Naoki; Yada, Tomoyuki; Nagata, Naoyoshi; Sakurai, Toshiyuki; Yokoi, Chizu; Kobayakawa, Masao; Yanase, Mikio; Masaki, Naohiko; Takeshita, Nozomi; Uemura, Naomi

2014-01-01

Stool cultures are expensive and time consuming, and the positive rate of enteric pathogens in cases of nosocomial diarrhea is low. The 3-day rule, whereby clinicians order a Clostridium difficile (CD) toxin test rather than a stool culture for inpatients developing diarrhea >3 days after admission, has been well studied in Western countries. The present study sought to validate the 3-day rule in an acute care hospital setting in Japan. Stool bacterial and CD toxin test results for adult patients hospitalized in an acute care hospital in 2008 were retrospectively analyzed. Specimens collected after an initial positive test were excluded. The positive rate and cost-effectiveness of the tests were compared among three patient groups. The adult patients were divided into three groups for comparison: outpatients, patients hospitalized for ≤3 days and patients hospitalized for ≥4 days. Over the 12-month period, 1,597 stool cultures were obtained from 992 patients, and 880 CD toxin tests were performed in 529 patients. In the outpatient, inpatient ≤3 days and inpatient ≥4 days groups, the rate of positive stool cultures was 14.2%, 3.6% and 1.3% and that of positive CD toxin tests was 1.9%, 7.1% and 8.5%, respectively. The medical costs required to obtain one positive result were 9,181, 36,075 and 103,600 JPY and 43,200, 11,333 and 9,410 JPY, respectively. The 3-day rule was validated for the first time in a setting other than a Western country. Our results revealed that the "3-day rule" is also useful and cost-effective in Japan.
Preparation, validation and user-testing of pictogram-based patient information leaflets for hemodialysis patients.

Science.gov (United States)

Mateti, Uday Venkat; Nagappa, Anantha Naik; Attur, Ravindra Prabhu; Bairy, Manohar; Nagaraju, Shankar Prasad; Mallayasamy, Surulivelrajan; Vilakkathala, Rajesh; Guddattu, Vasudev; Balkrishnan, Rajesh

2015-11-01

Patient information leaflets are universally-accepted resources to educate the patients/users about their medications, disease and lifestyle modification. The objective of the study was to prepare, validate and perform user-testing of pictogram-based patient information leaflets (P-PILs) among hemodialysis (HD) patients. The P-PILs are prepared by referring to the primary, secondary and tertiary resources. The content and pictograms of the leaflet have been validated by an expert committee consisting of three nephrologists and two academic pharmacists. The Baker Able Leaflet Design has been applied to develop the layout and design of the P-PILs. Quasi-experimental pre- and post-test design without control group was conducted on 81 HD patients for user-testing of P-PILs. The mean Baker Able Leaflet Design assessment score for English version of the leaflet was 28, and 26 for Kannada version. The overall user-testing knowledge assessment mean scores were observed to have significantly improved from 44.25 to 69.62 with p value information leaflets can be considered an effective educational tool for HD patients.
Validation of an Instrument and Testing Protocol for Measuring the Combinatorial Analysis Schema.

Science.gov (United States)

Staver, John R.; Harty, Harold

1979-01-01

Designs a testing situation to examine the presence of combinatorial analysis, to establish construct validity in the use of an instrument, Combinatorial Analysis Behavior Observation Scheme (CABOS), and to investigate the presence of the schema in young adolescents. (Author/GA)
Validation test of advanced technology for IPV nickel-hydrogen flight cells: Update

Science.gov (United States)

Smithrick, John J.; Hall, Stephen W.

1992-01-01

Individual pressure vessel (IPV) nickel-hydrogen technology was advanced at NASA Lewis and under Lewis contracts with the intention of improving cycle life and performance. One advancement was to use 26 percent potassium hydroxide (KOH) electrolyte to improve cycle life. Another advancement was to modify the state-of-the-art cell design to eliminate identified failure modes. The modified design is referred to as the advanced design. A breakthrough in the low-earth-orbit (LEO) cycle life of IPV nickel-hydrogen cells has been previously reported. The cycle life of boiler plate cells containing 26 percent KOH electrolyte was about 40,000 LEO cycles compared to 3,500 cycles for cells containing 31 percent KOH. The boiler plate test results are in the process of being validated using flight hardware and real time LEO testing at the Naval Weapons Support Center (NWSC), Crane, Indiana under a NASA Lewis Contract. An advanced 125 Ah IPV nickel-hydrogen cell was designed. The primary function of the advanced cell is to store and deliver energy for long-term, LEO spacecraft missions. The new features of this design are: (1) use of 26 percent rather than 31 percent KOH electrolyte; (2) use of a patented catalyzed wall wick; (3) use of serrated-edge separators to facilitate gaseous oxygen and hydrogen flow within the cell, while still maintaining physical contact with the wall wick for electrolyte management; and (4) use of a floating rather than a fixed stack (state-of-the-art) to accommodate nickel electrode expansion due to charge/discharge cycling. The significant improvements resulting from these innovations are: extended cycle life; enhanced thermal, electrolyte, and oxygen management; and accommodation of nickel electrode expansion. The advanced cell design is in the process of being validated using real time LEO cycle life testing of NWSC, Crane, Indiana. An update of validation test results confirming this technology is presented.
The Bulimia Test--Revised: Validation with "DSM-IV" Criteria for Bulimia Nervosa.

Science.gov (United States)

Thelen, Mark H.; And Others

1996-01-01

The Bulimia Test--Revised (BULIT-R) was given to 23 female subjects who met the criteria for bulimia in the "Diagnostic and Statistical Manual of Mental Disorders" (DSM-IV) and 124 female controls. The BULIT-R appears to be a valid instruction for identifying individuals who meet DSM-IV criteria for bulimia. (SLD)
Validity of 20-metre multi stage shuttle run test for estimation of ...

African Journals Online (AJOL)

Validity of 20-metre multi stage shuttle run test for estimation of maximum oxygen uptake in indian male university students. P Chatterjee, AK Banerjee, P Debnath, P Bas, B Chatterjee. Abstract. No Abstract. South African Journal for Physical, Health Education, Recreation and DanceVol. 12(4) 2006: pp. 461-467. Full Text:.
Bridging the Gap Between Validation and Implementation of Non-Animal Veterinary Vaccine Potency Testing Methods

Science.gov (United States)

Dozier, Samantha; Brown, Jeffrey; Currie, Alistair

2011-01-01

Simple Summary Many vaccines are tested for quality in experiments that require the use of large numbers of animals in procedures that often cause significant pain and distress. Newer technologies have fostered the development of vaccine quality control tests that reduce or eliminate the use of animals, but the availability of these newer methods has not guaranteed their acceptance by regulators or use by manufacturers. We discuss a strategic approach that has been used to assess and ultimately increase the use of non-animal vaccine quality tests in the U.S. and U.K. Abstract In recent years, technologically advanced high-throughput techniques have been developed that replace, reduce or refine animal use in vaccine quality control tests. Following validation, these tests are slowly being accepted for use by international regulatory authorities. Because regulatory acceptance itself has not guaranteed that approved humane methods are adopted by manufacturers, various organizations have sought to foster the preferential use of validated non-animal methods by interfacing with industry and regulatory authorities. After noticing this gap between regulation and uptake by industry, we began developing a paradigm that seeks to narrow the gap and quicken implementation of new replacement, refinement or reduction guidance. A systematic analysis of our experience in promoting the transparent implementation of validated non-animal vaccine potency assays has led to the refinement of our paradigmatic process, presented here, by which interested parties can assess the local regulatory acceptance of methods that reduce animal use and integrate them into quality control testing protocols, or ensure the elimination of peripheral barriers to their use, particularly for potency and other tests carried out on production batches. PMID:26486625
Interrater and test-retest reliability and validity of the Norwegian version of the BESTest and mini-BESTest in people with increased risk of falling.

Science.gov (United States)

Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L

2017-04-20

The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95 = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95 = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95 = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95 = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.
Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project

Directory of Open Access Journals (Sweden)

Singh Amika S

2012-08-01

Full Text Available Abstract Background Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10–12 year old children. Findings We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study of 10–12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement. All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. Conclusions The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
The Category Cued Recall test in very mild Alzheimer's disease: discriminative validity and correlation with semantic memory functions.

Science.gov (United States)

Vogel, A; Mortensen, E L; Gade, A; Waldemar, G

2007-01-01

Episodic memory tests that measure cued recall may be particularly effective in the diagnosis of early Alzheimer's disease (AD) because they examine both episodic and semantic memory functions. The Category Cued Recall (CCR) test provides superordinate semantic cues at encoding and retrieval, and high discriminative validity has been claimed for this test. The aim of this study was to investigate the discriminative validity for this test when compared with the 10-word memory list from Alzheimer's Disease Assessment Scale (ADAS-cog) that measures free recall. The clinical diagnosis of AD was taken as the standard. It was also investigated whether the two episodic memory tests correlated with measures of semantic memory. The tests were administered to 35 patients with very mild AD (Mini Mental State Examination score >22) and 28 control subjects. Both tests had high sensitivity (>88%) with high specificity (>89%). One out of the five semantic memory tests was significantly correlated to performances on CCR, whereas delayed recall on the ADAS-cog memory test was significantly correlated to two semantic tests. In conclusion, the discriminative validity of the CCR test and the ADAS-cog memory test was equivalent in very mild AD. This may be because CCR did not tap more semantic processes, which are impaired in the earliest phases of AD, than a test of free recall.
Control and Non-Payload Communications (CNPC) Prototype Radio Validation Flight Test Report

Science.gov (United States)

Shalkhauser, Kurt A.; Ishac, Joseph A.; Iannicca, Dennis C.; Bretmersky, Steven C.; Smith, Albert E.

2017-01-01

This report provides an overview and results from the unmanned aircraft (UA) Control and Non-Payload Communications (CNPC) Generation 5 prototype radio validation flight test campaign. The radios used in the test campaign were developed under cooperative agreement NNC11AA01A between the NASA Glenn Research Center and Rockwell Collins, Inc., of Cedar Rapids, Iowa. Measurement results are presented for flight tests over hilly terrain, open water, and urban landscape, utilizing radio sets installed into a NASA aircraft and ground stations. Signal strength and frame loss measurement data are analyzed relative to time and aircraft position, specifically addressing the impact of line-of-sight terrain obstructions on CNPC data flow. Both the radio and flight test system are described.

Validation of 14 C-urea breath test for diagnosis of Helicobacter pylori

International Nuclear Information System (INIS)

Mattar, Rejane; Silva, Fernando Marcuz; Alexandrino, Ana Maria; Laudanna, Antonio Atilio

1999-01-01

The aim of this study was to validate the 14 C-urea breath test for use in diagnosis of Helicobacter pylori infection. Thirty H. pylori positive patients, based on histologic test and thirty H. pylori negative patients by histology and anti-H pylori IgG entered the study. Fasting patients drank 5 uCi of 14 C-urea in 20 ml of water. Breath samples were collected at O, 5, 10, 15, 20 and 30 min. The difference of cpm values between the two groups was significant at all the time intervals, besides time 0 (p 14 C-urea breath test is highly accurate for Helicobacter pylori diagnosis. It is fast, simple and should be the non-invasive test used after treating Helicobacter pylori infection. (author)
Validity and power of association testing in family-based sampling designs: evidence for and against the common wisdom.

Science.gov (United States)

Knight, Stacey; Camp, Nicola J

2011-04-01

Current common wisdom posits that association analyses using family-based designs have inflated type 1 error rates (if relationships are ignored) and independent controls are more powerful than familial controls. We explore these suppositions. We show theoretically that family-based designs can have deflated type-error rates. Through simulation, we examine the validity and power of family designs for several scenarios: cases from randomly or selectively ascertained pedigrees; and familial or independent controls. Family structures considered are as follows: sibships, nuclear families, moderate-sized and extended pedigrees. Three methods were considered with the χ(2) test for trend: variance correction (VC), weighted (weights assigned to account for genetic similarity), and naïve (ignoring relatedness) as well as the Modified Quasi-likelihood Score (MQLS) test. Selectively ascertained pedigrees had similar levels of disease enrichment; random ascertainment had no such restriction. Data for 1,000 cases and 1,000 controls were created under the null and alternate models. The VC and MQLS methods were always valid. The naïve method was anti-conservative if independent controls were used and valid or conservative in designs with familial controls. The weighted association method was generally valid for independent controls, and was conservative for familial controls. With regard to power, independent controls were more powerful for small-to-moderate selectively ascertained pedigrees, but familial and independent controls were equivalent in the extended pedigrees and familial controls were consistently more powerful for all randomly ascertained pedigrees. These results suggest a more complex situation than previously assumed, which has important implications for study design and analysis. © 2011 Wiley-Liss, Inc.
Child abuse: validation of a questionnaire translated into Brazilian Portuguese

Directory of Open Access Journals (Sweden)

Glaucia Marengo

2013-04-01

Full Text Available This study sought to validate the Portuguese translation of a questionnaire on maltreatment of children and adolescents, developed by Russell et al. and to test its psychometric properties for use in Brazil. The original questionnaire was translated into Portuguese using a standardized forward-backward linguistic translation method. Both face and content validity were tested in a small pilot study (n = 8. In the main study, a convenience sample of 80 graduate dentistry students with different specialties, from Curitiba, PR, Brazil, were invited to complete the final Brazilian version of the questionnaire. Discriminant validity was assessed by comparing the results obtained from the questionnaire for different specialties (pediatric dentistry, for example. The respondents completed the questionnaire again after 4 weeks to evaluate test-retest reliability. The comparison of test versus retest questionnaire answers showed good agreement (kappa > 0.53, intraclass correlation > 0.84 for most questions. In regard to discriminant validity, a statistically significant difference was observed only in the experience and interest domains, in which pediatric dentists showed more experience with and interest in child abuse compared with dentists of other specialties (Mann-Whitney test, p < 0.05. The Brazilian version of the questionnaire was valid and reliable for assessing knowledge regarding child abuse by Portuguese-speaking dentists.
Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

Directory of Open Access Journals (Sweden)

Penny Moss

Full Text Available Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot. Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%. Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56 years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%. Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add
Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

Science.gov (United States)

Moss, Penny; Whitnell, Jasmine; Wright, Anthony

2016-01-01

Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot). Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%). Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56) years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%). Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add validity and
The timed "up and go" test : Reliability and validity in persons with unilateral lower limb amputation

NARCIS (Netherlands)

Schoppen, Tanneke; Boonstra, Antje; Groothoff, JW; de Vries, J; Goeken, LNH; Eisma, Willem

Objective: To determine the interrater and interrater reliability and the validity of the Timed "up and go" test as a measure for physical mobility in elderly patients with an amputation of the lower extremity. Design: To test interrater reliability, the test was performed for two observers at
Microcomputer-based tests for repeated-measures: Metric properties and predictive validities

Science.gov (United States)

Kennedy, Robert S.; Baltzley, Dennis R.; Dunlap, William P.; Wilkes, Robert L.; Kuntz, Lois-Ann

1989-01-01

A menu of psychomotor and mental acuity tests were refined. Field applications of such a battery are, for example, a study of the effects of toxic agents or exotic environments on performance readiness, or the determination of fitness for duty. The key requirement of these tasks is that they be suitable for repeated-measures applications, and so questions of stability and reliability are a continuing, central focus of this work. After the initial (practice) session, seven replications of 14 microcomputer-based performance tests (32 measures) were completed by 37 subjects. Each test in the battery had previously been shown to stabilize in less than five 90-second administrations and to possess retest reliabilities greater than r = 0.707 for three minutes of testing. However, all the tests had never been administered together as a battery and they had never been self-administered. In order to provide predictive validity for intelligence measurement, the Wechsler Adult Intelligence Scale-Revised and the Wonderlic Personnel Test were obtained on the same subjects.
Concurrent and discriminant validity of the Star Excursion Balance Test for military personnel with lateral ankle sprain.

Science.gov (United States)

Bastien, Maude; Moffet, Hélène; Bouyer, Laurent; Perron, Marc; Hébert, Luc J; Leblond, Jean

2014-02-01

The Star Excursion Balance Test (SEBT) has frequently been used to measure motor control and residual functional deficits at different stages of recovery from lateral ankle sprain (LAS) in various populations. However, the validity of the measure used to characterize performance--the maximal reach distance (MRD) measured by visual estimation--is still unknown. To evaluate the concurrent validity of the MRD in the SEBT estimated visually vs the MRD measured with a 3D motion-capture system and evaluate and compare the discriminant validity of 2 MRD-normalization methods (by height or by lower-limb length) in participants with or without LAS (n = 10 per group). There is a high concurrent validity and a good degree of accuracy between the visual estimation measurement and the MRD gold-standard measurement for both groups and under all conditions. The Cohen d ratios between groups and MANOVA products were higher when computed from MRD data normalized by height. The results support the concurrent validity of visual estimation of the MRD and the use of the SEBT to evaluate motor control. Moreover, normalization of MRD data by height appears to increase the discriminant validity of this test.
Development and validation of real-time PCR tests for the identification of four Spodoptera species: Spodoptera eridania, Spodoptera frugiperda, Spodoptera littoralis, and Spodoptera litura (Lepidoptera: Noctuidae).

Science.gov (United States)

Van de Vossenberg, B T L H; Van der Straten, M J

2014-08-01

The genus Spodoptera comprises 31 species, 4 of which are listed as quarantine pests for the European Union: Spodoptera eridania (Cramer), Spodoptera frugiperda (Smith), Spodoptera littoralis (Boisduval), and Spodoptera litura (F.). In international trade, the earlier life stages (eggs and larvae) are being intercepted at point of inspection most frequently, challenging the possibilities of morphological identification. To realize a rapid and reliable identification for all stages, we developed and validated four simplex real-time polymerase chain reaction identification tests based on the mitochondrial cytochrome b gene using dual-labeled hydrolysis probes. Method validation on dilutions of extracted DNA of the target organisms showed that low levels of template (up to 0.2-100 pg) can reliably be identified. No cross-reactivity was observed with 14 nontarget Spodoptera and 5 non-Spodoptera species in the specific Spodoptera tests. The tests showed to be repeatable, reproducible (both 100%), and robust. The new Spodoptera tests have proven to be suitable tools for routine identification of all life stages of S. eridania, S. frugiperda, S. littoralis, and S. litura.
Validity of the Jump-and-Reach Test in Subelite Adolescent Handball Players.

Science.gov (United States)

Muehlbauer, Thomas; Pabst, Jan; Granacher, Urs; Büsch, Dirk

2017-05-01

Muehlbauer, T, Pabst, J, Granacher, U, and Büsch, D. Validity of the jump-and-reach test in subelite adolescent handball players. J Strength Cond Res 31(5): 1282-1289, 2017-The primary purpose of this study was to examine concurrent validity of the jump-and-reach (JaR) test using the Vertec system and a criterion device (i.e., Optojump system). In separate subanalyses, we investigated the influence of gym floor condition and athletes' sex on the validity of vertical jump height. Four hundred forty subelite adolescent female (n = 222, mean age: 14 ± 1 year, age range: 13-15 years) and male (n = 218, mean age: 15 ± 1 year, age range: 14-16 years) handball players performed the JaR test in gyms with region or point elastic floors. Maximal vertical jump height was simultaneously assessed using the Vertec and the Optojump systems. In general, significantly higher jump heights were obtained for the Vertec compared with the Optojump system (11.2 cm, Δ31%, Cohen's d = 2.58). The subanalyses revealed significantly larger jump heights for the Vertec compared with the Optojump system irrespective of gym floor condition and players' sex. The association between Optojump- and Vertec-derived vertical jump heights amounted to rP = 0.84, with a coefficient of determination (R) of 0.71. The subanalyses indicated significantly larger correlations in males (rP = 0.75, R = 0.56) than in females (rP = 0.63, R = 0.40). Yet, correlations were not significantly different between region (rP = 0.83, R = 0.69) as opposed to point elastic floor (rP = 0.87, R = 0.76). Our findings indicate that the 2 apparatuses cannot be used interchangeably. Consequently, gym floor and sex-specific regression equations were provided to estimate true (Optojump system) vertical jump height from Vertec-derived data.
The development and validation of the Closed-set Mandarin Sentence (CMS) test.

Science.gov (United States)

Tao, Duo-Duo; Fu, Qian-Jie; Galvin, John J; Yu, Ya-Feng

2017-09-01

Matrix-styled sentence tests offer a closed-set paradigm that may be useful when evaluating speech intelligibility. Ideally, sentence test materials should reflect the distribution of phonemes within the target language. We developed and validated the Closed-set Mandarin Sentence (CMS) test to assess Mandarin speech intelligibility in noise. CMS test materials were selected to be familiar words and to represent the natural distribution of vowels, consonants, and lexical tones found in Mandarin Chinese. Ten key words in each of five categories (Name, Verb, Number, Color, and Fruit) were produced by a native Mandarin talker, resulting in a total of 50 words that could be combined to produce 100,000 unique sentences. Normative data were collected in 10 normal-hearing, adult Mandarin-speaking Chinese listeners using a closed-set test paradigm. Two test runs were conducted for each subject, and 20 sentences per run were randomly generated while ensuring that each word was presented only twice in each run. First, the level of the words in each category were adjusted to produce equal intelligibility in noise. Test-retest reliability for word-in-sentence recognition was excellent according to Cronbach's alpha (0.952). After the category level adjustments, speech reception thresholds (SRTs) for sentences in noise, defined as the signal-to-noise ratio (SNR) that produced 50% correct whole sentence recognition, were adaptively measured by adjusting the SNR according to the correctness of response. The mean SRT was -7.9 (SE=0.41) and -8.1 (SE=0.34) dB for runs 1 and 2, respectively. The mean standard deviation across runs was 0.93 dB, and paired t-tests showed no significant difference between runs 1 and 2 (p=0.74) despite random sentences being generated for each run and each subject. The results suggest that the CMS provides large stimulus set with which to repeatedly and reliably measure Mandarin-speaking listeners' speech understanding in noise using a closed-set paradigm.
Geant4 hadronic and electromagnetic validation tests in LHCb

CERN Document Server

Griffith, Peter Noel

2016-01-01

LHCb uses Geant4 to simulate the interactions of particles with the detector material and components. The simulation response can vary significantly due to modification of material description, of detector geometry, or of the Geant4 toolkit itself. Therefore, an extensive variety of tools have been developed to study the effects of Geant4 modification on the LHCb simulation framework and on stand-alone environments within the LHCb software infrastructure. These tools have proven to be very effective for investigating new and alternative models provided by Geant4, and also in identifying and fixing anomalous behaviours that arise from changes. The next goal is to have these validation tests run autonomously and periodically, alerting the relevant users when problems are detected. Quick and easy comparison of the results from different software versions and simulation models will be made possible through the web interface of the LHCb Performance and Regression testing system, LHCbPR.
A physical function test for use in the intensive care unit: validity, responsiveness, and predictive utility of the physical function ICU test (scored).

Science.gov (United States)

Denehy, Linda; de Morton, Natalie A; Skinner, Elizabeth H; Edbrooke, Lara; Haines, Kimberley; Warrillow, Stephen; Berney, Sue

2013-12-01

Several tests have recently been developed to measure changes in patient strength and functional outcomes in the intensive care unit (ICU). The original Physical Function ICU Test (PFIT) demonstrates reliability and sensitivity. The aims of this study were to further develop the original PFIT, to derive an interval score (the PFIT-s), and to test the clinimetric properties of the PFIT-s. A nested cohort study was conducted. One hundred forty-four and 116 participants performed the PFIT at ICU admission and discharge, respectively. Original test components were modified using principal component analysis. Rasch analysis examined the unidimensionality of the PFIT, and an interval score was derived. Correlations tested validity, and multiple regression analyses investigated predictive ability. Responsiveness was assessed using the effect size index (ESI), and the minimal clinically important difference (MCID) was calculated. The shoulder lift component was removed. Unidimensionality of combined admission and discharge PFIT-s scores was confirmed. The PFIT-s displayed moderate convergent validity with the Timed "Up & Go" Test (r=-.60), the Six-Minute Walk Test (r=.41), and the Medical Research Council (MRC) sum score (rho=.49). The ESI of the PFIT-s was 0.82, and the MCID was 1.5 points (interval scale range=0-10). A higher admission PFIT-s score was predictive of: an MRC score of ≥48, increased likelihood of discharge home, reduced likelihood of discharge to inpatient rehabilitation, and reduced acute care hospital length of stay. Scoring of sit-to-stand assistance required is subjective, and cadence cutpoints used may not be generalizable. The PFIT-s is a safe and inexpensive test of physical function with high clinical utility. It is valid, responsive to change, and predictive of key outcomes. It is recommended that the PFIT-s be adopted to test physical function in the ICU.
Reliability and construct validity of Yo-Yo tests in untrained and soccer-trained school-girls aged 9-16

DEFF Research Database (Denmark)

Póvoas, Susana C A; Castagna, Carlo; Soares, José Manuel da Costa

2016-01-01

Purpose: The reliability and construct validity of three age-adapted-intensity Yo-Yo tests were evaluated in untrained (n=67) vs. soccer-trained (n=65) 9-16-year-old school-girls. Methods: Tests were performed 7 days apart for reliability (9-11-year-old: Yo-Yo intermittent recovery level 1 children...... during test and retest. Conclusion: The Yo-Yo tests are reliable for determining intermittent-exercise capacity and %HRpeak for soccer players and untrained 9-16-year-old girls. They also possess construct validity with better performances for soccer players compared to untrained age-matched girls...
Validity of the rey visual design test in primary and secondary school children

NARCIS (Netherlands)

Wilhelm, P.; van Klink, M.; van Klink, M.

2007-01-01

The Rey Visual Design Learning Test (Rey, 1964, cited in Spreen & Strauss, 1991, Wilhelm, 2004) assesses immediate memory span, new learning, delayed recall and recognition for nonverbal material. Two studies are presented that focused on the construct validity of the RVDLT in primary and secondary
Reliability and Validity of the Hip Stability Isometric Test (HipSIT): A New Method to Assess Hip Posterolateral Muscle Strength.

Science.gov (United States)

Almeida, Gabriel Peixoto Leão; das Neves Rodrigues, Helena Larissa; de Freitas, Bruno Wesley; de Paula Lima, Pedro Olavo

2017-12-01

Study Design Cross-sectional study. Background The Hip Stability Isometric Test (HipSIT) evaluates the strength of the hip posterolateral stabilizers in a position that favors greater activation of the gluteus maximus and gluteus medius and lower activation of the tensor fascia lata. Objectives To check the validity and reliability of the HipSIT and to evaluate the HipSIT in women with patellofemoral pain (PFP). Methods The HipSIT was evaluated with a handheld dynamometer. During testing, the participants were sidelying, with their legs positioned at 45° of hip flexion and 90° of knee flexion. Participants were instructed to raise the knee of the upper leg while keeping the upper and lower heels in contact. To establish reliability and validity, 49 women were tested with the HipSIT by 2 different evaluators on day 1, and then again 7 days later. The strength of the hip extensors, abductors, and external rotators was also evaluated. Twenty women with unilateral PFP were also evaluated. Results The HipSIT has excellent intrarater and interrater reliability. The standard error of measurement was 0.01 kgf/kg, and the minimal detectable change was 0.036 kgf/kg. The HipSIT showed good validity in isolated hip abduction, external rotation, and extension (Pstrength deficits in women with PFP. J Orthop Sports Phys Ther 2017;47(12):906-913. Epub 9 Oct 2017. doi:10.2519/jospt.2017.7274.
Reliability, validity and description of timed performance of the Jebsen-Taylor Test in patients with muscular dystrophies.

Science.gov (United States)

Artilheiro, Mariana Cunha; Fávero, Francis Meire; Caromano, Fátima Aparecida; Oliveira, Acary de Souza Bulle; Carvas, Nelson; Voos, Mariana Callil; Sá, Cristina Dos Santos Cardoso de

2017-12-08

The Jebsen-Taylor Test evaluates upper limb function by measuring timed performance on everyday activities. The test is used to assess and monitor the progression of patients with Parkinson disease, cerebral palsy, stroke and brain injury. To analyze the reliability, internal consistency and validity of the Jebsen-Taylor Test in people with Muscular Dystrophy and to describe and classify upper limb timed performance of people with Muscular Dystrophy. Fifty patients with Muscular Dystrophy were assessed. Non-dominant and dominant upper limb performances on the Jebsen-Taylor Test were filmed. Two raters evaluated timed performance for inter-rater reliability analysis. Test-retest reliability was investigated by using intraclass correlation coefficients. Internal consistency was assessed using the Cronbach alpha. Construct validity was conducted by comparing the Jebsen-Taylor Test with the Performance of Upper Limb. The internal consistency of Jebsen-Taylor Test was good (Cronbach's α=0.98). A very high inter-rater reliability (0.903-0.999), except for writing with an Intraclass correlation coefficient of 0.772-1.000. Strong correlations between the Jebsen-Taylor Test and the Performance of Upper Limb Module were found (rho=-0.712). The Jebsen-Taylor Test is a reliable and valid measure of timed performance for people with Muscular Dystrophy. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Software Verification and Validation Test Report for the HEPA filter Differential Pressure Fan Interlock System

International Nuclear Information System (INIS)

ERMI, A.M.

2000-01-01

The HEPA Filter Differential Pressure Fan Interlock System PLC ladder logic software was tested using a Software Verification and Validation (VandV) Test Plan as required by the ''Computer Software Quality Assurance Requirements''. The purpose of his document is to report on the results of the software qualification
Bradykinesia-akinesia incoordination test: validating an online keyboard test of upper limb function.

Science.gov (United States)

Noyce, Alastair J; Nagy, Anna; Acharya, Shami; Hadavi, Shahrzad; Bestwick, Jonathan P; Fearnley, Julian; Lees, Andrew J; Giovannoni, Gavin

2014-01-01

The Bradykinesia Akinesia Incoordination (BRAIN) test is a computer keyboard-tapping task that was developed for use in assessing the effect of symptomatic treatment on motor function in Parkinson's disease (PD). An online version has now been designed for use in a wider clinical context and the research setting. Validation of the online BRAIN test was undertaken in 58 patients with Parkinson's disease (PD) and 93 age-matched, non-neurological controls. Kinesia scores (KS30, number of key taps in 30 seconds), akinesia times (AT30, mean dwell time on each key in milliseconds), incoordination scores (IS30, variance of travelling time between key presses) and dysmetria scores (DS30, accuracy of key presses) were compared between groups. These parameters were correlated against total motor scores and sub-scores from the Unified Parkinson's Disease Rating Scale (UPDRS). Mean KS30, AT30 and IS30 were significantly different between PD patients and controls (p≤0.0001). Sensitivity for 85% specificity was 50% for KS30, 40% for AT30 and 29% for IS30. KS30, AT30 and IS30 correlated significantly with UPDRS total motor scores (r = -0.53, r = 0.27 and r = 0.28 respectively) and motor UPDRS sub-scores. The reliability of KS30, AT30 and DS30 was good on repeated testing. The BRAIN test is a reliable, convenient test of upper limb motor function that can be used routinely in the outpatient clinic, at home and in clinical trials. In addition, it can be used as an objective longitudinal measurement of emerging motor dysfunction for the prediction of PD in at-risk cohorts.
Bradykinesia-akinesia incoordination test: validating an online keyboard test of upper limb function.

Directory of Open Access Journals (Sweden)

Alastair J Noyce

Full Text Available The Bradykinesia Akinesia Incoordination (BRAIN test is a computer keyboard-tapping task that was developed for use in assessing the effect of symptomatic treatment on motor function in Parkinson's disease (PD. An online version has now been designed for use in a wider clinical context and the research setting.Validation of the online BRAIN test was undertaken in 58 patients with Parkinson's disease (PD and 93 age-matched, non-neurological controls. Kinesia scores (KS30, number of key taps in 30 seconds, akinesia times (AT30, mean dwell time on each key in milliseconds, incoordination scores (IS30, variance of travelling time between key presses and dysmetria scores (DS30, accuracy of key presses were compared between groups. These parameters were correlated against total motor scores and sub-scores from the Unified Parkinson's Disease Rating Scale (UPDRS.Mean KS30, AT30 and IS30 were significantly different between PD patients and controls (p≤0.0001. Sensitivity for 85% specificity was 50% for KS30, 40% for AT30 and 29% for IS30. KS30, AT30 and IS30 correlated significantly with UPDRS total motor scores (r = -0.53, r = 0.27 and r = 0.28 respectively and motor UPDRS sub-scores. The reliability of KS30, AT30 and DS30 was good on repeated testing.The BRAIN test is a reliable, convenient test of upper limb motor function that can be used routinely in the outpatient clinic, at home and in clinical trials. In addition, it can be used as an objective longitudinal measurement of emerging motor dysfunction for the prediction of PD in at-risk cohorts.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.