WorldWideScience

Sample records for validity testing showed

  1. Validation and test report

    DEFF Research Database (Denmark)

    Pedersen, Jens Meldgaard; Andersen, T. Bull

    2012-01-01

    . As a consequence of extensive movement artefacts seen during dynamic contractions, the following validation and test report consists of a report that investigates the physiological responses to a static contraction in a standing and a supine position. Eight subjects performed static contractions of the ankle...

  2. Convergent validity test, construct validity test and external validity test of the David Liberman algorithm

    Directory of Open Access Journals (Sweden)

    David Maldavsky

    2013-08-01

    Full Text Available The author first exposes a complement of a previous test about convergent validity, then a construct validity test and finally an external validity test of the David Liberman algorithm.  The first part of the paper focused on a complementary aspect, the differential sensitivity of the DLA 1 in an external comparison (to other methods, and 2 in an internal comparison (between two ways of using the same method, the DLA.  The construct validity test exposes the concepts underlined to DLA, their operationalization and some corrections emerging from several empirical studies we carried out.  The external validity test examines the possibility of using the investigation of a single case and its relation with the investigation of a more extended sample.

  3. The validation of language tests

    African Journals Online (AJOL)

    KATEVG

    Stellenbosch Papers in Linguistics, Vol. ... validation is necessary because of the major impact which test results can have on the many ... Messick (1989: 20) introduces his much-quoted progressive matrix (cf. table 1), which ... argue that current accounts of validity only superficially address theories of measurement.

  4. Validation through model testing

    International Nuclear Information System (INIS)

    1995-01-01

    Geoval-94 is the third Geoval symposium arranged jointly by the OECD/NEA and the Swedish Nuclear Power Inspectorate. Earlier symposia in this series took place in 1987 and 1990. In many countries, the ongoing programmes to site and construct deep geological repositories for high and intermediate level nuclear waste are close to realization. A number of studies demonstrates the potential barrier function of the geosphere, but also that there are many unresolved issues. A key to these problems are the possibilities to gain knowledge by model testing with experiments and to increase confidence in models used for prediction. The sessions cover conclusions from the INTRAVAL-project, experiences from integrated experimental programs and underground research laboratories as well as the integration between performance assessment and site characterisation. Technical issues ranging from waste and buffer interactions with the rock to radionuclide migration in different geological media is addressed. (J.S.)

  5. Migraine patients consistently show abnormal vestibular bedside tests

    Directory of Open Access Journals (Sweden)

    Eliana Teixeira Maranhão

    2015-01-01

    Full Text Available Migraine and vertigo are common disorders, with lifetime prevalences of 16% and 7% respectively, and co-morbidity around 3.2%. Vestibular syndromes and dizziness occur more frequently in migraine patients. We investigated bedside clinical signs indicative of vestibular dysfunction in migraineurs.Objective To test the hypothesis that vestibulo-ocular reflex, vestibulo-spinal reflex and fall risk (FR responses as measured by 14 bedside tests are abnormal in migraineurs without vertigo, as compared with controls.Method Cross-sectional study including sixty individuals – thirty migraineurs, 25 women, 19-60 y-o; and 30 gender/age healthy paired controls.Results Migraineurs showed a tendency to perform worse in almost all tests, albeit only the Romberg tandem test was statistically different from controls. A combination of four abnormal tests better discriminated the two groups (93.3% specificity.Conclusion Migraine patients consistently showed abnormal vestibular bedside tests when compared with controls.

  6. Migraine patients consistently show abnormal vestibular bedside tests.

    Science.gov (United States)

    Maranhão, Eliana Teixeira; Maranhão-Filho, Péricles; Luiz, Ronir Raggio; Vincent, Maurice Borges

    2016-01-01

    Migraine and vertigo are common disorders, with lifetime prevalences of 16% and 7% respectively, and co-morbidity around 3.2%. Vestibular syndromes and dizziness occur more frequently in migraine patients. We investigated bedside clinical signs indicative of vestibular dysfunction in migraineurs. To test the hypothesis that vestibulo-ocular reflex, vestibulo-spinal reflex and fall risk (FR) responses as measured by 14 bedside tests are abnormal in migraineurs without vertigo, as compared with controls. Cross-sectional study including sixty individuals - thirty migraineurs, 25 women, 19-60 y-o; and 30 gender/age healthy paired controls. Migraineurs showed a tendency to perform worse in almost all tests, albeit only the Romberg tandem test was statistically different from controls. A combination of four abnormal tests better discriminated the two groups (93.3% specificity). Migraine patients consistently showed abnormal vestibular bedside tests when compared with controls.

  7. Construct Validity of Neuropsychological Tests in Schizophrenia.

    Science.gov (United States)

    Allen, Daniel N.; Aldarondo, Felito; Goldstein, Gerald; Huegel, Stephen G.; Gilbertson, Mark; van Kammen, Daniel P.

    1998-01-01

    The construct validity of neuropsychological tests in patients with schizophrenia was studied with 39 patients who were evaluated with a battery of six tests assessing attention, memory, and abstract reasoning abilities. Results support the construct validity of the neuropsychological tests in patients with schizophrenia. (SLD)

  8. Limonene hydroperoxide analogues show specific patch test reactions.

    Science.gov (United States)

    Christensson, Johanna Bråred; Hellsén, Staffan; Börje, Anna; Karlberg, Ann-Therese

    2014-05-01

    The fragrance terpene R-limonene is a very weak sensitizer, but forms allergenic oxidation products upon contact with air. The primary oxidation products of oxidized limonene, the hydroperoxides, have an important impact on the sensitizing potency of the oxidation mixture. One analogue, limonene-1-hydroperoxide, was experimentally shown to be a significantly more potent sensitizer than limonene-2-hydroperoxide in the local lymph node assay with non-pooled lymph nodes. To investigate the pattern of reactivity among consecutive dermatitis patients to two structurally closely related limonene hydroperoxides, limonene-1-hydroperoxide and limonene-2-hydroperoxide. Limonene-1-hydroperoxide, limonene-2-hydroperoxide, at 0.5% in petrolatum, and oxidized limonene 3.0% pet. were tested in 763 consecutive dermatitis patients. Of the tested materials, limonene-1-hydroperoxide gave most reactions, with 2.4% of the patients showing positive patch test reactions. Limonene-2-hydroperoxide and oxidized R-limonene gave 1.7% and 1.2% positive patch test reactions, respectively. Concomitant positive patch test reactions to other fragrance markers in the baseline series were frequently noted. The results are in accordance with the experimental studies, as limonene-1-hydroperoxide gave more positive patch test reactions in the tested patients than limonene-2-hydroperoxide. Furthermore, the results support the specificity of the allergenic activity of the limonene hydroperoxide analogues and the importance of oxidized limonene as a cause of contact allergy. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  9. Validity evidence based on test content.

    Science.gov (United States)

    Sireci, Stephen; Faulkner-Bond, Molly

    2014-01-01

    Validity evidence based on test content is one of the five forms of validity evidence stipulated in the Standards for Educational and Psychological Testing developed by the American Educational Research Association, American Psychological Association, and National Council on Measurement in Education. In this paper, we describe the logic and theory underlying such evidence and describe traditional and modern methods for gathering and analyzing content validity data. A comprehensive review of the literature and of the aforementioned Standards is presented. For educational tests and other assessments targeting knowledge and skill possessed by examinees, validity evidence based on test content is necessary for building a validity argument to support the use of a test for a particular purpose. By following the methods described in this article, practitioners have a wide arsenal of tools available for determining how well the content of an assessment is congruent with and appropriate for the specific testing purposes.

  10. Methodology for testing and validating knowledge bases

    Science.gov (United States)

    Krishnamurthy, C.; Padalkar, S.; Sztipanovits, J.; Purves, B. R.

    1987-01-01

    A test and validation toolset developed for artificial intelligence programs is described. The basic premises of this method are: (1) knowledge bases have a strongly declarative character and represent mostly structural information about different domains, (2) the conditions for integrity, consistency, and correctness can be transformed into structural properties of knowledge bases, and (3) structural information and structural properties can be uniformly represented by graphs and checked by graph algorithms. The interactive test and validation environment have been implemented on a SUN workstation.

  11. DTU PMU Laboratory Development - Testing and Validation

    OpenAIRE

    Garcia-Valle, Rodrigo; Yang, Guang-Ya; Martin, Kenneth E.; Nielsen, Arne Hejde; Østergaard, Jacob

    2010-01-01

    This is a report of the results of phasor measurement unit (PMU) laboratory development and testing done at the Centre for Electric Technology (CET), Technical University of Denmark (DTU). Analysis of the PMU performance first required the development of tools to convert the DTU PMU data into IEEE standard, and the validation is done for the DTU-PMU via a validated commercial PMU. The commercial PMU has been tested from the authors' previous efforts, where the response can be expected to foll...

  12. Validation of SSC using the FFTF natural-circulation tests

    International Nuclear Information System (INIS)

    Horak, W.C.; Guppy, J.G.; Kennett, R.J.

    1982-01-01

    As part of the Super System Code (SSC) validation program, the 100% power FFTF natural circulation test has been simulated using SSC. A detailed 19 channel, 2 loop model was used in SSC. Comparisons showed SSC calculations to be in good agreement with the Fast Flux Test Facility (FFTF), test data. Simulation of the test was obtained in real time

  13. Mollusc reproductive toxicity tests - Development and validation of test guidelines

    DEFF Research Database (Denmark)

    Ducrot, Virginie; Holbech, Henrik; Kinnberg, Karin Lund

    . Draft standard operating procedures (SOPs) have been designed based upon literature and expert knowledge from project partners. Pre-validation studies have been implemented to validate the proposed test conditions and identify issues in performing the SOPs and analyzing test results. Pre-validation work......The Organisation for Economic Cooperation and Development is promoting the development and validation of mollusc toxicity tests within its test guidelines programme, eventually aiming for the standardization of mollusc apical toxicity tests. Through collaborative work between academia, industry...... and stakeholders, this study aims to develop innovative partial life-cycle tests on the reproduction of the freshwater gastropods Potamopyrgus antipodarum and Lymnaea stagnalis, which are relevant candidate species for the standardization of mollusc apical toxicity tests assessing reprotoxic effects of chemicals...

  14. Validating a Spanish Developmental Spelling Test.

    Science.gov (United States)

    Ferroli, Lou; Krajenta, Marilyn

    The creation and validation of a Spanish version of an English developmental spelling test (DST) is described. An introductory section reviews related literature on the rationale for and construction of DSTs, spelling development in the early grades, and Spanish-English bilingual education. Differences between the English and Spanish test versions…

  15. DTU PMU Laboratory Development - Testing and Validation

    DEFF Research Database (Denmark)

    Garcia-Valle, Rodrigo; Yang, Guang-Ya; Martin, Kenneth E.

    2010-01-01

    This is a report of the results of phasor measurement unit (PMU) laboratory development and testing done at the Centre for Electric Technology (CET), Technical University of Denmark (DTU). Analysis of the PMU performance first required the development of tools to convert the DTU PMU data into IEEE...... standard, and the validation is done for the DTU-PMU via a validated commercial PMU. The commercial PMU has been tested from the authors' previous efforts, where the response can be expected to follow known patterns and provide confirmation about the test system to confirm the design and settings....... In a nutshell, having 2 PMUs that observe same signals provides validation of the operation and flags questionable results with more certainty. Moreover, the performance and accuracy of the DTU-PMU is tested acquiring good and precise results, when compared with a commercial phasor measurement device, PMU-1....

  16. IP validation in remote microelectronics testing

    Science.gov (United States)

    Osseiran, Adam; Eshraghian, Kamran; Lachowicz, Stefan; Zhao, Xiaoli; Jeffery, Roger; Robins, Michael

    2004-03-01

    This paper presents the test and validation of FPGA based IP using the concept of remote testing. It demonstrates how a virtual tester environment based on a powerful, networked Integrated Circuit testing facility, aimed to complement the emerging Australian microelectronics based research and development, can be employed to perform the tasks beyond the standard IC test. IC testing in production consists in verifying the tested products and eliminating defective parts. Defects could have a number of different causes, including process defects, process migration and IP design and implementation errors. One of the challenges in semiconductor testing is that while current fault models are used to represent likely faults (stuck-at, delay, etc.) in a global context, they do not account for all possible defects. Research in this field keeps growing but the high cost of ATE is preventing a large community from accessing test and verification equipment to validate innovative IP designs. For these reasons a world class networked IC teletest facility has been established in Australia under the support of the Commonwealth government. The facility is based on a state-of-the-art semiconductor tester operating as a virtual centre spanning Australia and accessible internationally. Through a novel approach the teletest network provides virtual access to the tester on which the DUT has previously been placed. The tester software is then accessible as if the designer is sitting next to the tester. This paper presents the approach used to test and validate FPGA based IPs using this remote test approach.

  17. Validation of measured friction by process tests

    DEFF Research Database (Denmark)

    Eriksen, Morten; Henningsen, Poul; Tan, Xincai

    The objective of sub-task 3.3 is to evaluate under actual process conditions the friction formulations determined by simulative testing. As regards task 3.3 the following tests have been used according to the original project plan: 1. standard ring test and 2. double cup extrusion test. The task...... has, however, been extended to include a number of new developed process tests: 3. forward rod extrusion test, 4. special ring test at low normal pressure, 5. spike test (especially developed for warm and hot forging). Validation of the measured friction values in cold forming from sub-task 3.1 has...... been made with forward rod extrusion, and very good agreement was obtained between the measured friction values in simulative testing and process testing....

  18. Validation testing of safety-critical software

    International Nuclear Information System (INIS)

    Kim, Hang Bae; Han, Jae Bok

    1995-01-01

    A software engineering process has been developed for the design of safety critical software for Wolsung 2/3/4 project to satisfy the requirements of the regulatory body. Among the process, this paper described the detail process of validation testing performed to ensure that the software with its hardware, developed by the design group, satisfies the requirements of the functional specification prepared by the independent functional group. To perform the tests, test facility and test software were developed and actual safety system computer was connected. Three kinds of test cases, i.e., functional test, performance test and self-check test, were programmed and run to verify each functional specifications. Test failures were feedback to the design group to revise the software and test results were analyzed and documented in the report to submit to the regulatory body. The test methodology and procedure were very efficient and satisfactory to perform the systematic and automatic test. The test results were also acceptable and successful to verify the software acts as specified in the program functional specification. This methodology can be applied to the validation of other safety-critical software. 2 figs., 2 tabs., 14 refs. (Author)

  19. Latency-Based and Psychophysiological Measures of Sexual Interest Show Convergent and Concurrent Validity.

    Science.gov (United States)

    Ó Ciardha, Caoilte; Attard-Johnson, Janice; Bindemann, Markus

    2018-04-01

    Latency-based measures of sexual interest require additional evidence of validity, as do newer pupil dilation approaches. A total of 102 community men completed six latency-based measures of sexual interest. Pupillary responses were recorded during three of these tasks and in an additional task where no participant response was required. For adult stimuli, there was a high degree of intercorrelation between measures, suggesting that tasks may be measuring the same underlying construct (convergent validity). In addition to being correlated with one another, measures also predicted participants' self-reported sexual interest, demonstrating concurrent validity (i.e., the ability of a task to predict a more validated, simultaneously recorded, measure). Latency-based and pupillometric approaches also showed preliminary evidence of concurrent validity in predicting both self-reported interest in child molestation and viewing pornographic material containing children. Taken together, the study findings build on the evidence base for the validity of latency-based and pupillometric measures of sexual interest.

  20. The validation of Huffaz Intelligence Test (HIT)

    Science.gov (United States)

    Rahim, Mohd Azrin Mohammad; Ahmad, Tahir; Awang, Siti Rahmah; Safar, Ajmain

    2017-08-01

    In general, a hafiz who can memorize the Quran has many specialties especially in respect to their academic performances. In this study, the theory of multiple intelligences introduced by Howard Gardner is embedded in a developed psychometric instrument, namely Huffaz Intelligence Test (HIT). This paper presents the validation and the reliability of HIT of some tahfiz students in Malaysia Islamic schools. A pilot study was conducted involving 87 huffaz who were randomly selected to answer the items in HIT. The analysis method used includes Partial Least Square (PLS) on reliability, convergence and discriminant validation. The study has validated nine intelligences. The findings also indicated that the composite reliabilities for the nine types of intelligences are greater than 0.8. Thus, the HIT is a valid and reliable instrument to measure the multiple intelligences among huffaz.

  1. ASTM Validates Air Pollution Test Methods

    Science.gov (United States)

    Chemical and Engineering News, 1973

    1973-01-01

    The American Society for Testing and Materials (ASTM) has validated six basic methods for measuring pollutants in ambient air as the first part of its Project Threshold. Aim of the project is to establish nationwide consistency in measuring pollutants; determining precision, accuracy and reproducibility of 35 standard measuring methods. (BL)

  2. Validating High-Stakes Testing Programs.

    Science.gov (United States)

    Kane, Michael

    2002-01-01

    Makes the point that the interpretations and use of high-stakes test scores rely on policy assumptions about what should be taught and the content standards and performance standards that should be applied. The assumptions built into an assessment need to be subjected to scrutiny and criticism if a strong case is to be made for the validity of the…

  3. Development and Validation of a Test for Bulimia.

    Science.gov (United States)

    Smith, Marcia C.; Thelen, Mark H.

    1984-01-01

    Developed the Bulimia Test (BULIT) based on responses of clinically identified females (N=18) and normal female college students (N=119) to preliminary test items. Results showed that the BULIT provided an objective, reliable, and valid measure by which to identify individuals with symptoms of bulimia. (Instrument is appended.) (LLL)

  4. Validation of Clinical Testing for Warfarin Sensitivity

    Science.gov (United States)

    Langley, Michael R.; Booker, Jessica K.; Evans, James P.; McLeod, Howard L.; Weck, Karen E.

    2009-01-01

    Responses to warfarin (Coumadin) anticoagulation therapy are affected by genetic variability in both the CYP2C9 and VKORC1 genes. Validation of pharmacogenetic testing for warfarin responses includes demonstration of analytical validity of testing platforms and of the clinical validity of testing. We compared four platforms for determining the relevant single nucleotide polymorphisms (SNPs) in both CYP2C9 and VKORC1 that are associated with warfarin sensitivity (Third Wave Invader Plus, ParagonDx/Cepheid Smart Cycler, Idaho Technology LightCycler, and AutoGenomics Infiniti). Each method was examined for accuracy, cost, and turnaround time. All genotyping methods demonstrated greater than 95% accuracy for identifying the relevant SNPs (CYP2C9 *2 and *3; VKORC1 −1639 or 1173). The ParagonDx and Idaho Technology assays had the shortest turnaround and hands-on times. The Third Wave assay was readily scalable to higher test volumes but had the longest hands-on time. The AutoGenomics assay interrogated the largest number of SNPs but had the longest turnaround time. Four published warfarin-dosing algorithms (Washington University, UCSF, Louisville, and Newcastle) were compared for accuracy for predicting warfarin dose in a retrospective analysis of a local patient population on long-term, stable warfarin therapy. The predicted doses from both the Washington University and UCSF algorithms demonstrated the best correlation with actual warfarin doses. PMID:19324988

  5. Continuous validation of ASTEC containment models and regression testing

    International Nuclear Information System (INIS)

    Nowack, Holger; Reinke, Nils; Sonnenkalb, Martin

    2014-01-01

    The focus of the ASTEC (Accident Source Term Evaluation Code) development at GRS is primarily on the containment module CPA (Containment Part of ASTEC), whose modelling is to a large extent based on the GRS containment code COCOSYS (COntainment COde SYStem). Validation is usually understood as the approval of the modelling capabilities by calculations of appropriate experiments done by external users different from the code developers. During the development process of ASTEC CPA, bugs and unintended side effects may occur, which leads to changes in the results of the initially conducted validation. Due to the involvement of a considerable number of developers in the coding of ASTEC modules, validation of the code alone, even if executed repeatedly, is not sufficient. Therefore, a regression testing procedure has been implemented in order to ensure that the initially obtained validation results are still valid with succeeding code versions. Within the regression testing procedure, calculations of experiments and plant sequences are performed with the same input deck but applying two different code versions. For every test-case the up-to-date code version is compared to the preceding one on the basis of physical parameters deemed to be characteristic for the test-case under consideration. In the case of post-calculations of experiments also a comparison to experimental data is carried out. Three validation cases from the regression testing procedure are presented within this paper. The very good post-calculation of the HDR E11.1 experiment shows the high quality modelling of thermal-hydraulics in ASTEC CPA. Aerosol behaviour is validated on the BMC VANAM M3 experiment, and the results show also a very good agreement with experimental data. Finally, iodine behaviour is checked in the validation test-case of the THAI IOD-11 experiment. Within this test-case, the comparison of the ASTEC versions V2.0r1 and V2.0r2 shows how an error was detected by the regression testing

  6. Construct Validity of the Nepalese School Leaving English Reading Test

    Science.gov (United States)

    Dawadi, Saraswati; Shrestha, Prithvi N.

    2018-01-01

    There has been a steady interest in investigating the validity of language tests in the last decades. Despite numerous studies on construct validity in language testing, there are not many studies examining the construct validity of a reading test. This paper reports on a study that explored the construct validity of the English reading test in…

  7. Validity of an Interactive Functional Reach Test.

    Science.gov (United States)

    Galen, Sujay S; Pardo, Vicky; Wyatt, Douglas; Diamond, Andrew; Brodith, Victor; Pavlov, Alex

    2015-08-01

    Videogaming platforms such as the Microsoft (Redmond, WA) Kinect(®) are increasingly being used in rehabilitation to improve balance performance and mobility. These gaming platforms do not have built-in clinical measures that offer clinically meaningful data. We have now developed software that will enable the Kinect sensor to assess a patient's balance using an interactive functional reach test (I-FRT). The aim of the study was to test the concurrent validity of the I-FRT and to establish the feasibility of implementing the I-FRT in a clinical setting. The concurrent validity of the I-FRT was tested among 20 healthy adults (mean age, 25.8±3.4 years; 14 women). The Functional Reach Test (FRT) was measured simultaneously by both the Kinect sensor using the I-FRT software and the Optotrak Certus(®) 3D motion-capture system (Northern Digital Inc., Waterloo, ON, Canada). The feasibility of implementing the I-FRT in a clinical setting was assessed by performing the I-FRT in 10 participants with mild balance impairments recruited from the outpatient physical therapy clinic (mean age, 55.8±13.5 years; four women) and obtaining their feedback using a NASA Task Load Index (NASA-TLX) questionnaire. There was moderate to good agreement between FRT measures made by the two measurement systems. The greatest agreement between the two measurement system was found with the Kinect sensor placed at a distance of 2.5 m [intraclass correlation coefficient (2,k)=0.786; PNASA/TLX questionnaire. FRT measures made using the Kinect sensor I-FRT software provides a valid clinical measure that can be used with the gaming platforms.

  8. Technique for unit testing of safety software verification and validation

    International Nuclear Information System (INIS)

    Li Duo; Zhang Liangju; Feng Junting

    2008-01-01

    The key issue arising from digitalization of the reactor protection system for nuclear power plant is how to carry out verification and validation (V and V), to demonstrate and confirm the software that performs reactor safety functions is safe and reliable. One of the most important processes for software V and V is unit testing, which verifies and validates the software coding based on concept design for consistency, correctness and completeness during software development. The paper shows a preliminary study on the technique for unit testing of safety software V and V, focusing on such aspects as how to confirm test completeness, how to establish test platform, how to develop test cases and how to carry out unit testing. The technique discussed here was successfully used in the work of unit testing on safety software of a digital reactor protection system. (authors)

  9. Unit testing, model validation, and biological simulation.

    Science.gov (United States)

    Sarma, Gopal P; Jacobs, Travis W; Watts, Mark D; Ghayoomie, S Vahid; Larson, Stephen D; Gerkin, Richard C

    2016-01-01

    The growth of the software industry has gone hand in hand with the development of tools and cultural practices for ensuring the reliability of complex pieces of software. These tools and practices are now acknowledged to be essential to the management of modern software. As computational models and methods have become increasingly common in the biological sciences, it is important to examine how these practices can accelerate biological software development and improve research quality. In this article, we give a focused case study of our experience with the practices of unit testing and test-driven development in OpenWorm, an open-science project aimed at modeling Caenorhabditis elegans. We identify and discuss the challenges of incorporating test-driven development into a heterogeneous, data-driven project, as well as the role of model validation tests, a category of tests unique to software which expresses scientific models.

  10. Differential Weighting of Items to Improve University Admission Test Validity

    Directory of Open Access Journals (Sweden)

    Eduardo Backhoff Escudero

    2001-05-01

    Full Text Available This paper gives an evaluation of different ways to increase university admission test criterion-related validity, by differentially weighting test items. We compared four methods of weighting multiple-choice items of the Basic Skills and Knowledge Examination (EXHCOBA: (1 punishing incorrect responses by a constant factor, (2 weighting incorrect responses, considering the levels of error, (3 weighting correct responses, considering the item’s difficulty, based on the Classic Measurement Theory, and (4 weighting correct responses, considering the item’s difficulty, based on the Item Response Theory. Results show that none of these methods increased the instrument’s predictive validity, although they did improve its concurrent validity. It was concluded that it is appropriate to score the test by simply adding up correct responses.

  11. Effort, symptom validity testing, performance validity testing and traumatic brain injury.

    Science.gov (United States)

    Bigler, Erin D

    2014-01-01

    To understand the neurocognitive effects of brain injury, valid neuropsychological test findings are paramount. This review examines the research on what has been referred to a symptom validity testing (SVT). Above a designated cut-score signifies a 'passing' SVT performance which is likely the best indicator of valid neuropsychological test findings. Likewise, substantially below cut-point performance that nears chance or is at chance signifies invalid test performance. Significantly below chance is the sine qua non neuropsychological indicator for malingering. However, the interpretative problems with SVT performance below the cut-point yet far above chance are substantial, as pointed out in this review. This intermediate, border-zone performance on SVT measures is where substantial interpretative challenges exist. Case studies are used to highlight the many areas where additional research is needed. Historical perspectives are reviewed along with the neurobiology of effort. Reasons why performance validity testing (PVT) may be better than the SVT term are reviewed. Advances in neuroimaging techniques may be key in better understanding the meaning of border zone SVT failure. The review demonstrates the problems with rigidity in interpretation with established cut-scores. A better understanding of how certain types of neurological, neuropsychiatric and/or even test conditions may affect SVT performance is needed.

  12. Thyroid-specific questions on work ability showed known-groups validity among Danes with thyroid diseases.

    Science.gov (United States)

    Nexo, Mette Andersen; Watt, Torquil; Bonnema, Steen Joop; Hegedüs, Laszlo; Rasmussen, Åse Krogh; Feldt-Rasmussen, Ulla; Bjorner, Jakob Bue

    2015-07-01

    We aimed to identify the best approach to work ability assessment in patients with thyroid disease by evaluating the factor structure, measurement equivalence, known-groups validity, and predictive validity of a broad set of work ability items. Based on the literature and interviews with thyroid patients, 24 work ability items were selected from previous questionnaires, revised, or developed anew. Items were tested among 632 patients with thyroid disease (non-toxic goiter, toxic nodular goiter, Graves' disease (with or without orbitopathy), autoimmune hypothyroidism, and other thyroid diseases), 391 of which had participated in a study 5 years previously. Responses to select items were compared to general population data. We used confirmatory factor analyses for categorical data, logistic regression analyses and tests of differential item function, and head-to-head comparisons of relative validity in distinguishing known groups. Although all work ability items loaded on a common factor, the optimal factor solution included five factors: role physical, role emotional, thyroid-specific limitations, work limitations (without disease attribution), and work performance. The scale on thyroid-specific limitations showed the most power in distinguishing clinical groups and time since diagnosis. A global single item proved useful for comparisons with the general population, and a thyroid-specific item predicted labor market exclusion within the next 5 years (OR 5.0, 95 % CI 2.7-9.1). Items on work limitations with attribution to thyroid disease were most effective in detecting impact on work ability and showed good predictive validity. Generic work ability items remain useful for general population comparisons.

  13. The validity of the Michigan Alcoholism Screening Test (MAST)

    DEFF Research Database (Denmark)

    Storgaard, H; Nielsen, S D; Gluud, C

    1994-01-01

    This review examines the validity of the Michigan Alcoholism Screening Test (MAST) as a screening instrument for alcohol problems. Studies that compare the MAST-questionnaire with other defined diagnostic criteria of alcohol problems were retrieved through MEDLINE and a cross-bibliographic check....... A total of 20 validity studies were included. The studies varied considerably regarding the prevalence of alcohol problems, the diagnostic criteria, and the examined patient categories. The MAST compared with other diagnostic criteria of alcohol problems gave validity measures with the following span...... and the specificities show substantial variations. The variables that seem to have the largest influence on the PVpos seem to be the prevalence of alcohol problems, the diagnostic method against which the MAST-questionnaire is validated, and the populations on which the MAST is applied. The MAST should in the future...

  14. Development and psychometric validation of the verbal affective memory test

    DEFF Research Database (Denmark)

    Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S

    2015-01-01

    . Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...

  15. Validity of selected cardiovascular field-based test among Malaysian ...

    African Journals Online (AJOL)

    Based on emerge obese problem among Malaysian, this research is formulated to validate published tests among healthy female adult. Selected test namely; 20 meter multi-stage shuttle run, 2.4km run test, 1 mile walk test and Harvard Step test were correlated with laboratory test (Bruce protocol) to find the criterion validity ...

  16. Validity of the Eating Attitude Test among Exercisers.

    Science.gov (United States)

    Lane, Helen J; Lane, Andrew M; Matheson, Hilary

    2004-12-01

    Theory testing and construct measurement are inextricably linked. To date, no published research has looked at the factorial validity of an existing eating attitude inventory for use with exercisers. The Eating Attitude Test (EAT) is a 26-item measure that yields a single index of disordered eating attitudes. The original factor analysis showed three interrelated factors: Dieting behavior (13-items), oral control (7-items), and bulimia nervosa-food preoccupation (6-items). The primary purpose of the study was to examine the factorial validity of the EAT among a sample of exercisers. The second purpose was to investigate relationships between eating attitudes scores and selected psychological constructs. In stage one, 598 regular exercisers completed the EAT. Confirmatory factor analysis (CFA) was used to test the single-factor, a three-factor model, and a four-factor model, which distinguished bulimia from food pre-occupation. CFA of the single-factor model (RCFI = 0.66, RMSEA = 0.10), the three-factor-model (RCFI = 0.74; RMSEA = 0.09) showed poor model fit. There was marginal fit for the 4-factor model (RCFI = 0.91, RMSEA = 0.06). Results indicated five-items showed poor factor loadings. After these 5-items were discarded, the three models were re-analyzed. CFA results indicated that the single-factor model (RCFI = 0.76, RMSEA = 0.10) and three-factor model (RCFI = 0.82, RMSEA = 0.08) showed poor fit. CFA results for the four-factor model showed acceptable fit indices (RCFI = 0.98, RMSEA = 0.06). Stage two explored relationships between EAT scores, mood, self-esteem, and motivational indices toward exercise in terms of self-determination, enjoyment and competence. Correlation results indicated that depressed mood scores positively correlated with bulimia and dieting scores. Further, dieting was inversely related with self-determination toward exercising. Collectively, findings suggest that a 21-item four-factor model shows promising validity coefficients among

  17. Community males show multiple-perpetrator rape proclivity: development and preliminary validation of an interest scale.

    Science.gov (United States)

    Alleyne, Emma; Gannon, Theresa A; Ó Ciardha, Caoilte; Wood, Jane L

    2014-02-01

    The literature on Multiple Perpetrator Rape (MPR) is scant; however, a significant proportion of sexual offending involves multiple perpetrators. In addition to the need for research with apprehended offenders of MPR, there is also a need to conduct research with members of the general public. Recent advances in the forensic literature have led to the development of self-report proclivity scales. These scales have enabled researchers to conduct evaluative studies sampling from members of the general public who may be perpetrators of sexual offenses and have remained undetected, or at highest risk of engaging in sexual offending. The current study describes the development and preliminary validation of the Multiple-Perpetrator Rape Interest Scale (M-PRIS), a vignette-based measure assessing community males' sexual arousal to MPR, behavioral propensity toward MPR and enjoyment of MPR. The findings show that the M-PRIS is a reliable measure of community males' sexual interest in MPR with high internal reliability and temporal stability. In a sample of university males we found that a large proportion (66%) did not emphatically reject an interest in MPR. We also found that rape-supportive cognitive distortions, antisocial attitudes, and high-risk sexual fantasies were predictors of sexual interest in MPR. We discuss these findings and the implications for further research employing proclivity measures referencing theory development and clinical practice.

  18. Validation of the Information/Communications Technology Literacy Test

    Science.gov (United States)

    2016-10-01

    Technical Report 1360 Validation of the Information /Communications Technology Literacy Test D. Matthew Trippe Human Resources Research...TITLE AND SUBTITLE Validation of the Information /Communications Technology Literacy Test 5a. CONTRACT OR GRANT NUMBER W91WAS-09-D-0013 5b...validate a measure of cyber aptitude, the Information /Communications Technology Literacy Test (ICTL), in predicting trainee performance in Information

  19. Optimal number of tests to achieve and validate product reliability

    International Nuclear Information System (INIS)

    Ahmed, Hussam; Chateauneuf, Alaa

    2014-01-01

    The reliability validation of engineering products and systems is mandatory for choosing the best cost-effective design among a series of alternatives. Decisions at early design stages have a large effect on the overall life cycle performance and cost of products. In this paper, an optimization-based formulation is proposed by coupling the costs of product design and validation testing, in order to ensure the product reliability with the minimum number of tests. This formulation addresses the question about the number of tests to be specified through reliability demonstration necessary to validate the product under appropriate confidence level. The proposed formulation takes into account the product cost, the failure cost and the testing cost. The optimization problem can be considered as a decision making system according to the hierarchy of structural reliability measures. The numerical examples show the interest of coupling design and testing parameters. - Highlights: • Coupled formulation for design and testing costs, with lifetime degradation. • Cost-effective testing optimization to achieve reliability target. • Solution procedure for nested aleatoric and epistemic variable spaces

  20. 15 CFR 995.27 - Format validation software testing.

    Science.gov (United States)

    2010-01-01

    ... of NOAA ENC Products § 995.27 Format validation software testing. Tests shall be performed verifying... specification. These tests may be combined with testing of the conversion software. ... 15 Commerce and Foreign Trade 3 2010-01-01 2010-01-01 false Format validation software testing...

  1. Certification Testing as an Illustration of Argument-Based Validation

    Science.gov (United States)

    Kane, Michael

    2004-01-01

    The theories of validity developed over the past 60 years are quite sophisticated, but the methodology of validity is not generally very effective. The validity evidence for major testing programs is typically much weaker than the evidence for more technical characteristics such as reliability. In addition, most validation efforts have a strong…

  2. Test validation of nuclear and fossil fuel control operators

    International Nuclear Information System (INIS)

    Moffie, D.J.

    1976-01-01

    To establish job relatedness, one must go through a procedure of concurrent and predictive validation. For concurrent validity a group of employees is tested and the test scores are related to performance concurrently or during the same time period. For predictive validity, individuals are tested but the results of these tests are not used at the time of employment. The tests are sealed and scored at a later date, and then related to job performance. Job performance data include ratings by supervisors, actual job performance indices, turnover, absenteeism, progress in training, etc. The testing guidelines also stipulate that content and construct validity can be used

  3. The risk of bias in systematic reviews tool showed fair reliability and good construct validity.

    Science.gov (United States)

    Bühn, Stefanie; Mathes, Tim; Prengel, Peggy; Wegewitz, Uta; Ostermann, Thomas; Robens, Sibylle; Pieper, Dawid

    2017-11-01

    There is a movement from generic quality checklists toward a more domain-based approach in critical appraisal tools. This study aimed to report on a first experience with the newly developed risk of bias in systematic reviews (ROBIS) tool and compare it with A Measurement Tool to Assess Systematic Reviews (AMSTAR), that is, the most common used tool to assess methodological quality of systematic reviews while assessing validity, reliability, and applicability. Validation study with four reviewers based on 16 systematic reviews in the field of occupational health. Interrater reliability (IRR) of all four raters was highest for domain 2 (Fleiss' kappa κ = 0.56) and lowest for domain 4 (κ = 0.04). For ROBIS, median IRR was κ = 0.52 (range 0.13-0.88) for the experienced pair of raters compared to κ = 0.32 (range 0.12-0.76) for the less experienced pair of raters. The percentage of "yes" scores of each review of ROBIS ratings was strongly correlated with the AMSTAR ratings (r s  = 0.76; P = 0.01). ROBIS has fair reliability and good construct validity to assess the risk of bias in systematic reviews. More validation studies are needed to investigate reliability and applicability, in particular. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Validation of Symptom Validity Tests Using a "Child-model" of Adult Cognitive Impairments

    NARCIS (Netherlands)

    Rienstra, A.; Spaan, P. E. J.; Schmand, B.

    2010-01-01

    Validation studies of symptom validity tests (SVTs) in children are uncommon. However, since children's cognitive abilities are not yet fully developed, their performance may provide additional support for the validity of these measures in adult populations. Four SVTs, the Test of Memory Malingering

  5. Validation of symptom validity tests using a "child-model" of adult cognitive impairments

    NARCIS (Netherlands)

    Rienstra, A.; Spaan, P.E.J.; Schmand, B.

    2010-01-01

    Validation studies of symptom validity tests (SVTs) in children are uncommon. However, since children’s cognitive abilities are not yet fully developed, their performance may provide additional support for the validity of these measures in adult populations. Four SVTs, the Test of Memory Malingering

  6. Was the Conconi test validated by sporting success, expert opinion ...

    African Journals Online (AJOL)

    Was the Conconi test validated by sporting success, expert opinion or good science? ... Open Access DOWNLOAD FULL TEXT ... Despite scientific evidence to the contrary, a popular incremental field test for endurance athletes (Conconi Test) ...

  7. [Psychometric validation of the telephone memory test].

    Science.gov (United States)

    Ortiz, T; Fernández, A; Martínez-Castillo, E; Maestú, F; Martínez-Arias, R; López-Ibor, J J

    1999-01-01

    Several pathologies (i.e. Alzheimer's disease) that courses with memory alterations, appears in a context of impaired cognitive status and mobility. In recent years, several investigations were carried out in order to design short batteries that detect those subjects under risk of dementia. Some of this batteries were also design to be administrated over the telephone, trying to overcome the accessibility limitations of this patients. In this paper we present a battery (called Autotest de Memoria) essentially composed by episodic and semantic memory tests, administered both over the telephone and face to face. This battery was employed in the cognitive assessment of healthy controls and subjects diagnosed as probable Alzheimer's disease patients. Results show the capability of this battery in order to discriminate patients and healthy controls, a great sensibility and specificity, and a nearly absolute parallelism of telephone and face to face administrations. These data led us to claim the usefulness and practicality of our so called Memoria>.

  8. Validity and Reliability of the Arabic Token Test for Children

    Science.gov (United States)

    Alkhamra, Rana A.; Al-Jazi, Aya B.

    2016-01-01

    Background: The Token Test for Children (2nd edition) (TTFC) is a measure for assessing receptive language. In this study we describe the translation process, validity and reliability of the Arabic Token Test for Children (A-TTFC). Aims: The aim of this study is to translate, validate and establish the reliability of the Arabic Token Test for…

  9. Conceptualizing Essay Tests' Reliability and Validity: From Research to Theory

    Science.gov (United States)

    Badjadi, Nour El Imane

    2013-01-01

    The current paper on writing assessment surveys the literature on the reliability and validity of essay tests. The paper aims to examine the two concepts in relationship with essay testing as well as to provide a snapshot of the current understandings of the reliability and validity of essay tests as drawn in recent research studies. Bearing in…

  10. Construction of Valid and Reliable Test for Assessment of Students

    Science.gov (United States)

    Osadebe, P. U.

    2015-01-01

    The study was carried out to construct a valid and reliable test in Economics for secondary school students. Two research questions were drawn to guide the establishment of validity and reliability for the Economics Achievement Test (EAT). It is a multiple choice objective test of five options with 100 items. A sample of 1000 students was randomly…

  11. Safe and secure South Africa. Vehicle landmine protection validation testing

    CSIR Research Space (South Africa)

    Reinecke, JD

    2008-11-01

    Full Text Available The objective of this paper is to provide an overview of vehicle landmine protection validation testing in South Africa. A short history of validation test standards is given, followed by a summary of current open test standards in general use...

  12. Validating the Interpretations and Uses of Test Scores

    Science.gov (United States)

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  13. Reliability, Validity and Factor Structure of Drug Abuse Screening Test

    Directory of Open Access Journals (Sweden)

    Sayed Hadi Sayed Alitabar

    2016-05-01

    Full Text Available Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST. Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. The DAST Was Used as Instrument. Divergent & Convergent Validity of this Scale Was Assessed with Problems Assessment for Substance Using Psychiatric Patients (PASUPP and Relapse Prediction Scale (RPS.Results: The DAST after the First Time Factor Structure of Using Confirmatory Factor Analysis Was Confirmed. The DAST Had a Good Internal Consistency (Cranach’s Alpha, and the Reliability of the Test Within a Week, 0.9, 0.8. Also this Scale Had a Positive Correlation with Problems Assessment for Substance Using Psychiatric Patients and Relapse Prediction Scale (P<0.01.Conclusion: The Overall Results Showed that the Drug Abuse Screening Test in Iranian Society Is Valid. It Can Be Said that Self-Report Scale Tool Is Useful for Research Purposes and Addiction.

  14. Testing and Validation of the Dynamic Inertia Measurement Method

    Science.gov (United States)

    Chin, Alexander W.; Herrera, Claudia Y.; Spivey, Natalie D.; Fladung, William A.; Cloutier, David

    2015-01-01

    The Dynamic Inertia Measurement (DIM) method uses a ground vibration test setup to determine the mass properties of an object using information from frequency response functions. Most conventional mass properties testing involves using spin tables or pendulum-based swing tests, which for large aerospace vehicles becomes increasingly difficult and time-consuming, and therefore expensive, to perform. The DIM method has been validated on small test articles but has not been successfully proven on large aerospace vehicles. In response, the National Aeronautics and Space Administration Armstrong Flight Research Center (Edwards, California) conducted mass properties testing on an "iron bird" test article that is comparable in mass and scale to a fighter-type aircraft. The simple two-I-beam design of the "iron bird" was selected to ensure accurate analytical mass properties. Traditional swing testing was also performed to compare the level of effort, amount of resources, and quality of data with the DIM method. The DIM test showed favorable results for the center of gravity and moments of inertia; however, the products of inertia showed disagreement with analytical predictions.

  15. Development and Validation of a Translation Test.

    Science.gov (United States)

    Ghonsooly, Behzad

    1993-01-01

    Translation testing methodology has been criticized for its subjective character. No real strides have so far been made in developing an objective translation test. In this paper, certain detailed procedures including various phases of pretesting have been performed to achieve objectivity and scorability in translation testing methodology. In…

  16. Enuresis, Firesetting, and Cruelty to Animals: Does the Ego Triad Show Predictive Validity?

    Science.gov (United States)

    Slavkin, Michael Lawrence

    2001-01-01

    The hypothesis tested in this study was that the presence of enuresis and cruelty to animals in juvenile firesetters would be significantly related to recidivistic firesetting. No relationship was found between firesetting recidivism and enuresis. However, juveniles who were identified as being cruel to animals were more likely to engage in…

  17. A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

    Science.gov (United States)

    Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

    2018-04-10

    To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.

  18. Development and validation of a theoretical test in basic laparoscopy

    DEFF Research Database (Denmark)

    Strandbygaard, Jeanett; Maagaard, Mathilde; Larsen, Christian Rifbjerg

    2013-01-01

    for first-year residents in obstetrics and gynecology. This study therefore aimed to develop and validate a framework for a theoretical knowledge test, a multiple-choice test, in basic theory related to laparoscopy. METHODS: The content of the multiple-choice test was determined by conducting informal...... conversational interviews with experts in laparoscopy. The subsequent relevance of the test questions was evaluated using the Delphi method involving regional chief physicians. Construct validity was tested by comparing test results from three groups with expected different clinical competence and knowledge.......001). Internal consistency (Cronbach's alpha) was 0.82. There was no evidence of differential item functioning between the three groups tested. CONCLUSIONS: A newly developed knowledge test in basic laparoscopy proved to have content and construct validity. The formula for the development and validation...

  19. Modeling Run Test Validity: A Meta-Analytic Approach

    National Research Council Canada - National Science Library

    Vickers, Ross

    2002-01-01

    .... This study utilized data from 166 samples (N = 5,757) to test the general hypothesis that differences in testing methods could account for the cross-situational variation in validity. Only runs >2 km...

  20. Validation of the Vanderbilt Holistic Face Processing Test

    OpenAIRE

    Wang, Chao-Chih; Ross, David A.; Gauthier, Isabel; Richler, Jennifer J.

    2016-01-01

    The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the ...

  1. Validation of the Vanderbilt Holistic Face Processing Test.

    OpenAIRE

    Chao-Chih Wang; Chao-Chih Wang; David Andrew Ross; Isabel Gauthier; Jennifer Joanna Richler

    2016-01-01

    The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the ...

  2. Validity and Reliability of a Medicine Ball Explosive Power Test.

    Science.gov (United States)

    Stockbrugger, Barry A.; Haennel, Robert G.

    2001-01-01

    Evaluated the validity and reliability of a medicine ball throw test to evaluate explosive power. Data on competitive sand volleyball players who performed a medicine ball throw and a standard countermovement jump indicated that the medicine ball throw test was a valid and reliable way to assess explosive power for an analogous total-body movement…

  3. RESEM-CA: Validation and testing

    Energy Technology Data Exchange (ETDEWEB)

    Pal, Vineeta; Carroll, William L.; Bourassa, Norman

    2002-09-01

    This report documents the results of an extended comparison of RESEM-CA energy and economic performance predictions with the recognized benchmark tool DOE2.1E to determine the validity and effectiveness of this tool for retrofit design and analysis. The analysis was a two part comparison of patterns of (1) monthly and annual energy consumption of a simple base-case building and controlled variations in it to explore the predictions of load components of each program, and (2) a simplified life-cycle cost analysis of the predicted effects of selected Energy Conservation Measures (ECMs). The study tries to analyze and/or explain the differences that were observed. On the whole, this validation study indicates that RESEM is a promising tool for retrofit analysis. As a result of this study some factors (incident solar radiation, outside air film coefficient, IR radiation) have been identified where there is a possibility of algorithmic improvements. These would have to be made in a way that does not sacrifice the speed of the tool, necessary for extensive parametric search of optimum ECM measures.

  4. Validation of Cardiovascular Parameters during NASA's Functional Task Test

    Science.gov (United States)

    Arzeno, N. M.; Stenger, M. B.; Bloomberg, J. J.; Platts, S. H.

    2009-01-01

    Microgravity exposure causes physiological deconditioning and impairs crewmember task performance. The Functional Task Test (FTT) is designed to correlate these physiological changes to performance in a series of operationally-relevant tasks. One of these, the Recovery from Fall/Stand Test (RFST), tests both the ability to recover from a prone position and cardiovascular responses to orthostasis. PURPOSE: Three minutes were chosen for the duration of this test, yet it is unknown if this is long enough to induce cardiovascular responses similar to the operational 5 min stand test. The purpose of this study was to determine the validity and reliability of heart rate variability (HRV) analysis of a 3 min stand and to examine the effect of spaceflight on these measures. METHODS: To determine the validity of using 3 vs. 5 min of standing to assess HRV, ECG was collected from 7 healthy subjects who participated in a 6 min RFST. Mean R-R interval (RR) and spectral HRV were measured in minutes 0-3 and 0-5 following the heart rate transient due to standing. Significant differences between the segments were determined by a paired t-test. To determine the reliability of the 3-min stand test, 13 healthy subjects completed 3 trials of the FTT on separate days, including the RFST with a 3 min stand. Analysis of variance (ANOVA) was performed on the HRV measures. One crewmember completed the FTT before a 14-day mission, on landing day (R+0) and one (R+1) day after returning to Earth. RESULTS VALIDITY: HRV measures reflecting autonomic activity were not significantly different during the 0-3 and 0-5 min segments. RELIABILITY: The average coefficient of variation for RR, systolic (SBP) and diastolic blood pressures during the RFST were less than 8% for the 3 sessions. ANOVA results yielded a greater inter-subject variability (p0.05) for HRV in the RFST. SPACEFLIGHT: Lower RR and higher SBP were observed on R+0 in rest and stand. On R+1, both RR and SBP trended towards preflight

  5. VALIDITY OF THE EATING ATTITUDE TEST AMONG EXERCISERS

    Directory of Open Access Journals (Sweden)

    Hilary Matheson

    2004-12-01

    Full Text Available Theory testing and construct measurement are inextricably linked. To date, no published research has looked at the factorial validity of an existing eating attitude inventory for use with exercisers. The Eating Attitude Test (EAT is a 26-item measure that yields a single index of disordered eating attitudes. The original factor analysis showed three interrelated factors: Dieting behavior (13-items, oral control (7-items, and bulimia nervosa-food preoccupation (6-items. The primary purpose of the study was to examine the factorial validity of the EAT among a sample of exercisers. The second purpose was to investigate relationships between eating attitudes scores and selected psychological constructs. In stage one, 598 regular exercisers completed the EAT. Confirmatory factor analysis (CFA was used to test the single-factor, a three-factor model, and a four-factor model, which distinguished bulimia from food pre-occupation. CFA of the single-factor model (RCFI = 0.66, RMSEA = 0.10, the three-factor-model (RCFI = 0.74; RMSEA = 0.09 showed poor model fit. There was marginal fit for the 4-factor model (RCFI = 0.91, RMSEA = 0.06. Results indicated five-items showed poor factor loadings. After these 5-items were discarded, the three models were re-analyzed. CFA results indicated that the single-factor model (RCFI = 0.76, RMSEA = 0.10 and three-factor model (RCFI = 0.82, RMSEA = 0.08 showed poor fit. CFA results for the four-factor model showed acceptable fit indices (RCFI = 0.98, RMSEA = 0.06. Stage two explored relationships between EAT scores, mood, self-esteem, and motivational indices toward exercise in terms of self-determination, enjoyment and competence. Correlation results indicated that depressed mood scores positively correlated with bulimia and dieting scores. Further, dieting was inversely related with self-determination toward exercising. Collectively, findings suggest that a 21-item four-factor model shows promising validity coefficients

  6. Validity of the ISUOG basic training test

    DEFF Research Database (Denmark)

    Hillerup, Niels Emil; Tabor, Ann; Konge, Lars

    2018-01-01

    A certain level of theoretical knowledge is required when performing basic obstetrical and gynecological ultrasound. To assess the adequacy of trainees' basic theoretical knowledge, the International Society of Ultrasound in Obstetrics and Gynecology (ISUOG) has developed a theoretical test of 49...... Multiple Choice Questionnaire (MCQ) items for their basic training courses....

  7. Validation of qualitative microbiological test methods

    NARCIS (Netherlands)

    IJzerman-Boon, Pieta C.; van den Heuvel, Edwin R.

    2015-01-01

    This paper considers a statistical model for the detection mechanism of qualitative microbiological test methods with a parameter for the detection proportion (the probability to detect a single organism) and a parameter for the false positive rate. It is demonstrated that the detection proportion

  8. Construct Validity of Physical Fitness Tests

    Science.gov (United States)

    2011-02-03

    Powers, S. K., Lawler, J., Ayers, D., & Stuart, M. K. (1991). Physiological correlates to 800 meter running performance. Journal of Sports Medicine and... isokinetic tests. Journal of Sports Medicine and Physical Fitness, 36, 169-177. *Myers, D. C., Gebhardt, D. L., Crump, C.E., & Fleishman, E. A. (1984

  9. French validation of the internet addiction test.

    Science.gov (United States)

    Khazaal, Yasser; Billieux, Joël; Thorens, Gabriel; Khan, Riaz; Louati, Youssr; Scarlatti, Elisa; Theintz, Florence; Lederrey, Jerome; Van Der Linden, Martial; Zullino, Daniele

    2008-12-01

    The main goal of the present study is to investigate the psychometric properties of a French version of the Internet Addiction Test (IAT) and to assess its relationship with both time spent on Internet and online gaming. The French version of the Young's Internet Addiction Test (IAT) was administered to a sample of 246 adults. Exploratory and confirmatory analyses were carried out. We discovered that a one-factor model of the IAT has good psychometric properties and fits the data well, which is not the case of a six-factor model as found in previous studies using exploratory methods. Correlation analysis revealed positive significant relationships between IAT scores and both the daily duration of Internet use and the fact of being an online player. In addition, younger people scored higher on the IAT. The one-factor model found in this study has to be replicated in other IAT language versions.

  10. Validation of RNAi Silencing Efficiency Using Gene Array Data shows 18.5% Failure Rate across 429 Independent Experiments

    Directory of Open Access Journals (Sweden)

    Gyöngyi Munkácsy

    2016-01-01

    Full Text Available No independent cross-validation of success rate for studies utilizing small interfering RNA (siRNA for gene silencing has been completed before. To assess the influence of experimental parameters like cell line, transfection technique, validation method, and type of control, we have to validate these in a large set of studies. We utilized gene chip data published for siRNA experiments to assess success rate and to compare methods used in these experiments. We searched NCBI GEO for samples with whole transcriptome analysis before and after gene silencing and evaluated the efficiency for the target and off-target genes using the array-based expression data. Wilcoxon signed-rank test was used to assess silencing efficacy and Kruskal–Wallis tests and Spearman rank correlation were used to evaluate study parameters. All together 1,643 samples representing 429 experiments published in 207 studies were evaluated. The fold change (FC of down-regulation of the target gene was above 0.7 in 18.5% and was above 0.5 in 38.7% of experiments. Silencing efficiency was lowest in MCF7 and highest in SW480 cells (FC = 0.59 and FC = 0.30, respectively, P = 9.3E−06. Studies utilizing Western blot for validation performed better than those with quantitative polymerase chain reaction (qPCR or microarray (FC = 0.43, FC = 0.47, and FC = 0.55, respectively, P = 2.8E−04. There was no correlation between type of control, transfection method, publication year, and silencing efficiency. Although gene silencing is a robust feature successfully cross-validated in the majority of experiments, efficiency remained insufficient in a significant proportion of studies. Selection of cell line model and validation method had the highest influence on silencing proficiency.

  11. Automated Vision Test Development and Validation

    Science.gov (United States)

    2016-11-01

    crystal display monitor (NEC Multisync, P232W) at 1920x1080 resolution. Proper calibration was confirmed using a spot photometer/colorimeter (X-Rite i1...visual input to the right and left eye was achieved using liquid crystal display shuttered glasses (NVIDIA 3D Vision 2). The stereo target (Figure 4...threshold on the automated tasks. • Subjects had a lower (better) threshold on color testing for all cone types using the OCCT due to a ceiling

  12. The Sandia MEMS Passive Shock Sensor : FY08 testing for functionality, model validation, and technology readiness.

    Energy Technology Data Exchange (ETDEWEB)

    Walraven, Jeremy Allen; Blecke, Jill; Baker, Michael Sean; Clemens, Rebecca C.; Mitchell, John Anthony; Brake, Matthew Robert; Epp, David S.; Wittwer, Jonathan W.

    2008-10-01

    This report summarizes the functional, model validation, and technology readiness testing of the Sandia MEMS Passive Shock Sensor in FY08. Functional testing of a large number of revision 4 parts showed robust and consistent performance. Model validation testing helped tune the models to match data well and identified several areas for future investigation related to high frequency sensitivity and thermal effects. Finally, technology readiness testing demonstrated the integrated elements of the sensor under realistic environments.

  13. Development and Validation of a Dissolution Test Method for ...

    African Journals Online (AJOL)

    Purpose: To develop and validate a dissolution test method for dissolution release of artemether and lumefantrine from tablets. Methods: A single dissolution method for evaluating the in vitro release of artemether and lumefantrine from tablets was developed and validated. The method comprised of a dissolution medium of ...

  14. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    Science.gov (United States)

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  15. AULA virtual reality test as an attention measure: convergent validity with Conners' Continuous Performance Test.

    Science.gov (United States)

    Díaz-Orueta, Unai; Garcia-López, Cristina; Crespo-Eguílaz, Nerea; Sánchez-Carpintero, Rocío; Climent, Gema; Narbona, Juan

    2014-01-01

    The majority of neuropsychological tests used to evaluate attention processes in children lack ecological validity. The AULA Nesplora (AULA) is a continuous performance test, developed in a virtual setting, very similar to a school classroom. The aim of the present study is to analyze the convergent validity between the AULA and the Continuous Performance Test (CPT) of Conners. The AULA and CPT were administered correlatively to 57 children, aged 6-16 years (26.3% female) with average cognitive ability (IQ mean = 100.56, SD = 10.38) who had a diagnosis of attention deficit/hyperactivity disorder (ADHD) according to DSM-IV-TR criteria. Spearman correlations analyses were conducted among the different variables. Significant correlations were observed between both tests in all the analyzed variables (omissions, commissions, reaction time, and variability of reaction time), including for those measures of the AULA based on different sensorial modalities, presentation of distractors, and task paradigms. Hence, convergent validity between both tests was confirmed. Moreover, the AULA showed differences by gender and correlation to Perceptual Reasoning and Working Memory indexes of the WISC-IV, supporting the relevance of IQ measures in the understanding of cognitive performance in ADHD. In addition, the AULA (but not Conners' CPT) was able to differentiate between ADHD children with and without pharmacological treatment for a wide range of measures related to inattention, impulsivity, processing speed, motor activity, and quality of attention focus. Additional measures and advantages of the AULA versus Conners' CPT are discussed.

  16. A set of pathological tests to validate new finite elements

    Indian Academy of Sciences (India)

    M. Senthilkumar (Newgen Imaging) 1461 1996 Oct 15 13:05:22

    The finite element method entails several approximations. Hence it ... researchers have designed several pathological tests to validate any new finite element. The .... Three dimensional thick shell elements using a hybrid/mixed formu- lation.

  17. Validation test case generation based on safety analysis ontology

    International Nuclear Information System (INIS)

    Fan, Chin-Feng; Wang, Wen-Shing

    2012-01-01

    Highlights: ► Current practice in validation test case generation for nuclear system is mainly ad hoc. ► This study designs a systematic approach to generate validation test cases from a Safety Analysis Report. ► It is based on a domain-specific ontology. ► Test coverage criteria have been defined and satisfied. ► A computerized toolset has been implemented to assist the proposed approach. - Abstract: Validation tests in the current nuclear industry practice are typically performed in an ad hoc fashion. This study presents a systematic and objective method of generating validation test cases from a Safety Analysis Report (SAR). A domain-specific ontology was designed and used to mark up a SAR; relevant information was then extracted from the marked-up document for use in automatically generating validation test cases that satisfy the proposed test coverage criteria; namely, single parameter coverage, use case coverage, abnormal condition coverage, and scenario coverage. The novelty of this technique is its systematic rather than ad hoc test case generation from a SAR to achieve high test coverage.

  18. Validity and reliability of the NAB Naming Test.

    Science.gov (United States)

    Sachs, Bonnie C; Rush, Beth K; Pedraza, Otto

    2016-05-01

    Confrontation naming is commonly assessed in neuropsychological practice, but few standardized measures of naming exist and those that do are susceptible to the effects of education and culture. The Neuropsychological Assessment Battery (NAB) Naming Test is a 31-item measure used to assess confrontation naming. Despite adequate psychometric information provided by the test publisher, there has been limited independent validation of the test. In this study, we investigated the convergent and discriminant validity, internal consistency, and alternate forms reliability of the NAB Naming Test in a sample of adults (Form 1: n = 247, Form 2: n = 151) clinically referred for neuropsychological evaluation. Results indicate adequate-to-good internal consistency and alternate forms reliability. We also found strong convergent validity as demonstrated by relationships with other neurocognitive measures. We found preliminary evidence that the NAB Naming Test demonstrates a more pronounced ceiling effect than other commonly used measures of naming. To our knowledge, this represents the largest published independent validation study of the NAB Naming Test in a clinical sample. Our findings suggest that the NAB Naming Test demonstrates adequate validity and reliability and merits consideration in the test arsenal of clinical neuropsychologists.

  19. Investigating CSI: portrayals of DNA testing on a forensic crime show and their potential effects.

    Science.gov (United States)

    Ley, Barbara L; Jankowski, Natalie; Brewer, Paul R

    2012-01-01

    The popularity of forensic crime shows such as CSI has fueled debate about their potential social impact. This study considers CSI's potential effects on public understandings regarding DNA testing in the context of judicial processes, the policy debates surrounding crime laboratory procedures, and the forensic science profession, as well as an effect not discussed in previous accounts: namely, the show's potential impact on public understandings of DNA and genetics more generally. To develop a theoretical foundation for research on the "CSI effect," it draws on cultivation theory, social cognitive theory, and audience reception studies. It then uses content analysis and textual analysis to illuminate how the show depicts DNA testing. The results demonstrate that CSI tends to depict DNA testing as routine, swift, useful, and reliable and that it echoes broader discourses about genetics. At times, however, the show suggests more complex ways of thinking about DNA testing and genetics.

  20. Validation of Helicopter Gear Condition Indicators Using Seeded Fault Tests

    Science.gov (United States)

    Dempsey, Paula; Brandon, E. Bruce

    2013-01-01

    A "seeded fault test" in support of a rotorcraft condition based maintenance program (CBM), is an experiment in which a component is tested with a known fault while health monitoring data is collected. These tests are performed at operating conditions comparable to operating conditions the component would be exposed to while installed on the aircraft. Performance of seeded fault tests is one method used to provide evidence that a Health Usage Monitoring System (HUMS) can replace current maintenance practices required for aircraft airworthiness. Actual in-service experience of the HUMS detecting a component fault is another validation method. This paper will discuss a hybrid validation approach that combines in service-data with seeded fault tests. For this approach, existing in-service HUMS flight data from a naturally occurring component fault will be used to define a component seeded fault test. An example, using spiral bevel gears as the targeted component, will be presented. Since the U.S. Army has begun to develop standards for using seeded fault tests for HUMS validation, the hybrid approach will be mapped to the steps defined within their Aeronautical Design Standard Handbook for CBM. This paper will step through their defined processes, and identify additional steps that may be required when using component test rig fault tests to demonstrate helicopter CI performance. The discussion within this paper will provide the reader with a better appreciation for the challenges faced when defining a seeded fault test for HUMS validation.

  1. Test-driven verification/validation of model transformations

    Institute of Scientific and Technical Information of China (English)

    László LENGYEL; Hassan CHARAF

    2015-01-01

    Why is it important to verify/validate model transformations? The motivation is to improve the quality of the trans-formations, and therefore the quality of the generated software artifacts. Verified/validated model transformations make it possible to ensure certain properties of the generated software artifacts. In this way, verification/validation methods can guarantee different requirements stated by the actual domain against the generated/modified/optimized software products. For example, a verified/ validated model transformation can ensure the preservation of certain properties during the model-to-model transformation. This paper emphasizes the necessity of methods that make model transformation verified/validated, discusses the different scenarios of model transformation verification and validation, and introduces the principles of a novel test-driven method for verifying/ validating model transformations. We provide a solution that makes it possible to automatically generate test input models for model transformations. Furthermore, we collect and discuss the actual open issues in the field of verification/validation of model transformations.

  2. The Validation of NAA Method Used as Test Method in Serpong NAA Laboratory

    International Nuclear Information System (INIS)

    Rina-Mulyaningsih, Th.

    2004-01-01

    The Validation Of NAA Method Used As Test Method In Serpong NAA Laboratory. NAA Method is a non standard testing method. The testing laboratory shall validate its using method to ensure and confirm that it is suitable with application. The validation of NAA methods have been done with the parameters of accuracy, precision, repeatability and selectivity. The NIST 1573a Tomato Leaves, NIES 10C Rice flour unpolished and standard elements were used in this testing program. The result of testing with NIST 1573a showed that the elements of Na, Zn, Al and Mn are met from acceptance criteria of accuracy and precision, whereas Co is rejected. The result of testing with NIES 10C showed that Na and Zn elements are met from acceptance criteria of accuracy and precision, but Mn element is rejected. The result of selectivity test showed that the value of quantity is between 0.1-2.5 μg, depend on the elements. (author)

  3. Valid methods: the quality assurance of test method development, validation, approval, and transfer for veterinary testing laboratories.

    Science.gov (United States)

    Wiegers, Ann L

    2003-07-01

    Third-party accreditation is a valuable tool to demonstrate a laboratory's competence to conduct testing. Accreditation, internationally and in the United States, has been discussed previously. However, accreditation is only I part of establishing data credibility. A validated test method is the first component of a valid measurement system. Validation is defined as confirmation by examination and the provision of objective evidence that the particular requirements for a specific intended use are fulfilled. The international and national standard ISO/IEC 17025 recognizes the importance of validated methods and requires that laboratory-developed methods or methods adopted by the laboratory be appropriate for the intended use. Validated methods are therefore required and their use agreed to by the client (i.e., end users of the test results such as veterinarians, animal health programs, and owners). ISO/IEC 17025 also requires that the introduction of methods developed by the laboratory for its own use be a planned activity conducted by qualified personnel with adequate resources. This article discusses considerations and recommendations for the conduct of veterinary diagnostic test method development, validation, evaluation, approval, and transfer to the user laboratory in the ISO/IEC 17025 environment. These recommendations are based on those of nationally and internationally accepted standards and guidelines, as well as those of reputable and experienced technical bodies. They are also based on the author's experience in the evaluation of method development and transfer projects, validation data, and the implementation of quality management systems in the area of method development.

  4. LADO as a Language Test: Issues of Validity

    Science.gov (United States)

    McNamara, Tim; Van Den Hazelkamp, Carolien; Verrips, Maaike

    2016-01-01

    This article brings together the theoretical field of language testing and the practical field of language analysis for the determination of the origin of asylum seekers. It considers what it would mean to think of language analysis as a form of language test, subject to the same validity constraints, and proposes a research agenda.

  5. Cross-Cultural Validation of TEMAS, a Minority Projective Test.

    Science.gov (United States)

    Costantino, Giuseppe; And Others

    The theoretical framework and cross-cultural validation of Tell-Me-A-Story (TEMAS), a projective test developed to measure personality development in ethnic minority children, is presented. The TEMAS test consists of 23 chromatic pictures which incorporate the following characteristics: (1) representation of antithetical concepts which the…

  6. Validity Theory: Reform Policies, Accountability Testing, and Consequences

    Science.gov (United States)

    Chalhoub-Deville, Micheline

    2016-01-01

    Educational policies such as Race to the Top in the USA affirm a central role for testing systems in government-driven reform efforts. Such reform policies are often referred to as the global education reform movement (GERM). Changes observed with the GERM style of testing demand socially engaged validity theories that include consequential…

  7. Reasoning with Inductive Argument Test: A Study of Validity and Reliability

    Directory of Open Access Journals (Sweden)

    Mehmet Emrah Karadere

    2013-12-01

    Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that ‘Reasoning with Inductive Argument Test’ supports reliability and validity in Turkish population. [JCBPR 2013; 2(3.000: 156-161

  8. NASA Double Asteroid Redirection Test (DART) Trajectory Validation and Robutness

    Science.gov (United States)

    Sarli, Bruno V.; Ozimek, Martin T.; Atchison, Justin A.; Englander, Jacob A.; Barbee, Brent W.

    2017-01-01

    The Double Asteroid Redirection Test (DART) mission will be the first to test the concept of a kinetic impactor. Several studies have been made on asteroid redirection and impact mitigation, however, to this date no mission tested the proposed concepts. An impact study on a representative body allows the measurement of the effects on the target's orbit and physical structure. With this goal, DART's objective is to verify the effectiveness of the kinetic impact concept for planetary defense. The spacecraft uses solar electric propulsion to escape Earth, fly by (138971) 2001 CB21 for impact rehearsal, and impact Didymos-B, the secondary body of the binary (65803) Didymos system. This work focuses on the heliocentric transfer design part of the mission with the validation of the baseline trajectory, performance comparison to other mission objectives, and assessment of the baseline robustness to missed thrust events. Results show a good performance of the selected trajectory for different mission objectives: latest possible escape date, maximum kinetic energy on impact, shortest possible time of flight, and use of an Earth swing-by. The baseline trajectory was shown to be robust to a missed thrust with 1% of fuel margin being enough to recover the mission for failures of more than 14 days.

  9. Development of a test rig and its application for validation and reliability testing of safety-critical software

    Energy Technology Data Exchange (ETDEWEB)

    Thai, N D; McDonald, A M [Atomic Energy of Canada Ltd., Mississauga, ON (Canada)

    1996-12-31

    This paper describes a versatile test rig developed by AECL for functional testing of safety-critical software used in the process trip computers of the Wolsong CANDU stations. The description covers the hardware and software aspects of the test rig, the test language and its interpreter, and other major testing software utilities such as the test oracle, sampler and profiler. The paper also discusses the application of the rig in the final stages of testing of the process trip computer software, namely validation and reliability tests. It shows how random test cases are generated, test scripts prepared and automatically run on the test rig. The versatility of the rig is further demonstrated in other types of testing such as sub-system tests, verification of the test oracle, testing of newly-developed test script, self-test and calibration. (author). 5 tabs., 10 figs.

  10. Development of a test rig and its application for validation and reliability testing of safety-critical software

    International Nuclear Information System (INIS)

    Thai, N.D.; McDonald, A.M.

    1995-01-01

    This paper describes a versatile test rig developed by AECL for functional testing of safety-critical software used in the process trip computers of the Wolsong CANDU stations. The description covers the hardware and software aspects of the test rig, the test language and its interpreter, and other major testing software utilities such as the test oracle, sampler and profiler. The paper also discusses the application of the rig in the final stages of testing of the process trip computer software, namely validation and reliability tests. It shows how random test cases are generated, test scripts prepared and automatically run on the test rig. The versatility of the rig is further demonstrated in other types of testing such as sub-system tests, verification of the test oracle, testing of newly-developed test script, self-test and calibration. (author). 5 tabs., 10 figs

  11. Predictive validity of the Biomedical Admissions Test: an evaluation and case study.

    Science.gov (United States)

    McManus, I C; Ferguson, Eamonn; Wakeford, Richard; Powis, David; James, David

    2011-01-01

    There has been an increase in the use of pre-admission selection tests for medicine. Such tests need to show good psychometric properties. Here, we use a paper by Emery and Bell [2009. The predictive validity of the Biomedical Admissions Test for pre-clinical examination performance. Med Educ 43:557-564] as a case study to evaluate and comment on the reporting of psychometric data in the field of medical student selection (and the comments apply to many papers in the field). We highlight pitfalls when reliability data are not presented, how simple zero-order associations can lead to inaccurate conclusions about the predictive validity of a test, and how biases need to be explored and reported. We show with BMAT that it is the knowledge part of the test which does all the predictive work. We show that without evidence of incremental validity it is difficult to assess the value of any selection tests for medicine.

  12. Independent validation of the MMPI-2-RF Somatic/Cognitive and Validity scales in TBI Litigants tested for effort.

    Science.gov (United States)

    Youngjohn, James R; Wershba, Rebecca; Stevenson, Matthew; Sturgeon, John; Thomas, Michael L

    2011-04-01

    The MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008) is replacing the MMPI-2 as the most widely used personality test in neuropsychological assessment, but additional validation studies are needed. Our study examines MMPI-2-RF Validity scales and the newly created Somatic/Cognitive scales in a recently reported sample of 82 traumatic brain injury (TBI) litigants who either passed or failed effort tests (Thomas & Youngjohn, 2009). The restructured Validity scales FBS-r (restructured symptom validity), F-r (restructured infrequent responses), and the newly created Fs (infrequent somatic responses) were not significant predictors of TBI severity. FBS-r was significantly related to passing or failing effort tests, and Fs and F-r showed non-significant trends in the same direction. Elevations on the Somatic/Cognitive scales profile (MLS-malaise, GIC-gastrointestinal complaints, HPC-head pain complaints, NUC-neurological complaints, and COG-cognitive complaints) were significant predictors of effort test failure. Additionally, HPC had the anticipated paradoxical inverse relationship with head injury severity. The Somatic/Cognitive scales as a group were better predictors of effort test failure than the RF Validity scales, which was an unexpected finding. MLS arose as the single best predictor of effort test failure of all RF Validity and Somatic/Cognitive scales. Item overlap analysis revealed that all MLS items are included in the original MMPI-2 Hy scale, making MLS essentially a subscale of Hy. This study validates the MMPI-2-RF as an effective tool for use in neuropsychological assessment of TBI litigants.

  13. Validation of a clinical critical thinking skills test in nursing

    OpenAIRE

    Shin, Sujin; Jung, Dukyoo; Kim, Sungeun

    2015-01-01

    Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school stud...

  14. Validating safeguards effectiveness given inherently limited test data

    International Nuclear Information System (INIS)

    Sicherman, A.

    1987-01-01

    A key issue in designing and evaluating nuclear safeguards systems is how to validate safeguards effectiveness against a spectrum of potential threats. Safeguards effectiveness is measured by a performance indicator such as the probability of defeating an adversary attempting a malevolent act. Effectiveness validation means a testing program that provides sufficient evidence that the performance indicator is at an acceptable level. Traditional statistical program when numerous independent system trials are possible. However, within the safeguards environment, many situations arise for which traditional statistical approaches may be neither feasible nor appropriate. Such situations can occur, for example, when there are obvious constraints on the number of possible tests due to operational impacts and testing costs. Furthermore, these tests are usually simulations (e.g., staged force-on-force exercises) rather than actual tests, and the system is often modified after each test. Under such circumstances, it is difficult to make and justify inferences about system performance by using traditional statistical techniques. In this paper, the authors discuss several alternative quantitative techniques for validating system effectiveness. The techniques include: (1) minimizing the number of required tests using sequential testing; (2) combining data from models inspections and exercises using Bayesian statistics to improve inferences about system performance; and (3) using reliability growth and scenario modeling to help specify which safeguards elements and scenarios to test

  15. Validation of a clinical critical thinking skills test in nursing

    Directory of Open Access Journals (Sweden)

    Sujin Shin

    2015-01-01

    Full Text Available Purpose: The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS and to subsequently validate its performance. Methods: This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Results: Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. Conclusion: From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability.

  16. Validation of a clinical critical thinking skills test in nursing.

    Science.gov (United States)

    Shin, Sujin; Jung, Dukyoo; Kim, Sungeun

    2015-01-27

    The purpose of this study was to develop a revised version of the clinical critical thinking skills test (CCTS) and to subsequently validate its performance. This study is a secondary analysis of the CCTS. Data were obtained from a convenience sample of 284 college students in June 2011. Thirty items were analyzed using item response theory and test reliability was assessed. Test-retest reliability was measured using the results of 20 nursing college and graduate school students in July 2013. The content validity of the revised items was analyzed by calculating the degree of agreement between instrument developer intention in item development and the judgments of six experts. To analyze response process validity, qualitative data related to the response processes of nine nursing college students obtained through cognitive interviews were analyzed. Out of initial 30 items, 11 items were excluded after the analysis of difficulty and discrimination parameter. When the 19 items of the revised version of the CCTS were analyzed, levels of item difficulty were found to be relatively low and levels of discrimination were found to be appropriate or high. The degree of agreement between item developer intention and expert judgments equaled or exceeded 50%. From above results, evidence of the response process validity was demonstrated, indicating that subjects respondeds as intended by the test developer. The revised 19-item CCTS was found to have sufficient reliability and validity and will therefore represents a more convenient measurement of critical thinking ability.

  17. Validity, Reliability, and Sensitivity of a Volleyball Intermittent Endurance Test.

    Science.gov (United States)

    Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl

    2017-03-01

    To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P volleyball players.

  18. Overview of CSNI separate effects tests validation matrix

    Energy Technology Data Exchange (ETDEWEB)

    Aksan, N. [Paul Scherrer Institute, Villigen (Switzerland); Auria, F.D. [Univ. of Pisa (Italy); Glaeser, H. [Gesellschaft fuer anlagen und Reaktorsicherheit, (GRS), Garching (Germany)] [and others

    1995-09-01

    An internationally agreed separate effects test (SET) Validation Matrix for thermal-hydraulic system codes has been established by a sub-group of the Task Group on Thermal Hydraulic System Behaviour as requested by the OECD/NEA Committee on Safety of Nuclear Installations (SCNI) Principal Working Group No. 2 on Coolant System Behaviour. The construction of such a Matrix is an attempt to collect together in a systematic way the best sets of openly available test data for code validation, assessment and improvement and also for quantitative code assessment with respect to quantification of uncertainties to the modeling of individual phenomena by the codes. The methodology, that has been developed during the process of establishing CSNI-SET validation matrix, was an important outcome of the work on SET matrix. In addition, all the choices which have been made from the 187 identified facilities covering the 67 phenomena will be investigated together with some discussions on the data base.

  19. Automated smartphone audiometry: Validation of a word recognition test app.

    Science.gov (United States)

    Dewyer, Nicholas A; Jiradejvong, Patpong; Henderson Sabes, Jennifer; Limb, Charles J

    2018-03-01

    Develop and validate an automated smartphone word recognition test. Cross-sectional case-control diagnostic test comparison. An automated word recognition test was developed as an app for a smartphone with earphones. English-speaking adults with recent audiograms and various levels of hearing loss were recruited from an audiology clinic and were administered the smartphone word recognition test. Word recognition scores determined by the smartphone app and the gold standard speech audiometry test performed by an audiologist were compared. Test scores for 37 ears were analyzed. Word recognition scores determined by the smartphone app and audiologist testing were in agreement, with 86% of the data points within a clinically acceptable margin of error and a linear correlation value between test scores of 0.89. The WordRec automated smartphone app accurately determines word recognition scores. 3b. Laryngoscope, 128:707-712, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.

  20. Older people experiencing homelessness show marked impairment on tests of frontal lobe function.

    Science.gov (United States)

    Rogoz, Astrid; Burke, David

    2016-03-01

    Reported rates of mild and moderate cognitive impairment in older people experiencing homelessness range from 5-80%. The objective of this study was to determine the prevalence and characteristics of cognitive impairment in older people experiencing homelessness in the inner city of Sydney, Australia. Men and women experiencing homelessness aged 45 years and over in the inner city were screened for cognitive impairment. Participants who scored 26 or below on the mini-mental state examination and/or were impaired on any one of the clock-drawing test, the verbal fluency test and the trail-making test, part B were then assessed with a semi-structured interview, including the 21-item Depression Anxiety Stress Scale and the 12-item General Health Questionnaire. Screening of 144 men and 27 women aged between 45 years and 93 years identified cognitive impairment in 78%. Subsequently, high rates of mental and physical illness were identified, and 75% of subjects who were cognitively impaired performed poorly on frontal lobe tests. The trail-making test, part B was the most sensitive measure of frontal function. This study demonstrated that a large majority of older people experiencing homelessness, in the inner city of a high-income country, showed impairment on tests of frontal lobe function, a finding that could have significant implications for any medical or psychosocial intervention. Copyright © 2015 John Wiley & Sons, Ltd.

  1. Video game addiction test: validity and psychometric characteristics.

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  2. Video Game Addiction Test: Validity and Psychometric Characteristics

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  3. Validity of the American Sign Language Discrimination Test

    Science.gov (United States)

    Bochner, Joseph H.; Samar, Vincent J.; Hauser, Peter C.; Garrison, Wayne M.; Searls, J. Matt; Sanders, Cynthia A.

    2016-01-01

    American Sign Language (ASL) is one of the most commonly taught languages in North America. Yet, few assessment instruments for ASL proficiency have been developed, none of which have adequately demonstrated validity. We propose that the American Sign Language Discrimination Test (ASL-DT), a recently developed measure of learners' ability to…

  4. Test of Creative Imagination: Validity and Reliability Study

    Science.gov (United States)

    Gundogan, Aysun; Ari, Meziyet; Gonen, Mubeccel

    2013-01-01

    The purpose of this study was to investigate validity and reliability of the test of creative imagination. This study was conducted with the participation of 1000 children, aged between 9-14 and were studying in six primary schools in the city center of Denizli Province, chosen by cluster ratio sampling. In the study, it was revealed that the…

  5. Ecological validity of the Yo-Yo SFIE2 test

    DEFF Research Database (Denmark)

    Krustrup, Peter; Randers, Morten Bredsgaard; Horton, J

    2012-01-01

    The present study investigated the movement pattern of Portuguese top-level futsal referees (n=16) during competitive games and the ecological validity of the new Yo-Yo Sideways-Forwards Intermittent Endurance level 2 test (Yo-Yo SFIE2). Total distance covered (TD), high-intensity running (HIR...

  6. Validation of the Hwalek-Sengstock Elder Abuse Screening Test.

    Science.gov (United States)

    Neale, Anne Victoria; And Others

    Elder abuse is recognized as an under-detected and under-reported social problem. Difficulties in detecting elder abuse are compounded by the lack of a standardized, psychometrically valid instrument for case finding. The development of the Hwalek-Sengstock Elder Abuse Screening Test (H-S/EAST) followed a larger effort to identify indicators and…

  7. Validation testing of a soil macronutrient sensing system

    Science.gov (United States)

    Rapid on-site measurements of soil macronutrients (i.e., nitrogen, phosphorus, and potassium) are needed for site-specific crop management, where fertilizer nutrient application rates are adjusted spatially based on local requirements. This study reports on validation testing of a previously develop...

  8. Construct validity and reliability of automated body reaction test ...

    African Journals Online (AJOL)

    Automated Body Reaction Test (ABRT) is a new device for skills and physical assessment instrument to measure ability on react, move quickly and accurately in accordance with stimulus. A total of 474 subjects aged 7-17 years old were randomly selected for the construct validity (n=330) and reliability (n=144). The ABRT ...

  9. Functional Literacy Tests: A Case of Anticipatory Validity?

    Science.gov (United States)

    Anderson, Lorin W.; Anderson, Jo Craig

    1981-01-01

    Development of the mathematics functional literacy test (MFLT) is described, issues of predictive and content validity are discussed, and implications for educational policy are presented. Ten basic skill areas identified by the National Council of Supervisors of Mathematics were used as the basis for the development of the MFLT. (RL)

  10. Validation of new CFD release by Ground-Coupled Heat Transfer Test Cases

    Directory of Open Access Journals (Sweden)

    Sehnalek Stanislav

    2017-01-01

    Full Text Available In this article is presented validation of ANSYS Fluent with IEA BESTEST Task 34. Article stars with outlook to the topic, afterward are described steady-state cases used for validation. Thereafter is mentioned implementation of these cases on CFD. Article is concluded with presentation of the simulated results with a comparison of those from already validated simulation software by IEA. These validation shows high correlation with an older version of tested ANSYS as well as with other main software. The paper ends by discussion with an outline of future research.

  11. Validity and Reliability of Baseline Testing in a Standardized Environment.

    Science.gov (United States)

    Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur

    2017-08-11

    The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  12. ExEP yield modeling tool and validation test results

    Science.gov (United States)

    Morgan, Rhonda; Turmon, Michael; Delacroix, Christian; Savransky, Dmitry; Garrett, Daniel; Lowrance, Patrick; Liu, Xiang Cate; Nunez, Paul

    2017-09-01

    EXOSIMS is an open-source simulation tool for parametric modeling of the detection yield and characterization of exoplanets. EXOSIMS has been adopted by the Exoplanet Exploration Programs Standards Definition and Evaluation Team (ExSDET) as a common mechanism for comparison of exoplanet mission concept studies. To ensure trustworthiness of the tool, we developed a validation test plan that leverages the Python-language unit-test framework, utilizes integration tests for selected module interactions, and performs end-to-end crossvalidation with other yield tools. This paper presents the test methods and results, with the physics-based tests such as photometry and integration time calculation treated in detail and the functional tests treated summarily. The test case utilized a 4m unobscured telescope with an idealized coronagraph and an exoplanet population from the IPAC radial velocity (RV) exoplanet catalog. The known RV planets were set at quadrature to allow deterministic validation of the calculation of physical parameters, such as working angle, photon counts and integration time. The observing keepout region was tested by generating plots and movies of the targets and the keepout zone over a year. Although the keepout integration test required the interpretation of a user, the test revealed problems in the L2 halo orbit and the parameterization of keepout applied to some solar system bodies, which the development team was able to address. The validation testing of EXOSIMS was performed iteratively with the developers of EXOSIMS and resulted in a more robust, stable, and trustworthy tool that the exoplanet community can use to simulate exoplanet direct-detection missions from probe class, to WFIRST, up to large mission concepts such as HabEx and LUVOIR.

  13. Test re-test reliability and construct validity of the star-track test of manual dexterity

    DEFF Research Database (Denmark)

    Kildebro, Niels; Amirian, Ilda; Gögenur, Ismail

    2015-01-01

    Objectives. We wished to determine test re-test reliability and construct validity of the star-track test of manual dexterity. Design. Test re-test reliability was examined in a controlled study. Construct validity was tested in a blinded randomized crossover study. Setting. The study was performed...... at a university hospital in Denmark. Participants. A total of 11 subjects for test re-test and 20 subjects for the construct validity study were included. All subjects were healthy volunteers. Intervention. The test re-test trial had two measurements with 2 days pause in between. The interventions...... in the construct validity study included baseline measurement, intervention 1: fatigue, intervention 2: stress, and intervention 3: fatigue and stress. There was a 2 day pause between each intervention. Main outcome measure. An integrated measure of completion time and number of errors was used. Results. All...

  14. Experimental validation of a new heterogeneous mechanical test design

    Science.gov (United States)

    Aquino, J.; Campos, A. Andrade; Souto, N.; Thuillier, S.

    2018-05-01

    Standard material parameters identification strategies generally use an extensive number of classical tests for collecting the required experimental data. However, a great effort has been made recently by the scientific and industrial communities to support this experimental database on heterogeneous tests. These tests can provide richer information on the material behavior allowing the identification of a more complete set of material parameters. This is a result of the recent development of full-field measurements techniques, like digital image correlation (DIC), that can capture the heterogeneous deformation fields on the specimen surface during the test. Recently, new specimen geometries were designed to enhance the richness of the strain field and capture supplementary strain states. The butterfly specimen is an example of these new geometries, designed through a numerical optimization procedure where an indicator capable of evaluating the heterogeneity and the richness of strain information. However, no experimental validation was yet performed. The aim of this work is to experimentally validate the heterogeneous butterfly mechanical test in the parameter identification framework. For this aim, DIC technique and a Finite Element Model Up-date inverse strategy are used together for the parameter identification of a DC04 steel, as well as the calculation of the indicator. The experimental tests are carried out in a universal testing machine with the ARAMIS measuring system to provide the strain states on the specimen surface. The identification strategy is accomplished with the data obtained from the experimental tests and the results are compared to a reference numerical solution.

  15. Guppies Show Behavioural but Not Cognitive Sex Differences in a Novel Object Recognition Test.

    Directory of Open Access Journals (Sweden)

    Tyrone Lucon-Xiccato

    Full Text Available The novel object recognition (NOR test is a widely-used paradigm to study learning and memory in rodents. NOR performance is typically measured as the preference to interact with a novel object over a familiar object based on spontaneous exploratory behaviour. In rats and mice, females usually have greater NOR ability than males. The NOR test is now available for a large number of species, including fish, but sex differences have not been properly tested outside of rodents. We compared male and female guppies (Poecilia reticulata in a NOR test to study whether sex differences exist also for fish. We focused on sex differences in both performance and behaviour of guppies during the test. In our experiment, adult guppies expressed a preference for the novel object as most rodents and other species do. When we looked at sex differences, we found the two sexes showed a similar preference for the novel object over the familiar object, suggesting that male and female guppies have similar NOR performances. Analysis of behaviour revealed that males were more inclined to swim in the proximity of the two objects than females. Further, males explored the novel object at the beginning of the experiment while females did so afterwards. These two behavioural differences are possibly due to sex differences in exploration. Even though NOR performance is not different between male and female guppies, the behavioural sex differences we found could affect the results of the experiments and should be carefully considered when assessing fish memory with the NOR test.

  16. Recent trends on Software Verification and Validation Testing

    International Nuclear Information System (INIS)

    Kim, Hyungtae; Jeong, Choongheui

    2013-01-01

    Verification and Validation (V and V) include the analysis, evaluation, review, inspection, assessment, and testing of products. Especially testing is an important method to verify and validate software. Software V and V testing covers test planning to execution. IEEE Std. 1012 is a standard on the software V and V. Recently, IEEE Std. 1012-2012 was published. This standard is a major revision to IEEE Std. 1012-2004 which defines only software V and V. It expands the scope of the V and V processes to include system and hardware as well as software. This standard describes the scope of V and V testing according to integrity level. In addition, independent V and V requirement related to software V and V testing in IEEE 7-4.3.2-2010 have been revised. This paper provides a recent trend of software V and V testing by reviewing of IEEE Std. 1012-2012 and IEEE 7-4.3.2-2010. There are no major changes of software V and V testing activities and tasks in IEEE 1012-2012 compared with IEEE 1012-2004. But the positions on the responsibility to perform software V and V testing are changed. In addition IEEE 7-4.3.2-2010 newly describes the positions on responsibility to perform Software V and V Testing. However, the positions of these standards on the V and V testing are different. For integrity level 3 and 4, IEEE 1012-2012 basically requires that V and V organization shall conduct all of V and V testing tasks such as test plan, test design, test case, and test procedure except test execution. If V and V testing is conducted by not V and V but another organization, the results of that testing shall be analyzed by the V and V organization. For safety-related software, IEEE 7-4.3.2-2010 requires that test procedures and reports shall be independently verified by the alternate organization regardless of who writes the procedures and/or conducts the tests

  17. Test of Gross Motor Development : Expert Validity, confirmatory validity and internal consistence

    Directory of Open Access Journals (Sweden)

    Nadia Cristina Valentini

    2008-12-01

    Full Text Available The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motordevelopment. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by expertsand the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. Across-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionalsand 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls.Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated thatthe Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices ofconfirmatory factorial validity (χ2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tuckerand Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. ThePortuguese TGMD-2 demonstrated validity and reliability for the sample investigated.

  18. Test of Gross Motor Development: expert validity, confirmatory validity and internal consistence

    Directory of Open Access Journals (Sweden)

    Nadia Cristina Valentini

    2008-01-01

    The Test of Gross Motor Development (TGMD-2 is an instrument used to evaluate children’s level of motor development. The objective of this study was to translate and verify the clarity and pertinence of the TGMD-2 items by experts and the confirmatory factorial validity and the internal consistence by means of test-retest of the Portuguese TGMD-2. A cross-cultural translation was used to construct the Portuguese version. The participants of this study were 7 professionals and 587 children, from 27 schools (kindergarten and elementary from 3 to 10 years old (51.1% boys and 48.9% girls. Each child was videotaped performing the test twice. The videotaped tests were then scored. The results indicated that the Portuguese version of the TGMD-2 contains clear and pertinent motor items; demonstrated satisfactory indices of confirmatory factorial validity (÷2/gl = 3.38; Goodness-of-fit Index = 0.95; Adjusted Goodness-of-fit index = 0.92 and Tucker and Lewis’s Index of Fit = 0.83 and test-retest internal consistency (locomotion r = 0.82; control of object: r = 0.88. The Portuguese TGMD-2 demonstrated validity and reliability for the sample investigated.

  19. Assessing cultural validity in standardized tests in stem education

    Science.gov (United States)

    Gassant, Lunes

    This quantitative ex post facto study examined how race and gender, as elements of culture, influence the development of common misconceptions among STEM students. Primary data came from a standardized test: the Digital Logic Concept Inventory (DLCI) developed by Drs. Geoffrey L. Herman, Michael C. Louis, and Craig Zilles from the University of Illinois at Urbana-Champaign. The sample consisted of a cohort of 82 STEM students recruited from three universities in Northern Louisiana. Microsoft Excel and the Statistical Package for the Social Sciences (SPSS) were used for data computation. Two key concepts, several sub concepts, and 19 misconceptions were tested through 11 items in the DLCI. Statistical analyses based on both the Classical Test Theory (Spearman, 1904) and the Item Response Theory (Lord, 1952) yielded similar results: some misconceptions in the DLCI can reliably be predicted by the Race or the Gender of the test taker. The research is significant because it has shown that some misconceptions in a STEM discipline attracted students with similar ethnic backgrounds differently; thus, leading to the existence of some cultural bias in the standardized test. Therefore the study encourages further research in cultural validity in standardized tests. With culturally valid tests, it will be possible to increase the effectiveness of targeted teaching and learning strategies for STEM students from diverse ethnic backgrounds. To some extent, this dissertation has contributed to understanding, better, the gap between high enrollment rates and low graduation rates among African American students and also among other minority students in STEM disciplines.

  20. Test rig overview for validation and reliability testing of shutdown system software

    International Nuclear Information System (INIS)

    Zhao, M.; McDonald, A.; Dick, P.

    2007-01-01

    The test rig for Validation and Reliability Testing of shutdown system software has been upgraded from the AECL Windows-based test rig previously used for CANDU6 stations. It includes a Virtual Trip Computer, which is a software simulation of the functional specification of the trip computer, and a real-time trip computer simulator in a separate chassis, which is used during the preparation of trip computer test cases before the actual trip computers are available. This allows preparation work for Validation and Reliability Testing to be performed in advance of delivery of actual trip computers to maintain a project schedule. (author)

  1. Validation of the Vanderbilt Holistic Face Processing Test.

    Science.gov (United States)

    Wang, Chao-Chih; Ross, David A; Gauthier, Isabel; Richler, Jennifer J

    2016-01-01

    The Vanderbilt Holistic Face Processing Test (VHPT-F) is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014). In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom), which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1.

  2. Validation of the Vanderbilt Holistic Face Processing Test.

    Directory of Open Access Journals (Sweden)

    Chao-Chih Wang

    2016-11-01

    Full Text Available The Vanderbilt Holistic Face Processing Test (VHPT-F is a new measure of holistic face processing with better psychometric properties relative to prior measures developed for group studies (Richler et al., 2014. In fields where psychologists study individual differences, validation studies are commonplace and the concurrent validity of a new measure is established by comparing it to an older measure with established validity. We follow this approach and test whether the VHPT-F measures the same construct as the composite task, which is group-based measure at the center of the large literature on holistic face processing. In Experiment 1, we found a significant correlation between holistic processing measured in the VHPT-F and the composite task. Although this correlation was small, it was comparable to the correlation between holistic processing measured in the composite task with the same faces, but different target parts (top or bottom, which represents a reasonable upper limit for correlations between the composite task and another measure of holistic processing. These results confirm the validity of the VHPT-F by demonstrating shared variance with another measure of holistic processing based on the same operational definition. These results were replicated in Experiment 2, but only when the demographic profile of our sample matched that of Experiment 1.

  3. Validation of the German version of the Ford Insomnia Response to Stress Test.

    Science.gov (United States)

    Dieck, Arne; Helbig, Susanne; Drake, Christopher L; Backhaus, Jutta

    2018-06-01

    The purpose of this study was to assess the psychometric properties of a German version of the Ford Insomnia Response to Stress Test with groups with and without sleep problems. Three studies were analysed. Data set 1 was based on an initial screening for a sleep training program (n = 393), data set 2 was based on a study to test the test-retest reliability of the Ford Insomnia Response to Stress Test (n = 284) and data set 3 was based on a study to examine the influence of competitive sport on sleep (n = 37). Data sets 1 and 2 were used to test internal consistency, factor structure, convergent validity, discriminant validity and test-retest reliability of the Ford Insomnia Response to Stress Test. Content validity was tested using data set 3. Cronbach's alpha of the Ford Insomnia Response to Stress Test was good (α = 0.80) and test-retest reliability was satisfactory (r = 0.72). Overall, the one-factor model showed the best fit. Furthermore, significant positive correlations between the Ford Insomnia Response to Stress Test and impaired sleep quality, depression and stress reactivity were in line with the expectations regarding the convergent validity. Subjects with sleep problems had significantly higher scores in the Ford Insomnia Response to Stress Test than subjects without sleep problems (P Stress Test had significantly lower sleep quality (P = 0.01), demonstrating that vulnerability for stress-induced sleep disturbances accompanies poorer sleep quality in stressful episodes. The findings show that the German version of the Ford Insomnia Response to Stress Test is a reliable and valid questionnaire to assess the vulnerability to stress-induced sleep disturbances. © 2017 European Sleep Research Society.

  4. Reliability, Validity and Factor Structure of Drug Abuse Screening Test

    OpenAIRE

    Sayed Hadi Sayed Alitabar; Mojtaba Habibi; Maryam Falahatpisheh; Musa Arvin

    2016-01-01

    Background and Objective: According to the increasing of substance use in the country, more researches about this phenomenon are necessary. This Study Investigates the Validity, Reliability and Confirmatory Factor Structure of the Drug Abuse Screening test (DAST). Materials and Methods: The Sample Consisted of 381 Patients (143 Women and 238 Men) with a Multi-Stage Cluster Sampling of Areas 2, 6 and 12 of Tehran Were Selected from Each Region, 6 Randomly Selected Drug Rehabilitation Center. T...

  5. A Human Proximity Operations System test case validation approach

    Science.gov (United States)

    Huber, Justin; Straub, Jeremy

    A Human Proximity Operations System (HPOS) poses numerous risks in a real world environment. These risks range from mundane tasks such as avoiding walls and fixed obstacles to the critical need to keep people and processes safe in the context of the HPOS's situation-specific decision making. Validating the performance of an HPOS, which must operate in a real-world environment, is an ill posed problem due to the complexity that is introduced by erratic (non-computer) actors. In order to prove the HPOS's usefulness, test cases must be generated to simulate possible actions of these actors, so the HPOS can be shown to be able perform safely in environments where it will be operated. The HPOS must demonstrate its ability to be as safe as a human, across a wide range of foreseeable circumstances. This paper evaluates the use of test cases to validate HPOS performance and utility. It considers an HPOS's safe performance in the context of a common human activity, moving through a crowded corridor, and extrapolates (based on this) to the suitability of using test cases for AI validation in other areas of prospective application.

  6. Measurement of Dietary Restraint: Validity Tests of Four Questionnaires

    Science.gov (United States)

    Williamson, Donald A.; Martin, Corby K.; York-Crowe, Emily; Anton, Stephen D.; Redman, Leanne M.; Han, Hongmei; Ravussin, Eric

    2007-01-01

    This study tested the validity of four measures of dietary restraint: Dutch Eating Behavior Questionnaire, Eating Inventory (EI), Revised Restraint Scale (RS), and the Current Dieting Questionnaire. Dietary restraint has been implicated as a determinant of overeating and binge eating. Conflicting findings have been attributed to different methods for measuring dietary restraint. The validity of four self-report measures of dietary restraint and dieting behavior was tested using: 1) factor analysis, 2) changes in dietary restraint in a randomized controlled trial of different methods to achieve calorie restriction, and 3) correlation of changes in dietary restraint with an objective measure of energy balance, calculated from the changes in fat mass and fat-free mass over a six-month dietary intervention. Scores from all four questionnaires, measured at baseline, formed a dietary restraint factor, but the RS also loaded on a binge eating factor. Based on change scores, the EI Restraint scale was the only measure that correlated significantly with energy balance expressed as a percentage of energy require d for weight maintenance. These findings suggest that that, of the four questionnaires tested, the EI Restraint scale was the most valid measure of the intent to diet and actual caloric restriction. PMID:17101191

  7. Validation of a diabetes numeracy test in Arabic

    OpenAIRE

    Alghodaier, Hussah; Jradi, Hoda; Mohammad, Najwa Samantha; Bawazir, Amen

    2017-01-01

    Background The prevalence of diabetes Mellitus in Saudi Arabia is 24%, ranking it among the top ten Worldwide. Diabetes education focuses on self-management and relies on numeracy skills. Poor numeracy may go unrecognized and it is important to have an assessment tool in Arabic to measure such a skill in diabetes care. Objectives To validate a 15-item Diabetes Numeracy Test (DNT-15) in the Arabic Language as a tool to assess the numeracy skills of patients with diabetes and to test its proper...

  8. Validation Testing for Automated Solubility Measurement Equipment Final Report

    Energy Technology Data Exchange (ETDEWEB)

    Lachut, J. S. [Washington River Protection Solutions LLC, Richland, WA (United States)

    2016-01-11

    Laboratory tests have been completed to test the validity of automated solubility measurement equipment using sodium nitrate and sodium chloride solutions (see test plan WRPS-1404441, “Validation Testing for Automated Solubility Measurement Equipment”). The sodium nitrate solution results were within 2-3% of the reference values, so the experiment is considered successful using the turbidity meter. The sodium chloride test was done by sight, as the turbidity meter did not work well using sodium chloride. For example, the “clear” turbidity reading was 53 FNU at 80 °C, 107 FNU at 55 °C, and 151 FNU at 20 °C. The sodium chloride did not work because it is granular and large; as the solution was stirred, the granules stayed to the outside of the reactor and just above the stir bar level, having little impact on the turbidity meter readings as the meter was aimed at the center of the solution. Also, the turbidity meter depth has an impact. The salt tends to remain near the stir bar level. If the meter is deeper in the slurry, it will read higher turbidity, and if the meter is raised higher in the slurry, it will read lower turbidity (possibly near zero) because it reads the “clear” part of the slurry. The sodium chloride solution results, as measured by sight rather than by turbidity instrument readings, were within 5-6% of the reference values.

  9. Wave Tank Testing and Model Validation of an Autonomous Wave Energy Converter

    Directory of Open Access Journals (Sweden)

    Bret Bosma

    2015-08-01

    Full Text Available A key component in bringing ocean wave energy converters from concept to commercialization is the building and testing of scaled prototypes to provide model validation. A one quarter scale prototype of an autonomous two body heaving point absorber was modeled, built, and tested for this work. Wave tank testing results are compared with two hydrodynamic and system models—implemented in both ANSYS AQWA and MATLAB/Simulink—and show model validation over certain regions of operation. This work will serve as a guide for future developers of wave energy converter devices, providing insight in taking their design from concept to prototype stage.

  10. Testing ESL pragmatics development and validation of a web-based assessment battery

    CERN Document Server

    Roever, Carsten

    2014-01-01

    Although second language learners' pragmatic competence (their ability to use language in context) is an essential part of their general communicative competence, it has not been a part of second language tests. This book helps fill this gap by describing the development and validation of a web-based test of ESL pragmalinguistics. The instrument assesses learners' knowledge of routine formulae, speech acts, and implicature in 36 multiple-choice and brief-response items. The test's quantitative and qualitative validation with 300 learners showed high reliability and provided strong evidence of

  11. Reliability and Validity of the Inline Skating Skill Test

    Science.gov (United States)

    Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje

    2016-01-01

    This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters. Key points Study evaluated the reliability and construct validity of a newly developed inline skating skill test. Evaluated test is a first protocol designed to assess specific inline skating skill. Two groups of amateur skaters with different skating proficiency repeated the skill test in four separate occasions. The results suggest that evaluated test is reliable and valid to evaluate inline skating skill in amateur skaters. PMID:27803616

  12. Validation of a Spanish version of the Test Your Memory.

    Science.gov (United States)

    Ferrero-Arias, J; Turrión-Rojo, M Á

    2016-01-01

    To validate a Spanish version of the TYM, a self-administered cognitive screening test designed for the detection of Alzheimer's disease and mild cognitive defect. A cross-sectional study was conducted in a neurology outpatient clinic. The TYM was administered to individuals of 50 years o more who came to the clinic for whatever the symptom. Their cognitive state was evaluated regardless of the outcome of TYM. They were categorized into 3 groups: 1) Cognitively normal (739), 2) with mild cognitive impairment (183), 3) with dementia (127). An analysis of items was made and the psychometric properties of the TYM were defined. There was a cross-validation, and the predictive validity of the TYM score, adjusted to the demographic variables, was determined by evaluating their performance in ROC curves. The internal consistency, interobserver reliability, short term and long-term test-retest reliability were adequate. The TYM correlated with the MMSE (r=0.779, Pde Neurología. Published by Elsevier España, S.L.U. All rights reserved.

  13. Portuguese validation of the children's eating attitudes test

    Directory of Open Access Journals (Sweden)

    Maria Del Carmen Bento Teixeira

    2012-01-01

    Full Text Available BACKGROUND: The Eating Attitudes Test (EAT is the most widely used instrument for evaluating eating disorders in adults and adolescents in a variety of cultures and samples. OBJECTIVE: The aim of this study was to analyse the psychometric properties of the Portuguese version of the Children's Eating Attitudes Test (ChEAT. METHOD: Nine hundred and fifty-six Portuguese secondary students (565 girls and 391 boys answered the ChEAT. The test-retest reliability was obtained with data from 206 participants from the total sample who re-answered the questionnaire after 4-6 weeks. Psychometric analyses were carried out for the total sample and separately for girls and boys. RESULTS: Internal consistency and test-retest reliability were satisfactory. Principal components factorial analysis yielded four factors in the total sample, accounting for 42.35% of the total variance. Factor structure was similar in the total sample and in both genders. Factors were labelled: F1 "Fear of Getting Fat", F2 "Restrictive and Purgative Behaviours", F3 "Food Preoccupation" and F4 "Social Pressure to Eat". The concurrent validity, explored using the Contour Drawing Figure Rating Scale (CDRS was high. DISCUSSION: The Portuguese version of the ChEAT is a valid and useful instrument for the evaluation of abnormal eating attitudes and behaviours among Portuguese adolescents.

  14. Convergent and diagnostic validity of STAVUX, a word and pseudoword spelling test for adults.

    Science.gov (United States)

    Östberg, Per; Backlund, Charlotte; Lindström, Emma

    2016-10-01

    Few comprehensive spelling tests are available in Swedish, and none have been validated in adults with reading and writing disorders. The recently developed STAVUX test includes word and pseudoword spelling subtests with high internal consistency and adult norms stratified by education. This study evaluated the convergent and diagnostic validity of STAVUX in adults with dyslexia. Forty-six adults, 23 with dyslexia and 23 controls, took STAVUX together with a standard word-decoding test and a self-rated measure of spelling skills. STAVUX subtest scores showed moderate to strong correlations with word-decoding scores and predicted self-rated spelling skills. Word and pseudoword subtest scores both predicted dyslexia status. Receiver-operating characteristic (ROC) analysis showed excellent diagnostic discriminability. Sensitivity was 91% and specificity 96%. In conclusion, the results of this study support the convergent and diagnostic validity of STAVUX.

  15. Testing and Validation of Computational Methods for Mass Spectrometry.

    Science.gov (United States)

    Gatto, Laurent; Hansen, Kasper D; Hoopmann, Michael R; Hermjakob, Henning; Kohlbacher, Oliver; Beyer, Andreas

    2016-03-04

    High-throughput methods based on mass spectrometry (proteomics, metabolomics, lipidomics, etc.) produce a wealth of data that cannot be analyzed without computational methods. The impact of the choice of method on the overall result of a biological study is often underappreciated, but different methods can result in very different biological findings. It is thus essential to evaluate and compare the correctness and relative performance of computational methods. The volume of the data as well as the complexity of the algorithms render unbiased comparisons challenging. This paper discusses some problems and challenges in testing and validation of computational methods. We discuss the different types of data (simulated and experimental validation data) as well as different metrics to compare methods. We also introduce a new public repository for mass spectrometric reference data sets ( http://compms.org/RefData ) that contains a collection of publicly available data sets for performance evaluation for a wide range of different methods.

  16. Impact on participation and autonomy: test of validity and reliability for older persons

    Directory of Open Access Journals (Sweden)

    Isabelle Ottenvall Hammar

    2014-10-01

    Full Text Available In research and healthcare it is important to measure older persons’ self-determination in order to improve their possibilities to decide for themselves in daily life. The questionnaire Impact on Participation and Autonomy (IPA assesses self-determination, but is not constructed for older persons. The aim of this study was to examine the validity and reliability of the IPA-S questionnaire for persons aged 70 years and older. The study was performed in two steps; first a validity test of the Swedish version of the questionnaire, IPA-S, followed by a reliability test-retest of an adjusted version. The validity was tested with focus groups and individual interviews on persons aged 77-88 years, and the reliability on persons aged 70-99 years. The validity test result showed that IPA-S is valid for older persons but it was too extensive and the phrasing of the items needed adjustments. The reliability test-retest on the adjusted questionnaire, IPA-Older persons (IPA-O, showed that 15 of 22 items had high agreement. IPA-O can be used to measure older persons’ self-determination in their care and rehabilitation.

  17. The BACHD Rat Model of Huntington Disease Shows Specific Deficits in a Test Battery of Motor Function.

    Science.gov (United States)

    Manfré, Giuseppe; Clemensson, Erik K H; Kyriakou, Elisavet I; Clemensson, Laura E; van der Harst, Johanneke E; Homberg, Judith R; Nguyen, Huu Phuc

    2017-01-01

    Rationale : Huntington disease (HD) is a progressive neurodegenerative disorder characterized by motor, cognitive and neuropsychiatric symptoms. HD is usually diagnosed by the appearance of motor deficits, resulting in skilled hand use disruption, gait abnormality, muscle wasting and choreatic movements. The BACHD transgenic rat model for HD represents a well-established transgenic rodent model of HD, offering the prospect of an in-depth characterization of the motor phenotype. Objective : The present study aims to characterize different aspects of motor function in BACHD rats, combining classical paradigms with novel high-throughput behavioral phenotyping. Methods : Wild-type (WT) and transgenic animals were tested longitudinally from 2 to 12 months of age. To measure fine motor control, rats were challenged with the pasta handling test and the pellet reaching test. To evaluate gross motor function, animals were assessed by using the holding bar and the grip strength tests. Spontaneous locomotor activity and circadian rhythmicity were assessed in an automated home-cage environment, namely the PhenoTyper. We then integrated existing classical methodologies to test motor function with automated home-cage assessment of motor performance. Results : BACHD rats showed strong impairment in muscle endurance at 2 months of age. Altered circadian rhythmicity and locomotor activity were observed in transgenic animals. On the other hand, reaching behavior, forepaw dexterity and muscle strength were unaffected. Conclusions : The BACHD rat model exhibits certain features of HD patients, like muscle weakness and changes in circadian behavior. We have observed modest but clear-cut deficits in distinct motor phenotypes, thus confirming the validity of this transgenic rat model for treatment and drug discovery purposes.

  18. Validation Test of Geant4 Simulation of Electron Backscattering

    CERN Document Server

    Kim, Sung Hun; Basaglia, Tullio; Han, Min Cheol; Hoff, Gabriela; Kim, Chan Hyeong; Saracco, Paolo

    2015-01-01

    Backscattering is a sensitive probe of the accuracy of electron scattering algorithms implemented in Monte Carlo codes. The capability of the Geant4 toolkit to describe realistically the fraction of electrons backscattered from a target volume is extensively and quantitatively evaluated in comparison with experimental data retrieved from the literature. The validation test covers the energy range between approximately 100 eV and 20 MeV, and concerns a wide set of target elements. Multiple and single electron scattering models implemented in Geant4, as well as preassembled selections of physics models distributed within Geant4, are analyzed with statistical methods. The evaluations concern Geant4 versions from 9.1 to 10.1. Significant evolutions are observed over the range of Geant4 versions, not always in the direction of better compatibility with experiment. Goodness-of-fit tests complemented by categorical analysis tests identify a configuration based on Geant4 Urban multiple scattering model in Geant4 vers...

  19. Reliability and validity of two isometric squat tests.

    Science.gov (United States)

    Blazevich, Anthony J; Gill, Nicholas; Newton, Robert U

    2002-05-01

    The purpose of the present study was first to examine the reliability of isometric squat (IS) and isometric forward hack squat (IFHS) tests to determine if repeated measures on the same subjects yielded reliable results. The second purpose was to examine the relation between isometric and dynamic measures of strength to assess validity. Fourteen male subjects performed maximal IS and IFHS tests on 2 occasions and 1 repetition maximum (1-RM) free-weight squat and forward hack squat (FHS) tests on 1 occasion. The 2 tests were found to be highly reliable (intraclass correlation coefficient [ICC](IS) = 0.97 and ICC(IFHS) = 1.00). There was a strong relation between average IS and 1-RM squat performance, and between IFHS and 1-RM FHS performance (r(squat) = 0.77, r(FHS) = 0.76; p squat and FHS test performances (r squat and FHS test performance can be attributed to differences in the movement patterns of the tests

  20. Construct validity of the Free and Cued Selective Reminding Test in older adults with memory complaints.

    Science.gov (United States)

    Clerici, Francesca; Ghiretti, Roberta; Di Pucchio, Alessandra; Pomati, Simone; Cucumo, Valentina; Marcone, Alessandra; Vanacore, Nicola; Mariani, Claudio; Cappa, Stefano Francesco

    2017-06-01

    The Free and Cued Selective Reminding Test (FCSRT) is the memory test recommended by the International Working Group on Alzheimer's disease (AD) for the detection of amnestic syndrome of the medial temporal type in prodromal AD. Assessing the construct validity and internal consistency of the Italian version of the FCSRT is thus crucial. The FCSRT was administered to 338 community-dwelling participants with memory complaints (57% females, age 74.5 ± 7.7 years), including 34 with AD, 203 with Mild Cognitive Impairment, and 101 with Subjective Memory Impairment. Internal Consistency was estimated using Cronbach's alpha coefficient. To assess convergent validity, five FCSRT scores (Immediate Free Recall, Immediate Total Recall, Delayed Free Recall, Delayed Total Recall, and Index of Sensitivity of Cueing) were correlated with three well-validated memory tests: Story Recall, Rey Auditory Verbal Learning test, and Rey Complex Figure (RCF) recall (partial correlation analysis). To assess divergent validity, a principal component analysis (an exploratory factor analysis) was performed including, in addition to the above-mentioned memory tasks, the following tests: Word Fluencies, RCF copy, Clock Drawing Test, Trail Making Test, Frontal Assessment Battery, Raven Coloured Progressive Matrices, and Stroop Colour-Word Test. Cronbach's alpha coefficients for immediate recalls (IFR and ITR) and delayed recalls (DFR and DTR) were, respectively, .84 and .81. All FCSRT scores were highly correlated with those of the three well-validated memory tests. The factor analysis showed that the FCSRT does not load on the factors saturated by non-memory tests. These findings indicate that the FCSRT has a good internal consistency and has an excellent construct validity as an episodic memory measure. © 2015 The British Psychological Society.

  1. Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

    Science.gov (United States)

    Rae, James R.; Olson, Kristina R.

    2018-01-01

    The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…

  2. Validity and Reliability Testing of an e-learning Questionnaire for Chemistry Instruction

    Science.gov (United States)

    Guspatni, G.; Kurniawati, Y.

    2018-04-01

    The aim of this paper is to examine validity and reliability of a questionnaire used to evaluate e-learning implementation in chemistry instruction. 48 questionnaires were filled in by students who had studied chemistry through e-learning system. The questionnaire consisted of 20 indicators evaluating students’ perception on using e-learning. Parametric testing was done as data were assumed to follow normal distribution. Item validity of the questionnaire was examined through item-total correlation using Pearson’s formula while its reliability was assessed with Cronbach’s alpha formula. Moreover, convergent validity was assessed to see whether indicators building a factor had theoretically the same underlying construct. The result of validity testing revealed 19 valid indicators while the result of reliability testing revealed Cronbach’s alpha value of .886. The result of factor analysis showed that questionnaire consisted of five factors, and each of them had indicators building the same construct. This article shows the importance of factor analysis to get a construct valid questionnaire before it is used as research instrument.

  3. Real time risk analysis of kick detection: Testing and validation

    International Nuclear Information System (INIS)

    Islam, Rakibul; Khan, Faisal; Venkatesan, Ramchandran

    2017-01-01

    Oil and gas development is moving into harsh and remote locations where the highest level of safety is required. A blowout is one of the most feared accidents in oil and gas developments projects. The main objective of this paper is to test and validate the kick detection of blowout risk assessment model using uniquely developed experimental results. Kick detection is a major part of the blowout risk assessment model. The accuracy and timeliness of kick detection are dependent on the monitoring of multiple downhole parameters such as downhole pressure, fluid density, fluid conductivity and mass flow rate. In the present study these four parameters are considered in different logical combinations to assess the occurrence of kick and associated blowout risk. The assessed results are compared against the experimental observations. It is observed that simultaneous monitoring of mass flow rate combined with any one the three parameters provides most reliable detection of kick and potential blowout likelihood. The current work presents the framework for a dynamic risk assessment and management model. Upon success testing of this approach at the pilot and field levels, this approach could provide a paradigm shift in drilling safety. - Highlights: • A novel dynamic risk model of kick detection and blowout prediction. • Testing and Validation of the risk model. • Application of the dynamic risk model.

  4. [Attempt for development of rapid word reading test for children--evaluation of reliability and validity].

    Science.gov (United States)

    Hashimoto, Ryusaku; Kashiwagi, Mitsuru; Suzuki, Shuhei

    2008-09-01

    We developed a rapid word reading test for examining the phonological processing ability of Japanese children. We prepared two versions of the test, version A and B. Each test has word and non-word tasks. Twenty-two healthy boys of third grade in primary schools participated in this validation study. For criterion related validity, we performed the serial Hiragana reading test, the sentence reading test, Raven's coloured progressive matrices (RCPM), the Token test for children, the Kana word dictation test, the standardized comprehension test of abstract words (SCTAW), and Trail Circle test. The reading times of the newly developed test correlated moderately or highly with those of the serial Hiragana reading test and the sentence reading test. However, the scores of the other tests (RCPM, Token test for children, Kana word dictation test, SCTAW, Trail Circle test) did not correlated with the reading time of the rapid word reading test. Test-retest reliabilities in the word tasks were more than moderate: 0.52 and 0.76 in versions A and B, while those in the non-word tasks were high: 0.91 and 0.88 in versions A and B. The correlation coefficient between versions A and B was 0.7 for the word tasks and 0.92 for the non-word tasks. This study showed that the rapid word reading test has substantial validity and reliability for testing the phonological processing ability of Japanese children. In addition, the non-word tasks were more suitable for selectively examining the speed of the grapheme to phoneme conversion process.

  5. [Reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test].

    Science.gov (United States)

    Zhang, C; Yang, G P; Li, Z; Li, X N; Li, Y; Hu, J; Zhang, F Y; Zhang, X J

    2017-08-10

    Objective: To assess the reliability and validity of the Chinese version on Alcohol Use Disorders Identification Test (AUDIT) among medical students in China and to provide correct way of application on the recommended scales. Methods: An E-questionnaire was developed and sent to medical students in five different colleges. Students were all active volunteers to accept the testings. Cronbach's α and split-half reliability were calculated to evaluate the reliability of AUDIT while content, contract, discriminant and convergent validity were performed to measure the validity of the scales. Results: The overall Cronbach's α of AUDIT was 0.782 and the split-half reliability was 0.711. Data showed that the domain Cronbach's α and split-half reliability were 0.796 and 0.794 for hazardous alcohol use, 0.561 and 0.623 for dependence symptoms, and 0.647 and 0.640 for harmful alcohol use. Results also showed that the content validity index on the levels of items I-CVI) were from 0.83 to 1.00, the content validity index of scale level (S-CVI/UA) was 0.90, content validity index of average scale level (S-CVI/Ave) was 0.99 and the content validity ratios (CVR) were from 0.80 to 1.00. The simplified version of AUDIT supported a presupposed three-factor structure which could explain 61.175% of the total variance revealed through exploratory factor analysis. AUDIT semed to have good convergent and discriminant validity, with the success rate of calibration experiment as 100%. Conclusion: AUDIT showed good reliability and validity among medical students in China thus worth for promotion on its use.

  6. Independent verification and validation testing of the FLASH computer code, Versiion 3.0

    International Nuclear Information System (INIS)

    Martian, P.; Chung, J.N.

    1992-06-01

    Independent testing of the FLASH computer code, Version 3.0, was conducted to determine if the code is ready for use in hydrological and environmental studies at various Department of Energy sites. This report describes the technical basis, approach, and results of this testing. Verification tests, and validation tests, were used to determine the operational status of the FLASH computer code. These tests were specifically designed to test: correctness of the FORTRAN coding, computational accuracy, and suitability to simulating actual hydrologic conditions. This testing was performed using a structured evaluation protocol which consisted of: blind testing, independent applications, and graduated difficulty of test cases. Both quantitative and qualitative testing was performed through evaluating relative root mean square values and graphical comparisons of the numerical, analytical, and experimental data. Four verification test were used to check the computational accuracy and correctness of the FORTRAN coding, and three validation tests were used to check the suitability to simulating actual conditions. These tests cases ranged in complexity from simple 1-D saturated flow to 2-D variably saturated problems. The verification tests showed excellent quantitative agreement between the FLASH results and analytical solutions. The validation tests showed good qualitative agreement with the experimental data. Based on the results of this testing, it was concluded that the FLASH code is a versatile and powerful two-dimensional analysis tool for fluid flow. In conclusion, all aspects of the code that were tested, except for the unit gradient bottom boundary condition, were found to be fully operational and ready for use in hydrological and environmental studies

  7. Alternatives to animal testing: research, trends, validation, regulatory acceptance.

    Science.gov (United States)

    Huggins, Jane

    2003-01-01

    Current trends and issues in the development of alternatives to the use of animals in biomedical experimentation are discussed in this position paper. Eight topics are considered and include refinement of acute toxicity assays; eye corrosion/irritation alternatives; skin corrosion/irritation alternatives; contact sensitization alternatives; developmental/reproductive testing alternatives; genetic engineering (transgenic) assays; toxicogenomics; and validation of alternative methods. The discussion of refinement of acute toxicity assays is focused primarily on developments with regard to reduction of the number of animals used in the LD(50) assay. However, the substitution of humane endpoints such as clinical signs of toxicity for lethality in these assays is also evaluated. Alternative assays for eye corrosion/irritation as well as those for skin corrosion/irritation are described with particular attention paid to the outcomes, both successful and unsuccessful, of several validation efforts. Alternative assays for contact sensitization and developmental/reproductive toxicity are presented as examples of methods designed for the examination of interactions between toxins and somewhat more complex physiological systems. Moreover, genetic engineering and toxicogenomics are discussed with an eye toward the future of biological experimentation in general. The implications of gene manipulation for research animals, specifically, are also examined. Finally, validation methods are investigated as to their effectiveness, or lack thereof, and suggestions for their standardization and improvement, as well as implementation are reviewed.

  8. Test-retest reliability and predictive validity of the Implicit Association Test in children.

    Science.gov (United States)

    Rae, James R; Olson, Kristina R

    2018-02-01

    The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many factors simultaneously (lag-time between testing administrations, domain, etc.), it is difficult to discern what factors may explain variability in existing test-retest reliability and predictive validity estimates. Across five studies (total N = 519; ages 6- to 11-years-old), we manipulated two factors that have varied in previous developmental research-lag-time and domain. An internal meta-analysis of these studies revealed that, across three different methods of analyzing the data, mean test-retest (rs of .48, .38, and .34) and predictive validity (rs of .46, .20, and .10) effect sizes were significantly greater than zero. While lag-time did not moderate the magnitude of test-retest coefficients, whether we observed domain differences in test-retest reliability and predictive validity estimates was contingent on other factors, such as how we scored the IAT or whether we included estimates from a unique sample (i.e., a sample containing gender typical and gender diverse children). Recommendations are made for developmental researchers that utilize the IAT in their research. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  9. How Can Consumers Be Sure a Genetic Test Is Valid and Useful?

    Science.gov (United States)

    ... a genetic test is valid and useful? How can consumers be sure a genetic test is valid ... particular gene or genetic change. In other words, can the test accurately detect whether a specific genetic ...

  10. Dynamic testing in schizophrenia: does training change the construct validity of a test?

    Science.gov (United States)

    Wiedl, Karl H; Schöttke, Henning; Green, Michael F; Nuechterlein, Keith H

    2004-01-01

    Dynamic testing typically involves specific interventions for a test to assess the extent to which test performance can be modified, beyond level of baseline (static) performance. This study used a dynamic version of the Wisconsin Card Sorting Test (WCST) that is based on cognitive remediation techniques within a test-training-test procedure. From results of previous studies with schizophrenia patients, we concluded that the dynamic and static versions of the WCST should have different construct validity. This hypothesis was tested by examining the patterns of correlations with measures of executive functioning, secondary verbal memory, and verbal intelligence. Results demonstrated a specific construct validity of WCST dynamic (i.e., posttest) scores as an index of problem solving (Tower of Hanoi) and secondary verbal memory and learning (Auditory Verbal Learning Test), whereas the impact of general verbal capacity and selective attention (Verbal IQ, Stroop Test) was reduced. It is concluded that the construct validity of the test changes with dynamic administration and that this difference helps to explain why the dynamic version of the WCST predicts functional outcome better than the static version.

  11. POLYGON - A NEW FUNDAMENTAL MOVEMENT SKILLS TEST FOR 8 YEAR OLD CHILDREN: CONSTRUCTION AND VALIDATION

    Directory of Open Access Journals (Sweden)

    Frane Zuvela

    2011-03-01

    Full Text Available Inadequately adopted fundamental movement skills (FMS in early childhood may have a negative impact on the motor performance in later life (Gallahue and Ozmun, 2005. The need for an efficient FMS testing in Physical Education was recognized. The aim of this paper was to construct and validate a new FMS test for 8 year old children. Ninety-five 8 year old children were used for the testing. A total of 24 new FMS tasks were constructed and only the best representatives of movement areas entered into the final test product - FMS-POLYGON. The ICC showed high values for all 24 tasks (0.83-0.97 and the factorial analysis revealed the best representatives of each movement area that entered the FMS-POLYGON: tossing and catching the volleyball against a wall, running across obstacles, carrying the medicine balls, and straight running. The ICC for the FMS-POLYGON showed a very high result (0.98 and, therefore, confirmed the test's intra-rater reliability. Concurrent validity was tested with the use of the "Test of Gross Motor Development" (TGMD-2. Correlation analysis between the newly constructed FMS-POLYGON and the TGMD-2 revealed the coefficient of -0.82 which indicates a high correlation. In conclusion, the new test for FMS assessment proved to be a reliable and valid instrument for 8 year old children. Application of this test in schools is justified and could play an important factor in physical education and sport practice.

  12. COMMUNICATIVE VALIDITY OF THE NEW CET-4 LISTENING COMPREHENSION TEST IN CHINA

    Directory of Open Access Journals (Sweden)

    Chao Wang

    2014-07-01

    Full Text Available Abstract: Based on the major dimensions of a communicative language test that Bachman proposed, this paper aims to have an investigation on the validity of the new CET-4 listening subtest in China from a communicative point of view. Both qualitative and quantitative methods are involved in the study. Material analysis falls into qualitative study, including analysis of the CET-4 testing syllabus and eight new CET-4 listening comprehension tests. Students’ scores of two tests and the questionnaires are analyzed quantitatively. Through analysis, it is found that the new CET-4 listening subtest has a high validity and can measure test-takers’ listening ability in real communication. First, the new CET-4 listening subtest has the quality of reliability. Second, the seven listening skills tested in this subtest can measure the communicative language ability required in the testing syllabus. The intra-correlation analysis shows that each part of the new CET-4 listening subtest focuses on different language abilities related to listening. Third, the authenticity of the new CET-4 listening subtest reaches a satisfactory level. The materials chosen in the test cover various topics and genres. Speakers’ pronunciation, tone and speed are in accordance with the real situation. However, some shortcomings also exist in the test design and should be improved later. For example, its limited item types cannot represent the task types in real life, and the actual input is too ideal to be authentic.   Keywords: Communicative language ability, communicative language testing, listening comprehension, test validity

  13. Validating a dance-specific screening test for balance: preliminary results from multisite testing.

    Science.gov (United States)

    Batson, Glenna

    2010-09-01

    Few dance-specific screening tools adequately capture balance. The aim of this study was to administer and modify the Star Excursion Balance Test (oSEBT) to examine its utility as a balance screen for dancers. The oSEBT involves standing on one leg while lightly targeting with the opposite foot to the farthest distance along eight spokes of a star-shaped grid. This task simulates dance in the spatial pattern and movement quality of the gesturing limb. The oSEBT was validated for distance on athletes with history of ankle sprain. Thirty-three dancers (age 20.1 +/- 1.4 yrs) participated from two contemporary dance conservatories (UK and US), with or without a history of lower extremity injury. Dancers were verbally instructed (without physical demonstration) to execute the oSEBT and four modifications (mSEBT): timed (speed), timed with cognitive interference (answering questions aloud), and sensory disadvantaging (foam mat). Stepping strategies were tracked and performance strategies video-recorded. Unlike the oSEBT results, distances reached were not significant statistically (p = 0.05) or descriptively (i.e., shorter) for either group. Performance styles varied widely, despite sample homogeneity and instructions to control for strategy. Descriptive analysis of mSEBT showed an increased number of near-falls and decreased timing on the injured limb. Dancers appeared to employ variable strategies to keep balance during this test. Quantitative analysis is warranted to define balance strategies for further validation of SEBT modifications to determine its utility as a balance screening tool.

  14. In vitro and ex vivo testing of tenofovir shows it is effective as an HIV-1 microbicide.

    Directory of Open Access Journals (Sweden)

    Lisa C Rohan

    2010-02-01

    Full Text Available Tenofovir gel has entered into clinical trials for use as a topical microbicide to prevent HIV-1 infection but has no published data regarding pre-clinical testing using in vitro and ex vivo models. To validate our findings with on-going clinical trial results, we evaluated topical tenofovir gel for safety and efficacy. We also modeled systemic application of tenofovir for efficacy.Formulation assessment of tenofovir gel included osmolality, viscosity, in vitro release, and permeability testing. Safety was evaluated by measuring the effect on the viability of vaginal flora, PBMCs, epithelial cells, and ectocervical and colorectal explant tissues. For efficacy testing, PBMCs were cultured with tenofovir or vehicle control gels and HIV-1 representing subtypes A, B, and C. Additionally, polarized ectocervical and colorectal explant cultures were treated apically with either gel. Tenofovir was added basolaterally to simulate systemic application. All tissues were challenged with HIV-1 applied apically. Infection was assessed by measuring p24 by ELISA on collected supernatants and immunohistochemistry for ectocervical explants. Formulation testing showed the tenofovir and vehicle control gels were >10 times isosmolar. Permeability through ectocervical tissue was variable but in all cases the receptor compartment drug concentration reached levels that inhibit HIV-1 infection in vitro. The gels were non-toxic toward vaginal flora, PBMCs, or epithelial cells. A transient reduction in epithelial monolayer integrity and epithelial fracture for ectocervical and colorectal explants was noted and likely due to the hyperosmolar nature of the formulation. Tenofovir gel prevented HIV-1 infection of PBMCs regardless of HIV-1 subtype. Topical and systemic tenofovir were effective at preventing HIV-1 infection of explant cultures.These studies provide a mechanism for pre-clinical prediction of safety and efficacy of formulated microbicides. Tenofovir was effective

  15. Microscale validation of 4-aminoantipyrine test method for quantifying phenolic compounds in microbial culture

    International Nuclear Information System (INIS)

    Justiz Mendoza, Ibrahin; Aguilera Rodriguez, Isabel; Perez Portuondo, Irasema

    2014-01-01

    Validation of test methods microscale is currently of great importance due to the economic and environmental advantages possessed, which constitutes a prerequisite for the performance of services and quality assurance of the results to provide customer. This paper addresses the microscale validation of 4-aminoantipyrine spectrophotometric method for the quantification of phenolic compounds in culture medium. Parameters linearity, precision, regression, accuracy, detection limits, quantification limits and robustness were evaluated, addition to the comparison test with no standardized method for determining polyphenols (Folin Ciocalteu). The results showed that both methods are feasible for determining phenols

  16. Geant4 hadronic and electromagnetic validation tests in LHCb

    CERN Document Server

    Griffith, Peter Noel

    2016-01-01

    LHCb uses Geant4 to simulate the interactions of particles with the detector material and components. The simulation response can vary significantly due to modification of material description, of detector geometry, or of the Geant4 toolkit itself. Therefore, an extensive variety of tools have been developed to study the effects of Geant4 modification on the LHCb simulation framework and on stand-alone environments within the LHCb software infrastructure. These tools have proven to be very effective for investigating new and alternative models provided by Geant4, and also in identifying and fixing anomalous behaviours that arise from changes. The next goal is to have these validation tests run autonomously and periodically, alerting the relevant users when problems are detected. Quick and easy comparison of the results from different software versions and simulation models will be made possible through the web interface of the LHCb Performance and Regression testing system, LHCbPR.

  17. Reliability and Validity of the Inline Skating Skill Test

    Directory of Open Access Journals (Sweden)

    Ivan Radman, Lana Ruzic, Viktoria Padovan, Vjekoslav Cigrovski, Hrvoje Podnar

    2016-09-01

    Full Text Available This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male were randomized into two groups (competitive level vs. recreational level. They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8–2.6%] – 2.2% [95% CI: 0.0–4.5%]. In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2–2.4%] – 2.7% [95% CI: 2.1–4.0%] and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92–0.99] – 0.99 [95% CI: 0.98–1.00]. The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2 revealed that the skill test was able to track changes in skaters’ performances. Competitive-level skaters needed shorter time (24.4–26.4%, all p < 0.01 to complete the test in comparison to recreational-level skaters. Moreover, moderate correlation (ρ = 0.80–0.82; all p < 0.01 was observed between the participant’s self-rating and achieved performance times. In conclusion, the proposed test is a reliable and valid method to evaluate inline skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.

  18. Validation and structural analysis of the kinematics concept test

    Directory of Open Access Journals (Sweden)

    A. Lichtenberger

    2017-04-01

    Full Text Available The kinematics concept test (KCT is a multiple-choice test designed to evaluate students’ conceptual understanding of kinematics at the high school level. The test comprises 49 multiple-choice items about velocity and acceleration, which are based on seven kinematic concepts and which make use of three different representations. In the first part of this article we describe the development and the validation process of the KCT. We applied the KCT to 338 Swiss high school students who attended traditional teaching in kinematics. We analyzed the response data to provide the psychometric properties of the test. In the second part we present the results of a structural analysis of the test. An exploratory factor analysis of 664 student answers finally uncovered the seven kinematics concepts as factors. However, the analysis revealed a hierarchical structure of concepts. At the higher level, mathematical concepts group together, and then split up into physics concepts at the lower level. Furthermore, students who seem to understand a concept in one representation have difficulties transferring the concept to similar problems in another representation. Both results have implications for teaching kinematics. First, teaching mathematical concepts beforehand might be beneficial for learning kinematics. Second, instructions have to be designed to teach students the change between different representations.

  19. Validation and structural analysis of the kinematics concept test

    Science.gov (United States)

    Lichtenberger, A.; Wagner, C.; Hofer, S. I.; Stern, E.; Vaterlaus, A.

    2017-06-01

    The kinematics concept test (KCT) is a multiple-choice test designed to evaluate students' conceptual understanding of kinematics at the high school level. The test comprises 49 multiple-choice items about velocity and acceleration, which are based on seven kinematic concepts and which make use of three different representations. In the first part of this article we describe the development and the validation process of the KCT. We applied the KCT to 338 Swiss high school students who attended traditional teaching in kinematics. We analyzed the response data to provide the psychometric properties of the test. In the second part we present the results of a structural analysis of the test. An exploratory factor analysis of 664 student answers finally uncovered the seven kinematics concepts as factors. However, the analysis revealed a hierarchical structure of concepts. At the higher level, mathematical concepts group together, and then split up into physics concepts at the lower level. Furthermore, students who seem to understand a concept in one representation have difficulties transferring the concept to similar problems in another representation. Both results have implications for teaching kinematics. First, teaching mathematical concepts beforehand might be beneficial for learning kinematics. Second, instructions have to be designed to teach students the change between different representations.

  20. Validation of artificial skin equivalents as in vitro testing systems

    Science.gov (United States)

    Schmitt, Robert; Marx, Ulrich; Walles, Heike; Schober, Lena

    2011-03-01

    With the increasing complexity of the chemical composition of pharmaceuticals, cosmetics and everyday substances, the awareness of potential health issues and long term damages for humanoid organs is shifting into focus. Artificial in vitro testing systems play an important role in providing reliable test conditions and replacing precarious animal testing. Especially artificial skin equivalents ASEs are used for a broad spectrum of studies like penetration, irritation and corrosion of substances. One major challenge in tissue engineering is the qualification of each individual ASE as in vitro testing system. Due to biological fluctuations, the stratum corneum hornified layer of some ASEs may not fully develop or other defects might occur. For monitoring these effects we developed an fully automated Optical Coherence Tomography device. Here, we present different methods to characterize and evaluate the quality of the ASEs based on image and data processing of OCT B-scans. By analysing the surface structure, defects, like cuts or tears, are detectable. A further indicator for the quality of the ASE is the morphology of the tissue. This allows to determine if the skin model has reached the final growth state. We found, that OCT is a well suited technology for automatically characterizing artificial skin equivalents and validating the application as testing system.

  1. Compressive strength test for cemented waste forms: validation process

    International Nuclear Information System (INIS)

    Haucz, Maria Judite A.; Candido, Francisco Donizete; Seles, Sandro Rogerio

    2007-01-01

    In the Cementation Laboratory (LABCIM), of the Development Centre of the Nuclear Technology (CNEN/CDTN-MG), hazardous/radioactive wastes are incorporated in cement, to transform them into monolithic products, preventing or minimizing the contaminant release to the environment. The compressive strength test is important to evaluate the cemented product quality, in which it is determined the compression load necessary to rupture the cemented waste form. In LABCIM a specific procedure was developed to determine the compressive strength of cement waste forms based on the Brazilian Standard NBR 7215. The accreditation of this procedure is essential to assure reproductive and accurate results in the evaluation of these products. To achieve this goal the Laboratory personal implemented technical and administrative improvements in accordance with the NBR ISO/IEC 17025 standard 'General requirements for the competence of testing and calibration laboratories'. As the developed procedure was not a standard one the norm ISO/IEC 17025 requests its validation. There are some methodologies to do that. In this paper it is described the current status of the accreditation project, especially the validation process of the referred procedure and its results. (author)

  2. Six-minute stepper test: a valid clinical exercise tolerance test for COPD patients

    Directory of Open Access Journals (Sweden)

    Grosbois JM

    2016-03-01

    .005. Performances on the 6MST and 6MWT were significantly improved after PR (570 vs 488 steps, P=0.001 and 448 vs 406 m, respectively; P<0.0001. Improvements of the 6MST and 6MWT after PR were significantly correlated (r=0.34; P=0.03.Conclusion: The results of this study show that the 6MST is a valid test to evaluate exercise tolerance in COPD patients. The use of this test in clinical practice appears to be particularly relevant for the assessment of patients managed by home PR. Keywords: 6-minute stepper test, 6-minute walk test, exercise tolerance, pulmonary rehabilitation, cardiopulmonary exercise testing, validity

  3. Development and Validation of a Lifecycle-based Prognostics Architecture with Test Bed Validation

    Energy Technology Data Exchange (ETDEWEB)

    Hines, J. Wesley [Univ. of Tennessee, Knoxville, TN (United States); Upadhyaya, Belle [Univ. of Tennessee, Knoxville, TN (United States); Sharp, Michael [Univ. of Tennessee, Knoxville, TN (United States); Ramuhalli, Pradeep [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Jeffries, Brien [Univ. of Tennessee, Knoxville, TN (United States); Nam, Alan [Univ. of Tennessee, Knoxville, TN (United States); Strong, Eric [Univ. of Tennessee, Knoxville, TN (United States); Tong, Matthew [Univ. of Tennessee, Knoxville, TN (United States); Welz, Zachary [Univ. of Tennessee, Knoxville, TN (United States); Barbieri, Federico [Univ. of Tennessee, Knoxville, TN (United States); Langford, Seth [Univ. of Tennessee, Knoxville, TN (United States); Meinweiser, Gregory [Univ. of Tennessee, Knoxville, TN (United States); Weeks, Matthew [Univ. of Tennessee, Knoxville, TN (United States)

    2014-11-06

    On-line monitoring and tracking of nuclear plant system and component degradation is being investigated as a method for improving the safety, reliability, and maintainability of aging nuclear power plants. Accurate prediction of the current degradation state of system components and structures is important for accurate estimates of their remaining useful life (RUL). The correct quantification and propagation of both the measurement uncertainty and model uncertainty is necessary for quantifying the uncertainty of the RUL prediction. This research project developed and validated methods to perform RUL estimation throughout the lifecycle of plant components. Prognostic methods should seamlessly operate from beginning of component life (BOL) to end of component life (EOL). We term this "Lifecycle Prognostics." When a component is put into use, the only information available may be past failure times of similar components used in similar conditions, and the predicted failure distribution can be estimated with reliability methods such as Weibull Analysis (Type I Prognostics). As the component operates, it begins to degrade and consume its available life. This life consumption may be a function of system stresses, and the failure distribution should be updated to account for the system operational stress levels (Type II Prognostics). When degradation becomes apparent, this information can be used to again improve the RUL estimate (Type III Prognostics). This research focused on developing prognostics algorithms for the three types of prognostics, developing uncertainty quantification methods for each of the algorithms, and, most importantly, developing a framework using Bayesian methods to transition between prognostic model types and update failure distribution estimates as new information becomes available. The developed methods were then validated on a range of accelerated degradation test beds. The ultimate goal of prognostics is to provide an accurate assessment for

  4. Development and Validation of the Cognition Test Battery for Spaceflight.

    Science.gov (United States)

    Basner, Mathias; Savitt, Adam; Moore, Tyler M; Port, Allison M; McGuire, Sarah; Ecker, Adrian J; Nasrini, Jad; Mollicone, Daniel J; Mott, Christopher M; McCann, Thom; Dinges, David F; Gur, Ruben C

    2015-11-01

    Sustained high-level cognitive performance is of paramount importance for the success of space missions, which involve environmental, physiological, and psychological stressors that may affect brain functions. Despite subjective symptom reports of cognitive fluctuations in spaceflight, the nature of neurobehavioral functioning in space has not been clarified. We developed a computerized cognitive test battery (Cognition) that has sensitivity to multiple cognitive domains and was specifically designed for the high-performing astronaut population. Cognition consists of 15 unique forms of 10 neuropsychological tests that cover a range of cognitive domains, including emotion processing, spatial orientation, and risk decision making. Cognition is based on tests known to engage specific brain regions as evidenced by functional neuroimaging. Here we describe the first normative and acute total sleep deprivation data on the Cognition test battery as well as several efforts underway to establish the validity, sensitivity, feasibility, and acceptability of Cognition. Practice effects and test-retest variability differed substantially between the 10 Cognition tests, illustrating the importance of normative data that both reflect practice effects and differences in stimulus set difficulty in the population of interest. After one night without sleep, medium to large effect sizes were observed for 3 of the 10 tests addressing vigilant attention (Cohen's d = 1.00), cognitive throughput (d = 0.68), and abstract reasoning (d = 0.65). In addition to providing neuroimaging-based novel information on the effects of spaceflight on a range of cognitive functions, Cognition will facilitate comparing the effects of ground-based analogues to spaceflight, increase consistency across projects, and thus enable meta-analyses.

  5. Development and Validation of the Cognition Test Battery for Spaceflight

    Science.gov (United States)

    Basner, Mathias; Savitt, Adam; Moore, Tyler M.; Port, Allison M.; McGuire, Sarah; Ecker, Adrian J.; Nasrini, Jad; Mollicone, Daniel J.; Mott, Christopher M.; McCann, Thom; Dinges, David F.; Gur, Ruben C.

    2015-01-01

    Background Sustained high-level cognitive performance is of paramount importance for the success of space missions, which involve environmental, physiological and psychological stressors that may affect brain functions. Despite subjective symptom reports of cognitive fluctuations in spaceflight, the nature of neurobehavioral functioning in space has not been clarified. Methods We developed a computerized cognitive test battery (Cognition) that has sensitivity to multiple cognitive domains and was specifically designed for the high-performing astronaut population. Cognition consists of 15 unique forms of 10 neuropsychological tests that cover a range of cognitive domains including emotion processing, spatial orientation, and risk decision making. Cognition is based on tests known to engage specific brain regions as evidenced by functional neuroimaging. Here we describe the first normative and acute total sleep deprivation data on the Cognition test battery as well as several efforts underway to establish the validity, sensitivity, feasibility, and acceptability of Cognition. Results Practice effects and test-retest variability differed substantially between the 10 Cognition tests, illustrating the importance of normative data that both reflect practice effects and differences in stimulus set difficulty in the population of interest. After one night without sleep, medium to large effect sizes were observed for 3 of the 10 tests addressing vigilant attention (Cohen’s d=1.00), cognitive throughput (d=0.68), and abstract reasoning (d=0.65). Conclusions In addition to providing neuroimaging-based novel information on the effects of spaceflight on a range of cognitive functions, Cognition will facilitate comparing the effects of ground-based analogs to spaceflight, increase consistency across projects, and thus enable meta-analyses. PMID:26564759

  6. Tests for validation of fast neutron reactors safety

    International Nuclear Information System (INIS)

    Nagata, T.; Yamashita, H.

    2001-01-01

    Japanese scientific research and design enterprises in cooperation with industrial and power generating corporations implement a project on creating a fast neutron reactor of the ultimate safety. One of the basic expected results from such a development is creation of a reactor core structure that is able to eliminate recriticality occurrence in the course of reactor accident involving fuel melting. One of the possible ways to solve this problem is to include pipes (meant for specifying directed (controlled) molten fuel relocation) into fuel assembly structure. In the course of conduction and subsequent implementation of such a design the basic issue is to experimentally confirm the operating capacity of FA having such a structure and that is called FAIDUS. Within EAGLE Project on experimental basis of IAE NNC RK an activity has been started on preparation and conduction of out-of-pile and in-pile tests. During tests a sodium coolant will be used. Studies are conducted by NNC RK in cooperation with the Japanese corporations JAPC and JNC. Basic objective of out-of-pile tests was to obtain preliminary information on fuel relocation behavior under conditions simulating accident involving melting of core consisting of FAIDUS FA, which will help to clarify simulation criteria and to develop the most optimum structure of the experimental channel for reactor experiments conduction. The basic objective of in-pile tests was the experimental confirmation of operating capacity of FAIDUS FA model under reactor conditions. According to the program two tests are planned to be performed at IGR reactor: tests for validation of fast neutron reactor safety, and out-of-pile tests at EAGLE experimental facility without sodium coolant

  7. Development and validation of a new cognitive screening test: The Hong Kong Brief Cognitive Test (HKBC).

    Science.gov (United States)

    Chiu, Helen F K; Zhong, Bao-Liang; Leung, Tony; Li, S W; Chow, Paulina; Tsoh, Joshua; Yan, Connie; Xiang, Yu-Tao; Wong, Mike

    2018-07-01

    To develop and examine the validity of a new brief cognitive test with less educational bias for screening cognitive impairment. A new cognitive test, Hong Kong Brief Cognitive Test (HKBC), was developed based on review of the literature, as well as the views of an expert panel. Three groups of subjects aged 65 or above were recruited after written consent: normal older people recruited in elderly centres, people with mild NCD (neurocognitive disorder), and people with major NCD. The brief cognitive test, Mini-Mental State Examination (MMSE) and Montreal Cognitive Assessment Scale (MoCA), were administered to the subjects. The performance of HKBC in differentiating subjects with major NCD, mild NCD, and normal older people were compared with the clinical diagnosis, as well as the MMSE and MoCA scores. In total, 359 subjects were recruited, with 99 normal controls, 132 subjects with major NCD, and 128 with mild NCD. The mean MMSE, MoCA, and HKBC scores showed significant differences among the 3 groups of subjects. In the receiving operating characteristic curve analysis of the HKBC in differentiating normal subjects from those with cognitive impairment (mild NCD + major NCD), the area under the curve was 0.955 with an optimal cut-off score of 21/22. The performances of MMSE and MoCA in differentiating normal from cognitively impaired subjects are slightly inferior to the HKBC. The HKBC is a brief instrument useful for screening cognitive impairment in older adults and is also useful in populations with low educational level. Copyright © 2018 John Wiley & Sons, Ltd.

  8. Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

    Directory of Open Access Journals (Sweden)

    Penny Moss

    Full Text Available Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot. Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%. Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56 years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%. Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add

  9. Quantitative and Qualitative Responses to Topical Cold in Healthy Caucasians Show Variance between Individuals but High Test-Retest Reliability.

    Science.gov (United States)

    Moss, Penny; Whitnell, Jasmine; Wright, Anthony

    2016-01-01

    Increased sensitivity to cold may be a predictor of persistent pain, but cold pain threshold is often viewed as unreliable. This study aimed to determine the within-subject reliability and between-subject variance of cold response, measured comprehensively as cold pain threshold plus pain intensity and sensation quality at threshold. A test-retest design was used over three sessions, one day apart. Response to cold was assessed at four sites (thenar eminence, volar forearm, tibialis anterior, plantar foot). Cold pain threshold was measured using a Medoc thermode and standard method of limits. Intensity of pain at threshold was rated using a 10cm visual analogue scale. Quality of sensation at threshold was quantified with indices calculated from subjects' selection of descriptors from a standard McGill Pain Questionnaire. Within-subject reliability for each measure was calculated with intra-class correlation coefficients and between-subject variance was evaluated as group coefficient of variation percentage (CV%). Gender and site comparisons were also made. Forty-five healthy adults participated: 20 male, 25 female; mean age 29 (range 18-56) years. All measures at all four test sites showed high within-subject reliability: cold pain thresholds r = 0.92-0.95; pain rating r = 0.93-0.97; McGill pain quality indices r = 0.87-0.85. In contrast, all measures showed wide between-subject variance (CV% between 51.4% and 92.5%). Upper limb sites were consistently more sensitive than lower limb sites, but equally reliable. Females showed elevated cold pain thresholds, although similar pain intensity and quality to males. Females were also more reliable and showed lower variance for all measures. Thus, although there was clear population variation, response to cold for healthy individuals was found to be highly reliable, whether measured as pain threshold, pain intensity or sensation quality. A comprehensive approach to cold response testing therefore may add validity and

  10. Evaluation of the separate effects tests (SET) validation matrix

    International Nuclear Information System (INIS)

    1996-11-01

    This work is the result of a one year extended mandate which has been given by the CSNI on the request of the PWG 2 and the Task Group on Thermal Hydraulic System Behaviour (TG THSB) in late 1994. The aim was to evaluate the SET validation matrix in order to define the real needs for further experimental work. The statistical evaluation tables of the SET matrix provide an overview of the data base including the parameter ranges covered for each phenomenon and selected parameters, and questions posed to obtain answers concerning the need for additional experimental data with regard to the objective of nuclear power plant safety. A global view of the data base is first presented focussing on areas lacking in data and on hot topics. A new systematic evaluation has been done based on the authors technical judgments and giving evaluation tables. In these tables, global and indicative information are included. Four main parameters have been chosen as the most important and relevant parameters: a state parameter given by the operating pressure of the tests, a flow parameter expressed as mass flux, mass flow rate or volumetric flow rate in the tests, a geometrical parameter provided through a typical dimension expressed by a diameter, an equivalent diameter (hydraulic or heated) or a cross sectional area of the test sections, and an energy or heat transfer parameter given as the fluid temperature, the heat flux or the heat transfer surface temperature of the tests

  11. Validation of the Arabic Version of the Internet Gaming Disorder-20 Test.

    Science.gov (United States)

    Hawi, Nazir S; Samaha, Maya

    2017-04-01

    In recent years, researchers have been trying to shed light on gaming addiction and its association with different psychiatric disorders and psychological determinants. The latest edition version of the American Psychiatric Association's Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5) included in its Section 3 Internet Gaming Disorder (IGD) as a condition for further empirical study and proposed nine criteria for the diagnosis of IGD. The 20-item Internet Gaming Disorder (IGD-20) Test was developed as a valid and reliable tool to assess gaming addiction based on the nine criteria set by the DSM-5. The aim of this study is to validate an Arabic version of the IGD-20 Test. The Arabic version of IGD-20 will not only help in identifying Arabic-speaking pathological gamers but also stimulate cross-cultural studies that could contribute to an area in need of more research for insight and treatment. After a process of translation and back-translation and with the participation of a sizable sample of Arabic-speaking adolescents, the present study conducted a psychometric validation of the IGD-20 Test. Our confirmatory factor analysis showed the validity of the Arabic version of the IGD-20 Test. The one-factor model of the Arabic IGD-20 Test had very good psychometric properties, and it fitted the sample data extremely well. In addition, correlation analysis between the IGD-20 Test and the daily duration on weekdays and weekends gameplay revealed significant positive relationships that warranted a criterion-related validation. Thus, the Arabic version of the IGD-20 Test is a valid and reliable measure of IGD among Arabic-speaking populations.

  12. Suitability Screening Test for Marine Corps Air Traffic Controllers Phase 3: Non-cognitive Test Validation and Cognitive Test Prototype

    Science.gov (United States)

    2014-06-01

    developed, pilot tested, and in its Beta form. Findings or Results The subset of NCAPS traits that demonstrated statistically significant prediction for...development and initial pilot testing of the Prototype Marine ATC Cognitive Test. Method The validation approach chosen for this project was a criterion... multitasking ability, and 5) inductive reasoning ability. A working memory capacity test was developed because working memory has been linked to

  13. Test Method Facet and the Construct Validity of Listening Comprehension Tests

    Directory of Open Access Journals (Sweden)

    Roya Khoii

    2010-05-01

    Full Text Available The assessment of listening abilities is one of the least understood, least developed and, yet, one of the most important areas of language testing and assessment. It is particularly important because of its potential wash-back effects on classroom practices. Given the fact that listening tests play a great role in assessing the language proficiency of students, they are expected to enjoy a high level of construct validity. The present study was dedicated to investigating the construct validity of three different test formats, namely, multiple-choice, gap filling on summary (also called listening summary cloze, and fill-in-the-blank, used to evaluate the listening comprehension of EFL learners. In order to achieve the purpose of the study, three passages with relatively similar readability levels were used for the construction of 9 listening tests, that is, each appeared in three formats. Following a counter-balanced design, the tests were administered to 91homogeneous EFL learners divided into three groups. The statistical analysis of the results revealed that the multiple-choice test enjoyed the highest level of construct validity. Moreover, a repeated measure one-way ANOVA demonstrated that the fill-in-the-blank task was the most difficult with the MC test as the easiest for the participants.

  14. Reliability and Validity of the Inline Skating Skill Test.

    Science.gov (United States)

    Radman, Ivan; Ruzic, Lana; Padovan, Viktoria; Cigrovski, Vjekoslav; Podnar, Hrvoje

    2016-09-01

    This study aimed to examine the reliability and validity of the inline skating skill test. Based on previous skating experience forty-two skaters (26 female and 16 male) were randomized into two groups (competitive level vs. recreational level). They performed the test four times, with a recovery time of 45 minutes between sessions. Prior to testing, the participants rated their skating skill using a scale from 1 to 10. The protocol included performance time measurement through a course, combining different skating techniques. Trivial changes in performance time between the repeated sessions were determined in both competitive females/males and recreational females/males (-1.7% [95% CI: -5.8-2.6%] - 2.2% [95% CI: 0.0-4.5%]). In all four subgroups, the skill test had a low mean within-individual variation (1.6% [95% CI: 1.2-2.4%] - 2.7% [95% CI: 2.1-4.0%]) and high mean inter-session correlation (ICC = 0.97 [95% CI: 0.92-0.99] - 0.99 [95% CI: 0.98-1.00]). The comparison of detected typical errors and smallest worthwhile changes (calculated as standard deviations × 0.2) revealed that the skill test was able to track changes in skaters' performances. Competitive-level skaters needed shorter time (24.4-26.4%, all p skating skills in amateur competitive and recreational level skaters. Further studies are needed to evaluate the reproducibility of this skill test in different populations including elite inline skaters.

  15. Development and Validation of a Persian Version of Dichotic Emotional Word Test

    Directory of Open Access Journals (Sweden)

    Atefe Davudazde

    2016-03-01

    Full Text Available Introduction: Emotional words in comparison with neutral words have different hemispheric specialization. It is assumed that the right hemisphere has a role in processing every kind of emotional word. The objective of the present study was the development of a Persian version of the dichotic emotional word test and evaluate its validation among adult Persian speakers.   Materials and Methods: The present study was done on 60 adults, with the age ranging from 18-30 years for both genders, who had no history of neurological disorders with normal hearing. The developed test included eight main lists; each had several dichotic emotional/ neutral pairs of words. Participants were asked to recall as many words in each list as they could after they listened to them. A content validity index was used to analyze the validity of the test.   Results: The mean content validity index score was 0.94. The findings showed that in the left ear, emotional words were remembered more than neutral ones (P=0.007. While in the right ear, neutral words were remembered more (P=0.009. There were no significant differences in male and female scores.   Conclusion:  Dichotic emotional word test has a high content validity. The ability to remember emotional words better in the left ear supports the dominant role of the right hemisphere in emotional word perception.

  16. BENDER GESTALT VISUALMOTOR TEST AND CARAS TEST: A EXAM OF CONSTRUCT VALIDITY

    Directory of Open Access Journals (Sweden)

    Cesar Merino Soto

    2011-12-01

    Full Text Available Research with new versions of the Bender Gestalt Test (TGB has hardly attracted attention to the researchers of the Hispanic world, onsidering that this test is one of the most widely used psychological assessments. This study evaluates the construct validity of the modified version of TGB for children, elative to sustainedattention assessed by the Caras Test. Both tests were applied to 90 children, aged between 5 and 8, in standardized conditions. The esults indicate that the shared variance between the two measures is zero, even when applied disattenuated correlations for measurement error; also, no non-linear patterns were detected between the two variables. These correlations were consistent in the total sample and among subgroups of children. We discuss these results with respect to the limits of validity of this modified version of TGB in the Spanish language.

  17. Video game addiction test: validity and psychometric characteristics.

    Science.gov (United States)

    van Rooij, Antonius J; Schoenmakers, Tim M; van den Eijnden, Regina J J M; Vermulst, Ad A; van de Mheen, Dike

    2012-09-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study used data (n=2,894) from a large-sample paper-and-pencil questionnaire study, conducted in 2009 on secondary schools in Netherlands. Thus, the main source of data was a large sample of schoolchildren (aged 13-16 years). Measurements included the proposed VAT, the Compulsive Internet Use Scale, weekly hours spent on various game types, and several psychosocial variables. The VAT demonstrated excellent reliability, excellent construct validity, a one-factor model fit, and a high degree of measurement invariance across gender, ethnicity, and learning year, indicating that the scale outcomes can be compared across different subgroups with little bias. In summary, the VAT can be helpful in the further study of video game addiction, and it contributes to the debate on possible inclusion of behavioral addictions in the upcoming DSM-V.

  18. Development and validation of a Chinese music quality rating test.

    Science.gov (United States)

    Cai, Yuexin; Zhao, Fei; Zheng, Yiqing

    2013-09-01

    The present study aims to develop and validate a Chinese music quality rating test (MQRT). In Experiment 1, 22 music pieces were initially selected and paired as a 'familiar music piece' and 'unfamiliar music piece' based on familiarities amongst the general public in the categories of classical music (6), Chinese folk music (8), and pop music (8). Following the selection criteria, one pair of music pieces from each music category was selected and used for the MQRT in Experiment 2. In Experiment 2, the MQRT was validated using these music pieces in the categories 'Pleasantness', 'Naturalness', 'Fullness', 'Roughness', and 'Sharpness'. Seventy-two adult participants and 30 normal-hearing listeners were recruited in Experiments 1 and 2, respectively. Significant differences between the familiar and unfamiliar music pieces were found in respect of pleasantness rating for folk and pop music pieces as well as in sharpness rating for pop music pieces. The comparison of music category effect on MQRT found significant differences in pleasantness, fullness, and sharpness ratings. The Chinese MQRT developed in the present study is an effective tool for assessing music quality.

  19. [Cognition-correlation indices of gender schema: tests of validity].

    Science.gov (United States)

    Ishida, E

    1994-02-01

    Four-hundred and seventy-seven subjects evaluated a set of traits and behaviors in terms of how masculine and feminine they were and in terms of how well they represented their real and ideal self-images. Within-individual correlation coefficients between these evaluations were proposed as measures of psychological gender schemata, because they would represent the degree of matching between the subjects' gender-image and ideal/real self-images of gender-related attributes. The present study aims at examining the construct validity of these measures, by testing them to psychological variables that are known to reflect gender identity. The individual difference variables used as criteria were (a) satisfaction with one's own sex, (b) general happiness, (c) self-esteem (d) gender-conflict, and (e) school and occupational achievement need. Correlations between the gender-schema indices and the criteria variables supported the construct validity of those measures. Advantages of the present measurement over the conventional simple trait approach, such as BSRI, or PAQ are discussed.

  20. Testing ESL sociopragmatics development and validation of a web-based test battery

    CERN Document Server

    Roever, Carsten; Elder, Catherine

    2014-01-01

    Testing of second language pragmatics has grown as a research area but still suffers from a tension between construct coverage and practicality. In this book, the authors describe the development and validation of a web-based test of second language pragmatics for learners of English. The test has a sociopragmatic orientation and strives for a broad coverage of the construct by assessing learners'' metapragmatic judgments as well as their ability to co-construct discourse. To ensure practicality, the test is delivered online and is scored partially automatically and partially by human raters.

  1. Boys with autism spectrum disorders show superior performance on the adult Embedded Figures Test

    NARCIS (Netherlands)

    Schlooz, W.A.J.M.; Hulstijn, W.

    2014-01-01

    Weak central coherence is frequently studied using the Embedded Figures Test (EFT) yielding mixed and ambiguous results. In this study, the performance of 36 boys (9–14 years) with Autism Spectrum Disorders (ASD) is compared with that of 46 typical peers using both the children's and the adult

  2. Test of Flow Characteristics in Tubular Fuel Assembly I - Establishment of test loop and measurement validation test

    International Nuclear Information System (INIS)

    Park, Jong Hark; Chae, H. T.; Park, C.; Kim, H.

    2005-12-01

    Tubular type fuel has been developed as one of candidates for Advanced HANARO Reactor(AHR). It is necessary to test the flow characteristics such as velocity in each flow channels and pressure drop of tubular type fuel. A hydraulic test-loop to examine the hydraulic characteristics for a tubular type fuel has been designed and constructed. It consists of three parts; a) piping-loop including pump and motor, magnetic flow meter and valves etc, b) test-section part where a simulated tubular type fuel is located, and 3) data acquisition system to get reading signals from sensors or instruments. In this report, considerations during the design and installation of the facility and the selection of data acquisition sensors and instruments are described in detail. Before doing the experiment to measure the flow velocities in flow channels, a preliminary tests have been done for measuring the coolant velocities using pitot-tube and for validating the measurement accuracy as well. Local velocities of the radial direction in circular tubes are measured at regular intervals of 60 degrees by three pitot-tubes. Flow rate inside the circular flow channel can be obtained by integrating the velocity distribution in radial direction. The measured flow rate was compared to that of magnetic flow meter. According to the results, two values had a good agreement, which means that the measurement of coolant velocity by using pitot-tube and the flow rate measured by the magnetic flow meter are reliable. Uncertainty analysis showed that the error of velocity measurement by pitot-tube is less than ±2.21%. The hydraulic test-loop also can be adapted to others such as HANARO 18 and 36 fuel, in-pile system of FTL(Fuel Test Loop), etc

  3. Validation of a Short Form of an Indecision Test: The Vocational Assessment Test

    Science.gov (United States)

    Picard, France; Frenette, Éric; Guay, Frédéric; Labrosse, Julie

    2015-01-01

    The purpose of this research was to validate the scores of a short form of a new instrument, "l'Épreuve de décision vocationnelle, forme scolaire" (EDV-9S; vocational assessment test), which measures six indecision-related problems (lack of self-knowledge, lack of readiness, lack of method in decision making, lack of information,…

  4. Validation of a telephone screening test for Alzheimer's disease.

    Science.gov (United States)

    Camozzato, Ana Luiza; Kochhann, Renata; Godinho, Claudia; Costa, Amanda; Chaves, Marcia L

    2011-03-01

    Financial constraints, mobility issues, medical conditions, crime in local areas can make cognitive assessment difficult for elders and telephone interviews can be a good alternative. This study was carried out to evaluate the reliability, validity and clinical utility of a Brazilian telephone version of the Mini Mental State Examination (Braztel-MMSE) in a community sample of healthy elderly participants and AD patients. The MMSE and the Braztel-MMSE were applied to 66 AD patients and 67 healthy elderly participants. The test-retest reliability was strong and significant (r = .92, p = .01), and the correlation between the Braztel-MMSE and the MMSE were significant (p = .01) and strong (r = .92). The general screening ability of the Braztel-MMSE was high (AUC = 0.982; CI95% = 0.964-1.001). This telephone version can therefore be used as a screening measure for dementia in older adults that need neuropsychological screening and cannot present for an evaluation.

  5. Physical Stress Echocardiography: Prediction of Mortality and Cardiac Events in Patients with Exercise Test showing Ischemia

    Directory of Open Access Journals (Sweden)

    Ana Carla Pereira de Araujo

    2014-11-01

    Full Text Available Background: Studies have demonstrated the diagnostic accuracy and prognostic value of physical stress echocardiography in coronary artery disease. However, the prediction of mortality and major cardiac events in patients with exercise test positive for myocardial ischemia is limited. Objective: To evaluate the effectiveness of physical stress echocardiography in the prediction of mortality and major cardiac events in patients with exercise test positive for myocardial ischemia. Methods: This is a retrospective cohort in which 866 consecutive patients with exercise test positive for myocardial ischemia, and who underwent physical stress echocardiography were studied. Patients were divided into two groups: with physical stress echocardiography negative (G1 or positive (G2 for myocardial ischemia. The endpoints analyzed were all-cause mortality and major cardiac events, defined as cardiac death and non-fatal acute myocardial infarction. Results: G2 comprised 205 patients (23.7%. During the mean 85.6 ± 15.0-month follow-up, there were 26 deaths, of which six were cardiac deaths, and 25 non-fatal myocardial infarction cases. The independent predictors of mortality were: age, diabetes mellitus, and positive physical stress echocardiography (hazard ratio: 2.69; 95% confidence interval: 1.20 - 6.01; p = 0.016. The independent predictors of major cardiac events were: age, previous coronary artery disease, positive physical stress echocardiography (hazard ratio: 2.75; 95% confidence interval: 1.15 - 6.53; p = 0.022 and absence of a 10% increase in ejection fraction. All-cause mortality and the incidence of major cardiac events were significantly higher in G2 (p < 0. 001 and p = 0.001, respectively. Conclusion: Physical stress echocardiography provides additional prognostic information in patients with exercise test positive for myocardial ischemia.

  6. Vancomycin-resistant enterococci: validation of susceptibility testing and in vitro activity of novel antibiotics

    DEFF Research Database (Denmark)

    Rathe, Mathias; Lise, Kristensen,; Ellermann-Eriksen, Svend

    Vancomycin-resistant enterococci: validation of susceptibility testing and in vitro activity of novel antibiotics......Vancomycin-resistant enterococci: validation of susceptibility testing and in vitro activity of novel antibiotics...

  7. Testing the Predictive Validity of the Hendrich II Fall Risk Model.

    Science.gov (United States)

    Jung, Hyesil; Park, Hyeoun-Ae

    2018-03-01

    Cumulative data on patient fall risk have been compiled in electronic medical records systems, and it is possible to test the validity of fall-risk assessment tools using these data between the times of admission and occurrence of a fall. The Hendrich II Fall Risk Model scores assessed during three time points of hospital stays were extracted and used for testing the predictive validity: (a) upon admission, (b) when the maximum fall-risk score from admission to falling or discharge, and (c) immediately before falling or discharge. Predictive validity was examined using seven predictive indicators. In addition, logistic regression analysis was used to identify factors that significantly affect the occurrence of a fall. Among the different time points, the maximum fall-risk score assessed between admission and falling or discharge showed the best predictive performance. Confusion or disorientation and having a poor ability to rise from a sitting position were significant risk factors for a fall.

  8. A Mouse Model of Hyperproliferative Human Epithelium Validated by Keratin Profiling Shows an Aberrant Cytoskeletal Response to Injury

    Directory of Open Access Journals (Sweden)

    Samal Zhussupbekova

    2016-07-01

    Full Text Available A validated animal model would assist with research on the immunological consequences of the chronic expression of stress keratins KRT6, KRT16, and KRT17, as observed in human pre-malignant hyperproliferative epithelium. Here we examine keratin gene expression profile in skin from mice expressing the E7 oncoprotein of HPV16 (K14E7 demonstrating persistently hyperproliferative epithelium, in nontransgenic mouse skin, and in hyperproliferative actinic keratosis lesions from human skin. We demonstrate that K14E7 mouse skin overexpresses stress keratins in a similar manner to human actinic keratoses, that overexpression is a consequence of epithelial hyperproliferation induced by E7, and that overexpression further increases in response to injury. As stress keratins modify local immunity and epithelial cell function and differentiation, the K14E7 mouse model should permit study of how continued overexpression of stress keratins impacts on epithelial tumor development and on local innate and adaptive immunity.

  9. Testing Delays Resulting in Increased Identification Accuracy in Line-Ups and Show-Ups.

    Science.gov (United States)

    Dekle, Dawn J.

    1997-01-01

    Investigated time delays (immediate, two-three days, one week) between viewing a staged theft and attempting an eyewitness identification. Compared lineups to one-person showups in a laboratory analogue involving 412 subjects. Results show that across all time delays, participants maintained a higher identification accuracy with the showup…

  10. Standardization, Validity and Reliability Study of Gülhane Aphasia Test-2 (GAT-2

    Directory of Open Access Journals (Sweden)

    İlknur Maviş

    2007-04-01

    Full Text Available OBJECTIVE: Gülhane Aphasia Test-2 (GAT-2 has been developed to show the presence of a language disorder ‘aphasia’ and to give the clinician implications for the accompanying speech disorders such as apraxia and dysarthria. OBJECTIVE: The aim of the study was to report standardization, validity and reliability study of GAT-2. METHODS: : 10 healthy individuals were tested initially for the pilot study. 134 healthy individual was included to the standardization study and 30 individuals with aphasia and 11 individuals with right brain injury was included to the validation study. The inter group GAT-2 score differentiations and the effects of age, years of education, sex variances were observed. GAT-2 cut-off scores were calculated by the scores of healthy individuals. GAT-2 test-retest reliability and inter-observer reliability was calculated. RESULTS: Healthy individuals’ GAT-2 scores were significantly different from the GAT-2 scores of aphasic patients, but not from right brain injured patients’. Healthy individuals’ GAT-2 scores were not affected from the sex, age variances but from years of education, so cut-off scores were calculated by this variance. GAT-2 scores of aphasic patients were not affected from age, sex and years of education. Test-retest and inter-observer reliability and internal consistency results showed that GAT-2 is a highly reliable aphasia screening test. CONCLUSION: GAT-2 was found to be a standardized, highly reliable and a valid aphasia test for Turkish stroke patients with aphasia

  11. Software test and validation of wireless sensor nodes used in nuclear power plant

    International Nuclear Information System (INIS)

    Deng Changjian; Chen Dongyi; Zhang Heng

    2015-01-01

    The software test and validation of wireless sensor nodes is one of the key approaches to improve or guarantee the reliability of wireless network application in nuclear power plants (NPPs). At first, to validate the software test, some concepts are defined quantitatively, for example the robustness of software, the reliability of software, and the security of software. Then the development tools and simulators of discrete event drive operating system are compared, in order to present robustness, reliability and security of software test approach based on input-output function. Some simple preliminary test results are given to show that different development software can obtain almost same measurement and communication results although the software of special application may be different than normal application. (author)

  12. To Show or Not to Show: The Effects of Item Stems and Answer Options on Performance on a Multiple-Choice Listening Comprehension Test

    Science.gov (United States)

    Yanagawa, Kozo; Green, Anthony

    2008-01-01

    The purpose of this study is to examine whether the choice between three multiple-choice listening comprehension test formats results in any difference in listening comprehension test performance. The three formats entail (a) allowing test takers to preview both the question stem and answer options prior to listening; (b) allowing test takers to…

  13. Construct validity of the Big Five Implicit Association Test

    Directory of Open Access Journals (Sweden)

    Gaja Zager Kocjan

    2014-02-01

    Full Text Available Psychology has recently seen a noticeable increase in interest for implicit measures of attitudes and personality characteristics. The far most known implicit measure is the Implicit Association Test – IAT. We adapted this test in order to assess the Big Five personality dimensions (B5 IAT. We examined B5 IAT measurement characteristics on two samples. Based on the findings of the first sample (N = 62, improvements were made in the B5 IAT, to be tested again on another sample (N = 75. The two studies have shown similar results. The reliabilities of the personality dimensions measured with the B5 IAT failed to achieve a satisfactory level in most cases. The reason probably lies in a lower adequacy of certain stimuli and in the considerable length of the test procedure. The convergent validity of the B5 IAT with explicit measures of personality was low, which may be due to different structures underlying implicit and explicit measures. Results obtained on the first sample have shown that the correlations between IAT adjectives are adequately explained by five latent dimensions. However, these results should be interpreted with caution due to B5 IAT low reliabilities and small sample sizes. The second sample proved to be very unstable, thus the confirmatory factor analysis could not be conducted. Since this is the first attempt to adapt B5 IAT to Slovene language, it is hardly surprising that the results are not entirely consistent with the expectations. As implicit measures currently fail to meet relevant psychometric characteristics, they are not yet applicable in psychological practice. Nevertheless, they have great potential in exploring personality and individual differences, as they overcome many limitations of existing explicit measures.

  14. Testing the rationality assumption using a design difference in the TV game show 'Jeopardy'

    OpenAIRE

    Sjögren Lindquist, Gabriella; Säve-Söderbergh, Jenny

    2006-01-01

    Abstract This paper empirically investigates the rationality assumption commonly applied in economic modeling by exploiting a design difference in the game-show Jeopardy between the US and Sweden. In particular we address the assumption of individuals’ capabilities to process complex mathematical problems to find optimal strategies. The vital difference is that US contestants are given explicit information before they act, while Swedish contestants individually need to calculate the same info...

  15. Validating a bovine brucellosis elisa test for application in Uruguay

    International Nuclear Information System (INIS)

    Silva, M.; Muller, G.; Errico, F.; Dilave, M.

    1998-01-01

    Sera from 600 cattle on Rio Negro Island, known to be free of brucellosis, and 400 sera from vaccinated cattle but known to be negative in the Rose-Bengal test were selected for validation of the FAO/IAEA test kit for detection of antibody to Brucella abortus. Two conjugates, one a polyclonal antiserum and the other a monoclonal antibody, were evaluated. When evaluated for reproducibility using the sera from uninfected cattle, the average coefficient of variation for duplicate samples was 7.1%±5.5. The serum control samples did not exceed OD limits as established for the kit, for any of the 15 plates evaluated. When evaluated by regression analysis, the control sera had an average correlation coefficient of 0.996, indicating a high degree of agreement between the observed OD values of controls on each plate vs the expected values for those controls. Specificity in the assay was >98% as calculated by the PP or regression methods. Comparison of the monoclonal and polyclonal conjugates using sera from vaccinated cattle indicated that many of the cattle must have been vaccinated as adults because of high antibody levels detected by both conjugates. Before this assay can be used on vaccinated animals, the kit will have to be evaluated using sera from animals of known age of vaccination. (author)

  16. Validation of COMMIX with Westinghouse AP-600 PCCS test data

    International Nuclear Information System (INIS)

    Sun, J.G.; Chien, T.H.; Ding, J.; Sha, W.T.

    1993-01-01

    Small-scale test data for the Westinghouse AP-600 Passive Containment Cooling System (PCCS) have been used to validate the COMMIX computer code. To evaluate the performance of the PCCS, two transient liquid-film tracking models have been developed and implemented in the CO code. A set of heat transfer models and a mass transfer model based on heat and mass transfer analogy were used for the analysis of the AP-600 PCCS. It was found that the flow of the air stream in the annulus is a highly turbulent forced convection and that the flow of the air/steam mixture in the containment vessel is a mixed convection. Accordingly, a turbulent-forced-convection heat transfer model is used on the outside of the steel containment vessel wall and a mixed-convection heat transfer model is used on the inside of the steel containment vessel wall. The results from the CO calculations are compared with the experimental data from Westinghouse PCCS small-scale tests for average wall heat flux, evaporation rate, containment vessel pressure, and vessel wall temperature and heat flux distributions; agreement is good. The CO calculations also provide detailed distributions of velocity, temperature, and steam and air concentrations

  17. Vietnamese validation of the short version of Internet Addiction Test

    Directory of Open Access Journals (Sweden)

    Bach Xuan Tran

    2017-12-01

    Full Text Available Background and aims: The main goal of the present study was to examine the psychometric properties of a Vietnamese version of the short-version of Internet Addiction Test (s-IAT and to assess the relationship between s-IAT scores and demographics, health related qualify of life and perceived stress scores in young Vietnamese. Methods: The Vietnamese version of s-IAT was administered to a sample of 589 participants. Exploratory factor and reliability analyses were performed. Regression analysis was used to identify the associated factors. Results: The two-factor model of Vietnamese version of s-IAT demonstrated good psychometric properties. The internal consistency of Factor 1 (loss of control/time management was high (Cronbach's alpha=0.82 and Factor 2 (craving/social problems was satisfactory (Cronbach's alpha=0.75. Findings indicated that 20.9% youths were addicted to the Internet. Regression analysis revealed significant associations between Internet addiction and having problems in self-care, lower quality of life and high perceived stress scores. Discussion and conclusions: The Vietnamese version of s-IAT is a valid and reliable instrument to assess IA in Vietnamese population. Due to the high prevalence of IA among Vietnamese youths, IA should be paid attention in future intervention programs. s-IAT can be a useful screening tool for IA to promptly inform and treat the IA among Vietnamese youths. Keywords: Factor analysis, Short-version, Internet Addiction Test, Psychometric properties, Vietnamese

  18. Development and Validation of a Theoretical Test in Endosonography for Pulmonary Diseases

    DEFF Research Database (Denmark)

    Savran, Mona M; Clementsen, Paul Frost; Annema, Jouke T

    2014-01-01

    evidence for this test. METHODS: Initially, 78 questions were constructed after informal conversational interviews with 4 international experts in endosonography. The clarity and content validity of the questions were tested using a Delphi-like approach. Construct validity was explored by administering......BACKGROUND: Theoretical testing provides the necessary foundation to perform technical skills. Additionally, testing improves the retention of knowledge. OBJECTIVES: The aims of this study were to develop a multiple-choice test in endosonography for pulmonary diseases and to gather validity...... consistently than the novices (p = 0.037) and the intermediates (p Validity evidence was gathered, and the test demonstrated content and construct validity....

  19. Criterion and convergent validity of the Montreal cognitive assessment with screening and standardized neuropsychological testing.

    Science.gov (United States)

    Lam, Benjamin; Middleton, Laura E; Masellis, Mario; Stuss, Donald T; Harry, Robin D; Kiss, Alex; Black, Sandra E

    2013-12-01

    To compare the validity of the Montreal Cognitive Assessment (MoCA) with the criterion standard of standardized neuropsychological testing and to compare the convergent validity of the MoCA with that of existing screening tools and global measures of cognition. Cross-sectional observational study. Tertiary care hospital-based cognitive neurology subspecialty clinic. A convenience sample of 107 individuals with mild Alzheimer's disease (AD, n=75) or mild cognitive impairment (MCI, n=32) from the Sunnybrook Dementia Study. In addition to the MoCA, all participants completed the Mini-Mental State Examination (MMSE), the Mattis Dementia Rating Scale (DRS), and detailed neuropsychological testing. Convergent validity was supported, with MoCA scores correlating well with the MMSE (correlation coefficient (r)=0.66, Pvalidity was supported, with MoCA subscores according to cognitive domain correlating well with analogous neuropsychological tests and, in the case of memory (area under the receiver operating characteristic curve (AUC)=0.86), executive (AUC=0.79), and visuospatial function (AUC=0.79), being reasonably sensitive to impairment in those domains. The MoCA is a valid assessment of cognition that shows good agreement with existing screening tools and global measures (convergent validity) and was superior to the MMSE in this regard. The MoCA domain-specific subscores align with performance on more-detailed neuropsychological tests, suggesting not only good criterion validity for the MoCA, but also that it may be useful in guiding further neuropsychological testing. © 2013, Copyright the Authors Journal compilation © 2013, The American Geriatrics Society.

  20. Test anxiety and the validity of cognitive tests: A confirmatory factor analysis perspective and some empirical findings

    NARCIS (Netherlands)

    Wicherts, J.M.; Zand Scholten, A.

    2010-01-01

    The validity of cognitive ability tests is often interpreted solely as a function of the cognitive abilities that these tests are supposed to measure, but other factors may be at play. The effects of test anxiety on the criterion related validity (CRV) of tests was the topic of a recent study by

  1. Vertical jumping tests in volleyball: reliability, validity, and playing-position specifics.

    Science.gov (United States)

    Sattler, Tine; Sekulic, Damir; Hadzic, Vedran; Uljevic, Ognjen; Dervisevic, Edvin

    2012-06-01

    Vertical jumping is known to be important in volleyball, and jumping performance tests are frequently studied for their reliability and validity. However, most studies concerning jumping in volleyball have dealt with standard rather than sport-specific jumping procedures and tests. The aims of this study, therefore, were (a) to determine the reliability and factorial validity of 2 volleyball-specific jumping tests, the block jump (BJ) test and the attack jump (AJ) test, relative to 2 frequently used and systematically validated jumping tests, the countermovement jump test and the squat jump test and (b) to establish volleyball position-specific differences in the jumping tests and simple anthropometric indices (body height [BH], body weight, and body mass index [BMI]). The BJ was performed from a defensive volleyball position, with the hands positioned in front of the chest. During an AJ, the players used a 2- to 3-step approach and performed a drop jump with an arm swing followed by a quick vertical jump. A total of 95 high-level volleyball players (all men) participated in this study. The reliability of the jumping tests ranged from 0.97 to 0.99 for Cronbach's alpha coefficients, from 0.93 to 0.97 for interitem correlation coefficients and from 2.1 to 2.8 for coefficients of variation. The highest reliability was found for the specific jumping tests. The factor analysis extracted one significant component, and all of the tests were highly intercorrelated. The analysis of variance with post hoc analysis showed significant differences between 5 playing positions in some of the jumping tests. In general, receivers had a greater jumping capacity, followed by libero players. The differences in jumping capacities should be emphasized vis-a-vis differences in the anthropometric measures of players, where middle hitters had higher BH and body weight, followed by opposite hitters and receivers, with no differences in the BMI between positions.

  2. Thyroid-specific questions on work ability showed known-groups validity among Danes with thyroid diseases

    DEFF Research Database (Denmark)

    Nexo, Mette A.; Watt, Torquil; Bonnema, Steen Joop

    2014-01-01

    and interviews with thyroid patients, 24 work ability items were selected from previous questionnaires, revised, or developed anew. Items were tested among 632 patients with thyroid disease (non-toxic goiter, toxic nodular goiter, Graves' disease (with or without orbitopathy), autoimmune hypothyroidism...

  3. Testing an emerging paradigm in migration ecology shows surprising differences in efficiency between flight modes.

    Directory of Open Access Journals (Sweden)

    Adam E Duerr

    Full Text Available To maximize fitness, flying animals should maximize flight speed while minimizing energetic expenditure. Soaring speeds of large-bodied birds are determined by flight routes and tradeoffs between minimizing time and energetic costs. Large raptors migrating in eastern North America predominantly glide between thermals that provide lift or soar along slopes or ridgelines using orographic lift (slope soaring. It is usually assumed that slope soaring is faster than thermal gliding because forward progress is constant compared to interrupted progress when birds pause to regain altitude in thermals. We tested this slope-soaring hypothesis using high-frequency GPS-GSM telemetry devices to track golden eagles during northbound migration. In contrast to expectations, flight speed was slower when slope soaring and eagles also were diverted from their migratory path, incurring possible energetic costs and reducing speed of progress towards a migratory endpoint. When gliding between thermals, eagles stayed on track and fast gliding speeds compensated for lack of progress during thermal soaring. When thermals were not available, eagles minimized migration time, not energy, by choosing energetically expensive slope soaring instead of waiting for thermals to develop. Sites suited to slope soaring include ridges preferred for wind-energy generation, thus avian risk of collision with wind turbines is associated with evolutionary trade-offs required to maximize fitness of time-minimizing migratory raptors.

  4. Testing an emerging paradigm in migration ecology shows surprising differences in efficiency between flight modes.

    Science.gov (United States)

    Duerr, Adam E; Miller, Tricia A; Lanzone, Michael; Brandes, Dave; Cooper, Jeff; O'Malley, Kieran; Maisonneuve, Charles; Tremblay, Junior; Katzner, Todd

    2012-01-01

    To maximize fitness, flying animals should maximize flight speed while minimizing energetic expenditure. Soaring speeds of large-bodied birds are determined by flight routes and tradeoffs between minimizing time and energetic costs. Large raptors migrating in eastern North America predominantly glide between thermals that provide lift or soar along slopes or ridgelines using orographic lift (slope soaring). It is usually assumed that slope soaring is faster than thermal gliding because forward progress is constant compared to interrupted progress when birds pause to regain altitude in thermals. We tested this slope-soaring hypothesis using high-frequency GPS-GSM telemetry devices to track golden eagles during northbound migration. In contrast to expectations, flight speed was slower when slope soaring and eagles also were diverted from their migratory path, incurring possible energetic costs and reducing speed of progress towards a migratory endpoint. When gliding between thermals, eagles stayed on track and fast gliding speeds compensated for lack of progress during thermal soaring. When thermals were not available, eagles minimized migration time, not energy, by choosing energetically expensive slope soaring instead of waiting for thermals to develop. Sites suited to slope soaring include ridges preferred for wind-energy generation, thus avian risk of collision with wind turbines is associated with evolutionary trade-offs required to maximize fitness of time-minimizing migratory raptors.

  5. The development and validation of a test of science critical thinking for fifth graders.

    Science.gov (United States)

    Mapeala, Ruslan; Siew, Nyet Moi

    2015-01-01

    The paper described the development and validation of the Test of Science Critical Thinking (TSCT) to measure the three critical thinking skill constructs: comparing and contrasting, sequencing, and identifying cause and effect. The initial TSCT consisted of 55 multiple choice test items, each of which required participants to select a correct response and a correct choice of critical thinking used for their response. Data were obtained from a purposive sampling of 30 fifth graders in a pilot study carried out in a primary school in Sabah, Malaysia. Students underwent the sessions of teaching and learning activities for 9 weeks using the Thinking Maps-aided Problem-Based Learning Module before they answered the TSCT test. Analyses were conducted to check on difficulty index (p) and discrimination index (d), internal consistency reliability, content validity, and face validity. Analysis of the test-retest reliability data was conducted separately for a group of fifth graders with similar ability. Findings of the pilot study showed that out of initial 55 administered items, only 30 items with relatively good difficulty index (p) ranged from 0.40 to 0.60 and with good discrimination index (d) ranged within 0.20-1.00 were selected. The Kuder-Richardson reliability value was found to be appropriate and relatively high with 0.70, 0.73 and 0.92 for identifying cause and effect, sequencing, and comparing and contrasting respectively. The content validity index obtained from three expert judgments equalled or exceeded 0.95. In addition, test-retest reliability showed good, statistically significant correlations ([Formula: see text]). From the above results, the selected 30-item TSCT was found to have sufficient reliability and validity and would therefore represent a useful tool for measuring critical thinking ability among fifth graders in primary science.

  6. POLYGON - A New Fundamental Movement Skills Test for 8 Year Old Children: Construction and Validation.

    Science.gov (United States)

    Zuvela, Frane; Bozanic, Ana; Miletic, Durdica

    2011-01-01

    Inadequately adopted fundamental movement skills (FMS) in early childhood may have a negative impact on the motor performance in later life (Gallahue and Ozmun, 2005). The need for an efficient FMS testing in Physical Education was recognized. The aim of this paper was to construct and validate a new FMS test for 8 year old children. Ninety-five 8 year old children were used for the testing. A total of 24 new FMS tasks were constructed and only the best representatives of movement areas entered into the final test product - FMS-POLYGON. The ICC showed high values for all 24 tasks (0.83-0.97) and the factorial analysis revealed the best representatives of each movement area that entered the FMS-POLYGON: tossing and catching the volleyball against a wall, running across obstacles, carrying the medicine balls, and straight running. The ICC for the FMS-POLYGON showed a very high result (0.98) and, therefore, confirmed the test's intra-rater reliability. Concurrent validity was tested with the use of the "Test of Gross Motor Development" (TGMD-2). Correlation analysis between the newly constructed FMS-POLYGON and the TGMD-2 revealed the coefficient of -0.82 which indicates a high correlation. In conclusion, the new test for FMS assessment proved to be a reliable and valid instrument for 8 year old children. Application of this test in schools is justified and could play an important factor in physical education and sport practice. Key pointsAll 21 newly constructed tasks demonstrated high intra-rater reliability (0.83-0.97) in FMS assessment. High reliability was also noted in the FMS-POLYGON test (0.98).A high correlation was found between the FMS-POLYGON and TGMD-2 which is a confirmation of the new test's concurrent validity.The research resolved the problem of long and detailed FMS assessment by adding a new dimension using quick and effective norm-referenced approach but also covering all the most important movement areas.New and validated test can be of great use

  7. SOD1 aggregation in ALS mice shows simplistic test tube behavior.

    Science.gov (United States)

    Lang, Lisa; Zetterström, Per; Brännström, Thomas; Marklund, Stefan L; Danielsson, Jens; Oliveberg, Mikael

    2015-08-11

    A longstanding challenge in studies of neurodegenerative disease has been that the pathologic protein aggregates in live tissue are not amenable to structural and kinetic analysis by conventional methods. The situation is put in focus by the current progress in demarcating protein aggregation in vitro, exposing new mechanistic details that are now calling for quantitative in vivo comparison. In this study, we bridge this gap by presenting a direct comparison of the aggregation kinetics of the ALS-associated protein superoxide dismutase 1 (SOD1) in vitro and in transgenic mice. The results based on tissue sampling by quantitative antibody assays show that the SOD1 fibrillation kinetics in vitro mirror with remarkable accuracy the spinal cord aggregate buildup and disease progression in transgenic mice. This similarity between in vitro and in vivo data suggests that, despite the complexity of live tissue, SOD1 aggregation follows robust and simplistic rules, providing new mechanistic insights into the ALS pathology and organism-level manifestation of protein aggregation phenomena in general.

  8. Ecological validity of the five digit test and the oral trails test.

    Science.gov (United States)

    Paiva, Gabrielle Chequer de Castro; Fialho, Mariana Braga; Costa, Danielle de Souza; Paula, Jonas Jardim de

    2016-01-01

    Tests evaluating the attentional-executive system are widely used in clinical practice. However, proximity of an objective cognitive test with real-world situations (ecological validity) is not frequently investigated. The present study evaluate the association between measures of the Five Digit Test (FDT) and the Oral Trails Test (OTT) with self-reported cognitive failures in everyday life as measured by the Cognitive Failures Questionnaire (CFQ). Brazilian adults from 18-to-65 years old voluntarily performed the FDT and OTT tests and reported the frequency of cognitive failures in their everyday life through the CFQ. After controlling for the age effect, the measures of controlled attentional processes were associated with cognitive failures, yet the cognitive flexibility of both FDT and OTT accounted for by the majority of variance in most aspects of the CFQ factors. The FDT and the OTT measures were predictive of real-world problems such as cognitive failures in everyday activities/situations.

  9. Urine specimen validity test for drug abuse testing in workplace and court settings.

    Science.gov (United States)

    Lin, Shin-Yu; Lee, Hei-Hwa; Lee, Jong-Feng; Chen, Bai-Hsiun

    2018-01-01

    In recent decades, urine drug testing in the workplace has become common in many countries in the world. There have been several studies concerning the use of the urine specimen validity test (SVT) for drug abuse testing administered in the workplace. However, very little data exists concerning the urine SVT on drug abuse tests from court specimens, including dilute, substituted, adulterated, and invalid tests. We investigated 21,696 submitted urine drug test samples for SVT from workplace and court settings in southern Taiwan over 5 years. All immunoassay screen-positive urine specimen drug tests were confirmed by gas chromatography/mass spectrometry. We found that the mean 5-year prevalence of tampering (dilute, substituted, or invalid tests) in urine specimens from the workplace and court settings were 1.09% and 3.81%, respectively. The mean 5-year percentage of dilute, substituted, and invalid urine specimens from the workplace were 89.2%, 6.8%, and 4.1%, respectively. The mean 5-year percentage of dilute, substituted, and invalid urine specimens from the court were 94.8%, 1.4%, and 3.8%, respectively. No adulterated cases were found among the workplace or court samples. The most common drug identified from the workplace specimens was amphetamine, followed by opiates. The most common drug identified from the court specimens was ketamine, followed by amphetamine. We suggest that all urine specimens taken for drug testing from both the workplace and court settings need to be tested for validity. Copyright © 2017. Published by Elsevier B.V.

  10. SMILE: numerical evaluation of the WPS validation test

    International Nuclear Information System (INIS)

    Moinereau, D.; Studer, V.; Dahl, A.; Wadier, Y.

    2004-01-01

    The reactor pressure vessel (RPV) is an essential component liable to limit the lifetime duration of nuclear PWR power plants. The structural integrity assessment of RPV subjected to pressurized thermal shock (PTA) transients made at an European level does not take always into account the potential beneficial effect of the load history (warm pre-stress WPS). A three-year European Research and Development program (SMILE) started in January 2002 as part of the Fifth Framework Program of the European Atomic Energy Community (EURATOM) to evaluate this effect. The SMILE project is one of a ''cluster'' of Fifth Framework Projects in the area of Plant Life Management. It aims to give sufficient elements to model and to validate the beneficial WPS effect in a RPV structural integrity assessment. Finally, this project aims to harmonize the different approaches to lay the basis for European codes and standards regarding the inclusion of the warm pre-stress (WPS) effect in the RPV assessments. Within the framework of this project, an important experimental work has been conducted including WPS type tests on CT specimens and also a PTS type transient experiment on a large cracked cylinder. The present paper describes shortly the PTS type experiment and presents the corresponding analyses based on engineering methods, finite element elastic and elastic-plastic computations, and local approach to fracture. The results are in good agreement with the experimental result. Significant margins are underlined, with an effective significant increase of the material resistance regarding the risk of brittle failure. (orig.)

  11. Portuguese validation of the Internet Addiction Test: An empirical study.

    Science.gov (United States)

    Pontes, Halley M; Patrão, Ivone M; Griffiths, Mark D

    2014-06-01

    Research into Internet addiction (IA) has increased greatly over the last decade. Despite its various definitions and general lack of consensus regarding its conceptualisation amongst researchers, instruments for measuring this phenomenon have proliferated in a number of countries. There has been little research on IA in Portugal and this may be partly due to the absence of standardised measurement tools for assessing IA. This study attempted to address this issue by adapting a Portuguese version of the Internet Addiction Test (IAT) via a translation-back translation process and Confirmatory Factor Analysis in a sample of 593 Portuguese students that completed a Portuguese version of the IAT along with questions related to socio-demographic variables. The findings suggested that the IAT appears to be a valid and reliable instrument for measuring IA among Portuguese young adults as demonstrated by its satisfactory psychometric properties. However, the present findings also suggest the need to reword and update some of the IAT's items. Prevalence of IA found in the sample was 1.2% and is discussed alongside findings relating to socio-demographic correlates. Limitations and implications of the present study are also discussed. The present study calls for a reflection of the IAT while also contributing to a better understanding of the basic aspects of IA in the Portuguese community since many health practitioners are starting to realise that Internet use may pose a risk for some individuals.

  12. Validation and testing of the VAM2D computer code

    International Nuclear Information System (INIS)

    Kool, J.B.; Wu, Y.S.

    1991-10-01

    This document describes two modeling studies conducted by HydroGeoLogic, Inc. for the US NRC under contract no. NRC-04089-090, entitled, ''Validation and Testing of the VAM2D Computer Code.'' VAM2D is a two-dimensional, variably saturated flow and transport code, with applications for performance assessment of nuclear waste disposal. The computer code itself is documented in a separate NUREG document (NUREG/CR-5352, 1989). The studies presented in this report involve application of the VAM2D code to two diverse subsurface modeling problems. The first one involves modeling of infiltration and redistribution of water and solutes in an initially dry, heterogeneous field soil. This application involves detailed modeling over a relatively short, 9-month time period. The second problem pertains to the application of VAM2D to the modeling of a waste disposal facility in a fractured clay, over much larger space and time scales and with particular emphasis on the applicability and reliability of using equivalent porous medium approach for simulating flow and transport in fractured geologic media. Reflecting the separate and distinct nature of the two problems studied, this report is organized in two separate parts. 61 refs., 31 figs., 9 tabs

  13. Vietnamese validation of the short version of Internet Addiction Test.

    Science.gov (United States)

    Tran, Bach Xuan; Mai, Hue Thi; Nguyen, Long Hoang; Nguyen, Cuong Tat; Latkin, Carl A; Zhang, Melvyn W B; Ho, Roger C M

    2017-12-01

    The main goal of the present study was to examine the psychometric properties of a Vietnamese version of the short-version of Internet Addiction Test (s-IAT) and to assess the relationship between s-IAT scores and demographics, health related qualify of life and perceived stress scores in young Vietnamese. The Vietnamese version of s-IAT was administered to a sample of 589 participants. Exploratory factor and reliability analyses were performed. Regression analysis was used to identify the associated factors. The two-factor model of Vietnamese version of s-IAT demonstrated good psychometric properties. The internal consistency of Factor 1 (loss of control/time management) was high (Cronbach's alpha = 0.82) and Factor 2 (craving/social problems) was satisfactory (Cronbach's alpha = 0.75). Findings indicated that 20.9% youths were addicted to the Internet. Regression analysis revealed significant associations between Internet addiction and having problems in self-care, lower quality of life and high perceived stress scores. The Vietnamese version of s-IAT is a valid and reliable instrument to assess IA in Vietnamese population. Due to the high prevalence of IA among Vietnamese youths, IA should be paid attention in future intervention programs. s-IAT can be a useful screening tool for IA to promptly inform and treat the IA among Vietnamese youths.

  14. Validation testing of radioactive waste drum filter vents

    Energy Technology Data Exchange (ETDEWEB)

    Weber, L.D. [Pall Corp., Port Washington, NY (United States); Rahimi, R.S. [Pall Corp., Cortland, NY (United States); Edling, D. [Edling & Associates, Inc., Russel Springs, KY (United States)

    1997-08-01

    The minimum requirements for Drum Filter Vents (DFVs) can be met by demonstrating conformance with the Waste Isolation Pilot Plant (WIPP) Trupact II Safety Assessment Report (SAR), and conformance with U.S. Federal shipping regulations 49 CFR 178.350, DOT Spec 7A, for Type A packages. These together address a number of safety related performance parameters such as hydrogen diffusivity, flow related pressure drop, filtration efficiency and, separately, mechanical stability and the ability to prevent liquid water in-leakage. In order to make all metal DFV technology (including metallic filter medium) available to DOE sites, Pall launched a product development program to validate an all metal design to meet these requirements. Numerous problems experienced by DOE sites in the past came to light during this development program. They led us to explore enhancements to DFV design and performance testing addressing these difficulties and concerns. The result is a patented all metal DFV certified to all applicable regulatory requirements, which for the first time solves operational and health safety problems reported by DOE site personnel but not addressed by previous DFV`s. The new technology facilitates operations (such as manual, automated and semi-automated drum handling/redrumming), sampling, on-site storage, and shipping. At the same time, it upgrades filtration efficiency in configurations documented to maintain filter efficiency following mechanical stress. 2 refs., 2 figs., 10 tabs.

  15. Five-Kilometers Time Trial: Preliminary Validation of a Short Test for Cycling Performance Evaluation.

    Science.gov (United States)

    Dantas, Jose Luiz; Pereira, Gleber; Nakamura, Fabio Yuzo

    2015-09-01

    The five-kilometer time trial (TT5km) has been used to assess aerobic endurance performance without further investigation of its validity. This study aimed to perform a preliminary validation of the TT5km to rank well-trained cyclists based on aerobic endurance fitness and assess changes of the aerobic endurance performance. After the incremental test, 20 cyclists (age = 31.3 ± 7.9 years; body mass index = 22.7 ± 1.5 kg/m(2); maximal aerobic power = 360.5 ± 49.5 W) performed the TT5km twice, collecting performance (time to complete, absolute and relative power output, average speed) and physiological responses (heart rate and electromyography activity). The validation criteria were pacing strategy, absolute and relative reliability, validity, and sensitivity. Sensitivity index was obtained from the ratio between the smallest worthwhile change and typical error. The TT5km showed high absolute (coefficient of variation 0.95) reliability of performance variables, whereas it presented low reliability of physiological responses. The TT5km performance variables were highly correlated with the aerobic endurance indices obtained from incremental test (r > 0.70). These variables showed adequate sensitivity index (> 1). TT5km is a valid test to rank the aerobic endurance fitness of well-trained cyclists and to differentiate changes on aerobic endurance performance. Coaches can detect performance changes through either absolute (± 17.7 W) or relative power output (± 0.3 W.kg(-1)), the time to complete the test (± 13.4 s) and the average speed (± 1.0 km.h(-1)). Furthermore, TT5km performance can also be used to rank the athletes according to their aerobic endurance fitness.

  16. Testing the Predictive Validity of the IELTS Test on Omani English Candidates’ Professional Competencies

    Directory of Open Access Journals (Sweden)

    Moza Abdullah Said Al-Malki

    2014-09-01

    Full Text Available This study has investigated the relationship between IELTS testing and Omani English teacher trainees’ professional competencies by adopting a quantitative method for data collection. A total number of 94 graduate freshmen Omani English teachers’ IELTS, CGPA and their teaching professional competencies are collected. The results reveal a moderate significant relationship between IELTS and CGPA but a weak relationship between IELTS and teaching competencies.  This study could contribute to the growing body of literature that aims to assess the construct validity of IELTS, and attempts to do so in the new terrain of teaching competencies. This study puts forwards recommendations for IELTS proficiency test in the Omani context.

  17. Testing the Predictive Validity and Construct of Pathological Video Game Use

    Science.gov (United States)

    Groves, Christopher L.; Gentile, Douglas; Tapscott, Ryan L.; Lynch, Paul J.

    2015-01-01

    Three studies assessed the construct of pathological video game use and tested its predictive validity. Replicating previous research, Study 1 produced evidence of convergent validity in 8th and 9th graders (N = 607) classified as pathological gamers. Study 2 replicated and extended the findings of Study 1 with college undergraduates (N = 504). Predictive validity was established in Study 3 by measuring cue reactivity to video games in college undergraduates (N = 254), such that pathological gamers were more emotionally reactive to and provided higher subjective appraisals of video games than non-pathological gamers and non-gamers. The three studies converged to show that pathological video game use seems similar to other addictions in its patterns of correlations with other constructs. Conceptual and definitional aspects of Internet Gaming Disorder are discussed. PMID:26694472

  18. Testing the Predictive Validity and Construct of Pathological Video Game Use

    Directory of Open Access Journals (Sweden)

    Christopher L. Groves

    2015-12-01

    Full Text Available Three studies assessed the construct of pathological video game use and tested its predictive validity. Replicating previous research, Study 1 produced evidence of convergent validity in 8th and 9th graders (N = 607 classified as pathological gamers. Study 2 replicated and extended the findings of Study 1 with college undergraduates (N = 504. Predictive validity was established in Study 3 by measuring cue reactivity to video games in college undergraduates (N = 254, such that pathological gamers were more emotionally reactive to and provided higher subjective appraisals of video games than non-pathological gamers and non-gamers. The three studies converged to show that pathological video game use seems similar to other addictions in its patterns of correlations with other constructs. Conceptual and definitional aspects of Internet Gaming Disorder are discussed.

  19. Development and content validity testing of a comprehensive classification of diagnoses for pediatric nurse practitioners.

    Science.gov (United States)

    Burns, C

    1991-01-01

    Pediatric nurse practitioners (PNPs) need an integrated, comprehensive classification that includes nursing, disease, and developmental diagnoses to effectively describe their practice. No such classification exists. Further, methodologic studies to help evaluate the content validity of any nursing taxonomy are unavailable. A conceptual framework was derived. Then 178 diagnoses from the North American Nursing Diagnosis Association (NANDA) 1986 list, selected diagnoses from the International Classification of Diseases, the Diagnostic and Statistical Manual, Third Revision, and others were selected. This framework identified and listed, with definitions, three domains of diagnoses: Developmental Problems, Diseases, and Daily Living Problems. The diagnoses were ranked using a 4-point scale (4 = highly related to 1 = not related) and were placed into the three domains. The rating scale was assigned by a panel of eight expert pediatric nurses. Diagnoses that were assigned to the Daily Living Problems domain were then sorted into the 11 Functional Health patterns described by Gordon (1987). Reliability was measured using proportions of agreement and Kappas. Content validity of the groups created was measured using indices of content validity and average congruency percentages. The experts used a new method to sort the diagnoses in a new way that decreased overlaps among the domains. The Developmental and Disease domains were judged reliable and valid. The Daily Living domain of nursing diagnoses showed marginally acceptable validity with acceptable reliability. Six Functional Health Patterns were judged reliable and valid, mixed results were determined for four categories, and the Coping/Stress Tolerance category was judged reliable but not valid using either test. There were considerable differences between the panel's, Gordon's (1987), and NANDA's clustering of NANDA diagnoses. This study defines the diagnostic practice of nurses from a holistic, patient

  20. Validation of the Stroke Specific Quality of Life Scale (SS-QOL): test of reliability and validity of the Danish version (SS-QOL-DK).

    Science.gov (United States)

    Muus, Ingrid; Williams, Linda S; Ringsberg, Karin C

    2007-07-01

    To test the reliability and validity of the Danish version of the Stroke Specific Quality of Life Scale version 2.0 (SS-QOL-DK), an instrument for evaluation of health-related quality of life. A correlational study. A stroke unit that provides acute care and rehabilitation for stroke patients in Frederiksborg County, Denmark. One hundred and fifty-two stroke survivors participated; 24 of these performed test-retest. Questionnaires were sent out and returned by mail. A subsequent telephone interview assessed functional level and missing items. Test-retest was measured using Spearman's r, internal consistency was estimated using Cronbach's alpha, and evaluation of floor and ceiling values in proportion of minimum and maximum scores. Construct validity was assessed by comparing patients' scores on the SS-QOL-DK with those obtained by other test methods: Beck's Depression Index, the General Health Survey Short Form 36 (SF-36), the Barthel Index and the National Institutes of Health Stroke Scale, evaluating shared variance using coefficient of determination, r2. Comparing groups with known scores assessed known-group validity. Convergent and discriminant validity were assessed. Test-retest of SS-QOL-DK showed excellent stability, Spearman's r = 0.65-0.99. Internal consistency for all domains showed Cronbach's alpha = 0.81-0.94. Missing items rate was 1.0%. Most SS-QOL-DK domains showed moderately shared variance with similar domains of other test methods, r2 = 0.03-0.62. Groups with known differences showed statistically significant difference in scores. Item-to-scale correlation coefficients of 0.37-0.88 supported convergent validity. SS-QOL-DK is a reliable and valid instrument for measuring self-reported health-related quality of life on group level among people with mild to moderate stroke.

  1. Validity and test-retest reliability of a novel simple back extensor muscle strength test.

    Science.gov (United States)

    Harding, Amy T; Weeks, Benjamin Kurt; Horan, Sean A; Little, Andrew; Watson, Steven L; Beck, Belinda Ruth

    2017-01-01

    To develop and determine convergent validity and reliability of a simple and inexpensive clinical test to quantify back extensor muscle strength. Two testing sessions were conducted, 7 days apart. Each session involved three trials of standing maximal isometric back extensor muscle strength using both the novel test and isokinetic dynamometry. Lumbar spine bone mineral density was examined by dual-energy X-ray absorptiometry. Validation was examined with Pearson correlations ( r ). Test-retest reliability was examined with intraclass correlation coefficients and limits of agreement. Pearson correlations and intraclass correlation coefficients are presented with corresponding 95% confidence intervals. Linear regression was used to examine the ability of peak back extensor muscle strength to predict indices of lumbar spine bone mineral density and strength. A total of 52 healthy adults (26 men, 26 women) aged 46.4 ± 20.4 years were recruited from the community. A strong positive relationship was observed between peak back extensor strength from hand-held and isokinetic dynamometry ( r  = 0.824, p  strength test, short- and long-term reliability was excellent (intraclass correlation coefficient = 0.983 (95% confidence interval, 0.971-0.990), p  strength measures with the novel back extensor strength protocol were -6.63 to 7.70 kg, with a mean bias of +0.71 kg. Back extensor strength predicted 11% of variance in lumbar spine bone mineral density ( p  strength ( p  strength is quick, relatively inexpensive, and reliable; demonstrates initial convergent validity in a healthy population; and is associated with bone mass at a clinically important site.

  2. [Validation of three screening tests used for early detection of cervical cancer].

    Science.gov (United States)

    Rodriguez-Reyes, Esperanza Rosalba; Cerda-Flores, Ricardo M; Quiñones-Pérez, Juan M; Cortés-Gutiérrez, Elva I

    2008-01-01

    to evaluate the validity (sensitivity, specificity, and accuracy) of three screening methods used in the early detection of the cervical carcinoma versus the histopathology diagnosis. a selected sample of 107 women attended in the Opportune Detection of Cervicouterine Cancer Program in the Hospital de Zona 46, Instituto Mexicano del Seguro Social in Durango, during the 2003 was included. The application of Papa-nicolaou, acetic acid test, and molecular detection of human papillomavirus, and histopatholgy diagnosis were performed in all the patients at the time of the gynecological exam. The detection and tipification of the human papillomavirus was performed by polymerase chain reaction (PCR) and analysis of polymorphisms of length of restriction fragments (RFLP). Histopathology diagnosis was considered the gold standard. The evaluation of the validity was carried out by the Bayesian method for diagnosis test. the positive cases for acetic acid test, Papanicolaou, and PCR were 47, 22, and 19. The accuracy values were 0.70, 0.80 and 0.99, respectively. since the molecular method showed a greater validity in the early detection of the cervical carcinoma we considered of vital importance its implementation in suitable programs of Opportune Detection of Cervicouterino Cancer Program in Mexico. However, in order to validate this conclusion, cross-sectional studies in different region of country must be carried out.

  3. Fundamentals of endoscopic surgery: creation and validation of the hands-on test.

    Science.gov (United States)

    Vassiliou, Melina C; Dunkin, Brian J; Fried, Gerald M; Mellinger, John D; Trus, Thadeus; Kaneva, Pepa; Lyons, Calvin; Korndorffer, James R; Ujiki, Michael; Velanovich, Vic; Kochman, Michael L; Tsuda, Shawn; Martinez, Jose; Scott, Daniel J; Korus, Gary; Park, Adrian; Marks, Jeffrey M

    2014-03-01

    The Fundamentals of Endoscopic Surgery™ (FES) program consists of online materials and didactic and skills-based tests. All components were designed to measure the skills and knowledge required to perform safe flexible endoscopy. The purpose of this multicenter study was to evaluate the reliability and validity of the hands-on component of the FES examination, and to establish the pass score. Expert endoscopists identified the critical skill set required for flexible endoscopy. They were then modeled in a virtual reality simulator (GI Mentor™ II, Simbionix™ Ltd., Airport City, Israel) to create five tasks and metrics. Scores were designed to measure both speed and precision. Validity evidence was assessed by correlating performance with self-reported endoscopic experience (surgeons and gastroenterologists [GIs]). Internal consistency of each test task was assessed using Cronbach's alpha. Test-retest reliability was determined by having the same participant perform the test a second time and comparing their scores. Passing scores were determined by a contrasting groups methodology and use of receiver operating characteristic curves. A total of 160 participants (17 % GIs) performed the simulator test. Scores on the five tasks showed good internal consistency reliability and all had significant correlations with endoscopic experience. Total FES scores correlated 0.73, with participants' level of endoscopic experience providing evidence of their validity, and their internal consistency reliability (Cronbach's alpha) was 0.82. Test-retest reliability was assessed in 11 participants, and the intraclass correlation was 0.85. The passing score was determined and is estimated to have a sensitivity (true positive rate) of 0.81 and a 1-specificity (false positive rate) of 0.21. The FES hands-on skills test examines the basic procedural components required to perform safe flexible endoscopy. It meets rigorous standards of reliability and validity required for high

  4. Reasoning with Inductive Argument Test: A Study of Validity and Reliability

    Directory of Open Access Journals (Sweden)

    Mehmet Emrah Karadere

    2013-11-01

    Full Text Available Reasoning with Inductive Argument Test:A Study of Validity and Reliability Objective: The aim of our study is to research reliability and validity and to evaluate the usability of Turkish version of Reasoning with Inductive Argument Test (RIAT in Turkish healty population. Method: 51 healty volunteers who work in Ankara Dıskapi Yildirim Beyazit Research and Training Hospital participated in this study. Reasoning with Inductive Argument Test (RIAT was translated into Turkish by three clinical good knowledge of English. Participants were given a sociodemographic data form, and RIAT were performed by clinicians. To test the reliability of the Turkish version of RIAT, Cronbach’s alpha coefficient was calculated and the halving method was used for the test. Results: The internal consistency of the Reasoning with Inductive Argument Test (RIAT items, Cronbach’s alpha internal consistency coefficient measurements of 0.73 was found to be statistically significant. Spearman-Brown coefficient that determines the reliability of the whole test r=0.74 was found. Kurtosis values of all the items was below 1.5 and the percentages in the second evaluation were mainly lower. At the same time, both change in belief between self produced RIAT options and given RIAT options (p=0.02, z=-2296 as well as changes in beliefs between related and unrelated items for Obsessive Compulsive Disorder (OCD difference (p=0.03, z=-2.199 were significant. Conclusion: The preliminary data obtained from the study of reliability and validity of the scale shows that ‘Reasoning with Inductive Argument Test’ supports reliability and validity in Turkish population.

  5. Validity of the Eating Attitudes Test and the Eating Disorders Inventory in Bulimia Nervosa.

    Science.gov (United States)

    Gross, Janet; And Others

    1986-01-01

    Assessed criterion and concurrent validity of the Eating Attitudes Test and the Eating Disorder Inventory in 82 women with bulimia nervosa. Both tests demonstrated criterion validity by discriminating bulimia nervosa subjects from normals. Only weak support was found for concurrent validity within bulimia subjects. Recommends combination of…

  6. Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

    Science.gov (United States)

    Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

    2015-12-01

    To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.

  7. Validity of the Worth 4 Dot Test in Patients with Red-Green Color Vision Defect.

    Science.gov (United States)

    Bak, Eunoo; Yang, Hee Kyung; Hwang, Jeong-Min

    2017-05-01

    The Worth four dot test uses red and green glasses for binocular dissociation, and although it has been believed that patients with red-green color vision defects cannot accurately perform the Worth four dot test, this has not been validated. Therefore, the purpose of this study was to demonstrate the validity of the Worth four dot test in patients with congenital red-green color vision defects who have normal or abnormal binocular vision. A retrospective review of medical records was performed on 30 consecutive congenital red-green color vision defect patients who underwent the Worth four dot test. The type of color vision anomaly was determined by the Hardy Rand and Rittler (HRR) pseudoisochromatic plate test, Ishihara color test, anomaloscope, and/or the 100 hue test. All patients underwent a complete ophthalmologic examination. Binocular sensory status was evaluated with the Worth four dot test and Randot stereotest. The results were interpreted according to the presence of strabismus or amblyopia. Among the 30 patients, 24 had normal visual acuity without strabismus nor amblyopia and 6 patients had strabismus and/or amblyopia. The 24 patients without strabismus nor amblyopia all showed binocular fusional responses by seeing four dots of the Worth four dot test. Meanwhile, the six patients with strabismus or amblyopia showed various results of fusion, suppression, and diplopia. Congenital red-green color vision defect patients of different types and variable degree of binocularity could successfully perform the Worth four dot test. They showed reliable results that were in accordance with their estimated binocular sensory status.

  8. Modeling dynamic acousto-elastic testing experiments: validation and perspectives.

    Science.gov (United States)

    Gliozzi, A S; Scalerandi, M

    2014-10-01

    Materials possessing micro-inhomogeneities often display a nonlinear response to mechanical solicitations, which is sensitive to the confining pressure acting on the sample. Dynamic acoustoelastic testing allows measurement of the instantaneous variations in the elastic modulus due to the change of the dynamic pressure induced by a low-frequency wave. This paper shows that a Preisach-Mayergoyz space based hysteretic multi-state elastic model provides an explanation for experimental observations in consolidated granular media and predicts memory and nonlinear effects comparable to those measured in rocks.

  9. Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

    Science.gov (United States)

    Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

    2011-08-15

    Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.

  10. Genetic Validation of Leishmania donovani Lysyl-tRNA Synthetase Shows that It Is Indispensable for Parasite Growth and Infectivity.

    Science.gov (United States)

    Chadha, Sanya; Mallampudi, N Arjunreddy; Mohapatra, Debendra K; Madhubala, Rentala

    2017-01-01

    the proper construction of peptide chains. These enzymes provide raw materials for protein translation and also ensure fidelity of translation. L. donovani is a protozoan parasite that causes visceral leishmaniasis. It is a continuously proliferating parasite that depends heavily on efficient protein translation. Lysyl-tRNA synthetase is one of the aaRSs which charges lysine to its cognate tRNA. Two different coding sequences for lysyl-tRNA synthetases ( Ld LysRS) are present in this parasite. Ld LysRS-1 is closer to apicomplexans and eukaryotes, whereas Ld LysRS-2 is closer to bacteria. Here, we have characterized Ld LysRS-1 of L. donovani . Ld LysRS-1 appears to be an essential gene, as the chromosomal null mutants did not survive. The heterozygous mutants showed slower growth kinetics and exhibited attenuated virulence. This study also provides a platform to explore Ld LysRS-1 as a potential drug target.

  11. Educational testing validity and reliability in pharmacy and medical education literature.

    Science.gov (United States)

    Hoover, Matthew J; Jung, Rose; Jacobs, David M; Peeters, Michael J

    2013-12-16

    To evaluate and compare the reliability and validity of educational testing reported in pharmacy education journals to medical education literature. Descriptions of validity evidence sources (content, construct, criterion, and reliability) were extracted from articles that reported educational testing of learners' knowledge, skills, and/or abilities. Using educational testing, the findings of 108 pharmacy education articles were compared to the findings of 198 medical education articles. For pharmacy educational testing, 14 articles (13%) reported more than 1 validity evidence source while 83 articles (77%) reported 1 validity evidence source and 11 articles (10%) did not have evidence. Among validity evidence sources, content validity was reported most frequently. Compared with pharmacy education literature, more medical education articles reported both validity and reliability (59%; particles in pharmacy education compared to medical education, validity, and reliability reporting were limited in the pharmacy education literature.

  12. VALIDITY OF THE MODIFIED CONCONI TEST FOR DETERMINING VENTILATORY THRESHOLD DURING ON-WATER ROWING

    Directory of Open Access Journals (Sweden)

    Jorge Villamil Cabo

    2011-12-01

    Full Text Available The objectives of this study were to design a field test based on the Conconi protocol to determine the ventilatory threshold of rowers and to test its reliability and validity. A group of sixteen oarsmen completed a modified Conconi test for on-water rowing. The reliability of the detection of the heart rate threshold was evaluated using heart rate breaking point in the Conconi test and retest. Heart rate threshold was detected in 88.8% of cases in the test-retest. The validity of the modified Conconi test was evaluated by comparing the heart rate threshold data acquired with that obtained in a ventilatory threshold test (VT2. No significant differences were found for the values of different intensity parameters i.e. heart rate (HR, oxygen consumption (VO2, stroke rate (SR and speed (S between the heart rate threshold and the ventilatory threshold, (170.9 ± 6.8 vs. 169.3 ± 6.4 beats·min-1; 42.0 ± 8.6 vs. 43.5 ± 8.3 ml·kg-1·min-1; 25.8 ± 3.3 vs. 27.0 ± 3.2 strokes·min-1 and 14.4 ± 0.8 vs. 14.6 ± 0.8 km·h-1. The differences in averages obtained in the Conconi test-retest were small with a low standard error of the mean. The reliability data between the Conconi test-retest showed low coefficients of variations (CV and high intraclass correlation coefficients (ICC. The total errors for the Conconi test-retest are low for the measured variables (1.31 HR, 0.87 VO2, 0.65 SR, and 0.1 S. The Bland- Altman's method for analysis validity showed a strong concordance according to the analyzed variables. We conclude that the modified Conconi test for on-water rowing is a valid and reliable method for the determination of the second ventilatory threshold (VT2.

  13. Extended histopathology in immunotoxicity testing: Interlaboratory validation studies

    NARCIS (Netherlands)

    Germolec, D.R.; Nyska, A.; Kashon, M.; Kuper, C.F.; Portier, C.; Kommineni, C.; Johnson, K.A.; Luster, M.I.

    2004-01-01

    There has been considerable interest in the use of expanded histopathology as a primary screen for immunotoxicity assessment. To determine the utility of a semiquantitative histopathology approach for examining specific structural and architectural changes in lymphoid tissues, a validation effort

  14. Verification, validation, and field testing the USEPA National Stormwater Calculator

    Data.gov (United States)

    U.S. Environmental Protection Agency — We used this dataset to verify and validate functions in the USEPA National Stormwater Calculator, and then applied field data and commonly-available datasets to...

  15. Test-retest reliability and validity of the Sniffin' TOM odor memory test.

    Science.gov (United States)

    Croy, Ilona; Zehner, Cora; Larsson, Maria; Zucco, Gesualdo M; Hummel, Thomas

    2015-03-01

    Few attempts have been made to develop an olfactory test that captures episodic retention of olfactory information. Assessment of episodic odor memory is of particular interest in aging and in the cognitively impaired as both episodic memory deficits and olfactory loss have been targeted as reliable hallmarks of cognitive decline and impending dementia. Here, 96 healthy participants (18-92 years) and an additional 19 older people with mild cognitive impairment were tested (73-82 years). Participants were presented with 8 common odors with intentional encoding instructions that were followed by a yes-no recognition test. After recognition completion, participants were asked to identify all odors by means of free or cued identification. A retest of the odor memory test (Sniffin' TOM = test of odor memory) took place 17 days later. The results revealed satisfactory test-retest reliability (0.70) of odor recognition memory. Both recognition and identification performance were negatively affected by age and more pronounced among the cognitively impaired. In conclusion, the present work presents a reliable, valid, and simple test of episodic odor recognition memory that may be used in clinical groups where both episodic memory deficits and olfactory loss are prevalent preclinically such as Alzheimer's disease. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  16. Evaluating abdominal core muscle fatigue: Assessment of the validity and reliability of the prone bridging test.

    Science.gov (United States)

    De Blaiser, C; De Ridder, R; Willems, T; Danneels, L; Vanden Bossche, L; Palmans, T; Roosen, P

    2018-02-01

    The aims of this study were to research the amplitude and median frequency characteristics of selected abdominal, back, and hip muscles of healthy subjects during a prone bridging endurance test, based on surface electromyography (sEMG), (a) to determine if the prone bridging test is a valid field test to measure abdominal muscle fatigue, and (b) to evaluate if the current method of administrating the prone bridging test is reliable. Thirty healthy subjects participated in this experiment. The sEMG activity of seven abdominal, back, and hip muscles was bilaterally measured. Normalized median frequencies were computed from the EMG power spectra. The prone bridging tests were repeated on separate days to evaluate inter and intratester reliability. Significant differences in normalized median frequency slope (NMF slope ) values between several abdominal, back, and hip muscles could be demonstrated. Moderate-to-high correlation coefficients were shown between NMF slope values and endurance time. Multiple backward linear regression revealed that the test endurance time could only be significantly predicted by the NMF slope of the rectus abdominis. Statistical analysis showed excellent reliability (ICC=0.87-0.89). The findings of this study support the validity and reliability of the prone bridging test for evaluating abdominal muscle fatigue. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Understanding Student Teachers’ Behavioural Intention to Use Technology: Technology Acceptance Model (TAM Validation and Testing

    Directory of Open Access Journals (Sweden)

    Kung-Teck, Wong

    2013-01-01

    Full Text Available This study sets out to validate and test the Technology Acceptance Model (TAM in the context of Malaysian student teachers’ integration of their technology in teaching and learning. To establish factorial validity, data collected from 302 respondents were tested against the TAM using confirmatory factor analysis (CFA, and structural equation modelling (SEM was used for model comparison and hypotheses testing. The goodness-of-fit test of the analysis shows partial support of the applicability of the TAM in a Malaysian context. Overall, the TAM accounted for 37.3% of the variance in intention to use technology among student teachers and of the five hypotheses formulated, four are supported. Perceived usefulness is a significant influence on attitude towards computer use and behavioural intention. Perceived ease of use significantly influences perceived usefulness, and finally, behavioural intention is found to be influenced by attitude towards computer use. The findings of this research contribute to the literature by validating the TAM in the Malaysian context and provide several prominent implications for the research and practice of technology integration development.

  18. Validity and reliability of tests determining performance-related components of wheelchair basketball

    NARCIS (Netherlands)

    de Groot, Sonja; Balvers, Inge J.M.; Kouwenhoven, Sanne M.; Janssen, Thomas W.J.

    The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'

  19. Validity and reliability of tests determining performance-related components of wheelchair basketball

    NARCIS (Netherlands)

    De Groot, Sonja; Balvers, Inge J. M.; Kouwenhoven, Sanne M.; Janssen, Thomas W. J.

    2012-01-01

    The purpose of this study was to investigate the reliability and validity of wheelchair basketball field tests. Nineteen wheelchair basketball players performed 10 test items twice to determine the reliability. The validity of the tests was assessed by relating the scores to the players'

  20. An Integrated Approach to Establish Validity and Reliability of Reading Tests

    Science.gov (United States)

    Razi, Salim

    2012-01-01

    This study presents the processes of developing and establishing reliability and validity of a reading test by administering an integrative approach as conventional reliability and validity measures superficially reveals the difficulty of a reading test. In this respect, analysing vocabulary frequency of the test is regarded as a more eligible way…

  1. Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

    Science.gov (United States)

    Bhat, Mehraj A.

    2014-01-01

    This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…

  2. 40 CFR 1045.501 - How do I run a valid emission test?

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test... Procedures § 1045.501 How do I run a valid emission test? (a) Applicability. This subpart is addressed to you... maximum test speed. (g) Special and alternate procedures. If you are unable to run the duty cycle...

  3. 40 CFR 1054.501 - How do I run a valid emission test?

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test... Procedures § 1054.501 How do I run a valid emission test? (a) Applicability. This subpart is addressed to you... provisions of 40 CFR 1065.405 describes how to prepare an engine for testing. However, you may consider...

  4. Validation of Linguistic and Communicative Oral Language Tests for Spanish-English Bilingual Programs.

    Science.gov (United States)

    Politzer, Robert L.; And Others

    1983-01-01

    The development, administration, and scoring of a communicative test and its validation with tests of linguistic and sociolinguistic competence in English and Spanish are reported. Correlation with measures of home language use and school achievement are also presented, and issues of test validation for bilingual programs are discussed. (MSE)

  5. Test of Achievement in Quantitative Economics for Secondary Schools: Construction and Validation Using Item Response Theory

    Science.gov (United States)

    Eleje, Lydia I.; Esomonu, Nkechi P. M.

    2018-01-01

    A Test to measure achievement in quantitative economics among secondary school students was developed and validated in this study. The test is made up 20 multiple choice test items constructed based on quantitative economics sub-skills. Six research questions guided the study. Preliminary validation was done by two experienced teachers in…

  6. Validity and reliability testing of the Prenatal Psychosocial Profile.

    Science.gov (United States)

    Curry, M A; Campbell, R A; Christian, M

    1994-04-01

    Two studies of low-income pregnant women (N = 179) were done to examine the validity and reliability of the Prenatal Psychosocial Profile (PPP). The PPP, a composite of the Rosenberg Self-Esteem Scale, the Support Behaviors Inventory, and a newly developed measure of stress, is a brief, comprehensive clinical assessment of psychosocial risk during pregnancy. Construct validity of the stress scale was supported by theoretically predicted negative correlations with self-esteem, partner support, and support from others (N = 91). Convergent validity of the stress scale was demonstrated by a correlation of .71 with the Difficult Life Circumstances Scale. Adequate levels of internal consistency were found. Interrelationships between the four subscales were consistent with the underlying conceptualization, and there was beginning evidence of the factorial independence of the subscales.

  7. Shaking Table Tests Validating Two Strengthening Interventions on Masonry Buildings

    International Nuclear Information System (INIS)

    De Canio, Gerardo; Poggi, Massimo; Clemente, Paolo; Muscolino, Giuseppe; Palmeri, Alessandro

    2008-01-01

    numerical and experimental research has been carried out, aimed at validating two different strengthening interventions on masonry buildings: (i) the substitution of the existing roof with timber-concrete composite slabs, which are able to improve the dynamic behaviour of the structure without excessively increase the mass, and (ii) the reinforcement of masonry walls with FRP materials, which allow increasing both stiffness and strength of the construction. The experimental tests have been performed on a 1:2 scale model of a masonry building resembling a special type, the so-called 'tipo misto messinese', which is proper to the reconstruction of the city of Messina after the 1783 Calabria earthquake. The model, incorporating a novel timber-concrete composite slab, has been tested on the main shaking table available at the ENEA Research Centre 'Casaccia', both before and after the reinforcement with FRP materials. Some aspects related to the definition of the model and to the selection of an appropriate seismic input will be discussed, and numerical results confirming the effectiveness of the interventions mentioned above will be presented

  8. Development and validation of OECD test guidelines on mollusc reproductive toxicity tests

    DEFF Research Database (Denmark)

    Lagadic, Laurent; Holbech, Henrik; hutchinson, tom

    the comparison of endpoints relevant for reproduction in invertebrates often shows a much higher sensitivity in molluscs vs. e.g. daphnids. The OECD test guideline programme has thus been extended to cover reproduction effects of chemicals in molluscs. Existing mollusc toxicity test protocols have been reviewed...... in an OECD Detailed Review Paper that identifies two relevant candidate species for developing freshwater tests: Potamopyrgus antipodarum and Lymnaea stagnalis. However, this review did not clarify which toxicity test design/conditions are the most appropriate for chemicals assessment. Therefore, a mollusc...... reproduction test guideline will be developed describing partial- and full- life-cycle test protocols in these species, so as to propose a balanced suite of apical mollusc toxicity tests applicable for the assessment of any type of chemical, including endocrine disruptors, as level 4 and 5 assays of the EDTA...

  9. Recommendations for elaboration, transcultural adaptation and validation process of tests in Speech, Hearing and Language Pathology.

    Science.gov (United States)

    Pernambuco, Leandro; Espelt, Albert; Magalhães, Hipólito Virgílio; Lima, Kenio Costa de

    2017-06-08

    to present a guide with recommendations for translation, adaptation, elaboration and process of validation of tests in Speech and Language Pathology. the recommendations were based on international guidelines with a focus on the elaboration, translation, cross-cultural adaptation and validation process of tests. the recommendations were grouped into two Charts, one of them with procedures for translation and transcultural adaptation and the other for obtaining evidence of validity, reliability and measures of accuracy of the tests. a guide with norms for the organization and systematization of the process of elaboration, translation, cross-cultural adaptation and validation process of tests in Speech and Language Pathology was created.

  10. Evaporation over sump surface in containment studies: code validation on TOSQAN tests

    International Nuclear Information System (INIS)

    Malet, J.; Gelain, T.; Degrees du Lou, O.; Daru, V.

    2011-01-01

    During the course of a severe accident in a Nuclear Power Plant, water can be collected in the sump containment through steam condensation on walls and spray systems activation. The objective of this paper is to present code validation on evaporative sump tests performed on the TOSQAN facility. The ASTEC-CPA code is used as a lumped-parameter code and specific user-defined-functions are developed for the TONUS-CFD code. The tests are air-steam tests, as well as tests with other non-condensable gases (He, CO 2 and SF 6 ) under steady and transient conditions. The results show a good agreement between codes and experiments, indicating a good behaviour of the sump models in both codes. (author)

  11. Multicultural Standardization and Validation of TEMAS, a Thematic Apperception Test.

    Science.gov (United States)

    Costantino, Giuseppe; Malgady, Robert G.

    Mental health clinical services research has emphasized the urgency of developing new psychometric instruments for non-biased psychological assessment of minority and non-minority children of diverse cultural groups in the United States. Background multicultural standardization and validation information is presented for Tell-Me-A-Story (TEMAS)--a…

  12. Development and Validation of the Personality Assessment Questionnaire: Test Manual.

    Science.gov (United States)

    Rohner, Ronald P.; And Others

    Data are presented evaluating the validity and reliability of the Personality Assessment Questionnaire (PAQ), a self-report questionnaire designed to elicit respondents' perceptions of themselves with respect to seven personality and behavioral dispositions: hostility and aggression, dependence, self-esteem, self-adequacy, emotional…

  13. TESTING METHODS FOR MECHANICALLY IMPROVED SOILS: RELIABILITY AND VALIDITY

    Directory of Open Access Journals (Sweden)

    Ana Petkovšek

    2017-10-01

    Full Text Available A possibility of in-situ mechanical improvement for reducing the liquefaction potential of silty sands was investigated by using three different techniques: Vibratory Roller Compaction, Rapid Impact Compaction (RIC and Soil Mixing. Material properties at all test sites were investigated before and after improvement with the laboratory and the in situ tests (CPT, SDMT, DPSH B, static and dynamic load plate test, geohydraulic tests. Correlation between the results obtained by different test methods gave inconclusive answers.

  14. Multi Directional Repeated Sprint Is a Valid and Reliable Test for Assessment of Junior Handball Players

    Directory of Open Access Journals (Sweden)

    Amin Daneshfar

    2018-04-01

    Full Text Available The aim of the present study was to examine the validity and reliability of a 10 × (6 × 5 m multi-directional repeated sprint ability test (RSM in elite young team handball (TH players. Participants were members of the Iranian national team (n = 20, age 16.4 ± 0.7 years, weight 82.5 ± 5.5 kg, height 184.8 ± 4.6 cm, body fat 15.4 ± 4.3%. The validity of RSM was tested against a 10 × (15 + 15 m repeated sprint ability test (RSA, Yo-Yo Intermittent Recovery test Level 1 (Yo-Yo IR1, squat jump (SJ and countermovement jump (CMJ. To test the reliability of RSM, the participants repeated the testing sessions of RSM and RSA 1 week later. Both RSA and RSM tests showed good to excellent reliability of the total time (TT, best time (BT, and weakest time (WT. The results of the correlation analysis showed significant inverse correlations between maximum aerobic capacity and TT in RSA (r = −0.57, p ≤ 0.05 and RSM (r = −0.76, p ≤ 0.01. There was also a significant inverse correlation between maximum aerobic capacity with fatigue index (FI in RSA test (r = −0.64, p ≤ 0.01 and in RSM test (r = −0.53, p ≤ 0.05. BT, WT, and TT of RSA was largely-to-very largely correlated with BT (r = 0.58, p ≤ 0.01, WT (r = 0.62, p ≤ 0.01, and TT (r = 0.65, p ≤ 0.01 of RSM. BT in RSM was also correlated with FI in RSM (r = 0.88, p ≤ 0.01. In conclusion, based on the findings of the current study, the recently developed RSM test is a valid and reliable test and should be utilized for assessment of repeated sprint ability in handball players.

  15. Multi Directional Repeated Sprint Is a Valid and Reliable Test for Assessment of Junior Handball Players

    Science.gov (United States)

    Daneshfar, Amin; Gahreman, Daniel E.; Koozehchian, Majid S.; Amani Shalamzari, Sadegh; Hassanzadeh Sablouei, Mozhgan; Rosemann, Thomas; Knechtle, Beat; Nikolaidis, Pantelis T.

    2018-01-01

    The aim of the present study was to examine the validity and reliability of a 10 × (6 × 5 m) multi-directional repeated sprint ability test (RSM) in elite young team handball (TH) players. Participants were members of the Iranian national team (n = 20, age 16.4 ± 0.7 years, weight 82.5 ± 5.5 kg, height 184.8 ± 4.6 cm, body fat 15.4 ± 4.3%). The validity of RSM was tested against a 10 × (15 + 15 m) repeated sprint ability test (RSA), Yo-Yo Intermittent Recovery test Level 1 (Yo-Yo IR1), squat jump (SJ) and countermovement jump (CMJ). To test the reliability of RSM, the participants repeated the testing sessions of RSM and RSA 1 week later. Both RSA and RSM tests showed good to excellent reliability of the total time (TT), best time (BT), and weakest time (WT). The results of the correlation analysis showed significant inverse correlations between maximum aerobic capacity and TT in RSA (r = −0.57, p ≤ 0.05) and RSM (r = −0.76, p ≤ 0.01). There was also a significant inverse correlation between maximum aerobic capacity with fatigue index (FI) in RSA test (r = −0.64, p ≤ 0.01) and in RSM test (r = −0.53, p ≤ 0.05). BT, WT, and TT of RSA was largely-to-very largely correlated with BT (r = 0.58, p ≤ 0.01), WT (r = 0.62, p ≤ 0.01), and TT (r = 0.65, p ≤ 0.01) of RSM. BT in RSM was also correlated with FI in RSM (r = 0.88, p ≤ 0.01). In conclusion, based on the findings of the current study, the recently developed RSM test is a valid and reliable test and should be utilized for assessment of repeated sprint ability in handball players. PMID:29670536

  16. 15N liver function tests - concept, validity, clinical use

    International Nuclear Information System (INIS)

    Faust, H.; Jung, K.; Krumbiegel, P.; Hirschberg, K.; Reinhardt, R.; Junghans, P.

    1987-01-01

    Several liver function tests using the oral application of a nitrogen compound labelled with 15 N and the subsequent determination of 15 N in a certain fraction of urine by emission spectrometry are described. Because of the key position of the liver in the metabolism of nitrogen compounds the results of these tests allow conclusions concerning disturbances of special liver functions. Instructions for the clinical use of the '[ 15 N]Ammonium Test', '[ 15 N]Hippurate Test' the '[ 15 N]Methacetin Test', and the '[ 15 N]Glycine Test' are given. (author)

  17. VALIDITY IN COMPUTER-BASED TESTING: A LITERATURE REVIEW OF COMPARABILITY ISSUES AND EXAMINEE PERSPECTIVES

    Directory of Open Access Journals (Sweden)

    Ika Kana Trisnawati

    2015-05-01

    Full Text Available These past years have seen the growing popularity of the Computer-Based Tests (CBTs in various disciplines, for various purposes, although the Paper-and Pencil Based Tests (P&Ps are still in use. However, many question on whether the use of CBTs outperform the effectiveness of the P&Ps or if the CBTs can become a valid measuring tool compared to the PBTs. This paper tries to present the comparison on both the CBTs and the P&Ps and their respective examinee perspectives in order to figure out if doubts should arise to the emergence of the CBTs over the classic P&Ps. Findings showed that the CBTs are advantageous in that they are both efficient (reducing testing time and effective (maintaining the test reliability over the P&P versions. Nevertheless, the CBTs still need to have their variables well-designed (e.g., study design, computer algorithm in order for the scores to be comparable to those in the P&P tests since the score equivalence is one of the validity evidences needed in a CBT.

  18. A Valid Culture-Fair Test of Intelligence

    National Research Council Canada - National Science Library

    Fagan, Joseph F

    2008-01-01

    .... The technical barrier overcome was that current theories of intelligence are based on an assumption that all those taking IQ tests have had equal opportunity for exposure to the information being tested...

  19. Validation and Structural Analysis of the Kinematics Concept Test

    Science.gov (United States)

    Lichtenberger, A.; Wagner, C.; Hofer, S. I.; Stem, E.; Vaterlaus, A.

    2017-01-01

    The kinematics concept test (KCT) is a multiple-choice test designed to evaluate students' conceptual understanding of kinematics at the high school level. The test comprises 49 multiple-choice items about velocity and acceleration, which are based on seven kinematic concepts and which make use of three different representations. In the first part…

  20. The Unified Language Testing Plan: Speaking Proficiency Test. Spanish and English Pilot Validation Studies. Report Number 1.

    Science.gov (United States)

    Thornton, Julie A.

    This report describes one segment of the Federal Language Testing Board's Unified Language Testing Plan (ULTP), the validation of speaking proficiency tests in Spanish and English. The ULTP is a project to increase standardization of foreign language proficiency measurement and promote sharing of resources among testing programs in the federal…

  1. Competency measurements: testing convergent validity for two measures.

    Science.gov (United States)

    Cowin, Leanne S; Hengstberger-Sims, Cecily; Eagar, Sandy C; Gregory, Linda; Andrew, Sharon; Rolley, John

    2008-11-01

    This paper is a report of a study to investigate whether the Australian National Competency Standards for Registered Nurses demonstrate correlations with the Finnish Nurse Competency Scale. Competency assessment has become popular as a key regulatory requirement and performance indicator. The term competency, however, does not have a globally accepted definition and this has the potential to create controversy, ambiguity and confusion. Variations in meaning and definitions adopted in workplaces and educational settings will affect the interpretation of research findings and have implications for the nursing profession. A non-experimental cross-sectional survey design was used with a convenience sample of 116 new graduate nurses in 2005. The second version of the Australian National Competency Standards and the Nurse Competency Scale was used to elicit responses to self-assessed competency in the transitional year (first year as a Registered Nurse). Correlational analysis of self-assessed levels of competence revealed a relationship between the Australian National Competency Standards (ANCI) and the Nurse Competency Scale (NCS). The correlational relation between ANCI domains and NCS factors suggests that these scales are indeed used to measure related dimensions. A statistically significant relationship (r = 0.75) was found between the two competency measures. Although the finding of convergent validity is insufficient to establish construct validity for competency as used in both measures in this study, it is an important step towards this goal. Future studies on relationships between competencies must take into account the validity and reliability of the tools.

  2. Validation of a Human Papillomavirus (HPV) DNA Cervical Screening Test That Provides Expanded HPV Typing.

    Science.gov (United States)

    Demarco, Maria; Carter-Pokras, Olivia; Hyun, Noorie; Castle, Philip E; He, Xin; Dallal, Cher M; Chen, Jie; Gage, Julia C; Befano, Brian; Fetterman, Barbara; Lorey, Thomas; Poitras, Nancy; Raine-Bennett, Tina R; Wentzensen, Nicolas; Schiffman, Mark

    2018-05-01

    As cervical cancer screening shifts from cytology to human papillomavirus (HPV) testing, a major question is the clinical value of identifying individual HPV types. We aimed to validate Onclarity (Becton Dickinson Diagnostics, Sparks, MD), a nine-channel HPV test recently approved by the FDA, by assessing (i) the association of Onclarity types/channels with precancer/cancer; (ii) HPV type/channel agreement between the results of Onclarity and cobas (Roche Molecular Systems, Pleasanton, CA), another FDA-approved test; and (iii) Onclarity typing for all types/channels compared to typing results from a research assay (linear array [LA]; Roche). We compared Onclarity to histopathology, cobas, and LA. We tested a stratified random sample ( n = 9,701) of discarded routine clinical specimens that had tested positive by Hybrid Capture 2 (HC2; Qiagen, Germantown, MD). A subset had already been tested by cobas and LA ( n = 1,965). Cervical histopathology was ascertained from electronic health records. Hierarchical Onclarity channels showed a significant linear association with histological severity. Onclarity and cobas had excellent agreement on partial typing of HPV16, HPV18, and the other 12 types as a pool (sample-weighted kappa value of 0.83); cobas was slightly more sensitive for HPV18 and slightly less sensitive for the pooled high-risk types. Typing by Onclarity showed excellent agreement with types and groups of types identified by LA (kappa values from 0.80 for HPV39/68/35 to 0.97 for HPV16). Onclarity typing results corresponded well to histopathology and to an already validated HPV DNA test and could provide additional clinical typing if such discrimination is determined to be clinically desirable. This is a work of the U.S. Government and is not subject to copyright protection in the United States. Foreign copyrights may apply.

  3. Evidences of validity and reliability of the Luria-Nebraska Test for Children

    Directory of Open Access Journals (Sweden)

    Ricardo Franco de Lima

    2016-01-01

    Full Text Available Abstract This paper aimed to verify evidences of validity and reliability of Luria-Nebraska Test for Children (TLN-C, in Portuguese. Three hundred eighty-seven students aged 6–13 years old, with learning difficulties, comprised the study. They were assessed with the Wechsler Intelligence Scale for Children (WISC-III and TLN-C; and effect of age differences, as well as accuracy rating by internal consistency were investigated. Age effects were found for all subtests and in the general score, except for receptive speech subtest, even when total IQ effect was controlled. Reliability analysis had satisfactory results (0.79. The TLN-C showed evidences of validity and reliability. Receptive speech subtest requires revision.

  4. Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

    Directory of Open Access Journals (Sweden)

    Roberts Gwerfyl

    2012-06-01

    Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.

  5. Validity of FAA-approved color vision tests for class II and class III aeromedical screening.

    Science.gov (United States)

    1993-09-01

    All clinical color vision tests currently used in the medical examination of pilots were studied regarding validity for prediction of performance on practical tests of ability to discriminate the aviation signal colors, red, green, and white given un...

  6. Validity and Reliability Study of the Korean Tinetti Mobility Test for Parkinson's Disease.

    Science.gov (United States)

    Park, Jinse; Koh, Seong-Beom; Kim, Hee Jin; Oh, Eungseok; Kim, Joong-Seok; Yun, Ji Young; Kwon, Do-Young; Kim, Younsoo; Kim, Ji Seon; Kwon, Kyum-Yil; Park, Jeong-Ho; Youn, Jinyoung; Jang, Wooyoung

    2018-01-01

    Postural instability and gait disturbance are the cardinal symptoms associated with falling among patients with Parkinson's disease (PD). The Tinetti mobility test (TMT) is a well-established measurement tool used to predict falls among elderly people. However, the TMT has not been established or widely used among PD patients in Korea. The purpose of this study was to evaluate the reliability and validity of the Korean version of the TMT for PD patients. Twenty-four patients diagnosed with PD were enrolled in this study. For the interrater reliability test, thirteen clinicians scored the TMT after watching a video clip. We also used the test-retest method to determine intrarater reliability. For concurrent validation, the unified Parkinson's disease rating scale, Hoehn and Yahr staging, Berg Balance Scale, Timed-Up and Go test, 10-m walk test, and gait analysis by three-dimensional motion capture were also used. We analyzed receiver operating characteristic curve to predict falling. The interrater reliability and intrarater reliability of the Korean Tinetti balance scale were 0.97 and 0.98, respectively. The interrater reliability and intra-rater reliability of the Korean Tinetti gait scale were 0.94 and 0.96, respectively. The Korean TMT scores were significantly correlated with the other clinical scales and three-dimensional motion capture. The cutoff values for predicting falling were 14 points (balance subscale) and 10 points (gait subscale). We found that the Korean version of the TMT showed excellent validity and reliability for gait and balance and had high sensitivity and specificity for predicting falls among patients with PD.

  7. Racial Bias and Predictive Validity in Testing for Selection.

    Science.gov (United States)

    1983-07-01

    the inequa - lity rR (P.C) *0 (2) must define test bias. This definition of test bias conforms to the requirements of the Civil Rights Act of 1964 as...of Educational Measurement, 1976, 13, 43-52. Einhorn, H. J., & Bass, A. R. Methodological considerations relevant to discrimination in employment ...34unbiased" selec- tion model: A question of utilities. Journal of Applied Psychology, 1975, 60, 345-351. Guion, R. M. Employment tests and discriminatory

  8. Bradykinesia-akinesia incoordination test: validating an online keyboard test of upper limb function.

    Science.gov (United States)

    Noyce, Alastair J; Nagy, Anna; Acharya, Shami; Hadavi, Shahrzad; Bestwick, Jonathan P; Fearnley, Julian; Lees, Andrew J; Giovannoni, Gavin

    2014-01-01

    The Bradykinesia Akinesia Incoordination (BRAIN) test is a computer keyboard-tapping task that was developed for use in assessing the effect of symptomatic treatment on motor function in Parkinson's disease (PD). An online version has now been designed for use in a wider clinical context and the research setting. Validation of the online BRAIN test was undertaken in 58 patients with Parkinson's disease (PD) and 93 age-matched, non-neurological controls. Kinesia scores (KS30, number of key taps in 30 seconds), akinesia times (AT30, mean dwell time on each key in milliseconds), incoordination scores (IS30, variance of travelling time between key presses) and dysmetria scores (DS30, accuracy of key presses) were compared between groups. These parameters were correlated against total motor scores and sub-scores from the Unified Parkinson's Disease Rating Scale (UPDRS). Mean KS30, AT30 and IS30 were significantly different between PD patients and controls (p≤0.0001). Sensitivity for 85% specificity was 50% for KS30, 40% for AT30 and 29% for IS30. KS30, AT30 and IS30 correlated significantly with UPDRS total motor scores (r = -0.53, r = 0.27 and r = 0.28 respectively) and motor UPDRS sub-scores. The reliability of KS30, AT30 and DS30 was good on repeated testing. The BRAIN test is a reliable, convenient test of upper limb motor function that can be used routinely in the outpatient clinic, at home and in clinical trials. In addition, it can be used as an objective longitudinal measurement of emerging motor dysfunction for the prediction of PD in at-risk cohorts.

  9. Bradykinesia-akinesia incoordination test: validating an online keyboard test of upper limb function.

    Directory of Open Access Journals (Sweden)

    Alastair J Noyce

    Full Text Available The Bradykinesia Akinesia Incoordination (BRAIN test is a computer keyboard-tapping task that was developed for use in assessing the effect of symptomatic treatment on motor function in Parkinson's disease (PD. An online version has now been designed for use in a wider clinical context and the research setting.Validation of the online BRAIN test was undertaken in 58 patients with Parkinson's disease (PD and 93 age-matched, non-neurological controls. Kinesia scores (KS30, number of key taps in 30 seconds, akinesia times (AT30, mean dwell time on each key in milliseconds, incoordination scores (IS30, variance of travelling time between key presses and dysmetria scores (DS30, accuracy of key presses were compared between groups. These parameters were correlated against total motor scores and sub-scores from the Unified Parkinson's Disease Rating Scale (UPDRS.Mean KS30, AT30 and IS30 were significantly different between PD patients and controls (p≤0.0001. Sensitivity for 85% specificity was 50% for KS30, 40% for AT30 and 29% for IS30. KS30, AT30 and IS30 correlated significantly with UPDRS total motor scores (r = -0.53, r = 0.27 and r = 0.28 respectively and motor UPDRS sub-scores. The reliability of KS30, AT30 and DS30 was good on repeated testing.The BRAIN test is a reliable, convenient test of upper limb motor function that can be used routinely in the outpatient clinic, at home and in clinical trials. In addition, it can be used as an objective longitudinal measurement of emerging motor dysfunction for the prediction of PD in at-risk cohorts.

  10. Development and validation of a partial life-cycle test with Potamopyrgus antipodarum

    DEFF Research Database (Denmark)

    Geiss, Cornelia; Holbech, Henrik; Kinnberg, Karin Lund

    endpoints. The present study aims to develop and validate the partial life-cycle test on the reproduction of P. antipodarum. Here, results from two pre-validation studies of the reproduction test with the chemicals tributyltin (TBT) with nominal concentrations of 10 - 400 ng TBT-Sn/L and cadmium...

  11. 40 CFR 1039.501 - How do I run a valid emission test?

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test? 1039.501 Section 1039.501 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR... Procedures § 1039.501 How do I run a valid emission test? (a) Use the equipment and procedures for...

  12. Understanding Student Teachers' Behavioural Intention to Use Technology: Technology Acceptance Model (TAM) Validation and Testing

    Science.gov (United States)

    Wong, Kung-Teck; Osman, Rosma bt; Goh, Pauline Swee Choo; Rahmat, Mohd Khairezan

    2013-01-01

    This study sets out to validate and test the Technology Acceptance Model (TAM) in the context of Malaysian student teachers' integration of their technology in teaching and learning. To establish factorial validity, data collected from 302 respondents were tested against the TAM using confirmatory factor analysis (CFA), and structural equation…

  13. [Testing reliability and validity of reduced substitutes for leadership scales(rd-SLS)].

    Science.gov (United States)

    Kim, Jeong-Hee

    2005-10-01

    This paper was conducted to test the reliability and validity of rd-SLS, developed by Podsakoff, et al. (1993) which measured 'substitutes for leadership'. The subjects were 345 nurses in 5 general hospitals. Cronbach's and the Guttman split-half coefficient were used to test the reliability of rd-SLS. Factor analysis, and the correlations of the rv-SLS and SLS with rd-SLS were used for convergent and discriminant validity. Cronbach's data was 0.76 and the Guttman split-half coefficient was 0.52. Twelve factors evolved by factor analysis, which explained 70.4% of the total variance. This result was similar to previous study results. However, 'Indifference toward organizational rewards'-related items were classified two factors. It was not clear t hat the rd-SLS consisted of 13 concepts(factors). The correlations of the rv-SLS and SLS with the rd-SLS were 0.93 and 0.87 respectively. The rd-SLS showed a moderate degree of validity and reliability. Thus, it is recommended to use the rd-SLS in general nursing organizations for screening for leadership substitutes. In addition, it is necessary to clarify the concept of organizational rewards. In a further study, the factor structure of the rd-SLS may be considered.

  14. Computer-aided test selection and result validation-opportunities and pitfalls

    DEFF Research Database (Denmark)

    McNair, P; Brender, J; Talmon, J

    1998-01-01

    /or to increase cost-efficiency). Our experience shows that there is a practical limit to the extent of exploitation of the principle of dynamic test scheduling, unless it is automated in one way or the other. This paper analyses some issues of concern related to the profession of clinical biochemistry, when......Dynamic test scheduling is concerned with pre-analytical preprocessing of the individual samples within a clinical laboratory production by means of decision algorithms. The purpose of such scheduling is to provide maximal information with minimal data production (to avoid data pollution and...... implementing such dynamic test scheduling within a Laboratory Information System (and/or an advanced analytical workstation). The challenge is related to 1) generation of appropriately validated decision models, and 2) mastering consequences of analytical imprecision and bias....

  15. Validating WCAG versions 1.0 and 2.0 through usability testing with disabled users

    DEFF Research Database (Denmark)

    Rømen, Dagfinn; Svanæs, Dag

    2012-01-01

    ) and a control group (N = 6), it was found that only 27% of the identified website accessibility problems could have been identified through the use of WCAG 1.0. A similar analysis of conformance to WCAG 2.0 showed a marginal 5% improvement concerning identified website accessibility problems. Compensating...... accessibility evaluations and guidelines in many countries. WCAG version 2.0 was released in 2008. This paper reports on a study that empirically validated the usefulness of using WCAG as a heuristic for website accessibility. Through controlled usability tests of two websites with disabled users (N = 7...... for the low number of test subjects with confidence tests gave results that were still low (42% for WCAG 1.0 and 49% for WCAG 2.0, with 95% confidence). It is concluded from this that the application of WAI accessibility guidelines is not sufficient to guarantee website accessibility. It is recommended...

  16. The Smoking-Related Weight and Eating Episodes Test (SWEET): development and preliminary validation.

    Science.gov (United States)

    Adams, Claire E; Baillie, Lauren E; Copeland, Amy L

    2011-11-01

    Many smokers believe that smoking helps them to control their weight, and concerns about weight gain can interfere with smoking cessation. As researchers typically assess general weight concerns, a measure specific to smoking-related weight concerns is needed. The Smoking-related Weight and Eating Episodes Test (SWEET) was created by generating items from 4 content domains: Hunger, Craving, Overeating, and Body Image. Female undergraduate smokers (N = 280) rated their postcessation weight gain concern and completed the SWEET, Fagerström Test for Nicotine Dependence, Brief Smoking Consequences Questionnaire-Adult, Eating Attitudes Test (EAT)-26, Bulimia Test-Revised (BULIT-R), and Body Shape Questionnaire. Factor analysis of the initial items suggested a 4-factor solution, suggesting 4 subscales: Smoking to suppress appetite, smoking to prevent overeating, smoking to cope with body dissatisfaction, and withdrawal-related appetite increases. Based on these results, the SWEET subscales were revised and shortened. The resulting 10-item SWEET showed excellent internal consistency (total α = .94; mean α = .86) and evidence of validity by predicting smoking frequency, eating pathology, and body image concerns (ps < .05). Smoking frequency, eating pathology, and body image concerns were significantly predicted by the SWEET while controlling for existing measures of postcessation weight gain concern. The SWEET appears to be a reliable and valid measure of tendencies to smoke in response to body image concern and nicotine withdrawal and as a way to control appetite and overeating.

  17. Development and validation of a dissolution test for lodenafil carbonate based on in vivo data.

    Science.gov (United States)

    Codevilla, Cristiane Franco; Castilhos, Tamara dos Santos; Cirne, Carolina Araújo; Froehlich, Pedro Eduardo; Bergold, Ana Maria

    2014-04-01

    Lodenafil carbonate is a phosphodiesterase type 5 inhibitor used for the treatment of erectile dysfunction. Currently, there is no dissolution test reported for lodenafil carbonate and this drug is not listed in any pharmacopoeia. The present study focused on the development and validation of a dissolution test for lodenafil carbonate tablets, using a simulated absorption profile based on in vivo data. The appropriate conditions were determined after testing sink conditions. Different conditions as medium, surfactant concentration and rotation speed were evaluated. The percentage of dose absorbed was calculated by deconvolution, using the Wagner-Nelson method. According to the obtained results, the use of 0.1 M HCl + 1.5% SLS (900 mL, at 37 + 0.5 °C) as the dissolution medium, paddles at 25 rpm were considered adequate. The samples were quantified by UV spectroscopy at 295 nm and the validation was performed according to international guidelines. The method showed specificity, linearity, accuracy and precision, within the acceptable range. Kinetics of drug release was better described by the first-order model. The proposed dissolution test can be used for the routine quality control of lodenafil carbonate in tablets.

  18. The initial validation of a test of emergent literacy

    NARCIS (Netherlands)

    Gruhn, C.M.S.; Weideman, A.J.

    2017-01-01

    In addition to a large body of evidence supporting the relevance of the home environment for literacy development, tests of cognitive-based skills are commonly employed to predict literacy acquisition. The Test of Emergent Literacy (TEL) has been designed to account for the early interaction of

  19. Attention and Intelligence: The Validity of the Star Counting Test.

    NARCIS (Netherlands)

    Jong, de P.F.; Das-Smaal, E.A.

    1995-01-01

    The mechanisms underlying performance on the Star Counting Test (SCT) and its nomothetic span were investigated along with the relationships between working memory capacity, fluid intelligence (Gf), speed, and school achievement. The SCT is an attention test for children that requires the

  20. Race of Examiner Effects and the Validity of Intelligence Tests.

    Science.gov (United States)

    Graziano, William G.; And Others

    1982-01-01

    Recent empirical evidence for the influence of examiner's race on examinee's performance on intelligence tests is reviewed. The current literature, 1966 through 1980, offers little support for the hypothesis that examiner's race has a systematic effect on examinee's performance on intelligence tests. Conceptual and methodological issues are…

  1. Design and validation of a cardiorespiratory capacity test for ...

    African Journals Online (AJOL)

    11.00 months), that were randomly selected from three schools in the southeast of Spain participated in this study. The 10x20m test was designed to evaluate aerobic endurance. The 6-minute walk test (6MWT) was selected and used for ...

  2. Testing and Validation of the Dynamic Interia Measurement Method

    Science.gov (United States)

    Chin, Alexander; Herrera, Claudia; Spivey, Natalie; Fladung, William; Cloutier, David

    2015-01-01

    This presentation describes the DIM method and how it measures the inertia properties of an object by analyzing the frequency response functions measured during a ground vibration test (GVT). The DIM method has been in development at the University of Cincinnati and has shown success on a variety of small scale test articles. The NASA AFRC version was modified for larger applications.

  3. Test plan for validation of the radiative transfer equation.

    Energy Technology Data Exchange (ETDEWEB)

    Ricks, Allen Joseph; Grasser, Thomas W.; Kearney, Sean Patrick; Jernigan, Dann A.; Blanchat, Thomas K.

    2010-09-01

    As the capabilities of numerical simulations increase, decision makers are increasingly relying upon simulations rather than experiments to assess risks across a wide variety of accident scenarios including fires. There are still, however, many aspects of fires that are either not well understood or are difficult to treat from first principles due to the computational expense. For a simulation to be truly predictive and to provide decision makers with information which can be reliably used for risk assessment the remaining physical processes must be studied and suitable models developed for the effects of the physics. A set of experiments are outlined in this report which will provide soot volume fraction/temperature data and heat flux (intensity) data for the validation of models for the radiative transfer equation. In addition, a complete set of boundary condition measurements will be taken to allow full fire predictions for validation of the entire fire model. The experiments will be performed with a lightly-sooting liquid hydrocarbon fuel fire in the fully turbulent scale range (2 m diameter).

  4. Process-oriented tests for validation of baroclinic shallow water models: The lock-exchange problem

    Science.gov (United States)

    Kolar, R. L.; Kibbey, T. C. G.; Szpilka, C. M.; Dresback, K. M.; Tromble, E. M.; Toohey, I. P.; Hoggan, J. L.; Atkinson, J. H.

    A first step often taken to validate prognostic baroclinic codes is a series of process-oriented tests, as those suggested by Haidvogel and Beckmann [Haidvogel, D., Beckmann, A., 1999. Numerical Ocean Circulation Modeling. Imperial College Press, London], among others. One of these tests is the so-called "lock-exchange" test or "dam break" problem, wherein water of different densities is separated by a vertical barrier, which is removed at time zero. Validation against these tests has primarily consisted of comparing the propagation speed of the wave front, as predicted by various theoretical and experimental results, to model output. In addition, inter-model comparisons of the lock-exchange test have been used to validate codes. Herein, we present a high resolution data set, taken from a laboratory-scale model, for direct and quantitative comparison of experimental and numerical results throughout the domain, not just the wave front. Data is captured every 0.2 s using high resolution digital photography, with salt concentration extracted by comparing pixel intensity of the dyed fluid against calibration standards. Two scenarios are discussed in this paper, symmetric and asymmetric mixing, depending on the proportion of dense/light water (17.5 ppt/0.0 ppt) in the experiment; the Boussinesq approximation applies to both. Front speeds, cast in terms of the dimensionless Froude number, show excellent agreement with literature-reported values. Data are also used to quantify the degree of mixing, as measured by the front thickness, which also provides an error band on the front speed. Finally, experimental results are used to validate baroclinic enhancements to the barotropic shallow water ADvanced CIRCulation (ADCIRC) model, including the effect of the vertical mixing scheme on simulation results. Based on salinity data, the model provides an average root-mean-square (rms) error of 3.43 ppt for the symmetric case and 3.74 ppt for the asymmetric case, most of which can

  5. Coverage of the Test of Memory Malingering, Victoria Symptom Validity Test, and Word Memory Test on the Internet: is test security threatened?

    Science.gov (United States)

    Bauer, Lyndsey; McCaffrey, Robert J

    2006-01-01

    In forensic neuropsychological settings, maintaining test security has become critically important, especially in regard to symptom validity tests (SVTs). Coaching, which can entail providing patients or litigants with information about the cognitive sequelae of head injury, or teaching them test-taking strategies to avoid detection of symptom dissimulation has been examined experimentally in many research studies. Emerging evidence supports that coaching strategies affect psychological and neuropsychological test performance to differing degrees depending on the coaching paradigm and the tests administered. The present study sought to examine Internet coverage of SVTs because it is potentially another source of coaching, or information that is readily available. Google searches were performed on the Test of Memory Malingering, the Victoria Symptom Validity Test, and the Word Memory Test. Results indicated that there is a variable amount of information available about each test that could threaten test security and validity should inappropriately interested parties find it. Steps that could be taken to improve this situation and limitations to this exploration are discussed.

  6. Performance Validity Testing in Neuropsychology: Scientific Basis and Clinical Application-A Brief Review.

    Science.gov (United States)

    Greher, Michael R; Wodushek, Thomas R

    2017-03-01

    Performance validity testing refers to neuropsychologists' methodology for determining whether neuropsychological test performances completed in the course of an evaluation are valid (ie, the results of true neurocognitive function) or invalid (ie, overly impacted by the patient's effort/engagement in testing). This determination relies upon the use of either standalone tests designed for this sole purpose, or specific scores/indicators embedded within traditional neuropsychological measures that have demonstrated this utility. In response to a greater appreciation for the critical role that performance validity issues play in neuropsychological testing and the need to measure this variable to the best of our ability, the scientific base for performance validity testing has expanded greatly over the last 20 to 30 years. As such, the majority of current day neuropsychologists in the United States use a variety of measures for the purpose of performance validity testing as part of everyday forensic and clinical practice and address this issue directly in their evaluations. The following is the first article of a 2-part series that will address the evolution of performance validity testing in the field of neuropsychology, both in terms of the science as well as the clinical application of this measurement technique. The second article of this series will review performance validity tests in terms of methods for development of these measures, and maximizing of diagnostic accuracy.

  7. TRAC-P validation test matrix. Revision 1.0

    International Nuclear Information System (INIS)

    Hughes, E.D.; Boyack, B.E.

    1997-01-01

    This document briefly describes the elements of the Nuclear Regulatory Commission's (NRC's) software quality assurance program leading to software (code) qualification and identifies a test matrix for qualifying Transient Reactor Analysis Code (TRAC)-Pressurized Water Reactor Version (-P), or TRAC-P, to the NRC's software quality assurance requirements. Code qualification is the outcome of several software life-cycle activities, specifically, (1) Requirements Definition, (2) Design, (3) Implementation, and (4) Qualification Testing. The major objective of this document is to define the TRAC-P Qualification Testing effort

  8. TRAC-P validation test matrix. Revision 1.0

    Energy Technology Data Exchange (ETDEWEB)

    Hughes, E.D.; Boyack, B.E.

    1997-09-05

    This document briefly describes the elements of the Nuclear Regulatory Commission`s (NRC`s) software quality assurance program leading to software (code) qualification and identifies a test matrix for qualifying Transient Reactor Analysis Code (TRAC)-Pressurized Water Reactor Version (-P), or TRAC-P, to the NRC`s software quality assurance requirements. Code qualification is the outcome of several software life-cycle activities, specifically, (1) Requirements Definition, (2) Design, (3) Implementation, and (4) Qualification Testing. The major objective of this document is to define the TRAC-P Qualification Testing effort.

  9. Clinical Validation of a Test for the Diagnosis of Vaginitis.

    Science.gov (United States)

    Gaydos, Charlotte A; Beqaj, Sajo; Schwebke, Jane R; Lebed, Joel; Smith, Bonnie; Davis, Thomas E; Fife, Kenneth H; Nyirjesy, Paul; Spurrell, Timothy; Furgerson, Dorothy; Coleman, Jenell; Paradis, Sonia; Cooper, Charles K

    2017-07-01

    Vaginitis may be diagnosed as bacterial vaginosis, vulvovaginal candidiasis, trichomoniasis, or coinfection. A new molecular test assays the vaginal microbiome and organisms that cause three common infections. The objective of the trial was to evaluate the clinical accuracy of the investigational test for vaginal swabs collected by patients (self) or clinicians. The primary and secondary outcomes were to compare the investigational test with reference methods for the three most common causes of vaginitis and compare clinician-collected with self-collected swabs. We conducted a cross-sectional study in which women with symptoms of vaginitis were recruited at ten clinical centers and consented to the investigation between May and September 2015. The woman collected a vaginal swab, sheathed, and then handed it to the clinician. These swabs were to evaluate how self-collected swabs compared with clinician-collected swabs. The clinician collected an investigational test swab and reference test swabs. From 1,740 symptomatic patients, clinician-collected and self-collected vaginal swabs were evaluated by the molecular test and six tests. The reference methods for bacterial vaginosis were Nugent's score and Amsel's criteria for intermediate Nugent results. The reference methods for Candida infection were isolation of any potential Candida microorganisms from inoculation of two culture media: chromogenic and Sabouraud agar and sequencing. The reference methods for trichomoniasis were wet mount and culture. For clinician-collected swabs, by reference methods, bacterial vaginosis was diagnosed in 56.5%, vaginal candidiasis in 32.8%, trichomoniasis in 8%, and none of the three infections in 24% with a coinfection rate of 20%. The investigational test sensitivity was 90.5% (95% confidence interval [CI] 88.3-92.2%) and specificity was 85.8% (95% CI 83.0-88.3%) for bacterial vaginosis. The investigational test sensitivity was 90.9% (95% CI 88.1-93.1%) and specificity was 94

  10. Full-Scale Structural and NDI Validation Tests of Bonded Composite Doublers for Commercial Aircraft Applications

    Energy Technology Data Exchange (ETDEWEB)

    Roach, D.; Walkington, P.

    1999-02-01

    Composite doublers, or repair patches, provide an innovative repair technique which can enhance the way aircraft are maintained. Instead of riveting multiple steel or aluminum plates to facilitate an aircraft repair, it is possible to bond a single Boron-Epoxy composite doubler to the damaged structure. Most of the concerns surrounding composite doubler technology pertain to long-term survivability, especially in the presence of non-optimum installations, and the validation of appropriate inspection procedures. This report focuses on a series of full-scale structural and nondestructive inspection (NDI) tests that were conducted to investigate the performance of Boron-Epoxy composite doublers. Full-scale tests were conducted on fuselage panels cut from retired aircraft. These full-scale tests studied stress reductions, crack mitigation, and load transfer capabilities of composite doublers using simulated flight conditions of cabin pressure and axial stress. Also, structures which modeled key aspects of aircraft structure repairs were subjected to extreme tension, shear and bending loads to examine the composite laminate's resistance to disbond and delamination flaws. Several of the structures were loaded to failure in order to determine doubler design margins. Nondestructive inspections were conducted throughout the test series in order to validate appropriate techniques on actual aircraft structure. The test results showed that a properly designed and installed composite doubler is able to enhance fatigue life, transfer load away from damaged structure, and avoid the introduction of new stress risers (i.e. eliminate global reduction in the fatigue life of the structure). Comparisons with test data obtained prior to the doubler installation revealed that stresses in the parent material can be reduced 30%--60% through the use of the composite doubler. Tests to failure demonstrated that the bondline is able to transfer plastic strains into the doubler and that

  11. Diagnostic validation of selected serological tests for detecting scrub typhus.

    Science.gov (United States)

    Koraluru, Munegowda; Bairy, Indira; Varma, Muralidhar; Vidyasagar, Sudha

    2015-07-01

    Clinical diagnosis of scrub typhus is often difficult because the symptoms are very similar to those of other febrile illness such as dengue, leptospirosis, malaria and other viral hemorrhagic fevers. Though better diagnostic tests are available for rickettsial diseases and scrub typhus elsewhere, the Weil-Felix test is still commonly used in India, mainly because microimmunofluorescence assays (M-IFA) were not available in India till recently and relevant staff had insufficient training. The present study was performed to investigate the performance of M-IFA, IgM ELISA, and Weil-Felix test on 546 non-repeated serum samples from subjects suspected of having scrub typhus. One hundred and forty-three of these 546 samples were positive by M-IFA; these cases were also confirmed clinically to have scrub typhus based on their dramatic responses to doxycycline therapy. IgM ELISA was positive in 122 of the 143 M-IFA positive cases and the Weil-Felix test in 96. Though the Weil-Felix test is a heterophile agglutination test, it was found in this study to have good specificity but far too little sensitivity to use as a routine diagnostic test. IgM ELISA can be a good substitute for M-IFA. Incorporation of multiple prototype antigens on M-IFA slides is likely one of the reasons for its superior performance. As newer and better diagnostic assays become available for scrub typhus diagnosis in developed countries, it will be imperative to also use such tests in other endemic countries to prevent over- or under-diagnosis of scrub typhus. © 2015 The Societies and Wiley Publishing Asia Pty Ltd.

  12. Towards a foundation for holistic power system validation and testing

    DEFF Research Database (Denmark)

    Blank, M.; Lehnhoff, S.; Heussen, Kai

    2016-01-01

    , and intelligent solutions for system operation have transformed the power system into a smart grid. To support the development process of smart grid solutions on system level they have to be tested in a holistic manner covering the multi-domain aspect of a such complex systems. This paper introduces the concept...... of holistic power system testing and discuss first steps towards a corresponding methodology that is being developed in the European ERIGrid research infrastructure project....

  13. Validation of Alternative In Vitro Methods to Animal Testing: Concepts, Challenges, Processes and Tools.

    Science.gov (United States)

    Griesinger, Claudius; Desprez, Bertrand; Coecke, Sandra; Casey, Warren; Zuang, Valérie

    This chapter explores the concepts, processes, tools and challenges relating to the validation of alternative methods for toxicity and safety testing. In general terms, validation is the process of assessing the appropriateness and usefulness of a tool for its intended purpose. Validation is routinely used in various contexts in science, technology, the manufacturing and services sectors. It serves to assess the fitness-for-purpose of devices, systems, software up to entire methodologies. In the area of toxicity testing, validation plays an indispensable role: "alternative approaches" are increasingly replacing animal models as predictive tools and it needs to be demonstrated that these novel methods are fit for purpose. Alternative approaches include in vitro test methods, non-testing approaches such as predictive computer models up to entire testing and assessment strategies composed of method suites, data sources and decision-aiding tools. Data generated with alternative approaches are ultimately used for decision-making on public health and the protection of the environment. It is therefore essential that the underlying methods and methodologies are thoroughly characterised, assessed and transparently documented through validation studies involving impartial actors. Importantly, validation serves as a filter to ensure that only test methods able to produce data that help to address legislative requirements (e.g. EU's REACH legislation) are accepted as official testing tools and, owing to the globalisation of markets, recognised on international level (e.g. through inclusion in OECD test guidelines). Since validation creates a credible and transparent evidence base on test methods, it provides a quality stamp, supporting companies developing and marketing alternative methods and creating considerable business opportunities. Validation of alternative methods is conducted through scientific studies assessing two key hypotheses, reliability and relevance of the

  14. Validation of the Cross-Cultural Alcoholism Screening Test (CCAST).

    Science.gov (United States)

    Gorenc, K D; Peredo, S; Pacurucu, S; Llanos, R; Vincente, B; López, R; Abreu, L F; Paez, E

    1999-01-01

    When screening instruments that are used in the assessment and diagnosis of alcoholism of individuals from different ethnicities, some cultural variables based on norms and societal acceptance of drinking behavior can play an important role in determining the outcome. The accepted diagnostic criteria of current market testing are based on Western standards. In this study, the Munich Alcoholism Test (31 items) was the base instrument applied to subjects from several Hispanic-American countries (Bolivia, Chile, Ecuador, Mexico, and Peru). After the sample was submitted to several statistical procedures, these 31 items were reduced to a culture-free, 31-item test named the Cross-Cultural Alcohol Screening Test (CCAST). The results of this Hispanic-American sample (n = 2,107) empirically demonstrated that CCAST measures alcoholism with an adequate degree of accuracy when compared to other available cross-cultural tests. CCAST is useful in the diagnosis of alcoholism in Spanish-speaking immigrants living in countries where English is spoken. CCAST can be used in general hospitals, psychiatric wards, emergency services and police stations. The test can be useful for other professionals, such as psychological consultants, researchers, and those conducting expertise appraisal.

  15. Development and validation of an immunoperoxidase antigen detection test for improved diagnosis of rabies in Indonesia.

    Science.gov (United States)

    Rahmadane, Ibnu; Certoma, Andrea F; Peck, Grantley R; Fitria, Yul; Payne, Jean; Colling, Axel; Shiell, Brian J; Beddome, Gary; Wilson, Susanne; Yu, Meng; Morrissy, Chris; Michalski, Wojtek P; Bingham, John; Gardner, Ian A; Allen, John D

    2017-11-01

    Rabies continues to pose a significant threat to human and animal health in regions of Indonesia. Indonesia has an extensive network of veterinary diagnostic laboratories and the 8 National laboratories are equipped to undertake diagnostic testing for rabies using the commercially-procured direct fluorescent antibody test (FAT), which is considered the reference (gold standard) test. However, many of the Indonesian Provincial diagnostic laboratories do not have a fluorescence microscope required to undertake the FAT. Instead, certain Provincial laboratories continue to screen samples using a chemical stain-based test (Seller's stain test, SST). This test has low diagnostic sensitivity, with negative SST-tested samples being forwarded to the nearest National laboratory resulting in significant delays for completion of testing and considerable additional costs. This study sought to develop a cost-effective and diagnostically-accurate immunoperoxidase antigen detection (RIAD) test for rabies that can be readily and quickly performed by the resource-constrained Provincial laboratories. This would reduce the burden on the National laboratories and allow more rapid diagnoses and implementation of post-exposure prophylaxis. The RIAD test was evaluated using brain smears fixed with acetone or formalin and its performance was validated by comparison with established rabies diagnostic tests used in Indonesia, including the SST and FAT. A proficiency testing panel was distributed between Provincial laboratories to assess the reproducibility of the test. The performance of the RIAD test was improved by using acetone fixation of brain smears rather than formalin fixation such that it was of equivalent accuracy to that of the World Organisation for Animal Health (OIE)-recommended FAT, with both tests returning median diagnostic sensitivity and specificity values of 0.989 and 0.993, respectively. The RIAD test and FAT had higher diagnostic sensitivity than the SST (median = 0

  16. Development and validation of an immunoperoxidase antigen detection test for improved diagnosis of rabies in Indonesia.

    Directory of Open Access Journals (Sweden)

    Ibnu Rahmadane

    2017-11-01

    Full Text Available Rabies continues to pose a significant threat to human and animal health in regions of Indonesia. Indonesia has an extensive network of veterinary diagnostic laboratories and the 8 National laboratories are equipped to undertake diagnostic testing for rabies using the commercially-procured direct fluorescent antibody test (FAT, which is considered the reference (gold standard test. However, many of the Indonesian Provincial diagnostic laboratories do not have a fluorescence microscope required to undertake the FAT. Instead, certain Provincial laboratories continue to screen samples using a chemical stain-based test (Seller's stain test, SST. This test has low diagnostic sensitivity, with negative SST-tested samples being forwarded to the nearest National laboratory resulting in significant delays for completion of testing and considerable additional costs. This study sought to develop a cost-effective and diagnostically-accurate immunoperoxidase antigen detection (RIAD test for rabies that can be readily and quickly performed by the resource-constrained Provincial laboratories. This would reduce the burden on the National laboratories and allow more rapid diagnoses and implementation of post-exposure prophylaxis. The RIAD test was evaluated using brain smears fixed with acetone or formalin and its performance was validated by comparison with established rabies diagnostic tests used in Indonesia, including the SST and FAT. A proficiency testing panel was distributed between Provincial laboratories to assess the reproducibility of the test. The performance of the RIAD test was improved by using acetone fixation of brain smears rather than formalin fixation such that it was of equivalent accuracy to that of the World Organisation for Animal Health (OIE-recommended FAT, with both tests returning median diagnostic sensitivity and specificity values of 0.989 and 0.993, respectively. The RIAD test and FAT had higher diagnostic sensitivity than the SST

  17. 'Mechanical restraint-confounders, risk, alliance score': testing the clinical validity of a new risk assessment instrument.

    Science.gov (United States)

    Deichmann Nielsen, Lea; Bech, Per; Hounsgaard, Lise; Alkier Gildberg, Frederik

    2017-08-01

    Unstructured risk assessment, as well as confounders (underlying reasons for the patient's risk behaviour and alliance), risk behaviour, and parameters of alliance, have been identified as factors that prolong the duration of mechanical restraint among forensic mental health inpatients. To clinically validate a new, structured short-term risk assessment instrument called the Mechanical Restraint-Confounders, Risk, Alliance Score (MR-CRAS), with the intended purpose of supporting the clinicians' observation and assessment of the patient's readiness to be released from mechanical restraint. The content and layout of MR-CRAS and its user manual were evaluated using face validation by forensic mental health clinicians, content validation by an expert panel, and pilot testing within two, closed forensic mental health inpatient units. The three sub-scales (Confounders, Risk, and a parameter of Alliance) showed excellent content validity. The clinical validations also showed that MR-CRAS was perceived and experienced as a comprehensible, relevant, comprehensive, and useable risk assessment instrument. MR-CRAS contains 18 clinically valid items, and the instrument can be used to support the clinical decision-making regarding the possibility of releasing the patient from mechanical restraint. The present three studies have clinically validated a short MR-CRAS scale that is currently being psychometrically tested in a larger study.

  18. Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests

    NARCIS (Netherlands)

    Rijken, N.H.M.; Engelen, B.G.M. van; Weerdesteyn, V.G.M.; Geurts, A.C.H.

    2015-01-01

    OBJECTIVE: To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). DESIGN: Case-control study. SETTING: University medical center. PARTICIPANTS: Patients with various severity levels

  19. Official Position of the American Academy of Clinical Neuropsychology Social Security Administration Policy on Validity Testing: Guidance and Recommendations for Change.

    Science.gov (United States)

    Chafetz, M D; Williams, M A; Ben-Porath, Y S; Bianchini, K J; Boone, K B; Kirkwood, M W; Larrabee, G J; Ord, J S

    2015-01-01

    The milestone publication by Slick, Sherman, and Iverson (1999) of criteria for determining malingered neurocognitive dysfunction led to extensive research on validity testing. Position statements by the National Academy of Neuropsychology and the American Academy of Clinical Neuropsychology (AACN) recommended routine validity testing in neuropsychological evaluations. Despite this widespread scientific and professional support, the Social Security Administration (SSA) continued to discourage validity testing, a stance that led to a congressional initiative for SSA to reevaluate their position. In response, SSA commissioned the Institute of Medicine (IOM) to evaluate the science concerning the validation of psychological testing. The IOM concluded that validity assessment was necessary in psychological and neuropsychological examinations (IOM, 2015 ). The AACN sought to provide independent expert guidance and recommendations concerning the use of validity testing in disability determinations. A panel of contributors to the science of validity testing and its application to the disability process was charged with describing why the disability process for SSA needs improvement, and indicating the necessity for validity testing in disability exams. This work showed how the determination of malingering is a probability proposition, described how different types of validity tests are appropriate, provided evidence concerning non-credible findings in children and low-functioning individuals, and discussed the appropriate evaluation of pain disorders typically seen outside of mental consultations. A scientific plan for validity assessment that additionally protects test security is needed in disability determinations and in research on classification accuracy of disability decisions.

  20. USFDA-GUIDELINE BASED VALIDATION OF TESTING METHOD FOR RIFAMPICIN IN INDONESIAN SERUM SPECIMEN

    Directory of Open Access Journals (Sweden)

    Tri Joko Raharjo

    2010-06-01

    Full Text Available Regarding a new regulation from Indonesia FDA (Badan POM-RI, all new non patent drugs should show bioequivalence with the originator drug prior to registration. Bioequivalence testing (BE-testing has to be performed to the people that represented of population to which the drug to be administrated. BE testing need a valid bio-analytical method for certain drug target and group of population. This research report specific validation of bio-analysis of Rifampicin in Indonesian serum specimen in order to be used for BE testing. The extraction was performed using acetonitrile while the chromatographic separation was accomplished on a RP 18 column (250 × 4.6 mm i.d., 5 µm, with a mobile phase composed of KH2PO4 10 mM-Acetonitrile (40:60, v/v and UV detection was set at 333 nm. The method shown specificity compared to blank serum specimen with retention time of rifampicin at 2.1 min. Lower limit of quantification (LLOQ was 0.06 µg/mL with dynamic range up to 20 µg/mL (R>0.990. Precision of the method was very good with coefficient of variance (CV 0.58; 7.40 and 5.56% for concentration at 0.06, 5, 15 µg/mL, respectively. Accuracies of the method were 3.22; 1.94; 1.90% for concentration 0.06, 5 and 15 µg/mL respectively. The average recoveries were 97.82, 95.50 and 97.31% for concentration of rifampicin 1, 5 and 5 µg/mL, respectively. The method was also shown reliable result on stability test on freezing-thawing, short-term and long-term stability as well as post preparation stability. Validation result shown that the method was ready to be used for Rifampicin BE testing with Indonesian subject.   Keywords: Rifampicin, Validation, USFDA-Guideline

  1. A New Tool for Nutrition App Quality Evaluation (AQEL): Development, Validation, and Reliability Testing.

    Science.gov (United States)

    DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M

    2017-10-27

    The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no

  2. Brief implicit association test: Validity and utility in prediction of voting behavior

    Directory of Open Access Journals (Sweden)

    Pavlović Maša D.

    2013-01-01

    Full Text Available We employed the Brief Implicit Association Test (a recently developed short version of IAT to measure implicit political attitudes toward four political parties running for Serbian parliament. To test its criterion validity, we measured voting intention and actual voting behavior. In addition, we introduced political involvement as a potential moderator of the BIAT’s predictive and incremental validity. The BIAT demonstrated good internal and predictive validity, but lacked incremental validity over self-report measures. Predictive power of the BIAT was moderated by political involvement - the BIAT scores were stronger predictors of voting intention and behavior among voters highly involved in politics. [Projekat Ministarstva nauke Republike Srbije, br. 179018

  3. Validity of purchasing power parity for selected Latin American countries: Linear and non-linear unit root tests

    Directory of Open Access Journals (Sweden)

    Claudio Roberto Fóffano Vasconcelos

    2016-01-01

    Full Text Available The aim of this study is to examine empirically the validity of PPP in the context of unit root tests based on linear and non-linear models of the real effective exchange rate of Argentina, Brazil, Chile, Colombia, Mexico, Peru and Venezuela. For this purpose, we apply the Harvey et al. (2008 linearity test and the non-linear unit root test (Kruse, 2011. The results show that the series with linear characteristics are Argentina, Brazil, Chile, Colombia and Peru and those with non-linear characteristics are Mexico and Venezuela. The linear unit root tests indicate that the real effective exchange rate is stationary for Chile and Peru, and the non-linear unit root tests evidence that Mexico is stationary. In the period analyzed, the results show support for the validity of PPP in only three of the seven countries.

  4. Mechanical tests for validation of seismic isolation elastomer constitutive models

    International Nuclear Information System (INIS)

    Kulak, R.F.; Hughes, T.H.

    1992-01-01

    High damping laminated elastomeric bearings are becoming the preferred device for seismic isolation of large buildings and structures, such as nuclear power plants. The key component of these bearings is a filled natural rubber elastomer. This material exhibits nonlinear behavior within the normal design range. The material damping cannot be classified as either viscous or hysteritic, but it seems to fall somewhere in between. This paper describes a series of tests that can be used to characterize the mechanical response of these elastomers. The tests are designed to determine the behavior of the elastomer in the time scale of the earthquake, which is typically from 30 to 60 seconds. The test results provide data for use in determining the material parameters associated with nonlinear constitutive models. 4 refs

  5. Development, Validation, and Deployment of an Occupational Test of Color Vision for Air Traffic Control Specialists

    Science.gov (United States)

    2011-05-01

    direct sampling of form and content of critical display data. Evidence of construct validity is provided by correlation with Colour Assessment and...testing resulted in development of tests such as the Colour Assessment and Diagnosis Test (CAD; Rodriguez -Carmona, Harlow, Walker, & Barbur, 2005...Great Lakes Regional PEPC who failed initial screening on the Dvorine PIP test. Volunteers were recruited through advertisements in local

  6. Clinical Functional Capacity Testing in Patients With Facioscapulohumeral Muscular Dystrophy: Construct Validity and Interrater Reliability of Antigravity Tests.

    Science.gov (United States)

    Rijken, Noortje H; van Engelen, Baziel G; Weerdesteyn, Vivian; Geurts, Alexander C

    2015-12-01

    To evaluate the construct validity and interrater reliability of 4 simple antigravity tests in a small group of patients with facioscapulohumeral muscular dystrophy (FSHD). Case-control study. University medical center. Patients with various severity levels of FSHD (n=9) and healthy control subjects (n=10) were included (N=19). Not applicable. A 4-point ordinal scale was designed to grade performance on the following 4 antigravity tests: sit to stance, stance to sit, step up, and step down. In addition, the 6-minute walk test, 10-m walking test, Berg Balance Scale, and timed Up and Go test were administered as conventional tests. Construct validity was determined by linear regression analysis using the Clinical Severity Score (CSS) as the dependent variable. Interrater agreement was tested using a κ analysis. Patients with FSHD performed worse on all 4 antigravity tests compared with the controls. Stronger correlations were found within than between test categories (antigravity vs conventional). The antigravity tests revealed the highest explained variance with regard to the CSS (R(2)=.86, P=.014). Interrater agreement was generally good. The results of this exploratory study support the construct validity and interrater reliability of the proposed antigravity tests for the assessment of functional capacity in patients with FSHD taking into account the use of compensatory strategies. Future research should further validate these results in a larger sample of patients with FSHD. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  7. Testing the Validity of a Cognitive Behavioral Model for Gambling Behavior.

    Science.gov (United States)

    Raylu, Namrata; Oei, Tian Po S; Loo, Jasmine M Y; Tsai, Jung-Shun

    2016-06-01

    Currently, cognitive behavioral therapies appear to be one of the most studied treatments for gambling problems and studies show it is effective in treating gambling problems. However, cognitive behavior models have not been widely tested using statistical means. Thus, the aim of this study was to test the validity of the pathways postulated in the cognitive behavioral theory of gambling behavior using structural equation modeling (AMOS 20). Several questionnaires assessing a range of gambling specific variables (e.g., gambling urges, cognitions and behaviors) and gambling correlates (e.g., psychological states, and coping styles) were distributed to 969 participants from the community. Results showed that negative psychological states (i.e., depression, anxiety and stress) only directly predicted gambling behavior, whereas gambling urges predicted gambling behavior directly as well as indirectly via gambling cognitions. Avoidance coping predicted gambling behavior only indirectly via gambling cognitions. Negative psychological states were significantly related to gambling cognitions as well as avoidance coping. In addition, significant gender differences were also found. The results provided confirmation for the validity of the pathways postulated in the cognitive behavioral theory of gambling behavior. It also highlighted the importance of gender differences in conceptualizing gambling behavior.

  8. Converting Hangar High Expansion Foam Systems to Prevent Cockpit Damage: Full-Scale Validation Tests

    Science.gov (United States)

    2017-09-01

    AFCEC-CO-TY-TR-2018-0001 CONVERTING HANGAR HIGH EXPANSION FOAM SYSTEMS TO PREVENT COCKPIT DAMAGE: FULL-SCALE VALIDATION TESTS Gerard G...manufacturer, or otherwise does not constitute or imply its endorsement, recommendation , or approval by the United States Air Force. The views and...09-2017 Final Test Report May 2017 Converting Hangar High Expansion Foam Systems to Prevent Cockpit Damage: Full-Scale Validation Tests N00173-15-D

  9. AMSTERDAM-NIJMEGEN EVERYDAY LANGUAGE TEST - CONSTRUCTION, RELIABILITY AND VALIDITY

    NARCIS (Netherlands)

    BLOMERT, L; KEAN, ML; KOSTER, C; SCHOKKER, J

    1994-01-01

    The Amsterdam-Nijmegen Everyday Language Test (ANELT) is designed to measure, first, the level of verbal communicative abilities of aphasic patients and, second, changes in these abilities over time. The level of communicative effectiveness is determined by the adequacy of bringing a message across.

  10. Development and validation of dissolution test for Metoprolol ...

    African Journals Online (AJOL)

    The dissolution method which uses USP apparatus I (Basket) with rotating at 100 rpm, 900 ml of different dissolution medium, ultra violet spectroscopy for quantification was demonstrated to be robust, discriminating and transferable. Dissolution tests conditions were selected after it was demonstrated that the Metoprolol ...

  11. The Development and Validation of the Vocalic Sensitivity Test.

    Science.gov (United States)

    Villaume, William A.; Brown, Mary Helen

    1999-01-01

    Notes that presbycusis, hearing loss associated with aging, may be marked by a second dimension of hearing loss, a loss in vocalic sensitivity. Reports on the development of the Vocalic Sensitivity Test, which controls for the verbal elements in speech while also allowing for the vocalics to exercise their normal metacommunicative function of…

  12. Use of run statistics to validate tensile tests

    International Nuclear Information System (INIS)

    Eatherly, W.P.

    1981-01-01

    In tensile testing of irradiated graphites, it is difficult to assure alignment of sample and train for tensile measurements. By recording location of fractures, run (sequential) statistics can readily detect lack of randomness. The technique is based on partitioning binomial distributions

  13. Defense Waste Processing Facility Canister Closure Weld Current Validation Testing

    Energy Technology Data Exchange (ETDEWEB)

    Korinko, P. S. [Savannah River Site (SRS), Aiken, SC (United States). Savannah River National Lab. (SRNL); Maxwell, D. N. [Savannah River Site (SRS), Aiken, SC (United States). Savannah River National Lab. (SRNL)

    2018-01-29

    Two closure welds on filled Defense Waste Processing Facility (DWPF) canisters failed to be within the acceptance criteria in the DWPF operating procedure SW4-15.80-2.3 (1). In one case, the weld heat setting was inadvertently provided to the canister at the value used for test welds (i.e., 72%) and this oversight produced a weld at a current of nominally 210 kA compared to the operating procedure range (i.e., 82%) of 240 kA to 263 kA. The second weld appeared to experience an instrumentation and data acquisition upset. The current for this weld was reported as 191 kA. Review of the data from the Data Acquisition System (DAS) indicated that three of the four current legs were reading the expected values, approximately 62 kA each, and the fourth leg read zero current. Since there is no feasible way by further examination of the process data to ascertain if this weld was actually welded at either the target current or the lower current, a test plan was executed to provide assurance that these Nonconforming Welds (NCWs) meet the requirements for strength and leak tightness. Acceptance of the welds is based on evaluation of Test Nozzle Welds (TNW) made specifically for comparison. The TNW were nondestructively and destructively evaluated for plug height, heat tint, ultrasonic testing (UT) for bond length and ultrasonic volumetric examination for weld defects, burst pressure, fractography, and metallography. The testing was conducted in agreement with a Task Technical and Quality Assurance Plan (TTQAP) (2) and applicable procedures.

  14. Cultural Adaptation of the Portuguese Version of the "Sniffin' Sticks" Smell Test: Reliability, Validity, and Normative Data.

    Science.gov (United States)

    Ribeiro, João Carlos; Simões, João; Silva, Filipe; Silva, Eduardo D; Hummel, Cornelia; Hummel, Thomas; Paiva, António

    2016-01-01

    The cross-cultural adaptation and validation of the Sniffin`Sticks test for the Portuguese population is described. Over 270 people participated in four experiments. In Experiment 1, 67 participants rated the familiarity of presented odors and seven descriptors of the original test were adapted to a Portuguese context. In Experiment 2, the Portuguese version of Sniffin`Sticks test was administered to 203 healthy participants. Older age, male gender and active smoking status were confirmed as confounding factors. The third experiment showed the validity of the Portuguese version of Sniffin`Sticks test in discriminating healthy controls from patients with olfactory dysfunction. In Experiment 4, the test-retest reliability for both the composite score (r71 = 0.86) and the identification test (r71 = 0.62) was established (pPortuguese version of Sniffin`Sticks test is provided, showing good validity and reliability and effectively distinguishing patients from healthy controls with high sensitivity and specificity. The Portuguese version of Sniffin`Sticks test identification test is a clinically suitable screening tool in routine outpatient Portuguese settings.

  15. Vietnamese validation of the short version of Internet Addiction Test

    OpenAIRE

    Tran, Bach Xuan; Mai, Hue Thi; Nguyen, Long Hoang; Nguyen, Cuong Tat; Latkin, Carl A.; Zhang, Melvyn W.B.; Ho, Roger C.M.

    2017-01-01

    Background and aims: The main goal of the present study was to examine the psychometric properties of a Vietnamese version of the short-version of Internet Addiction Test (s-IAT) and to assess the relationship between s-IAT scores and demographics, health related qualify of life and perceived stress scores in young Vietnamese. Methods: The Vietnamese version of s-IAT was administered to a sample of 589 participants. Exploratory factor and reliability analyses were performed. Regression analys...

  16. Diagnostic validation of three test methods for detection of cyprinid herpesvirus 3 (CyHV-3).

    Science.gov (United States)

    Clouthier, Sharon C; McClure, Carol; Schroeder, Tamara; Desai, Megan; Hawley, Laura; Khatkar, Sunita; Lindsay, Melissa; Lowe, Geoff; Richard, Jon; Anderson, Eric D

    2017-03-06

    Cyprinid herpesvirus 3 (CyHV-3) is the aetiological agent of koi herpesvirus disease in koi and common carp. The disease is notifiable to the World Organisation for Animal Health. Three tests-quantitative polymerase chain reaction (qPCR), conventional PCR (cPCR) and virus isolation by cell culture (VI)-were validated to assess their fitness as diagnostic tools for detection of CyHV-3. Test performance metrics of diagnostic accuracy were sensitivity (DSe) and specificity (DSp). Repeatability and reproducibility were measured to assess diagnostic precision. Estimates of test accuracy, in the absence of a gold standard reference test, were generated using latent class models. Test samples originated from wild common carp naturally exposed to CyHV-3 or domesticated koi either virus free or experimentally infected with the virus. Three laboratories in Canada participated in the precision study. Moderate to high repeatability (81 to 99%) and reproducibility (72 to 97%) were observed for the qPCR and cPCR tests. The lack of agreement observed between some of the PCR test pair results was attributed to cross-contamination of samples with CyHV-3 nucleic acid. Accuracy estimates for the PCR tests were 99% for DSe and 93% for DSp. Poor precision was observed for the VI test (4 to 95%). Accuracy estimates for VI/qPCR were 90% for DSe and 88% for DSp. Collectively, the results show that the CyHV-3 qPCR test is a suitable tool for surveillance, presumptive diagnosis and certification of individuals or populations as CyHV-3 free.

  17. Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

    Science.gov (United States)

    Jongen, Stefan; Perrier, Joy; Vuurman, Eric F; Ramaekers, Johannes G; Vermeeren, Annemiek

    2015-01-01

    To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation. Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am) and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am). On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP) of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT). Large effects sizes were also found in the Divided Attention Test (DAT), the Attention Network Test (ANT), and the test for Useful Field of View (UFOV) at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV. From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.

  18. Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

    Directory of Open Access Journals (Sweden)

    Stefan Jongen

    Full Text Available To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation.Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am.On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT. Large effects sizes were also found in the Divided Attention Test (DAT, the Attention Network Test (ANT, and the test for Useful Field of View (UFOV at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV.From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.

  19. The ad-libitum alcohol ?taste test?: secondary analyses of potential confounds and construct validity

    OpenAIRE

    Jones, Andrew; Button, Emily; Rose, Abigail K.; Robinson, Eric; Christiansen, Paul; Di Lemma, Lisa; Field, Matt

    2015-01-01

    Rationale Motivation to drink alcohol can be measured in the laboratory using an ad-libitum ?taste test?, in which participants rate the taste of alcoholic drinks whilst their intake is covertly monitored. Little is known about the construct validity of this paradigm. Objective The objective of this study was to investigate variables that may compromise the validity of this paradigm and its construct validity. Methods We re-analysed data from 12 studies from our laboratory that incorporated a...

  20. Experimental Testing Procedures and Dynamic Model Validation for Vanadium Redox Flow Battery Storage System

    DEFF Research Database (Denmark)

    Baccino, Francesco; Marinelli, Mattia; Nørgård, Per Bromand

    2013-01-01

    The paper aims at characterizing the electrochemical and thermal parameters of a 15 kW/320 kWh vanadium redox flow battery (VRB) installed in the SYSLAB test facility of the DTU Risø Campus and experimentally validating the proposed dynamic model realized in Matlab-Simulink. The adopted testing...... efficiency of the battery system. The test procedure has general validity and could also be used for other storage technologies. The storage model proposed and described is suitable for electrical studies and can represent a general model in terms of validity. Finally, the model simulation outputs...

  1. Ares I-X Flight Test Validation of Control Design Tools in the Frequency-Domain

    Science.gov (United States)

    Johnson, Matthew; Hannan, Mike; Brandon, Jay; Derry, Stephen

    2011-01-01

    A major motivation of the Ares I-X flight test program was to Design for Data, in order to maximize the usefulness of the data recorded in support of Ares I modeling and validation of design and analysis tools. The Design for Data effort was intended to enable good post-flight characterizations of the flight control system, the vehicle structural dynamics, and also the aerodynamic characteristics of the vehicle. To extract the necessary data from the system during flight, a set of small predetermined Programmed Test Inputs (PTIs) was injected directly into the TVC signal. These PTIs were designed to excite the necessary vehicle dynamics while exhibiting a minimal impact on loads. The method is similar to common approaches in aircraft flight test programs, but with unique launch vehicle challenges due to rapidly changing states, short duration of flight, a tight flight envelope, and an inability to repeat any test. This paper documents the validation effort of the stability analysis tools to the flight data which was performed by comparing the post-flight calculated frequency response of the vehicle to the frequency response calculated by the stability analysis tools used to design and analyze the preflight models during the control design effort. The comparison between flight day frequency response and stability tool analysis for flight of the simulated vehicle shows good agreement and provides a high level of confidence in the stability analysis tools for use in any future program. This is true for both a nominal model as well as for dispersed analysis, which shows that the flight day frequency response is enveloped by the vehicle s preflight uncertainty models.

  2. Solar Sail Models and Test Measurements Correspondence for Validation Requirements Definition

    Science.gov (United States)

    Ewing, Anthony; Adams, Charles

    2004-01-01

    Solar sails are being developed as a mission-enabling technology in support of future NASA science missions. Current efforts have advanced solar sail technology sufficient to justify a flight validation program. A primary objective of this activity is to test and validate solar sail models that are currently under development so that they may be used with confidence in future science mission development (e.g., scalable to larger sails). Both system and model validation requirements must be defined early in the program to guide design cycles and to ensure that relevant and sufficient test data will be obtained to conduct model validation to the level required. A process of model identification, model input/output documentation, model sensitivity analyses, and test measurement correspondence is required so that decisions can be made to satisfy validation requirements within program constraints.

  3. Content validity and reliability of test of gross motor development in Chilean children

    Directory of Open Access Journals (Sweden)

    Marcelo Cano-Cappellacci

    2015-01-01

    Full Text Available ABSTRACT OBJECTIVE To validate a Spanish version of the Test of Gross Motor Development (TGMD-2 for the Chilean population. METHODS Descriptive, transversal, non-experimental validity and reliability study. Four translators, three experts and 92 Chilean children, from five to 10 years, students from a primary school in Santiago, Chile, have participated. The Committee of Experts has carried out translation, back-translation and revision processes to determine the translinguistic equivalence and content validity of the test, using the content validity index in 2013. In addition, a pilot implementation was achieved to determine test reliability in Spanish, by using the intraclass correlation coefficient and Bland-Altman method. We evaluated whether the results presented significant differences by replacing the bat with a racket, using T-test. RESULTS We obtained a content validity index higher than 0.80 for language clarity and relevance of the TGMD-2 for children. There were significant differences in the object control subtest when comparing the results with bat and racket. The intraclass correlation coefficient for reliability inter-rater, intra-rater and test-retest reliability was greater than 0.80 in all cases. CONCLUSIONS The TGMD-2 has appropriate content validity to be applied in the Chilean population. The reliability of this test is within the appropriate parameters and its use could be recommended in this population after the establishment of normative data, setting a further precedent for the validation in other Latin American countries.

  4. VALIDATION OF THE ASSR TEST THROUGH COMPLEMENTARY AUDIOLOGYICAL METHODS

    Directory of Open Access Journals (Sweden)

    C. Mârtu

    2016-04-01

    Full Text Available Introduction: Auditory Steady State Response (ASSR is an objective method for determining the auditive threshold, applicable and necessary especially in children. The test is extremely important for recommending cochlear implant in children. The aim of the study was to compare pure tone audiometry responses and auditory steady-state thresholds. Materials and method: The study was performed on a group including both patients with normal hearing and with hearing loss. The main inclusion criteria accepted only patients with normal otomicroscopic aspect, normal tympanogram, capable to respond to pure tone audiometry, and with ear conduction thresholds between 0 and 80 dB NHL. The patients with suppurative otic processes or ear malformations were excluded. The research protocol was followed, the tests being performed in soundproofed rooms, starting with pure tone audiometry followed, after a pause, by ASSR determinations at frequencies of 0.5, 1.2 and 4 KHz. The audiological instruments were provided by a single manufacturer. ASSR was recorded at least two times for both borderline intensities, namely the one defining the auditory threshold and the first no-response intensity. The recorded responses were stored in a database and further processed in Excel. Discussion: The differences observed between pure tone audiometry and ASSR thresholds are important at 500 Hz and insignificant at the other frequencies. When approaching the PTA-ASSR relation, whatever the main characteristic between the PTA and ASSR thresholds in one ear, the profile of the lines gap maintains the same shape on the opposite ear. Conclusions: ASSR is a confident objective test, maintaining attention to low frequencies, where some differences might occur.

  5. Are chiropractic tests for the lumbo-pelvic spine reliable and valid? A systematic critical literature review

    DEFF Research Database (Denmark)

    Hestbaek, L; Leboeuf-Yde, C

    2000-01-01

    OBJECTIVE: To systematically review the peer-reviewed literature about the reliability and validity of chiropractic tests used to determine the need for spinal manipulative therapy of the lumbo-pelvic spine, taking into account the quality of the studies. DATA SOURCES: The CHIROLARS database......-pelvic spine were included. DATA EXTRACTION: Data quality were assessed independently by the two reviewers, with a quality score based on predefined methodologic criteria. Results of the studies were then evaluated in relation to quality. DATA SYNTHESIS: None of the tests studied had been sufficiently...... evaluated in relation to reliability and validity. Only tests for palpation for pain had consistently acceptable results. Motion palpation of the lumbar spine might be valid but showed poor reliability, whereas motion palpation of the sacroiliac joints seemed to be slightly reliable but was not shown...

  6. Validation of an Instrument to Measure High School Students' Attitudes toward Fitness Testing

    Science.gov (United States)

    Mercier, Kevin; Silverman, Stephen

    2014-01-01

    Purpose: The purpose of this investigation was to develop an instrument that has scores that are valid and reliable for measuring students' attitudes toward fitness testing. Method: The method involved the following steps: (a) an elicitation study, (b) item development, (c) a pilot study, and (d) a validation study. The pilot study included 427…

  7. Development and testing of a cross-culturally valid instrument: food-related life style

    DEFF Research Database (Denmark)

    Brunsø, Karen; Grunert, Klaus G.

    1995-01-01

    -culturaly valid way. To this end we have developed a pool of 202 items, collected data in three countries, and have constructed scales based on cross-culturally stable factor patterns. We have then applie set of scales to a fourth country, in order to further test the cross-cultural validity of the instrument....

  8. Exploring the Reliability and Validity of the Social-Moral Awareness Test

    Science.gov (United States)

    Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth

    2012-01-01

    Background: The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor…

  9. Validity of the Mayer-Salovey-Caruso Emotional Intelligence Test: Youth Version-Research Edition

    Science.gov (United States)

    Peters, Christine; Kranzler, John H.; Rossen, Eric

    2009-01-01

    This study examines the criterion-related validity evidence of scores on the Mayer-Salovey-Caruso Emotional Intelligence Test: Youth Version-Research Version. The authors also investigate the relationship between scores on the MSCEIT-YV and chronological age. Results provide initial support for the construct validity of the MSCEIT-YV but also…

  10. Similarity Analysis for Reactor Flow Distribution Test and Its Validation

    Energy Technology Data Exchange (ETDEWEB)

    Hong, Soon Joon; Ha, Jung Hui [Heungdeok IT Valley, Yongin (Korea, Republic of); Lee, Taehoo; Han, Ji Woong [KAERI, Daejeon (Korea, Republic of)

    2015-05-15

    The newly derived dimensionless groups are slightly different from Hetsroni's. Reynolds number, relative wall roughness, and Euler don't appear, instead, friction factor appears newly. In order to conserve friction factor Reynolds number and relative wall roughness should be conserved. Since the effect of Reynolds number in high range is small, and since the scaled model is far smaller than prototype the conservation of friction factor is easily obtained by making the model wall just smooth. It is much easier to implement the test design than Hetsroni's because the Reynolds number and relative wall roughness do not appear explicitly. In case that there is no free surface within the interested domain of the reactor, the gravity is of second importance, and in this case the pressure drops should be compensated for in order to compare them between prototype and model. The gravity head compensated pressure drop is directly same to the measured value by a differential pressure transmitter. In order to conserve the gravity effect Froude number should be conserved. In pool type SFR (Sodium Cooled Fast Reactor) there exists liquid level difference, and if the level difference is desired to be conserved, the Froude number should be conserved. Euler number, which represents pressure terms in momentum equation, should be well conserved according to Hetsroni's approach. It is not a wrong statement, but it should be noted that Euler number is NOT an independent variable BUT a dependent variable according to Hong et al. It means that if all the geometrical similarity and the dimensionless numbers are conserved, Euler number is automatically conserved. So Euler number need not be considered in case that the perfect geometrical similarity is kept. However, even in case that the geometrical similarity is not conserved, it possible to conserved the velocity field similarity by just conserve Euler number. It gives tolerance to the engineer who designs the test

  11. Familiarization, validity and smallest detectable difference of the isometric squat test in evaluating maximal strength.

    Science.gov (United States)

    Drake, David; Kennedy, Rodney; Wallace, Eric

    2018-02-06

    Isometric multi-joint tests are considered reliable and have strong relationships with 1RM performance. However, limited evidence is available for the isometric squat in terms of effects of familiarization and reliability. This study aimed to assess, the effect of familiarization, stability reliability, determine the smallest detectible difference, and the correlation of the isometric squat test with 1RM squat performance. Thirty-six strength-trained participants volunteered to take part in this study. Following three familiarization sessions, test-retest reliability was evaluated with a 48-hour window between each time point. Isometric squat peak, net and relative force were assessed. Results showed three familiarizations were required, isometric squat had a high level of stability reliability and smallest detectible difference of 11% for peak and relative force. Isometric strength at a knee angle of ninety degrees had a strong significant relationship with 1RM squat performance. In conclusion, the isometric squat is a valid test to assess multi-joint strength and can discriminate between strong and weak 1RM squat performance. Changes greater than 11% in peak and relative isometric squat performance should be considered as meaningful in participants who are familiar with the test.

  12. ASSERT validation against the Stern Laboratories' single-phase pressure drop tests

    International Nuclear Information System (INIS)

    Waddington, G.M.; Kiteley, J.C.; Carver, M.B.

    1995-01-01

    This paper describes the preliminary validation of ASSERT-IV against the single-phase pressure drop tests from the 37-element CHF (critical heat flux) experiments conducted at Stern Laboratories, and shows how this study fits into the overall ASSERT validation plan. The effects on the pressure drop of several friction and form loss models are evaluated, including the geometry-based K-factor model. The choice of friction factor has a small effect on the predicted channel pressure drop, compared to the form loss model choice. Using the uniform K-factors of Hameed, the computed pressure drops are in excellent agreement with the experimental results from the nominal pressure tube tests. For future ASSERT applications, either Hameed's uniform K-factors or the geometry-based model using Idelchik's thick-edged orifice equation are recommended, as are the friction factor correlations of Colebrook-White, Selander, and Aly and Groeneveld. More analysis of the geometry-based K-factor model is required. (author). 23 refs., 4 tabs., 9 figs

  13. Development, construct validity and test-retest reliability of a field-based wheelchair mobility performance test for wheelchair basketball

    NARCIS (Netherlands)

    de Witte, Annemarie M. H.; Hoozemans, Marco J. M.; Berger, Monique A. M.; van der Slikke, Rienk M. A.; van der Woude, Lucas H. V.; Veeger, Dirkjan (H. E. J)

    2018-01-01

    The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of

  14. Translation and validation of the Malay version of the Stroke Knowledge Test

    Directory of Open Access Journals (Sweden)

    Siti Noorkhairina Sowtali

    2016-04-01

    Conclusions: Malay version Stroke Knowledge Test was a valid and reliable tool to assess educational needs and to evaluate stroke knowledge among participants of group-based stroke education programs in Malaysia.

  15. Development and Validation of a Video-Based Social Knowledge Test for Junior Commissioned Army Officers

    National Research Council Canada - National Science Library

    Schneider, R. J; Johnson, J. W

    2004-01-01

    Social knowledge/skill are increasingly critical to the success of U.S. Army officers. In this paper, we describe development and criterion-related validation of an experimental video-based social knowledge test...

  16. The timed "up and go" test : Reliability and validity in persons with unilateral lower limb amputation

    NARCIS (Netherlands)

    Schoppen, Tanneke; Boonstra, Antje; Groothoff, JW; de Vries, J; Goeken, LNH; Eisma, Willem

    Objective: To determine the interrater and interrater reliability and the validity of the Timed "up and go" test as a measure for physical mobility in elderly patients with an amputation of the lower extremity. Design: To test interrater reliability, the test was performed for two observers at

  17. The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

    Science.gov (United States)

    Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

    2017-10-23

    Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (preliability was (ICC3,3) = 0.953 (pvalidity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.

  18. Validity of the Optometry Admission Test in Predicting Performance in Schools and Colleges of Optometry.

    Science.gov (United States)

    Kramer, Gene A.; Johnston, JoElle

    1997-01-01

    A study examined the relationship between Optometry Admission Test scores and pre-optometry or undergraduate grade point average (GPA) with first and second year performance in optometry schools. The test's predictive validity was limited but significant, and comparable to those reported for other admission tests. In addition, the scores…

  19. Reproducibility and validity of the DynaPort KneeTest

    NARCIS (Netherlands)

    Mokkink, L.B.; Terwee, C.B.; Slikke, van der R.M.; Lummel, van R.C.; Benink, R.J.; Bouter, L.M.; Vet, de H.C.W.

    2005-01-01

    OBJECTIVE: To determine the reproducibility and validity of the DynaPort KneeTest, a performance-based test that measures quality of movement of patients undergoing total knee replacement (TKR). METHODS: A total of 92 patients with osteoarthritis (OA) of the knee performed the KneeTest twice on the

  20. Proposal and validation of a clinical trunk control test in individuals with spinal cord injury.

    Science.gov (United States)

    Quinzaños, J; Villa, A R; Flores, A A; Pérez, R

    2014-06-01

    One of the problems that arise in spinal cord injury (SCI) is alteration in trunk control. Despite the need for standardized scales, these do not exist for evaluating trunk control in SCI. To propose and validate a trunk control test in individuals with SCI. National Institute of Rehabilitation, Mexico. The test was developed and later evaluated for reliability and criteria, content, and construct validity. We carried out 531 tests on 177 patients and found high inter- and intra-rater reliability. In terms of criterion validity, analysis of variance demonstrated a statistically significant difference in the test score of patients with adequate or inadequate trunk control according to the assessment of a group of experts. A receiver operating characteristic curve was plotted for optimizing the instrument's cutoff point, which was determined at 13 points, with a sensitivity of 98% and a specificity of 92.2%. With regard to construct validity, the correlation between the proposed test and the spinal cord independence measure (SCIM) was 0.873 (P=0.001) and that with the evolution time was 0.437 (P=0.001). For testing the hypothesis with qualitative variables, the Kruskal-Wallis test was performed, which resulted in a statistically significant difference between the scores in the proposed scale of each group defined by these variables. It was proven experimentally that the proposed trunk control test is valid and reliable. Furthermore, the test can be used for all patients with SCI despite the type and level of injury.

  1. Testing the validity of stock-recruitment curve fits

    International Nuclear Information System (INIS)

    Christensen, S.W.; Goodyear, C.P.

    1988-01-01

    The utilities relied heavily on the Ricker stock-recruitment model as the basis for quantifying biological compensation in the Hudson River power case. They presented many fits of the Ricker model to data derived from striped bass catch and effort records compiled by the National Marine Fisheries Service. Based on this curve-fitting exercise, a value of 4 was chosen for the parameter alpha in the Ricker model, and this value was used to derive the utilities' estimates of the long-term impact of power plants on striped bass populations. A technique was developed and applied to address a single fundamental question: if the Ricker model were applicable to the Hudson River striped bass population, could the estimates of alpha from the curve-fitting exercise be considered reliable. The technique involved constructing a simulation model that incorporated the essential biological features of the population and simulated the characteristics of the available actual catch-per-unit-effort data through time. The ability or failure to retrieve the known parameter values underlying the simulation model via the curve-fitting exercise was a direct test of the reliability of the results of fitting stock-recruitment curves to the real data. The results demonstrated that estimates of alpha from the curve-fitting exercise were not reliable. The simulation-modeling technique provides an effective way to identify whether or not particular data are appropriate for use in fitting such models. 39 refs., 2 figs., 3 tabs

  2. Validation of a Paper and Pencil Test Battery for the Diagnosis of Minimal Hepatic Encephalopathy in Korea.

    Science.gov (United States)

    Jeong, Jae Yoon; Jun, Dae Won; Bai, Daiseg; Kim, Ji Yean; Sohn, Joo Hyun; Ahn, Sang Bong; Kim, Sang Gyune; Kim, Tae Yeob; Kim, Hyoung Su; Jeong, Soung Won; Cho, Yong Kyun; Song, Do Seon; Kim, Hee Yeon; Jung, Young Kul; Yoon, Eileen L

    2017-09-01

    The aim of this study was to validate a new paper and pencil test battery to diagnose minimal hepatic encephalopathy (MHE) in Korea. A new paper and pencil test battery was composed of number connection test-A (NCT-A), number connection test-B (NCT-B), digit span test (DST), and symbol digit modality test (SDMT). The norm of the new test was based on 315 healthy individuals between the ages of 20 and 70 years old. Another 63 healthy subjects (n = 31) and cirrhosis patients (n = 32) were included as a validation cohort. All participants completed the new paper and pencil test, a critical flicker frequency (CFF) test and computerized cognitive function test (visual continuous performance test [CPT]). The scores on the NCT-A and NCT-B increased but those of DST and SDMT decreased according to age. Twelve of the cirrhotic patients (37.5%) were diagnosed with MHE based on the new paper and pencil test battery. The total score of the paper and pencil test battery showed good positive correlation with the CFF (r = 0.551, P cognitive function test. Also, this score was lower in patients with MHE compared to those without MHE (P cognitive test decreased significantly in patients with MHE compared to those without MHE. Test-retest reliability was comparable. In conclusion, the new paper and pencil test battery including NCT-A, NCT-B, DST, and SDMT showed good correlation with neuropsychological tests. This new paper and pencil test battery could help to discriminate patients with impaired cognitive function in cirrhosis (registered at Clinical Research Information Service [CRIS], https://cris.nih.go.kr/cris, KCT0000955). © 2017 The Korean Academy of Medical Sciences.

  3. Validation of the shake test for detecting freeze damage to adsorbed vaccines.

    Science.gov (United States)

    Kartoglu, Umit; Ozgüler, Nejat Kenan; Wolfson, Lara J; Kurzatkowski, Wiesław

    2010-08-01

    To determine the validity of the shake test for detecting freeze damage in aluminium-based, adsorbed, freeze-sensitive vaccines. A double-blind crossover design was used to compare the performance of the shake test conducted by trained health-care workers (HCWs) with that of phase contrast microscopy as a "gold standard". A total of 475 vials of 8 different types of World Health Organization prequalified freeze-sensitive vaccines from 10 different manufacturers were used. Vaccines were kept at 5 degrees C. Selected numbers of vials from each type were then exposed to -25 degrees C and -2 degrees C for 24-hour periods. There was complete concordance between HCWs and phase-contrast microscopy in identifying freeze-damaged vials and non-frozen samples. Non-frozen samples showed a fine-grain structure under phase contrast microscopy, but freeze-damaged samples showed large conglomerates of massed precipitates with amorphous, crystalline, solid and needle-like structures. Particles in the non-frozen samples measured from 1 microm (vaccines against diphtheria-tetanus-pertussis; Haemophilus influenzae type b; hepatitis B; diphtheria-tetanus-pertussis-hepatitis B) to 20 microm (diphtheria and tetanus vaccines, alone or in combination). By contrast, aggregates in the freeze-damaged samples measured up to 700 microm (diphtheria-tetanus-pertussis) and 350 microm on average. The shake test had 100% sensitivity, 100% specificity and 100% positive predictive value in this study, which confirms its validity for detecting freeze damage to aluminium-based freeze-sensitive vaccines.

  4. Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

    OpenAIRE

    Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

    2015-01-01

    Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administer...

  5. 40 CFR 1048.501 - How do I run a valid emission test?

    Science.gov (United States)

    2010-07-01

    ... 40 Protection of Environment 32 2010-07-01 2010-07-01 false How do I run a valid emission test... § 1048.501 How do I run a valid emission test? (a) Use the equipment and procedures for spark-ignition... 86.132-96(h) and then operate the engine for 60 minutes over repeat runs of the duty cycle specified...

  6. Building a Validity Argument for the Test of English as a Foreign Language™

    CERN Document Server

    Chapelle, Carol A; Jamieson, Joan M

    2007-01-01

    Building a Validity Argument for the Test of English as a Foreign Language™ is distinctive in its attempt to develop a coherent story of the rationale for a test or its revision, explain the research and development process, and provide the results of the validation process. This volume is particularly relevant for professionals and graduate students in educational measurement, applied linguistics, and second language acquisition as well as anyone interested in assessment issues.

  7. Validation of the Brazilian version of the childhood asthma control test (c-ACT).

    Science.gov (United States)

    Oliveira, Suelen G; Sarria, Edgar E; Roncada, Cristian; Stein, Renato T; Pitrez, Paulo M; Mattiello, Rita

    2016-04-01

    Children's perception of their symptoms has proved reliable and relevant to disease management and should be considered when assessing their asthma control. The aim of the study is to validate the Brazilian Portuguese version of the Childhood Asthma Control Test (c-ACT) in children aged 4-11 years. This is a cross-sectional study in children diagnosed with asthma undergoing treatment in a pediatric pulmonology outpatient clinic in Porto Alegre, Brazil. The translation and linguistic adaptation of the instrument were performed in accordance with international recommendations for questionnaire validation. A total of 105 participants were included, aged 4-11 years. all correlations between the total score and items on the questionnaire were significant and obtained values of r ≥ 0.3, and c-ACT means showed statistically significant differences between the GINA categories (P ACT scores than those of uncontrolled asthma group (controlled 22.0 ± 2.9 vs. uncontrolled 16.3 ± 5.3 P ACT scores than those of uncontrolled asthma group (partially controlled 20.0 ± 4.0 vs. uncontrolled 16.3 ± 5.3 P = 0.03). Correlations between the c-ACT total score and spirometry and nitric oxide were poor (r = 0.020; P = 0.866 and r = 0.035; P = 0.753, respectively). Reliability: the α-C coefficient for the c-ACT total score was 0.677 (95%CI 0.573-0763). Sensitivity to change had an effect size of 0.8 and an intraclass correlation coefficient of 0.598. No floor or ceiling effects were observed. The Brazilian version of the Childhood Asthma Control Test proved to be valid and reliable in children aged 4-11 years. © 2015 Wiley Periodicals, Inc.

  8. Development and validation of the Approach-Iron Skill Test for use in golf.

    Science.gov (United States)

    Robertson, Samuel John; Burnett, Angus F; Newton, Robert U

    2013-01-01

    The primary aim of this study was to develop and validate a golf-specific approach-iron test for use with elite and high-level amateur golfers. Elite (n=26) and high-level amateur (n=23) golfers were recruited for this study. The 'Approach-Iron Skill Test' requires players to hit a total of 27 shots. Specifically, three shots are hit at each of nine targets on a specially constructed driving range in a randomised order. A real-time launch monitor positioned behind the player, measured the carry distance for each of these shots. A scoring system was developed based on the percentage error index of each shot, meaning that 81 points was the maximum score possible (with a maximum of three points per shot). Two rounds of the test were performed. For both rounds of the test, elite-level golfers scored significantly higher than their high-level amateur counterparts (56.3 ± 5.6 and 58.5 ± 4.6 points versus 46.0 ± 6.3 and 46.1 ± 6.7 points, respectively) (P<0.05). For both elite and high-level players, 95% limits of agreement statistics also indicated that the test showed good test-retest reliability (2.1 ± 7.9 and 0.2 ± 10.8, respectively). Due to the clinimetric properties of the test, we conclude that the Approach-Iron Skill Test is suitable for further examination with the players examined in this study.

  9. The Ostomy Adjustment Scale: translation into Norwegian language with validation and reliability testing.

    Science.gov (United States)

    Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin

    2014-01-01

    The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is

  10. The validity of upper-limb neurodynamic tests for detecting peripheral neuropathic pain.

    Science.gov (United States)

    Nee, Robert J; Jull, Gwendolen A; Vicenzino, Bill; Coppieters, Michel W

    2012-05-01

    The validity of upper-limb neurodynamic tests (ULNTs) for detecting peripheral neuropathic pain (PNP) was assessed by reviewing the evidence on plausibility, the definition of a positive test, reliability, and concurrent validity. Evidence was identified by a structured search for peer-reviewed articles published in English before May 2011. The quality of concurrent validity studies was assessed with the Quality Assessment of Diagnostic Accuracy Studies tool, where appropriate. Biomechanical and experimental pain data support the plausibility of ULNTs. Evidence suggests that a positive ULNT should at least partially reproduce the patient's symptoms and that structural differentiation should change these symptoms. Data indicate that this definition of a positive ULNT is reliable when used clinically. Limited evidence suggests that the median nerve test, but not the radial nerve test, helps determine whether a patient has cervical radiculopathy. The median nerve test does not help diagnose carpal tunnel syndrome. These findings should be interpreted cautiously, because diagnostic accuracy might have been distorted by the investigators' definitions of a positive ULNT. Furthermore, patients with PNP who presented with increased nerve mechanosensitivity rather than conduction loss might have been incorrectly classified by electrophysiological reference standards as not having PNP. The only evidence for concurrent validity of the ulnar nerve test was a case study on cubital tunnel syndrome. We recommend that researchers develop more comprehensive reference standards for PNP to accurately assess the concurrent validity of ULNTs and continue investigating the predictive validity of ULNTs for prognosis or treatment response.

  11. The ad-libitum alcohol 'taste test': secondary analyses of potential confounds and construct validity.

    Science.gov (United States)

    Jones, Andrew; Button, Emily; Rose, Abigail K; Robinson, Eric; Christiansen, Paul; Di Lemma, Lisa; Field, Matt

    2016-03-01

    Motivation to drink alcohol can be measured in the laboratory using an ad-libitum 'taste test', in which participants rate the taste of alcoholic drinks whilst their intake is covertly monitored. Little is known about the construct validity of this paradigm. The objective of this study was to investigate variables that may compromise the validity of this paradigm and its construct validity. We re-analysed data from 12 studies from our laboratory that incorporated an ad-libitum taste test. We considered time of day and participants' awareness of the purpose of the taste test as potential confounding variables. We examined whether gender, typical alcohol consumption, subjective craving, scores on the Alcohol Use Disorders Identification Test and perceived pleasantness of the drinks predicted ad-libitum consumption (construct validity). We included 762 participants (462 female). Participant awareness and time of day were not related to ad-libitum alcohol consumption. Males drank significantly more alcohol than females (p alcohol consumption (p = 0.04), craving (p alcohol consumption. The construct validity of the taste test was supported by relationships between ad-libitum consumption and typical alcohol consumption, craving and pleasantness ratings of the drinks. The ad-libitum taste test is a valid method for the assessment of alcohol intake in the laboratory.

  12. Development of an Agility Test for Badminton Players and Assessment of Its Validity and Test-Retest Reliability.

    Science.gov (United States)

    Loureiro, Luiz de França Bahia; de Freitas, Paulo Barbosa

    2016-04-01

    Badminton requires open and fast actions toward the shuttlecock, but there is no specific agility test for badminton players with specific movements. To develop an agility test that simultaneously assesses perception and motor capacity and examine the test's concurrent and construct validity and its test-retest reliability. The Badcamp agility test consists of running as fast as possible to 6 targets placed on the corners and middle points of a rectangular area (5.6 × 4.2 m) from the start position located in the center of it, following visual stimuli presented in a luminous panel. The authors recruited 43 badminton players (17-32 y old) to evaluate concurrent (with shuttle-run agility test--SRAT) and construct validity and test-retest reliability. Results revealed that Badcamp presents concurrent and construct validity, as its performance is strongly related to SRAT (ρ = 0.83, P < .001), with performance of experts being better than nonexpert players (P < .01). In addition, Badcamp is reliable, as no difference (P = .07) and a high intraclass correlation (ICC = .93) were found in the performance of the players on 2 different occasions. The findings indicate that Badcamp is an effective, valid, and reliable tool to measure agility, allowing coaches and athletic trainers to evaluate players' athletic condition and training effectiveness and possibly detect talented individuals in this sport.

  13. Validity, Reliability and Standardization Study of the Language Assessment Test for Aphasia

    Directory of Open Access Journals (Sweden)

    Bülent Toğram

    2012-09-01

    Full Text Available OBJECTIVE: Aphasia assessment is the first step towards a well- founded language therapy. Language tests need to consider cultural as well as typological linguistic aspects of a given language. This study was designed to determine the standardization, validity and reliability of Language Assessment Test for Aphasia, which consists of eight subtests including spontaneous speech and language, auditory comprehension, repetition, naming, reading, grammar, speech acts, and writing. METHODS: The test was administered to 282 healthy participants and 92 aphasic participants in age, education and gender matched groups. The validity study of the test was investigated with analysis of content, structure and criterion-related validity. For reliability of the test, the analysis of internal consistency, stability and equivalence reliability was conducted. The influence of variables on healhty participants’ sub-test scores, test score and language score was examined. According to significant differences, norms and cut-off scores based on language score were determined. RESULTS: The group with aphasia performed highly lower than healthy participants on subtest, test and language scores. The test scores of healthy group were mostly affected by age and educational level but not affected by gender. According to significant differences, age and educational level for both groups were determined. Considering age and educational levels, the reference values for the cut-off scores were presented. CONCLUSION: The test was found to be a highly reliable and valid aphasia test for Turkish- speaking aphasic patients either in Turkey or other Turkish communities around the world

  14. Validity of a cross-specialty test in basic laparoscopic techniques (TABLT)

    DEFF Research Database (Denmark)

    Thinggaard, Ebbe; Bjerrum, Flemming; Strandbygaard, Jeanett

    2015-01-01

    . The aim of this study was to establish validity evidence for the Training and Assessment of Basic Laparoscopic Techniques (TABLT) test, a tablet-based training system. METHODS: Laparoscopic surgeons and trainees were recruited from departments of general surgery, gynaecology and urology. Participants...... included novice, intermediate and experienced surgeons. All participants performed the TABLT test. Performance scores were calculated based on time taken and errors made. Evidence of validity was explored using a contemporary framework of validity. RESULTS: Some 60 individuals participated. The TABLT...... was shown to be reliable, with an intraclass correlation coefficient of 0·99 (P value of 0·73 (P 

  15. Preparation, validation and user-testing of pictogram-based patient information leaflets for tuberculosis.

    Science.gov (United States)

    Shrestha, Anmol; Rajesh, V; Dessai, Sneha Shamrao; Stanly, Sharon Mary; Mateti, Uday Venkat

    2018-05-25

    Patient education is of paramount importance with regard to the condition of the disease and the treatment given besides lifestyle remodelling in order to get the desired therapeutic outcome. When verbal information is provided to the patients, they often tend to forget it. Pictorial aids or pictograms, as they are commonly known, are tools that are widely used for imparting knowledge to the patients. The aim of the study is to prepare and validate a Pictogram-based Patient Information Leaflet (P-PILs) on Tuberculosis (TB). P-PILs have been prepared from tertiary, secondary and primary sources. The knowledge-based questions are prepared with respect to the P-PILs. The baseline knowledge of the volunteers and patients have been analyzed before administering the P-PILs by using the validated questionnaire. The post-knowledge of the volunteers and patients has been analyzed after administering the P-PILs (20 min) by using the same questionnaire and the user-opinion has also been obtained at the end. The study results show that the mean scores of the overall user-testing knowledge assessment are found to have improved significantly from the pre-P-PILs administration score of 62.67 to the post-P-PILs administration score of 91. The overall user-opinion about the P-PILs has been found to be good (75%) followed by average (25%). The present study shows that there is significant improvement in the knowledge levels of the patients and volunteers after reading the validated leaflets. The Pictogram-based Patient Information Leaflets are found to be an effective educational tool for TB patients. Copyright © 2018. Published by Elsevier Ltd.

  16. Veggie ISS Validation Test Results and Produce Consumption

    Science.gov (United States)

    Massa, Gioia; Hummerick, Mary; Spencer, LaShelle; Smith, Trent

    2015-01-01

    The Veggie vegetable production system flew to the International Space Station (ISS) in the spring of 2014. The first set of plants, Outredgeous red romaine lettuce, was grown, harvested, frozen, and returned to Earth in October. Ground control and flight plant tissue was sub-sectioned for microbial analysis, anthocyanin antioxidant phenolic analysis, and elemental analysis. Microbial analysis was also performed on samples swabbed on orbit from plants, Veggie bellows, and plant pillow surfaces, on water samples, and on samples of roots, media, and wick material from two returned plant pillows. Microbial levels of plants were comparable to ground controls, with some differences in community composition. The range in aerobic bacterial plate counts between individual plants was much greater in the ground controls than in flight plants. No pathogens were found. Anthocyanin concentrations were the same between ground and flight plants, while antioxidant and phenolic levels were slightly higher in flight plants. Elements varied, but key target elements for astronaut nutrition were similar between ground and flight plants. Aerobic plate counts of the flight plant pillow components were significantly higher than ground controls. Surface swab samples showed low microbial counts, with most below detection limits. Flight plant microbial levels were less than bacterial guidelines set for non-thermostabalized food and near or below those for fungi. These guidelines are not for fresh produce but are the closest approximate standards. Forward work includes the development of standards for space-grown produce. A produce consumption strategy for Veggie on ISS includes pre-flight assessments of all crops to down select candidates, wiping flight-grown plants with sanitizing food wipes, and regular Veggie hardware cleaning and microbial monitoring. Produce then could be consumed by astronauts, however some plant material would be reserved and returned for analysis. Implementation of

  17. Validity of a Newly-Designed Rectilinear Stepping Ergometer Submaximal Exercise Test to Assess Cardiorespiratory Fitness.

    Science.gov (United States)

    Zhang, Rubin; Zhan, Likui; Sun, Shaoming; Peng, Wei; Sun, Yining

    2017-09-01

    The maximum oxygen uptake (V̇O 2 max), determined from graded maximal or submaximal exercise tests, is used to classify the cardiorespiratory fitness level of individuals. The purpose of this study was to examine the validity and reliability of the YMCA submaximal exercise test protocol performed on a newly-designed rectilinear stepping ergometer (RSE) that used up and down reciprocating vertical motion in place of conventional circular motion and giving precise measurement of workload, to determine V̇O 2 max in young healthy male adults. Thirty-two young healthy male adults (32 males; age range: 20-35 years; height: 1.75 ± 0.05 m; weight: 67.5 ± 8.6 kg) firstly participated in a maximal-effort graded exercise test using a cycle ergometer (CE) to directly obtain measured V̇O 2 max. Subjects then completed the progressive multistage test on the RSE beginning at 50W and including additional stages of 70, 90, 110, 130, and 150W, and the RSE YMCA submaximal test consisting of a workload increase every 3 minutes until the termination criterion was reached. A metabolic equation was derived from the RSE multistage exercise test to predict oxygen consumption (V̇O 2 ) from power output (W) during the submaximal exercise test (V̇O 2 (mL·min -1 )=12.4 ×W(watts)+3.5 mL·kg -1 ·min -1 ×M+160mL·min -1 , R 2 = 0.91, standard error of the estimate (SEE) = 134.8mL·min -1 ). A high correlation was observed between the RSE YMCA estimated V̇O 2 max and the CE measured V̇O 2 max (r=0.87). The mean difference between estimated and measured V̇O 2 max was 2.5 mL·kg -1 ·min -1 , with an SEE of 3.55 mL·kg -1 ·min -1 . The data suggest that the RSE YMCA submaximal exercise test is valid for predicting V̇O 2 max in young healthy male adults. The findings show that the rectilinear stepping exercise is an effective submaximal exercise for predicting V̇O 2 max. The newly-designed RSE may be potentially further developed as an alternative ergometer for assessing

  18. Validating a UAV artificial intelligence control system using an autonomous test case generator

    Science.gov (United States)

    Straub, Jeremy; Huber, Justin

    2013-05-01

    The validation of safety-critical applications, such as autonomous UAV operations in an environment which may include human actors, is an ill posed problem. To confidence in the autonomous control technology, numerous scenarios must be considered. This paper expands upon previous work, related to autonomous testing of robotic control algorithms in a two dimensional plane, to evaluate the suitability of similar techniques for validating artificial intelligence control in three dimensions, where a minimum level of airspeed must be maintained. The results of human-conducted testing are compared to this automated testing, in terms of error detection, speed and testing cost.

  19. Validation of a fracture mechanics approach to nuclear transportation cask design through a drop test program

    International Nuclear Information System (INIS)

    Sorenson, K.B.

    1986-01-01

    Sandia National Laboratories (SNL), under contract to the Department of Energy, is conducting a research program to develop and validate a fracture mechanics approach to cask design. A series of drop tests of a transportation cask is planned for the summer of 1986 as the method for benchmarking and, thereby, validating the fracture mechanics approach. This paper presents the drop test plan and background leading to the development of the test plan including structural analyses, material characterization, and non-destructive evaluation (NDE) techniques necessary for defining the test plan properly

  20. Establishing the Test-Retest Reliability & Concurrent Validity for the Repeat Ice Skating Test (RIST) in Adolescent Male Ice Hockey Players

    Science.gov (United States)

    Power, Allan; Faught, Brent E.; Przysucha, Eryk; McPherson, Moira; Montelpare, William

    2012-01-01

    In this study the authors examine the test-retest reliability and concurrent validity of the Repeat Ice Skating Test (RIST). This was an on-ice field anaerobic test that measured average peak power and was validated with 3 anaerobic lab tests: (a) vertical jump, (b) the Margaria-Kalamen stair test, and (c) the Wingate Anaerobic Test. The…

  1. Decentral gene expression analysis: analytical validation of the Endopredict genomic multianalyte breast cancer prognosis test

    Directory of Open Access Journals (Sweden)

    Kronenwett Ralf

    2012-10-01

    Full Text Available Abstract Background EndoPredict (EP is a clinically validated multianalyte gene expression test to predict distant metastasis in ER-positive, HER2-negative breast cancer treated with endocrine therapy alone. The test is based on the combined analysis of 12 genes in formalin-fixed, paraffin-embedded (FFPE tissue by reverse transcription-quantitative real-time PCR (RT-qPCR. Recently, it was shown that EP is feasible for reliable decentralized assessment of gene expression. The aim of this study was the analytical validation of the performance characteristics of the assay and its verification in a molecular-pathological routine laboratory. Methods Gene expression values to calculate the EP score were assayed by one-step RT-qPCR using RNA from FFPE tumor tissue. Limit of blank, limit of detection, linear range, and PCR efficiency were assessed for each of the 12 PCR assays using serial samples dilutions. Different breast cancer samples were used to evaluate RNA input range, precision and inter-laboratory variability. Results PCR assays were linear up to Cq values between 35.1 and 37.2. Amplification efficiencies ranged from 75% to 101%. The RNA input range without considerable change of the EP score was between 0.16 and 18.5 ng/μl. Analysis of precision (variation of day, day time, instrument, operator, reagent lots resulted in a total noise (standard deviation of 0.16 EP score units on a scale from 0 to 15. The major part of the total noise (SD 0.14 was caused by the replicate-to-replicate noise of the PCR assays (repeatability and was not associated with different operating conditions (reproducibility. Performance characteristics established in the manufacturer’s laboratory were verified in a routine molecular pathology laboratory. Comparison of 10 tumor samples analyzed in two different laboratories showed a Pearson coefficient of 0.995 and a mean deviation of 0.15 score units. Conclusions The EP test showed reproducible performance

  2. Decentral gene expression analysis: analytical validation of the Endopredict genomic multianalyte breast cancer prognosis test

    International Nuclear Information System (INIS)

    Kronenwett, Ralf; Brase, Jan C; Weber, Karsten E; Fisch, Karin; Müller, Berit M; Schmidt, Marcus; Filipits, Martin; Dubsky, Peter; Petry, Christoph; Dietel, Manfred; Denkert, Carsten; Bohmann, Kerstin; Prinzler, Judith; Sinn, Bruno V; Haufe, Franziska; Roth, Claudia; Averdick, Manuela; Ropers, Tanja; Windbergs, Claudia

    2012-01-01

    EndoPredict (EP) is a clinically validated multianalyte gene expression test to predict distant metastasis in ER-positive, HER2-negative breast cancer treated with endocrine therapy alone. The test is based on the combined analysis of 12 genes in formalin-fixed, paraffin-embedded (FFPE) tissue by reverse transcription-quantitative real-time PCR (RT-qPCR). Recently, it was shown that EP is feasible for reliable decentralized assessment of gene expression. The aim of this study was the analytical validation of the performance characteristics of the assay and its verification in a molecular-pathological routine laboratory. Gene expression values to calculate the EP score were assayed by one-step RT-qPCR using RNA from FFPE tumor tissue. Limit of blank, limit of detection, linear range, and PCR efficiency were assessed for each of the 12 PCR assays using serial samples dilutions. Different breast cancer samples were used to evaluate RNA input range, precision and inter-laboratory variability. PCR assays were linear up to C q values between 35.1 and 37.2. Amplification efficiencies ranged from 75% to 101%. The RNA input range without considerable change of the EP score was between 0.16 and 18.5 ng/μl. Analysis of precision (variation of day, day time, instrument, operator, reagent lots) resulted in a total noise (standard deviation) of 0.16 EP score units on a scale from 0 to 15. The major part of the total noise (SD 0.14) was caused by the replicate-to-replicate noise of the PCR assays (repeatability) and was not associated with different operating conditions (reproducibility). Performance characteristics established in the manufacturer’s laboratory were verified in a routine molecular pathology laboratory. Comparison of 10 tumor samples analyzed in two different laboratories showed a Pearson coefficient of 0.995 and a mean deviation of 0.15 score units. The EP test showed reproducible performance characteristics with good precision and negligible laboratory

  3. Use of the color trails test as an embedded measure of performance validity.

    Science.gov (United States)

    Henry, George K; Algina, James

    2013-01-01

    One hundred personal injury litigants and disability claimants referred for a forensic neuropsychological evaluation were administered both portions of the Color Trails Test (CTT) as part of a more comprehensive battery of standardized tests. Subjects who failed two or more free-standing tests of cognitive performance validity formed the Failed Performance Validity (FPV) group, while subjects who passed all free-standing performance validity measures were assigned to the Passed Performance Validity (PPV) group. A cutscore of ≥45 seconds to complete Color Trails 1 (CT1) was associated with a classification accuracy of 78%, good sensitivity (66%) and high specificity (90%), while a cutscore of ≥84 seconds to complete Color Trails 2 (CT2) was associated with a classification accuracy of 82%, good sensitivity (74%) and high specificity (90%). A CT1 cutscore of ≥58 seconds, and a CT2 cutscore ≥100 seconds was associated with 100% positive predictive power at base rates from 20 to 50%.

  4. Validation of a wind tunnel testing facility for blade surface pressure measurements

    Energy Technology Data Exchange (ETDEWEB)

    Fuglsang, P.; Antoniou, I.; Soerensen, N.N.; Madsen, H.A.

    1998-04-01

    This report concerns development and validation of a 2d testing facility for airfoil pressure measurements. The VELUX open jet wind tunnel was used with a test stand inserted. Reynolds numbers until 1.3 million were achieved with an airfoil chord of 0.45 m. The aerodynamic load coefficients were found from pressure distribution measurements and the total drag coefficient was calculated from wake rake measurements. Stationary inflow as well as dynamic inflow through pitching motion was possible. Wind tunnel corrections were applied for streamline curvature and down-wash. Even though the wind tunnel is not ideal for 2d testing, the overall quality of the flow was acceptable with a uniform flow field at the test stand position and a turbulence intensity of 1 % at the inlet of the test section. Reference values for free stream static and total pressure were found upstream of the test stand. The NACA 63-215 airfoil was tested and the results were compared with measurements from FFA and NACA. The measurements agreed well except for lift coefficient values at high angles of attack and the drag coefficient values at low angles of attack, that were slightly high. Comparisons of the measured results with numerical predictions from the XFOIL code and the EllipSys2D code showed good agreement. Measurements with the airfoil in pitching motion were carried out to study the dynamic aerodynamic coefficients. Steady inflow measurements at high angles of attack were used to investigate the double stall phenomenon. (au) EFP-94; EFP-95; EFP-97. 8 tabs., 82 ills., 16 refs.

  5. Preliminary Validation of a New Measure of Negative Response Bias: The Temporal Memory Sequence Test.

    Science.gov (United States)

    Hegedish, Omer; Kivilis, Naama; Hoofien, Dan

    2015-01-01

    The Temporal Memory Sequence Test (TMST) is a new measure of negative response bias (NRB) that was developed to enrich the forced-choice paradigm. The TMST does not resemble the common structure of forced-choice tests and is presented as a temporal recall memory test. The validation sample consisted of 81 participants: 21 healthy control participants, 20 coached simulators, and 40 patients with acquired brain injury (ABI). The TMST had high reliability and significantly high positive correlations with the Test of Memory Malingering and Word Memory Test effort scales. Moreover, the TMST effort scales exhibited high negative correlations with the Glasgow Coma Scale, thus validating the previously reported association between probable malingering and mild traumatic brain injury. A suggested cutoff score yielded acceptable classification rates in the ABI group as well as in the simulator and control groups. The TMST appears to be a promising measure of NRB detection, with respectable rates of reliability and construct and criterion validity.

  6. Validation of the conceptual research utilization scale: an application of the standards for educational and psychological testing in healthcare

    Science.gov (United States)

    2011-01-01

    Background There is a lack of acceptable, reliable, and valid survey instruments to measure conceptual research utilization (CRU). In this study, we investigated the psychometric properties of a newly developed scale (the CRU Scale). Methods We used the Standards for Educational and Psychological Testing as a validation framework to assess four sources of validity evidence: content, response processes, internal structure, and relations to other variables. A panel of nine international research utilization experts performed a formal content validity assessment. To determine response process validity, we conducted a series of one-on-one scale administration sessions with 10 healthcare aides. Internal structure and relations to other variables validity was examined using CRU Scale response data from a sample of 707 healthcare aides working in 30 urban Canadian nursing homes. Principal components analysis and confirmatory factor analyses were conducted to determine internal structure. Relations to other variables were examined using: (1) bivariate correlations; (2) change in mean values of CRU with increasing levels of other kinds of research utilization; and (3) multivariate linear regression. Results Content validity index scores for the five items ranged from 0.55 to 1.00. The principal components analysis predicted a 5-item 1-factor model. This was inconsistent with the findings from the confirmatory factor analysis, which showed best fit for a 4-item 1-factor model. Bivariate associations between CRU and other kinds of research utilization were statistically significant (p use, and longitudinal work to determine CRU Scale sensitivity to change. PMID:21595888

  7. Concurrent Validity and Feasibility of Short Tests Currently Used to Measure Early Childhood Development in Large Scale Studies.

    Directory of Open Access Journals (Sweden)

    Marta Rubio-Codina

    Full Text Available In low- and middle-income countries (LIMCs, measuring early childhood development (ECD with standard tests in large scale surveys and evaluations of interventions is difficult and expensive. Multi-dimensional screeners and single-domain tests ('short tests' are frequently used as alternatives. However, their validity in these circumstances is unknown. We examined the feasibility, reliability, and concurrent validity of three multi-dimensional screeners (Ages and Stages Questionnaires (ASQ-3, Denver Developmental Screening Test (Denver-II, Battelle Developmental Inventory screener (BDI-2 and two single-domain tests (MacArthur-Bates Short-Forms (SFI and SFII, WHO Motor Milestones (WHO-Motor in 1,311 children 6-42 months in Bogota, Colombia. The scores were compared with those on the Bayley Scales of Infant and Toddler Development (Bayley-III, taken as the 'gold standard'. The Bayley-III was given at a center by psychologists; whereas the short tests were administered in the home by interviewers, as in a survey setting. Findings indicated good internal validity of all short tests except the ASQ-3. The BDI-2 took long to administer and was expensive, while the single-domain tests were quickest and cheapest and the Denver-II and ASQ-3 were intermediate. Concurrent validity of the multi-dimensional tests' cognitive, language, and fine motor scales with the corresponding Bayley-III scale was low below 19 months. However, it increased with age, becoming moderate-to-high over 30 months. In contrast, gross motor scales' concurrence was high under 19 months and then decreased. Of the single-domain tests, the WHO-Motor had high validity with gross motor under 16 months, and the SFI and SFII expressive scales showed moderate correlations with language under 30 months. Overall, the Denver-II was the most feasible and valid multi-dimensional test and the ASQ-3 performed poorly under 31 months. By domain, gross motor development had the highest concurrence

  8. The Predictive Validity of using Admissions Testing and Multiple Mini-interviews in Undergraduate University Admissions

    DEFF Research Database (Denmark)

    Makransky, Guido; Havmose, Philip S.; Vang, Maria Louison

    2017-01-01

    The aim of this study was to evaluate the predictive validity of a two-step admissions procedure that included a cognitive ability test followed by multiple mini-interviews (MMI) used to assess non-cognitive skills compared to a grade-based admissions relative to subsequent drop-out rates...... and academic achievement after one and two years of study. The participants consisted of the entire population of 422 psychology students who were admitted to the University of Southern Denmark between 2010 and 2013. The results showed significantly lower drop-out rates after the first year of study, and non......-significant lower drop-out rates after the second year of study for the admission procedure that included the assessment of non-cognitive skills though the MMI. Furthermore, this admission procedure resulted in a significant lower risk of failing the final exam after the first and second year of study, compared...

  9. Conceal, don't feel, don't let it show: intentional versus instructed cheating in the Concealed Information Test

    NARCIS (Netherlands)

    Geven, L.; Klein Selle, N.; Ben-Shakhar, G.; Kindt, M.; Verschuere, B.

    2017-01-01

    The validity of the CIT has been demonstrated in hundreds of laboratory studies. Most studies, however, lack key ingredients of real‐life deception. One such factor is self‐initiated deception; in contrast to perpetrators committing real crimes, research participants are typically instructed to

  10. Investigation of reliability, validity and normality Persian version of the California Critical Thinking Skills Test; Form B (CCTST

    Directory of Open Access Journals (Sweden)

    Khallli H

    2003-04-01

    Full Text Available Background: To evaluate the effectiveness of the present educational programs in terms of students' achieving problem solving, decision making and critical thinking skills, reliable, valid and standard instrument are needed. Purposes: To Investigate the Reliability, validity and Norm of CCTST Form.B .The California Critical Thinking Skills Test contain 34 multi-choice questions with a correct answer in the jive Critical Thinking (CT cognitive skills domain. Methods: The translated CCTST Form.B were given t0405 BSN nursing students ojNursing Faculties located in Tehran (Tehran, Iran and Shahid Beheshti Universitiesthat were selected in the through random sampling. In order to determine the face and content validity the test was translated and edited by Persian and English language professor and researchers. it was also confirmed by judgments of a panel of medical education experts and psychology professor's. CCTST reliability was determined with internal consistency and use of KR-20. The construct validity of the test was investigated with factor analysis and internal consistency and group difference. Results: The test coefficien for reliablity was 0.62. Factor Analysis indicated that CCTST has been formed from 5 factor (element namely: Analysis, Evaluation, lriference, Inductive and Deductive Reasoning. Internal consistency method shows that All subscales have been high and positive correlation with total test score. Group difference method between nursing and philosophy students (n=50 indicated that there is meaningfUl difference between nursing and philosophy students scores (t=-4.95,p=0.OOO1. Scores percentile norm also show that percentile offifty scores related to 11 raw score and 95, 5 percentiles are related to 17 and 6 raw score ordinary. Conclusions: The Results revealed that the questions test is sufficiently reliable as a research tool, and all subscales measure a single construct (Critical Thinking and are able to distinguished the

  11. Excellent cross-cultural validity, intra-test reliability and construct validity of the dutch rivermead mobility index in patients after stroke undergoing rehabilitation

    NARCIS (Netherlands)

    Roorda, Leo D.; Green, John; De Kluis, Kiki R. A.; Molenaar, Ivo W.; Bagley, Pam; Smith, Jane; Geurts, Alexander C. H.

    2008-01-01

    Objective: To investigate the cross-cultural validity of international Dutch-English comparisons when using the Dutch Rivermead Mobility Index (RMI), and the intra-test reliability and construct validity of the Dutch RMI. Methods: Cross-cultural validity was studied in a combined data-set of Dutch

  12. Known-Groups and Concurrent Validity of the Mandarin Tone Identification Test (MTIT.

    Directory of Open Access Journals (Sweden)

    Shufeng Zhu

    Full Text Available The Mandarin Tone Identification Test (MTIT is a new test designed to assess the tone identification abilities of children with hearing impairment (HI. Evidence for reliability and sensitivity has been reported. The present study aimed to evaluate the known-groups and concurrent validity of the MTIT.The MTIT and Mandarin Pediatric Speech Intelligibility test (MPSI were administered in quiet and in noise conditions. The known-groups validity was evaluated by comparing the performance of the MTIT on children with two different levels of HI. The MPSI was included to evaluate the concurrent validity of the MTIT.81 children with HI were recruited in the present study. They were Mandarin-speaking children with profound HI (mean age = 9; 0, n = 41 and with moderate to severe HI (mean age = 8; 9, n = 40.Scores on the MTIT differed between the two groups with different hearing levels suggesting good known-groups validity. A strong relationship between tone and sentence perception both in quiet and in noise provided preliminary evidence for concurrent validity.The present study confirmed that the MTIT has good known-groups validity and provided preliminary evidence for concurrent validity. The MTIT could be used to evaluate tone identification ability in children with HI with confidence.

  13. Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.

    Science.gov (United States)

    Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid

    2017-11-01

    Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.

  14. Validity and reliability of the 10-m walk test and the 6-min walk test in spinal cord injury patients.

    Science.gov (United States)

    Scivoletto, G; Tamburella, F; Laurenza, L; Foti, C; Ditunno, J F; Molinari, M

    2011-06-01

    The 10-m walk test (10MWT) and the 6-min walk test (6MWT) have been recommended for assessment of walking in spinal cord injury (SCI) patients. The study was designed on test-retest analysis of the 10MWT and 6MWT. The objective of this study was to assess validity/reliability of different methods of performing the tests. The study was set at an SCI unit of a rehabilitation hospital. A total of 37 patients; whose median age was 58.5 years (interquartile range 40-66, full range 19-77); median time since onset of SCI was 24 months (interquartile range 16.25-70.5, full range 6-109). Non-traumatic etiology in 20 out of 37 patients; level: 12C, 14T and 11L; American Spinal Injury Association Impairment Scale grade: 35D/2C. Assessment with the 10MWT (with or without dynamic start) and the 6MWT (short or long track) by two blinded raters to evaluate inter/intra-rater reliabilities. The 10MWT was performed in a median of 19 s (25th-75th interquartile range 13-28) with the dynamic start and of 18.4 s (25th-75th interquartile range 12.6-29.9) with the static start (P=0.092). The correlation between the results of the two methods was between 0.98 and 0.99. The inter- and intra-rater reliabilities were between 0.95 and 0.99 for both the methods. The 6MWT showed significant differences according to the track length: patients walked a median of 226.7 m (25th-75th interquartile range 123.2-319) on the longer track and of 187.6 m (25th-75th interquartile range 69.7-240.6) on the short one (P<0.001). The correlation between the results of the two methods was between 0.91 and 0.93. The inter- and intra-rater reliabilities were between 0.98 and 0.99. The 10MWT shows high inter/intra-rater reliability and shows comparable results with both dynamic and static start. The different testing conditions of the 6MWT (track/turns) results in significant differences that need standardization for use in future trials.

  15. [Evaluation of Suicide Risk Levels in Hospitals: Validity and Reliability Tests].

    Science.gov (United States)

    Macagnino, Sandro; Steinert, Tilman; Uhlmann, Carmen

    2018-05-01

    Examination of in-hospital suicide risk levels concerning their validity and their reliability. The internal suicide risk levels were evaluated in a cross sectional study of in 163 inpatients. A reliability check was performed via determining interrater-reliability of senior physician, therapist and the responsible nurse. Within the scope of the validity check, we conducted analyses of criterion validity and construct validity. For the total sample an "acceptable" to "good" interrater-reliability (Kendalls W = .77) of suicide risk levels were obtained. Schizophrenic disorders showed the lowest values, for personality disorders we found the highest level of interrater-reliability. When examining the criterion validity, Item-9 of the BDI-II is substantial correlated to our suicide risk levels (ρ m  = .54, p validity check, affective disorders showed the highest correlation (ρ = .77), compatible also with "convergent validity". They differed with schizophrenic disorders which showed the least concordance (ρ = .43). In-hospital suicide risk levels may represent an important contribution to the assessment of suicidal behavior of inpatients experiencing psychiatric treatment due to their overall good validity and reliability. © Georg Thieme Verlag KG Stuttgart · New York.

  16. Validation of the OECD reproduction test guideline with the New Zealand mudsnail Potamopyrgus antipodarum using trenbolone and prochloraz.

    Science.gov (United States)

    Geiß, Cornelia; Ruppert, Katharina; Askem, Clare; Barroso, Carlos; Faber, Daniel; Ducrot, Virginie; Holbech, Henrik; Hutchinson, Thomas H; Kajankari, Paula; Kinnberg, Karin Lund; Lagadic, Laurent; Matthiessen, Peter; Morris, Steve; Neiman, Maurine; Penttinen, Olli-Pekka; Sanchez-Marin, Paula; Teigeler, Matthias; Weltje, Lennart; Oehlmann, Jörg

    2017-04-01

    The Organisation for Economic Cooperation and Development (OECD) provides several standard test methods for the environmental hazard assessment of chemicals, mainly based on primary producers, arthropods, and fish. In April 2016, two new test guidelines with two mollusc species representing different reproductive strategies were approved by OECD member countries. One test guideline describes a 28-day reproduction test with the parthenogenetic New Zealand mudsnail Potamopyrgus antipodarum. The main endpoint of the test is reproduction, reflected by the embryo number in the brood pouch per female. The development of a new OECD test guideline involves several phases including inter-laboratory validation studies to demonstrate the robustness of the proposed test design and the reproducibility of the test results. Therefore, a ring test of the reproduction test with P. antipodarum was conducted including eight laboratories with the test substances trenbolone and prochloraz and results are presented here. Most laboratories could meet test validity criteria, thus demonstrating the robustness of the proposed test protocol. Trenbolone did not have an effect on the reproduction of the snails at the tested concentration range (nominal: 10-1000 ng/L). For prochloraz, laboratories produced similar EC 10 and NOEC values, showing the inter-laboratory reproducibility of results. The average EC 10 and NOEC values for reproduction (with coefficient of variation) were 26.2 µg/L (61.7%) and 29.7 µg/L (32.9%), respectively. This ring test shows that the mudsnail reproduction test is a well-suited tool for use in the chronic aquatic hazard and risk assessment of chemicals.

  17. Comprehensive validation scheme for in situ fiber optics dissolution method for pharmaceutical drug product testing.

    Science.gov (United States)

    Mirza, Tahseen; Liu, Qian Julie; Vivilecchia, Richard; Joshi, Yatindra

    2009-03-01

    There has been a growing interest during the past decade in the use of fiber optics dissolution testing. Use of this novel technology is mainly confined to research and development laboratories. It has not yet emerged as a tool for end product release testing despite its ability to generate in situ results and efficiency improvement. One potential reason may be the lack of clear validation guidelines that can be applied for the assessment of suitability of fiber optics. This article describes a comprehensive validation scheme and development of a reliable, robust, reproducible and cost-effective dissolution test using fiber optics technology. The test was successfully applied for characterizing the dissolution behavior of a 40-mg immediate-release tablet dosage form that is under development at Novartis Pharmaceuticals, East Hanover, New Jersey. The method was validated for the following parameters: linearity, precision, accuracy, specificity, and robustness. In particular, robustness was evaluated in terms of probe sampling depth and probe orientation. The in situ fiber optic method was found to be comparable to the existing manual sampling dissolution method. Finally, the fiber optic dissolution test was successfully performed by different operators on different days, to further enhance the validity of the method. The results demonstrate that the fiber optics technology can be successfully validated for end product dissolution/release testing. (c) 2008 Wiley-Liss, Inc. and the American Pharmacists Association

  18. Six factors of adult dyslexia assesed by cognitive tests and self-report questions: Very high predictive validity

    NARCIS (Netherlands)

    Tamboer, P.; Vorst, H.C.M.; de Jong, P.F.

    2017-01-01

    The Multiple Diagnostic Digital Dyslexia Test for Adults (MDDDT-A) consists of 12 newly developed tests and self-report questions in the Dutch language. Predictive validity and construct validity were investigated and compared with validity of a standard test battery of dyslexia (STB) in a sample of

  19. Validation of MATRA-S Low Flow Predictions Using PNL 2x6 Mixed Convection Test

    Energy Technology Data Exchange (ETDEWEB)

    Seo, Kyong-Won; Kwon, Hyuk; Kim, Seong-Jin; Hwang, Dae-Hyun [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

    2015-10-15

    The MATRA-S, a subchannel analysis code has been used to thermal-hydraulic design of SMART core. As the safety enhancement is getting important more and more, some features of the MATRA-S code are required to be validated in order to be applied to nonnominal operating conditions in addition to its applicability to reactor design under normal operating conditions. The MATRA-S code has two numerical schemes, SCHEME for implicit application and XSCHEM for explicit one. The implicit scheme had been developed under assumptions that the axial flow is larger enough than the crossflow. Under certain conditions, especially low flow and low pressure operating conditions, this implicit SCHEME oscillates or becomes unstable numerically and then MATRA-S fails to obtain good solution. These demerits were known as common in implicit schemes of many COBRA families. Efforts have been exerted to resolve these limitations in SCHEME of the MATRA-S such as a once through marching scheme against the multi-pass marching scheme and an adaptive multi-grid method. These remedies can reduce the numerically unstable range for SCHEME but some unstable regions still remain. The XSCHEM, an explicit scheme of MATRA-S was validated using the PNL 2x6 rod bundle flow transient test. The explicit scheme agreed with implicit scheme for steady state calculations. And it showed its capability to predict low flow conditions such as negative flow and recirculation flow.

  20. Establishing a 'Physician's Spiritual Well-being Scale' and testing its reliability and validity.

    Science.gov (United States)

    Fang, C K; Li, P Y; Lai, M L; Lin, M H; Bridge, D T; Chen, H W

    2011-01-01

    The purpose of this study was to develop a Physician's Spiritual Well-Being Scale (PSpWBS). The significance of a physician's spiritual well-being was explored through in-depth interviews with and qualitative data collection from focus groups. Based on the results of qualitative analysis and related literature, the PSpWBS consisting of 25 questions was established. Reliability and validity tests were performed on 177 subjects. Four domains of the PSpWBS were devised: physician's characteristics; medical practice challenges; response to changes; and overall well-being. The explainable total variance was 65.65%. Cronbach α was 0.864 when the internal consistency of the whole scale was calculated. Factor analysis showed that the internal consistency Cronbach α value for each factor was between 0.625 and 0.794 and the split-half reliability was 0.865. The scale has satisfactory reliability and validity and could serve as the basis for assessment of the spiritual well-being of a physician.

  1. Test and validation of the iterative code for the neutrons spectrometry and dosimetry: NSDUAZ

    International Nuclear Information System (INIS)

    Reyes H, A.; Ortiz R, J. M.; Reyes A, A.; Castaneda M, R.; Solis S, L. O.; Vega C, H. R.

    2014-08-01

    In this work was realized the test and validation of an iterative code for neutronic spectrometry known as Neutron Spectrometry and Dosimetry of the Universidad Autonoma de Zacatecas (NSDUAZ). This code was designed in a user graph interface, friendly and intuitive in the environment programming of LabVIEW using the iterative algorithm known as SPUNIT. The main characteristics of the program are: the automatic selection of the initial spectrum starting from the neutrons spectra catalog compiled by the International Atomic Energy Agency, the possibility to generate a report in HTML format that shows in graph and numeric way the neutrons flowing and calculates the ambient dose equivalent with base to this. To prove the designed code, the count rates of a spectrometer system of Bonner spheres were used with a detector of 6 LiI(Eu) with 7 polyethylene spheres with diameter of 0, 2, 3, 5, 8, 10 and 12. The count rates measured with two neutron sources: 252 Cf and 239 PuBe were used to validate the code, the obtained results were compared against those obtained using the BUNKIUT code. We find that the reconstructed spectra present an error that is inside the limit reported in the literature that oscillates around 15%. Therefore, it was concluded that the designed code presents similar results to those techniques used at the present time. (Author)

  2. Reliability and convergent validity of the five-step test in people with chronic stroke.

    Science.gov (United States)

    Ng, Shamay S M; Tse, Mimi M Y; Tam, Eric W C; Lai, Cynthia Y Y

    2018-01-10

    (i) To estimate the intra-rater, inter-rater and test-retest reliabilities of the Five-Step Test (FST), as well as the minimum detectable change in FST completion times in people with stroke. (ii) To estimate the convergent validity of the FST with other measures of stroke-specific impairments. (iii) To identify the best cut-off times for distinguishing FST performance in people with stroke from that of healthy older adults. A cross-sectional study. University-based rehabilitation centre. Forty-eight people with stroke and 39 healthy controls. None. The FST, along with (for the stroke survivors only) scores on the Fugl-Meyer Lower Extremity Assessment (FMA-LE), the Berg Balance Scale (BBS), Limits of Stability (LOS) tests, and Activities-specific Balance Confidence (ABC) scale were tested. The FST showed excellent intra-rater (intra-class correlation coefficient; ICC = 0.866-0.905), inter-rater (ICC = 0.998), and test-retest (ICC = 0.838-0.842) reliabilities. A minimum detectable change of 9.16 s was found for the FST in people with stroke. The FST correlated significantly with the FMA-LE, BBS, and LOS results in the forward and sideways directions (r = -0.411 to -0.716, p people with stroke and healthy older adults. The FST is a reliable, easy-to-administer clinical test for assessing stroke survivors' ability to negotiate steps and stairs.

  3. Development and validation status of the IFMIF High Flux Test Module

    International Nuclear Information System (INIS)

    Arbeiter, Frederik; Abou-Sena, Ali; Chen Yuming; Dolensky, Bernhard; Heupel, Tobias; Klein, Christine; Scheel, Nicola; Schlindwein, Georg

    2011-01-01

    The development of the IFMIF (International Fusion Material Irradiation Facility) High Flux Test Module in the EVEDA (Engineering Validation and Engineering Design Activities) phase up to 2013 includes conceptual design, engineering analyses, as well as design and engineering validation by building of prototypes and their testing. The High Flux Test Module is the device to facilitate the irradiation of SSTT samples of RAFM steels at temperatures 250-550 deg. C and up to an accumulated irradiation damage of 150 dpa. The requirements, the current design and the performance of the module are discussed, and the development process is outlined.

  4. Development and validation status of the IFMIF High Flux Test Module

    Energy Technology Data Exchange (ETDEWEB)

    Arbeiter, Frederik, E-mail: frederik.arbeiter@kit.edu [Karlsruhe Institute of Technology, Institute for Neutron Physics and Reactor Technology (KIT-INR), Karlsruhe (Germany); Abou-Sena, Ali; Chen Yuming; Dolensky, Bernhard; Heupel, Tobias; Klein, Christine; Scheel, Nicola; Schlindwein, Georg [Karlsruhe Institute of Technology, Institute for Neutron Physics and Reactor Technology (KIT-INR), Karlsruhe (Germany)

    2011-10-15

    The development of the IFMIF (International Fusion Material Irradiation Facility) High Flux Test Module in the EVEDA (Engineering Validation and Engineering Design Activities) phase up to 2013 includes conceptual design, engineering analyses, as well as design and engineering validation by building of prototypes and their testing. The High Flux Test Module is the device to facilitate the irradiation of SSTT samples of RAFM steels at temperatures 250-550 deg. C and up to an accumulated irradiation damage of 150 dpa. The requirements, the current design and the performance of the module are discussed, and the development process is outlined.

  5. Validation of a Standardized Multiple-Choice Multicultural Competence Test: Implications for Training, Assessment, and Practice

    Science.gov (United States)

    Gillem, Angela R.; Bartoli, Eleonora; Bertsch, Kristin N.; McCarthy, Maureen A.; Constant, Kerra; Marrero-Meisky, Sheila; Robbins, Steven J.; Bellamy, Scarlett

    2016-01-01

    The Multicultural Counseling and Psychotherapy Test (MCPT), a measure of multicultural counseling competence (MCC), was validated in 2 phases. In Phase 1, the authors administered 451 test items derived from multicultural guidelines in counseling and psychology to 32 multicultural experts and 30 nonexperts. In Phase 2, the authors administered the…

  6. Assessment of Prospective Memory – a Validity Study of Memory for Intentions Screening Test

    NARCIS (Netherlands)

    Bezdicek, O.; Raskin, S.A.; Altgassen, A.M.; Ruzicka, E.

    2014-01-01

    Aim: The goal of the present study was to validate the Czech version of the Memory for Intentions (Screening) Test (MIST, 2010). We included standardized testing material, translation of administration and scoring, and assessment of normative data for the MIST in the Czech population. Introduction:

  7. Reliability and validity of the rey visual design learning test in primary school children

    NARCIS (Netherlands)

    Wilhelm, P.

    2004-01-01

    The Rey Visual Design Learning Test (Rey, 1964, in Spreen & Strauss, 1991) assesses immediate memory span, new learning and recognition for non-verbal material. Three studies are presented that focused on the reliability and validity of the RVDLT in primary school children. Test-retest reliability

  8. Multilevel Assessment of the Predictive Validity of Teacher Made Tests in the Zimbabwean Primary Education Sector

    Science.gov (United States)

    Machingambi, Zadzisai

    2017-01-01

    The principal focus of this study was to undertake a multilevel assessment of the predictive validity of teacher made tests in the Zimbabwean primary education sector. A correlational research design was adopted for the study, mainly to allow for statistical treatment of data and subsequent classical hypotheses testing using the spearman's rho.…

  9. Review of seismic tests for qualification of components and validation of methods

    International Nuclear Information System (INIS)

    Buland, P.; Gantenbein, F.; Gibert, R.J.; Hoffmann, A.; Queval, J.C.

    1988-01-01

    Seismic tests are performed in CEA-DEMT since many years in order: to demonstrate the qualification of components, to give an experimental validation of calculation methods used for seismic design of components. The paper presents examples of these two types of tests, a description of the existing facilities and details about the new facility TAMARIS under construction. (author)

  10. Software Verification and Validation Test Report for the HEPA filter Differential Pressure Fan Interlock System

    International Nuclear Information System (INIS)

    ERMI, A.M.

    2000-01-01

    The HEPA Filter Differential Pressure Fan Interlock System PLC ladder logic software was tested using a Software Verification and Validation (VandV) Test Plan as required by the ''Computer Software Quality Assurance Requirements''. The purpose of his document is to report on the results of the software qualification

  11. Design and Validation of a Straight-Copy Typewriting Prognostic Test Using Kinesthetic Sensitivity.

    Science.gov (United States)

    Olson, Norma Jean

    1979-01-01

    Describes the development and application of a kinesthetic sensitivity test to determine whether it is a valid and reliable measure of straight-copy typing speed and accuracy. The author states that this kinesthetic sensitivity instrument may be used as a prognostic aptitude test and recommends administration methods. (MF)

  12. Measuring right-hemisphere dysfunction in children: validity of two new computer tests

    NARCIS (Netherlands)

    Sips, H.J.W.A.; C.E. Catsman-Berrevoets (Coriene); H.R. van Dongen (Huug); van der Werff, P.J.J.; Brooke, L.J.

    1994-01-01

    textabstractThe validity of two new computer‐mediated tests for the detection of right‐cerebral hemisphere lesions in children–the Right‐hemisphere Dysfunction Test and the Visual Perception Test–was evaluated. Normative data were drawn from a group of 91 children (aged five to 14 years) and 14

  13. Review of seismic tests for qualification of components and validation of methods

    Energy Technology Data Exchange (ETDEWEB)

    Buland, P; Gantenbein, F; Gibert, R J; Hoffmann, A; Queval, J C [CEA-CEN SACLAY-DEMT, Gif sur Yvette-Cedex (France)

    1988-07-01

    Seismic tests are performed in CEA-DEMT since many years in order: to demonstrate the qualification of components, to give an experimental validation of calculation methods used for seismic design of components. The paper presents examples of these two types of tests, a description of the existing facilities and details about the new facility TAMARIS under construction. (author)

  14. Validation of a Video-based Game-Understanding Test Procedure in Badminton.

    Science.gov (United States)

    Blomqvist, Minna T.; Luhtanen, Pekka; Laakso, Lauri; Keskinen, Esko

    2000-01-01

    Reports the development and validation of video-based game-understanding tests in badminton for elementary and secondary students. The tests included different sequences that simulated actual game situations. Players had to solve tactical problems by selecting appropriate solutions and arguments for their decisions. Results suggest that the test…

  15. Assessment of Advanced Life Support competence when combining different test methods--reliability and validity

    DEFF Research Database (Denmark)

    Ringsted, C; Lippert, F; Hesselfeldt, R

    2007-01-01

    Cardiac Arrest Simulation Test (CASTest) scenarios for the assessments according to guidelines 2005. AIMS: To analyse the reliability and validity of the individual sub-tests provided by ERC and to find a combination of MCQ and CASTest that provides a reliable and valid single effect measure of ALS...... that possessed high reliability, equality of test sets, and ability to discriminate between the two groups of supposedly different ALS competence. CONCLUSIONS: ERC sub-tests of ALS competence possess sufficient reliability and validity. A combined ALS score with equal weighting of one MCQ and one CASTest can...... competence. METHODS: Two groups of participants were included in this randomised, controlled experimental study: a group of newly graduated doctors, who had not taken the ALS course (N=17) and a group of students, who had passed the ALS course 9 months before the study (N=16). Reliability in terms of inter...

  16. The Validity of Value-Added Estimates from Low-Stakes Testing Contexts: The Impact of Change in Test-Taking Motivation and Test Consequences

    Science.gov (United States)

    Finney, Sara J.; Sundre, Donna L.; Swain, Matthew S.; Williams, Laura M.

    2016-01-01

    Accountability mandates often prompt assessment of student learning gains (e.g., value-added estimates) via achievement tests. The validity of these estimates have been questioned when performance on tests is low stakes for students. To assess the effects of motivation on value-added estimates, we assigned students to one of three test consequence…

  17. Some Findings from Thermal-Hydraulic Validation Tests for SMART Passive Safety System

    Energy Technology Data Exchange (ETDEWEB)

    Park, Hyun Sik; Bae, Hwang; Ryu, Sung-Uk; Ryu, Hyobong; Shin, Yong-Cheol; Min, Kyoung-Ho; Yi, Sung-Jae [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

    2014-10-15

    To satisfy the domestic and international needs for nuclear safety improvement after the Fukushima accident, an effort to improve its safety has been studied, and a Passive Safety System (PSS) for SMART has been designed. In addition, an Integral Test Loop for the SMART design (SMART-ITL, or FESTA) has been constructed and it finished its commissioning tests in 2012. Consequently, a set of Design Base Accident (DBA) scenarios have been simulated using SMARTITL. Recently, a test program to validate the performance of the SMART PSS was launched and its scaled-down test facility was additionally installed at the existing SMART-ITL facility. In this paper, some findings from the validation tests for the SMART PSS will be summarized. The acquired data will be used to validate the safety analysis code and its related models, to evaluate the performance of SMART PSS, and to provide base data during the application phase of SDA revision and construction licensing. A test program to validate the performance of SMARS PSS was launched with an additional scaleddown test facility of SMART PSS, which will be installed at the existing SMART-ITL facility. In this paper, some findings from the validation tests of the SMART passive safety system during 2013-2014 were summarized. They include a couple of SMART PSS tests using active pumps and several 1-train SMART PSS tests. From the test results it was estimated that the SMART PSS has sufficient cooling capability to deal with the SBLOCA scenario of SMART. During the SBLOCA scenario, in the CMT the water layer inventory was well stratified thermally and the safety injection water was injected efficiently into the RPV from the initial period and cools down the RCS properly.

  18. Exploring the reliability and validity of the social-moral awareness test.

    Science.gov (United States)

    Livesey, Alexandra; Dodd, Karen; Pote, Helen; Marlow, Elizabeth

    2012-11-01

    The aim of the study was to explore the validity of the social-moral awareness test (SMAT) a measure designed for assessing socio-moral rule knowledge and reasoning in people with learning disabilities. Comparisons between Theory of Mind and socio-moral reasoning allowed the exploration of construct validity of the tool. Factor structure, reliability and discriminant validity were also assessed. Seventy-one participants with mild-moderate learning disabilities completed the two scales of the SMAT and two False Belief Tasks for Theory of Mind. Reliability of the SMAT was very good, and the scales were shown to be uni-dimensional in factor structure. There was a significant positive relationship between Theory of Mind and both SMAT scales. There is early evidence of the construct validity and reliability of the SMAT. Further assessment of the validity of the SMAT will be required. © 2012 Blackwell Publishing Ltd.

  19. Test your memory-Turkish version (TYM-TR): reliability and validity study of a cognitive screening test.

    Science.gov (United States)

    Maviş, Ilknur; Özbabalik Adapinar, Belgin Demet; Yenilmez, Çinar; Aydin, Ayşe; Olgun, Engin; Bal, Cengiz

    2015-01-01

    The test your memory (TYM) is reported to be a sensitive cognitive function assessment scale for people with dementia. The aim of the present study was to investigate the reliability and validity of an adapted Turkish version of the TYM (TYM-TR) among Turkish dementia patients. The TYM-TR was given to 59 patients with dementia aged 60+ and 336 normal controls aged 23-75+. The diagnostic utility of the TYM-TR was compared with that of the mini-mental state examination (MMSE) to validate it. The internal consistency of the TYM-TR was a = 0.85. The test-retest reliability was 0.97 (P reliability and validity to distinguish dementia in the Turkish population.

  20. [COOP/WONCA: Reliability and validity of the test administered by telephone].

    Science.gov (United States)

    Pedrero-Pérez, Eduardo J; Díaz-Olalla, José Manuel

    2016-01-01

    The COOP/WONCA test was initially proposed as a self-report in which the answers were supported by drawings illustrating the state investigated. Subsequent studies have confirmed its usefulness as a mere verbal self-report face-to-face administered. No data have been found about its useful when administered by telephone interview. The aim of this study was to determine the psychometric properties of the COOP / WONCA test to measure Related Quality of Life (HRQoL) administered by telephone and compare them with those obtained in other forms of prior administration. Cross-sectional study on a random. City of Madrid. Random sample of 802 adult subjects, representative of the adult population in Madrid, obtained by stratification from the population census. Questionnaire COOP/WONCA with 9 ítems included in a broader battery, administered by telephone interview. The unrestricted factor analysis points to the unifactoriality of the scale, which measures a single latent construct (HRQOL), showing high internal consistency, not significantly different from those found by face-to-face administration, ruling out the existence of biases in the phone modality. The COOP/WONCA test appears as a reliable and valid measure of HRQOL and telephonic administration allows to assume no changes in the results, which can reduce costs in population studies, increasing efficiency without loss of quality in the information collected. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.

  1. Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.

    Science.gov (United States)

    Sawers, Andrew; Hafner, Brian

    2018-04-11

    To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  2. Publishing nutrition research: validity, reliability, and diagnostic test assessment in nutrition-related research.

    Science.gov (United States)

    Gleason, Philip M; Harris, Jeffrey; Sheean, Patricia M; Boushey, Carol J; Bruemmer, Barbara

    2010-03-01

    This is the sixth in a series of monographs on research design and analysis. The purpose of this article is to describe and discuss several concepts related to the measurement of nutrition-related characteristics and outcomes, including validity, reliability, and diagnostic tests. The article reviews the methodologic issues related to capturing the various aspects of a given nutrition measure's reliability, including test-retest, inter-item, and interobserver or inter-rater reliability. Similarly, it covers content validity, indicators of absolute vs relative validity, and internal vs external validity. With respect to diagnostic assessment, the article summarizes the concepts of sensitivity and specificity. The hope is that dietetics practitioners will be able to both use high-quality measures of nutrition concepts in their research and recognize these measures in research completed by others. Copyright 2010 American Dietetic Association. Published by Elsevier Inc. All rights reserved.

  3. Performance Validity Testing in Neuropsychology: Methods for Measurement Development and Maximizing Diagnostic Accuracy.

    Science.gov (United States)

    Wodushek, Thomas R; Greher, Michael R

    2017-05-01

    In the first column in this 2-part series, Performance Validity Testing in Neuropsychology: Scientific Basis and Clinical Application-A Brief Review, the authors introduced performance validity tests (PVTs) and their function, provided a justification for why they are necessary, traced their ongoing endorsement by neuropsychological organizations, and described how they are used and interpreted by ever increasing numbers of clinical neuropsychologists. To enhance readers' understanding of these measures, this second column briefly describes common detection strategies used in PVTs as well as the typical methods used to validate new PVTs and determine cut scores for valid/invalid determinations. We provide a discussion of the latest research demonstrating how neuropsychologists can combine multiple PVTs in a single battery to improve sensitivity/specificity to invalid responding. Finally, we discuss future directions for the research and application of PVTs.

  4. Cross-cultural validation and psychometric testing of the Norwegian version of the TeamSTEPPS® teamwork perceptions questionnaire.

    Science.gov (United States)

    Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise

    2017-12-02

    Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.

  5. Symptom validity testing in memory clinics: Hippocampal-memory associations and relevance for diagnosing mild cognitive impairment.

    Science.gov (United States)

    Rienstra, Anne; Groot, Paul F C; Spaan, Pauline E J; Majoie, Charles B L M; Nederveen, Aart J; Walstra, Gerard J M; de Jonghe, Jos F M; van Gool, Willem A; Olabarriaga, Silvia D; Korkhov, Vladimir V; Schmand, Ben

    2013-01-01

    Patients with mild cognitive impairment (MCI) do not always convert to dementia. In such cases, abnormal neuropsychological test results may not validly reflect cognitive symptoms due to brain disease, and the usual brain-behavior relationships may be absent. This study examined symptom validity in a memory clinic sample and its effect on the associations between hippocampal volume and memory performance. Eleven of 170 consecutive patients (6.5%; 13% of patients younger than 65 years) referred to memory clinics showed noncredible performance on symptom validity tests (SVTs, viz. Word Memory Test and Test of Memory Malingering). They were compared to a demographically matched group (n = 57) selected from the remaining patients. Hippocampal volume, measured by an automated volumetric method (Freesurfer), was correlated with scores on six verbal memory tests. The median correlation was r = .49 in the matched group. However, the relation was absent (median r = -.11) in patients who failed SVTs. Memory clinic samples may include patients who show noncredible performance, which invalidates their MCI diagnosis. This underscores the importance of applying SVTs in evaluating patients with cognitive complaints that may signify a predementia stage, especially when these patients are relatively young.

  6. Water evaporation over sump surface in nuclear containment studies: CFD and LP codes validation on TOSQAN tests

    Energy Technology Data Exchange (ETDEWEB)

    Malet, J., E-mail: jeanne.malet@irsn.fr [Institut de Radioprotection et de Sûreté Nucléaire (IRSN), PSN-RES/SCA BP 68, 91192 Gif-sur-Yvette (France); Degrees du Lou, O. [Institut de Radioprotection et de Sûreté Nucléaire (IRSN), PSN-RES/SCA BP 68, 91192 Gif-sur-Yvette (France); Arts et Métiers ParisTech, DynFluid Lab. EA92, 151, boulevard de l’Hôpital, 75013 Paris (France); Gelain, T. [Institut de Radioprotection et de Sûreté Nucléaire (IRSN), PSN-RES/SCA BP 68, 91192 Gif-sur-Yvette (France)

    2013-10-15

    Highlights: • Simulations of evaporative TOSQAN sump tests are performed. • These tests are under air–steam gas conditions with addition of He, CO{sub 2} and SF{sub 6}. • ASTEC-CPA LP and TONUS-CFD codes with UDF for sump model are used. • Validation of sump models of both codes show good results. • The code–experiment differences are attributed to turbulent gas mixing modeling. -- Abstract: During the course of a severe accident in a Nuclear Power Plant, water can be collected in the sump containment through steam condensation on walls and spray systems activation. The objective of this paper is to present code validation on evaporative sump tests performed on TOSQAN facility. The ASTEC-CPA code is used as a lumped-parameter code and specific user-defined-functions are developed for the TONUS-CFD code. The seven tests are air–steam tests, as well as tests with other non-condensable gases (He, CO{sub 2} and SF{sub 6}) under steady and transient conditions (two depressurization tests). The results show a good agreement between codes and experiments, indicating a good behavior of the sump models in both codes. The sump model developed as User-Defined Functions (UDF) for TONUS is considered as well validated and is ‘ready-to-use’ for all CFD codes in which such UDF can be added. The remaining discrepancies between codes and experiments are caused by turbulent transport and gas mixing, especially in the presence of non-condensable gases other than air, so that code validation on this important topic for hydrogen safety analysis is still recommended.

  7. On the limits of effort testing: symptom validity tests and severity of neurocognitive symptoms in nonlitigant patients

    NARCIS (Netherlands)

    Merten, Thomas; Bossink, Linda; Schmand, Ben

    2007-01-01

    Modern symptom validity tests (SVTs) use empirical cutoffs for decision making. However, limits to the applicability of these cutoffs may arise when severe cognitive symptoms are present. The purpose of the studies presented here was to explore these limits of applicability. In Experiment 1, a group

  8. Content Validity Index and Intra- and Inter-Rater Reliability of a New Muscle Strength/Endurance Test Battery for Swedish Soldiers.

    Directory of Open Access Journals (Sweden)

    Helena Larsson

    Full Text Available The objective of this study was to examine the content validity of commonly used muscle performance tests in military personnel and to investigate the reliability of a proposed test battery. For the content validity investigation, thirty selected tests were those described in the literature and/or commonly used in the Nordic and North Atlantic Treaty Organization (NATO countries. Nine selected experts rated, on a four-point Likert scale, the relevance of these tests in relation to five different work tasks: lifting, carrying equipment on the body or in the hands, climbing, and digging. Thereafter, a content validity index (CVI was calculated for each work task. The result showed excellent CVI (≥0.78 for sixteen tests, which comprised of one or more of the military work tasks. Three of the tests; the functional lower-limb loading test (the Ranger test, dead-lift with kettlebells, and back extension, showed excellent content validity for four of the work tasks. For the development of a new muscle strength/endurance test battery, these three tests were further supplemented with two other tests, namely, the chins and side-bridge test. The inter-rater reliability was high (intraclass correlation coefficient, ICC2,1 0.99 for all five tests. The intra-rater reliability was good to high (ICC3,1 0.82-0.96 with an acceptable standard error of mean (SEM, except for the side-bridge test (SEM%>15. Thus, the final suggested test battery for a valid and reliable evaluation of soldiers' muscle performance comprised the following four tests; the Ranger test, dead-lift with kettlebells, chins, and back extension test. The criterion-related validity of the test battery should be further evaluated for soldiers exposed to varying physical workload.

  9. The speed of memory errors shows the influence of misleading information: Testing the diffusion model and discrete-state models.

    Science.gov (United States)

    Starns, Jeffrey J; Dubé, Chad; Frelinger, Matthew E

    2018-05-01

    In this report, we evaluate single-item and forced-choice recognition memory for the same items and use the resulting accuracy and reaction time data to test the predictions of discrete-state and continuous models. For the single-item trials, participants saw a word and indicated whether or not it was studied on a previous list. The forced-choice trials had one studied and one non-studied word that both appeared in the earlier single-item trials and both received the same response. Thus, forced-choice trials always had one word with a previous correct response and one with a previous error. Participants were asked to select the studied word regardless of whether they previously called both words "studied" or "not studied." The diffusion model predicts that forced-choice accuracy should be lower when the word with a previous error had a fast versus a slow single-item RT, because fast errors are associated with more compelling misleading memory retrieval. The two-high-threshold (2HT) model does not share this prediction because all errors are guesses, so error RT is not related to memory strength. A low-threshold version of the discrete state approach predicts an effect similar to the diffusion model, because errors are a mixture of responses based on misleading retrieval and guesses, and the guesses should tend to be slower. Results showed that faster single-trial errors were associated with lower forced-choice accuracy, as predicted by the diffusion and low-threshold models. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

    Science.gov (United States)

    Mills, Tamara L; Holm, Margo B; Schmeler, Mark

    2007-01-01

    The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.

  11. Validation of new prognostic and predictive scores by sequential testing approach

    International Nuclear Information System (INIS)

    Nieder, Carsten; Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid

    2010-01-01

    Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)

  12. Validation of new prognostic and predictive scores by sequential testing approach

    Energy Technology Data Exchange (ETDEWEB)

    Nieder, Carsten [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway); Inst. of Clinical Medicine, Univ. of Tromso (Norway); Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway)

    2010-03-15

    Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)

  13. Reliability and criterion-related validity testing (construct) of the Endotracheal Suction Assessment Tool (ESAT©).

    Science.gov (United States)

    Davies, Kylie; Bulsara, Max K; Ramelet, Anne-Sylvie; Monterosso, Leanne

    2018-05-01

    To establish criterion-related construct validity and test-retest reliability for the Endotracheal Suction Assessment Tool© (ESAT©). Endotracheal tube suction performed in children can significantly affect clinical stability. Previously identified clinical indicators for endotracheal tube suction were used as criteria when designing the ESAT©. Content validity was reported previously. The final stages of psychometric testing are presented. Observational testing was used to measure construct validity and determine whether the ESAT© could guide "inexperienced" paediatric intensive care nurses' decision-making regarding endotracheal tube suction. Test-retest reliability of the ESAT© was performed at two time points. The researchers and paediatric intensive care nurse "experts" developed 10 hypothetical clinical scenarios with predetermined endotracheal tube suction outcomes. "Experienced" (n = 12) and "inexperienced" (n = 14) paediatric intensive care nurses were presented with the scenarios and the ESAT© guiding decision-making about whether to perform endotracheal tube suction for each scenario. Outcomes were compared with those predetermined by the "experts" (n = 9). Test-retest reliability of the ESAT© was measured at two consecutive time points (4 weeks apart) with "experienced" and "inexperienced" paediatric intensive care nurses using the same scenarios and tool to guide decision-making. No differences were observed between endotracheal tube suction decisions made by "experts" (n = 9), "inexperienced" (n = 14) and "experienced" (n = 12) nurses confirming the tool's construct validity. No differences were observed between groups for endotracheal tube suction decisions at T1 and T2. Criterion-related construct validity and test-retest reliability of the ESAT© were demonstrated. Further testing is recommended to confirm reliability in the clinical setting with the "inexperienced" nurse to guide decision-making related to endotracheal tube

  14. Screening for cognitive impairment in older individuals. Validation study of a computer-based test.

    Science.gov (United States)

    Green, R C; Green, J; Harrison, J M; Kutner, M H

    1994-08-01

    This study examined the validity of a computer-based cognitive test that was recently designed to screen the elderly for cognitive impairment. Criterion-related validity was examined by comparing test scores of impaired patients and normal control subjects. Construct-related validity was computed through correlations between computer-based subtests and related conventional neuropsychological subtests. University center for memory disorders. Fifty-two patients with mild cognitive impairment by strict clinical criteria and 50 unimpaired, age- and education-matched control subjects. Control subjects were rigorously screened by neurological, neuropsychological, imaging, and electrophysiological criteria to identify and exclude individuals with occult abnormalities. Using a cut-off total score of 126, this computer-based instrument had a sensitivity of 0.83 and a specificity of 0.96. Using a prevalence estimate of 10%, predictive values, positive and negative, were 0.70 and 0.96, respectively. Computer-based subtests correlated significantly with conventional neuropsychological tests measuring similar cognitive domains. Thirteen (17.8%) of 73 volunteers with normal medical histories were excluded from the control group, with unsuspected abnormalities on standard neuropsychological tests, electroencephalograms, or magnetic resonance imaging scans. Computer-based testing is a valid screening methodology for the detection of mild cognitive impairment in the elderly, although this particular test has important limitations. Broader applications of computer-based testing will require extensive population-based validation. Future studies should recognize that normal control subjects without a history of disease who are typically used in validation studies may have a high incidence of unsuspected abnormalities on neurodiagnostic studies.

  15. Reliability and Validity of the Turkish Language Version of the Test of Performance Strategies

    Directory of Open Access Journals (Sweden)

    Miçooğulları Bülent Okan

    2017-03-01

    Full Text Available The aim of the present study was to examine the psychometric properties of the Test of Performance Strategies (TOPS; Thomas et al., 1999 on the Turkish population. The TOPS was designed to assess eight psychological skills and strategies used by athletes in competition (activation, automaticity, emotional control, goal-setting, imagery, relaxation, self-talk, and negative thinking and the same strategies, except negative thinking is replaced by attentional control used in training. The sample of the study included athletes who were training and competing in a wide variety of sports across a broad range of performance standards. The final sample consisted of 433 males (mean ± s: age 22.47 ± 5.30 years and 187 females (mean ± s: age 20.97 ± 4.78 years, 620 athletes in total (mean ± s: age 21.25 ± 4.87 years who voluntarily participated; TOPS was administered to all participants. Afterward, Confirmatory Factor Analysis (CFA was conducted by Analysis Moments of Structures (AMOS 18. Comparative fit index (CFI, non-normed fit index (NNFI and root mean square error of approximation (RMSEA were used to verify whether the model fit the data. Goodness-of-fit statistics were CFI= .91, NNFI= .92 and RMSEA= .056. These values showed that the tested model is coherent at a satisfactory level. Moreover, results of confirmatory factor analyses revealed that a total of four items (two items from competition and two from practice within the subscale of automaticity have been removed. The 28 items within the remaining seven subscales have been validated. In conclusion, Turkish version of TOPS is a valid and reliable instrument to assess the psychological skills and strategies used by athletes in competition and practices.

  16. Simulation tests for cervical nonorganic signs: a study of face validity.

    Science.gov (United States)

    Vernon, Howard; Proctor, Dan; Bakalovski, Dianna; Moreton, Jesse

    2010-01-01

    The purpose of this study was to develop and determine the face validity of additional cervical nonorganic simulation tests. Four simulation tests were either selected from the literature or newly designed: simulated sitting trunk/shoulder rotation (SR; test no. 1), active vs passive cervical rotation (CR; test no. 2), Libman's test (LT; test no. 3) of pressure over the mastoid process, and side-lying passive shoulder abduction (SA; test no. 4). Three groups, 1 without neck pain (n = 44) and 2 with neck pain (n = 43 and 27), were formed. Outcome measures consisted of questions on provocation of pain (Yes/No) and appropriateness (Yes/No) as well as measurements of cervical rotation (goniometric) and pressure pain threshold (pressure algometer). Group test responses were evaluated and scored. A threshold of acceptance was established at 80% agreement for face validity. Ranges of rotation and pressure threshold values were analyzed with the Student t test. In nonneck pain subjects, all 4 tests were rated as nonpainful and 3 were rated as "appropriate" for neck pain examination (not SR). In neck pain subjects, this test and SA were rated as nonpainful, whereas LT was rated as painful in 26% of subjects. Only CR and LT were rated as "appropriate." In neck pain subjects, passive rotations exceeded actives by 10% to 14% (P = .000). On a second round of testing with a slightly modified method, SR and SA achieved acceptable "appropriateness." Once 2 tests were slightly modified, all 4 tests were found to have acceptable face validity. Further research into the reliability of these tests as well as into the combinations of these tests is warranted. Copyright 2010 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.

  17. Numerical Simulation and Experimental Validation of the Inflation Test of Latex Balloons

    Directory of Open Access Journals (Sweden)

    Claudio Bustos

    Full Text Available Abstract Experiments and modeling aimed at assessing the mechanical response of latex balloons in the inflation test are presented. To this end, the hyperelastic Yeoh material model is firstly characterized via tensile test and, then, used to numerically simulate via finite elements the stress-strain evolution during the inflation test. The numerical pressure-displacement curves are validated with those obtained experimentally. Moreover, this analysis is extended to a biomedical problem of an eyeball under glaucoma conditions.

  18. Validation of the Arabic Version of the Group Personality Projective Test among university students in Bahrain.

    Science.gov (United States)

    Al-Musawi, Nu'man M

    2003-04-01

    Using confirmatory factor analytic techniques on data generated from 200 students enrolled at the University of Bahrain, we obtained some construct validity and reliability data for the Arabic Version of the 1961 Group Personality Projective Test by Cassel and Khan. In contrast to the 5-factor model proposed for the Group Personality Projective Test, a 6-factor solution appeared justified for the Arabic Version of this test, suggesting some variance between the cultural groups in the United States and in Bahrain.

  19. Numerical Simulation and Experimental Validation of the Inflation Test of Latex Balloons

    OpenAIRE

    Bustos, Claudio; Herrera, Claudio García; Celentano, Diego; Chen, Daming; Cruchaga, Marcela

    2016-01-01

    Abstract Experiments and modeling aimed at assessing the mechanical response of latex balloons in the inflation test are presented. To this end, the hyperelastic Yeoh material model is firstly characterized via tensile test and, then, used to numerically simulate via finite elements the stress-strain evolution during the inflation test. The numerical pressure-displacement curves are validated with those obtained experimentally. Moreover, this analysis is extended to a biomedical problem of an...

  20. The bogus taste test: Validity as a measure of laboratory food intake

    OpenAIRE

    Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A.; Kemps, Eva; Higgs, Suzanne; Jones, Andrew

    2017-01-01

    Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The ?bogus? taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food inta...

  1. Intra-tester Reliability and Construct Validity of a Hip Abductor Eccentric Strength Test.

    Science.gov (United States)

    Brindle, Richard A; Ebaugh, D David; Milner, Clare E

    2017-11-15

    Side-lying hip abductor strength tests are commonly used to evaluate muscle strength. In a 'break' test the tester applies sufficient force to lower the limb to the table while the patient resists. The peak force is postulated to occur while the leg is lowering, thus representing the participant's eccentric muscle strength. However, it is unclear whether peak force occurs before or after the leg begins to lower. To determine intra-rater reliability and construct validity of a hip abductor eccentric strength test. Intra-rater reliability and construct validity study. Twenty healthy adults (26 ±6 years; 1.66 ±0.06 m; 62.2 ±8.0 kg) made two visits to the laboratory at least one week apart. During the hip abductor eccentric strength test, a hand-held dynamometer recorded peak force and time to peak force and limb position was recorded via a motion capture system. Intra-rater reliability was determined using intra-class correlation (ICC), standard error of measurement (SEM), and minimal detectable difference (MDD). Construct validity was assessed by determining if peak force occurred after the start of the lowering phase using a one-sample t-test. The hip abductor eccentric strength test had substantial intra-rater reliability (ICC( 3,3 ) = 0.88; 95% confidence interval: 0.65-0.95), SEM of 0.9%BWh, and a MDD of 2.5%BWh. Construct validity was established as peak force occurred 2.1s (±0.6s; range 0.7s to 3.7s) after the start of the lowering phase of the test (p ≤ 0.001). The hip abductor eccentric strength test is a valid and reliable measure of eccentric muscle strength. This test may be used clinically to assess changes in eccentric muscle strength over time.

  2. Development and content validity of a screening instrument for gaming addiction in adolescents: the Gaming Addiction Identification Test (GAIT).

    Science.gov (United States)

    Vadlin, Sofia; Åslund, Cecilia; Nilsson, Kent W

    2015-08-01

    This study describes the development of a screening tool for gaming addiction in adolescents - the Gaming Addiction Identification Test (GAIT). Its development was based on the research literature on gaming and addiction. An expert panel comprising professional raters (n = 7), experiential adolescent raters (n = 10), and parent raters (n = 10) estimated the content validity of each item (I-CVI) as well as of the whole scale (S-CVI/Ave), and participated in a cognitive interview about the GAIT scale. The mean scores for both I-CVI and S-CVI/Ave ranged between 0.97 and 0.99 compared with the lowest recommended I-CVI value of 0.78 and the S-CVI/Ave value of 0.90. There were no sex differences and no differences between expert groups regarding ratings in content validity. No differences in the overall evaluation of the scale emerged in the cognitive interviews. Our conclusions were that GAIT showed good content validity in capturing gaming addiction. The GAIT needs further investigation into its psychometric properties of construct validity (convergent and divergent validity) and criterion-related validity, as well as its reliability in both clinical settings and in community settings with adolescents. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.

  3. Prolonged ELS test with the marine flatfish sole (Solea solea) shows delayed toxic effects of previous exposure to PCB 126

    NARCIS (Netherlands)

    Foekema, E.M.; Deerenberg, C.M.; Murk, A.J.

    2008-01-01

    The effect of the dioxin-like PCB 126 (3,3¿,4,4¿,5-pentachlorobiphenyl) on the early development of the marine flatfish sole (Solea solea) was tested in a newly developed early life stage (ELS) test that includes the metamorphosis of the symmetric larvae into an asymmetrical flatfish. Early life

  4. S.E.T., CSNI Separate Effects Test Facility Validation Matrix

    International Nuclear Information System (INIS)

    1997-01-01

    1 - Description of test facility: The SET matrix of experiments is suitable for the developmental assessment of thermal-hydraulics transient system computer codes by selecting individual tests from selected facilities, relevant to each phenomena. Test facilities differ from one another in geometrical dimensions, geometrical configuration and operating capabilities or conditions. Correlation between SET facility and phenomena were calculated on the basis of suitability for model validation (which means that a facility is designed in such a way as to stimulate the phenomena assumed to occur in a plant and is sufficiently instrumented); limited suitability for model variation (which means that a facility is designed in such a way as to stimulate the phenomena assumed to occur in a plant but has problems associated with imperfect scaling, different test fluids or insufficient instrumentation); and unsuitability for model validation. 2 - Description of test: Whereas integral experiments are usually designed to follow the behaviour of a reactor system in various off-normal or accident transients, separate effects tests focus on the behaviour of a single component, or on the characteristics of one thermal-hydraulic phenomenon. The construction of a separate effects test matrix is an attempt to collect together the best sets of openly available test data for code validation, assessment and improvement, from the wide range of experiments that have been carried out world-wide in the field of thermal hydraulics. In all, 2094 tests are included in the SET matrix

  5. Victoria Symptom Validity Test performance in children and adolescents with neurological disorders.

    Science.gov (United States)

    Brooks, Brian L

    2012-12-01

    It is becoming increasingly more important to study, use, and promote the utility of measures that are designed to detect non-compliance with testing (i.e., poor effort, symptom non-validity, response bias) as part of neuropsychological assessments with children and adolescents. Several measures have evidence for use in pediatrics, but there is a paucity of published support for the Victoria Symptom Validity Test (VSVT) in this population. The purpose of this study was to examine the performance on the VSVT in a sample of pediatric patients with known neurological disorders. The sample consisted of 100 consecutively referred children and adolescents between the ages of 6 and 19 years (mean = 14.0, SD = 3.1) with various neurological diagnoses. On the VSVT total items, 95% of the sample had performance in the "valid" range, with 5% being deemed "questionable" and 0% deemed "invalid". On easy items, 97% were "valid", 2% were "questionable", and 1% was "invalid." For difficult items, 84% were "valid," 16% were "questionable," and 0% was "invalid." For those patients given two effort measures (i.e., VSVT and Test of Memory Malingering; n = 65), none was identified as having poor test-taking compliance on both measures. VSVT scores were significantly correlated with age, intelligence, processing speed, and functional ratings of daily abilities (attention, executive functioning, and adaptive functioning), but not objective performance on the measure of sustained attention, verbal memory, or visual memory. The VSVT has potential to be used in neuropsychological assessments with pediatric patients.

  6. The reliability and validity of a soccer-specific nonmotorised treadmill simulation (intermittent soccer performance test).

    Science.gov (United States)

    Aldous, Jeffrey W F; Akubat, Ibrahim; Chrismas, Bryna C R; Watkins, Samuel L; Mauger, Alexis R; Midgley, Adrian W; Abt, Grant; Taylor, Lee

    2014-07-01

    This study investigated the reliability and validity of a novel nonmotorised treadmill (NMT)-based soccer simulation using a novel activity category called a "variable run" to quantify fatigue during high-speed running. Twelve male University soccer players completed 3 familiarization sessions and 1 peak speed assessment before completing the intermittent soccer performance test (iSPT) twice. The 2 iSPTs were separated by 6-10 days. The total distance, sprint distance, and high-speed running distance (HSD) were 8,968 ± 430 m, 980 ± 75 m and 2,122 ± 140 m, respectively. No significant difference (p > 0.05) was found between repeated trials of the iSPT for all physiological and performance variables. Reliability measures between iSPT1 and iSPT2 showed good agreement (coefficient of variation: 0.80). Furthermore, the variable run phase showed HSD significantly decreased (p ≤ 0.05) in the last 15 minutes (89 ± 6 m) compared with the first 15 minutes (85 ± 7 m), quantifying decrements in high-speed exercise compared with the previous literature. This study validates the iSPT as a NMT-based soccer simulation compared with the previous match-play data and is a reliable tool for assessing and monitoring physiological and performance variables in soccer players. The iSPT could be used in a number of ways including player rehabilitation, understanding the efficacy of nutritional interventions, and also the quantification of environmentally mediated decrements on soccer-specific performance.

  7. Validation of Metagenomic Next-Generation Sequencing Tests for Universal Pathogen Detection.

    Science.gov (United States)

    Schlaberg, Robert; Chiu, Charles Y; Miller, Steve; Procop, Gary W; Weinstock, George

    2017-06-01

    - Metagenomic sequencing can be used for detection of any pathogens using unbiased, shotgun next-generation sequencing (NGS), without the need for sequence-specific amplification. Proof-of-concept has been demonstrated in infectious disease outbreaks of unknown causes and in patients with suspected infections but negative results for conventional tests. Metagenomic NGS tests hold great promise to improve infectious disease diagnostics, especially in immunocompromised and critically ill patients. - To discuss challenges and provide example solutions for validating metagenomic pathogen detection tests in clinical laboratories. A summary of current regulatory requirements, largely based on prior guidance for NGS testing in constitutional genetics and oncology, is provided. - Examples from 2 separate validation studies are provided for steps from assay design, and validation of wet bench and bioinformatics protocols, to quality control and assurance. - Although laboratory and data analysis workflows are still complex, metagenomic NGS tests for infectious diseases are increasingly being validated in clinical laboratories. Many parallels exist to NGS tests in other fields. Nevertheless, specimen preparation, rapidly evolving data analysis algorithms, and incomplete reference sequence databases are idiosyncratic to the field of microbiology and often overlooked.

  8. The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

    Science.gov (United States)

    Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

    2018-04-12

    To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (Test possessed low CV and significant (pTest possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.

  9. The design organization test: further demonstration of reliability and validity as a brief measure of visuospatial ability.

    Science.gov (United States)

    Killgore, William D S; Gogel, Hannah

    2014-01-01

    Neuropsychological assessments are frequently time-consuming and fatiguing for patients. Brief screening evaluations may reduce test duration and allow more efficient use of time by permitting greater attention toward neuropsychological domains showing probable deficits. The Design Organization Test (DOT) was initially developed as a 2-min paper-and-pencil alternative for the Block Design (BD) subtest of the Wechsler scales. Although initially validated for clinical neurologic patients, we sought to further establish the reliability and validity of this test in a healthy, more diverse population. Two alternate versions of the DOT and the Wechsler Abbreviated Scale of Intelligence (WASI) were administered to 61 healthy adult participants. The DOT showed high alternate forms reliability (r = .90-.92), and the two versions yielded equivalent levels of performance. The DOT was highly correlated with BD (r = .76-.79) and was significantly correlated with all subscales of the WASI. The DOT proved useful when used in lieu of BD in the calculation of WASI IQ scores. Findings support the reliability and validity of the DOT as a measure of visuospatial ability and suggest its potential worth as an efficient estimate of intellectual functioning in situations where lengthier tests may be inappropriate or unfeasible.

  10. The development and validation of the Closed-set Mandarin Sentence (CMS) test.

    Science.gov (United States)

    Tao, Duo-Duo; Fu, Qian-Jie; Galvin, John J; Yu, Ya-Feng

    2017-09-01

    Matrix-styled sentence tests offer a closed-set paradigm that may be useful when evaluating speech intelligibility. Ideally, sentence test materials should reflect the distribution of phonemes within the target language. We developed and validated the Closed-set Mandarin Sentence (CMS) test to assess Mandarin speech intelligibility in noise. CMS test materials were selected to be familiar words and to represent the natural distribution of vowels, consonants, and lexical tones found in Mandarin Chinese. Ten key words in each of five categories (Name, Verb, Number, Color, and Fruit) were produced by a native Mandarin talker, resulting in a total of 50 words that could be combined to produce 100,000 unique sentences. Normative data were collected in 10 normal-hearing, adult Mandarin-speaking Chinese listeners using a closed-set test paradigm. Two test runs were conducted for each subject, and 20 sentences per run were randomly generated while ensuring that each word was presented only twice in each run. First, the level of the words in each category were adjusted to produce equal intelligibility in noise. Test-retest reliability for word-in-sentence recognition was excellent according to Cronbach's alpha (0.952). After the category level adjustments, speech reception thresholds (SRTs) for sentences in noise, defined as the signal-to-noise ratio (SNR) that produced 50% correct whole sentence recognition, were adaptively measured by adjusting the SNR according to the correctness of response. The mean SRT was -7.9 (SE=0.41) and -8.1 (SE=0.34) dB for runs 1 and 2, respectively. The mean standard deviation across runs was 0.93 dB, and paired t-tests showed no significant difference between runs 1 and 2 (p=0.74) despite random sentences being generated for each run and each subject. The results suggest that the CMS provides large stimulus set with which to repeatedly and reliably measure Mandarin-speaking listeners' speech understanding in noise using a closed-set paradigm.

  11. The accomplishments of lithium target and test facility validation activities in the IFMIF/EVEDA phase

    Science.gov (United States)

    Arbeiter, Frederik; Baluc, Nadine; Favuzza, Paolo; Gröschel, Friedrich; Heidinger, Roland; Ibarra, Angel; Knaster, Juan; Kanemura, Takuji; Kondo, Hiroo; Massaut, Vincent; Saverio Nitti, Francesco; Miccichè, Gioacchino; O'hira, Shigeru; Rapisarda, David; Sugimoto, Masayoshi; Wakai, Eiichi; Yokomine, Takehiko

    2018-01-01

    As part of the engineering validation and engineering design activities (EVEDA) phase for the international fusion materials irradiation facility IFMIF, major elements of a lithium target facility and the test facility were designed, prototyped and validated. For the lithium target facility, the EVEDA lithium test loop was built at JAEA and used to test the stability (waves and long term) of the lithium flow in the target, work out the startup procedures, and test lithium purification and analysis. It was confirmed by experiments in the Lifus 6 plant at ENEA that lithium corrosion on ferritic martensitic steels is acceptably low. Furthermore, complex remote handling procedures for the remote maintenance of the target in the test cell environment were successfully practiced. For the test facility, two variants of a high flux test module were prototyped and tested in helium loops, demonstrating their good capabilities of maintaining the material specimens at the desired temperature with a low temperature spread. Irradiation tests were performed for heated specimen capsules and irradiation instrumentation in the BR2 reactor at SCK-CEN. The small specimen test technique, essential for obtaining material test results with limited irradiation volume, was advanced by evaluating specimen shape and test technique influences.

  12. Italian validation of the Purpose In Life (PIL) test and the Seeking Of Noetic Goals (SONG) test in a population of cancer patients.

    Science.gov (United States)

    Brunelli, C; Bianchi, E; Murru, L; Monformoso, P; Bosisio, M; Gangeri, L; Miccinesi, G; Scrignaro, M; Ripamonti, C; Borreani, C

    2012-11-01

    The first instruments developed to evaluate specific logotherapeutic dimensions were the Purpose In Life (PIL) and the Seeking Of Noetic Goals (SONG) tests, designed to reflect Frankl's concepts of, respectively, meaning in life attainment and will to meaning. This study aims to perform the Italian cultural adaptation and the psychometric validation of the PIL and SONG questionnaires. We administered the PIL and SONG, culturally adapted into the Italian language, to 266 cancer patients. The psychometric validation appraised construct validity, internal consistency, test-retest reliability, known-group validity, and convergent validity of the two questionnaires with respect to one another. The factorial analysis indicates that the original single-factor solution can be maintained for both instruments (proportion of variance explained by the first factor 77% and 71% for the PIL and SONG, respectively). The results show excellent internal consistency (Cronbach's alpha of 0.91 for the PIL and 0.90 for the SONG) and test-retest reliability (intraclass correlation coefficient of 0.92 for the PIL and 0.81 for the SONG). As expected, males, believers, patients nearer to the diagnosis, and patients not undergoing psychological therapy have higher PIL and lower SONG scores, while expectations for age were not confirmed. The average level for the PIL was 107.3, while for the SONG, it was 66.1, and a negative correlation (-0.47) between PIL and SONG scores indicates good convergent validity of the two instruments. Italian versions of the PIL and SONG are adequate and reliable self-report instruments for evaluating purpose in life and the motivation to find purpose for cancer patient populations.

  13. The Generalized Problematic Internet Use Scale 2: Validation and test of the model to Facebook use.

    Science.gov (United States)

    Assunção, Raquel S; Matos, Paula Mena

    2017-01-01

    The main goals of the present study were to test the psychometric properties of a Portuguese version of the GPIUS2 (Generalized Problematic Internet Use Scale 2, Caplan, 2010), and to test whether the cognitive-behavioral model proposed by Caplan (2010) replicated in the context of Facebook use. We used a sample of 761 Portuguese adolescents (53.7% boys, 46.3% girls, mean age = 15.8). Our results showed that the data presented an adequate fit to the original model using confirmatory factor analysis. The scale presented also good internal consistency and adequate construct validity. The cognitive-behavioral model was also applicable to the Facebook context, presenting good fit. Consistently with previous findings we found that preference for online social interaction and the use of Facebook to mood regulation purposes, predicted positively and significantly the deficient self-regulation in Facebook use, which in turn was a significant predictor of the negative outcomes associated with this use. Copyright © 2016 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.

  14. Validating a test to assess early childhood learners’ ability to perceive, express and appreciate emotions

    Directory of Open Access Journals (Sweden)

    Jose Miguel Mestre Navas

    2011-10-01

    Full Text Available Emotional Education, regardless of the school level, has an important mission in the goal of any educational project: socialising younger generations. However, it is also important to assess implemented programs by means of a valid, reliable measure of the progression of children’s’ cognitive and emotional development. Using a sample of 138 early childhood learners (aged from 3 to 6 this paper tested an instrument for assessing the ability to perceive, appreciate and express emotions (as defined by Mayer & Salovey’s model, 1997; 2007. Also, external criteria were developed by teachers on several issues related to children’s social and personal adaptation (school rules, achievement, impulsiveness, social acceptance of peers and hostility. Findings suggest that children from 3 to 6 years who obtain best scores in the perception and assessment of basic emotions are considered by their teachers to better adjust to school rules, to better control impulses, to achieve better academic performance and to be less problematic. It is also important to note that the study is at its initial stages and presents some limitations, as certain important variables such as personality and verbal ability are not controlled. Nevertheless, it should be pointed out that children showed great enthusiasm in taking the test.

  15. Adaptation and validation into Portuguese language of the six-item cognitive impairment test (6CIT).

    Science.gov (United States)

    Apóstolo, João Luís Alves; Paiva, Diana Dos Santos; Silva, Rosa Carla Gomes da; Santos, Eduardo José Ferreira Dos; Schultz, Timothy John

    2017-07-25

    The six-item cognitive impairment test (6CIT) is a brief cognitive screening tool that can be administered to older people in 2-3 min. To adapt the 6CIT for the European Portuguese and determine its psychometric properties based on a sample recruited from several contexts (nursing homes; universities for older people; day centres; primary health care units). The original 6CIT was translated into Portuguese and the draft Portuguese version (6CIT-P) was back-translated and piloted. The accuracy of the 6CIT-P was assessed by comparison with the Portuguese Mini-Mental State Examination (MMSE). A convenience sample of 550 older people from various geographical locations in the north and centre of the country was used. The test-retest reliability coefficient was high (r = 0.95). The 6CIT-P also showed good internal consistency (α = 0.88) and corrected item-total correlations ranged between 0.32 and 0.90. Total 6CIT-P and MMSE scores were strongly correlated. The proposed 6CIT-P threshold for cognitive impairment is ≥10 in the Portuguese population, which gives sensitivity of 82.78% and specificity of 84.84%. The accuracy of 6CIT-P, as measured by area under the ROC curve, was 0.91. The 6CIT-P has high reliability and validity and is accurate when used to screen for cognitive impairment.

  16. Validation test for CAP88 predictions of tritium dispersion at Los Alamos National Laboratory.

    Science.gov (United States)

    Michelotti, Erika; Green, Andrew; Whicker, Jeffrey; Eisele, William; Fuehne, David; McNaughton, Michael

    2013-08-01

    Gaussian plume models, such as CAP88, are used regularly for estimating downwind concentrations from stack emissions. At many facilities, the U.S. Environmental Protection Agency (U.S. EPA) requires that CAP88 be used to demonstrate compliance with air quality regulations for public protection from emissions of radionuclides. Gaussian plume models have the advantage of being relatively simple and their use pragmatic; however, these models are based on simplifying assumptions and generally they are not capable of incorporating dynamic meteorological conditions or complex topography. These limitations encourage validation tests to understand the capabilities and limitations of the model for the specific application. Los Alamos National Laboratory (LANL) has complex topography but is required to use CAP88 for compliance with the Clean Air Act Subpart H. The purpose of this study was to test the accuracy of the CAP88 predictions against ambient air measurements using released tritium as a tracer. Stack emissions of tritium from two LANL stacks were measured and the dispersion modeled with CAP88 using local meteorology. Ambient air measurements of tritium were made at various distances and directions from the stacks. Model predictions and ambient air measurements were compared over the course of a full year's data. Comparative results were consistent with other studies and showed the CAP88 predictions of downwind tritium concentrations were on average about three times higher than those measured, and the accuracy of the model predictions were generally more consistent for annual averages than for bi-weekly data.

  17. Reliability, construct and discriminative validity of clinical testing in subjects with and without chronic neck pain

    DEFF Research Database (Denmark)

    Jørgensen, René; Ris Hansen, Inge; Falla, Deborah

    2014-01-01

    -retest reliability in people with and without chronic neck pain. Moreover, construct and between-group discriminative validity of the tests were examined. METHODS: Twenty-one participants with chronic neck pain and 21 asymptomatic participants were included. Intra- and inter-reliability were evaluated for the Cranio-Cervical...... Flexion Test (CCFT), Range of Movement (ROM), Joint Position Error (JPE), Gaze Stability (GS), Smooth Pursuit Neck Torsion Test (SPNTT), and neuromuscular control of the Deep Cervical Extensors (DCE). Test-retest reliability was assessed for Postural Control (SWAY) and Pressure Pain Threshold (PPT) over......BACKGROUND: The reliability of clinical tests for the cervical spine has not been adequately evaluated. Six cervical clinical tests, which are low cost and easy to perform in clinical settings, were tested for intra- and inter-examiner reliability, and two performance tests were assessed for test...

  18. Sino-Nasal Outcome Test-22: Translation, Cross-cultural Adaptation, and Validation in Hebrew-Speaking Patients.

    Science.gov (United States)

    Shapira Galitz, Yael; Halperin, Doron; Bavnik, Yosef; Warman, Meir

    2016-05-01

    To perform the translation, cross-cultural adaptation, and validation of the Sino-Nasal Outcome Test-22 (SNOT-22) questionnaire to the Hebrew language. A single-center prospective cross-sectional study. Seventy-three chronic rhinosinusitis (CRS) patients and 73 patients without sinonasal disease filled the Hebrew version of the SNOT-22 questionnaire. Fifty-one CRS patients underwent endoscopic sinus surgery, out of which 28 filled a postoperative questionnaire. Seventy-three healthy volunteers without sinonasal disease also answered the questionnaire. Internal consistency, test-retest reproducibility, validity, and responsiveness of the questionnaire were evaluated. Questionnaire reliability was excellent, with a high internal consistency (Cronbach's alpha coefficient, 0.91-0.936) and test-retest reproducibility (Spearman's coefficient, 0.962). Mean scores for the preoperative, postoperative, and control groups were 50.44, 29.64, and 13.15, respectively (P < .0001 for CRS vs controls, P < .001 for preoperative vs postoperative), showing validity and responsiveness of the questionnaire. The Hebrew version of SNOT-22 questionnaire is a valid outcome measure for patients with CRS with or without nasal polyps. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.

  19. ASSESSMENT OF SATISFACTION IN PERITONEAL EQUILIBRATION TEST: A STUDY ON THE VALIDITY AND RELIABILITY OF THE PERITONEAL EQUILIBRATION SATISFACTION SCALE

    Directory of Open Access Journals (Sweden)

    Eylem TOPBAŞ

    2016-01-01

    Full Text Available Aim: This study has been designed to develop an assessment tool to be used in determining the patients’ satisfaction level with the peritoneal equilibration test (PET procedure. Materials and Methods: The development and validation of the peritoneal equilibration test Satisfaction Scale (PETSS was completed in two phases. Phase I focused on instrument construction and included item development and establishment of concurrent validity. Phase II included the factor analysis and psychometric assessment of the scale. In statistical evaluation of the data descriptive statistics and non-paratmetric tests were used. Results: The first version of the scale that has 3.62 Content Validity Index value was composed of 20 items. It was found that the latest version of the scale that has 14 items explained 46% of the variance. It was found that the Cronbach alfa value of this scale, which has 0.52-0.89 coefficient of item-total correlation was 0.96. Psychometric assessment of the scale revealed that except for type of the PET application, none of the demographic and clinical characteristics effect patients level of satisfaction during the PET application. Conclusion: This preliminary study showed that PETSS was a valid and reliable scale that can be used for determining satisfaction level of patients during PET application.

  20. Experience with Aero- and Fluid-Dynamic Testing for Engineering and CFD Validation

    Science.gov (United States)

    Ross, James C.

    2016-01-01

    Ever since computations have been used to simulate aerodynamics the need to ensure that the computations adequately represent real life has followed. Many experiments have been performed specifically for validation and as computational methods have improved, so have the validation experiments. Validation is also a moving target because computational methods improve requiring validation for the new aspect of flow physics that the computations aim to capture. Concurrently, new measurement techniques are being developed that can help capture more detailed flow features pressure sensitive paint (PSP) and particle image velocimetry (PIV) come to mind. This paper will present various wind-tunnel tests the author has been involved with and how they were used for validation of various kinds of CFD. A particular focus is the application of advanced measurement techniques to flow fields (and geometries) that had proven to be difficult to predict computationally. Many of these difficult flow problems arose from engineering and development problems that needed to be solved for a particular vehicle or research program. In some cases the experiments required to solve the engineering problems were refined to provide valuable CFD validation data in addition to the primary engineering data. All of these experiments have provided physical insight and validation data for a wide range of aerodynamic and acoustic phenomena for vehicles ranging from tractor-trailers to crewed spacecraft.

  1. Validation Ice Crystal Icing Engine Test in the Propulsion Systems Laboratory at NASA Glenn Research Center

    Science.gov (United States)

    Oliver, Michael J.

    2014-01-01

    The Propulsion Systems Laboratory (PSL) is an existing altitude simulation jet engine test facility located at NASA Glenn Research Center in Cleveland, OH. It was modified in 2012 with the integration of an ice crystal cloud generation system. This paper documents the inaugural ice crystal cloud test in PSL--the first ever full scale, high altitude ice crystal cloud turbofan engine test to be conducted in a ground based facility. The test article was a Lycoming ALF502-R5 high bypass turbofan engine, serial number LF01. The objectives of the test were to validate the PSL ice crystal cloud calibration and engine testing methodologies by demonstrating the capability to calibrate and duplicate known flight test events that occurred on the same LF01 engine and to generate engine data to support fundamental and computational research to investigate and better understand the physics of ice crystal icing in a turbofan engine environment while duplicating known revenue service events and conducting test points while varying facility and engine parameters. During PSL calibration testing it was discovered than heated probes installed through tunnel sidewalls experienced ice buildup aft of their location due to ice crystals impinging upon them, melting and running back. Filtered city water was used in the cloud generation nozzle system to provide ice crystal nucleation sites. This resulted in mineralization forming on flow path hardware that led to a chronic degradation of performance during the month long test. Lacking internal flow path cameras, the response of thermocouples along the flow path was interpreted as ice building up. Using this interpretation, a strong correlation between total water content (TWC) and a weaker correlation between median volumetric diameter (MVD) of the ice crystal cloud and the rate of ice buildup along the instrumented flow path was identified. For this test article the engine anti-ice system was required to be turned on before ice crystal

  2. The Validity of Graduate Management Admission Test Scores: A Summary of Studies Conducted from 1997 to 2004

    Science.gov (United States)

    Talento-Miller, Eileen; Rudner, Lawrence M.

    2008-01-01

    The validity of Graduate Management Admission Test (GMAT) scores is examined by summarizing 273 studies conducted between 1997 and 2004. Each of the studies was conducted through the Validity Study Service of the test sponsor and contained identical variables and statistical methods. Validity coefficients from each of the studies were corrected…

  3. A validity test of movie, television, and video-game ratings.

    Science.gov (United States)

    Walsh, D A; Gentile, D A

    2001-06-01

    Numerous studies have documented the potential effects on young audiences of violent content in media products, including movies, television programs, and computer and video games. Similar studies have evaluated the effects associated with sexual content and messages. Cumulatively, these effects represent a significant public health risk for increased aggressive and violent behavior, spread of sexually transmitted diseases, and pediatric pregnancy. In partial response to these risks and to public and legislative pressure, the movie, television, and gaming industries have implemented ratings systems intended to provide information about the content and appropriate audiences for different films, shows, and games. To test the validity of the current movie-, television-, and video game-rating systems. Panel study. Participants used the KidScore media evaluation tool, which evaluates films, television shows, and video games on 10 aspects, including the appropriateness of the media product for children based on age. When an entertainment industry rates a product as inappropriate for children, parent raters agree that it is inappropriate for children. However, parent raters disagree with industry usage of many of the ratings designating material suitable for children of different ages. Products rated as appropriate for adolescents are of the greatest concern. The level of disagreement varies from industry to industry and even from rating to rating. Analysis indicates that the amount of violent content and portrayals of violence are the primary markers for disagreement between parent raters and industry ratings. As 1 part of a solution to the complex public health problems posed by violent and sexually explicit media products, ratings can have value if used with caution. Parents and caregivers relying on the ratings systems to guide their children's use of media products should continue to monitor content independently. Industry ratings systems should be revised with input

  4. Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

    Science.gov (United States)

    Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

    2015-01-01

    Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.

  5. Translation and validation of the Malay version of the Stroke Knowledge Test.

    Science.gov (United States)

    Sowtali, Siti Noorkhairina; Yusoff, Dariah Mohd; Harith, Sakinah; Mohamed, Monniaty

    2016-04-01

    To date, there is a lack of published studies on assessment tools to evaluate the effectiveness of stroke education programs. This study developed and validated the Malay language version of the Stroke Knowledge Test research instrument. This study involved translation, validity, and reliability phases. The instrument underwent backward and forward translation of the English version into the Malay language. Nine experts reviewed the content for consistency, clarity, difficulty, and suitability for inclusion. Perceived usefulness and utilization were obtained from experts' opinions. Later, face validity assessment was conducted with 10 stroke patients to determine appropriateness of sentences and grammar used. A pilot study was conducted with 41 stroke patients to determine the item analysis and reliability of the translated instrument using the Kuder Richardson 20 or Cronbach's alpha. The final Malay version Stroke Knowledge Test included 20 items with good content coverage, acceptable item properties, and positive expert review ratings. Psychometric investigations suggest that Malay version Stroke Knowledge Test had moderate reliability with Kuder Richardson 20 or Cronbach's alpha of 0.58. Improvement is required for Stroke Knowledge Test items with unacceptable difficulty indices. Overall, the average rating of perceived usefulness and perceived utility of the instruments were both 72.7%, suggesting that reviewers were likely to use the instruments in their facilities. Malay version Stroke Knowledge Test was a valid and reliable tool to assess educational needs and to evaluate stroke knowledge among participants of group-based stroke education programs in Malaysia.

  6. Applicability of U.S. Army tracer test data to model validation needs of ERDA

    International Nuclear Information System (INIS)

    Shearer, D.L.; Minott, D.H.

    1976-06-01

    This report covers the first phase of an atmospheric dispersion model validation project sponsored by the Energy Research and Development Administration (ERDA). The project will employ dispersion data generated during an extensive series of field tracer experiments that were part of a meteorological research program which was conducted by the U. S. Army Dugway Proving Ground, Utah, from the late 1950's to the early 1970's. The tests were conducted at several locations in the U. S., South America, Germany, and Norway chosen to typify the effects of certain environmental factors on atmospheric dispersion. The purpose of the Phase I work of this project was to identify applicable portions of the Army data, obtain and review that data, and make recommendations for its uses for atmospheric dispersion model validations. This report presents key information in three formats. The first is a tabular listing of the Army dispersion test reports summarizing the test data contained in each report. This listing is presented in six separate tables with each tabular list representing a different topical area that is based on model validation requirements and the nature of the Army data base. The second format for presenting key information is a series of discussions of the Army test information assigned to each of the six topical areas. These discussions relate the extent and quality of the available data, as well as its prospective use for model validation. The third format is a series of synopses for each Army test report

  7. Noncredible cognitive performance at clinical evaluation of adult ADHD: An embedded validity indicator in a visuospatial working memory test.

    Science.gov (United States)

    Fuermaier, Anselm B M; Tucha, Oliver; Koerts, Janneke; Lange, Klaus W; Weisbrod, Matthias; Aschenbrenner, Steffen; Tucha, Lara

    2017-12-01

    The assessment of performance validity is an essential part of the neuropsychological evaluation of adults with attention-deficit/hyperactivity disorder (ADHD). Most available tools, however, are inaccurate regarding the identification of noncredible performance. This study describes the development of a visuospatial working memory test, including a validity indicator for noncredible cognitive performance of adults with ADHD. Visuospatial working memory of adults with ADHD (n = 48) was first compared to the test performance of healthy individuals (n = 48). Furthermore, a simulation design was performed including 252 individuals who were randomly assigned to either a control group (n = 48) or to 1 of 3 simulation groups who were requested to feign ADHD (n = 204). Additional samples of 27 adults with ADHD and 69 instructed simulators were included to cross-validate findings from the first samples. Adults with ADHD showed impaired visuospatial working memory performance of medium size as compared to healthy individuals. Simulation groups committed significantly more errors and had shorter response times as compared to patients with ADHD. Moreover, binary logistic regression analysis was carried out to derive a validity index that optimally differentiates between true and feigned ADHD. ROC analysis demonstrated high classification rates of the validity index, as shown in excellent specificity (95.8%) and adequate sensitivity (60.3%). The visuospatial working memory test as presented in this study therefore appears sensitive in indicating cognitive impairment of adults with ADHD. Furthermore, the embedded validity index revealed promising results concerning the detection of noncredible cognitive performance of adults with ADHD. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  8. Validity and reliability of skill-related fitness tests for wheelchair-using youth with Spina Bifida.

    NARCIS (Netherlands)

    Bloemen, M.A.; Takken, T.; Backx, F.J.; Vos, M.; Kruitwagen, C.L.; Groot, J.F. de

    2017-01-01

    Objectives: To determine content validity of the Muscle Power Sprint Test (MPST), and construct validity and reliability of the MPST, 10x5 Meter Sprint Test (10x5MST), slalom test, and One Stroke Push Test (1SPT) in wheelchair-using youth with spina bifida (SB). Design: Clinimetric study. Setting:

  9. Validity and Reliability of Skill-Related Fitness Tests for Wheelchair-Using Youth With Spina Bifida

    NARCIS (Netherlands)

    Bloemen, Manon A.; Takken, Tim; Backx, Frank J.; Vos, Marleen; Kruitwagen, Cas L.; de Groot, Janke F.

    OBJECTIVE: To determine content validity of the Muscle Power Sprint Test (MPST) and construct validity and reliability of the MPST, 10x5 Meter Sprint Test (10x5MST), slalom test and one stroke push test (1SPT) in wheelchair-using youth with spina bifida (SB). DESIGN: Clinimetric study SETTING:

  10. Validity and Reliability of Skill-Related Fitness Tests for Wheelchair-Using Youth with Spina Bifida

    NARCIS (Netherlands)

    Cas L.J.J. Kruitwagen; Frank J.G. Backx; Tim Takken; Janke de Groot; Marleen Vos; Manon A.T. Bloemen

    2016-01-01

    Objective: To determine content validity of the Muscle Power Sprint Test (MPST) and construct validity and reliability of the MPST, 10x5 Meter Sprint Test (10x5MST), slalom test and one stroke push test (1SPT) in wheelchair-using youth with spina bifida (SB). Design: Clinimetric study Setting:

  11. Test-retest studies of cerebral glucose metabolism using fluorine-18 deoxyglucose: validation of method

    International Nuclear Information System (INIS)

    Brooks, R.A.; Di Chiro, G.; Zukerberg, B.W.; Bairamian, D.; Larson, S.M.

    1987-01-01

    In studies using [ 18 F]deoxyglucose (FDG), one often wants to compare metabolic rates following stimulation (drug or motor-sensory) with the baseline values. However, because of reproducibility problems with baseline variations of 25% in the same individual not uncommon, the global effect of the stimulation may be difficult to see. One approach to this problem is to perform the two studies sequentially. This means that, with the 110-min half-life of 18 F, one must take into account the residual activity from the first study when calculating metabolic rates for the second. We performed TEST-RETEST baseline studies on four subjects, with a 1-hr interval between injections. These studies were done without stimulation, in order to validate the repeatability of the method. To reduce the amount of residual activity from the first study, the first injection was only 2 mCi in three cases, and only 1 mCi in one case, out of a total injected dose of 5 mCi. A correction for residual activity was included in the RETEST calculation of metabolic rate. The results showed a global metabolic shift between the two studies of 2% to 9%. An error analysis shows that the shift could be further reduced if anatomically comparable scans are done at comparable postinjection times

  12. Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

    Science.gov (United States)

    Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

    2015-01-01

    Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.

  13. Phase 1 Validation Testing and Simulation for the WEC-Sim Open Source Code

    Science.gov (United States)

    Ruehl, K.; Michelen, C.; Gunawan, B.; Bosma, B.; Simmons, A.; Lomonaco, P.

    2015-12-01

    WEC-Sim is an open source code to model wave energy converters performance in operational waves, developed by Sandia and NREL and funded by the US DOE. The code is a time-domain modeling tool developed in MATLAB/SIMULINK using the multibody dynamics solver SimMechanics, and solves the WEC's governing equations of motion using the Cummins time-domain impulse response formulation in 6 degrees of freedom. The WEC-Sim code has undergone verification through code-to-code comparisons; however validation of the code has been limited to publicly available experimental data sets. While these data sets provide preliminary code validation, the experimental tests were not explicitly designed for code validation, and as a result are limited in their ability to validate the full functionality of the WEC-Sim code. Therefore, dedicated physical model tests for WEC-Sim validation have been performed. This presentation provides an overview of the WEC-Sim validation experimental wave tank tests performed at the Oregon State University's Directional Wave Basin at Hinsdale Wave Research Laboratory. Phase 1 of experimental testing was focused on device characterization and completed in Fall 2015. Phase 2 is focused on WEC performance and scheduled for Winter 2015/2016. These experimental tests were designed explicitly to validate the performance of WEC-Sim code, and its new feature additions. Upon completion, the WEC-Sim validation data set will be made publicly available to the wave energy community. For the physical model test, a controllable model of a floating wave energy converter has been designed and constructed. The instrumentation includes state-of-the-art devices to measure pressure fields, motions in 6 DOF, multi-axial load cells, torque transducers, position transducers, and encoders. The model also incorporates a fully programmable Power-Take-Off system which can be used to generate or absorb wave energy. Numerical simulations of the experiments using WEC-Sim will be

  14. Validation of US3D for Capsule Aerodynamics using 05-CA Wind Tunnel Test Data

    Science.gov (United States)

    Schwing, Alan

    2012-01-01

    Several comparisons of computational fluid dynamics to wind tunnel test data are shown for the purpose of code validation. The wind tunnel test, 05-CA, uses a 7.66% model of NASA's Multi-Purpose Crew Vehicle in the 11-foot test section of the Ames Unitary Plan Wind tunnel. A variety of freestream conditions over four Mach numbers and three angles of attack are considered. Test data comparisons include time-averaged integrated forces and moments, time-averaged static pressure ports on the surface, and Strouhal Number. The applicability of the US3D code to subsonic and transonic flow over a bluff body is assessed on a comprehensive data set. With close comparison, this work validates US3D for highly separated flows similar to those examined here.

  15. Integration and validation testing for PhEDEx, DBS and DAS with the PhEDEx LifeCycle agent

    Science.gov (United States)

    Boeser, C.; Chwalek, T.; Giffels, M.; Kuznetsov, V.; Wildish, T.

    2014-06-01

    The ever-increasing amount of data handled by the CMS dataflow and workflow management tools poses new challenges for cross-validation among different systems within CMS experiment at LHC. To approach this problem we developed an integration test suite based on the LifeCycle agent, a tool originally conceived for stress-testing new releases of PhEDEx, the CMS data-placement tool. The LifeCycle agent provides a framework for customising the test workflow in arbitrary ways, and can scale to levels of activity well beyond those seen in normal running. This means we can run realistic performance tests at scales not likely to be seen by the experiment for some years, or with custom topologies to examine particular situations that may cause concern some time in the future. The LifeCycle agent has recently been enhanced to become a general purpose integration and validation testing tool for major CMS services. It allows cross-system integration tests of all three components to be performed in controlled environments, without interfering with production services. In this paper we discuss the design and implementation of the LifeCycle agent. We describe how it is used for small-scale debugging and validation tests, and how we extend that to large-scale tests of whole groups of sub-systems. We show how the LifeCycle agent can emulate the action of operators, physicists, or software agents external to the system under test, and how it can be scaled to large and complex systems.

  16. Integration and validation testing for PhEDEx, DBS and DAS with the PhEDEx LifeCycle agent

    International Nuclear Information System (INIS)

    Boeser, C; Chwalek, T; Giffels, M; Kuznetsov, V; Wildish, T

    2014-01-01

    The ever-increasing amount of data handled by the CMS dataflow and workflow management tools poses new challenges for cross-validation among different systems within CMS experiment at LHC. To approach this problem we developed an integration test suite based on the LifeCycle agent, a tool originally conceived for stress-testing new releases of PhEDEx, the CMS data-placement tool. The LifeCycle agent provides a framework for customising the test workflow in arbitrary ways, and can scale to levels of activity well beyond those seen in normal running. This means we can run realistic performance tests at scales not likely to be seen by the experiment for some years, or with custom topologies to examine particular situations that may cause concern some time in the future. The LifeCycle agent has recently been enhanced to become a general purpose integration and validation testing tool for major CMS services. It allows cross-system integration tests of all three components to be performed in controlled environments, without interfering with production services. In this paper we discuss the design and implementation of the LifeCycle agent. We describe how it is used for small-scale debugging and validation tests, and how we extend that to large-scale tests of whole groups of sub-systems. We show how the LifeCycle agent can emulate the action of operators, physicists, or software agents external to the system under test, and how it can be scaled to large and complex systems.

  17. Validation of the Alcohol Use Disorders Identification Test in university students: AUDIT and AUDIT-C.

    Science.gov (United States)

    García Carretero, Miguel Ángel; Novalbos Ruiz, José Pedro; Martínez Delgado, José Manuel; O'Ferrall González, Cristina

    2016-03-02

    The aim of this study was to determine the psychometric properties of the Alcohol Use Disorders Identification Test (AUDIT and AUDIT-C) in order to detect problems related to the consumption of alcohol in the university population. The sample consisted of 1309 students.A Weekly Alcohol Consumption Diary was used as a gold standard; Cronbach's Alpha, the Kappa index, Spearman's correlation coefficient and exploratory factor analysis were applied for diagnostic reliability and validity, with ROC curves used to establish the different cut-off points. Binge Drinking (BD) episodes were found in 3.9% of men and 4.0% of women with otherwise low-risk drinking patterns. AUDIT identified 20.1% as high-risk drinkers and 6.4% as drinkers with physical-psychological problems and probable alcohol dependence.Cronbach's alpha of 0.75 demonstrates good internal consistency. The best cut-off points for high-risk drinking students were 8 for males and 6 for females. As for problem drinkers and probable ADS, 13 was the best cut-off point for both sexes. In relation to AUDIT-C, 5 and 4 were the best cut-off points for males and females with high-risk patterns, respectively. The criterion validity of AUDIT and AUDIT-C to detect binge drinking episodes was found to have a moderate K value. The results obtained show that AUDIT has good psychometric properties to detect early alcohol abuse disorders in university students; however, it is recommended that the cut-off point be reduced to 8 in men. AUDIT-C improves its predictive value by raising the cut-off point by one unit. Items 2 and 3 should be reviewed to increase its predictive value for BD.

  18. Validation and Verification (V and V) Testing on Midscale Flame Resistant (FR) Test Method

    Science.gov (United States)

    2016-12-16

    materials . The results demonstrated that the Midscale test is a quick and cost-effective method for evaluation of FR performance of design features...standard and novel FR materials and design configurations during fire engulfment. Details of the test method and its development can be found in the...employed in the FRACU is a ripstop fabric blend of 65% FR rayon, 25% para- aramid and 10% nylon. The iCVC material is Nylon/Cotton/Nomex. All three

  19. Flight test techniques for validating simulated nuclear electromagnetic pulse aircraft responses

    Science.gov (United States)

    Winebarger, R. M.; Neely, W. R., Jr.

    1984-01-01

    An attempt has been made to determine the effects of nuclear EM pulses (NEMPs) on aircraft systems, using a highly instrumented NASA F-106B to document the simulated NEMP environment at the Kirtland Air Force Base's Vertically Polarized Dipole test facility. Several test positions were selected so that aircraft orientation relative to the test facility would be the same in flight as when on the stationary dielectric stand, in order to validate the dielectric stand's use in flight configuration simulations. Attention is given to the flight test portions of the documentation program.

  20. Identification of conductive hearing loss using air conduction tests alone: reliability and validity of an automatic test battery.

    Science.gov (United States)

    Convery, Elizabeth; Keidser, Gitte; Seeto, Mark; Freeston, Katrina; Zhou, Dan; Dillon, Harvey

    2014-01-01

    The primary objective of this study was to determine whether a combination of automatically administered pure-tone audiometry and a tone-in-noise detection task, both delivered via an air conduction (AC) pathway, could reliably and validly predict the presence of a conductive component to the hearing loss. The authors hypothesized that performance on the battery of tests would vary according to hearing loss type. A secondary objective was to evaluate the reliability and validity of a novel automatic audiometry algorithm to assess its suitability for inclusion in the test battery. Participants underwent a series of hearing assessments that were conducted in a randomized order: manual pure-tone air conduction audiometry and bone conduction audiometry; automatic pure-tone air conduction audiometry; and an automatic tone-in-noise detection task. The automatic tests were each administered twice. The ability of the automatic test battery to: (a) predict the presence of an air-bone gap (ABG); and (b) accurately measure AC hearing thresholds was assessed against the results of manual audiometry. Test-retest conditions were compared to determine the reliability of each component of the automatic test battery. Data were collected on 120 ears from normal-hearing and conductive, sensorineural, and mixed hearing-loss subgroups. Performance differences between different types of hearing loss were observed. Ears with a conductive component (conductive and mixed ears) tended to have normal signal to noise ratios (SNR) despite impaired thresholds in quiet, while ears without a conductive component (normal and sensorineural ears) demonstrated, on average, an increasing relationship between their thresholds in quiet and their achieved SNR. Using the relationship between these two measures among ears with no conductive component as a benchmark, the likelihood that an ear has a conductive component can be estimated based on the deviation from this benchmark. The sensitivity and

  1. The Bulimia Test--Revised: Validation with "DSM-IV" Criteria for Bulimia Nervosa.

    Science.gov (United States)

    Thelen, Mark H.; And Others

    1996-01-01

    The Bulimia Test--Revised (BULIT-R) was given to 23 female subjects who met the criteria for bulimia in the "Diagnostic and Statistical Manual of Mental Disorders" (DSM-IV) and 124 female controls. The BULIT-R appears to be a valid instruction for identifying individuals who meet DSM-IV criteria for bulimia. (SLD)

  2. Validity of 20-metre multi stage shuttle run test for estimation of ...

    African Journals Online (AJOL)

    Validity of 20-metre multi stage shuttle run test for estimation of maximum oxygen uptake in indian male university students. P Chatterjee, AK Banerjee, P Debnath, P Bas, B Chatterjee. Abstract. No Abstract. South African Journal for Physical, Health Education, Recreation and DanceVol. 12(4) 2006: pp. 461-467. Full Text:.

  3. Preliminary Process Theory does not validate the Comparison Question Test: A comment on Palmatier and Rovner

    NARCIS (Netherlands)

    Ben-Shakar, G.; Gamer, M.; Iacono, W.; Meijer, E.; Verschuere, B.

    2015-01-01

    Palmatier and Rovner (2015) attempt to establish the construct validity of the Comparison Question Test (CQT) by citing extensive research ranging from modern neuroscience to memory and psychophysiology. In this comment we argue that merely citing studies on the preliminary process theory (PPT) of

  4. Validation of a Wave-Body Interaction Model by Experimental Tests

    DEFF Research Database (Denmark)

    Ferri, Francesco; Kramer, Morten; Pecher, Arthur

    2013-01-01

    Within the wave energy field, numerical simulation has recently acquired a worldwide consent as being a useful tool, besides physical model testing. The main goal of this work is the validation of a numerical model by experimental results. The numerical model is based on a linear wave-body intera...

  5. Validity of the rey visual design test in primary and secondary school children

    NARCIS (Netherlands)

    Wilhelm, P.; van Klink, M.; van Klink, M.

    2007-01-01

    The Rey Visual Design Learning Test (Rey, 1964, cited in Spreen & Strauss, 1991, Wilhelm, 2004) assesses immediate memory span, new learning, delayed recall and recognition for nonverbal material. Two studies are presented that focused on the construct validity of the RVDLT in primary and secondary

  6. Teaching Analytical Method Transfer through Developing and Validating Then Transferring Dissolution Testing Methods for Pharmaceuticals

    Science.gov (United States)

    Kimaru, Irene; Koether, Marina; Chichester, Kimberly; Eaton, Lafayette

    2017-01-01

    Analytical method transfer (AMT) and dissolution testing are important topics required in industry that should be taught in analytical chemistry courses. Undergraduate students in senior level analytical chemistry laboratory courses at Kennesaw State University (KSU) and St. John Fisher College (SJFC) participated in development, validation, and…

  7. Validating Models of Clinical Word Recognition Tests for Spanish/English Bilinguals

    Science.gov (United States)

    Shi, Lu-Feng

    2014-01-01

    Purpose: Shi and Sánchez (2010) developed models to predict the optimal test language for evaluating Spanish/English (S/E) bilinguals' word recognition. The current study intended to validate their conclusions in a separate bilingual listener sample. Method: Seventy normal-hearing S/E bilinguals varying in language profile were included.…

  8. The validity of the circumduction test in elderly men and women

    NARCIS (Netherlands)

    Lemmink, KAPM; Kemper, HCG; de Greef, MHG; Rispens, P; Stevens, M

    2003-01-01

    This article focuses on the validity of the circumduction test for measuring shoulder flexibility in older adults. Participants included 137 community-dwelling older adults. Equipment consisted of a cord with a fixed handle on one end and a sliding handle on the other. The sliding handle was

  9. Using the U.S. "Test of Financial Literacy" in Germany--Adaptation and Validation

    Science.gov (United States)

    Förster, Manuel; Happ, Roland; Molerov, Dimitar

    2017-01-01

    In this article, the authors present the adaptation and validation processes conducted to render the American "Test of Financial Literacy" (TFL) suitable for use in Germany (TFL-G). First, they outline the translation procedure followed and the various cultural adjustments made in line with international standards. Next, they present…

  10. Validation of an Instrument and Testing Protocol for Measuring the Combinatorial Analysis Schema.

    Science.gov (United States)

    Staver, John R.; Harty, Harold

    1979-01-01

    Designs a testing situation to examine the presence of combinatorial analysis, to establish construct validity in the use of an instrument, Combinatorial Analysis Behavior Observation Scheme (CABOS), and to investigate the presence of the schema in young adolescents. (Author/GA)

  11. Validation of a Criterion Referenced Test for Young Handicapped Children: PIPER.

    Science.gov (United States)

    Strum, Irene; Shapiro, Madelaine

    The purpose of this study was to validate the Prescriptive Instructional Program for Educational Readiness (PIPER) for utilization as a criterion referenced test (CRT) among learning disabled children. The program consisted of behavioral objectives and diagnostic and/or mastery tasks and activities for each objective in the area of gross motor…

  12. Testing the Construct Validity of Proposed Criteria for "DSM-5" Autism Spectrum Disorder

    Science.gov (United States)

    Mandy, William P. L.; Charman, Tony; Skuse, David H.

    2012-01-01

    Objective: To use confirmatory factor analysis to test the construct validity of the proposed "DSM-5" symptom model of autism spectrum disorder (ASD), in comparison to alternative models, including that described in "DSM-IV-TR." Method: Participants were 708 verbal children and young persons (mean age, 9.5 years) with mild to severe autistic…

  13. Validating Score Interpretations and Uses: Messick Lecture, Language Testing Research Colloquium, Cambridge, April 2010

    Science.gov (United States)

    Kane, Michael

    2012-01-01

    The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…

  14. The Role of Policy Assumptions in Validating High-stakes Testing Programs.

    Science.gov (United States)

    Kane, Michael

    L. Cronbach has made the point that for validity arguments to be convincing to diverse audiences, they need to be based on assumptions that are credible to these audiences. The interpretations and uses of high stakes test scores rely on a number of policy assumptions about what should be taught in schools, and more specifically, about the content…

  15. Development of a Saudi Food Frequency Questionnaire and testing its reliability and validity

    OpenAIRE

    Gosadi, Ibrahim M.; Alatar, Abdullah A.; Otayf, Mojahed M.; AlJahani, Dhaherah M.; Ghabbani, Hisham M.; AlRajban, Waleed A.; Alrsheed, Abdullah M.; Al-Nasser, Khalid A.

    2017-01-01

    Objectives: To create a food frequency questionnaire specifically designed to capture the dietary habits of Saudis and test its validity and reliability. Methods: This investigation is a longitudinal, test-retest study conducted in King Saud University, Riyadh, Kingdom of Saudi Arabia between December 2015 and March 2016. A list of 140 food items was included in the questionnaire where a closed-ended and open-ended approach was used. Regarding past year food frequency consumption and 24 hours...

  16. Validity of a Newly-Designed Rectilinear Stepping Ergometer Submaximal Exercise Test to Assess Cardiorespiratory Fitness

    OpenAIRE

    Rubin Zhang, Likui Zhan, Shaoming Sun, Wei Peng, Yining Sun

    2017-01-01

    The maximum oxygen uptake (V̇O2 max), determined from graded maximal or submaximal exercise tests, is used to classify the cardiorespiratory fitness level of individuals. The purpose of this study was to examine the validity and reliability of the YMCA submaximal exercise test protocol performed on a newly-designed rectilinear stepping ergometer (RSE) that used up and down reciprocating vertical motion in place of conventional circular motion and giving precise measurement of workload, to det...

  17. The bogus taste test: Validity as a measure of laboratory food intake.

    Science.gov (United States)

    Robinson, Eric; Haynes, Ashleigh; Hardman, Charlotte A; Kemps, Eva; Higgs, Suzanne; Jones, Andrew

    2017-09-01

    Because overconsumption of food contributes to ill health, understanding what affects how much people eat is of importance. The 'bogus' taste test is a measure widely used in eating behaviour research to identify factors that may have a causal effect on food intake. However, there has been no examination of the validity of the bogus taste test as a measure of food intake. We conducted a participant level analysis of 31 published laboratory studies that used the taste test to measure food intake. We assessed whether the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. We examined construct validity by testing whether participant sex, hunger and liking of taste test food were associated with the amount of food consumed in the taste test. In addition, we also examined whether BMI (body mass index), trait measures of dietary restraint and over-eating in response to palatable food cues were associated with food consumption. Results indicated that the taste test was sensitive to experimental manipulations hypothesized to increase or decrease food intake. Factors that were reliably associated with increased consumption during the taste test were being male, have a higher baseline hunger, liking of the taste test food and a greater tendency to overeat in response to palatable food cues, whereas trait dietary restraint and BMI were not. These results indicate that the bogus taste test is likely to be a valid measure of food intake and can be used to identify factors that have a causal effect on food intake. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  18. A simple spatial working memory and attention test on paired symbols shows developmental deficits in schizophrenia patients.

    Science.gov (United States)

    Song, Wei; Zhang, Kai; Sun, Jinhua; Ma, Lina; Jesse, Forrest Fabian; Teng, Xiaochun; Zhou, Ying; Bao, Hechen; Chen, Shiqing; Wang, Shuai; Yang, Beimeng; Chu, Xixia; Ding, Wenhua; Du, Yasong; Cheng, Zaohuo; Wu, Bin; Chen, Shanguang; He, Guang; He, Lin; Chen, Xiaoping; Li, Weidong

    2013-01-01

    People with neuropsychiatric disorders such as schizophrenia often display deficits in spatial working memory and attention. Evaluating working memory and attention in schizophrenia patients is usually based on traditional tasks and the interviewer's judgment. We developed a simple Spatial Working Memory and Attention Test on Paired Symbols (SWAPS). It takes only several minutes to complete, comprising 101 trials for each subject. In this study, we tested 72 schizophrenia patients and 188 healthy volunteers in China. In a healthy control group with ages ranging from 12 to 60, the efficiency score (accuracy divided by reaction time) reached a peak in the 20-27 age range and then declined with increasing age. Importantly, schizophrenia patients failed to display this developmental trend in the same age range and adults had significant deficits compared to the control group. Our data suggests that this simple Spatial Working Memory and Attention Test on Paired Symbols can be a useful tool for studies of spatial working memory and attention in neuropsychiatric disorders.

  19. A Simple Spatial Working Memory and Attention Test on Paired Symbols Shows Developmental Deficits in Schizophrenia Patients

    Directory of Open Access Journals (Sweden)

    Wei Song

    2013-01-01

    Full Text Available People with neuropsychiatric disorders such as schizophrenia often display deficits in spatial working memory and attention. Evaluating working memory and attention in schizophrenia patients is usually based on traditional tasks and the interviewer’s judgment. We developed a simple Spatial Working Memory and Attention Test on Paired Symbols (SWAPS. It takes only several minutes to complete, comprising 101 trials for each subject. In this study, we tested 72 schizophrenia patients and 188 healthy volunteers in China. In a healthy control group with ages ranging from 12 to 60, the efficiency score (accuracy divided by reaction time reached a peak in the 20–27 age range and then declined with increasing age. Importantly, schizophrenia patients failed to display this developmental trend in the same age range and adults had significant deficits compared to the control group. Our data suggests that this simple Spatial Working Memory and Attention Test on Paired Symbols can be a useful tool for studies of spatial working memory and attention in neuropsychiatric disorders.

  20. Validity of Forced Eyelid Closure Test: A Novel Clinical Screening Test for Ocular Myasthenia Gravis.

    Science.gov (United States)

    Apinyawasisuk, Supanut; Zhou, Xinkai; Tian, Jack J; Garcia, Giancarlo A; Karanjia, Rustum; Sadun, Alfredo A

    2017-09-01

    Forced eyelid closure test (FECT) is a clinical screening test developed from the original Cogan lid twitch (CLT) sign to assist in the diagnosis of ocular myasthenia gravis (OMG), We evaluated the sensitivity and specificity of FECT compared with CLT and benchmarked to standard diagnostic tests. This study was a retrospective chart review of 48 patients using electronic medical records of those that presented with ptosis and/or diplopia at Doheny Eye Institute, University of California, Los Angeles between February 2015 and April 2016. Patients without FECT testing were excluded. FECT and CLT results, and final diagnosis were recorded. To perform FECT, the patient was asked to squeeze his or her eyelids shut for 5-10 seconds then open quickly and fixate in primary position. The excessive upward overshoot of eyelids movement indicated a positive FECT. The test was performed by a neuro-ophthalmologist before establishing the diagnosis. Patients who had equivocal test results and/or inconclusive final diagnosis were excluded. Of the 48 patients studied, 18 patients (37.5%) had positive FECT; 15 of whom had a final diagnosis of OMG (83.3%). Of the 30 patients with negative FECT, 1 had OMG (3.3%). Of the 48 patients, 35 patients also had a documented CLT result (72.9%). CLT was positive in 11 of these 35 patients (31.4%), and 9 of these 11 had OMG (81.8%). Of the 24 patients with negative CLT, 2 of them had OMG (8.3%). Sensitivity and specificity of FECT were 94% and 91% (joint 95% confidence region: sensitivity × specificity = [0.70, 1] × [0.75, 1]). The relative true-positive fraction (rTPF) between FECT and CLT was 1.15; the relative false-positive fraction was 1.31. FECT is a simple clinical screening test with good sensitivity and specificity for OMG.

  1. Validation of an Academic Listening Test: Effects of "Breakdown" Tests and Test Takers' Cognitive Awareness of Listening Processes

    Science.gov (United States)

    Chi, Youngshin

    2011-01-01

    This study investigated the breakdown effect of a listening comprehension test, whether test takers are affected in comprehending lectures by impediments, and collected test takers' cognitive awareness on test tasks which contain listening breakdown factors how they perceived these impediments. In this context of the study, a "Breakdown" is a test…

  2. Aerobic fitness testing in 6- to 9-year-old children: reliability and validity of a modified Yo-Yo IR1 test and the Andersen test

    DEFF Research Database (Denmark)

    Ahler, T; Bendiksen, Mads; Krustrup, Peter

    2012-01-01

    This study analysed the reliability and validity of two intermittent running tests (the Yo-Yo IR1 test and the Andersen test) as tools for estimating VO(2max) in children under the age of 10. Two groups, aged 6-7 years (grade 0, n = 18) and 8-9 years (grade 2, n = 16), carried out two repetitions...

  3. The prone bridge test: Performance, validity, and reliability among older and younger adults.

    Science.gov (United States)

    Bohannon, Richard W; Steffl, Michal; Glenney, Susan S; Green, Michelle; Cashwell, Leah; Prajerova, Kveta; Bunn, Jennifer

    2018-04-01

    The prone bridge maneuver, or plank, has been viewed as a potential alternative to curl-ups for assessing trunk muscle performance. The purpose of this study was to assess prone bridge test performance, validity, and reliability among younger and older adults. Sixty younger (20-35 years old) and 60 older (60-79 years old) participants completed this study. Groups were evenly divided by sex. Participants completed surveys regarding physical activity and abdominal exercise participation. Height, weight, body mass index (BMI), and waist circumference were measured. On two occasions, 5-9 days apart, participants held a prone bridge until volitional exhaustion or until repeated technique failure. Validity was examined using data from the first session: convergent validity by calculating correlations between survey responses, anthropometrics, and prone bridge time, known groups validity by using an ANOVA comparing bridge times of younger and older adults and of men and women. Test-retest reliability was examined by using a paired t-test to compare prone bridge times for Session1 and Session 2. Furthermore, an intraclass correlation coefficient (ICC) was used to characterize relative reliability and minimal detectable change (MDC 95% ) was used to describe absolute reliability. The mean prone bridge time was 145.3 ± 71.5 s, and was positively correlated with physical activity participation (p ≤ 0.001) and negatively correlated with BMI and waist circumference (p ≤ 0.003). Younger participants had significantly longer plank times than older participants (p = 0.003). The ICC between testing sessions was 0.915. The prone bridge test is a valid and reliable measure for evaluating abdominal performance in both younger and older adults. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. 1:50 Scale Testing of Three Floating Wind Turbines at MARIN and Numerical Model Validation Against Test Data

    Energy Technology Data Exchange (ETDEWEB)

    Dagher, Habib [Univ. of Maine, Orno, ME (United States); Viselli, Anthony [Univ. of Maine, Orno, ME (United States); Goupee, Andrew [Univ. of Maine, Orno, ME (United States); Allen, Christopher [Univ. of Maine, Orno, ME (United States)

    2017-08-15

    The primary goal of the basin model test program discussed herein is to properly scale and accurately capture physical data of the rigid body motions, accelerations and loads for different floating wind turbine platform technologies. The intended use for this data is for performing comparisons with predictions from various aero-hydro-servo-elastic floating wind turbine simulators for calibration and validation. Of particular interest is validating the floating offshore wind turbine simulation capabilities of NREL’s FAST open-source simulation tool. Once the validation process is complete, coupled simulators such as FAST can be used with a much greater degree of confidence in design processes for commercial development of floating offshore wind turbines. The test program subsequently described in this report was performed at MARIN (Maritime Research Institute Netherlands) in Wageningen, the Netherlands. The models considered consisted of the horizontal axis, NREL 5 MW Reference Wind Turbine (Jonkman et al., 2009) with a flexible tower affixed atop three distinct platforms: a tension leg platform (TLP), a spar-buoy modeled after the OC3 Hywind (Jonkman, 2010) and a semi-submersible. The three generic platform designs were intended to cover the spectrum of currently investigated concepts, each based on proven floating offshore structure technology. The models were tested under Froude scale wind and wave loads. The high-quality wind environments, unique to these tests, were realized in the offshore basin via a novel wind machine which exhibits negligible swirl and low turbulence intensity in the flow field. Recorded data from the floating wind turbine models included rotor torque and position, tower top and base forces and moments, mooring line tensions, six-axis platform motions and accelerations at key locations on the nacelle, tower, and platform. A large number of tests were performed ranging from simple free-decay tests to complex operating conditions with

  5. Sex and stress: Men and women show different cortisol responses to psychological stress induced by the Trier social stress test and the Iowa singing social stress test.

    Science.gov (United States)

    Reschke-Hernández, Alaine E; Okerstrom, Katrina L; Bowles Edwards, Angela; Tranel, Daniel

    2017-01-02

    Acute psychological stress affects each of us in our daily lives and is increasingly a topic of discussion for its role in mental illness, aging, cognition, and overall health. A better understanding of how such stress affects the body and mind could contribute to the development of more effective clinical interventions and prevention practices. Over the past 3 decades, the Trier Social Stress Test (TSST) has been widely used to induce acute stress in a laboratory setting based on the principles of social evaluative threat, namely, a judged speech-making task. A comparable alternative task may expand options for examining acute stress in a controlled laboratory setting. This study uses a within-subjects design to examine healthy adult participants' (n = 20 men, n = 20 women) subjective stress and salivary cortisol responses to the standard TSST (involving public speaking and math) and the newly created Iowa Singing Social Stress Test (I-SSST). The I-SSST is similar to the TSST but with a new twist: public singing. Results indicated that men and women reported similarly high levels of subjective stress in response to both tasks. However, men and women demonstrated different cortisol responses; men showed a robust response to both tasks, and women displayed a lesser response. These findings are in line with previous literature and further underscore the importance of examining possible sex differences throughout various phases of research, including design, analysis, and interpretation of results. Furthermore, this nascent examination of the I-SSST suggests a possible alternative for inducing stress in the laboratory. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  6. Implementation of the validation testing in MPPG 5.a "Commissioning and QA of treatment planning dose calculations-megavoltage photon and electron beams".

    Science.gov (United States)

    Jacqmin, Dustin J; Bredfeldt, Jeremy S; Frigo, Sean P; Smilowitz, Jennifer B

    2017-01-01

    The AAPM Medical Physics Practice Guideline (MPPG) 5.a provides concise guidance on the commissioning and QA of beam modeling and dose calculation in radiotherapy treatment planning systems. This work discusses the implementation of the validation testing recommended in MPPG 5.a at two institutions. The two institutions worked collaboratively to create a common set of treatment fields and analysis tools to deliver and analyze the validation tests. This included the development of a novel, open-source software tool to compare scanning water tank measurements to 3D DICOM-RT Dose distributions. Dose calculation algorithms in both Pinnacle and Eclipse were tested with MPPG 5.a to validate the modeling of Varian TrueBeam linear accelerators. The validation process resulted in more than 200 water tank scans and more than 50 point measurements per institution, each of which was compared to a dose calculation from the institution's treatment planning system (TPS). Overall, the validation testing recommended in MPPG 5.a took approximately 79 person-hours for a machine with four photon and five electron energies for a single TPS. Of the 79 person-hours, 26 person-hours required time on the machine, and the remainder involved preparation and analysis. The basic photon, electron, and heterogeneity correction tests were evaluated with the tolerances in MPPG 5.a, and the tolerances were met for all tests. The MPPG 5.a evaluation criteria were used to assess the small field and IMRT/VMAT validation tests. Both institutions found the use of MPPG 5.a to be a valuable resource during the commissioning process. The validation testing in MPPG 5.a showed the strengths and limitations of the TPS models. In addition, the data collected during the validation testing is useful for routine QA of the TPS, validation of software upgrades, and commissioning of new algorithms. © 2016 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals, Inc. on behalf of

  7. Naturalistic validation of an on-road driving test of older drivers.

    Science.gov (United States)

    Ott, Brian R; Papandonatos, George D; Davis, Jennifer D; Barco, Peggy P

    2012-08-01

    The objective was to compare a standardized road test to naturalistic driving by older people who may have cognitive impairment to define improvements that could potentially enhance the validity of road testing in this population. Road testing has been widely adapted as a tool to assess driving competence of older people who may be at risk for unsafe driving because of dementia; however, the validity of this approach has not been rigorously evaluated. For 2 weeks, 80 older drivers (38 healthy elders and 42 with cognitive impairment) who passed a standardized road test were video recorded in their own vehicles. Using a standardized rating scale, 4 hr of video was rated by a driving instructor. The authors examine weighting of individual road test items to form global impressions and to compare road test and naturalistic driving using factor analyses of these two assessments. The road test score was unidimensional, reflecting a major factor related to awareness of signage and traffic behavior. Naturalistic driving reflected two factors related to lane keeping as well as traffic behavior. Maintenance of proper lane is an important dimension of driving safety that appears to be relatively underemphasized during the highly supervised procedures of the standardized road test. Road testing in this population could be improved by standardized designs that emphasize lane keeping and that include self-directed driving. Additional information should be sought from observers in the community as well as crash evidence when advising older drivers who may be cognitively impaired.

  8. Naturalistic Validation of an On-Road Driving Test of Older Drivers

    Science.gov (United States)

    Ott, Brian R.; Papandonatos, George D.; Davis, Jennifer D.; Barco, Peggy P.

    2013-01-01

    Objective The objective was to compare a standardized road test to naturalistic driving by older people who may have cognitive impairment to define improvements that could potentially enhance the validity of road testing in this population. Background Road testing has been widely adapted as a tool to assess driving competence of older people who may be at risk for unsafe driving because of dementia; however, the validity of this approach has not been rigorously evaluated. Method For 2 weeks, 80 older drivers (38 healthy elders and 42 with cognitive impairment) who passed a standardized road test were video recorded in their own vehicles. Using a standardized rating scale, 4 hr of video was rated by a driving instructor. The authors examine weighting of individual road test items to form global impressions and to compare road test and naturalistic driving using factor analyses of these two assessments. Results The road test score was unidimensional, reflecting a major factor related to awareness of signage and traffic behavior. Naturalistic driving reflected two factors related to lane keeping as well as traffic behavior. Conclusion Maintenance of proper lane is an important dimension of driving safety that appears to be relatively underemphasized during the highly supervised procedures of the standardized road test. Application Road testing in this population could be improved by standardized designs that emphasize lane keeping and that include self-directed driving. Additional information should be sought from observers in the community as well as crash evidence when advising older drivers who may be cognitively impaired. PMID:22908688

  9. How'd they do it? Malingering strategies on symptom validity tests.

    Science.gov (United States)

    Tan, Jing Ee; Slick, Daniel J; Strauss, Esther; Hultsch, David F

    2002-12-01

    Twenty-five undergraduate students were instructed to feign believable impairment following a brain injury from a car accident and 27 students were told to perform like they had recovered from such an injury. Three forced-choice tests, the Test of Memory Malingering (TOMM), Victoria Symptom Validity Test (VSVT), and Word Memory Test (WMT) were given. Test-taking strategies were evaluated by means of a questionnaire given at the end of the test session. The results revealed that all the tasks differentiated between groups. Using conventional cut-scores, the WMT proved most efficient while the VSVT captured the most participants in the definitive below-chance category. Individuals instructed to feign injury were more likely to prepare prior to the experiment, with feigning of memory loss as the most frequently reported strategy. Regardless, preparation effort did not translate into believable performance on the tests.

  10. Psychometric properties and convergent and predictive validity of an executive function test battery for two-year-olds

    Directory of Open Access Journals (Sweden)

    Hanna eMulder

    2014-07-01

    Full Text Available Executive function (EF is an important predictor of numerous developmental outcomes, such as academic achievement and behavioral adjustment. Although a plethora of measurement instruments exists to assess executive function in children, only few of these are suitable for toddlers, and even fewer have undergone psychometric evaluation. The present study evaluates the psychometric properties and validity of an assessment battery for measuring EF in two-year-olds. A sample of 2437 children were administered the assessment battery at a mean age of 2;4 years (SD = 0;3 years in a large-scale field study. Measures of both hot EF (snack and gift delay tasks and cool EF (six boxes, memory for location, and visual search task were included. Confirmatory Factor Analyses showed that a two-factor hot and cool EF model fitted the data better than a one-factor model. Measurement invariance was supported across groups differing in age, gender, socioeconomic status (SES, home language, and test setting. Criterion and convergent validity were evaluated by examining relationships between EF and age, gender, SES, home language, and parent and teacher reports of children’s attention and inhibitory control. Predictive validity of the test battery was investigated by regressing children’s pre-academic skills and behavioral problems at age three on the latent hot and cool EF factors at age two years. The test battery showed satisfactory psychometric quality and criterion, convergent, and predictive validity. Whereas cool EF predicted both pre-academic skills and behavior problems one year later, hot EF predicted behavior problems only. These results show that EF can be assessed with psychometrically sound instruments in children as young as two years, and that EF tasks can be reliably applied in large scale field research. The current instruments offer new opportunities for investigating EF in early childhood, and for evaluating interventions targeted at improving

  11. Making Sense of Fear Testing - Validating Common Behavioral Tests used in Swine

    Science.gov (United States)

    Tests to assess fear are commonly used in laboratory animals, such as mice and rats, when researchers wish to understand the implications of specific drugs, such as anxiolytics, or specific environments which may be used to house experimental animals. Researchers who study the welfare of livestock ...

  12. Combined analysis of 19 common validated type 2 diabetes susceptibility gene variants shows moderate discriminative value and no evidence of gene-gene interaction

    DEFF Research Database (Denmark)

    Sparsø, T; Grarup, N; Andreasen, C.

    2009-01-01

    study; and additional type 2 diabetic patients and glucose-tolerant individuals. The case-control studies involved 4,093 type 2 diabetic patients and 5,302 glucose-tolerant individuals. RESULTS: Single-variant analyses demonstrated allelic odds ratios ranging from 1.04 (95% CI 0.98-1.11) to 1.33 (95% CI...... analysis of the 19 validated variants enables detection of subgroups at substantially increased risk of type 2 diabetes; however, the discrimination between glucose-tolerant and type 2 diabetes individuals is still too inaccurate to achieve clinical value.......AIMS/HYPOTHESIS: The list of validated type 2 diabetes susceptibility variants has recently been expanded from three to 19. The variants identified are common and have low penetrance in the general population. The aim of the study is to investigate the combined effect of the 19 variants by applying...

  13. Validation of the Danish Addenbrooke's Cognitive Examination as a screening test in a memory clinic

    DEFF Research Database (Denmark)

    Stokholm, Jette; Vogel, Asmus; Johannsen, Peter

    2009-01-01

    BACKGROUND: Addenbrooke's Cognitive Examination (ACE) is a cognitive screening test developed to detect dementia. It has been validated in several countries. Validation studies have predominantly included patients with various degrees of dementia and healthy controls. OBJECTIVE: The aim...... (MMSE >or=20), 30 non-demented patients diagnosed with depression (originally referred for evaluation of cognitive symptoms), and 63 healthy volunteers, all between 60 and 85 years of age, were included. All patients were given the ACE as a supplement to the standard diagnostic work-up. RESULTS: The cut...

  14. Danish validation of sniffin' sticks olfactory test for threshold, discrimination, and identification

    DEFF Research Database (Denmark)

    Niklassen, Andreas Steenholt; Ovesen, Therese; Fernandes, Henrique

    2017-01-01

    to investigate external validity of international normative values to separate hyposmia from normosmia. METHODS: The study included 388 participants. The first step was a questionnaire study in which 238 adults rated their familiarity with 125 odor descriptors. In the second step, we evaluated the original...... in improvement of familiarity and rate of I, making the test valid for use in Denmark. Furthermore, the study found a large variation in T and D scores between different countries, which should be considered when using these scores to separate hyposmia and anosmia from normosmia. LEVEL OF EVIDENCE: 2b...

  15. Recent Advances in Simulation of Eddy Current Testing of Tubes and Experimental Validations

    Science.gov (United States)

    Reboud, C.; Prémel, D.; Lesselier, D.; Bisiaux, B.

    2007-03-01

    Eddy current testing (ECT) is widely used in iron and steel industry for the inspection of tubes during manufacturing. A collaboration between CEA and the Vallourec Research Center led to the development of new numerical functionalities dedicated to the simulation of ECT of non-magnetic tubes by external probes. The achievement of experimental validations led us to the integration of these models into the CIVA platform. Modeling approach and validation results are discussed here. A new numerical scheme is also proposed in order to improve the accuracy of the model.

  16. Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

    Science.gov (United States)

    Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

    2016-01-01

    Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149

  17. [Validation of the AUDIT test for identifying risk consumption and alcohol use disorders in women].

    Science.gov (United States)

    Pérula de Torres, L A; Fernández-García, J A; Arias-Vega, R; Muriel-Palomino, M; Márquez-Rebollo, E; Ruiz-Moral, R

    2005-11-30

    To validate the AUDIT test for identifying women with excess alcohol consumption and/or dependency syndrome (DS). Descriptive study to validate a test. Two primary care centres and a county drug-dependency centre. 414 women from 18 to 75 recruited at the clinic. Interventions. Social and personal details were obtained through personal interview, their alcohol consumption was quantified and the AUDIT and MALT questionnaires were filled in. Then the semi-structured SCAN interview was conducted (gold standard; DSM-IV and CIE-10 criteria), and analyses were requested (GGT, GOT, GPT, VCM). 186 patients were given a follow-up appointment three-four weeks later (retest). Intra-observer reliability was evaluated with the Kappa index, internal consistency with Cronbach s alpha, and the validity of criteria with indexes of sensitivity and specificity, predictive values and probability quotients. To evaluate the diagnostic performance of the test and the most effective cut-off point, a ROC analysis was run. 11.4% (95% CI, 8.98-13.81) were diagnosed with alcohol abuse (0.5%) or DS (10.9%). The Kappa coefficients of the AUDIT items ranged between 0.685 and 0.795 (PAUDIT is a questionnaire with good psycho-measurement properties. It is reliable and valid for the detection of risk consumption and DS in women.

  18. [Validity of AUDIT test for detection of disorders related with alcohol consumption in women].

    Science.gov (United States)

    Pérula-de Torres, Luis Angel; Fernández-García, José Angel; Arias-Vega, Raquel; Muriel-Palomino, María; Márquez-Rebollo, Encarnación; Ruiz-Moral, Roger

    2005-11-26

    Early detection of patients with alcohol problems is important in clinical practice. The AUDIT (Alcohol Use Disorders Identification Test) questionnaire is a valid tool for this aim, especially in the male population. The objective of this study was to validate how useful is this questionnaire in females patients and to assess their test cut-off point for the diagnosis of alcohol problems in women. 414 woman were recruited in 2 health center and specialized center for addiction treatment. The AUDIT test and a semistructured interview (SCAN as gold standard) were performed to all patients. Internal consistency and criteria validity was assessed. Cronbach alpha was 0.93 (95% confidence interval [CI], 0.921-0.941). When the DSM-IV was taken as reference the most useful cut-off point was 6 points, with 89.6% (95% CI, 76.11-96.02) sensitivity and 95.07% (95% CI, 92.18-96.97) specificity. When CIE-10 was taken as reference the sensitivity was 89.58% (95% CI, 76.56-96.10) and the specificity was 95.33% (95% CI, 92.48-97.17). AUDIT is a questionnaire with good psychometrics properties and is valid for detecting dependence and risk alcohol consumption in women.

  19. The predictive validity of the BioMedical Admissions Test for pre-clinical examination performance.

    Science.gov (United States)

    Emery, Joanne L; Bell, John F

    2009-06-01

    Some medical courses in the UK have many more applicants than places and almost all applicants have the highest possible previous and predicted examination grades. The BioMedical Admissions Test (BMAT) was designed to assist in the student selection process specifically for a number of 'traditional' medical courses with clear pre-clinical and clinical phases and a strong focus on science teaching in the early years. It is intended to supplement the information provided by examination results, interviews and personal statements. This paper reports on the predictive validity of the BMAT and its predecessor, the Medical and Veterinary Admissions Test. Results from the earliest 4 years of the test (2000-2003) were matched to the pre-clinical examination results of those accepted onto the medical course at the University of Cambridge. Correlation and logistic regression analyses were performed for each cohort. Section 2 of the test ('Scientific Knowledge') correlated more strongly with examination marks than did Section 1 ('Aptitude and Skills'). It also had a stronger relationship with the probability of achieving the highest examination class. The BMAT and its predecessor demonstrate predictive validity for the pre-clinical years of the medical course at the University of Cambridge. The test identifies important differences in skills and knowledge between candidates, not shown by their previous attainment, which predict their examination performance. It is thus a valid source of additional admissions information for medical courses with a strong scientific emphasis when previous attainment is very high.

  20. Reliability and validity of videotaped functional performance tests in ACL-injured subjects

    DEFF Research Database (Denmark)

    von Porat, Anette; Holmström, Eva; Roos, Ewa

    2008-01-01

    BACKGROUND AND PURPOSE: In clinical practice, visual observation is often used to determine functional impairment and to evaluate treatment following a knee injury. The aim of this study was to evaluate the reliability and validity of observational assessments of knee movement pattern quality......, crossover hop on one leg and one-leg hop. The videos were observed by four physiotherapists, and the knee movement pattern quality, a feature of the loading strategy of the lower extremity, was scored on an 11-point rating scale. To assess the criterion validity, the observational rating was correlated...... obtained between the observers' assessment and knee flexion angle, r = 0.37-0.61. The crossover hop test or one-leg hop test was ranked as the most useful test in 172 of 192 occasions (90%) when assessing knee function. CONCLUSION: The moderate to good inter-observer reliability and the moderate criterion...