WorldWideScience

Sample records for psychometric test scores

  1. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    Science.gov (United States)

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  2. Psychometric Evaluation of the Lower Extremity Computerized Adaptive Test, the Modified Harris Hip Score, and the Hip Outcome Score.

    Science.gov (United States)

    Hung, Man; Hon, Shirley D; Cheng, Christine; Franklin, Jeremy D; Aoki, Stephen K; Anderson, Mike B; Kapron, Ashley L; Peters, Christopher L; Pelt, Christopher E

    2014-12-01

    The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Cohort study (diagnosis); Level of evidence, 2. Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior in all psychometric aspects examined in this study. Future

  3. The use of test scores from large-scale assessment surveys: psychometric and statistical considerations

    Directory of Open Access Journals (Sweden)

    Henry Braun

    2017-11-01

    Full Text Available Abstract Background Economists are making increasing use of measures of student achievement obtained through large-scale survey assessments such as NAEP, TIMSS, and PISA. The construction of these measures, employing plausible value (PV methodology, is quite different from that of the more familiar test scores associated with assessments such as the SAT or ACT. These differences have important implications both for utilization and interpretation. Although much has been written about PVs, it appears that there are still misconceptions about whether and how to employ them in secondary analyses. Methods We address a range of technical issues, including those raised in a recent article that was written to inform economists using these databases. First, an extensive review of the relevant literature was conducted, with particular attention to key publications that describe the derivation and psychometric characteristics of such achievement measures. Second, a simulation study was carried out to compare the statistical properties of estimates based on the use of PVs with those based on other, commonly used methods. Results It is shown, through both theoretical analysis and simulation, that under fairly general conditions appropriate use of PV yields approximately unbiased estimates of model parameters in regression analyses of large scale survey data. The superiority of the PV methodology is particularly evident when measures of student achievement are employed as explanatory variables. Conclusions The PV methodology used to report student test performance in large scale surveys remains the state-of-the-art for secondary analyses of these databases.

  4. Psychometric properties of the Cumulated Ambulation Score

    DEFF Research Database (Denmark)

    Ferriero, Giorgio; Kristensen, Morten T; Invernizzi, Marco

    2018-01-01

    INTRODUCTION: In the geriatric population, independent mobility is a key factor in determining readiness for discharge following acute hospitalization. The Cumulated Ambulation Score (CAS) is a potentially valuable score that allows day-to-day measurements of basic mobility. The CAS was developed...... and validated in older patients with hip fracture as an early postoperative predictor of short-term outcome, but it is also used to assess geriatric in-patients with acute medical illness. Despite the fast- accumulating literature on the CAS, to date no systematic review synthesizing its psychometric properties....... Of 49 studies identified, 17 examined the psychometric properties of the CAS. EVIDENCE SYNTHESIS: Most papers dealt with patients after hip fracture surgery, and only 4 studies assessed the CAS psychometric characteristics also in geriatric in-patients with acute medical illness. Two versions of CAS...

  5. Normalization of the psychometric hepatic encephalopathy score for ...

    African Journals Online (AJOL)

    Aim: To construct normal values for the tests of the psychometric hepatic encephalopathy score (PHES) and evaluate the prevalence of minimal hepatic encephalopathy (MHE) among Turkish patients with liver cirrhosis. Materials and Methods: One hundred and eighty-five healthy subjects and sixty patients with liver ...

  6. Normalization of the Psychometric Hepatic Encephalopathy score ...

    African Journals Online (AJOL)

    2016-05-09

    May 9, 2016 ... influenced by age, education levels, and gender.[5] Till date, the PHES ... and death. MHE also increases the risk of development ... large circles beginning from each row on the left and working to the right. The test score is the ...

  7. Smoking habit and psychometric scores: a community study.

    Science.gov (United States)

    Waal-Manning, H J; de Hamel, F A

    1978-09-13

    During the Milton health survey subjects completed a psychometric inventory consisting of the 48 questions of the Middlesex Hospital questionnaire (MHQ) and 26 from the hostility and direction of hostility questionnaire (HDHQ) designed to examine nine psychological dimensions. The 1209 subjects were classified into smoking categories and the scores for each psychometric trait were calculated. Women scored higher than men and heavy smokers scored higher than "never smokers". The psychometric traits and the scores of the four smoking categories after correcting for age and Quetelet's index showed statistically significant differences by analysis of variance in respect of somatic anxiety and depression for both men and women; and free-floating anxiety, phobic anxiety, hysteria, acting out hostility, self criticism and guilt in women. For somatic anxiety the increase in score almost exactly paralleled the increasing quantity of tobacco consumed.

  8. Testing Psychometrics of Healthcare Empowerment Questionnaires ...

    African Journals Online (AJOL)

    Testing Psychometrics of Healthcare Empowerment Questionnaires (HCEQ) among Iranian ... PROMOTING ACCESS TO AFRICAN RESEARCH ... translation and backtranslation procedures, pilot testing, and getting views of expert panel.

  9. Psychometrics behind Computerized Adaptive Testing.

    Science.gov (United States)

    Chang, Hua-Hua

    2015-03-01

    The paper provides a survey of 18 years' progress that my colleagues, students (both former and current) and I made in a prominent research area in Psychometrics-Computerized Adaptive Testing (CAT). We start with a historical review of the establishment of a large sample foundation for CAT. It is worth noting that the asymptotic results were derived under the framework of Martingale Theory, a very theoretical perspective of Probability Theory, which may seem unrelated to educational and psychological testing. In addition, we address a number of issues that emerged from large scale implementation and show that how theoretical works can be helpful to solve the problems. Finally, we propose that CAT technology can be very useful to support individualized instruction on a mass scale. We show that even paper and pencil based tests can be made adaptive to support classroom teaching.

  10. Psychometrics

    NARCIS (Netherlands)

    Borsboom, D.; Molenaar, D.; Wright, J.D.

    2015-01-01

    Psychometrics is a scientific discipline concerned with the construction of measurement models for psychological data. In these models, a theoretical construct (e.g., intelligence) is systematically coordinated with observables (e.g., IQ scores). This is often done through latent variable models,

  11. Nonparametric tests for equality of psychometric functions.

    Science.gov (United States)

    García-Pérez, Miguel A; Núñez-Antón, Vicente

    2017-12-07

    Many empirical studies measure psychometric functions (curves describing how observers' performance varies with stimulus magnitude) because these functions capture the effects of experimental conditions. To assess these effects, parametric curves are often fitted to the data and comparisons are carried out by testing for equality of mean parameter estimates across conditions. This approach is parametric and, thus, vulnerable to violations of the implied assumptions. Furthermore, testing for equality of means of parameters may be misleading: Psychometric functions may vary meaningfully across conditions on an observer-by-observer basis with no effect on the mean values of the estimated parameters. Alternative approaches to assess equality of psychometric functions per se are thus needed. This paper compares three nonparametric tests that are applicable in all situations of interest: The existing generalized Mantel-Haenszel test, a generalization of the Berry-Mielke test that was developed here, and a split variant of the generalized Mantel-Haenszel test also developed here. Their statistical properties (accuracy and power) are studied via simulation and the results show that all tests are indistinguishable as to accuracy but they differ non-uniformly as to power. Empirical use of the tests is illustrated via analyses of published data sets and practical recommendations are given. The computer code in MATLAB and R to conduct these tests is available as Electronic Supplemental Material.

  12. Assessment of Minimal HE (with emphasis on computerized psychometric tests)

    Science.gov (United States)

    Kappus, Matthew R; Bajaj, Jasmohan S

    2012-01-01

    Synopsis Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life and driving accidents. The detection of MHE requires specialized testing since it cannot by definition, be diagnosed on standard clinical examination. Psychometric (paper-pencil or computerized or a combination) and neuro-physiological techniques are often used to test for MHE. Paper-pencil psychometric batteries like the Psychometric Hepatic Encephalopathy Score (PHES) have been validated in several countries but do not have US normative values. Computerized tests such as the inhibitory control test (ICT), cognitive drug research system and Scan test have proven useful to diagnose MHE and predict outcomes. The specificity and sensitivity of these tests are similar to the recommended gold standards. Neuro-physiological tests such as the EEG and its interpretations, evoked potentials and Critical Flicker Frequency (CFF) also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike and the testing strategies depend on the normative data available, patient comfort and local expertise. PMID:22321464

  13. Psychometric testing and Human Resource Management

    Directory of Open Access Journals (Sweden)

    R. P. van der Merwe

    2002-09-01

    Full Text Available This is a cumulative report on the findings of various exploratory research that were done with regard to the practice of psychometric testing in the Eastern Cape. Recent and ongoing developments in the South African labour legislation, and especially the implications of the Employment Equity Act, highlight once again the importance of the validation of all instruments to be used for human assessment and selection purposes. Information was gathered to establish which psychometric tests are used, and for what purposes, in industry today. Biographical information on each organisation is supplied, including the number of employees. The role of psychometric testing in the selection procedure is discussed. The different tests used, as well as the test users, are also indicated. The findings of other, related research, as well as comments, recommendations and shortcomings, are discussed. Opsomming Hierdie is ‘n kumulatiewe verslag wat die resultate verstrek van verskeie verkennende ondersoeke wat gedoen is na die aanwending van psigometriese toetsing in die Oos-Kaap. Onlangse en voortdurende ontwikkelinge in die Suid-Afrikaanse arbeidswetgewing, en veral die implikasies van die Wet op Gelyke Indiensneming, beklemtoon weer eens die belangrikheid van die validering van enige instrumente wat gebruik word vir evaluerings- en keuringsdoeleindes van individue. Inligting is ingewin om te bepaal watter psigometriese toetse, sowel as vir watter doel, vandag in die bedryf gebruik word. Biografiese inligting oor die onderskeie organisasies, insluitende hul aantal werknemers, word verstrek. Die rol van psigometriese toetsing in die keuringsproses word bespreek. Die verskillende toetse wat deur die organisasies gebruik word, sowel as die toetsge-bruikers, word ook aangedui. Die bevindinge van ander, relevante navorsing, sowel as opmerkings, aanbevelings en tekortkominge word bespreek.

  14. Do Test Scores Buy Happiness?

    Science.gov (United States)

    McCluskey, Neal

    2017-01-01

    Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…

  15. Predicting occupational personality test scores.

    Science.gov (United States)

    Furnham, A; Drakeley, R

    2000-01-01

    The relationship between students' actual test scores and their self-estimated scores on the Hogan Personality Inventory (HPI; R. Hogan & J. Hogan, 1992), an omnibus personality questionnaire, was examined. Despite being given descriptive statistics and explanations of each of the dimensions measured, the students tended to overestimate their scores; yet all correlations between actual and estimated scores were positive and significant. Correlations between self-estimates and actual test scores were highest for sociability, ambition, and adjustment (r = .62 to r = .67). The results are discussed in terms of employers' use and abuse of personality assessment for job recruitment.

  16. Psychometric challenges and proposed solutions when scoring facial emotion expression codes

    OpenAIRE

    Olderbak, Sally; Hildebrandt, Andrea; Pinkpank, Thomas; Sommer, Werner; Wilhelm, Oliver

    2013-01-01

    Coding of facial emotion expressions is increasingly performed by automated emotion expression scoring software; however, there is limited discussion on how best to score the resulting codes. We present a discussion of facial emotion expression theories and a review of contemporary emotion expression coding methodology. We highlight methodological challenges pertinent to scoring software-coded facial emotion expression codes and present important psychometric research questions centered on co...

  17. A Psychometric Evaluation of the Mayer-Salovey-Caruso Emotional Intelligence Test Version 2.0

    Science.gov (United States)

    Palmer, B.R.; Gignac, G.; Manocha, R.; Stough, C.

    2005-01-01

    and discussed.There has been some debate recently over the scoring, reliability and factor structure of ability measures of emotional intelligence (EI). This study examined these three psychometric properties with the most recent ability test of EI, the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT V2.0; Mayer, Salovey, & Caruso,…

  18. Psychometric Properties of the Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite Scale.

    Science.gov (United States)

    Barnett, Carolina; Merkies, Ingemar S J; Katzberg, Hans; Bril, Vera

    2015-09-02

    The Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite are two commonly used outcome measures in Myasthenia Gravis. So far, their measurement properties have not been compared, so we aimed to study their psychometric properties using the Rasch model. 251 patients with stable myasthenia gravis were assessed with both scales, and 211 patients returned for a second assessment. We studied fit to the Rasch model at the first visit, and compared item fit, thresholds, differential item functioning, local dependence, person separation index, and tests for unidimensionality. We also assessed test-retest reliability and estimated the Minimal Detectable Change. Neither scale fit the Rasch model (X2p Myasthenia Gravis Composite had lower discrimination properties than the Quantitative Myasthenia Gravis Scale (Person Separation Index: 0.14 and 0.7). There was local dependence in both scales, as well as differential item functioning for ocular and generalized disease. Disordered thresholds were found in 6(60%) items of the Myasthenia Gravis Composite and in 4(31%) of the Quantitative Myasthenia Gravis Score. Both tools had adequate test-retest reliability (ICCs >0.8). The minimally detectable change was 4.9 points for the Myasthenia Gravis Composite and 4.3 points for the Quantitative Myasthenia Gravis Score. Neither scale fulfilled Rasch model expectations. The Quantitative Myasthenia Gravis Score has higher discrimination than the Myasthenia Gravis Composite. Both tools have items with disordered thresholds, differential item functioning and local dependency. There was evidence of multidimensionality in the QMGS. The minimal detectable change values are higher than previous studies on the minimal significant change. These findings might inform future modifications of these tools.

  19. Raven's Test Performance of Sub-Saharan Africans: Average Performance, Psychometric Properties, and the Flynn Effect

    Science.gov (United States)

    Wicherts, Jelte M.; Dolan, Conor V.; Carlson, Jerry S.; van der Maas, Han L. J.

    2010-01-01

    This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's test scores as measures of general intelligence.…

  20. Psychometrics evaluation of Charcot-Marie-Tooth Neuropathy Score (CMTNSv2) second version, using Rasch analysis.

    Science.gov (United States)

    Sadjadi, Reza; Reilly, Mary M; Shy, Michael E; Pareyson, Davide; Laura, Matilde; Murphy, Sinead; Feely, Shawna M E; Grider, Tiffany; Bacon, Chelsea; Piscosquito, Giuseppe; Calabrese, Daniela; Burns, Ted M

    2014-09-01

    Charcot-Marie-Tooth Neuropathy Score second version (CMTNSv2) is a validated clinical outcome measure developed for use in clinical trials to monitor disease impairment and progression in affected CMT patients. Currently, all items of CMTNSv2 have identical contribution to the total score. We used Rasch analysis to further explore psychometric properties of CMTNSv2, and in particular, category response functioning, and their weight on the overall disease progression. Weighted category responses represent a more accurate estimate of actual values measuring disease severity and therefore could potentially be used in improving the current version. © 2014 Peripheral Nerve Society.

  1. Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

    Science.gov (United States)

    Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

    2011-08-15

    Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.

  2. Translation, cross-cultural adaptation, and psychometric properties of the German version of the hip disability and osteoarthritis outcome score.

    Science.gov (United States)

    Blasimann, Angela; Dauphinee, Sharon Wood; Staal, J Bart

    2014-12-01

    Clinical measurement. To translate and cross-culturally adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) from English into German, and to study its psychometric properties in patients after hip surgery. There is no specific hip questionnaire in German that not only measures symptoms and function but also contains items about hip-related quality of life. The translation and cross-cultural adaptation involved forward translation, harmonization, cognitive debriefing, back translation, and comparison to the original HOOS following international guidelines. The German version was tested in 51 Swiss inpatients 8 weeks after different types of hip surgery, mainly total hip replacement. The mean age of the participants was 62.5 years, and the age range was from 27 to 87 years. Thirty (58.8%) of the participants were women. Internal consistency and test-retest reliability were estimated using Cronbach alpha and intraclass correlation coefficients for agreement. For construct validity, total scores of the German HOOS were correlated with those of the Western Ontario and McMaster Universities Osteoarthritis Index. The HOOS was also compared to the Medical Outcomes Study 36-Item Short-Form Health Survey. Cronbach alpha values for all German HOOS subscales were between .87 and .93. For test-retest reliability, the intraclass correlation coefficient for agreement was 0.85 for the total scores of the German HOOS. The Spearman rho for the Medical Outcomes Study 36-Item Short-Form Health Survey physical functioning subscale compared to the sum of all HOOS subscales was 0.71, and that for the Medical Outcomes Study 36-Item Short-Form Health Survey physical component summary was 0.97. The German HOOS has demonstrated adequate reliability and validity. Use of the German HOOS is recommended for assessment of patients after hip surgery, with the proviso that additional psychometric testing should be done in future research.

  3. The Rorschach: projective technique or psychometric test?

    Science.gov (United States)

    Aronow, E; Reznikoff, M; Moreland, K L

    1995-04-01

    Various approaches to the Rorschach Technique are described in terms of the idiographic-nomothetic axis and the perceptual-content axis. It is suggested that it is most productive to view the Rorschach as a projective tool, with perceptual scoring a secondary factor. Current efforts at objectification of the Rorschach are not seen as useful as efforts to enhance its projective qualities. Some possible ways are discussed in which the projective value of the instrument can be maximized.

  4. Development and psychometric validation of the verbal affective memory test

    DEFF Research Database (Denmark)

    Jensen, Christian Gaden; Hjordt, Liv V; Stenbæk, Dea S

    2015-01-01

    . Furthermore, larger seasonal decreases in positive recall significantly predicted larger increases in depressive symptoms. Retest reliability was satisfactory, rs ≥ .77. In conclusion, VAMT-24 is more thoroughly developed and validated than existing verbal affective memory tests and showed satisfactory...... psychometric properties. VAMT-24 seems especially sensitive to measuring positive verbal recall bias, perhaps due to the application of common, non-taboo words. Based on the psychometric and clinical results, we recommend VAMT-24 for international translations and studies of affective memory.......We here present the development and validation of the Verbal Affective Memory Test-24 (VAMT-24). First, we ensured face validity by selecting 24 words reliably perceived as positive, negative or neutral, respectively, according to healthy Danish adults' valence ratings of 210 common and non...

  5. Scoring and psychometric validation of the Perception of Anticoagulant Treatment Questionnaire (PACT-Q©

    Directory of Open Access Journals (Sweden)

    Essers B

    2009-04-01

    Full Text Available Abstract Background The 'Perception of Anti-Coagulant Treatment Questionnaire' (PACT-Q was developed to assess patients' expectations of, and satisfaction with their anticoagulant treatment. This questionnaire needs to be finalised and psychometrically validated. Methods The PACT-Q was included in the United States, the Netherlands and France into three phase III multinational clinical trials conducted to evaluate efficacy and safety of a new long-acting anticoagulant drug (idraparinux compared to vitamin K antagonist (VKA. PACT-Q was administered to patients with deep venous thrombosis (DVT, atrial fibrillation (AF or pulmonary embolism (PE at Day 1, to assess patients' expectations, and at 3 and 6 months to assess patients' satisfaction and treatment convenience and burden. The final structure of the PACT-Q (Principal Component Analysis – PCA – with Varimax Rotation was first determined and its psychometric properties were then measured with validity of the structure (Multitrait analysis, internal consistency reliability (Cronbach's alpha coefficients and known-group validity. Results PCA and multitrait analyses showed the multidimensionality of the "Treatment Expectations" dimension, comprising 7 items that had to be scored independently. The "Convenience" and "Burden of Disease and Treatment" dimensions of the hypothesised original structure of the questionnaire were combined, thus resulting in 13 items grouped into the single dimension "Convenience". The "Anticoagulant Treatment Satisfaction" dimension remained unchanged and included 7 items. All items of the "Convenience" and "Anticoagulant Treatment Satisfaction" dimensions displayed good convergent and discriminant validity. The internal consistency reliability was good, with a Cronbach's alpha of 0.84 for the "Convenience" dimension, and 0.76 for the "Anticoagulant Treatment Satisfaction" dimension. Known-group validity was good, especially with regard to occurrence of

  6. [Psychometric validation of the telephone memory test].

    Science.gov (United States)

    Ortiz, T; Fernández, A; Martínez-Castillo, E; Maestú, F; Martínez-Arias, R; López-Ibor, J J

    1999-01-01

    Several pathologies (i.e. Alzheimer's disease) that courses with memory alterations, appears in a context of impaired cognitive status and mobility. In recent years, several investigations were carried out in order to design short batteries that detect those subjects under risk of dementia. Some of this batteries were also design to be administrated over the telephone, trying to overcome the accessibility limitations of this patients. In this paper we present a battery (called Autotest de Memoria) essentially composed by episodic and semantic memory tests, administered both over the telephone and face to face. This battery was employed in the cognitive assessment of healthy controls and subjects diagnosed as probable Alzheimer's disease patients. Results show the capability of this battery in order to discriminate patients and healthy controls, a great sensibility and specificity, and a nearly absolute parallelism of telephone and face to face administrations. These data led us to claim the usefulness and practicality of our so called Memoria>.

  7. Translation, Cross-cultural Adaptation, and Psychometric Properties of the German Version of the Hip Disability and Osteoarthritis Outcome Score

    NARCIS (Netherlands)

    Blasimann, A.; Dauphinee, S.W.; Staal, J.B.

    2014-01-01

    Study Design Clinical measurement. Objectives To translate and cross-culturally adapt the Hip disability and Osteoarthritis Outcome Score (HOOS) from English into German, and to study its psychometric properties in patients after hip surgery. Background There is no specific hip questionnaire in

  8. Summary of Score Changes (in other Tests).

    Science.gov (United States)

    Cleary, T. Anne; McCandless, Sam A.

    Scholastic Aptitude Test (SAT) scores have declined during the last 14 years. Similar score declines have been observed in many different testing programs, many groups, and tested areas. The declines, while not large in any given year, have been consistent over time, area, and group. The period around 1965 is critical for the interpretation of…

  9. Patient activation in Europe: an international comparison of psychometric properties and patients' scores on the short form Patient Activation Measure (PAM-13).

    Science.gov (United States)

    Rademakers, Jany; Maindal, Helle Terkildsen; Steinsbekk, Aslak; Gensichen, Jochen; Brenk-Franz, Katja; Hendriks, Michelle

    2016-10-12

    To allow better assessment of patients' individual competencies for self-management, the Patient Activation Measure (PAM) has been developed in the USA. Because the American studies have shown the PAM to be a valuable tool, several European countries have translated the instrument into their native languages (Danish, Dutch, German, Norwegian). The aim was to compare the psychometric properties in studies from the different countries and establish whether the scores on the PAM vary between the studies. Data from the four separate studies were subjected to the same data cleaning procedures and statistical analyses. The psychometric properties of the instruments were established with measures of data quality and scale structure. The mean patient activation score and distribution across four predefined activation levels were described and the differences between the four studies were tested with ANOVA (unadjusted and adjusted) followed by a post-hoc Tukey HSD test and the Pearson chi-squared test respectively. The total N of the four studies was 5184. The percentage of missing values was low in all datasets, confirming the good quality of the datasets. Factor analyses revealed moderate to strong factor loadings on the first factor in all datasets. Cronbach's α was high for all version, ranging from .80 (German) to .88 (Dutch). Item-rest correlations varied between .32 and .66, indicating a moderate to strong correlation of the individual items to the sum scale. Both the mean PAM score and the distribution across activation levels differed between the four datasets. After adjustment of the PAM score, patients in Norway in particular had a higher patient activation level. The European translations of PAM-13 (into Danish, Dutch, German and Norwegian) resulted in four instruments with good psychometric capabilities for measuring patient activation. The mean PAM score and the distribution across activation levels differed between the four datasets.

  10. Psychometric properties of the Plutchik's EPI test (Emotions Profile Index

    Directory of Open Access Journals (Sweden)

    Ana Trebovc

    2005-04-01

    Full Text Available Authors report a study on psychometric properties of Plutchik's test, called Emotions Profile Index (EPI. A new Slovene translation and adaptation of English version of the test, consisting of combinations (pairs of 12 words reflecting eight different emotional conditions, was prepared and compared to the old one. Both versions as well as the Big Five Questionnaire (BFQ were administered on the sample of 239 participants. Different statistical analyses were performed examining psychometric features of both versions of EPI. Discriminative power was tested by cluster analysis and analysis of frequency distributions, reliability was studied via internal consistency index and correlation between the two versions, and validity was examined by correlating PIE dimensions with BFQ dimensions and subdimensions, by comparing profiles of groups on both versions of EPI and BFQ and by fitting the theoretical model proposed by Plutchik to the data. Discriminative power of EPI seems to be affected by avoiding (not choosing the socially desirable expressions in the test, parallel reliability seems to be susceptible to the use of different words (expressions in the new version of EPI having the same meaning as words in the old version. Dimensions expected to reflect similar constructs in BFQ and EPI do not correlate satisfactory. Data gathered with EPI cannot be fully explained with the model proposed by Plutchik's theory.

  11. A New Clinical Pain Knowledge Test for Nurses: Development and Psychometric Evaluation.

    Science.gov (United States)

    Bernhofer, Esther I; St Marie, Barbara; Bena, James F

    2017-08-01

    All nurses care for patients with pain, and pain management knowledge and attitude surveys for nurses have been around since 1987. However, no validated knowledge test exists to measure postlicensure clinicians' knowledge of the core competencies of pain management in current complex patient populations. To develop and test the psychometric properties of an instrument designed to measure pain management knowledge of postlicensure nurses. Psychometric instrument validation. Four large Midwestern U.S. hospitals. Registered nurses employed full time and part time August 2015 to April 2016, aged M = 43.25 years; time as RN, M = 16.13 years. Prospective survey design using e-mail to invite nurses to take an electronic multiple choice pain knowledge test. Content validity of initial 36-item test "very good" (95.1% agreement). Completed tests that met analysis criteria, N = 747. Mean initial test score, 69.4% correct (range 27.8-97.2). After revision/removal of 13 unacceptable questions, mean test score was 50.4% correct (range 8.7-82.6). Initial test item percent difficulty range was 15.2%-98.1%; discrimination values range, 0.03-0.50; final test item percent difficulty range, 17.6%-91.1%, discrimination values range, -0.04 to 1.04. Split-half reliability final test was 0.66. A high decision consistency reliability was identified, with test cut-score of 75%. The final 23-item Clinical Pain Knowledge Test has acceptable discrimination, difficulty, decision consistency, reliability, and validity in the general clinical inpatient nurse population. This instrument will be useful in assessing pain management knowledge of clinical nurses to determine gaps in education, evaluate knowledge after pain management education, and measure research outcomes. Copyright © 2017 American Society for Pain Management Nursing. Published by Elsevier Inc. All rights reserved.

  12. Development and psychometric properties of the Basic Amputee Mobility Score for use in patients with a major lower extremity amputation

    DEFF Research Database (Denmark)

    Kristensen, Morten Tange; Nielsen, Anni Østergaard; Topp, Ulla Madsen

    2018-01-01

    AIM: To develop and examine the psychometric properties, including responsiveness and interrater reliability, of a new outcome measure for the evaluation of basic mobility activities after a major lower extremity amputation - The Basic Amputee Mobility Score (BAMS). METHODS: The four following es...... a large responsiveness, excellent interrater reliability and with a change of 1 point indicating a real change in performances. Geriatr Gerontol Int 2017; ••: ••-••....

  13. Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

    Directory of Open Access Journals (Sweden)

    Roberts Gwerfyl

    2012-06-01

    Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.

  14. Psychometric properties of a Swedish translation of the VISA-P outcome score for patellar tendinopathy.

    Science.gov (United States)

    Frohm, Anna; Saartok, Tönu; Edman, Gunnar; Renström, Per

    2004-12-18

    Self-administrated patient outcome scores are increasingly recommended for evaluation of primary outcome in clinical studies. The VISA-P score, developed at the Victorian Institute of Sport Assessment in Melbourne, Australia, is a questionnaire developed for patients with patellar tendinopathy and the patients assess severity of symptoms, function and ability to participate in sport. The aim of this study was to translate the questionnaire into Swedish and to study the reliability and validity of the translated questionnaire and resultant scores. The questionnaire was translated into Swedish according to internationally recommended guidelines for cross-cultural adaptation of self-report measures. The reliability and validity were tested in three different populations. The populations used were healthy students (n = 17), members of the Swedish male national basketball team (n = 17), considered as a population at risk, and a group of non-surgically treated patients (n = 17) with clinically diagnosed patellar tendinopathy. The questionnaire was completed by 51 subjects altogether. The translated VISA-P questionnaire showed very good test-retest reliability (ICC = 0.97).The mean (+/- SD) of the VISA-P score, at both the first and second test occasions was highest in the healthy student group 83 (+/- 13) and 81 (+/- 15), respectively. The score of the basketball players was 79 (+/- 24) and 80 (+/- 23), while the patient group scored significantly (p < 0.05) lower, 48 (+/- 20) and 52 (+/- 19). The translated version of the VISA-P questionnaire was linguistically and culturally equivalent to the original version. The translated score showed good reliability.

  15. Psychometric properties of a Swedish translation of the VISA-P outcome score for patellar tendinopathy

    Directory of Open Access Journals (Sweden)

    Edman Gunnar

    2004-12-01

    Full Text Available Abstract Background Self-administrated patient outcome scores are increasingly recommended for evaluation of primary outcome in clinical studies. The VISA-P score, developed at the Victorian Institute of Sport Assessment in Melbourne, Australia, is a questionnaire developed for patients with patellar tendinopathy and the patients assess severity of symptoms, function and ability to participate in sport. The aim of this study was to translate the questionnaire into Swedish and to study the reliability and validity of the translated questionnaire and resultant scores. Methods The questionnaire was translated into Swedish according to internationally recommended guidelines for cross-cultural adaptation of self-report measures. The reliability and validity were tested in three different populations. The populations used were healthy students (n = 17, members of the Swedish male national basketball team (n = 17, considered as a population at risk, and a group of non-surgically treated patients (n = 17 with clinically diagnosed patellar tendinopathy. The questionnaire was completed by 51 subjects altogether. Results The translated VISA-P questionnaire showed very good test-retest reliability (ICC = 0.97. The mean (± SD of the VISA-P score, at both the first and second test occasions was highest in the healthy student group 83 (± 13 and 81 (± 15, respectively. The score of the basketball players was 79 (± 24 and 80 (± 23, while the patient group scored significantly (p Conclusions The translated version of the VISA-P questionnaire was linguistically and culturally equivalent to the original version. The translated score showed good reliability.

  16. A Psychometric Review of Norm-Referenced Tests Used to Assess Phonological Error Patterns

    Science.gov (United States)

    Kirk, Celia; Vigeland, Laura

    2014-01-01

    Purpose: The authors provide a review of the psychometric properties of 6 norm-referenced tests designed to measure children's phonological error patterns. Three aspects of the tests' psychometric adequacy were evaluated: the normative sample, reliability, and validity. Method: The specific criteria used for determining the psychometric…

  17. [Standardization of the Test Your Memory and evaluation of their concordance with the outcome of the psychometric examination].

    Science.gov (United States)

    Ferrero-Arias, J; Turrión-Rojo, M A

    2016-05-01

    To explore the relationship between scores on the Test Your Memory (TYM) battery and findings from a more exhaustive neurocognitive assessment. The TYM and fourteen psychometric tests were administered to 84 subjects aged 50 or older who attended an outpatient neurology clinic due to cognitive symptoms. Each patient's cognitive state was determined independently from his/her score on the TYM (CDR 0, n=25; CDR 0.5, n=45; CDR 1, n=14). We analysed concurrent validity of TYM scores and results from the psychometric tests, as well as the degree of concordance between the two types of measurement, by contrasting normalised data from each instrument. Although the intraclass correlation coefficient was 0.67 (confidence interval 95%, 0.53-0.77), analysis of the Bland-Altman plot and the curve on the survival-agreement plot (Luiz et al. method) demonstrates that the individual distances between the two methods exhibit excessive dispersion from a clinical viewpoint. TYM-based predictions of the mean z-score on psychometric tests differed substantially from real results in 30% of the subjects. Concordance of 95% can only be achieved by accepting absolute inter-instrument differences of up to 0.87 as identical values. Furthermore, the TYM underestimates cognitive performance for low values and overestimates it for high values. The TYM is a cognitive screening test which should not be used to predict results on psychometric tests or to detect cognitive changes in clinical trials. Copyright © 2014 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.

  18. The new Intragroup Conflict Scale: testing and psychometric properties.

    Science.gov (United States)

    Cox, Kathleen B

    2014-01-01

    The importance of healthy work environments has received attention. Health care organizations are plagued with conflict which is detrimental to work environments. Thus, conflict must be studied. The purpose of this article is to describe the testing of a measure of conflict. A survey was used to evaluate the psychometric properties. The sample consisted of 430 nurses at an academic medical center. Using principal component analysis (PCA) with varimax rotation, a six-factor solution (30 items) that explained 74.3% of variance emerged. Coefficient alpha ranged from .95 to .81. Correlations with existing scales supported construct validity (r = -.32(-)-.58). The results are encouraging. Use of the scale may provide insight into the impact of conflict on patient, staff, and organizational outcomes.

  19. What Do Test Scores Really Mean? A Latent Class Analysis of Danish Test Score Performance

    DEFF Research Database (Denmark)

    Munk, Martin D.; McIntosh, James

    2014-01-01

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores...... of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture and possible incentive problems make it more di¢ cult to understand what the tests measure....

  20. Psychometric properties for the Balanced Inventory of Desirable Responding: dichotomous versus polytomous conventional and IRT scoring.

    Science.gov (United States)

    Vispoel, Walter P; Kim, Han Yi

    2014-09-01

    [Correction Notice: An Erratum for this article was reported in Vol 26(3) of Psychological Assessment (see record 2014-16017-001). The mean, standard deviation and alpha coefficient originally reported in Table 1 should be 74.317, 10.214 and .802, respectively. The validity coefficients in the last column of Table 4 are affected as well. Correcting this error did not change the substantive interpretations of the results, but did increase the mean, standard deviation, alpha coefficient, and validity coefficients reported for the Honesty subscale in the text and in Tables 1 and 4. The corrected versions of Tables 1 and Table 4 are shown in the erratum.] Item response theory (IRT) models were applied to dichotomous and polytomous scoring of the Self-Deceptive Enhancement and Impression Management subscales of the Balanced Inventory of Desirable Responding (Paulhus, 1991, 1999). Two dichotomous scoring methods reflecting exaggerated endorsement and exaggerated denial of socially desirable behaviors were examined. The 1- and 2-parameter logistic models (1PLM, 2PLM, respectively) were applied to dichotomous responses, and the partial credit model (PCM) and graded response model (GRM) were applied to polytomous responses. For both subscales, the 2PLM fit dichotomous responses better than did the 1PLM, and the GRM fit polytomous responses better than did the PCM. Polytomous GRM and raw scores for both subscales yielded higher test-retest and convergent validity coefficients than did PCM, 1PLM, 2PLM, and dichotomous raw scores. Information plots showed that the GRM provided consistently high measurement precision that was superior to that of all other IRT models over the full range of both construct continuums. Dichotomous scores reflecting exaggerated endorsement of socially desirable behaviors provided noticeably weak precision at low levels of the construct continuums, calling into question the use of such scores for detecting instances of "faking bad." Dichotomous

  1. Predicting psychopharmacological drug effects on actual driving performance (SDLP) from psychometric tests measuring driving-related skills.

    Science.gov (United States)

    Verster, Joris C; Roth, Thomas

    2012-03-01

    There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a constant speed and steady lateral position within the right traffic lane. Standard deviation of lateral position (SDLP), i.e., the weaving of the car, was determined. The subjects also performed a psychometric test battery including the DSST, Sternberg memory scanning test, a tracking test, and a divided attention test. Difference scores from placebo for parameters of the psychometric tests and SDLP were computed and correlated with each other. A stepwise linear regression analysis determined the predictive validity of the laboratory test battery to SDLP. Stepwise regression analyses revealed that the combination of five parameters, hard tracking, tracking and reaction time of the divided attention test, and reaction time and percentage of errors of the Sternberg memory scanning test, together had a predictive validity of 33.4%. The psychometric tests in this test battery showed insufficient predictive validity to replace the on-the-road driving test during normal traffic.

  2. What do educational test scores really measure?

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelate......, and possible incentive problems make it more difficult to elicit true values of what the tests measure....

  3. Health Belief Model Scale for Human Papilloma Virus and its Vaccination: Adaptation and Psychometric Testing.

    Science.gov (United States)

    Guvenc, Gulten; Seven, Memnun; Akyuz, Aygul

    2016-06-01

    To adapt and psychometrically test the Health Belief Model Scale for Human Papilloma Virus (HPV) and Its Vaccination (HBMS-HPVV) for use in a Turkish population and to assess the Human Papilloma Virus Knowledge score (HPV-KS) among female college students. Instrument adaptation and psychometric testing study. The sample consisted of 302 nursing students at a nursing school in Turkey between April and May 2013. Questionnaire-based data were collected from the participants. Information regarding HBMS-HPVV and HPV knowledge and descriptive characteristic of participants was collected using translated HBMS-HPVV and HPV-KS. Test-retest reliability was evaluated and Cronbach α was used to assess internal consistency reliability, and exploratory factor analysis was used to assess construct validity of the HBMS-HPVV. The scale consists of 4 subscales that measure 4 constructs of the Health Belief Model covering the perceived susceptibility and severity of HPV and the benefits and barriers. The final 14-item scale had satisfactory validity and internal consistency. Cronbach α values for the 4 subscales ranged from 0.71 to 0.78. Total HPV-KS ranged from 0 to 8 (scale range, 0-10; 3.80 ± 2.12). The HBMS-HPVV is a valid and reliable instrument for measuring young Turkish women's beliefs and attitudes about HPV and its vaccination. Copyright © 2015 North American Society for Pediatric and Adolescent Gynecology. Published by Elsevier Inc. All rights reserved.

  4. Herth hope index: psychometric testing of the Chinese version.

    Science.gov (United States)

    Chan, Keung Sum; Li, Ho Cheung William; Chan, Sally Wai-Chi; Lopez, Violeta

    2012-09-01

    This article is a report on psychometric testing of the Chinese version of the herth hope index. The availability of a valid and reliable instrument that accurately measures the level of hope in patients with heart failure is crucial before any hope-enhancing interventions can be appropriately planned and evaluated. There is no such instrument for Chinese people. A test-retest, within-subjects design was used. A purposive sample of 120 Hong Kong Chinese patients with heart failure between the ages of 60 and 80 years admitted to two medical wards was recruited during an 8-month period in 2009. Participants were asked to respond to the Chinese version of the herth hope index, Hamilton depression rating scale and Rosenberg's self-esteem scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the herth hope index were assessed. The newly translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly translated scale can be used as a self-report assessment tool in assessing the level of hope in Hong Kong Chinese patients with heart failure. © 2011 Blackwell Publishing Ltd.

  5. Gender, Stereotype Threat and Mathematics Test Scores

    OpenAIRE

    Ming Tsui; Xiao Y. Xu; Edmond Venator

    2011-01-01

    Problem statement: Stereotype threat has repeatedly been shown to depress womens scores on difficult math tests. An attempt to replicate these findings in China found no support for the stereotype threat hypothesis. Our math test was characterized as being personally important for the student participants, an atypical condition in most stereotype threat laboratory research. Approach: To evaluate the effects of this personal demand, we conducted three experiments. Results: ...

  6. Psychometric properties including reliability, validity and responsiveness of the Majeed pelvic score in patients with chronic sacroiliac joint pain.

    Science.gov (United States)

    Bajada, Stefan; Mohanty, Khitish

    2016-06-01

    The Majeed scoring system is a disease-specific outcome measure that was originally designed to assess pelvic injuries. The aim of this study was to determine the psychometric properties of the Majeed scoring system for chronic sacroiliac joint pain. Internal consistency, content validity, criterion validity, construct validity and responsiveness to change was assessed prospectively for the Majeed scoring system in a cohort of 60 patients diagnosed with sacroiliac joint pain. This diagnosis was confirmed with CT-guided sacroiliac joint anaesthetic block. The overall Majeed score showed acceptable internal consistency (Cronbach alpha = 0.63). Similarly, it showed acceptable floor (0 %) and ceiling (0 %) effects. On the other hand, the domains of pain, work, sitting and sexual intercourse had high (>30 %) floor effects. Significant correlation with the physical component of the Short Form-36 (p = 0.005) and Oswestry disability index (p ≤ 0.001) was found indicating acceptable criterion validity. The overall Majeed score showed acceptable construct validity with all five developed hypotheses showing significance (p ≤ 0.05). The overall Majeed score showed acceptable responsiveness to change with a large (≥0.80) effect size and standardized response mean. Overall the Majeed scoring system demonstrated acceptable psychometric properties for outcome assessment in chronic sacroiliac joint pain. Thus, its use in this condition is adequate. However, some domains demonstrated suboptimal performance indicating that improvement might be achieved with the development of an outcome measure specific for sacroiliac joint dysfunction and degeneration.

  7. Analyzing Test-Taking Behavior: Decision Theory Meets Psychometric Theory.

    Science.gov (United States)

    Budescu, David V; Bo, Yuanchao

    2015-12-01

    We investigate the implications of penalizing incorrect answers to multiple-choice tests, from the perspective of both test-takers and test-makers. To do so, we use a model that combines a well-known item response theory model with prospect theory (Kahneman and Tversky, Prospect theory: An analysis of decision under risk, Econometrica 47:263-91, 1979). Our results reveal that when test-takers are fully informed of the scoring rule, the use of any penalty has detrimental effects for both test-takers (they are always penalized in excess, particularly those who are risk averse and loss averse) and test-makers (the bias of the estimated scores, as well as the variance and skewness of their distribution, increase as a function of the severity of the penalty).

  8. [The psychometric properties of the Proverb-Metaphor Test].

    Science.gov (United States)

    Szajer, Katarzyna; Karakuła, Hanna; Grzywa, Anna; Parnas, Josef; Perzyńska, Aneta; Zaborska, Anna; Pawezka, Justyna; Sekunda, Agnieszka; Piszczek, Rafał; Skórska, Małgorzata

    2007-01-01

    Abstract thinking belongs to intellectual abilities of the highest level of the evolutionary development, thanks to which operations such a classification, systematisation and comparison are possible. An analysis of the psychometric properties of the Proverb-Metaphor Test (TPM) which has been used in the German speaking countries since 2001. The TPM was subject to the process of translation--retranslation--travesty in order to be adapted to clinical conditions in Poland. 60 patients of the Department of Psychiatry, Medical University of Lublin with diagnosed paranoid schizophrenia (according to ICD-10 criteria). PANSS and TPM was carried out amongst 15 patients at the beginning of the hospitalisation (the first stage of the research) and among all persons during the remission of syndromes (the second stage). The WAIS-R (PL) was used in the second stage. 1. The TPM is a reliable instrument, of high criteria propriety. 2. The evaluated test is a relatively homogeneous research tool. 3. The TPM is, thanks to its simple construction and the short carrying out time, a practical method of abstract thinking evaluation. 4. The TPM may be a useful instrument enabling long term prognosis.

  9. Psychometric testing of the Spiritual Well-Being Scale-Mandarin version in Taiwanese cancer patients.

    Science.gov (United States)

    Tang, Woung-Ru; Kao, Chen-Yi

    2017-06-01

    The spiritual well-being of terminally ill cancer patients is an important indicator of the quality of their lives and of the quality of hospice care, but no validated tools are available for assessing this indicator in Taiwan. The present cross-sectional study validated the Spiritual Well-Being Scale-Mandarin version (SWBS-M) by testing its psychometric properties in 243 cancer patients from five teaching hospitals throughout Taiwan. Construct validity was tested by factor analysis and hypothesis testing. Patients' spiritual well-being and quality of life were assessed using the SWBS-M and the McGill Quality of Life Questionnaire (MQoL), respectively. Overall, the SWBS-M had an internal consistency/reliability of 0.89. Exploratory factor analysis showed that the SWBS-M had an underlying two-factor structure, explaining 46.94% of the variance. SWBS-M scores correlated moderately with MQoL scores (r = 0.48, p spiritual well-being was inversely related to their average pain level during the previous 24 hours (r = -0.183, p = 0.006). Cancer patients' spiritual well-being also differed significantly with their experience of pain (t = -3.67, p spiritual well-being than those without pain. Our findings support a two-factor model for the SWBS-M in terminally ill Taiwanese cancer patients. We recommend testing the psychometric properties of the SWBS-M in different patient populations to verify its factorial structure in other Asian countries.

  10. Exploring a Source of Uneven Score Equity across the Test Score Range

    Science.gov (United States)

    Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D.

    2018-01-01

    Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…

  11. [Development of patient-reported outcome scale for myasthenia gravis: a psychometric test].

    Science.gov (United States)

    Chen, Xin-lin; Liu, Feng-bin; Guo, Li; Liu, Xiao-bin

    2010-02-01

    To investigate the scientificity of patient-reported outcome (PRO) scale for myasthenia gravis (MG), which was used to evaluate the clinical effects of traditional Chinese and Western medicine treatment on MG patients. Psychometric performance of the MG-PRO scale was also expected to be evaluated in this study. A total of 100 MG patients and 100 healthy people were face-to-face interviewed by well-trained investigators, and the data of MG-PRO scale were collected. The classical theory test (CTT) and item response theory (IRT) methods were used to analyze the psychometric performance such as validity, reliability, person separation index (PSI) and differential item functioning (DIF) in the MG-PRO scale. The results of CTT analysis showed that the split-half reliabilities of the MG-PRO scale and each dimension were greater than 0.7. In the analysis of internal consistency of each dimension, the Cronbach's alpha was greater than 0.8. Each facet had greater correlation with its dimension than the other dimensions. Four principal components were extracted by exploratory factor analysis, which represented all dimensions of the scale, and the cumulative variance was 55.54%. The scores of each of the 8 facets between MG patients and healthy people were different (Pdefinition and connotation of quality of life and contains special issues of MG patients as well, and shows good reliability (split-half reliability, Cronbach's alpha), validity (content validity, construct validity, discriminate validity) from the results of CTT, and good psychometric performance from the results of IRT.

  12. Psychometric testing of an instrument to measure the experience of home.

    Science.gov (United States)

    Molony, Sheila L; McDonald, Deborah Dillon; Palmisano-Mills, Christine

    2007-10-01

    Research related to quality of life in long-term care has been hampered by a paucity of measurement tools sensitive to environmental interventions. The primary aim of this study was to test the psychometric properties of a new instrument, the Experience of Home (EOH) Scale, designed to measure the strength of the experience of meaningful person-environment transaction. The instrument was administered to 200 older adults in diverse dwelling types. Principal components analysis provided support for construct validity, eliciting a three-factor solution accounting for 63.18% of variance in scores. Internal consistency reliability was supported with Cronbach's alpha of .96 for the entire scale. The EOH Scale is a unique research tool to evaluate interventions to improve quality of living in residential environments.

  13. Psychometric Properties of the Persian Adaptation of Mini-Cog Test in Iranian Older Adults.

    Science.gov (United States)

    Rezaei, Mohammad; Rashedi, Vahid; Lotfi, Gohar; Shirinbayan, Peymaneh; Foroughan, Mahshid

    2018-04-01

    The aim of this study was to assess the psychometric properties of the Mini-Cog in Iranian older adults. It was a cross-sectional study; 50 older people with dementia and 50 without dementia who matched for age, gender, and education entered the study. The diagnostic and statistical manual of mental disorders criteria for dementia were used as gold standard. A battery of scales included the abbreviated mental test score (AMTS), the Geriatric Depression Scale, and the Mini-Cog was performed. Validity and reliability of the Mini-Cog determined using the Pearson product-moment correlation coefficient (Pearson's r), Cronbach's alpha, and Receiver Operating Characteristic (ROC) curve analysis. The Persian version of Mini-Cog showed a good inter-rater reliability ( K = 0.76, p Mini-Cog have an acceptable sensitivity, specificity, and substantial overall agreement with the AMTS.

  14. Development and psychometric testing of the Supportive Supervisory Scale.

    Science.gov (United States)

    McGilton, Katherine S

    2010-06-01

    To describe the development and psychometric testing of the Supportive Supervisory Scale (SSS). The development of the items of the scale was based on Winnicott's relationship theory and on focus groups with 26 healthcare aides (HCAs) and 30 supervisors from six long-term care (LTC) facilities in Ontario, Canada. Content validity of the 15-item instrument was established by a panel of experts. Based on a secondary analysis of data collected from 222 HCAs in 10 LTC facilities in Ontario, Canada, the SSS was subjected to principal components analysis with oblique rotation. A two-factor solution was accepted, which is consistent with the theoretical conceptualization of the instrument. Factor I was labeled Respects Uniqueness and Factor II was labeled Being Reliable. Internal consistency of Factor I was .95, and that of Factor II was .91. Discriminant validity was also established. The focus groups revealed that "being available to staff" while "recognizing the HCA as an individual, and taking a moment to get to know them" was essential to feeling supported by their supervisor. The SSS is a reliable and valid measure of supervisory support of supervisors working in LTC facilities. At the core of supportive supervision is the supervisor's ability to develop and maintain positive relationships with each HCA. It is through respecting the uniqueness of each HCA and being reliable that supervisor-HCA relationships can flourish. Supportive leadership in LTC settings is a major contributor to HCAs' job satisfaction and retention and to quality of patient care. Therefore, a tool developed and tested to measure supervisors' supportive capacities in LTC is primal to evaluate the effectiveness of supervisors in these environments.

  15. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    Science.gov (United States)

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  16. Testing the psychometric properties of a Chinese version of the level of expressed emotion scale.

    Science.gov (United States)

    Chien, Wai Tong; Chan, Zenobia Chung-Yee; Chan, Sally Wai-Chi

    2014-01-01

    This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76-0.95 and ICC = 0.81-0.92), test-retest reliability (r = 0.89-0.95, P Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement), accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ (2)/df = 1.93, P = 0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78) to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = -0.54) and family burden (r = 0.49) and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90). The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P Chinese version demonstrates sound psychometric properties to measure families' expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.

  17. Testing the Psychometric Properties of a Chinese Version of the Level of Expressed Emotion Scale

    Directory of Open Access Journals (Sweden)

    Wai Tong Chien

    2014-01-01

    Full Text Available This study tested the psychometric properties of a Chinese version of the level of expressed emotion scale in Hong Kong Chinese patients with severe mental illness and their family caregivers. First, the semantic equivalence with the original English version and test-retest reliability at 2-week interval of the Chinese version was examined. After that, the reproducibility, construct validity, and internal consistency of the Chinese version were tested. The Chinese version indicated good semantic equivalence with the English version (kappa values = 0.76–0.95 and ICC = 0.81–0.92, test-retest reliability (r = 0.89–0.95, P<0.01, and internal consistency (Cronbach’s α = 0.86–0.92. Among 262 patients with severe mental illness and their caregivers, the 50-item Chinese version had substantial loadings on one of the four factors identified (intrusiveness/hostility, attitude towards patient, tolerance, and emotional involvement, accounting for 71.8% of the total variance of expressed emotion. In confirmatory factor analysis, the identified four-factor model showed the best fit based on all fit indices (χ2/df = 1.93, P=0.75; AGFI = 0.96; TLI = 1.02; RMSEA = 0.031; WRMR = 0.78 to the collected data. The four-factor Chinese version also indicated a good concurrent validity with significant correlations with family functioning (r = −0.54 and family burden (r = 0.49 and a satisfactory reproducibility over six months (intraclass correlation coefficient of 0.90. The mean scores of the overall and subscale of the Chinese version in patients with unipolar disorder were higher than in other illness groups (schizophrenia, psychotic disorders, and bipolar disorder; P<0.01. The Chinese version demonstrates sound psychometric properties to measure families’ expressed emotion in Chinese patients with severe mental illness, which are found varied across countries.

  18. Assessment of minimal hepatic encephalopathy (with emphasis on computerized psychometric tests).

    Science.gov (United States)

    Kappus, Matthew R; Bajaj, Jasmohan S

    2012-02-01

    Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life, and driving accidents. The detection of MHE requires specialized testing because it cannot, by definition, be diagnosed on standard clinical examination. Psychometric and neurophysiologic techniques are often used to test for MHE. Paper-pencil psychometric batteries and computerized tests have proved useful in diagnosing MHE and predicting its outcomes. Neurophysiologic tests also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike. Testing strategies depend on the normative data available, patient comfort, and local expertise. Copyright © 2012 Elsevier Inc. All rights reserved.

  19. Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

    Science.gov (United States)

    Sawaki, Yasuyo; Sinharay, Sandip

    2013-01-01

    This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

  20. Effects of common chronic medical conditions on psychometric tests used to diagnose minimal hepatic encephalopathy

    DEFF Research Database (Denmark)

    Lauridsen, M M; Poulsen, L; Rasmussen, C K

    2016-01-01

    Many chronic medical conditions are accompanied by cognitive disturbances but these have only to a very limited extent been psychometrically quantified. An exception is liver cirrhosis where hepatic encephalopathy is an inherent risk and mild forms are diagnosed by psychometric tests. The preferred...... diagnostic test battery in cirrhosis is often the Continuous Reaction Time (CRT) and the Portosystemic Encephalopathy (PSE) tests but the effect on these of other medical conditions is not known. We aimed to examine the effects of common chronic (non-cirrhosis) medical conditions on the CRT and PSE tests. We...

  1. Evidence for the Psychometric Validity, Internal Consistency and Measurement Invariance of Warwick Edinburgh Mental Well-being Scale Scores in Scottish and Irish Adolescents.

    Science.gov (United States)

    McKay, Michael T; Andretta, James R

    2017-09-01

    Mental well-being is an important indicator of current, but also the future health of adolescents. The 14-item Warwick Edinburgh Mental Well-being Scale (WEMWBS) has been well validated in adults world-wide, but less work has been undertaken to examine the psychometric validity and internal consistency of WEMWBS scores in adolescents. In particular, little research has examined scores on the short 7-item version of the WEMWBS. The present study used two large samples of school children in Scotland and Northern Ireland and found that for both forms of the WEMWBS, scores were psychometrically valid, internally consistent, factor saturated, and measurement invariant by country. Using the WEMWBS full form, males reported significantly higher scores than females, and Northern Irish adolescents reported significantly higher scores than their Scottish counterparts. Last, the lowest overall levels of well-being were observed among Scottish females. Copyright © 2017. Published by Elsevier B.V.

  2. Validating the Interpretations and Uses of Test Scores

    Science.gov (United States)

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  3. Psychometric properties of a Swedish translation of the VISA-P outcome score for patellar tendinopathy

    OpenAIRE

    Frohm, Anna; Saartok, Tönu; Edman, Gunnar; Renström, Per

    2004-01-01

    Abstract Background Self-administrated patient outcome scores are increasingly recommended for evaluation of primary outcome in clinical studies. The VISA-P score, developed at the Victorian Institute of Sport Assessment in Melbourne, Australia, is a questionnaire developed for patients with patellar tendinopathy and the patients assess severity of symptoms, function and ability to participate in sport. The aim of this study was to translate the questionnaire into Swedish and to study the rel...

  4. The factor structure and psychometric properties of the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test.

    Science.gov (United States)

    Sanchez-Garcia, Manuel; Extremera, Natalio; Fernandez-Berrocal, Pablo

    2016-11-01

    This research examined evidence regarding the reliability and validity of scores on the Spanish version of the Mayer-Salovey-Caruso Emotional Intelligence Test, Version 2.0 (MSCEIT; Mayer, Salovey, & Caruso, 2002). In Study 1, we found a close convergence of the Spanish consensus scores and the general and expert consensus scores determined with Mayer, Salovey, Caruso, and Sitarenios (2003) data. The MSCEIT also demonstrated adequate evidence of reliability of test scores as estimated by internal consistency and test-retest correlation after 12 weeks. Confirmatory factor analysis supported a 3-level higher factor model with 8 manifest variables (task scores), 4 first-level factors (corresponding to the 4-branch model of Mayer & Salovey [1997], with 2 tasks for each branch), 2 second-level factors (experiential and strategic areas, with 2 branches for each area), and 1 third-level factor (overall emotional intelligence [EI]), and multigroup analyses supported MSCEIT cross-gender invariance. Study 2 found evidence for the discriminant validity of scores on the MSCEIT subscales, which were differentially related to personality and self-reported EI. Study 3 provided evidence of the incremental validity of scores on the MSCEIT, which added significant variance to the prospective prediction of psychological well-being after controlling for personality traits. The psychometric properties of the Spanish MSCEIT are similar to those of the original English version, supporting its use for assessing emotional abilities in the Spanish population. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  5. Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

    Science.gov (United States)

    Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

    2017-01-01

    The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

  6. Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

    Science.gov (United States)

    Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

    2016-01-01

    Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149

  7. The 1-min Screening Test for Reading Problems in College Students: Psychometric Properties of the 1-min TIL.

    Science.gov (United States)

    Fernandes, Tânia; Araújo, Susana; Sucena, Ana; Reis, Alexandra; Castro, São Luís

    2017-02-01

    Reading is a central cognitive domain, but little research has been devoted to standardized tests for adults. We, thus, examined the psychometric properties of the 1-min version of Teste de Idade de Leitura (Reading Age Test; 1-min TIL), the Portuguese version of Lobrot L3 test, in three experiments with college students: typical readers in Experiment 1A and B, dyslexic readers and chronological age controls in Experiment 2. In Experiment 1A, test-retest reliability and convergent validity were evaluated in 185 students. Reliability was >.70, and phonological decoding underpinned 1-min TIL. In Experiment 1B, internal consistency was assessed by presenting two 45-s versions of the test to 19 students, and performance in these versions was significantly associated (r = .78). In Experiment 2, construct validity, criterion validity and clinical utility of 1-min TIL were investigated. A multiple regression analysis corroborated construct validity; both phonological decoding and listening comprehension were reliable predictors of 1-min TIL scores. Logistic regression and receiver operating characteristics analyses revealed the high accuracy of this test in distinguishing dyslexic from typical readers. Therefore, the 1-min TIL, which assesses reading comprehension and potential reading difficulties in college students, has the necessary psychometric properties to become a useful screening instrument in neuropsychological assessment and research. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  8. Psychometric properties of the Turkish versions of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in the prison setting.

    Science.gov (United States)

    Evren, Cuneyt; Ogel, Kultegin; Evren, Bilge; Bozkurt, Muge

    2014-01-01

    The aim of this study was to evaluate psychometric properties of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in prisoners with (n = 124) or without (n = 78) drug use disorder. Participants were evaluated with the DUDIT, the DAST-10, and the Addiction Profile Index-Short (API-S). The DUDIT and the DAST-10 were found to be psychometrically sound drug abuse screening measures with high convergent validity when compared with each other (r = 0.86), and API-S (r = 0.88 and r = 0.84, respectively), and to have a Cronbach's α of 0.93 and 0.87, respectively. In addition, a single component accounted for 58.28% of total variance for DUDIT, whereas this was 47.10% for DAST-10. The DUDIT had sensitivity and specificity scores of 0.95 and 0.79, respectively, when using the optimal cut-off score of 10, whereas these scores were 0.88 and 0.74 for the DAST-10 when using the optimal cut-off score of 4. Additionally, both the DUDIT and the DAST-10 showed good discriminant validity as they differentiated prisoners with drug use disorder from those without. Findings support the Turkish versions of both the DUDIT and the DAST-10 as reliable and valid drug abuse screening instruments that measure unidimensional constructs.

  9. Psychometric properties of the List of Threatening Experiences--LTE and its association with psychosocial factors and mental disorders according to different scoring methods.

    Science.gov (United States)

    Motrico, Emma; Moreno-Küstner, Berta; de Dios Luna, Juan; Torres-González, Francisco; King, Michael; Nazareth, Irwin; Montón-Franco, Carmen; Gilde Gómez-Barragán, María Josefa; Sánchez-Celaya, Marta; Díaz-Barreiros, Miguel Ángel; Vicens, Catalina; Moreno-Peral, Patricia; Bellón, Juan Ángel

    2013-09-25

    The List of Threatening Experiences (LTE) questionnaire is frequently used to assess stressful events; however, studies of its psychometric properties are scarce. We examined the LTE's reliability, factorial structure, construct validity and explored the association between LTE scores and psychosocial variables and mental disorders. This study involved interviewing 5442 primary care attendees from Spain. Associations between four different methods of quantifying LTE scores, psychosocial factors, major depression (CIDI), anxiety disorders (PRIME-MD), alcohol misuse and dependence (AUDIT) were measured. The LTE showed high test-retest reliability (Kappa range=0.61-0.87) and low internal consistency (α=0.44). Tetrachoric factorial analysis yielded four factors (spousal and relational problems; employment and financial problems; personal problems; illness and bereavement in close persons). Logistic multilevel regression found a strong association between greater social support and a lower occurrence of stressful events (OR range=0.36-0.79). The association between religious-spiritual beliefs and the LTE, was weaker. The association between mental disorders and LTE scores was greater for depression (OR range=1.64-2.57) than anxiety (OR range=1.35-1.97), though the highest ORs were obtained with alcohol dependence (OR range=2.86-4.80). The ordinal score (ordinal regression) was more sensitive to detect the strength of association with mental disorders. We are unable to distinguish the direction of the association between stressful events, psychosocial factors and mental disorders, due to our cross-sectional design of the study. The LTE is a valid and reliable measure of stress in mental health, and the strength of association with mental disorders depends on the method of quantifying LTE scores. © 2013 Elsevier B.V. All rights reserved.

  10. Psychometric testing of children prenatally irradiated during the Chernobyl accident

    International Nuclear Information System (INIS)

    Bajrakova, A.; Vasilev, G.; Khristova, M. N.; Chobanova, N.; Tsenova, T.; Jordanova, M.; Lalova, J.; Vasileva, F.; Mikhajlova, Z.; Trifonova, S.

    1993-01-01

    The investigation involved 50 children aged median 6 years and 6 months. The group was selected in view of the critical period for occurrence of radiation-related deviations in mental development (8-15 gestation weeks) and the period of maximum irradiation during the Chernobyl accident. Assessment of the individual exposure and analysis of possible impacts from non-radiation risk factors were based on guided parental history reports. The dose of accidental irradiation was determined using the radiological data for the country. A Bulgarian standardization of the Wechsler Intelligence Scale for Children (WISC-R) was used. The procedure includes 5 verbal and 5 nonverbal subtests. Results were compared with those from a countrywide control group of children (including a large city, a small town, a village). The analysis indicated higher mean IQ scores in the investigated children. The children were additionally studied by original tests for attention and gnosis-praxis functions using tactile and visual modalities. The tests included intra- and transmodal versions, bilateral simultaneous presentation of stimuli with verbal and nonverbal characteristics in applying analytical and global strategies. Comparisons were made with results for children in the same age range, who had been studied prior to the Chernobyl accident. The evidence surprisingly varied, taking into account the small size of the investigation group. A longitudinal follow-up of this population thus appears to be appropriate. (author)

  11. The Truth about Scores Children Achieve on Tests.

    Science.gov (United States)

    Brown, Jonathan R.

    1989-01-01

    The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)

  12. Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

    Science.gov (United States)

    Jongen, Stefan; Perrier, Joy; Vuurman, Eric F; Ramaekers, Johannes G; Vermeeren, Annemiek

    2015-01-01

    To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation. Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am) and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am). On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP) of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT). Large effects sizes were also found in the Divided Attention Test (DAT), the Attention Network Test (ANT), and the test for Useful Field of View (UFOV) at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV. From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.

  13. Sensitivity and validity of psychometric tests for assessing driving impairment: effects of sleep deprivation.

    Directory of Open Access Journals (Sweden)

    Stefan Jongen

    Full Text Available To assess drug induced driving impairment, initial screening is needed. However, no consensus has been reached about which initial screening tools have to be used. The present study aims to determine the ability of a battery of psychometric tests to detect performance impairing effects of clinically relevant levels of drowsiness as induced by one night of sleep deprivation.Twenty four healthy volunteers participated in a 2-period crossover study in which the highway driving test was conducted twice: once after normal sleep and once after one night of sleep deprivation. The psychometric tests were conducted on 4 occasions: once after normal sleep (at 11 am and three times during a single night of sleep deprivation (at 1 am, 5 am, and 11 am.On-the-road driving performance was significantly impaired after sleep deprivation, as measured by an increase in Standard Deviation of Lateral Position (SDLP of 3.1 cm compared to performance after a normal night of sleep. At 5 am, performance in most psychometric tests showed significant impairment. As expected, largest effect sizes were found on performance in the Psychomotor Vigilance Test (PVT. Large effects sizes were also found in the Divided Attention Test (DAT, the Attention Network Test (ANT, and the test for Useful Field of View (UFOV at 5 and 11 am during sleep deprivation. Effects of sleep deprivation on SDLP correlated significantly with performance changes in the PVT and the DAT, but not with performance changes in the UFOV.From the psychometric tests used in this study, the PVT and DAT seem most promising for initial evaluation of drug impairment based on sensitivity and correlations with driving impairment. Further studies are needed to assess the sensitivity and validity of these psychometric tests after benchmark sedative drug use.

  14. Evaluation of a new computerized psychometric test battery: Effects of zolpidem and caffeine

    OpenAIRE

    Raveendranadh Pilli; MUR Naidu; Usharani Pingali; J C Shobha

    2013-01-01

    Objective: To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. Materials and Methods: The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of...

  15. Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

    Science.gov (United States)

    Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

    2015-12-01

    The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.

  16. Psychometric properties of the NOMO 1.0 tested among adult powered-mobility users

    DEFF Research Database (Denmark)

    Sund, Terje; Brandt, Åse; Anttila, Heidi

    2017-01-01

    (Participation Repertoire). PURPOSE: This study aimed to investigate a range of psychometric properties of the NOMO 1.0 in a sample of adult powered mobility device (PMD) users. METHOD: Data collected from PMD users ( N = 248) in Denmark, Finland, and Norway as part of a larger study were analyzed using state...... scale and six components of the Frequency scale. IMPLICATIONS: The NOMO 1.0 should be used for research purposes and not for clinical practice. Better reliability should be established for the Need for Assistance and Ease/Difficulty scales prior to further psychometric testing to establish the validity...

  17. The Free and Cued Selective Reminding Test: evidence of psychometric adequacy

    Directory of Open Access Journals (Sweden)

    KATJA OCEPEK-WELIKSON

    2009-09-01

    Full Text Available These analyses examine the psychometric properties of the Free and Cued Selective Reminding Test with Immediate Recall (FCSRT-IR. FCSRT-IR is a measure of memory under conditions that control attention and cognitive processing in order to obtain an assessment of memory unconfounded by normal agerelated changes in cognition. FCSRT-IR performance has been associated with preclinical and early dementia in several longitudinal epidemiological studies. Factor and item response theory analyses were applied to FCSRT-IR data from patients at a geriatric primary care center who had independently established clinical diagnoses. The results provide supporting evidence for the psychometric adequacy of the FCSR-IR in terms of reliability, essential (sufficient unidimensionality, information across the continuum of memory disability/ability, and classification accuracy. The psychometric adequacy of the FCSRT-IR adds further validity to its use as a case finding strategy for dementia.

  18. Psychometric Evaluation of the Italian Adaptation of the Test of Inferential and Creative Thinking

    Science.gov (United States)

    Faraci, Palmira; Hell, Benedikt; Schuler, Heinz

    2016-01-01

    This article describes the psychometric properties of the Italian adaptation of the "Analyse des Schlussfolgernden und Kreativen Denkens" (ASK; Test of Inferential and Creative Thinking) for measuring inferential and creative thinking. The study aimed to (a) supply evidence for the factorial structure of the instrument, (b) describe its…

  19. Development and Psychometric Testing of the Dogs and WalkinG Survey (DAWGS)

    Science.gov (United States)

    Richards, Elizabeth A.; McDonough, Meghan H.; Edwards, Nancy E.; Lyle, Roseann M.; Troped, Philip J.

    2013-01-01

    Purpose: Dog owners represent 40% of the population, a promising audience to increase population levels of physical activity. The purpose of this study was to develop and test the psychometric properties of a new instrument to assess social-cognitive theory constructs related to dog walking. Method: Dog owners ("N" = 431) completed the…

  20. Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

    Science.gov (United States)

    Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

    2016-04-01

    The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.

  1. Psychometric properties of Persian version of the Sustained Auditory Attention Capacity Test in children with attention deficit-hyperactivity disorder.

    Science.gov (United States)

    Soltanparast, Sanaz; Jafari, Zahra; Sameni, Seyed Jalal; Salehi, Masoud

    2014-01-01

    The purpose of the present study was to evaluate the psychometric properties (validity and reliability) of the Persian version of the Sustained Auditory Attention Capacity Test in children with attention deficit hyperactivity disorder. The Persian version of the Sustained Auditory Attention Capacity Test was constructed to assess sustained auditory attention using the method provided by Feniman and colleagues (2007). In this test, comments were provided to assess the child's attentional deficit by determining inattention and impulsiveness error, the total scores of the sustained auditory attention capacity test and attention span reduction index. In the present study for determining the validity and reliability of in both Rey Auditory Verbal Learning test and the Persian version of the Sustained Auditory Attention Capacity Test (SAACT), 46 normal children and 41 children with Attention Deficit Hyperactivity (ADHD), all right-handed and aged between 7 and 11 of both genders, were evaluated. In determining convergent validity, a negative significant correlation was found between the three parts of the Rey Auditory Verbal Learning test (first, fifth, and immediate recall) and all indicators of the SAACT except attention span reduction. By comparing the test scores between the normal and ADHD groups, discriminant validity analysis showed significant differences in all indicators of the test except for attention span reduction (pAttention Capacity test has good validity and reliability, that matches other reliable tests, and it can be used for the identification of children with attention deficits and if they suspected to have Attention Deficit Hyperactivity Disorder.

  2. Disentangling Gratitude: A Theoretical and Psychometric Examination of the Gratitude Resentment and Appreciation Test-Revised Short (GRAT-RS).

    Science.gov (United States)

    Hammer, Joseph H; Brenner, Rachel E

    2017-07-14

    This study extended our theoretical and applied understanding of gratitude through a psychometric examination of the most popular multidimensional measure of gratitude, the Gratitude, Resentment, and Appreciation Test-Revised Short form (GRAT-RS). Namely, the dimensionality of the GRAT-RS, the model-based reliability of the GRAT-RS total score and 3 subscale scores, and the incremental evidence of validity for its latent factors were assessed. Dimensionality measures (e.g., explained common variance) and confirmatory factor analysis results with 426 community adults indicated that the GRAT-RS conformed to a multidimensional (bifactor) structure. Model-based reliability measures (e.g., omega hierarchical) provided support for the future use of the Lack of a Sense of Deprivation raw subscale score, but not for the raw GRAT-RS total score, Simple Appreciation subscale score, or Appreciation of Others subscale score. Structural equation modeling results indicated that only the general gratitude factor and the lack of a sense of deprivation specific factor accounted for significant variance in life satisfaction, positive affect, and distress. These findings support the 3 pillars of gratitude conceptualization of gratitude over competing conceptualizations, the position that the specific forms of gratitude are theoretically distinct, and the argument that appreciation is distinct from the superordinate construct of gratitude.

  3. Data-driven efficient score tests for deconvolution hypotheses

    NARCIS (Netherlands)

    Langovoy, M.

    2008-01-01

    We consider testing statistical hypotheses about densities of signals in deconvolution models. A new approach to this problem is proposed. We constructed score tests for the deconvolution density testing with the known noise density and efficient score tests for the case of unknown density. The

  4. Non invasive blood flow measurement in cerebellum detects minimal hepatic encephalopathy earlier than psychometric tests.

    Science.gov (United States)

    Felipo, Vicente; Urios, Amparo; Giménez-Garzó, Carla; Cauli, Omar; Andrés-Costa, Maria-Jesús; González, Olga; Serra, Miguel A; Sánchez-González, Javier; Aliaga, Roberto; Giner-Durán, Remedios; Belloch, Vicente; Montoliu, Carmina

    2014-09-07

    To assess whether non invasive blood flow measurement by arterial spin labeling in several brain regions detects minimal hepatic encephalopathy. Blood flow (BF) was analyzed by arterial spin labeling (ASL) in different brain areas of 14 controls, 24 cirrhotic patients without and 16 cirrhotic patients with minimal hepatic encephalopathy (MHE). Images were collected using a 3 Tesla MR scanner (Achieva 3T-TX, Philips, Netherlands). Pulsed ASL was performed. Patients showing MHE were detected using the battery Psychometric Hepatic Encephalopathy Score (PHES) consisting of five tests. Different cognitive and motor functions were also assessed: alterations in selective attention were evaluated using the Stroop test. Patients and controls also performed visuo-motor and bimanual coordination tests. Several biochemical parameters were measured: serum pro-inflammatory interleukins (IL-6 and IL-18), 3-nitrotyrosine, cGMP and nitrates+nitrites in plasma, and blood ammonia. Bivariate correlations were evaluated. In patients with MHE, BF was increased in cerebellar hemisphere (P = 0.03) and vermis (P = 0.012) and reduced in occipital lobe (P = 0.017). BF in cerebellar hemisphere was also increased in patients without MHE (P = 0.02). Bimanual coordination was impaired in patients without MHE (P = 0.05) and much more in patients with MHE (P battery and with CFF. BF in cerebellar hemisphere correlates with plasma cGMP and nitric oxide (NO) metabolites. BF in vermis cerebellar also correlates with NO metabolites and with 3-nitrotyrosine. IL-18 in plasma correlates with BF in thalamus and occipital lobe. Non invasive BF determination in cerebellum using ASL may detect MHE earlier than the PHES. Altered NO-cGMP pathway seems to be associated to altered BF in cerebellum.

  5. Psychometric validation of the Persian Bergen Social Media Addiction Scale using classic test theory and Rasch models.

    Science.gov (United States)

    Lin, Chung-Ying; Broström, Anders; Nilsen, Per; Griffiths, Mark D; Pakpour, Amir H

    2017-12-01

    Background and aims The Bergen Social Media Addiction Scale (BSMAS), a six-item self-report scale that is a brief and effective psychometric instrument for assessing at-risk social media addiction on the Internet. However, its psychometric properties in Persian have never been examined and no studies have applied Rasch analysis for the psychometric testing. This study aimed to verify the construct validity of the Persian BSMAS using confirmatory factor analysis (CFA) and Rasch models among 2,676 Iranian adolescents. Methods In addition to construct validity, measurement invariance in CFA and differential item functioning (DIF) in Rasch analysis across gender were tested for in the Persian BSMAS. Results Both CFA [comparative fit index (CFI) = 0.993; Tucker-Lewis index (TLI) = 0.989; root mean square error of approximation (RMSEA) = 0.057; standardized root mean square residual (SRMR) = 0.039] and Rasch (infit MnSq = 0.88-1.28; outfit MnSq = 0.86-1.22) confirmed the unidimensionality of the BSMAS. Moreover, measurement invariance was supported in multigroup CFA including metric invariance (ΔCFI = -0.001; ΔSRMR = 0.003; ΔRMSEA = -0.005) and scalar invariance (ΔCFI = -0.002; ΔSRMR = 0.005; ΔRMSEA = 0.001) across gender. No item displayed DIF (DIF contrast = -0.48 to 0.24) in Rasch across gender. Conclusions Given the Persian BSMAS was unidimensional, it is concluded that the instrument can be used to assess how an adolescent is addicted to social media on the Internet. Moreover, users of the instrument may comfortably compare the sum scores of the BSMAS across gender.

  6. The Development and Psychometric Properties of the Immigration Law Concerns Scale (ILCS) for HIV Testing.

    Science.gov (United States)

    Lechuga, Julia; Galletly, Carol L; Broaddus, Michelle R; Dickson-Gomez, Julia B; Glasman, Laura R; McAuliffe, Timothy L; Vega, Miriam Y; LeGrand, Sarah; Mena, Carla A; Barlow, Morgan L; Valera, Erik; Montenegro, Judith I

    2017-11-08

    To develop, pilot test, and conduct psychometric analyses of an innovative scale measuring the influence of perceived immigration laws on Latino migrants' HIV-testing behavior. The Immigration Law Concerns Scale (ILCS) was developed in three phases: Phase 1 involved a review of law and literature, generation of scale items, consultation with project advisors, and subsequent revision of the scale. Phase 2 involved systematic translation- back translation and consensus-based editorial processes conducted by members of a bilingual and multi-national study team. In Phase 3, 339 sexually active, HIV-negative Spanish-speaking, non-citizen Latino migrant adults (both documented and undocumented) completed the scale via audio computer-assisted self-interview. The psychometric properties of the scale were tested with exploratory factor analysis and estimates of reliability coefficients were generated. Bivariate correlations were conducted to test the discriminant and predictive validity of identified factors. Exploratory factor analysis revealed a three-factor, 17-item scale. subscale reliability ranged from 0.72 to 0.79. There were significant associations between the ILCS and the HIV-testing behaviors of participants. Results of the pilot test and psychometric analysis of the ILCS are promising. The scale is reliable and significantly associated with the HIV-testing behaviors of participants. Subscales related to unwanted government attention and concerns about meeting moral character requirements should be refined.

  7. Psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C) for patients with alcoholism.

    Science.gov (United States)

    Hsueh, Yu-Jung; Chu, Hsin; Huang, Chang-Chih; Ou, Keng-Liang; Chen, Chiung-Hua; Chou, Kuei-Ru

    2014-04-01

    The aim of this study was to examine the psychometric properties of the Chinese version of the Michigan Alcoholism Screening Test (MAST-C). The sensitivity, specificity, and positive and negative predictive values for the MAST-C were examined in this study. The MAST-C had an internal consistency of 0.83 and a test-retest reliability of 0.89. It had a good content validity index of 0.92. Factor analysis identified four factors and the optimal cutoff point for the MAST-C was a score of 6/7, which yielded a sensitivity of 0.92, a specificity of 0.83, a positive predictive value of 0.92, and a negative predictive value of 0.83. The MAST-C provides a fast, accurate, and sensitive method for clinically diagnosing alcoholism and clinical management. © 2013 Wiley Periodicals, Inc.

  8. Adaptive testing with equated number-correct scoring

    NARCIS (Netherlands)

    van der Linden, Willem J.

    1999-01-01

    A constrained CAT algorithm is presented that automatically equates the number-correct scores on adaptive tests. The algorithm can be used to equate number-correct scores across different administrations of the same adaptive test as well as to an external reference test. The constraints are derived

  9. Improving personality facet scores with multidimensional computer adaptive testing

    DEFF Research Database (Denmark)

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A W

    2013-01-01

    personality tests contain many highly correlated facets. This article investigates the possibility of increasing the precision of the NEO PI-R facet scores by scoring items with multidimensional item response theory and by efficiently administering and scoring items with multidimensional computer adaptive...

  10. Improving Scores on the IELTS Speaking Test

    Science.gov (United States)

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  11. An instrument assessing satisfaction with iron chelation therapy: Psychometric testing from an open-label clinical trial.

    Science.gov (United States)

    Rofail, Diana; Viala, Muriel; Gater, Adam; Abetz-Webb, Linda; Baladi, Jean-Francois; Cappellini, Maria Domenica

    2010-08-01

    The Satisfaction with Iron Chelation Therapy (SICT) instrument was developed based on a literature review, in-depth patient and clinician interviews, and cognitive debriefing interviews. An, open-label, single arm, multicenter trial evaluating the efficacy and safety of deferasirox in patients diagnosed with transfusion-dependent iron overload, provided an opportunity to assess the psychometric measurement properties of the instrument. Psychometric analyses were performed using data at baseline from 273 patients with a range of transfusion-dependent iron overload conditions who were participating in a multinational study. Responsiveness was further evaluated for all patients who also had subsequent satisfaction domain scores collected at week 4. Baseline SICT domain scores had acceptable floor and ceiling effects and internal consistency reliability (Cronbach's alpha: 0.75-0.85). Item discriminant and item convergent validity were both excellent although one item in each analysis did not meet the specified criterion. Small to moderate correlations were observed between SICT and Short Form 36 Health Survey (SF-36) domain scores. Patients with the highest levels of serum ferritin at baseline (>3100 ng/mL) were the least satisfied about the Perceived Effectiveness of ICT and vice versa. Satisfaction improved in all patients, although there were no clear differences observed between groups of patients defined according to changes in serum ferritin levels from baseline to week 4 (stable, improved, or worsened). The SICT domains are reliable and valid. Further testing using a more specific criterion (such as assessing patient global ratings of change in satisfaction domains that correspond to the SICT domains) could help to establish with greater confidence the responsiveness of the instrument.

  12. Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

    OpenAIRE

    Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff; Lenon, George Binh

    2016-01-01

    Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization’s guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability...

  13. Evaluation of a new computerized psychometric test battery: Effects of zolpidem and caffeine.

    Science.gov (United States)

    Pilli, Raveendranadh; Naidu, Mur; Pingali, Usharani; Shobha, Jc

    2013-10-01

    To evaluate the effects of centrally active drugs using a new indigenously developed automated psychometric test system and compare the results with that obtained using pencil- and paper-based techniques. The tests were standardized in 24 healthy participants. Reproducibility of the test procedure was evaluated by performing the tests by a single experimenter on two occasions (interday reproducibility). To evaluate the sensitivity of the tests, the effects of zolpidem (5 mg) and caffeine (500 mg) versus placebo were studied in 24 healthy participants in a randomized, double-blind three-way crossover design. Psychometric tests were performed at baseline and at 1, 2, and 3 h after administration of study medication. The effects of zolpidem and caffeine on the psychomotor performance were most pronounced 1 h after administration. At this time, a significant impairment of performance in the simple reaction test (SRT), choice discrimination test (CDT), digit symbol substitution test (DSST), digit vigilance test (DVT), and card sorting test (CST) was observed with zolpidem. In contrast, caffeine showed a significant improvement in performance in CDT and DVT only. The results suggest that the tests of the computerized system are more sensitive and reliable then the pencil and paper tests in detecting the effects of central acting agents and are suitable for use in clinical areas to conduct studies with patients.

  14. Applying modern psychometric techniques to melodic discrimination testing: Item response theory, computerised adaptive testing, and automatic item generation.

    Science.gov (United States)

    Harrison, Peter M C; Collins, Tom; Müllensiefen, Daniel

    2017-06-15

    Modern psychometric theory provides many useful tools for ability testing, such as item response theory, computerised adaptive testing, and automatic item generation. However, these techniques have yet to be integrated into mainstream psychological practice. This is unfortunate, because modern psychometric techniques can bring many benefits, including sophisticated reliability measures, improved construct validity, avoidance of exposure effects, and improved efficiency. In the present research we therefore use these techniques to develop a new test of a well-studied psychological capacity: melodic discrimination, the ability to detect differences between melodies. We calibrate and validate this test in a series of studies. Studies 1 and 2 respectively calibrate and validate an initial test version, while Studies 3 and 4 calibrate and validate an updated test version incorporating additional easy items. The results support the new test's viability, with evidence for strong reliability and construct validity. We discuss how these modern psychometric techniques may also be profitably applied to other areas of music psychology and psychological science in general.

  15. Psychometric properties of the Knee injury and Osteoarthritis Outcome Score for Children (KOOS-Child) in children with knee disorders

    DEFF Research Database (Denmark)

    Ortqvist, Maria; Iversen, Maura D; Janarv, Per-Mats

    2014-01-01

    -Child was developed. This study aims to evaluate psychometric properties of the final KOOS-Child when used in children with knee disorders. METHODS: 115 children (boys/girls 51/64, 7-16 years) with knee disorders were recruited. All children (n=115) completed the KOOS-Child, the Child-Health Assessment Questionnaire...... better. CONCLUSIONS: The final KOOS-Child demonstrates good psychometric properties and supports the use of the KOOS-Child when evaluating children with knee disorders....

  16. Video game addiction test: validity and psychometric characteristics.

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, D. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  17. Video Game Addiction Test: Validity and Psychometric Characteristics

    NARCIS (Netherlands)

    Rooij, A.J. van; Schoenmakers, T.M.; Eijnden, R.J.J.M. van den; Vermulst, A.A.; Mheen, H. van de

    2012-01-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study

  18. Development and psychometric testing of a Clinical Reasoning Evaluation Simulation Tool (CREST) for assessing nursing students' abilities to recognize and respond to clinical deterioration.

    Science.gov (United States)

    Liaw, Sok Ying; Rashasegaran, Ahtherai; Wong, Lai Fun; Deneen, Christopher Charles; Cooper, Simon; Levett-Jones, Tracy; Goh, Hongli Sam; Ignacio, Jeanette

    2018-03-01

    The development of clinical reasoning skills in recognising and responding to clinical deterioration is essential in pre-registration nursing education. Simulation has been increasingly used by educators to develop this skill. To develop and evaluate the psychometric properties of a Clinical Reasoning Evaluation Simulation Tool (CREST) for measuring clinical reasoning skills in recognising and responding to clinical deterioration in a simulated environment. A scale development with psychometric testing and mixed methods study. Nursing students and academic staff were recruited at a university. A three-phase prospective study was conducted. Phase 1 involved the development and content validation of the CREST; Phase 2 included the psychometric testing of the tool with 15 second-year and 15 third-year nursing students who undertook the simulation-based assessment; Phase 3 involved the usability testing of the tool with nine academic staff through a survey questionnaire and focus group discussion. A 10-item CREST was developed based on a model of clinical reasoning. A content validity of 0.93 was obtained from the validation of 15 international experts. The construct validity was supported as the third-year students demonstrated significantly higher (preasoning scores than the second-year students. The concurrent validity was also supported with significant positive correlations between global rating scores and almost all subscale scores, and the total scores. The predictive validity was supported with an existing tool. The internal consistency was high with a Cronbach's alpha of 0.92. A high inter-rater reliability was demonstrated with an intraclass correlation coefficient of 0.88. The usability of the tool was rated positively by the nurse educators but the need to ease the scoring process was highlighted. A valid and reliable tool was developed to measure the effectiveness of simulation in developing clinical reasoning skills for recognising and responding to

  19. The Psychometric Properties of English and Spanish Versions of the Life Orientation Test-Revised in Hispanic Americans.

    Science.gov (United States)

    Pan, Tonya M; Mills, Sarah D; Fox, Rina S; Baik, Sharon H; Harry, Kadie M; Roesch, Scott C; Sadler, Georgia Robins; Malcarne, Vanessa L

    2017-12-01

    The Life Orientation Test-Revised (LOT-R) is a widely used measure of optimism and pessimism, with three positively worded and three negatively worded content items. This study examined the structural validity and invariance, internal consistency reliability, and convergent and divergent validity of the English and Spanish versions of the LOT-R among Hispanic Americans. A community sample of Hispanic Americans ( N = 422) completed self-report measures, including the LOT-R, Patient Health Questionnaire-9, and Generalized Anxiety Disorder-7, in their preferred language of English or Spanish. Based on the literature, four structural models were tested: one-factor , oblique two-factor , orthogonal two-factor method effects with positive specific factor , and orthogonal two-factor method effects with negative specific factor . Baseline support for both of the English and Spanish versions was not achieved for any model; in all models, the negatively worded items in Spanish had non-significant factor loadings. Therefore, the positively worded three-item optimism subscale of the LOT-R was examined separately and fit the data, with factor loadings equivalent across language-preference groups. Coefficient alphas for the optimism subscale were consistent across both language-preference groups (αs = .61 [English] and .66 [Spanish]). In contrast, the six-item total score and three-item pessimism subscale demonstrated extremely low or inconsistent alphas. Convergent and divergent validity were established for the optimism subscale in both languages. In sum, the optimism subscale of the LOT-R demonstrated minimally acceptable to good psychometric properties across English and Spanish language-preference groups. However, neither the total score nor the pessimism subscale showed adequate psychometric properties for Spanish-speaking Hispanic Americans, likely due to translation and cultural adaptation issues, and thus are not supported for use with this population.

  20. Development, Testing, and Psychometric Qualities of the Nash Duty to Care Scale for Disaster Response.

    Science.gov (United States)

    Nash, Tracy Jeanne

    2017-08-01

    Although nurses struggle with the decision to report for work during disaster events, there are no instruments to measure nurses' duty to care for disaster situations. The purpose of this study was to describe the development, testing, and psychometric qualities of the Nash Duty to Care Scale. A convenience sample of 409 registered nurses were recruited from 3 universities in the United States. Exploratory factor analysis resulted in a 19-item, 4-factor model explaining 67.34% of the variance. Internal consistency reliability was supported by Cronbach's alpha ranging from .81 to .91 for the 4-factor subscales and .92 for the total scale. The psychometrically sound instrument for measuring nurses' perceived duty to care for disasters is applicable to contemporary nursing practice, institutional disaster management plans, and patient health outcomes worldwide.

  1. Video game addiction test: validity and psychometric characteristics.

    Science.gov (United States)

    van Rooij, Antonius J; Schoenmakers, Tim M; van den Eijnden, Regina J J M; Vermulst, Ad A; van de Mheen, Dike

    2012-09-01

    The study explores the reliability, validity, and measurement invariance of the Video game Addiction Test (VAT). Game-addiction problems are often linked to Internet enabled online games; the VAT has the unique benefit that it is theoretically and empirically linked to Internet addiction. The study used data (n=2,894) from a large-sample paper-and-pencil questionnaire study, conducted in 2009 on secondary schools in Netherlands. Thus, the main source of data was a large sample of schoolchildren (aged 13-16 years). Measurements included the proposed VAT, the Compulsive Internet Use Scale, weekly hours spent on various game types, and several psychosocial variables. The VAT demonstrated excellent reliability, excellent construct validity, a one-factor model fit, and a high degree of measurement invariance across gender, ethnicity, and learning year, indicating that the scale outcomes can be compared across different subgroups with little bias. In summary, the VAT can be helpful in the further study of video game addiction, and it contributes to the debate on possible inclusion of behavioral addictions in the upcoming DSM-V.

  2. A Human Capital Model of Educational Test Scores

    DEFF Research Database (Denmark)

    McIntosh, James; D. Munk, Martin

    Latent class Poisson count models are used to analyze a sample of Danish test score results from a cohort of individuals born in 1954-55 and tested in 1968. The procedure takes account of unobservable effects as well as excessive zeros in the data. The bulk of unobservable effects are uncorrelated...... with observable parental attributes and, thus, are environmental rather than genetic in origin. We show that the test scores measure manifest or measured ability as it has evolved over the life of the respondent and is, thus, more a product of the human capital formation process than some latent or fundamental...... measure of pure cognitive ability. We find that variables which are not closely associated with traditional notions of intelligence explain a significant proportion of the variation in test scores. This adds to the complexity of interpreting test scores and suggests that school culture, attitudes...

  3. Group differences in the heritability of items and test scores

    NARCIS (Netherlands)

    Wicherts, J.M.; Johnson, W.

    2009-01-01

    It is important to understand potential sources of group differences in the heritability of intelligence test scores. On the basis of a basic item response model we argue that heritabilities which are based on dichotomous item scores normally do not generalize from one sample to the next. If groups

  4. Testing the applicability of the SASS5 scoring procedure for ...

    African Journals Online (AJOL)

    A study was undertaken between 29th January and 17th February 2004 to test the applicability of the South African Scoring System Version 5 (SASS5) scoring and calculation procedure in nutrient-enriched palustrine wetlands in the midlands of KwaZulu-Natal, South Africa. Four reference wetlands and three dairy-effluent ...

  5. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    Science.gov (United States)

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  6. Clock Drawing Test and the diagnosis of amnestic mild cognitive impairment: can more detailed scoring systems do the work?

    Science.gov (United States)

    Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin

    2014-01-01

    The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.

  7. Relative Merits of Four Methods for Scoring Cloze Tests.

    Science.gov (United States)

    Brown, James Dean

    1980-01-01

    Describes study comparing merits of exact answer, acceptable answer, clozentropy and multiple choice methods for scoring tests. Results show differences among reliability, mean item facility, discrimination and usability, but not validity. (BK)

  8. A call for policy guidance on psychometric testing in doping control in sport.

    Science.gov (United States)

    Petróczi, Andrea; Backhouse, Susan H; Barkoukis, Vassilis; Brand, Ralf; Elbe, Anne-Marie; Lazuras, Lambros; Lucidi, Fabio

    2015-11-01

    One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping) into organised intelligence to protect clean sport. Beyond the foreseeable consequences of a positive identification as a doping user, this task is further complicated by the discrepancy between what constitutes a doping offence in the World Anti-Doping Code and operationalized in doping research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. In both research and forensic settings, we must acknowledge that (1) socially desirable responding confounds self-reported psychometric test results and (2) that the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that, to date, both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better understanding of doping behaviour, not to establish evidence for doping. A false 'positive' psychological profile for doping affects not only the individual

  9. Prediction of true test scores from observed item scores and ancillary data.

    Science.gov (United States)

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.

  10. Applying cognitive acuity theory to the development and scoring of situational judgment tests.

    Science.gov (United States)

    Leeds, J Peter

    2017-11-09

    The theory of cognitive acuity (TCA) treats the response options within items as signals to be detected and uses psychophysical methods to estimate the respondents' sensitivity to these signals. Such a framework offers new methods to construct and score situational judgment tests (SJT). Leeds (2012) defined cognitive acuity as the capacity to discern correctness and distinguish between correctness differences among simultaneously presented situation-specific response options. In this study, SJT response options were paired in order to offer the respondent a two-option choice. The contrast in correctness valence between the two options determined the magnitude of signal emission, with larger signals portending a higher probability of detection. A logarithmic relation was found between correctness valence contrast (signal stimulus) and its detectability (sensation response). Respondent sensitivity to such signals was measured and found to be related to the criterion variables. The linkage between psychophysics and elemental psychometrics may offer new directions for measurement theory.

  11. [The methodological assessment and qualitative evaluation of psychometric performance tests based on the example of modern tests that assess reading and spelling skills].

    Science.gov (United States)

    Galuschka, Katharina; Rothe, Josefine; Schulte-Körne, Gerd

    2015-09-01

    This article looks at a means of objectively evaluating the quality of psychometric tests. This approach enables users to evaluate psychometric tests based on their methodological characteristics, in order to decide which instrument should be used. Reading and spelling assessment tools serve as examples. The paper also provides a review of German psychometric tests for the assessment of reading and spelling skills. This method facilitates the identification of psychometric tests.of high methodological quality which can be used for the assessment of reading and spelling skills. Reading performance should ideally be assessed with the following instruments: ELFE 1-6, LGVT 6-12, LESEN 6-7, LESEN 8-9, or WLLP-R. The tests to be used for the evaluation of spelling skills are DERET 1-2+, DERET 3-4+, WRT 1+, WRT 2+, WRT 3+, WRT 4+ or HSP 1-10.

  12. Predicting Freshman Grade Point Average From College Admissions Test Scores and State High School Test Scores

    OpenAIRE

    Koretz, Daniel; Yu, C; Mbekeani, Preeya Pandya; Langi, M.; Dhaliwal, Tasminda Kaur; Braslow, David Arthur

    2016-01-01

    The current focus on assessing “college and career readiness” raises an empirical question: How do high school tests compare with college admissions tests in predicting performance in college? We explored this using data from the City University of New York and public colleges in Kentucky. These two systems differ in the choice of college admissions test, the stakes for students on the high school test, and demographics. We predicted freshman grade point average (FGPA) from high school GPA an...

  13. A test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS: Normative data and psychometric properties

    Directory of Open Access Journals (Sweden)

    Giorgio eArcara

    2016-02-01

    Full Text Available The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS test is a new tool to evaluate pragmatic abilities in patients with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2 and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total. Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts’ evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient’s cognitive profile, given its vital role for communication and social interaction in daily life. The

  14. Development and psychometric pilot-testing of a questionnaire for the evaluation of satisfaction with continuing education in infection control nurses.

    Science.gov (United States)

    Meng, Michael; Peter, Daniel; Mattner, Frauke; Igel, Christoph; Kugler, Christiane

    2018-05-16

    Satisfaction with continuing education can be defined as positive attitudes towards educational programs, which has potential to strengthen learning outcomes. A multi-dimensional construct may enhance continuing education program evaluation processes. The objective is to describe the development and psychometric testing of the 'affective - behavioral - cognitive - satisfaction questionnaire' (ABC-SAT) for assessing participants' satisfaction with a continuing education program for nurses in infection control. The multi-staged development of a satisfaction questionnaire comprised of three subscales. The pilot tool was administered to a nationwide sample of 126 infection control nurses to assess satisfaction after participating in a continuing education program. Satisfaction scores were calculated and psychometric testing was performed to determine reliability, using Cronbach's alpha, face validity, objectivity, and economy. A principle component analysis using varimax rotation and Kaiser normalization was performed. The analysis led to a three-factor solution of the questionnaire with 11 items, explaining 61.4% of the variance. Internal consistency of three scales using Cronbach's alpha was 0.83, 0.60, and 0.66, respectively. Selectivity coefficients varied between 0.39 and 0.70. Participants needed approximately three minutes to complete the questionnaire. Initial findings refer to a satisfying scale structure and internal consistency of the 3-dimensional ABC-SAT questionnaire. Further research is required to confirm the questionnaires' psychometric properties. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. High Test Scores: The Wrong Road to National Economic Success

    Science.gov (United States)

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  16. Accountancy, teaching methods, sex, and American College Test scores.

    Science.gov (United States)

    Heritage, J; Harper, B S; Harper, J P

    1990-10-01

    This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.

  17. Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

    Science.gov (United States)

    McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

    2010-01-01

    Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.

  18. Cross-cultural adaptation and psychometric analysis of the Arabic version of the oxford knee score in adult male with knee osteoarthritis.

    Science.gov (United States)

    Alghadir, Ahmad H; Al-Eisa, Einas S; Anwer, Shahnawaz

    2017-05-15

    There are varieties of self-assessment questionnaire used for the evaluation of pain, functional disability, and health related quality of life in individuals with knee osteoarthritis (OA). The present study intended to adapt and translate the oxford knee score into the Arabic and investigated its psychometric properties in adult male with knee OA. Ninety-seven adult male (mean age 57.55 ± 11.49 years) with knee OA participated. Patients were requested to complete the adapted Arabic version of the Oxford knee score (OKS-Ar), reduced "Western Ontario and McMaster Universities Index (WOMAC)", and the Visual analogue scale (VAS). Patients were requested to complete 2 nd form of OKS-Ar at least 1 week apart to assess the reproducibility of the score. The OKS was adapted and translated into Arabic by two independent Arabic native speakers (one rehabilitation professional having experience of knee OA patients and another one a trained translator) according to the international guidelines. All the participants completed the 2 nd form of OKS-Ar (Response rate 100%). Reliability and internal consistency was high with an ICC of 0.97, and the Cronbach's alpha coefficient of 0.987, respectively. A significant relationship between the OKS-Ar and the WOMAC and VAS scores confirmed the construct validity (p < 0.001). The standard error of measurement (SEM) and the minimum detectable change (MDC) were 2.2 and 6.2, respectively. The adapted Arabic version of the OKS demonstrated acceptable psychometric properties, including reliability, internal consistency, and the validity. The present study indicates that the OKS-Ar is a suitable questionnaire to measure pain and physical function in the Arabic speaking adult male patients with knee OA.

  19. Development and psychometric testing of the Carter Assessment of Critical Thinking in Midwifery (Preceptor/Mentor version).

    Science.gov (United States)

    Carter, Amanda G; Creedy, Debra K; Sidebotham, Mary

    2016-03-01

    develop and test a tool designed for use by preceptors/mentors to assess undergraduate midwifery students׳ critical thinking in practice. a descriptive cohort design was used. participants worked in a range of maternity settings in Queensland, Australia. 106 midwifery clinicians who had acted in the role of preceptor for undergraduate midwifery students. this study followed a staged model for tool development recommended by DeVellis (2012). This included generation of items, content validity testing through mapping of draft items to critical thinking concepts and expert review, administration of items to a convenience sample of preceptors, and psychometric testing. A 24 item tool titled the XXXX Assessment of Critical Thinking in Midwifery (CACTiM) was completed by registered midwives in relation to students they had recently preceptored in the clinical environment. ratings by experts revealed a content validity index score of 0.97, representing good content validity. An evaluation of construct validity through factor analysis generated three factors: 'partnership in practice', 'reflection on practice' and 'practice improvements'. The scale demonstrated good internal reliability with a Cronbach alpha coefficient of 0.97. The mean total score for the CACTiM scale was 116.77 (SD=16.68) with a range of 60-144. Total and subscale scores correlated significantly. the CACTiM (Preceptor/Mentor version) was found to be a valid and reliable tool for use by preceptors to assess critical thinking in undergraduate midwifery students. given the importance of critical thinking skills for midwifery practice, mapping and assessing critical thinking development in students׳ practice across an undergraduate programme is vital. The CACTiM (Preceptor/Mentor version) has utility for clinical education, research and practice. The tool can inform and guide preceptors׳ assessment of students׳ critical thinking in practice. The availability of a reliable and valid tool can be used to

  20. Predicting Freshman Grade Point Average From College Admissions Test Scores and State High School Test Scores

    Directory of Open Access Journals (Sweden)

    Daniel Koretz

    2016-09-01

    Full Text Available The current focus on assessing “college and career readiness” raises an empirical question: How do high school tests compare with college admissions tests in predicting performance in college? We explored this using data from the City University of New York and public colleges in Kentucky. These two systems differ in the choice of college admissions test, the stakes for students on the high school test, and demographics. We predicted freshman grade point average (FGPA from high school GPA and both college admissions and high school tests in mathematics and English. In both systems, the choice of tests had only trivial effects on the aggregate prediction of FGPA. Adding either test to an equation that included the other had only trivial effects on prediction. Although the findings suggest that the choice of test might advantage or disadvantage different students, it had no substantial effect on the over- and underprediction of FGPA for students classified by race-ethnicity or poverty.

  1. Development and Psychometric Testing of a Sexual Concerns Questionnaire for Kidney Transplant Recipients.

    Science.gov (United States)

    Muehrer, Rebecca J; Lanuza, Dorothy M; Brown, Roger L; Djamali, Arjang

    2015-01-01

    This study describes the development and psychometric testing of the Sexual Concerns Questionnaire (SCQ) in kidney transplant (KTx) recipients. Construct validity was assessed using the Kroonenberg and Lewis exploratory/confirmatory procedure and testing hypothesized relationships with established questionnaires. Configural and weak invariance were examined across gender, dialysis history, relationship status, and transplant type. Reliability was assessed with Cronbach's alpha, composite reliability, and test-retest reliability. Factor analysis resulted in a 7-factor solution and suggests good model fit. Construct validity was also supported by the tests of hypothesized relationships. Configural and weak invariance were supported for all subgroups. Reliability of the SCQ was also supported. Findings indicate the SCQ is a valid and reliable measure of KTx recipients' sexual concerns.

  2. Measurement of overgeneral autobiographical memory: Psychometric properties of the autobiographical memory test in young and older populations.

    Science.gov (United States)

    Ros, Laura; Romero, Dulce; Ricarte, Jorge J; Serrano, Juan P; Nieto, Marta; Latorre, Jose M

    2018-01-01

    The Autobiographical Memory Test (AMT) is the most widely used measure of overgeneral autobiographical memory (OGM). The AMT appears to have good psychometric properties, but more research is needed on the influence and applicability of individual cue words in different languages and populations. To date, no studies have evaluated its usefulness as a measure of OMG in Spanish or older populations. This work aims to analyze the applicability of the AMT in young and older Spanish samples. We administered a Spanish version of the AMT to samples of young (N = 520) and older adults (N = 155). We conducted confirmatory factor analysis (CFA), item response theory-based analysis (IRT) and differential item functioning (DIF). Results confirm the one-factor structure for the AMT. IRT analysis suggests that both groups find the AMT easy given that they generally perform well, and that it is more precise in individuals who score low on memory specificity. DIF analysis finds three items differ in their functioning depending on age group. This differential functioning of these items affects the overall AMT scores and, thus, they should be excluded from the AMT in studies comparing young and older samples. We discuss the possible implications of the samples and cue words used.

  3. Psychometric Investigation of the Raven's Colored Progressive Matrices Test in a Sample of Preschool Children.

    Science.gov (United States)

    Lúcio, Patrícia Silva; Cogo-Moreira, Hugo; Puglisi, Marina; Polanczyk, Guilherme Vanoni; Little, Todd D

    2017-11-01

    The present study investigated the psychometric properties of the Raven's Colored Progressive Matrices (CPM) test in a sample of preschoolers from Brazil ( n = 582; age: mean = 57 months, SD = 7 months; 46% female). We investigated the plausibility of unidimensionality of the items (confirmatory factor analysis) and differential item functioning (DIF) for sex and age (multiple indicators multiple causes method). We tested four unidimensional models and the one with the best-fit index was a reduced form of the Raven's CPM. The DIF analysis was carried out with the reduced form of the test. A few items presented DIF (two for sex and one for age), confirming that the Raven's CPM items are mostly measurement invariant. There was no effect of sex on the general factor, but increasing age was associated with higher values of the g factor. Future research should indicate if the reduced form is suitable for evaluating the general ability of preschoolers.

  4. Another Look at the PART-O Using the Traumatic Brain Injury Model Systems National Database: Scoring to Optimize Psychometrics.

    Science.gov (United States)

    Malec, James F; Whiteneck, Gale G; Bogner, Jennifer A

    2016-02-01

    To integrate previous approaches to scoring the Participation Assessment with Recombined Tools-Objective (PART-O) in a unidimensional scale. Retrospective analysis of PART-O data from the Traumatic Brain Injury Model Systems. Community. Data from individuals (N=469) selected randomly from participants who completed 1-year follow-up in the Traumatic Brain Injury Model Systems were used in Rasch model development. The model was subsequently tested on data from additional random samples of similar size at 1-, 2-, 5-, 10-, and >15-year follow-ups. Not applicable. PART-O. After combining items for productivity and social interaction, the initial analysis at 1-year follow-up indicated relatively good fit to the Rasch model (person reliability=.80) but also suggested item misfit and that the 0-to-5 scale used for most items did not consistently show clear separation between rating levels. Reducing item rating scales to 3 levels (except combined and dichotomous items) resolved these issues and demonstrated good item level discrimination, fit, and person reliability (.81), with no evidence of multidimensionality. These results replicated in analyses at each additional follow-up period. Modifications to item scoring for the PART-O resulted in a unidimensional parametric equivalent measure that addresses previous concerns about competing item relations, and it fit the Rasch model consistently across follow-up periods. The person-item map shows a progression toward greater community participation from solitary and dyadic activities, such as leaving the house and having a friend through social and productivity activities, to group activities with others who share interests or beliefs. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  5. A prognostic scoring system for arm exercise stress testing.

    Science.gov (United States)

    Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

    2016-01-01

    Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all pstatistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.

  6. Psychometric evaluation of the adolescent and parent versions of the Gaming Addiction Identification Test (GAIT).

    Science.gov (United States)

    Vadlin, Sofia; Åslund, Cecilia; Rehn, Mattias; Nilsson, Kent W

    2015-12-01

    The objective of the study is to evaluate the psychometric properties of the Gaming Addiction Identification Test (GAIT) and its parent version (GAIT-P), in a representative community sample of adolescents and parents in Västmanland, Sweden. Self-rated and parent-rated gaming addictive symptoms identified by GAIT and GAIT-P were analyzed for frequency of endorsement, internal consistency, concordance, factor structure, prevalence of Internet gaming disorder (IGD), concurrence with the Gaming Addiction Scale for Adolescents, 7-item version (GAS) and the parent version of GAS (GAS-P), and for sex differences. The 12-month prevalence of IGD was found to be 1.3% with GAIT and 2.4% with GAIT-P. Results also indicate promising psychometric results within this population, with high internal consistency, and high concurrent validity with GAS and GAS-P. Concordance between adolescents and parents ratings was high, although moderate in girls. Although exploratory factor analysis indicated poor model fit, it also indicated unidimensionality and high factor loadings in all analyses. GAIT and GAIT-P are suitable for continued use in measuring gaming addiction in adolescents, and, with the additional two items, they now cover all nine IGD criteria. © 2015 Scandinavian Psychological Associations and John Wiley & Sons Ltd.

  7. Development and psychometric testing of a new geriatric spiritual well-being scale.

    Science.gov (United States)

    Dunn, Karen S

    2008-09-01

    Aims and objectives.  Assess the psychometric properties of a new geriatric spiritual well-being scale (GSWS), specifically designed for older adults. Background.  Religiosity and spiritual wellness must be measured as two distinct concepts to prevent confounding them as synonymous among atheist and agnostic population. Design.  A test-retest survey design was used to estimate the psychometric properties. Methods.  A convenience sample of 138 community-dwelling older adults was drawn from the inner city of Detroit. Data were collected using telephone survey interviews. Data analyses included descriptive statistics, structural equation modelling, reliability analyses, and point-biserial correlations. Results.  The factorial validity of the proposed model was not supported by the data. Fit indices were χ(2)  = 185.98, d.f. = 98, P atheists have spiritual needs that do not include religious beliefs or practices. Thus, assessing patients' religious beliefs and practices prior to assessing spiritual well-being is essential to prevent bias. © 2008 The Author. Journal compilation © 2008 Blackwell Publishing Ltd.

  8. Construction and psychometric testing of the EMPATHIC questionnaire measuring parent satisfaction in the pediatric intensive care unit

    NARCIS (Netherlands)

    Latour, Jos M.; van Goudoever, Johannes B.; Duivenvoorden, Hugo J.; Albers, Marcel J. I. J.; van Dam, Nicolette A. M.; Dullaart, Eugenie; van Heerde, Marc; de Neef, Marjorie; Verlaat, Carin W. M.; van Vught, Elise M.; Hazelzet, Jan A.

    To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument

  9. Construction and psychometric testing of the EMPATHIC questionnaire measuring parent satisfaction in the pediatric intensive care unit

    NARCIS (Netherlands)

    Latour, J.M.; van Goudoever, J.B.; Duivenvoorden, H.J.; Albers, M.J.I.J.; van Dam, N.A.M.; Dullaart, E.; van Heerde, M.; de Neef, M.; Verlaat, C.W.M.; van Vught, E.M.; Hazelzet, J.A.

    2011-01-01

    To construct and test the reliability and validity of the EMpowerment of PArents in THe Intensive Care (EMPATHIC) questionnaire measuring parent satisfaction in the pediatric intensive care unit (PICU). Structured development and psychometric testing of a parent satisfaction-with-care instrument

  10. Modeling the Test-Taking Motivation Construct through Investigation of Psychometric Properties of an Expectancy-Value-Based Questionnaire

    Science.gov (United States)

    Knekta, Eva; Eklöf, Hanna

    2015-01-01

    The aim of this study was to evaluate the psychometric properties of an expectancy-value-based questionnaire measuring five aspects of test-taking motivation (effort, expectancies, importance, interest, and test anxiety). The questionnaire was distributed to a sample of Swedish Grade 9 students taking a low-stakes (n = 1,047) or a high-stakes (n =…

  11. A Latent Class Approach to Estimating Test-Score Reliability

    Science.gov (United States)

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  12. Source Country Differences in Test Score Gaps: Evidence from Denmark

    Science.gov (United States)

    Rangvid, Beatrice Schindler

    2010-01-01

    We combine data from three studies for Denmark in the PISA 2000 framework to investigate differences in the native-immigrant test score gap by country of origin. In addition to the controls available from PISA data sources, we use student-level data on home background and individual migration histories linked from administrative registers. We find…

  13. Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

    Science.gov (United States)

    Minor, Elizabeth Covay

    2016-01-01

    Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…

  14. Manual for Scoring the Test of Directed Imagination.

    Science.gov (United States)

    Veldman, Donald J.; And Others

    A scoring manual for the Directed Imagination Test, a projective technique wherein the subject is instructed to write four fictional stories (four minutes are allowed for each) about teachers and their experiences, is presented. The manual provides detailed instructions for rating each story by fifteen dimensions relevant to teacher education…

  15. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    Science.gov (United States)

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  16. [Quality of advanced practice nurse counseling in home care settings (APN-BQ): psychometric testing of the instrument].

    Science.gov (United States)

    Petry, Heidi; Suter-Riederer, Susanne; Kerker-Specker, Carmen; Imhof, Lorenz

    2014-12-01

    Patient centred and individually-tailored counselling of older people with a chronic condition who live at home is a useful intervention to support their independence. The paper presents the development and psychometric testing of the APN-BQ Instrument, to measure patient-centeredness. To measure the quality of an in-home counselling intervention, a 23-item questionnaire was developed and tested with 206 people 80 years and older. Principal component analysis with Varimax Rotation was conducted (n = 206). Analysis revealed a four factor (fs = 0.91) model scoring in 19 items. All factors loaded > 0.45. Cronbach's alpha was 0.86. The utility and acceptance of the instrument was confirmed by the high response rate (100 %) and the fact that participants answered 98.8 % of all questions. The APN-BQ has shown to be a reliable Instrument with good content and construct validity. It is a tool for APNs to measure structure, process, and outcome quality of a patient-centred and individually-tailored counselling program, including the degree of patient participation, and patient empowerment.

  17. Psychometric properties of the Alcohol Use Disorders Identification Test (AUDIT) and prevalence of alcohol use among Iranian psychiatric outpatients.

    Science.gov (United States)

    Noorbakhsh, Simasadat; Shams, Jamal; Faghihimohamadi, Mohamadmahdi; Zahiroddin, Hanieh; Hallgren, Mats; Kallmen, Hakan

    2018-01-30

    Iran is a developing and Islamic country where the consumption of alcoholic beverages is banned. However, psychiatric disorders and alcohol use disorders are often co-occurring. We used the Alcohol Use Disorders Identification Test (AUDIT) to estimate the prevalence of alcohol use and examined the psychometric properties of the test among psychiatric outpatients in Teheran, Iran. AUDIT was completed by 846 consecutive (sequential) patients. Descriptive statistics, internal consistency (Cronbach alpha), confirmatory and exploratory factor analyses were used to analyze the prevalence of alcohol use, reliability and construct validity. 12% of men and 1% of women were hazardous alcohol consumers. Internal reliability of the Iranian version of AUDIT was excellent. Confirmatory factor analyses showed that the construct validity and the fit of previous factor structures (1, 2 and 3 factors) to data were not good and seemingly contradicted results from the explorative principal axis factoring, which showed that a 1-factor solution explained 77% of the co-variances. We could not reproduce the suggested factor structure of AUDIT, probably due to the skewed distribution of alcohol consumption. Only 19% of men and 3% of women scored above 0 on AUDIT. This could be explained by the fact that alcohol is illegal in Iran. In conclusion the AUDIT exhibited good internal reliability when used as a single scale. The prevalence estimates according to AUDIT were somewhat higher among psychiatric patients compared to what was reported by WHO regarding the general population.

  18. Psychometric characteristics of single-word tests of children's speech sound production.

    Science.gov (United States)

    Flipsen, Peter; Ogiela, Diane A

    2015-04-01

    Our understanding of test construction has improved since the now-classic review by McCauley and Swisher (1984). The current review article examines the psychometric characteristics of current single-word tests of speech sound production in an attempt to determine whether our tests have improved since then. It also provides a resource that clinicians may use to help them make test selection decisions for their particular client populations. Ten tests published since 1990 were reviewed to determine whether they met the 10 criteria set out by McCauley and Swisher (1984), as well as 7 additional criteria. All of the tests reviewed met at least 3 of McCauley and Swisher's (1984) original criteria, and 9 of 10 tests met at least 5 of them. Most of the tests met some of the additional criteria as well. The state of the art for single-word tests of speech sound production in children appears to have improved in the last 30 years. There remains, however, room for improvement.

  19. Propriedades psicométricas apresentadas em manuais de testes de inteligência Psychometric parameters in intelligence test directions

    Directory of Open Access Journals (Sweden)

    Ana Paula Porto Noronha

    2003-06-01

    Full Text Available A pesquisa teve como objetivo verificar quais os parâmetros psicométricos apresentados nos manuais de 19 instrumentos de avaliação da inteligência. Os elementos avaliados nos instrumentos foram: análise de itens, padronização, validade e precisão. Os resultados encontrados mostraram que, dos 19 testes avaliados, 89,5% apresentaram estudos de padronização, sendo que o procedimento mais utilizado na escolha dos sujeitos foi o não aleatório (62,2% dos testes. No que se refere à validade, a de construto foi a mais freqüente dentre os testes (94,7%. Observou-se que todos os instrumentos apresentaram verifica��ão da precisão, sendo o método de consistência interna o mais aplicado (78,9%. Conclui-se que, embora os autores concordem que todos os testes devam realizar estudos de verificação dos parâmetros psicométricos e devam possuir normas regionais, tal prática ainda não se encontra totalmente difundida na avaliação psicológica brasileira,This research aimed to verify the psychometric parameters presented in manuals of 19 intelligence tests. The psychometric properties included in the analysis were: item analysis, validity, reliability, and norms studies. The results indicated that 89.5% of the 19 tests presented norming studies. The procedure of sample selection was mostly non-random (62.2% of the tests. Construct validity was the most frequent method used among the studies (94.7%. All tests presented reliability studies, most of them using internal consistency coefficient (78.9%. It is concluded that although the authors agree that all tests need studies to verify psychometric parameters and studies to obtain regional norms this action isn’t divulged totally yet in the Brazilian psychological assessment.

  20. Development and psychometric testing of the Nursing Workplace Relational Environment Scale (NWRES).

    Science.gov (United States)

    Duddle, Maree; Boughton, Maureen

    2009-03-01

    The aim of this study was to develop and test the psychometric properties of the Nursing Workplace Relational Environment Scale (NWRES). A positive relational environment in the workplace is characterised by a sense of connectedness and belonging, support and cooperation among colleagues, open communication and effectively managed conflict. A poor relational environment in the workplace may contribute to job dissatisfaction and early turnover of staff. Quantitative survey. A three-stage process was used to design and test the NWRES. In Stage 1, an extensive literature review was conducted on professional working relationships and the nursing work environment. Three key concepts; collegiality, workplace conflict and job satisfaction were identified and defined. In Stage 2, a pool of items was developed from the dimensions of each concept and formulated into a 35-item scale which was piloted on a convenience sample of 31 nurses. In Stage 3, the newly refined 28-item scale was administered randomly to a convenience sample of 150 nurses. Psychometric testing was conducted to establish the construct validity and reliability of the scale. Exploratory factor analysis resulted in a 22-item scale. The factor analysis indicated a four-factor structure: collegial behaviours, relational atmosphere, outcomes of conflict and job satisfaction which explained 68.12% of the total variance. Cronbach's alpha coefficient for the NWRES was 0.872 and the subscales ranged from 0.781-0.927. The results of the study confirm the reliability and validity of the NWRES. Replication of this study with a larger sample is indicated to determine relationships among the subscales. The results of this study have implications for health managers in terms of understanding the impact of the relational environment of the workplace on job satisfaction and retention.

  1. Cross-cultural validation and psychometric testing of the Norwegian version of the TeamSTEPPS® teamwork perceptions questionnaire.

    Science.gov (United States)

    Ballangrud, Randi; Husebø, Sissel Eikeland; Hall-Lord, Marie Louise

    2017-12-02

    Teamwork is an integrated part of today's specialized and complex healthcare and essential to patient safety, and is considered as a core competency to improve twenty-first century healthcare. Teamwork measurements and evaluations show promising results to promote good team performance, and are recommended for identifying areas for improvement. The validated TeamSTEPPS® Teamwork Perception Questionnaire (T-TPQ) was found suitable for cross-cultural validation and testing in a Norwegian context. T-TPQ is a self-report survey that examines five dimensions of perception of teamwork within healthcare settings. The aim of the study was to translate and cross-validate the T-TPQ into Norwegian, and test the questionnaire for psychometric properties among healthcare personnel. The T-TPQ was translated and adapted to a Norwegian context according to a model of a back-translation process. A total of 247 healthcare personnel representing different professionals and hospital settings responded to the questionnaire. A confirmatory factor analysis was carried out to test the factor structure. Cronbach's alpha was used to establish internal consistency, and an Intraclass Correlation Coefficient was used to assess the test - retest reliability. A confirmatory factor analysis showed an acceptable fitting model (χ 2 (df) 969.46 (546), p teamwork dimension clearly represents that specific construct. The Cronbach's alpha demonstrated acceptable values on the five subscales (0.786-0.844), and test-retest showed a reliability parameter, with Intraclass Correlation Coefficient scores from 0.672 to 0.852. The Norwegian version of T-TPQ was considered to be acceptable regarding the validity and reliability for measuring Norwegian individual healthcare personnel's perception of group level teamwork within their unit. However, it needs to be further tested, preferably in a larger sample and in different clinical settings.

  2. Psychometric properties of the Turkish version of the Internet Addiction Test (IAT).

    Science.gov (United States)

    Boysan, Murat; Kuss, Daria J; Barut, Yaşar; Ayköse, Nafi; Güleç, Mustafa; Özdemir, Osman

    2017-01-01

    Of many instruments developed to assess Internet addiction, the Internet Addiction Test (IAT), an expanded version of the Internet Addiction Diagnostic Questionnaire (IADQ), has been the most widely used scale in English and non-English speaking populations. In this study, our aim was to investigate the psychometric properties of short and expanded versions of the IAT in a Turkish undergraduate sample. Overall, 455 undergraduate students from Turkey aged between 18 and 30 participated in the study (63.53% were females). Explanatory and confirmatory factor analytic procedures investigated factor structures of the IADQ and IAT. The Internet Addiction Scale (IAS), Coping Inventory for Stressful Situations (CISS), Obsessive Compulsive Inventory-Revised (OCI-R) and Dissociative Experiences Scale (DES) were administered to assess convergent and divergent validities of the IADQ and IAT. Internal consistency and 15-day test-retest reliability were computed. In the factorial analytic investigation, we found a unidimensional factor structure for each measure fit the current data best. Significant but weak to moderate correlations of the IADQ and the IAT with the CISS, OCI-R and DES provided empirical evidence for divergent validity, whereas strong associations with the subscales of the IAS pointed to the convergent validity of Young's Internet addiction construct. Internal consistency of the IADQ was weak (α=0.67) and of the IAT was high (α=0.93). Temporal reliability of both instruments was very high (α=0.81 and α=0.87; respectively). The IAT revealed promising and sound psychometric properties in a Turkish sample. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. The Weighted Airman Promotion System: Standardizing Test Scores

    Science.gov (United States)

    2008-01-01

    u th o ri ze d Top 3/E6 ratio, inventory 1401206040 100 70 130 5R 2F 2G 3N 2M 2A 4J 4C 4P 4T 4B 1W 2T 3P 1T 4A 2S 5J 1A 1S1C 6F 4N 7S 4R 4E 1N 3A 3V...System: Standardizing Test Scores AFHRL convened a panel to identify the relevant factors to consider, and then sit as a promotion board and rank...Costs If the Air Force decided to standardize test scores, there would be three basic types of costs: implementation costs, marketing costs, and

  4. Investigation of Psychometric Properties of the Test for Creative Thinking-Drawing Production: Evidence from Study in Latvia

    Science.gov (United States)

    Kalis, Emils; Roke, Liga; Krumina, Indra

    2016-01-01

    The Test for Creative Thinking-Drawing Production (TCT-DP) is designed as an effective drawing-based instrument for measuring creative potential. Many studies report adaptation efforts in other cultures pointing out good psychometric properties of the instrument nonetheless revealing also some trouble spots. The present study includes adaptation…

  5. D-Catch instrument : development and psychometric testing of a measurement instrument for nursing documentation in hospitals

    NARCIS (Netherlands)

    Paans, Wolter; Sermeus, Walter; Nieweg, Roos; van der Schans, Cees P.

    AIM: This paper is a report of the development and testing of the psychometric properties of an instrument to measure the accuracy of nursing documentation in general hospitals. BACKGROUND: Little information is available about the accuracy of nursing documentation. None of the existing instruments

  6. Psychological theory testing versus psychometric nay-saying: comment on Neuberg et al.'s (1997) critique of the need for closure scale.

    Science.gov (United States)

    Kruglanski, A W; Atash, M N; DeGrada, E; Mannetti, L; Pierro, A; Webster, D M

    1997-11-01

    S. L. Neuberg, T. N. Judice, and S. G. West (1997) faulted our work with the Need for Closure Scale (NFCS) on grounds that the NFCS lacks discriminant validity relative to S. L. Neuberg's and J. T. Newsom's (1993) Personal Need for Structure (PNS) Scale and is multidimensional, which, so they claim, renders the use of its total score inadmissible. By contrast, the present authors show that neither of the above assertions is incompatible with the underlying need for closure theory. Relations between NFCS and the PNS are to be expected, as these were designed to operationalize the very same construct (of need for closure). Furthermore, no unidimensionality of the NFCS has been claimed, and none is required to use its total score for testing various theoretically derived predictions. An instrument's ultimate utility hinges on theoretical considerations and empirical evidence rather than on questionable psychometric dogma unrelated to the substantive matters at hand.

  7. Using the Nursing Culture Assessment Tool (NCAT) in Long-Term Care: An Update on Psychometrics and Scoring Standardization.

    Science.gov (United States)

    Kennerly, Susan; Heggestad, Eric D; Myers, Haley; Yap, Tracey L

    2015-07-29

    An effective workforce performing within the context of a positive cultural environment is central to a healthcare organization's ability to achieve quality outcomes. The Nursing Culture Assessment Tool (NCAT) provides nurses with a valid and reliable tool that captures the general aspects of nursing culture. This study extends earlier work confirming the tool's construct validity and dimensionality by standardizing the scoring approach and establishing norm-referenced scoring. Scoring standardization provides a reliable point of comparison for NCAT users. NCAT assessments support nursing's ability to evaluate nursing culture, use results to shape the culture into one that supports change, and advance nursing's best practices and care outcomes. Registered nurses, licensed practical nurses, and certified nursing assistants from 54 long-term care facilities in Kentucky, Nevada, North Carolina, and Oregon were surveyed. Confirmatory factor analysis yielded six first order factors forming the NCAT's subscales (Expectations, Behaviors, Teamwork, Communication, Satisfaction, Commitment) (Comparative Fit Index 0.93) and a second order factor-The Total Culture Score. Aggregated facility level comparisons of observed group variance with expected random variance using rwg(J) statistics is presented. Normative scores and cumulative rank percentages and how the NCAT can be used in implementing planned change are provided.

  8. Using the Nursing Culture Assessment Tool (NCAT in Long-Term Care: An Update on Psychometrics and Scoring Standardization

    Directory of Open Access Journals (Sweden)

    Susan Kennerly

    2015-07-01

    Full Text Available An effective workforce performing within the context of a positive cultural environment is central to a healthcare organization’s ability to achieve quality outcomes. The Nursing Culture Assessment Tool (NCAT provides nurses with a valid and reliable tool that captures the general aspects of nursing culture. This study extends earlier work confirming the tool’s construct validity and dimensionality by standardizing the scoring approach and establishing norm-referenced scoring. Scoring standardization provides a reliable point of comparison for NCAT users. NCAT assessments support nursing’s ability to evaluate nursing culture, use results to shape the culture into one that supports change, and advance nursing’s best practices and care outcomes. Registered nurses, licensed practical nurses, and certified nursing assistants from 54 long-term care facilities in Kentucky, Nevada, North Carolina, and Oregon were surveyed. Confirmatory factor analysis yielded six first order factors forming the NCAT’s subscales (Expectations, Behaviors, Teamwork, Communication, Satisfaction, Commitment (Comparative Fit Index 0.93 and a second order factor—The Total Culture Score. Aggregated facility level comparisons of observed group variance with expected random variance using rwg(J statistics is presented. Normative scores and cumulative rank percentages and how the NCAT can be used in implementing planned change are provided.

  9. Allele-sharing models: LOD scores and accurate linkage tests.

    Science.gov (United States)

    Kong, A; Cox, N J

    1997-11-01

    Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested.

  10. Psychometric evaluation of a motor control test battery of the craniofacial region.

    Science.gov (United States)

    von Piekartz, H; Stotz, E; Both, A; Bahn, G; Armijo-Olivo, S; Ballenberger, N

    2017-12-01

    The primary objective of this study was to determine the structural and known-group validity as well as the inter-rater reliability of a test battery to evaluate the motor control of the craniofacial region. Seventy volunteers without TMD and 25 subjects with TMD (Axes I) per the DC/TMD were asked to execute a test battery consisting of eight tests. The tests were video-taped in the same sequence in a standardised manner. Two experienced physical therapists participated in this study as blinded assessors. We used exploratory factor analysis to identify the underlying component structure of the eight tests. Internal consistency (Cronbach's α), inter-rater reliability (intra-class correlation coefficient) and construct validity (ie, hypothesis testing-known-group validity) (receiver operating curves) were also explored for the test battery. The structural validity showed the presence of one factor underlying the construct of the test battery. The internal consistency was excellent (0.90) as well as the inter-rater reliability. All values of reliability were close to 0.9 or above indicating very high inter-rater reliability. The area under the curve (AUC) was 0.93 for rater 1 and 0.94 for rater two, respectively, indicating excellent discrimination between subjects with TMD and healthy controls. The results of the present study support the psychometric properties of test battery to measure motor control of the craniofacial region when evaluated through videotaping. This test battery could be used to differentiate between healthy subjects and subjects with musculoskeletal impairments in the cervical and oro-facial regions. In addition, this test battery could be used to assess the effectiveness of management strategies in the craniofacial region. © 2017 John Wiley & Sons Ltd.

  11. Psychometric test of the Team Climate Inventory-short version investigated in Dutch quality improvement teams

    Directory of Open Access Journals (Sweden)

    Nieboer Anna P

    2009-07-01

    Full Text Available Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Results Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales – vision, participative safety, task orientation and support for innovation – acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. Conclusion The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.

  12. Psychometric test of the Team Climate Inventory-short version investigated in Dutch quality improvement teams.

    Science.gov (United States)

    Strating, Mathilde M H; Nieboer, Anna P

    2009-07-24

    Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. This study included quality improvement teams participating in the Care for Better improvement program for home care, care for the handicapped and the elderly in the Netherlands between 2006 and 2008. As part of a larger evaluation study 270 written questionnaires from team members were collected at baseline and 139 questionnaires at end measurement. Confirmatory factor analyses, reliability, Pearson correlations and paired samples t-tests were conducted to investigate construct validity, reliability, predictive validity and temporal stability. Confirmatory factor analyses revealed the expected four-factor structure and good fit indices. For the four subscales--vision, participative safety, task orientation and support for innovation--acceptable Cronbach's alpha coefficients and high inter-item correlations were found. The four subscales all proved significant predictors of perceived team effectiveness, with participatory safety being the best predictor. As expected the four subscales were found to be stable over time; i.e. without significant changes between baseline and end measurement. The psychometric properties of the Dutch version of the TCI-14 are satisfactory. Together these results show that the TCI-14 is a useful instrument to assess to what extent aspects of team climate influence perceived team effectiveness of quality improvement teams.

  13. Psychometric testing of the properties of the spiritual health scale short form.

    Science.gov (United States)

    Hsiao, Ya-Chu; Chiang, Yi-Chien; Lee, Hsiang-Chun; Han, Chin-Yen

    2013-11-01

    To further examine the psychometric properties of the spiritual health scale short form, including its reliability and validity. Spirituality is one of the main factors associated with good health outcomes. A reliable and valid instrument to measure spirituality is essential to identify the spiritual needs of an individual and to evaluate the effect of spiritual care. A cross-sectional study design was used. The study was conducted in six nursing schools in northern, central and southern Taiwan. The inclusion criterion for participants was nursing students with clinical practice experience. Initially, 1141 participants were recruited for the study, but 67 were absent and 48 did not complete the questionnaires. A total of 1026 participants were finally recruited, indicating a response rate of 89·9%. The psychometric testing of the spiritual health scale short form included construct validity with confirmatory factor analysis, known-group validity and internal consistency reliability. The results of the confirmatory factor analysis supported the five-factor model as an acceptable model fit. In the known-group validity, the results indicated that people who are in the category of primary religious affiliation have better spiritual health than people in the category of secondary religious affiliation and atheism. The result also indicated that the 24-item spiritual health scale short form achieved an acceptable internal consistency coefficient. The findings suggest that the spiritual health scale short form is a valid and reliable instrument for the appraisal of individual spiritual health. The spiritual health scale short form could provide useful information to guide clinical practice in assessing and managing people's spiritual health in Taiwan. © 2013 John Wiley & Sons Ltd.

  14. Development and psychometric testing the Health of Body, Mind and Spirit Scale for assessing individuals who have drug abuse histories.

    Science.gov (United States)

    Sun, Fan-Ko; Chiang, Chun-Ying; Lu, Chu-Yun; Yu, Pei-Jane; Liao, Tzu-Chiao; Lan, Chu-Mei

    2018-03-01

    To develop the Health of Body, Mind and Spirit Scale (HBMSS), which was designed to assess drug abusers' health condition. Helping drug abusers to become healthy is important to healthcare professionals. However, no instrument exists to assess drug abusers' state of health. A cross-sectional questionnaire survey was implemented to examine the validity of the HBMSS. Data were collected from 2015-2016 at one drug abuse prevention centre in Taiwan. Participants (N = 320) who had abused drugs were invited to complete a preliminary 64-item version of the HBMSS. An item analysis, criterion-related validity analysis (using the Relapse Prediction Scale [RPS] score), split-half reliability testing and confirmatory factor analysis (CFA) were conducted to examine the psychometric properties of the HBMSS. The final version of the HBMSS contained 15 items that were divided into three subscales: the health of the body, mind and spirit. Cronbach's α and split-half reliability coefficients were all above .85. The factor loading of each item was between .74-.95. The HBMSS had satisfactory criterion-related validity with the RPS score (r = -.50, p < .001). A second-order CFA was conducted on the HBMSS. The fit indexes were good, χ 2  = 184.060, df = 94, χ 2 /df = 1.958 (p = .000). The entire HBMSS and the subscales had satisfactory reliability and validity. Healthcare professionals could use the HBMSS to evaluate the condition of the health of individuals with a drug abuse history. © 2017 John Wiley & Sons Ltd.

  15. The Italian version of the 16-item prodromal questionnaire (iPQ-16): Field-test and psychometric features.

    Science.gov (United States)

    Lorenzo, Pelizza; Silvia, Azzali; Federica, Paterlini; Sara, Garlassi; Ilaria, Scazza; Pupo, Simona; Andrea, Raballo

    2018-03-20

    Among current early screeners for psychosis-risk states, the Prodromal Questionnaire-16 items (PQ-16) is often used. We aimed to assess validity and reliability of the Italian version of the PQ-16 in a young adult help-seeking population. We included 154 individuals aged 18-35years seeking help at the Reggio Emilia outpatient mental health services in a large semirural catchment area (550.000 inhabitants). Participants completed the Italian version of the PQ-16 (iPQ-16) and were subsequently evaluated with the Comprehensive Assessment of At-Risk Mental States (CAARMS). We examined diagnostic accuracy (i.e. specificity, sensitivity, negative and positive likelihood ratios, and negative and positive predictive values) and content, convergent, and concurrent validity between PQ-16 and CAARMS using Cronbach's alpha, Spearman's rho, and Cohen's kappa, respectively. We also tested the validity of the adopted PQ-16 cut-offs through Receiver Operating Characteristic (ROC) curves plotted against CAARMS diagnoses and the 1-year predictive validity of the PQ-16. The iPQ-16 showed high internal consistency and acceptable diagnostic accuracy and concurrent validity. ROC analyses pointed to a cut-off score of ≥5 as best cut-off. After 12months of follow-up, 8.7% of participants with a PQ-16 symptom total score of ≥5 who were below the CAARMS psychosis threshold at the baseline, developed a psychotic disorder. Psychometric properties of the iPQ-16 were satisfactory. Copyright © 2018. Published by Elsevier B.V.

  16. Biering-Sorensen test scores in coal miners

    Energy Technology Data Exchange (ETDEWEB)

    Tekin, Y.; Ortancil, O.; Ankarali, H.; Basaran, A.; Sarikaya, S.; Ozdolap, S. [Zonguldak Karaelmas University, Zonguldak (Turkey)

    2009-05-15

    Biering-Sorensen test is an isometric back endurance test. Biering-Sorensen test scores have varied in different cultural and occupational groups. The aims of this study were to collect normative data on Biering-Sorensen holding times, to determine the discriminative ability of the Biering-Sorensen test in Turkish coal miners, and to examine the association between Biering-Sorensen test result and functional disability. One hundred and fifty male coal miners participated in this study. Trunk extensor muscle strength was measured using the Biering-Sorensen test. Oswestry disability index was used to measure the functional disability level of low back pain. The mean Biering-Sorensen holding time for the total subject group was 107.3 {+-} 22.5 s. The mean time of Biering-Sorensen test of the subjects with and without low back pain were 99.9 {+-} 19.8 and 128.6 {+-} 15.2 s, respectively. The difference between the subjects with and without low back pain was statistically significant (p < 0.001). There was a statistically significant negative correlation between Oswestry functional disability score and Biering-Sorensen holding time (R = -0.824, p < 0.001). Turkish coal miners have low mean back extensor endurance holding times. Biering-Sorensen test had a good discriminative ability in our study group. Trunk muscle strength has a significant effect on the disability level of low back pain. Thus trunk muscle endurance training exercise therapy may be effective for the reduction of disability in patients with low back pain.

  17. Parent Ratings of Impulsivity and Inhibition Predict State Testing Scores

    Directory of Open Access Journals (Sweden)

    Rebecca A. Lundwall

    2018-03-01

    Full Text Available One principle of cognitive development is that earlier intervention for educational difficulties tends to improve outcomes such as future educational and career success. One possible way to help students who struggle is to determine if they process information differently. Such determination might lead to clues for interventions. For example, early information processing requires attention before the information can be identified, encoded, and stored. The aim of the present study was to investigate whether parent ratings of inattention, inhibition, and impulsivity, and whether error rate on a reflexive attention task could be used to predict child scores on state standardized tests. Finding such an association could provide assistance to educators in identifying academically struggling children who might require targeted educational interventions. Children (N = 203 were invited to complete a peripheral cueing task (which measures the automatic reorienting of the brain’s attentional resources from one location to another. While the children completed the task, their parents completed a questionnaire. The questionnaire gathered information on broad indicators of child functioning, including observable behaviors of impulsivity, inattention, and inhibition, as well as state academic scores (which the parent retrieved online from their school. We used sequential regression to analyze contributions of error rate and parent-rated behaviors in predicting six academic scores. In one of the six analyses (for science, we found that the improvement was significant from the simplified model (with only family income, child age, and sex as predictors to the full model (adding error rate and three parent-rated behaviors. Two additional analyses (reading and social studies showed near significant improvement from simplified to full models. Parent-rated behaviors were significant predictors in all three of these analyses. In the reading score analysis

  18. Psychometric evaluation of the Chinese version of short-form Test of Functional Health Literacy in Adolescents.

    Science.gov (United States)

    Chang, Li-Chun; Hsieh, Pei-Lin; Liu, Chieh-Hsing

    2012-09-01

    The purpose of this study is to develop and evaluate the psychometric properties of the Chinese version of short-form Test of Functional Health Literacy in Adolescents. Assessing health literacy is vital to design health education programme; however, there are no measurement tools exist for use specifically in Chinese adolescents. A non-experimental design was used to test the psychometric properties of the Test of Functional Health Literacy in Adolescents. The short-form Test of Functional Health Literacy in Adolescents was translated and back translated into a Chinese language version. Thirty high school students were recruited to validate the scenario of Test of Functional Health Literacy in Adolescents. Based on the multiple-stage stratified random sampling method, 300 high school students from four counties in Taiwan were invited to participate in this study to evaluate the psychometric properties of Test of Functional Health Literacy in Adolescents. The Functional Health Literacy in Adolescents had good internal consistency reliability and excellent test-retest reliability. Confirmatory factor analysis resulted in a one-factor solution. Contrary to the original version of the Test of Functional Health Literacy in Adolescents, the findings revealed that the 36-item, one-factor model for the Test of Functional Health Literacy in Adolescents is the best-fit model. This is a suitable instrument to assess health literacy levels in Chinese adolescents before health education programmes can be appropriately planned, implemented and evaluated. © 2012 Blackwell Publishing Ltd.

  19. Validity of GRE General Test scores and TOEFL scores for graduate admission to a technical university in Western Europe

    Science.gov (United States)

    Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

    2018-01-01

    Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.

  20. The paced auditory serial addition test for working memory assessment: Psychometric properties

    Science.gov (United States)

    Nikravesh, Maryam; Jafari, Zahra; Mehrpour, Masoud; Kazemi, Roozbeh; Amiri Shavaki, Younes; Hossienifar, Shamim; Azizi, Mohamad Parsa

    2017-01-01

    Background: The paced auditory serial addition test (PASAT) was primarily developed to assess the effects of traumatic brain injury on cognitive functioning. Working memory (WM) is one of the most important aspects of cognitive function, and WM impairment is one of the clinically remarkable signs of aphasia. To develop the Persian version of PASAT, an initial version was used in individuals with aphasia (IWA). Methods: In this study, 25 individuals with aphasia (29-60 years) and 85 controls (18-60 years) were included. PASAT was presented in the form of recorded 61 single-digit numbers (1 to 9). The participants repeatedly added the 2 recent digits. The psychometric properties of PASAT including convergent validity (using the digit memory span tasks), divergent validity (using results in the control group and IWA group), and face validity were investigated. Test-retest reliability was considered as well. Results: The relationship between the PASAT and digit memory span tests was moderate to strong in the control group (forward digit memory span test: r= 0.52, p< 0.0001; backward digit memory span test: r = 0.48, p< 0.0001). A strong relationship was found in IWA (forward digit memory span test: r= 0.72, p< 0.0001; backward digit memory span test: r= 0.53, p= 0.006). Also, strong testretest reliability (intraclass correlation= 0.95, p< 0.0001) was observed. Conclusion: According to our results, the PASAT is a valid and reliable test to assess working memory, particularly in IWA. It could be used as a feasible tool for clinical and research applications. PMID:29445690

  1. Center for Epidemiologic Studies Depression Scale for Children: psychometric testing of the Chinese version.

    Science.gov (United States)

    Li, Ho Cheung William; Chung, Oi Kwan Joyce; Ho, Ka Yan

    2010-11-01

    This paper is a report of psychometric testing of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children. The availability of a valid and reliable instrument that accurately detects depressive symptoms in children is crucial before any psychological intervention can be appropriately planned and evaluated. There is no such an instrument for Chinese children. A test-retest, within-subjects design was used. A total of 313 primary school students between the ages of 8 and 12 years were invited to participate in the study in 2009. Participants were asked to respond to the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children, short form of the State Anxiety Scale for Children and Rosenberg's Self-Esteem Scale. The internal consistency, content validity and construct validity and test-retest reliability of the Chinese version of the Center for Epidemiologic Studies Depression Scale for Children were assessed. The newly-translated scale demonstrated adequate internal consistency, good content validity and appropriate convergent and discriminant validity. Confirmatory factor analysis added further evidence of the construct validity of the scale. Results suggest that the newly-translated scale can be used as a self-report assessment tool in detecting depressive symptoms of Chinese children aged between 8 and 12 years. © 2010 Blackwell Publishing Ltd.

  2. Revision and psychometric testing of the City of Hope Quality of Life-Ostomy Questionnaire.

    Science.gov (United States)

    Grant, Marcia; Ferrell, Betty; Dean, Grace; Uman, Gwen; Chu, David; Krouse, Robert

    2004-10-01

    Ostomies may be performed for bowel or urinary diversion, and occur in both cancer and non-cancer patients. Impact on physical, psychological, social and spiritual well-being is not unexpected, but has been minimally described in the literature. The City of Hope Quality of Life (COH-QOL)-Ostomy Questionnaire is an adult patient self-report instrument designed to assess quality of life. This report focuses on the revision and psychometric testing of this questionnaire. The revised COH-QOL-Ostomy Questionnaire involved in-depth patient interviews and expert panel review. The format consisted of a 13-item disease and demographic section, a 34-item forced-choice section, and a 41-item linear analogue scaled section. A mailed survey to California members of the United Ostomy Association resulted in a 62% response rate (n = 1513). Factor analysis was conducted to refine the instrument. Construct validity involved testing a number of hypotheses identifying contrasting groups. Factor analysis confirmed the conceptual framework. Reliability of subscales ranged from 0.77 to 0.90. The questionnaire discriminated between subpopulations with specific concerns. Overall, the analyses provide evidence for the validity and reliability of the COH-QOL-Ostomy Questionnaire as a comprehensive, multidimensional self-report questionnaire for measuring quality of life in patients with intestinal ostomies.

  3. Computerized scoring algorithms for the Autobiographical Memory Test.

    Science.gov (United States)

    Takano, Keisuke; Gutenbrunner, Charlotte; Martens, Kris; Salmon, Karen; Raes, Filip

    2018-02-01

    Reduced specificity of autobiographical memories is a hallmark of depressive cognition. Autobiographical memory (AM) specificity is typically measured by the Autobiographical Memory Test (AMT), in which respondents are asked to describe personal memories in response to emotional cue words. Due to this free descriptive responding format, the AMT relies on experts' hand scoring for subsequent statistical analyses. This manual coding potentially impedes research activities in big data analytics such as large epidemiological studies. Here, we propose computerized algorithms to automatically score AM specificity for the Dutch (adult participants) and English (youth participants) versions of the AMT by using natural language processing and machine learning techniques. The algorithms showed reliable performances in discriminating specific and nonspecific (e.g., overgeneralized) autobiographical memories in independent testing data sets (area under the receiver operating characteristic curve > .90). Furthermore, outcome values of the algorithms (i.e., decision values of support vector machines) showed a gradient across similar (e.g., specific and extended memories) and different (e.g., specific memory and semantic associates) categories of AMT responses, suggesting that, for both adults and youth, the algorithms well capture the extent to which a memory has features of specific memories. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  4. School accountability and the black-white test score gap.

    Science.gov (United States)

    Gaddis, S Michael; Lauen, Douglas Lee

    2014-03-01

    Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality. Copyright © 2013 Elsevier Inc. All rights reserved.

  5. ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

    Science.gov (United States)

    Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C.

    2016-01-01

    Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…

  6. An Investigation into the Psychometric Properties of the Proportional Reduction of Mean Squared Error and Augmented Scores

    Science.gov (United States)

    Stephens, Christopher Neil

    2012-01-01

    Augmentation procedures are designed to provide better estimates for a given test or subtest through the use of collateral information. The main purpose of this dissertation was to use Haberman's and Wainer's augmentation procedures on a large-scale, standardized achievement test to understand the relationship between reliability and…

  7. Development and psychometric testing of the active aging scale for Thai adults.

    Science.gov (United States)

    Thanakwang, Kattika; Isaramalai, Sang-Arun; Hatthakit, Urai

    2014-01-01

    Active aging is central to enhancing the quality of life for older adults, but its conceptualization is not often made explicit for Asian elderly people. Little is known about active aging in older Thai adults, and there has been no development of scales to measure the expression of active aging attributes. The aim of this study was to develop a culturally relevant composite scale of active aging for Thai adults (AAS-Thai) and to evaluate its reliability and validity. EIGHT STEPS OF SCALE DEVELOPMENT WERE FOLLOWED: 1) using focus groups and in-depth interviews, 2) gathering input from existing studies, 3) developing preliminary quantitative measures, 4) reviewing for content validity by an expert panel, 5) conducting cognitive interviews, 6) pilot testing, 7) performing a nationwide survey, and 8) testing psychometric properties. In a nationwide survey, 500 subjects were randomly recruited using a stratified sampling technique. Statistical analyses included exploratory factor analysis, item analysis, and measures of internal consistency, concurrent validity, and test-retest reliability. Principal component factor analysis with varimax rotation resulted in a final 36-item scale consisting of seven factors of active aging: 1) being self-reliant, 2) being actively engaged with society, 3) developing spiritual wisdom, 4) building up financial security, 5) maintaining a healthy lifestyle, 6) engaging in active learning, and 7) strengthening family ties to ensure care in later life. These factors explained 69% of the total variance. Cronbach's alpha coefficient for the overall AAS-Thai was 0.95 and varied between 0.81 and 0.91 for the seven subscales. Concurrent validity and test-retest reliability were confirmed. The AAS-Thai demonstrated acceptable overall validity and reliability for measuring the multidimensional attributes of active aging in a Thai context. This newly developed instrument is ready for use as a screening tool to assess active aging levels among older

  8. Development and psychometric testing of the Nurse Practitioner Primary Care Organizational Climate Questionnaire.

    Science.gov (United States)

    Poghosyan, Lusine; Nannini, Angela; Finkelstein, Stacey R; Mason, Emanuel; Shaffer, Jonathan A

    2013-01-01

    Policy makers and healthcare organizations are calling for expansion of the nurse practitioner (NP) workforce in primary care settings to assure timely access and high-quality care for the American public. However, many barriers, including those at the organizational level, exist that may undermine NP workforce expansion and their optimal utilization in primary care. This study developed a new NP-specific survey instrument, Nurse Practitioner Primary Care Organizational Climate Questionnaire (NP-PCOCQ), to measure organizational climate in primary care settings and conducted its psychometric testing. Using instrument development design, the organizational climate domain pertinent for primary care NPs was identified. Items were generated from the evidence and qualitative data. Face and content validity were established through two expert meetings. Content validity index was computed. The 86-item pool was reduced to 55 items, which was pilot tested with 81 NPs using mailed surveys and then field-tested with 278 NPs in New York State. SPSS 18 and Mplus software were used for item analysis, reliability testing, and maximum likelihood exploratory factor analysis. Nurse Practitioner Primary Care Organizational Climate Questionnaire had face and content validity. The content validity index was .90. Twenty-nine items loaded on four subscale factors: professional visibility, NP-administration relations, NP-physician relations, and independent practice and support. The subscales had high internal consistency reliability. Cronbach's alphas ranged from.87 to .95. Having a strong instrument is important to promote future research. Also, administrators can use it to assess organizational climate in their clinics and propose interventions to improve it, thus promoting NP practice and the expansion of NP workforce.

  9. The development and psychometric testing of East Asian Acculturation Scale among Asian immigrant women in Taiwan.

    Science.gov (United States)

    Kuo, Shu-Fen; Chang, Wen-Yin; Chang, Lu-I; Chou, Yu-Hua; Chen, Ching-Min

    2013-01-01

    This is a report of development and psychometric testing of the East Asian Acculturation Measure-Chinese version (EAAM-C) scale. An instrument validation design with a cross-sectional survey was conducted. The process was carried in two phases. In Phase 1, Barry's East Asian Acculturation Measure was translated and back translated to evaluate its content, face validity, and feasibility validity. In Phase 2, the 16-item EAAM-C was pilot-tested among 485 female immigrants for test-retest reliability, internal consistency, theoretically-supported construct validity and concurrent validity. The pilot work and the survey results indicated the tools possessed adequate content and face validity. The Cronbach's Alphas for the EAAM-C was 0.72, and 0.76-0.79 for its subscales, and the correlation of test-retest reliability (at 3 weeks) was 0.75. After dropping one item, four theoretically-supported factors which explained 61.82% of the variance were abstracted using exploratory factor analysis: assimilation, integration, separation, and marginalization. Based on the underlying four-factor theoretical structures of the EAAM, the confirmatory factor analysis of the EAAM-C was further examined. The analysis revealed that the four-factor model was an acceptable fit for the data which demonstrated adequate finding in its construct validity. These factors were inter-correlated, and showed statistically significant correlation with the Chinese Health Questionnaire, indicating adequate concurrent validity. The scale shows acceptable validity and consistency, and suggests that immigrant acculturation is a complex construct. This quick evaluation instrument can be applied to assess clients' acculturation and in further developing certain interventions to improve their health.

  10. Development and psychometric testing of a new tool for detecting moral distress: the Moral Distress Thermometer.

    Science.gov (United States)

    Wocial, Lucia D; Weaver, Michael T

    2013-01-01

    To report the development and psychometric testing of the Moral Distress Thermometer. The Moral Distress Thermometer is a new screening tool to measure moral distress in nurses who practise in the hospital setting. Moral distress occurs when one knows the ethically correct thing to do, but is prevented from acting on that perceived obligation. It is a well documented phenomenon with negative consequences that may be experienced by nurses. Creating an instrument to effectively and efficiently measure moral distress in a timely way has been identified as a priority for nursing. This study used a cross-sectional survey design. Data collection for this research occurred in 2009. Participants simultaneously completed either the adult or pediatric version of the Moral Distress Scale version 2009 and the Moral Distress Thermometer. A total of 529 participants from various clinical areas completed both tools. Coefficients alpha were adequate for both Adult (0·90) and Pediatric (0·92) Moral Distress Scale 2009 scales. Statistically significant Pearson correlations were found for the Moral Distress Thermometer with Adult Moral Distress Scale 2009 and Pediatric Moral Distress Scale 2009 and higher Moral Distress Thermometer, Adult Moral Distress Scale 2009 and Pediatric Moral Distress Scale 2009 means for participants who had left or who considered leaving a position because of moral distress. These findings provide support for the validity of the Moral Distress Thermometer. © 2012 Blackwell Publishing Ltd.

  11. Psychometric testing of the modified Care Dependency Scale (Neuro-CDS).

    Science.gov (United States)

    Piredda, Michela; Biagioli, Valentina; Gambale, Giulia; Porcelli, Elisa; Barbaranelli, Claudio; Palese, Alvisa; De Marinis, Maria Grazia

    2016-01-01

    Effective measures of nursing care dependency in neurorehabilitation are warranted to plan nursing interventions to help patients avoid increasing dependency. The Care Dependency Scale (CDS) is a theory-based, comprehensive tool to evaluate functional disability. This study aimed to modify the CDS for neurological and neurorehabilitation patients (Neuro-CDS) and to test its psychometric properties in adult neurorehabilitation inpatients. Exploratory factor analysis (EFA) was performed using a Maximum Likelihood robust (MLR) estimator. The Barthel Index (BI) was used to evaluate concurrent validity. Stability was measured using the Intra-class Correlation Coefficient (ICC). The sample included 124 patients (mean age = 69.7 years, 54% male). The EFA revealed a two-factor structure with good fit indexes, Factor 1 (Physical care dependence) loaded by 11 items and Factor 2 (Psycho-social care dependence) loaded by 4 items. The correlation between factors was 0.61. Correlations between Factor 1 and the BI and between Factor 2 and the BI were r = 0.843 and r = 0.677, respectively (p dependence in neurorehabilitation patients as a basis for individualized and holistic care.

  12. Psychometric Testing of the Self-Efficacy for Interdisciplinary Plans of Care Scale.

    Science.gov (United States)

    Molle, Elizabeth; Froman, Robin

    2017-01-01

    Computerized interdisciplinary plans of care have revitalized nurse-centric care plans into dynamic and meaningful electronic documents. To maximize the benefits of these documents, it is important to understand healthcare professionals' attitudes, specifically their confidence, for making computerized interdisciplinary care plans useful and meaningful documents. The purpose of the study was to test the psychometric properties of the Self-Efficacy for Interdisciplinary Plans of Care instrument intended to measure healthcare professionals' self-efficacy for using such documents. Content validity was assessed by an expert review panel. Content validity indices ranged from 0.75 to 1.00, with a scale CVI of 0.94. A sample of 389 healthcare providers completed the 14-item instrument. Principal axis factoring was used to assess factor structure. The exploratory factor analysis yielded a single-factor structure accounting for 71.76% of covariance. Cronbach internal consistency coefficient for the single factor solution was .97. The corrected item-total correlations ranged from 0.71 to 0.90. The coefficient of stability, during a 2-week period, with a subset of the sample (n = 38), was estimated at 0.82. The results of this study suggest that the Self-Efficacy for Interdisciplinary Plans of Care has sturdy reliability and validity for measuring the self-efficacy of healthcare providers to make computerized interdisciplinary plans of care meaningful and useful documents.

  13. Your move: The effect of chess on mathematics test scores.

    Science.gov (United States)

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1-3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students.

  14. Your move: The effect of chess on mathematics test scores

    DEFF Research Database (Denmark)

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla Trille

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1–3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We...... use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who...... are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students....

  15. Your move: The effect of chess on mathematics test scores.

    Directory of Open Access Journals (Sweden)

    Michael Rosholm

    Full Text Available We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1-3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students.

  16. Motivational Interviewing Skills in Health Care Encounters (MISHCE): Development and psychometric testing of an assessment tool.

    Science.gov (United States)

    Petrova, Tatjana; Kavookjian, Jan; Madson, Michael B; Dagley, John; Shannon, David; McDonough, Sharon K

    2015-01-01

    Motivational interviewing (MI) has demonstrated a significant impact as an intervention strategy for addiction management, change in lifestyle behaviors, and adherence to prescribed medication and other treatments. Key elements to studying MI include training in MI of professionals who will use it, assessment of skills acquisition in trainees, and the use of a validated skills assessment tool. The purpose of this research project was to develop a psychometrically valid and reliable tool that has been designed to assess MI skills competence in health care provider trainees. The goal was to develop an assessment tool that would evaluate the acquisition and use of specific MI skills and principles, as well as the quality of the patient-provider therapeutic alliance in brief health care encounters. To address this purpose, specific steps were followed, beginning with a literature review. This review contributed to the development of relevant conceptual and operational definitions, selecting a scaling technique and response format, and methods for analyzing validity and reliability. Internal consistency reliability was established on 88 video recorded interactions. The inter-rater and test-retest reliability were established using randomly selected 18 from the 88 interactions. The assessment tool Motivational Interviewing Skills for Health Care Encounters (MISHCE) and a manual for use of the tool were developed. Validity and reliability of MISHCE were examined. Face and content validity were supported with well-defined conceptual and operational definitions and feedback from an expert panel. Reliability was established through internal consistency, inter-rater reliability, and test-retest reliability. The overall internal consistency reliability (Cronbach's alpha) for all fifteen items was 0.75. MISHCE demonstrated good inter-rater reliability and good to excellent test-retest reliability. MISHCE assesses the health provider's level of knowledge and skills in brief

  17. Development and psychometric testing of a new instrument to measure the caring behaviour of nurses in Italian acute care settings.

    Science.gov (United States)

    Piredda, Michela; Ghezzi, Valerio; Fenizia, Elisa; Marchetti, Anna; Petitti, Tommasangelo; De Marinis, Maria Grazia; Sili, Alessandro

    2017-12-01

    To develop and psychometrically test the Italian-language Nurse Caring Behaviours Scale, a short measure of nurse caring behaviour as perceived by inpatients. Patient perceptions of nurses' caring behaviours are a predictor of care quality. Caring behaviours are culture-specific, but no measure of patient perceptions has previously been developed in Italy. Moreover, existing tools show unclear psychometric properties, are burdensome for respondents, or are not widely applicable. Instrument development and psychometric testing. Item generation included identifying and adapting items from existing measures of caring behaviours as perceived by patients. A pool of 28 items was evaluated for face validity. Content validity indexes were calculated for the resulting 15-item scale; acceptability and clarity were pilot tested with 50 patients. To assess construct validity, a sample of 2,001 consecutive adult patients admitted to a hospital in 2014 completed the scale and was split into two groups. Reliability was evaluated using nonlinear structural equation modelling coefficients. Measurement invariance was tested across subsamples. Item 15 loaded poorly in the exploratory factor analysis (n = 983) and was excluded from the final solution, positing a single latent variable with 14 indicators. This model fitted the data moderately. The confirmatory factor analysis (n = 1018) returned similar results. Internal consistency was excellent in both subsamples. Full scalar invariance was reached, and no significant latent mean differences were detected across subsamples. The new instrument shows reasonable psychometric properties and is a promising short and widely applicable measure of inpatient perceptions of nurse caring behaviours. © 2017 John Wiley & Sons Ltd.

  18. Score Gains on g-loaded Tests: No g

    NARCIS (Netherlands)

    te Nijenhuis, J.; van Vianen, A.E.M.; van der Flier, H.

    2007-01-01

    IQ scores provide the best general predictor of success in education, job training, and work. However, there are many ways in which IQ scores can be increased, for instance by means of retesting or participation in learning potential training programs. What is the nature of these score gains? Jensen

  19. Simple shoulder test and Oxford Shoulder Score: Persian translation and cross-cultural validation.

    Science.gov (United States)

    Naghdi, Soofia; Nakhostin Ansari, Noureddin; Rustaie, Nilufar; Akbari, Mohammad; Ebadi, Safoora; Senobari, Maryam; Hasson, Scott

    2015-12-01

    To translate, culturally adapt, and validate the simple shoulder test (SST) and Oxford Shoulder Score (OSS) into Persian language using a cross-sectional and prospective cohort design. A standard forward and backward translation was followed to culturally adapt the SST and the OSS into Persian language. Psychometric properties of floor and ceiling effects, construct convergent validity, discriminant validity, internal consistency reliability, test-retest reliability, standard error of the measurement (SEM), smallest detectable change (SDC), and factor structure were determined. One hundred patients with shoulder disorders and 50 healthy subjects participated in the study. The PSST and the POSS showed no missing responses. No floor or ceiling effects were observed. Both the PSST and POSS detected differences between patients and healthy subjects supporting their discriminant validity. Construct convergent validity was confirmed by a very good correlation between the PSST and POSS (r = 0.68). There was high internal consistency for both the PSST (α = 0.73) and the POSS (α = 0.91 and 0.92). Test-retest reliability with 1-week interval was excellent (ICCagreement = 0.94 for PSST and 0.90 for POSS). Factor analyses demonstrated a three-factor solution for the PSST (49.7 % of variance) and a two-factor solution for the POSS (61.6 % of variance). The SEM/SDC was satisfactory for PSST (5.5/15.3) and POSS (6.8/18.8). The PSST and POSS are valid and reliable outcome measures for assessing functional limitations in Persian-speaking patients with shoulder disorders.

  20. Psychometric properties and convergent and predictive validity of an executive function test battery for two-year-olds

    Directory of Open Access Journals (Sweden)

    Hanna eMulder

    2014-07-01

    Full Text Available Executive function (EF is an important predictor of numerous developmental outcomes, such as academic achievement and behavioral adjustment. Although a plethora of measurement instruments exists to assess executive function in children, only few of these are suitable for toddlers, and even fewer have undergone psychometric evaluation. The present study evaluates the psychometric properties and validity of an assessment battery for measuring EF in two-year-olds. A sample of 2437 children were administered the assessment battery at a mean age of 2;4 years (SD = 0;3 years in a large-scale field study. Measures of both hot EF (snack and gift delay tasks and cool EF (six boxes, memory for location, and visual search task were included. Confirmatory Factor Analyses showed that a two-factor hot and cool EF model fitted the data better than a one-factor model. Measurement invariance was supported across groups differing in age, gender, socioeconomic status (SES, home language, and test setting. Criterion and convergent validity were evaluated by examining relationships between EF and age, gender, SES, home language, and parent and teacher reports of children’s attention and inhibitory control. Predictive validity of the test battery was investigated by regressing children’s pre-academic skills and behavioral problems at age three on the latent hot and cool EF factors at age two years. The test battery showed satisfactory psychometric quality and criterion, convergent, and predictive validity. Whereas cool EF predicted both pre-academic skills and behavior problems one year later, hot EF predicted behavior problems only. These results show that EF can be assessed with psychometrically sound instruments in children as young as two years, and that EF tasks can be reliably applied in large scale field research. The current instruments offer new opportunities for investigating EF in early childhood, and for evaluating interventions targeted at improving

  1. Validity of GRE General Test Scores and TOEFL Scores for Graduate Admission to a Technical University in Western Europe

    Science.gov (United States)

    Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

    2018-01-01

    Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the…

  2. Revision and psychometric testing of the Incivility in Nursing Education (INE) survey: introducing the INE-R.

    Science.gov (United States)

    Clark, Cynthia M; Barbosa-Leiker, Celestina; Gill, Larecia Money; Nguyen, Danh

    2015-06-01

    Academic incivility is a serious challenge for nursing education, which needs to be empirically measured and fully addressed. A convenience sample of nursing faculty and students from 20 schools of nursing in the United States participated in a mixed-methods study to test the psychometric properties of the Incivility in Nursing Education-Revised (INE-R) Survey. A factor analysis and other reliability analyses support the use of the INE-R as a valid and reliable measurement of student and faculty perceptions of incivility in nursing education. The INE-R is a psychometrically sound instrument to measure faculty and student perceptions of incivility; to examine differences regarding levels of nursing education, program type, gender, age, and ethnicity; to compare perceptions of incivility between and among adjunct, clinical, teaching, and research faculty; and to conduct pre- and postassessments of the perceived levels of faculty and student incivility in nursing programs to inform evidence-based interventions. Copyright 2015, SLACK Incorporated.

  3. Psychometric properties of a test in evidence based practice: the Spanish version of the Fresno test

    Directory of Open Access Journals (Sweden)

    Jiménez-Villa Josep

    2010-06-01

    Full Text Available Abstract Background Validated instruments are needed to evaluate the programmatic impact of Evidence Based Practice (EBP training and to document the competence of individual trainees. This study aimed to translate the Fresno test into Spanish and subsequently validate it, in order to ensure the equivalence of the Spanish version against the original English version. Methods Before and after study performed between October 2007 and June 2008. Three groups of participants: (a Mentors of family medicine residents (expert group (n = 56; (b Family medicine physicians (intermediate experience group (n = 17; (c Family medicine residents (novice group (n = 202; Medical residents attended an EBP course, and two sets of the test were administered before and after the course. The Fresno test is a performance based measure for use in medical education that assesses EBP skills. The outcome measures were: inter-rater and intra-rater reliability, internal consistency, item analyses, construct validity, feasibility of administration, and responsiveness. Results Inter-rater correlations were 0.95 and 0.85 in the pre-test and the post-test respectively. The overall intra-rater reliability was 0.71 and 0.81 in the pre-test and post-test questionnaire, respectively. Cronbach's alpha was 0.88 and 0.77, respectively. 152 residents (75.2% returned both sets of the questionnaire. The observed effect size for the residents was 1.77 (CI 95%: 1.57-1.95, the standardised response mean was 1.65 (CI 95%:1.47-1.82. Conclusions The Spanish version of the Fresno test is a useful tool in assessing the knowledge and skills of EBP in Spanish-speaking residents of Family Medicine.

  4. GamTest: Psychometric Evaluation and the Role of Emotions in an Online Self-Test for Gambling Behavior.

    Science.gov (United States)

    Jonsson, Jakob; Munck, Ingrid; Volberg, Rachel; Carlbring, Per

    2017-06-01

    Recent increases in the number of online gambling sites have made gambling more available, which may contribute to an increase in gambling problems. At the same time, online gambling provides opportunities to introduce measures intended to prevent problem gambling. GamTest is an online test of gambling behavior that provides information that can be used to give players individualized feedback and recommendations for action. The aim of this study is to explore the dimensionality of GamTest and validate it against the Problem Gambling Severity Index (PGSI) and the gambler's own perceived problems. A recent psychometric approach, exploratory structural equation modeling (ESEM) is used. Well-defined constructs are identified in a two-step procedure fitting a traditional exploratory factor analysis model as well as a so-called bifactor model. Using data collected at four Nordic gambling sites in the autumn of 2009 (n = 10,402), the GamTest ESEM analyses indicate high correspondence with the players' own understanding of their problems and with the PGSI, a validated measure of problem gambling. We conclude that GamTest captures five dimensions of problematic gambling (i.e., overconsumption of money and time, and monetary, social and emotional negative consequences) with high reliability, and that the bifactor approach, composed of a general factor and specific residual factors, reproduces all these factors except one, the negative consequences emotional factor, which contributes to the dominant part of the general factor. The results underscore the importance of tailoring feedback and support to online gamblers with a particular focus on how to handle emotions in relation to their gambling behavior.

  5. The Effect of Mock Tests on Iranian EFL learners’ Test Scores

    OpenAIRE

    Hossein Khodabakhshzadeh; Reza Zardkanloo

    2016-01-01

    The effect of using tests in test preparation courses has been subject to debate. While some scholars such as Yang and Badger (2015) believe it is a cause of positive washback effect, others argue that this issue is tentative and context-bound (Green, 2007). Therefore, this study investigated the effect of using Mock tests in International English Language Testing System (IELTS) preparation courses on students’ overall IELTS scores. Fifty one IELTS students were selected non-randomly through ...

  6. Adapting tests of sign language assessment for other sign languages--a review of linguistic, cultural, and psychometric problems.

    Science.gov (United States)

    Haug, Tobias; Mann, Wolfgang

    2008-01-01

    Given the current lack of appropriate assessment tools for measuring deaf children's sign language skills, many test developers have used existing tests of other sign languages as templates to measure the sign language used by deaf people in their country. This article discusses factors that may influence the adaptation of assessment tests from one natural sign language to another. Two tests which have been adapted for several other sign languages are focused upon: the Test for American Sign Language and the British Sign Language Receptive Skills Test. A brief description is given of each test as well as insights from ongoing adaptations of these tests for other sign languages. The problems reported in these adaptations were found to be grounded in linguistic and cultural differences, which need to be considered for future test adaptations. Other reported shortcomings of test adaptation are related to the question of how well psychometric measures transfer from one instrument to another.

  7. Test/score/report: Simulation techniques for automating the test process

    Science.gov (United States)

    Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

    1994-01-01

    A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary

  8. Psychometric test of the Team Climate Inventory-short version investigated in Dutch quality improvement teams

    NARCIS (Netherlands)

    M.M.H. Strating (Mathilde); A.P. Nieboer (Anna)

    2009-01-01

    textabstractAbstract BACKGROUND: Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate

  9. Psychometric Properties of the Malay Language Version of Knee Injury and Osteoarthritis Outcome Score (KOOS Questionnaire among Knee Osteoarthritis Patients: A Confirmatory Factor Analysis

    Directory of Open Access Journals (Sweden)

    Zulkifli MM

    2017-07-01

    Full Text Available INTRODUCTION: This study aimed to cross-culturally adapt a Malay version of Knee Injury and Osteoarthritis Outcome Score (KOOS and to evaluate its psychometric properties in patients with knee osteoarthritis (OA. MATERIALS AND METHODS: The English version KOOS was translated into a Malay version using forward and backward translation process, followed by face validity and content validity. Two hundred and twenty-six knee OA patients attending the Outpatient and Orthopaedic Clinics, Universiti Sains Malaysia Hospital, completed the Malay version KOOS. Construct validity using confirmatory factor analysis and internal reliability assessment were performed. RESULTS: The results showed that the original five-factor model with 42 items failed to achieve acceptable values of the goodness of fit indices, indicating poor model fit. A new five-factor model of 26 items demonstrated acceptable level of goodness of fit (comparative fit index= 0.929, incremental fit index= 0.930, Tucker Lewis fit index= 0.920, root mean square error of approximation= 0.073 and Chisquared/ degree of freedom= 2.183 indices to signify a model fit. The Cronbach’s alpha value for the new model ranged from 0.776 to 0.946. The composite reliability values of each construct ranged between 0.819 and 0.921, indicating satisfactory to high level of convergent validity. CONCLUSION: The five-factor model with 26 items in the Malay version of KOOS questionnaire demonstrated a good degree of goodness of fit and was found to be valid, reliable and simple as an assessment tool for symptoms, pain, activity of daily living, sports and recreational activity and quality of life for Malaysian adults suffering from knee osteoarthritis.

  10. The "Reading the Mind in the Eyes" Test: Investigation of Psychometric Properties and Test-Retest Reliability of the Persian Version

    Science.gov (United States)

    Khorashad, Behzad S.; Baron-Cohen, Simon; Roshan, Ghasem M.; Kazemian, Mojtaba; Khazai, Ladan; Aghili, Zahra; Talaei, Ali; Afkhamizadeh, Mozhgan

    2015-01-01

    The psychometric properties of the Persian "Reading the Mind in the Eyes" test were investigated, so were the predictions from the Empathizing-Systemizing theory of psychological sex differences. Adults aged 16-69 years old (N = 545, female = 51.7%) completed the test online. The analysis of items showed them to be generally acceptable.…

  11. Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

    Science.gov (United States)

    Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

    2010-01-01

    Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

  12. Field Psychometric Testing of the Instrument for Assessment of Psychological Predictors of Well-Being and Quality of Life in People with HIV or AIDS.

    Science.gov (United States)

    Remor, Eduardo; Fuster-RuizdeApodaca, Maria José; Ballester-Arnal, Rafael; Gómez-Martínez, Sandra; Fumaz, Carmina R; González-Garcia, Marian; Ubillos-Landa, Silvia; Aguirrezabal-Prado, Arrate; Molero, Fernando; Ruzafa-Martínez, Maria

    2016-06-01

    The Screenphiv, a screening measure for psychological issues related to HIV, was psychometrically tested in a study involving 744 HIV-infected people in Spain. Participants ages 18-82 (M = 43.04, 72 % men, 28 % women) completed an assessment protocol that included the Screenphiv and the MOS-HIV. A trained interviewer also collected relevant illness-related clinical data and socio-demographics from the participants. A confirmatory factor analysis was used to evaluate the goodness of fit of the Screenphiv's theoretical model and confirmed six first-order factors and two second-order factors [RMSEA (IC 90 %) = 0.07 (0.07-0.08)]. No floor or ceiling effects were observed for the scores. Cronbach's alphas were acceptable for all of the factors (from 0.65 to 0.92). Criterion-related validity also achieved; Screenphiv scores were related to socio-demographic and clinical variables and MOS-HIV summary scores. The Screenphiv is a reliable and valid measure, ready to use in research and clinical settings in Spain.

  13. The Apgar score has survived the test of time.

    Science.gov (United States)

    Finster, Mieczyslaw; Wood, Margaret

    2005-04-01

    In 1953, Virginia Apgar, M.D. published her proposal for a new method of evaluation of the newborn infant. The avowed purpose of this paper was to establish a simple and clear classification of newborn infants which can be used to compare the results of obstetric practices, types of maternal pain relief and the results of resuscitation. Having considered several objective signs pertaining to the condition of the infant at birth she selected five that could be evaluated and taught to the delivery room personnel without difficulty. These signs were heart rate, respiratory effort, reflex irritability, muscle tone and color. Sixty seconds after the complete birth of the baby a rating of zero, one or two was given to each sign, depending on whether it was absent or present. Virginia Apgar reviewed anesthesia records of 1025 infants born alive at Columbia Presbyterian Medical Center during the period of this report. All had been rated by her method. Infants in poor condition scored 0-2, infants in fair condition scored 3-7, while scores 8-10 were achieved by infants in good condition. The most favorable score 1 min after birth was obtained by infants delivered vaginally with the occiput the presenting part (average 8.4). Newborns delivered by version and breech extraction had the lowest score (average 6.3). Infants delivered by cesarean section were more vigorous (average score 8.0) when spinal was the method of anesthesia versus an average score of 5.0 when general anesthesia was used. Correlating the 60 s score with neonatal mortality, Virginia found that mature infants receiving 0, 1 or 2 scores had a neonatal death rate of 14%; those scoring 3, 4, 5, 6 or 7 had a death rate of 1.1%; and those in the 8-10 score group had a death rate of 0.13%. She concluded that the prognosis of an infant is excellent if he receives one of the upper three scores, and poor if one of the lowest three scores.

  14. Analysing model fit of psychometric process models: An overview, a new test and an application to the diffusion model.

    Science.gov (United States)

    Ranger, Jochen; Kuhn, Jörg-Tobias; Szardenings, Carsten

    2017-05-01

    Cognitive psychometric models embed cognitive process models into a latent trait framework in order to allow for individual differences. Due to their close relationship to the response process the models allow for profound conclusions about the test takers. However, before such a model can be used its fit has to be checked carefully. In this manuscript we give an overview over existing tests of model fit and show their relation to the generalized moment test of Newey (Econometrica, 53, 1985, 1047) and Tauchen (J. Econometrics, 30, 1985, 415). We also present a new test, the Hausman test of misspecification (Hausman, Econometrica, 46, 1978, 1251). The Hausman test consists of a comparison of two estimates of the same item parameters which should be similar if the model holds. The performance of the Hausman test is evaluated in a simulation study. In this study we illustrate its application to two popular models in cognitive psychometrics, the Q-diffusion model and the D-diffusion model (van der Maas, Molenaar, Maris, Kievit, & Boorsboom, Psychol Rev., 118, 2011, 339; Molenaar, Tuerlinckx, & van der Maas, J. Stat. Softw., 66, 2015, 1). We also compare the performance of the test to four alternative tests of model fit, namely the M 2 test (Molenaar et al., J. Stat. Softw., 66, 2015, 1), the moment test (Ranger et al., Br. J. Math. Stat. Psychol., 2016) and the test for binned time (Ranger & Kuhn, Psychol. Test. Asess. , 56, 2014b, 370). The simulation study indicates that the Hausman test is superior to the latter tests. The test closely adheres to the nominal Type I error rate and has higher power in most simulation conditions. © 2017 The British Psychological Society.

  15. The development and psychometric testing of a Disaster Response Self-Efficacy Scale among undergraduate nursing students.

    Science.gov (United States)

    Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling

    2017-12-01

    Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, pself-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.

  16. Psychometric properties of the Neck OutcOme Score, Neck Disability Index, and Short Form-36 were evaluated in patients with neck pain

    DEFF Research Database (Denmark)

    Juul, Tina; Søgaard, Karen; Davis, Aileen M.

    2016-01-01

    Objective:To assess reliability, construct validity, responsiveness, and interpretability for Neck OutcOme Score (NOOS), Neck Disability Index (NDI), and Short Form–36 (SF-36) in neck pain patients. Study Design and Setting: Internal consistency was assessed by Cronbach alpha. Test-retest reliabi...

  17. ISSUE PAPER: What Do Test Scores in Texas Tell Us?

    National Research Council Canada - National Science Library

    Klein, Stephen

    2000-01-01

    ...) about possible unintended consequences of these programs. We conducted several analyses to examine the issue of whether TAAS scores can be trusted to provide an accurate index of student skills and abilities...

  18. Bayesian psychometric scaling

    NARCIS (Netherlands)

    Fox, Gerardus J.A.; van den Berg, Stéphanie Martine; Veldkamp, Bernard P.; Irwing, P.; Booth, T.; Hughes, D.

    2015-01-01

    In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item

  19. Future of Psychometrics: Ask What Psychometrics Can Do for Psychology

    Science.gov (United States)

    Sijtsma, Klaas

    2012-01-01

    I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…

  20. Development and psychometric testing of a breast cancer patient-profiling questionnaire

    Directory of Open Access Journals (Sweden)

    Gorini A

    2015-06-01

    Full Text Available Alessandra Gorini,1,2 Ketti Mazzocco,1,2 Sara Gandini,2 Elisabetta Munzone,2 Gordon McVie,2 Gabriella Pravettoni1,2 1Department of Health Science, University of Milan, Milan, Italy; 2European Institute of Oncology, Milan, Italy Introduction: The advent of “personalized medicine” has been driven by technological advances in genomics. Concentration at the subcellular level of a patient's cancer cells has meant inevitably that the “person” has been overlooked. For this reason, we think there is an urgent need to develop a truly personalized approach focusing on each patient as an individual, assessing his/her unique mental dimensions and tailoring interventions to his/her individual needs and preferences. The aim of this study was to develop and test the psychometric properties of the ALGA-Breast Cancer (ALGA-BC, a new multidimensional questionnaire that assesses the breast cancer patient's physical and mental characteristics in order to provide physicians, prior to the consultation, with a patient's profile that is supposed to facilitate subsequent communication, interaction, and information delivery between the doctor and the patient. Methods: The specific validation processes used were: content and face validity, construct validity using factor analysis, reliability and internal consistency using test–retest reliability, and Cronbach's alpha correlation coefficient. The exploratory analysis included 100 primary breast cancer patients and 730 healthy subjects. Results: The exploratory factor analysis revealed eight key factors: global self-rated health, perceived physical health, anxiety, self-efficacy, cognitive closure, memory, body image, and sexual life. Test–retest reliability and internal consistency were good. Comparing patients with a sample of healthy subjects, we also observed a general ability of the ALGA-BC questionnaire to discriminate between the two. Conclusion: The ALGA-BC questionnaire with 29 items is a valid

  1. Psychometric Testing of the Persian Version of the Perceived Perioperative Competence Scale-Revised.

    Science.gov (United States)

    Ajorpaz, Neda Mirbagher; Tafreshi, Mansoureh Zagheri; Mohtashami, Jamileh; Zayeri, Farid; Rahemi, Zahra

    2017-12-01

    The clinical competence of nursing students in operating room (OR) is an important issue in nursing education. The purpose of this study was to evaluate the psychometric properties of the Persian Perceived Perioperative Competence Scale-Revised (PPCS-R) instrument. This cross-sectional study was conducted across 12 universities in Iran. The psychometric properties and factor structure of the PPCS-R for OR students was examined. Based on the results of factor analysis, seven items were removed from the original version of the scale. The fitness indices of the Persian scale include comparative fit index (CFI) = .90, goodness-of-fit-index (GFI) = .86, adjusted goodness-of-fit index (AGFI) = .90, normed fit index (NFI) = .84, and root mean square error of approximation (RMSEA) = .04. High validity and reliability indicated the scale's value for measuring perceived perioperative competence of Iranian OR students.

  2. Effects of white noise on Callsign Acquisition Test and Modified Rhyme Test scores.

    Science.gov (United States)

    Blue-Terry, Misty; Letowski, Tomasz

    2011-02-01

    The Callsign Acquisition Test (CAT) is a speech intelligibility test developed by the US Army Research Laboratory. The test has been used to evaluate speech transmission through various communication systems but has not been yet sufficiently standardised and validated. The aim of this study was to compare CAT and Modified Rhyme Test (MRT) performance in the presence of white noise across a range of signal-to-noise ratios (SNRs). A group of 16 normal-hearing listeners participated in the study. The speech items were presented at 65 dB(A) in the background of white noise at SNRs of -18, -15, -12, -9 and -6 dB. The results showed a strong positive association (75.14%) between the two tests, but significant differences between the CAT and MRT absolute scores in the range of investigated SNRs. Based on the data, a function to predict CAT scores based on existing MRT scores and vice versa was formulated. STATEMENT OF RELEVANCE: This work compares performance data of a common speech intelligibility test (MRT) with a new test (CAT) in the presence of white noise. The results here can be used as a part of the standardisation procedures and provide insights to the predictive capabilities of the CAT to quantify speech intelligibility communication in high-noise military environments.

  3. Psychometric test of the Team Climate Inventory-short version investigated in Dutch quality improvement teams

    OpenAIRE

    Nieboer Anna P; Strating Mathilde MH

    2009-01-01

    Abstract Background Although some studies have used the Team Climate Inventory within teams working in health care settings, none of these included quality improvement teams. The aim of our study is to investigate the psychometric properties of the 14-item version of the Team Climate Inventory in healthcare quality improvement teams participating in a Dutch quality collaborative. Methods This study included quality improvement teams participating in the Care for Better improvement program for...

  4. The Team Climate Inventory (TCI): A psychometric test on a Taiwanese sample of work groups

    OpenAIRE

    Tseng, Hsu-Min; Liu, F-C; West, Michael

    2009-01-01

    Understanding the feasibility of applying the Team Climate Inventory (TCI) in non-Western cultures is essential for researchers attempting to understand the influence of culture on workers' perceived climate. This study describes the application of the TCI in such a setting using data from 203 administrators employed in a Taiwanese medical center. Reliability and factor analyses were performed to establish the feasibility and psychometric properties of the TCI Taiwan version. Reliabilities of...

  5. The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

    Science.gov (United States)

    Molenaar, Dylan; Borsboom, Denny

    2013-01-01

    Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…

  6. Translation and Psychometric Testing of the Persian Version of the Spiritual Needs Questionnaire Among Elders With Chronic Diseases.

    Science.gov (United States)

    Moeini, Babak; Zamanian, Hadi; Taheri-Kharameh, Zahra; Ramezani, Tahereh; Saati-Asr, Mohamadhasan; Hajrahimian, Mohamadhasan; Amini-Tehrani, Mohammadali

    2018-01-01

    Spirituality plays an important role in coping with chronic diseases for patients and they often report unmet spiritual and existential needs, which should be considered for a holistic view of their health. Studying spiritual needs in this generation requires culturally appropriate and valid instruments. The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The aim of this study was to determine the psychometric properties, such as validity, reliability, and factor structure of the Persian version of Spiritual Needs Questionnaire (SpNQ). The "forward-backward" procedure was applied to translate the SpNQ from English into Persian. The SpNQ-Persian Version (SpNQ-PV) was checked in terms of validity and reliability with a convenience sample of 100 elders with chronic diseases who were recruited from the inpatient wards at two university hospitals in Qom, Iran. The validity was assessed using content, face, and construct validity. The Cronbach alpha and test-retest were used to assess the reliability of the questionnaire. The results of the exploratory factor analysis indicated a five-factor solution for the questionnaire, which included religious needs, existential needs, forgiveness/generativity needs, need for inner peace, and emotional needs. These accounted for 60.1% of the total observed variance. One item was removed (factor loading Spiritual Well-being Scale. Cronbach alpha of the subscales ranged from 0.56 to 0.78 and the test-retest reliability ranged from 0.72 to 0.91, which indicated an acceptable range of reliability. The SpNQ-PV showed a minor difference in structuring and indicated good psychometric properties, which can be used to assess the spiritual needs of Iranian elders suffering from chronic diseases. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.

  7. The Effect of Mock Tests on Iranian EFL learners’ Test Scores

    Directory of Open Access Journals (Sweden)

    Hossein Khodabakhshzadeh

    2016-07-01

    Full Text Available The effect of using tests in test preparation courses has been subject to debate. While some scholars such as Yang and Badger (2015 believe it is a cause of positive washback effect, others argue that this issue is tentative and context-bound (Green, 2007. Therefore, this study investigated the effect of using Mock tests in International English Language Testing System (IELTS preparation courses on students’ overall IELTS scores. Fifty one IELTS students were selected non-randomly through the quota sampling approach out of 76 students at Mahan Language Institute in Birjand, Iran.  These participants were distributed into Group 1 (n=25 and Group 2 (n=26. A complete IELTS test was administered to ensure that the Groups were homogeneous and to serve as pretest. After 10 sessions of intervention, a different IELTS test was administered as posttest. The results of between subject analysis through independent samples t-test revealed that using Mock tests in the IELTS preparation courses can positively affect the participants scores on IELTS exam. Pedagogical implications are discussed.

  8. Psychometric Quality of the Dutch Version of the Children's Eating Attitude Test in a Community Sample and a Sample of Overweight Youngsters

    Directory of Open Access Journals (Sweden)

    Lotte Theuwis

    2010-12-01

    Full Text Available Introduction. Disturbed eating attitudes may be important precursors of pathological eating patterns and, therefore need to be researched adequately. The Children's Eating Attitude Test (ChEAT is indicated for detecting at-risk attitudes and concerns in youngsters. Method. The present study was designed to provide a preliminary psychometric evaluation of the Dutch version of the ChEAT, by examining reliability and validity in a sample of 166 youngsters. Results. Generally the ChEAT seems to be a reliable instrument. Concurrent validity was demonstrated by positive correlations with measures assessing pathological eating behaviour and with related psychological problems. The discriminant validity was good. Based on ChEAT scores we can distinguish overweight youngsters from the community sample and “dieters” from “non dieters”. Divergent validity and factor structure show still shortcomings. Discussion. The Dutch version of the ChEAT seems to be a promising screening- and research instrument. Future prospective research could focus on a cut-off score for identifying at-risk youngsters.

  9. Psychometric Properties of the Physical Activity Questionnaire for Older Children in Italy: Testing the Validity among a General and Clinical Pediatric Population.

    Directory of Open Access Journals (Sweden)

    Erica Gobbi

    Full Text Available The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It. Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170 examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA and construct validity with enjoyment perception during physical activity. Study 2 (n = 59 reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry over the span of seven consecutive days. Study 3 (n = 58 examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD. In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83. Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36, with BMI (r = -.30 and -.79 for CHD simple form, and with the VO2max (r = .55 for CHD simple form. Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p < .05. Findings of the EFA suggested a two-factor structure for the PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.

  10. Psychometric Properties of the Physical Activity Questionnaire for Older Children in Italy: Testing the Validity among a General and Clinical Pediatric Population.

    Science.gov (United States)

    Gobbi, Erica; Elliot, Catherine; Varnier, Maurizio; Carraro, Attilio

    2016-01-01

    The purpose of this research was to assess an Italian version of the Physical Activity Questionnaire for Older Children (PAQ-C-It). Three separate studies were conducted, whereby testing general psychometric properties, construct validity, concurrent validity and the factor structure of the PAQ-C-It among general and clinical pediatric population. Study 1 (n = 1170) examined the psychometric properties, internal consistency, factor structure (exploratory factor analysis, EFA) and construct validity with enjoyment perception during physical activity. Study 2 (n = 59) reported on reliability, construct validity with enjoyment and BMI, and on cross-sectional concurrent validity with objectively measured MVPA (tri-axial accelerometry) over the span of seven consecutive days. Study 3 (n = 58) examined the PAQ-C-It reliability, construct validity with BMI and VO2max as the objective measurement among a population of children with congenital heart defects (CHD). In study 2 and 3, the factor structure of the PAQ-C-It was then re-examined with an EFA. The PAQ-C-It showed acceptable to good reliability (alpha .70 to .83). Results on construct validity showed moderate but significant association with enjoyment perception (r = .30 and .36), with BMI (r = -.30 and -.79 for CHD simple form), and with the VO2max (r = .55 for CHD simple form). Significant concurrent validity with the objectively measured MVPA was reported (rho = .30, p PAQ-C-It, with items 2, 3, and 4 contributing little to the total score. This study supports the PAQ-C-It as an appropriate instrument to assess the MVPA levels of Italian children, including children with simple forms of CHD. Support is given to the possible instrument effectiveness on a large international perspective in order to level out data gathering across the globe.

  11. Psychometric testing of the Agitation Severity Scale for acute presentation behavioral management patients in the emergency department.

    Science.gov (United States)

    Strout, Tania D

    2014-01-01

    Agitation is a vexing problem frequently observed in emergency department acute psychiatric patients, yet no instruments to measure agitation in this setting and population were found upon review of the literature. Previously developed agitation rating scales are limited by the length of observation they require, their need for participation by the patient, complexity in scoring, and a lack of validity in this setting and population. The purpose of this study was to psychometrically evaluate and refine an observation-based agitation scale for use with emergency department acute psychiatric patients. Using a methodological design, the 21-item Agitation Severity Scale was utilized to assess 270 adult psychiatric patients in the emergency setting in a prospective, observational fashion. Reliability analysis, item analysis, exploratory factor analysis, and validity assessments were completed. The relationship between Agitation Severity Scale scores and scores on the previously established Overt Agitation Severity Scale was evaluated. The instrument was reduced to 17 items representing four factors (Aggressive Behaviors, Interpersonal Behaviors, Involuntary Motor Behaviors, and Physical Stance) that accounted for nearly 70% of observed variance, Cronbach's α = 0.91. Evidence of internal consistency reliability, equivalence reliability, construct validity, and convergent validity was established. Through this study, the 17-item Agitation Severity Scale demonstrated acceptable levels of reliability and validity when used with acute psychiatric patients in the emergency setting. This instrument holds promise as a method of enhancing clinical communication about agitation, evaluating the efficacy of interventions aimed at decreasing agitation, and as a research tool.

  12. Measuring use of positive thinking skills: psychometric testing of a new scale.

    Science.gov (United States)

    Bekhet, Abir K; Zauszniewski, Jaclene A

    2013-09-01

    Positive thinking interventions improve adaptive functioning and quality of life in many populations. However, no direct measure of positive thinking skills taught during intervention exists. This psychometric study of a convenience sample of 109 autism spectrum disorder (ASD) caregivers examined a new eight-item Positive Thinking Skills Scale (PTSS), which measures the frequency of use of positive thinking skills. The PTSS was found to be internally consistent (α = .90). Construct validity was supported by significant correlations (p thinking skills could provide direction for future intervention.

  13. Validating internet research: a test of the psychometric equivalence of internet and in-person samples.

    Science.gov (United States)

    Meyerson, Paul; Tryon, Warren W

    2003-11-01

    This study evaluated the psychometric equivalency of Web-based research. The Sexual Boredom Scale was presented via the World-Wide Web along with five additional scales used to validate it. A subset of 533 participants that matched a previously published sample (Watt & Ewing, 1996) on age, gender, and race was identified. An 8 x 8 correlation matrix from the matched Internet sample was compared via structural equation modeling with a similar 8 x 8 correlation matrix from the previously published study. The Internet and previously published samples were psychometrically equivalent. Coefficient alpha values calculated on the matched Internet sample yielded reliability coefficients almost identical to those for the previously published sample. Factors such as computer administration and uncontrollable administration settings did not appear to affect the results. Demographic data indicated an overrepresentation of males by about 6% and Caucasians by about 13% relative to the U.S. Census (2000). A total of 2,230 participants were obtained in about 8 months without remuneration. These results suggest that data collection on the Web is (1) reliable, (2) valid, (3) reasonably representative, (4) cost effective, and (5) efficient.

  14. Work environment impact scale: testing the psychometric properties of the Swedish version.

    Science.gov (United States)

    Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée

    2014-01-01

    The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.

  15. Psychometrically equivalent bisyllabic words for speech recognition threshold testing in Vietnamese.

    Science.gov (United States)

    Harris, Richard W; McPherson, David L; Hanson, Claire M; Eggett, Dennis L

    2017-08-01

    This study identified, digitally recorded, edited and evaluated 89 bisyllabic Vietnamese words with the goal of identifying homogeneous words that could be used to measure the speech recognition threshold (SRT) in native talkers of Vietnamese. Native male and female talker productions of 89 Vietnamese bisyllabic words were recorded, edited and then presented at intensities ranging from -10 to 20 dBHL. Logistic regression was used to identify the best words for measuring the SRT. Forty-eight words were selected and digitally edited to have 50% intelligibility at a level equal to the mean pure-tone average (PTA) for normally hearing participants (5.2 dBHL). Twenty normally hearing native Vietnamese participants listened to and repeated bisyllabic Vietnamese words at intensities ranging from -10 to 20 dBHL. A total of 48 male and female talker recordings of bisyllabic words with steep psychometric functions (>9.0%/dB) were chosen for the final bisyllabic SRT list. Only words homogeneous with respect to threshold audibility with steep psychometric function slopes were chosen for the final list. Digital recordings of bisyllabic Vietnamese words are now available for use in measuring the SRT for patients whose native language is Vietnamese.

  16. The development and psychometric testing of a theory-based instrument to evaluate nurses' perception of clinical reasoning competence.

    Science.gov (United States)

    Liou, Shwu-Ru; Liu, Hsiu-Chen; Tsai, Hsiu-Min; Tsai, Ying-Huang; Lin, Yu-Ching; Chang, Chia-Hao; Cheng, Ching-Yu

    2016-03-01

    The purpose of the study was to develop and psychometrically test the Nurses Clinical Reasoning Scale. Clinical reasoning is an essential skill for providing safe and quality patient care. Identifying pre-graduates' and nurses' needs and designing training courses to improve their clinical reasoning competence becomes a critical task. However, there is no instrument focusing on clinical reasoning in the nursing profession. Cross-sectional design was used. This study included the development of the scale, a pilot study that preliminary tested the readability and reliability of the developed scale and a main study that implemented and tested the psychometric properties of the developed scale. The Nurses Clinical Reasoning Scale was developed based on the Clinical Reasoning Model. The scale includes 15 items using a Likert five-point scale. Data were collected from 2013-2014. Two hundred and fifty-one participants comprising clinical nurses and nursing pre-graduates completed and returned the questionnaires in the main study. The instrument was tested for internal consistency and test-retest reliability. Its validity was tested with content, construct and known-groups validity. One factor emerged from the factor analysis. The known-groups validity was confirmed. The Cronbach's alpha for the entire instrument was 0·9. The reliability and validity of the Nurses Clinical Reasoning Scale were supported. The scale is a useful tool and can be easily administered for the self-assessment of clinical reasoning competence of clinical nurses and future baccalaureate nursing graduates. Study limitations and further recommendations are discussed. © 2015 John Wiley & Sons Ltd.

  17. Psychometric testing of the Chinese version of the medical outcomes study social support survey (MOS-SSS-C).

    Science.gov (United States)

    Yu, Doris S F; Lee, Diana T F; Woo, Jean

    2004-04-01

    The purpose of this study was to assess the psychometric properties of the Chinese version of the Medical Outcomes Study Social Support Survey (MOS-SSS-C) in a sample of 110 patients. Criterion-related and construct validities of the MOS-SSS-C were evaluated by correlations with the Chinese version of the Multidimensional Perceived Social Support Survey (r =.82) and the Hospital Anxiety and Depression Scale (r = -.58). Confirmatory factor analysis affirmed the four-factor structure of the MOS-SSS-C in measuring the functional aspects of perceived social support. Cronbach's alphas for the subscales ranged from.93 to.96, whereas the alpha for the overall scale was.98. The 2-week test-retest reliability of the MOS-SSS-C as measured by the intraclass correlation coefficient was.84. The MOS-SSS-C is a psychometrically sound multidimensional measure for the evaluation of functional aspects of perceived social support by Chinese patients with chronic disease. Copyright 2004 Wiley Periodicals, Inc.

  18. Examination of Psychometric Properties of a Translated Social-Emotional Screening Test: The Taiwanese Version of The Ages and Stages Questionnaires: Social-Emotional

    Science.gov (United States)

    Chen, Chieh-Yu

    2017-01-01

    Investigating the psychometric properties of a screening instrument for young children is necessary to ascertain its quality and accuracy. In light of the important role culture plays on human beliefs and parenting styles, a newly translated and adapted test needs to be studied. Evaluating outcomes on a translated version of a test may reveal…

  19. Achievement Testing with the Wechsler Quicktest: An Examination of Its Psychometric Properties and Applied Utility with a Greek-Cypriot Sample

    Science.gov (United States)

    Vrachimi-Souroulla, Andry; Panayiotou, Georgia; Kokkinos, Constantinos M.; Lamprianou, Iasonas

    2011-01-01

    The study aimed to field-test a Greek version of the Wechsler Quicktest and to examine its psychometric properties. The Quicktest was individually administered to 208 students, aged 5-14 years, along with a reading test. Based on the Rasch analysis, data for the Quicktest subtests showed acceptable fit to the model. Also, correlations were found…

  20. Development and psychometric testing of the rural pregnancy experience scale (RPES).

    Science.gov (United States)

    Kornelsen, Jude; Stoll, Kathrin; Grzybowski, Stefan

    2011-01-01

    Rural pregnant woman who lack local access to maternity care due to their remote living circumstances may experience stress and anxiety related to pregnancy and parturition. The Rural Pregnancy Experience Scale (RPES) was designed to assess the unique worry and concerns reflective of the stress and anxiety of rural pregnant women related to pregnancy and parturition. The items of the scale were designed based on the results of a qualitative study of the experiences of pregnant rural women, thereby building a priori content validity into the measure. The relevancy content validity index (CVI) for this instrument was 1.0 and the clarity CVI was .91, as rated by maternity care specialists. A field test of the RPES with 187 pregnant rural women from British Columbia indicated that it had two factors: financial worries and worries/concerns about maternity care services, which were consistent with the conceptual base of the tool. Cronbach's alpha for the total RPES was .91; for the financial worries subscale and the worries/concerns about maternity care services subscale, alpha were .89 and .88, respectively. Construct validity was supported by significant correlations between the total scores of the RPES and the Depression Anxiety Stress Scales (DASS [r =.39, p DASS supporting convergent validity (correlations ranged between .20; p < .05 and .43; p < .01). Construct validity was also supported by findings that the level of access and availability of maternity care services were significantly associated with RPES scores. It was concluded that the RPES is a reliable and valid measure of worries and concerns reflective of rural pregnant women's stress and anxiety related to pregnancy and parturition.

  1. Development and psychometric evaluation of an information literacy self-efficacy survey and an information literacy knowledge test.

    Science.gov (United States)

    Tepe, Rodger; Tepe, Chabha

    2015-03-01

    To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.

  2. Reduce, Reuse, Recycle: The Longitudinal Value of Local Cut Scores Using State Test Data

    Science.gov (United States)

    Nelson, Peter M.; Van Norman, Ethan R.; VanDerHeyden, Amanda

    2017-01-01

    We used existing reading (n = 1,498) and math (n = 2,260) data to evaluate state test scores for screening middle school students. In Phase 1, state test data were used to create a research-derived cut score that was optimal for predicting state test performance the following year. In Phase 2, those cut scores were applied with future cohorts.…

  3. The Effect of Black Peers on Black Test Scores

    Science.gov (United States)

    Armor, David J.; Duck, Stephanie

    2007-01-01

    Recent studies have used increasingly complex methodologies to estimate the effect of peer characteristics--race, poverty, and ability--on student achievement. A paper by Hanushek, Kain, and Rivkin using Texas state testing data has received particularly wide attention because it found a large negative effect of school percent black on black math…

  4. Psychometric properties of the Neck OutcOme Score, Neck Disability Index, and Short Form-36 were evaluated in patients with neck pain.

    Science.gov (United States)

    Juul, Tina; Søgaard, Karen; Davis, Aileen M; Roos, Ewa M

    2016-11-01

    To assess reliability, construct validity, responsiveness, and interpretability for Neck OutcOme Score (NOOS), Neck Disability Index (NDI), and Short Form-36 (SF-36) in neck pain patients. Internal consistency was assessed by Cronbach alpha. Test-retest reliability was evaluated by intraclass correlation coefficient (ICC), and measurement error was estimated from the standard error of measurement. Responsiveness was assessed as standardized response mean (SRM) and interpretability from the minimal important difference (MID). Construct validity was tested correlating subscale scores from NOOS and SF-36 and NDI items. At baseline, 196 neck pain patients were included. Cronbach α was adequate for most NOOS subscales, NDI, and SF-36 with few exceptions. Good to excellent reliability was found for NOOS subscales (ICC 0.88-0.95), for NDI, and for SF-36 with few exceptions. For NOOS, minimal detectable changes varied between 1.1 and 1.9, and construct validity was supported. SRMs were higher for NOOS subscales (0.19-0.42), compared to SF-36 and NDI. MID values varied between 15.0 and 24.1 for NOOS subscales. In conclusion, the NOOS is a reliable, valid, and responsive measure of self-reported disability in neck pain patients, performing at least as well or better than the commonly used SF-36 and NDI. Copyright © 2016 Elsevier Inc. All rights reserved.

  5. A Computer-Based Sustained Visual Attention Test for Pre-School Children: Design, Development and Psychometric Properties

    Directory of Open Access Journals (Sweden)

    Roohollah Zahedian Nasb

    2016-06-01

    Full Text Available Background: Sustained visual attention is a prerequisite for learning and memory. The early evaluation of attention in childhood is essential for their school and career success in the future. The aim of this study was to design, development and investigation of psychometric properties (content, face and convergent validity and test-retest and internal consistency reliability of the computer - based sustained visual attention test (SuVAT for healthy preschool children aged 4-6 with their special needs. Methods: This study was carried out in two stages: in the first stage computerbased SuVAT in two versions original and parallel were developed. Then the test-retest and internal consistency reliability using intra-class correlation and Cronbach’s alpha coefficients respectively were examined; Face validity was calculated through ideas gathering from 10 preschool children and content validity evaluated using CVI and CVR method and convergent validity of SuVAT with CPT was assessed using Pearson correlation. Results: The developed test showed a good content and faces validity, and also had excellent test-retest reliability. In addition, the assessment of internal consistency indicated the high internal consistency of the test (Cronbach’s alpha=0.869. SuVAT and CPT test demonstrated a positive correlation upon the convergent validity testing. Conclusion: SuVAT with good reliability and validity could be used as an acceptable sustained attention assessment in preschool children.

  6. Assessment of nutrition and physical activity environments in family child care homes: modification and psychometric testing of the Environment and Policy Assessment and Observation

    Directory of Open Access Journals (Sweden)

    Amber E. Vaughn

    2017-08-01

    development (r = 0.21, and nutrition policy (r = 0.18. Child MVPA was significantly associated with overall time provided for activity (r = 0.18 and outdoor playtime (r = 0.20. There was also an unexpected negative association between child MVPA and screen time (−0.16 and screen time practices (r = −0.21. Conclusions The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. Trial registration NCT01814215 , March 15, 2013.

  7. Assessment of nutrition and physical activity environments in family child care homes: modification and psychometric testing of the Environment and Policy Assessment and Observation.

    Science.gov (United States)

    Vaughn, Amber E; Mazzucca, Stephanie; Burney, Regan; Østbye, Truls; Benjamin Neelon, Sara E; Tovar, Alison; Ward, Dianne S

    2017-08-29

     = 0.18). Child MVPA was significantly associated with overall time provided for activity (r = 0.18) and outdoor playtime (r = 0.20). There was also an unexpected negative association between child MVPA and screen time (-0.16) and screen time practices (r = -0.21). The EPAO for the FCCH instrument is a useful tool for researchers working with this unique type of ECE setting. It has undergone rigorous development and testing and appears to have good psychometric properties. NCT01814215 , March 15, 2013.

  8. The Effects of Listening to Music Just Before Reading Test on Students’ Test Score

    OpenAIRE

    MAHDAVI, Mojtaba

    2015-01-01

    Abstract. In this study the researcher  examined  the  effect  of  music  on  reading  comprehension played just before the test .  Because the emotional consequences of music listening are evident in stress and anxiety removal, it was used as a tool to pacify the mind of the tastes and boost their memory and the related cognitive processes. Experimental group did well with the mean score of) and control group (). This study confirmed that using multimedia devices such as music can not only i...

  9. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    Science.gov (United States)

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  10. The Internet Addiction Test: assessing its psychometric properties in Bangladeshi culture.

    Science.gov (United States)

    Rezaul Karim, A K M; Nigar, Naima

    2014-08-01

    There is growing importance of the Internet Addiction Test (IAT) in Internet addiction research around the world. Since the development of the IAT (Young, 1996, 1998), a number of validation studies have been done in various cultures. The aim of this study was to translate the instrument into Bangla and validate in Bangladeshi culture, a culture vulnerable to Internet addiction. A total of 177 Internet users (77 females and 100 males) participated in the study. Exploratory factor analysis (EFA) of the data from 172 participants (who provided complete responses) identified a four factor structure of the IAT with 18 items. The four factors namely 'Neglect of duty', 'Online dependence', 'Virtual fantasies', and 'Privacy and self-defense' together explained 55.68% of the total variance. Problematic (moderate/excessive) users on the IAT scored, on average, higher on each of the four IAT factors as compared to average or non-problematic (minimal) users consistently across genders. The IAT and its factors showed good internal consistency (Cronbach's α=.89 for the IAT, and .60-.84 for the factors), strong convergent and discriminant validity. Thus, the Bangla version IAT appears to be valid and reliable and therefore may be used in further research on Internet addiction in the country. Copyright © 2013 Elsevier B.V. All rights reserved.

  11. 'Mechanical restraint-confounders, risk, alliance score': testing the clinical validity of a new risk assessment instrument.

    Science.gov (United States)

    Deichmann Nielsen, Lea; Bech, Per; Hounsgaard, Lise; Alkier Gildberg, Frederik

    2017-08-01

    Unstructured risk assessment, as well as confounders (underlying reasons for the patient's risk behaviour and alliance), risk behaviour, and parameters of alliance, have been identified as factors that prolong the duration of mechanical restraint among forensic mental health inpatients. To clinically validate a new, structured short-term risk assessment instrument called the Mechanical Restraint-Confounders, Risk, Alliance Score (MR-CRAS), with the intended purpose of supporting the clinicians' observation and assessment of the patient's readiness to be released from mechanical restraint. The content and layout of MR-CRAS and its user manual were evaluated using face validation by forensic mental health clinicians, content validation by an expert panel, and pilot testing within two, closed forensic mental health inpatient units. The three sub-scales (Confounders, Risk, and a parameter of Alliance) showed excellent content validity. The clinical validations also showed that MR-CRAS was perceived and experienced as a comprehensible, relevant, comprehensive, and useable risk assessment instrument. MR-CRAS contains 18 clinically valid items, and the instrument can be used to support the clinical decision-making regarding the possibility of releasing the patient from mechanical restraint. The present three studies have clinically validated a short MR-CRAS scale that is currently being psychometrically tested in a larger study.

  12. Psychometric evaluation and validation of the Serbian version of “Reading the mind in the eyes” test

    Directory of Open Access Journals (Sweden)

    Đorđević Jelena

    2017-01-01

    Full Text Available “Reading the Mind in the Eyes” test (RMET is one of the most popular and widely used measures of individual differences in Theory of Mind (ToM capabilities. Despite demonstrating good validity in differentiating various clinical groups exhibiting ToM deficits from unimpaired controls, previous studies raised the question of the RMET’s homogeneity, latent structure, and reliability. The aim of this study is to provide evidence on psychometric properties, latent structure, and validity of the newly adapted Serbian version of the RMET. In total, 260 participants (61.9% females took part in the study. The sample consisted of both unimpaired controls (76.5%, and a clinical group of participants that are believed to demonstrate ToM deficits (23.5%, namely, persons diagnosed with schizophrenia and bipolar disorder (54.1% females. RMET has demonstrated fair psychometric properties (KMO = .723; α = .747; H1 = .076; H5 = .465, successfully differentiating between clinical group and control [F (1,254 = 26.175, p <.001, η2 p = .093], while typical gender differences in performance were found only in control group. Tests of several models based on the previous literature revealed that the affect-specific factors underlying performance on RMET demonstrate poor fit. The best fitting model obtained included reduced scale with a single-factor underlying the test’s performance (TLI = .953, CFI = .958, RMSEA = .020. Based on the fit parameters we propose 18-item short-form of the Serbian version of RMET (KMO = .797; α = .728; H1 = .129; H5 = .677 for economic, reliable and valid measurement of ToM abilities.

  13. International field testing of the psychometric properties of an EORTC quality of life module for oral health: the EORTC QLQ-OH15.

    Science.gov (United States)

    Hjermstad, Marianne J; Bergenmar, Mia; Bjordal, Kristin; Fisher, Sheila E; Hofmeister, Dirk; Montel, Sébastien; Nicolatou-Galitis, Ourania; Pinto, Monica; Raber-Durlacher, Judith; Singer, Susanne; Tomaszewska, Iwona M; Tomaszewski, Krzysztof A; Verdonck-de Leeuw, Irma; Yarom, Noam; Winstanley, Julie B; Herlofson, Bente B

    2016-09-01

    This international EORTC validation study (phase IV) is aimed at testing the psychometric properties of a quality of life (QoL) module related to oral health problems in cancer patients. The phase III module comprised 17 items with four hypothesized multi-item scales and three single items. In phase IV, patients with mixed cancers, in different treatment phases from 10 countries completed the EORTC QLQ-C30, the QLQ-OH module, and a debriefing interview. The hypothesized structure was tested using combinations of classical test theory and item response theory, following EORTC guidelines. Test-retest assessments and responsiveness to change analysis (RCA) were performed after 2 weeks. Five hundred seventy-two patients (median age 60.3, 54 % females) were analyzed. Completion took issues were addressed. Analyses suggested a revision of the phase III hypothesized scale structure. Two items were deleted based on a high degree of item misfit, together with negative patient feedback. The remaining 15 items formed one eight-item scale named OH-QoL score, a two-item information scale, a two-item scale regarding dentures, and three single items (sticky saliva/mouth soreness/sensitivity to food/drink). Face and convergent validity and internal consistency were confirmed. Test-retest reliability (n = 60) was demonstrated as was RCA for patients undergoing chemotherapy (n = 117; p = 0.06). The resulting QLQ-OH15 discriminated between clinically distinct patient groups, e.g., low performance status vs. higher (p < 000.1), and head-and-neck cancer versus other cancers (p < 0.03). The EORTC module QLQ-OH15 is a short, well-accepted assessment tool focusing on oral problems and QoL to improve clinical management. ClinicalTrials.gov Identifier: NCT01724333.

  14. Initial development and psychometric testing of an instrument to measure the quality of children's end-of-life care.

    Science.gov (United States)

    Widger, Kimberley; Tourangeau, Ann E; Steele, Rose; Streiner, David L

    2015-01-01

    The field of pediatric palliative care is hindered by the lack of a well-defined, reliable, and valid method for measuring the quality of end-of-life care. The study purpose was to develop and test an instrument to measure mothers' perspectives on the quality of care received before, at the time of, and following a child's death. In Phase 1, key components of quality end-of-life care for children were synthesized through a comprehensive review of research literature. These key components were validated in Phase 2 and then extended through focus groups with bereaved parents. In Phase 3, items were developed to assess structures, processes, and outcomes of quality end-of-life care then tested for content and face validity with health professionals. Cognitive testing was conducted through interviews with bereaved parents. In Phase 4, bereaved mothers were recruited through 10 children's hospitals/hospices in Canada to complete the instrument, and psychometric testing was conducted. Following review of 67 manuscripts and 3 focus groups with 10 parents, 141 items were initially developed. The overall content validity index for these items was 0.84 as rated by 7 health professionals. Based on feedback from health professionals and cognitive testing with 6 parents, a 144-item instrument was finalized for further testing. In Phase 4, 128 mothers completed the instrument, 31 of whom completed it twice. Test-retest reliability, internal consistency, and construct validity were demonstrated for six subscales: Connect With Families, Involve Parents, Share Information With Parents, Share Information Among Health Professionals, Support Parents, and Provide Care at Death. Additional items with content validity were grouped in four domains: Support the Child, Support Siblings, Provide Bereavement Follow-up, and Structures of Care. Forty-eight items were deleted through psychometric testing, leaving a 95-item instrument. There is good initial evidence for the reliability and

  15. Governing by Testing: Circulation, Psychometric Knowledge, Experts and the "Alliance for Progress" in Latin America during the 1960s and 1970s

    Science.gov (United States)

    Alarcón, Cristina

    2015-01-01

    This paper analyzes the activities, members, and effects of an inter-American expert network for the diffusion of psychometric knowledge, specifically of standardized aptitude testing for university admission in Latin America during the 1960s and 1970s. Within the framework of educational transfer studies, the role of international,…

  16. Attitudes toward Science: Measurement and Psychometric Properties of the Test of Science-Related Attitudes for Its Use in Spanish-Speaking Classrooms

    Science.gov (United States)

    Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina

    2016-01-01

    Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the "Test of Science-Related Attitudes" (TOSRA), such as its construct validity, its…

  17. Psychometric Features of the General Aptitude Test-Verbal Part (GAT-V): A Large-Scale Assessment of High School Graduates in Saudi Arabia

    Science.gov (United States)

    Dimitrov, Dimiter M.; Shamrani, Abdul Rahman

    2015-01-01

    This study examines the psychometric features of a General Aptitude Test-Verbal Part, which is used with assessments of high school graduates in Saudi Arabia. The data supported a bifactor model, with one general factor and three content domains (Analogy, Sentence Completion, and Reading Comprehension) as latent aspects of verbal aptitude.

  18. [Can Psychometric Tests Predict Success in the Selection Interview for Medical School? A Cross-Sectional Study at One German Medical School].

    Science.gov (United States)

    Kötter, T; Obst, K U; Brüheim, L; Eisemann, N; Voltmer, E; Katalinic, A

    2017-07-01

    Background The final exam grade is the main selection criterion for medical school application in Germany. For academic success, it seems to be a reliable predictor. Its use as the only selection criterion is, however, criticised. At some universities, personal interviews are part of the selection process. However, these are very time consuming and are of doubtful validity. The (additional) use of appropriate psychometric instruments could reduce the cost and increase the validity. This study investigates the extent to which psychometric instruments can predict the outcome of a personal selection interview. Methods This is a cross-sectional study on the correlation of the results of psychometric instruments with those of the personal selection interview as part of the application process. As the outcome, the score of the selection interview was used. The NEO - Five Factor Inventory, the Hospital Anxiety and Depression Scale (HADS) and the questionnaire to identify work-related behaviour and experience patterns (AVEM) were used as psychometric interviews. Results There was a statistically significant correlation with the results of the personal selection interview for the sum score of the depression scale from the HADS and the sum score for the dimension of life satisfaction of the AVEM. In addition, those participants who did not previously complete an application training achieved a better result in the selection interview. Conclusion The instruments used measure different aspects than the interviews and cannot replace them. It remains to be seen whether the selected parameters are able to predict academic success. © Georg Thieme Verlag KG Stuttgart · New York.

  19. Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

    Science.gov (United States)

    King, Molly Elizabeth

    2016-01-01

    The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…

  20. The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

    Science.gov (United States)

    Silles, Mary A.

    2010-01-01

    This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…

  1. Using Raters from India to Score a Large-Scale Speaking Test

    Science.gov (United States)

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  2. AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

    Science.gov (United States)

    Cech, Scott J.

    2008-01-01

    More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…

  3. Psychometric testing of inventory of beliefs and attitudes on domestic violence.

    Science.gov (United States)

    Hutchinson, Marie; Doran, Frances

    2017-06-22

    Background Domestic violence (DV) is an international public health issue associated with adverse health outcomes for adults and children. There have been widespread calls to increase nurses' capacity to respond to DV and improve undergraduate nursing education in this area. However, there are few valid, reliable and contemporary measures of nursing attitudes towards and beliefs concerning DV that are suited for use in evaluating education programmes. Aim To establish the psychometric properties of a newly developed inventory designed to measure nursing students' beliefs about and attitudes towards DV. Discussion Exploratory factor analysis identified five factors, with a Cronbach's alpha of 0.646. The few factors loading>.80 suggest that the instrument has good discriminate validity. The absence of cross-loadings indicate good convergent validity. Conclusion The inventory provides one of the first validated and reliable measures for examining undergraduate nursing students' attitudes towards and beliefs about DV. Implications for practice The instrument is suited for use by nurse educators in assessing the influence of curriculum design and teaching strategies on student beliefs and attitudes. It would also be useful in studies investigating nurses' clinical practice on domestic violence.

  4. Cross-cultural adaptation and psychometric testing of the Quality of Dying and Death Questionnaire for the Spanish population.

    Science.gov (United States)

    Gutiérrez Sánchez, Daniel; Cuesta-Vargas, Antonio I

    2018-04-01

    Many measurements have been developed to assess the quality of death (QoD). Among these, the Quality of Dying and Death Questionnaire (QODD) is the most widely studied and best validated. Informal carers and health professionals who care for the patient during their last days of life can complete this assessment tool. The aim of the study is to carry out a cross-cultural adaptation and a psychometric analysis of the QODD for the Spanish population. The translation was performed using a double forward and backward method. An expert panel evaluated the content validity. The questionnaire was tested in a sample of 72 Spanish-speaking adult carers of deceased cancer patients. A psychometric analysis was performed to evaluate internal consistency, divergent criterion-related validity with the Mini-Suffering State Examination (MSSE) and concurrent criterion-related validity with the Palliative Outcome Scale (POS). Some items were deleted and modified to create the Spanish version of the QODD (QODD-ESP-26). The instrument was readable and acceptable. The content validity index was 0.96, suggesting that all items are relevant for the measure of the QoD. This questionnaire showed high internal consistency (Cronbach's α coefficient = 0.88). Divergent validity with MSSE (r = -0.64) and convergent validity with POS (r = -0.61) were also demonstrated. The QODD-ESP-26 is a valid and reliable instrument for the assessment of the QoD of deceased cancer patients that can be used in a clinical and research setting. Copyright © 2018 Elsevier Ltd. All rights reserved.

  5. A process dissociation approach to objective-projective test score interrelationships.

    Science.gov (United States)

    Bornstein, Robert F

    2002-02-01

    Even when self-report and projective measures of a given trait or motive both predict theoretically related features of behavior, scores on the 2 tests correlate modestly with each other. This article describes a process dissociation framework for personality assessment, derived from research on implicit memory and learning, which can resolve these ostensibly conflicting results. Research on interpersonal dependency is used to illustrate 3 key steps in the process dissociation approach: (a) converging behavioral predictions, (b) modest test score intercorrelations, and (c) delineation of variables that differentially affect self-report and projective test scores. Implications of the process dissociation framework for personality assessment and test development are discussed.

  6. Study protocol of psychometric properties of the Spanish translation of a competence test in evidence based practice: The Fresno test

    Directory of Open Access Journals (Sweden)

    Villafafila-Ferrero Rosa

    2009-02-01

    Full Text Available Abstract Background There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. Methods and design The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU. Participants will be: (a tutors of Family Medicine residents (expert group; (b Family Medicine residents in their second year of the Family Medicine training program (novice group, and (c Family Medicine physicians (intermediate group. The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. Discussion The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.

  7. Study protocol of psychometric properties of the Spanish translation of a competence test in evidence based practice: the Fresno test.

    Science.gov (United States)

    Argimon-Pallàs, Josep M; Flores-Mateo, Gemma; Jiménez-Villa, Josep; Pujol-Ribera, Enriqueta; Foz, Gonçal; Bundó-Vidiella, Magda; Juncosa, Sebastià; Fuentes-Bellido, Cruz M; Pérez-Rodríguez, Belén; Margalef-Pallarès, Francesc; Villafafila-Ferrero, Rosa; Forès-Garcia, Dolors; Roman-Martínez, Josep; Vilert-Garroga, Esther

    2009-02-24

    There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP) curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU). Participants will be: (a) tutors of Family Medicine residents (expert group); (b) Family Medicine residents in their second year of the Family Medicine training program (novice group), and (c) Family Medicine physicians (intermediate group). The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.

  8. Development and psychometric testing of the 'Motivation and Self-Efficacy in Early Detection of Skin Lesions' index.

    Science.gov (United States)

    Dyson, Judith; Cowdell, Fiona

    2014-12-01

    To develop and psychometrically test the Motivation and Self-Efficacy in Early Detection of Skin Lesions Index. Skin cancer is the most frequently diagnosed cancer worldwide. The primary strategy used to prevent skin cancer is promotion of sun avoidance and the use of sun protection. However, despite costly and extensive campaigns, cases of skin cancer continue to increase. If found and treated early, skin cancer is curable. Early detection is, therefore, very important. The study was conducted in 2013. Instrument Development. A literature review and a survey identified barriers (factors that hinder) and levers (factors that help) to skin self-examination. These were categorized according to a the Theoretical Domains Framework and this formed the basis of an instrument, which was tested for validity and reliability using confirmatory factor analysis and Cronbach's alpha respectively. A five-factor 20-item instrument was used that tested well for reliability and construct validity. Test-retest reliability was good for all items and domains. The five factors were: (i) Outcome expectancies; (ii) Intention; (iii) Self-efficacy; (iv) Social influences; (v) Memory. The Motivation and Self-Efficacy in Early Detection of Skin Lesions Index provides a reliable and valid method of assessing barriers and levers to skin self-examination. The next step is to design a theory-based intervention that can be tailored according to individual determinants to behaviour change identified by this instrument. © 2014 John Wiley & Sons Ltd.

  9. The Mother's Autonomy in Decision Making (MADM) scale: Patient-led development and psychometric testing of a new instrument to evaluate experience of maternity care.

    Science.gov (United States)

    Vedam, Saraswathi; Stoll, Kathrin; Martin, Kelsey; Rubashkin, Nicholas; Partridge, Sarah; Thordarson, Dana; Jolicoeur, Ganga

    2017-01-01

    To develop and validate a new instrument that assesses women's autonomy and role in decision making during maternity care. Through a community-based participatory research process, service users designed, content validated, and administered a cross-sectional quantitative survey, including 31 items on the experience of decision-making. Pregnancy experiences (n = 2514) were reported by 1672 women who saw a single type of primary maternity care provider in British Columbia. They described care by a midwife, family physician or obstetrician during 1, 2 or 3 maternity care cycles. We conducted psychometric testing in three separate samples. We assessed reliability, item-to-total correlations, and the factor structure of the The Mothers' Autonomy in Decision Making (MADM) scale. We report MADM scores by care provider type, length of prenatal appointments, preferences for role in decision-making, and satisfaction with experience of decision-making. The MADM scale measures a single construct: autonomy in decision-making during maternity care. Cronbach alphas for the scale exceeded 0.90 for all samples and all provider groups. All item-to-total correlations were replicable across three samples and exceeded 0.7. Eigenvalue and scree plots exhibited a clear 90-degree angle, and factor analysis generated a one factor scale. MADM median scores were highest among women who were cared for by midwives, and 10 or more points lower for those who saw physicians. Increased time for prenatal appointments was associated with higher scale scores, and there were significant differences between providers with respect to average time spent in prenatal appointments. Midwifery care was associated with higher MADM scores, even during short prenatal appointments (maternity care. This new scale was developed and content validated by community members representing various populations of childbearing women in BC including women from vulnerable populations. MADM measures women's ability to lead

  10. A comparison of likelihood ratio tests and Rao's score test for three separable covariance matrix structures.

    Science.gov (United States)

    Filipiak, Katarzyna; Klein, Daniel; Roy, Anuradha

    2017-01-01

    The problem of testing the separability of a covariance matrix against an unstructured variance-covariance matrix is studied in the context of multivariate repeated measures data using Rao's score test (RST). The RST statistic is developed with the first component of the separable structure as a first-order autoregressive (AR(1)) correlation matrix or an unstructured (UN) covariance matrix under the assumption of multivariate normality. It is shown that the distribution of the RST statistic under the null hypothesis of any separability does not depend on the true values of the mean or the unstructured components of the separable structure. A significant advantage of the RST is that it can be performed for small samples, even smaller than the dimension of the data, where the likelihood ratio test (LRT) cannot be used, and it outperforms the standard LRT in a number of contexts. Monte Carlo simulations are then used to study the comparative behavior of the null distribution of the RST statistic, as well as that of the LRT statistic, in terms of sample size considerations, and for the estimation of the empirical percentiles. Our findings are compared with existing results where the first component of the separable structure is a compound symmetry (CS) correlation matrix. It is also shown by simulations that the empirical null distribution of the RST statistic converges faster than the empirical null distribution of the LRT statistic to the limiting χ 2 distribution. The tests are implemented on a real dataset from medical studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Perceived Breastfeeding Support Assessment Tool (PBSAT): development and testing of psychometric properties with Pakistani urban working mothers.

    Science.gov (United States)

    Hirani, Shela Akbar Ali; Karmaliani, Rozina; Christie, Thomas; Parpio, Yasmin; Rafique, Ghazala

    2013-06-01

    breast feeding is an essential source of nutrition among young babies; however, in Pakistan a gradual decline in prevalence of breast feeding, especially among urban working mothers, has been reported. Previous studies among Pakistani urban working mothers have revealed that ensuring exclusivity and continuation of breast feeding is challenging if social and/or workplace environmental support is minimal or absent. This problem indicated a crucial need to assess availability of breast-feeding support for Pakistani urban working mothers by using a comprehensive, reliable, and validated tool in their national language (Urdu). to develop and test the psychometric properties of the 'Perceived Breastfeeding Support Assessment Tool' (PBSAT) that can gauge Pakistani urban working mothers' perceptions about breast-feeding support. this methodological research was undertaken in five phases. During phase I, a preliminary draft of the PBSAT was developed by using the Socio-ecological model, reviewing literature, and referring to two United States based tools. In Phase II, the instrument was evaluated by seven different experts, and, in Phase III, the instrument was revised, translated, and back translated. In Phase IV, the tool was pilot tested among 20 participants and then modified on the basis of statistical analysis. In Phase V, the refined instrument was tested on 200 breast-feeding working mothers recruited through purposive sampling from the government and private health-care settings in Karachi, Pakistan. Approvals were received from the Ethical Review Committees of the identified settings. the 29-item based PBSAT revealed an acceptable inter-rater reliability of 0.95, and an internal consistency reliability coefficient (Cronbach's alpha) of 0.85. A construct validity assessment through Exploratory Factor Analysis revealed that the PBSAT has two dimensions, 'workplace environmental support' (12 items; α=0.86) and 'social environmental support' (17 items; α=0.77). the

  12. Effects of Analytical and Holistic Scoring Patterns on Scorer Reliability in Biology Essay Tests

    Science.gov (United States)

    Ebuoh, Casmir N.

    2018-01-01

    Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…

  13. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    Science.gov (United States)

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  14. Individual Differences in Digit Span, Susceptibility to Proactive Interference, and Aptitude/Achievement Test Scores.

    Science.gov (United States)

    Dempster, Frank N.; Cooney, John B.

    1982-01-01

    Individual differences in digit span, susceptibility to proactive interference, and various aptitude/achievement test scores were investigated in two experiments with college students. Results indicated that digit span was strongly correlated with aptitude/achievement scores, but did not indicate that susceptibility to proactive interference…

  15. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    Science.gov (United States)

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  16. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    Science.gov (United States)

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  17. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    Science.gov (United States)

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  18. The End-Stage Renal Disease Adherence Questionnaire (ESRD-AQ): testing the psychometric properties in patients receiving in-center hemodialysis.

    OpenAIRE

    Kim, Y; Evangelista, LS; Phillips, LR; Pavlish, C; Kopple, JD

    2010-01-01

    Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and v...

  19. Contributions of Hamstring Stiffness to Straight-Leg-Raise and Sit-and-Reach Test Scores.

    Science.gov (United States)

    Miyamoto, Naokazu; Hirata, Kosuke; Kimura, Noriko; Miyamoto-Mikami, Eri

    2018-02-01

    The passive straight-leg-raise (PSLR) and the sit-and-reach (SR) tests have been widely used to assess hamstring extensibility. However, it remains unclear to what extent hamstring stiffness (a measure of material properties) contributes to PSLR and SR test scores. Therefore, we aimed to clarify the relationship between hamstring stiffness and PSLR and SR scores using ultrasound shear wave elastography. Ninety-eight healthy subjects completed the study. Each subject completed PSLR testing, and classic and modified SR testing of the right leg. Muscle shear modulus of the biceps femoris, semitendinosus, and semimembranosus was quantified as an index of muscle stiffness. The relationships between shear modulus of each muscle and PSLR or SR scores were calculated using Pearson's product-moment correlation coefficients. Shear modulus of the semitendinosus and semimembranosus showed negative correlations with the two PSLR and two SR scores (absolute r value≤0.484). Shear modulus of the biceps femoris was significantly correlated with the PSLR score determined by the examiner and the modified SR score (absolute r value≤0.308). The present findings suggest that PSLR and SR test scores are strongly influenced by factors other than hamstring stiffness and therefore might not accurately evaluate hamstring stiffness. © Georg Thieme Verlag KG Stuttgart · New York.

  20. Relationships between spatial activities and scores on the mental rotation test as a function of sex.

    Science.gov (United States)

    Ginn, Sheryl R; Pickens, Stefanie J

    2005-06-01

    Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.

  1. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    Science.gov (United States)

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  2. Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test

    KAUST Repository

    Cai, T.; Lin, X.; Carroll, R. J.

    2012-01-01

    the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least

  3. Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

    Science.gov (United States)

    Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

    2015-11-01

    In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. (c) 2015 APA, all rights reserved).

  4. Cross-Cultural Adaptation and Psychometric Testing of the Brazilian Version of the Self-Care of Heart Failure Index Version 6.2

    Science.gov (United States)

    Ávila, Christiane Wahast; Riegel, Barbara; Pokorski, Simoni Chiarelli; Camey, Suzi; Silveira, Luana Claudia Jacoby; Rabelo-Silva, Eneida Rejane

    2013-01-01

    Objective. To adapt and evaluate the psychometric properties of the Brazilian version of the SCHFI v 6.2. Methods. With the approval of the original author, we conducted a complete cross-cultural adaptation of the instrument (translation, synthesis, back translation, synthesis of back translation, expert committee review, and pretesting). The adapted version was named Brazilian version of the self-care of heart failure index v 6.2. The psychometric properties assessed were face validity and content validity (by expert committee review), construct validity (convergent validity and confirmatory factor analysis), and reliability. Results. Face validity and content validity were indicative of semantic, idiomatic, experimental, and conceptual equivalence. Convergent validity was demonstrated by a significant though moderate correlation (r = −0.51) on comparison with equivalent question scores of the previously validated Brazilian European heart failure self-care behavior scale. Confirmatory factor analysis supported the original three-factor model as having the best fit, although similar results were obtained for inadequate fit indices. The reliability of the instrument, as expressed by Cronbach's alpha, was 0.40, 0.82, and 0.93 for the self-care maintenance, self-care management, and self-care confidence scales, respectively. Conclusion. The SCHFI v 6.2 was successfully adapted for use in Brazil. Nevertheless, further studies should be carried out to improve its psychometric properties. PMID:24163765

  5. Methods for Examining the Psychometric Quality of Subscores: A Review and Application

    Science.gov (United States)

    Wedman, Jonathan; Lyrén, Per-Erik

    2015-01-01

    When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…

  6. Psychometric properties of the Purpose-In-Life Test and age-related differences among women diagnosed with eating disorders.

    Science.gov (United States)

    García-Alandete, Joaquín; Ros, Montserrat Cañabate; Salvador, José Heliodoro Marco; Rodríguez, Sandra Pérez

    2018-03-01

    The aim of this study was to analyze the psychometric properties of the Purpose-In-Life Test (PIL), as well as the age-related differences in meaning in life in women diagnosed with eating disorders. Participants were 250 Spanish women diagnosed with eating disorders who ranged from 12 to 60 years old. Confirmatory Factor Analysis, descriptive analyses, estimation of the internal consistency of the PIL, correlations between the PIL and the Beck Hopelessness Scale (BHS), Overweight Preoccupation Scale (OPS), and Body Investment Scale (BIS), and age differences were calculated. A 19-item model that showed a good fit and internal consistency, a negative correlation between the PIL and both the BHS and OPS, and a positive correlation with the BIS, as well as significant differences between the adolescents and the mature adults, were found. It would be advisable to increase the inclusion of meaning in life in psychotherapeutic interventions with women diagnosed with eating disorders. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Development and Psychometric Testing of a Scale for Evaluating Self-Management Needs of Knee Osteoarthritis (SMNKOA) in Taiwan.

    Science.gov (United States)

    Kao, Mei-Hua; Tsai, Yun-Fang

    2017-06-01

    Self-management of osteoarthritis (OA) of the knee is important for treating this chronic disease. This study developed and psychometrically tested a new instrument for measuring adult patients' self-management needs of knee osteoarthritis (SMNKOA). The theoretical framework of self-care guided the development of the 35-item SMNKOA scale. Participants ( N = 372) were purposively sampled from orthopedic clinics at medical centers in Taiwan. The content validity index was 0.83. Principal components analysis identified a three-factor solution, accounting for 53.19% of the variance. The divergent validity was -0.67; convergent validity was -0.51. Cronbach's alpha was .95, Pearson's correlation coefficient was .88, and the intraclass correlation coefficient was .95. The scale's reliability and validity supports the SMNKOA, as a tool to measure self-management needs of adults with knee OA. Nurses and other health care providers can use this instrument to evaluate knee OA patients and identify strategies for improving health-related outcomes and patient education.

  8. Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André

    2016-01-01

    Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…

  9. THE SIMULATED SOCIAL-INTERACTION TEST - A PSYCHOMETRIC EVALUATION WITH DUTCH SOCIAL PHOBIC PATIENTS

    NARCIS (Netherlands)

    MERSCH, PPA; BREUKERS, P; EMMELKAMP, PMG

    1992-01-01

    The Simulated Social Interaction Test (SSIT) was translated and adjusted for use on a population of Dutch males and females. Seventy-four social phobic patients were assessed with the SSIT, a conversation test, and an interview with an independent observer. Results show that the SSIT is a relatively

  10. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    Science.gov (United States)

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  11. Does breastfeeding contribute to the racial gap in reading and math test scores?

    Science.gov (United States)

    Peters, Kristen E; Huang, Jin; Vaughn, Michael G; Witko, Christopher

    2013-10-01

    The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0-12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. Copyright © 2013 Elsevier Inc. All rights reserved.

  12. Depressive status explains a significant amount of the variance in COPD assessment test (CAT) scores.

    Science.gov (United States)

    Miravitlles, Marc; Molina, Jesús; Quintano, José Antonio; Campuzano, Anna; Pérez, Joselín; Roncero, Carlos

    2018-01-01

    COPD assessment test (CAT) is a short, easy-to-complete health status tool that has been incorporated into the multidimensional assessment of COPD in order to guide therapy; therefore, it is important to understand the factors determining CAT scores. This is a post hoc analysis of a cross-sectional, observational study conducted in respiratory medicine departments and primary care centers in Spain with the aim of identifying the factors determining CAT scores, focusing particularly on the cognitive status measured by the Mini-Mental State Examination (MMSE) and levels of depression measured by the short Beck Depression Inventory (BDI). A total of 684 COPD patients were analyzed; 84.1% were men, the mean age of patients was 68.7 years, and the mean forced expiratory volume in 1 second (%) was 55.1%. Mean CAT score was 21.8. CAT scores correlated with the MMSE score (Pearson's coefficient r =-0.371) and the BDI ( r =0.620), both p CAT scores and explained 45% of the variability. However, a model including only MMSE and BDI scores explained up to 40% and BDI alone explained 38% of the CAT variance. CAT scores are associated with clinical variables of severity of COPD. However, cognitive status and, in particular, the level of depression explain a larger percentage of the variance in the CAT scores than the usual COPD clinical severity variables.

  13. Validation of Patient-Reported Outcomes Measurement Information System Computerized Adaptive Tests Against the Foot and Ankle Outcome Score for 6 Common Foot and Ankle Pathologies.

    Science.gov (United States)

    Koltsov, Jayme C B; Greenfield, Stephen T; Soukup, Dylan; Do, Huong T; Ellis, Scott J

    2017-08-01

    The field of foot and ankle surgery lacks a widely accepted gold-standard patient-reported outcome instrument. With the changing infrastructure of the medical profession, more efficient patient-reported outcome tools are needed to reduce respondent burden and increase participation while providing consistent and reliable measurement across multiple pathologies and disciplines. The primary purpose of the present study was to validate 3 Patient-Reported Outcomes Measurement Information System computer adaptive tests (CATs) most relevant to the foot and ankle discipline against the Foot and Ankle Outcome Score (FAOS) and the Short Form 12 general health status survey in patients with 6 common foot and ankle pathologies. Patients (n = 240) indicated for operative treatment for 1 of 6 common foot and ankle pathologies completed the CATs, FAOS, and Short Form 12 at their preoperative surgical visits, 1 week subsequently (before surgery), and at 6 months postoperatively. The psychometric properties of the instruments were assessed and compared. The Patient-Reported Outcomes Measurement Information System CATs each took less than 1 minute to complete, whereas the FAOS took 6.5 minutes, and the Short Form 12 took 3 minutes. CAT scores were more normally distributed and had fewer floor and ceiling effects than those on the FAOS, which reached as high as 24%. The CATs were more precise than the FAOS and had similar responsiveness and test-retest reliability. The physical function and mobility CATs correlated strongly with the activities subscale of the FAOS, and the pain interference CAT correlated strongly with the pain subscale of the FAOS. The CATs and FAOS were responsive to changes with operative treatment for 6 common foot and ankle pathologies. The CATs performed as well as or better than the FAOS in all aspects of psychometric validity. The Patient-Reported Outcomes Measurement Information System CATs show tremendous potential for improving the study of patient

  14. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    Science.gov (United States)

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  15. Peabody Picture Vocabulary Test (PPVT-III: Psychometric properties and significance for application

    Directory of Open Access Journals (Sweden)

    Nataša Bucik

    2003-12-01

    Full Text Available The purpose of this article is to present the content, conceptual structure and methodological steps of the latest revision of the Peabody Picture Vocabulary Test (PPVT-III, which is a highly functional and valuable vocabulary test that has been in use since 1959 in different language and cultural surroundings. On the case of the PPVT-III we are presenting the procedure of development and standardization of such vocabulary tests as well as its translation and adaptation from one language and cultural milieu to another. We also note the practical use of the PPVT-III for research purposes. In Slovenian language no vocabulary tests were developed or adapted so far; PPVT-III is presented in this context, too.

  16. Development and psychometric testing of the Mariani Nursing Career Satisfaction Scale.

    Science.gov (United States)

    Mariani, Bette; Allen, Lois Ryan

    2014-01-01

    The Mariani Nursing Career Satisfaction Scale (MNCSS) was developed to explore the influence of mentoring on career satisfaction of registered nurses (RNs). A review of the literature revealed no contemporary valid and reliable measure of career satisfaction. The MNCSS is a semantic differential of 16 opposite adjective pairs on which participants rate feelings about their nursing career. The MNCSS was used in a pilot study and three major studies exploring career satisfaction of RNs. Validity, reliability, and exploratory factor analysis (FA) were computed to explore the internal structure of the instrument. The newly developed instrument had a content validity index (CVI) of .84 and Cronbach's alpha internal consistency reliabilities of .93-.96 across three major studies. Exploratory FA (N = 496) revealed a univocal instrument with one factor that explains 57.8% of the variance in career satisfaction scores. The MNCSS is a valid and reliable instrument for measuring career satisfaction. FA of the combined data from three studies yielded one factor that measures the concept of career satisfaction.

  17. The Diabetes Symptom Self-Care Inventory: development and psychometric testing with Mexican Americans.

    Science.gov (United States)

    García, Alexandra A

    2011-04-01

    Type 2 diabetes is prevalent throughout the world. In previous studies of Mexican Americans with type 2 diabetes, 95%-97% of those sampled reported having symptoms they believe were caused by diabetes and most self-treated their symptoms. To more accurately capture Mexican Americans' symptom prevalence and their self-treatments, the Diabetes Symptom Self-Care Inventory (DSSCI) was adapted from the Diabetes Self-Care Instrument. This article describes the modification process used to perfect the DSSCI for use in improving self-care among people with type 2 diabetes. This instrumentation study used qualitative and quantitative methods. The study was completed in four phases that used focus groups, cognitive interviews, and survey administration. Four convenience samples were drawn from community-dwelling Mexican American adults, aged 25-75 years, with type 2 diabetes in an urban area and a rural location in Texas. Phase I: Seven focus groups (n=45) generated data for revising items. Phase II: Cognitive interviews with 16 participants were used to evaluate four revisions of the questionnaire. Phase III: Surveys were administered to 81 participants. Total number of symptoms on the DSSCI correlated with scores on the Centers for Epidemiological Studies-Depression scale (r=0.65, PPerception Questionnaire-Revised Diabetes Symptom subscale (r=0.57, PAmericans' diabetes symptoms and the actions they take to address them. Copyright © 2011 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.

  18. The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

    Science.gov (United States)

    Longenbecker, Sueann; Wood, Peter H.

    1984-01-01

    Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)

  19. Standardised test protocol (Constant Score) for evaluation of functionality in patients with shoulder disorders

    DEFF Research Database (Denmark)

    Ban, Ilija; Troelsen, Anders; Christiansen, David Høyrup

    2013-01-01

    INTRODUCTION: The Constant Score (CS), developed as a scoring system to evaluate overall functionality of patients with shoulder disorders, is widely used but has been criticised for relying on an imprecise terminology and for lack of a standardised methodology. A modified guideline was therefore...... differences. One of the authors of the modified CS approved both the English and the Danish test protocol. CONCLUSION: A simple test protocol of the modified CS was developed in both English and Danish. With precise terminology and definitions, the test protocol is the first of its kind. We suggest its use...

  20. Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules.

    Science.gov (United States)

    Bereby-Meyer, Yoella; Meyer, Joachim; Budescu, David V

    2003-02-01

    This paper assesses framing effects on decision making with internal uncertainty, i.e., partial knowledge, by focusing on examinees' behavior in multiple-choice (MC) tests with different scoring rules. In two experiments participants answered a general-knowledge MC test that consisted of 34 solvable and 6 unsolvable items. Experiment 1 studied two scoring rules involving Positive (only gains) and Negative (only losses) scores. Although answering all items was the dominating strategy for both rules, the results revealed a greater tendency to answer under the Negative scoring rule. These results are in line with the predictions derived from Prospect Theory (PT) [Econometrica 47 (1979) 263]. The second experiment studied two scoring rules, which allowed respondents to exhibit partial knowledge. Under the Inclusion-scoring rule the respondents mark all answers that could be correct, and under the Exclusion-scoring rule they exclude all answers that might be incorrect. As predicted by PT, respondents took more risks under the Inclusion rule than under the Exclusion rule. The results illustrate that the basic process that underlies choice behavior under internal uncertainty and especially the effect of framing is similar to the process of choice under external uncertainty and can be described quite accurately by PT. Copyright 2002 Elsevier Science B.V.

  1. Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

    Science.gov (United States)

    Curtis, David

    2018-01-01

    Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.

  2. Psychometric evaluation of the EORTC computerized adaptive test (CAT) fatigue item pool

    DEFF Research Database (Denmark)

    Petersen, Morten Aa; Giesinger, Johannes M; Holzner, Bernhard

    2013-01-01

    Fatigue is one of the most common symptoms associated with cancer and its treatment. To obtain a more precise and flexible measure of fatigue, the EORTC Quality of Life Group has developed a computerized adaptive test (CAT) measure of fatigue. This is part of an ongoing project developing a CAT...

  3. Psychometric Properties and Factorial Structure of the Chinese Version of the Gratitude Resentment and Appreciation Test

    Science.gov (United States)

    Lin, Shu-Hui; Huang, Yun-Chen

    2016-01-01

    The purpose of this study was to validate a Chinese version of the Gratitude Resentment and Appreciation Test (GRAT) with Taiwanese students. In Study 1, a total of 2511 Taiwanese students participated and completed the translated GRAT. Exploratory factor analysis, confirmatory factor analysis and reliability analysis were undertaken to assess the…

  4. The Caregiver Contribution to Heart Failure Self-Care (CACHS): Further Psychometric Testing of a Novel Instrument.

    Science.gov (United States)

    Buck, Harleah G; Harkness, Karen; Ali, Muhammad Usman; Carroll, Sandra L; Kryworuchko, Jennifer; McGillion, Michael

    2017-04-01

    Caregivers (CGs) contribute important assistance with heart failure (HF) self-care, including daily maintenance, symptom monitoring, and management. Until CGs' contributions to self-care can be quantified, it is impossible to characterize it, account for its impact on patient outcomes, or perform meaningful cost analyses. The purpose of this study was to conduct psychometric testing and item reduction on the recently developed 34-item Caregiver Contribution to Heart Failure Self-care (CACHS) instrument using classical and item response theory methods. Fifty CGs (mean age 63 years ±12.84; 70% female) recruited from a HF clinic completed the CACHS in 2014 and results evaluated using classical test theory and item response theory. Items would be deleted for low (.95) endorsement, low (.7) corrected item-total correlations, significant pairwise correlation coefficients, floor or ceiling effects, relatively low latent trait and item information function levels ( .5), and differential item functioning. After analysis, 14 items were excluded, resulting in a 20-item instrument (self-care maintenance eight items; monitoring seven items; and management five items). Most items demonstrated moderate to high discrimination (median 2.13, minimum .77, maximum 5.05), and appropriate item difficulty (-2.7 to 1.4). Internal consistency reliability was excellent (Cronbach α = .94, average inter-item correlation = .41) with no ceiling effects. The newly developed 20-item version of the CACHS is supported by rigorous instrument development and represents a novel instrument to measure CGs' contribution to HF self-care. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  5. Development and psychometric testing of a new instrument to measure factors influencing women's breast cancer prevention behaviors (ASSISTS).

    Science.gov (United States)

    Khazaee-Pool, Maryam; Majlessi, Fereshteh; Montazeri, Ali; Pashaei, Tahereh; Gholami, Ali; Ponnet, Koen

    2016-07-22

    Breast cancer preventive behaviors have an extreme effect on women's health. Despite the benefits of preventive behaviors regarding breast cancer, they have not been implemented as routine care for healthy women. To assess this health issue, a reliable and valid scale is needed. The aim of the present study is to develop and examine the psychometric properties of a new scale, called the ASSISTS, in order to identify factors that affect women's breast cancer prevention behaviors. A multi-phase instrument development method was performed to develop the questionnaire from February 2012 to September 2014. The item pool was generated based on secondary analyses of previous qualitative data. Then, content and face validity were applied to provide a pre-final version of the scale. The scale validation was conducted with a sample of women recruited from health centers affiliated with Tehran University of Medical Sciences. The construct validity (both exploratory and confirmatory), convergent validity, discriminate validity, internal consistency reliability and test-retest analysis of the questionnaire were tested. Fifty-eight items were initially extracted from the secondary analysis of previous qualitative data. After content validity, this was reduced to 49 items. The exploratory factor analysis revealed seven factors (Attitude, supportive systems, self-efficacy, information seeking, stress management, stimulant and self-care) containing 33 items that jointly accounted for 60.62 % of the observed variance. The confirmatory factor analysis showed a model with appropriate fitness for the data. The Cronbach's alpha coefficient for the subscales ranged from 0.68 to 0.85, and the Intraclass Correlation Coefficient (ICC) ranged from 0.71 to 0.98; which is well above the acceptable thresholds. The findings showed that the designed questionnaire was a valid and reliable instrument for assessing factors affecting women's breast cancer prevention behaviors that can be used both

  6. Chronic Widespread Back Pain is Distinct From Chronic Local Back Pain: Evidence From Quantitative Sensory Testing, Pain Drawings, and Psychometrics.

    Science.gov (United States)

    Gerhardt, Andreas; Eich, Wolfgang; Janke, Susanne; Leisner, Sabine; Treede, Rolf-Detlef; Tesarz, Jonas

    2016-07-01

    Whether chronic localized pain (CLP) and chronic widespread pain (CWP) have different mechanisms or to what extent they overlap in their pathophysiology is controversial. The study compared quantitative sensory testing profiles of nonspecific chronic back pain patients with CLP (n=48) and CWP (n=29) with and fibromyalgia syndrome (FMS) patients (n=90) and pain-free controls (n = 40). The quantitative sensory testing protocol of the "German-Research-Network-on-Neuropathic-Pain" was used to measure evoked pain on the painful area in the lower back and the pain-free hand (thermal and mechanical detection and pain thresholds, vibration threshold, pain sensitivity to sharp and blunt mechanical stimuli). Ongoing pain and psychometrics were captured with pain drawings and questionnaires. CLP patients did not differ from pain-free controls, except for lower pressure pain threshold (PPT) on the back. CWP and FMS patients showed lower heat pain threshold and higher wind-up ratio on the back and lower heat pain threshold and cold pain threshold on the hand. FMS showed lower PPT on back and hand, and higher comorbidity of anxiety and depression and more functional impairment than all other groups. Even after long duration CLP presents with a local hypersensitivity for PPT, suggesting a somatotopically specific sensitization of nociceptive processing. However, CWP patients show widespread ongoing pain and hyperalgesia for different stimuli that is generalized in space, suggesting the involvement of descending control systems, as also suggested for FMS patients. Because mechanisms in nonspecific chronic back pain with CLP and CWP differ, these patients should be distinguished in future research and allocated to different treatments.

  7. The Impact of the Use of Hierarchical Teaching on Test Scores of Students’ Technology

    Directory of Open Access Journals (Sweden)

    Zhao Guorong

    2015-01-01

    Full Text Available Test scores of students’ technology is the main basis for physical examination of college students’ physical, fitness evaluation based on test results. To change the view by the stratified teaching method consistent system of teaching mode, special movement technical level of students is improved significantly.

  8. How Well Does the Sum Score Summarize the Test? Summability as a Measure of Internal Consistency

    NARCIS (Netherlands)

    Goeman, J.J.; De, Jong N.H.

    2018-01-01

    Many researchers use Cronbach's alpha to demonstrate internal consistency, even though it has been shown numerous times that Cronbach's alpha is not suitable for this. Because the intention of questionnaire and test constructers is to summarize the test by its overall sum score, we advocate

  9. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    Science.gov (United States)

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  10. From Test Scores to Language Use: Emergent Bilinguals Using English to Accomplish Academic Tasks

    Science.gov (United States)

    Rodriguez-Mojica, Claudia

    2018-01-01

    Prominent discourses about emergent bilinguals' academic abilities tend to focus on performance as measured by test scores and perpetuate the message that emergent bilinguals trail far behind their peers. When we remove the constraints of formal testing situations, what can emergent bilinguals do in English as they engage in naturally occurring…

  11. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    Science.gov (United States)

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  12. A disease-specific measure of health-related quality of life in adults with chronic immune thrombocytopenic purpura: psychometric testing in an open-label clinical trial.

    Science.gov (United States)

    Mathias, Susan D; Bussel, James B; George, James N; McMillan, Robert; Okano, Gary J; Nichol, Janet L

    2007-05-01

    The Immune Thrombocytopenic Purpura Patient Assessment Questionnaire (ITP-PAQ) was developed to assess disease-specific quality of life (QoL) in adults with ITP. It is a 44-item questionnaire that includes scales for physical health (symptoms, fatigue/sleep, bother, and activity), emotional health (psychological and fear), overall QoL, social activity, women's reproductive health, and work. A previous study reported preliminary evidence of its reliability and validity. The present study was conducted to ascertain the responsiveness (ability to detect a clinically important treatment effect), reliability, and validity of the ITP-PAQ and to corroborate the earlier findings. The women's reproductive health scale was evaluated for psychometric evidence of the existence of separate menstrual symptoms and fertility subscales. The ITP-PAQ was evaluated in the context of an ongoing open-label extension study assessing the tolerability and durability of increases in the platelet count with AMG 531 (a thrombopoiesis peptibody that increases platelet production by targeting the thrombopoietin receptor) administered by subcutaneous injection once weekly in adult patients with ITP It was self-administered at baseline and at weeks 4, 12, and 24. The responsiveness of the questionnaire was evaluated by calculating and comparing the change scores of patients who showed clinical improvement-categorized as platelet responders (those with a platelet count > or =50 x 10(9) cells/L and a doubling of baseline values at week 24) and durable platelet responders (those with a platelet count > or =50 x 10(9) cells/L and a doubling of baseline values on > or =6 occasions during weeks 17-24)-with the change scores of patients wh did not show clinical improvement. The reliability (internal consistency and test-retest) and validity (convergent, discriminant, and known groups) of the questionnaire were also evaluated. Validity was examined in terms of correlations between the ITP-PAQ and the 36

  13. A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

    Science.gov (United States)

    Kosinski, Andrzej S

    2013-03-15

    Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.

  14. The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.

    Science.gov (United States)

    Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary

    2017-10-01

    Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between

  15. The Impact of Correction for Guessing Formula on MC and Yes/No Vocabulary Tests' Scores

    Directory of Open Access Journals (Sweden)

    abdollah baradaran

    2009-10-01

    Full Text Available A standard correction for random guessing (cfg formula on multiple-choice and Yes/Noexaminations was examined retrospectively in the scores of the intermediate female EFL learners in an English language school. The correctionwas a weighting formula for points awarded for correct answers,incorrect answers, and unanswered questions so that the expectedvalue of the increase in test score due to guessing was zero. The researcher compared uncorrected and corrected scores on examinationsusing multiple-choice and Yes/No formats. These short-answer formats eliminatedor at least greatly reduced the potential for guessing the correctanswer. The expectation for students to improve their grade by guessingon multiple-choice and Yes/No format examinations is well known. The researcher examined a method for correcting for random guessing (cfg " no knowledge" on multiple- choice and Yes/No vocabulary examinations by comparing application and non-application of correction for guessing (cfg formula on scores on these examinations. It was done to determine whether the test takers really knew the correct answer, or they had resorted to a kind of guessing. This study represented a unique opportunity to compare scores from multiple-choice and Yes/No examinations in a settingin which students were given the same number of questions ineach of the two format types testing their knowledge over thesame subject matter. The results of this study indicated that the significant differences were highlighted between the subjects' scores when cfg formula was applied and when it was not.

  16. Validation of new prognostic and predictive scores by sequential testing approach

    International Nuclear Information System (INIS)

    Nieder, Carsten; Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid

    2010-01-01

    Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)

  17. Validation of new prognostic and predictive scores by sequential testing approach

    Energy Technology Data Exchange (ETDEWEB)

    Nieder, Carsten [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway); Inst. of Clinical Medicine, Univ. of Tromso (Norway); Haukland, Ellinor; Pawinski, Adam; Dalhaug, Astrid [Radiation Oncology Unit, Nordland Hospital, Bodo (Norway)

    2010-03-15

    Background and Purpose: For practitioners, the question arises how their own patient population differs from that used in large-scale analyses resulting in new scores and nomograms and whether such tools actually are valid at a local level and thus can be implemented. A recent article proposed an easy-to-use method for the in-clinic validation of new prediction tools with a limited number of patients, a so-called sequential testing approach. The present study evaluates this approach in scores related to radiation oncology. Material and Methods: Three different scores were used, each predicting short overall survival after palliative radiotherapy (bone metastases, brain metastases, metastatic spinal cord compression). For each scenario, a limited number of consecutive patients entered the sequential testing approach. The positive predictive value (PPV) was used for validation of the respective score and it was required that the PPV exceeded 80%. Results: For two scores, validity in the own local patient population could be confirmed after entering 13 and 17 patients, respectively. For the third score, no decision could be reached even after increasing the sample size to 30. Conclusion: In-clinic validation of new predictive tools with sequential testing approach should be preferred over uncritical adoption of tools which provide no significant benefit to local patient populations. Often the necessary number of patients can be reached within reasonable time frames even in small oncology practices. In addition, validation is performed continuously as the data are collected. (orig.)

  18. Psychometric evaluation of an item bank for computerized adaptive testing of the EORTC QLQ-C30 cognitive functioning dimension in cancer patients

    DEFF Research Database (Denmark)

    Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J. B.

    2017-01-01

    on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). METHODS: In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients...... model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study...... sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. CONCLUSION: A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient...

  19. Development and psychometric testing of a chinese version of the caregiver burden index for parents of children with allergies.

    Science.gov (United States)

    Liu, Pei-Ching; Gau, Bih-Shya; Hung, Chao-Chia

    2015-01-01

    BACKGROUNG: No specific instrument has thus far been developed for measuring the caregiver burden perceived by parents of children with allergies (CWA). To determine the psychometric properties of the Caregiver Burden Index (CBI). A mixed-methods design was adopted to evaluate the psychometric properties of the scale. The content validity index was 0.89, and the internal consistency was high with a coefficient alpha of 0.98. Three factors were extracted after exploratory factor analysis. The study findings suggest that the CBI has sufficient reliability and validity to evaluate the caregiver burden of parents of CWA. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. A score based on screening tests to differentiate mild cognitive impairment from subjective memory complaints

    Directory of Open Access Journals (Sweden)

    Fábio Henrique de Gobbi Porto

    2013-09-01

    Full Text Available It is not easy to differentiate patients with mild cognitive impairment (MCI from subjective memory complainers (SMC. Assessments with screening cognitive tools are essential, particularly in primary care where most patients are seen. The objective of this study was to evaluate the diagnostic accuracy of screening cognitive tests and to propose a score derived from screening tests. Elderly subjects with memory complaints were evaluated using the Mini Mental State Examination (MMSE and the Brief Cognitive Battery (BCB. We added two delayed recalls in the MMSE (a delayed recall and a late-delayed recall, LDR, and also a phonemic fluency test of letter P fluency (LPF. A score was created based on these tests. The diagnoses were made on the basis of clinical consensus and neuropsychological testing. Receiver operating characteristic curve analyses were used to determine area under the curve (AUC, the sensitivity and specificity for each test separately and for the final proposed score. MMSE, LDR, LPF and delayed recall of BCB scores reach statistically significant differences between groups (P=0.000, 0.03, 0.001 and 0.01, respectively. Sensitivity, specificity and AUC were MMSE: 64%, 79% and 0.75 (cut off <29; LDR: 56%, 62% and 0.62 (cut off <3; LPF: 71%, 71% and 0.71 (cut off <14; delayed recall of BCB: 56%, 82% and 0.68 (cut off <9. The proposed score reached a sensitivity of 88% and 76% and specificity of 62% and 75% for cut off over 1 and over 2, respectively. AUC were 0.81. In conclusion, a score created from screening tests is capable of discriminating MCI from SMC with moderate to good accurancy.

  1. The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

    Science.gov (United States)

    Ockey, Gary J.

    2009-01-01

    The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

  2. Do Standardized Tests Penalize Deep-Thinking, Creative, or Conscientious Students?: Some Personality Correlates of Graduate Record Examinations Test Scores

    Science.gov (United States)

    Powers, Donald E.; Kaufman, James C.

    2004-01-01

    The objective of the study reported here was to explore the relationship of Graduate Record Examinations (GRE) General Test scores to selected personality traits--conscientiousness, rationality, ingenuity, quickness, creativity, and depth. A sample of 342 GRE test takers completed short personality inventory scales for each trait. Analyses…

  3. Development and Psychometric Testing of a Novel Food Service Satisfaction Questionnaire for Food Service Staff of Aged Care Homes.

    Science.gov (United States)

    Miller, M; Hamilton, J; Scupham, R; Matwiejczyk, L; Prichard, I; Farrer, O; Yaxley, A

    2018-01-01

    Food service staff are integral to delivery of quality food in aged care homes yet measurement of their satisfaction is unable to be performed due to an absence of a valid and reliable questionnaire. The aim of this study was to develop and perform psychometric testing for a new Food Service Satisfaction Questionnaire developed in Australia specifically for use by food service staff working in residential aged care homes (Flinders FSSQFSAC). A mixed methods design utilizing both a qualitative (in-depth interviews, focus groups) and a quantitative approach (cross sectional survey) was used. Content validity was determined from focus groups and interviews with food service staff currently working in aged care homes, related questionnaires from the literature and consultation with an expert panel. The questionnaire was tested for construct validity and internal consistency using data from food service staff currently working in aged care homes that responded to an electronic invitation circulated to Australian aged care homes using a national database of email addresses. Construct validity was tested via principle components analysis and internal consistency through Cronbach's alpha. Temporal stability of the questionnaire was determined from food service staff undertaking the Flinders FSSQFSAC on two occasions, two weeks apart, and analysed using Pearson's correlations. Content validity for the Flinders FSSQFSAC was established from a panel of experts and stakeholders. Principle components analysis revealed food service staff satisfaction was represented by 61-items divided into eight domains: job satisfaction (α=0.832), food quality (α=0.871), staff training (α=0.922), consultation (α=0.840), eating environment (α=0.777), reliability (α=0.695), family expectations (α=0.781) and resident relationships (α=0.429), establishing construct validity in all domains, and internal consistency in all (α>0.5) except for "resident relationships" (α=0.429). Test

  4. Measuring patients’ satisfaction with their anti-TNF treatment in severe Crohn’s disease: scoring and psychometric validation of the Satisfaction for PAtients in Crohn’s diseasE Questionnaire (SPACE-Q©

    Directory of Open Access Journals (Sweden)

    Gilet H

    2014-12-01

    Full Text Available Hélène Gilet,1 Benoit Arnould,1 Fatoumata Fofana,1 Pierre Clerson,2 Jean-Frédéric Colombel,10 Olivier D’Hondt,2 Patrick Faure,4 Hervé Hagège,5 Maria Nachury,3 Stéphane Nahon,6 Gilbert Tucat,7 Luc Vandromme,8 Ines Cazala-Telinge,9 Emmanuel Thibout9 1HEOR and Strategic Market Access, Mapi, Lyon, France; 2Orgamétrie, Roubaix, France; 3Hôpital Claude Huriez, Lille, France; 4Clinique Saint-Jean du Languedoc, Toulouse, France; 5Centre Hospitalier Intercommunal, Créteil, France; 6Centre Hospitalier Intercommunal, Le Raincy Montfermeil, France; 7Gastroenterologist, Private Clinical Practice, Paris, France; 8Gastroenterologist, Private Clinical Practice, Reims, France; 9Abbvie France, Rungis, France; 10Icahn School of Medicine at Mount Sinai, New York, NY, USA Background: Severe Crohn’s disease management includes anti-tumor necrosis factor (anti-TNF drugs that differ from early-stage treatments regarding efficacy, safety, and convenience. This study aimed to finalize and psychometrically validate the Satisfaction for PAtients in Crohn’s diseasE Questionnaire (SPACE-Q©, developed to measure satisfaction with anti-TNF treatment in patients with severe Crohn’s disease. Methods: A total of 279 patients with severe Crohn’s disease receiving anti-TNF therapy completed the SPACE-Q 62-item pilot version at inclusion and 12 and 13 weeks after first anti-TNF injection. The final SPACE-Q scoring was defined using multitrait and regression analyses and clinical relevance considerations. Psychometric validation included clinical validity against Harvey–Bradshaw score, concurrent validity against Treatment Satisfaction Questionnaire for Medication (TSQM, internal consistency reliability, test–retest reliability, and responsiveness against the patient global impression of change (PGIC.Results: Quality of completion was good (55%–67% of patients completed all items. Four items were removed from the questionnaire. Eleven scores were defined

  5. Online pre-race education improves test scores for volunteers at a marathon.

    Science.gov (United States)

    Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

    2017-09-01

    This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.

  6. Psychometric Properties of Brief Screening Tests for Alcohol Use Disorders during Pregnancy in Argentina.

    Science.gov (United States)

    López, Mariana Beatriz; Lichtenberger, Aldana; Conde, Karina; Cremonte, Mariana

    2017-07-01

    Background  Considering the physical, mental and behavioral problems related to fetal alcohol exposure, prenatal clinical guides suggest a brief evaluation of alcohol consumption during pregnancy to detect alcohol intake and to adjust interventions, if required. Even if any alcohol use should be considered risky during pregnancy, identifying women with alcohol use disorders is important because they could need a more specific intervention than simple advice to abstain. Most screening tests have been developed and validated in male populations and focused on the long-term consequences of heavy alcohol use, so they might be inappropriate to assess consumption in pregnant women. Objective  To analyze the internal reliability and validity of the alcohol screening instruments Alcohol Use Disorders Identification Test (AUDIT), Alcohol Use Disorders Identification Test - Consumption (AUDIT-C), Tolerance, Worried, Eye-Opener, Amnesia and Cut-Down (TWEAK), Rapid Alcohol Problems Screen - Quantity Frequency (RAPS-QF) and Tolerance, Annoyed, Cut-Down and Eye-Opener (T-ACE) to identify alcohol use disorders in pregnant women. Methods  A total of 641 puerperal women were personally interviewed during the 48 hours after delivery. The receiver operating characteristics (ROC) curves and the sensitivity and specificity of each instrument using different cut-off points were analyzed. Results  All instruments showed areas under the ROC curves above 0.80. Larger areas were found for the TWEAK and the AUDIT. The TWEAK, the T-ACE and the AUDIT-C showed higher sensitivity, while the AUDIT and the RAPS-QF showed higher specificity. Reliability (internal consistency) was low for all instruments, improving when optimal cut-off points were used, especially for the AUDIT, the AUDIT-C and the RAPS-QF. Conclusions  In other cultural contexts, studies have concluded that T-ACE and TWEAK are the best instruments to assess pregnant women. In contrast, our results evidenced the low

  7. A call for policy guidance on psychometric testing in doping control in sport

    DEFF Research Database (Denmark)

    Petróczi, Andrea; Backhouse, Susan H.; Barkoukis, Vassilis

    2015-01-01

    research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. At the same time, we must acknowledge that socially desirable responding...... guidance aims to protect the global athletic community against social, ethical and legal consequences from potential misuse of psychological tests, including applications as forensic diagnostic tools in both practice and research.......One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping...

  8. Modeling Floor Effects in Standardized Vocabulary Test Scores in a Sample of Low SES Hispanic Preschool Children under the Multilevel Structural Equation Modeling Framework

    Directory of Open Access Journals (Sweden)

    Leina Zhu

    2017-12-01

    Full Text Available Researchers and practitioners often use standardized vocabulary tests such as the Peabody Picture Vocabulary Test-4 (PPVT-4; Dunn and Dunn, 2007 and its companion, the Expressive Vocabulary Test-2 (EVT-2; Williams, 2007, to assess English vocabulary skills as an indicator of children's school readiness. Despite their psychometric excellence in the norm sample, issues arise when standardized vocabulary tests are used to asses children from culturally, linguistically and ethnically diverse backgrounds (e.g., Spanish-speaking English language learners or delayed in some manner. One of the biggest challenges is establishing the appropriateness of these measures with non-English or non-standard English speaking children as often they score one to two standard deviations below expected levels (e.g., Lonigan et al., 2013. This study re-examines the issues in analyzing the PPVT-4 and EVT-2 scores in a sample of 4-to-5-year-old low SES Hispanic preschool children who were part of a larger randomized clinical trial on the effects of a supplemental English shared-reading vocabulary curriculum (Pollard-Durodola et al., 2016. It was found that data exhibited strong floor effects and the presence of floor effects made it difficult to differentiate the invention group and the control group on their vocabulary growth in the intervention. A simulation study is then presented under the multilevel structural equation modeling (MSEM framework and results revealed that in regular multilevel data analysis, ignoring floor effects in the outcome variables led to biased results in parameter estimates, standard error estimates, and significance tests. Our findings suggest caution in analyzing and interpreting scores of ethnically and culturally diverse children on standardized vocabulary tests (e.g., floor effects. It is recommended appropriate analytical methods that take into account floor effects in outcome variables should be considered.

  9. Opportunity to learn: Investigating possible predictors for pre-course Test Of Astronomy STandards TOAST scores

    Science.gov (United States)

    Berryhill, Katie J.

    As astronomy education researchers become more interested in experimentally testing innovative teaching strategies to enhance learning in introductory astronomy survey courses ("ASTRO 101"), scholars are placing increased attention toward better understanding factors impacting student gain scores on the widely used Test Of Astronomy STandards (TOAST). Usually used in a pre-test and post-test research design, one might naturally assume that the pre-course differences observed between high- and low-scoring college students might be due in large part to their pre-existing motivation, interest, experience in science, and attitudes about astronomy. To explore this notion, 11 non-science majoring undergraduates taking ASTRO 101 at west coast community colleges were interviewed in the first few weeks of the course to better understand students' pre-existing affect toward learning astronomy with an eye toward predicting student success. In answering this question, we hope to contribute to our understanding of the incoming knowledge of students taking undergraduate introductory astronomy classes, but also gain insight into how faculty can best meet those students' needs and assist them in achieving success. Perhaps surprisingly, there was only weak correlation between students' motivation toward learning astronomy and their pre-test scores. Instead, the most fruitful predictor of TOAST pre-test scores was the quantity of pre-existing, informal, self-directed astronomy learning experiences.

  10. Gender Gaps in High School GPA and ACT Scores: High School Grade Point Average and ACT Test Score by Subject and Gender. Information Brief 2014-12

    Science.gov (United States)

    ACT, Inc., 2014

    2014-01-01

    Female students who graduated from high school in 2013 averaged higher grades than their male counterparts in all subjects, but male graduates earned higher scores on the math and science sections of the ACT. This information brief looks at high school grade point average and ACT test score by subject and gender

  11. The Alliance Negotiation Scale: A psychometric investigation.

    Science.gov (United States)

    Doran, Jennifer M; Safran, Jeremy D; Muran, J Christopher

    2016-08-01

    This study investigates the utility and psychometric properties of a new measure of psychotherapy process, the Alliance Negotiation Scale (ANS; Doran, Safran, Waizmann, Bolger, & Muran, 2012). The ANS was designed to operationalize the theoretical construct of negotiation (Safran & Muran, 2000), and to extend our current understanding of the working alliance concept (Bordin, 1979). The ANS was also intended to improve upon existing measures such as the Working Alliance Inventory (WAI; Horvath & Greenberg, 1986, 1989) and its short form (WAI-S; Tracey & Kokotovic, 1989) by expanding the emphasis on negative therapy process. The present study investigates the psychometric validity of the ANS test scores and interpretation-including confirming its original factor structure and evaluating its internal consistency and construct validity. Construct validity was examined through the ANS' convergence and divergence with several existing scales that measure theoretically related constructs. The results bolster and extend previous findings about the psychometric integrity of the ANS, and begin to illuminate the relationship between negotiation and other important variables in psychotherapy research. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  12. Combination of classical test theory (CTT) and item response theory (IRT) analysis to study the psychometric properties of the French version of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF).

    Science.gov (United States)

    Bourion-Bédès, Stéphanie; Schwan, Raymund; Epstein, Jonathan; Laprevote, Vincent; Bédès, Alex; Bonnet, Jean-Louis; Baumann, Cédric

    2015-02-01

    The study aimed to examine the construct validity and reliability of the Quality of Life Enjoyment and Satisfaction Questionnaire-Short Form (Q-LES-Q-SF) according to both classical test and item response theories. The psychometric properties of the French version of this instrument were investigated in a cross-sectional, multicenter study. A total of 124 outpatients with a substance dependence diagnosis participated in the study. Psychometric evaluation included descriptive analysis, internal consistency, test-retest reliability, and validity. The dimensionality of the instrument was explored using a combination of the classical test, confirmatory factor analysis (CFA), and an item response theory analysis, the Person Separation Index (PSI), in a complementary manner. The results of the Q-LES-Q-SF revealed that the questionnaire was easy to administer and the acceptability was good. The internal consistency and the test-retest reliability were 0.9 and 0.88, respectively. All items were significantly correlated with the total score and the SF-12 used in the study. The CFA with one factor model was good, and for the unidimensional construct, the PSI was found to be 0.902. The French version of the Q-LES-Q-SF yielded valid and reliable clinical assessments of the quality of life for future research and clinical practice involving French substance abusers. In response to recent questioning regarding the unidimensionality or bidimensionality of the instrument and according to the underlying theoretical unidimensional construct used for its development, this study suggests the Q-LES-Q-SF as a one-dimension questionnaire in French QoL studies.

  13. Measuring Sexual Motives: A Test of the Psychometric Properties of the Sexual Motivations Scale.

    Science.gov (United States)

    Jardin, Charles; Garey, Lorra; Zvolensky, Michael J

    2017-01-01

    Sexual motives refer to functions served by sexual behavior. The Sex Motivations Scale (SMS) has frequently been used to assess sexual motives. At its development, the SMS demonstrated good internal consistency; convergent, divergent, and criterion validity; and configural invariance across sex, age, and Caucasians and African Americans. Yet the metric and scalar invariance of the SMS has not been examined, nor has the measurement invariance of the SMS across Hispanic and Asian Americans, sexual minority status, and relationship status been tested. The criterion validity of the SMS also has yet to be examined for nonintercourse sexual behaviors, such as sexting. The present study aimed to address these gaps in a diverse sample of 2,201 college students (77.60% female; M age  = 22.06; 27.84% Caucasian). Results further affirmed the configural, metric, and scalar invariance of the SMS. The convergent and divergent validity of the SMS was supported in relation to positive and negative affect and attachment patterns; and specific SMS subscales demonstrated associations with sexual intercourse behaviors and sexting, supporting the criterion validity of the SMS. These findings suggest the relevance of the SMS in assessing sexual motives across diverse populations and behaviors.

  14. Raven’s test performance of sub-Saharan Africans: average performance, psychometric properties, and the Flynn Effect

    NARCIS (Netherlands)

    Wicherts, J.M.; Dolan, C.V.; Carlson, J.S.; van der Maas, H.L.J.

    2010-01-01

    This paper presents a systematic review of published data on the performance of sub-Saharan Africans on Raven's Progressive Matrices. The specific goals were to estimate the average level of performance, to study the Flynn Effect in African samples, and to examine the psychometric meaning of Raven's

  15. A Psychometric Review of Measures Assessing Discrimination Against Sexual Minorities.

    Science.gov (United States)

    Morrison, Todd G; Bishop, C J; Morrison, Melanie A; Parker-Taneo, Kandice

    2016-08-01

    Discrimination against sexual minorities is widespread and has deleterious consequences on victims' psychological and physical wellbeing. However, a review of the psychometric properties of instruments measuring lesbian, gay, and bisexual (LGB) discrimination has not been conducted. The results of this review, which involved evaluating 162 articles, reveal that most have suboptimal psychometric properties. Specifically, myriad scales possess questionable content validity as (1) items are not created in collaboration with sexual minorities; (2) measures possess a small number of items and, thus, may not sufficiently represent the domain of interest; and (3) scales are "adapted" from measures designed to examine race- and gender-based discrimination. Additional limitations include (1) summed scores are computed, often in the absence of scale score reliability metrics; (2) summed scores operate from the questionable assumption that diverse forms of discrimination are necessarily interrelated; (3) the dimensionality of instruments presumed to consist of subscales is seldom tested; (4) tests of criterion-related validity are routinely omitted; and (5) formal tests of measures' construct validity are seldom provided, necessitating that one infer validity based on the results obtained. The absence of "gold standard" measures, the attendant difficulty in formulating a coherent picture of this body of research, and suggestions for psychometric improvements are noted.

  16. Associations of maximal strength and muscular endurance test scores with cardiorespiratory fitness and body composition.

    Science.gov (United States)

    Vaara, Jani P; Kyröläinen, Heikki; Niemi, Jaakko; Ohrankämmen, Olli; Häkkinen, Arja; Kocay, Sheila; Häkkinen, Keijo

    2012-08-01

    The purpose of the present study was to assess the relationships between maximal strength and muscular endurance test scores additionally to previously widely studied measures of body composition and maximal aerobic capacity. 846 young men (25.5 ± 5.0 yrs) participated in the study. Maximal strength was measured using isometric bench press, leg extension and grip strength. Muscular endurance tests consisted of push-ups, sit-ups and repeated squats. An indirect graded cycle ergometer test was used to estimate maximal aerobic capacity (V(O2)max). Body composition was determined with bioelectrical impedance. Moreover, waist circumference (WC) and height were measured and body mass index (BMI) calculated. Maximal bench press was positively correlated with push-ups (r = 0.61, p strength (r = 0.34, p strength correlated positively (r = 0.36-0.44, p test scores were related to maximal aerobic capacity and body fat content, while fat free mass was associated with maximal strength test scores and thus is a major determinant for maximal strength. A contributive role of maximal strength to muscular endurance tests could be identified for the upper, but not the lower extremities. These findings suggest that push-up test is not only indicative of body fat content and maximal aerobic capacity but also maximal strength of upper body, whereas repeated squat test is mainly indicative of body fat content and maximal aerobic capacity, but not maximal strength of lower extremities.

  17. Association between the gait pattern characteristics of older people and their two-step test scores.

    Science.gov (United States)

    Kobayashi, Yoshiyuki; Ogata, Toru

    2018-04-27

    The Two-Step test is one of three official tests authorized by the Japanese Orthopedic Association to evaluate the risk of locomotive syndrome (a condition of reduced mobility caused by an impairment of the locomotive organs). It has been reported that the Two-Step test score has a good correlation with one's walking ability; however, its association with the gait pattern of older people during normal walking is still unknown. Therefore, this study aims to clarify the associations between the gait patterns of older people observed during normal walking and their Two-Step test scores. We analyzed the whole waveforms obtained from the lower-extremity joint angles and joint moments of 26 older people in various stages of locomotive syndrome using principal component analysis (PCA). The PCA was conducted using a 260 × 2424 input matrix constructed from the participants' time-normalized pelvic and right-lower-limb-joint angles along three axes (ten trials of 26 participants, 101 time points, 4 angles, 3 axes, and 2 variable types per trial). The Pearson product-moment correlation coefficient between the scores of the principal component vectors (PCVs) and the scores of the Two-Step test revealed that only one PCV (PCV 2) among the 61 obtained relevant PCVs is significantly related to the score of the Two-Step test. We therefore concluded that the joint angles and joint moments related to PCV 2-ankle plantar-flexion, ankle plantar-flexor moments during the late stance phase, ranges of motion and moments on the hip, knee, and ankle joints in the sagittal plane during the entire stance phase-are the motions associated with the Two-Step test.

  18. Robust joint score tests in the application of DNA methylation data analysis.

    Science.gov (United States)

    Li, Xuan; Fu, Yuejiao; Wang, Xiaogang; Qiu, Weiliang

    2018-05-18

    Recently differential variability has been showed to be valuable in evaluating the association of DNA methylation to the risks of complex human diseases. The statistical tests based on both differential methylation level and differential variability can be more powerful than those based only on differential methylation level. Anh and Wang (2013) proposed a joint score test (AW) to simultaneously detect for differential methylation and differential variability. However, AW's method seems to be quite conservative and has not been fully compared with existing joint tests. We proposed three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM, and have made extensive comparisons with the joint likelihood ratio test (jointLRT), the Kolmogorov-Smirnov (KS) test, and the AW test. Systematic simulation studies showed that: 1) the three improved tests performed better (i.e., having larger power, while keeping nominal Type I error rates) than the other three tests for data with outliers and having different variances between cases and controls; 2) for data from normal distributions, the three improved tests had slightly lower power than jointLRT and AW. The analyses of two Illumina HumanMethylation27 data sets GSE37020 and GSE20080 and one Illumina Infinium MethylationEPIC data set GSE107080 demonstrated that three improved tests had higher true validation rates than those from jointLRT, KS, and AW. The three proposed joint score tests are robust against the violation of normality assumption and presence of outlying observations in comparison with other three existing tests. Among the three proposed tests, iAW.BF seems to be the most robust and effective one for all simulated scenarios and also in real data analyses.

  19. IQs Are Very Strong but Imperfect Indicators of Psychometric "g": Results from Joint Confirmatory Factor Analysis

    Science.gov (United States)

    Farmer, Ryan L.; Floyd, Randy G.; Reynolds, Matthew R.; Kranzler, John H.

    2014-01-01

    The most global score yielded by intelligence tests, IQs, are supported by substantial validity evidence and have historically been central to the identification of intellectual disabilities, learning disabilities, and giftedness. This study examined the extent to which IQs measure the ability they target, psychometric "g." Data from…

  20. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    OpenAIRE

    Omid Talebi Rezaabadi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening Anxiety Scale and a newly developed Foreign Language Social Anxiety Scale. The potential correlation between social anxiety and listening-test perfor...

  1. Integrating GIS in the Middle School Curriculum: Impacts on Diverse Students' Standardized Test Scores

    Science.gov (United States)

    Goldstein, Donna; Alibrandi, Marsha

    2013-01-01

    This case study conducted with 1,425 middle school students in Palm Beach County, Florida, included a treatment group receiving GIS instruction (256) and a control group without GIS instruction (1,169). Quantitative analyses on standardized test scores indicated that inclusion of GIS in middle school curriculum had a significant effect on student…

  2. Intelligence Test Scores and Birth Order among Young Norwegian Men (Conscripts) Analyzed within and between Families

    Science.gov (United States)

    Bjerkedal, Tor; Kristensen, Petter; Skjeret, Geir A.; Brevik, John I.

    2007-01-01

    The present paper reports the results of a within and between family analysis of the relation between birth order and intelligence. The material comprises more than a quarter of a million test scores for intellectual performance of Norwegian male conscripts recorded during 1984-2004. Conscripts, mostly 18-19 years of age, were born to women for…

  3. International Test Score Comparisons and Educational Policy: A Review of the Critiques

    Science.gov (United States)

    Carnoy, Martin

    2015-01-01

    Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…

  4. Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests

    Science.gov (United States)

    Almond, Russell G.

    2014-01-01

    Assessments consisting of only a few extended constructed response items (essays) are not typically equated using anchor test designs as there are typically too few essay prompts in each form to allow for meaningful equating. This article explores the idea that output from an automated scoring program designed to measure writing fluency (a common…

  5. Changes in Student Populations and Average Test Scores of Dutch Primary Schools

    Science.gov (United States)

    Luyten, Hans; de Wolf, Inge

    2011-01-01

    This article focuses on the relation between student population characteristics and average test scores per school in the final grade of primary education from a dynamic perspective. Aggregated data of over 5,000 Dutch primary schools covering a 6-year period were used to study the relation between changes in school populations and shifts in mean…

  6. Evaluation of Two Methods for Modeling Measurement Errors When Testing Interaction Effects with Observed Composite Scores

    Science.gov (United States)

    Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C.

    2018-01-01

    Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…

  7. Comprehensive School Reform and Standardized Test Scores in Illinois Elementary and Middle Schools

    Science.gov (United States)

    McEnroe, James D.

    2010-01-01

    The study examined the effects of the federally funded Comprehensive School Reform (CSR) program on student performance on mandated standardized tests. The study focused on the mathematics and reading scores of Illinois public elementary and middle and junior high school students. The federal CSR program provided Illinois schools with an annual…

  8. Test Score Gaps between Private and Government Sector Students at School Entry Age in India

    Science.gov (United States)

    Singh, Abhijeet

    2014-01-01

    Various studies have noted that students enrolled in private schools in India perform better on average than students in government schools. In this paper, I show that large gaps in the test scores of children in private and public sector education are evident even at the point of initial enrollment in formal schooling and are associated with…

  9. Classroom Organizational Structure in Fifth Grade Math Classrooms and the Effect on Standardized Test Scores

    Science.gov (United States)

    Lane, Dallas Marie

    2017-01-01

    The purpose of this study was to determine if there is a relationship between the classroom organizational structure and MCT2 test scores of fifth-grade math students. The researcher gained insight regarding which structure teachers believe is most beneficial to them and students, and whether or not their belief of classroom organizational…

  10. Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

    Science.gov (United States)

    Flug, Susanna

    2010-01-01

    In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take…

  11. Validating Score Interpretations and Uses: Messick Lecture, Language Testing Research Colloquium, Cambridge, April 2010

    Science.gov (United States)

    Kane, Michael

    2012-01-01

    The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…

  12. Virginia tech freshman class becoming more competitive; Rise in grades and test scores noted

    OpenAIRE

    Virginia Tech News

    2004-01-01

    Admission to Virginia Tech continues to become more competitive as applicants report higher grade point averages and test scores than previous years. The incoming class of 4,975 students has an average grade point average (GPA) of 3.68 and SAT 1203, up from 3.60 GPA and 1197 SAT in 2003.

  13. Specific algorithm method of scoring the Clock Drawing Test applied in cognitively normal elderly

    Directory of Open Access Journals (Sweden)

    Liana Chaves Mendes-Santos

    Full Text Available The Clock Drawing Test (CDT is an inexpensive, fast and easily administered measure of cognitive function, especially in the elderly. This instrument is a popular clinical tool widely used in screening for cognitive disorders and dementia. The CDT can be applied in different ways and scoring procedures also vary. OBJECTIVE: The aims of this study were to analyze the performance of elderly on the CDT and evaluate inter-rater reliability of the CDT scored by using a specific algorithm method adapted from Sunderland et al. (1989. METHODS: We analyzed the CDT of 100 cognitively normal elderly aged 60 years or older. The CDT ("free-drawn" and Mini-Mental State Examination (MMSE were administered to all participants. Six independent examiners scored the CDT of 30 participants to evaluate inter-rater reliability. RESULTS AND CONCLUSION: A score of 5 on the proposed algorithm ("Numbers in reverse order or concentrated", equivalent to 5 points on the original Sunderland scale, was the most frequent (53.5%. The CDT specific algorithm method used had high inter-rater reliability (p<0.01, and mean score ranged from 5.06 to 5.96. The high frequency of an overall score of 5 points may suggest the need to create more nuanced evaluation criteria, which are sensitive to differences in levels of impairment in visuoconstructive and executive abilities during aging.

  14. The Relationships between Social Class, Listening Test Anxiety and Test Scores

    Science.gov (United States)

    Rezaabadi, Omid Talebi

    2016-01-01

    This study investigated the relationships between the social anxiety, social class and listening-test anxiety of students learning English as a foreign language. The aims of the study were to examine the relationship between listening-test anxiety and listening-test performance. The data were collected using an adapted Foreign Language Listening…

  15. Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

    Directory of Open Access Journals (Sweden)

    Ulla Haverinen-Shaughnessy

    Full Text Available Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms from Southwestern United States, and student level data (N = 3109 on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person. The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points were increased by up to eleven points (0.5% per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points. There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points. Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.

  16. Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

    Science.gov (United States)

    Haverinen-Shaughnessy, Ulla; Shaughnessy, Richard J

    2015-01-01

    Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.

  17. Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

    Science.gov (United States)

    2015-01-01

    Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643

  18. Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease

    Directory of Open Access Journals (Sweden)

    Elaheh Moradi

    2017-01-01

    Full Text Available Rey's Auditory Verbal Learning Test (RAVLT is a powerful neuropsychological tool for testing episodic memory, which is widely used for the cognitive assessment in dementia and pre-dementia conditions. Several studies have shown that an impairment in RAVLT scores reflect well the underlying pathology caused by Alzheimer's disease (AD, thus making RAVLT an effective early marker to detect AD in persons with memory complaints. We investigated the association between RAVLT scores (RAVLT Immediate and RAVLT Percent Forgetting and the structural brain atrophy caused by AD. The aim was to comprehensively study to what extent the RAVLT scores are predictable based on structural magnetic resonance imaging (MRI data using machine learning approaches as well as to find the most important brain regions for the estimation of RAVLT scores. For this, we built a predictive model to estimate RAVLT scores from gray matter density via elastic net penalized linear regression model. The proposed approach provided highly significant cross-validated correlation between the estimated and observed RAVLT Immediate (R = 0.50 and RAVLT Percent Forgetting (R = 0.43 in a dataset consisting of 806 AD, mild cognitive impairment (MCI or healthy subjects. In addition, the selected machine learning method provided more accurate estimates of RAVLT scores than the relevance vector regression used earlier for the estimation of RAVLT based on MRI data. The top predictors were medial temporal lobe structures and amygdala for the estimation of RAVLT Immediate and angular gyrus, hippocampus and amygdala for the estimation of RAVLT Percent Forgetting. Further, the conversion of MCI subjects to AD in 3-years could be predicted based on either observed or estimated RAVLT scores with an accuracy comparable to MRI-based biomarkers.

  19. Lower Quarter Y-Balance Test Scores and Lower Extremity Injury in NCAA Division I Athletes.

    Science.gov (United States)

    Lai, Wilson C; Wang, Dean; Chen, James B; Vail, Jeremy; Rugg, Caitlin M; Hame, Sharon L

    2017-08-01

    Functional movement tests that are predictive of injury risk in National Collegiate Athletic Association (NCAA) athletes are useful tools for sports medicine professionals. The Lower Quarter Y-Balance Test (YBT-LQ) measures single-leg balance and reach distances in 3 directions. To assess whether the YBT-LQ predicts the laterality and risk of sports-related lower extremity (LE) injury in NCAA athletes. Case-control study; Level of evidence, 3. The YBT-LQ was administered to 294 NCAA Division I athletes from 21 sports during preparticipation physical examinations at a single institution. Athletes were followed prospectively over the course of the corresponding season. Correlation analysis was performed between the laterality of reach asymmetry and composite scores (CS) versus the laterality of injury. Receiver operating characteristic (ROC) analysis was used to determine the optimal asymmetry cutoff score for YBT-LQ. A multivariate regression analysis adjusting for sex, sport type, body mass index, and history of prior LE surgery was performed to assess predictors of earlier and higher rates of injury. Neither the laterality of reach asymmetry nor the CS correlated with the laterality of injury. ROC analysis found optimal cutoff scores of 2, 9, and 3 cm for anterior, posteromedial, and posterolateral reach, respectively. All of these potential cutoff scores, along with a cutoff score of 4 cm used in the majority of prior studies, were associated with poor sensitivity and specificity. Furthermore, none of the asymmetric cutoff scores were associated with earlier or increased rate of injury in the multivariate analyses. YBT-LQ scores alone do not predict LE injury in this collegiate athlete population. Sports medicine professionals should be cautioned against using the YBT-LQ alone to screen for injury risk in collegiate athletes.

  20. Effects of correcting for prematurity on cognitive test scores in childhood.

    Science.gov (United States)

    Wilson-Ching, Michelle; Pascoe, Leona; Doyle, Lex W; Anderson, Peter J

    2014-03-01

    The American Academy of Pediatrics recommends that test scores should be corrected for prematurity up to 3 years of age, but this practice varies greatly in both clinical and research settings. The aim of this study was to contrast the effects of using chronological age and those of using corrected age on measures of cognitive outcome across childhood. A theoretical model was constructed using norms from the Bayley Scales of Infant and Toddler Development, Third Edition; the Wechsler Preschool and Primary Scale of Intelligence, Third Edition Australian; and the Wechsler Intelligence Scales for Children, Fourth Edition Australian. Baseline scores representing different levels of functioning (70, below average; 85, borderline; and 100, average) were recalculated using the normative data for ages 6 months to 16 years to account for 1, 2, 3 and 4 months of prematurity. The model created depicted the difference in standardised scores between chronological and corrected age. Compared with scores corrected for prematurity, the absolute reduction in scores using chronological age was greater for increasing degree of prematurity, younger ages at assessment and higher baseline scores and was substantial even beyond 3 years of age. However, the pattern was erratic, with considerable fluctuation evident across different ages and baseline scores. Chronological age results in a lowering of scores at all ages for preterm-born subjects that is greater in the first few years and in those born at earlier gestational ages. Whether or not to correct for prematurity depends upon the context of the assessment. © 2014 The Authors. Journal of Paediatrics and Child Health © 2014 Paediatrics and Child Health Division (Royal Australasian College of Physicians).

  1. Identifying genetic marker sets associated with phenotypes via an efficient adaptive score test

    KAUST Repository

    Cai, T.

    2012-06-25

    In recent years, genome-wide association studies (GWAS) and gene-expression profiling have generated a large number of valuable datasets for assessing how genetic variations are related to disease outcomes. With such datasets, it is often of interest to assess the overall effect of a set of genetic markers, assembled based on biological knowledge. Genetic marker-set analyses have been advocated as more reliable and powerful approaches compared with the traditional marginal approaches (Curtis and others, 2005. Pathways to the analysis of microarray data. TRENDS in Biotechnology 23, 429-435; Efroni and others, 2007. Identification of key processes underlying cancer phenotypes using biologic pathway analysis. PLoS One 2, 425). Procedures for testing the overall effect of a marker-set have been actively studied in recent years. For example, score tests derived under an Empirical Bayes (EB) framework (Liu and others, 2007. Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models. Biometrics 63, 1079-1088; Liu and others, 2008. Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC bioinformatics 9, 292-2; Wu and others, 2010. Powerful SNP-set analysis for case-control genome-wide association studies. American Journal of Human Genetics 86, 929) have been proposed as powerful alternatives to the standard Rao score test (Rao, 1948. Large sample tests of statistical hypotheses concerning several parameters with applications to problems of estimation. Mathematical Proceedings of the Cambridge Philosophical Society, 44, 50-57). The advantages of these EB-based tests are most apparent when the markers are correlated, due to the reduction in the degrees of freedom. In this paper, we propose an adaptive score test which up- or down-weights the contributions from each member of the marker-set based on the Z-scores of

  2. Effect on intelligence test score of prenatal exposure to ionizing radiation in Hiroshima and Nagasaki

    International Nuclear Information System (INIS)

    Schull, W.J.; Otake, Masanori; Yoshimaru, Hiroshi.

    1988-10-01

    Analyses of intelligence test scores (Koga) at 10-11 years of age of individuals exposed prenatally to the atomic bombing of Hiroshima and Nagasaki using estimates of the uterine absorbed dose based on the recently introduced system of dosimetry, the Dosimetry System 1986 (DS86), reveal the following: 1) there is no evidence of a radiation-related effect on intelligence among those individuals exposed within 0-7 weeks after fertilization or in the 26th or subsequent weeks; 2) for individuals exposed at 8-15 weeks after fertilization, and to a lesser extent those exposed at 16-25 weeks, the mean tests scores but not the variances are significantly heterogeneous among exposure categories; 3) the cumulative distribution of test scores suggests a progressive shift downwards in individual scores with increasing exposure; and 4) within the group most sensitive to the occurrence of clinically recognizable severe mental retardation, individuals exposed 8 through 15 weeks after fertilization, the regression of intelligence score on estimated DS86 uterine absorbed dose is more linear than with T65DR fetal dose, the diminution in intelligence score under the linear model is 21-29 points at 1Gy. The effect is somewhat greater when the controls receiving less than 0.01 Gy are excluded, 24-33 points at 1 Gy. These findings are discussed in the light of the earlier analysis of the frequency of occurrence of mental retardation among the prenatally exposed survivors of the A-bombing of Hiroshima and Nagasaki. It is suggested that both are the consequences of the same underlying biological process or processes. (author)

  3. A Study on Variables that Affect Class Scores of Primary Education Students in Placement Test

    OpenAIRE

    Yavuz, Mustafa

    2010-01-01

    This study aims to determine the variables that predict class scores which are obtained by adding 70 % of the Placement Test (PT) scores of the primary education sixth and seventh grade students who took it for the first time in the 2007-2008 academic year within the framework of the system of passing to secondary education reorganized by the MNE, 25 % of their end-of-the-year passing grades. The study is of general survey model. The study group consists of students who took the PT in the 200...

  4. ACER Mathematics Profile Series: Number Test. (Test Booklet, Answer and Record Sheet, Score Key, and Teachers Handbook).

    Science.gov (United States)

    Cornish, Greg; Wines, Robin

    The Number Test of the ACER Mathematics Profile Series, contains 30 items, for each of three suggested grade levels: 7-8, 8-9, and 9-10. Raw scores on all tests in the ACER Mathematics Profile Series (Number, Operations, Space and Measurement) are converted to a common scale called MAPS, a major feature of the Series. Based on the Rasch Model,…

  5. Association of Health Sciences Reasoning Test scores with academic and experiential performance.

    Science.gov (United States)

    Cox, Wendy C; McLaughlin, Jacqueline E

    2014-05-15

    To assess the association of scores on the Health Sciences Reasoning Test (HSRT) with academic and experiential performance in a doctor of pharmacy (PharmD) curriculum. The HSRT was administered to 329 first-year (P1) PharmD students. Performance on the HSRT and its subscales was compared with academic performance in 29 courses throughout the curriculum and with performance in advanced pharmacy practice experiences (APPEs). Significant positive correlations were found between course grades in 8 courses and HSRT overall scores. All significant correlations were accounted for by pharmaceutical care laboratory courses, therapeutics courses, and a law and ethics course. There was a lack of moderate to strong correlation between HSRT scores and academic and experiential performance. The usefulness of the HSRT as a tool for predicting student success may be limited.

  6. Effects of Public Preschool Expenditures on the Test Scores of 4th Graders: Evidence from TIMSS

    Science.gov (United States)

    Waldfogel, Jane; Zhai, Fuhua

    2011-01-01

    This study examines the effects of public preschool expenditures on the math and science scores of 4th graders, holding constant child, family, and school characteristics, other relevant social expenditures, and country and year effects, in seven Organization for Economic Co-operation and Development (OECD) countries -- Australia, Japan, Netherlands, New Zealand, Norway, U.K., and U.S -- using data from the 1995 and 2003 Trends in International Mathematics and Science Study (TIMSS). Our results indicate that there are small but significant positive effects of public preschool expenditures on the math and science scores of 4th graders and preschool expenditures reduce the risk of children scoring at the low level of proficiency. We also find some evidence that children from low-resource homes and homes where the test language is not always spoken may tend to gain more from increased public preschool expenditures than other children,. PMID:21442008

  7. Effects of Public Preschool Expenditures on the Test Scores of 4 Graders: Evidence from TIMSS.

    Science.gov (United States)

    Waldfogel, Jane; Zhai, Fuhua

    2008-02-01

    This study examines the effects of public preschool expenditures on the math and science scores of 4(th) graders, holding constant child, family, and school characteristics, other relevant social expenditures, and country and year effects, in seven Organization for Economic Co-operation and Development (OECD) countries -- Australia, Japan, Netherlands, New Zealand, Norway, U.K., and U.S -- using data from the 1995 and 2003 Trends in International Mathematics and Science Study (TIMSS). Our results indicate that there are small but significant positive effects of public preschool expenditures on the math and science scores of 4(th) graders and preschool expenditures reduce the risk of children scoring at the low level of proficiency. We also find some evidence that children from low-resource homes and homes where the test language is not always spoken may tend to gain more from increased public preschool expenditures than other children,.

  8. Testing statistical significance scores of sequence comparison methods with structure similarity

    Directory of Open Access Journals (Sweden)

    Leunissen Jack AM

    2006-10-01

    Full Text Available Abstract Background In the past years the Smith-Waterman sequence comparison algorithm has gained popularity due to improved implementations and rapidly increasing computing power. However, the quality and sensitivity of a database search is not only determined by the algorithm but also by the statistical significance testing for an alignment. The e-value is the most commonly used statistical validation method for sequence database searching. The CluSTr database and the Protein World database have been created using an alternative statistical significance test: a Z-score based on Monte-Carlo statistics. Several papers have described the superiority of the Z-score as compared to the e-value, using simulated data. We were interested if this could be validated when applied to existing, evolutionary related protein sequences. Results All experiments are performed on the ASTRAL SCOP database. The Smith-Waterman sequence comparison algorithm with both e-value and Z-score statistics is evaluated, using ROC, CVE and AP measures. The BLAST and FASTA algorithms are used as reference. We find that two out of three Smith-Waterman implementations with e-value are better at predicting structural similarities between proteins than the Smith-Waterman implementation with Z-score. SSEARCH especially has very high scores. Conclusion The compute intensive Z-score does not have a clear advantage over the e-value. The Smith-Waterman implementations give generally better results than their heuristic counterparts. We recommend using the SSEARCH algorithm combined with e-values for pairwise sequence comparisons.

  9. Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

    Science.gov (United States)

    Jacob, Brian A.

    2016-01-01

    Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

  10. The Shame and Guilt Scales of the Test of Self-Conscious Affect-Adolescent (TOSCA-A): Psychometric Properties for Responses from Children, and Measurement Invariance Across Children and Adolescents

    Science.gov (United States)

    Watson, Shaun D.; Gomez, Rapson; Gullone, Eleonora

    2016-01-01

    This study examined various psychometric properties of the items comprising the shame and guilt scales of the Test of Self-Conscious Affect-Adolescent (TOSCA-A) in a group children between 8 and 11 years of age. A total of 699 children (367 females and 332 males) completed these scales, and also measures of depression and empathy. Confirmatory factor analysis (CFA) provided support for an oblique two-factor model, with the originally proposed shame and guilt items comprising shame and guilt factors, respectively. There was good internal consistency reliability for the shame and guilt scales, with omega coefficient values of 0.77 and 0.81 for shame and guilt, respectively. Also, shame correlated with depression symptoms positively (0.34, p Guilt correlated with depression symptoms negatively (-0.28, p guilt factors. Multiple-group CFA comparing this group of children with a separate group of adolescents (320 females and 242 males), based on the chi-square difference test, supported full metric invariance, the intercept invariance of 17 of the 30 shame and guilt items, and higher latent mean scores among children for both shame and guilt. The non-equivalency for intercepts and mean scores were of small effect sizes. Comparisons based on the difference in root mean squared error of approximation values supported full measurement invariance and no group difference for latent mean scores. The findings in the current study support the use of the TOSCA-A in children and the valid comparison of scores between children and adolescents, thereby opening up the possibility of evaluating change in the TOSCA-A shame and guilt factors over these developmental age groups. PMID:27242573

  11. A systematic review protocol investigating tests for physical or physiological qualities and game-specific skills commonly used in rugby and related sports and their psychometric properties.

    Science.gov (United States)

    Chiwaridzo, Matthew; Ferguson, Gillian D; Smits-Engelsman, Bouwien C M

    2016-07-27

    Scientific focus on rugby has increased over the recent years, providing evidence of the physical or physiological characteristics and game-specific skills needed in the sport. Identification of tests commonly used to measure these characteristics is important for the development of test batteries, which in turn may be used for talent identification and injury prevention programmes. Although there are a number of tests available in the literature to measure physical or physiological variables and game-specific skills, there is limited information available on the psychometric properties of the tests. Therefore, the purpose of this study is to systematically review the literature for tests commonly used in rugby to measure physical or physiological characteristics and rugby-specific skills, documenting evidence of reliability and validity of the identified tests. A systematic review will be conducted. Electronic databases such as Scopus, MEDLINE via EBSCOhost and PubMed, Academic Search Premier, CINAHL and Africa-Wide Information via EBSCOhost will be searched for original research articles published in English from January 1, 1995, to December 31, 2015, using a pre-defined search strategy. The principal investigator will select potentially relevant articles from titles and abstracts. To minimise bias, full text of titles and abstracts deemed potentially relevant will be retrieved and reviewed by two independent reviewers based on the inclusion criteria. Data extraction will be conducted by the principal investigator and verified by two independent reviewers. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) checklist will be used to assess the methodological quality of the selected studies. Choosing an appropriate test to be included in the screening test battery should be based on sound psychometric properties of the test available. This systematic review will provide an overview of the tests commonly used in rugby union

  12. Pediatric residents' learning styles and temperaments and their relationships to standardized test scores.

    Science.gov (United States)

    Tuli, Sanjeev Y; Thompson, Lindsay A; Saliba, Heidi; Black, Erik W; Ryan, Kathleen A; Kelly, Maria N; Novak, Maureen; Mellott, Jane; Tuli, Sonal S

    2011-12-01

    Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P  =  .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents.

  13. [Relationship between unipedal stance test score and center of pressure velocity in elderly].

    Science.gov (United States)

    Rodrigo Antonio, Guzmán; Rony, Silvestre; Francisco Aniceto, Rodríguez; David Andrés, Arriagada; Pablo Andrés, Ortega

    2011-01-01

    Frequent falls are one of the most important health problems in the elderly population. The unipedal stance test (UPST), asses postural stability and is used in fall risk measures. Despite this, there is little information about its relationship with posturographic parameters (PP) that characterizes postural stability. Center of pressure velocity (CoPV) is one of the best PP that describes postural stability. The aim of this study was to analyze the relation between UST score and CoPV in elderly population. A sample of 38 healthy elderly subjects where divided in two groups according to their UPST score, low performance (LP, n=11) and high performance (HP, n=27). The correlation between UPST score and COP mean velocity (CoPmV), recorded from a posturographic test, was analyzed between both groups. An inverse correlation between UPST score and CoPmV was found in both groups. However, this was higher in the LP group (r=-0.69, P=.02) compared to the HP (r=-0.39, P=.04). Based on the results of this investigation, it may be concluded that the achievement on UPST has an inverse relationship with CoPmV, especially in subjects with low performance in the UPST. Copyright © 2010 SEGG. Published by Elsevier Espana. All rights reserved.

  14. The effect of instructional methodology on high school students natural sciences standardized tests scores

    Science.gov (United States)

    Powell, P. E.

    Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.

  15. Effect of Mindfulness Meditation on Perceived Stress Scores and Autonomic Function Tests of Pregnant Indian Women.

    Science.gov (United States)

    Muthukrishnan, Shobitha; Jain, Reena; Kohli, Sangeeta; Batra, Swaraj

    2016-04-01

    Various pregnancy complications like hypertension, preeclampsia have been strongly correlated with maternal stress. One of the connecting links between pregnancy complications and maternal stress is mind-body intervention which can be part of Complementary and Alternative Medicine (CAM). Biologic measures of stress during pregnancy may get reduced by such interventions. To evaluate the effect of Mindfulness meditation on perceived stress scores and autonomic function tests of pregnant Indian women. Pregnant Indian women of 12 weeks gestation were randomised to two treatment groups: Test group with Mindfulness meditation and control group with their usual obstetric care. The effect of Mindfulness meditation on perceived stress scores and cardiac sympathetic functions and parasympathetic functions (Heart rate variation with respiration, lying to standing ratio, standing to lying ratio and respiratory rate) were evaluated on pregnant Indian women. There was a significant decrease in perceived stress scores, a significant decrease of blood pressure response to cold pressor test and a significant increase in heart rate variability in the test group (pwomen. The results of this study suggest that mindfulness meditation improves parasympathetic functions in pregnant women and is a powerful modulator of the sympathetic nervous system during pregnancy.

  16. A knowledge-based theory of rising scores on "culture-free" tests.

    Science.gov (United States)

    Fox, Mark C; Mitchum, Ainsley L

    2013-08-01

    Secular gains in intelligence test scores have perplexed researchers since they were documented by Flynn (1984, 1987). Gains are most pronounced on abstract, so-called culture-free tests, prompting Flynn (2007) to attribute them to problem-solving skills availed by scientifically advanced cultures. We propose that recent-born individuals have adopted an approach to analogy that enables them to infer higher level relations requiring roles that are not intrinsic to the objects that constitute initial representations of items. This proposal is translated into item-specific predictions about differences between cohorts in pass rates and item-response patterns on the Raven's Matrices (Flynn, 1987), a seemingly culture-free test that registers the largest Flynn effect. Consistent with predictions, archival data reveal that individuals born around 1940 are less able to map objects at higher levels of relational abstraction than individuals born around 1990. Polytomous Rasch models verify predicted violations of measurement invariance, as raw scores are found to underestimate the number of analogical rules inferred by members of the earlier cohort relative to members of the later cohort who achieve the same overall score. The work provides a plausible cognitive account of the Flynn effect, furthers understanding of the cognition of matrix reasoning, and underscores the need to consider how test-takers select item responses. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  17. Psychometric properties of the Fagerström Test for Nicotine Dependence As propriedades psicométricas do Teste de Fagerström para Dependência de Nicotina

    Directory of Open Access Journals (Sweden)

    Izilda Carolina de Meneses-Gaya

    2009-01-01

    Full Text Available OBJECTIVE: The Fagerström Test for Nicotine Dependence (FTND is a screening instrument for physical nicotine dependence and is extensively used in various countries. The objective of the present report was to review articles related to the psychometric properties of the FTND. METHODS: A systematic search for articles published up through December of 2007 was carried out in various electronic databases. The following search terms were used: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; and "factors". We included articles published in English, Spanish or Portuguese and in which the psychometric properties of the FTND were evaluated. RESULTS: Twenty-six studies related to the psychometric properties of the FTND were identified in the indexed literature. Analysis of the studies confirmed the reliability of the FTND for the assessment of nicotine dependence in different settings and populations. CONCLUSIONS: Further validation studies using previously validated instruments as a comparative measure are needed before the extensive use of the FTND can be justified on the basis of its psychometric qualities.OBJETIVO: O Fagerström Test for Nicotine Dependence (FTND, Teste de Fagerström para Dependência de Nicotina é um instrumento de rastreamento para dependência física de tabaco, amplamente utilizado em diversos países. Objetivou-se realizar uma revisão de artigos relacionados às propriedades psicométricas do FTND. MÉTODOS: Uma busca sistemática foi realizada usando-se vários indexadores eletrônicos até dezembro de 2007, com os seguintes descritores: "Fagerström Test for Nicotine Dependence"; "FTND"; "psychometric"; "validity"; "reliability"; "feasibility"; e "factors". Foram incluídos os artigos relacionados à avaliação das propriedades psicométricas do FTND publicados em inglês, espanhol e português. RESULTADOS: Vinte e seis estudos relativos às propriedades psicom

  18. The Analysis of a Teacher Test Preparation Tutorial to Learner Test Scores: An Action Research Study

    Science.gov (United States)

    Mild, Toni L. Hittle

    2014-01-01

    Many Pennsylvania colleges and universities require that teacher candidates pass a standardized assessment in order to gain formal entry in to their education programs. Standardized tests are also required for Level I teacher certification within Pennsylvania. The initial assessment required of all Pennsylvania preservice teachers for…

  19. Quick screening tool for patients with severe negative emotional reactions to chronic illness: psychometric study of the negative emotions due to chronic illness screening test (NECIS).

    Science.gov (United States)

    Huang, Yun-Hsin; Wu, Chih-Hsun; Chen, Hsiu-Jung; Cheng, Yih-Ru; Hung, Fu-Chien; Leung, Kai-Kuan; Lue, Bee-Horng; Chen, Ching-Yu; Chiu, Tai-Yuan; Wu, Yin-Chang

    2018-01-16

    Severe negative emotional reactions to chronic illness are maladaptive to patients and they need to be addressed in a primary care setting. The psychometric properties of a quick screening tool-the Negative Emotions due to Chronic Illness Screening Test (NECIS)-for general emotional problems among patients with chronic illness being treated in a primary care setting was investigated. Three studies including 375 of patients with chronic illness were used to assess and analyze internal consistency, test-retest reliability, criterion-related validity, a cut-off point for distinguishing maladaptive emotions and clinical application validity of NECIS. Self-report questionnaires were used. Internal consistency (Cronbach's α) ranged from 0.78 to 0.82, and the test-retest reliability was 0.71 (P analysis reference, the receiver-operating characteristic curve analysis revealed an area under the curve of 0.81 and 0.82 (ps emotions, with a sensitivity and specificity of 83.3 and 69.0%, and 68.5 and 83.0%, respectively. The clinical application validity analysis revealed that low NECIS group showed significantly better adaptation to chronic illness on the scales of subjective health, general satisfaction with life, self-efficacy of self-care for disease, illness perception and stressors in everyday life. The NECIS has satisfactory psychometric properties for use in the primary care setting. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  20. Psychometric Properties of the Chinese Version of the Eating Attitudes Test in Young Female Patients with Eating Disorders in Mainland China.

    Science.gov (United States)

    Kang, Qing; Chan, Raymond C K; Li, Xiaoping; Arcelus, Jon; Yue, Ling; Huang, Jiabin; Gu, Lian; Fan, Qing; Zhang, Haiyin; Xiao, Zeping; Chen, Jue

    2017-11-01

    The study aimed to investigate the reliability and validity of the Chinese version of the eating attitudes test (EAT-26) among female adolescents and young adults in Mainland China. This scale was administered to 396 female eating disorder patients and 406 noneating disorder healthy controls, in addition 35 healthy controls completed a retest after a 4-week intervals. Tests for reliability, convergent validity and receiver operating characteristic analysis were performed to detect the psychometric properties. The EAT-26 demonstrated good internal consistency (Cronbach's alpha = 0.822-0.922), test-retest reliability (interclass correlation coefficient = 0.817) and convergent validity(r = 0.450-0.750). The receiver operating characteristic analysis showed that the cut-off 14 for anorexia nervosa and 15 for bulimia nervosa represented good compromises with approximate sensitivity (0.66-0.68) and specificity (0.85-0.86). Our findings provided evidence that the Chinese version of the EAT-26 was a psychometrically reliable and valid self-rating instrument for identifying people suffering from an eating disorder in Mainland China. A clinical cut-off range between 14 and 15 could be used, but caution should be exercised because of the low sensitivity of the tool. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association. Copyright © 2017 John Wiley & Sons, Ltd and Eating Disorders Association.

  1. A Psychometric Analysis of the Reading the Mind in the Eyes Test: Towards a Brief Form for Research and Applied Settings

    Directory of Open Access Journals (Sweden)

    Sally eOlderbak

    2015-10-01

    Full Text Available The Reading the Mind in the Eyes Test is a popular measure of individual differences in Theory of Mind that is often applied in the assessment of particular clinical populations (primarily, individuals on the autism spectrum. However, little is known about the test’s psychometric properties, including factor structure, internal consistency, and convergent validity evidence. We present a psychometric analysis of the test followed by an evaluation of other empirically proposed and statistically identified structures. We identified, and cross-validated in a second sample, an adequate short-form solution that is homogeneous with adequate internal consistency, and is moderately related to Cognitive Empathy, Emotion Perception, and strongly related to Vocabulary. We recommend the use of this short-form solution in normal adults as a more precise measure over the original version. Future revisions of the test should seek to reduce the test’s reliance on one’s vocabulary and evaluate the short-form structure in clinical populations.

  2. An analysis of aviation test scores to characterize Student Naval Aviator disqualification

    OpenAIRE

    Wahl, Erich J.

    1998-01-01

    Approved for public release; distribution is unlimited The U.S. Navy uses the Aviation Selection Test Battery (ASTh) to identify those Student Naval Aviator (SNA) applicants most likely to succeed in flight training. Using classification and regression trees, this thesis concludes that individual answers to an ASTh subtest, the Biographical Inventory, are not good predictors of SNA primary flight grades. It also concludes that those SNA who score less than a 6 on the Pilot Biographical Inv...

  3. Impact of Answer-Switching Behavior on Multiple-Choice Test Scores in Higher Education

    Directory of Open Access Journals (Sweden)

    Ramazan BAŞTÜRK

    2011-06-01

    Full Text Available The multiple- choice format is one of the most popular selected-response item formats used in educational testing. Researchers have shown that Multiple-choice type test is a useful vehicle for student assessment in core university subjects that usually have large student numbers. Even though the educators, test experts and different test recourses maintain the idea that the first answer should be retained, many researchers argued that this argument is not dependent with empirical findings. The main question of this study is to examine how the answer switching behavior affects the multiple-choice test score. Additionally, gender differences and relationship between number of answer switching behavior and item parameters (item difficulty and item discrimination were investigated. The participants in this study consisted of 207 upper-level College of Education students from mid-sized universities. A Midterm exam consisted of 20 multiple-choice questions was used. According to the result of this study, answer switching behavior statistically increase test scores. On the other hand, there is no significant gender difference in answer-switching behavior. Additionally, there is a significant negative relationship between answer switching behavior and item difficulties.

  4. Are students' impressions of improved learning through active learning methods reflected by improved test scores?

    Science.gov (United States)

    Everly, Marcee C

    2013-02-01

    To report the transformation from lecture to more active learning methods in a maternity nursing course and to evaluate whether student perception of improved learning through active-learning methods is supported by improved test scores. The process of transforming a course into an active-learning model of teaching is described. A voluntary mid-semester survey for student acceptance of the new teaching method was conducted. Course examination results, from both a standardized exam and a cumulative final exam, among students who received lecture in the classroom and students who had active learning activities in the classroom were compared. Active learning activities were very acceptable to students. The majority of students reported learning more from having active-learning activities in the classroom rather than lecture-only and this belief was supported by improved test scores. Students who had active learning activities in the classroom scored significantly higher on a standardized assessment test than students who received lecture only. The findings support the use of student reflection to evaluate the effectiveness of active-learning methods and help validate the use of student reflection of improved learning in other research projects. Copyright © 2011 Elsevier Ltd. All rights reserved.

  5. Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.

    Science.gov (United States)

    Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid

    2017-11-01

    Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.

  6. Comparison of physical therapy anatomy performance and anxiety scores in timed and untimed practical tests.

    Science.gov (United States)

    Schwartz, Sarah M; Evans, Cathy; Agur, Anne M R

    2015-01-01

    Students in health care professional programs face many stressful tests that determine successful completion of their program. Test anxiety during these high stakes examinations can affect working memory and lead to poor outcomes. Methods of decreasing test anxiety include lengthening the time available to complete examinations or evaluating students using untimed examinations. There is currently no consensus in the literature regarding whether untimed examinations provide a benefit to test performance in clinical anatomy. This study aimed to determine the impact of timed versus untimed practical tests on Master of Physical Therapy student anatomy performance and test anxiety. Test anxiety was measured using the State-Trait Anxiety Inventory (STAI). Differences in performance, anxiety scores, and time taken were compared using paired sample Student's t-tests. Eighty-one of the 84 students completed the study and provided feedback. Students performed significantly higher on the untimed test (P = 0.005), with a significant reduction in test anxiety (P anxiety. If the intended goal of evaluating health care professional students is to determine fundamental competencies, these factors should be considered when designing future curricula. © 2014 American Association of Anatomists.

  7. Consequences of the Chernobyl accident in Russia: search for effects of radiation exposure in utero using psychometric tests

    International Nuclear Information System (INIS)

    Ryabukhin, Yu.S.; Ryabukhin, V.Yu.

    2001-01-01

    Psychometric indicators for mental development of children in towns distinguished by radioactive contamination resulting from the Chernobyl accident are studied. Using some radiological information obtained after the Chernobyl accident, values of expected intelligence quotient (IQ) reduction have been assessed as a result of brain exposure in utero due to various components of dose. Comparing the results of examinations in Novozybkov, Klintsy and Obninsk, no confident evidence has been obtained that radiation exposure of the developing brain exerts influence on indicators for mental development [ru

  8. Translation and psychometric testing of the Korean Versions of the Spiritual Perspective Scale and the Self-transcendence Scale in Korean elders.

    Science.gov (United States)

    Kim, Suk Sun; Reed, Pamela G; Kang, Youngmi; Oh, Jina

    2012-12-01

    The purpose of this study was to translate the Spiritual Perspective Scale (SPS) and Self-transcendence Scale (STS) into Korean and test the psychometric properties of the instruments with Korean elders. A cross-sectional survey design was used to implement the three stages of the study. Stage I consisted of translating and reviewing the scales by six experts. In Stage II, equivalence was tested by comparing the responses between the Korean and English versions among 71 bilingual adults. Stage III established the psychometric properties of the Korean versions SPS-K and STS-K among 154 Korean elders. Cronbach's alpha of the SPS-K and the STS-K .97, and .85 respectively with Korean elders. Factor analysis showed that the SPS-K had one factor; the STS-K had four factors with one factor clearly representing self-transcendence as theorized. Both scales showed good reliability and validity for the translated Korean versions. However, continued study of the construct validity of the STS-K is needed. Study findings indicate that the SPS-K and the STS-K could be useful for nurses and geriatric researchers to assess a broadly defined spirituality, and to conduct research on spirituality and health among Korean elders. Use of these scales within a theory-based study may contribute to further knowledge about the role of spirituality in the health and well-being of Korean people facing health crises.

  9. Psychometric evaluation of an item bank for computerized adaptive testing of the EORTC QLQ-C30 cognitive functioning dimension in cancer patients.

    Science.gov (United States)

    Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J B; Conroy, Thierry; Tomaszewski, Krzysztof A; Young, Teresa; Petersen, Morten Aa

    2017-11-01

    The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients. This evaluation included an assessment of dimensionality, fit to the item response theory (IRT) model, differential item functioning (DIF), and measurement properties. A total of 1030 cancer patients completed the 44 candidate items on CF. Of these, 34 items could be included in a unidimensional IRT model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient assessment of HRQOL of cancer patients, without loss of comparability of results.

  10. The Performance of the Upper Limb scores correlate with pulmonary function test measures and Egen Klassifikation scores in Duchenne muscular dystrophy.

    Science.gov (United States)

    Lee, Ha Neul; Sawnani, Hemant; Horn, Paul S; Rybalsky, Irina; Relucio, Lani; Wong, Brenda L

    2016-01-01

    The Performance of the Upper Limb scale was developed as an outcome measure specifically for ambulant and non-ambulant patients with Duchenne muscular dystrophy and is implemented in clinical trials needing longitudinal data. The aim of this study is to determine whether this novel tool correlates with functional ability using pulmonary function test, cardiac function test and Egen Klassifikation scale scores as clinical measures. In this cross-sectional study, 43 non-ambulatory Duchenne males from ages 10 to 30 years and on long-term glucocorticoid treatment were enrolled. Cardiac and pulmonary function test results were analyzed to assess cardiopulmonary function, and Egen Klassifikation scores were analyzed to assess functional ability. The Performance of the Upper Limb scores correlated with pulmonary function measures and had inverse correlation with Egen Klassifikation scores. There was no correlation with left ventricular ejection fraction and left ventricular dysfunction. Body mass index and decreased joint range of motion affected total Performance of the Upper Limb scores and should be considered in clinical trial designs. Copyright © 2016 Elsevier B.V. All rights reserved.

  11. Association testing for next-generation sequencing data using score statistics

    DEFF Research Database (Denmark)

    Skotte, Line; Korneliussen, Thorfinn Sand; Albrechtsen, Anders

    2012-01-01

    computationally feasible due to the use of score statistics. As part of the joint likelihood, we model the distribution of the phenotypes using a generalized linear model framework, which works for both quantitative and discrete phenotypes. Thus, the method presented here is applicable to case-control studies...... of genotype calls into account have been proposed; most require numerical optimization which for large-scale data is not always computationally feasible. We show that using a score statistic for the joint likelihood of observed phenotypes and observed sequencing data provides an attractive approach...... to association testing for next-generation sequencing data. The joint model accounts for the genotype classification uncertainty via the posterior probabilities of the genotypes given the observed sequencing data, which gives the approach higher power than methods based on called genotypes. This strategy remains...

  12. A high COPD assessment test score may predict anxiety in COPD

    Directory of Open Access Journals (Sweden)

    Harryanto H

    2018-03-01

    Full Text Available Hilman Harryanto,1 Sally Burrows,2 Yuben Moodley1,2 1Department of Respiratory Medicine, Fiona Stanley Hospital, Perth, WA, Australia; 2Faculty of Health and Medical Sciences, Medical School, University of Western Australia, Perth, WA, AustraliaThe prevalence of anxiety is 55% in patients with COPD,1 and it is associated with worse disease control. Therefore, early recognition and institution of treatment of this comorbidity significantly improve patient’s quality of life. Recently, a questionnaire called the COPD assessment test (CAT has been incorporated into the Global Initiative for Chronic Obstructive Lung Disease (GOLD guidelines for the management of COPD, and a higher score is associated with increased COPD symptoms.2 Considering the regular use of CAT, it was evaluated whether this tool can also be used to identify anxiety. The CAT score was correlated with the Hospital Anxiety and Depression Scale (HADS to determine the level at which CAT may predict anxiety.

  13. Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success.

    Science.gov (United States)

    Niu, Sunny X; Tienda, Marta

    2012-04-01

    Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success-high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe.

  14. COMPARISON BETWEEN WOOD DRYING DEFECT SCORES: SPECIMEN TESTING X ANALYSIS OF KILN-DRIED BOARDS

    Directory of Open Access Journals (Sweden)

    Djeison Cesar Batista

    2015-04-01

    Full Text Available It is important to develop drying technologies for Eucalyptus grandis lumber, which is one of the most planted species of this genus in Brazil and plays an important role as raw material for the wood industry. The general aim of this work was to assess the conventional kiln drying of juvenile wood of three clones of Eucalyptus grandis. The specific aims were to compare the behavior between: i drying defects indicated by tests with wood specimens and conventional kiln-dried boards; and ii physical properties and the drying quality. Five 11-year-old trees of each clone were felled, and only flatsawn boards of the first log were used. Basic density and total shrinkage were determined, and the drying test with wood specimens at 100 °C was carried out. Kiln drying of boards was performed, and initial and final moisture content, moisture gradient in thickness, drying stresses and drying defects were assessed. The defect scoring method was used to verify the behavior between the defects detected by specimen testing and the defects detected in kiln-dried boards. As main results, the drying schedule was too severe for the wood, resulting in a high level of boards with defects. The behavior between the defects in the drying test with specimens and the defects of kiln-dried boards was different, there was no correspondence, according to the defect scoring method.

  15. Development and Testing of Screen-Based and Psychometric Instruments for Assessing Resident Performance in an Operating Room Simulator

    Directory of Open Access Journals (Sweden)

    Richard R. McNeer

    2016-01-01

    Full Text Available Introduction. Medical simulators are used for assessing clinical skills and increasingly for testing hypotheses. We developed and tested an approach for assessing performance in anesthesia residents using screen-based simulation that ensures expert raters remain blinded to subject identity and experimental condition. Methods. Twenty anesthesia residents managed emergencies in an operating room simulator by logging actions through a custom graphical user interface. Two expert raters rated performance based on these entries using custom Global Rating Scale (GRS and Crisis Management Checklist (CMC instruments. Interrater reliability was measured by calculating intraclass correlation coefficients (ICC, and internal consistency of the instruments was assessed with Cronbach’s alpha. Agreement between GRS and CMC was measured using Spearman rank correlation (SRC. Results. Interrater agreement (GRS: ICC = 0.825, CMC: ICC = 0.878 and internal consistency (GRS: alpha = 0.838, CMC: alpha = 0.886 were good for both instruments. Subscale analysis indicated that several instrument items can be discarded. GRS and CMC scores were highly correlated (SRC = 0.948. Conclusions. In this pilot study, we demonstrated that screen-based simulation can allow blinded assessment of performance. GRS and CMC instruments demonstrated good rater agreement and internal consistency. We plan to further test construct validity of our instruments by measuring performance in our simulator as a function of training level.

  16. Hurlbert Index of Sexual Assertiveness: a study of psychometric properties in a Spanish sample.

    Science.gov (United States)

    Santos-Iglesias, Pablo; Sierra, Juan Carlos

    2010-08-01

    The study analyzed psychometric properties of a Spanish version of the Hurlbert Index of Sexual Assertiveness in a Spanish sample of 400 men and 453 women who had had a partner for the last 6 mo. or longer at the time of the study. Exploratory and confirmatory factor analyses suggested a two-factor solution with the factors Initiation and No shyness/Refusal. Internal consistency values for total scores were .87 and .83 for the factors, respectively. Convergent validity tests were also satisfactory. It is therefore reasonable to conclude that the Spanish version of the scale has appropriate psychometric properties.

  17. Determining the Sensitivity of CAT-ASVAB (Computerized Adaptive Testing- Armed Services Vocational Aptitude Battery) Scores to Changes in Item Response Curves with the Medium of Administration

    Science.gov (United States)

    1986-08-01

    most examinees. Therefore it appears psychometrically ac - ceptable for the CAT -ASVAB project to proceed without item recalibration based on...MEMORANDUM DETERMINING THE SENSITIVITY OF CAT -ASVAB SCORES TO CHANGES IN ITEM RESPONSE CURVES WITH THE MEDIUM OF ADMINISTRATION D. R. Divgi...Subj: Center for Naval Analyses Research Memorandum 86-189 End: (1) CNA Research Memorandum 86-189, "Determining the Sensitivity of CAT -ASVAB

  18. Attitudes toward science: measurement and psychometric properties of the Test of Science-Related Attitudes for its use in Spanish-speaking classrooms

    Science.gov (United States)

    Navarro, Marianela; Förster, Carla; González, Caterina; González-Pose, Paulina

    2016-06-01

    Understanding attitudes toward science and measuring them remain two major challenges for science teaching. This article reviews the concept of attitudes toward science and their measurement. It subsequently analyzes the psychometric properties of the Test of Science-Related Attitudes (TOSRA), such as its construct validity, its discriminant and concurrent validity, and its reliability. The evidence presented suggests that TOSRA, in its Spanish-adapted version, has adequate construct validity regarding its theoretical referents, as well as good indexes of reliability. In addition, it determines the attitudes toward science of secondary school students in Santiago de Chile (n = 664) and analyzes the sex variable as a differentiating factor in such attitudes. The analysis by sex revealed low-relevance gender difference. The results are contrasted with those obtained in English-speaking countries. This TOSRA sample showed good psychometric parameters for measuring and evaluating attitudes toward science, which can be used in classrooms of Spanish-speaking countries or with immigrant populations with limited English proficiency.

  19. Micronucleus test for radiation biodosimetry in mass casualty events: Evaluation of visual and automated scoring

    Energy Technology Data Exchange (ETDEWEB)

    Bolognesi, Claudia, E-mail: claudia.bolognesi@istge.i [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Balia, Cristina; Roggieri, Paola [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Cardinale, Francesco [Clinical Epidemiology Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Department of Health Sciences, University of Genoa, Genoa (Italy); Bruzzi, Paolo [Clinical Epidemiology Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Sorcinelli, Francesca [Environmental Carcinogenesis Unit, National Cancer Research Institute, Largo R. Benzi 10, 16132 Genoa (Italy); Laboratory of Genetics, Histology and Molecular Biology Section, Army Medical and Veterinary, Research Center, Via Santo Stefano Rotondo 4, 00184 Roma (Italy); Lista, Florigio [Laboratory of Genetics, Histology and Molecular Biology Section, Army Medical and Veterinary, Research Center, Via Santo Stefano Rotondo 4, 00184 Roma (Italy); D' Amelio, Raffaele [Sapienza, Universita di Roma II Facolta di Medicina e Chirurgia and Ministero della Difesa, Direzione Generale Sanita Militare (Italy); Righi, Enzo [Frascati National Laboratories, National Institute of Nuclear Physics, Via Enrico Fermi 40, 00044 Frascati, Rome (Italy)

    2011-02-15

    In the case of a large-scale nuclear or radiological incidents a reliable estimate of dose is an essential tool for providing timely assessment of radiation exposure and for making life-saving medical decisions. Cytogenetics is considered as the 'gold standard' for biodosimetry. The dicentric analysis (DA) represents the most specific cytogenetic bioassay. The micronucleus test (MN) applied in interphase in peripheral lymphocytes is an alternative and simpler approach. A dose-effect calibration curve for the MN frequency in peripheral lymphocytes from 27 adult donors was established after in vitro irradiation at a dose range 0.15-8 Gy of {sup 137}Cs gamma rays (dose rate 6 Gy min{sup -1}). Dose prediction by visual scoring in a dose-blinded study (0.15-4.0 Gy) revealed a high level of accuracy (R = 0.89). The scoring of MN is time consuming and requires adequate skills and expertise. Automated image analysis is a feasible approach allowing to reduce the time and to increase the accuracy of the dose estimation decreasing the variability due to subjective evaluation. A good correlation (R = 0.705) between visual and automated scoring with visual correction was observed over the dose range 0-2 Gy. Almost perfect discrimination power for exposure to 1-2 Gy, and a satisfactory power for 0.6 Gy were detected. This threshold level can be considered sufficient for identification of sub lethally exposed individuals by automated CBMN assay.

  20. Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

    Science.gov (United States)

    Educational Testing Service, 2008

    2008-01-01

    The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…

  1. Scoring in genetically modified organism proficiency tests based on log-transformed results.

    Science.gov (United States)

    Thompson, Michael; Ellison, Stephen L R; Owen, Linda; Mathieson, Kenneth; Powell, Joanne; Key, Pauline; Wood, Roger; Damant, Andrew P

    2006-01-01

    The study considers data from 2 UK-based proficiency schemes and includes data from a total of 29 rounds and 43 test materials over a period of 3 years. The results from the 2 schemes are similar and reinforce each other. The amplification process used in quantitative polymerase chain reaction determinations predicts a mixture of normal, binomial, and lognormal distributions dominated by the latter 2. As predicted, the study results consistently follow a positively skewed distribution. Log-transformation prior to calculating z-scores is effective in establishing near-symmetric distributions that are sufficiently close to normal to justify interpretation on the basis of the normal distribution.

  2. The End-Stage Renal Disease Adherence Questionnaire (ESRD-AQ): testing the psychometric properties in patients receiving in-center hemodialysis.

    Science.gov (United States)

    Kim, Youngmee; Evangelista, Lorraine S; Phillips, Linda R; Pavlish, Carol; Kopple, Joel D

    2010-01-01

    Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and validity) of the 46-item ESRD-Adherence Questionnaire (ESRD-AQ) in a cohort of patients receiving maintenance HD at dialysis centers in Los Angeles County. The ESRD-AQ is the first self-report instrument to address all components of adherence behaviors of patients with ESRD. The findings support that the instrument is reliable and valid and is easy to administer. Future studies are needed in a larger sample to determine whether additional modifications are needed.

  3. Increased correlation coefficient between the written test score and tutors’ performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia

    Directory of Open Access Journals (Sweden)

    Heethal Jaiprakash

    2016-03-01

    Full Text Available This paper is aimed at finding if there was a change of correlation between the written test score and tutors’ performance test scores in the assessment of medical students during a problem-based learning (PBL course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group’s tutors did not receive tutor training; while the second group’s tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors’ performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors’ scores in group 1 was 0.099 (p<0.001 and for group 2 was 0.305 (p<0.001. The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.

  4. Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

    Science.gov (United States)

    Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

    2016-03-01

    This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (pcorrelation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.

  5. A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

    Science.gov (United States)

    Lee, Guemin; Park, In-Yong

    2012-01-01

    Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…

  6. REPRODUCIBILITY OF THE MODIFIED STAR EXCURSION BALANCE TEST COMPOSITE AND SPECIFIC REACH DIRECTION SCORES.

    Science.gov (United States)

    van Lieshout, Remko; Reijneveld, Elja A E; van den Berg, Sandra M; Haerkens, Gijs M; Koenders, Niek H; de Leeuw, Arina J; van Oorsouw, Roel G; Paap, Davy; Scheffer, Else; Weterings, Stijn; Stukstette, Mirelle J

    2016-06-01

    The mSEBT is a screening tool used to evaluate dynamic balance. Most research investigating measurement properties focused on intrarater reliability and was done in small samples. To know whether the mSEBT is useful to discriminate dynamic balance between persons and to evaluate changes in dynamic balance, more research into intra- and interrater reliability and smallest detectable change (synonymous with minimal detectable change) is needed. To estimate intra- and interrater reliability and smallest detectable change of the mSEBT in adults at risk for ankle sprain. Cross-sectional, test-retest design. Fifty-five healthy young adults participating in sports at risk for ankle sprain participated (mean ± SD age, 24.0 ± 2.9 years). Each participant performed three test sessions within one hour and was rated by two physical therapists (session 1, rater 1; session 2, rater 2; session 3, rater 1). Participants and raters were blinded for previous measurements. Normalized composite and reach direction scores for the right and left leg were collected. Analysis of variance was used to calculate intraclass correlation coefficient values for intra- and interrater reliability. Smallest detectable change values were calculated based on the standard error of measurement. Intra- and interrater reliability for both legs was good to excellent (intraclass correlation coefficient ranging from 0.87 to 0.94). The intrarater smallest detectable change for the composite score of the right leg was 7.2% and for the left 6.2%. The interrater smallest detectable change for the composite score of the right leg was 6.9% and for the left 5.0%. The mSEBT is a reliable measurement instrument to discriminate dynamic balance between persons. Most smallest detectable change values of the mSEBT appear to be large. More research is needed to investigate if the mSEBT is usable for evaluative purposes. Level 2.

  7. A Follow-Up Web-Based Survey: Test and Measurement Expert Opinions on the Psychometric Properties of Out-of-Level Tests. Out-of-Level Testing Report.

    Science.gov (United States)

    Bielinski, John; Minnema, Jane; Thurlow, Martha

    A Web-based survey of 25 experts in testing theory and large-scale assessment examined the utility of out-of-level testing for making decisions about students and schools. Survey respondents were given a series of scenarios and asked to judge the degree to which out-of-level testing would affect the reliability and validity of test scores within…

  8. Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

    Directory of Open Access Journals (Sweden)

    Yota Uno

    Full Text Available OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70 was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. RESULTS: The Cronbach's alpha for the new Tanaka B Intelligence Scale IQ (BIQ was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96. In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9, and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4. Thus, intellectual disability could be ruled out or determined. CONCLUSION: The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.

  9. Reliability and validity of the new Tanaka B Intelligence Scale scores: a group intelligence test.

    Science.gov (United States)

    Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

    2014-01-01

    The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQIntelligence Scale IQ (BIQ) was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85-0.96). In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9-48.9), and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03-0.4). Thus, intellectual disability could be ruled out or determined. The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult.

  10. Performance on large-scale science tests: Item attributes that may impact achievement scores

    Science.gov (United States)

    Gordon, Janet Victoria

    Significant differences in achievement among ethnic groups persist on the eighth-grade science Washington Assessment of Student Learning (WASL). The WASL measures academic performance in science using both scenario and stand-alone question types. Previous research suggests that presenting target items connected to an authentic context, like scenario question types, can increase science achievement scores especially in underrepresented groups and thus help to close the achievement gap. The purpose of this study was to identify significant differences in performance between gender and ethnic subgroups by question type on the 2005 eighth-grade science WASL. MANOVA and ANOVA were used to examine relationships between gender and ethnic subgroups as independent variables with achievement scores on scenario and stand-alone question types as dependent variables. MANOVA revealed no significant effects for gender, suggesting that the 2005 eighth-grade science WASL was gender neutral. However, there were significant effects for ethnicity. ANOVA revealed significant effects for ethnicity and ethnicity by gender interaction in both question types. Effect sizes were negligible for the ethnicity by gender interaction. Large effect sizes between ethnicities on scenario question types became moderate to small effect sizes on stand-alone question types. This indicates the score advantage the higher performing subgroups had over the lower performing subgroups was not as large on stand-alone question types compared to scenario question types. A further comparison examined performance on multiple-choice items only within both question types. Similar achievement patterns between ethnicities emerged; however, achievement patterns between genders changed in boys' favor. Scenario question types appeared to register differences between ethnic groups to a greater degree than stand-alone question types. These differences may be attributable to individual differences in cognition

  11. The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach.

    Science.gov (United States)

    Xu, Jian

    2017-01-01

    The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers' listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.

  12. The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

    Directory of Open Access Journals (Sweden)

    Jian Xu

    2017-12-01

    Full Text Available The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers’ listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.

  13. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    Energy Technology Data Exchange (ETDEWEB)

    Zain, Zakiyah, E-mail: zac@uum.edu.my; Ahmad, Yuhaniz, E-mail: yuhaniz@uum.edu.my [School of Quantitative Sciences, Universiti Utara Malaysia, UUM Sintok 06010, Kedah (Malaysia); Azwan, Zairul, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com; Raduan, Farhana, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com; Sagap, Ismail, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com [Surgery Department, Universiti Kebangsaan Malaysia Medical Centre, Jalan Yaacob Latif, 56000 Bandar Tun Razak, Kuala Lumpur (Malaysia); Aziz, Nazrina, E-mail: nazrina@uum.edu.my

    2014-12-04

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  14. Genome scan for linkage to asthma using a linkage disequilibrium-lod score test.

    Science.gov (United States)

    Jiang, Y; Slager, S L; Huang, J

    2001-01-01

    We report a genome-wide linkage study of asthma on the German and Collaborative Study on the Genetics of Asthma (CSGA) data. Using a combined linkage and linkage disequilibrium test and the nonparametric linkage score, we identified 13 markers from the German data, 1 marker from the African American (CSGA) data, and 7 markers from the Caucasian (CSGA) data in which the p-values ranged between 0.0001 and 0.0100. From our analysis and taking into account previous published linkage studies of asthma, we suggest that three regions in chromosome 5 (around D5S418, D5S644, and D5S422), one region in chromosome 6 (around three neighboring markers D6S1281, D6S291, and D6S1019), one region in chromosome 11 (around D11S2362), and two regions in chromosome 12 (around D12S351 and D12S324) especially merit further investigation.

  15. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    Science.gov (United States)

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. © 2016 John Wiley & Sons Ltd/University College London.

  16. The achievement impact of the inclusion model on the standardized test scores of general education students

    Science.gov (United States)

    Garrett-Rainey, Syrena

    The purpose of this study was to compare the achievement of general education students within regular education classes to the achievement of general education students in inclusion/co-teach classes to determine whether there was a significant difference in the achievement between the two groups. The school district's inclusion/co-teach model included ongoing professional development support for teachers and administrators. General education teachers, special education teachers, and teacher assistants collaborated to develop instructional strategies to provide additional remediation to help students to acquire the skills needed to master course content. This quantitative study reviewed the end-of course test (EoCT) scores of Grade 10 physical science and math students within an urban school district. It is not known whether general education students in an inclusive/co-teach science or math course will demonstrate a higher achievement on the EoCT in math or science than students not in an inclusive/co-teach classroom setting. In addition, this study sought to determine if students classified as low socioeconomic status benefited from participating in co-teaching classrooms as evidenced by standardized tests. Inferential statistics were used to determine whether there was a significant difference between the achievements of the treatment group (inclusion/co-teach) and the control group (non-inclusion/co-teach). The findings can be used to provide school districts with optional instructional strategies to implement in the diverse classroom setting in the modern classroom to increase academic performance on state standardized tests.

  17. Science Teacher Efficacy and Outcome Expectancy as Predictors of Students' End-of-Instruction (EOI) Biology I Test Scores

    Science.gov (United States)

    Angle, Julie; Moseley, Christine

    2009-01-01

    The purpose of this study was to compare teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the statewide End-of-Instruction (EOI) Biology I test met or exceeded the state academic proficiency level (Proficient Group) to teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the…

  18. Utilizing the Six Realms of Meaning in Improving Campus Standardized Test Scores through Team Teaching and Strategic Planning

    Science.gov (United States)

    Stevenson, Rosnisha D.; Kritsonis, William Allan

    2009-01-01

    This article will seek to utilize Dr. William Allan Kritsonis' book "Ways of Knowing Through the Realms of Meaning" (2007) as a framework to improve a campus's standardized test scores, more specifically, their TAKS (Texas Assessment of Knowledge and Skills) scores. Many campuses have an improvement plan, also known as a Campus…

  19. Implications of Deployed and Nondeployed Fathers on Seventh Graders' California Achievement Test Scores during a Military Crisis.

    Science.gov (United States)

    Pisano, Mark C.

    The differences in California Achievement Test (CAT) scores from 1990 to 1991 in seventh graders, currently enrolled in Albritton Junior High School in the Fort Bragg Schools, of deployed and nondeployed fathers were analyzed. CAT percentile scores from 1990 and 1991 (1991 being the year of "Desert Storm") were obtained in reading, math…

  20. The Test Anxiety Measure for Adolescents (TAMA): Examination of the Reliability and Validity of the Scores of a New Multidimensional Measure of Test Anxiety for Middle and High School Students

    Science.gov (United States)

    Lowe, Patricia A.

    2014-01-01

    A new multidimensional measure of test anxiety, the Test Anxiety Measure for Adolescents (TAMA), specifically designed for U.S. adolescents in Grades 6 to 12 was developed and its psychometric properties were examined. The TAMA consists of five scales (Cognitive Interference, Physiological Hyperarousal, Social Concerns, Task Irrelevant Behavior,…

  1. A Case Study About Why It Can Be Difficult To Test Whether Propensity Score Analysis Works in Field Experiments

    Directory of Open Access Journals (Sweden)

    William R. Shadish

    2013-02-01

    Full Text Available Peikes, Moreno and Orzol (2008 sensibly caution researchers that propensity score analysis may not lead to valid causal inference in field applications. But at the same time, they made the far stronger claim to have performed an ideal test of whether propensity score matching in quasi-experimental data is capable of approximating the results of a randomized experiment in their dataset, and that this ideal test showed that such matching could not do so. In this article we show that their study does not support that conclusion because it failed to meet a number of basic criteria for an ideal test. By implication, many other purported tests of the effectiveness of propensity score analysis probably also fail to meet these criteria, and are therefore questionable contributions to the literature on the effects of propensity score analysis. DOI: 10.2458/azu_jmmss.v3i2.16475

  2. A Case Study About Why It Can Be Difficult To Test Whether Propensity Score Analysis Works in Field Experiments

    Directory of Open Access Journals (Sweden)

    Thomas D. Cook

    2012-01-01

    Full Text Available Peikes, Moreno and Orzol (2008 sensibly caution researchers that propensity score analysis may not lead to valid causal inference in field applications. But at the same time, they made the far stronger claim to have performed an ideal test of whether propensity score matching in quasi-experimental data is capable of approximating the results of a randomized experiment in their dataset, and that this ideal test showed that such matching could not do so. In this article we show that their study does not support that conclusion because it failed to meet a number of basic criteria for an ideal test. By implication, many other purported tests of the effectiveness of propensity score analysis probably also fail to meet these criteria, and are therefore questionable contributions to the literature on the effects of propensity score analysis.

  3. Similar predictions of etravirine sensitivity regardless of genotypic testing method used: comparison of available scoring systems.

    Science.gov (United States)

    Vingerhoets, Johan; Nijs, Steven; Tambuyzer, Lotke; Hoogstoel, Annemie; Anderson, David; Picchio, Gaston

    2012-01-01

    The aims of this study were to compare various genotypic scoring systems commonly used to predict virological outcome to etravirine, and examine their concordance with etravirine phenotypic susceptibility. Six etravirine genotypic scoring systems were assessed: Tibotec 2010 (based on 20 mutations; TBT 20), Monogram, Stanford HIVdb, ANRS, Rega (based on 37, 30, 27 and 49 mutations, respectively) and virco(®)TYPE HIV-1 (predicted fold change based on genotype). Samples from treatment-experienced patients who participated in the DUET trials and with both genotypic and phenotypic data (n=403) were assessed using each scoring system. Results were retrospectively correlated with virological response in DUET. κ coefficients were calculated to estimate the degree of correlation between the different scoring systems. Correlation between the five scoring systems and the TBT 20 system was approximately 90%. Virological response by etravirine susceptibility was comparable regardless of which scoring system was utilized, with 70-74% of DUET patients determined as susceptible to etravirine by the different scoring systems achieving plasma viral load <50 HIV-1 RNA copies/ml. In samples classed as phenotypically susceptible to etravirine (fold change in 50% effective concentration ≤3), correlations with genotypic score were consistently high across scoring systems (≥70%). In general, the etravirine genotypic scoring systems produced similar results, and genotype-phenotype concordance was high. As such, phenotypic interpretations, and in their absence all genotypic scoring systems investigated, may be used to reliably predict the activity of etravirine.

  4. Translation and Adaptation of Knee Injury and Osteoarthritis Outcome Score (KOOS in to Persian and Testing Persian Version Reliability Among Iranians with Osteoarthritis

    Directory of Open Access Journals (Sweden)

    Solaleh Saraei-Pour

    2007-04-01

    Full Text Available Objective: To achieve a reliable tool for measuring health related quality of life among Iranians with knee osteoarthritis, by translating and culturally adapting the Knee injury and Osteoarthritis Outcome Score(KOOS to Persian and testing the reliability and internal consistency of the Iranian version. Materials & Methods: It was a non experimental methodology study. KOOS was translated and adapted culturally to Persian language and culture in three phases with respect to IQOLA project. For examining test-retest reliability Iranians version of KOOS was corresponded twice with in at least two days or at most one week interval, by 30 Iranian people with knee OA whom were referred to Municipality and 110 physiotherapy clinics of Tehran with PT order by physicians. It was a non experimental methodological research and we used sample of convenience and non probability design for sampling. Psychometric evaluation: the collected data from the questionnaires was rated and analyzed with SPSS software from the aspects of test-retest reliability, absolute reliability, subscale and item internal consistency. Results: Internal consistency which was calculated by Cronbach '&alpha was high for all the subscales (at least 0.76, except for "symptom" subscale which was moderate, and showed that items of each subscale measured the same construct. Item internal consistency after correction for overlap, was higher than optimal value (0.4, except for the items of" symptom" subscale , which demonstrated good item internal consistency. SEM and ICC which were used for evaluating the absolute and test-retest reliability in respect showed that all the subscales had good test-retest reliability (0.7 and the absolute reliability was also very good in such away that the highest calculated SEM for Persian version was 7.44 which was less than Minimal Perceptible Clinical Improvement (MPCI that is estimated 8 to 10 for the KOOS questionnaire. Conclusion: With the Persian

  5. The TSCA interagency testing committee`s approaches to screening and scoring chemicals and chemical groups: 1977-1983

    Energy Technology Data Exchange (ETDEWEB)

    Walker, J.D. [Environmental Protection Agency, Washington, DC (United States)

    1990-12-31

    This paper describes the TSCA interagency testing committee`s (ITC) approaches to screening and scoring chemicals and chemical groups between 1977 and 1983. During this time the ITC conducted five scoring exercises to select chemicals and chemical groups for detailed review and to determine which of these chemicals and chemical groups should be added to the TSCA Section 4(e) Priority Testing List. 29 refs., 1 fig., 2 tabs.

  6. The Dysexecutive Questionnaire advanced: item and test score characteristics, 4-factor solution, and severity classification.

    Science.gov (United States)

    Bodenburg, Sebastian; Dopslaff, Nina

    2008-01-01

    The Dysexecutive Questionnaire (DEX, , Behavioral assessment of the dysexecutive syndrome, 1996) is a standardized instrument to measure possible behavioral changes as a result of the dysexecutive syndrome. Although initially intended only as a qualitative instrument, the DEX has also been used increasingly to address quantitative problems. Until now there have not been more fundamental statistical analyses of the questionnaire's testing quality. The present study is based on an unselected sample of 191 patients with acquired brain injury and reports on the data relating to the quality of the items, the reliability and the factorial structure of the DEX. Item 3 displayed too great an item difficulty, whereas item 11 was not sufficiently discriminating. The DEX's reliability in self-rating is r = 0.85. In addition to presenting the statistical values of the tests, a clinical severity classification of the overall scores of the 4 found factors and of the questionnaire as a whole is carried out on the basis of quartile standards.

  7. Hippocampal dose volume histogram predicts Hopkins Verbal Learning Test scores after brain irradiation

    Directory of Open Access Journals (Sweden)

    Catherine Okoukoni, PhD

    2017-10-01

    Full Text Available Purpose: Radiation-induced cognitive decline is relatively common after treatment for primary and metastatic brain tumors; however, identifying dosimetric parameters that are predictive of radiation-induced cognitive decline is difficult due to the heterogeneity of patient characteristics. The memory function is especially susceptible to radiation effects after treatment. The objective of this study is to correlate volumetric radiation doses received by critical neuroanatomic structures to post–radiation therapy (RT memory impairment. Methods and materials: Between 2008 and 2011, 53 patients with primary brain malignancies were treated with conventionally fractionated RT in prospectively accrued clinical trials performed at our institution. Dose-volume histogram analysis was performed for the hippocampus, parahippocampus, amygdala, and fusiform gyrus. Hopkins Verbal Learning Test-Revised scores were obtained at least 6 months after RT. Impairment was defined as an immediate recall score ≤15. For each anatomic region, serial regression was performed to correlate volume receiving a given dose (VD(Gy with memory impairment. Results: Hippocampal V53.4Gy to V60.9Gy significantly predicted post-RT memory impairment (P < .05. Within this range, the hippocampal V55Gy was the most significant predictor (P = .004. Hippocampal V55Gy of 0%, 25%, and 50% was associated with tumor-induced impairment rates of 14.9% (95% confidence interval [CI], 7.2%-28.7%, 45.9% (95% CI, 24.7%-68.6%, and 80.6% (95% CI, 39.2%-96.4%, respectively. Conclusions: The hippocampal V55Gy is a significant predictor for impairment, and a limiting dose below 55 Gy may minimize radiation-induced cognitive impairment.

  8. Rugby versus Soccer in South Africa: Content Familiarity Contributes to Cross-Cultural Differences in Cognitive Test Scores

    Science.gov (United States)

    Malda, Maike; van de Vijver, Fons J. R.; Temane, Q. Michael

    2010-01-01

    In this study, cross-cultural differences in cognitive test scores are hypothesized to depend on a test's cultural complexity (Cultural Complexity Hypothesis: CCH), here conceptualized as its content familiarity, rather than on its cognitive complexity (Spearman's Hypothesis: SH). The content familiarity of tests assessing short-term memory,…

  9. The Effect of Computer-Based Self-Access Learning on Weekly Vocabulary Test Scores

    Directory of Open Access Journals (Sweden)

    Jordan Dreyer

    2014-09-01

    Full Text Available This study sets out to clarify the effectiveness of using an online vocabulary study tool, Quizlet, in an urban high school language arts class. Previous similar studies have mostly dealt with English Language Learners in college settings (Chui, 2013, and were therefore not directed at the issue self-efficacy that is at the heart of the problem of urban high school students in America entering remedial writing programs (Rose, 1989. The study involves 95 students over the course of 14 weeks. Students were tested weekly and were asked to use the Quizlet program in their own free time. The result of this optional involvement was that many students did not participate in the treatment and therefore acted as an elective control group. The resultant data collected shows a strong correlation between the use of an online vocabulary review program and short-term vocabulary retention. The study also showed that students who paced themselves and spread out their study sessions outperformed those students who used the program only for last minute “cram sessions.” The implications of the study are that students who take advantage of tools outside of the classroom are able to out perform their peers. The results are also in line with the call to include technology in the Basic Writing classroom not simply as a tool, but as a “form of discourse” (Jonaitis, 2012. Weekly vocabulary tests, combined with the daily online activity as reported by Quizlet, show that: 1 utilizing the review software improved the scores of most students, 2 those students who used Quizlet to review more than a single time (i.e., several days before the test outperformed those who only used the product once, and 3 students who professed proficiency with the “notebook” system of vocabulary learning appeared not to need the treatment.

  10. Sequential Neighborhood Effects: The Effect of Long-Term Exposure to Concentrated Disadvantage on Children's Reading and Math Test Scores.

    Science.gov (United States)

    Hicks, Andrew L; Handcock, Mark S; Sastry, Narayan; Pebley, Anne R

    2018-02-01

    Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure.

  11. Linkage analysis in nuclear families. 2: Relationship between affected sib-pair tests and lod score analysis.

    Science.gov (United States)

    Knapp, M; Seuchter, S A; Baur, M P

    1994-01-01

    It is believed that the main advantage of affected sib-pair tests is that their application requires no information about the underlying genetic mechanism of the disease. However, here it is proved that the mean test, which can be considered the most prominent of the affected sib-pair tests, is equivalent to lod score analysis for an assumed recessive mode of inheritance, irrespective of the true mode of the disease. Further relationships of certain sib-pair tests and lod score analysis under specific assumed genetic modes are investigated.

  12. Evaluation of Factors Affecting Continuous Performance Test Identical Pairs Version Score of Schizophrenic Patients in a Japanese Clinical Sample

    Directory of Open Access Journals (Sweden)

    Takayoshi Koide

    2012-01-01

    Full Text Available Aim. Cognitive impairment in schizophrenia strongly relates to social outcome and is a good candidate for endophenotypes. When we accurately measure drug efficacy or effects of genes or variants relevant to schizophrenia on cognitive impairment, clinical factors that can affect scores on cognitive tests, such as age and severity of symptoms, should be considered. To elucidate the effect of clinical factors, we conducted multiple regression analysis using scores of the Continuous Performance Test Identical Pairs Version (CPT-IP, which is often used to measure attention/vigilance in schizophrenia. Methods. We conducted the CPT-IP (4-4 digit and examined clinical information (sex, age, education years, onset age, duration of illness, chlorpromazine-equivalent dose, and Positive and Negative Symptom Scale (PANSS scores in 126 schizophrenia patients in Japanese population. Multiple regression analysis was used to evaluate the effect of clinical factors. Results. Age, chlorpromazine-equivalent dose, and PANSS-negative symptom score were associated with mean d′ score in patients. These three clinical factors explained about 28% of the variance in mean d′ score. Conclusions. As conclusion, CPT-IP score in schizophrenia patients is influenced by age, chlorpromazine-equivalent dose and PANSS negative symptom score.

  13. Development and psychometric testing of the childhood obesity perceptions (COP) survey among African American caregivers: A tool for obesity prevention program planning.

    Science.gov (United States)

    Alexander, Dayna S; Alfonso, Moya L; Cao, Chunhua

    2016-12-01

    Currently, public health practitioners are analyzing the role that caregivers play in childhood obesity efforts. Assessing African American caregiver's perceptions of childhood obesity in rural communities is an important prevention effort. This article's objective is to describe the development and psychometric testing of a survey tool to assess childhood obesity perceptions among African American caregivers in a rural setting, which can be used for obesity prevention program development or evaluation. The Childhood Obesity Perceptions (COP) survey was developed to reflect the multidimensional nature of childhood obesity including risk factors, health complications, weight status, built environment, and obesity prevention strategies. A 97-item survey was pretested and piloted with the priority population. After pretesting and piloting, the survey was reduced to 59-items and administered to 135 African American caregivers. An exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) was conducted to test how well the survey items represented the number of Social Cognitive Theory constructs. Twenty items were removed from the original 59-item survey and acceptable internal consistency of the six factors (α=0.70-0.85) was documented for all scales in the final COP instrument. CFA resulted in a less than adequate fit; however, a multivariate Lagrange multiplier test identified modifications to improve the model fit. The COP survey represents a promising approach as a potentially comprehensive assessment for implementation or evaluation of childhood obesity programs. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. The Sinonasal Outcome Test 22 score in persons without chronic rhinosinusitis

    DEFF Research Database (Denmark)

    Lange, Bibi; Thilsing, T; Baelum, J

    2016-01-01

    -67 with a mean score of 10.5 (CI: 9.1 - 11.9) and the median score was 7. Persons with allergic rhinitis and blue collar workers had a significant higher score. CONCLUSION: The median value of 7 is taken as the normal SNOT 22 score in persons without CRS and can be used as a reference in clinical settings...... and research. Allergic rhinitis and occupation affects SNOT 22 in persons without CRS. This article is protected by copyright. All rights reserved....

  15. Exploration of analysis methods for diagnostic imaging tests: problems with ROC AUC and confidence scores in CT colonography.

    Science.gov (United States)

    Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G

    2014-01-01

    Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.

  16. Development and psychometric testing of a barriers to HIV testing scale among individuals with HIV infection in Sweden; The Barriers to HIV testing scale-Karolinska version.

    Science.gov (United States)

    Wiklander, Maria; Brännström, Johanna; Svedhem, Veronica; Eriksson, Lars E

    2015-11-19

    Barriers to HIV testing experienced by individuals at risk for HIV can result in treatment delay and further transmission of the disease. Instruments to systematically measure barriers are scarce, but could contribute to improved strategies for HIV testing. Aims of this study were to develop and test a barriers to HIV testing scale in a Swedish context. An 18-item scale was developed, based on an existing scale with addition of six new items related to fear of the disease or negative consequences of being diagnosed as HIV-infected. Items were phrased as statements about potential barriers with a three-point response format representing not important, somewhat important, and very important. The scale was evaluated regarding missing values, floor and ceiling effects, exploratory factor analysis, and internal consistencies. The questionnaire was completed by 292 adults recently diagnosed with HIV infection, of whom 7 were excluded (≥9 items missing) and 285 were included (≥12 items completed) in the analyses. The participants were 18-70 years old (mean 40.5, SD 11.5), 39 % were females and 77 % born outside Sweden. Routes of transmission were heterosexual transmission 63 %, male to male sex 20 %, intravenous drug use 5 %, blood product/transfusion 2 %, and unknown 9 %. All scale items had <3 % missing values. The data was feasible for factor analysis (KMO = 0.92) and a four-factor solution was chosen, based on level of explained common variance (58.64 %) and interpretability of factor structure. The factors were interpreted as; personal consequences, structural barriers, social and economic security, and confidentiality. Ratings on the minimum level (suggested barrier not important) were common, resulting in substantial floor effects on the scales. The scales were internally consistent (Cronbach's α 0.78-0.91). This study gives preliminary evidence of the scale being feasible, reliable and valid to identify different types of barriers to HIV testing.

  17. Conceptual Scoring and Classification Accuracy of Vocabulary Testing in Bilingual Children

    Science.gov (United States)

    Anaya, Jissel B.; Peña, Elizabeth D.; Bedore, Lisa M.

    2018-01-01

    Purpose: This study examined the effects of single-language and conceptual scoring on the vocabulary performance of bilingual children with and without specific language impairment. We assessed classification accuracy across 3 scoring methods. Method: Participants included Spanish-English bilingual children (N = 247) aged 5;1 (years;months) to…

  18. Psychometric properties and clinical usefulness of the Oswestry Disability Index.

    Science.gov (United States)

    Vianin, Michael

    2008-12-01

    Outcome measures with good reliability, validity, responsiveness, and low burden of administration are clinically useful. The Oswestry Disability Index (ODI) is one of the most commonly used outcome measures for individuals with low back pain. Psychometric properties of the ODI will determine the questionnaire's suitability as a useful clinical tool. A literature search of relevant databases on psychometric evaluation of the ODI was performed. The search was done using the key words disability evaluation, and low back pain, and questionnaires, and reproducibility of results, and the term Oswestry. Inclusion criterion was direct reference regarding psychometric property, interpretability, and burden being included in the abstract. Eight articles met the inclusion criterion. The ODI shows good construct validity; internal consistency is rated as acceptable; test-retest reliability and responsiveness have been shown to be high; and burden of administration is low. The ODI is a valid, reliable, and responsive condition-specific assessment tool that is suited for use in clinical practice. It is easy to administer and score, objectifies clients' complaints, and monitors effects of therapy.

  19. Development and psychometric testing of the Canine Owner-Reported Quality of Life questionnaire, an instrument designed to measure quality of life in dogs with cancer.

    Science.gov (United States)

    Giuffrida, Michelle A; Brown, Dorothy Cimino; Ellenberg, Susan S; Farrar, John T

    2018-05-01

    OBJECTIVE To describe development and initial psychometric testing of an owner-reported questionnaire designed to standardize measurement of general quality of life (QOL) in dogs with cancer. DESIGN Key-informant interviews, questionnaire development, and field trial. SAMPLE Owners of 25 dogs with cancer for item development and pretesting and owners of 90 dogs with cancer for reliability and validity testing. PROCEDURES Standard methods for development and testing of questionnaire instruments intended to measure subjective states were used. Items were generated, selected, scaled, and pretested for content, meaning, and readability. Response items were evaluated with exploratory factor analysis and by assessing internal consistency (Cronbach α) and convergence with global QOL as determined with a visual analog scale. Preliminary tests of stability and responsiveness were performed. RESULTS The final questionnaire-which was named the Canine Owner-Reported Quality of Life (CORQ) questionnaire-contained 17 items related to observable behaviors commonly used by owners to evaluate QOL in their dogs. Several items pertaining to physical symptoms performed poorly and were omitted. The 17 items were assigned to 4 factors-vitality, companionship, pain, and mobility-on the basis of the items they contained. The CORQ questionnaire and its factors had high internal consistency (Cronbach α = 0.68 to 0.90) and moderate to strong correlations (r = 0.49 to 0.71) with global QOL as measured on a visual analog scale. Preliminary testing indicated good test-retest reliability and responsiveness to improvements in overall QOL. CONCLUSIONS AND CLINICAL RELEVANCE The CORQ questionnaire was a valid, reliable owner-reported questionnaire that measured general QOL in dogs with cancer and showed promise as a clinical trial outcome measure for quantifying changes in individual dog QOL occurring in response to cancer treatment and progression.

  20. Visual-Constructional Ability in Individuals with Severe Obesity: Rey Complex Figure Test Accuracy and the Q-Score

    Directory of Open Access Journals (Sweden)

    Hanna L. Sargénius

    2017-09-01

    Full Text Available The aims of this study were to investigate visual-construction and organizational strategy among individuals with severe obesity, as measured by the Rey Complex Figure Test (RCFT, and to examine the validity of the Q-score as a measure for the quality of performance on the RCFT. Ninety-six non-demented morbidly obese (MO patients and 100 healthy controls (HC completed the RCFT. Their performance was calculated by applying the standard scoring criteria. The quality of the copying process was evaluated per the directions of the Q-score scoring system. Results revealed that the MO did not perform significantly lower than the HC on Copy accuracy (mean difference −0.302, CI −1.374 to 0.769, p = 0.579. In contrast, the groups did statistically differ from each other, with MO performing poorer than the HC on the Q-score (mean −1.784, CI −3.237 to −0.331, p = 0.016 and the Unit points (mean −1.409, CI −2.291 to −0.528, p = 0.002, but not on the Order points score (mean −0.351, CI −0.994 to 0.293, p = 0.284. Differences on the Unit score and the Q-score were slightly reduced when adjusting for gender, age, and education. This study presents evidence supporting the presence of inefficiency in visuospatial constructional ability among MO patients. We believe we have found an indication that the Q-score captures a wider range of cognitive processes that are not described by traditional scoring methods. Rather than considering accuracy and placement of the different elements only, the Q-score focuses more on how the subject has approached the task.

  1. The Validity of Graduate Management Admission Test Scores: A Summary of Studies Conducted from 1997 to 2004

    Science.gov (United States)

    Talento-Miller, Eileen; Rudner, Lawrence M.

    2008-01-01

    The validity of Graduate Management Admission Test (GMAT) scores is examined by summarizing 273 studies conducted between 1997 and 2004. Each of the studies was conducted through the Validity Study Service of the test sponsor and contained identical variables and statistical methods. Validity coefficients from each of the studies were corrected…

  2. Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: Psychometric approaches to test equating

    Directory of Open Access Journals (Sweden)

    Shu-Wei Liao

    2014-03-01

    Conclusion: Although both the chained linear equating method and Rasch analysis can be readily applied to practical test-equating issues in medical education, Rasch analysis exhibited more versatility in test parameter estimation and item bank development for clinical curriculums.

  3. The Effects of Video Game Experience on Computer-Based Air Traffic Controller Specialist, Air Traffic Scenario Test Scores.

    Science.gov (United States)

    1997-02-01

    application with a strong resemblance to a video game , concern has been raised that prior video game experience might have a moderating effect on scores. Much...such as spatial ability. The effects of computer or video game experience on work sample scores have not been systematically investigated. The purpose...of this study was to evaluate the incremental validity of prior video game experience over that of general aptitude as a predictor of work sample test

  4. The Dutch Technical-Tactical Tennis Test (D4T) for Talent Identification and Development : Psychometric Characteristics

    NARCIS (Netherlands)

    Kolman, Nikki; Huijgen, Barbara; Kramer, Tamara; Elferink-Gemser, Marije; Visscher, Chris

    2017-01-01

    This study examined the test-retest reliability, validity and feasibility of the newly developed Dutch Technical Tactical Tennis Test (D4T). This new test is relevant for talent identification and development in tennis. Thirty-two youth male tennis players (age 13.4 +/- 0.5) were classified as elite

  5. Power and sample size evaluation for the Cochran-Mantel-Haenszel mean score (Wilcoxon rank sum) test and the Cochran-Armitage test for trend.

    Science.gov (United States)

    Lachin, John M

    2011-11-10

    The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.

  6. Preliminary Psychometric Testing of the Postpartum Depression Predictors Inventory-Revised (PDPI-R) in Portuguese Women.

    Science.gov (United States)

    Alves, Stephanie; Fonseca, Ana; Canavarro, Maria Cristina; Pereira, Marco

    2018-04-01

    Introduction Postpartum depression (PPD) is a prevalent condition with a serious impact. The early identification of women at risk for developing PPD allows for primary prevention and the delivery of timely appropriate referrals. This study investigated the validity and reliability of the postnatal version of the Postpartum Depression Predictors Inventory-Revised (PDPI-R), an instrument widely studied internationally, in Portuguese women. Methods The sample consisted of 204 women who participated in an online cross-sectional survey. Participants completed the European Portuguese versions of the PDPI-R, the Edinburgh Postnatal Depression Scale (EPDS), and the Postnatal Negative Thoughts Questionnaire at 1-2 months postpartum. Additionally, ROC analyses were performed to conduct an exploratory analysis of the instruments' predictive validity. Results The prevalence rates of clinical postpartum depressive symptoms were 27.5 and 14.2% using the cut-off scores of 9 and 12, respectively, on the EPDS. The European Portuguese postnatal version of the PDPI-R demonstrated acceptable reliability and satisfactory construct and convergent validity. When using the EPDS > 9 cut-off score, the exploratory analyses yielded a sensitivity of 76.8% and a specificity of 73.0% with a cut-off score of 5.5 [area under the curve = 0.816]. Discussion These preliminary findings encourage the use of the postnatal version of the PDPI-R as a screening tool to identify Portuguese women at high risk for developing PPD. Subsequent assessments are needed to support the routine application of the PDPI-R both in research and for clinical purposes.

  7. A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

    Science.gov (United States)

    Bersabé, Rosa; Rivas, Teresa

    2010-05-01

    The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.

  8. The Effects of Teaching Descriptive Geometry in General Engineering 103 on Spatial Relations Tests Scores.

    Science.gov (United States)

    Stallings, William M.

    It was hypothesized that instruction in descriptive geometry produces an increase in SRT scores. The resultant data do not firmly support this hypothesis. It is suggested that this study be replicated with the use of randomly selected control groups. (MS)

  9. Lead exposure and the 2010 achievement test scores of children in New York counties

    Directory of Open Access Journals (Sweden)

    Strayhorn Jillian C

    2012-01-01

    Full Text Available Abstract Background Lead is toxic to cognitive and behavioral functioning in children even at levels well below those producing physical symptoms. Continuing efforts in the U.S. since about the 1970s to reduce lead exposure in children have dramatically reduced the incidence of elevated blood lead levels (with elevated levels defined by the current U.S. Centers for Disease Control threshold of 10 μg/dl. The current study examines how much lead toxicity continues to impair the academic achievement of children of New York State, using 2010 test data. Methods This study relies on three sets of data published for the 57 New York counties outside New York City: school achievement data from the New York State Department of Education, data on incidence of elevated blood lead levels from the New York State Department of Health, and data on income from the U.S. Census Bureau. We studied third grade and eighth grade test scores in English Language Arts and mathematics. Using the county as the unit of analysis, we computed bivariate correlations and regression coefficients, with percent of children achieving at the lowest reported level as the dependent variable and the percent of preschoolers in the county with elevated blood lead levels as the independent variable. Then we repeated those analyses using partial correlations to control for possible confounding effects of family income, and using multiple regressions with income included. Results The bivariate correlations between incidence of elevated lead and number of children in the lowest achievement group ranged between 0.38 and 0.47. The partial correlations ranged from 0.29 to 0.40. The regression coefficients, both bivariate and partial (both estimating the increase in percent of children in the lowest achievement group for every percent increase in the children with elevated blood lead levels, ranged from 0.52 to 1.31. All regression coefficients, when rounded to the nearest integer, were

  10. Longitudinal analysis of standardized test scores of students in the Science Writing Heuristic approach

    Science.gov (United States)

    Chanlen, Niphon

    The purpose of this study was to examine the longitudinal impacts of the Science Writing Heuristic (SWH) approach on student science achievement measured by the Iowa Test of Basic Skills (ITBS). A number of studies have reported positive impact of an inquiry-based instruction on student achievement, critical thinking skills, reasoning skills, attitude toward science, etc. So far, studies have focused on exploring how an intervention affects student achievement using teacher/researcher-generated measurement. Only a few studies have attempted to explore the long-term impacts of an intervention on student science achievement measured by standardized tests. The students' science and reading ITBS data was collected from 2000 to 2011 from a school district which had adopted the SWH approach as the main approach in science classrooms since 2002. The data consisted of 12,350 data points from 3,039 students. The multilevel model for change with discontinuity in elevation and slope technique was used to analyze changes in student science achievement growth trajectories prior and after adopting the SWH approach. The results showed that the SWH approach positively impacted students by initially raising science achievement scores. The initial impact was maintained and gradually increased when students were continuously exposed to the SWH approach. Disadvantaged students who were at risk of having low science achievement had bigger benefits from experience with the SWH approach. As a result, existing problematic achievement gaps were narrowed down. Moreover, students who started experience with the SWH approach as early as elementary school seemed to have better science achievement growth compared to students who started experiencing with the SWH approach only in high school. The results found in this study not only confirmed the positive impacts of the SWH approach on student achievement, but also demonstrated additive impacts found when students had longitudinal experiences

  11. A Comparison of Scores on the WISC-R and Lorge-Thorndike Intelligence Test for Disadvantaged Black Elementary School Children

    Science.gov (United States)

    Lowe, James D.; Karnes, Frances A.

    1976-01-01

    It is indicated that, although the scores [obtained on both tests] are significantly correlated, the tests yield significantly different scores with the Lorge-Thorndike consistently overestimating the WISC-R full scale I.Q. (Author)

  12. Critique of the Watson-Glaser Critical Thinking Appraisal Test: The More You Know, the Lower Your Score

    Directory of Open Access Journals (Sweden)

    Kevin Possin

    2014-12-01

    Full Text Available The Watson-Glaser Critical Thinking Appraisal Test is one of the oldest, most frequently used, multiple-choice critical-thinking tests on the market in business, government, and legal settings for purposes of hiring and promotion. I demonstrate, however, that the test has serious construct-validity issues, stemming primarily from its ambiguous, unclear, misleading, and sometimes mysterious instructions, which have remained unaltered for decades. Erroneously scored items further diminish the test’s validity. As a result, having enhanced knowledge of formal and informal logic could well result in test subjects receiving lower scores on the test. That’s not how things should work for a CT assessment test.

  13. R in Psychometrics and Psychometrics in R

    OpenAIRE

    Leeuw, Jan de

    2006-01-01

    In psychometrics, and in the closely related fields of quantititative methods for the social and educational sciences, R is not yet used very often. Traditional mainframe packages such as SAS and SPSS are still dominant at the user-level, Stata has made inroads at the teaching level, and Matlab is quite prominent at the research level. In this paper we define the most visible techniques in the psychometrics area, we give an overview of what is available in R, and we discuss what is m...

  14. A physical function test for use in the intensive care unit: validity, responsiveness, and predictive utility of the physical function ICU test (scored).

    Science.gov (United States)

    Denehy, Linda; de Morton, Natalie A; Skinner, Elizabeth H; Edbrooke, Lara; Haines, Kimberley; Warrillow, Stephen; Berney, Sue

    2013-12-01

    Several tests have recently been developed to measure changes in patient strength and functional outcomes in the intensive care unit (ICU). The original Physical Function ICU Test (PFIT) demonstrates reliability and sensitivity. The aims of this study were to further develop the original PFIT, to derive an interval score (the PFIT-s), and to test the clinimetric properties of the PFIT-s. A nested cohort study was conducted. One hundred forty-four and 116 participants performed the PFIT at ICU admission and discharge, respectively. Original test components were modified using principal component analysis. Rasch analysis examined the unidimensionality of the PFIT, and an interval score was derived. Correlations tested validity, and multiple regression analyses investigated predictive ability. Responsiveness was assessed using the effect size index (ESI), and the minimal clinically important difference (MCID) was calculated. The shoulder lift component was removed. Unidimensionality of combined admission and discharge PFIT-s scores was confirmed. The PFIT-s displayed moderate convergent validity with the Timed "Up & Go" Test (r=-.60), the Six-Minute Walk Test (r=.41), and the Medical Research Council (MRC) sum score (rho=.49). The ESI of the PFIT-s was 0.82, and the MCID was 1.5 points (interval scale range=0-10). A higher admission PFIT-s score was predictive of: an MRC score of ≥48, increased likelihood of discharge home, reduced likelihood of discharge to inpatient rehabilitation, and reduced acute care hospital length of stay. Scoring of sit-to-stand assistance required is subjective, and cadence cutpoints used may not be generalizable. The PFIT-s is a safe and inexpensive test of physical function with high clinical utility. It is valid, responsive to change, and predictive of key outcomes. It is recommended that the PFIT-s be adopted to test physical function in the ICU.

  15. Psychometrics in Support of a Valid Assessment of Linguistic Minorities: Implications for the Test and Sampling Designs

    Science.gov (United States)

    Oliveri, María Elena; von Davier, Alina A.

    2016-01-01

    In this study, we propose that the unique needs and characteristics of linguistic minorities should be considered throughout the test development process. Unlike most measurement invariance investigations in the assessment of linguistic minorities, which typically are conducted after test administration, we propose strategies that focus on the…

  16. Classifying and scoring of molecules with the NGN: new datasets, significance tests, and generalization

    Directory of Open Access Journals (Sweden)

    Cameron Christopher JF

    2010-10-01

    Full Text Available Abstract This paper demonstrates how a Neural Grammar Network learns to classify and score molecules for a variety of tasks in chemistry and toxicology. In addition to a more detailed analysis on datasets previously studied, we introduce three new datasets (BBB, FXa, and toxicology to show the generality of the approach. A new experimental methodology is developed and applied to both the new datasets as well as previously studied datasets. This methodology is rigorous and statistically grounded, and ultimately culminates in a Wilcoxon significance test that proves the effectiveness of the system. We further include a complete generalization of the specific technique to arbitrary grammars and datasets using a mathematical abstraction that allows researchers in different domains to apply the method to their own work. Background Our work can be viewed as an alternative to existing methods to solve the quantitative structure-activity relationship (QSAR problem. To this end, we review a number approaches both from a methodological and also a performance perspective. In addition to these approaches, we also examined a number of chemical properties that can be used by generic classifier systems, such as feed-forward artificial neural networks. In studying these approaches, we identified a set of interesting benchmark problem sets to which many of the above approaches had been applied. These included: ACE, AChE, AR, BBB, BZR, Cox2, DHFR, ER, FXa, GPB, Therm, and Thr. Finally, we developed our own benchmark set by collecting data on toxicology. Results Our results show that our system performs better than, or comparatively to, the existing methods over a broad range of problem types. Our method does not require the expert knowledge that is necessary to apply the other methods to novel problems. Conclusions We conclude that our success is due to the ability of our system to: 1 encode molecules losslessly before presentation to the learning system, and 2

  17. The Dutch Technical-Tactical Tennis Test (D4T for Talent Identification and Development: Psychometric Characteristics

    Directory of Open Access Journals (Sweden)

    Kolman Nikki

    2017-01-01

    Full Text Available This study examined the test-retest reliability, validity and feasibility of the newly developed Dutch Technical-Tactical Tennis Test (D4T. This new test is relevant for talent identification and development in tennis. Thirty-two youth male tennis players (age 13.4 ± 0.5 were classified as elite (n = 15 or sub-elite (n = 17 according to their position on the national youth ranking list under 15 years (cut-off rank 50 in the Netherlands. Games, rallies and different tactical situations (i.e. offensive, neutral and defensive were simulated with a ball machine. Players had to return 72 balls to predetermined target areas. Stroke quality was recorded based on ball velocity and accuracy (VA-index, as well as percentage errors. Test-retest reliability was assessed by comparing differences between the first and second test-session (n = 10. An intraclass-correlation coefficient of .78 for the VA-index was found (p < .05, indicating excellent test-retest reliability. Independent t-tests revealed that elite players outscored sub-elite players for the VA-index, ball velocity, accuracy and percentage errors (p < .05, supporting good validity. Furthermore, a high correlation was found between the VA-index and individual positions on the youth ranking list (p = -.75; p < .001. The assessment of feasibility indicated that the D4T was applicable for instructors and coaches. In conclusion, the D4T was shown to be a reliable, valid and feasible test to measure technical-tactical characteristics of tennis performance in youth players.

  18. Predicting psychopharmacological drug effects on actual driving performance (SDLP) from psychometric tests measuring driving-related skills

    OpenAIRE

    Verster, Joris C.; Roth, Thomas

    2011-01-01

    Rationale There are various methods to examine driving ability. Comparisons between these methods and their relationship with actual on-road driving is often not determined. Objective The objective of this study was to determine whether laboratory tests measuring driving-related skills could adequately predict on-the-road driving performance during normal traffic. Methods Ninety-six healthy volunteers performed a standardized on-the-road driving test. Subjects were instructed to drive with a ...

  19. Psychometric Properties of the Autism-Spectrum Quotient for Assessing Low and High Levels of Autistic Traits in College Students.

    Science.gov (United States)

    Stevenson, Jennifer L; Hart, Kari R

    2017-06-01

    The current study systematically investigated the effects of scoring and categorization methods on the psychometric properties of the Autism-Spectrum Quotient. Four hundred and three college students completed the Autism-Spectrum Quotient at least once. Total scores on the Autism-Spectrum Quotient had acceptable internal consistency and test-retest reliability using a binary or Likert scoring method, but the results were more varied for the subscales. Overall, Likert scoring yielded higher internal consistency and test-retest reliability than binary scoring. However, agreement in categorization of low and high autistic traits was poor over time (except for a median split on Likert scores). The results support using Likert scoring and administering the Autism-Spectrum Quotient at the same time as the task of interest with neurotypical participants.

  20. CaPTHUS scoring model in primary hyperparathyroidism: can it eliminate the need for ioPTH testing?

    Science.gov (United States)

    Elfenbein, Dawn M; Weber, Sara; Schneider, David F; Sippel, Rebecca S; Chen, Herbert

    2015-04-01

    The CaPTHUS model was reported to have a positive predictive value of 100 % to correctly predict single-gland disease in patients with primary hyperparathyroidism, thus obviating the need for intraoperative parathyroid hormone (ioPTH) testing. We sought to apply the CaPTHUS scoring model in our patient population and assess its utility in predicting long-term biochemical cure. We retrospective reviewed all parathyroidectomies for primary hyperparathyroidism performed at our university hospital from 2003 to 2012. We routinely perform ioPTH testing. Biochemical cure was defined as a normal calcium level at 6 months. A total of 1,421 patients met the inclusion criteria: 78 % of patients had a single adenoma at the time of surgery, 98 % had a normal serum calcium at 1 week postoperatively, and 96 % had a normal serum calcium level 6 months postoperatively. Using the CaPTHUS scoring model, 307 patients (22.5 %) had a score of ≥ 3, with a positive predictive value of 91 % for single adenoma. A CaPTHUS score of ≥ 3 had a positive predictive value of 98 % for biochemical cure at 1 week as well as at 6 months. In our population, where ioPTH testing is used routinely to guide use of bilateral exploration, patients with a preoperative CaPTHUS score of ≥ 3 had good long-term biochemical cure rates. However, the model only predicted adenoma in 91 % of cases. If minimally invasive parathyroidectomy without ioPTH testing had been done for these patients, the cure rate would have dropped from 98 % to an unacceptable 89 %. Even in these patients with high CaPTHUS scores, multigland disease is present in almost 10 %, and ioPTH testing is necessary.

  1. Testing measurement invariance of the schizotypal personality questionnaire-brief scores across Spanish and Swiss adolescents.

    Directory of Open Access Journals (Sweden)

    Javier Ortuño-Sierra

    Full Text Available BACKGROUND: Schizotypy is a complex construct intimately related to psychosis. Empirical evidence indicates that participants with high scores on schizotypal self-report are at a heightened risk for the later development of psychotic disorders. Schizotypal experiences represent the behavioural expression of liability for psychotic disorders. Previous factorial studies have shown that schizotypy is a multidimensional construct similar to that found in patients with schizophrenia. Specifically, using the Schizotypal Personality Questionnaire-Brief (SPQ-B, the three-dimensional model has been widely replicated. However, there has been no in-depth investigation of whether the dimensional structure underlying the SPQ-B scores is invariant across countries. METHODS: The main goal of this study was to examine the measurement invariance of the SPQ-B scores across Spanish and Swiss adolescents. The final sample was made up of 261 Spanish participants (51.7% men; M = 16.04 years and 241 Swiss participants (52.3% men; M = 15.94 years. RESULTS: The results indicated that Raine et al.'s three-factor model presented adequate goodness-of-fit indices. Moreover, the results supported the measurement invariance (configural and partial strong invariance of the SPQ-B scores across the two samples. Spanish participants scored higher on Interpersonal dimension than Swiss when latent means were compared. DISCUSSION: The study of measurement equivalence across countries provides preliminary evidence for the Raine et al.'s three-factor model and of the cross-cultural validity of the SPQ-B scores in adolescent population. Future studies should continue to examine the measurement invariance of the schizotypy and psychosis-risk syndromes across cultures.

  2. Differences in distribution of T-scores and Z-scores among bone densitometry tests in postmenopausal women (a comparative study)

    International Nuclear Information System (INIS)

    Wendlova, J.

    2002-01-01

    To determine the character of T-score and Z-score value distribution in individually selected methods of bone densitometry and to compare them using statistical analysis. We examined 56 postmenopausal women with an age between 43 and 68 years with osteopenia or osteoporosis according to the WHO classification. The following measurements were made in each patient: T-score and Z-score for: 1) Stiffness index (S) of the left heel bone, USM (index). 2) Bone mineral density of the left heel bone (BMDh), DEXA (g of Ca hydroxyapatite per cm 2 ). 3) Bone mineral density of trabecular bone of the L1 vertebra (BMDL1). QCT (mg of Ca hydroxyapatite per cm 3 ). The densitometers used in the study were: ultrasonometer to measure heel bone, Achilles plus LUNAR, USA: DEXA to measure heel bone, PIXl, LUNAR, USA: QCT to measure the L1 vertebra, CT, SOMATOM Plus, Siemens, Germany. Statistical analysis: differences between measured values of T-scores (Z-scores) were evaluated by parametric or non-parametric methods of determining the 95 % confidence intervals (C.I.). Differences between Z-score and T-score values for compared measurements were statistically significant; however, these differences were lower for Z-scores. Largest differences in 95 % C.I., characterizing individual measurements of T-score values (in comparison with Z-scores), were found for those densitometers whose age range of the reference groups of young adults differed the most, and conversely, the smallest differences in T-score values were found when the differences between the age ranges of reference groups were smallest. The higher variation in T-score values in comparison to Z-scores is also caused by a non-standard selection of the reference groups of young adults for the QCT, PIXI and Achilles Plus densitometers used in the study. Age characteristics of the reference group for T-scores should be standardized for all types of densitometers. (author)

  3. Questionnaire-based assessment of executive functioning: Psychometrics.

    Science.gov (United States)

    Castellanos, Irina; Kronenberger, William G; Pisoni, David B

    2018-01-01

    The psychometric properties of the Learning, Executive, and Attention Functioning (LEAF) scale were investigated in an outpatient clinical pediatric sample. As a part of clinical testing, the LEAF scale, which broadly measures neuropsychological abilities related to executive functioning and learning, was administered to parents of 118 children and adolescents referred for psychological testing at a pediatric psychology clinic; 85 teachers also completed LEAF scales to assess reliability across different raters and settings. Scores on neuropsychological tests of executive functioning and academic achievement were abstracted from charts. Psychometric analyses of the LEAF scale demonstrated satisfactory internal consistency, parent-teacher inter-rater reliability in the small to large effect size range, and test-retest reliability in the large effect size range, similar to values for other executive functioning checklists. Correlations between corresponding subscales on the LEAF and other behavior checklists were large, while most correlations with neuropsychological tests of executive functioning and achievement were significant but in the small to medium range. Results support the utility of the LEAF as a reliable and valid questionnaire-based assessment of delays and disturbances in executive functioning and learning. Applications and advantages of the LEAF and other questionnaire measures of executive functioning in clinical neuropsychology settings are discussed.

  4. The Health Professions Admission Test (HPAT) score and leaving certificate results can independently predict academic performance in medical school: do we need both tests?

    LENUS (Irish Health Repository)

    Halpenny, D

    2010-11-01

    A recent study raised concerns regarding the ability of the health professions admission test (HPAT) Ireland to improve the selection process in Irish medical schools. We aimed to establish whether performance in a mock HPAT correlated with academic success in medicine. A modified HPAT examination and a questionnaire were administered to a group of doctors and medical students. There was a significant correlation between HPAT score and college results (r2: 0.314, P = 0.018, Spearman Rank) and between leaving cert score and college results (r2: 0.306, P = 0.049, Spearman Rank). There was no correlation between leaving cert points score and HPAT score. There was no difference in HPAT score across a number of other variables including gender, age and medical speciality. Our results suggest that both the HPAT Ireland and the leaving certificate examination could act as independent predictors of academic achievement in medicine.

  5. The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

    OpenAIRE

    Xu, Jian

    2017-01-01

    The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating...

  6. Psychometric Characteristics of a Measure of Emotional Dispositions Developed to Test a Developmental Propensity Model of Conduct Disorder

    Science.gov (United States)

    Lahey, Benjamin B.; Applegate, Brooks; Chronis, Andrea M.; Jones, Heather A.; Williams, Stephanie Hall; Loney, Jan; Waldman, Irwin D.

    2008-01-01

    Lahey and Waldman proposed a developmental propensity model in which three dimensions of children's emotional dispositions are hypothesized to transact with the environment to influence risk for conduct disorder, heterogeneity in conduct disorder, and comorbidity with other disorders. To prepare for future tests of this model, a new measure of…

  7. Examining Construct Congruence for Psychometric Tests: A Note on an Extension to Binary Items and Nesting Effects

    Science.gov (United States)

    Raykov, Tenko; Marcoulides, George A.; Dimitrov, Dimiter M.; Li, Tatyana

    2018-01-01

    This article extends the procedure outlined in the article by Raykov, Marcoulides, and Tong for testing congruence of latent constructs to the setting of binary items and clustering effects. In this widely used setting in contemporary educational and psychological research, the method can be used to examine if two or more homogeneous…

  8. Measuring Parental Meta-Emotion: Psychometric Properties of the Emotion-Related Parenting Styles Self-Test

    Science.gov (United States)

    Hakim-Larson, Julie; Parker, Alison; Lee, Catharine; Goodwin, Jacqueline; Voelker, Sylvia

    2006-01-01

    Parental meta-emotion, assessed through interviews, involves parents' philosophy about emotions and has been found to be related to parenting behaviors and children's emotional and social competence (e.g., Gottman, Katz, & Hooven, 1996; Katz & Windecker-Nelson, 2004). The Emotion-Related Parenting Styles Self-Test is a true-false…

  9. Differences in physical-fitness test scores between actively and passively recruited older adults : Consequences for norm-based classification

    NARCIS (Netherlands)

    van Heuvelen, M.J.G.; Stevens, M.; Kempen, G.I.J.M.

    This study investigated differences in physical-fitness test scores between actively and passively recruited older adults and the consequences thereof for norm-based classification of individuals. Walking endurance, grip strength, hip flexibility, balance, manual dexterity, and reaction time were

  10. The Score Reliability of Draw-a-Person Intellectual Ability Test (DAP: IQ) for Rural Malawi Students

    Science.gov (United States)

    Khasu, Denis S.; Williams, Thomas O., Jr.

    2016-01-01

    In this brief article, the reliability of scores for the Draw-A-Person Intellectual Ability Test for Children, Adolescents, and Adults (DAP: IQ; Reynolds & Hickman, 2004) was examined through several analyses with a sample of 147 children from rural Malawi, Africa using a Chichewa translation of instructions. Cronbach alpha coefficients for…

  11. Differential Predictive Validity of High School GPA and College Entrance Test Scores for University Students in Yemen

    Science.gov (United States)

    Al-Hattami, Abdulghani Ali Dawod

    2012-01-01

    High school grade point average and college entrance test scores are two admission criteria that are currently used by most colleges in Yemen to select their prospective students. Given their widespread use, it is important to investigate their predictive validity to ensure the accuracy of the admission decisions in these institutions. This study…

  12. Automated Scoring for the "TOEFL Junior"® Comprehensive Writing and Speaking Test. Research Report. ETS RR-15-09

    Science.gov (United States)

    Evanini, Keelan; Heilman, Michael; Wang, Xinhao; Blanchard, Daniel

    2015-01-01

    This report describes the initial automated scoring results that were obtained using the constructed responses from the Writing and Speaking sections of the pilot forms of the "TOEFL Junior"® Comprehensive test administered in late 2011. For all of the items except one (the edit item in the Writing section), existing automated scoring…

  13. Predicting Pre-Service Classroom Teachers' Civil Servant Recruitment Examination's Educational Sciences Test Scores Using Artificial Neural Networks

    Science.gov (United States)

    Demir, Metin

    2015-01-01

    This study predicts the number of correct answers given by pre-service classroom teachers in Civil Servant Recruitment Examination's (CSRE) educational sciences test based on their high school grade point averages, university entrance scores, and grades (mid-term and final exams) from their undergraduate educational courses. This study was…

  14. The pornography craving questionnaire: psychometric properties.

    Science.gov (United States)

    Kraus, Shane; Rosenberg, Harold

    2014-04-01

    Despite the prevalence of pornography use, and recent conceptualization of problematic use as an addiction, we could find no published scale to measure craving for pornography. Therefore, we conducted three studies employing young male pornography users to develop and evaluate such a questionnaire. In Study 1, we had participants rate their agreement with 20 potential craving items after reading a control script or a script designed to induce craving to watch pornography. We dropped eight items because of low endorsement. In Study 2, we revised both the questionnaire and cue exposure stimuli and then evaluated several psychometric properties of the modified questionnaire. Item loadings from a principal components analysis, a high internal consistency reliability coefficient, and a moderate mean inter-item correlation supported interpreting the 12 revised items as a single scale. Correlations of craving scores with preoccupation with pornography, sexual history, compulsive internet use, and sensation seeking provided support for convergent validity, criterion validity, and discriminant validity, respectively. The enhanced imagery script did not impact reported craving; however, more frequent users of pornography reported higher craving than less frequent users regardless of script condition. In Study 3, craving scores demonstrated good one-week test-retest reliability and predicted the number of times participants used pornography during the following week. This questionnaire could be applied in clinical settings to plan and evaluate therapy for problematic users of pornography and as a research tool to assess the prevalence and contextual triggers of craving among different types of pornography users.

  15. Psychometrics of social cognitive measures for psychosis treatment research.

    Science.gov (United States)

    Davidson, Charlie A; Lesser, Rebecca; Parente, Lori T; Fiszdon, Joanna M

    2018-03-01

    Social cognition represents an important treatment target, closely linked to everyday social function. While a number of social cognitive interventions have recently been developed, measures used to evaluate these treatments are only beginning to receive psychometric scrutiny. Study goals were to replicate recently-published psychometrics for several social cognitive measures, and to provide information for additional social cognitive measures not included in recent reports. Forty-eight outpatients with psychotic-spectrum disorders completed measures of emotion perception, theory of mind, and attributional bias on two occasions, one month apart. Measures were tested for distributional characteristics, test-retest reliability, utility as a repeated measure, and relationship to symptoms and functioning. For a subgroup of participants, information about sensitivity to social cognitive treatment was also available. We replicated aspects of prior work, including largely favorable psychometric characteristics for the Bell-Lysaker Emotion Recognition Task, and promising but weaker characteristics for The Awareness of Social Inferences Test subscales and Reading the Mind in the Eyes Task. The Hinting Task had adequate test-retest statistics but a more pronounced ceiling effect. Ambiguous Intentions and Hostility Questionnaire data showed evidence of validity but were limited by inconsistency over time. Our results strongly support the Davos Assessment of Cognitive Biases Scale for future evaluation as a social cognitive treatment outcome measure. Its scores were adequately distributed, consistent over time, related to symptoms and functioning, and sensitive to treatment effects. Other relatively novel assessments of attributional bias and theory of mind showed some promise, although more work is needed. Published by Elsevier B.V.

  16. Pulmonary Exacerbation Score in Cystlc Fibrosis Patients: Reliability and Validity Testing

    OpenAIRE

    Keller, F.

    2016-01-01

    Background: Lung disease in cystic fibrosis (CF) is characterized by recurrent pulmonary exacerbations (PEs), but consensus on diagnostic criteria for PE is lacking. The use of a consistent definition of PE as an outcome measure in CF clinical trials would allow meaningful comparison across centers. The aim of this study was to assess the reliability and validity of a simplified version of the Seattle Pulmonary Exacerbation Score (SPEX). Materials and Methods: A cross-sectional observational ...

  17. Psychometric Testing of the Personal Internet Gaming Disorder Evaluation-9: A New Measure Designed to Assess Internet Gaming Disorder.

    Science.gov (United States)

    Pearcy, Benjamin T D; Roberts, Lynne D; McEvoy, Peter M

    2016-05-01

    Internet Gaming Disorder (IGD) is in the early stages of recognition as a disorder, following its inclusion in the Diagnostic and Statistical Manual for Mental Disorders (DSM-5; American Psychiatric Association(1)) as a condition for further study. Existing measures of Internet gaming pathology are limited in their ability to measure IGD as defined in the DSM-5. We present the initial development and validation of a new measure derived from the proposed DSM-5 criteria for IGD, the Personal Internet Gaming Disorder Evaluation-9 (PIE-9). A student sample (n = 119) and a community sample (n = 285), sourced through a variety of online gaming forums, completed an online survey comprising the new measure, existing measures of IGD, and a range of health and demographic questions. Exploratory and confirmatory factor analysis supported a single factor structure for the 9-item PIE-9. Internal consistency (α = 0.89) and test-retest reliability (intraclass correlation [ICC] = 0.77) were high. Convergent validity was demonstrated with similar gaming addiction measures. Predictive validity was established through significant differences in distress and disability between those who met the criteria for IGD and those who did not. The distress and disability associated with meeting IGD criteria fell within the range of other common DSM-5 disorders. Preliminary testing of the PIE-9 has demonstrated that it is an efficient and straightforward measure for use in further research of IGD, and as a potential screening measure in clinical practice.

  18. Are WISC IQ scores in children with mathematical learning disabilities underestimated? The influence of a specialized intervention on test performance.

    Science.gov (United States)

    Lambert, Katharina; Spinath, Birgit

    2018-01-01

    Intelligence measures play a pivotal role in the diagnosis of mathematical learning disabilities (MLD). Probably as a result of math-related material in IQ tests, children with MLD often display reduced IQ scores. However, it remains unclear whether the effects of math remediation extend to IQ scores. The present study investigated the impact of a special remediation program compared to a control group receiving private tutoring (PT) on the WISC IQ scores of children with MLD. We included N=45 MLD children (7-12 years) in a study with a pre- and post-test control group design. Children received remediation for two years on average. The analyses revealed significantly greater improvements in the experimental group on the Full-Scale IQ, and the Verbal Comprehension, Perceptual Reasoning, and Working Memory indices, but not Processing Speed, compared to the PT group. Children in the experimental group showed an average WISC IQ gain of more than ten points. Results indicate that the WISC IQ scores of MLD children might be underestimated and that an effective math intervention can improve WISC IQ test performance. Taking limitations into account, we discuss the use of IQ measures more generally for defining MLD in research and practice. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Timed up & go test score in patients with hip fracture is related to the type of walking aid

    DEFF Research Database (Denmark)

    Kristensen, Morten T; Bandholm, Thomas; Holm, Bente

    2009-01-01

    Kristensen MT, Bandholm T, Holm B, Ekdahl C, Kehlet H. Timed Up & Go test score in patients with hip fracture is related to the type of walking aid. OBJECTIVE: To determine the relationship between Timed Up & Go (TUG) test scores and type of walking aid used during the test, and to determine...... the feasibility of using the rollator as a standardized walking aid during the TUG in patients with hip fracture who were allowed full weight-bearing (FWB). DESIGN: Prospective methodological study. SETTING: An acute orthopedic hip fracture unit at a university hospital. PARTICIPANTS: Patients (N=126; 90 women......, 36 men) with hip fracture with a mean age +/- SD of 74.8+/-12.7 years performed the TUG the day before discharge from the orthopedic ward. INTERVENTIONS: Not applicable. MAIN OUTCOME MEASURES: The TUG was performed with the walking aid the patient was to be discharged with: a walker (n=88) or elbow...

  20. Comparison of Physical Therapy Anatomy Performance and Anxiety Scores in Timed and Untimed Practical Tests

    Science.gov (United States)

    Schwartz, Sarah M.; Evans, Cathy; Agur, Anne M.R.

    2015-01-01

    Students in health care professional programs face many stressful tests that determine successful completion of their program. Test anxiety during these high stakes examinations can affect working memory and lead to poor outcomes. Methods of decreasing test anxiety include lengthening the time available to complete examinations or evaluating…

  1. Relationship Between Broiler Body Weights, Eimeria maxima Gross Lesion Scores, and Microscores in Three Anticoccidial Sensitivity Tests.

    Science.gov (United States)

    Barrios, Miguel A; Da Costa, Manuel; Kimminau, Emily; Fuller, Lorraine; Clark, Steven; Pesti, Gene; Beckstead, Robert

    2017-06-01

    Anticoccidial sensitivity tests (ASTs) serve to determine the efficacy of anticoccidial drugs against Eimeria field isolates in a controlled laboratory setting. The most commonly measured parameters are body weight gain, feed conversion ratio, gross intestinal lesion scores, and mortality. Due to the difficulty in reliably scoring gross lesion scores of Eimeria maxima , microscopic analysis of intestinal scrapings (microscores) can be used in the field to indicate the presence of this particular Eimeria. The goal of this study was to determine the relationship between E. maxima microscores and broiler body weights and gross E. maxima lesion scores in three ASTs. Day-old broiler chicks were raised for 12 days on a standard corn-soy diet. On Day 12, chicks were placed in Petersime batteries and treatment diets were provided. There were six birds per pen, four pens per treatment, and 12 treatments, for a total of 288 chicks per AST. The treatments were as follows: 1) nonmedicated, noninfected; 2) nonmedicated, infected; 3) lasalocid, infected; 4) salinomycin, infected; 5) diclazuril, infected; 6) monensin, infected; 7) decoquinate, infected; 8) narasin + nicarbazin, infected; 9) narasin, infected; 10) nicarbazin, infected; 11) robenidine, infected; and 12) zoalene, infected. On Day 14, chicks were challenged with an Eimeria field isolate by oral gavage. On Day 20, broilers were weighed, and gross lesion scores and microscores were classified from 0 to 4 depending on the severity of the gross lesion scores and E. maxima microscores. Data from three trials using different field isolates were statistically analyzed using a logarithmic regression model. There was no relationship (P = 0.1224) between microscores and body weight gain. There was a positive relationship between microscores and gross lesion scores (P = 0.004). However, there was also an interaction between isolate and treatment (P Eimeria or the amount of E. maxima in the inoculum.

  2. The Standard Error of a Proportion for Different Scores and Test Length.

    Directory of Open Access Journals (Sweden)

    David A. Walker

    2005-06-01

    Full Text Available This paper examines Smith's (2003 proposed standard error of a proportion index..associated with the idea of reliability as sufficiency of information. A detailed table..indexing all of the standard error values affiliated with assessments that range from 5 to..100 items, where students scored as low as 50% correct and 50% incorrect to as high as..95% correct and 5% incorrect, calculated in increments of 1 percentage point, is..presented, along with distributional qualities. Examples using this measure for classroom..teachers and higher education instructors of assessment are provided.

  3. Birth order and its relationship to depression, anxiety, and self-concept test scores in children.

    Science.gov (United States)

    Gates, L; Lineberger, M R; Crockett, J; Hubbard, J

    1988-03-01

    Children (N = 404), 7 to 12 years old, were given the Children's Depression Inventory, the State-Trait Anxiety Inventory for Children, and the Piers-Harris Self-Concept Scale. First-born children scored significantly lower on depression than second-, third-, fourth-born, and youngest children. First borns showed significantly less trait anxiety than third-born children. First-born children also showed significantly higher levels of self-esteem than second-born and youngest children. Girls in this study showed significantly more trait anxiety than boys.

  4. Chronic obstructive pulmonary disease (COPD) assessment test scores corresponding to modified Medical Research Council grades among COPD patients.

    Science.gov (United States)

    Lee, Chang-Hoon; Lee, Jinwoo; Park, Young Sik; Lee, Sang-Min; Yim, Jae-Joon; Kim, Young Whan; Han, Sung Koo; Yoo, Chul-Gyu

    2015-09-01

    In assigning patients with chronic obstructive pulmonary disease (COPD) to subgroups according to the updated guidelines of the Global Initiative for Chronic Obstructive Lung Disease, discrepancies have been noted between the COPD assessment test (CAT) criteria and modified Medical Research Council (mMRC) criteria. We investigated the determinants of symptom and risk groups and sought to identify a better CAT criterion. This retrospective study included COPD patients seen between June 20, 2012, and December 5, 2012. The CAT score that can accurately predict an mMRC grade ≥ 2 versus COPD patients, the percentages of patients classified into subgroups A, B, C, and D were 24.5%, 47.2%, 4.2%, and 24.1% based on CAT criteria and 49.3%, 22.4%, 8.9%, and 19.4% based on mMRC criteria, respectively. More than 90% of the patients who met the mMRC criteria for the 'more symptoms group' also met the CAT criteria. AUROC and CART analyses suggested that a CAT score ≥ 15 predicted an mMRC grade ≥ 2 more accurately than the current CAT score criterion. During follow-up, patients with CAT scores of 10 to 14 did not have a different risk of exacerbation versus those with CAT scores COPD patients.

  5. Zero Calcium Score as a Filter for Further Testing in Patients Admitted to the Coronary Care Unit with Chest Pain.

    Science.gov (United States)

    Correia, Luis Cláudio Lemos; Esteves, Fábio P; Carvalhal, Manuela; Souza, Thiago Menezes Barbosa de; Sá, Nicole de; Correia, Vitor Calixto de Almeida; Alexandre, Felipe Kalil Beirão; Lopes, Fernanda; Ferreira, Felipe; Noya-Rabelo, Márcia

    2017-06-12

    The accuracy of zero coronary calcium score as a filter in patients with chest pain has been demonstrated at the emergency room and outpatient clinics, populations with low prevalence of coronary artery disease (CAD). To test the gatekeeping role of zero calcium score in patients with chest pain admitted to the coronary care unit (CCU), where the pretest probability of CAD is higher than that of other populations. Patients underwent computed tomography for calcium scoring, and obstructive CAD was defined by a minimum 70% stenosis on invasive angiography. In 146 patients studied, the prevalence of CAD was 41%. A zero calcium score was present in 35% of the patients. The sensitivity and specificity of zero calcium score yielded a negative likelihood ratio of 0.16. After logistic regression adjustment for pretest probability, zero calcium score was independently associated with lower odds of CAD (OR = 0.12, 95%CI = 0.04-0.36), increasing the area under the ROC curve of the clinical model from 0.76 to 0.82 (p = 0.006). Zero calcium score provided a net reclassification improvement of 0.20 (p = 0.0018) over the clinical model when using a pretest probability threshold of 10% for discharging without further testing. In patients with pretest probability zero calcium score had a negative predictive value of 95% (95%CI = 83%-99%), with a number needed to test of 2.1 for obtaining one additional discharge. Zero calcium score substantially reduces the pretest probability of obstructive CAD in patients admitted to the CCU with acute chest pain. (Arq Bras Cardiol. 2017; [online].ahead print, PP.0-0). A acurácia do escore de cálcio coronário zero como um filtro nos pacientes com dor torácica aguda tem sido demonstrada na sala de emergência e nos ambulatórios, populações com baixa prevalência de doença arterial coronariana (DAC). Testar o papel do escore de cálcio zero como filtro nos pacientes com dor torácica admitidos numa unidade coronariana intensiva (UCI), na

  6. The development and psychometric validation of the Ethical Awareness Scale.

    Science.gov (United States)

    Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela

    2018-04-19

    To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.

  7. Percentiles of the null distribution of 2 maximum lod score tests.

    Science.gov (United States)

    Ulgen, Ayse; Yoo, Yun Joo; Gordon, Derek; Finch, Stephen J; Mendell, Nancy R

    2004-01-01

    We here consider the null distribution of the maximum lod score (LOD-M) obtained upon maximizing over transmission model parameters (penetrance values, dominance, and allele frequency) as well as the recombination fraction. Also considered is the lod score maximized over a fixed choice of genetic model parameters and recombination-fraction values set prior to the analysis (MMLS) as proposed by Hodge et al. The objective is to fit parametric distributions to MMLS and LOD-M. Our results are based on 3,600 simulations of samples of n = 100 nuclear families ascertained for having one affected member and at least one other sibling available for linkage analysis. Each null distribution is approximately a mixture p(2)(0) + (1 - p)(2)(v). The values of MMLS appear to fit the mixture 0.20(2)(0) + 0.80chi(2)(1.6). The mixture distribution 0.13(2)(0) + 0.87chi(2)(2.8). appears to describe the null distribution of LOD-M. From these results we derive a simple method for obtaining critical values of LOD-M and MMLS. Copyright 2004 S. Karger AG, Basel

  8. A quantitative assessment of alkaptonuria: testing the reliability of two disease severity scoring systems.

    Science.gov (United States)

    Cox, Trevor F; Ranganath, Lakshminarayan

    2011-12-01

    Alkaptonuria (AKU) is due to excessive homogentisic acid accumulation in body fluids due to lack of enzyme homogentisate dioxygenase leading in turn to varied clinical manifestations mainly by a process of conversion of HGA to a polymeric melanin-like pigment known as ochronosis. A potential treatment, a drug called nitisinone, to decrease formation of HGA is available. However, successful demonstration of its efficacy in modifying the natural history of AKU requires an effective quantitative assessment tool. We have described two potential tools that could be used to quantitate disease burden in AKU. One tool describes scoring the clinical features that includes clinical assessments, investigations and questionnaires in 15 patients with AKU. The second tool describes a scoring system that only includes items obtained from questionnaires used in 44 people with AKU. Statistical analyses were carried out on the two patient datasets to assess the AKU tools; these included the calculation of Chronbach's alpha, multidimensional scaling and simple linear regression analysis. The conclusion was that there was good evidence that the tools could be adopted as AKU assessment tools, but perhaps with further refinement before being used in the practical setting of a clinical trial.

  9. Evaluation of the Discrepancy between the European Pharmacopoeia Test and an Adopted United States Pharmacopoeia Test Regarding the Weight Uniformity of Scored Tablet Halves: Is Harmonization Required?

    Science.gov (United States)

    Zaid, Abdel Naser; Ghoush, Abeer Abu; Al-Ramahi, Rowa'; Are'r, Mohammed

    2012-01-01

    The aim of this study was to evaluate whether there exists any discrepancy between the European Pharmacopoeia (Ph. Eur.) and adopted United States Pharmacopeia (USP) tests concerning the weight uniformity measurements of tablet halves after splitting. The USP method does not contain provisions to evaluate split tablets, so here we adopt their whole tablet weight uniformity method. Twenty-nine different commercial scored tablets (local and imported) were divided. The split units were individually weighed and the relative standard deviation (RSD) for each product was calculated and then evaluated according to both the adopted USP and the Ph. Eur. tests of weight uniformity. Twenty out of the 29 products tested failed the USP test, while 14 of them failed the Ph. Eur. test. Nine products passed both the USP and Ph. Eur. tests. Six products passed the Ph. Eur. test but failed the USP test, with all of these products having an RSD greater than 6%. The correlation coefficient between the weight and content of split halves for three randomly selected products-corotenol 100 mg, corotenol 50 mg, and lorazepam 2.5 mg-was found to be 0.986, 0.998, and 0.72, respectively. A clear difference can be seen between outcomes obtained by the two compendial tablet splitting methods with regard to weight uniformity. Results from the USP test showed that tighter measures are needed to pass the test. Our results argue that the Ph. Eur. should revise the existing weight uniformity test on scored tablets to include the RSD parameter in it. The USP should include this adopted test as a specific test for scored tablet halves, not just whole tablets. Manufacturers in some cases will need to improve the quality of the produced scored tablets in order to pass the USP test, especially those with low therapeutic indices. Finally, harmonization between the pharmacopoeias regarding the weight uniformity testing of split tablets is warranted. The aim of this study was to evaluate whether there

  10. A risk score for predicting coronary artery disease in women with angina pectoris and abnormal stress test finding.

    Science.gov (United States)

    Lo, Monica Y; Bonthala, Nirupama; Holper, Elizabeth M; Banks, Kamakki; Murphy, Sabina A; McGuire, Darren K; de Lemos, James A; Khera, Amit

    2013-03-15

    Women with angina pectoris and abnormal stress test findings commonly have no epicardial coronary artery disease (CAD) at catheterization. The aim of the present study was to develop a risk score to predict obstructive CAD in such patients. Data were analyzed from 337 consecutive women with angina pectoris and abnormal stress test findings who underwent cardiac catheterization at our center from 2003 to 2007. Forward selection multivariate logistic regression analysis was used to identify the independent predictors of CAD, defined by ≥50% diameter stenosis in ≥1 epicardial coronary artery. The independent predictors included age ≥55 years (odds ratio 2.3, 95% confidence interval 1.3 to 4.0), body mass index stress imaging (odds ratio 2.8, 95% confidence interval 1.5 to 5.5), and exercise capacity statistic of 0.745 (95% confidence interval 0.70 to 0.79), and an optimized cutpoint of a score of ≤2 included 62% of the subjects and had a negative predictive value of 80%. In conclusion, a simple clinical risk score of 7 characteristics can help differentiate those more or less likely to have CAD among women with angina pectoris and abnormal stress test findings. This tool, if validated, could help to guide testing strategies in women with angina pectoris. Copyright © 2013 Elsevier Inc. All rights reserved.

  11. Psychometric properties of a scale to measure alexithymia.

    Science.gov (United States)

    Blanchard, E B; Arena, J G; Pallmeyer, T P

    1981-01-01

    Four studies were conducted on a sample of 230 undergraduates to determine the psychometric properties of a measure of alexithymia, the Schalling-Sifneos Scale. In the first study it was found that scores on the scale are approximately normally distributed for each sex with 8.2% of males and 1.8% of females in the alexithymia range. In the second study a factor analysis of the scale revealed three distinct factors: (1) 'difficulty in expression of feelings'; (2) 'the importance of feelings especially about people'; (3) 'day-dreaming or introspection'. In the second factor analytic study, scores from several standard psychological tests on the same subjects were introduced with the scale items. Two factors in this analysis were comprised almost entirely of the other test scores: a 'general psychological distress factor' and a 'concerns about physical symptoms factor'. The other two factors were similar to factors 1 and 2 above in terms of items. The Rathus Assertiveness Scale loaded positively on the equivalent of factor 1. In the lst study, it was shown that Schalling-Sifneos Scale score is relatively orthogonal to other psychological tests with the exception of a Psychosomatic Symptom Checklist and thus is measuring something other than depression, anxiety, etc.

  12. A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.

    Science.gov (United States)

    Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael

    2014-01-01

    This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.

  13. Psychometric and screening properties of the WHO-5 well-being index in adult outpatients with Type 1 or Type 2 diabetes mellitus

    DEFF Research Database (Denmark)

    Hajós, Tibor R S; Pouwer, F; Skovlund, S E

    2013-01-01

    OBJECTIVE: The 5-item World Health Organization well-being index is a commonly used measure of emotional well-being, but research on psychometric properties in outpatients with diabetes is scarce. We examined psychometric and screening properties for depression of this index in a large sample...... of the WHO-5 index was determined by Cronbach's alpha. The factor structure was tested by confirmatory factor analysis. Concurrent validity was assessed by correlations with the Patient Health Questionnaire, Problem Areas in Diabetes and the Short Form-12 mental component scores. Sensitivity and specificity...... and Type 2 diabetes. Moderate to strong correlations were observed between the WHO-5 index and the Patient Health Questionnaire scores, the Problem Areas in Diabetes scores and the Short Form-12 mental component scores (r = 0.55-0.69, P

  14. The Impact of Linking Distinct Achievement Test Scores on the Interpretation of Student Growth in Achievement

    Science.gov (United States)

    Airola, Denise Tobin

    2011-01-01

    Changes to state tests impact the ability of State Education Agencies (SEAs) to monitor change in performance over time. The purpose of this study was to evaluate the Standardized Performance Growth Index (PGIz), a proposed statistical model for measuring change in student and school performance, across transitions in tests. The PGIz is a…

  15. EAP Study Recommendations and Score Gains on the IELTS Academic Writing Test

    Science.gov (United States)

    Green, Anthony

    2005-01-01

    The IELTS test is widely accepted by university admissions offices as evidence of English language ability. The test is also used to guide decisions about the amount of language study required for students to satisfy admissions requirements. Guidelines currently published by the British Association of Lecturers in English for Academic Purposes…

  16. Examining the Relationship between Students' Mathematics Test Scores and Computer Use at Home and at School

    Science.gov (United States)

    O'Dwyer, Laura M.; Russell, Michael; Bebell, Damian; Seeley, Kevon

    2008-01-01

    Over the past decade, standardized test results have become the primary tool used to judge the effectiveness of schools and educational programs, and today, standardized testing serves as the keystone for educational policy at the state and federal levels. This paper examines the relationship between fourth grade mathematics achievement and…

  17. Evaluation of psychometric properties of Tinetti performance-oriented mobility assessment scale in subjects with knee osteoarthritis

    OpenAIRE

    Parveen, Huma; Noohu, Majumi M.

    2017-01-01

    Objective: The objective of this study was to determine the psychometric properties of the Tinetti Performance-Oriented Mobility Assessment (POMA) scale to measure balance and gait impairments in individuals with knee osteoarthritis (OA). Methods: A convenient sample of 25 individuals with bilateral OA knee were recruited. The convergent validity was determined by correlation analysis between scores of Berg Balance Scale (BBS) with balance subscale (POMA-B) and the Timed Up and Go Test (TU...

  18. Developing a model of competence in the operating theatre: psychometric validation of the perceived perioperative competence scale-revised.

    Science.gov (United States)

    Gillespie, Brigid M; Polit, Denise F; Hamlin, Lois; Chaboyer, Wendy

    2012-01-01

    This paper describes the development and validation of the Revised Perioperative Competence Scale (PPCS-R). There is a lack of a psychometrically tested sound self-assessment tools to measure nurses' perceived competence in the operating room. Content validity was established by a panel of international experts and the original 98-item scale was pilot tested with 345 nurses in Queensland, Australia. Following the removal of several items, a national sample that included all 3209 nurses who were members of the Australian College of Operating Room Nurses was surveyed using the 94-item version. Psychometric testing assessed content validity using exploratory factor analysis, internal consistency using Cronbach's alpha, and construct validity using the "known groups" technique. During item reduction, several preliminary factor analyses were performed on two random halves of the sample (n=550). Usable data for psychometric assessment were obtained from 1122 nurses. The original 94-item scale was reduced to 40 items. The final factor analysis using the entire sample resulted in a 40 item six-factor solution. Cronbach's alpha for the 40-item scale was .96. Construct validation demonstrated significant differences (pperceived competence scores relative to years of operating room experience and receipt of specialty education. On the basis of these results, the psychometric properties of the PPCS-R were considered encouraging. Further testing of the tool in different samples of operating room nurses is necessary to enable cross-cultural comparisons. Copyright © 2011 Elsevier Ltd. All rights reserved.

  19. The Mental Vulnerability Questionnaire: a psychometric evaluation

    DEFF Research Database (Denmark)

    Eplov, Lene Falgaard; Petersen, Janne; Jørgensen, Torben

    2010-01-01

    The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim of the pre......The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim...... 0.30 for the 12 and the 22 item scales. All five Mental Vulnerability scales had positively skewed score distributions which were associated significantly with both SCL-90-R symptom scores and NEO-PI-R personality scales (primarily Neuroticism and Extraversion). Coefficient alpha was highest...

  20. The effect of an intervention program on functional movement screen test scores in mixed martial arts athletes.

    Science.gov (United States)

    Bodden, Jamie G; Needham, Robert A; Chockalingam, Nachiappan

    2015-01-01

    This study assessed the basic fundamental movements of mixed martial arts (MMA) athletes using the functional movement screen (FMS) assessment and determined if an intervention program was successful at improving results. Participants were placed into 1 of the 2 groups: intervention and control groups. The intervention group was required to complete a corrective exercise program 4 times per week, and all participants were asked to continue their usual MMA training routine. A mid-intervention FMS test was included to examine if successful results were noticed sooner than the 8-week period. Results highlighted differences in FMS test scores between the control group and intervention group (p = 0.006). Post hoc testing revealed a significant increase in the FMS score of the intervention group between weeks 0 and 8 (p = 0.00) and weeks 0 and 4 (p = 0.00) and no significant increase between weeks 4 and 8 (p = 1.00). A χ analysis revealed that the intervention group participants were more likely to have an FMS score >14 than participants in the control group at week 4 (χ = 7.29, p < 0.01) and week 8 (χ = 5.2, p ≤ 0.05). Finally, a greater number of participants in the intervention group were free from asymmetry at week 4 and week 8 compared with the initial test period. The results of the study suggested that a 4-week intervention program was sufficient at improving FMS scores. Most if not all, the movements covered on the FMS relate to many aspects of MMA training. The knowledge that the FMS can identify movement dysfunctions and, furthermore, the fact that the issues can be improved through a standardized intervention program could be advantageous to MMA coaches, thus, providing the opportunity to adapt and implement new additions to training programs.

  1. An Examination of Psychometric Properties of Positive Functional Attitudes Scale

    Directory of Open Access Journals (Sweden)

    Saide Umut ZEYBEK

    2017-08-01

    Full Text Available The aim of this study is to investigate the applicability of Coping Attitudes Scale: Measure of Positive Attitudes in Depression (CAS among Turkish young adult community sample and determine the psychometric properties (validity and reliability of this scale. This study was conducted with 419 students attending different departments in Mugla Sitki Kocman University, Faculty of Education in the spring semester of academic year of 2015-2016. Positive Functional Attitudes Scale, Beck Depression Scale, Beck Hopelessness Scale, Automatic Thoughts Scale, Positivity Scale and Developed Automatic Thoughts Scale.were used as data collection tools. Confirmatory factor analysis (CFA were used for investigation of the psychometric properties of the PFAS. Also, criterion-related validity, test-retest validity, and internal consistency were used calculated. The CFA results showed that standardized item estimates of the CAS ranged between 0.45 and 0.47. Also the CFA results showed that the original factor structure of the PFAS confirmed on the Turkish sample. internal consistency was calculated using the total community sample’s PFAS score. Cronbach’s alpha coefficient ort he total scale (.93 was high. Test-retest results of the subscales were 0.76. The findings showed that factor structures of the PFAS’ life perspective, personal accomplishment, positive future, self-worth, coping with problems had psychometric quality in Turkish version. As a result of the study, the Turkish version of PFAS has good validity and reliability for young adult community sample. [JCBPR 2017; 6(2.000: 59-66

  2. The Effect of Presentation Medium on Pilot Selection Test Battery Scores

    National Research Council Canada - National Science Library

    Biggerstaff, S

    1998-01-01

    .... The American Psychological Association (APA) has set guidelines to be followed to ensure both qualitative and quantitative equivalence of new test formats prior to their use in applied settings...

  3. Back extensor muscle endurance test scores in coal miners in Australia

    Energy Technology Data Exchange (ETDEWEB)

    Stewart, M.; Latimer, J.; Jamieson, M. [University of Sydney, Sydney, NSW (Australia). Faculty of Health and Science, School of Physiotherapy

    2003-06-01

    Low back pain is a common complaint among those working in the Australian coal mining industry. One test that may be predictive of first-time episodes of low back pain is the Biering-Sorensen test of back extensor endurance strength. While this test has been evaluated in overseas sedentary populations, normative data and the discriminative ability of the test have not been evaluated with coal miners. Eighty-eight coal miners completed a questionnaire for known risk factors for low back pain, performed the Biering-Sorensen test, and undertook a test of aerobic fitness. Data analysis was performed to describe the groups and to determine whether any significant difference existed between those with a past history of low back pain and those without. Significantly lower than expected holding times were found in this group of coal miners (mean 113 s). This result was significantly lower than demonstrated in previous studies. When holding times for those with a past history of low back pain were compared with times for those with no history of low back pain, the difference was not statistically significant, nor was there a significant difference in fitness between those with a past history of low back pain and those without. It is concluded that coal miners in Australia have lower than normal Biering-Sorensen holding times. This lower back holding time does not differ between coal miners with a past history of low back pain and those without.

  4. A powerful score-based test statistic for detecting gene-gene co-association.

    Science.gov (United States)

    Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

    2016-01-29

    The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.

  5. Parâmetros psicométricos: estudo comparativo entre testes de inteligência e de personalidade Psychometric parameters: comparative study between intelligence and personality tests

    Directory of Open Access Journals (Sweden)

    Ana Paula Porto Noronha

    2003-01-01

    Full Text Available A validade e a precisão de testes psicológicos vêm sendo bastante questionadas e discutidas atualmente. O presente estudo teve como objetivo avaliar a validade, a precisão e a existência de padronização brasileira em 43 testes psicológicos comercializados no Brasil, sendo 22 de inteligência e 21 de personalidade. Os testes foram comparados quanto ao período de publicação no Brasil. Os resultados indicaram que existe maior número de instrumentos publicados nas décadas de 1980 e 1990, que os testes de inteligência apresentam mais estudos de padronização, validade e precisão, embora não tenha havido diferença significante entre os grupos de testes (inteligência e personalidade. Novos estudos devem ser desenvolvidos com o intuito de promover os testes psicológicos e a área de avaliação psicológica, como um todo.Nowadays, the validity and the reliability of psychological tests are being very questioned and discussed. The current study aims to evaluate the validity, the reliability and the existence of Brazilian standardization in 43 psychological tests commercialized in Brazil, being 22 of intelligence and 21 of personality. The tests have been compared concearning the publication period in Brazil. The results have indicated that there is a bigger number of instruments published in the 1980´s and 1990´s, that the intelligence tests present more studies of standardization, validity and reliability, even though there hasn´t been significant difference among the groups tests (intelligence and personality. New studies must be developed aiming to promote the psychological tests and the area of psychological assessment, as a whole.

  6. Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

    Science.gov (United States)

    Haberman, Shelby J.

    2011-01-01

    Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

  7. Cognitive Ability and Personality Variables as Predictors of School Grades and Test Scores in Adolescents

    Science.gov (United States)

    Hofer, Manfred; Kuhnle, Claudia; Kilian, Britta; Fries, Stefan

    2012-01-01

    The predictive power of cognitive ability and self-control strength for self-reported grades and an achievement test were studied. It was expected that the variables use of time structure, academic procrastination, and motivational interference during learning further aid in predicting students' achievement because they are operative in situations…

  8. Nasalance Scores of Children with Repaired Cleft Palate Who Exhibit Normal Velopharyngeal Closure during Aerodynamic Testing

    Science.gov (United States)

    Zajac, David J.

    2013-01-01

    Purpose: To determine if children with repaired cleft palate and normal velopharyngeal (VP) closure as determined by aerodynamic testing exhibit greater acoustic nasalance than control children without cleft palate. Method: Pressure-flow procedures were used to identify 2 groups of children based on VP closure during the production of /p/ in the…

  9. A Comparison of Eighth Grade Students' Testing Scores between the "Jeopardy" and "Seatwork" Types of Review.

    Science.gov (United States)

    Daft, Lee T.

    This study focused on the review process before social studies testing. The students involved in the study were 71 13 and 14-year olds and came from predominantly middle to upper class social status in a Knoxville, Tennessee suburb. The influence of an interactive review based on the quiz show "Jeopardy" was compared with that of a "seatwork"…

  10. Interpreting Mathematics Scores on the New Jersey College Basic Skills Placement Test.

    Science.gov (United States)

    Dass, Jane; Pine, Charles

    The New Jersey College Basic Skills Placement Test (NJCBSPT) is designed to measure certain basic language and mathematics skills of students entering New Jersey colleges. The primary purpose of the two mathematics sections is to determine whether students are prepared to begin certain college-level work without a handicap in computation or…

  11. The Effects of Specific Reading Interventions on Elementary Students' Test Scores

    Science.gov (United States)

    Griffin, Jacqueline Laverne Meeks

    2016-01-01

    Many students in third, fourth and fifth grades struggle at the lowest levels of reading proficiency. In fact, fewer than 40% of fourth graders in the United States read at or above the "proficient" level on state standardized tests in 2009 (D'Ardenne, Barnes, Hightower, Lamason, Mason, Patterson, Stephens, Wilson, Smith & Erickson,…

  12. Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests

    Science.gov (United States)

    Sinharay, Sandip

    2018-01-01

    Producers and consumers of test scores are increasingly concerned about fraudulent behavior before and during the test. There exist several statistical or psychometric methods for detecting fraudulent behavior on tests. This paper provides a review of the Bayesian approaches among them. Four hitherto-unpublished real data examples are provided to…

  13. THE MODERN RACISM SCALE: PSYCHOMETRIC

    Directory of Open Access Journals (Sweden)

    MANUEL CÁRDENAS

    2007-08-01

    Full Text Available An adaption of McConahay, Harder and Batts’ (1981 moderm racism scale is presented for Chilean population andits psychometric properties, (reliability and validity are studied, along with its relationship with other relevantpsychosocial variables in studies on prejudice and ethnic discrimination (authoritarianism, religiousness, politicalposition, etc., as well as with other forms of prejudice (gender stereotypes and homophobia. The sample consistedof 120 participants, students of psychology, resident in the city of Antofagasta (a geographical zone with a highnumber of Latin-American inmigrants. Our findings show that the scale seems to be a reliable instrument to measurethe prejudice towards Bolivian immigrants in our social environment. Likewise, important differences among thesubjects are detected with high and low scores in the psychosocial variables used.

  14. Psychometric Properties of the Drive for Muscularity Attitudes Questionnaire Among Irish Men

    Directory of Open Access Journals (Sweden)

    Travis A. Ryan

    2014-09-01

    Full Text Available The Drive for Muscularity Attitudes Questionnaire (DMAQ was developed to measure men’s desire to attain an idealized muscular body. To date, the cross-cultural suitability of this measure has received limited attention. The current study addressed this omission by testing the psychometric properties of the DMAQ using an online sample of Irish men (N = 327. Confirmatory factor analysis revealed that a unidimensional model adequately matched observed data (i.e., fit indices suggested acceptable model fit. Analyses also showed that the DMAQ yielded reliable and construct valid scores, suggesting that the scale holds promise as an indicant of the drive for muscularity among Irish men. Strengths and limitations associated with this study are discussed, such as advantages and disadvantages of Internet research. Directions for future research are given, including the need for more psychometric work.

  15. Detection of acute deterioration in health status visit among COPD patients by monitoring COPD assessment test score

    Directory of Open Access Journals (Sweden)

    Pothirat C

    2015-02-01

    Full Text Available Chaicharn Pothirat, Warawut Chaiwong, Atikun Limsukon, Athavudh Deesomchok, Chalerm Liwsrisakun, Chaiwat Bumroongkit, Theerakorn Theerakittikul, Nittaya PhetsukDivision of Pulmonary, Critical Care and Allergy, Department of Internal Medicine, Faculty of Medicine, Chiang Mai University, Chiang Mai, ThailandBackground: The Chronic Obstructive Pulmonary Disease Assessment Test (CAT could play a role in detecting acute deterioration in health status during monitoring visits in routine clinical practice.Objective: To evaluate the discriminative property of a change in CAT score from a stable baseline visit for detecting acute deterioration in health status visits of chronic obstructive pulmonary disease (COPD patients.Methods: The CAT questionnaire was administered to stable COPD patients routinely attending the chest clinic of Chiang Mai University Hospital who were monitored using the CAT score every 1–3 months for 15 months. Acute deterioration in health status was defined as worsening or exacerbation. CAT scores at baseline, and subsequent visits with acute deterioration in health status were analyzed using the t-test. The receiver operating characteristic curve was performed to evaluate the discriminative property of change in CAT score for detecting acute deterioration during a health status visit.Results: A total of 354 follow-up visits were made by 140 patients, aged 71.1±8.4 years, with a forced expiratory volume in 1 second of 47.49%±18.2% predicted, who were monitored for 15 months. The mean CAT score change between stable baseline visits, by patients’ and physicians’ global assessments, were 0.05 (95% confidence interval [CI], -0.37–0.46 and 0.18 (95% CI, -0.23–0.60, respectively. At worsening visits, as assessed by patients, there was significant increase in CAT score (6.07; 95% CI, 4.95–7.19. There were also significant increases in CAT scores at visits with mild and moderate exacerbation (5.51 [95% CI, 4.39–6

  16. The quantitative LOD score: test statistic and sample size for exclusion and linkage of quantitative traits in human sibships.

    Science.gov (United States)

    Page, G P; Amos, C I; Boerwinkle, E

    1998-04-01

    We present a test statistic, the quantitative LOD (QLOD) score, for the testing of both linkage and exclusion of quantitative-trait loci in randomly selected human sibships. As with the traditional LOD score, the boundary values of 3, for linkage, and -2, for exclusion, can be used for the QLOD score. We investigated the sample sizes required for inferring exclusion and linkage, for various combinations of linked genetic variance, total heritability, recombination distance, and sibship size, using fixed-size sampling. The sample sizes required for both linkage and exclusion were not qualitatively different and depended on the percentage of variance being linked or excluded and on the total genetic variance. Information regarding linkage and exclusion in sibships larger than size 2 increased as approximately all possible pairs n(n-1)/2 up to sibships of size 6. Increasing the recombination (theta) distance between the marker and the trait loci reduced empirically the power for both linkage and exclusion, as a function of approximately (1-2theta)4.

  17. Report test scores assessment of the functioning of dosimetry service staff of the CNLV

    International Nuclear Information System (INIS)

    Alvarez R, J.T.; Tovar M, V.M.

    2002-12-01

    The ININ realized the evaluation of the service of personal dosimetry in the CNLV, in the categories: IV. (Photons of high energy of 137 CS) and the VA. (Particles beta of 90 Sr/ 90 Y); in the category IV the test was satisfactory, however in the chart 1 has an underestimation a the American Standard HP over the value true conventional of a 9%; for this irregularity it is recommended to revise the procedures of evaluation of the process and the determination of the chart 1 of the HP. In the category VA, the test is also satisfactory, however the results contrasted with the chart 2 and the HP, the values were overestimated in 29% of the true conventional value, and for that problem is recommended to revise the evaluation procedures in contrast with the values determined by the standard HP. (Author)

  18. Sistem Scoring Conversion TOEFL Paper Based Test (PBT Politeknik Negeri Cilacap Menggunakan Metode User Centered Design

    Directory of Open Access Journals (Sweden)

    Cahya Vikasari

    2017-06-01

    Full Text Available Sistem komputer interaktif untuk dipakai oleh useruntuk mendukung pekerjannya. User merupakan object yang penting didalam pengembangan dan pembangun sistem. User adalah personal-personal yang terlibat langsung dalam pemakaian aplikasi. Konsep dari UCD adalah user sebagai pusat dari proses pengembangan sistem, dan tujuan/sifat-sifat, konteks dan lingkungan sistem semua didasarkan dari pengalaman pengguna Pembangunan sistem skoring test TOEFL paper based test (PBT di UPT bahasa politeknik negeri cilacapmenggunakan metode UCD. Dengan menggunakan metode UCD sistem dapat   mempermudah dan mempercepat pendaftaran oleh calon pendaftar dengan tampilan antarmuka yang user friendly , mempermudah proses pengelolaan data dan rekap data pendaftar, mempermudah pengkonversian skor TOEFL yang dilakukan secara otomatis, serta  meminimalisir terjadinya kesalahan, duplikasi data dan duplikasi kegiatan.

  19. Reliability and Validity of the New Tanaka B Intelligence Scale Scores: A Group Intelligence Test

    OpenAIRE

    Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

    2014-01-01

    OBJECTIVE: The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. METHODS: The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2 ± 0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach's alpha coefficient, and concurre...

  20. Refining Ovarian Cancer Test accuracy Scores (ROCkeTS): protocol for a prospective longitudinal test accuracy study to validate new risk scores in women with symptoms of suspected ovarian cancer

    Science.gov (United States)

    Sundar, Sudha; Rick, Caroline; Dowling, Francis; Au, Pui; Rai, Nirmala; Champaneria, Rita; Stobart, Hilary; Neal, Richard; Davenport, Clare; Mallett, Susan; Sutton, Andrew; Kehoe, Sean; Timmerman, Dirk; Bourne, Tom; Van Calster, Ben; Gentry-Maharaj, Aleksandra; Deeks, Jon

    2016-01-01

    Introduction Ovarian cancer (OC) is associated with non-specific symptoms such as bloating, making accurate diagnosis challenging: only 1 in 3 women with OC presents through primary care referral. National Institute for Health and Care Excellence guidelines recommends sequential testing with CA125 and routine ultrasound in primary care. However, these diagnostic tests have limited sensitivity or specificity. Improving accurate triage in women with vague symptoms is likely to improve mortality by streamlining referral and care pathways. The Refining Ovarian Cancer Test Accuracy Scores (ROCkeTS; HTA 13/13/01) project will derive and validate new tests/risk prediction models that estimate the probability of having OC in women with symptoms. This protocol refers to the prospective study only (phase III). Methods and analysis ROCkeTS comprises four parallel phases. The full ROCkeTS protocol can be found at http://www.birmingham.ac.uk/ROCKETS. Phase III is a prospective test accuracy study. The study will recruit 2450 patients from 15 UK sites. Recruited patients complete symptom and anxiety questionnaires, donate a serum sample and undergo ultrasound scored as per International Ovarian Tumour Analysis (IOTA) criteria. Recruitment is at rapid access clinics, emergency departments and elective clinics. Models to be evaluated include those based on ultrasound derived by the IOTA group and novel models derived from analysis of existing data sets. Estimates of sensitivity, specificity, c-statistic (area under receiver operating curve), positive predictive value and negative predictive value of diagnostic tests are evaluated and a calibration plot for models will be presented. ROCkeTS has received ethical approval from the NHS West Midlands REC (14/WM/1241) and is registered on the controlled trials website (ISRCTN17160843) and the National Institute of Health Research Cancer and Reproductive Health portfolios. PMID:27507231